Query lcl|NC_015279.1_cdsid_YP_004322275.1 [gene=gp23] [protein=precursor of major head subunit] [protein_id=YP_004322275.1] [location=102850..104253] Match_columns 467 No_of_seqs 167 out of 418 Neff 4.6 Searched_HMMs 1612 Date Thu Nov 7 15:17:05 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_117 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_117_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:104915 Length: 470 100.0 3E-240 2E-243 1333.7 37.3 458 1-467 3-470 (470) 2 protein:vir:104549 Length: 462 100.0 4E-238 2E-241 1322.4 35.6 449 1-467 1-462 (462) 3 protein:vir:106998 Length: 468 100.0 1E-236 9E-240 1313.5 37.1 451 1-467 1-468 (468) 4 protein:vir:103181 Length: 457 100.0 1E-233 7E-237 1297.7 36.4 444 1-467 1-457 (457) 5 protein:vir:106286 Length: 534 100.0 6E-224 3E-227 1244.6 36.2 453 2-466 1-534 (534) 6 protein:vir:6901 Length: 522 # 100.0 9E-223 6E-226 1237.8 35.3 453 1-466 4-522 (522) 7 protein:vir:98143 Length: 524 100.0 4E-222 3E-225 1234.2 34.9 454 1-466 1-524 (524) 8 protein:vir:103463 Length: 521 100.0 1E-221 8E-225 1231.7 35.2 453 1-466 3-521 (521) 9 protein:vir:80986 Length: 528 100.0 2E-221 9E-225 1231.2 35.2 454 1-466 1-528 (528) 10 protein:vir:5670 Length: 514 # 100.0 1E-221 7E-225 1231.9 33.4 450 5-466 1-514 (514) 11 protein:vir:7214 Length: 521 # 100.0 2E-221 1E-224 1230.2 34.9 453 1-466 3-521 (521) 12 protein:vir:101039 Length: 529 100.0 4E-221 3E-224 1228.7 34.2 454 1-466 2-529 (529) 13 protein:vir:101811 Length: 529 100.0 2E-220 1E-223 1224.6 34.9 453 1-466 2-529 (529) 14 protein:vir:6601 Length: 528 # 100.0 4E-219 2E-222 1218.0 35.2 454 1-466 1-528 (528) 15 protein:vir:100603 Length: 529 100.0 6E-218 4E-221 1211.4 34.7 454 1-466 2-529 (529) 16 protein:vir:107947 Length: 519 100.0 4E-217 2E-220 1207.1 35.4 453 1-466 1-519 (519) 17 protein:vir:5942 Length: 523 # 100.0 5E-193 3E-196 1074.9 33.0 411 1-453 1-523 (523) 18 protein:vir:4830 Length: 397 # 96.2 0.00087 5.4E-07 37.4 20.1 336 1-463 1-397 (397) 19 protein:vir:78523 Length: 338 94.5 0.0042 2.6E-06 33.6 16.9 314 52-453 1-338 (338) 20 protein:vir:96223 Length: 324 94.1 0.0055 3.4E-06 33.0 19.9 308 35-450 1-324 (324) 21 protein:vir:4953 Length: 397 # 93.7 0.0066 4.1E-06 32.5 21.3 337 1-463 1-397 (397) 22 protein:vir:3033 Length: 272 # 93.0 0.0093 5.8E-06 31.7 14.4 266 105-455 1-272 (272) 23 protein:vir:9820 Length: 272 # 93.0 0.0093 5.8E-06 31.7 14.4 266 105-455 1-272 (272) 24 protein:vir:103955 Length: 324 93.0 0.0093 5.8E-06 31.7 18.9 303 36-450 1-324 (324) 25 protein:vir:9410 Length: 415 # 92.4 0.011 7.1E-06 31.2 19.3 343 1-452 28-415 (415) 26 protein:vir:81227 Length: 413 92.4 0.012 7.1E-06 31.2 18.0 344 1-467 31-411 (413) 27 protein:vir:1886 Length: 385 # 92.3 0.012 7.3E-06 31.2 19.9 331 1-454 1-385 (385) 28 protein:vir:191 Length: 385 # 92.3 0.012 7.3E-06 31.2 19.9 331 1-454 1-385 (385) 29 protein:vir:97148 Length: 324 92.3 0.012 7.5E-06 31.1 20.5 308 32-450 1-324 (324) 30 protein:vir:10364 Length: 390 92.2 0.012 7.6E-06 31.1 18.3 324 1-464 30-390 (390) 31 protein:vir:81100 Length: 415 92.1 0.013 7.9E-06 31.0 19.3 340 1-456 28-415 (415) 32 protein:vir:79987 Length: 415 92.1 0.013 7.9E-06 31.0 19.3 340 1-456 28-415 (415) 33 protein:vir:98339 Length: 415 92.1 0.013 7.9E-06 31.0 19.3 340 1-456 28-415 (415) 34 protein:vir:99749 Length: 324 91.6 0.015 9.3E-06 30.6 18.6 309 32-450 1-324 (324) 35 protein:vir:9309 Length: 324 # 90.6 0.02 1.2E-05 29.9 20.0 305 35-450 1-324 (324) 36 protein:vir:1268 Length: 397 # 89.8 0.024 1.5E-05 29.4 16.0 329 1-464 39-397 (397) 37 protein:vir:4700 Length: 415 # 88.9 0.029 1.8E-05 29.0 19.0 346 1-456 28-415 (415) 38 protein:vir:4600 Length: 415 # 88.9 0.029 1.8E-05 29.0 19.0 346 1-456 28-415 (415) 39 protein:vir:6212 Length: 434 # 88.7 0.031 1.9E-05 28.9 18.0 339 1-454 58-434 (434) 40 protein:vir:7771 Length: 330 # 88.0 0.035 2.2E-05 28.6 12.2 293 105-448 1-330 (330) 41 protein:vir:3845 Length: 395 # 88.0 0.035 2.2E-05 28.6 16.8 336 2-454 1-395 (395) 42 protein:vir:104085 Length: 320 87.1 0.041 2.5E-05 28.2 15.0 293 43-448 1-320 (320) 43 protein:vir:107593 Length: 392 86.7 0.044 2.7E-05 28.0 15.2 325 1-467 1-385 (392) 44 protein:vir:105004 Length: 392 86.7 0.044 2.7E-05 28.0 15.2 325 1-467 1-385 (392) 45 protein:vir:102082 Length: 392 86.7 0.044 2.7E-05 28.0 15.2 325 1-467 1-385 (392) 46 protein:vir:102873 Length: 392 86.7 0.044 2.7E-05 28.0 15.2 325 1-467 1-385 (392) 47 protein:vir:81160 Length: 371 86.1 0.048 3E-05 27.8 15.9 323 1-466 1-371 (371) 48 protein:vir:1433 Length: 435 # 84.3 0.061 3.8E-05 27.2 15.6 381 2-434 1-435 (435) 49 protein:vir:78830 Length: 324 83.5 0.068 4.2E-05 27.0 18.3 308 35-450 1-324 (324) 50 protein:vir:96392 Length: 324 83.5 0.068 4.2E-05 27.0 18.3 308 35-450 1-324 (324) 51 protein:vir:9574 Length: 300 # 83.2 0.07 4.4E-05 26.9 16.8 279 67-448 1-300 (300) 52 protein:vir:4997 Length: 397 # 80.9 0.09 5.6E-05 26.3 20.7 339 1-454 1-397 (397) 53 protein:vir:101650 Length: 497 80.5 0.094 5.8E-05 26.2 20.6 353 1-454 53-497 (497) 54 protein:vir:7855 Length: 497 # 80.5 0.094 5.8E-05 26.2 20.6 353 1-454 53-497 (497) 55 protein:vir:101607 Length: 379 79.9 0.1 6.2E-05 26.1 18.0 328 1-466 16-379 (379) 56 protein:vir:81070 Length: 390 77.3 0.13 7.8E-05 25.5 21.3 325 1-464 19-390 (390) 57 protein:vir:104256 Length: 458 77.3 0.13 7.8E-05 25.5 17.6 335 1-447 73-458 (458) 58 protein:vir:4511 Length: 409 # 75.9 0.14 8.8E-05 25.2 15.7 333 1-467 41-407 (409) 59 protein:vir:95898 Length: 274 75.0 0.15 9.4E-05 25.1 9.5 260 131-452 1-274 (274) 60 protein:vir:96262 Length: 274 75.0 0.15 9.4E-05 25.1 9.5 260 131-452 1-274 (274) 61 protein:vir:8420 Length: 477 # 72.7 0.18 0.00011 24.7 20.9 353 1-453 82-477 (477) 62 protein:vir:4856 Length: 293 # 69.5 0.22 0.00014 24.2 17.3 268 54-450 1-293 (293) 63 protein:vir:97053 Length: 390 68.4 0.24 0.00015 24.0 19.1 324 1-464 32-390 (390) 64 protein:vir:3870 Length: 400 # 62.9 0.33 0.0002 23.2 16.0 320 1-449 41-400 (400) 65 protein:vir:94673 Length: 419 61.9 0.34 0.00021 23.1 19.0 338 1-467 32-418 (419) 66 protein:vir:3158 Length: 321 # 61.5 0.35 0.00022 23.1 15.7 295 31-467 1-310 (321) 67 protein:vir:80376 Length: 435 61.3 0.36 0.00022 23.0 18.9 346 1-449 42-435 (435) 68 protein:vir:1638 Length: 298 # 58.8 0.4 0.00025 22.7 15.4 277 69-454 1-298 (298) 69 protein:vir:4339 Length: 395 # 58.2 0.42 0.00026 22.7 20.7 324 1-466 37-395 (395) 70 protein:vir:2344 Length: 397 # 54.8 0.5 0.00031 22.3 11.1 306 123-467 1-330 (397) 71 protein:vir:41 Length: 299 # N 54.1 0.51 0.00032 22.2 18.2 276 66-455 1-299 (299) 72 protein:vir:79928 Length: 393 51.7 0.57 0.00036 21.9 12.9 348 8-467 1-392 (393) 73 protein:vir:6242 Length: 390 # 50.1 0.62 0.00038 21.7 15.6 331 1-467 4-390 (390) 74 protein:vir:105334 Length: 276 50.0 0.62 0.00039 21.7 11.7 264 105-448 1-276 (276) 75 protein:vir:7409 Length: 408 # 49.7 0.63 0.00039 21.7 19.4 330 1-467 39-394 (408) 76 protein:vir:2430 Length: 318 # 49.2 0.65 0.0004 21.6 17.6 295 48-453 1-318 (318) 77 protein:vir:4226 Length: 326 # 48.4 0.67 0.00042 21.5 15.9 309 35-448 1-326 (326) 78 protein:vir:2504 Length: 305 # 47.5 0.7 0.00043 21.4 16.7 287 55-454 1-305 (305) 79 protein:vir:100135 Length: 418 46.9 0.72 0.00045 21.4 20.5 336 1-453 35-418 (418) 80 protein:vir:102119 Length: 404 46.8 0.72 0.00045 21.4 19.5 329 1-448 37-404 (404) 81 protein:vir:8102 Length: 543 # 46.1 0.75 0.00046 21.3 18.2 325 1-448 178-543 (543) 82 protein:vir:105038 Length: 428 41.6 0.92 0.00057 20.8 15.8 337 1-453 30-428 (428) 83 protein:vir:100884 Length: 389 41.2 0.94 0.00058 20.7 14.8 323 1-449 12-389 (389) 84 protein:vir:1328 Length: 392 # 39.6 1 0.00063 20.6 15.9 334 1-449 9-392 (392) 85 protein:vir:9704 Length: 394 # 37.8 1.1 0.00068 20.4 19.0 319 1-467 30-391 (394) 86 protein:vir:100172 Length: 394 37.1 1.1 0.00071 20.3 13.6 329 1-454 12-394 (394) 87 protein:vir:95763 Length: 297 36.7 1.2 0.00072 20.2 16.8 282 54-450 1-297 (297) 88 protein:vir:96833 Length: 275 33.9 1.3 0.00082 19.9 13.3 262 113-452 1-275 (275) 89 protein:vir:8187 Length: 311 # 32.6 1.4 0.00088 19.8 15.7 285 68-448 1-311 (311) 90 protein:vir:8885 Length: 347 # 32.1 1.4 0.0009 19.7 11.5 306 105-448 1-347 (347) 91 protein:vir:80930 Length: 278 31.5 1.5 0.00093 19.6 14.4 270 105-454 1-278 (278) 92 protein:vir:94711 Length: 347 31.4 1.5 0.00093 19.6 15.3 304 105-447 1-347 (347) 93 protein:vir:4092 Length: 390 # 29.6 1.6 0.001 19.4 20.0 346 1-454 1-390 (390) 94 protein:vir:78223 Length: 333 28.8 1.7 0.0011 19.3 18.3 308 32-448 1-333 (333) 95 protein:vir:96123 Length: 274 28.3 1.8 0.0011 19.2 16.6 267 119-467 1-274 (274) 96 protein:vir:95107 Length: 270 27.7 1.8 0.0011 19.2 11.5 258 131-456 1-270 (270) 97 protein:vir:1025 Length: 408 # 27.1 1.9 0.0012 19.1 16.6 332 1-450 4-408 (408) 98 protein:vir:1084 Length: 437 # 26.1 2 0.0012 19.0 16.0 325 1-450 56-437 (437) 99 protein:vir:4456 Length: 401 # 24.4 2.2 0.0014 18.7 18.2 321 1-466 18-401 (401) 100 protein:vir:100247 Length: 425 22.6 2.4 0.0015 18.5 19.0 327 1-455 64-425 (425) 101 protein:vir:94142 Length: 304 22.3 2.5 0.0015 18.4 17.4 282 61-454 1-304 (304) 102 protein:vir:105905 Length: 304 22.3 2.5 0.0015 18.4 17.4 282 61-454 1-304 (304) 103 protein:vir:3991 Length: 404 # 20.2 2.8 0.0017 18.1 20.0 331 1-454 39-404 (404) No 1 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=100.00 E-value=3e-240 Score=1333.73 Aligned_cols=458 Identities=78% Similarity=1.185 Sum_probs=422.5 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhHHHHHHHHhhhhhcchhhhhh-----hhhccccccccccccc Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEKFMQEQVAFEQGGMIAEQP-----TNAVGNGGYTSSGGQT 75 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~~~~e~~~~~~~~~~~e~~-----~~~~g~~~~~st~tg~ 75 (467) |+++|+|+|||+|||||||+|+|++.+||+|+++|||||||+++|++. +|+|++ ....+.++.+||+|++ T Consensus 3 ~~~~e~l~~kw~p~l~~~~~~~i~~~~~~~v~a~l~enq~~~~~~~~~-----~l~e~~~~~~~~~~~~~~i~~st~t~~ 77 (470) T protein:vir:10 3 MFNSEYLQEKWAPILDYDGLDPIKDSHRRSVTAVLLENQEKELREERN-----FLSEAPNVNTNSGATAGFSADATAAGP 77 (470) T ss_pred cchhHHHHHhhhhhhcCCccchhcchhhhhhhhhhhhhhHHHHhhccc-----hhhhhhhcccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999994 677773 2333344558999999 Q ss_pred ccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCccccccccccccccccccccccccc---cc Q lcl|NC_015279. 76 VAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMS---DA 152 (467) Q Consensus 76 i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~---~~ 152 (467) |++|||+||+||||++|||||+|||||||||||||||||||+||.+|+|+|+||+||++.|||.+++........ .. T Consensus 78 v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~sG~EaffnEA~T~fSG~~~~~~~~~~~~~~~a~ 157 (470) T protein:vir:10 78 VAGFDPVLISLIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQSGTEALFNEADTAFSGQPDGLDDTSGFTATGAN 157 (470) T ss_pred ccccCchhhhhHHHHHhhhhhhhhheeecCCccceeeeEEEEEecCCCccceeeecCCcccCcccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999999999999999877655433222 22 Q ss_pred ccccCCCccccccccccccccc-ccccccccccccchhhHhhcCC-CCCccceeeeEEEEEEEEeecccccccccHHHHH Q lcl|NC_015279. 153 AAGLGTTSQAGSNPAALNPVAT-ASSTGYNVGQGMRTDEAEDLGT-SGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQ 230 (467) Q Consensus 153 ~~~~~~~~~agt~p~~ln~~~~-~~~~~~~~~~Gm~TA~aE~LGs-~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQ 230 (467) ..+.+...+++++|..++.... .....|+++.||+|+.+|.||+ ++++|+||+|+||||+|||||||||||||||||| T Consensus 158 ~~g~~~~~~~gt~~~~~~~~~~~a~~~~y~~~~GMsTa~aE~lg~s~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQ 237 (470) T protein:vir:10 158 NVGLGTTAQQGSNPGLLNSTAAQTNATDYNVGQGMRTDSAEDLGDGTGDQFNQMAFSIEKVTVTAKSRALKAEYSLELAQ 237 (470) T ss_pred cccccccccccccccccccccccccccccccccccchHHhhhcCCCCCcccceeeeEEEEEEEEeeccceeccccHHHHH Confidence 2333445677777776655433 3455689999999999999996 4567999999999999999999999999999999 Q ss_pred HHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeeccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_015279. 231 DLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANA 310 (467) Q Consensus 231 DLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~ 310 (467) |||||||||||+||+|||||||||||||||||+|+++|+|||+.|++++|+|||+++++|||++|+||+|+|||+||||+ T Consensus 238 DLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~k~~~~~~~Gv~Dl~~~~~gr~~~e~~~~l~~~i~~ean~ 317 (470) T protein:vir:10 238 DLKAIHGLNAEAELANILSTEILAEINREVIRTIYNVAEPGAQANVAAAGTFDLDTDSNGRWSVEKFKGLIFQIERDANA 317 (470) T ss_pred HHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhhhceeccccccceEEeecccchhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCceEEEecccccccchhhccCCCceEE Q lcl|NC_015279. 311 IAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYV 390 (467) Q Consensus 311 i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~ 390 (467) |+|||+||+||||||||+||++|+|||||++.|++++++++|+|+++|+|+|+|||+||||||+++ +-+.|+||++ T Consensus 318 i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~d~y~~~----~~~a~~dy~~ 393 (470) T protein:vir:10 318 IAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGILQGKYRVYIDPFSAS----GGAAATQYYV 393 (470) T ss_pred HHHhhccccceEEEEchhHHhHhhhccccccccccccccccCCCCceEEEEecCceEEEeeccccc----cCcccccEEE Confidence 999999999999999999999999999999999999999999999999999999999999999763 1246889999 Q ss_pred EEEecCCCccceeEecccchhhcccccCCccccceeeeeeeeceeecCcccccCccccccccccccccceeeeeccC Q lcl|NC_015279. 391 VGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMVANPFAEGTTVGAGRLRVNSNRYYRRVAVKNLM 467 (467) Q Consensus 391 vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~~nP~~~~~~~~~~~~~~~~n~y~r~~~v~~~~ 467 (467) |||||++++|+||||||||||++++++||+||||++||||||||++|||+++++++++++++|+|||||||+||||| T Consensus 394 vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~i~~~~n~y~r~~~v~~l~ 470 (470) T protein:vir:10 394 VGYKGSSPYDAGLFYCPYVPLQMVRAVGQDTFQPKIGFKTRYGLVENPFSQGTTQGLGTLTRNSNRYYRRVKVANLM 470 (470) T ss_pred EEEecCcceecceeeccccccccCCCCCCccccceeeeeeeeceeecCcccCCCcccccccCCCCceeeEEEeeccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=100.00 E-value=3.5e-238 Score=1322.38 Aligned_cols=449 Identities=63% Similarity=0.969 Sum_probs=416.4 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhHHHHHHHHhhhhhcchhhhhhhhhcccccc--cccccccccc Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEKFMQEQVAFEQGGMIAEQPTNAVGNGGY--TSSGGQTVAG 78 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~~~~e~~~~~~~~~~~e~~~~~~g~~~~--~st~tg~i~~ 78 (467) |++ |+|+|||+|||||||+|+|++.|||+|+++|||||||+++|++ ++|+|++ |+||+ +|++++++++ T Consensus 1 ms~-~~l~~~w~~~l~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~-----~~l~ea~----~~~g~~~~~~~t~~~~~ 70 (462) T protein:vir:10 1 MSI-QQLQEKWAPVLNHESVPEIKDSYKKGVVAQLLENQENAIREEG-----QVLNETL----QTTGYTTGDTATGPVAG 70 (462) T ss_pred Cch-HHHHHHhhhhhcccccchhhhhhHHHHHHHHhhhHHHHHHhcc-----cchhccc----cccCCCcCccccccccc Confidence 988 7999999999999999999999999999999999999998866 7999984 78888 6888999999 Q ss_pred cCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecC------CCCCccccccccccccccccccccccccccc Q lcl|NC_015279. 79 FDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYST------QGGTEALFDEADTAFAGQNEGFDLTNGMSDA 152 (467) Q Consensus 79 ~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~------qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~ 152 (467) |||+||+||||++|||||+|||||||||||||||||||+||.+ |+|+||||||||+.||+.++...... ... T Consensus 71 ~~P~Li~l~Rra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~~~~~~~nq~gtEAlfnEadt~fSg~~~~~~~~~--~~~ 148 (462) T protein:vir:10 71 FDPVLISLIRRSMPQLIAYDVAGVQPMTGPTGLIFAMRSFYGSERRPANSDFREALFNEPNAGFSGGAGTGLSNY--DPT 148 (462) T ss_pred ccchhhhHHHHHHhhhhhhcceeeecCCcchhhhheeeeeccCCccccccccchhhhccCCcCcccccccccccc--ccc Confidence 9999999999999999999999999999999999999999985 56899999999999998765543322 122 Q ss_pred ccccCCCcccccccccccccccccccccccccccchhhHhhcCCC--CCccceeeeEEEEEEEEeecccccccccHHHHH Q lcl|NC_015279. 153 AAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTS--GDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQ 230 (467) Q Consensus 153 ~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~--g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQ 230 (467) ........+.+++|...++...+....++.+.||+|+.+|.||+. +++|+||+|+||||+|||||||||||||||||| T Consensus 149 ~~~~~~~~~~g~~~~~~~~~~~g~~~~~~~~~GM~Ta~aE~lg~~s~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQ 228 (462) T protein:vir:10 149 ASSSAVNDAEGANPGLLNDSPAGTYEVTGDATGMATATAEALDDSSASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQ 228 (462) T ss_pred cccccccccccccceeecCCCccceecccccccccchhccccCCccCCcchhhceeEEEEEEEeeeccceeccccHHHHH Confidence 233344556777887777766666666778899999999999953 457999999999999999999999999999999 Q ss_pred HHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeeccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_015279. 231 DLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANA 310 (467) Q Consensus 231 DLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~ 310 (467) |||||||||||+||+|||||||||||||||||+|+++|+|||+.|++++|+|||+++++|||++|+||+|+|||+||||+ T Consensus 229 DLKAIHGLDAEtELaNILSTEImlEINReii~~l~~~a~~~k~~~~~~~Gv~dl~~~~~gr~~~e~~k~l~~qi~~ean~ 308 (462) T protein:vir:10 229 DLKAIHGLDAESELANILSTEILAEINREVVRTIYVNAVKGAIANTATDGIFDLDVDSNGRWSVEKFKGLLFQIERDSNA 308 (462) T ss_pred HHHHhcCCChhHHHHHHHHHHHHHHhhHHHHhhhhhhheeeecccccccceeeeccccchHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccCCccEEEEchHHHHHHhhhcchhcccccccc---cccccCCceeEEEecCceEEEecccccccchhhccCCCc Q lcl|NC_015279. 311 IAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNAN---LNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQ 387 (467) Q Consensus 311 i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~---~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~d 387 (467) |+|||+||+||||||||+||++|+|||||++.|++++. .++|+++++|+|+|+|||+||||||+.+ ++|+| T Consensus 309 i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Y~~~------ns~~d 382 (462) T protein:vir:10 309 IGQETRRGKGNILICSADVASALGMAGVLDYAPGLQGNSALTGVDDTSSTLVGTLNGRIKVYVDPYSSN------VADKH 382 (462) T ss_pred HHHHhccccceEEEEchhHHHHhhhccchhccccccccccccccccccceeEEEecCceEEEEecccCC------Ccccc Confidence 99999999999999999999999999999999998743 5799999999999999999999999864 45789 Q ss_pred eEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeeceeecCcccccCccccccccccccccceeeeeccC Q lcl|NC_015279. 388 YYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMVANPFAEGTTVGAGRLRVNSNRYYRRVAVKNLM 467 (467) Q Consensus 388 Y~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~~nP~~~~~~~~~~~~~~~~n~y~r~~~v~~~~ 467 (467) |++|||||++++|+||||||||||+++|++||+||||++||||||||++|||+++++++++|+++|+|||||||+||||| T Consensus 383 y~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~t~~~~~~~~~~~~~~n~y~r~~~v~~l~ 462 (462) T protein:vir:10 383 FYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPNTFQPKIGFKTRYGMVSNPFSGGLTQGSGALTANANKYYRRVQVANLM 462 (462) T ss_pred eEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeeeeecCCCCCcCCccccccccCcceeeeEEeeccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=100.00 E-value=1.5e-236 Score=1313.50 Aligned_cols=451 Identities=62% Similarity=0.999 Sum_probs=415.1 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhHHHHHHHHhhhhhcchhhhhhhhhcccccc-------ccccc Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEKFMQEQVAFEQGGMIAEQPTNAVGNGGY-------TSSGG 73 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~~~~e~~~~~~~~~~~e~~~~~~g~~~~-------~st~t 73 (467) |||+|+|+|||+|||||||+|+|++.|||+|+++|||||||+++|++ .+|.|...++.|++.. ++++| T Consensus 1 ~~~~e~l~~kW~plLe~~~~~~i~~~~k~~i~a~llENQe~~~~~~~-----~~~~~~~~~~~~~~~~~~~n~~~~~~~t 75 (468) T protein:vir:10 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREER-----GMLNEVAVNSLGAGTIAPAGSALGSANT 75 (468) T ss_pred CcchHHHHHhhhHhhcCCccchhccchhhhhhhhhhhhHHHHHhccc-----cccchhhHhhcCCcccchhhhhhhhccc Confidence 99999999999999999999999999999999999999999999999 4889999999998854 58889 Q ss_pred ccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCcccccccccccccccccccccccccccc Q lcl|NC_015279. 74 QTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAA 153 (467) Q Consensus 74 g~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~ 153 (467) ++|+++||+||+||||++|||||+|||||||||||||||||||+||.+|+|+||||||||++|||.++...... .... T Consensus 76 ~~v~~~~P~Li~l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~g~EAf~nEadt~fSg~~~~~~~~~--~~~~ 153 (468) T protein:vir:10 76 GGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDY--AVRT 153 (468) T ss_pred ccccccCchhhhhHHHHHhhhhhhhceeeecCCccceeeeEEEEEecCCCCccceecccccccccccccccccc--cccc Confidence 99999999999999999999999999999999999999999999999999999999999999998765443322 2223 Q ss_pred cccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEEEeecccccccccHHHHHHHH Q lcl|NC_015279. 154 AGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLK 233 (467) Q Consensus 154 ~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLk 233 (467) .....+.+.+++|...+. ...+.++++.||+|+++|.||+++++|+||+|+||||+||||||||||||||||||||| T Consensus 154 ~~~~~~~~~g~~~~~~~~---a~~~~~~~g~gMsTa~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLK 230 (468) T protein:vir:10 154 GAGVGGDSEGNNPALLND---AAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLK 230 (468) T ss_pred ccccccCCCCCccccccc---ccccccccccccchHHHhhcCCCCcccceeeeEEEEEEEeeeccceeccccHHHHHHHH Confidence 333445566677665544 34566889999999999999998899999999999999999999999999999999999 Q ss_pred HhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeeccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015279. 234 AIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQ 313 (467) Q Consensus 234 AiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~ 313 (467) ||||||||+||+|||||||||||||||||+|+++|+|||+++++++|+|||+++++|||++|+||+|+|||+||||+|+| T Consensus 231 AiHGLDAEtELaNILStEImlEINReii~~l~~va~~~k~~g~~~~Gv~d~~~~~~~rw~~e~~k~L~~~i~~ean~i~~ 310 (468) T protein:vir:10 231 AIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQ 310 (468) T ss_pred HhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhhheecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhccCCccEEEEchHHHHHHhhhcchhcccccccc-----cccccCCceeEEEecCceEEEecccccccchhhccCCCce Q lcl|NC_015279. 314 RTRRGKGNMILCSADVASALTMAGVLDYTPALNAN-----LNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQY 388 (467) Q Consensus 314 ~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~-----~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY 388 (467) ||+||+||||||||+||++|+|||||++.|++++. +++|+|+++|+|+|+|||+||||+|+.+ ++|+|| T Consensus 311 ~T~rg~gn~ii~S~~Va~~L~~sG~l~~~~~~~~~~~~~~~~~D~tg~~~~G~l~~r~~vy~D~Ya~~------~s~~dY 384 (468) T protein:vir:10 311 ETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAAN------LSDKHY 384 (468) T ss_pred hhccccccEEEechhHHHHHhhcCcceecccccccccccccccccCcceEEEEecCceEEEEcccccc------CCccce Confidence 99999999999999999999999999999999865 4899999999999999999999999863 568999 Q ss_pred EEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeeceeecCcccccC--ccc---cccccccccccceeee Q lcl|NC_015279. 389 YVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMVANPFAEGTT--VGA---GRLRVNSNRYYRRVAV 463 (467) Q Consensus 389 ~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~~nP~~~~~~--~~~---~~~~~~~n~y~r~~~v 463 (467) ++|||||++++|+||||||||||+|++++||+||||++||||||||++|||++..+ ++. ..+..++|+|||||+| T Consensus 385 ~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~g~~~~~~~~~~~N~y~r~~~v 464 (468) T protein:vir:10 385 YVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVSNPFVTTNGLYNGTPDGEALTPNANMYYRRVQV 464 (468) T ss_pred EEEEEecCcceeceeeeccccccccccccCCCcccceeeeeeeeceeecccceeccccCCCcccccccccccceeeeEEE Confidence 99999999999999999999999999999999999999999999999999985322 221 2356799999999999 Q ss_pred eccC Q lcl|NC_015279. 464 KNLM 467 (467) Q Consensus 464 ~~~~ 467 (467) |||| T Consensus 465 ~~l~ 468 (468) T protein:vir:10 465 TNLM 468 (468) T ss_pred eccC Confidence 9999 No 4 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=100.00 E-value=1.1e-233 Score=1297.75 Aligned_cols=444 Identities=64% Similarity=1.003 Sum_probs=417.2 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhHHHHHHHHhhhhhcchhhhhhhhhccccccc--ccccccccc Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEKFMQEQVAFEQGGMIAEQPTNAVGNGGYT--SSGGQTVAG 78 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~~~~e~~~~~~~~~~~e~~~~~~g~~~~~--st~tg~i~~ 78 (467) |++ |+|+|||+|||||||+|||++.|||+|+++|||||||+++|++ ++|+|+ .|+||+. |++|++|++ T Consensus 1 m~~-~~l~~~w~~~l~~~~~~~i~~~~~~~~~~~~lenq~~~~~~~~-----~~l~ea----~~~~g~~~~s~~t~~v~~ 70 (457) T protein:vir:10 1 MSF-QNLQEKWAPVLEHDSLPEIGDSYKKGVVAQLLENQEKAIAEEG-----KILTET----LQTTGYTGGDTVTGPVAG 70 (457) T ss_pred Cch-HHHHHHhhHhhccCccchhhhhHHHHHHHHHhhhHHHHHHhcc-----cccccc----ccccCCCccccccccccc Confidence 988 7999999999999999999999999999999999999999866 799997 4899995 888999999 Q ss_pred cCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCC------Cccccccccccccccccccccccccccc Q lcl|NC_015279. 79 FDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGG------TEALFDEADTAFAGQNEGFDLTNGMSDA 152 (467) Q Consensus 79 ~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG------tEAlfnEadt~fSg~~a~~~~~~~~~~~ 152 (467) +||+||+||||++|||||+|||||||||||||||||||+||++|.+ +|||||||++.||+.+++.... T Consensus 71 ~~P~Li~l~Rra~p~LIa~DIwGVQPmTgPTGLIFAmRsrY~~q~~~~~a~~~EAl~nEadt~fSg~~~~~~~~------ 144 (457) T protein:vir:10 71 FDPVLISLIRRSMPQLIAYDIAGVQPMTGPTGLIFAMRTNYGAERNPAAAGYDEAFFNEPNAGFSGGPGAYDPG------ 144 (457) T ss_pred ccchhhhhhHHHHhhhhhhhcceeecCCCcceeeeeeeeeecCccccccccccceeeeccCcccCccccccccc------ Confidence 9999999999999999999999999999999999999999999876 7999999999999876653321 Q ss_pred ccccCCCcccccccccccccccccccccccccccchhhHhhcCCC--CCccceeeeEEEEEEEEeecccccccccHHHHH Q lcl|NC_015279. 153 AAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTS--GDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQ 230 (467) Q Consensus 153 ~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~--g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQ 230 (467) .......+.+++|...++...+....++++.||+|+++|.||+. ++.|+||+|+||||+|||||||||||||||||| T Consensus 145 -~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~gmsTA~aE~lgd~~~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQ 223 (457) T protein:vir:10 145 -ATGVTNDAEGTNPALLNDSPAGTYEQADDATGMSTATVEALDDSTANTAFREMGFSIEKVTVTARARALKAEYSIEMAQ 223 (457) T ss_pred -ccccccccccccccccCccccccccccccccchhhhhhhccCCCCCccchhhheeEEEEEEEeeeccceeccccHHHHH Confidence 11123456677888888777777788899999999999999953 357999999999999999999999999999999 Q ss_pred HHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeeccccchhHHHHHHHHHHHHHHHHHH Q lcl|NC_015279. 231 DLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANA 310 (467) Q Consensus 231 DLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~ 310 (467) |||||||||||+||+|||||||||||||||||+|+++|+|||++|++++|+|||+++++|||++|+||+|+|||+||||+ T Consensus 224 DLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~~~~~~~~~gv~dl~~~~~g~~~~e~~k~L~~~i~~ean~ 303 (457) T protein:vir:10 224 DLKAIHGLDAEQELANILSTEILAEINREVVRTIYTNAVAGAQNNTATAGVFDLDVDSNGRWSVEKFKGLLFQIERDANA 303 (457) T ss_pred HHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeccccccceeeeeeccccchhhHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccCCccEEEEchHHHHHHhhhcchhcccccccc---cccccCCceeEEEecCceEEEecccccccchhhccCCCc Q lcl|NC_015279. 311 IAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNAN---LNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQ 387 (467) Q Consensus 311 i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~---~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~d 387 (467) |+|||+||+||||||||+||++|+|||||++.|++++. .++|+++.+|+|+|+|||+||||||+.+ ++|+| T Consensus 304 i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Ya~~------ns~~d 377 (457) T protein:vir:10 304 IGHQTRRGKGNILICSADVVSALGMAGVLDYTPALNGNNGLAGVDDTSSTLVGTLNGRIKVYVDPYSAN------VADKH 377 (457) T ss_pred HHHhhccccceEEEEchhHHHHHhhcccccccchhhccccccccccccceeEEEecCCeEEEEeccccc------CCccc Confidence 99999999999999999999999999999999999864 6789999999999999999999999964 56889 Q ss_pred eEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeeceeecCcccccCccccccccccccccceeeeeccC Q lcl|NC_015279. 388 YYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMVANPFAEGTTVGAGRLRVNSNRYYRRVAVKNLM 467 (467) Q Consensus 388 Y~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~~nP~~~~~~~~~~~~~~~~n~y~r~~~v~~~~ 467 (467) |++|||||++++|+||||||||||++++++||+||||++||||||||++|||+.+++++++++++|.|.||||++|+||| T Consensus 378 y~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~~~~~~n~~~~rs~vs~ll 457 (457) T protein:vir:10 378 FYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPDTFQPKIGFKTRYGMVSNPFAGGLTQGSGALTVNANKYYRRVQVANLM 457 (457) T ss_pred eEEEEEeCCcceecceeecccccccccCccCCccccceeeeeeeeeeeecccccccccccccccccchhhcceeeeeecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 No 5 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=100.00 E-value=5.5e-224 Score=1244.57 Aligned_cols=453 Identities=39% Similarity=0.682 Sum_probs=396.9 Q ss_pred cchHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhHHHHHHHHh---------hhhhcch--------hhhhhhhhcc Q lcl|NC_015279. 2 FQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEKFMQEQV---------AFEQGGM--------IAEQPTNAVG 64 (467) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~~~~e~~---------~~~~~~~--------~~e~~~~~~g 64 (467) ...|+|+|||+|||||||+|+|++.+||+|+++|||||||+++|++ ...+|.| |+|++++ | T Consensus 1 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ea~~~--~ 78 (534) T protein:vir:10 1 MSKKSLLKKWQPLVESEGMPAIASMKRKDIVARIFENQDEDIAHNEGGVYTDQVVVNSMVDVKGRIEEARLAEANIG--G 78 (534) T ss_pred CchhHHHHHhHHhhcCCccccccchhhhhhhhhhhhhHHHHHhhhcccccchhhhhhhhhccccchhhccccccccc--c Confidence 5567999999999999999999999999999999999999987764 3455555 8887665 9 Q ss_pred cccc------cccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCC----CCcccccc--c Q lcl|NC_015279. 65 NGGY------TSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQG----GTEALFDE--A 132 (467) Q Consensus 65 ~~~~------~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----GtEAlfnE--a 132 (467) |||| +|+++++|+++||+||+||||++|||||+|||||||||||||||||||+||.+|. ++|||||| + T Consensus 79 ~~g~~~~~ia~s~~s~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~s~~EAf~ne~~a 158 (534) T protein:vir:10 79 DHGYDATKIASGETSGSITNVGPAVMGLVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIYGGNSQDANAREAFHPTYGP 158 (534) T ss_pred ccccccccccccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCCCccccccccccccc Confidence 9987 4777999999999999999999999999999999999999999999999999874 67999999 9 Q ss_pred ccccccccccccccccccccccccC----------CCcccccccc-------------c------ccccccccccccccc Q lcl|NC_015279. 133 DTAFAGQNEGFDLTNGMSDAAAGLG----------TTSQAGSNPA-------------A------LNPVATASSTGYNVG 183 (467) Q Consensus 133 dt~fSg~~a~~~~~~~~~~~~~~~~----------~~~~agt~p~-------------~------ln~~~~~~~~~~~~~ 183 (467) |+.|||+++..+...+........+ ...+.|+.+. . ......+....|+++ T Consensus 159 dt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~ 238 (534) T protein:vir:10 159 DADFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLANGYAVETS 238 (534) T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCccccccccccccccccceecc Confidence 9999998765433221111110000 0011111111 0 001112334568899 Q ss_pred cccchhhHhhcC----CCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHH Q lcl|NC_015279. 184 QGMRTDEAEDLG----TSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINRE 259 (467) Q Consensus 184 ~Gm~TA~aE~LG----s~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINRE 259 (467) .||+|+.+|.|| +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+|||||||||||||| T Consensus 239 ~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINRe 318 (534) T protein:vir:10 239 SAMATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINRE 318 (534) T ss_pred cccchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHH Confidence 999999999995 3456899999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhhccccccccc----ccceeEEeecccc---chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHH Q lcl|NC_015279. 260 VIRTIYKVSEQGAVSNT----ATAGVFDLDIDSN---GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASA 332 (467) Q Consensus 260 II~~l~~~a~~~k~~~~----~~~gv~Dl~~~~~---~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~ 332 (467) |||+|+++|+|||+.|+ +++|+|||+++.| +||++|+||.|++||++|+|+|+|+|+||+||||||||+||++ T Consensus 319 ii~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~ 398 (534) T protein:vir:10 319 MVLWINATAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAA 398 (534) T ss_pred HHHHHhhhhheeecccccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHH Confidence 99999999999999986 5689999999998 9999999999999999999999999999999999999999999 Q ss_pred Hhhhcchhcccccc--cccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccch Q lcl|NC_015279. 333 LTMAGVLDYTPALN--ANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVP 410 (467) Q Consensus 333 L~~sG~~~~~~~~~--~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~ 410 (467) |+|+|||++.|+.. .++++|+++.+|+|+|+|||+||||+|+. +||++|||||++++|+||||||||| T Consensus 399 L~~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~----------~dy~~vG~KG~~~~~~glfyaPYv~ 468 (534) T protein:vir:10 399 LGHTDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRVYIDQYAV----------EDYFTVGYKGASEMDAGLYYCPYVA 468 (534) T ss_pred HhhccchhccccccccccccccCCCceEEEEecCceEEEecCCCC----------cceEEEEEeCCcccccceeeccccc Confidence 99999999998765 56899999999999999999999999984 7999999999999999999999999 Q ss_pred hhcccccCCccccceeeeeeeeceeecCcccccCcccc-cccc---------ccccccceeeeecc Q lcl|NC_015279. 411 LQMVRAVGENTFQPKIGFKTRYGMVANPFAEGTTVGAG-RLRV---------NSNRYYRRVAVKNL 466 (467) Q Consensus 411 l~~~~~~Dp~s~qP~~g~~tRY~l~~nP~~~~~~~~~~-~~~~---------~~n~y~r~~~v~~~ 466 (467) |+|+|++||+||||++||||||||++|||++++++.+. +|++ |+|.|||||+|||| T Consensus 469 l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~~i~~g~~~~~~~ag~n~~~~~~~Vk~l 534 (534) T protein:vir:10 469 LTPLRGTDPKNFQPVLGFKTRYGVKLHPMADATQNKGFAKISNGMPQHTNMFGKNAFFRRVLVAGV 534 (534) T ss_pred cccccccCCccccceeeeeeeeceeecCcccccCCccccccccCCcchhhhcccccceeeeeeecC Confidence 99999999999999999999999999999999999884 5554 67779999999999 No 6 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=100.00 E-value=9.5e-223 Score=1237.80 Aligned_cols=453 Identities=41% Similarity=0.698 Sum_probs=399.7 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhHHH-------HHHHHhhhhhcchhhhhhhhhcccccc----- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEK-------FMQEQVAFEQGGMIAEQPTNAVGNGGY----- 68 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~-------~~~e~~~~~~~~~~~e~~~~~~g~~~~----- 68 (467) |++.|+|+|||+|||||||+|+|++. ||+|+++||||||| ||+|++...+|.+|+||+++ ||||+ T Consensus 4 ~~~~e~l~~kw~p~l~~~~~~~~~~~-~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~--~~~~~~~~~i 80 (522) T protein:vir:69 4 IKTKAQLVDKWKELLEGEGLPEIANS-KQAIIAKIFENQEKDFEVSPEYKDEKIAQAFGSFLTEAEIG--GDHGYNAQNI 80 (522) T ss_pred cchHHHHHHhhHHHhcCCCCCccccc-hhhhhhhhhhhhhHHhhcccccchhHHHHhhhhhhhhhccc--cccCCCcccc Confidence 88999999999999999999999986 99999999999997 77778889999999999887 99998 Q ss_pred -cccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCC----CCccc--ccccccccccccc Q lcl|NC_015279. 69 -TSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQG----GTEAL--FDEADTAFAGQNE 141 (467) Q Consensus 69 -~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----GtEAl--fnEadt~fSg~~a 141 (467) +|+++++|++|||+||+|+||++|||||+|||||||||||||||||||+||.+|. ++|+| |+|+|+.|||.++ T Consensus 81 ~es~~t~~v~~~~P~li~lvrRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~~~eaf~~~neadt~fSG~~~ 160 (522) T protein:vir:69 81 AAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGQGA 160 (522) T ss_pred cccccccccccccchHHHHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCcccCccccccccccccccccccccc Confidence 6778999999999999999999999999999999999999999999999999874 66776 5999999999876 Q ss_pred ccccccccccccc---------------------ccCCCcccccccccccc---cccccccccccccccchhhHhhc--- Q lcl|NC_015279. 142 GFDLTNGMSDAAA---------------------GLGTTSQAGSNPAALNP---VATASSTGYNVGQGMRTDEAEDL--- 194 (467) Q Consensus 142 ~~~~~~~~~~~~~---------------------~~~~~~~agt~p~~ln~---~~~~~~~~~~~~~Gm~TA~aE~L--- 194 (467) ............. .......++.++..++. ........|+++.||+|+.+|.+ T Consensus 161 ~t~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal~~l 240 (522) T protein:vir:69 161 AKKFPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTISSSADDAAKLDAEIIKQMEAGALVEIAEGMATSIAELQEGF 240 (522) T ss_pred cccccccccccccccccccccccccccceeeecccCCcCCCCCcccccccchhccccccccceeeccccchhhhhhcccC Confidence 5443222111100 00011122222222221 12344567899999999999986 Q ss_pred C-CCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccc Q lcl|NC_015279. 195 G-TSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAV 273 (467) Q Consensus 195 G-s~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~ 273 (467) | +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||||||||||++|+.+|+.+++ T Consensus 241 ggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~ 320 (522) T protein:vir:69 241 NGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKS 320 (522) T ss_pred CCCcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhheeecc Confidence 3 345689999999999999999999999999999999999999999999999999999999999999999888888777 Q ss_pred ccc----ccceeEEeecccc---chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccc Q lcl|NC_015279. 274 SNT----ATAGVFDLDIDSN---GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALN 346 (467) Q Consensus 274 ~~~----~~~gv~Dl~~~~~---~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~ 346 (467) .++ +.+|+|||+++.| |||++|+||.|+|||+||+|+|+|+|+||+||||||||+||++|+|+|++++.++.+ T Consensus 321 g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~~~~ 400 (522) T protein:vir:69 321 GMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQG 400 (522) T ss_pred ccccccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHhhccccccccccc Confidence 665 5799999999998 999999999999999999999999999999999999999999999999999998876 Q ss_pred --cccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccc Q lcl|NC_015279. 347 --ANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQP 424 (467) Q Consensus 347 --~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP 424 (467) .++++|+++++|+|+|+|||+||||+|+ ++||++|||||++++|+||||||||||+|+|++||+|||| T Consensus 401 ~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~----------~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP 470 (522) T protein:vir:69 401 LASGFNTDTTKSVFAGVLGGKYRVYIDQYA----------KQDYFTVGYKGANEMDAGIYYAPYVALTPLRGSDPKNFQP 470 (522) T ss_pred ccccccccCCCceEEEEecCceEEEecCCC----------CcceEEEEEeCCcccccceeeccccccccccccCCccccc Confidence 6789999999999999999999999998 4799999999999999999999999999999999999999 Q ss_pred eeeeeeeeceeecCcccccC-ccccccc---------cccccccceeeeecc Q lcl|NC_015279. 425 KIGFKTRYGMVANPFAEGTT-VGAGRLR---------VNSNRYYRRVAVKNL 466 (467) Q Consensus 425 ~~g~~tRY~l~~nP~~~~~~-~~~~~~~---------~~~n~y~r~~~v~~~ 466 (467) ++||||||||++|||+++.+ +.++||+ .|+|+|||||+|||| T Consensus 471 ~~g~~tRY~l~vNP~~~~~~~~~~~ri~~g~p~~~~~~~~n~y~r~v~v~~~ 522 (522) T protein:vir:69 471 VMGFKTRYGIGVNPFAESSLQAPGARIQSGMPSILNSLGKNAYFRRVYVKGI 522 (522) T ss_pred eeeeeeeeceeecCcccccCCcccceeecccchhhcccCCcceeeEEEeecC Confidence 99999999999999999765 4566755 566999999999999 No 7 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=100.00 E-value=4.3e-222 Score=1234.19 Aligned_cols=454 Identities=42% Similarity=0.726 Sum_probs=405.5 Q ss_pred CcchHHHHHhhhhhhcc-CccchhcchhHHHHHHHHhhhHHHH-------HHHHhhhhhcchhhhhhhhhcccccc---- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNY-EGLDKISDPHRRAVTAVLLENQEKF-------MQEQVAFEQGGMIAEQPTNAVGNGGY---- 68 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~-~~~~~i~~~~~~~v~~~~~enq~~~-------~~e~~~~~~~~~~~e~~~~~~g~~~~---- 68 (467) |++.|+|+|||+||||+ ||+|||++.|||+|+++|||||||+ ++|+....+|++|+|+++. |||++ T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~--~~~~~~~~~ 78 (524) T protein:vir:98 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIA--GDHNYDQTN 78 (524) T ss_pred CcchHHHHHHhHHHhcCCcCcchhcchhhHHHHHHHHhhHHHHHhcCccccchHHHHhhhccccccccc--ccccccccc Confidence 99999999999999986 8999999999999999999999994 5555678899999999886 89987 Q ss_pred --cccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCC---CCCccccccc-------cccc Q lcl|NC_015279. 69 --TSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQ---GGTEALFDEA-------DTAF 136 (467) Q Consensus 69 --~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~q---sGtEAlfnEa-------dt~f 136 (467) +|++|++|+++||+||+||||++|||||+|||||||||||||||||||+||.++ .|+||+|||| |+.| T Consensus 79 i~~s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~~~~gteA~~nEAf~~~ye~dt~f 158 (524) T protein:vir:98 79 IASGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMY 158 (524) T ss_pred ccccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhhhhhheeecCCCCCccccccccccccccccccccc Confidence 588899999999999999999999999999999999999999999999999998 4779999996 8999 Q ss_pred cccccccccccccccc---------------------ccccCCCccccccccccccccc---ccccccccccccchhhHh Q lcl|NC_015279. 137 AGQNEGFDLTNGMSDA---------------------AAGLGTTSQAGSNPAALNPVAT---ASSTGYNVGQGMRTDEAE 192 (467) Q Consensus 137 Sg~~a~~~~~~~~~~~---------------------~~~~~~~~~agt~p~~ln~~~~---~~~~~~~~~~Gm~TA~aE 192 (467) ||.++........... ....+...+++++|..++.... .....++++.||+|+.+| T Consensus 159 SG~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aE 238 (524) T protein:vir:98 159 SGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAE 238 (524) T ss_pred CCccccccccccccccccccccccccccccccceeccccccCcccccccccccccccccccccccceeecccccchhhhh Confidence 9876554433222211 1112233456777776665443 344568999999999999 Q ss_pred hcC----CCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhc Q lcl|NC_015279. 193 DLG----TSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVS 268 (467) Q Consensus 193 ~LG----s~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a 268 (467) +|+ +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||||||||||++|+..+ T Consensus 239 aL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~i~~~a 318 (524) T protein:vir:98 239 LQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTA 318 (524) T ss_pred hhccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhh Confidence 983 3567899999999999999999999999999999999999999999999999999999999999999887777 Q ss_pred cccccccc----ccceeEEeecccc---chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhh--hcch Q lcl|NC_015279. 269 EQGAVSNT----ATAGVFDLDIDSN---GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTM--AGVL 339 (467) Q Consensus 269 ~~~k~~~~----~~~gv~Dl~~~~~---~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~--sG~~ 339 (467) +.++..++ +.+|+|||+++.| +||++|+||.|++||++|+|+|+|+|+||+||||||||+||++|+| +||+ T Consensus 319 ~~~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~~~~~g~~ 398 (524) T protein:vir:98 319 QVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGIT 398 (524) T ss_pred eeceeecccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhhhcccc Confidence 66655543 3369999988854 9999999999999999999999999999999999999999999999 8999 Q ss_pred hcccccccccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCC Q lcl|NC_015279. 340 DYTPALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGE 419 (467) Q Consensus 340 ~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp 419 (467) +++++++.++++|+++.+|+|+|+|||+||||+|+ ++||++|||||++++|+||||||||||+|+|++|| T Consensus 399 ~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~----------~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp 468 (524) T protein:vir:98 399 PASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQYA----------RQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDP 468 (524) T ss_pred cccchhhcccccCCccceEEEEecCceEEEecCCC----------CcceEEEEeeCCcccccceeeccccccccccccCC Confidence 99999999999999999999999999999999998 47999999999999999999999999999999999 Q ss_pred ccccceeeeeeeeceeecCcccccCcccc-cccc--------ccccccceeeeecc Q lcl|NC_015279. 420 NTFQPKIGFKTRYGMVANPFAEGTTVGAG-RLRV--------NSNRYYRRVAVKNL 466 (467) Q Consensus 420 ~s~qP~~g~~tRY~l~~nP~~~~~~~~~~-~~~~--------~~n~y~r~~~v~~~ 466 (467) +||||++||||||||++|||+++.++.++ |+++ |+|.|||||+|||| T Consensus 469 ~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~l 524 (524) T protein:vir:98 469 KNFQPVMGFKTRYGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) T ss_pred ccccceeeeeeeeceeecCcccccCCccccccccCcchHhhcCccceeeEeeeccC Confidence 99999999999999999999999988765 8875 45789999999999 No 8 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=100.00 E-value=1.2e-221 Score=1231.71 Aligned_cols=453 Identities=40% Similarity=0.689 Sum_probs=401.1 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhHHH-------HHHHHhhhhhcchhhhhhhhhcccccc----- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEK-------FMQEQVAFEQGGMIAEQPTNAVGNGGY----- 68 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~-------~~~e~~~~~~~~~~~e~~~~~~g~~~~----- 68 (467) |++.|+|+|||+|||||||+|+|++. ||+||++||||||| |++|++...++.+|+|++++ |+|++ T Consensus 3 ~~~~~~l~~kw~p~l~~~~~~~i~~~-~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~--~~~~~~~~~i 79 (521) T protein:vir:10 3 IKTKAELLNKWKPLLEGEGLPEIANS-KQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIG--GDHGYNATNI 79 (521) T ss_pred cchhHHHHHhhhhhhccCCCCccccc-hhhhhhhhhhhhhhhhhhccccchhHHHHHHhhhhhhhccc--Cccccccccc Confidence 99999999999999999999999986 99999999999996 66677788899999999887 88887 Q ss_pred -cccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCC----CCcccccc--cccccccccc Q lcl|NC_015279. 69 -TSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQG----GTEALFDE--ADTAFAGQNE 141 (467) Q Consensus 69 -~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----GtEAlfnE--adt~fSg~~a 141 (467) +|++|++|+++||+||+||||++|||||+|||||||||||||||||||++|.+|. |+|+|+++ +|+.|||+++ T Consensus 80 ~es~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~ 159 (521) T protein:vir:10 80 AAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGA 159 (521) T ss_pred cccccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccccccccccchhcccccccccccc Confidence 6778999999999999999999999999999999999999999999999999984 67888765 9999999876 Q ss_pred ccccccccccccccc---------------------CCCcccccccccccc---cccccccccccccccchhhHhhcC-- Q lcl|NC_015279. 142 GFDLTNGMSDAAAGL---------------------GTTSQAGSNPAALNP---VATASSTGYNVGQGMRTDEAEDLG-- 195 (467) Q Consensus 142 ~~~~~~~~~~~~~~~---------------------~~~~~agt~p~~ln~---~~~~~~~~~~~~~Gm~TA~aE~LG-- 195 (467) ..+............ .....+++++..++. ........|+++.||+|+.+|+|+ T Consensus 160 at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~ 239 (521) T protein:vir:10 160 AKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQESF 239 (521) T ss_pred ccccccccccccccccccccccccccccceecccccccCCCcccccccccccccccccccceeecccccchhhHhhhccC Confidence 544322211111000 001111222222221 123455678999999999999883 Q ss_pred --CCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccc Q lcl|NC_015279. 196 --TSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAV 273 (467) Q Consensus 196 --s~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~ 273 (467) +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||||||||||++|+..+++++. T Consensus 240 g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~ 319 (521) T protein:vir:10 240 NGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKS 319 (521) T ss_pred CCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeee Confidence 456789999999999999999999999999999999999999999999999999999999999999998888888777 Q ss_pred ccc----ccceeEEeecccc---chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccc Q lcl|NC_015279. 274 SNT----ATAGVFDLDIDSN---GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALN 346 (467) Q Consensus 274 ~~~----~~~gv~Dl~~~~~---~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~ 346 (467) .++ +.+|+|||+++.| +||++|+||+|+|||+||||+|+|+|+||+||||||||+||++|+|+|.+++.++.+ T Consensus 320 g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~ 399 (521) T protein:vir:10 320 GMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQG 399 (521) T ss_pred eeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhccccccccccc Confidence 666 4589999999888 999999999999999999999999999999999999999999999999999988885 Q ss_pred --cccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccc Q lcl|NC_015279. 347 --ANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQP 424 (467) Q Consensus 347 --~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP 424 (467) .++++|+|+++|+|+|+|||+||||+|+ ++||++|||||++++|+||||||||||+|+|++||+|||| T Consensus 400 ~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~----------~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP 469 (521) T protein:vir:10 400 LATGFNTDTTKSVFAGVLGGKYRVYIDQYA----------KQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQP 469 (521) T ss_pred ccccccccCCCceEEEEecCceEEEecCCC----------CcceEEEEEeCCcccccceeeccccccccccccCCccccc Confidence 6789999999999999999999999997 4799999999999999999999999999999999999999 Q ss_pred eeeeeeeeceeecCcccccCccccccccc----------cccccceeeeecc Q lcl|NC_015279. 425 KIGFKTRYGMVANPFAEGTTVGAGRLRVN----------SNRYYRRVAVKNL 466 (467) Q Consensus 425 ~~g~~tRY~l~~nP~~~~~~~~~~~~~~~----------~n~y~r~~~v~~~ 466 (467) ++||||||||++|||+++.++.++|+|++ +|.|||||+|||| T Consensus 470 ~~g~~tRY~l~~NP~~~~~~~~~~~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:10 470 VMGFKTRYGIGINPFAESAAQAPASRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred eeeeeeeeceeecCcccccCCccceeecccchhhhccccccceeeeeeecCC Confidence 99999999999999999999999988764 5789999999999 No 9 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=100.00 E-value=1.5e-221 Score=1231.20 Aligned_cols=454 Identities=40% Similarity=0.657 Sum_probs=396.5 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhHHH-------HHHHHhhhhhcchhhhhhhhhcccccc----- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEK-------FMQEQVAFEQGGMIAEQPTNAVGNGGY----- 68 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~-------~~~e~~~~~~~~~~~e~~~~~~g~~~~----- 68 (467) |+++|+|+|||+|||||||+|+|++.+||+|+++||||||| ||+|++...+|++|+|++++ ||||+ T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~--~~~~~~~~~i 78 (528) T protein:vir:80 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVA--GDHGYDASQI 78 (528) T ss_pred CcchHHHHHhhhHhhcCCccchhcchhhhhhhhhhhhhhhHHhhccccccchHHHHhhhhhccccccc--cccCCccccc Confidence 99999999999999999999999999999999999999999 88888999999999999887 99998 Q ss_pred -cccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCC----CCccc--ccccccccccccc Q lcl|NC_015279. 69 -TSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQG----GTEAL--FDEADTAFAGQNE 141 (467) Q Consensus 69 -~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----GtEAl--fnEadt~fSg~~a 141 (467) +|++|++|++|||+||+||||++|||||+|||||||||||||||||||+||+++. ++||| |+++++.||+..+ T Consensus 79 ~es~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea~~~~~~~da~fS~~~t 158 (528) T protein:vir:80 79 AAGQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLASQAKEAFHPMYAPDAFHSSLAA 158 (528) T ss_pred cccccccccccCCchhhhHHHHHHhhhhhhhhheeccCCchhhhheeeeeeecCCccccccccccccccccccccccccc Confidence 4678999999999999999999999999999999999999999999999999874 56665 4578888876544 Q ss_pred ccccccccccccc--------------------------------ccCCCcccccccc-cccccccccccccccccccch Q lcl|NC_015279. 142 GFDLTNGMSDAAA--------------------------------GLGTTSQAGSNPA-ALNPVATASSTGYNVGQGMRT 188 (467) Q Consensus 142 ~~~~~~~~~~~~~--------------------------------~~~~~~~agt~p~-~ln~~~~~~~~~~~~~~Gm~T 188 (467) ............. ........++++. .......+....|+++.||+| T Consensus 159 ~~~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~T 238 (528) T protein:vir:80 159 KGAAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFGMAT 238 (528) T ss_pred cccccccccccccccccccccccccceeccccccccccccccccccccCccccCCcccccccccccccccccccccccch Confidence 3222111100000 0000011111111 112233455667899999999 Q ss_pred hhHhhc---C-CCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHH Q lcl|NC_015279. 189 DEAEDL---G-TSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTI 264 (467) Q Consensus 189 A~aE~L---G-s~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l 264 (467) +.+|.+ | +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||||||||||++| T Consensus 239 a~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii~~i 318 (528) T protein:vir:80 239 SIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVI 318 (528) T ss_pred hhhhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhh Confidence 999965 4 456789999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccccc----ccceeEEeecccc---chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhc Q lcl|NC_015279. 265 YKVSEQGAVSNT----ATAGVFDLDIDSN---GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAG 337 (467) Q Consensus 265 ~~~a~~~k~~~~----~~~gv~Dl~~~~~---~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG 337 (467) +..|+++++.++ ..+|+|||+++.| +||++|+||.|+|||+||+|+|+|+|+||+||||||||+||++|+|+| T Consensus 319 ~~~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g 398 (528) T protein:vir:80 319 NFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASAD 398 (528) T ss_pred hheeeeeeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhcc Confidence 999999998776 4589999997776 899999999999999999999999999999999999999999999998 Q ss_pred c--hhcccccccccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhccc Q lcl|NC_015279. 338 V--LDYTPALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVR 415 (467) Q Consensus 338 ~--~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~ 415 (467) . ..+.++++..+++|+++.+|+|+|+|||+||||+|+ ++||++|||||++++|+||||||||||.|++ T Consensus 399 ~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~----------~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~ 468 (528) T protein:vir:80 399 QGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYA----------RQDYFTVGYKGDNEMDAGIYYAPYVALTPLR 468 (528) T ss_pred ccccccccccccccccCCCCceEEEEecCceEEEecCCC----------CcceEEEEEeCCcccccceeecccccceeeE Confidence 4 455566667899999999999999999999999998 4799999999999999999999999999999 Q ss_pred ccCCccccceeeeeeeeceeecCcccccCcc-ccccc--------cccccccceeeeecc Q lcl|NC_015279. 416 AVGENTFQPKIGFKTRYGMVANPFAEGTTVG-AGRLR--------VNSNRYYRRVAVKNL 466 (467) Q Consensus 416 ~~Dp~s~qP~~g~~tRY~l~~nP~~~~~~~~-~~~~~--------~~~n~y~r~~~v~~~ 466 (467) ++||+||||++||||||||++|||+++.++. ++|++ .|+|.|||||+|||| T Consensus 469 ~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:80 469 ATDPQSFHPVLGFKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred eeCCccccceeeeeeeeceeecCcccccCCcccccccccchhhhhcCccceeEEeeeccC Confidence 9999999999999999999999999999886 56776 456899999999999 No 10 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=100.00 E-value=1.1e-221 Score=1231.85 Aligned_cols=450 Identities=41% Similarity=0.701 Sum_probs=393.1 Q ss_pred HHHHHhhhhhhccCc--cchhcchhHHHHHHHHhhhHHHHHHHHhh-------hhhcchhhhhhhhhcccccc------c Q lcl|NC_015279. 5 EQLQEKWAPLLNYEG--LDKISDPHRRAVTAVLLENQEKFMQEQVA-------FEQGGMIAEQPTNAVGNGGY------T 69 (467) Q Consensus 5 ~~l~~kw~p~l~~~~--~~~i~~~~~~~v~~~~~enq~~~~~e~~~-------~~~~~~~~e~~~~~~g~~~~------~ 69 (467) -+|+|||+||||||| +|+|++.|||+|+++|||||||+++|++. ..++++|+|+++| |||++ + T Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~--~~~~~~~~~ia~ 78 (514) T protein:vir:56 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVN--GDHGYDPANIAQ 78 (514) T ss_pred CchhhhhhHHhcccccccccccchhhhhhhhhhhhhHHHHHhcCCcccchhhhhhhhccccccccc--cccccccccccc Confidence 599999999999998 89999999999999999999999988754 5788999999998 99988 5 Q ss_pred ccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCC--CCCcccc--cccccccccccccccc Q lcl|NC_015279. 70 SSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQ--GGTEALF--DEADTAFAGQNEGFDL 145 (467) Q Consensus 70 st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~q--sGtEAlf--nEadt~fSg~~a~~~~ 145 (467) |++|++|+++||+||+||||++|||||+|||||||||||||||||||++|.+| +|+|||| ||+|+.|||++++... T Consensus 79 s~~t~~v~~~~P~ll~lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~tg~EAf~~~nEadt~fSG~~~~~~~ 158 (514) T protein:vir:56 79 GVTTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTI 158 (514) T ss_pred ccccccccccchhHHHHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCcccccccccccccCcCcccccccccc Confidence 78899999999999999999999999999999999999999999999999998 6889999 9999999997765443 Q ss_pred cccccccccccCCC-------------------------cccccccccccccccccccccccccccchhhHhhc---C-C Q lcl|NC_015279. 146 TNGMSDAAAGLGTT-------------------------SQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDL---G-T 196 (467) Q Consensus 146 ~~~~~~~~~~~~~~-------------------------~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~L---G-s 196 (467) ...........+.. ...+...........+....|+++.||+|+.+|.+ | + T Consensus 159 ~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta~aEal~~lggs 238 (514) T protein:vir:56 159 ADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGS 238 (514) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccccchhhhhhhhhhhhhhhhcccCCCC Confidence 22211111100000 00011111111122344556889999999999985 3 4 Q ss_pred CCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHh---hhcccccc Q lcl|NC_015279. 197 SGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIY---KVSEQGAV 273 (467) Q Consensus 197 ~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~---~~a~~~k~ 273 (467) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||||||||||++|+ +|+++|++ T Consensus 239 ~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~~~atv~~~~~~ 318 (514) T protein:vir:56 239 SNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWT 318 (514) T ss_pred cccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHHhheeehhcccc Confidence 567899999999999999999999999999999999999999999999999999999999999998886 56677788 Q ss_pred cccccceeEEeecccc---chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccc---c Q lcl|NC_015279. 274 SNTATAGVFDLDIDSN---GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALN---A 347 (467) Q Consensus 274 ~~~~~~gv~Dl~~~~~---~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~---~ 347 (467) .+++++|+|||+++.| +||++|+||.|+|||++|+|+|+|+|+||+||||||||+||++|+|+|||++.++.. + T Consensus 319 ~~~~~~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~~~~g~~~~ 398 (514) T protein:vir:56 319 QGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDG 398 (514) T ss_pred cccccccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHhhhhhccccccCcccc Confidence 8999999999997776 799999999999999999999999999999999999999999999999998866664 4 Q ss_pred ccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccceee Q lcl|NC_015279. 348 NLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIG 427 (467) Q Consensus 348 ~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g 427 (467) ++++|+++.+|+|+|+|||+||||+|+. +||++|||||++++|+||||||||||++++++||+||||++| T Consensus 399 ~~~~d~~~~~~aG~l~~~~~vy~D~y~~----------~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g 468 (514) T protein:vir:56 399 SMNTDTNQTVFAGVLGGRFKVYIDQYAV----------NDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIG 468 (514) T ss_pred ccccccCcceEEEEecCceEEEecCCCC----------cceEEEEEecCcceecceeeccccccccccccCCccccceee Confidence 7999999999999999999999999984 799999999999999999999999999999999999999999 Q ss_pred eeeeeceeecCcccccC-------ccccccccccccccceeeeecc Q lcl|NC_015279. 428 FKTRYGMVANPFAEGTT-------VGAGRLRVNSNRYYRRVAVKNL 466 (467) Q Consensus 428 ~~tRY~l~~nP~~~~~~-------~~~~~~~~~~n~y~r~~~v~~~ 466 (467) |||||||++|||++.+. +.+-....++|.|||||+|||| T Consensus 469 ~~tRY~l~~NPy~~~~~~~~~~~~~~~~~a~~~~n~y~r~v~v~~l 514 (514) T protein:vir:56 469 FKTRYGVQVNPFADPTASATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) T ss_pred eeeeeceeeCCCCCccccccccCCcchhhhcccccceeeeEEEecC Confidence 99999999999988543 3333455689999999999999 No 11 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=100.00 E-value=2.3e-221 Score=1230.18 Aligned_cols=453 Identities=41% Similarity=0.690 Sum_probs=399.7 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhHHH-------HHHHHhhhhhcchhhhhhhhhcccccc----- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEK-------FMQEQVAFEQGGMIAEQPTNAVGNGGY----- 68 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~-------~~~e~~~~~~~~~~~e~~~~~~g~~~~----- 68 (467) |++.|+|+|||+|||||||+|+|++. ||+||++||||||| |++|+...+++.+|+|++++ |+|++ T Consensus 3 ~~~~~~l~~kw~p~l~~~~~~~i~~~-~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~--~~~~~~~~~i 79 (521) T protein:vir:72 3 IKTKAELLNKWKPLLEGEGLPEIANS-KQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIG--GDHGYNATNI 79 (521) T ss_pred cchhHHHHHhhhhhhccCCCCccccc-hhhhhhhhhhhhhhhhhhcccccchHHHHHHhhhhhhhccc--CccccCcccc Confidence 99999999999999999999999986 99999999999997 55566677889999999876 88887 Q ss_pred -cccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCC----CCcccccc--cccccccccc Q lcl|NC_015279. 69 -TSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQG----GTEALFDE--ADTAFAGQNE 141 (467) Q Consensus 69 -~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----GtEAlfnE--adt~fSg~~a 141 (467) +|++|++|+++||+||+||||++|||||+|||||||||||||||||||+||.+|. |+|+||+| +|+.|||+++ T Consensus 80 aes~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~ea~~~e~~~da~fSG~~~ 159 (521) T protein:vir:72 80 AAGQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPVAAGAKEAFHPMYGPDAMFSGQGA 159 (521) T ss_pred cccccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCcccccccchhcccccccccccc Confidence 6788999999999999999999999999999999999999999999999999985 78999987 7889998876 Q ss_pred cccccccccccc--------------------cccCCCcccccccccc-c---ccccccccccccccccchhhHhhc--- Q lcl|NC_015279. 142 GFDLTNGMSDAA--------------------AGLGTTSQAGSNPAAL-N---PVATASSTGYNVGQGMRTDEAEDL--- 194 (467) Q Consensus 142 ~~~~~~~~~~~~--------------------~~~~~~~~agt~p~~l-n---~~~~~~~~~~~~~~Gm~TA~aE~L--- 194 (467) ............ .....+.+.|++.... + .........|+++.||+|+.+|.+ T Consensus 160 ~~~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~Ta~aEal~~~ 239 (521) T protein:vir:72 160 AKKFPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQEGF 239 (521) T ss_pred cccccccccccccccccccccccccccccccccccccccCCCCCCccccccccccccccCceeeeecccchhhhhhhccc Confidence 543222111110 0011111222221111 1 122344567899999999999986 Q ss_pred C-CCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccc Q lcl|NC_015279. 195 G-TSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAV 273 (467) Q Consensus 195 G-s~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~ 273 (467) | ++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||||||||||++|+..+++++. T Consensus 240 g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~g~~ 319 (521) T protein:vir:72 240 NGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKS 319 (521) T ss_pred CCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeee Confidence 3 345689999999999999999999999999999999999999999999999999999999999999998888887777 Q ss_pred ccc----ccceeEEeecccc---chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccc Q lcl|NC_015279. 274 SNT----ATAGVFDLDIDSN---GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALN 346 (467) Q Consensus 274 ~~~----~~~gv~Dl~~~~~---~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~ 346 (467) .++ +.+|+|||+++.| +||++|+||+|+|||+||||+|+|+|+||+||||||||+||++|+|+|.+++.++.+ T Consensus 320 g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~ 399 (521) T protein:vir:72 320 GMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQG 399 (521) T ss_pred eeeeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhccccccccccc Confidence 666 4589999999888 999999999999999999999999999999999999999999999999999999986 Q ss_pred --cccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccc Q lcl|NC_015279. 347 --ANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQP 424 (467) Q Consensus 347 --~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP 424 (467) .+++.|+|+++|+|+|+|||+||||+|+ ++||++|||||++++|+||||||||||+|+|++||+|||| T Consensus 400 ~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~----------~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP 469 (521) T protein:vir:72 400 LATGFSTDTTKSVFAGVLGGKYRVYIDQYA----------KQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQP 469 (521) T ss_pred ccccccccCCCceEEEEccCceEEEecCCC----------CcceEEEEEeCCcccccceeeccccccccccccCCccccc Confidence 6789999999999999999999999998 4799999999999999999999999999999999999999 Q ss_pred eeeeeeeeceeecCcccccCccccccccc----------cccccceeeeecc Q lcl|NC_015279. 425 KIGFKTRYGMVANPFAEGTTVGAGRLRVN----------SNRYYRRVAVKNL 466 (467) Q Consensus 425 ~~g~~tRY~l~~nP~~~~~~~~~~~~~~~----------~n~y~r~~~v~~~ 466 (467) ++||||||||++|||+++.++.++|.|++ +|.|||||+|||| T Consensus 470 ~~g~~tRY~l~~NP~~~~~~~~~a~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:72 470 VMGFKTRYGIGINPFAESAAQAPASRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred eeeeeeeeceeecCcccccCcccceeecCcChhhhcCccccceeeeeeecCC Confidence 99999999999999999999999988864 5679999999999 No 12 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=100.00 E-value=4.2e-221 Score=1228.75 Aligned_cols=454 Identities=40% Similarity=0.675 Sum_probs=390.7 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhHHHHHHHHh-------hhhhcchhhhhhhhhcccccc----- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEKFMQEQV-------AFEQGGMIAEQPTNAVGNGGY----- 68 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~~~~e~~-------~~~~~~~~~e~~~~~~g~~~~----- 68 (467) -.++|+|+|||+|||||||+|+|++.+||+|+++|||||||+++|++ ...++.+|+|++++ |+|++ T Consensus 2 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~~~~~~--~~~~~~~~~i 79 (529) T protein:vir:10 2 SLKNKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVA--GDHGYDPTNI 79 (529) T ss_pred cccHHHHHHHhHHHhcCCccchhccchhhhhhhhhhhhhHHHHhhccccchhhhhhhhhcccchhhcc--cccccccccc Confidence 34567999999999999999999999999999999999999988875 56788899999987 88876 Q ss_pred -cccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCC----CCcccccc--cccccccccc Q lcl|NC_015279. 69 -TSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQG----GTEALFDE--ADTAFAGQNE 141 (467) Q Consensus 69 -~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----GtEAlfnE--adt~fSg~~a 141 (467) +|++|++|++|||+||+||||++|||||+|||||||||||||||||||++|.++. +.|+||++ +++.||+.+. T Consensus 80 ~est~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~y~Pda~~sga~~ 159 (529) T protein:vir:10 80 AAGQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLAT 159 (529) T ss_pred ccccccccccccCchhhhhHHHHHhhhhhheeeeeecCCchhhhhhhhheeecCCccccccccccccccccccccccccc Confidence 6788999999999999999999999999999999999999999999999998874 34566555 4445554432 Q ss_pred cccccccccc--------------------------ccc--ccCCCccccccc-----ccccccccccccccccccccch Q lcl|NC_015279. 142 GFDLTNGMSD--------------------------AAA--GLGTTSQAGSNP-----AALNPVATASSTGYNVGQGMRT 188 (467) Q Consensus 142 ~~~~~~~~~~--------------------------~~~--~~~~~~~agt~p-----~~ln~~~~~~~~~~~~~~Gm~T 188 (467) .......... ..+ ..+...+.++++ ........+....++++.||+| T Consensus 160 ~ga~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~Gm~T 239 (529) T protein:vir:10 160 KGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMAT 239 (529) T ss_pred cccccccCccccccccccccccccCcceeeeecccceecccccccccccCccccCcccccccccccccccccccccccch Confidence 2211110000 000 000111111111 1112223345677899999999 Q ss_pred hhHhhcC----CCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHH Q lcl|NC_015279. 189 DEAEDLG----TSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTI 264 (467) Q Consensus 189 A~aE~LG----s~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l 264 (467) +.+|.|| +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+|||||||||||||||||+| T Consensus 240 a~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l 319 (529) T protein:vir:10 240 SIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWI 319 (529) T ss_pred hhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHhH Confidence 9999994 345689999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccccc----ccceeEEeecccc---chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhc Q lcl|NC_015279. 265 YKVSEQGAVSNT----ATAGVFDLDIDSN---GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAG 337 (467) Q Consensus 265 ~~~a~~~k~~~~----~~~gv~Dl~~~~~---~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG 337 (467) |++|+|||+.|+ +.+|+|||+++.+ +||++|+||.|++||++|+|+|+|+|+||+||||||||+||++|+|+| T Consensus 320 ~~~a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~ 399 (529) T protein:vir:10 320 NYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALID 399 (529) T ss_pred hhhhhhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhh Confidence 999999999988 6689999998866 999999999999999999999999999999999999999999999999 Q ss_pred chhccccc--ccccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhccc Q lcl|NC_015279. 338 VLDYTPAL--NANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVR 415 (467) Q Consensus 338 ~~~~~~~~--~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~ 415 (467) ++++.+.. ....++|+++.+|+|+|+|||+||||+|+ ++||++|||||++++|+||||||||||+|+| T Consensus 400 ~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~----------~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~ 469 (529) T protein:vir:10 400 TNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYA----------RQDYFTMGYRGANNLDAGIYYCPYVALTPLR 469 (529) T ss_pred hhccccccccccccccccCCceEEEEecCceEEEecCCC----------CcceEEEEEeCCcccccceeecccccccccc Confidence 99886544 45678999999999999999999999997 4799999999999999999999999999999 Q ss_pred ccCCccccceeeeeeeeceeecCcccccCcc-cccccc--------ccccccceeeeecc Q lcl|NC_015279. 416 AVGENTFQPKIGFKTRYGMVANPFAEGTTVG-AGRLRV--------NSNRYYRRVAVKNL 466 (467) Q Consensus 416 ~~Dp~s~qP~~g~~tRY~l~~nP~~~~~~~~-~~~~~~--------~~n~y~r~~~v~~~ 466 (467) ++||+||||++||||||||++|||+++.++. ++|+++ |+|.|||||+|||| T Consensus 470 ~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 470 GSDPKNFQPVMGFKTRYAIGVNPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred ccCCCcccceeeeeeeeceeecCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 9999999999999999999999999988775 667764 46889999999999 No 13 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=100.00 E-value=2.4e-220 Score=1224.62 Aligned_cols=453 Identities=41% Similarity=0.688 Sum_probs=394.5 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhHHHHHHHHh-------hhhhcchhhhhhhhhcccccc----- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEKFMQEQV-------AFEQGGMIAEQPTNAVGNGGY----- 68 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~~~~e~~-------~~~~~~~~~e~~~~~~g~~~~----- 68 (467) -.++|+|+|||+|||||||+|+|++.+||+|+++|||||||+++|++ ...++.+|+|++++ |+|++ T Consensus 2 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~--~~~~~~~~~i 79 (529) T protein:vir:10 2 SLKNKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVA--GDHGYDPTNI 79 (529) T ss_pred ccchHHHHHHhhHhhcCCccchhccchhhhhhhhhhhhhHHHHhcccccchhhhhhhhhccchhhccc--cccccccccc Confidence 34677999999999999999999999999999999999999998874 57788899999887 88877 Q ss_pred -cccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCC----CCcccccc--cccccccccc Q lcl|NC_015279. 69 -TSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQG----GTEALFDE--ADTAFAGQNE 141 (467) Q Consensus 69 -~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----GtEAlfnE--adt~fSg~~a 141 (467) +|++|++|++|||+||+||||++|||||+|||||||||||||||||||++|.++. +.|+||++ +++.||+.+. T Consensus 80 ~~st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~~~pda~~sga~~ 159 (529) T protein:vir:10 80 AAGQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLAT 159 (529) T ss_pred ccccccccccccCchhhhhHHHHHHhhhhhhhheeccCCchhhhhheeeeeecCCccccccccccccccccccccccccc Confidence 5778999999999999999999999999999999999999999999999999874 57888887 6777777654 Q ss_pred cccccccccccc--------------------------c--ccCCCccccccccc-----ccccccccccccccccccch Q lcl|NC_015279. 142 GFDLTNGMSDAA--------------------------A--GLGTTSQAGSNPAA-----LNPVATASSTGYNVGQGMRT 188 (467) Q Consensus 142 ~~~~~~~~~~~~--------------------------~--~~~~~~~agt~p~~-----ln~~~~~~~~~~~~~~Gm~T 188 (467) .+..+....... . ..+.+.+.++++.. ......+....++++.||+| T Consensus 160 ~ga~t~~~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~~~~g~~~t~~~~~~~~~~~~a~~~~~~~~~GmsT 239 (529) T protein:vir:10 160 KGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMAT 239 (529) T ss_pred ccccccccccccccccccccccccccceeeecccCceeeccccccccccCccccCcccccccccccccccccccccchhh Confidence 333221110000 0 00111222222221 12223345667899999999 Q ss_pred hhHhhcC----CCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHH Q lcl|NC_015279. 189 DEAEDLG----TSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTI 264 (467) Q Consensus 189 A~aE~LG----s~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l 264 (467) +.+|.|| +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+|||||||||||||||||+| T Consensus 240 a~aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l 319 (529) T protein:vir:10 240 SIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWI 319 (529) T ss_pred hhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHH Confidence 9999994 355789999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccccccc----ceeEEeecccc---chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhc Q lcl|NC_015279. 265 YKVSEQGAVSNTAT----AGVFDLDIDSN---GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAG 337 (467) Q Consensus 265 ~~~a~~~k~~~~~~----~gv~Dl~~~~~---~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG 337 (467) |++|+|||+.|+.+ +|+|||+++.+ +||++|+||.|++||++|+|+|+|+|+||+||||||||+||++|+|+| T Consensus 320 ~~~a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~ 399 (529) T protein:vir:10 320 NYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALID 399 (529) T ss_pred hhhhhhhccccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhc Confidence 99999999999854 59999998866 999999999999999999999999999999999999999999999999 Q ss_pred chhccccc---ccccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcc Q lcl|NC_015279. 338 VLDYTPAL---NANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMV 414 (467) Q Consensus 338 ~~~~~~~~---~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~ 414 (467) ++ +.|++ ....++|+++.+|+|+|+|||+||||+|+ ++||++|||||++++|+|||||||||++|+ T Consensus 400 ~~-~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~----------~~dy~~vG~KG~~~~~~glfy~PYv~l~~~ 468 (529) T protein:vir:10 400 TN-ISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYA----------RQDYFTMGYRGANNLDAGIYYCPYVALTPL 468 (529) T ss_pred cc-ccccccccccccccccCCceEEEEecCceEEEecCCC----------CcceEEEEEeCCcccccceeeccccccccc Confidence 65 55554 34578999999999999999999999997 479999999999999999999999999999 Q ss_pred cccCCccccceeeeeeeeceeecCcccccCcc-cccccc--------ccccccceeeeecc Q lcl|NC_015279. 415 RAVGENTFQPKIGFKTRYGMVANPFAEGTTVG-AGRLRV--------NSNRYYRRVAVKNL 466 (467) Q Consensus 415 ~~~Dp~s~qP~~g~~tRY~l~~nP~~~~~~~~-~~~~~~--------~~n~y~r~~~v~~~ 466 (467) |++||+||||++||||||||++|||+++.++. ++|+++ |+|.|||||+|||| T Consensus 469 ~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 469 RGFDPKNFQPVMGFKTRYAIGVNPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred cccCCCcccceeeeeeeeceeecCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 99999999999999999999999999987775 667764 46889999999999 No 14 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=100.00 E-value=3.9e-219 Score=1217.96 Aligned_cols=454 Identities=39% Similarity=0.644 Sum_probs=396.4 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhHHH-------HHHHHhhhhhcchhhhhhhhhcccccc----- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEK-------FMQEQVAFEQGGMIAEQPTNAVGNGGY----- 68 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~-------~~~e~~~~~~~~~~~e~~~~~~g~~~~----- 68 (467) |+++|+|+|||+|||||||+|+|++.+||+|+++||||||| ||+|+....++.+|+|++++ ||||+ T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~--~~~~~~~~~i 78 (528) T protein:vir:66 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVA--GDHGYDASQI 78 (528) T ss_pred CcchHHHHHHhHHhhcCCCcchhcchhhhhhhhhhhhhhHHHhhcccchhhHHHHHhhhhhhhhhccc--ccccccchhc Confidence 99999999999999999999999999999999999999999 77778889999999999887 88887 Q ss_pred -cccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCC-------------CCccccccccc Q lcl|NC_015279. 69 -TSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQG-------------GTEALFDEADT 134 (467) Q Consensus 69 -~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs-------------GtEAlfnEadt 134 (467) +|++|++|++|||+||+||||++|||||+|||||||||||||||||||++|.++. |+||+|+|+++ T Consensus 79 ~es~~t~~v~~~~P~Li~lvRRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea~t 158 (528) T protein:vir:66 79 AAGQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSLAA 158 (528) T ss_pred cccccccccccCchhHHHHHHHHHHhhhhhhhheeecCCchhhhheeeeeeecCCccccccccccccccccccccccccc Confidence 6788999999999999999999999999999999999999999999999998874 56889999888 Q ss_pred cccccccccccccc----------c--------cccccc-------cCCCcccccccc-cccccccccccccccccccch Q lcl|NC_015279. 135 AFAGQNEGFDLTNG----------M--------SDAAAG-------LGTTSQAGSNPA-ALNPVATASSTGYNVGQGMRT 188 (467) Q Consensus 135 ~fSg~~a~~~~~~~----------~--------~~~~~~-------~~~~~~agt~p~-~ln~~~~~~~~~~~~~~Gm~T 188 (467) .|+..++.+..... . +...++ .......++.+. .......+....++++.||+| T Consensus 159 ~~a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm~T 238 (528) T protein:vir:66 159 KEATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGMAT 238 (528) T ss_pred ccccccCCccceeecccccccccccceeeecccccceeeeccccccccccCcccccccccccccccccccceecccccch Confidence 88765432211000 0 000000 000111122221 122233455667899999999 Q ss_pred hhHhhc---C-CCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHH Q lcl|NC_015279. 189 DEAEDL---G-TSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTI 264 (467) Q Consensus 189 A~aE~L---G-s~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l 264 (467) +.+|.+ | ++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||||||||||++| T Consensus 239 a~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILStEImlEINREii~~i 318 (528) T protein:vir:66 239 SIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVI 318 (528) T ss_pred hhhhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhh Confidence 999985 4 345789999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccccc----ccceeEEeecccc---chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhc Q lcl|NC_015279. 265 YKVSEQGAVSNT----ATAGVFDLDIDSN---GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAG 337 (467) Q Consensus 265 ~~~a~~~k~~~~----~~~gv~Dl~~~~~---~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG 337 (467) +..|+++++.++ +.+|+|||+++.| +||++|+||.|+|||+||+|+|+|+|+||+||||||||+||++|+|+| T Consensus 319 ~~~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~~g 398 (528) T protein:vir:66 319 NFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASAD 398 (528) T ss_pred hheeeeeeeeeeeccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhcc Confidence 999999998776 4579999997766 699999999999999999999999999999999999999999999998 Q ss_pred ch--hcccccccccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhccc Q lcl|NC_015279. 338 VL--DYTPALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVR 415 (467) Q Consensus 338 ~~--~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~ 415 (467) .+ .+.++++..+++|+++.+|+|+|+|||+||||+|+ ++||++|||||++++|+|||||||||+.|++ T Consensus 399 ~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~----------~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~ 468 (528) T protein:vir:66 399 QGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQYA----------RQDYFTVGYKGDNEMDAGIYYAPYVALTPLR 468 (528) T ss_pred ccccccccccccccccCCCCceeEEEecCceEEEecCCC----------CcceEEEEEeCCcccccceeecccccceeeE Confidence 55 45555567899999999999999999999999997 4799999999999999999999999999999 Q ss_pred ccCCccccceeeeeeeeceeecCcccccCcc-ccccc--------cccccccceeeeecc Q lcl|NC_015279. 416 AVGENTFQPKIGFKTRYGMVANPFAEGTTVG-AGRLR--------VNSNRYYRRVAVKNL 466 (467) Q Consensus 416 ~~Dp~s~qP~~g~~tRY~l~~nP~~~~~~~~-~~~~~--------~~~n~y~r~~~v~~~ 466 (467) ++||+||||++||||||||++|||++++++. ++|++ .|+|.|||||+|||| T Consensus 469 ~~dp~sfqP~~g~~tRY~l~vNP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:66 469 ATDPQSFHPVLGFKTRYGIGINPFADSKSQEPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred eeCCccccceeeeeeeeceeecCcccccCccccccccccchhhhhcCccceeEEeeeccC Confidence 9999999999999999999999999998665 67776 456889999999999 No 15 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=100.00 E-value=6.2e-218 Score=1211.40 Aligned_cols=454 Identities=41% Similarity=0.682 Sum_probs=395.9 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhHHHHHHHHh-------hhhhcchhhhhhhhhcccccc----- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEKFMQEQV-------AFEQGGMIAEQPTNAVGNGGY----- 68 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~~~~e~~-------~~~~~~~~~e~~~~~~g~~~~----- 68 (467) -+++|+|+|||+|||||||+|+|++.+||+|+++|||||||+++|++ ...++.+|+|++++ |+||+ T Consensus 2 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~--~~~~~~~~~i 79 (529) T protein:vir:10 2 SLKTKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKTDPVYRDDKLIEAFGQSLMEAEVA--GDHGYDPTNI 79 (529) T ss_pred ccchHHHHHHhhHhhcCCccchhcchhhhhhhhhhhhhHHHHhhcccccchhhhhhhhhhccchhhcc--cccccccccc Confidence 46788999999999999999999999999999999999999999876 45677789998877 77765 Q ss_pred -cccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCC----CCcc--cccccccccccccc Q lcl|NC_015279. 69 -TSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQG----GTEA--LFDEADTAFAGQNE 141 (467) Q Consensus 69 -~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----GtEA--lfnEadt~fSg~~a 141 (467) +|++|++|+++||+||+||||++|||||+|||||||||||||||||||+||.+|. |+|+ +++|||+.|||.+. T Consensus 80 a~s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~g~eaf~~~~e~dt~~SG~~~ 159 (529) T protein:vir:10 80 AAGQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSGLAA 159 (529) T ss_pred cccccccccccccchhhhhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCCccccccccccccccccccccc Confidence 6888999999999999999999999999999999999999999999999999884 5565 46899999998765 Q ss_pred ccccccccccc----------------------------ccccCCCcccccc-----cccccccccccccccccccccch Q lcl|NC_015279. 142 GFDLTNGMSDA----------------------------AAGLGTTSQAGSN-----PAALNPVATASSTGYNVGQGMRT 188 (467) Q Consensus 142 ~~~~~~~~~~~----------------------------~~~~~~~~~agt~-----p~~ln~~~~~~~~~~~~~~Gm~T 188 (467) ++......... ....++..+.+++ +........+....++++.||+| T Consensus 160 ~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~~~~gmsT 239 (529) T protein:vir:10 160 KGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGELAEIAEGMAT 239 (529) T ss_pred ccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCccccccccccccccccccccccch Confidence 44322211100 0111111222221 22222333445567899999999 Q ss_pred hhHhhcC----CCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHH Q lcl|NC_015279. 189 DEAEDLG----TSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTI 264 (467) Q Consensus 189 A~aE~LG----s~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l 264 (467) +.+|.|+ ++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+|||||||||||||||||+| T Consensus 240 a~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlEINReii~~i 319 (529) T protein:vir:10 240 SIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWI 319 (529) T ss_pred hhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHh Confidence 9999983 456789999999999999999999999999999999999999999999999999999999999999988 Q ss_pred hhhccccccccc----ccceeEEeecccc---chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhc Q lcl|NC_015279. 265 YKVSEQGAVSNT----ATAGVFDLDIDSN---GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAG 337 (467) Q Consensus 265 ~~~a~~~k~~~~----~~~gv~Dl~~~~~---~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG 337 (467) +.+|++++..++ +.+|+|||+++.| +||++|+||.|++||++|+|+|+|+|+||+||||||||+||++|+|.| T Consensus 320 ~~~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~ 399 (529) T protein:vir:10 320 NYTAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALVD 399 (529) T ss_pred hhhceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHhhhc Confidence 888876655443 5789999998876 899999999999999999999999999999999999999999999999 Q ss_pred chhcccccc--cccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhccc Q lcl|NC_015279. 338 VLDYTPALN--ANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVR 415 (467) Q Consensus 338 ~~~~~~~~~--~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~ 415 (467) .+++.++.. .++++|+++.+|+|+|+|||+||||+|+ ++||++|||||++++|+||||||||||+|+| T Consensus 400 ~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~----------~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~ 469 (529) T protein:vir:10 400 AGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYA----------RQDYFTMGYRGANNLDAGIYYCPYVALTPLR 469 (529) T ss_pred cccccccccccccceeecCCceEEEEecCceEEEecCCC----------CcceEEEEEeCCcccccceeecccccccccc Confidence 999888884 5688999999999999999999999997 4799999999999999999999999999999 Q ss_pred ccCCccccceeeeeeeeceeecCcccccCcc-ccccc--------cccccccceeeeecc Q lcl|NC_015279. 416 AVGENTFQPKIGFKTRYGMVANPFAEGTTVG-AGRLR--------VNSNRYYRRVAVKNL 466 (467) Q Consensus 416 ~~Dp~s~qP~~g~~tRY~l~~nP~~~~~~~~-~~~~~--------~~~n~y~r~~~v~~~ 466 (467) ++||+||||++||||||||++|||++++++. ++|++ .|+|.|||||+|||| T Consensus 470 ~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 470 GSDPKNFQPVMGFKTRYAIGVNPFAESRTQAPTSRISNGMPGAHSVGKNAYFRRVWVKGL 529 (529) T ss_pred ccCCCcccceeeeeeeeceeecCccccccccccccccCCcchhhhcCccceeeEeeeccC Confidence 9999999999999999999999999999886 66776 567899999999999 No 16 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=100.00 E-value=3.7e-217 Score=1207.11 Aligned_cols=453 Identities=40% Similarity=0.676 Sum_probs=392.4 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhHHHHHHH-------Hhhhhhcchhhhhhhhhccccccc---- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEKFMQE-------QVAFEQGGMIAEQPTNAVGNGGYT---- 69 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~~~~e-------~~~~~~~~~~~e~~~~~~g~~~~~---- 69 (467) |.+ |+|+|||+|||||||+|+|++.|||+|+++|||||||++.| +....++.||+|++++ |+|+++ T Consensus 1 ~~~-~~l~~kw~p~l~~~~~~~i~~~~~~~i~~~~~en~~~~~~~~~~~~~~~~~~~~~~~l~e~~~~--~~~~~~~t~i 77 (519) T protein:vir:10 1 MKK-NALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIG--GDHGYDATNI 77 (519) T ss_pred Cch-hHHHHHhHHhhcccccchhhhhhhHHHHHHHHHHHHHHhhhcccccchHHHHHHhhhcchhccC--CccccCcccc Confidence 655 59999999999999999999999999999999999996555 4567888999999887 999985 Q ss_pred --ccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCC----CCccc--ccccccccccccc Q lcl|NC_015279. 70 --SSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQG----GTEAL--FDEADTAFAGQNE 141 (467) Q Consensus 70 --st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----GtEAl--fnEadt~fSg~~a 141 (467) ++++++|+++||+||+|+||++|||||+|||||||||||||||||||+||.++. |+|+| |+|||+.|||+++ T Consensus 78 ~~~~~t~~v~~~~P~l~~l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~~g~ea~~~~nEadt~fSG~~~ 157 (519) T protein:vir:10 78 AAGQTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGA 157 (519) T ss_pred ccccccccccccchhHHHHHHHHHHhhhhhhhheeecCCchhhhhheeeeeecCCccccccccccccccccccccCcccc Confidence 557999999999999999999999999999999999999999999999999875 55554 6999999999876 Q ss_pred ccccccccccccccc--------------------CCCcccccccc-cc---cccccccccccccccccchhhHhhc--- Q lcl|NC_015279. 142 GFDLTNGMSDAAAGL--------------------GTTSQAGSNPA-AL---NPVATASSTGYNVGQGMRTDEAEDL--- 194 (467) Q Consensus 142 ~~~~~~~~~~~~~~~--------------------~~~~~agt~p~-~l---n~~~~~~~~~~~~~~Gm~TA~aE~L--- 194 (467) +.............. ..+.+++++.. .+ ..........++++.||+|+.+|.+ T Consensus 158 ~~~~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~~gmsTa~aEal~~l 237 (519) T protein:vir:10 158 AETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGF 237 (519) T ss_pred ccccccccccccccccccccccccccccceeccccccccCCCCcCccccccccccccccccccccccccccchhhccccC Confidence 544332221110000 01112222111 11 1223344567899999999999985 Q ss_pred C-CCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccc Q lcl|NC_015279. 195 G-TSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAV 273 (467) Q Consensus 195 G-s~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~ 273 (467) | +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||||||||||++|+.+|+.++. T Consensus 238 ggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~ 317 (519) T protein:vir:10 238 NGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKS 317 (519) T ss_pred CCccccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhhccee Confidence 3 345689999999999999999999999999999999999999999999999999999999999999988777776665 Q ss_pred ccccc----ceeEEeecccc---chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccc Q lcl|NC_015279. 274 SNTAT----AGVFDLDIDSN---GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALN 346 (467) Q Consensus 274 ~~~~~----~gv~Dl~~~~~---~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~ 346 (467) ..+.+ +|+|||+++.| +||++|+||+|+|||+||+|+|+|+|+||+||||||||+||++|+|+|++++.++.. T Consensus 318 g~t~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~g~~~~~~~~~ 397 (519) T protein:vir:10 318 GMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQG 397 (519) T ss_pred ecccCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccchhcccccc Confidence 55433 69999998866 999999999999999999999999999999999999999999999999998888775 Q ss_pred --cccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccc Q lcl|NC_015279. 347 --ANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQP 424 (467) Q Consensus 347 --~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP 424 (467) ..+++|+++.+|+|+|+|||+||||+|+. +||++|||||++++|+||||||||||+|+|++||+|||| T Consensus 398 ~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~----------~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP 467 (519) T protein:vir:10 398 LGQGFNVDTTKAVFAGVLGGKYRVYIDQYAR----------SDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQP 467 (519) T ss_pred ccccccccCCCceEEEEecCceEEEecCCCC----------cceEEEEEecCcccccceeeccccccccccccCCccccc Confidence 46899999999999999999999999984 799999999999999999999999999999999999999 Q ss_pred eeeeeeeeceeecCcccccCcc-ccccccc---------cccccceeeeecc Q lcl|NC_015279. 425 KIGFKTRYGMVANPFAEGTTVG-AGRLRVN---------SNRYYRRVAVKNL 466 (467) Q Consensus 425 ~~g~~tRY~l~~nP~~~~~~~~-~~~~~~~---------~n~y~r~~~v~~~ 466 (467) ++||||||||++|||++++++. +.+++++ .|.|||||+|||| T Consensus 468 ~~g~~tRY~l~~NP~~~~~~~~~~~~i~~g~~~~a~~~~~n~y~r~v~v~~~ 519 (519) T protein:vir:10 468 VMGFKTRYGIGINPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) T ss_pred eeeeeeeeceeecCcccccccCccceeccCchhhhccccCceeeeeeeeecC Confidence 9999999999999999887655 4577764 5899999999999 No 17 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=100.00 E-value=4.9e-193 Score=1074.88 Aligned_cols=411 Identities=26% Similarity=0.391 Sum_probs=336.1 Q ss_pred Ccc---hHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhHHHHHHHHhhhhhcchhhhhhhhhccccccccccccccc Q lcl|NC_015279. 1 MFQ---SEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEKFMQEQVAFEQGGMIAEQPTNAVGNGGYTSSGGQTVA 77 (467) Q Consensus 1 ~~~---~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~~~~e~~~~~~~~~~~e~~~~~~g~~~~~st~tg~i~ 77 (467) |++ +|+|+|||+||||.+| ++|||+||++||||||| |++ ++|.|++ .++.|+ T Consensus 1 ~~~~~~~e~l~~kw~p~l~~~~-----~~~~~~~~a~llenq~~---~~~-----~~l~e~~------------~~~~~~ 55 (523) T protein:vir:59 1 MSQPKINEQLIEKWQPLLEGCR-----NDWERHTLATLLENQYR---EAK-----KHLMETT------------QTTEVD 55 (523) T ss_pred CCcchhhHHHHHhhhhhhcccC-----ChhHHHHHHHHhhhhhH---HHH-----Hhhhhhh------------hccccc Confidence 765 6899999999999655 55899999999999986 222 4777753 366699 Q ss_pred ccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCccccccccc--------------ccccccccc Q lcl|NC_015279. 78 GFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADT--------------AFAGQNEGF 143 (467) Q Consensus 78 ~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt--------------~fSg~~a~~ 143 (467) +|.| ||+|+||++|||||+||||||||||||||||||||||.+|.|+||+|+++.+ .|++.+... T Consensus 56 ~~~~-~~~~v~r~~p~l~a~DIWGVQPMTGPTGLIFAMRSRY~~q~gteA~yg~~~~~~~~a~~~~~ean~~~s~~~~~~ 134 (523) T protein:vir:59 56 GWNL-ALPIVRRVFANLRATDLVSVQPLSLPTGLVFYLDFKSPELPGNGSVYGGTGLTTDTATGGLYDENARLSRREYET 134 (523) T ss_pred cccc-hhhhhhhHhhhhhhhhccccccCCCCcceeEEEEeeccCCCCcccccCccccCcccccccccccccccccccccC Confidence 9996 9999999999999999999999999999999999999999999999986554 343322111 Q ss_pred ccccccccc--------ccc--------cC--------------CC-----------------c---------------- Q lcl|NC_015279. 144 DLTNGMSDA--------AAG--------LG--------------TT-----------------S---------------- 160 (467) Q Consensus 144 ~~~~~~~~~--------~~~--------~~--------------~~-----------------~---------------- 160 (467) ......... ... .+ .. . T Consensus 135 ~~~~d~~~sg~~~~~~~a~stg~A~a~~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga~fa~s~~~an~astA 214 (523) T protein:vir:59 135 TITVDLATAQQATMRDVGFDTGIASLVSSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPENTVAYPLPRYNRIVGA 214 (523) T ss_pred ccCCCcccccccccccccccccchhhccccceeeeeccccccccccccccccccccccccccccccccchhhcccccccc Confidence 100000000 000 00 00 0 Q ss_pred -----------cccccccccccc---ccccccccccccccchhhHhhcCC------CCCccceeeeEEEEEEEEeecccc Q lcl|NC_015279. 161 -----------QAGSNPAALNPV---ATASSTGYNVGQGMRTDEAEDLGT------SGDNFNEMAFSIEKVTVTAKSRAL 220 (467) Q Consensus 161 -----------~agt~p~~ln~~---~~~~~~~~~~~~Gm~TA~aE~LGs------~g~~f~EMaFsIEK~tVtAKSRaL 220 (467) ..+++....... .......++.+.||+|+.+|.+|+ ++++|+||+|+||||+|||||||| T Consensus 215 ss~Al~gEA~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~tVtAkSRaL 294 (523) T protein:vir:59 215 VGSALYARLFFVTGSDFATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLELRSRPVATKTRKL 294 (523) T ss_pred ccccccccccccccccccccCCCcccccccccccccccccchhhccccccccccccccccccceeeEEEeEEEeeecccc Confidence 000000000000 001122367789999999999964 346799999999999999999999 Q ss_pred cccccHHHHHHHHHhh-CCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeeccccchhH------ Q lcl|NC_015279. 221 KAEYSLELAQDLKAIH-GLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDIDSNGRWS------ 293 (467) Q Consensus 221 KAEYT~ELAQDLkAiH-GLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~~~~r~~------ 293 (467) |||||||||||||||| |||||+||+|||||||||||||||||+||++|+|||+.|++++|+|||+++.+++|. T Consensus 295 KAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~ 374 (523) T protein:vir:59 295 RAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGFWSEVVGEYYDETSGNFVAGNFYG 374 (523) T ss_pred cccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeeccccccceeeecccccchhhhhhhhh Confidence 9999999999999999 999999999999999999999999999999999999999999999999999999997 Q ss_pred --HHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCceEEEec Q lcl|NC_015279. 294 --VEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKYRVYID 371 (467) Q Consensus 294 --ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D 371 (467) +|+||.|+|||++|+|+|+|+|+||+||||||||+||++|++||||++.+ ...+|+++.+|+|+|+|||+|||| T Consensus 375 ~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~----~~~~~~~~~~~~g~l~~~~~vy~d 450 (523) T protein:vir:59 375 SKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTPGN----DNRDGGTGIFYVGMVQGRYRLYKN 450 (523) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccccccCC----ccccccccceeEEEecCceEEEec Confidence 89999999999999999999999999999999999999999999997533 357788999999999999999999 Q ss_pred ccccccchhhccCCCceEEEEEec-CCCccceeEecccchhhccccc-CCccccceeeeeeeeceee-cCcccccCcccc Q lcl|NC_015279. 372 PYSSNLTSANAANGNQYYVVGYKG-TSPYDAGLFYCPYVPLQMVRAV-GENTFQPKIGFKTRYGMVA-NPFAEGTTVGAG 448 (467) Q Consensus 372 ~y~~~~~~~~~~~~~dY~~vGyKG-~~~~d~glfyaPYv~l~~~~~~-Dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~ 448 (467) +|+ ++||++||||| .+++|+||||||||||.+++.+ ||+||||+|||||||||++ |||+.+.--- T Consensus 451 ~~~----------~~dy~~~g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~~~~~~-- 518 (523) T protein:vir:59 451 IYQ----------NQPVIIMGNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSYRRGLMTRYALEVVRPEFYGLLYV-- 518 (523) T ss_pred CCC----------CcceEEEEecccCCcccccceecccchhhcccccccCCcccceeeeeeehhheecchhHhhhhhh-- Confidence 997 47999999999 5699999999999999999996 9999999999999999986 9999873200 Q ss_pred ccccc Q lcl|NC_015279. 449 RLRVN 453 (467) Q Consensus 449 ~~~~~ 453 (467) ++..- T Consensus 519 ~~~~~ 523 (523) T protein:vir:59 519 KLLQP 523 (523) T ss_pred hhcCC Confidence 00000 No 18 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=96.21 E-value=0.00087 Score=37.36 Aligned_cols=336 Identities=14% Similarity=0.119 Sum_probs=133.5 Q ss_pred CcchHHHHHhhhhhhcc-----------CccchhcchhHHHHHHHHhhhHHH--HHHHHhh------------------- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNY-----------EGLDKISDPHRRAVTAVLLENQEK--FMQEQVA------------------- 48 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~-----------~~~~~i~~~~~~~v~~~~~enq~~--~~~e~~~------------------- 48 (467) |-+.++|++.|.-+=+. +...+......+++.+.+-+.+++ .+++... T Consensus 1 Mk~~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:48 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhcccccc Confidence 99999988777655110 000111111111222222221211 0000000 Q ss_pred -----------hhhcchhhhhhhhhcccccccccc-cccc--cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeee Q lcl|NC_015279. 49 -----------FEQGGMIAEQPTNAVGNGGYTSSG-GQTV--AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFA 114 (467) Q Consensus 49 -----------~~~~~~~~e~~~~~~g~~~~~st~-tg~i--~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFA 114 (467) ..+..++.+............+++ .|.+ ..+.+.++.+.| +...-.+++.++||++++|-+-- T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~ 157 (397) T protein:vir:48 81 KSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVR---QYDSLQEYVNVENVTTLTGSRVY 157 (397) T ss_pred chhhHHHHHHHHHHHHHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHH---HHHHHHhhhceeeccCCcceEEE Confidence 000000000000000000001111 1211 123344455555 45567888999999999886653 Q ss_pred eeeeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhc Q lcl|NC_015279. 115 MRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDL 194 (467) Q Consensus 115 MRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~L 194 (467) .+ .....+. ..+ .++++.. T Consensus 158 ~~--~~~~~~~--------a~~---------------------------------------------------v~E~~~~ 176 (397) T protein:vir:48 158 EK--WADITGL--------AKL---------------------------------------------------DDEAGSI 176 (397) T ss_pred Ee--ecCCCcc--------eee---------------------------------------------------ecccccc Confidence 33 2111100 000 0011111 Q ss_pred CCC-CCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccc Q lcl|NC_015279. 195 GTS-GDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAV 273 (467) Q Consensus 195 Gs~-g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~ 273 (467) .+. ...|.++.|++.|..+- ..+|-||.+|-. .|.+++|.+-|+..|..-+|+.||.-.- T Consensus 177 ~~~~~~~~~~v~~~~~k~~~~-------~~iS~ell~ds~----~~l~~~v~~~l~~~~~~~~d~~il~G~g-------- 237 (397) T protein:vir:48 177 GTNDDPKLYPIRYAIKRYAGI-------STVTNSLLADSA----ENILAWLSGWIAKKVVVTRNKAILEAIA-------- 237 (397) T ss_pred ccccccceeeEEeeheeeeee-------hhhHHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Confidence 111 12466666666666544 579999999843 5789999999999999999999884321 Q ss_pred cccccceeEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccccccccccc Q lcl|NC_015279. 274 SNTATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDD 353 (467) Q Consensus 274 ~~~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~ 353 (467) .+....++.++ +....++..+... ...+..+||+|.....|.. +....+- .-+..|. T Consensus 238 ~~~~~~~~~~~----------d~i~~~~~~l~~~---------~~~~a~~v~n~~~~~~L~~---lkd~~G~-~i~~~~~ 294 (397) T protein:vir:48 238 TLPTKPTLTKW----------DDIIDLQAKVDPA---------IKQTSFFLTNTSGFTALKK---VKNAFGD-YLMERDV 294 (397) T ss_pred ccccccccccH----------HHHHHHHHHhhhh---------hcCCCEEEECHHHHHHHHH---hhcCCCc-eeeccCc Confidence 11122222222 1223343333221 2234677899999988864 2211110 0011111 Q ss_pred CCceeEEEecCceEEEe-c-cccccc--chhhcc--CCCceEEEEEecCCCccceeEecccchhhcccccCCccccceee Q lcl|NC_015279. 354 TGNTFAGVLQGKYRVYI-D-PYSSNL--TSANAA--NGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIG 427 (467) Q Consensus 354 t~~~~~G~l~~~~~vy~-D-~y~~~~--~~~~~~--~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g 427 (467) ++ --.++| .|++|++ | ...... ....++ ...+|++++..+.-....+-+...| -...+-.+- T Consensus 295 ~~-~~~~~l-~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~----------~~~~~~~~r 362 (397) T protein:vir:48 295 KS-PTGYSI-DGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGA----------FETDTTKIR 362 (397) T ss_pred CC-CCCcee-ccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccchhh----------hhcCceeEE Confidence 11 123466 5566654 2 111100 000011 1234555554433322211111000 112222333 Q ss_pred eeeeecee-ecC--c-----ccccCccccccccccccccceeee Q lcl|NC_015279. 428 FKTRYGMV-ANP--F-----AEGTTVGAGRLRVNSNRYYRRVAV 463 (467) Q Consensus 428 ~~tRY~l~-~nP--~-----~~~~~~~~~~~~~~~n~y~r~~~v 463 (467) ...|++.. .|| | +...++.+. . -.+-| T Consensus 363 ~~~r~d~~~~~~~a~~~~~~~~~~~~~~~--~-------~~~~~ 397 (397) T protein:vir:48 363 VIDRFDVVATDTESFVPASFKAIADQKGN--L-------GSTAV 397 (397) T ss_pred EEeeeccEEecccceEEEEecccccCCCC--c-------cccCC Confidence 33344332 122 1 000111100 0 00111 No 19 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=94.53 E-value=0.0042 Score=33.63 Aligned_cols=314 Identities=11% Similarity=0.018 Sum_probs=126.2 Q ss_pred cchhhhhhhhhccccccc-ccccccccccCchh-hhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCcccc Q lcl|NC_015279. 52 GGMIAEQPTNAVGNGGYT-SSGGQTVAGFDPVL-ISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALF 129 (467) Q Consensus 52 ~~~~~e~~~~~~g~~~~~-st~tg~i~~~~P~L-v~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlf 129 (467) ---|+|...++.|..... .++++. .-.-+.+ =.+++...+..+-..++.+.||+++..-|.-.. .. T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~-~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~----~~------- 68 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPS-DLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTV----KR------- 68 (338) T ss_pred CcchHHhhhhhcccccccceecccc-cccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEe----cC------- Confidence 112455545444432221 111111 1222222 234455556667889999999998755444322 10 Q ss_pred cccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEE Q lcl|NC_015279. 130 DEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIE 209 (467) Q Consensus 130 nEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIE 209 (467) +...+-+.+. +...+++ ...++-.-+++ T Consensus 69 --~~a~~v~~~~-------------------------------------------~~~~~Eg-------~~~~~~~~~f~ 96 (338) T protein:vir:78 69 --PEVGQVGVGT-------------------------------------------SNEQREG-------GTKPLSGTAWD 96 (338) T ss_pred --ccceeecccc-------------------------------------------ccccccc-------cccccccccee Confidence 0000000000 0000111 12222233334 Q ss_pred EEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccc----cceeEEee Q lcl|NC_015279. 210 KVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTA----TAGVFDLD 285 (467) Q Consensus 210 K~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~----~~gv~Dl~ 285 (467) .++...+..+-...+|-||.+|-. .|.|++|.+-|+..|...||..||.---+..... ..++. ..+....+ T Consensus 97 ~v~l~~~k~~~~~~is~ell~ds~----~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~-~~gi~~~~~~~~~~~~~ 171 (338) T protein:vir:78 97 TRSVAPIKLATIVTVSEEFARMNP----SGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSA-LQGIDTNNVIVNTTNVD 171 (338) T ss_pred EEEEEEEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccc-ccccccccccccccccc Confidence 444444444445678899999833 6788999999999999999998884221110000 00000 01111111 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCc Q lcl|NC_015279. 286 IDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGK 365 (467) Q Consensus 286 ~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~ 365 (467) ....+- ...|..-.++-.-......+..+-++++|+....|...-.+....+ ..+-.+.....-.++| .| T Consensus 172 ~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g--~~l~~~~~~~~~~~~l-~G 241 (338) T protein:vir:78 172 YLQTGT-------TPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANG--NVDPTRINLAASAGDL-LG 241 (338) T ss_pred cccccc-------hhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCC--ceeecccccCCCCcee-ee Confidence 101100 0111212222222233345667789999998887754322221111 0111111111124577 46 Q ss_pred eEEEecccccccchh-------hccCCCceEEEEEecCCCccceeEecccchhhcccccCCcc-----c---cceeeeee Q lcl|NC_015279. 366 YRVYIDPYSSNLTSA-------NAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENT-----F---QPKIGFKT 430 (467) Q Consensus 366 ~~vy~D~y~~~~~~~-------~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s-----~---qP~~g~~t 430 (467) ++|+++.+....... ...-.+.++++|..+.-.++ ..+| .......||.. | |=.+=... T Consensus 242 ~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~----~~~~--~~~~~~~~~~~~~~~~~~~~~~~~r~~~ 315 (338) T protein:vir:78 242 LPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVK----MSDT--ATLTDNTSPTPQTVSMWQTNQIAILIEV 315 (338) T ss_pred eeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEE----Eeec--ccccccccccccchhhhhcCcEEEEEEE Confidence 699888664210000 00001122333333322211 1111 11111223321 1 11222355 Q ss_pred eec-eeecC--cccccCccccccccc Q lcl|NC_015279. 431 RYG-MVANP--FAEGTTVGAGRLRVN 453 (467) Q Consensus 431 RY~-l~~nP--~~~~~~~~~~~~~~~ 453 (467) |++ .+.|| |+.-+....+ +. T Consensus 316 r~d~~v~~~~a~~~l~~~~~~---~~ 338 (338) T protein:vir:78 316 TFGWLLGDKQAFVKFVDDEDP---DA 338 (338) T ss_pred EeccEeecccceEEEecccCC---CC Confidence 777 44555 3222221111 00 No 20 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=94.06 E-value=0.0055 Score=32.97 Aligned_cols=308 Identities=12% Similarity=0.061 Sum_probs=121.7 Q ss_pred HhhhHHHHHHHHhhhhhcchhhhhhhhhcccccccccccccccccCchhh-hhHHHHHhhhhhhhceeeccCCccceeee Q lcl|NC_015279. 35 LLENQEKFMQEQVAFEQGGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLI-SLIRRSMPNLVAYDLAGVQPMSGPTGLIF 113 (467) Q Consensus 35 ~~enq~~~~~e~~~~~~~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv-~l~Rr~~p~LI~~DI~GVQPmTGPTGLIF 113 (467) +-++|+..++.+. +.. -++..+..+ .....++.++.. ..-|.+. .+++......+..+++.+-||++++.-|. T Consensus 1 ~~~~~~~~~~~~~-f~~-~~~~~~~~~---a~~~~~~~~~~~-lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p 74 (324) T protein:vir:96 1 MEQTQKLKLNLQH-FAS-NNVKPQVFN---PDNVMMHEKKDG-TLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) T ss_pred CCcchhhhHHHHH-HHH-hhhhhhhcc---cccccccCCCcc-eechhHHHHHHHHHHhhchhhhhcceeeccCCceEEE Confidence 1222211111111 110 011111111 111111121211 1222232 24455556667889999999988764332 Q ss_pred eeeeeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhh Q lcl|NC_015279. 114 AMRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAED 193 (467) Q Consensus 114 AMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~ 193 (467) - +.. +.++ .| .++++. T Consensus 75 ~----~~~--~~~a-------~~---------------------------------------------------v~Eg~~ 90 (324) T protein:vir:96 75 F----WAD--KPGA-------YW---------------------------------------------------VGEGQK 90 (324) T ss_pred E----Eec--Ccce-------ee---------------------------------------------------ecCCcc Confidence 1 110 0000 00 011111 Q ss_pred cCCCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccc Q lcl|NC_015279. 194 LGTSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAV 273 (467) Q Consensus 194 LGs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~ 273 (467) .......|.+..+.+.|..+-. ..|-||.+|-. .|.+++|.+.|...|...+++.+|.--- T Consensus 91 ~~~~~~~f~~v~~~~~k~~~~~-------~is~ell~ds~----~~l~~~i~~~l~~aia~~~d~~~l~G~g-------- 151 (324) T protein:vir:96 91 IETSKATWVNATMRAFKLGVIL-------PVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQG-------- 151 (324) T ss_pred ccccccceeEEEEEeEEEEEee-------hhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCC-------- Confidence 1111234666666666665554 48999999853 5688999999999999999998884311 Q ss_pred cccccceeEEeecccc----chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccccccc Q lcl|NC_015279. 274 SNTATAGVFDLDIDSN----GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANL 349 (467) Q Consensus 274 ~~~~~~gv~Dl~~~~~----~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~ 349 (467) .+....|++....... +.-..+....+..++ ....+..+.++||+.....|... ....+ ..+ T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i---------~~~~~~~~~~i~n~~~~~~L~~l---kd~~G--~~~ 217 (324) T protein:vir:96 152 NNPFGKSIAQSIKKTNKVIKGDFTQDNIIDLEALL---------EDDELEANAFISKTQNRSLLRKI---VDPET--KER 217 (324) T ss_pred CCCcCccccccccccceecccccchHHHHHHHHhh---------hhccCCCCEEEEcHHHHHHHHHh---hCCCC--Cee Confidence 1111122222111000 000011222222222 12235566789999998888643 21111 011 Q ss_pred ccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccce--eEecccchhhcccccCCccccceee Q lcl|NC_015279. 350 NVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAG--LFYCPYVPLQMVRAVGENTFQPKIG 427 (467) Q Consensus 350 ~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~g--lfyaPYv~l~~~~~~Dp~s~qP~~g 427 (467) -.+.. .++| .|++|++++...........-.+.++++|..+.-+.+.+ ..+.++...+....-.-..-|=.+= T Consensus 218 ~~~~~----~~~l-~G~PV~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r 292 (324) T protein:vir:96 218 IYDRN----SDSL-DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) T ss_pred ecCCC----CCcc-cceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEE Confidence 11112 2345 567888765432110000000112344555543322111 0001110000000000000112222 Q ss_pred eeeeece-eecC--ccc------ccCcccccc Q lcl|NC_015279. 428 FKTRYGM-VANP--FAE------GTTVGAGRL 450 (467) Q Consensus 428 ~~tRY~l-~~nP--~~~------~~~~~~~~~ 450 (467) ..-||+. ..+| |+. +++..|+.+ T Consensus 293 ~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 293 ATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred EEEEeccEEecccceEEEecccccCCCCCCCC Confidence 3445555 3344 211 233333333 No 21 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=93.71 E-value=0.0066 Score=32.52 Aligned_cols=337 Identities=13% Similarity=0.126 Sum_probs=134.7 Q ss_pred CcchHHHHHhhhhhhccCccchhcch-------------hHHHHH---HHHhh---hHHHHHHHHhhhhhc--------- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDP-------------HRRAVT---AVLLE---NQEKFMQEQVAFEQG--------- 52 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~-------------~~~~v~---~~~~e---nq~~~~~e~~~~~~~--------- 52 (467) |.+.++|+++|.-+-+. +-++.+. .-+++. ..+.+ .+++.+.+.+..... T Consensus 1 Mk~~~el~~~~~~~~~~--~~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 78 (397) T protein:vir:49 1 MKTSNELHDLWVAQGDK--VENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKKP 78 (397) T ss_pred CchHHHHHHHHHHHHHH--HHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Confidence 99999998888754322 0000000 000001 11110 010111111000000 Q ss_pred -----------------chhhhhhhhhcccccccccccccc---cccCchhhhhHHHHHhhhhhhhceeeccCCccceee Q lcl|NC_015279. 53 -----------------GMIAEQPTNAVGNGGYTSSGGQTV---AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLI 112 (467) Q Consensus 53 -----------------~~~~e~~~~~~g~~~~~st~tg~i---~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLI 112 (467) .++..-...........+++.|.+ ..+.+.++.+.| +..+..+++.++||++++|-+ T Consensus 79 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~ 155 (397) T protein:vir:49 79 LTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVS---QYDSLQEYVNVENVTTLTGSR 155 (397) T ss_pred cccchhHHHHHHHHHHHHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHH---hhhhHHhhhceeecccCccce Confidence 000000000000000011111211 123344444555 555778889999999998854 Q ss_pred eeeeeeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHh Q lcl|NC_015279. 113 FAMRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAE 192 (467) Q Consensus 113 FAMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE 192 (467) .=++ ..+.++ .+ .+ .++++ T Consensus 156 ~~~~--~~~~~~-~a-------~~---------------------------------------------------v~E~~ 174 (397) T protein:vir:49 156 VYEK--WTDITG-LA-------NI---------------------------------------------------DDEAG 174 (397) T ss_pred EEEe--eccCCc-ce-------ee---------------------------------------------------ecCcc Confidence 3222 111110 00 00 00001 Q ss_pred hcCC-CCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccc Q lcl|NC_015279. 193 DLGT-SGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQG 271 (467) Q Consensus 193 ~LGs-~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~ 271 (467) .... +...|.++.|++.|..+ ...+|-||.+|-. .|.+++|.+-|+..|..-+|+.||.-.-+. T Consensus 175 ~~~~~~~~~~~~i~~~~~k~~~-------~~~iS~ell~ds~----~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~---- 239 (397) T protein:vir:49 175 KIADVDDPKLSLIKYTIKRYAG-------ISTVTNSLLADSA----ENILAWLSGWIAKKVVVTRNKAILEAIAAL---- 239 (397) T ss_pred ccccccccceeeEEeeeeeEEe-------eehhHHHHHhhhH----HHHHHHHHHHHHHHHHHHHHHHHHhhcccc---- Confidence 1000 11346666666666554 4568999999852 578999999999999999999988543221 Q ss_pred cccccccceeEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccccccccc Q lcl|NC_015279. 272 AVSNTATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNV 351 (467) Q Consensus 272 k~~~~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~ 351 (467) ....+..++ +....+.+.+... -.....+|++|.....|... ....+- .-+.. T Consensus 240 ----~~~~~~~~~----------d~i~~~~~~l~~~---------~~~~a~~vmn~~~~~~l~~l---kd~~G~-~l~~~ 292 (397) T protein:vir:49 240 ----PTKPTLTKW----------DDIIDLEAKVDPA---------IKQTSFFLTNTSGFTALKKV---KNALGD-YLMER 292 (397) T ss_pred ----ccccccccH----------HHHHHHHHhhhhh---------hcCCCEEEEcHHHHHHHHHh---hcCCCc-eeecc Confidence 122333322 2233455555322 12335788999998888652 221110 00111 Q ss_pred ccCCceeEEEecCceEEEe--ccccccc--chhhcc--CCCceEEEEEecCCCccceeEecccchhhcccccCCccccce Q lcl|NC_015279. 352 DDTGNTFAGVLQGKYRVYI--DPYSSNL--TSANAA--NGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPK 425 (467) Q Consensus 352 d~t~~~~~G~l~~~~~vy~--D~y~~~~--~~~~~~--~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~ 425 (467) +.+ ....++| .|++|++ +.+.... .+..++ ...+|++++.++..+.+ +.+|... +-...+-. T Consensus 293 ~~~-~~~~~~l-~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~----~~~~~~~------~~~~~~~~ 360 (397) T protein:vir:49 293 DVK-SPTGYSI-DGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLL----STNIGGG------AFETDTTK 360 (397) T ss_pred CcC-CCCCcee-cceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEE----Eeccccc------hhhcCcee Confidence 111 1223567 5667765 2221100 000011 11234444444333322 2232211 11122233 Q ss_pred eeeeeeecee-ecC--cccc--cCccccccccccccccceeee Q lcl|NC_015279. 426 IGFKTRYGMV-ANP--FAEG--TTVGAGRLRVNSNRYYRRVAV 463 (467) Q Consensus 426 ~g~~tRY~l~-~nP--~~~~--~~~~~~~~~~~~n~y~r~~~v 463 (467) +-...|++.. .|| |... +.........++-+ | T Consensus 361 ~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~------~ 397 (397) T protein:vir:49 361 VRVIDRFDVVATDTEAFVPASFKAIADQKGNLGSTA------V 397 (397) T ss_pred EEEEeeeCcEEecccceEEEEeecccCCCCCccccc------C Confidence 3344444432 222 1100 00000000000000 0 No 22 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=92.97 E-value=0.0093 Score=31.72 Aligned_cols=266 Identities=14% Similarity=0.093 Sum_probs=116.4 Q ss_pred CCccceeeeeeeeeecCCCCCcccccccccccccccccccccccccccccccCCCc--cccccccccccccccccccccc Q lcl|NC_015279. 105 MSGPTGLIFAMRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTS--QAGSNPAALNPVATASSTGYNV 182 (467) Q Consensus 105 mTGPTGLIFAMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~--~agt~p~~ln~~~~~~~~~~~~ 182 (467) |. ...++.+ ..+.+|--..+=-.. -.. .....+..... ..+. + ....++ T Consensus 1 MA-----------~~~T~~~-~~~iPev~s~~v~~~--~~~----~~~~~~~~~~~~~~~g~-~----------G~tv~i 51 (272) T protein:vir:30 1 MA-----------VGTTKMA-QMLDPEVLADMIDAE--VGK----AIRFAPLAEVDTTLEGQ-P----------GTTLTV 51 (272) T ss_pred CC-----------Cccccch-heechHHHHHHHHHH--HHH----HhhhhccccccccccCC-C----------CCEEEE Confidence 11 0000000 011111000000000 000 00000000000 0000 0 000111 Q ss_pred ccccchhhHhhcCCCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHH Q lcl|NC_015279. 183 GQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIR 262 (467) Q Consensus 183 ~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~ 262 (467) ..--....++..+. |.++..=..+.+.++++.|.++-.-++|=|++.+ -+-|.+.++.+-|+..|..+|+++|+. T Consensus 52 P~~~~~~~a~~v~e-g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~~~d~~i~~ 126 (272) T protein:vir:30 52 PKWDYIGDAEDVAE-GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDHKVDADVLD 126 (272) T ss_pred EEecCCCCcccccC-CCcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 11001112222221 2333344455677777777777666777666533 247999999999999999999999998 Q ss_pred HHhhhcccccccccccceeEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcc Q lcl|NC_015279. 263 TIYKVSEQGAVSNTATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYT 342 (467) Q Consensus 263 ~l~~~a~~~k~~~~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~ 342 (467) .+....... .+-.++ +.+-.++.++..+ -...+++|++|.++..|......++. T Consensus 127 ~~~~a~~~~-------~~~~t~----------d~i~da~~~l~~~---------~~~~~~~vv~p~~~~~L~k~~~~~~~ 180 (272) T protein:vir:30 127 ALSKSTQTV-------EATATV----------DGVSKALDIFNDE---------DDAETVIVMNPADASTLRLDAAKEWL 180 (272) T ss_pred Hhccccccc-------ccccCH----------HHHHHHHHHHhcc---------CCCccEEEEcHHHHHHHHHhcccccc Confidence 765443221 111111 1122333444322 24567999999999999766444432 Q ss_pred cccccccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEe-cCCCccceeEecccchhhcccccCCcc Q lcl|NC_015279. 343 PALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYK-GTSPYDAGLFYCPYVPLQMVRAVGENT 421 (467) Q Consensus 343 ~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyK-G~~~~d~glfyaPYv~l~~~~~~Dp~s 421 (467) .......+ ....-..|.+ .|++|+++++.. ++-.+.++ |.- +++-..-+..+. --|+.+ T Consensus 181 ~~~~~~~~--~~~~g~ig~i-~G~~Vi~s~~~p-----------~~t~~~~~~~a~----~~~~~~~~~ve~--~r~~~~ 240 (272) T protein:vir:30 181 GATEVGAN--RVVSGVYGEV-LGVQIVRSRKCP-----------KGTAYMVRKGAL----RIMLKRNTMVET--DRDITK 240 (272) T ss_pred cccccccc--ccccccchhh-cCeeEEEcCCCC-----------cceEEEEcCCeE----EEEecCCceeee--cccccc Confidence 22111111 1111235677 568999998852 22222222 211 122222222111 127888 Q ss_pred ccceeeeeeeecee-ecCc--ccccCccccccccccc Q lcl|NC_015279. 422 FQPKIGFKTRYGMV-ANPF--AEGTTVGAGRLRVNSN 455 (467) Q Consensus 422 ~qP~~g~~tRY~l~-~nP~--~~~~~~~~~~~~~~~n 455 (467) ++-.+-..-|||+. .||- ...+-.. .++. T Consensus 241 ~~~~i~~~~~~~~~v~~~~~vv~~t~~~-----a~~~ 272 (272) T protein:vir:30 241 AINQIVANKHYGVYLYKAEKAVKITLKD-----AAKK 272 (272) T ss_pred ceeEEEEEEEEEEEEEcCCceEEEEecc-----cccC Confidence 88888778888875 3442 1111111 1111 No 23 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=92.97 E-value=0.0093 Score=31.72 Aligned_cols=266 Identities=14% Similarity=0.093 Sum_probs=116.4 Q ss_pred CCccceeeeeeeeeecCCCCCcccccccccccccccccccccccccccccccCCCc--cccccccccccccccccccccc Q lcl|NC_015279. 105 MSGPTGLIFAMRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTS--QAGSNPAALNPVATASSTGYNV 182 (467) Q Consensus 105 mTGPTGLIFAMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~--~agt~p~~ln~~~~~~~~~~~~ 182 (467) |. ...++.+ ..+.+|--..+=-.. -.. .....+..... ..+. + ....++ T Consensus 1 MA-----------~~~T~~~-~~~iPev~s~~v~~~--~~~----~~~~~~~~~~~~~~~g~-~----------G~tv~i 51 (272) T protein:vir:98 1 MA-----------VGTTKMA-QMLDPEVLADMIDAE--VGK----AIRFAPLAEVDTTLEGQ-P----------GTTLTV 51 (272) T ss_pred CC-----------Cccccch-heechHHHHHHHHHH--HHH----HhhhhccccccccccCC-C----------CCEEEE Confidence 11 0000000 011111000000000 000 00000000000 0000 0 000111 Q ss_pred ccccchhhHhhcCCCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHH Q lcl|NC_015279. 183 GQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIR 262 (467) Q Consensus 183 ~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~ 262 (467) ..--....++..+. |.++..=..+.+.++++.|.++-.-++|=|++.+ -+-|.+.++.+-|+..|..+|+++|+. T Consensus 52 P~~~~~~~a~~v~e-g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~~~d~~i~~ 126 (272) T protein:vir:98 52 PKWDYIGDAEDVAE-GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDHKVDADVLD 126 (272) T ss_pred EEecCCCCcccccC-CCcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHHHHHHHHHH Confidence 11001112222221 2333344455677777777777666777666533 247999999999999999999999998 Q ss_pred HHhhhcccccccccccceeEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcc Q lcl|NC_015279. 263 TIYKVSEQGAVSNTATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYT 342 (467) Q Consensus 263 ~l~~~a~~~k~~~~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~ 342 (467) .+....... .+-.++ +.+-.++.++..+ -...+++|++|.++..|......++. T Consensus 127 ~~~~a~~~~-------~~~~t~----------d~i~da~~~l~~~---------~~~~~~~vv~p~~~~~L~k~~~~~~~ 180 (272) T protein:vir:98 127 ALSKSTQTV-------EATATV----------DGVSKALDIFNDE---------DDAETVIVMNPADASTLRLDAAKEWL 180 (272) T ss_pred Hhccccccc-------ccccCH----------HHHHHHHHHHhcc---------CCCccEEEEcHHHHHHHHHhcccccc Confidence 765443221 111111 1122333444322 24567999999999999766444432 Q ss_pred cccccccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEe-cCCCccceeEecccchhhcccccCCcc Q lcl|NC_015279. 343 PALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYK-GTSPYDAGLFYCPYVPLQMVRAVGENT 421 (467) Q Consensus 343 ~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyK-G~~~~d~glfyaPYv~l~~~~~~Dp~s 421 (467) .......+ ....-..|.+ .|++|+++++.. ++-.+.++ |.- +++-..-+..+. --|+.+ T Consensus 181 ~~~~~~~~--~~~~g~ig~i-~G~~Vi~s~~~p-----------~~t~~~~~~~a~----~~~~~~~~~ve~--~r~~~~ 240 (272) T protein:vir:98 181 GATEVGAN--RVVSGVYGEV-LGVQIVRSRKCP-----------KGTAYMVRKGAL----RIMLKRNTMVET--DRDITK 240 (272) T ss_pred cccccccc--ccccccchhh-cCeeEEEcCCCC-----------cceEEEEcCCeE----EEEecCCceeee--cccccc Confidence 22111111 1111235677 568999998852 22222222 211 122222222111 127888 Q ss_pred ccceeeeeeeecee-ecCc--ccccCccccccccccc Q lcl|NC_015279. 422 FQPKIGFKTRYGMV-ANPF--AEGTTVGAGRLRVNSN 455 (467) Q Consensus 422 ~qP~~g~~tRY~l~-~nP~--~~~~~~~~~~~~~~~n 455 (467) ++-.+-..-|||+. .||- ...+-.. .++. T Consensus 241 ~~~~i~~~~~~~~~v~~~~~vv~~t~~~-----a~~~ 272 (272) T protein:vir:98 241 AINQIVANKHYGVYLYKAEKAVKITLKD-----AAKK 272 (272) T ss_pred ceeEEEEEEEEEEEEEcCCceEEEEecc-----cccC Confidence 88888778888875 3442 1111111 1111 No 24 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=92.97 E-value=0.0093 Score=31.72 Aligned_cols=303 Identities=12% Similarity=0.095 Sum_probs=123.8 Q ss_pred hhhHHHHHHHHhhhhhcchhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeee Q lcl|NC_015279. 36 LENQEKFMQEQVAFEQGGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAM 115 (467) Q Consensus 36 ~enq~~~~~e~~~~~~~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAM 115 (467) ||.-++-..|-+.|... +.. ....+.....++.++...--..+.-.+++.....-+..+++-+-||++.+.-|. . T Consensus 1 ~~~~~~~~~~~~~f~~~-~~~---~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p-~ 75 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASN-NVK---PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT-F 75 (324) T ss_pred CCCchHHHHHHHHHHHH-hhc---cceecccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEE-E Confidence 22211111111111100 000 000111111122211111111111223344445556788889999887653321 1 Q ss_pred eeeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcC Q lcl|NC_015279. 116 RSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLG 195 (467) Q Consensus 116 RsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LG 195 (467) ..+ +.++ .| .+ | T Consensus 76 ---~~~--~~~a-------~~---------------------------------------------------v~--E--- 87 (324) T protein:vir:10 76 ---WAD--KPGA-------YW---------------------------------------------------VG--E--- 87 (324) T ss_pred ---EeC--Ccce-------eE---------------------------------------------------ec--c--- Confidence 100 0000 00 00 1 Q ss_pred CCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccc Q lcl|NC_015279. 196 TSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSN 275 (467) Q Consensus 196 s~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~ 275 (467) +..+++...+++++++..|.-+-.-..|-||.+|-. .|.+++|.+.|+..|...+++.+|.---+ + T Consensus 88 --g~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~ai~~~~d~a~l~G~g~--------~ 153 (324) T protein:vir:10 88 --GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGN--------N 153 (324) T ss_pred --CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCC--------C Confidence 223444455566666666766777789999999864 46899999999999999999998843211 1 Q ss_pred cccceeEEeeccc----cchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccccccccc Q lcl|NC_015279. 276 TATAGVFDLDIDS----NGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNV 351 (467) Q Consensus 276 ~~~~gv~Dl~~~~----~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~ 351 (467) ....|++...... .+--..+....++..+. ...+..+.+|++|.....|... ....+ ..+-. T Consensus 154 ~~~~~i~~~~~~~~~~~~~~~t~~~i~~~~~~l~---------~~~~~~~~~v~n~~~~~~L~~l---~d~~g--~~~~~ 219 (324) T protein:vir:10 154 PFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLE---------DDELEANAFISKTQNRSLLRKI---VDPET--KERIY 219 (324) T ss_pred ccCccccccccccceeccccCCHHHHHHHHHhhh---------hccCCCCEEEEcHHHHHHHHHh---hccCC--ceeec Confidence 1112222111000 01111233333433331 1234456689999999888642 21111 01111 Q ss_pred ccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCc--------ccc Q lcl|NC_015279. 352 DDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGEN--------TFQ 423 (467) Q Consensus 352 d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~--------s~q 423 (467) +.. .++| .|++|++.+.........+.-.+.++++|..+.-..+- ... .......|+. +-+ T Consensus 220 ~~~----~~~l-~G~PV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~----~~~--~~~~~~~~~~~~~~~~~~~~~ 288 (324) T protein:vir:10 220 DRN----SDTL-DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKI----DET--AQLSTVKNEDGTPVNLFEQDM 288 (324) T ss_pred CCC----Cccc-cceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEE----eec--ccccccccccccchhhhhcCc Confidence 112 2345 45688887654211100000011222334333222110 000 0000001111 112 Q ss_pred ceeeeeeeece-eecC--ccc------ccCcccccc Q lcl|NC_015279. 424 PKIGFKTRYGM-VANP--FAE------GTTVGAGRL 450 (467) Q Consensus 424 P~~g~~tRY~l-~~nP--~~~------~~~~~~~~~ 450 (467) =.+=...|||. +.|| |+. +....++.+ T Consensus 289 ~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:10 289 VALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred EEEEEEEEEccEEecccceEEEEeccCCCCCCCCCC Confidence 23333456775 3445 322 122334444 No 25 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=92.42 E-value=0.011 Score=31.22 Aligned_cols=343 Identities=16% Similarity=0.071 Sum_probs=127.9 Q ss_pred Cc-------------chHHHHHhhhhhhc-------cCcc-------ch---hcchhHHHHHHHHhhhH---HHHHHHHh Q lcl|NC_015279. 1 MF-------------QSEQLQEKWAPLLN-------YEGL-------DK---ISDPHRRAVTAVLLENQ---EKFMQEQV 47 (467) Q Consensus 1 ~~-------------~~~~l~~kw~p~l~-------~~~~-------~~---i~~~~~~~v~~~~~enq---~~~~~e~~ 47 (467) +. +-+.|.++..-+-+ .... .+ .....++.-......++ +..-.|.+ T Consensus 28 ~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 107 (415) T protein:vir:94 28 ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEASTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred HhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHHHHHHHHhhhhhhhhhHHHHH Confidence 11 11111111111100 0000 00 00000000000001110 00111111 Q ss_pred hhhhcc-hhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCc Q lcl|NC_015279. 48 AFEQGG-MIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTE 126 (467) Q Consensus 48 ~~~~~~-~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtE 126 (467) .+.... .-.+...+ ...+.+|...--....-.+++..-+..+-.+++.|+||++..+-+--.+ +.+. .+ T Consensus 108 ~~~~~~~~~~~~~~~------~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~--~~ 177 (415) T protein:vir:94 108 DFTEYLETRNDIQGG------SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR--QSEV--AA 177 (415) T ss_pred HHHHHhhhhhhhhhh------ccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEe--ecCC--cc Confidence 111100 00111111 1111122221111122234444446668899999999998776543222 1110 00 Q ss_pred ccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCC-CCccceee Q lcl|NC_015279. 127 ALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTS-GDNFNEMA 205 (467) Q Consensus 127 AlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~-g~~f~EMa 205 (467) + .+- .+++..... ...|.+.. T Consensus 178 ~-------~~v---------------------------------------------------~Eg~~~~~~~~~~~~~i~ 199 (415) T protein:vir:94 178 L-------EKV---------------------------------------------------EELEENPELAVKPFFQLA 199 (415) T ss_pred c-------eec---------------------------------------------------cccccccccccccceeeE Confidence 0 000 000000000 12366666 Q ss_pred eEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEee Q lcl|NC_015279. 206 FSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLD 285 (467) Q Consensus 206 FsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~ 285 (467) |++.|..+ .-.+|-||.+|-- +|.+++|.+-|...|..-+|+.||.-.-+-...+-.......++- . T Consensus 200 ~~~~k~~~-------~~~is~ell~ds~----~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~-~- 266 (415) T protein:vir:94 200 YDINTHRG-------YFRISREAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK-L- 266 (415) T ss_pred eeheeeee-------echhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccc-c- Confidence 66666654 3458999999864 478999999999999999999998654332222111111111110 0 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCc Q lcl|NC_015279. 286 IDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGK 365 (467) Q Consensus 286 ~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~ 365 (467) ...+--..+....++..+.. ...+.+.+|++|.....|... ....+- .-+..+.++ -..++| .| T Consensus 267 -~~~~~~~~~~i~~~~~~~~~---------~~~~~~~~vmn~~~~~~l~~l---kd~~G~-~l~~~~~~~-~~~~~l-~G 330 (415) T protein:vir:94 267 -EVKKAKSLDDIKDAINLNVK---------PNYEHNVAIVSQTMFAKLDKM---KDKLGN-YLIQPDVKE-KTQQRL-LG 330 (415) T ss_pred -ccccccchHHHHHHHHhhhh---------hccCCCEEEEcHHHHHHHHHh---hccCCC-eeeccCcCC-CCCcee-cc Confidence 00011111223334333321 223467789999988888652 221110 001111111 123466 56 Q ss_pred eEEEecccccccchhhccCCCceEEEEE-ecCCCccceeEecccchhhcccccCCccccceeeeeeeeceee-cC--ccc Q lcl|NC_015279. 366 YRVYIDPYSSNLTSANAANGNQYYVVGY-KGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMVA-NP--FAE 441 (467) Q Consensus 366 ~~vy~D~y~~~~~~~~~~~~~dY~~vGy-KG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~~-nP--~~~ 441 (467) ++|++.+...... . +.--+++|- + +.-+ ......+ -....|-.++|-.+-...|++..+ +| |.. T Consensus 331 ~pV~~~~~~~~~~-~----~~~~i~~gd~~-----~~~~-~~~~~~~-~v~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~ 398 (415) T protein:vir:94 331 AKIEILPDEVLGQ-K----GNNTLIIGNLK-----DAIV-LFDRSQY-QASWTDYMHFGECLMIAVRQDCRILDYKSAIV 398 (415) T ss_pred eeeEEecccccCC-C----CccEEEEEehh-----ccEE-EEeecce-EEEEeccccCceEEEEEEEeccEEeccccEEE Confidence 6777765532100 0 000122221 1 0000 0000000 011123345566666777887643 44 211 Q ss_pred c----cCcccc--cccc Q lcl|NC_015279. 442 G----TTVGAG--RLRV 452 (467) Q Consensus 442 ~----~~~~~~--~~~~ 452 (467) . +..+++ .+.. T Consensus 399 ~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:94 399 IEYDDSERGEGDLGLEA 415 (415) T ss_pred EEEeccCCCCCccccCC Confidence 1 111111 1111 No 26 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=92.41 E-value=0.012 Score=31.21 Aligned_cols=344 Identities=13% Similarity=0.022 Sum_probs=118.3 Q ss_pred CcchHHHHHhhhhhhccCcc-----chhcchhHHHHHHH-------HhhhH----HHHHHHHhhhhhcc----hhhhhhh Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGL-----DKISDPHRRAVTAV-------LLENQ----EKFMQEQVAFEQGG----MIAEQPT 60 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~-----~~i~~~~~~~v~~~-------~~enq----~~~~~e~~~~~~~~----~~~e~~~ 60 (467) |-..+++.++=+-+.+.... ....+..+++.... +-++. .+...+.+...... .+.-.+. T Consensus 31 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 110 (413) T protein:vir:81 31 EDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRKGEGYKSIGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRV 110 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHH Confidence 11111111111111000000 00000000000000 00000 00000000000000 0000011 Q ss_pred hhccccccc---cccccc--ccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCcccccccccc Q lcl|NC_015279. 61 NAVGNGGYT---SSGGQT--VAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTA 135 (467) Q Consensus 61 ~~~g~~~~~---st~tg~--i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~ 135 (467) ...++.... ++..+. -..+.+-++.+.| +..+..+++.|+||++++.-+.-.+ -. ..+ +.. T Consensus 111 ~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~-~~--~~~--------~~~ 176 (413) T protein:vir:81 111 KAASDPASTATLTDEFQGGYGTTWNRNIIYRRR---EKLVVADLMDNLTMTNTTIKYLMEK-AN--RVV--------EGG 176 (413) T ss_pred HhhhhhhhhcccccccccccchhhHHHHHHHHh---hhhhHHhhcceeeccCCceeEEEec-cc--ccc--------ccc Confidence 111111111 111111 1223444555555 5567889999999999875332111 00 000 000 Q ss_pred cccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCC-CccceeeeEEEEEEEE Q lcl|NC_015279. 136 FAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSG-DNFNEMAFSIEKVTVT 214 (467) Q Consensus 136 fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g-~~f~EMaFsIEK~tVt 214 (467) + ..++ +++....++ ..|.+..|.+.|..+ T Consensus 177 a-------------------------------------------~~v~------Eg~~~~~~~~~~f~~i~~~~~k~~~- 206 (413) T protein:vir:81 177 F-------------------------------------------KTVA------EGGKKPYMRFADFDIVTESLSKIAG- 206 (413) T ss_pred c-------------------------------------------ceec------CcccccccCcccceeeEeeeeeEEE- Confidence 0 0000 001111111 246666666666554 Q ss_pred eecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEee------ccc Q lcl|NC_015279. 215 AKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLD------IDS 288 (467) Q Consensus 215 AKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~------~~~ 288 (467) ....|-||.+|-- +.++.|.+-|+..|..-+|+.||. |.-.+-...|++... ... T Consensus 207 ------~~~iS~ell~ds~-----~l~~~i~~~la~~~~~~~d~~~l~--------G~G~~~~~~Gi~~~~~~~~~~~~~ 267 (413) T protein:vir:81 207 ------LTKITDEMIEDYD-----FLVSYINARLLEELAIEEERQLLL--------GDGTGNNLTGLLKRDGIQTLAVSN 267 (413) T ss_pred ------eehhhHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHhc--------cCCCCCcccccccccccccccccc Confidence 4568889999862 257888888888888888887773 111111123443331 111 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhh----cchhcccccccccccccCCceeEEEecC Q lcl|NC_015279. 289 NGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMA----GVLDYTPALNANLNVDDTGNTFAGVLQG 364 (467) Q Consensus 289 ~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~s----G~~~~~~~~~~~~~~d~t~~~~~G~l~~ 364 (467) +.+. +.....+-.....-..+..+-+|++|.....|..- |-.-+.+...... .+ -+....++|. T Consensus 268 -~~~~--------~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~-~~-~~~~~~~~l~- 335 (413) T protein:vir:81 268 -KDEL--------ADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQY-GS-GGIMLDPAPW- 335 (413) T ss_pred -cchh--------HHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccc-cc-cccccCceec- Confidence 1111 11111221222222344556688899887777431 1000111110000 00 0111234664 Q ss_pred ceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeeceee-cCccccc Q lcl|NC_015279. 365 KYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMVA-NPFAEGT 443 (467) Q Consensus 365 ~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~~-nP~~~~~ 443 (467) |++|+++....... ...-...++++++.++... +=..+|... +-.+.|=.+=...||+..+ +| T Consensus 336 G~pv~~s~~~~~~~-~~~gd~~~~~~~~~~~~~~----v~~~~~~~~------~~~~~~~~~r~~~r~d~~~~~~----- 399 (413) T protein:vir:81 336 GLRTVQSQVVPVGK-PVVGAFRSAASVLRKGGVR----IDSTNTNVD------DFENNLITVRAEERVGLMVTFP----- 399 (413) T ss_pred ceeeEEcCCCCccc-EEEEecccEEEEEEecceE----EEEeccccc------hhhcCcEEEEEEEeeccEEecc----- Confidence 77888887642100 0000111233333322211 111222110 0122333444445665433 22 Q ss_pred CccccccccccccccceeeeeccC Q lcl|NC_015279. 444 TVGAGRLRVNSNRYYRRVAVKNLM 467 (467) Q Consensus 444 ~~~~~~~~~~~n~y~r~~~v~~~~ 467 (467) ..|+++.++..- T Consensus 400 ------------~a~~~l~~~~~~ 411 (413) T protein:vir:81 400 ------------EAIVQLDVAEVV 411 (413) T ss_pred ------------cceEEEEecCCC Confidence 111111111111 No 27 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=92.35 E-value=0.012 Score=31.16 Aligned_cols=331 Identities=12% Similarity=0.027 Sum_probs=130.3 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHH-----HhhhHHHHHHHHhhhhhcchhhhhh---------------- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAV-----LLENQEKFMQEQVAFEQGGMIAEQP---------------- 59 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~-----~~enq~~~~~e~~~~~~~~~~~e~~---------------- 59 (467) |.+-++|.++..-+.+. .-++.+..+..+-.. =|++|-+.+.++...... .+.+.+ T Consensus 1 M~~l~el~~~~~~~~~e--~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 77 (385) T protein:vir:18 1 MSELALIQKAIEESQQK--MTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGT-RLFDLEQKLASGAENPGEKKSF 77 (385) T ss_pred ChHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhhccccccchhhhh Confidence 99999999888766532 212222222221111 111111000000000000 000000 Q ss_pred ---------h---hhcccc---------ccccccccc--ccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeee Q lcl|NC_015279. 60 ---------T---NAVGNG---------GYTSSGGQT--VAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMR 116 (467) Q Consensus 60 ---------~---~~~g~~---------~~~st~tg~--i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMR 116 (467) - .-.+.. ...++.+|. .....+.++...| +.....+++-++||+++..-+.- T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~-- 152 (385) T protein:vir:18 78 SERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGL---RRLTIRDLLAQGRTSSNALEYVR-- 152 (385) T ss_pred HHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhh---hccchhhhcceecccCcceEEEE-- Confidence 0 000000 000111111 1122344444444 44566778888888776532211 Q ss_pred eeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCC Q lcl|NC_015279. 117 SKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGT 196 (467) Q Consensus 117 srY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs 196 (467) +....+ .+ . .. +| T Consensus 153 --~~~~~~-~a-------~---------------------------------------------------~v--~E---- 165 (385) T protein:vir:18 153 --EEVFTN-NA-------D---------------------------------------------------VV--AE---- 165 (385) T ss_pred --EecCCc-ce-------e---------------------------------------------------ee--cc---- Confidence 100000 00 0 00 01 Q ss_pred CCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccccc Q lcl|NC_015279. 197 SGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNT 276 (467) Q Consensus 197 ~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~ 276 (467) +..+++-..++++++.+.|.-+-...+|-||.||-- +.++.|.+-|+..|..-+|+.||.- .-.+. T Consensus 166 -~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~la~a~~~~~d~~~l~G--------~g~~~ 231 (385) T protein:vir:18 166 -KALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQSYINNRLMYGLALKEEGQLLNG--------DGTGD 231 (385) T ss_pred -CccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHHhc--------cCCCC Confidence 122344445556666666666667789999999842 3567777777777777777777621 11111 Q ss_pred ccceeEEeeccc------cchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccc Q lcl|NC_015279. 277 ATAGVFDLDIDS------NGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLN 350 (467) Q Consensus 277 ~~~gv~Dl~~~~------~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~ 350 (467) ...|++...... .+--.......++.++ ....+..+-+||||.....|... ....+ ..+- T Consensus 232 ~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l---------~~~~~~~~~~~~~~~~~~~l~~l---kd~~G--~~l~ 297 (385) T protein:vir:18 232 NLEGLNKVATAYDTSLNATGDTRADIIAHAIYQV---------TESEFSASGIVLNPRDWHNIALL---KDNEG--RYIF 297 (385) T ss_pred cccccccccccccccccccccchHHHHHHHHHhh---------ccccCCCCEEEEcHHHHHHHHHh---hcCCC--ceec Confidence 223333221111 0000011122222222 22345567899999998888542 21111 1111 Q ss_pred cccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCC-ccccceeeee Q lcl|NC_015279. 351 VDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGE-NTFQPKIGFK 429 (467) Q Consensus 351 ~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp-~s~qP~~g~~ 429 (467) .+.+ .-..++|. |++|+++++..... .......++++++..+....+-+ .. .-|+ ..-+=.+-.. T Consensus 298 ~~~~-~~~~~~l~-G~pV~~~~~~p~~~-~~~gd~~~~~~~~~~~~~~v~~~----~~-------~~~~~~~~~~~~~~~ 363 (385) T protein:vir:18 298 GGPQ-AFTSNIMW-GLPVVPTKAQAAGT-FTVGGFDMASQVWDRMDATVEVS----RE-------DRDNFVKNMLTILCE 363 (385) T ss_pred cCcc-cCCCceec-ceeeEEcCcCCCCc-EEEeecccEEEEEEecceEEEEe----cc-------ccchhhcCcEEEEEE Confidence 1111 11135674 58999998753200 00001112222222211111100 00 0011 1112223334 Q ss_pred eeece-eecC--cccccCcccccccccc Q lcl|NC_015279. 430 TRYGM-VANP--FAEGTTVGAGRLRVNS 454 (467) Q Consensus 430 tRY~l-~~nP--~~~~~~~~~~~~~~~~ 454 (467) .||+. +.+| |+..+-. .++ T Consensus 364 ~r~~~~v~~~~a~~~~~~~------aa~ 385 (385) T protein:vir:18 364 ERLALAHYRPTAIIKGTFS------SGS 385 (385) T ss_pred EeeccEEecccceEEEEec------cCC Confidence 47776 3444 2211111 111 No 28 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=92.35 E-value=0.012 Score=31.16 Aligned_cols=331 Identities=12% Similarity=0.027 Sum_probs=130.3 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHH-----HhhhHHHHHHHHhhhhhcchhhhhh---------------- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAV-----LLENQEKFMQEQVAFEQGGMIAEQP---------------- 59 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~-----~~enq~~~~~e~~~~~~~~~~~e~~---------------- 59 (467) |.+-++|.++..-+.+. .-++.+..+..+-.. =|++|-+.+.++...... .+.+.+ T Consensus 1 M~~l~el~~~~~~~~~e--~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 77 (385) T protein:vir:19 1 MSELALIQKAIEESQQK--MTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGT-RLFDLEQKLASGAENPGEKKSF 77 (385) T ss_pred ChHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhhccccccchhhhh Confidence 99999999888766532 212222222221111 111111000000000000 000000 Q ss_pred ---------h---hhcccc---------ccccccccc--ccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeee Q lcl|NC_015279. 60 ---------T---NAVGNG---------GYTSSGGQT--VAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMR 116 (467) Q Consensus 60 ---------~---~~~g~~---------~~~st~tg~--i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMR 116 (467) - .-.+.. ...++.+|. .....+.++...| +.....+++-++||+++..-+.- T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~-- 152 (385) T protein:vir:19 78 SERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGL---RRLTIRDLLAQGRTSSNALEYVR-- 152 (385) T ss_pred HHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhh---hccchhhhcceecccCcceEEEE-- Confidence 0 000000 000111111 1122344444444 44566778888888776532211 Q ss_pred eeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCC Q lcl|NC_015279. 117 SKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGT 196 (467) Q Consensus 117 srY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs 196 (467) +....+ .+ . .. +| T Consensus 153 --~~~~~~-~a-------~---------------------------------------------------~v--~E---- 165 (385) T protein:vir:19 153 --EEVFTN-NA-------D---------------------------------------------------VV--AE---- 165 (385) T ss_pred --EecCCc-ce-------e---------------------------------------------------ee--cc---- Confidence 100000 00 0 00 01 Q ss_pred CCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccccc Q lcl|NC_015279. 197 SGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNT 276 (467) Q Consensus 197 ~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~ 276 (467) +..+++-..++++++.+.|.-+-...+|-||.||-- +.++.|.+-|+..|..-+|+.||.- .-.+. T Consensus 166 -~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~la~a~~~~~d~~~l~G--------~g~~~ 231 (385) T protein:vir:19 166 -KALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQSYINNRLMYGLALKEEGQLLNG--------DGTGD 231 (385) T ss_pred -CccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHHhc--------cCCCC Confidence 122344445556666666666667789999999842 3567777777777777777777621 11111 Q ss_pred ccceeEEeeccc------cchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccc Q lcl|NC_015279. 277 ATAGVFDLDIDS------NGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLN 350 (467) Q Consensus 277 ~~~gv~Dl~~~~------~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~ 350 (467) ...|++...... .+--.......++.++ ....+..+-+||||.....|... ....+ ..+- T Consensus 232 ~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l---------~~~~~~~~~~~~~~~~~~~l~~l---kd~~G--~~l~ 297 (385) T protein:vir:19 232 NLEGLNKVATAYDTSLNATGDTRADIIAHAIYQV---------TESEFSASGIVLNPRDWHNIALL---KDNEG--RYIF 297 (385) T ss_pred cccccccccccccccccccccchHHHHHHHHHhh---------ccccCCCCEEEEcHHHHHHHHHh---hcCCC--ceec Confidence 223333221111 0000011122222222 22345567899999998888542 21111 1111 Q ss_pred cccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCC-ccccceeeee Q lcl|NC_015279. 351 VDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGE-NTFQPKIGFK 429 (467) Q Consensus 351 ~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp-~s~qP~~g~~ 429 (467) .+.+ .-..++|. |++|+++++..... .......++++++..+....+-+ .. .-|+ ..-+=.+-.. T Consensus 298 ~~~~-~~~~~~l~-G~pV~~~~~~p~~~-~~~gd~~~~~~~~~~~~~~v~~~----~~-------~~~~~~~~~~~~~~~ 363 (385) T protein:vir:19 298 GGPQ-AFTSNIMW-GLPVVPTKAQAAGT-FTVGGFDMASQVWDRMDATVEVS----RE-------DRDNFVKNMLTILCE 363 (385) T ss_pred cCcc-cCCCceec-ceeeEEcCcCCCCc-EEEeecccEEEEEEecceEEEEe----cc-------ccchhhcCcEEEEEE Confidence 1111 11135674 58999998753200 00001112222222211111100 00 0011 1112223334 Q ss_pred eeece-eecC--cccccCcccccccccc Q lcl|NC_015279. 430 TRYGM-VANP--FAEGTTVGAGRLRVNS 454 (467) Q Consensus 430 tRY~l-~~nP--~~~~~~~~~~~~~~~~ 454 (467) .||+. +.+| |+..+-. .++ T Consensus 364 ~r~~~~v~~~~a~~~~~~~------aa~ 385 (385) T protein:vir:19 364 ERLALAHYRPTAIIKGTFS------SGS 385 (385) T ss_pred EeeccEEecccceEEEEec------cCC Confidence 47776 3444 2211111 111 No 29 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=92.29 E-value=0.012 Score=31.11 Aligned_cols=308 Identities=12% Similarity=0.065 Sum_probs=117.7 Q ss_pred HHHHhhhHHHHHHHHhh-hhhcchhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccce Q lcl|NC_015279. 32 TAVLLENQEKFMQEQVA-FEQGGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTG 110 (467) Q Consensus 32 ~~~~~enq~~~~~e~~~-~~~~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTG 110 (467) |. ++| +++++.. |.. ..+.-+. .......++.++...--....-.+++.+.+..+..+++-+.||++.+- T Consensus 1 ~~---~~~--~~~~~~~~f~~-~~~~~~~---~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~ 71 (324) T protein:vir:97 1 ME---QTQ--KLKLNLQHFAS-NNVKPQV---FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK 71 (324) T ss_pred Cc---cch--hHHHHHHHHHH-hhhhhhh---hccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCce Confidence 11 111 1111110 000 0000000 111112222222221111122234455556778888999999987663 Q ss_pred eeeeeeeeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhh Q lcl|NC_015279. 111 LIFAMRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDE 190 (467) Q Consensus 111 LIFAMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~ 190 (467) -|- ++.. +.+| .+ .++ T Consensus 72 ~ip----~~~~--~~~a-------~~---------------------------------------------------v~E 87 (324) T protein:vir:97 72 KFT----FWAD--KPGA-------YW---------------------------------------------------VGE 87 (324) T ss_pred EEE----EEec--Ccce-------eE---------------------------------------------------ecc Confidence 221 1110 0000 00 011 Q ss_pred HhhcCCCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccc Q lcl|NC_015279. 191 AEDLGTSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQ 270 (467) Q Consensus 191 aE~LGs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~ 270 (467) ++....+...|.++.|+..|..+- ..+|-||.+|-. .|.+++|.+-|+..|...+++.||.---+ T Consensus 88 g~~~~~~~~~f~~v~~~~~k~~~~-------~~is~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~g~---- 152 (324) T protein:vir:97 88 GQKIETSKATWVNATMRAFKLGVI-------LPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGN---- 152 (324) T ss_pred CccccccccceeEEEEeeEEEEEe-------ehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhccCCC---- Confidence 111111122355555555555544 458999999863 57899999999999999999999853211 Q ss_pred ccccccccceeEEeeccc----cchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccc Q lcl|NC_015279. 271 GAVSNTATAGVFDLDIDS----NGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALN 346 (467) Q Consensus 271 ~k~~~~~~~gv~Dl~~~~----~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~ 346 (467) +....|++...... .+....+....+...+. . -.+....+||+|.....|... ....+ T Consensus 153 ----~~~~~gi~~~~~~~~~~~~~~~~~~~i~~~~~~l~--------~-~~~~~~~~v~n~~~~~~L~~l---kd~~g-- 214 (324) T protein:vir:97 153 ----NPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLE--------D-DELEANAFISKTQNRSLLRKI---VDPET-- 214 (324) T ss_pred ----CccCccccccccccceeccccCCHHHHHHHHHhhh--------h-ccCCCCEEEEcHHHHHHHHHh---hcCCC-- Confidence 11112222211110 11111122333333332 1 223344678999998888642 21111 Q ss_pred cccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccce--eEecccchhhcccccCCccccc Q lcl|NC_015279. 347 ANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAG--LFYCPYVPLQMVRAVGENTFQP 424 (467) Q Consensus 347 ~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~g--lfyaPYv~l~~~~~~Dp~s~qP 424 (467) ..+-.+.+ .|+| .|++|++.+...........-.+.++++|..++-..+-. .+...+...+......=..-|= T Consensus 215 ~~~~~~~~----~~tl-~G~PV~~~~~~~~~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~ 289 (324) T protein:vir:97 215 KERIYDRN----SDTL-DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMV 289 (324) T ss_pred ceeecCCC----Cccc-cceeeEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcE Confidence 01111111 2456 456777755322100000000012233444433221100 0000000000000000000011 Q ss_pred eeeeeeeece-eecC--ccc------ccCcccccc Q lcl|NC_015279. 425 KIGFKTRYGM-VANP--FAE------GTTVGAGRL 450 (467) Q Consensus 425 ~~g~~tRY~l-~~nP--~~~------~~~~~~~~~ 450 (467) .+=+..||+. ..|| |+. +++..|+.+ T Consensus 290 ~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) T protein:vir:97 290 ALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred EEEEEEEeccEEecccceEEEEeccCCCCCCCCCC Confidence 1222345553 3333 211 123444444 No 30 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=92.25 E-value=0.012 Score=31.08 Aligned_cols=324 Identities=15% Similarity=0.075 Sum_probs=119.4 Q ss_pred CcchHHHHHhhhhhhcc-Ccc-chhcchhHHHHHHHHhhhHH--------------------HHHHHHhhhhhcchhhhh Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNY-EGL-DKISDPHRRAVTAVLLENQE--------------------KFMQEQVAFEQGGMIAEQ 58 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~-~~~-~~i~~~~~~~v~~~~~enq~--------------------~~~~e~~~~~~~~~~~e~ 58 (467) --.+++...++.-+... +.+ -+|. ++..++-+-+. .+..+......+...... T Consensus 30 ~~~~~e~~~~~~~~~~e~~~l~~~i~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 104 (390) T protein:vir:10 30 GELNASARSKVDELFATVGNLSAEVQ-----AARQRVAELEGNGAGGDVQHVSVGDLFVASEQFQASAGRWNDRSARATM 104 (390) T ss_pred cccCHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHhhcccccccccchhhhhhhhHHHHHHHHhhhhhhhhhhh Confidence 01112223333322110 000 0010 00000000000 000000000000000000 Q ss_pred hhhhccccccc--cccccc--ccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCccccccccc Q lcl|NC_015279. 59 PTNAVGNGGYT--SSGGQT--VAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADT 134 (467) Q Consensus 59 ~~~~~g~~~~~--st~tg~--i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt 134 (467) +.......... ++..|. +...-+.++.+.| +.....+++.+.||++++.-+. | ..+.++ ++ T Consensus 105 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~--~--~~~~~~-~a------- 169 (390) T protein:vir:10 105 NIKAALNTASTDAAGSAGALTTPNRLPGFITQPD---ARLTVRDLIGSGRTDSALIEYV--Q--ETGFVN-NA------- 169 (390) T ss_pred HHHHHHHhhhcccccccccccchhHHHHHHHHHH---hhchhhhhcceeeccCCceEEE--E--EecCCc-ce------- Confidence 00000000000 111111 1112233444444 3445667899999887653222 1 111000 00 Q ss_pred ccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEEE Q lcl|NC_015279. 135 AFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVT 214 (467) Q Consensus 135 ~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVt 214 (467) .+ .+| +...++-..+++++++. T Consensus 170 ~~-----------------------------------------------------v~E-----g~~~~~~~~~~~~i~~~ 191 (390) T protein:vir:10 170 AI-----------------------------------------------------VAE-----GALKPESSLKFAKKTDT 191 (390) T ss_pred ee-----------------------------------------------------ecC-----CccccccccceeEEEEe Confidence 00 001 12233344455566666 Q ss_pred eecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeecc------c Q lcl|NC_015279. 215 AKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDID------S 288 (467) Q Consensus 215 AKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~------~ 288 (467) +|..+....+|-||.||-- |.++.|.+-|+..|...||+.||.- .-.+....|++..... . T Consensus 192 ~~k~~~~~~is~ell~d~~-----~l~~~i~~~l~~~~~~~~~~~il~G--------~G~~~~p~Gi~~~~~~~~~~~~~ 258 (390) T protein:vir:10 192 THVIAHTMKATRQILSDAP-----QLASYMNNRLIRGLKVKEDAEILRG--------TGANDGLLGLIPQATTYAAPTTI 258 (390) T ss_pred eEEEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHhhc--------CCCCccccccccccccccccccc Confidence 6666677889999999852 4688999999999999999888831 1111223444432111 1 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCceEE Q lcl|NC_015279. 289 NGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKYRV 368 (467) Q Consensus 289 ~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~v 368 (467) .+--.......++.++ ......++-+|++|.....|.. +....+ ..+-.++.. .-.++| .|++| T Consensus 259 ~~~~~~~~~~~~~~~l---------~~~~~~~~~~v~n~~~~~~L~~---lkd~~g--~~l~~~~~~-~~~~~l-~G~pv 322 (390) T protein:vir:10 259 AGATRVDQLRLAMLQA---------SLAEYPASGIVINPIDWAAIEL---AKDANN--QYLIGNARG-TLTPTL-WGLPV 322 (390) T ss_pred cccchHHHHHHHHHhh---------ccccCCCCEEEEcHHHHHHHHH---hhcCCC--ceeecCCcC-cCCcee-cceee Confidence 1111112222233332 1234456778999998887754 221111 111111111 113456 57799 Q ss_pred EecccccccchhhccCCCceEEEE-EecCCCccceeEecccchh--hcccc-cCCccccceeeeeeeeceee-cCccccc Q lcl|NC_015279. 369 YIDPYSSNLTSANAANGNQYYVVG-YKGTSPYDAGLFYCPYVPL--QMVRA-VGENTFQPKIGFKTRYGMVA-NPFAEGT 443 (467) Q Consensus 369 y~D~y~~~~~~~~~~~~~dY~~vG-yKG~~~~d~glfyaPYv~l--~~~~~-~Dp~s~qP~~g~~tRY~l~~-nP~~~~~ 443 (467) ++++... ..-+++| ++ .+++.+...-+ ++... ..-.+-+=.+-...|++..+ +|= T Consensus 323 ~~~~~~p----------~~~~~~gdf~------~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~---- 382 (390) T protein:vir:10 323 VATQAMA----------PGEFLVGAFD------LAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPE---- 382 (390) T ss_pred EEcCCCC----------CCcEEEEecc------ceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccc---- Confidence 9987742 2222222 11 01111111000 00000 00111121222334555432 220 Q ss_pred Cccccccccccccccceeeee Q lcl|NC_015279. 444 TVGAGRLRVNSNRYYRRVAVK 464 (467) Q Consensus 444 ~~~~~~~~~~~n~y~r~~~v~ 464 (467) .|.++-+. T Consensus 383 -------------a~~~~~~a 390 (390) T protein:vir:10 383 -------------ALISGSFA 390 (390) T ss_pred -------------cEEEEEeC Confidence 11111111 No 31 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=92.13 E-value=0.013 Score=30.98 Aligned_cols=340 Identities=16% Similarity=0.094 Sum_probs=129.1 Q ss_pred Ccch-------------HHHHHhhhhhh------ccCccchhcchhHHHHHH-HH-------------hhhHHHHHHHHh Q lcl|NC_015279. 1 MFQS-------------EQLQEKWAPLL------NYEGLDKISDPHRRAVTA-VL-------------LENQEKFMQEQV 47 (467) Q Consensus 1 ~~~~-------------~~l~~kw~p~l------~~~~~~~i~~~~~~~v~~-~~-------------~enq~~~~~e~~ 47 (467) ++.. +.|.++..-+- +.+...++....+..... +. ++..+..-.|.+ T Consensus 28 ~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 107 (415) T protein:vir:81 28 ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred HhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHH Confidence 1111 11222221110 000000000000000000 00 000000000111 Q ss_pred hhhhcchhhhhhhhhcccccc---ccccccc-c--cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecC Q lcl|NC_015279. 48 AFEQGGMIAEQPTNAVGNGGY---TSSGGQT-V--AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYST 121 (467) Q Consensus 48 ~~~~~~~~~e~~~~~~g~~~~---~st~tg~-i--~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~ 121 (467) .+.. .+.. +.... ..+..|. + ..+.+.++.+. .+..+-.+++.|.||++..+-+--.| ..+ T Consensus 108 ~~~~--~~~~------~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~---~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~ 174 (415) T protein:vir:81 108 DFTE--YLET------RNDIQGGSLKTDSGFVVIPEEIVTDILKLK---EVEFNLDKYVTVKRVTNGSGKYPVVR--QSE 174 (415) T ss_pred HHHH--HHhh------hhhhhhccccccccccccchHHHHHHHHHH---HhhhhhhhheeeeeccCCceeEEEEe--ecC Confidence 1100 0000 11110 0111111 1 12233344444 45567789999999999887554333 111 Q ss_pred CCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCC-Cc Q lcl|NC_015279. 122 QGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSG-DN 200 (467) Q Consensus 122 qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g-~~ 200 (467) . .++ .+ .++.+.....+ .. T Consensus 175 ~--~~~-------~~---------------------------------------------------v~E~~~~~~~~~~~ 194 (415) T protein:vir:81 175 V--AAL-------EK---------------------------------------------------VEELEENPELAVKP 194 (415) T ss_pred C--ccc-------ee---------------------------------------------------eccccccCcccccc Confidence 0 000 00 00000000011 23 Q ss_pred cceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccce Q lcl|NC_015279. 201 FNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAG 280 (467) Q Consensus 201 f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~g 280 (467) |.+..|++.|..+ ...+|-||.+|- ..|.+++|.+-|+..|..-+|+.||.-.-+-...+-..+....+ T Consensus 195 ~~~v~~~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~ 263 (415) T protein:vir:81 195 FFQLAYDINTHRG-------YFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEG 263 (415) T ss_pred eeeEEeeeeeeEe-------eehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccc Confidence 5566666665544 456999999984 35789999999999999999999986553322111111111111 Q ss_pred eEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEE Q lcl|NC_015279. 281 VFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAG 360 (467) Q Consensus 281 v~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G 360 (467) +- . ..++--..+....++..+... -.+++.+||++.....|.. +....+- .-+..+.++ -..+ T Consensus 264 ~~-~--~~~~~~~~~~i~~~~~~~~~~---------~~~~~~~v~n~~~~~~l~~---lkd~~G~-~l~~~~~~~-~~~~ 326 (415) T protein:vir:81 264 KK-L--EVKKAKSLDDIKDAINLNVKP---------NYEHNVAIVSQTMFAKLDK---MKDKLGN-YLIQPDVKE-KTQQ 326 (415) T ss_pred cc-c--ccccccchhHHHHHHHhhhhh---------ccCCCEEEEcHHHHHHHHH---hhccCCc-eeeccCcCC-CCCc Confidence 10 0 011111123333444444322 2345678899998888864 2221110 001111111 1234 Q ss_pred EecCceEEEecccccccchhhccCCCceEEEEE-ecCCCccceeEecccchhhcccccCCccccceeeeeeeeceee-cC Q lcl|NC_015279. 361 VLQGKYRVYIDPYSSNLTSANAANGNQYYVVGY-KGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMVA-NP 438 (467) Q Consensus 361 ~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGy-KG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~~-nP 438 (467) +| .|++|++.++..... .+..-+++|- +. ......-..+. +...|-.+++..+....|++..+ +| T Consensus 327 ~l-~G~pV~~~~~~~~~~-----~~~~~~~~Gd~~~------~~~~~~~~~~~-v~~~~~~~~~~~~~~~~r~d~~v~~~ 393 (415) T protein:vir:81 327 RL-LGAKIEILPDEVLGQ-----KGNNTLIIGNLKD------AIVLFDRSQYQ-ASWTDYMHFGECLMIAVRQDCRILDY 393 (415) T ss_pred ee-cceeeEEecccccCC-----CCccEEEEEehhc------cEEEEeecceE-EEEeccccCceEEEEEEEeccEEecc Confidence 66 567787765532100 0001122220 10 00000000000 01123456677777778887643 44 Q ss_pred --cccc----cCcccccccccccc Q lcl|NC_015279. 439 --FAEG----TTVGAGRLRVNSNR 456 (467) Q Consensus 439 --~~~~----~~~~~~~~~~~~n~ 456 (467) |... +..+++-+ |-.+ T Consensus 394 ~a~~~~~~~~~~~~~~~~--~~~~ 415 (415) T protein:vir:81 394 KSAIVIEYDDSERGEGDL--GLEA 415 (415) T ss_pred ccEEEEEEeccCCCCCcc--ccCC Confidence 2111 11111111 0001 No 32 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=92.13 E-value=0.013 Score=30.98 Aligned_cols=340 Identities=16% Similarity=0.094 Sum_probs=129.1 Q ss_pred Ccch-------------HHHHHhhhhhh------ccCccchhcchhHHHHHH-HH-------------hhhHHHHHHHHh Q lcl|NC_015279. 1 MFQS-------------EQLQEKWAPLL------NYEGLDKISDPHRRAVTA-VL-------------LENQEKFMQEQV 47 (467) Q Consensus 1 ~~~~-------------~~l~~kw~p~l------~~~~~~~i~~~~~~~v~~-~~-------------~enq~~~~~e~~ 47 (467) ++.. +.|.++..-+- +.+...++....+..... +. ++..+..-.|.+ T Consensus 28 ~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 107 (415) T protein:vir:79 28 ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred HhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHH Confidence 1111 11222221110 000000000000000000 00 000000000111 Q ss_pred hhhhcchhhhhhhhhcccccc---ccccccc-c--cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecC Q lcl|NC_015279. 48 AFEQGGMIAEQPTNAVGNGGY---TSSGGQT-V--AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYST 121 (467) Q Consensus 48 ~~~~~~~~~e~~~~~~g~~~~---~st~tg~-i--~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~ 121 (467) .+.. .+.. +.... ..+..|. + ..+.+.++.+. .+..+-.+++.|.||++..+-+--.| ..+ T Consensus 108 ~~~~--~~~~------~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~---~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~ 174 (415) T protein:vir:79 108 DFTE--YLET------RNDIQGGSLKTDSGFVVIPEEIVTDILKLK---EVEFNLDKYVTVKRVTNGSGKYPVVR--QSE 174 (415) T ss_pred HHHH--HHhh------hhhhhhccccccccccccchHHHHHHHHHH---HhhhhhhhheeeeeccCCceeEEEEe--ecC Confidence 1100 0000 11110 0111111 1 12233344444 45567789999999999887554333 111 Q ss_pred CCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCC-Cc Q lcl|NC_015279. 122 QGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSG-DN 200 (467) Q Consensus 122 qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g-~~ 200 (467) . .++ .+ .++.+.....+ .. T Consensus 175 ~--~~~-------~~---------------------------------------------------v~E~~~~~~~~~~~ 194 (415) T protein:vir:79 175 V--AAL-------EK---------------------------------------------------VEELEENPELAVKP 194 (415) T ss_pred C--ccc-------ee---------------------------------------------------eccccccCcccccc Confidence 0 000 00 00000000011 23 Q ss_pred cceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccce Q lcl|NC_015279. 201 FNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAG 280 (467) Q Consensus 201 f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~g 280 (467) |.+..|++.|..+ ...+|-||.+|- ..|.+++|.+-|+..|..-+|+.||.-.-+-...+-..+....+ T Consensus 195 ~~~v~~~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~ 263 (415) T protein:vir:79 195 FFQLAYDINTHRG-------YFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEG 263 (415) T ss_pred eeeEEeeeeeeEe-------eehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccc Confidence 5566666665544 456999999984 35789999999999999999999986553322111111111111 Q ss_pred eEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEE Q lcl|NC_015279. 281 VFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAG 360 (467) Q Consensus 281 v~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G 360 (467) +- . ..++--..+....++..+... -.+++.+||++.....|.. +....+- .-+..+.++ -..+ T Consensus 264 ~~-~--~~~~~~~~~~i~~~~~~~~~~---------~~~~~~~v~n~~~~~~l~~---lkd~~G~-~l~~~~~~~-~~~~ 326 (415) T protein:vir:79 264 KK-L--EVKKAKSLDDIKDAINLNVKP---------NYEHNVAIVSQTMFAKLDK---MKDKLGN-YLIQPDVKE-KTQQ 326 (415) T ss_pred cc-c--ccccccchhHHHHHHHhhhhh---------ccCCCEEEEcHHHHHHHHH---hhccCCc-eeeccCcCC-CCCc Confidence 10 0 011111123333444444322 2345678899998888864 2221110 001111111 1234 Q ss_pred EecCceEEEecccccccchhhccCCCceEEEEE-ecCCCccceeEecccchhhcccccCCccccceeeeeeeeceee-cC Q lcl|NC_015279. 361 VLQGKYRVYIDPYSSNLTSANAANGNQYYVVGY-KGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMVA-NP 438 (467) Q Consensus 361 ~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGy-KG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~~-nP 438 (467) +| .|++|++.++..... .+..-+++|- +. ......-..+. +...|-.+++..+....|++..+ +| T Consensus 327 ~l-~G~pV~~~~~~~~~~-----~~~~~~~~Gd~~~------~~~~~~~~~~~-v~~~~~~~~~~~~~~~~r~d~~v~~~ 393 (415) T protein:vir:79 327 RL-LGAKIEILPDEVLGQ-----KGNNTLIIGNLKD------AIVLFDRSQYQ-ASWTDYMHFGECLMIAVRQDCRILDY 393 (415) T ss_pred ee-cceeeEEecccccCC-----CCccEEEEEehhc------cEEEEeecceE-EEEeccccCceEEEEEEEeccEEecc Confidence 66 567787765532100 0001122220 10 00000000000 01123456677777778887643 44 Q ss_pred --cccc----cCcccccccccccc Q lcl|NC_015279. 439 --FAEG----TTVGAGRLRVNSNR 456 (467) Q Consensus 439 --~~~~----~~~~~~~~~~~~n~ 456 (467) |... +..+++-+ |-.+ T Consensus 394 ~a~~~~~~~~~~~~~~~~--~~~~ 415 (415) T protein:vir:79 394 KSAIVIEYDDSERGEGDL--GLEA 415 (415) T ss_pred ccEEEEEEeccCCCCCcc--ccCC Confidence 2111 11111111 0001 No 33 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=92.13 E-value=0.013 Score=30.98 Aligned_cols=340 Identities=16% Similarity=0.094 Sum_probs=129.1 Q ss_pred Ccch-------------HHHHHhhhhhh------ccCccchhcchhHHHHHH-HH-------------hhhHHHHHHHHh Q lcl|NC_015279. 1 MFQS-------------EQLQEKWAPLL------NYEGLDKISDPHRRAVTA-VL-------------LENQEKFMQEQV 47 (467) Q Consensus 1 ~~~~-------------~~l~~kw~p~l------~~~~~~~i~~~~~~~v~~-~~-------------~enq~~~~~e~~ 47 (467) ++.. +.|.++..-+- +.+...++....+..... +. ++..+..-.|.+ T Consensus 28 ~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 107 (415) T protein:vir:98 28 ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred HhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHH Confidence 1111 11222221110 000000000000000000 00 000000000111 Q ss_pred hhhhcchhhhhhhhhcccccc---ccccccc-c--cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecC Q lcl|NC_015279. 48 AFEQGGMIAEQPTNAVGNGGY---TSSGGQT-V--AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYST 121 (467) Q Consensus 48 ~~~~~~~~~e~~~~~~g~~~~---~st~tg~-i--~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~ 121 (467) .+.. .+.. +.... ..+..|. + ..+.+.++.+. .+..+-.+++.|.||++..+-+--.| ..+ T Consensus 108 ~~~~--~~~~------~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~---~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~ 174 (415) T protein:vir:98 108 DFTE--YLET------RNDIQGGSLKTDSGFVVIPEEIVTDILKLK---EVEFNLDKYVTVKRVTNGSGKYPVVR--QSE 174 (415) T ss_pred HHHH--HHhh------hhhhhhccccccccccccchHHHHHHHHHH---HhhhhhhhheeeeeccCCceeEEEEe--ecC Confidence 1100 0000 11110 0111111 1 12233344444 45567789999999999887554333 111 Q ss_pred CCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCC-Cc Q lcl|NC_015279. 122 QGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSG-DN 200 (467) Q Consensus 122 qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g-~~ 200 (467) . .++ .+ .++.+.....+ .. T Consensus 175 ~--~~~-------~~---------------------------------------------------v~E~~~~~~~~~~~ 194 (415) T protein:vir:98 175 V--AAL-------EK---------------------------------------------------VEELEENPELAVKP 194 (415) T ss_pred C--ccc-------ee---------------------------------------------------eccccccCcccccc Confidence 0 000 00 00000000011 23 Q ss_pred cceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccce Q lcl|NC_015279. 201 FNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAG 280 (467) Q Consensus 201 f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~g 280 (467) |.+..|++.|..+ ...+|-||.+|- ..|.+++|.+-|+..|..-+|+.||.-.-+-...+-..+....+ T Consensus 195 ~~~v~~~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~ 263 (415) T protein:vir:98 195 FFQLAYDINTHRG-------YFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEG 263 (415) T ss_pred eeeEEeeeeeeEe-------eehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccc Confidence 5566666665544 456999999984 35789999999999999999999986553322111111111111 Q ss_pred eEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEE Q lcl|NC_015279. 281 VFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAG 360 (467) Q Consensus 281 v~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G 360 (467) +- . ..++--..+....++..+... -.+++.+||++.....|.. +....+- .-+..+.++ -..+ T Consensus 264 ~~-~--~~~~~~~~~~i~~~~~~~~~~---------~~~~~~~v~n~~~~~~l~~---lkd~~G~-~l~~~~~~~-~~~~ 326 (415) T protein:vir:98 264 KK-L--EVKKAKSLDDIKDAINLNVKP---------NYEHNVAIVSQTMFAKLDK---MKDKLGN-YLIQPDVKE-KTQQ 326 (415) T ss_pred cc-c--ccccccchhHHHHHHHhhhhh---------ccCCCEEEEcHHHHHHHHH---hhccCCc-eeeccCcCC-CCCc Confidence 10 0 011111123333444444322 2345678899998888864 2221110 001111111 1234 Q ss_pred EecCceEEEecccccccchhhccCCCceEEEEE-ecCCCccceeEecccchhhcccccCCccccceeeeeeeeceee-cC Q lcl|NC_015279. 361 VLQGKYRVYIDPYSSNLTSANAANGNQYYVVGY-KGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMVA-NP 438 (467) Q Consensus 361 ~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGy-KG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~~-nP 438 (467) +| .|++|++.++..... .+..-+++|- +. ......-..+. +...|-.+++..+....|++..+ +| T Consensus 327 ~l-~G~pV~~~~~~~~~~-----~~~~~~~~Gd~~~------~~~~~~~~~~~-v~~~~~~~~~~~~~~~~r~d~~v~~~ 393 (415) T protein:vir:98 327 RL-LGAKIEILPDEVLGQ-----KGNNTLIIGNLKD------AIVLFDRSQYQ-ASWTDYMHFGECLMIAVRQDCRILDY 393 (415) T ss_pred ee-cceeeEEecccccCC-----CCccEEEEEehhc------cEEEEeecceE-EEEeccccCceEEEEEEEeccEEecc Confidence 66 567787765532100 0001122220 10 00000000000 01123456677777778887643 44 Q ss_pred --cccc----cCcccccccccccc Q lcl|NC_015279. 439 --FAEG----TTVGAGRLRVNSNR 456 (467) Q Consensus 439 --~~~~----~~~~~~~~~~~~n~ 456 (467) |... +..+++-+ |-.+ T Consensus 394 ~a~~~~~~~~~~~~~~~~--~~~~ 415 (415) T protein:vir:98 394 KSAIVIEYDDSERGEGDL--GLEA 415 (415) T ss_pred ccEEEEEEeccCCCCCcc--ccCC Confidence 2111 11111111 0001 No 34 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=91.63 E-value=0.015 Score=30.59 Aligned_cols=309 Identities=10% Similarity=0.007 Sum_probs=121.0 Q ss_pred HHHHhhhHHHHHHHHhhhhhcchhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCcccee Q lcl|NC_015279. 32 TAVLLENQEKFMQEQVAFEQGGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGL 111 (467) Q Consensus 32 ~~~~~enq~~~~~e~~~~~~~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGL 111 (467) |.+--+++....+.......+. ..+.....++.+++..--..+.-.+++......+..+++.+.||++.+.- T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~--------~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ 72 (324) T protein:vir:99 1 MEQTQKLKLNLQHFASNNVKPQ--------VFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEKK 72 (324) T ss_pred CCCchHhhHHHHHHHHHhhhhh--------hccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceE Confidence 2221111111111111100111 11111111222211111111112233333455567888899998876532 Q ss_pred eeeeeeeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhH Q lcl|NC_015279. 112 IFAMRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEA 191 (467) Q Consensus 112 IFAMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~a 191 (467) |. ++.+ +.++ .+ .+ T Consensus 73 ~p----~~~~--~~~a-------~~---------------------------------------------------v~-- 86 (324) T protein:vir:99 73 FT----FWAD--KPGA-------YW---------------------------------------------------VG-- 86 (324) T ss_pred EE----EEec--Ccce-------eE---------------------------------------------------ec-- Confidence 21 1110 0000 00 00 Q ss_pred hhcCCCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccc Q lcl|NC_015279. 192 EDLGTSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQG 271 (467) Q Consensus 192 E~LGs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~ 271 (467) | +..+++...++++++.+.|.-+---..|-||.+|-. .|.+++|.+.|+..|...+++.||.--- T Consensus 87 E-----g~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~ai~~~~d~~~l~G~g------ 151 (324) T protein:vir:99 87 E-----GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQG------ 151 (324) T ss_pred c-----CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCC------ Confidence 1 122344444555555555555556678999999974 4689999999999999999999984211 Q ss_pred cccccccceeEEeec----cccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccccc Q lcl|NC_015279. 272 AVSNTATAGVFDLDI----DSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNA 347 (467) Q Consensus 272 k~~~~~~~gv~Dl~~----~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~ 347 (467) .+....|++.... ...+.-..+....++..+ .........+|++|.....|... ....+ . T Consensus 152 --~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l---------~~~~~~~~~~v~n~~~~~~L~~l---~d~~g--~ 215 (324) T protein:vir:99 152 --NNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALL---------EDDELEANAFISKTQNRSLLRKI---VDPET--K 215 (324) T ss_pred --CCccCccccccccccceeccccCCHHHHHHHHHhh---------hhccCCCCEEEEcHHHHHHHHHh---hcCCC--c Confidence 1111122221110 001111123333333333 12334556789999999888642 21111 1 Q ss_pred ccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccce--eEecccchhhcccccCCccccce Q lcl|NC_015279. 348 NLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAG--LFYCPYVPLQMVRAVGENTFQPK 425 (467) Q Consensus 348 ~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~g--lfyaPYv~l~~~~~~Dp~s~qP~ 425 (467) .+-.+.. .++| .|++|++.+...........-.+.++++|..+.-..+-+ .+...+...+....-.-.+-|=. T Consensus 216 ~~~~~~~----~~~l-~G~PVv~~~~~~~~~~~~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~ 290 (324) T protein:vir:99 216 ERIYDRN----SDTL-DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA 290 (324) T ss_pred eeecCCC----Cccc-cceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEE Confidence 1111111 2456 447888776542110000000112223344332221100 00011100000000000011222 Q ss_pred eeeeeeecee-ecC--ccc------ccCcccccc Q lcl|NC_015279. 426 IGFKTRYGMV-ANP--FAE------GTTVGAGRL 450 (467) Q Consensus 426 ~g~~tRY~l~-~nP--~~~------~~~~~~~~~ 450 (467) +=...|++.. .|| |+. +++..++.+ T Consensus 291 ~r~~~r~d~~v~~~~a~~~lt~a~~~~~~~~~~~ 324 (324) T protein:vir:99 291 LRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred EEEEEEEccEEecccceEEEEeccCCCCCCCCCC Confidence 2234566633 344 211 233334444 No 35 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=90.60 E-value=0.02 Score=29.90 Aligned_cols=305 Identities=12% Similarity=0.076 Sum_probs=124.8 Q ss_pred HhhhHHHHHHHHhhhhhcchhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeee Q lcl|NC_015279. 35 LLENQEKFMQEQVAFEQGGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFA 114 (467) Q Consensus 35 ~~enq~~~~~e~~~~~~~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFA 114 (467) ..++| +-..|-+.+... +.+ ....+.....++.++...--....-.+++....+.+..+++.+-||++++--|.- T Consensus 1 ~~~~~-~~~~~~~~f~~~--~~~--~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~ 75 (324) T protein:vir:93 1 MEQTQ-KLKLNLQHFASN--NVK--PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTF 75 (324) T ss_pred CchhH-HHHHHHHHHHHh--hhh--hhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEE Confidence 22222 111111111111 111 1111111111222222111122222345555567788899999999887543321 Q ss_pred eeeeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhc Q lcl|NC_015279. 115 MRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDL 194 (467) Q Consensus 115 MRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~L 194 (467) ..+ +.++ .| .+ | T Consensus 76 ----~~~--~~~a-------~~---------------------------------------------------v~--E-- 87 (324) T protein:vir:93 76 ----WAD--KPGA-------YW---------------------------------------------------VG--E-- 87 (324) T ss_pred ----Eec--Ccce-------ee---------------------------------------------------ec--C-- Confidence 100 0000 00 01 1 Q ss_pred CCCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccc Q lcl|NC_015279. 195 GTSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVS 274 (467) Q Consensus 195 Gs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~ 274 (467) +..+++..-++++++++.|..+-....|-||.+|-. .|.+++|.+-|+..|...+++.+|.---+ T Consensus 88 ---g~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~g~-------- 152 (324) T protein:vir:93 88 ---GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGN-------- 152 (324) T ss_pred ---CccccccccceeEEEEEeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhcCCCC-------- Confidence 122333333445555555555666778999999953 46889999999999999999988743211 Q ss_pred ccccceeEEeeccc----cchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccc Q lcl|NC_015279. 275 NTATAGVFDLDIDS----NGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLN 350 (467) Q Consensus 275 ~~~~~gv~Dl~~~~----~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~ 350 (467) +....|+++..... .+.-..+....++.+++. ..+....++|+|.....|... ....+ ..+- T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~---------~~~~~~~~v~n~~~~~~L~~l---~d~~G--~~~~ 218 (324) T protein:vir:93 153 NPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLED---------DELEANAFISKTQNRSLLRKI---VDPET--KERI 218 (324) T ss_pred CCcCccccccccccceeccccccHHHHHHHHHhhhh---------ccCCCCEEEEcHHHHHHHHHh---hCCCC--Ceee Confidence 11112222221110 111112233333333321 234456789999999888642 22111 1111 Q ss_pred cccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCC------ccccc Q lcl|NC_015279. 351 VDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGE------NTFQP 424 (467) Q Consensus 351 ~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp------~s~qP 424 (467) .+.. .+.| .|++|++.+.........+.-.+.++++|..+..+.+-. .+..+......|. ..-|= T Consensus 219 ~~~~----~~~l-~G~PVv~~~~~~~~~~~i~~gdfs~~~~~~~~~~~i~~~----~~~~~~~~~~~~~~~~~~f~~n~~ 289 (324) T protein:vir:93 219 YDRN----SDSL-DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKID----ETAQLSTVKNEDGTPVNLFEQDMV 289 (324) T ss_pred cCCC----CCcc-cceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEe----ecccccccccccccchhhhhcCcE Confidence 1111 2355 457888765432110000001122344555544332211 1100000000000 01122 Q ss_pred eeeeeeeeceee-cC--ccc------ccCcccccc Q lcl|NC_015279. 425 KIGFKTRYGMVA-NP--FAE------GTTVGAGRL 450 (467) Q Consensus 425 ~~g~~tRY~l~~-nP--~~~------~~~~~~~~~ 450 (467) .+=...|||..+ +| |+. +++..|+.+ T Consensus 290 ~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:93 290 ALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred EEEEEEEeccEEecccceEEEecccccCCCCCCCC Confidence 333344555432 33 111 233344444 No 36 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=89.78 E-value=0.024 Score=29.42 Aligned_cols=329 Identities=13% Similarity=0.060 Sum_probs=121.2 Q ss_pred CcchHHHHHhhhhhhccCcc--------------chhcchhHHHHHHHHhhhHHHHHHHHhhhhhcchhhhhhhhhcc-- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGL--------------DKISDPHRRAVTAVLLENQEKFMQEQVAFEQGGMIAEQPTNAVG-- 64 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~--------------~~i~~~~~~~v~~~~~enq~~~~~e~~~~~~~~~~~e~~~~~~g-- 64 (467) +-.-+.|.++...+-+.... ++......+......-+.+..++++-.....++.+.+....... T Consensus 39 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 118 (397) T protein:vir:12 39 LDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPEGQRSQGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSP 118 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcccccccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhh Confidence 11111222222111100000 00000000000000000111112211111112222221110000 Q ss_pred ---cccccccccccc---cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCccccccccccccc Q lcl|NC_015279. 65 ---NGGYTSSGGQTV---AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTAFAG 138 (467) Q Consensus 65 ---~~~~~st~tg~i---~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~fSg 138 (467) +....++.+|.+ ..+.+.++.+.| +..+..+++.+.||+++.|-+--.|.. ++..+ .+- T Consensus 119 ~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~----~~~~a-------~~v- 183 (397) T protein:vir:12 119 EFRAMSGINDEDGGILIPEDIGRQIHEFKR---QFEPLEQYVTVEPVTTRSGTRLLEKNA----DMVPF-------SPV- 183 (397) T ss_pred hhhhccccccccCcccCchhHHHHHHHhhh---hhhhHHhhcceeeccCCceeEEEEEec----CCcce-------eee- Confidence 000111122222 223344444554 566778999999999988754322200 00000 000 Q ss_pred ccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCC-CCCccceeeeEEEEEEEEeec Q lcl|NC_015279. 139 QNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGT-SGDNFNEMAFSIEKVTVTAKS 217 (467) Q Consensus 139 ~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs-~g~~f~EMaFsIEK~tVtAKS 217 (467) ++++.... +...|.++.|+..|..+- T Consensus 184 --------------------------------------------------~Eg~~~~~~~~~~~~~v~~~~~k~~~~--- 210 (397) T protein:vir:12 184 --------------------------------------------------EELGNLPEIDQPRFTKVSYSIIDYGGI--- 210 (397) T ss_pred --------------------------------------------------cccccccccccccceeEEeeheeeEee--- Confidence 00000000 113467777777777665 Q ss_pred ccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeeccccchhHHHHH Q lcl|NC_015279. 218 RALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDIDSNGRWSVEKF 297 (467) Q Consensus 218 RaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~~~~r~~ve~~ 297 (467) ..+|-||.+|-- +|.++.|.+.|...|...+|+-|+.-.-+ ....|+..++ .. T Consensus 211 ----~~is~e~l~ds~----~~l~~~i~~~l~~~~~~~~d~~il~G~g~---------~~~~g~~~~~----------~i 263 (397) T protein:vir:12 211 ----MTLSNSMLNDSD----QAIMTYVAKWFAKKSVVTRNNLILAAIAS---------LKKVDIDGLD----------GI 263 (397) T ss_pred ----ehhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHHhcccc---------ccccccccHH----------HH Confidence 448999998854 46788999999999999998888743211 1234444321 11 Q ss_pred HHHHH-HHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCceEEEecccccc Q lcl|NC_015279. 298 KGLLF-QIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSN 376 (467) Q Consensus 298 ~~l~~-~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~ 376 (467) ..++. .+. . -...+..++|+|.....|... ....+- .-+..+.++ -..++| .|++|++.+...- T Consensus 264 ~~~~~~~l~-~--------~~~~~a~~~~n~~~~~~L~~l---kd~~G~-~l~~~~~~~-g~~~~l-~G~pv~~~~~~~~ 328 (397) T protein:vir:12 264 KKALNVTLD-P--------MVAPGSIVLTNQDGYDWLDTL---KDGTGR-YLLQPDPTN-PTKKLL-DGRPVVPFTNRVL 328 (397) T ss_pred HHHHhhccc-h--------hhhCCCEEEEcHHHHHHHHHh---hccCCc-eeecccccC-CCCccc-cceeeEEeccccc Confidence 22221 221 1 112335578899888877542 211110 001111111 122466 4568775433210 Q ss_pred cch---hhcc--CCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeeceee-cCcccccCcccccc Q lcl|NC_015279. 377 LTS---ANAA--NGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMVA-NPFAEGTTVGAGRL 450 (467) Q Consensus 377 ~~~---~~~~--~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~ 450 (467) ... ..++ ...+|++++.+.....+. .++. ..+-.+-+-.+-...|++..+ ||-+-.. T Consensus 329 ~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~----~~~~------~~~f~~~~~~~r~~~r~d~~~~~~~a~~~------- 391 (397) T protein:vir:12 329 KTQKGKAPLIIGNLKEAIVLFDREQQSIAS----TDTG------AGAFETNSTKVRGIEREDVRKWDEDAVVF------- 391 (397) T ss_pred ccCCCccEEEEEehhceEEEEeecceEEEE----eccc------cchhhcCceEEEEEEeeccEEecccceEE------- Confidence 000 0000 112334333332211110 0000 000112233444555555432 3311100 Q ss_pred ccccccccceeeee Q lcl|NC_015279. 451 RVNSNRYYRRVAVK 464 (467) Q Consensus 451 ~~~~n~y~r~~~v~ 464 (467) =++-++ T Consensus 392 --------~~~t~~ 397 (397) T protein:vir:12 392 --------GQITVE 397 (397) T ss_pred --------EEEeeC Confidence 001111 No 37 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=88.92 E-value=0.029 Score=28.98 Aligned_cols=346 Identities=14% Similarity=0.082 Sum_probs=128.2 Q ss_pred Cc-------------chHHHHHhhhhhhcc----------Ccc----chh------cchhHHHHHHHHhhhHHHHHHHHh Q lcl|NC_015279. 1 MF-------------QSEQLQEKWAPLLNY----------EGL----DKI------SDPHRRAVTAVLLENQEKFMQEQV 47 (467) Q Consensus 1 ~~-------------~~~~l~~kw~p~l~~----------~~~----~~i------~~~~~~~v~~~~~enq~~~~~e~~ 47 (467) +. .-+.|.++..-+-+. ... .+. .+...+......+.+......|.+ T Consensus 28 ~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 107 (415) T protein:vir:47 28 ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHH Confidence 11 111122222111000 000 000 000000000011111100111111 Q ss_pred hhhhcchhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCcc Q lcl|NC_015279. 48 AFEQGGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEA 127 (467) Q Consensus 48 ~~~~~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEA 127 (467) .+... +... +.... ...++..|...--....-.+++...+...-.+++.+.||+++++-+.-.+ .. .+.++ T Consensus 108 ~~~~~--~~~~--~~~~~-~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~--~~~~~ 178 (415) T protein:vir:47 108 DFTEY--LETR--NDIQG-GSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR--QS--EVAAL 178 (415) T ss_pred HHHHH--Hhhh--hhhhh-ccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEE--ec--CCcce Confidence 11110 0000 00000 00111122111111111234455556678889999999999887553332 10 00000 Q ss_pred cccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceee-e Q lcl|NC_015279. 128 LFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMA-F 206 (467) Q Consensus 128 lfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMa-F 206 (467) .+ .+ | +..+++.+ - T Consensus 179 -------~~---------------------------------------------------v~--E-----g~~~~~~~~~ 193 (415) T protein:vir:47 179 -------EK---------------------------------------------------VE--E-----LEENPELAVK 193 (415) T ss_pred -------ee---------------------------------------------------cc--c-----cccccccccc Confidence 00 00 0 11222222 2 Q ss_pred EEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccccccc-ceeEEee Q lcl|NC_015279. 207 SIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTAT-AGVFDLD 285 (467) Q Consensus 207 sIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~-~gv~Dl~ 285 (467) ++++++..++..+-...+|-||.+|-. .|.+++|.+-|+..|..-+|+.||.-.-+-...+....... .... . T Consensus 194 ~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~--~ 267 (415) T protein:vir:47 194 PFFQLAYDINTHRGYFRISREAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL--E 267 (415) T ss_pred ceeeEEeeeeeeEeeehhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccccccccee--c Confidence 344555555555555689999999843 57889999999999999999999865433222111111000 0111 0 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCc Q lcl|NC_015279. 286 IDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGK 365 (467) Q Consensus 286 ~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~ 365 (467) . .+--..+....++.++.. -.++.+.+|++|.....|.. +....+- .-+..+.++ -..++| .| T Consensus 268 ~--~~~~~~~~i~~~~~~~~~---------~~~~~~~~v~n~~~~~~L~~---lkd~~G~-~i~~~~~~~-~~~~~l-~G 330 (415) T protein:vir:47 268 V--KKAKSLDDIKDAINLNVK---------PNYEHNVAIVSQTMFAKLDK---MKDKLGN-YLIQPDVKE-KTQQRL-LG 330 (415) T ss_pred c--ccccchHHHHHHHHhhhh---------hccCCCEEEEcHHHHHHHHH---hhccCCC-eeeccCcCC-CCCccc-cc Confidence 0 111112233344444432 23456788999998888854 2221110 001112111 113466 55 Q ss_pred eEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeecee-ecC--cccc Q lcl|NC_015279. 366 YRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMV-ANP--FAEG 442 (467) Q Consensus 366 ~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~-~nP--~~~~ 442 (467) ++|++.++..... .+..-+++|-- . +. +.......+ .+...|-.++|-.+-...|++.. .+| |... T Consensus 331 ~pV~~~~~~~~~~-----~~~~~~~~gd~---~-~~-~~~~~~~~~-~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~ 399 (415) T protein:vir:47 331 AKIEILPDEVLGQ-----KGNNTLIIGNL---K-DA-IVLFDRSQY-QASWTDYMHFGECLMIAVRQDCRILDYKSAIVI 399 (415) T ss_pred eeeEEeccccccC-----CCccEEEEEeh---h-cc-EEEEeecce-EEEeeccccCceEEEEEEEeccEEeccccEEEE Confidence 6776654432100 00111222200 0 00 000000000 00112334556666677788764 344 2111 Q ss_pred ----cCcccccccccccc Q lcl|NC_015279. 443 ----TTVGAGRLRVNSNR 456 (467) Q Consensus 443 ----~~~~~~~~~~~~n~ 456 (467) ...+++-+ |-.+ T Consensus 400 ~~~~~~~~~~~~--~~~~ 415 (415) T protein:vir:47 400 EYDDSERGEGDL--GLEA 415 (415) T ss_pred EeeccCCCCCCc--cCCC Confidence 11111111 0001 No 38 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=88.92 E-value=0.029 Score=28.98 Aligned_cols=346 Identities=14% Similarity=0.082 Sum_probs=128.2 Q ss_pred Cc-------------chHHHHHhhhhhhcc----------Ccc----chh------cchhHHHHHHHHhhhHHHHHHHHh Q lcl|NC_015279. 1 MF-------------QSEQLQEKWAPLLNY----------EGL----DKI------SDPHRRAVTAVLLENQEKFMQEQV 47 (467) Q Consensus 1 ~~-------------~~~~l~~kw~p~l~~----------~~~----~~i------~~~~~~~v~~~~~enq~~~~~e~~ 47 (467) +. .-+.|.++..-+-+. ... .+. .+...+......+.+......|.+ T Consensus 28 ~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 107 (415) T protein:vir:46 28 ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHH Confidence 11 111122222111000 000 000 000000000011111100111111 Q ss_pred hhhhcchhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCcc Q lcl|NC_015279. 48 AFEQGGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEA 127 (467) Q Consensus 48 ~~~~~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEA 127 (467) .+... +... +.... ...++..|...--....-.+++...+...-.+++.+.||+++++-+.-.+ .. .+.++ T Consensus 108 ~~~~~--~~~~--~~~~~-~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~--~~~~~ 178 (415) T protein:vir:46 108 DFTEY--LETR--NDIQG-GSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR--QS--EVAAL 178 (415) T ss_pred HHHHH--Hhhh--hhhhh-ccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEE--ec--CCcce Confidence 11110 0000 00000 00111122111111111234455556678889999999999887553332 10 00000 Q ss_pred cccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceee-e Q lcl|NC_015279. 128 LFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMA-F 206 (467) Q Consensus 128 lfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMa-F 206 (467) .+ .+ | +..+++.+ - T Consensus 179 -------~~---------------------------------------------------v~--E-----g~~~~~~~~~ 193 (415) T protein:vir:46 179 -------EK---------------------------------------------------VE--E-----LEENPELAVK 193 (415) T ss_pred -------ee---------------------------------------------------cc--c-----cccccccccc Confidence 00 00 0 11222222 2 Q ss_pred EEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccccccc-ceeEEee Q lcl|NC_015279. 207 SIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTAT-AGVFDLD 285 (467) Q Consensus 207 sIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~-~gv~Dl~ 285 (467) ++++++..++..+-...+|-||.+|-. .|.+++|.+-|+..|..-+|+.||.-.-+-...+....... .... . T Consensus 194 ~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~--~ 267 (415) T protein:vir:46 194 PFFQLAYDINTHRGYFRISREAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL--E 267 (415) T ss_pred ceeeEEeeeeeeEeeehhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccccccccee--c Confidence 344555555555555689999999843 57889999999999999999999865433222111111000 0111 0 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCc Q lcl|NC_015279. 286 IDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGK 365 (467) Q Consensus 286 ~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~ 365 (467) . .+--..+....++.++.. -.++.+.+|++|.....|.. +....+- .-+..+.++ -..++| .| T Consensus 268 ~--~~~~~~~~i~~~~~~~~~---------~~~~~~~~v~n~~~~~~L~~---lkd~~G~-~i~~~~~~~-~~~~~l-~G 330 (415) T protein:vir:46 268 V--KKAKSLDDIKDAINLNVK---------PNYEHNVAIVSQTMFAKLDK---MKDKLGN-YLIQPDVKE-KTQQRL-LG 330 (415) T ss_pred c--ccccchHHHHHHHHhhhh---------hccCCCEEEEcHHHHHHHHH---hhccCCC-eeeccCcCC-CCCccc-cc Confidence 0 111112233344444432 23456788999998888854 2221110 001112111 113466 55 Q ss_pred eEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeecee-ecC--cccc Q lcl|NC_015279. 366 YRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMV-ANP--FAEG 442 (467) Q Consensus 366 ~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~-~nP--~~~~ 442 (467) ++|++.++..... .+..-+++|-- . +. +.......+ .+...|-.++|-.+-...|++.. .+| |... T Consensus 331 ~pV~~~~~~~~~~-----~~~~~~~~gd~---~-~~-~~~~~~~~~-~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~ 399 (415) T protein:vir:46 331 AKIEILPDEVLGQ-----KGNNTLIIGNL---K-DA-IVLFDRSQY-QASWTDYMHFGECLMIAVRQDCRILDYKSAIVI 399 (415) T ss_pred eeeEEeccccccC-----CCccEEEEEeh---h-cc-EEEEeecce-EEEeeccccCceEEEEEEEeccEEeccccEEEE Confidence 6776654432100 00111222200 0 00 000000000 00112334556666677788764 344 2111 Q ss_pred ----cCcccccccccccc Q lcl|NC_015279. 443 ----TTVGAGRLRVNSNR 456 (467) Q Consensus 443 ----~~~~~~~~~~~~n~ 456 (467) ...+++-+ |-.+ T Consensus 400 ~~~~~~~~~~~~--~~~~ 415 (415) T protein:vir:46 400 EYDDSERGEGDL--GLEA 415 (415) T ss_pred EeeccCCCCCCc--cCCC Confidence 11111111 0001 No 39 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=88.70 E-value=0.031 Score=28.88 Aligned_cols=339 Identities=10% Similarity=0.051 Sum_probs=123.2 Q ss_pred CcchHHHHHhhhhh-----------hccCccch--hcchhHHHHHHHHhhhHHH-------HHHHHhhhhhcchhh---- Q lcl|NC_015279. 1 MFQSEQLQEKWAPL-----------LNYEGLDK--ISDPHRRAVTAVLLENQEK-------FMQEQVAFEQGGMIA---- 56 (467) Q Consensus 1 ~~~~~~l~~kw~p~-----------l~~~~~~~--i~~~~~~~v~~~~~enq~~-------~~~e~~~~~~~~~~~---- 56 (467) -++.+.-.++..-. ..+++..+ +....++......+.+... ...|.+ ..+..++. T Consensus 58 ~le~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r-~a~~~~l~~~~~ 136 (434) T protein:vir:62 58 KLEEKEKEEDPAKKKDDDPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIR-SVFANYIVGNID 136 (434) T ss_pred HHHHHHHHHHHHhhhcchhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHH-HHHHHHhccccc Confidence 01111111222111 11111111 1111112212222111111 111111 11111111 Q ss_pred hhhhhhcccccccccccccc---cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCcccccccc Q lcl|NC_015279. 57 EQPTNAVGNGGYTSSGGQTV---AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEAD 133 (467) Q Consensus 57 e~~~~~~g~~~~~st~tg~i---~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEad 133 (467) +.+..+++. .|..|.. ..+... +++..-+..+...++-|.|++|..- |- ++.... . T Consensus 137 ~~e~~a~~~----~t~~GG~lvP~~~~~~---Ii~~l~~~~~i~~~~~~~~~~~~~~--~p---~~~~~~--~------- 195 (434) T protein:vir:62 137 EKEARALGL----VTGNGSVTIPDFLSKE---IITYAQEENFLRRLGTGVKTKENIK--YP---VLVKKA--E------- 195 (434) T ss_pred hhhhhhhcc----cccccceecchhhHHH---HHHhhhhhhhhhhhcceeccCCceE--EE---EEecCC--c------- Confidence 111111110 1111111 112222 4444445667778888888765311 11 010000 0 Q ss_pred cccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEE Q lcl|NC_015279. 134 TAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTV 213 (467) Q Consensus 134 t~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tV 213 (467) +.. .. ..+ .+...++-..++++++. T Consensus 196 -------a~~-----------------~~--------------------------~~~-----e~~~~~~~~~~f~~v~~ 220 (434) T protein:vir:62 196 -------AQG-----------------HK--------------------------NER-----TNNEMPETDIEFDEIEL 220 (434) T ss_pred -------ccc-----------------ee--------------------------ccc-----ccccccccccceeeEEe Confidence 000 00 000 01122222234555666 Q ss_pred EeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeeccccchhH Q lcl|NC_015279. 214 TAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDIDSNGRWS 293 (467) Q Consensus 214 tAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~~~~r~~ 293 (467) .+|.-+-...+|-||.+|- .+|.+++|.+-|+..|..-+++.||.--=+. ....++.......+...... . T Consensus 221 ~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~---~~~~g~~~~~~~~~~~~~~~--~ 291 (434) T protein:vir:62 221 SPTEFDALATVTKKLLART----GLPIEQIVMDELKKAYVRKETQYMVNGDEAN---NINDGALAKKAVEFKTDEKN--L 291 (434) T ss_pred eheeeEeehhhHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhccCCCC---ccccceeecccccccccccc--h Confidence 6666666678999999995 3578999999999999999999888311000 00001111111111111111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccccccc-ccccCCc-eeEEEecCceEEEec Q lcl|NC_015279. 294 VEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANL-NVDDTGN-TFAGVLQGKYRVYID 371 (467) Q Consensus 294 ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~-~~d~t~~-~~~G~l~~~~~vy~D 371 (467) .+....|.+.+.. --+..+ ..|+++.....|.. |....+ ..+ ..+.... -.-.+| .|++|+++ T Consensus 292 ~d~l~~l~~~l~~--------~~~~~a-~~v~n~~~~~~L~~---lkd~~G--~~l~~~~~~~~~g~~~tl-~G~pV~~~ 356 (434) T protein:vir:62 292 YDALVKMKNTPVK--------EVRKKA-RWVLNTAALTKIET---MKTDDG--FPLLRPFNQAEGGIGYTL-LGFPVEEE 356 (434) T ss_pred hhHHHHHHhhcch--------hhhcCC-EEEEcHHHHHHHHH---hhccCC--CEeeccCCCccCCCCcee-cceeeEEe Confidence 1222334333322 123333 45778888877754 221111 011 0000000 011246 46888887 Q ss_pred ccccccch--hh-cc--CCCceEEEEEecCCCccceeEecccchhhcccccCCc--cccceeeeeeee-ceee-cCcccc Q lcl|NC_015279. 372 PYSSNLTS--AN-AA--NGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGEN--TFQPKIGFKTRY-GMVA-NPFAEG 442 (467) Q Consensus 372 ~y~~~~~~--~~-~~--~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~--s~qP~~g~~tRY-~l~~-nP~~~~ 442 (467) .+...... .. +. .-.+|+++-.+|....+ +..++- .-|=.+..+.|. |..+ .|++.. T Consensus 357 ~~~~~~~~~~~~~i~~Gdfs~~~i~~~~g~~~i~--------------~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~ 422 (434) T protein:vir:62 357 DAIDIPDSPDTPVFYFGDFSKFYIQDVIGSLEVQ--------------KLVELFSRTNRVGFRIWNLLDAQLIHSPFEVP 422 (434) T ss_pred cCccCccCCCceEEEEeeccceEEEEeeceeEEE--------------eehhhhcccCceEEEEEeeecceeecCcccce Confidence 66421100 00 00 11233333233322221 112221 222234455666 4434 377654 Q ss_pred cCcccccccccc Q lcl|NC_015279. 443 TTVGAGRLRVNS 454 (467) Q Consensus 443 ~~~~~~~~~~~~ 454 (467) .-..+...-.+. T Consensus 423 ~~~~~~~~~~~~ 434 (434) T protein:vir:62 423 VYKYVLKAPTGA 434 (434) T ss_pred EEEEEeccCCCC Confidence 321111111111 No 40 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=88.02 E-value=0.035 Score=28.57 Aligned_cols=293 Identities=12% Similarity=0.032 Sum_probs=107.4 Q ss_pred CCccceeeeeeeeeecCCC--CCcccccccccccccccccccccccccccccccCCCccccccccccccccccccccccc Q lcl|NC_015279. 105 MSGPTGLIFAMRSKYSTQG--GTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNV 182 (467) Q Consensus 105 mTGPTGLIFAMRsrY~~qs--GtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~ 182 (467) |+|++ +|+...... +...+..+--..+- ......+.-..+..........+.+ T Consensus 1 m~~~~-----~~a~~~~~t~~~g~~i~~~~~~~ii--------------------~~~~~~s~l~~~~~~~~~~~~~~~~ 55 (330) T protein:vir:77 1 MAGST-----VPSTQVALTGDFSAFLTPEQSQDYF--------------------AEIEKTSIVQRIARKVPMGPTGISI 55 (330) T ss_pred Ccccc-----cchhhccccCCCcceechhHHHHHH--------------------HHHHhccchhhhcceeeccCCceEE Confidence 33332 121111110 00000000000000 0000000000000000000000000 Q ss_pred ccccchhhHhh-cCCCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHH Q lcl|NC_015279. 183 GQGMRTDEAED-LGTSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVI 261 (467) Q Consensus 183 ~~Gm~TA~aE~-LGs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII 261 (467) .. .+...++ ...++..+++-..++++++...|..+-...+|-||.+|- ..|.|++|.+-|+..|...||+-+| T Consensus 56 p~--~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~ai~~~~~~~~l 129 (330) T protein:vir:77 56 PH--WTGAVSASWTGEAERKPITKGSFGKQELEPVKITTIFAESAEVVRLN----PLNYLNTMRTKIAEAIALKFDAAAI 129 (330) T ss_pred EE--EcCCcceeEecCCCccccccceeeEEEEeEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHhh Confidence 00 0000010 012345677777788888888888888889999999984 4689999999999999999999888 Q ss_pred HH---------HhhhcccccccccccceeEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHH Q lcl|NC_015279. 262 RT---------IYKVSEQGAVSNTATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASA 332 (467) Q Consensus 262 ~~---------l~~~a~~~k~~~~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~ 332 (467) .- |...+... ..+......+..... ....+.+..++.++ .+ .....+.+||+|..... T Consensus 130 ~G~g~~~~~~g~~~~~~~~--~~~~~~~~~~~~~~~--~~~~~~l~~~~~~~--------~~-~~~~~~~~vmn~~~~~~ 196 (330) T protein:vir:77 130 HGIDKPSAFKGYLAETTKV--VSLADTNLTTASGPQ--GNAYLAVNNALSLL--------VN-SGKKWTGTLLDNVTEPI 196 (330) T ss_pred cccCCCCcccccccccccc--ceeeccccccccccc--chhHHHHHHHHHhh--------hh-cCCCccEEEEcHHHHHH Confidence 31 11111000 000001111111100 00011111221222 11 22344568999999888 Q ss_pred Hhhhcchhccccc---ccccccccCCceeEEEecCceEEEecccccccchh----hccCCCceEEEEEecCCCcc----c Q lcl|NC_015279. 333 LTMAGVLDYTPAL---NANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSA----NAANGNQYYVVGYKGTSPYD----A 401 (467) Q Consensus 333 L~~sG~~~~~~~~---~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~----~~~~~~dY~~vGyKG~~~~d----~ 401 (467) |.. +....+- ..............++|. |++||+..+....... ...-.+.++++|-.+..+.+ + T Consensus 197 l~~---lkd~~G~~l~~~~~~~~~~~~~~~~~l~-G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~ 272 (330) T protein:vir:77 197 LNT---AVDGNGRPLFVESTYTEQVGAIREGRIL-GRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQA 272 (330) T ss_pred HHH---HhccCCceeecCccccccccccCCceec-ceeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecc Confidence 864 2211110 000001111112234663 5899998775321100 00001122334544333321 1 Q ss_pred eeEec--ccchhhcccccCCccc---cceeeeeeeecee-ecC--cc------cccCcccc Q lcl|NC_015279. 402 GLFYC--PYVPLQMVRAVGENTF---QPKIGFKTRYGMV-ANP--FA------EGTTVGAG 448 (467) Q Consensus 402 glfya--PYv~l~~~~~~Dp~s~---qP~~g~~tRY~l~-~nP--~~------~~~~~~~~ 448 (467) .+.+. .|... ...+-+-| +=.+=...|++.. .+| |+ -+.+.++. T Consensus 273 ~~~~~~~~~~~~---~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~~~~~~~~~ 330 (330) T protein:vir:77 273 TLDFGEEQGGVW---VPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQVAGTDPEEE 330 (330) T ss_pred eeeecccccccc---cccccchhhcCcEEEEEEEEeccEEecccceEEEEeccCCcCCCCC Confidence 11111 00000 00000001 1111222344432 233 21 12333333 No 41 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=88.00 E-value=0.035 Score=28.56 Aligned_cols=336 Identities=13% Similarity=0.050 Sum_probs=126.5 Q ss_pred cchHHHHHhhhhhhccC-cc-chhc----c----hhH---HHH---HHH---HhhhHH--HHHHHHhhhhhcchh----- Q lcl|NC_015279. 2 FQSEQLQEKWAPLLNYE-GL-DKIS----D----PHR---RAV---TAV---LLENQE--KFMQEQVAFEQGGMI----- 55 (467) Q Consensus 2 ~~~~~l~~kw~p~l~~~-~~-~~i~----~----~~~---~~v---~~~---~~enq~--~~~~e~~~~~~~~~~----- 55 (467) -+-++|+++|.-+.+.- ++ .++. + .-+ ..+ -.. +.+.++ +...++......... T Consensus 1 M~~~eL~~~~~~~~~~~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (395) T protein:vir:38 1 MNINQLKDAFDMAGQKVQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVNKKP 80 (395) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 23355666555442110 00 0000 0 000 001 000 000000 000000000000000 Q ss_pred -------------hhhhhhhcccc---cccccccccc---cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeee Q lcl|NC_015279. 56 -------------AEQPTNAVGNG---GYTSSGGQTV---AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMR 116 (467) Q Consensus 56 -------------~e~~~~~~g~~---~~~st~tg~i---~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMR 116 (467) ...-.+..... +..++++|.+ ..+.+.++.+.| +..+..+++.+.||++++|-+-=.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~ 157 (395) T protein:vir:38 81 LPVKDGKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTR---SFTSLESLANVENVTTSHGSRVYEK 157 (395) T ss_pred cchhhhhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHH---hhcchhhhcceeeccCCcceEEEEe Confidence 00000000000 0011112221 122334444444 5567888999999999988642111 Q ss_pred eeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCC Q lcl|NC_015279. 117 SKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGT 196 (467) Q Consensus 117 srY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs 196 (467) -.+.++ .+ . ..++++.... T Consensus 158 --~~~~~~-~a-------~---------------------------------------------------~v~E~~~~~~ 176 (395) T protein:vir:38 158 --LADITP-LK-------D---------------------------------------------------LDDESALIGD 176 (395) T ss_pred --eccCCc-cc-------c---------------------------------------------------cccccccccc Confidence 000000 00 0 0000011111 Q ss_pred -CCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccc Q lcl|NC_015279. 197 -SGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSN 275 (467) Q Consensus 197 -~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~ 275 (467) ....|.+..|+..|..+- ..+|-||.+|- +.|-++.|.+-|+..|..-||+.|+.-.- .+ T Consensus 177 ~~~~~f~~v~~~~~k~~~~-------~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~~~~il~g~g--------~~ 237 (395) T protein:vir:38 177 NDDPELTVVKYLIHRYAGI-------TTVTNTLLKDT----VDNIIQWLVNWAAKKDVVTRNAKILEVMG--------KA 237 (395) T ss_pred ccccceeeEEeeeeeeEee-------hhhHHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhccc--------cc Confidence 113466666666666554 45999999993 35678889999988888888888874211 11 Q ss_pred cccceeEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCC Q lcl|NC_015279. 276 TATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTG 355 (467) Q Consensus 276 ~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~ 355 (467) ....|..++ .....+++. ....--+. ...+||+|.....|.. +....+ ..+-..+.. T Consensus 238 ~~~~~~~~~----------~~i~~~~~~-------~l~~~~~~-~a~~v~n~~~~~~L~~---lkd~~G--~~l~~~~~~ 294 (395) T protein:vir:38 238 PKKPTISQF----------DNIKDLENN-------TLDPAIES-TSSFITNQSGYNILSK---VKDADG--RYLMQPDVT 294 (395) T ss_pred ccccccccH----------HHHHHHHHH-------hhhhhhcC-CCEEEEcHHHHHHHHH---hhccCC--ceeeccCcC Confidence 112222221 111222221 11111222 2457899999888854 222111 111111111 Q ss_pred ceeEEEecCceEEEecccccccch---hhcc--CCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeee Q lcl|NC_015279. 356 NTFAGVLQGKYRVYIDPYSSNLTS---ANAA--NGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKT 430 (467) Q Consensus 356 ~~~~G~l~~~~~vy~D~y~~~~~~---~~~~--~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~t 430 (467) .-..++| .|++|++......... +.++ ...++++++.++... +=+.++.. .+-..-+=.+-+.. T Consensus 295 ~~~~~~l-~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~----i~~~~~~~------~~~~~~~~~~r~~~ 363 (395) T protein:vir:38 295 SPDKYLI-DGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQ----IDTTNVGA------GSFEHDTTKLRFID 363 (395) T ss_pred CCCccee-ccceeEEecccccCcCCCcceEEEEeccccEEEEEecceE----EEEecccc------chhhcCceEEEEEE Confidence 1223466 4677776543211100 0000 011233333332111 11111110 01122233444555 Q ss_pred eeceee-cC--c-----ccccCcccccccccc Q lcl|NC_015279. 431 RYGMVA-NP--F-----AEGTTVGAGRLRVNS 454 (467) Q Consensus 431 RY~l~~-nP--~-----~~~~~~~~~~~~~~~ 454 (467) ||+..+ +| | +...++.++..-.|+ T Consensus 364 r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 395 (395) T protein:vir:38 364 RFDVQLIDDGAFAAASFKTVANQAQGTAGTGK 395 (395) T ss_pred eeccEEecccceEEEEeecccCCCCCccCCCC Confidence 665443 23 2 222344444444555 No 42 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=87.15 E-value=0.041 Score=28.21 Aligned_cols=293 Identities=11% Similarity=0.068 Sum_probs=113.6 Q ss_pred HHHHhhhhhcchhhhhh-hhhcccccccccccccccccCchhh-hhHHHHHhhhhhhhceeeccCCccceeeeeeeeeec Q lcl|NC_015279. 43 MQEQVAFEQGGMIAEQP-TNAVGNGGYTSSGGQTVAGFDPVLI-SLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYS 120 (467) Q Consensus 43 ~~e~~~~~~~~~~~e~~-~~~~g~~~~~st~tg~i~~~~P~Lv-~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~ 120 (467) +.|.. .|-.|.. ....++ ++.++- .-|.+. .+++......+-.+++-+.||++.+.-|. ++. T Consensus 1 ~~~~~-----~~~~~~~~~~~t~~----~~~~~~---ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p----~~~ 64 (320) T protein:vir:10 1 MAAGT-----AFQVDHAQIAQTGD----TMFKGY---LEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIP----HWI 64 (320) T ss_pred CCCCc-----cCCHHHHHhhcccc----cccccc---ccHHHHHHHHHHHHhccchhhhcceeeccCCceEEE----EEe Confidence 11111 1111111 110111 111111 112221 13333334456788899999987653322 111 Q ss_pred CCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCc Q lcl|NC_015279. 121 TQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDN 200 (467) Q Consensus 121 ~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~ 200 (467) ++.++ .| .+ | +.+ T Consensus 65 --~~~~a-------~~---------------------------------------------------v~--E-----~~~ 77 (320) T protein:vir:10 65 --GDVSA-------QW---------------------------------------------------IG--E-----GDM 77 (320) T ss_pred --CCcce-------EE---------------------------------------------------ec--C-----Ccc Confidence 00000 00 00 1 112 Q ss_pred cceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccc------ Q lcl|NC_015279. 201 FNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVS------ 274 (467) Q Consensus 201 f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~------ 274 (467) +++-..++++++...|..+-...+|.||.+|-. .|.|+.|.+.|...|...+|+-++.-- ..+... T Consensus 78 ~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~l~~a~a~~~d~a~l~G~----g~~~~~~~~~~~ 149 (320) T protein:vir:10 78 KPITKGNMTSQNIAPHKIATIFVASAETVRANP----ANYLGTMRTKVATAFAMAFDSAALNGT----DSPFPTYLAQTT 149 (320) T ss_pred ccccccceeEEEEeeEEEEEeehhhHHHHhcCh----HHHHHHHHHHHHHHHHHHHHHHhhccc----CCCCCccccccc Confidence 233333445555566666666779999999865 468888999999999988888886311 000000 Q ss_pred ---ccccceeEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccc---ccc Q lcl|NC_015279. 275 ---NTATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPAL---NAN 348 (467) Q Consensus 275 ---~~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~---~~~ 348 (467) .+...+.... +.-+..+ ..+ ..+... ..........+||+|.....|.. +....+- ... T Consensus 150 ~~~~~~~~~~~~~----~~~~~~~---~~~----~~~~~~-~~~~~~~~~~~v~n~~~~~~L~~---lkd~~G~~l~~~~ 214 (320) T protein:vir:10 150 KSVSLADPGGATA----SDLTAYD---AVA----VNGLSL-LVNAKKKWTHTLLDDIVEPILNG---AKDKNGRPLFIES 214 (320) T ss_pred ccccceecccccc----cccccHH---HHH----HHHHhh-hhcccCCCcEEEEcHHHHHHHHH---hhccCCceeeccc Confidence 0011111111 1111111 111 111111 11223345688999999998864 2221110 000 Q ss_pred cccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCcc-----c- Q lcl|NC_015279. 349 LNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENT-----F- 422 (467) Q Consensus 349 ~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s-----~- 422 (467) ...........++| .+++|+++++........+.-.+.++++|..+.-+++-+= +.......|+.. | T Consensus 215 ~~~~~~~~~~~~~i-~g~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~------~~~~~~~~~~~~~~~~~f~ 287 (320) T protein:vir:10 215 TYTDENSPFRAGRI-VSRPTILSDHVADGTTVGYMGDFRNVIWGQVGGLSFDVTD------QATLNLGTPTEPNFVSLWQ 287 (320) T ss_pred cccCccccccCcee-eeeeeEecCCCCCCceEEEEeecceEEEEEecCeEEEEee------cceeeeccccccccchhhh Confidence 00011112223455 5788888877531100000011223345554443322100 000000011111 1 Q ss_pred --cceeeeeeeecee-ecC--cccc--cCcccc Q lcl|NC_015279. 423 --QPKIGFKTRYGMV-ANP--FAEG--TTVGAG 448 (467) Q Consensus 423 --qP~~g~~tRY~l~-~nP--~~~~--~~~~~~ 448 (467) |=.+=...|++.. .+| |+.- ....++ T Consensus 288 ~~~~~~r~~~~~d~~v~~~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 288 HNLVAVRVEAEYAFHNNDKDAFVKLTNVVTPDA 320 (320) T ss_pred cCcEEEEEEEeeccEEecccceEEEEeccCCCC Confidence 1112222455443 233 2221 122212 No 43 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=86.70 E-value=0.044 Score=28.04 Aligned_cols=325 Identities=13% Similarity=0.091 Sum_probs=129.4 Q ss_pred Ccch-HHHHHhhhh-------hhccCccchhcchh--HHHHHHHHhhhHHHHHHHHhh---------------------- Q lcl|NC_015279. 1 MFQS-EQLQEKWAP-------LLNYEGLDKISDPH--RRAVTAVLLENQEKFMQEQVA---------------------- 48 (467) Q Consensus 1 ~~~~-~~l~~kw~p-------~l~~~~~~~i~~~~--~~~v~~~~~enq~~~~~e~~~---------------------- 48 (467) |.+. ++|+++=.- +++.+..-++.... -+++.+.| |.+++..+++.. T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKI-DLQRSLDEAETEERNNGREVETRNVDGEMEYRDV 79 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhccccccccCccchHHHHHH Confidence 6643 233333222 22232222221110 01111111 111111111110 Q ss_pred ---hhhcchhhhhhhhhccccc-----cccc-ccccc---cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeee Q lcl|NC_015279. 49 ---FEQGGMIAEQPTNAVGNGG-----YTSS-GGQTV---AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMR 116 (467) Q Consensus 49 ---~~~~~~~~e~~~~~~g~~~-----~~st-~tg~i---~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMR 116 (467) ...++.+........+... ...| +.|.. ..+.+.++.+.| +...-.+++++.||++++|-+.-.+ T Consensus 80 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~~~~~~ 156 (392) T protein:vir:10 80 FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELAR---SFDALEQYVTVEPVRTRSGSRVLEK 156 (392) T ss_pred HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHH---hhhhhhhhceeeeccCCceeEEEEe Confidence 0001111110000000000 0011 11211 123344455555 4445678999999999887532111 Q ss_pred eeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCC Q lcl|NC_015279. 117 SKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGT 196 (467) Q Consensus 117 srY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs 196 (467) .. ++.++ .| .++++.... T Consensus 157 --~~--~~~~a-------~~---------------------------------------------------v~E~~~~~~ 174 (392) T protein:vir:10 157 --NS--DMIPF-------AE---------------------------------------------------ITEMGEIPE 174 (392) T ss_pred --ec--CCccc-------ee---------------------------------------------------ecccccccc Confidence 11 10000 00 000000000 Q ss_pred -CCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccc Q lcl|NC_015279. 197 -SGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSN 275 (467) Q Consensus 197 -~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~ 275 (467) +...|.++.+...|..+ ...+|-||.+|- ..|.+++|.+-|...|..-++.-|+.-.-+. T Consensus 175 ~~~~~~~~v~l~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~-------- 235 (392) T protein:vir:10 175 TDNPKFSNVQYAVKDRAG-------ILPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVIEKL-------- 235 (392) T ss_pred cccccceeEEeeeeeEEE-------eehhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------- Confidence 11346666666666554 456899999984 2567889999999999999998887422211 Q ss_pred cccceeEEeeccccchhHHHHHHHHH-HHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccc-ccccc Q lcl|NC_015279. 276 TATAGVFDLDIDSNGRWSVEKFKGLL-FQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNAN-LNVDD 353 (467) Q Consensus 276 ~~~~gv~Dl~~~~~~r~~ve~~~~l~-~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~-~~~d~ 353 (467) ...++.+. +....++ +.+... -+ ..-..|++|.....|..- ....+ .. +..+. T Consensus 236 -~~~~~~~~----------d~i~~~~~~~l~~~--------~~-~~a~~vm~~~~~~~L~~l---kd~~G--~~l~~~~~ 290 (392) T protein:vir:10 236 -TKQAIKSL----------DDIKDVLNVKLDPA--------IS-PNAILLTNQDGFNYLDKL---KDKDG--KYILQSDP 290 (392) T ss_pred -cccCccCH----------HHHHHHHHHhhhhh--------hc-cCCEEEEcHHHHHHHHHh---hccCC--CeEeecCc Confidence 12222221 1122222 122111 11 224478899998888542 22111 11 11111 Q ss_pred CCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchh-------hcccccCC------c Q lcl|NC_015279. 354 TGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPL-------QMVRAVGE------N 420 (467) Q Consensus 354 t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l-------~~~~~~Dp------~ 420 (467) ..-..++|.|...|+++.... ++.+|...-+..++|+.+-.. .+.-.++| . T Consensus 291 -~~~~~~tllG~~~v~~~~~~~---------------~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~ 354 (392) T protein:vir:10 291 -TQKNKKLFAGTNPVVVVSNRF---------------LKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFT 354 (392) T ss_pred -cCCccccccCcccEEEecccc---------------cCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhh Confidence 122346787777777653321 111111111222333332110 00001122 2 Q ss_pred cccceeeeeeeeceeecCcccccCccccccccccccccceeeeeccC Q lcl|NC_015279. 421 TFQPKIGFKTRYGMVANPFAEGTTVGAGRLRVNSNRYYRRVAVKNLM 467 (467) Q Consensus 421 s~qP~~g~~tRY~l~~nP~~~~~~~~~~~~~~~~n~y~r~~~v~~~~ 467 (467) +.|=.+-...|+|..+ -....|.++.++.-= T Consensus 355 ~~~~~~r~~~r~d~~v----------------~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 355 RNTLDLRAIQRDDVQM----------------WDNEAAVYGEIDLSA 385 (392) T ss_pred cCceEEEEEEeeccEE----------------ecccceEEEEecccc Confidence 3344455666666433 112345555555433 No 44 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=86.70 E-value=0.044 Score=28.04 Aligned_cols=325 Identities=13% Similarity=0.091 Sum_probs=129.4 Q ss_pred Ccch-HHHHHhhhh-------hhccCccchhcchh--HHHHHHHHhhhHHHHHHHHhh---------------------- Q lcl|NC_015279. 1 MFQS-EQLQEKWAP-------LLNYEGLDKISDPH--RRAVTAVLLENQEKFMQEQVA---------------------- 48 (467) Q Consensus 1 ~~~~-~~l~~kw~p-------~l~~~~~~~i~~~~--~~~v~~~~~enq~~~~~e~~~---------------------- 48 (467) |.+. ++|+++=.- +++.+..-++.... -+++.+.| |.+++..+++.. T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKI-DLQRSLDEAETEERNNGREVETRNVDGEMEYRDV 79 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhccccccccCccchHHHHHH Confidence 6643 233333222 22232222221110 01111111 111111111110 Q ss_pred ---hhhcchhhhhhhhhccccc-----cccc-ccccc---cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeee Q lcl|NC_015279. 49 ---FEQGGMIAEQPTNAVGNGG-----YTSS-GGQTV---AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMR 116 (467) Q Consensus 49 ---~~~~~~~~e~~~~~~g~~~-----~~st-~tg~i---~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMR 116 (467) ...++.+........+... ...| +.|.. ..+.+.++.+.| +...-.+++++.||++++|-+.-.+ T Consensus 80 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~~~~~~ 156 (392) T protein:vir:10 80 FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELAR---SFDALEQYVTVEPVRTRSGSRVLEK 156 (392) T ss_pred HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHH---hhhhhhhhceeeeccCCceeEEEEe Confidence 0001111110000000000 0011 11211 123344455555 4445678999999999887532111 Q ss_pred eeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCC Q lcl|NC_015279. 117 SKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGT 196 (467) Q Consensus 117 srY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs 196 (467) .. ++.++ .| .++++.... T Consensus 157 --~~--~~~~a-------~~---------------------------------------------------v~E~~~~~~ 174 (392) T protein:vir:10 157 --NS--DMIPF-------AE---------------------------------------------------ITEMGEIPE 174 (392) T ss_pred --ec--CCccc-------ee---------------------------------------------------ecccccccc Confidence 11 10000 00 000000000 Q ss_pred -CCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccc Q lcl|NC_015279. 197 -SGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSN 275 (467) Q Consensus 197 -~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~ 275 (467) +...|.++.+...|..+ ...+|-||.+|- ..|.+++|.+-|...|..-++.-|+.-.-+. T Consensus 175 ~~~~~~~~v~l~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~-------- 235 (392) T protein:vir:10 175 TDNPKFSNVQYAVKDRAG-------ILPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVIEKL-------- 235 (392) T ss_pred cccccceeEEeeeeeEEE-------eehhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------- Confidence 11346666666666554 456899999984 2567889999999999999998887422211 Q ss_pred cccceeEEeeccccchhHHHHHHHHH-HHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccc-ccccc Q lcl|NC_015279. 276 TATAGVFDLDIDSNGRWSVEKFKGLL-FQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNAN-LNVDD 353 (467) Q Consensus 276 ~~~~gv~Dl~~~~~~r~~ve~~~~l~-~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~-~~~d~ 353 (467) ...++.+. +....++ +.+... -+ ..-..|++|.....|..- ....+ .. +..+. T Consensus 236 -~~~~~~~~----------d~i~~~~~~~l~~~--------~~-~~a~~vm~~~~~~~L~~l---kd~~G--~~l~~~~~ 290 (392) T protein:vir:10 236 -TKQAIKSL----------DDIKDVLNVKLDPA--------IS-PNAILLTNQDGFNYLDKL---KDKDG--KYILQSDP 290 (392) T ss_pred -cccCccCH----------HHHHHHHHHhhhhh--------hc-cCCEEEEcHHHHHHHHHh---hccCC--CeEeecCc Confidence 12222221 1122222 122111 11 224478899998888542 22111 11 11111 Q ss_pred CCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchh-------hcccccCC------c Q lcl|NC_015279. 354 TGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPL-------QMVRAVGE------N 420 (467) Q Consensus 354 t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l-------~~~~~~Dp------~ 420 (467) ..-..++|.|...|+++.... ++.+|...-+..++|+.+-.. .+.-.++| . T Consensus 291 -~~~~~~tllG~~~v~~~~~~~---------------~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~ 354 (392) T protein:vir:10 291 -TQKNKKLFAGTNPVVVVSNRF---------------LKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFT 354 (392) T ss_pred -cCCccccccCcccEEEecccc---------------cCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhh Confidence 122346787777777653321 111111111222333332110 00001122 2 Q ss_pred cccceeeeeeeeceeecCcccccCccccccccccccccceeeeeccC Q lcl|NC_015279. 421 TFQPKIGFKTRYGMVANPFAEGTTVGAGRLRVNSNRYYRRVAVKNLM 467 (467) Q Consensus 421 s~qP~~g~~tRY~l~~nP~~~~~~~~~~~~~~~~n~y~r~~~v~~~~ 467 (467) +.|=.+-...|+|..+ -....|.++.++.-= T Consensus 355 ~~~~~~r~~~r~d~~v----------------~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 355 RNTLDLRAIQRDDVQM----------------WDNEAAVYGEIDLSA 385 (392) T ss_pred cCceEEEEEEeeccEE----------------ecccceEEEEecccc Confidence 3344455666666433 112345555555433 No 45 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=86.70 E-value=0.044 Score=28.04 Aligned_cols=325 Identities=13% Similarity=0.091 Sum_probs=129.4 Q ss_pred Ccch-HHHHHhhhh-------hhccCccchhcchh--HHHHHHHHhhhHHHHHHHHhh---------------------- Q lcl|NC_015279. 1 MFQS-EQLQEKWAP-------LLNYEGLDKISDPH--RRAVTAVLLENQEKFMQEQVA---------------------- 48 (467) Q Consensus 1 ~~~~-~~l~~kw~p-------~l~~~~~~~i~~~~--~~~v~~~~~enq~~~~~e~~~---------------------- 48 (467) |.+. ++|+++=.- +++.+..-++.... -+++.+.| |.+++..+++.. T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKI-DLQRSLDEAETEERNNGREVETRNVDGEMEYRDV 79 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhccccccccCccchHHHHHH Confidence 6643 233333222 22232222221110 01111111 111111111110 Q ss_pred ---hhhcchhhhhhhhhccccc-----cccc-ccccc---cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeee Q lcl|NC_015279. 49 ---FEQGGMIAEQPTNAVGNGG-----YTSS-GGQTV---AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMR 116 (467) Q Consensus 49 ---~~~~~~~~e~~~~~~g~~~-----~~st-~tg~i---~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMR 116 (467) ...++.+........+... ...| +.|.. ..+.+.++.+.| +...-.+++++.||++++|-+.-.+ T Consensus 80 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~~~~~~ 156 (392) T protein:vir:10 80 FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELAR---SFDALEQYVTVEPVRTRSGSRVLEK 156 (392) T ss_pred HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHH---hhhhhhhhceeeeccCCceeEEEEe Confidence 0001111110000000000 0011 11211 123344455555 4445678999999999887532111 Q ss_pred eeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCC Q lcl|NC_015279. 117 SKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGT 196 (467) Q Consensus 117 srY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs 196 (467) .. ++.++ .| .++++.... T Consensus 157 --~~--~~~~a-------~~---------------------------------------------------v~E~~~~~~ 174 (392) T protein:vir:10 157 --NS--DMIPF-------AE---------------------------------------------------ITEMGEIPE 174 (392) T ss_pred --ec--CCccc-------ee---------------------------------------------------ecccccccc Confidence 11 10000 00 000000000 Q ss_pred -CCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccc Q lcl|NC_015279. 197 -SGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSN 275 (467) Q Consensus 197 -~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~ 275 (467) +...|.++.+...|..+ ...+|-||.+|- ..|.+++|.+-|...|..-++.-|+.-.-+. T Consensus 175 ~~~~~~~~v~l~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~-------- 235 (392) T protein:vir:10 175 TDNPKFSNVQYAVKDRAG-------ILPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVIEKL-------- 235 (392) T ss_pred cccccceeEEeeeeeEEE-------eehhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------- Confidence 11346666666666554 456899999984 2567889999999999999998887422211 Q ss_pred cccceeEEeeccccchhHHHHHHHHH-HHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccc-ccccc Q lcl|NC_015279. 276 TATAGVFDLDIDSNGRWSVEKFKGLL-FQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNAN-LNVDD 353 (467) Q Consensus 276 ~~~~gv~Dl~~~~~~r~~ve~~~~l~-~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~-~~~d~ 353 (467) ...++.+. +....++ +.+... -+ ..-..|++|.....|..- ....+ .. +..+. T Consensus 236 -~~~~~~~~----------d~i~~~~~~~l~~~--------~~-~~a~~vm~~~~~~~L~~l---kd~~G--~~l~~~~~ 290 (392) T protein:vir:10 236 -TKQAIKSL----------DDIKDVLNVKLDPA--------IS-PNAILLTNQDGFNYLDKL---KDKDG--KYILQSDP 290 (392) T ss_pred -cccCccCH----------HHHHHHHHHhhhhh--------hc-cCCEEEEcHHHHHHHHHh---hccCC--CeEeecCc Confidence 12222221 1122222 122111 11 224478899998888542 22111 11 11111 Q ss_pred CCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchh-------hcccccCC------c Q lcl|NC_015279. 354 TGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPL-------QMVRAVGE------N 420 (467) Q Consensus 354 t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l-------~~~~~~Dp------~ 420 (467) ..-..++|.|...|+++.... ++.+|...-+..++|+.+-.. .+.-.++| . T Consensus 291 -~~~~~~tllG~~~v~~~~~~~---------------~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~ 354 (392) T protein:vir:10 291 -TQKNKKLFAGTNPVVVVSNRF---------------LKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFT 354 (392) T ss_pred -cCCccccccCcccEEEecccc---------------cCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhh Confidence 122346787777777653321 111111111222333332110 00001122 2 Q ss_pred cccceeeeeeeeceeecCcccccCccccccccccccccceeeeeccC Q lcl|NC_015279. 421 TFQPKIGFKTRYGMVANPFAEGTTVGAGRLRVNSNRYYRRVAVKNLM 467 (467) Q Consensus 421 s~qP~~g~~tRY~l~~nP~~~~~~~~~~~~~~~~n~y~r~~~v~~~~ 467 (467) +.|=.+-...|+|..+ -....|.++.++.-= T Consensus 355 ~~~~~~r~~~r~d~~v----------------~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 355 RNTLDLRAIQRDDVQM----------------WDNEAAVYGEIDLSA 385 (392) T ss_pred cCceEEEEEEeeccEE----------------ecccceEEEEecccc Confidence 3344455666666433 112345555555433 No 46 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=86.70 E-value=0.044 Score=28.04 Aligned_cols=325 Identities=13% Similarity=0.091 Sum_probs=129.4 Q ss_pred Ccch-HHHHHhhhh-------hhccCccchhcchh--HHHHHHHHhhhHHHHHHHHhh---------------------- Q lcl|NC_015279. 1 MFQS-EQLQEKWAP-------LLNYEGLDKISDPH--RRAVTAVLLENQEKFMQEQVA---------------------- 48 (467) Q Consensus 1 ~~~~-~~l~~kw~p-------~l~~~~~~~i~~~~--~~~v~~~~~enq~~~~~e~~~---------------------- 48 (467) |.+. ++|+++=.- +++.+..-++.... -+++.+.| |.+++..+++.. T Consensus 1 M~k~l~el~~~~~~~~~e~~~~~~~~~~~e~~~~~~e~~~l~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEEVRSLMGEDKVAEAEQMMEEVRSLQKKI-DLQRSLDEAETEERNNGREVETRNVDGEMEYRDV 79 (392) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhccccccccCccchHHHHHH Confidence 6643 233333222 22232222221110 01111111 111111111110 Q ss_pred ---hhhcchhhhhhhhhccccc-----cccc-ccccc---cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeee Q lcl|NC_015279. 49 ---FEQGGMIAEQPTNAVGNGG-----YTSS-GGQTV---AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMR 116 (467) Q Consensus 49 ---~~~~~~~~e~~~~~~g~~~-----~~st-~tg~i---~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMR 116 (467) ...++.+........+... ...| +.|.. ..+.+.++.+.| +...-.+++++.||++++|-+.-.+ T Consensus 80 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~~~~~~ 156 (392) T protein:vir:10 80 FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELAR---SFDALEQYVTVEPVRTRSGSRVLEK 156 (392) T ss_pred HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHH---hhhhhhhhceeeeccCCceeEEEEe Confidence 0001111110000000000 0011 11211 123344455555 4445678999999999887532111 Q ss_pred eeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCC Q lcl|NC_015279. 117 SKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGT 196 (467) Q Consensus 117 srY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs 196 (467) .. ++.++ .| .++++.... T Consensus 157 --~~--~~~~a-------~~---------------------------------------------------v~E~~~~~~ 174 (392) T protein:vir:10 157 --NS--DMIPF-------AE---------------------------------------------------ITEMGEIPE 174 (392) T ss_pred --ec--CCccc-------ee---------------------------------------------------ecccccccc Confidence 11 10000 00 000000000 Q ss_pred -CCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccc Q lcl|NC_015279. 197 -SGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSN 275 (467) Q Consensus 197 -~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~ 275 (467) +...|.++.+...|..+ ...+|-||.+|- ..|.+++|.+-|...|..-++.-|+.-.-+. T Consensus 175 ~~~~~~~~v~l~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~-------- 235 (392) T protein:vir:10 175 TDNPKFSNVQYAVKDRAG-------ILPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVIEKL-------- 235 (392) T ss_pred cccccceeEEeeeeeEEE-------eehhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------- Confidence 11346666666666554 456899999984 2567889999999999999998887422211 Q ss_pred cccceeEEeeccccchhHHHHHHHHH-HHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccc-ccccc Q lcl|NC_015279. 276 TATAGVFDLDIDSNGRWSVEKFKGLL-FQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNAN-LNVDD 353 (467) Q Consensus 276 ~~~~gv~Dl~~~~~~r~~ve~~~~l~-~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~-~~~d~ 353 (467) ...++.+. +....++ +.+... -+ ..-..|++|.....|..- ....+ .. +..+. T Consensus 236 -~~~~~~~~----------d~i~~~~~~~l~~~--------~~-~~a~~vm~~~~~~~L~~l---kd~~G--~~l~~~~~ 290 (392) T protein:vir:10 236 -TKQAIKSL----------DDIKDVLNVKLDPA--------IS-PNAILLTNQDGFNYLDKL---KDKDG--KYILQSDP 290 (392) T ss_pred -cccCccCH----------HHHHHHHHHhhhhh--------hc-cCCEEEEcHHHHHHHHHh---hccCC--CeEeecCc Confidence 12222221 1122222 122111 11 224478899998888542 22111 11 11111 Q ss_pred CCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchh-------hcccccCC------c Q lcl|NC_015279. 354 TGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPL-------QMVRAVGE------N 420 (467) Q Consensus 354 t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l-------~~~~~~Dp------~ 420 (467) ..-..++|.|...|+++.... ++.+|...-+..++|+.+-.. .+.-.++| . T Consensus 291 -~~~~~~tllG~~~v~~~~~~~---------------~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~ 354 (392) T protein:vir:10 291 -TQKNKKLFAGTNPVVVVSNRF---------------LKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFT 354 (392) T ss_pred -cCCccccccCcccEEEecccc---------------cCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhh Confidence 122346787777777653321 111111111222333332110 00001122 2 Q ss_pred cccceeeeeeeeceeecCcccccCccccccccccccccceeeeeccC Q lcl|NC_015279. 421 TFQPKIGFKTRYGMVANPFAEGTTVGAGRLRVNSNRYYRRVAVKNLM 467 (467) Q Consensus 421 s~qP~~g~~tRY~l~~nP~~~~~~~~~~~~~~~~n~y~r~~~v~~~~ 467 (467) +.|=.+-...|+|..+ -....|.++.++.-= T Consensus 355 ~~~~~~r~~~r~d~~v----------------~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 355 RNTLDLRAIQRDDVQM----------------WDNEAAVYGEIDLSA 385 (392) T ss_pred cCceEEEEEEeeccEE----------------ecccceEEEEecccc Confidence 3344455666666433 112345555555433 No 47 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=86.11 E-value=0.048 Score=27.82 Aligned_cols=323 Identities=14% Similarity=0.054 Sum_probs=125.5 Q ss_pred Cc-chHHHHHhhhhh-------hccCccchhcchhH--HHHHHHHhhhHHHHHHHHhh-------------------hhh Q lcl|NC_015279. 1 MF-QSEQLQEKWAPL-------LNYEGLDKISDPHR--RAVTAVLLENQEKFMQEQVA-------------------FEQ 51 (467) Q Consensus 1 ~~-~~~~l~~kw~p~-------l~~~~~~~i~~~~~--~~v~~~~~enq~~~~~e~~~-------------------~~~ 51 (467) |+ +.++|+|+=.-+ ++.+.+-++..... +++-.++ +.+++..++.+. ..+ T Consensus 1 M~k~l~~l~e~~~~~~~e~~~~~~~~~~e~~~~~~~ei~~l~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (371) T protein:vir:81 1 MPKELRELLEQINNKKEEARKLLAENKIEEAKKLKEEIVALQEKF-DVAKELYEEQKQTIEDKEPLKPTVQVKENEVEAF 79 (371) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhccccccccchhhHHHHHHHH Confidence 77 333444443322 22222112211000 0111111 111110010000 000 Q ss_pred cchhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCcccccc Q lcl|NC_015279. 52 GGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDE 131 (467) Q Consensus 52 ~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnE 131 (467) ..++-.-..+... ..++.+|.+.--....-.+++...+..+..+++++.||++.++-+.-.+ ..+. .++ T Consensus 80 ~~~l~~~~~~a~~---~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~--~~~~--~~a---- 148 (371) T protein:vir:81 80 VNHIRTRFRNAMS---EGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKK--RSQQ--TGF---- 148 (371) T ss_pred HHHHHHHHHHhhc---cCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEe--ecCC--cce---- Confidence 0001000011010 0111122211111111234455557778889999999998877654333 1110 000 Q ss_pred cccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCC-CCCccceeeeEEEE Q lcl|NC_015279. 132 ADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGT-SGDNFNEMAFSIEK 210 (467) Q Consensus 132 adt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs-~g~~f~EMaFsIEK 210 (467) .+ .++++...+ +...|.+..++..| T Consensus 149 ---~~---------------------------------------------------v~Eg~~~~~~~~~~f~~i~~~~~k 174 (371) T protein:vir:81 149 ---VE---------------------------------------------------VAEGAAIGEKATPQFTLLQYQVKK 174 (371) T ss_pred ---ee---------------------------------------------------eccccccccccccceeeEEeeeeE Confidence 00 000111111 11346666666666 Q ss_pred EEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeeccccc Q lcl|NC_015279. 211 VTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDIDSNG 290 (467) Q Consensus 211 ~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~~~~ 290 (467) ..+. ..+|-||.+|-. .|.++.|.+.|...|..-+|+.|+.-.-+. ...|+.+. T Consensus 175 ~~~~-------~~iS~ell~ds~----~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~---------~~~~~~~~------ 228 (371) T protein:vir:81 175 YAGF-------FRVTNELLNDST----EAIVNTLVRWIGDESRVTRNGLIINVLNTK---------AKTAIADL------ 228 (371) T ss_pred EEEe-------ehhhHHHHhhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------cccccccH------ Confidence 6654 479999999853 467889999999999999998888533221 22333322 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccccccc-ccccCCceeEEEecCceEEE Q lcl|NC_015279. 291 RWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANL-NVDDTGNTFAGVLQGKYRVY 369 (467) Q Consensus 291 r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~-~~d~t~~~~~G~l~~~~~vy 369 (467) +..+.++. ... ...-+ ....+|++|.....|... ....+ ..+ ..+.+ .-..|+| .|++|| T Consensus 229 ----~~i~~~~~---~~l----~~~~~-~~a~~vmn~~~~~~L~~l---kd~~g--~~l~~~~~~-~~~~~~l-~G~pV~ 289 (371) T protein:vir:81 229 ----DGLKQIIN---VQL----DPVFR-STSSVIVNQDAFNWLDTL---KDQNG--QYLLQPSIS-SPTGRQL-LGLPVV 289 (371) T ss_pred ----HHHHHHHH---hhc----chhhh-cCCEEEEcHHHHHHHHHh---hccCC--CeeeecccC-CCCCcee-cceeEE Confidence 11122211 000 00111 224678899888877642 21111 011 11111 1234677 467887 Q ss_pred ecccccccchhhccCCCceEEEEEec---CCCccceeEecccch-------hhcccccCCc------cccceeeeeeeec Q lcl|NC_015279. 370 IDPYSSNLTSANAANGNQYYVVGYKG---TSPYDAGLFYCPYVP-------LQMVRAVGEN------TFQPKIGFKTRYG 433 (467) Q Consensus 370 ~D~y~~~~~~~~~~~~~dY~~vGyKG---~~~~d~glfyaPYv~-------l~~~~~~Dp~------s~qP~~g~~tRY~ 433 (467) +..+.. .|..+ ...-..-++|+.+.. ..+...+++. .-|=.+-...|++ T Consensus 290 ~~~~~~---------------~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d 354 (371) T protein:vir:81 290 IVSNKV---------------LANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMD 354 (371) T ss_pred Eecccc---------------cCccccccccCCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec Confidence 765432 11110 000111233332211 0111112222 2233444555665 Q ss_pred eee-cCcccccCccccccccccccccceeeeecc Q lcl|NC_015279. 434 MVA-NPFAEGTTVGAGRLRVNSNRYYRRVAVKNL 466 (467) Q Consensus 434 l~~-nP~~~~~~~~~~~~~~~~n~y~r~~~v~~~ 466 (467) ..+ ||- .|.++.++-= T Consensus 355 ~~~~~~~-----------------a~~~~~~~~A 371 (371) T protein:vir:81 355 VKMRDDE-----------------AFVFGEVQLA 371 (371) T ss_pred cEEeccc-----------------ceEEEEEecC Confidence 432 331 1122221111 No 48 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=84.32 E-value=0.061 Score=27.23 Aligned_cols=381 Identities=11% Similarity=0.066 Sum_probs=121.8 Q ss_pred cchHHHHHhhhhhhccC-cc-------chhcchhHHHHHHHHhhhHHHHHHHHhhhhh--cchhhhhhhhh-cccccccc Q lcl|NC_015279. 2 FQSEQLQEKWAPLLNYE-GL-------DKISDPHRRAVTAVLLENQEKFMQEQVAFEQ--GGMIAEQPTNA-VGNGGYTS 70 (467) Q Consensus 2 ~~~~~l~~kw~p~l~~~-~~-------~~i~~~~~~~v~~~~~enq~~~~~e~~~~~~--~~~~~e~~~~~-~g~~~~~s 70 (467) -+-++|+++++.+++.- .+ .++....++++-.. +.+-+.+++++.... -....+.+... .+...... T Consensus 1 M~i~eL~e~r~~~~~~~~~l~~~~~e~~~lt~ee~~~~~~l--~~ei~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~~ 78 (435) T protein:vir:14 1 MNVNELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQL--SSKFSELTAQIERAEAAERMAAAAAVPVDPNPTAVAA 78 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhhh Confidence 56678888888876521 11 12222223322221 111112222111000 00000000000 00000000 Q ss_pred cccccccccCchhhhhHHHHHhhhhhhhceeeccCCcccee--eeeeeeeecCCCCCccccccccccc-ccccccccccc Q lcl|NC_015279. 71 SGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGL--IFAMRSKYSTQGGTEALFDEADTAF-AGQNEGFDLTN 147 (467) Q Consensus 71 t~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGL--IFAMRsrY~~qsGtEAlfnEadt~f-Sg~~a~~~~~~ 147 (467) .... -...+|.....-+..+-+.+ . +.+...|..-. -.+.+. .+..+..... .+..+.+.. . T Consensus 79 ~~~~-~~~~~~~~~~~~~~~~~~~~-~---~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~t~~~gg~-~ 143 (435) T protein:vir:14 79 PAAA-PVHAQPKALEVKGAKMARMV-R---ALAAARGDAQLASKLAIER---------GFGEEVAMSLNTLSPGAGGV-L 143 (435) T ss_pred cccc-ccccccchhhhhHHHHHHHH-H---HHHhhcchhhHHHHHHHhh---------hhhhhhhhhcccCCcCCCcc-c Confidence 0000 01111111111111110000 0 00000000000 000000 0000000000 000000000 0 Q ss_pred cccccccccCCCcccccccccccc-cccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEEEeecccccccccH Q lcl|NC_015279. 148 GMSDAAAGLGTTSQAGSNPAALNP-VATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVTAKSRALKAEYSL 226 (467) Q Consensus 148 ~~~~~~~~~~~~~~agt~p~~ln~-~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ 226 (467) .+.......-......+.-..+.. ..........+..-...+.+... .++..+++-.-++++++..++.-+-....|- T Consensus 144 vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~v-~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ 222 (435) T protein:vir:14 144 VPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGYI-GADTDIPTTQQQFDDLKLTAKKMAALVPIAN 222 (435) T ss_pred cchhHHHHHHHHHhhhchhhhhcceeeecCCCceEEEEEeCCcceeee-ccCccccccccceeEEEeeeEEEEEeehhhH Confidence 000000000000000000000000 00000000000000000111111 1234566666777788888887777888999 Q ss_pred HHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeeccc---------cchhHHHHH Q lcl|NC_015279. 227 ELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDIDS---------NGRWSVEKF 297 (467) Q Consensus 227 ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~~---------~~r~~ve~~ 297 (467) ||.+|-. ...+.|+.|.+-|+..|...+|+-||. |.-.+-...|++...... ..-.....+ T Consensus 223 ell~ds~--~~~~l~~~i~~~l~~ai~~~~d~a~l~--------G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (435) T protein:vir:14 223 DLIKYAG--VNPNVDQIVVGDLTAAIGAREDKAFIR--------DDGTANTPKGLRFWALPSNVITASDASTLQKIETDL 292 (435) T ss_pred HHHHhhc--cCHHHHHHHHHHHHHHHHHHHHHHhhc--------cCCCCccccceeecccccceeccccccchhhHHHHH Confidence 9999932 123477888888888888888877762 111111234444321111 011111111 Q ss_pred HHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCceEEEecccccc- Q lcl|NC_015279. 298 KGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSN- 376 (467) Q Consensus 298 ~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~- 376 (467) ..++..+.. .. .......+|++|.....|... ....+ ..+-.+.+ .|+|. |++|+++++.-. T Consensus 293 ~~l~~~~~~---~~----~~~~~~~~v~n~~~~~~L~~l---kd~~G--~~l~~~~~----~g~l~-G~Pv~~~~~~p~~ 355 (435) T protein:vir:14 293 GKVILALEN---AD----ANLTQPGWIMAPRTFRFLEGL---RDGNG--NKVYPELA----NGMLK-GYPVGKTTQVPIN 355 (435) T ss_pred HHHHHHhhh---cc----ccccCCEEEEcHHHHHHHHHh---hccCC--ceeccCCC----CCeee-cceeEeecccccc Confidence 222222210 01 122334578999999888542 21111 11111112 35674 578888766411 Q ss_pred cch------hhccCCCceEEEEEecCCCccceeEecccchhhcc-----------------------cccCCccccceee Q lcl|NC_015279. 377 LTS------ANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMV-----------------------RAVGENTFQPKIG 427 (467) Q Consensus 377 ~~~------~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~-----------------------~~~Dp~s~qP~~g 427 (467) +.. ...-...+|+ +|-.+.-. +-.+||.-.... ...||+.|.+.-| T Consensus 356 ~~~~~~~~~i~~gd~s~~~-i~~~~~~~----~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~ 430 (435) T protein:vir:14 356 LGETGKESEIYFTDFGDVF-IGEEETLE----IDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAG 430 (435) T ss_pred ccCCCccceEEEeecccEE-EEEecccE----EEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEec Confidence 000 0000112333 55544433 333444211100 0124444443333 Q ss_pred eeeeece Q lcl|NC_015279. 428 FKTRYGM 434 (467) Q Consensus 428 ~~tRY~l 434 (467) .- ||- T Consensus 431 ~~--~~~ 435 (435) T protein:vir:14 431 VA--WGA 435 (435) T ss_pred CC--CCC Confidence 21 111 No 49 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=83.48 E-value=0.068 Score=26.98 Aligned_cols=308 Identities=11% Similarity=0.035 Sum_probs=116.1 Q ss_pred HhhhHHHHHHHHhh-hhhcchhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeee Q lcl|NC_015279. 35 LLENQEKFMQEQVA-FEQGGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIF 113 (467) Q Consensus 35 ~~enq~~~~~e~~~-~~~~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIF 113 (467) .-++|+ +++.+. +.. .+.-....+... ...+.++...--.+..-.+++...+.....+++-+-||++++--|. T Consensus 1 ~~~~~~--~~~~~~~~~~-~~~~~~~~~a~~---~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p 74 (324) T protein:vir:78 1 MEQTQK--LKLNLQHFAS-NNVKPQVFNPDN---VMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) T ss_pred CCcchh--hhHHHHHHHH-Hhhhhhhhcccc---ccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEE Confidence 111221 111110 000 000000001000 0111111111111122234444556667788888889887653322 Q ss_pred eeeeeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhh Q lcl|NC_015279. 114 AMRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAED 193 (467) Q Consensus 114 AMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~ 193 (467) -.. ++.++ .+ .+ | T Consensus 75 ~~~------~~~~a-------~~---------------------------------------------------v~--E- 87 (324) T protein:vir:78 75 FWA------DKPGA-------YW---------------------------------------------------VG--E- 87 (324) T ss_pred EEe------cCcce-------eE---------------------------------------------------ec--C- Confidence 110 00000 00 00 1 Q ss_pred cCCCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccc Q lcl|NC_015279. 194 LGTSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAV 273 (467) Q Consensus 194 LGs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~ 273 (467) +..+++...++++++++.+.-+.-..+|-||.+|-. .|.+++|.+-|+..|...|++-+|.---+. T Consensus 88 ----g~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g~~------ 153 (324) T protein:vir:78 88 ----GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN------ 153 (324) T ss_pred ----CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhccCCCC------ Confidence 122333334444445555555555669999999864 578999999999999999999888432111 Q ss_pred cccccceeEEeecc----ccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccccccc Q lcl|NC_015279. 274 SNTATAGVFDLDID----SNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANL 349 (467) Q Consensus 274 ~~~~~~gv~Dl~~~----~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~ 349 (467) ....|+...... ..+....+....+..++.. .....+.+|+||.....|... ....+ ..+ T Consensus 154 --~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~~---------~~~~~~~~vmn~~~~~~L~~l---~d~~G--~~~ 217 (324) T protein:vir:78 154 --PFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLED---------DELEANAFISKTQNRSLLRKI---VDPET--KER 217 (324) T ss_pred --CcCccccccccccceeccccccHHHHHHHHHhhhh---------ccCCCCEEEEcHHHHHHHHHh---hccCC--Cee Confidence 111222222111 1111112333334343321 234455689999998888542 11111 011 Q ss_pred ccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccce--eEecccchhhcccccCCccccceee Q lcl|NC_015279. 350 NVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAG--LFYCPYVPLQMVRAVGENTFQPKIG 427 (467) Q Consensus 350 ~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~g--lfyaPYv~l~~~~~~Dp~s~qP~~g 427 (467) -.+.. .++| .+++|++++...-.......-.+.++++|+.+.-..+-+ .+...+...+-.....=.+-|=.+= T Consensus 218 ~~~~~----~~~l-~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r 292 (324) T protein:vir:78 218 IYDRN----SDSL-DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) T ss_pred ecCCC----CCcc-cceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEE Confidence 11112 2345 457888766532110000000112233444433222110 0000000000000000000011111 Q ss_pred eeeeeceee-cC--ccc------ccCcccccc Q lcl|NC_015279. 428 FKTRYGMVA-NP--FAE------GTTVGAGRL 450 (467) Q Consensus 428 ~~tRY~l~~-nP--~~~------~~~~~~~~~ 450 (467) ...||+..+ +| |+. .++..|+-+ T Consensus 293 ~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:78 293 ATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred EEEEEccEEecccceEEEecccccCCCCCCCC Confidence 223444322 22 111 122222222 No 50 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=83.48 E-value=0.068 Score=26.98 Aligned_cols=308 Identities=11% Similarity=0.035 Sum_probs=116.1 Q ss_pred HhhhHHHHHHHHhh-hhhcchhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeee Q lcl|NC_015279. 35 LLENQEKFMQEQVA-FEQGGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIF 113 (467) Q Consensus 35 ~~enq~~~~~e~~~-~~~~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIF 113 (467) .-++|+ +++.+. +.. .+.-....+... ...+.++...--.+..-.+++...+.....+++-+-||++++--|. T Consensus 1 ~~~~~~--~~~~~~~~~~-~~~~~~~~~a~~---~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p 74 (324) T protein:vir:96 1 MEQTQK--LKLNLQHFAS-NNVKPQVFNPDN---VMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) T ss_pred CCcchh--hhHHHHHHHH-Hhhhhhhhcccc---ccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEE Confidence 111221 111110 000 000000001000 0111111111111122234444556667788888889887653322 Q ss_pred eeeeeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhh Q lcl|NC_015279. 114 AMRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAED 193 (467) Q Consensus 114 AMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~ 193 (467) -.. ++.++ .+ .+ | T Consensus 75 ~~~------~~~~a-------~~---------------------------------------------------v~--E- 87 (324) T protein:vir:96 75 FWA------DKPGA-------YW---------------------------------------------------VG--E- 87 (324) T ss_pred EEe------cCcce-------eE---------------------------------------------------ec--C- Confidence 110 00000 00 00 1 Q ss_pred cCCCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccc Q lcl|NC_015279. 194 LGTSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAV 273 (467) Q Consensus 194 LGs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~ 273 (467) +..+++...++++++++.+.-+.-..+|-||.+|-. .|.+++|.+-|+..|...|++-+|.---+. T Consensus 88 ----g~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g~~------ 153 (324) T protein:vir:96 88 ----GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN------ 153 (324) T ss_pred ----CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhccCCCC------ Confidence 122333334444445555555555669999999864 578999999999999999999888432111 Q ss_pred cccccceeEEeecc----ccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccccccc Q lcl|NC_015279. 274 SNTATAGVFDLDID----SNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANL 349 (467) Q Consensus 274 ~~~~~~gv~Dl~~~----~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~ 349 (467) ....|+...... ..+....+....+..++.. .....+.+|+||.....|... ....+ ..+ T Consensus 154 --~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~~---------~~~~~~~~vmn~~~~~~L~~l---~d~~G--~~~ 217 (324) T protein:vir:96 154 --PFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLED---------DELEANAFISKTQNRSLLRKI---VDPET--KER 217 (324) T ss_pred --CcCccccccccccceeccccccHHHHHHHHHhhhh---------ccCCCCEEEEcHHHHHHHHHh---hccCC--Cee Confidence 111222222111 1111112333334343321 234455689999998888542 11111 011 Q ss_pred ccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccce--eEecccchhhcccccCCccccceee Q lcl|NC_015279. 350 NVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAG--LFYCPYVPLQMVRAVGENTFQPKIG 427 (467) Q Consensus 350 ~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~g--lfyaPYv~l~~~~~~Dp~s~qP~~g 427 (467) -.+.. .++| .+++|++++...-.......-.+.++++|+.+.-..+-+ .+...+...+-.....=.+-|=.+= T Consensus 218 ~~~~~----~~~l-~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r 292 (324) T protein:vir:96 218 IYDRN----SDSL-DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) T ss_pred ecCCC----CCcc-cceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEE Confidence 11112 2345 457888766532110000000112233444433222110 0000000000000000000011111 Q ss_pred eeeeeceee-cC--ccc------ccCcccccc Q lcl|NC_015279. 428 FKTRYGMVA-NP--FAE------GTTVGAGRL 450 (467) Q Consensus 428 ~~tRY~l~~-nP--~~~------~~~~~~~~~ 450 (467) ...||+..+ +| |+. .++..|+-+ T Consensus 293 ~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 293 ATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred EEEEEccEEecccceEEEecccccCCCCCCCC Confidence 223444322 22 111 122222222 No 51 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=83.20 E-value=0.07 Score=26.90 Aligned_cols=279 Identities=12% Similarity=0.063 Sum_probs=115.6 Q ss_pred cccccccccc---cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCcccccccccccccccccc Q lcl|NC_015279. 67 GYTSSGGQTV---AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTAFAGQNEGF 143 (467) Q Consensus 67 ~~~st~tg~i---~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~fSg~~a~~ 143 (467) =+++|+++.. ..+.+-++...| +..+..+++.+.||++-..-+. . +.. +.+| .| T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~---~~s~i~~l~~~~~~~~~~~~~p-~---~~~--~~~a-------~w------- 57 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVK---GHSSIAKLSPQKPIPFNGQREF-V---FDF--DSDI-------DI------- 57 (300) T ss_pred CcccccCCcceechhhHHHHHHHHH---hhhhhhhhcceeeccCCceEEE-E---Eec--Ccce-------EE------- Confidence 1222332222 122333444444 4456678999999976432111 1 110 1111 00 Q ss_pred cccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEEEeeccccccc Q lcl|NC_015279. 144 DLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVTAKSRALKAE 223 (467) Q Consensus 144 ~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVtAKSRaLKAE 223 (467) + + | +.+.++...+++.++..+|.=+-... T Consensus 58 --------------------------------------v------~--E-----g~~~~~s~~~f~~v~l~~~k~~~~~~ 86 (300) T protein:vir:95 58 --------------------------------------V------A--E-----NGKKTHGGVSLDPVTIVPLKVEYGAR 86 (300) T ss_pred --------------------------------------e------e--C-----CcccccccccceeeEeeeEEEEEeeh Confidence 0 0 0 11233333444455555555555567 Q ss_pred ccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccccc----ccceeEEeeccccchhHHHHHHH Q lcl|NC_015279. 224 YSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNT----ATAGVFDLDIDSNGRWSVEKFKG 299 (467) Q Consensus 224 YT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~----~~~gv~Dl~~~~~~r~~ve~~~~ 299 (467) +|-||.+.... ..+|-+++|.+-|...|...+++.++.-...- .+.-.+. ...+.........+--.-+-... T Consensus 87 iS~ell~~~~d-~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~--~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 163 (300) T protein:vir:95 87 VSDEFLHASEE-AKVDMLTDFVEGFSKKLARGLDIMSIHGINPR--TKQASTIIGDNCFDKKVTQTVPFKDTNPDESMED 163 (300) T ss_pred hhHHHhccCCC-CHHHHHHHHHHHHHHHHHHHHHHhhhhcccCC--CCCCcccccccccccccceeecccccchHHHHHH Confidence 88898753222 23567888888888888888888888432110 0100000 11111111111111111011112 Q ss_pred HHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccccccc-ccccCCceeEEEecCceEEEecccccccc Q lcl|NC_015279. 300 LLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANL-NVDDTGNTFAGVLQGKYRVYIDPYSSNLT 378 (467) Q Consensus 300 l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~-~~d~t~~~~~G~l~~~~~vy~D~y~~~~~ 378 (467) ++..+ ..-.++.+-+|++|.....|... ....+ ..+ ..+.++ -..++| .+++|+++.+... T Consensus 164 ~~~~~---------~~~~~~~~~~vmn~~~~~~L~~l---kd~~G--~~i~~~~~~~-~~~~~l-~G~Pv~~s~~v~~-- 225 (300) T protein:vir:95 164 AVGMI---------DGSERDITGAILDPIFTTALSKM---KNAEG--GKLYPELAWG-GVPDAI-NGLAVDKNRTVSY-- 225 (300) T ss_pred HHHHh---------hhcCCCccEEEECHHHHHHHHHh---hccCC--CeeccCcccc-CCCcee-cceeeEEecCCCC-- Confidence 22222 12345666789999988877442 21111 011 111111 124678 5679998877521 Q ss_pred hhhccCCCceEEEEEecCCCccceeEecccchhh--cccccCCcc-----c---cceeeeeeeeceee-cC--cccccCc Q lcl|NC_015279. 379 SANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQ--MVRAVGENT-----F---QPKIGFKTRYGMVA-NP--FAEGTTV 445 (467) Q Consensus 379 ~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~--~~~~~Dp~s-----~---qP~~g~~tRY~l~~-nP--~~~~~~~ 445 (467) ....+.+.+++|=- ..+++|....... +..-.|+++ | |=.+=+..|+|..+ +| |+.-+.. T Consensus 226 --~~~~~~~~~~~GDf-----~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~ 298 (300) T protein:vir:95 226 --SQTDPKNTAIVGDF-----ETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKT 298 (300) T ss_pred --CCCCCccEEEEeec-----cceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecC Confidence 11112223333210 0111122211111 111113221 2 23333455887544 66 3332222 Q ss_pred ccc Q lcl|NC_015279. 446 GAG 448 (467) Q Consensus 446 ~~~ 448 (467) + + T Consensus 299 ~-g 300 (300) T protein:vir:95 299 G-G 300 (300) T ss_pred C-C Confidence 1 1 No 52 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=80.93 E-value=0.09 Score=26.31 Aligned_cols=339 Identities=13% Similarity=0.090 Sum_probs=132.8 Q ss_pred CcchHHHHHhhhhhhccCcc-----------chhcchhHHHHHHHHhhhHHH--HH----HHHhhhh------------- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGL-----------DKISDPHRRAVTAVLLENQEK--FM----QEQVAFE------------- 50 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~-----------~~i~~~~~~~v~~~~~enq~~--~~----~e~~~~~------------- 50 (467) |-+.++|++.|.-+.+.-.. .+.....-+++.+.+-+-+++ .+ .+.+... T Consensus 1 Mk~~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKPLT 80 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Confidence 99999999998877653100 000001112222222111110 00 0000000 Q ss_pred -------------hcchhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeee Q lcl|NC_015279. 51 -------------QGGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRS 117 (467) Q Consensus 51 -------------~~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRs 117 (467) +..++..............+++.|.+.--....-.+++..-+.....+++.|+||++.+|-+-=.+ T Consensus 81 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~- 159 (397) T protein:vir:49 81 KNEEEVKANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEK- 159 (397) T ss_pred chhhHHHHHHHHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEe- Confidence 000000000000000000111112111111111124444446667788999999998876532122 Q ss_pred eecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCC Q lcl|NC_015279. 118 KYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTS 197 (467) Q Consensus 118 rY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~ 197 (467) .....+ .+ .| .++++..... T Consensus 160 -~~~~~~-~a-------~~---------------------------------------------------v~E~~~~~~~ 179 (397) T protein:vir:49 160 -WADITG-LA-------KL---------------------------------------------------DDEGGQIGQN 179 (397) T ss_pred -eccCCc-ce-------ee---------------------------------------------------eccccccccc Confidence 111100 00 00 0000000001 Q ss_pred -CCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccccc Q lcl|NC_015279. 198 -GDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNT 276 (467) Q Consensus 198 -g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~ 276 (467) ...|.++.|++.|..+ ...+|-||.+|-. +|.+++|.+-|+..|..-+|+.||.-. -.+. T Consensus 180 ~~~~~~~v~~~~~k~~~-------~~~iS~ell~ds~----~~l~~~i~~~l~~~~~~~~d~ail~G~--------g~~~ 240 (397) T protein:vir:49 180 DDPKLSLIRYAIKRYAG-------ISTVTNSLLADSA----ENILAWLSGWIAKKVVVTRNKAILEAI--------GTLP 240 (397) T ss_pred cccceeeeEeeeeeeEe-------ehhhHHHHHhhhh----HHHHHHHHHHHHHHHHHHHHHHHHhcc--------cccc Confidence 1235556665555544 4678999999853 578999999999999999999888321 1222 Q ss_pred ccceeEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCc Q lcl|NC_015279. 277 ATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGN 356 (467) Q Consensus 277 ~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~ 356 (467) ...+++++ +....+...+.. .......+|++|.....|..- ....+- .-+..+.+. T Consensus 241 ~~~~~~~~----------d~i~~~~~~l~~---------~~~~~a~~v~n~~~~~~l~~l---kd~~g~-~l~~~~~~~- 296 (397) T protein:vir:49 241 NKPTLAKW----------DDIIDLQAKVDP---------AIKQTSLFLTNTSGFTALKKV---KNAMGD-YLMERDVKS- 296 (397) T ss_pred ccccccCH----------HHHHHHHHhhhh---------hhcCCCEEEEcHHHHHHHHHh---hccCCc-eeecccccC- Confidence 22333322 112233333321 223446788999988888652 211110 001111111 Q ss_pred eeEEEecCceEEEe-c-ccccccc--hhhcc--CCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeee Q lcl|NC_015279. 357 TFAGVLQGKYRVYI-D-PYSSNLT--SANAA--NGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKT 430 (467) Q Consensus 357 ~~~G~l~~~~~vy~-D-~y~~~~~--~~~~~--~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~t 430 (467) -..++|+ |++|++ + ...-... ...++ ...+|++++..+.-. +-..||.-- +-...+-.+-... T Consensus 297 g~~~~l~-G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~----i~~~~~~~~------~~~~~~~~~~~~~ 365 (397) T protein:vir:49 297 PTGYSID-GFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLS----LLSTNIGGG------AFETDTTKVRVID 365 (397) T ss_pred CCCceec-ceeeEEecccccccccCCceeEEEeeccceEEEEeecccE----EEEeccccc------hhhcCeeeEEEEE Confidence 1134674 446553 2 1110000 00000 112344444433322 223333211 1112233334444 Q ss_pred eeceee-cC--ccc-----ccCcccccccccc Q lcl|NC_015279. 431 RYGMVA-NP--FAE-----GTTVGAGRLRVNS 454 (467) Q Consensus 431 RY~l~~-nP--~~~-----~~~~~~~~~~~~~ 454 (467) |++..+ +| |.. ..++.+..-.-|. T Consensus 366 r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:49 366 RFDVVSTDTEAFVPASFKAIADQKAKLSTAGA 397 (397) T ss_pred eeccEEecccceEEEEecccccccCcccccCC Confidence 554432 22 111 1111111111111 No 53 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=80.55 E-value=0.094 Score=26.22 Aligned_cols=353 Identities=15% Similarity=0.083 Sum_probs=136.1 Q ss_pred CcchHHHHHhhhhhhccCcc--chhcchhHH------HHHHHH------hhhHHHHHHHHh------h----hhhcchhh Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGL--DKISDPHRR------AVTAVL------LENQEKFMQEQV------A----FEQGGMIA 56 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~--~~i~~~~~~------~v~~~~------~enq~~~~~e~~------~----~~~~~~~~ 56 (467) ..+++++..++..++..... .+|.....+ .-.... +.+++....... . ...+.... T Consensus 53 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 132 (497) T protein:vir:10 53 HERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAA 132 (497) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHH Confidence 33334444444444432111 111111100 000000 000000000000 0 00000000 Q ss_pred hh---------hhhhccccccccccccc---ccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCC Q lcl|NC_015279. 57 EQ---------PTNAVGNGGYTSSGGQT---VAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGG 124 (467) Q Consensus 57 e~---------~~~~~g~~~~~st~tg~---i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG 124 (467) |. ...........+++++. ...+.+-++.+.| +..+..+++.+-||+++..- |... .+... T Consensus 133 ~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~---~~~~i~~l~~~~~~~~~~~~-~~~~--~~~~~- 205 (497) T protein:vir:10 133 ELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLF---YELSLADLISSRPVTSPNLS-YLTE--SAAHN- 205 (497) T ss_pred HHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHH---hhhhHHhhccccccCCCceE-EEEE--cCCCC- Confidence 00 00000111111222222 2234555666665 45577899999999987532 2111 00000 Q ss_pred CcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCcccee Q lcl|NC_015279. 125 TEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEM 204 (467) Q Consensus 125 tEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EM 204 (467) ++ . .. +| +..+++. T Consensus 206 -~a-------~---------------------------------------------------wv--~E-----~~~~~~s 219 (497) T protein:vir:10 206 -NA-------A---------------------------------------------------AV--AE-----AGTYPFS 219 (497) T ss_pred -cc-------e---------------------------------------------------ee--cc-----Ccccccc Confidence 00 0 00 01 1234445 Q ss_pred eeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHH--------Hhhhccccccccc Q lcl|NC_015279. 205 AFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRT--------IYKVSEQGAVSNT 276 (467) Q Consensus 205 aFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~--------l~~~a~~~k~~~~ 276 (467) ..+++++++.+|.-+-...+|-||++|-- +.++.|.+-|+..|..-+|+.||.- |.+.+....+... T Consensus 220 ~~~f~~i~~~~~k~a~~~~iS~ell~d~~-----~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~ 294 (497) T protein:vir:10 220 SEEFARVYEQVGKVANALTITDEGLRDAP-----ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSA 294 (497) T ss_pred cccceeeEeeeeeeEeecHhHHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHhhcCCCccccccccccccccccccc Confidence 55667777777776777889999999942 3789999999999999999998841 1111111111100 Q ss_pred cc--------ceeEEeeccccchhHHHH-----HHHH----------------------HHHHHHHHHHHHHhhccCCcc Q lcl|NC_015279. 277 AT--------AGVFDLDIDSNGRWSVEK-----FKGL----------------------LFQIERDANAIAQRTRRGKGN 321 (467) Q Consensus 277 ~~--------~gv~Dl~~~~~~r~~ve~-----~~~l----------------------~~~i~~ean~i~~~t~rg~gn 321 (467) .. .+..++..+..+.|.+.. .+.. ...-...+-...+++....++ T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 374 (497) T protein:vir:10 295 SSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN 374 (497) T ss_pred ccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCC Confidence 00 001111111111111110 0000 000011222233455666777 Q ss_pred EEEEchHHHHHHhhh----cchhcccccccccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEE-Eec- Q lcl|NC_015279. 322 MILCSADVASALTMA----GVLDYTPALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVG-YKG- 395 (467) Q Consensus 322 ~~i~S~~Va~~L~~s----G~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vG-yKG- 395 (467) .+|.+|.-...|... |-.-+.|...... .......++|. |++|++.+.... .++ ++| ++- T Consensus 375 ~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~---~~~~~~~~~l~-G~pV~~t~~~~~---------~~~-~~Gd~~~~ 440 (497) T protein:vir:10 375 AVVMNPRDWELLRLTKDANGQYMGGNFFGNAY---GNPVNGGKNIW-GVPVVTTPLIPL---------GTI-LVGHFAPS 440 (497) T ss_pred eEEEchHHHHHHHHhhcCCCceeccCcccccc---cccccCCceee-ceeeEecCCCCC---------Cce-EEeecccc Confidence 788888777666432 2111111100000 00011123664 588888766421 232 222 110 Q ss_pred CC----CccceeEecccchhhcccccCCccccceeeeeeeece-eecC--cccccCcccccccccc Q lcl|NC_015279. 396 TS----PYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGM-VANP--FAEGTTVGAGRLRVNS 454 (467) Q Consensus 396 ~~----~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l-~~nP--~~~~~~~~~~~~~~~~ 454 (467) .. ..+-.+-..||.-.++ .+.|=.+=+..|+++ +.+| |...+-...+ .++ T Consensus 441 ~~~i~~r~~~~v~~~~~~~~~f------~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~---~~~ 497 (497) T protein:vir:10 441 VIQTARREGVTMQMTNSNGTDF------VDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA---TGS 497 (497) T ss_pred eEEEEEecccEEEeecccchhh------hcCcEEEEEEEeecceeeccccEEEEEecCCc---cCC Confidence 00 0011122223311111 122334444678865 6677 3332221111 222 No 54 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=80.55 E-value=0.094 Score=26.22 Aligned_cols=353 Identities=15% Similarity=0.083 Sum_probs=136.1 Q ss_pred CcchHHHHHhhhhhhccCcc--chhcchhHH------HHHHHH------hhhHHHHHHHHh------h----hhhcchhh Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGL--DKISDPHRR------AVTAVL------LENQEKFMQEQV------A----FEQGGMIA 56 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~--~~i~~~~~~------~v~~~~------~enq~~~~~e~~------~----~~~~~~~~ 56 (467) ..+++++..++..++..... .+|.....+ .-.... +.+++....... . ...+.... T Consensus 53 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 132 (497) T protein:vir:78 53 HERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAA 132 (497) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHH Confidence 33334444444444432111 111111100 000000 000000000000 0 00000000 Q ss_pred hh---------hhhhccccccccccccc---ccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCC Q lcl|NC_015279. 57 EQ---------PTNAVGNGGYTSSGGQT---VAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGG 124 (467) Q Consensus 57 e~---------~~~~~g~~~~~st~tg~---i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG 124 (467) |. ...........+++++. ...+.+-++.+.| +..+..+++.+-||+++..- |... .+... T Consensus 133 ~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~---~~~~i~~l~~~~~~~~~~~~-~~~~--~~~~~- 205 (497) T protein:vir:78 133 ELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLF---YELSLADLISSRPVTSPNLS-YLTE--SAAHN- 205 (497) T ss_pred HHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHH---hhhhHHhhccccccCCCceE-EEEE--cCCCC- Confidence 00 00000111111222222 2234555666665 45577899999999987532 2111 00000 Q ss_pred CcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCcccee Q lcl|NC_015279. 125 TEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEM 204 (467) Q Consensus 125 tEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EM 204 (467) ++ . .. +| +..+++. T Consensus 206 -~a-------~---------------------------------------------------wv--~E-----~~~~~~s 219 (497) T protein:vir:78 206 -NA-------A---------------------------------------------------AV--AE-----AGTYPFS 219 (497) T ss_pred -cc-------e---------------------------------------------------ee--cc-----Ccccccc Confidence 00 0 00 01 1234445 Q ss_pred eeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHH--------Hhhhccccccccc Q lcl|NC_015279. 205 AFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRT--------IYKVSEQGAVSNT 276 (467) Q Consensus 205 aFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~--------l~~~a~~~k~~~~ 276 (467) ..+++++++.+|.-+-...+|-||++|-- +.++.|.+-|+..|..-+|+.||.- |.+.+....+... T Consensus 220 ~~~f~~i~~~~~k~a~~~~iS~ell~d~~-----~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~ 294 (497) T protein:vir:78 220 SEEFARVYEQVGKVANALTITDEGLRDAP-----ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSA 294 (497) T ss_pred cccceeeEeeeeeeEeecHhHHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHhhcCCCccccccccccccccccccc Confidence 55667777777776777889999999942 3789999999999999999998841 1111111111100 Q ss_pred cc--------ceeEEeeccccchhHHHH-----HHHH----------------------HHHHHHHHHHHHHhhccCCcc Q lcl|NC_015279. 277 AT--------AGVFDLDIDSNGRWSVEK-----FKGL----------------------LFQIERDANAIAQRTRRGKGN 321 (467) Q Consensus 277 ~~--------~gv~Dl~~~~~~r~~ve~-----~~~l----------------------~~~i~~ean~i~~~t~rg~gn 321 (467) .. .+..++..+..+.|.+.. .+.. ...-...+-...+++....++ T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 374 (497) T protein:vir:78 295 SSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPN 374 (497) T ss_pred ccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCC Confidence 00 001111111111111110 0000 000011222233455666777 Q ss_pred EEEEchHHHHHHhhh----cchhcccccccccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEE-Eec- Q lcl|NC_015279. 322 MILCSADVASALTMA----GVLDYTPALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVG-YKG- 395 (467) Q Consensus 322 ~~i~S~~Va~~L~~s----G~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vG-yKG- 395 (467) .+|.+|.-...|... |-.-+.|...... .......++|. |++|++.+.... .++ ++| ++- T Consensus 375 ~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~---~~~~~~~~~l~-G~pV~~t~~~~~---------~~~-~~Gd~~~~ 440 (497) T protein:vir:78 375 AVVMNPRDWELLRLTKDANGQYMGGNFFGNAY---GNPVNGGKNIW-GVPVVTTPLIPL---------GTI-LVGHFAPS 440 (497) T ss_pred eEEEchHHHHHHHHhhcCCCceeccCcccccc---cccccCCceee-ceeeEecCCCCC---------Cce-EEeecccc Confidence 788888777666432 2111111100000 00011123664 588888766421 232 222 110 Q ss_pred CC----CccceeEecccchhhcccccCCccccceeeeeeeece-eecC--cccccCcccccccccc Q lcl|NC_015279. 396 TS----PYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGM-VANP--FAEGTTVGAGRLRVNS 454 (467) Q Consensus 396 ~~----~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l-~~nP--~~~~~~~~~~~~~~~~ 454 (467) .. ..+-.+-..||.-.++ .+.|=.+=+..|+++ +.+| |...+-...+ .++ T Consensus 441 ~~~i~~r~~~~v~~~~~~~~~f------~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~---~~~ 497 (497) T protein:vir:78 441 VIQTARREGVTMQMTNSNGTDF------VDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA---TGS 497 (497) T ss_pred eEEEEEecccEEEeecccchhh------hcCcEEEEEEEeecceeeccccEEEEEecCCc---cCC Confidence 00 0011122223311111 122334444678865 6677 3332221111 222 No 55 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=79.90 E-value=0.1 Score=26.07 Aligned_cols=328 Identities=12% Similarity=0.040 Sum_probs=122.1 Q ss_pred Cc------chHHHHHhhhhhhccCc--cchhcchhHHHHHHHHhhhHHHHHHHHhhhhhc-------------------- Q lcl|NC_015279. 1 MF------QSEQLQEKWAPLLNYEG--LDKISDPHRRAVTAVLLENQEKFMQEQVAFEQG-------------------- 52 (467) Q Consensus 1 ~~------~~~~l~~kw~p~l~~~~--~~~i~~~~~~~v~~~~~enq~~~~~e~~~~~~~-------------------- 52 (467) .. ..++..++...+.+... .-+..+..+.++ +.|.+..+ .++++.+.... T Consensus 16 ~l~~~~~~~~~e~~~~~e~~~~~~~~~~~~~~~e~~~~~-~~l~~~~~-~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 93 (379) T protein:vir:10 16 QVDSKSSAQALEVKGLIEALEAKMTSEKDLAVNELKSDM-AALQAHAD-KLDVKLKEKAKSEDKSDSLVKSITENFNDIK 93 (379) T ss_pred HHHHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHHHHHHH-HHHHHHHH-HHHHHHHhcccccccchhHHHHHHHHHHhHH Confidence 00 01112222211111000 000001111111 11111110 01111100000 Q ss_pred --chhhhhhhhhcccccccccccccc-cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCcccc Q lcl|NC_015279. 53 --GMIAEQPTNAVGNGGYTSSGGQTV-AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALF 129 (467) Q Consensus 53 --~~~~e~~~~~~g~~~~~st~tg~i-~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlf 129 (467) ...........|+....+..++.| ..+.+-++-+.|+ ...-.+++.|.||++++.-|.- .. T Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~---~~~i~~~~~~~~~~~~~~~~~~-------~~------ 157 (379) T protein:vir:10 94 EVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQ---MLNVSDIVGAVSISGGTYTFVR-------EN------ 157 (379) T ss_pred HHHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHHh---hhhHHhhceeeeccCCceEEEE-------ee------ Confidence 000000112223322222222222 2234444544543 4466788999999887543310 00 Q ss_pred cccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEE Q lcl|NC_015279. 130 DEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIE 209 (467) Q Consensus 130 nEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIE 209 (467) ++.+... .-.+| +...+++..+++ T Consensus 158 -----~~~~~~~----------------------------------------------~~v~E-----g~~~~~~~~~f~ 181 (379) T protein:vir:10 158 -----GAGEGAI----------------------------------------------GAQVE-----GATKGQKDYDIS 181 (379) T ss_pred -----cCCCccc----------------------------------------------ccccC-----Ccccccccccee Confidence 0000000 00011 122333444444 Q ss_pred EEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeecccc Q lcl|NC_015279. 210 KVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDIDSN 289 (467) Q Consensus 210 K~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~~~ 289 (467) +++..+|.=+--...|-||.||-- +.++.|.+-|+..|+.-+|..++.-+.+....+.... .+ T Consensus 182 ~i~~~~~k~~~~~~iS~ell~D~~-----~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~~~------------~~ 244 (379) T protein:vir:10 182 MIDVNTDFIAGFTRYSKKMANNLP-----FLTSFIPNALRRDYAKAENAAFNAVLAANATASTEII------------TN 244 (379) T ss_pred eeEeeeeeEEeeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHhcccccccccccccc------------cC Confidence 444444444444779999999963 2678899999999998888888755443322111111 11 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccc-cccccCCce-eEEEecCceE Q lcl|NC_015279. 290 GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNAN-LNVDDTGNT-FAGVLQGKYR 367 (467) Q Consensus 290 ~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~-~~~d~t~~~-~~G~l~~~~~ 367 (467) . ..++..+.+++++.. .....+.+|++|.....|... ....+ .. +..+.+... -..+| .|++ T Consensus 245 ~-~~~d~i~~~~~~~~~---------~~~~~~~~vmn~~~~~~l~~l---kd~~G--~~l~~~~~~~~~~~~~~l-~G~p 308 (379) T protein:vir:10 245 K-NKVEMLINEIAKQEN---------LDFPVTAIVLRPTDYYDILVT---QKSVG--AGYGLPGVVTQDNGVLRI-NGIP 308 (379) T ss_pred c-ccHHHHHHHHHhhhh---------ccCCCCEEEEcHHHHHHHHHh---hccCC--ceeccCCccCCCCCccee-ccee Confidence 1 112333444444421 244566788999887777532 11111 00 110101000 01145 3689 Q ss_pred EEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhccccc--CCccccceeeeeeeeceee-cCcccccC Q lcl|NC_015279. 368 VYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAV--GENTFQPKIGFKTRYGMVA-NPFAEGTT 444 (467) Q Consensus 368 vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~--Dp~s~qP~~g~~tRY~l~~-nP~~~~~~ 444 (467) |+++++... .++++.-++ ..-+++--=+..+..+.. +-.+-+=.+=+..|+|+.+ +| T Consensus 309 vv~s~~~~a---------g~~~~gdf~-----~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p------ 368 (379) T protein:vir:10 309 LFRATWLAA---------NKYYVGDWT-----RVTKVTTEGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQP------ 368 (379) T ss_pred eEecCCCCC---------CceEEeecc-----cEEEEEEeceEEEEeecccccccCCcEEEEEEEEeccEEecC------ Confidence 999987531 122211111 001111100000000000 1122222222345775543 33 Q ss_pred ccccccccccccccceeeeecc Q lcl|NC_015279. 445 VGAGRLRVNSNRYYRRVAVKNL 466 (467) Q Consensus 445 ~~~~~~~~~~n~y~r~~~v~~~ 466 (467) ..|-++-+..| T Consensus 369 -----------~a~v~~~~~~~ 379 (379) T protein:vir:10 369 -----------AALIFGDFTAV 379 (379) T ss_pred -----------ccEEEEEecCC Confidence 11233333333 No 56 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=77.35 E-value=0.13 Score=25.52 Aligned_cols=325 Identities=12% Similarity=0.028 Sum_probs=116.4 Q ss_pred Cc-----------chHHHHHhhhhh--------------------hccCcc---chhcchhHHHHHHHHhhhHHHHHHHH Q lcl|NC_015279. 1 MF-----------QSEQLQEKWAPL--------------------LNYEGL---DKISDPHRRAVTAVLLENQEKFMQEQ 46 (467) Q Consensus 1 ~~-----------~~~~l~~kw~p~--------------------l~~~~~---~~i~~~~~~~v~~~~~enq~~~~~e~ 46 (467) |- .+++..++-.-+ ++.... ++-+..........-+++..+...+. T Consensus 19 ~~~~~e~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 98 (390) T protein:vir:81 19 LRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASAGRWNDR 98 (390) T ss_pred HHHHHHHHHhhcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHHhhh Confidence 00 000011110000 000000 00000000000000000000000000 Q ss_pred hhhhhcchhhh--hhhhhccccccccccccc--ccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCC Q lcl|NC_015279. 47 VAFEQGGMIAE--QPTNAVGNGGYTSSGGQT--VAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQ 122 (467) Q Consensus 47 ~~~~~~~~~~e--~~~~~~g~~~~~st~tg~--i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~q 122 (467) . +..-.+ +..+.. ....++..|. .....+.++...| +..+..+++.+.||++++.-+.- ..+. T Consensus 99 ~----~~~~~~~~~~~~~~--~~~~~~~~g~~~~~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~----~~~~ 165 (390) T protein:vir:81 99 S----ARATMNIKAALNTA--STDAAGSAGALTTPNRLPGFITPPD---ARLTVRDLIGSGRTDSALIEYVQ----ETGF 165 (390) T ss_pred h----hhhhhHHHHHHHhh--ccccccCCcceechhhhHHHHHHHh---hhhhhhhhcceeeccCCceEEEE----EecC Confidence 0 000000 000000 0000111111 1122334444444 45567889999999887743321 1110 Q ss_pred CCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccc Q lcl|NC_015279. 123 GGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFN 202 (467) Q Consensus 123 sGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~ 202 (467) .+ .+ . ..++++........|. T Consensus 166 ~~-~a-------~---------------------------------------------------~v~Eg~~~~~~~~~~~ 186 (390) T protein:vir:81 166 VN-NA-------A---------------------------------------------------IVAEGALKPESSLKFA 186 (390) T ss_pred Cc-ce-------e---------------------------------------------------eecCCcccccccceee Confidence 00 00 0 0000000011122355 Q ss_pred eeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeE Q lcl|NC_015279. 203 EMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVF 282 (467) Q Consensus 203 EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~ 282 (467) ++.+++.|..+ ...+|-||.+|- . +.++.|.+-|+..|...+|+-||.- .-.+-...|++ T Consensus 187 ~i~~~~~k~~~-------~~~is~ell~d~--~---~~~~~i~~~l~~~~~~~~d~a~l~G--------~g~~~~~~Gi~ 246 (390) T protein:vir:81 187 KKTDTTHVIAH-------TMKATRQILSDA--P---QLASYMNNRLIRGLKVKEDAEILRG--------TGANDGLLGLI 246 (390) T ss_pred EEEEeeeEEEE-------eehhhHHHHHhH--H---HHHHHHHHHHHHHHHHHHHHHHHhc--------CCCCCccccee Confidence 55555555554 456788999984 2 4788899999999999998887732 11122234444 Q ss_pred Eeec------cccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCc Q lcl|NC_015279. 283 DLDI------DSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGN 356 (467) Q Consensus 283 Dl~~------~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~ 356 (467) .... ...+-..++....+++++. ...+..+.+|++|.....|... ....+ ..+-.+... T Consensus 247 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~v~~~~~~~~l~~l---kd~~G--~~l~~~~~~- 311 (390) T protein:vir:81 247 PQATTYAAPTTIAGATRVDQLRLAMLQAS---------LAEYNPSGIVINPIDWAAIELA---KDANN--QYLIGNARG- 311 (390) T ss_pred ecccccccccccccchhHHHHHHHHHhhc---------cccCCCCEEEEcHHHHHHHHHh---hcCCC--ceeecCccc- Confidence 3211 1112223333444444432 2234556789999998887532 21111 111111111 Q ss_pred eeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeece-e Q lcl|NC_015279. 357 TFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGM-V 435 (467) Q Consensus 357 ~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l-~ 435 (467) .-.++| .|++|++..+..... .......++++++..++-..+.+-... | =.+-+=.+=...|++. + T Consensus 312 ~~~~~l-~G~pv~~~~~~p~~~-~~~gd~~~~~~~~~~~~~~v~~~~~~~-~----------~~~~~v~~r~~~r~d~~v 378 (390) T protein:vir:81 312 TLTPTL-WGLPVVATQAMAPGE-FLVGAFDLAAQIFDQWDARVEIGYVGE-D----------FQRNMITVLAEERLALVV 378 (390) T ss_pred ccCcee-cceeeEEcCCCCCCc-EEEEehhceEEEEEecceEEEEecccc-h----------hhcCcEEEEEEEeeccEE Confidence 112466 577888887642100 000001122222222222211110000 0 0011112223445544 2 Q ss_pred ecC--cccccCccccccccccccccceeeee Q lcl|NC_015279. 436 ANP--FAEGTTVGAGRLRVNSNRYYRRVAVK 464 (467) Q Consensus 436 ~nP--~~~~~~~~~~~~~~~~n~y~r~~~v~ 464 (467) .+| |+..+- . T Consensus 379 ~~~~a~v~~t~-------------------a 390 (390) T protein:vir:81 379 YRPEALISGSF-------------------A 390 (390) T ss_pred ecccceEEEEe-------------------C Confidence 333 111111 1 No 57 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=77.29 E-value=0.13 Score=25.51 Aligned_cols=335 Identities=16% Similarity=0.150 Sum_probs=118.3 Q ss_pred Cc--------chHHHHHhhhhhhc------------cCccchhcchhHHHHHHHHhhhHHHHHHHH--hhhh-----hcc Q lcl|NC_015279. 1 MF--------QSEQLQEKWAPLLN------------YEGLDKISDPHRRAVTAVLLENQEKFMQEQ--VAFE-----QGG 53 (467) Q Consensus 1 ~~--------~~~~l~~kw~p~l~------------~~~~~~i~~~~~~~v~~~~~enq~~~~~e~--~~~~-----~~~ 53 (467) +. ......++...-+. .+..+.+.+..++ .+...++...++. +.+. .+. T Consensus 73 l~ee~~~~~~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~----~~~~~~~~~~~~~e~~~~~~~~~~~~~ 148 (458) T protein:vir:10 73 LDEKSKKSNELFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAK----ALYGTQENFEDEVEKLVLLSYVMEKGV 148 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhc----cchhhhhhHHHHHHHHHHHHHHHhhcc Confidence 00 00001111110000 0000111000000 0000100000000 0000 000 Q ss_pred hhhhhhhhhcccccccccc-ccc--c-cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCcccc Q lcl|NC_015279. 54 MIAEQPTNAVGNGGYTSSG-GQT--V-AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALF 129 (467) Q Consensus 54 ~~~e~~~~~~g~~~~~st~-tg~--i-~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlf 129 (467) ...+.............++ .+. + ..+.+-++.+.| +..+..+++-++||+++..-++ .. ..+..| T Consensus 149 ~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~-~~-----~~~~~a-- 217 (458) T protein:vir:10 149 FETEHGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQ---KELVVGALFEELPMSSKILTML-VE-----PDAGKA-- 217 (458) T ss_pred chhhhhhhhhhhhhhcccCccccceehhhHhHHHHHHHH---hhhhHHhhcceeecCCcceEEE-Ee-----cCCcce-- Confidence 0000000000000000111 111 1 123334445555 5667889999999988653222 11 000000 Q ss_pred cccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEE Q lcl|NC_015279. 130 DEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIE 209 (467) Q Consensus 130 nEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIE 209 (467) .|-+-+.. ...+...+ .....|.+..+ T Consensus 218 -----~~v~e~~~------------------------------------------~~~~~~~~---~~~~~~~~i~~--- 244 (458) T protein:vir:10 218 -----TWVAASTY------------------------------------------GTDTTTGE---EVKGALKEIHF--- 244 (458) T ss_pred -----eecccccc------------------------------------------cccccccc---cccccceeeEe--- Confidence 00000000 00000000 00122444444 Q ss_pred EEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEee---- Q lcl|NC_015279. 210 KVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLD---- 285 (467) Q Consensus 210 K~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~---- 285 (467) .++.-+-...+|-||.+|-- .|.+++|.+-|+..|..-||+.||.- ...+ ...|++... T Consensus 245 ----~~~k~~~~v~is~ell~ds~----~~~~~~i~~~l~~~i~~~~d~~~l~G----~G~~-----~p~Gi~~~~~~~~ 307 (458) T protein:vir:10 245 ----STYKLAAKSFITDETEEDAI----FSLLPLLRKRLIEAHAVSIEEAFMTG----DGSG-----KPKGLLTLASEDS 307 (458) T ss_pred ----eeeeEEeeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhcC----CCCC-----ccceeeecccccc Confidence 44444445678999988833 46788999999999999999988731 1111 123333221 Q ss_pred --------ccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCce Q lcl|NC_015279. 286 --------IDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNT 357 (467) Q Consensus 286 --------~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~ 357 (467) .+...-...+....+.+.+.. ...+...+||+|.....|.. +....+- .-+..+..... T Consensus 308 ~~~~~~~~~~~~~~~~~~~i~~~~~~l~~---------~~~~~~~~v~~~~~~~~l~~---lkd~~G~-~i~~~~~~~~~ 374 (458) T protein:vir:10 308 AKVVTEAKADGSVLVTAKTISKLRRKLGR---------HGLKLSKLVLIVSMDAYYDL---LEDEEWQ-DVAQVGNDSVK 374 (458) T ss_pred cceeecccccccccccHHHHHHHHHhhhh---------hhcCCCEEEEcHHHHHHHHh---hcccCCc-eeecccccccc Confidence 111111111222233333311 11234567899988887753 1211110 00000001111 Q ss_pred ---eEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeee--eee Q lcl|NC_015279. 358 ---FAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFK--TRY 432 (467) Q Consensus 358 ---~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~--tRY 432 (467) -.++|+ |++|+++.+.-... ...+.++..++ + +.++.. -..+....||-+-...++|. .|. T Consensus 375 ~~~~~~~l~-G~pv~~~~~~p~~~-----~~~~~~~~~f~-~-----~~~~~~--~~~~~v~~d~~~~~~~~~~~~~~r~ 440 (458) T protein:vir:10 375 LQGQVGRIY-GLPVVVSEYFPAKA-----NSAEFAVIVYK-D-----NFVMPR--QRAVTVERERQAGKQRDAYYVTQRV 440 (458) T ss_pred ccCcCceec-ceeeEEcccccccc-----CCcceEEEEec-c-----cEEEEE--eeceEEEeecccCCCceEEEEEEEe Confidence 123675 79999987752111 01122222222 1 000100 01111123544444455655 466 Q ss_pred ce-eecC--cccccCccc Q lcl|NC_015279. 433 GM-VANP--FAEGTTVGA 447 (467) Q Consensus 433 ~l-~~nP--~~~~~~~~~ 447 (467) |+ +.+| |+.++-.+. T Consensus 441 ~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 441 NLQRYFANGVVSGTYAAS 458 (458) T ss_pred cceEecccceEEEeeccC Confidence 43 3455 322221111 No 58 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=75.86 E-value=0.14 Score=25.23 Aligned_cols=333 Identities=16% Similarity=0.150 Sum_probs=119.0 Q ss_pred CcchHHHHHhhh---------------------hhhccCccchhcchhHHHHHHHHhhhHHHHH-HHHhhhhhcchhhhh Q lcl|NC_015279. 1 MFQSEQLQEKWA---------------------PLLNYEGLDKISDPHRRAVTAVLLENQEKFM-QEQVAFEQGGMIAEQ 58 (467) Q Consensus 1 ~~~~~~l~~kw~---------------------p~l~~~~~~~i~~~~~~~v~~~~~enq~~~~-~e~~~~~~~~~~~e~ 58 (467) +.+.+.|.++.+ +.+..+.-. -.+..+++.....+.+=...+ .+++ ..+.|. T Consensus 41 ~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~a~~~~l~~~~~~~~~~e~-----~~~~~~ 114 (409) T protein:vir:45 41 KSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDPENNS-QQDEKRAQVFDKWMRHGASELTSEER-----KALREL 114 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccCCCCCcc-hhhHHHHHHHHHHHHhhhhhccHHHH-----HHHHHH Confidence 222222222222 222222211 112222222222222210000 1111 011221 Q ss_pred hhhhccccccccccccc---ccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCcccccccccc Q lcl|NC_015279. 59 PTNAVGNGGYTSSGGQT---VAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTA 135 (467) Q Consensus 59 ~~~~~g~~~~~st~tg~---i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~ 135 (467) ...+. .+...|. -..+.+.++.+.| +..+..+++-|-|+++.....+-... ... .. T Consensus 115 ~a~~~-----~~~~~gg~liP~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~---~~~-~~--------- 173 (409) T protein:vir:45 115 RAQGV-----AQDEKGGYTVPETFLAKVVEKMK---SYGGIASVAQILTTSDGRTMEWATAD---GTS-EV--------- 173 (409) T ss_pred hhccC-----ccCcCCceeccHhHHHHHHHHHH---hhhhhhhhceeeecCCCceEEEEeec---cCc-cc--------- Confidence 11100 0111111 1223344555555 33345678888888765544432221 000 00 Q ss_pred cccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEEEe Q lcl|NC_015279. 136 FAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVTA 215 (467) Q Consensus 136 fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVtA 215 (467) +-...+++....+...|.+..|.--|.. T Consensus 174 -------------------------------------------------~~~v~E~~~~~~~~~~f~~~~l~~~k~~--- 201 (409) T protein:vir:45 174 -------------------------------------------------GVLLGENEEAGEEDTDFGMGSLGALKMT--- 201 (409) T ss_pred -------------------------------------------------cccccccccccccccccceeeeeeeeee--- Confidence 0000111111111223444444332221 Q ss_pred ecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeecccc-----c Q lcl|NC_015279. 216 KSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDIDSN-----G 290 (467) Q Consensus 216 KSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~~~-----~ 290 (467) +-=..+|-||.+|- .+|.+++|.+-|+..|.+-+|+.||.-= |........|++.-..... + T Consensus 202 ---~~~i~is~ell~ds----~~~l~~~i~~~la~a~~~~~~~a~l~G~------G~~~~~~p~Gil~~~~~~~~~~~~~ 268 (409) T protein:vir:45 202 ---SKIIRVSNELLQDS----AIDMEAYLARRIAERIGRGEARYLIQGT------GAGTPKQPKGLAASVTGTTQTAAAN 268 (409) T ss_pred ---eeehhhhHHHHhcc----HHHHHHHHHHHHHHHHHHHHHHHhhccC------CCCCccccceeeecccccccccccc Confidence 11135799999994 2578999999999999999999988310 0000011233332211000 0 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhccCCccE-EEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCceEEE Q lcl|NC_015279. 291 RWSVEKFKGLLFQIERDANAIAQRTRRGKGNM-ILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKYRVY 369 (467) Q Consensus 291 r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~-~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy 369 (467) --..+....|++.+. .--+..+.+ ++|++.....|.. |....+ ..-+..|.+.. -.++|. |++|+ T Consensus 269 ~~~~d~i~~l~~~l~--------~~~~~~a~~~~~~n~~~~~~l~~---lkd~~G-~~i~~~~~~~~-~~~~l~-G~PV~ 334 (409) T protein:vir:45 269 AVKWQEILALKHSID--------PAYRRGPKFRLAFNDNTLKLISE---MEDGQG-RPLWLPDIVGV-APASVL-NVPYV 334 (409) T ss_pred ccchHHHHHHHHhhh--------hhhccCCeEEEEECHHHHHHHHH---hhcCCC-ceeeccCcCCC-CCceec-ceeeE Confidence 001122333333332 223445666 5788887766643 221111 00012221111 124674 47998 Q ss_pred ecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeee--eeecee-ecCcccccCcc Q lcl|NC_015279. 370 IDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFK--TRYGMV-ANPFAEGTTVG 446 (467) Q Consensus 370 ~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~--tRY~l~-~nP~~~~~~~~ 446 (467) ++.+...+. .+.+-+++| +-. . .+...--........||-.-...++|. .||+.. .||=+ T Consensus 335 ~~~~~p~~~-----~~~~~i~~G---d~~-~--~~i~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A------ 397 (409) T protein:vir:45 335 IDQEIDDIG-----AGKKFMFCG---DFD-R--FIIRRVRYMILKRLVERYAEYDQTGFLAFHRFDCILEDTSA------ 397 (409) T ss_pred EecCcCCcc-----CCccEEEEe---ehh-h--hheeeccceEEEEeecccccCCcEEEEEEEEeccEeechhh------ Confidence 887643110 011112222 110 0 000000001111122443323344443 366543 34421 Q ss_pred ccccccccccccceeeeeccC Q lcl|NC_015279. 447 AGRLRVNSNRYYRRVAVKNLM 467 (467) Q Consensus 447 ~~~~~~~~n~y~r~~~v~~~~ 467 (467) |+.+.+|.-= T Consensus 398 -----------~~~l~~k~s~ 407 (409) T protein:vir:45 398 -----------IKALVGKGSV 407 (409) T ss_pred -----------eEEEEeccCC Confidence 1111111111 No 59 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=75.00 E-value=0.15 Score=25.07 Aligned_cols=260 Identities=12% Similarity=0.016 Sum_probs=112.4 Q ss_pred ccc--cccccccccc-----ccc-ccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCC-CCCcc Q lcl|NC_015279. 131 EAD--TAFAGQNEGF-----DLT-NGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGT-SGDNF 201 (467) Q Consensus 131 Ead--t~fSg~~a~~-----~~~-~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs-~g~~f 201 (467) +++ |..+.---.- -.. .......++... .+. .+ ........+++.--.+.++|.+.. .+-.. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~-----~~~-~l---~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~ 71 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAE-----IDN-TL---VGQPGDTLTFPAFIYSGDAKVVAEGEKIPT 71 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccce-----ecc-cc---cCCCCCEEEeeeecCCCccccccCCCccch Confidence 222 1111000000 000 000000001100 000 00 000111122221111233444422 12234 Q ss_pred ceeeeEEEEEEEEeecccccccccHHHHHHHHHhhC-CChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccce Q lcl|NC_015279. 202 NEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHG-LNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAG 280 (467) Q Consensus 202 ~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHG-LDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~g 280 (467) .++..+ +.+++.+-|+-.-+++ |+-+..+ -|.-.|..+-++..+..+++++++..+.+....- ... T Consensus 72 ~~lt~~--~~~~~i~~~~~a~~i~-----D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~------~~~ 138 (274) T protein:vir:95 72 DILETK--KREAKIRKIAKGTSIS-----DEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTV------EAD 138 (274) T ss_pred hhcccc--eeEEEeeeeecceeeh-----HHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------ccc Confidence 444433 3333334443222222 5555443 5888999999999999999999998876543321 111 Q ss_pred eEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEE Q lcl|NC_015279. 281 VFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAG 360 (467) Q Consensus 281 v~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G 360 (467) .+ + .+.+-..+.++..| -..+++++++|.|++.|.-....++.+......+. ..+-..| T Consensus 139 ~~------~----~d~i~~A~~~lgd~---------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~--~~~G~ig 197 (274) T protein:vir:95 139 IT------K----LTGLQTAIDKFNDE---------DLEPMVLFISPLDAGKLRGDATTNFTRATELGDDV--IVKGAFG 197 (274) T ss_pred cc------C----HHHHHHHHHHhccc---------cccccEEEeCHHHHHHHHhhccccccccccccccc--eeccccc Confidence 11 1 12233344444332 13678999999999999775544443332211111 1122467 Q ss_pred EecCceEEEecccccccchhhccCCCceEEEEEe-cCCCccceeEecccchhhcccc-cCCccccceeeeeeeeceee-c Q lcl|NC_015279. 361 VLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYK-GTSPYDAGLFYCPYVPLQMVRA-VGENTFQPKIGFKTRYGMVA-N 437 (467) Q Consensus 361 ~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyK-G~~~~d~glfyaPYv~l~~~~~-~Dp~s~qP~~g~~tRY~l~~-n 437 (467) .+ .|++||.|... | +|-.+-++ |.-. ||.. -+.. +.+ =||.+++=.+-..-+||+.+ | T Consensus 198 ~~-~G~~Vi~s~~~----------~-~~t~~l~~~gA~~-----~~~~-~~~~-vE~~Rd~~~~~d~i~~~~~y~~~~~~ 258 (274) T protein:vir:95 198 EA-LGAVIVRSNKL----------E-AGTAILAKKGAVK-----LITK-RDFF-LETDRDPSTKTTALYSDKHYVAYLYD 258 (274) T ss_pred ee-cCeEEEEeCCC----------C-CceEEEEecccee-----eeec-CCcc-cccccccccccCEEEEeEEEEEEEEc Confidence 77 57999999653 2 23222222 2111 1111 0111 112 28888998888888888754 4 Q ss_pred Cc-ccccCcccccccc Q lcl|NC_015279. 438 PF-AEGTTVGAGRLRV 452 (467) Q Consensus 438 P~-~~~~~~~~~~~~~ 452 (467) |= ....+-+...+.. T Consensus 259 ~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 259 ESKAVKITKGSGSLEM 274 (274) T ss_pred CCcEEEEEcCCccccC Confidence 41 1111111111111 No 60 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=75.00 E-value=0.15 Score=25.07 Aligned_cols=260 Identities=12% Similarity=0.016 Sum_probs=112.4 Q ss_pred ccc--cccccccccc-----ccc-ccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCC-CCCcc Q lcl|NC_015279. 131 EAD--TAFAGQNEGF-----DLT-NGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGT-SGDNF 201 (467) Q Consensus 131 Ead--t~fSg~~a~~-----~~~-~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs-~g~~f 201 (467) +++ |..+.---.- -.. .......++... .+. .+ ........+++.--.+.++|.+.. .+-.. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~-----~~~-~l---~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~ 71 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAE-----IDN-TL---VGQPGDTLTFPAFIYSGDAKVVAEGEKIPT 71 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccce-----ecc-cc---cCCCCCEEEeeeecCCCccccccCCCccch Confidence 222 1111000000 000 000000001100 000 00 000111122221111233444422 12234 Q ss_pred ceeeeEEEEEEEEeecccccccccHHHHHHHHHhhC-CChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccce Q lcl|NC_015279. 202 NEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHG-LNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAG 280 (467) Q Consensus 202 ~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHG-LDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~g 280 (467) .++..+ +.+++.+-|+-.-+++ |+-+..+ -|.-.|..+-++..+..+++++++..+.+....- ... T Consensus 72 ~~lt~~--~~~~~i~~~~~a~~i~-----D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~------~~~ 138 (274) T protein:vir:96 72 DILETK--KREAKIRKIAKGTSIS-----DEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTV------EAD 138 (274) T ss_pred hhcccc--eeEEEeeeeecceeeh-----HHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------ccc Confidence 444433 3333334443222222 5555443 5888999999999999999999998876543321 111 Q ss_pred eEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEE Q lcl|NC_015279. 281 VFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAG 360 (467) Q Consensus 281 v~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G 360 (467) .+ + .+.+-..+.++..| -..+++++++|.|++.|.-....++.+......+. ..+-..| T Consensus 139 ~~------~----~d~i~~A~~~lgd~---------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~--~~~G~ig 197 (274) T protein:vir:96 139 IT------K----LTGLQTAIDKFNDE---------DLEPMVLFISPLDAGKLRGDATTNFTRATELGDDV--IVKGAFG 197 (274) T ss_pred cc------C----HHHHHHHHHHhccc---------cccccEEEeCHHHHHHHHhhccccccccccccccc--eeccccc Confidence 11 1 12233344444332 13678999999999999775544443332211111 1122467 Q ss_pred EecCceEEEecccccccchhhccCCCceEEEEEe-cCCCccceeEecccchhhcccc-cCCccccceeeeeeeeceee-c Q lcl|NC_015279. 361 VLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYK-GTSPYDAGLFYCPYVPLQMVRA-VGENTFQPKIGFKTRYGMVA-N 437 (467) Q Consensus 361 ~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyK-G~~~~d~glfyaPYv~l~~~~~-~Dp~s~qP~~g~~tRY~l~~-n 437 (467) .+ .|++||.|... | +|-.+-++ |.-. ||.. -+.. +.+ =||.+++=.+-..-+||+.+ | T Consensus 198 ~~-~G~~Vi~s~~~----------~-~~t~~l~~~gA~~-----~~~~-~~~~-vE~~Rd~~~~~d~i~~~~~y~~~~~~ 258 (274) T protein:vir:96 198 EA-LGAVIVRSNKL----------E-AGTAILAKKGAVK-----LITK-RDFF-LETDRDPSTKTTALYSDKHYVAYLYD 258 (274) T ss_pred ee-cCeEEEEeCCC----------C-CceEEEEecccee-----eeec-CCcc-cccccccccccCEEEEeEEEEEEEEc Confidence 77 57999999653 2 23222222 2111 1111 0111 112 28888998888888888754 4 Q ss_pred Cc-ccccCcccccccc Q lcl|NC_015279. 438 PF-AEGTTVGAGRLRV 452 (467) Q Consensus 438 P~-~~~~~~~~~~~~~ 452 (467) |= ....+-+...+.. T Consensus 259 ~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 259 ESKAVKITKGSGSLEM 274 (274) T ss_pred CCcEEEEEcCCccccC Confidence 41 1111111111111 No 61 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=72.73 E-value=0.18 Score=24.68 Aligned_cols=353 Identities=14% Similarity=0.089 Sum_probs=129.0 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhh---hHHH-------HHHHHhhhhhcchhhhhhhh-----hccc Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLE---NQEK-------FMQEQVAFEQGGMIAEQPTN-----AVGN 65 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~e---nq~~-------~~~e~~~~~~~~~~~e~~~~-----~~g~ 65 (467) ....+.-.. .-...+..++ ...++......+. ++.+ ..+..+.........+.... ...+ T Consensus 82 ~~~~~~~~~---~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (477) T protein:vir:84 82 KLEAETKTV---RKATVEVNEA--LTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRD 156 (477) T ss_pred cchhhhhhh---cccccccccc--hhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhcc Confidence 000000000 0000111100 0001000000000 0000 00000000000000000000 0001 Q ss_pred ccccccccccccccCchhh--hhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCC-ccccccccccccccccc Q lcl|NC_015279. 66 GGYTSSGGQTVAGFDPVLI--SLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGT-EALFDEADTAFAGQNEG 142 (467) Q Consensus 66 ~~~~st~tg~i~~~~P~Lv--~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGt-EAlfnEadt~fSg~~a~ 142 (467) ....++..|... -|..+ .++...-+..+..+++++.||++.+|-+-=-|.. +|. .+ .+. T Consensus 157 ~~~~~~~gg~lv--~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~----~~~~~a-------~~~----- 218 (477) T protein:vir:84 157 LDRNGGTGGYAV--PPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKIL----TGTSTA-------IQA----- 218 (477) T ss_pred ccccCCCcceee--ccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEe----cCccee-------eee----- Confidence 101111112111 12211 2444444666778999999999988854322211 110 00 000 Q ss_pred ccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEEEeecccccc Q lcl|NC_015279. 143 FDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVTAKSRALKA 222 (467) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVtAKSRaLKA 222 (467) .++... .....++...+++.++..+|.-+-.. T Consensus 219 ----------------------------------------------~Eg~~~--~~~~~~~s~~~f~~i~~~~~k~~~~~ 250 (477) T protein:vir:84 219 ----------------------------------------------ADNAAL--TAPSAHEVDLTDGFVQANVKTIAGQQ 250 (477) T ss_pred ----------------------------------------------ccCccc--ccccccccccceeeEEEeeeeEEeee Confidence 000000 00123444456677777777777778 Q ss_pred cccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeec------cc-cchhHHH Q lcl|NC_015279. 223 EYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDI------DS-NGRWSVE 295 (467) Q Consensus 223 EYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~------~~-~~r~~ve 295 (467) .+|-||.+|-. .|.++.|.+-|+..|..-|++.||.- .-.+-...|++.... .. ..-|. T Consensus 251 ~iS~ell~ds~----~~l~~~i~~~l~~~~~~~~d~~~l~G--------~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~-- 316 (477) T protein:vir:84 251 GIAIQLLDQAA----VSVDEFVFRDLAADYANKLNVQVISG--------TGSNNQVVGVRATAGITQVTATSAGSALE-- 316 (477) T ss_pred HHHHHHHhccc----hhHHHHHHHHHHHHHHHHHHHHHhcc--------CCCCCccceeeeccccccccccccccchh-- Confidence 89999999843 56899999999999999999988821 111112456654321 11 01111 Q ss_pred HHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhh----hc---chhccccccc-ccccccCCceeEEEecCceE Q lcl|NC_015279. 296 KFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTM----AG---VLDYTPALNA-NLNVDDTGNTFAGVLQGKYR 367 (467) Q Consensus 296 ~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~----sG---~~~~~~~~~~-~~~~d~t~~~~~G~l~~~~~ 367 (467) ....+ +.-...+..-.....+-.+..+|++|.....|.. .| |....++.+. .+....-.....|+| .|++ T Consensus 317 ~~~~~-~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l-~G~p 394 (477) T protein:vir:84 317 KHQII-YQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQM-HGLP 394 (477) T ss_pred hHHHH-HHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchh-cccc Confidence 11111 1111222222222333345667888876665533 23 1111111111 112222233346777 5789 Q ss_pred EEecccccccchhhccCCCceEEEEEecCCCc-cc--eeEecccchhhcccccCCccccceeeeeeeec-----eeecC- Q lcl|NC_015279. 368 VYIDPYSSNLTSANAANGNQYYVVGYKGTSPY-DA--GLFYCPYVPLQMVRAVGENTFQPKIGFKTRYG-----MVANP- 438 (467) Q Consensus 368 vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~-d~--glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~-----l~~nP- 438 (467) |+++++.-. +.+.-....-+++|--++.-. +. .+.-.||.= .-...+.|.+ || .+-+| T Consensus 395 Vv~s~~~p~--~~~~~~d~~~i~~gd~~~~~i~~~~~~~~~~~~~~----------~~~~~~~~~v-~~~~~~~~~r~~~ 461 (477) T protein:vir:84 395 VVTDPTLPT--TLGTGTDQDVIHVLRASDLALFESSVRMRALQETR----------AENLSVLLQV-YGYLAFTAARFPQ 461 (477) T ss_pred eEecCcccc--cccccCCcceEEEEEeceEEEEeeceeEEeccccc----------cccceeeeee-hhhhhhhhhcccc Confidence 999987521 000111112334443311100 00 122223210 1112222211 22 11245 Q ss_pred -cccccCccccccccc Q lcl|NC_015279. 439 -FAEGTTVGAGRLRVN 453 (467) Q Consensus 439 -~~~~~~~~~~~~~~~ 453 (467) |+..+-.+...-... T Consensus 462 afv~~t~~~~~~~~~~ 477 (477) T protein:vir:84 462 SVVEIGGTALTAPTFA 477 (477) T ss_pred ceEEeecccccccccC Confidence 333221111100111 No 62 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=69.53 E-value=0.22 Score=24.17 Aligned_cols=268 Identities=13% Similarity=0.108 Sum_probs=113.8 Q ss_pred hhhhhhhhhccccccccccccccc---ccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCccccc Q lcl|NC_015279. 54 MIAEQPTNAVGNGGYTSSGGQTVA---GFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFD 130 (467) Q Consensus 54 ~~~e~~~~~~g~~~~~st~tg~i~---~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfn 130 (467) +|..-.. .+++.|... .+.+.++.+.| +..+-.+++.+=||++.+|-+==.+ ....++ +| T Consensus 1 ~l~~~~~--------~t~~~gg~liP~~~~~~Ii~~~~---~~~~l~~~~~~~~~~~~~g~~~~~~--~~~~~~-~a--- 63 (293) T protein:vir:48 1 MLDSKTD--------HSGSDAGLTIPQDIRTAINTLVR---QYDSLQEYVNVENVTTLTGSRVYEK--WTDITG-LA--- 63 (293) T ss_pred Cceeecc--------cccCcCceEechhHHHHHHHHHH---hhhhhhhhceeeeccCCcceEEEEe--ecCCCc-ce--- Confidence 2222111 111122111 12233344444 5667788888888887665211111 000000 00 Q ss_pred ccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceee-eEEE Q lcl|NC_015279. 131 EADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMA-FSIE 209 (467) Q Consensus 131 Eadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMa-FsIE 209 (467) . ..+ | +..++|.+ .+++ T Consensus 64 ----~---------------------------------------------------~v~--E-----g~~~~~~~~~~~~ 81 (293) T protein:vir:48 64 ----N---------------------------------------------------IDD--E-----AGKIADIDDPKLS 81 (293) T ss_pred ----e---------------------------------------------------eec--C-----Cccccccccccee Confidence 0 001 1 12233332 3455 Q ss_pred EEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeecccc Q lcl|NC_015279. 210 KVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDIDSN 289 (467) Q Consensus 210 K~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~~~ 289 (467) +++..+|.-+-...+|-||.+|. .+|.|++|.+-|+..|..-+|+.|+.-+-+.+. ..+..++ T Consensus 82 ~i~l~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~--------~~~~~~~----- 144 (293) T protein:vir:48 82 LIKYTIKRYAGISTVTNSLLADS----AENILAWLSGWIAKKVVVTRNKAILGVVDKLPT--------KPTLTKW----- 144 (293) T ss_pred EEEEeeeEEEEeehhhHHHHhhh----hHHHHHHHHHHHHHHHHHHHHhHHhhccccccc--------cccccCH----- Confidence 55555565566678999999986 367899999999999999999998854433222 1122211 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCceEEE Q lcl|NC_015279. 290 GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKYRVY 369 (467) Q Consensus 290 ~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy 369 (467) +....++.++... -+. ....+|++.....|... +...+ ..+-.++...-..++| .|++|+ T Consensus 145 -----d~i~~~~~~l~~~--------~~~-~a~~vmn~~~~~~L~~l---kd~~g--~~l~~~~~~~~~~~~l-~G~Pv~ 204 (293) T protein:vir:48 145 -----DDIIDLEAKVDPA--------IKQ-TSFFLTNTSGFTALKKV---KNALG--DYLMERDVKSPTGYSI-AGFAVK 204 (293) T ss_pred -----HHHHHHHHhhhhh--------hcC-CCEEEEcHHHHHHHHHh---hccCC--ceEeecCcCCCCCcee-cceeeE Confidence 2233344444211 122 24567888888777542 21111 1111111111123566 456766 Q ss_pred e--cccccccch--h--hccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeec---------- Q lcl|NC_015279. 370 I--DPYSSNLTS--A--NAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYG---------- 433 (467) Q Consensus 370 ~--D~y~~~~~~--~--~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~---------- 433 (467) + |.+..+... . ..-...++++++.++....+-+-.+.-| -.+-|=.+-...||+ T Consensus 205 ~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~----------~~~~~~~~r~~~r~d~~~~~~~a~~ 274 (293) T protein:vir:48 205 EISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGA----------FETDTTKVRVIDRFDVVATDTEAFV 274 (293) T ss_pred EecccccCCccCCceEEEEEeccceEEEEEecceEEEEecccchh----------hhcCeEEEEEEEeeCcEEecccceE Confidence 4 322211000 0 0001123444444433222211111111 112222333334443 Q ss_pred -----eeecCcccccCcccccc Q lcl|NC_015279. 434 -----MVANPFAEGTTVGAGRL 450 (467) Q Consensus 434 -----l~~nP~~~~~~~~~~~~ 450 (467) =.+-|+.+..+-+ + T Consensus 275 ~l~~~~~~~~~~~~~~~~---~ 293 (293) T protein:vir:48 275 PASFKAIADQKGNIGSTA---V 293 (293) T ss_pred EEEeeccccCCccccccC---C Confidence 3333333222211 1 No 63 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=68.44 E-value=0.24 Score=24.01 Aligned_cols=324 Identities=12% Similarity=0.074 Sum_probs=116.4 Q ss_pred Cc------------chHHHHHhhhhh------hccCccchhcchhHHHHHHHHhhhHHHHHHHHhhhhh--cchhhhh-- Q lcl|NC_015279. 1 MF------------QSEQLQEKWAPL------LNYEGLDKISDPHRRAVTAVLLENQEKFMQEQVAFEQ--GGMIAEQ-- 58 (467) Q Consensus 1 ~~------------~~~~l~~kw~p~------l~~~~~~~i~~~~~~~v~~~~~enq~~~~~e~~~~~~--~~~~~e~-- 58 (467) +. ..++|+++=.-+ ++.++.+. +...+.+-...-++.+ +.+....... +..-.+. T Consensus 32 ~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 108 (390) T protein:vir:97 32 LNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGG--DVQHVSVGDMFVASEQ-FQASTGRWNDRSARATMNIKA 108 (390) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc--ccccccchhhhhhhHH-HHHHHHHhhhhhhhhhhHHHH Confidence 10 001111111000 00000000 0000000000000000 0000000000 0000000 Q ss_pred hhhhcccccccccccccc--cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCccccccccccc Q lcl|NC_015279. 59 PTNAVGNGGYTSSGGQTV--AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTAF 136 (467) Q Consensus 59 ~~~~~g~~~~~st~tg~i--~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~f 136 (467) ..+ ......++..|.+ ...-+.++.+.| +..+..+++.+-||++++.-+.- ..+.++ . ..| T Consensus 109 ~~~--~~~~~~~~~~g~lip~~~~~~ii~~~~---~~~~i~~~~~~~~~~~~~~~~~~----~~~~~~-~-------a~~ 171 (390) T protein:vir:97 109 ALN--TASTDAAGSAGALTTPNRLPGFITPPD---ARLTVRDLIGSGRTDSALIEYVQ----ETGFVN-N-------AAI 171 (390) T ss_pred HHH--hhhcccccccccccchhhhHHHHHHHh---hhhhhHhhcceeeccCCceEEEE----EecCCc-c-------eee Confidence 001 0000011111211 122334444444 55567788999999877643211 111000 0 000 Q ss_pred ccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEEEee Q lcl|NC_015279. 137 AGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVTAK 216 (467) Q Consensus 137 Sg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVtAK 216 (467) .++++....+...|.+..|.+.|..+ T Consensus 172 ---------------------------------------------------v~Eg~~~~~~~~~~~~i~~~~~k~~~--- 197 (390) T protein:vir:97 172 ---------------------------------------------------VAEGALKPESSLKFAKKTDTTHVIAH--- 197 (390) T ss_pred ---------------------------------------------------ecCCccccccccceeEEEEeeeeEEE--- Confidence 00000000111235555555555444 Q ss_pred cccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeec------cccc Q lcl|NC_015279. 217 SRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDI------DSNG 290 (467) Q Consensus 217 SRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~------~~~~ 290 (467) ...+|-||.+|-- +.++.|.+-|+..|...+|+.||.- .-.+-...|++.... ...+ T Consensus 198 ----~~~is~ell~ds~-----~l~~~i~~~la~a~~~~~d~a~l~G--------~g~~~~p~Gi~~~~~~~~~~~~~~~ 260 (390) T protein:vir:97 198 ----TMKATRQILSDAP-----QLASYMNNRLIRGLKVKEDAEILRG--------TGANDGLLGLIPQATTYAAPTTIAG 260 (390) T ss_pred ----eehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHhhc--------CCCCccccceeeccccccccccccc Confidence 5779999999852 4788888888888888888887731 111112344443211 0111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCceEEEe Q lcl|NC_015279. 291 RWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKYRVYI 370 (467) Q Consensus 291 r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~ 370 (467) --..+....++.++ .......+-+|++|.....|.. +....+ ..+-.+..+ .-.++| .|++|++ T Consensus 261 ~~~~d~~~~~~~~~---------~~~~~~~~~~v~n~~~~~~L~~---lkd~~G--~~l~~~~~~-~~~~~l-~G~pV~~ 324 (390) T protein:vir:97 261 ATRVDQLRLAMLQA---------SLAEYPASGIVINPIDWAAIEL---AKDANN--QYLIGNARG-TLTPTL-WGLPVVA 324 (390) T ss_pred cchHHHHHHHHHhh---------ccccCCCCEEEEcHHHHHHHHH---hhcCCC--ceeecCccC-CCCcee-cceeeEE Confidence 11112222222222 2233456678899998888863 222211 111112111 113466 5778888 Q ss_pred cccccccchhhccCCCceEEEE-EecCCCccceeEecccchhhcccccCC---ccccceeeeeeeeceee-cCcccccCc Q lcl|NC_015279. 371 DPYSSNLTSANAANGNQYYVVG-YKGTSPYDAGLFYCPYVPLQMVRAVGE---NTFQPKIGFKTRYGMVA-NPFAEGTTV 445 (467) Q Consensus 371 D~y~~~~~~~~~~~~~dY~~vG-yKG~~~~d~glfyaPYv~l~~~~~~Dp---~s~qP~~g~~tRY~l~~-nP~~~~~~~ 445 (467) ++... ..-+++| ++ .++++...-.+......+. .+-+=.+-...||++.+ +|= T Consensus 325 ~~~~~----------~~~~~~gd~~------~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~------ 382 (390) T protein:vir:97 325 TQAMA----------PGEFLVGAFD------LAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPE------ 382 (390) T ss_pred cCCCC----------CCcEEEEecc------ceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccc------ Confidence 87642 2223333 21 0111111100000000011 12222333445666654 231 Q ss_pred cccccccccccccceeeee Q lcl|NC_015279. 446 GAGRLRVNSNRYYRRVAVK 464 (467) Q Consensus 446 ~~~~~~~~~n~y~r~~~v~ 464 (467) .|.++-+. T Consensus 383 -----------a~v~~~~a 390 (390) T protein:vir:97 383 -----------ALITGSFA 390 (390) T ss_pred -----------cEEEEEeC Confidence 01111111 No 64 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=62.89 E-value=0.33 Score=23.25 Aligned_cols=320 Identities=12% Similarity=0.070 Sum_probs=122.8 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHH-HHHHHHhhhH---------------------------HHHHHHHhhh--h Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRR-AVTAVLLENQ---------------------------EKFMQEQVAF--E 50 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~-~v~~~~~enq---------------------------~~~~~e~~~~--~ 50 (467) +-+.+++++++.-+.+ +|.+...+ +....+.+.. .+...+.... . T Consensus 41 ~~~~~e~~~~~~~l~~-----ei~~l~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 115 (400) T protein:vir:38 41 LKKAEGVRAKYDKAGK-----EIKDLEEKRDLYEAALKGNEQSSGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTD 115 (400) T ss_pred HHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHhhcccccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHH Confidence 3333444444444322 11111000 0000111000 0000000000 0 Q ss_pred hcc----hhhhhhh-hhcccccccccccccc--cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCC Q lcl|NC_015279. 51 QGG----MIAEQPT-NAVGNGGYTSSGGQTV--AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQG 123 (467) Q Consensus 51 ~~~----~~~e~~~-~~~g~~~~~st~tg~i--~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs 123 (467) ... .-.+... ..... +..++..|.+ ..+.+.++.+.| +..+..+++.+.||++.++-+--++. .+ T Consensus 116 ~~~~~~~~~~~~~~~~~~~~-~~~~~~gg~~vP~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~----~~ 187 (400) T protein:vir:38 116 VGTFAVLRAVPTDASDAVNA-GVKAADAASTIPETISNTPQRELQ---TVVDLKPFTNVFQASTQKGTYPTVAN----AT 187 (400) T ss_pred HHHHhhhhhhhHHHHHHHhh-cccccCCcccccHHHHHHHHHHHH---hhhhhhhcceeEeccCcceEEEEEec----CC Confidence 000 0000000 00000 0111111211 123344444555 55678899999999988775433331 00 Q ss_pred CCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccce Q lcl|NC_015279. 124 GTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNE 203 (467) Q Consensus 124 GtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~E 203 (467) +.-+.+.| .+.. .+ .+...|.+ T Consensus 188 ~~~~~~~E------------------------------~~~~-------------------------~~---~~~~~f~~ 209 (400) T protein:vir:38 188 TKMVTVAE------------------------------LEKN-------------------------PA---MAKPEFKP 209 (400) T ss_pred Cccccccc------------------------------cccc-------------------------cc---ccccccee Confidence 00000000 0000 00 00123555 Q ss_pred eeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEE Q lcl|NC_015279. 204 MAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFD 283 (467) Q Consensus 204 MaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~D 283 (467) ..|.+.|. +-...+|-||.+|- ..|.+++|.+-|+..|...+|+-|+.-.-. ....|+.. T Consensus 210 i~~~~~k~-------~~~~~is~ell~ds----~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~---------~~~~~~~~ 269 (400) T protein:vir:38 210 VNWSVETY-------RQALPVSQESIDDS----AIDLVGLIAQNGQQIKVNTTNGAVATLLKG---------FTAKTISS 269 (400) T ss_pred eEeehhhe-------eeehhhHHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhhhhcccc---------cccccccc Confidence 55554444 44677999999986 347888999999999999999888743321 12222222 Q ss_pred eeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEec Q lcl|NC_015279. 284 LDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQ 363 (467) Q Consensus 284 l~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~ 363 (467) + .....++. .... ..+. ..+|++|.....|..- ....+- --+..+.++ -..++| T Consensus 270 ~----------~~~~~~~~-~~~~--------~~~~-a~~v~~~~~~~~l~~l---kd~~G~-~i~~~~~~~-~~~~~l- 323 (400) T protein:vir:38 270 V----------DDLKHINN-VDLD--------PAYS-RVIIASQSFYNFLDTV---KDGNGR-YLLQDSILT-PSGKSV- 323 (400) T ss_pred H----------HHHHHHHH-hhhh--------hhhC-cEEEEcHHHHHHHHHh---hccCCC-eeeecCcCC-CCcccc- Confidence 1 11111111 1111 1122 3467788888777542 211110 001111111 113466 Q ss_pred CceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeeceee-cC--cc Q lcl|NC_015279. 364 GKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMVA-NP--FA 440 (467) Q Consensus 364 ~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~~-nP--~~ 440 (467) .|++|++..+.... ..+...+++|--- ..+....... ......|-..|+..+-...|++..+ +| |. T Consensus 324 ~G~pv~~~~~~~~~-----~~g~~~~~~gd~s-----~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~ 392 (400) T protein:vir:38 324 LGMPIAVVSDDTLG-----AAGEAHAFLGDIK-----RAILFANRAD-FMVRWVDDQIYGQFLQAGMRFGVSVADEKAGY 392 (400) T ss_pred ccceeEEecccccC-----CCCceEEEEEecc-----ccEEEEeecc-eEEEEecccccceeEEEEEEeccEEecccceE Confidence 55666665432100 0111223322110 0000000000 0112234556666777778887654 44 21 Q ss_pred cccCccccc Q lcl|NC_015279. 441 EGTTVGAGR 449 (467) Q Consensus 441 ~~~~~~~~~ 449 (467) -. +-.++. T Consensus 393 ~l-~~~~~a 400 (400) T protein:vir:38 393 FL-TYTPKA 400 (400) T ss_pred EE-EeecCC Confidence 11 111111 No 65 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=61.90 E-value=0.34 Score=23.12 Aligned_cols=338 Identities=9% Similarity=0.018 Sum_probs=115.3 Q ss_pred CcchHHHHHhhhhhhccCccchhc-----------------------chhHHHHHHHHhhhHHHHHHHHhhhhhcchhhh Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKIS-----------------------DPHRRAVTAVLLENQEKFMQEQVAFEQGGMIAE 57 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~-----------------------~~~~~~v~~~~~enq~~~~~e~~~~~~~~~~~e 57 (467) .-..+.+.++...-++.-.. ++. ....+.+-. .+.+-+.+.........+.+-.+ T Consensus 32 ~~e~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 109 (419) T protein:vir:94 32 VAEARGLADALQAESDRAAA-RAALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQ-RFADSDGLREYRARDKRGQFQVE 109 (419) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhhhccccccccccccchhh-hhhhHHHHHHHHHhhhhhhhhHH Confidence 11111122222211111000 000 000000000 00000000000000000000000 Q ss_pred h---hhh-hcccccc-ccccccc---c-cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCccc Q lcl|NC_015279. 58 Q---PTN-AVGNGGY-TSSGGQT---V-AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEAL 128 (467) Q Consensus 58 ~---~~~-~~g~~~~-~st~tg~---i-~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAl 128 (467) . ..+ ..+.... .++.++. + .-....++.+.+. .++..+++.+.||++++.-+ +| ..+.. T Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~---~~~i~~~~~~~~~~~~~~~~--~~--~~~~~----- 177 (419) T protein:vir:94 110 MRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDL---PLLVADLLDQQNADYNVLEY--IR--DTSGT----- 177 (419) T ss_pred HHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhh---hhhhhhcceeeeccCCceee--ee--ecccc----- Confidence 0 000 0001001 1111111 1 1112222333332 23567899999998875322 22 10000 Q ss_pred ccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEE Q lcl|NC_015279. 129 FDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSI 208 (467) Q Consensus 129 fnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsI 208 (467) -+ ..+ +.+ .+. .++ | +..+++...++ T Consensus 178 --~~--~~~-------------------------~~~------------~a~------~v~--E-----g~~~~~~~~~~ 203 (419) T protein:vir:94 178 --AG--AGS-------------------------TWN------------KAA------VVP--E-----GTAKPQSTLSF 203 (419) T ss_pred --cc--ccc-------------------------cCc------------ccc------eec--C-----Cccccccccce Confidence 00 000 000 000 001 1 12344444445 Q ss_pred EEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeecc- Q lcl|NC_015279. 209 EKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDID- 287 (467) Q Consensus 209 EK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~- 287 (467) ++++..+|.=+-...+|-||.||.- +.+++|.+-|+..|...+|+.||.- ...+ ...|++-...- T Consensus 204 ~~i~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~la~a~~~~~d~aii~G----~G~~-----~p~Gi~~~~~~~ 269 (419) T protein:vir:94 204 DTITTTLKTVAHWLPITRQAADDNS-----QLMGYIQGRLTYGLRFLRDRQLLNG----NGST-----EMQGILTTPGIG 269 (419) T ss_pred eeEEeeeeeEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHHhc----cCcc-----cccceecccccc Confidence 5555555555555679999999952 3689999999999999999999831 0011 12233221100 Q ss_pred ----ccchh------HHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCce Q lcl|NC_015279. 288 ----SNGRW------SVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNT 357 (467) Q Consensus 288 ----~~~r~------~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~ 357 (467) ..+.. ..+....+.+.+ . ...++.+.+||+|.....|... .+ ..+-.-.+..+. ..- T Consensus 270 ~~~~~~~~~~~t~~~~~~~l~~~~~~~--------~-~~~~~~~~~v~n~~~~~~l~~~--k~-~~~~~~~~~~~~-~~~ 336 (419) T protein:vir:94 270 TYQQPKPTAPATDEPPLVDIRRAKTVA--------E-IAGFPPDGVVVHPQDWESIELD--QA-PGSGVFRVIANV-QGE 336 (419) T ss_pred cccccccccccccchhHHHHHHHHHhh--------h-hccCCCCEEEEcHHHHHHHHHH--hh-cCCCceeecCCc-ccC Confidence 00000 011112222222 1 1234567899999988877542 11 000000011111 111 Q ss_pred eEEEecCceEEEecccccccchhhccCCCceEEEE-EecC-C---CccceeEecccchhhcccccCCccccceeeeeeee Q lcl|NC_015279. 358 FAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVG-YKGT-S---PYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRY 432 (467) Q Consensus 358 ~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vG-yKG~-~---~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY 432 (467) ..++|. |++|+++..... .+ +++| ++-. . ..+-.+-..++....+ ..-+=.+=+..|+ T Consensus 337 ~~~~l~-G~pV~~~~~~~~---------~~-~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~------~~~~~~~r~~~r~ 399 (419) T protein:vir:94 337 ATPRIW-GLNVVSTVAIAQ---------GT-ALVGGFRQGATLWSRQGITVLMTDSHADFF------TANTLVILAEFRA 399 (419) T ss_pred CCcccc-ceeeEEcCCCCC---------cc-EEEeeccceEEEEEecceEEEEeccccchh------hcCcEEEEEEEee Confidence 234664 678988876431 12 2222 1100 0 0001111112111111 0122233344555 Q ss_pred ceee-cCcccccCccccccccccccccceeeeeccC Q lcl|NC_015279. 433 GMVA-NPFAEGTTVGAGRLRVNSNRYYRRVAVKNLM 467 (467) Q Consensus 433 ~l~~-nP~~~~~~~~~~~~~~~~n~y~r~~~v~~~~ 467 (467) +..+ +|- .|.++.++-.= T Consensus 400 d~~v~~~~-----------------a~~~~~~~aa~ 418 (419) T protein:vir:94 400 NLAVYQPK-----------------AFVRVTFAAAT 418 (419) T ss_pred ccEEeccc-----------------cEEEEEeccCC Confidence 5433 221 11111111111 No 66 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=61.55 E-value=0.35 Score=23.07 Aligned_cols=295 Identities=11% Similarity=0.035 Sum_probs=111.1 Q ss_pred HHHHHhhhHHHHHHHHhhhhhcchhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccce Q lcl|NC_015279. 31 VTAVLLENQEKFMQEQVAFEQGGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTG 110 (467) Q Consensus 31 v~~~~~enq~~~~~e~~~~~~~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTG 110 (467) +-++.|.| ++++-+ ++ +.... ++.++...--.|..-.|+++...+-.....+-|.||+...| T Consensus 1 ~~~k~~~~---~l~~~~---------~~-----~~~~~-~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~ 62 (321) T protein:vir:31 1 MASRTINN---DLSRIT---------EK-----NALTV-DDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKT 62 (321) T ss_pred CchHHHHH---HHHHHH---------Hh-----ccccc-cccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcce Confidence 33344444 332211 11 01111 11111111112333446666555545566788888888877 Q ss_pred eeeeeeeeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhh Q lcl|NC_015279. 111 LIFAMRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDE 190 (467) Q Consensus 111 LIFAMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~ 190 (467) .|=.+- ++... ... ...++ T Consensus 63 ~i~~~~--~~~~~-----------------~~~----------------~~e~~-------------------------- 81 (321) T protein:vir:31 63 RIPTLN--IGERH-----------------RRP----------------QDEGE-------------------------- 81 (321) T ss_pred eeeeec--cCCcc-----------------ccc----------------ccccc-------------------------- Confidence 653221 10000 000 00000 Q ss_pred HhhcCCCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccc Q lcl|NC_015279. 191 AEDLGTSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQ 270 (467) Q Consensus 191 aE~LGs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~ 270 (467) .+. ..+...|.++.+...|+.+- ...|-||.+| ..||.|-|+.|.+.++..|.+.+++-++.-= .++.. T Consensus 82 ~~~-~~~~~~~~~~~~~~~k~~~~-------~~it~e~L~d--~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd-~~~~~ 150 (321) T protein:vir:31 82 WNE-NESDVSTGTIDISTEKATVA-------WDLPREVVQE--NPEGEALADRILNLMTDAWSADVEDLAANGD-EDAED 150 (321) T ss_pred ccc-ccccceeeeeeeeeEEEEee-------hhccHHHHHh--hhcchhHHHHHHHHHHHHHHHHHHhheeecc-ccCCC Confidence 000 00112366666666666554 3467788887 3468888888888888888887766655221 11111 Q ss_pred ccccccccceeEEee-------ccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccE-EEEchHHHHHHhhhcchhcc Q lcl|NC_015279. 271 GAVSNTATAGVFDLD-------IDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNM-ILCSADVASALTMAGVLDYT 342 (467) Q Consensus 271 ~k~~~~~~~gv~Dl~-------~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~-~i~S~~Va~~L~~sG~~~~~ 342 (467) .-. -..+|.+..- +...+.+..+.+..|.+.|... =|..+++ .|++++....+... +... T Consensus 151 ~~~--~~n~G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~--------yr~~~~~v~im~~~~~~~~~~~--l~~~ 218 (321) T protein:vir:31 151 SFE--NQNDGFITVAEGDVETIDAADDILDNDLVIRTIAGLDSK--------YRARMNPALIVSEDQLLSYHYT--LTDR 218 (321) T ss_pred ccc--ccchhhhhhhccccccccccccccCHHHHHHHHHhccHh--------HhcCCCeEEEechHHHHHHHHH--HhcC Confidence 000 0012222110 0111223334455555554322 2334565 47888876544221 1111 Q ss_pred cccccccc-cccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCcc Q lcl|NC_015279. 343 PALNANLN-VDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENT 421 (467) Q Consensus 343 ~~~~~~~~-~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s 421 (467) .. .+. ..-++. ...+| +|++|++.++. |.+.++++ |+++ T Consensus 219 ~~---~~~~~~l~~~-~~~tl-~G~pvv~~~~m----------P~~~il~t-------------------------~~~n 258 (321) T protein:vir:31 219 DT---PLGDNVIMGE-ADVNP-FSFPIIGSGLW----------PDDKAMFT-------------------------DPQN 258 (321) T ss_pred CC---ccccchhhcc-ccccc-cceeEEEcCCC----------CCCcEEEe-------------------------cccc Confidence 00 010 000000 01133 57777777663 33333332 3333 Q ss_pred ccceeeeeeeeceeecC-cc-cccCcccccccccccc----ccceeeeeccC Q lcl|NC_015279. 422 FQPKIGFKTRYGMVANP-FA-EGTTVGAGRLRVNSNR----YYRRVAVKNLM 467 (467) Q Consensus 422 ~qP~~g~~tRY~l~~nP-~~-~~~~~~~~~~~~~~n~----y~r~~~v~~~~ 467 (467) +...+.--+|.-....+ .. +..+.-..-...+.+. |==-++|.|+= T Consensus 259 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~ 310 (321) T protein:vir:31 259 LIYALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEGLG 310 (321) T ss_pred EEEEEeeccEEEEeecCccccccceeeEeeeeeecceeEeccccEEEEecCC Confidence 32111111111111110 00 0000000000000000 00112333333 No 67 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=61.32 E-value=0.36 Score=23.05 Aligned_cols=346 Identities=14% Similarity=0.088 Sum_probs=111.1 Q ss_pred CcchHHHHHhhhhhhc-----------------------------cCccchhcchhHHHHHHHHhhhHHHHHHHHhhhhh Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLN-----------------------------YEGLDKISDPHRRAVTAVLLENQEKFMQEQVAFEQ 51 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~-----------------------------~~~~~~i~~~~~~~v~~~~~enq~~~~~e~~~~~~ 51 (467) +-+-+.|.++..-+-+ .+..++.+.....+....+..++............ T Consensus 42 ~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 121 (435) T protein:vir:80 42 SSKFNELTAQIERAEAAERMAAAAAVPVDPNPAAVTASAAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIE 121 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhccccccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHh Confidence 2222333333332210 00000110000011111111111111000000000 Q ss_pred cchhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhc-eeeccCCccceeeeeeeeeecCCCCCccccc Q lcl|NC_015279. 52 GGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDL-AGVQPMSGPTGLIFAMRSKYSTQGGTEALFD 130 (467) Q Consensus 52 ~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI-~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfn 130 (467) .. +.+...+.+ ...++..|.+.--....-.+++...+..+...+ +=+-||+.+. +-+.. .. ++.++ T Consensus 122 ~~-~~~~~~~~~---~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~-~~~p~---~~--~~~~a--- 188 (435) T protein:vir:80 122 RG-FGEEVAMSL---NTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGN-ITIPR---LK--GGAIV--- 188 (435) T ss_pred hh-hhhhhhhhh---cccCCCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCc-eEEEE---Ee--CCcce--- Confidence 00 011111100 000111111110011111122322233344444 1233333321 11110 00 00000 Q ss_pred ccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEE Q lcl|NC_015279. 131 EADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEK 210 (467) Q Consensus 131 Eadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK 210 (467) .| .+ | +..+++...++++ T Consensus 189 ----~~---------------------------------------------------v~--E-----~~~~~~~~~~f~~ 206 (435) T protein:vir:80 189 ----GY---------------------------------------------------IG--A-----DTDIPTTQQQFDD 206 (435) T ss_pred ----ee---------------------------------------------------ec--c-----Cccccccccceee Confidence 00 00 1 1223444445555 Q ss_pred EEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEee----- Q lcl|NC_015279. 211 VTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLD----- 285 (467) Q Consensus 211 ~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~----- 285 (467) ++...+.-+-....|-||.+|-.- +.|.|+.|.+-|+..|...+++-||.- ... +-...|++... T Consensus 207 i~~~~~k~~~~~~is~ell~ds~~--~~~l~~~i~~~l~~a~~~~~d~a~l~G----~G~----~~~p~Gi~~~~~~~~~ 276 (435) T protein:vir:80 207 LKLTAKKMAALVPIANDLIKYAGV--NPNVDQIVVGDLTAAIGAREDKAFIRD----DGT----ANTPKGLRFWALPGNV 276 (435) T ss_pred EEEeeEEEEEeehhhHHHHHhhcc--cHHHHHHHHHHHHHHHHHHHHHHhhcc----CCC----CCcccceeecccccce Confidence 555555555667789999999432 356788888888888888888877732 110 11233433221 Q ss_pred -ccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecC Q lcl|NC_015279. 286 -IDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQG 364 (467) Q Consensus 286 -~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~ 364 (467) ...++- .+......+.+-...+...........+|++|.....|... ....+ ..+-.+.+ .|+| . T Consensus 277 ~~~~~~~----~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~l---kd~~G--~~l~~~~~----~~~l-~ 342 (435) T protein:vir:80 277 ITASDGS----TLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGL---RDGNG--NKVYPELA----NGML-K 342 (435) T ss_pred eeccccc----chhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhh---hccCC--ceeccCCC----CCeE-e Confidence 111110 01111111111111111111123345678999999888552 22111 11111222 2466 4 Q ss_pred ceEEEeccccccc-chh----hcc-CCCceEEEEEecCCCccceeEecccchhhcccccCCccc---cceeeeeeeecee Q lcl|NC_015279. 365 KYRVYIDPYSSNL-TSA----NAA-NGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTF---QPKIGFKTRYGMV 435 (467) Q Consensus 365 ~~~vy~D~y~~~~-~~~----~~~-~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~---qP~~g~~tRY~l~ 435 (467) |++||++.+.-.. ... -++ -.+.++++|-.+.-..+ ..+|.-+.......-..| +=.+=..-|++.. T Consensus 343 G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~i~----~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~ 418 (435) T protein:vir:80 343 GYPVGKTTQVPINLGEAGKESEIYFTDFGDVFIGEEETLEID----YSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFG 418 (435) T ss_pred eeeeEEeccccccccCCCCcceEEEEEcccEEEEeecceEEE----EeccccccccccchhhhhhcCcceeeeeeeeCcE Confidence 5899988774210 000 000 00122335554443322 122211000000000000 1111223333332 Q ss_pred e-cC--cccccCccccc Q lcl|NC_015279. 436 A-NP--FAEGTTVGAGR 449 (467) Q Consensus 436 ~-nP--~~~~~~~~~~~ 449 (467) + +| |+.-+.-+-+. T Consensus 419 ~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 419 PRHVESIAVLSGVAWGA 435 (435) T ss_pred eecccceEEEeccCCCC Confidence 2 12 11111111000 No 68 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=58.85 E-value=0.4 Score=22.74 Aligned_cols=277 Identities=12% Similarity=0.085 Sum_probs=113.8 Q ss_pred ccccccccc--ccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCccccccccccccccccccccc Q lcl|NC_015279. 69 TSSGGQTVA--GFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTAFAGQNEGFDLT 146 (467) Q Consensus 69 ~st~tg~i~--~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~ 146 (467) =++.+|.+. ..-.-++.+.| +..+..+++.+.||++...-|. .. . ++.+| .| T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~---~~s~i~~l~~~~~~~~~~~~ip-~~---~--~~~~a-------~~---------- 54 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVA---GKSSIARLSAQKPIPFNGEKVF-TF---T--MDSEI-------DV---------- 54 (298) T ss_pred CcccCcceechhHHHHHHHHHH---hhhhhhhhcceeeccCCceEEE-EE---e--cCcce-------EE---------- Confidence 111222111 11123344444 6668899999999976432221 11 0 00000 00 Q ss_pred ccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEEEeecccccccccH Q lcl|NC_015279. 147 NGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVTAKSRALKAEYSL 226 (467) Q Consensus 147 ~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ 226 (467) + + | +.++++-..++++++..+|.-+-....|- T Consensus 55 -----------------------------------v------~--E-----~~~~~~~~~~f~~v~l~~~k~a~~~~iS~ 86 (298) T protein:vir:16 55 -----------------------------------V------A--E-----SGKKTHGGVTLAPQTMVPIKVEYGARISD 86 (298) T ss_pred -----------------------------------e------c--C-----CccccccccceeEEEEeeeeEEEeehhhH Confidence 0 0 1 12233333444555555555555678999 Q ss_pred HHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccccc-ccceeE---Eeecccc-chhH-HHHHHHH Q lcl|NC_015279. 227 ELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNT-ATAGVF---DLDIDSN-GRWS-VEKFKGL 300 (467) Q Consensus 227 ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~-~~~gv~---Dl~~~~~-~r~~-ve~~~~l 300 (467) ||.++--. -..|-+++|.+-|+..|...|+..++.-...- -++..++ ...++. ....... ..+. ......+ T Consensus 87 ell~~s~d-~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~--~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 163 (298) T protein:vir:16 87 EFMYASDE-EKINILQEFNDGFAKKVARGIDLMAFHGVNPR--LGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAIENA 163 (298) T ss_pred HHhhcCcc-cHHHHHHHHHHHHHHHHHHHHHHHhhccccCC--CCcccccccccccccccccccccccccccHHHHHHHH Confidence 99875432 12456778888888888888888887432110 0111100 001111 1111111 1111 1112222 Q ss_pred HHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccc-cccccCCceeEEEecCceEEEecccccccch Q lcl|NC_015279. 301 LFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNAN-LNVDDTGNTFAGVLQGKYRVYIDPYSSNLTS 379 (467) Q Consensus 301 ~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~-~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~ 379 (467) +.++. .-.++..-+|++|.....|... .+..+ .. +..+.++. -.|+|. |++|+++.+... T Consensus 164 ~~~~~---------~~~~~~~~~vmn~~~~~~l~~l---kd~~G--~~i~~~~~~~~-~~~~l~-G~PV~~~~~v~~--- 224 (298) T protein:vir:16 164 VELLT---------GVDADVTGIAINPSFRSALAKQ---KDLQD--NALFPELKWGA-TPDTIN-GLPVDVNKTVSD--- 224 (298) T ss_pred HHHhh---------hcCCCccEEEEcHHHHHHHHHh---hccCC--CeeecCcccCC-CCceec-ceeeEEeccccc--- Confidence 22221 1234445688899888887542 22111 01 11111111 136774 579998876531 Q ss_pred hhccCCCceEEEE-EecCCCccceeEecccc--hhhcccccCCcc-----cc-ceeee--eeeece-eecCcccccCccc Q lcl|NC_015279. 380 ANAANGNQYYVVG-YKGTSPYDAGLFYCPYV--PLQMVRAVGENT-----FQ-PKIGF--KTRYGM-VANPFAEGTTVGA 447 (467) Q Consensus 380 ~~~~~~~dY~~vG-yKG~~~~d~glfyaPYv--~l~~~~~~Dp~s-----~q-P~~g~--~tRY~l-~~nP~~~~~~~~~ 447 (467) ...++.+.+++| ++ .++.|..-- ++...+..||+. || =.++| ..|++. +.+|=+ - T Consensus 225 -~~~~~~~~~~~GDfs------~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a------~ 291 (298) T protein:vir:16 225 -MSLTQRDRAIIGDFA------NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATK------F 291 (298) T ss_pred -ccCCCccEEEEeecc------ceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccc------e Confidence 112334455544 11 111121111 112222224432 22 11333 446663 344411 0 Q ss_pred ccccccc Q lcl|NC_015279. 448 GRLRVNS 454 (467) Q Consensus 448 ~~~~~~~ 454 (467) .++.+.+ T Consensus 292 ~~l~~at 298 (298) T protein:vir:16 292 ARVTEAN 298 (298) T ss_pred EEEeecC Confidence 1111111 No 69 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=58.16 E-value=0.42 Score=22.66 Aligned_cols=324 Identities=10% Similarity=0.000 Sum_probs=116.2 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhHHH---------------HHHH-----H--hh---hhhcchh Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEK---------------FMQE-----Q--VA---FEQGGMI 55 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~---------------~~~e-----~--~~---~~~~~~~ 55 (467) .-.++++.++=..+... + +.+-+++.+-+.+ ...+ . .. ...++.. T Consensus 37 ~~~~~e~~~~~~~~~~~-----~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 106 (395) T protein:vir:43 37 GEMNKETRAKVDELLTA-----Q-----GELQARLSAAEQAMLANEKRDGGEEAPKTAGQMVAESLKEQGVTSSLRGSHR 106 (395) T ss_pred hhhhHHHHHHHHHHHHH-----H-----HHHHHHHHHHHHHHHhhhccccccchhhhHHHHHHHHHHHHHHHHHhhhhhh Confidence 00011111111111110 0 0000111100000 0000 0 00 0000000 Q ss_pred hhhhhhhcccccccccccccccccCc-hhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCccccccccc Q lcl|NC_015279. 56 AEQPTNAVGNGGYTSSGGQTVAGFDP-VLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADT 134 (467) Q Consensus 56 ~e~~~~~~g~~~~~st~tg~i~~~~P-~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt 134 (467) .+.+.... ...+.+.|.+ ..| ..-.++++..+..+..+++.++||.+++.-+. | +...++. + T Consensus 107 ~~~~~~~~---~~~~~~~g~~--vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~--~--~~~~~~~-a------- 169 (395) T protein:vir:43 107 VSMPRSAI---TSIDGSGGAL--VAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYV--R--ETGFVNN-A------- 169 (395) T ss_pred hhhhhhhh---cccCCCCccc--cchhhHHHHHHHHHhhhhHHhhccceecCCCceEEE--E--EecCCCc-e------- Confidence 00000000 0001111111 111 12223444446667889999999987753221 1 1110000 0 Q ss_pred ccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEEE Q lcl|NC_015279. 135 AFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVT 214 (467) Q Consensus 135 ~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVt 214 (467) . -+ +| +...++-..++++++.. T Consensus 170 ~---------------------------------------------~v--------~E-----~~~~~~~~~~~~~i~~~ 191 (395) T protein:vir:43 170 A---------------------------------------------PV--------SE-----GTQKPYSDLTFELENAP 191 (395) T ss_pred e---------------------------------------------ee--------cC-----CccccccccceeEEEEe Confidence 0 00 01 11122333344444444 Q ss_pred eecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeec-------- Q lcl|NC_015279. 215 AKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDI-------- 286 (467) Q Consensus 215 AKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~-------- 286 (467) .|.-+-...+|-||.||.- +.++.|.+-|+..+...+|+.||.- +. .+-...|++.... T Consensus 192 ~~k~~~~~~is~ell~d~~-----~l~~~v~~~la~a~~~~~d~~~l~G----~g----~~~~~~Gi~~~~~~~~~~~~~ 258 (395) T protein:vir:43 192 VRTIAHLFKASRQILDDAS-----ALQSYIDARARYGLMLVEECQLLYG----NG----TGANLHGIIPQAQAYAPPSGV 258 (395) T ss_pred eeeEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHHhc----cC----CCCcccccccccccccccccc Confidence 5544555779999999852 3688899999999999999888731 10 1111223322110 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCce Q lcl|NC_015279. 287 DSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKY 366 (467) Q Consensus 287 ~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~ 366 (467) ...+--.++....+++.+ ....+++..+|+||.....|..- .... ...+-.+..+ --.++|. |+ T Consensus 259 ~~~~~~~~~~i~~~~~~~---------~~~~~~~~~~vmn~~~~~~l~~l---kd~~--G~~i~~~~~~-~~~~~l~-G~ 322 (395) T protein:vir:43 259 VVTAEQRIDRIRLAILQA---------QLAEFPASGIVLNPIDWALIELN---KDAE--NRYIIGSPQN-GTTPTLW-RL 322 (395) T ss_pred ccccchhHHHHHHHHHhh---------ccccCCCcEEEEcHHHHHHHHHh---hccC--Cceecccccc-CCCceec-ce Confidence 000100111112222222 22334556789999998877431 2111 1122222111 1245774 58 Q ss_pred EEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeeceee-cCcccccCc Q lcl|NC_015279. 367 RVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMVA-NPFAEGTTV 445 (467) Q Consensus 367 ~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~~-nP~~~~~~~ 445 (467) +|+++++..... .......++++++..+.....- .++... +-..-+=.+=+..|++..+ +|= T Consensus 323 pVv~~~~~~~~~-~~~gd~~~~~~~~~~~~~~i~~----~~~~~~------~f~~~~~~~r~~~r~d~~v~~~~------ 385 (395) T protein:vir:43 323 PVVETQAITQDE-FLTGAFSLGAQIFDRMDIEVLV----STENDK------DFENNMVTIRAEERLAFAVYRPE------ 385 (395) T ss_pred eeEEcCCCCCCc-EEEEeccceEEEEEecceEEEE----eccccc------hhhcCcEEEEEEEeeccEEeccc------ Confidence 999988753100 0000011222222222111111 111000 0011122233334666544 231 Q ss_pred cccccccccccccceeeeecc Q lcl|NC_015279. 446 GAGRLRVNSNRYYRRVAVKNL 466 (467) Q Consensus 446 ~~~~~~~~~n~y~r~~~v~~~ 466 (467) .|.++.|+-= T Consensus 386 -----------a~~~~~~taa 395 (395) T protein:vir:43 386 -----------AFVTGSLTAS 395 (395) T ss_pred -----------ceEEEEeccC Confidence 1111111111 No 70 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=54.78 E-value=0.5 Score=22.26 Aligned_cols=306 Identities=12% Similarity=-0.003 Sum_probs=106.6 Q ss_pred CCCcccccccc-ccc-ccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCc Q lcl|NC_015279. 123 GGTEALFDEAD-TAF-AGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDN 200 (467) Q Consensus 123 sGtEAlfnEad-t~f-Sg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~ 200 (467) -| ||.-. ... .+...++.. ................+.-..+............+..-.....+... .++.. T Consensus 1 ~g----~~~e~~~~~~~~t~~~~g~--l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv-~Eg~~ 73 (397) T protein:vir:23 1 MG----FSADHSQIAQTKDTMFTGY--LDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVSAQWI-GEGDM 73 (397) T ss_pred CC----cCHHHHHHhhccCCCCccc--cchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceEEe-cCCcc Confidence 00 11100 000 000000000 00000000000000000000000000000000111100001111111 12455 Q ss_pred cceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccce Q lcl|NC_015279. 201 FNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAG 280 (467) Q Consensus 201 f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~g 280 (467) +++-..+++++++..|..+-.-.+|-||.+|-. .|.|++|.+-|...|...+|+.+|.-.-+ +....+ T Consensus 74 ~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~gt--------~~~~~~ 141 (397) T protein:vir:23 74 KPITKGNMTKRDVHPAKIATIFVASAETVRANP----ANYLGTMRTKVATAIAMAFDNAALHGTNA--------PSAFQG 141 (397) T ss_pred ccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhhcccC--------Cccccc Confidence 677777788888888888888899999999863 67899999999999999999999843211 000111 Q ss_pred eEEeecc---ccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhh----cchhccccccccccccc Q lcl|NC_015279. 281 VFDLDID---SNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMA----GVLDYTPALNANLNVDD 353 (467) Q Consensus 281 v~Dl~~~---~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~s----G~~~~~~~~~~~~~~d~ 353 (467) ..+.... ..+-...+....++.++. .--+..+-+|+++.....|... |-.-+.|..... . T Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~l~---------~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~----~ 208 (397) T protein:vir:23 142 YLDQSNKTQSISPNAYQGLGVSGLTKLV---------TDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYES----L 208 (397) T ss_pred ccccccceeeecccchhHHHHHHHHhhh---------hcccCCCEEEEcHHHHHHHHHhhccCCceeeccccccc----c Confidence 1111000 001101111112222221 1234556789999999888752 111111111100 0 Q ss_pred CCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCcc-----c---cce Q lcl|NC_015279. 354 TGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENT-----F---QPK 425 (467) Q Consensus 354 t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s-----~---qP~ 425 (467) ......|+| .+++|+++++..........-.+..+++|..+.-..+-+ .+.......|+.. | |=. T Consensus 209 ~~~~~~~tl-~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~i~i~~~------~e~~~~~~~~~~~~~~~lf~~d~v~ 281 (397) T protein:vir:23 209 TTPFREGRI-LGRPTILSDHVAEGDVVGYAGDFSQIIWGQVGGLSFDVT------DQATLNLGSQESPNFVSLWQHNLVA 281 (397) T ss_pred cccccCcee-eeeeEEEeCCCCCCceEEEEeecceEEEEEEeceEEEEe------eeeeeeeccccccceeeeeecccee Confidence 111224577 688999988753211000000112233444433221100 0000000001100 0 111 Q ss_pred eeeeeeecee-ecC--ccccc--Ccc--ccccccccccccceeeeeccC Q lcl|NC_015279. 426 IGFKTRYGMV-ANP--FAEGT--TVG--AGRLRVNSNRYYRRVAVKNLM 467 (467) Q Consensus 426 ~g~~tRY~l~-~nP--~~~~~--~~~--~~~~~~~~n~y~r~~~v~~~~ 467 (467) +=...|++.. .+| |.... ... ..-...+....==++.++|-- T Consensus 282 ~ra~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (397) T protein:vir:23 282 VRVEAEYGLLINDVNAFVKLTFDPVLTTYALDLDGASAGNFTLSLDGKT 330 (397) T ss_pred EEEEeeeccceecccceEEEeeccccceeeecccccCcceEEEEecCcc Confidence 1122233321 111 10000 000 000000111111122222211 No 71 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=54.13 E-value=0.51 Score=22.18 Aligned_cols=276 Identities=11% Similarity=0.042 Sum_probs=123.9 Q ss_pred ccccc-----cccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCccccccccccccccc Q lcl|NC_015279. 66 GGYTS-----SGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTAFAGQN 140 (467) Q Consensus 66 ~~~~s-----t~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~fSg~~ 140 (467) .|+.+ +.++...--....-.++++..+..+-.+++-+-||++.+.-+- .. ++.+| .| T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~-----~~--~~~~a-------~~---- 62 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFT-----FM--SGVGA-------FW---- 62 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEE-----EE--cCCce-------ee---- Confidence 22221 1111111111122345666667778889999999988763221 10 00000 00 Q ss_pred ccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEEEeecccc Q lcl|NC_015279. 141 EGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVTAKSRAL 220 (467) Q Consensus 141 a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVtAKSRaL 220 (467) . +| +.++++...++++++...|..+- T Consensus 63 -----------------------------------------------v--~E-----~~~~~~~~~~f~~v~l~~~k~~~ 88 (299) T protein:vir:41 63 -----------------------------------------------V--DE-----AERIQTSKPTFTKAKMRSKKMGV 88 (299) T ss_pred -----------------------------------------------e--ec-----CccccccccceeEEEEeeEEEEE Confidence 0 11 22344455556777777777777 Q ss_pred cccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeec-----cccchhHHH Q lcl|NC_015279. 221 KAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDI-----DSNGRWSVE 295 (467) Q Consensus 221 KAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~-----~~~~r~~ve 295 (467) ...+|-||.+|-. .|.++.|.+.|+..|...+++.||.---+ ++ ..|++.... ...+--..+ T Consensus 89 ~~~is~ell~ds~----~~~~~~i~~~l~~a~~~~~d~a~l~G~g~----~~-----~~gil~~~~~~~~~~~~~~~~~~ 155 (299) T protein:vir:41 89 IIPTTKENLNYSV----TNFFSLMQAEIVEAFYKKFDQAVFTGVES----PY-----NWNILKSATDASNLVEETANKYD 155 (299) T ss_pred eehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhhcccC----cc-----cccccccccccceeeccccccHH Confidence 8889999999754 46788999999999999999888842111 11 111111100 000101112 Q ss_pred HHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCceEEEeccccc Q lcl|NC_015279. 296 KFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKYRVYIDPYSS 375 (467) Q Consensus 296 ~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~ 375 (467) ....++.++.. -.++++.+||+|+....|.. +....+ ..-+..+.++. .++|. |++|++..+.. T Consensus 156 ~l~~~~~~l~~---------~~~~~~~~v~n~~~~~~L~~---lkd~~G-~~l~~~~~~~~--~~~l~-G~PV~~~~~~~ 219 (299) T protein:vir:41 156 DLNEAIGLIEA---------EDLEPNGIATIRKQRVKYRS---TKDGNG-MPIFNTATSNG--VDDVL-GLPIAYTPKYT 219 (299) T ss_pred HHHHHHHhhhc---------ccCCcCEEEEcHHHHHHHHH---hhccCC-ceeecCCcCCC--Cceec-ceeeEEecccC Confidence 33344444422 23456779999999888864 221111 00012222221 35775 58887776642 Q ss_pred ccchh--hccCCCceEEEEEecCCCccceeEecccchhhcccccCCcc-----ccc-eeee--eeeeceee-cC--cccc Q lcl|NC_015279. 376 NLTSA--NAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENT-----FQP-KIGF--KTRYGMVA-NP--FAEG 442 (467) Q Consensus 376 ~~~~~--~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s-----~qP-~~g~--~tRY~l~~-nP--~~~~ 442 (467) ....+ ...-.+.++++|..++.+++-. .+.......||+. ||- .++| ..|+|..+ || |+.- T Consensus 220 ~~~~~~~~~~gdfs~~~i~~~~~~~i~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l 293 (299) T protein:vir:41 220 FGDKDISELVGDWNQAYYGILRGVEYEIL------TEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAV 293 (299) T ss_pred CCCCceEEEEEecccEEEEEecCcEEEEe------ecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEE Confidence 11000 0000112233444433221110 0000111122221 222 2333 35666544 33 2221 Q ss_pred cCccccccccccc Q lcl|NC_015279. 443 TTVGAGRLRVNSN 455 (467) Q Consensus 443 ~~~~~~~~~~~~n 455 (467) +. ...| T Consensus 294 ~~-------~aa~ 299 (299) T protein:vir:41 294 QP-------KAGN 299 (299) T ss_pred Ee-------ccCC Confidence 11 1111 No 72 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=51.68 E-value=0.57 Score=21.90 Aligned_cols=348 Identities=14% Similarity=0.192 Sum_probs=158.8 Q ss_pred HHhhhhhhccCccchhcchhHHHHHHHHhhhHHHHHHHHhhhhhcchhhhhhhhhcccccccccccccccccCchhhhhH Q lcl|NC_015279. 8 QEKWAPLLNYEGLDKISDPHRRAVTAVLLENQEKFMQEQVAFEQGGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLI 87 (467) Q Consensus 8 ~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq~~~~~e~~~~~~~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~ 87 (467) .|.|..-|...|.-+-+-...|.+ +.=||-||....-+..... |.|.++. .+-+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~k~l-r~~me~~et~~e~~~~~~~---~~~~e~e--------------------l~E~f~ 56 (393) T protein:vir:79 1 MENWLKQLKESGFTETQVQEQKSL-RTRMERGETLAEADANKLA---LNEEETQ--------------------ILESFA 56 (393) T ss_pred CchHHHHHHhccCchhHHHHHHHH-HHHhhhhhhhhhhhhhhhh---cchhHHH--------------------HHHHHH Confidence 789999998888877655444443 3345655542221111100 1222221 011222 Q ss_pred HHHHhhhhhhhceeecc---CCccceeeeeeeeeecCC-CCCc------ccccccccccccccccccccccccccccccC Q lcl|NC_015279. 88 RRSMPNLVAYDLAGVQP---MSGPTGLIFAMRSKYSTQ-GGTE------ALFDEADTAFAGQNEGFDLTNGMSDAAAGLG 157 (467) Q Consensus 88 Rr~~p~LI~~DI~GVQP---mTGPTGLIFAMRsrY~~q-sGtE------AlfnEadt~fSg~~a~~~~~~~~~~~~~~~~ 157 (467) +-.--+.=+.+ ||- ||.|+|.|--=|+--+.- -..| -++.++.-.|. +.-- + T Consensus 57 Kmm~G~~p~~e---V~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~G-rsm~----------F---- 118 (393) T protein:vir:79 57 KMMEGETPTNE---VNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSG-QSMI----------F---- 118 (393) T ss_pred HHhcCCCchhh---eehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcC-ccee----------c---- Confidence 20001111222 444 667777666544321110 0111 12222221111 0000 0 Q ss_pred CCcccccccccccccccccccccccccccchhhHhhcCCCCCccce--ee-eEEEEEEEEeecccccccccHHHHHHHHH Q lcl|NC_015279. 158 TTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNE--MA-FSIEKVTVTAKSRALKAEYSLELAQDLKA 234 (467) Q Consensus 158 ~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~E--Ma-FsIEK~tVtAKSRaLKAEYT~ELAQDLkA 234 (467) .+. ...-+++++.| .+|++ |. |+-|++++..+--+++=+||=|+.-| T Consensus 119 --~~~------------g~~Ra~~IgEG-------------gE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsD--- 168 (393) T protein:vir:79 119 --PSI------------GIMRAYDVAEG-------------QEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISD--- 168 (393) T ss_pred --cch------------heeeecccccc-------------ccccccchhhhcCCceeEEechhhhhhhhHHHHhhc--- Confidence 000 01122333333 22333 44 66788888888878887777777665 Q ss_pred hhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccc-----cceeEEeeccccchhHHHHHHHHHHHHHHHHH Q lcl|NC_015279. 235 IHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTA-----TAGVFDLDIDSNGRWSVEKFKGLLFQIERDAN 309 (467) Q Consensus 235 iHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~-----~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean 309 (467) .|+|-=.-+-...-.-|..----+.++-+-+....- -.+++ -..-.|++.--+|..+.|.+-.+++++.- T Consensus 169 -Sg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtv-fDa~st~t~ahptGr~~~~~qNGTlSleDllDm~~av~~--- 243 (393) T protein:vir:79 169 -SQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTV-FDNYSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAVMA--- 243 (393) T ss_pred -chHHHHHHHHHHHHHHHHhhhHHHHHhhhhccccee-eeccccCccceeecCCccccccccccHHHHHHHHHHHhc--- Confidence 566533222222222222221122333332222200 00011 11123344444677788888888887743 Q ss_pred HHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCceEEEecccccccchhhccCCCceE Q lcl|NC_015279. 310 AIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYY 389 (467) Q Consensus 310 ~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~ 389 (467) --+.+..++--|=+-+..+---.|+. +....+| |=+-++|.--.+ .+-| T Consensus 244 ------~hyt~svi~MHPLAWnv~AKna~me~------------~~~na~g--N~~~~~~~ts~a---------lgp~-- 292 (393) T protein:vir:79 244 ------NEYTPSDLMMHPLAWTVFAKNELMGS------------LQANPYG--NYPAKGAPSSMA---------LGPD-- 292 (393) T ss_pred ------ccCCcceEEEcCchhhhhhhhhhhcc------------eeecccc--ccCccccchhhh---------hchh-- Confidence 35777888877766555543222221 1111122 111111111000 0001 Q ss_pred EEEEecCCCccceeEecccchhhccc------c-------------------c-CCccccceeeeeeeeceeecCccccc Q lcl|NC_015279. 390 VVGYKGTSPYDAGLFYCPYVPLQMVR------A-------------------V-GENTFQPKIGFKTRYGMVANPFAEGT 443 (467) Q Consensus 390 ~vGyKG~~~~d~glfyaPYv~l~~~~------~-------------------~-Dp~s~qP~~g~~tRY~l~~nP~~~~~ 443 (467) -.||.-++-=-+.++|+||++... . + ||.--=-++=++-|||+.+ +-.+. T Consensus 293 --~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk~rdiq~iKl~ERYG~gv--Ln~gk 368 (393) T protein:vir:79 293 --SIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEKARGLQNIKMIERYGIGI--LNEGK 368 (393) T ss_pred --hhccccccceeEEEecccccccccceeeEEEeecCCceEEEEecCcceeccccccccceeeeeeeeeceee--eeCCc Confidence 113333333345556666655111 0 1 4444445677899999955 44556 Q ss_pred CccccccccccccccceeeeeccC Q lcl|NC_015279. 444 TVGAGRLRVNSNRYYRRVAVKNLM 467 (467) Q Consensus 444 ~~~~~~~~~~~n~y~r~~~v~~~~ 467 (467) .-.-++.|.=.+.||--.++||+= T Consensus 369 aiavakNI~~~k~y~~P~~~~~~~ 392 (393) T protein:vir:79 369 AIAVAKNISMDKSYAEPMLIKNVG 392 (393) T ss_pred eEEEEecceeecccccchhhhccC Confidence 666678888999999999999998 No 73 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=50.13 E-value=0.62 Score=21.73 Aligned_cols=331 Identities=15% Similarity=0.089 Sum_probs=108.7 Q ss_pred Ccch---HHHHHhhh---hhhccCccchhcchhHHHHHH---H--HhhhHHHHHHHHh-hhhh-cchhhhhhhhhc---- Q lcl|NC_015279. 1 MFQS---EQLQEKWA---PLLNYEGLDKISDPHRRAVTA---V--LLENQEKFMQEQV-AFEQ-GGMIAEQPTNAV---- 63 (467) Q Consensus 1 ~~~~---~~l~~kw~---p~l~~~~~~~i~~~~~~~v~~---~--~~enq~~~~~e~~-~~~~-~~~~~e~~~~~~---- 63 (467) |.-. |+..++|+ -|++....-+..+..++++-. . -|+.|-+...|.. .... ...+.+.+.... T Consensus 4 ~~l~~l~e~r~~~~~e~~~L~~~~~~~~lt~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (390) T protein:vir:62 4 TTLSANFEARERATAELRTLTDEFAGKEMTDEAREKEERLITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSGSGAQR 83 (390) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchh Confidence 1100 11111122 122211110111111111111 0 0111100000000 0000 000000000000 Q ss_pred ------------ccc-------------ccccccccccc---ccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeee Q lcl|NC_015279. 64 ------------GNG-------------GYTSSGGQTVA---GFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAM 115 (467) Q Consensus 64 ------------g~~-------------~~~st~tg~i~---~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAM 115 (467) |.. ...++.++.+. .....++.+.|.. .+...++-|-||++...+-+.. T Consensus 84 ~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~---~~l~~~~~~~~~~~~~~~~~p~ 160 (390) T protein:vir:62 84 SADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERS---AIMRGGATTFTTSDANPLDFTV 160 (390) T ss_pred hcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhh---hhhhhcceeeecCCCceeEEEE Confidence 000 00000011000 0011111222211 1233444444444333222211 Q ss_pred eeeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcC Q lcl|NC_015279. 116 RSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLG 195 (467) Q Consensus 116 RsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LG 195 (467) .. + +. . +...++ T Consensus 161 ~~---~--~~----------------------------------------------------~------a~wv~E----- 172 (390) T protein:vir:62 161 IT---G--RS----------------------------------------------------S------ASIVGE----- 172 (390) T ss_pred Ec---C--Cc----------------------------------------------------c------eeeecc----- Confidence 10 0 00 0 000111 Q ss_pred CCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccc Q lcl|NC_015279. 196 TSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSN 275 (467) Q Consensus 196 s~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~ 275 (467) +..+++-.-++++++..+|.-+-....|-||.+|- .+|.+++|.+-|+..|..-+|..||.- -| T Consensus 173 --~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~l~G------~G---- 236 (390) T protein:vir:62 173 --TAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQ----VLDLVGFLVSDAGPAIGDAMGRHFITG------TG---- 236 (390) T ss_pred --cccccccccceeeeEeeeeeEEeehHHHHHHHhhh----hHHHHHHHHHHHHHHHHHHHHhhhhcc------CC---- Confidence 22334444445666666666667778999999993 367899999999999999999998831 01 Q ss_pred cccceeEEeecccc--------chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccccc Q lcl|NC_015279. 276 TATAGVFDLDIDSN--------GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNA 347 (467) Q Consensus 276 ~~~~gv~Dl~~~~~--------~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~ 347 (467) ...|++....... ..-.......+...+. .--+..+ ..|+++.....|.. |....+ . T Consensus 237 -~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~--------~~~~~~a-~~vmn~~~~~~L~~---lkd~~g--~ 301 (390) T protein:vir:62 237 -QPRGILTDASPATATFLATDTDSKVSDALIDLFHEVP--------SAYRANA-KYVVNDLRAAQMRK---LKDANG--Q 301 (390) T ss_pred -ccccccccccccccceecccccccchHHHHHHHHhhh--------hhhhcCC-EEEEchHHHHHHHH---hhccCC--C Confidence 1223332211000 0000011112222221 1122333 35778887777743 221111 0 Q ss_pred c-cccccCCceeEEEecCceEEEecccccccchhhcc-CCCceEEEEEecCCCccceeEecccchhhcccccCCccccce Q lcl|NC_015279. 348 N-LNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAA-NGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPK 425 (467) Q Consensus 348 ~-~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~-~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~ 425 (467) . +..+.+.. .-++| .|++|+++.+.-. +.++ -.+.+++++..++...+.+. -+|. .+-|=. T Consensus 302 ~l~~~~~~~g-~~~~l-~G~Pv~~~~~~p~---~~i~~gd~s~~~i~~~~~~~v~~~~--~~~~----------~~~~~~ 364 (390) T protein:vir:62 302 YLWQSGLTVG-APSLF-NGKVVETDDGMPA---DKILFADLSKYRVRFAGSLRVDRSV--DAKF----------STDQIV 364 (390) T ss_pred eeecCCcCCC-cccee-cccceEEecCCCC---ccEEEeeccceeEEeecceEEEeec--cccc----------cCCcEE Confidence 1 11111111 12366 4578888876531 1111 01122334444333322211 0110 111122 Q ss_pred eeeeeeecee-ecCcccccCccccccccccccccceeeeeccC Q lcl|NC_015279. 426 IGFKTRYGMV-ANPFAEGTTVGAGRLRVNSNRYYRRVAVKNLM 467 (467) Q Consensus 426 ~g~~tRY~l~-~nP~~~~~~~~~~~~~~~~n~y~r~~~v~~~~ 467 (467) +=+..|++.. .||=+ |+.+.|+.== T Consensus 365 ~~~~~r~d~~~~~~~A-----------------~~~l~~~~~a 390 (390) T protein:vir:62 365 YRFLQRADGLLVDARG-----------------AKVLTVTPGA 390 (390) T ss_pred EEEEEEeCcEeechhh-----------------eEEEEeecCC Confidence 2233455432 23311 1111111111 No 74 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=49.96 E-value=0.62 Score=21.71 Aligned_cols=264 Identities=11% Similarity=0.023 Sum_probs=109.9 Q ss_pred CC-ccceeeeeeeeeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccc Q lcl|NC_015279. 105 MS-GPTGLIFAMRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVG 183 (467) Q Consensus 105 mT-GPTGLIFAMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~ 183 (467) |. +.|-|- .-+.+|--..+=-...... ....+...... .+ ........+++ T Consensus 1 Ma~~~T~l~-------------d~i~Pev~~~~v~~~~~~~------~~~~~~~~~~~------~l---~g~~G~ti~iP 52 (276) T protein:vir:10 1 MAQGTTTKS-------------TQIVPEVLAPMMQAELDKK------LRFAQFADIDS------TL---VGQPGDTLTFP 52 (276) T ss_pred CCcceeehh-------------hhhchHHHHHHHHHHHHhh------hhhcccceecc------cc---cCCCCCEEEee Confidence 11 011000 0011111000000000000 00000000000 00 00001111111 Q ss_pred cccchhhHhhcCCCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHH-hhCCChhHHHHHHHHHHHHHHhcHHHHH Q lcl|NC_015279. 184 QGMRTDEAEDLGTSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKA-IHGLNAEAELANILSSEILAEINREVIR 262 (467) Q Consensus 184 ~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkA-iHGLDAEtELaNILStEImlEINREII~ 262 (467) .--...++|.+.. +.++..=..+..+.+++.+-|.-.=++| |+-+ .-+.|.-.+..+-++.-|...++.+++. T Consensus 53 ~~~~igda~~~~e-g~~i~~~~lt~~~~~a~i~~~~k~~~~t-----D~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~ 126 (276) T protein:vir:10 53 AFVYSGDATVVPE-GQKIPVDKIETNRREAKIHKIGKGTDIT-----DEALLSGYGDPQGEAVRQHGLAIANKVDNDVLE 126 (276) T ss_pred eecCCCccccccC-CCccCccccccceeeEEeehcccccccc-----HHHHHhhccchHHHHHHHHHHHHHHHHHHHHHH Confidence 1111123333332 2233333334455555555554333333 3333 2357999999999999999999999998 Q ss_pred HHhhhcccccccccccceeEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcc Q lcl|NC_015279. 263 TIYKVSEQGAVSNTATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYT 342 (467) Q Consensus 263 ~l~~~a~~~k~~~~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~ 342 (467) .+....... +.+.+++ +.+-..+.++..| -...++++++|.+.+.|.-....++. T Consensus 127 ~l~~~~~~~------~~~~~t~----------d~i~~A~~~lgd~---------~~~~~~ivv~p~~~~~L~k~~~~~f~ 181 (276) T protein:vir:10 127 ALRGTKLTV------SADIGTL----------AGLEAAIDTFDDE---------DLEPMVLFINPKDAGKLRSSASDNFT 181 (276) T ss_pred HHhcccccc------cccccCH----------HHHHHHHHHhccc---------cCcccEEEEcHHHHHHHHHhcccccc Confidence 876643321 1112211 1222232333222 24678999999999998543222222 Q ss_pred cccccccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEe-cCCCccceeEecccchhhcccc-cCCc Q lcl|NC_015279. 343 PALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYK-GTSPYDAGLFYCPYVPLQMVRA-VGEN 420 (467) Q Consensus 343 ~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyK-G~~~~d~glfyaPYv~l~~~~~-~Dp~ 420 (467) +......+ ...+-..|++ .|++|++|... | +|-.+-++ |.-. ++... +.. +.. =|++ T Consensus 182 ~~s~~g~~--~~~~G~ig~~-~G~~Vi~s~~~----------p-~~t~~l~~~gAi~----~~~~~--~~~-vE~dRd~~ 240 (276) T protein:vir:10 182 RATELGDN--IIVKGAFGEA-LGAVIVRSKKL----------D-EGEAILAKRGAVK----LITKR--DFF-LETDRDPS 240 (276) T ss_pred cccccccc--ceecccccee-cceeEEEcCCC----------C-cceEEEEecccee----eeecC--Cce-eecccchh Confidence 22111111 1222246777 57899999764 2 23222222 2211 11111 111 111 1888 Q ss_pred cccceeeeeeeeceee-cCc-------ccccCcccc Q lcl|NC_015279. 421 TFQPKIGFKTRYGMVA-NPF-------AEGTTVGAG 448 (467) Q Consensus 421 s~qP~~g~~tRY~l~~-nP~-------~~~~~~~~~ 448 (467) .++-.+--.-+||... ||= +.++....+ T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~~~ 276 (276) T protein:vir:10 241 TKTTALYSDKHYVAYLYDESKAVKVTKGAGTTDSGA 276 (276) T ss_pred hcccEEEEeeEEEEEEEcCcceEEEecCCcCCcCCC Confidence 8888888888888643 441 112222211 No 75 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=49.66 E-value=0.63 Score=21.68 Aligned_cols=330 Identities=12% Similarity=0.062 Sum_probs=124.3 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhh---------h-HHHHHHHHhhhhhcchhh--hh--hhhhcccc Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLE---------N-QEKFMQEQVAFEQGGMIA--EQ--PTNAVGNG 66 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~e---------n-q~~~~~e~~~~~~~~~~~--e~--~~~~~g~~ 66 (467) +-..+++..++..+.+... ++....+......... + .++-..++....+..++- +. .......+ T Consensus 39 ~e~i~e~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 116 (408) T protein:vir:74 39 AEAMSELKNKRDNEKVRRD--ALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTE 116 (408) T ss_pred HHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHhhccccccccccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhh Confidence 2233455666654433211 1111000000000000 0 000000000000000000 00 00000111 Q ss_pred ccccccccccc---ccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCcccccccccccccccccc Q lcl|NC_015279. 67 GYTSSGGQTVA---GFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTAFAGQNEGF 143 (467) Q Consensus 67 ~~~st~tg~i~---~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~fSg~~a~~ 143 (467) ...++..|.+. .+.+.++.+.| +.....++++++||++.+|-+--.+ ..+.. ..+ . T Consensus 117 ~~~~~~~gg~~vP~~~~~~Ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~-~~~-------~-------- 175 (408) T protein:vir:74 117 TSGSDSAAGLTIPQDIRTMINTLVR---QYDSLQQYVRVESVSTSSGSRVYEK--WTDVT-PLK-------A-------- 175 (408) T ss_pred cccccCCCceeechhHhhHHHHHHh---hhcchhhhcceeeccCCcceEEEEe--ecCCc-ccc-------c-------- Confidence 11122222221 23344455555 5556789999999999887653333 11100 000 0 Q ss_pred cccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCC-CCCccceeeeEEEEEEEEeecccccc Q lcl|NC_015279. 144 DLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGT-SGDNFNEMAFSIEKVTVTAKSRALKA 222 (467) Q Consensus 144 ~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs-~g~~f~EMaFsIEK~tVtAKSRaLKA 222 (467) ..++.+.... +...|.++.|+..|..+ .. T Consensus 176 -------------------------------------------~v~E~~~~~~~~~~~~~~i~~~~~k~~~-------~~ 205 (408) T protein:vir:74 176 -------------------------------------------MDEEDGKIPDLDNPRLTIIKYLIKRYAG-------II 205 (408) T ss_pred -------------------------------------------ccccccccccccccceeeEEeeeeeEEe-------ee Confidence 0000000000 11235555555555554 45 Q ss_pred cccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeeccccchhHHHHHHHHHH Q lcl|NC_015279. 223 EYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDIDSNGRWSVEKFKGLLF 302 (467) Q Consensus 223 EYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~~~~r~~ve~~~~l~~ 302 (467) .+|-||.+|- .+|.++.|.+-|+..|..-+|+.||.- .-.+....+..++++ ...+++ T Consensus 206 ~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~d~~il~G--------~G~~~~~~~~~~~~~----------i~~~~~ 263 (408) T protein:vir:74 206 TATNTLLKDT----AENILAWLSSWIAKKVVVTRNQAIIAA--------MGTVPKKPTIANFDD----------VITMIN 263 (408) T ss_pred hhHHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhc--------ccccccccccccHHH----------HHHHHH Confidence 6999999983 357899999999999999999888732 111222233332210 111111 Q ss_pred HHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccccccc-ccccCCceeEEEecCceEEEecccccccchh- Q lcl|NC_015279. 303 QIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANL-NVDDTGNTFAGVLQGKYRVYIDPYSSNLTSA- 380 (467) Q Consensus 303 ~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~-~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~- 380 (467) ......-+.. -.+||+|.....|.. +....+ ..+ ..+.++ -..++| .|++||+-.+..-+... T Consensus 264 -------~~l~~~~~~~-a~~v~n~~~~~~l~~---lkd~~G--~~l~~~~~~~-~~~~~l-~G~pV~~~~~~~~~~~~~ 328 (408) T protein:vir:74 264 -------TSVDPAIIAT-SSLLTNQSGLNKLAL---VKTAEG--KYLLEPDPTK-PNSYLI-KGKQVIVVADRWLPNSGS 328 (408) T ss_pred -------HhhhhhhcCC-CEEEEcHHHHHHHHH---hhcCCC--ceEeccCcCC-CCCcee-cceeeEEecCcccccccC Confidence 1111111222 357889999888864 222111 111 111122 123567 56777653221100000 Q ss_pred ---hcc--CCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeeceee-cCcccccCcccccccccc Q lcl|NC_015279. 381 ---NAA--NGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMVA-NPFAEGTTVGAGRLRVNS 454 (467) Q Consensus 381 ---~~~--~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~~~~~ 454 (467) .++ ...+|++++-++... +=..||.-. +-...+-.+-+..||+..+ +|- T Consensus 329 ~~~~i~~gd~~~~~~~~~~~~~~----i~~~~~~~~------~f~~~~~~~r~~~r~d~~~~~~~--------------- 383 (408) T protein:vir:74 329 TVYPLYYGDMSQAITLFDRENMS----LLPTNIGAG------AFETDTTKIRVIDRFDVKATDSE--------------- 383 (408) T ss_pred CcceEEEEehhccEEEEEecceE----EEEeccccc------hhhcceeeEEEEEeeCcEEeccc--------------- Confidence 000 011233333322222 222333211 1123444445555555432 221 Q ss_pred ccccceeeeeccC Q lcl|NC_015279. 455 NRYYRRVAVKNLM 467 (467) Q Consensus 455 n~y~r~~~v~~~~ 467 (467) .|+.+.++.+= T Consensus 384 --a~~~~~~~~~~ 394 (408) T protein:vir:74 384 --ALVAGSFTAIA 394 (408) T ss_pred --ceEEEEeeccc Confidence 01111111111 No 76 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=49.21 E-value=0.65 Score=21.62 Aligned_cols=295 Identities=10% Similarity=0.009 Sum_probs=116.2 Q ss_pred hhhhcchhhhhhhh-hcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCc Q lcl|NC_015279. 48 AFEQGGMIAEQPTN-AVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTE 126 (467) Q Consensus 48 ~~~~~~~~~e~~~~-~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtE 126 (467) ......|-.|.... ..++ +..+..- -....-.+++...+..+..+++.+-||++++.-|. +..+ +.+ T Consensus 1 ~~~~~~~~~e~~~~~~~~~-----~~~~~~i-p~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip----~~~~--~~~ 68 (318) T protein:vir:24 1 MAAGTAFAVDHAQIAQTGD-----TMFKGYL-EPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIP----HWVG--DVS 68 (318) T ss_pred CCCCCCCCHHHHHhhcccC-----cccceee-chhHHHHHHHHHHhhchhhhhcceeeccCCceEEE----EEeC--Ccc Confidence 11111222222111 0111 1111110 01121223344445667788899999987653321 1100 000 Q ss_pred ccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeee Q lcl|NC_015279. 127 ALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAF 206 (467) Q Consensus 127 AlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaF 206 (467) | .| .++ +.++++... T Consensus 69 a-------~~---------------------------------------------------v~E-------g~~~~~~~~ 83 (318) T protein:vir:24 69 A-------QW---------------------------------------------------IGE-------GDMKPITKG 83 (318) T ss_pred e-------EE---------------------------------------------------ecC-------Ccccccccc Confidence 0 00 011 122333444 Q ss_pred EEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEee- Q lcl|NC_015279. 207 SIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLD- 285 (467) Q Consensus 207 sIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~- 285 (467) ++++++.+.|..+-...+|-||.+|-. .|.+++|.+.|+..|...|++.+|.---+ ++ ..|++... T Consensus 84 ~f~~i~~~~~k~~~~~~iS~e~l~ds~----~~~~~~i~~~l~~~~~~~~d~a~l~G~g~----~~-----~~~~~~~~~ 150 (318) T protein:vir:24 84 NMTSQTIAPHKIATIFVASAETVRANP----ANYLGTMRTKVATAFAMAFDGAAMHGTDS----PF-----PTYIGQTTK 150 (318) T ss_pred ceeEEEEeeEEEEEeehhhHHHhhcCh----HHHHHHHHHHHHHHHHHHHHHhhhcccCC----CC-----Ccccccccc Confidence 455555666665666789999999844 57999999999999999999998732211 11 11111111 Q ss_pred ------ccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCc--- Q lcl|NC_015279. 286 ------IDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGN--- 356 (467) Q Consensus 286 ------~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~--- 356 (467) .....-|. .....++.... .........+||||.....|... +...+- ..+..+.++. T Consensus 151 ~~~~~~~~~~~~~~--------~~~~~~~~~~~-~~~~~~~~~~v~n~~~~~~L~~l---kd~~G~-~l~~~~~~~~~~~ 217 (318) T protein:vir:24 151 AISIADTTGATTVY--------DQVAVNGLSLL-VNDGKKWTHTLLDDITEPILNGA---KDQNGR-PLFIESTYGEAAS 217 (318) T ss_pred cccccccccccchH--------HHHHHHHHHhh-ccccCCCCEEEEcHHHHHHHHHh---hccCCc-eeecCccccCccc Confidence 00111111 11111111111 22234446789999999988642 221110 0011111111 Q ss_pred ee-EEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCcc-----c---cceee Q lcl|NC_015279. 357 TF-AGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENT-----F---QPKIG 427 (467) Q Consensus 357 ~~-~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s-----~---qP~~g 427 (467) .+ -+.+ .+++|++.+...........-.+.++++|..+.-+++-+ .+.......|+.. | |=.+= T Consensus 218 ~~~~~~i-~g~pv~~~~~~~~~~~~~~~gdfs~~~~~~~~~l~i~~~------~~~~~~~~~~~~~~~~~~f~~~~~~~r 290 (318) T protein:vir:24 218 PFRSGRI-VARPTILSDHVVEGTTVGFMGDFSQLIWGQIGGLSFDVT------DQATLNLGTVESPNFVSLWQHNLVAVR 290 (318) T ss_pred cccCceE-EEEeeEEeCCCCCCccEEEEeecceEEEEEecCeEEEEe------eccceeccccccccchhhhhcCcEEEE Confidence 11 1233 346677665542110000000112233444332221100 0000111111111 2 22333 Q ss_pred eeeeecee-ecC--cccccCccccccccc Q lcl|NC_015279. 428 FKTRYGMV-ANP--FAEGTTVGAGRLRVN 453 (467) Q Consensus 428 ~~tRY~l~-~nP--~~~~~~~~~~~~~~~ 453 (467) ...|++.. .+| |+.-+. .-+....| T Consensus 291 ~~~r~d~~v~~~~a~~~i~~-~~a~~~~~ 318 (318) T protein:vir:24 291 VEAEYAFHCNDAEAFVALTN-VVSGGGEG 318 (318) T ss_pred EEEEEccEEecccceEEEEe-eccCCCCC Confidence 45566655 344 211111 00000111 No 77 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=48.44 E-value=0.67 Score=21.54 Aligned_cols=309 Identities=11% Similarity=0.033 Sum_probs=120.4 Q ss_pred HhhhHHHHHHHHhhhhhcchhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeee Q lcl|NC_015279. 35 LLENQEKFMQEQVAFEQGGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFA 114 (467) Q Consensus 35 ~~enq~~~~~e~~~~~~~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFA 114 (467) +.=|-.|. .++-. ..|...-+.+. +..|.. --.+.+=.+++...+..+-..++-+-||++++.-+. T Consensus 1 ~~~~~~r~------~~~~~-~~e~~a~~~~~-----~~~g~~-ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p- 66 (326) T protein:vir:42 1 MAVNPDRT------TPFLG-VNDPKVAQTGD-----SMFEGY-LEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIP- 66 (326) T ss_pred CCCCccch------hhhcC-cchhhheeccc-----cCCcce-echhhHHHHHHHHHhcchhhhhcceeeccCCceEEE- Confidence 22232111 11100 01111100010 111111 112333334444445556777888889887653221 Q ss_pred eeeeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhc Q lcl|NC_015279. 115 MRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDL 194 (467) Q Consensus 115 MRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~L 194 (467) +.. ++.++ .| + +| T Consensus 67 ---~~~--~~~~a-------~~---------------------------------------------v--------~E-- 79 (326) T protein:vir:42 67 ---HWT--GDVSA-------SW---------------------------------------------I--------GE-- 79 (326) T ss_pred ---EEe--CCcce-------EE---------------------------------------------e--------cC-- Confidence 010 00000 00 0 01 Q ss_pred CCCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccc Q lcl|NC_015279. 195 GTSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVS 274 (467) Q Consensus 195 Gs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~ 274 (467) +..++|-..+++++++.+|..+-.-.+|-||.+|-. .|.++.|.+-|+..|...+++.+|.---+-...+-.+ T Consensus 80 ---g~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~----~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~ 152 (326) T protein:vir:42 80 ---GDMKPITKGNMTSQTIAPHKIATIFVASAETVRANP----ANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQ 152 (326) T ss_pred ---CccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccc Confidence 233444555667777777777777889999999843 5789999999999999999999884211000000000 Q ss_pred ccccceeEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccc-- Q lcl|NC_015279. 275 NTATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVD-- 352 (467) Q Consensus 275 ~~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d-- 352 (467) .....+..... ..+-+......... +..... ........++.+|++|.....|.. |....+- .-+..+ T Consensus 153 ~~~~~~~~~~~--~~~~~~~~~~~~~~--~~~~~~--~~~~~~~~~a~~v~n~~~~~~L~~---lkd~~G~-~l~~~~~~ 222 (326) T protein:vir:42 153 TTKEVSLVDPD--GTGSNADLTVYDAV--AVNALS--LLVNAGKKWTHTLLDDITEPILNG---AKDKSGR-PLFIESTY 222 (326) T ss_pred cccccceeecc--cccccccchhHHHH--HHHHHh--hhhhhccCccEEEEeHHHHHHHHH---hhccCCc-eeeccccc Confidence 00000111000 00000000000010 001111 112234456778899999888864 2221110 000001 Q ss_pred --cCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCcc-----cc-- Q lcl|NC_015279. 353 --DTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENT-----FQ-- 423 (467) Q Consensus 353 --~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s-----~q-- 423 (467) .......|+| .+++|+++++..........-.+.++++|..+..+++-+ .+.......|+.. || T Consensus 223 ~~~~~~~~~~~l-~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~------~e~~~~~~~~~~~~~~~~~~~d 295 (326) T protein:vir:42 223 TEENSPFRLGRI-VARPTILSDHVASGTVVGYQGDFRQLVWGQVGGLSFDVT------DQATLNLGTPQAPNFVSLWQHN 295 (326) T ss_pred cCccccccCcee-eeeeEEEcCCCCCCceEEEEeecceEEEEEecceEEEEe------ecceeeecccccccchhhhhcC Confidence 1112234455 578999988753100000000111122222222211100 0000001111111 22 Q ss_pred -ceeeeeeeeceee-cC--cccc--cCcccc Q lcl|NC_015279. 424 -PKIGFKTRYGMVA-NP--FAEG--TTVGAG 448 (467) Q Consensus 424 -P~~g~~tRY~l~~-nP--~~~~--~~~~~~ 448 (467) =.+=...|++..+ +| |+.- ...+++ T Consensus 296 ~~~~r~~~~~d~~v~~~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 296 LVAVRVEAEYAFHCNDKDAFVKLTNVDATEA 326 (326) T ss_pred cEEEEEEEEeccEEecccceEEEeeccccCC Confidence 2333455666543 33 2111 111111 No 78 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=47.53 E-value=0.7 Score=21.44 Aligned_cols=287 Identities=11% Similarity=0.050 Sum_probs=113.8 Q ss_pred hhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCccccccccc Q lcl|NC_015279. 55 IAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADT 134 (467) Q Consensus 55 ~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt 134 (467) +++. +++.+...--....-.++++..+..+..+++-+.||++++--|--.. .+.+| T Consensus 1 ma~~-----------t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~------~~~~a------- 56 (305) T protein:vir:25 1 MADI-----------SRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA------TLPEA------- 56 (305) T ss_pred CCCc-----------cCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEe------CCcce------- Confidence 1111 11111111111222334555556667788899999987753221111 11111 Q ss_pred ccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEEE Q lcl|NC_015279. 135 AFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVT 214 (467) Q Consensus 135 ~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVt 214 (467) .|-+ +++... ...++.-..++++++.. T Consensus 57 ~wv~---------------------------------------------------E~~~~~--~~~~~~s~~~f~~i~~~ 83 (305) T protein:vir:25 57 DWVG---------------------------------------------------ESATDP--KGVKPTSKVTWANRTLV 83 (305) T ss_pred EEee---------------------------------------------------cccccc--cccccccccceeeEEee Confidence 1100 000000 00111112233344444 Q ss_pred eecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEee-------cc Q lcl|NC_015279. 215 AKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLD-------ID 287 (467) Q Consensus 215 AKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~-------~~ 287 (467) ++..+-...+|-||.+|-. .|.|++|.+-|+..|...+++.+|.-- |+..+....++.... .. T Consensus 84 ~~k~~~~~~is~ell~ds~----~~~~~~i~~~l~~~~a~~~d~a~~~G~------g~~~~~~~~~~~~~~~~~~~~~~~ 153 (305) T protein:vir:25 84 AEEIAVIIPVHENVIDDAT----VAVLTEVAELGGQAIGKKLDQAVIFGT------DKPASWVSPALIPAAVTAGQAVEV 153 (305) T ss_pred eEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHhhhheecc------CCCCCccccccccccccccccccc Confidence 4444445679999999843 578999999999999999999998321 111111111111000 00 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCce--eEEEecCc Q lcl|NC_015279. 288 SNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNT--FAGVLQGK 365 (467) Q Consensus 288 ~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~--~~G~l~~~ 365 (467) ..+-. ..-.++.-+. .+.....+ -.+..+-++++|.-...|... . |.+|.. --++| .+ T Consensus 154 ~~~~~---~~~~~~~~~~-~~~~~~~~-~~~~~~~~v~~~~~~~~l~~l---k-----------d~~G~~i~~~~~l-~G 213 (305) T protein:vir:25 154 VGGVA---NESDIVGATN-RAAKAVAS-AGWAPDTLLSSLALRYEVANI---R-----------DANGNPVFRDDSF-AG 213 (305) T ss_pred cccch---hhhHHHHHHH-HHHHhhhh-cccccceeEecHHHHHHHHHh---h-----------ccCCceeecCCcc-cc Confidence 00000 0001111111 11111111 123444578888877777431 2 112221 12466 56 Q ss_pred eEEEecccccccchhh--ccCCCceEEEEEecCCCccceeEecccchhhcccccCCcc-cc-ceee--eeeeece-eecC Q lcl|NC_015279. 366 YRVYIDPYSSNLTSAN--AANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENT-FQ-PKIG--FKTRYGM-VANP 438 (467) Q Consensus 366 ~~vy~D~y~~~~~~~~--~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s-~q-P~~g--~~tRY~l-~~nP 438 (467) ++|++..+......+. ..-.+.++++|..+.-+.+- ..+.-+.+ .-.|.+ || ..++ ...|||+ +.|| T Consensus 214 ~Pv~~~~~~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~----~~~~~~~~--~~~~~~~~~~~~~~~R~~~r~~~~v~~p 287 (305) T protein:vir:25 214 FRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKF----LDQATLGT--GENQINLAERDMVALRLKARFAYVLGVS 287 (305) T ss_pred cceEEcCccCCCCCccEEEEEecceEEEEEecCeEEEE----eeeeeeec--CCceeeeeecCcEEEEEEEeecceeeCc Confidence 7888876642110000 00011222334443322211 11110000 001111 22 1223 4568995 5688 Q ss_pred cccc-cCcccc-cccccc Q lcl|NC_015279. 439 FAEG-TTVGAG-RLRVNS 454 (467) Q Consensus 439 ~~~~-~~~~~~-~~~~~~ 454 (467) -+-. .+..+. .+...+ T Consensus 288 ~a~v~~~~~~~~~~~pa~ 305 (305) T protein:vir:25 288 ATAQGANKTPVAVVAPAA 305 (305) T ss_pred ccEEEEccccccccCCCC Confidence 4321 111111 111222 No 79 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=46.88 E-value=0.72 Score=21.37 Aligned_cols=336 Identities=10% Similarity=-0.011 Sum_probs=120.8 Q ss_pred CcchH-------------------------HHHHhhhhhhccC-c-------cchhcchhHHHHHHH-HhhhHHHHHHHH Q lcl|NC_015279. 1 MFQSE-------------------------QLQEKWAPLLNYE-G-------LDKISDPHRRAVTAV-LLENQEKFMQEQ 46 (467) Q Consensus 1 ~~~~~-------------------------~l~~kw~p~l~~~-~-------~~~i~~~~~~~v~~~-~~enq~~~~~e~ 46 (467) .-+.+ +|..+...+-+.. . .+........+.... .-++.+ ++.- T Consensus 35 ~~e~~~~~e~~~~e~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~ 112 (418) T protein:vir:10 35 GDEVKSAGEKALAEAKRAGDLGVETKATVDELLIKQGELQARLLEAEQKLARGGGSAELETPKTLGQLVTESEE--MKGM 112 (418) T ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhhHHhhhHHH--HHHH Confidence 11111 1222222111100 0 000000000000000 000100 0000 Q ss_pred hhhhhcchhhhhhh----hhcccccccccccccccccCchhh-hhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecC Q lcl|NC_015279. 47 VAFEQGGMIAEQPT----NAVGNGGYTSSGGQTVAGFDPVLI-SLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYST 121 (467) Q Consensus 47 ~~~~~~~~~~e~~~----~~~g~~~~~st~tg~i~~~~P~Lv-~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~ 121 (467) ............+. ......+...+.+|. ..-|.+. .+++...+..+..+++.+-||++++.-+ .| ..+ T Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~--lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~--~~--~~~ 186 (418) T protein:vir:10 113 DGSARKSVRVRVDRKSIMNVPATVGSGVSGSNS--LVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEY--TV--ETG 186 (418) T ss_pred HHHHhhhhhhhhHHHHHHHhhhhccCCCCCCcc--ccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeE--EE--Eec Confidence 00000000000000 000111111111111 1112211 3344444666788899999998875321 11 100 Q ss_pred CCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCcc Q lcl|NC_015279. 122 QGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNF 201 (467) Q Consensus 122 qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f 201 (467) .++ . ..| ++ | +... T Consensus 187 ~~~-~-------a~~---------------------------------------------------v~--E-----~~~~ 200 (418) T protein:vir:10 187 FTN-N-------AAA---------------------------------------------------VA--E-----GAQK 200 (418) T ss_pred CCC-c-------eee---------------------------------------------------ec--c-----Cccc Confidence 000 0 000 00 1 1112 Q ss_pred ceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccccccccee Q lcl|NC_015279. 202 NEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGV 281 (467) Q Consensus 202 ~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv 281 (467) ++-..++++++..+|.-+-...+|-||.||.- |.++.|.+-|+..|..-+|+-||.- .-.+....|+ T Consensus 201 ~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~-----~l~~~i~~~l~~a~~~~~d~a~l~G--------~g~~~~p~Gi 267 (418) T protein:vir:10 201 PTSDLKFNLKNQPVRTIAHLFKASRQILDDAP-----ALQSYIDGRARYGLQLTEEGQILKG--------DGTGANILGI 267 (418) T ss_pred cccccceeeEEEeeeeEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHhcc--------CCCCcccccc Confidence 22233455555566655566789999999852 4678888888888888888777621 1111112233 Q ss_pred EEee------ccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCC Q lcl|NC_015279. 282 FDLD------IDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTG 355 (467) Q Consensus 282 ~Dl~------~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~ 355 (467) +... ....+--.++....+++++ ....+..+-+||+|.....|..- ....+ ..+-.+.++ T Consensus 268 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~~---------~~~~~~~~~~v~n~~~~~~L~~l---kd~~G--~~i~~~~~~ 333 (418) T protein:vir:10 268 LPQASAFMPSITLANATPIDKIRLALLQA---------VLAEFPATGIVLNPIDWASIELT---KDSQG--RYIVGNPVN 333 (418) T ss_pred ccccccccccccccccccHHHHHHHHHhh---------ccccCCCCEEEEcHHHHHHHHHh---hcCCC--ceecccccc Confidence 2211 1111111122233333333 12345566799999998887542 21111 111112111 Q ss_pred ceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeecee Q lcl|NC_015279. 356 NTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMV 435 (467) Q Consensus 356 ~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~ 435 (467) . ..|+| .|++|+++.+.... ........++++++.++.... =..||.-..+ ...+=.+=+..|++.. T Consensus 334 ~-~~~~l-~G~pV~~~~~~p~~-~~~~gd~s~~~~~~~~~~~~i----~~~~~~~~~f------~~~~~~~r~~~~~d~~ 400 (418) T protein:vir:10 334 G-TTPRL-WNLPVVETQAMTAN-EFLVGAFSMAAQIFDRMEIEV----LLSTENVDDF------EKNMVSIRAEERLALA 400 (418) T ss_pred C-CCcee-cceeeEEcCCCCCC-cEEEeeccceEEEEEecceEE----EEecccchhh------hcCceEEEEEEeeccE Confidence 1 24577 45899998775310 000011122233333322221 1222221111 1122223334556543 Q ss_pred e-cC--cccccCccccccccc Q lcl|NC_015279. 436 A-NP--FAEGTTVGAGRLRVN 453 (467) Q Consensus 436 ~-nP--~~~~~~~~~~~~~~~ 453 (467) + +| |+..+-..++ .| T Consensus 401 ~~~~~a~~~~~~~~~~---~g 418 (418) T protein:vir:10 401 VYRPESFVTGALVEQA---GG 418 (418) T ss_pred EecccceEEEEeccCC---CC Confidence 2 34 2222111111 11 No 80 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=46.84 E-value=0.72 Score=21.36 Aligned_cols=329 Identities=12% Similarity=0.055 Sum_probs=118.6 Q ss_pred CcchHHHHHhhhhhhccCcc---------------chhc-chhHHHHHHHHhhhHHHHHHHHhhhhhcchhhhhhhhhcc Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGL---------------DKIS-DPHRRAVTAVLLENQEKFMQEQVAFEQGGMIAEQPTNAVG 64 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~---------------~~i~-~~~~~~v~~~~~enq~~~~~e~~~~~~~~~~~e~~~~~~g 64 (467) .-+.+.|.++..-..+.+.+ .+.. ...++.....+.++ ++.+...........|. ..+ T Consensus 37 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~e~--~a~- 110 (404) T protein:vir:10 37 SNEIDILQAKIEAQKRKENIENNFNEDNVKSLNTGKEENVIYNGALFVRAIADN---LLKQKNQRGLNLSEKEI--NAI- 110 (404) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHHHHHHHHHHH---HHHHHHhhhhcchhhHH--hhh- Confidence 11111222222211100000 0000 00011111111111 11111111111111111 111 Q ss_pred cccccccccccc---cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCcccccccccccccccc Q lcl|NC_015279. 65 NGGYTSSGGQTV---AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTAFAGQNE 141 (467) Q Consensus 65 ~~~~~st~tg~i---~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~fSg~~a 141 (467) ...++++|.+ ..+.+-++.+.| +.....+++++.||+++.|-+-=.| ..+. .. ..+- T Consensus 111 --~~~~~~~gg~~vP~~~~~~ii~~~~---~~~~l~~l~~~~~~~~~~g~~~~~~--~~~~--~~-------~~~v---- 170 (404) T protein:vir:10 111 --SENIDEDGGYAVPEDIQTKINTRLK---DTTDLYNMVDYEPVFTRSGSRTYEK--RSKQ--KP-------MKPL---- 170 (404) T ss_pred --ccccCCCCceeechhHHHHHHHHHh---hhhhHhhhhceeeccCCccceEEEE--ecCC--cc-------eeec---- Confidence 0111122222 123334444444 5557788999999999998543222 1110 00 0000 Q ss_pred cccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcC-C-CCCccceeeeEEEEEEEEeeccc Q lcl|NC_015279. 142 GFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLG-T-SGDNFNEMAFSIEKVTVTAKSRA 219 (467) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LG-s-~g~~f~EMaFsIEK~tVtAKSRa 219 (467) .+++... + ....|.+..|+..|.. T Consensus 171 -----------------------------------------------~e~~~~~~~~~~~~f~~i~~~~~k~~------- 196 (404) T protein:vir:10 171 -----------------------------------------------SENQQIPTNGDNGKLERFNFKLKDLA------- 196 (404) T ss_pred -----------------------------------------------cccccccccccccceeeeEeeheeeE------- Confidence 0000000 0 0122555555555544 Q ss_pred ccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEee------ccccchhH Q lcl|NC_015279. 220 LKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLD------IDSNGRWS 293 (467) Q Consensus 220 LKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~------~~~~~r~~ 293 (467) -...+|-||.+|-. .+.++.|.+.|+..|...+|+.||.-. -.+....|+.... .....-| T Consensus 197 ~~~~iS~ell~ds~----~~l~~~i~~~la~~~~~~~~~~il~G~--------g~~~~~~gi~~~~~~~~~~~~~~~~~- 263 (404) T protein:vir:10 197 DFMSIPNDLLKFAD----KSLEDWIINWFVDKVRITRNAEILYGA--------GGDEHATGIMTANKFKKITLPKSPAL- 263 (404) T ss_pred eeehhhHHHHhhcH----HHHHHHHHHHHHHHHHHHHHHHHhhcC--------CCCCcccceeeccccceeeccccccH- Confidence 44578999998843 357788888888888888888777321 1111223333221 1111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccCCcc-EEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCceEEEecc Q lcl|NC_015279. 294 VEKFKGLLFQIERDANAIAQRTRRGKGN-MILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKYRVYIDP 372 (467) Q Consensus 294 ve~~~~l~~~i~~ean~i~~~t~rg~gn-~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~ 372 (467) .....+ ++.+ .. ..+.++ .+||||+....|..- ....+- ..+..+.+ ....++|+ |++|++.+ T Consensus 264 -~~~~~~-------~~~~-l~-~~~~~~~~~v~n~~~~~~L~~l---kd~~G~-~l~~~~~~-~~~~~~l~-G~PV~~~~ 327 (404) T protein:vir:10 264 -KDFKKC-------KNVE-LL-NVFKATSSWIVNQDGFNYLDSL---EDKTGR-PYLQPDPK-DPTQYRFL-GLPVIELP 327 (404) T ss_pred -HHHHHH-------HHhh-hh-ccccCCCEEEEcHHHHHHHHHh---hccCCc-eeeccCcC-CCCCcccc-ceeeEEec Confidence 111111 1111 11 223333 468999998888652 211110 00111111 11234664 56777432 Q ss_pred cccccch---hhcc--CCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeeceee-cC--ccc--- Q lcl|NC_015279. 373 YSSNLTS---ANAA--NGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMVA-NP--FAE--- 441 (467) Q Consensus 373 y~~~~~~---~~~~--~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~~-nP--~~~--- 441 (467) ...-... ..++ ...++++++..+..+..- .++ ...+-...+=.+-...|++..+ +| |+. T Consensus 328 ~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~----~~~------~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~ 397 (404) T protein:vir:10 328 NDLLLSTESAIPVLLGDTKEAYKYVSDGAYELAT----TNI------GAGAFETNTTKARIIMRIDGNVKDSEALLIAEI 397 (404) T ss_pred ccccCCCCCccEEEEEeccccEEEEEecceEEEE----ecc------ccchhhcCceEEEEEEeeccEEecccceEEEEe Confidence 2110000 0000 001223333222221111 010 0011123344455666766543 33 221 Q ss_pred ccCcccc Q lcl|NC_015279. 442 GTTVGAG 448 (467) Q Consensus 442 ~~~~~~~ 448 (467) .....|+ T Consensus 398 ~~aa~~~ 404 (404) T protein:vir:10 398 PVESVQA 404 (404) T ss_pred ecccCCC Confidence 2223333 No 81 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=46.08 E-value=0.75 Score=21.28 Aligned_cols=325 Identities=13% Similarity=0.027 Sum_probs=109.9 Q ss_pred CcchHHHHHhhhhhhccCc-----------------cchhcchhHHHHHHHHhhhHHHHHHHHhhhhhcchhhhhhhhhc Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEG-----------------LDKISDPHRRAVTAVLLENQEKFMQEQVAFEQGGMIAEQPTNAV 63 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~-----------------~~~i~~~~~~~v~~~~~enq~~~~~e~~~~~~~~~~~e~~~~~~ 63 (467) .-..+.|..+..-+-.... ...-...+++.....+.+.+...+..... ..+.+.... T Consensus 178 ~~~~e~l~~~~e~~~~~~~~~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~----~~~~~~~~~-- 251 (543) T protein:vir:81 178 LSAIEKMQGASDNVRAAATKIIERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAILTEEEK----RAINEVRAM-- 251 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHHhhhhhh----hhhhhhhhc-- Confidence 0011112111111100000 00000000000000111111101100000 001111100 Q ss_pred cccccccccccccc---ccCch-hhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCcccccccccccccc Q lcl|NC_015279. 64 GNGGYTSSGGQTVA---GFDPV-LISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTAFAGQ 139 (467) Q Consensus 64 g~~~~~st~tg~i~---~~~P~-Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~fSg~ 139 (467) ..++++|.+. .+.+. ++.+.|.. -+...++-|.|++|..- +- + . .++..+ . T Consensus 252 ----~~t~~~gg~lip~~~~~~ii~~~~~~~---~~l~~~~~~~~~~g~~~--~~-~--~--~~~~~a-------~---- 306 (543) T protein:vir:81 252 ----GLTKADGGYLVPFQLDPTVIITSNGSL---NDIRRFARQVVATGDVW--HG-V--S--SAAVQW-------S---- 306 (543) T ss_pred ----ccccccCcccCchhhhhHHHHHHHhhh---chhhhhcccccCCcceE--EE-E--e--cCCcce-------e---- Confidence 0111121111 11222 12333321 23344455555443321 00 0 0 000000 0 Q ss_pred cccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEEEeeccc Q lcl|NC_015279. 140 NEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVTAKSRA 219 (467) Q Consensus 140 ~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVtAKSRa 219 (467) ..++ +..+++-..+++.++++++.-+ T Consensus 307 -----------------------------------------------~v~E-------g~~~~~~~~~~~~i~~~~~k~~ 332 (543) T protein:vir:81 307 -----------------------------------------------WDAE-------FEEVSDDSPEFGQPEIPVKKAQ 332 (543) T ss_pred -----------------------------------------------eccc-------CccccccccccceeeeeeeeeE Confidence 0011 1122223334455555555555 Q ss_pred ccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEE--------eeccccch Q lcl|NC_015279. 220 LKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFD--------LDIDSNGR 291 (467) Q Consensus 220 LKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~D--------l~~~~~~r 291 (467) =...+|-||.+|- + |.++.|.+-|...|...+|+-||.- .-.+-...|++. +.....+- T Consensus 333 ~~~~is~ell~d~--~---~~~~~i~~~l~~~~~~~~d~ail~G--------~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~ 399 (543) T protein:vir:81 333 GFVPISIEALQDE--A---NVTETVALLFAEGKDELEAVTLTTG--------TGQGNQPTGIVTALAGTAAEIAPVTAET 399 (543) T ss_pred eeehhhHHHHhcc--H---HHHHHHHHHHHHHHHHHHHHHHhcc--------CCCCcccccchhhccccccccccccccc Confidence 5678999999873 2 6899999999999999999988721 101111223221 11111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccccccc-ccccCCceeEEEecCceEEEe Q lcl|NC_015279. 292 WSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANL-NVDDTGNTFAGVLQGKYRVYI 370 (467) Q Consensus 292 ~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~-~~d~t~~~~~G~l~~~~~vy~ 370 (467) ...+.+..+...+.- .......+|++|.+...|..- ....+ ..+ .....+ .-++| .|++||+ T Consensus 400 ~~~~~~~~~~~~l~~---------~~~~~~~~v~n~~~~~~l~~l---kd~~G--~~l~~~~~~g--~~~~l-~G~pv~~ 462 (543) T protein:vir:81 400 FALADVYAVYEQLAA---------RHRRQGAWLANNLIYNKIRQF---DTQGG--AGLWTTIGNG--EPSQL-LGRPVGE 462 (543) T ss_pred ccHHHHHHHHHhhhc---------cccCCcEEEEcHHHHHHHHHh---hcCCC--ceeccCcCCC--CCccc-cceeeEE Confidence 122233334333321 111123578899988888642 21111 011 111111 13467 4578888 Q ss_pred cccccccchhhc--------cCCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeecee-ecC--c Q lcl|NC_015279. 371 DPYSSNLTSANA--------ANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMV-ANP--F 439 (467) Q Consensus 371 D~y~~~~~~~~~--------~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~-~nP--~ 439 (467) ..+......... .-.+.++++|..+..+. =..||+- ...|-...+=.+=+..|+|.. .|| | T Consensus 463 ~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i----~~~~~~~----~~~~~~~~~~~~~~~~r~d~~v~~~~A~ 534 (543) T protein:vir:81 463 AEAMDANWNTSASADNFVLLYGNFQNYVIADRIGMTV----EFIPHLF----GTNRRPNGSRGWFAYYRMGADVVNPNAF 534 (543) T ss_pred eccccccccccccCCcceEEEeeccceeEEeecccEE----EEecccc----ccchhhcCceEEEEEEeeccEeecccce Confidence 876421100000 00111222333222211 1122210 011222233344445566653 344 2 Q ss_pred ccccCcccc Q lcl|NC_015279. 440 AEGTTVGAG 448 (467) Q Consensus 440 ~~~~~~~~~ 448 (467) ...+-...+ T Consensus 535 ~~l~~~~~a 543 (543) T protein:vir:81 535 RLLNVETAS 543 (543) T ss_pred EEEEecccC Confidence 111111111 No 82 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=41.65 E-value=0.92 Score=20.79 Aligned_cols=337 Identities=15% Similarity=0.110 Sum_probs=111.0 Q ss_pred Ccc-------------hHHHHHhhhhhhccCcc----------------chhcch-------hHHHHHHHHhhhHHHHHH Q lcl|NC_015279. 1 MFQ-------------SEQLQEKWAPLLNYEGL----------------DKISDP-------HRRAVTAVLLENQEKFMQ 44 (467) Q Consensus 1 ~~~-------------~~~l~~kw~p~l~~~~~----------------~~i~~~-------~~~~v~~~~~enq~~~~~ 44 (467) ++. -++|.++..-+=..|.+ +.+... ...+....+.+.+..+.+ T Consensus 30 ~lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 109 (428) T protein:vir:10 30 TLTAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVAKPVKATQHGPAVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQD 109 (428) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhhccccccccccchhhhHHHHHHHHHHHHhhhhHHH Confidence 111 22233222211000000 000000 000000001111000000 Q ss_pred HHhhhhhcchhhhhhhhhcccccccccccccc---cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecC Q lcl|NC_015279. 45 EQVAFEQGGMIAEQPTNAVGNGGYTSSGGQTV---AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYST 121 (467) Q Consensus 45 e~~~~~~~~~~~e~~~~~~g~~~~~st~tg~i---~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~ 121 (467) ..... ......+......+ .++++|.+ ....+-++.+.| +..+..++ |+...++++|-+-=.| ..+ T Consensus 110 ~~~~~-~~~~~~~~~~~~~~----~~~~~gg~liP~~~~~~ii~~l~---~~~~l~~~-~~~~~~~~~g~~~~p~--~~~ 178 (428) T protein:vir:10 110 AAKFA-SDELNDQSVSMAIS----TAAGSGGVLIPQNIHSEVIELLR---DRTIVRKL-GARSIPLPNGNMSLPR--LAG 178 (428) T ss_pred HHHHh-hhhhhhhhHhhhhc----ccccCCccccchhHHHHHHHHHh---hhchhhhh-cceeeecCCcceEEEE--EeC Confidence 00000 00000011111001 11112211 122233444444 34444555 3333333333321111 000 Q ss_pred CCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCcc Q lcl|NC_015279. 122 QGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNF 201 (467) Q Consensus 122 qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f 201 (467) +..+ + ..+ | +... T Consensus 179 --~~~a----------------------------------------------------~------~v~--E-----g~~~ 191 (428) T protein:vir:10 179 --GATA----------------------------------------------------S------YTG--E-----NQDA 191 (428) T ss_pred --Ccce----------------------------------------------------e------eec--c-----Cccc Confidence 0000 0 001 1 1233 Q ss_pred ceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccccccccee Q lcl|NC_015279. 202 NEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGV 281 (467) Q Consensus 202 ~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv 281 (467) ++...++++++...|.-+-...+|-||.+|- ..|.++.|.+.|...|...+|+.||.- .-.+....|+ T Consensus 192 ~~~~~~f~~i~~~~~k~~~~v~is~ell~ds----~~~l~~~i~~~l~~ai~~~~d~~~l~G--------~G~~~~p~Gi 259 (428) T protein:vir:10 192 KVSEARFDDVKLTAKTMIAMVPISNALIGRA----GFNVEQLVLQDILTAISVREDKAFMRD--------DGTGDTPIGM 259 (428) T ss_pred cccccceeeEEeeeEEEEEeehhhHHHHhhh----hHHHHHHHHHHHHHHHHHHHHHHHhcc--------CCCCcccccc Confidence 4444455555555555555688999999884 246788888888888888888888731 1111122333 Q ss_pred EEe----------eccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccccccccc Q lcl|NC_015279. 282 FDL----------DIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNV 351 (467) Q Consensus 282 ~Dl----------~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~ 351 (467) +-- ......-+ .....+ .....-+..... ........|+++.....|..- ....+ ..+-. T Consensus 260 ~~~~~~~~~~~~~~~~~~~~~--~~~~~~-~~~~~~~~~~~~--~~~~~~~~v~n~~~~~~L~~l---kd~~G--~~i~~ 329 (428) T protein:vir:10 260 KARATQWNRLLPWAADAAVNL--DTIDTY-LDSIILMSMDGN--SNMISSGWGMSNRTYMKLFGL---RDGNG--NKVYP 329 (428) T ss_pred ccccccccccccccccccccH--HHHHHH-HHHHHHhhhccc--cccccCEEEEcHHHHHHHHHh---hccCC--ceecc Confidence 321 11111110 111111 111111111111 112234456788887777542 21111 01111 Q ss_pred ccCCceeEEEecCceEEEecccccc-cch------hhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccc-- Q lcl|NC_015279. 352 DDTGNTFAGVLQGKYRVYIDPYSSN-LTS------ANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTF-- 422 (467) Q Consensus 352 d~t~~~~~G~l~~~~~vy~D~y~~~-~~~------~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~-- 422 (467) +. ..|+| .|++||++.+.-. +.. ...-. +.++++|..|.-+.+ ..+|..........-..| T Consensus 330 ~~----~~g~l-~G~pv~~~~~~p~~~~~~~~~~~i~~gd-~s~~~i~~~~~i~i~----~~~~~~~~~~~~~~~~~f~~ 399 (428) T protein:vir:10 330 EM----AQGML-KGYPIQRTSAIPANLGEGGKESEIYFAD-FNDVVIGEDGNMKVD----FSKEASYIDTDGKLVSAFSR 399 (428) T ss_pred CC----CCCee-eceeeEEeccccccccCCCccceEEEEe-cceEEEEEecceEEE----eecccccccccccccchhhc Confidence 11 13566 6788988776421 000 00001 123345555444432 222221110000000000 Q ss_pred -cceeeeeeeeceeec-C--cccccCccccccccc Q lcl|NC_015279. 423 -QPKIGFKTRYGMVAN-P--FAEGTTVGAGRLRVN 453 (467) Q Consensus 423 -qP~~g~~tRY~l~~n-P--~~~~~~~~~~~~~~~ 453 (467) +=.+=...|+++.+. | |+..+.. .| T Consensus 400 ~~~~~R~~~r~d~~v~~p~a~~~~t~~------~~ 428 (428) T protein:vir:10 400 NQSLIRVVTEHDIGFRHPEGLVLGTGV------LF 428 (428) T ss_pred chhheeeeeeeCceeeccceEEEEecc------CC Confidence 111123344544432 3 2221111 11 No 83 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=41.20 E-value=0.94 Score=20.74 Aligned_cols=323 Identities=13% Similarity=0.122 Sum_probs=116.3 Q ss_pred CcchHHHHHhhhhhhcc--CccchhcchhHHHHH---HHH--hhhHHHHHHHHhh------------------------- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNY--EGLDKISDPHRRAVT---AVL--LENQEKFMQEQVA------------------------- 48 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~--~~~~~i~~~~~~~v~---~~~--~enq~~~~~e~~~------------------------- 48 (467) --..++++++=.-.+.. +.+-++.... ..+. +.+ ++.|.+.++++.. T Consensus 12 ~~~~~e~~~~l~~~~~~~~~~~e~~~~l~-~ei~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 90 (389) T protein:vir:10 12 SAKCADLNAQLNAKLQDENASVDDFQKIK-DDLTAAKARRDAINDQIKALEAEKPAEPKTEPKDDGSKKGTDLSKKPIDA 90 (389) T ss_pred HHHHHHHHHHHHHHHHhHhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccccchhHHHH Confidence 00000111110000000 0001111100 0000 000 1111111100000 Q ss_pred --hhhcchhh-h-hhhhhccccccccccccccc---ccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecC Q lcl|NC_015279. 49 --FEQGGMIA-E-QPTNAVGNGGYTSSGGQTVA---GFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYST 121 (467) Q Consensus 49 --~~~~~~~~-e-~~~~~~g~~~~~st~tg~i~---~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~ 121 (467) ..+..++- . ..... ....+++.|.+. .+.+.++.+. .+..+..+++.|.||+++++-+--++. . T Consensus 91 ~~~~~~~~lr~~~~~~~~---~~~~t~~~gg~~vP~~~~~~i~~~~---~~~~~l~~~~~~~~~~~~~~~~~~~~~--~- 161 (389) T protein:vir:10 91 KKKAINDFIHSHGKVIDA---TSKVTSTEAGVLIPEEIIYDPTAEV---NSVVDLSTLVTKTPVTTPKGTYPILKR--A- 161 (389) T ss_pred HHHHHHHHhhcchhhhhh---hcccccCCcceeehHHHHHHHHHHH---HhhhhHHhhcceeeccCCeeEEEEEec--C- Confidence 00000000 0 00000 001111222221 2233344444 456677899999999988765443330 0 Q ss_pred CCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCcc Q lcl|NC_015279. 122 QGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNF 201 (467) Q Consensus 122 qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f 201 (467) ++.-+ + . +..++.--.+...| T Consensus 162 -~~~~~--------~-------------------------~-------------------------~E~~~~~~~~~~~~ 182 (389) T protein:vir:10 162 -TDRFS--------S-------------------------V-------------------------AELAENPKLAEPEF 182 (389) T ss_pred -CCccc--------c-------------------------c-------------------------cccccccccccccc Confidence 00000 0 0 00000000012346 Q ss_pred ceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccccccccee Q lcl|NC_015279. 202 NEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGV 281 (467) Q Consensus 202 ~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv 281 (467) .+..+++.|..+ -..+|-||.+|- ..|.+++|.+-|...+..-+|+.|+.-+-.. ...+ ..+. T Consensus 183 ~~i~~~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~----~~~~--~~~~ 245 (389) T protein:vir:10 183 NKVDWSVATYRG-------AIPLSEEAIADS----AVDLTALVGQSIKEKSVNTYNAMIAPVLQSF----TAKK--TTTD 245 (389) T ss_pred eeeeeeheeeEe-------eehhhHHHHhhh----hHHHHHHHHHHHHHHHHHHHHHHHhhhhccc----cccc--cccc Confidence 667777766654 356899999984 2467888999999999988898887443211 1111 1111 Q ss_pred EEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccc---cccccccccCCcee Q lcl|NC_015279. 282 FDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPA---LNANLNVDDTGNTF 358 (467) Q Consensus 282 ~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~---~~~~~~~d~t~~~~ 358 (467) ... ..+..+ +...... .+ ..-+|+++.....|... ....+ +.... .+.+.... T Consensus 246 ~~~----------d~l~~~-~~~~~~~--------~~-~a~~~~n~~~~~~L~~l---kd~~G~~i~~~~~-~~~~~~~~ 301 (389) T protein:vir:10 246 TLV----------DSLKHI-LNVDLDP--------AY-SRALVVTQSLFNTLDTL---KDKNGRYLLHDAS-DSITDGTA 301 (389) T ss_pred ccH----------HHHHHH-HHhhhhh--------hh-CcEEEecHHHHHHHHHh---hccCCCeeeecCc-cccccccc Confidence 110 112222 1111111 12 24578888888777642 21111 00001 01122233 Q ss_pred EEEecCceEEEe-cc-cccccc-hhhcc--CCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeec Q lcl|NC_015279. 359 AGVLQGKYRVYI-DP-YSSNLT-SANAA--NGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYG 433 (467) Q Consensus 359 ~G~l~~~~~vy~-D~-y~~~~~-~~~~~--~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~ 433 (467) .++| .|++||+ |. ...... +..++ .-.++++++-++.-..+ ..|-..|.-.+...-|++ T Consensus 302 ~~~l-~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~---------------~~~~~~~~~~~~~~~r~d 365 (389) T protein:vir:10 302 KGTI-LGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLA---------------WEDSKIYGKYLGAAFRFG 365 (389) T ss_pred cccc-ccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEE---------------eeccccccceEEEEEEec Confidence 4567 5566664 32 111000 00000 00122222222111111 112333444556667887 Q ss_pred ee-ecC--cc-----cccCccccc Q lcl|NC_015279. 434 MV-ANP--FA-----EGTTVGAGR 449 (467) Q Consensus 434 l~-~nP--~~-----~~~~~~~~~ 449 (467) .. .|| |. ......+++ T Consensus 366 ~~~~~~~a~~~~~~~~~~~~~~~~ 389 (389) T protein:vir:10 366 VQKADSKAGYFVTNTDVPGSALGK 389 (389) T ss_pred cEEecccceEEEEeeccCCCCCCC Confidence 64 344 11 111222222 No 84 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=39.62 E-value=1 Score=20.56 Aligned_cols=334 Identities=13% Similarity=0.069 Sum_probs=104.7 Q ss_pred Ccc-----hHHHHHhhhhhhccCccchhcch------hHHHHHHH---HhhhHHHHHHH--Hhhhhhc------------ Q lcl|NC_015279. 1 MFQ-----SEQLQEKWAPLLNYEGLDKISDP------HRRAVTAV---LLENQEKFMQE--QVAFEQG------------ 52 (467) Q Consensus 1 ~~~-----~~~l~~kw~p~l~~~~~~~i~~~------~~~~v~~~---~~enq~~~~~e--~~~~~~~------------ 52 (467) +.. .++|++...-+-+.+-..++... ..+.+-.+ .+|.+++...+ +.....+ T Consensus 9 l~e~r~~~~~e~~~l~~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (392) T protein:vir:13 9 NFEARERATAELRSLTDEFAGKEMTAEAREKEERLLTAVADFDGRIKRGIDAIKATDAVTSLLSGLQGSGSGAQRSADHD 88 (392) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccchhhhhhHH Confidence 110 11222222211111111111111 01111111 12221110000 0000000 Q ss_pred -------chhhhhhhhhcccccccccccccccccCchh-hhhHHHHHh-hhhhhhceeeccCCccceeeeeeeeeecCCC Q lcl|NC_015279. 53 -------GMIAEQPTNAVGNGGYTSSGGQTVAGFDPVL-ISLIRRSMP-NLVAYDLAGVQPMSGPTGLIFAMRSKYSTQG 123 (467) Q Consensus 53 -------~~~~e~~~~~~g~~~~~st~tg~i~~~~P~L-v~l~Rr~~p-~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs 123 (467) +.+.|...-.........|++++-...-|.+ -.++....+ ..+...++-|=|+++...+-+-.. T Consensus 89 ~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~------- 161 (392) T protein:vir:13 89 DDAVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVI------- 161 (392) T ss_pred HHHHHhccchhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEE------- Confidence 0000000000000000011111100001111 111111111 112233333333322211111000 Q ss_pred CCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccce Q lcl|NC_015279. 124 GTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNE 203 (467) Q Consensus 124 GtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~E 203 (467) .+ ...+ ...++ +..+++ T Consensus 162 --------------------------------------~~------------~~~a------~~v~E-------~~~~~~ 178 (392) T protein:vir:13 162 --------------------------------------TG------------RATA------GIVGE-------TAEIPE 178 (392) T ss_pred --------------------------------------cC------------Ccce------eeecc-------cccccc Confidence 00 0000 00111 122333 Q ss_pred eeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEE Q lcl|NC_015279. 204 MAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFD 283 (467) Q Consensus 204 MaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~D 283 (467) -..++++++...+.-+-...+|-||.+|= ..|.++.|.+-|...|..-+|..||.- .-.+ ...|++. T Consensus 179 ~~~~f~~v~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~l~G--------~Gt~-~p~Gil~ 245 (392) T protein:vir:13 179 SYPATTQRSMGGFKYGFASVVSYEFATDQ----VLDLVGFLVSDAGPAIGDAMGRHFLTG--------TGTG-QPRGILT 245 (392) T ss_pred cccceeeEEeeeeeEEeeehhHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhcc--------cCCc-ccccccc Confidence 44444444444444555667899999983 367889999999999999999988831 0000 1233332 Q ss_pred eecccc--------chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccc-cccccC Q lcl|NC_015279. 284 LDIDSN--------GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNAN-LNVDDT 354 (467) Q Consensus 284 l~~~~~--------~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~-~~~d~t 354 (467) .....+ +.-..+....+.+.+.. --+..+ ..|+++.....|.. +....+ .. +..+.+ T Consensus 246 ~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~--------~~~~~a-~~v~n~~~~~~l~~---lkd~~G--~~l~~~~~~ 311 (392) T protein:vir:13 246 DATGANAAFGEADADSKVSDALIDLFHEVPS--------AYRKNA-KFVVNDLRAAQMRK---LKDANG--QYLWQSALT 311 (392) T ss_pred ccccccccccccccccccHHHHHHHHHhhhh--------hhhcCC-EEEEcHHHHHHHHH---hhccCC--ceeecCCcC Confidence 211100 00001112223222211 123333 45778888777753 222111 01 111111 Q ss_pred CceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccceee--eeeee Q lcl|NC_015279. 355 GNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIG--FKTRY 432 (467) Q Consensus 355 ~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g--~~tRY 432 (467) .. ..++| .|++||++.+.... .-..-.+..+++|..+....+ +..|+..-...++ ...|. T Consensus 312 ~g-~~~~l-~G~Pv~~~~~~~~~--~i~~Gdf~~~~i~~~~~~~i~--------------~~~~~~~~~~~~~~r~~~r~ 373 (392) T protein:vir:13 312 VG-APDTF-NGKVVETDDGMPAD--KVLFADLSKYRVRFAGSLRVD--------------RSVDAKFSTDQIVYRFLQRA 373 (392) T ss_pred CC-CCcee-cceeeEEcCCCCCC--cEEEeeccceeEEeecceEEE--------------eeccccccCCcEEEEEEEEe Confidence 11 12366 46899998886310 000000111223333222211 1112221112222 23344 Q ss_pred ce-eecCcccc-cCccccc Q lcl|NC_015279. 433 GM-VANPFAEG-TTVGAGR 449 (467) Q Consensus 433 ~l-~~nP~~~~-~~~~~~~ 449 (467) +. ..||-+-. ..-..+. T Consensus 374 d~~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 374 DGLLVDARGAKVLTVTPAA 392 (392) T ss_pred ccEEecccceEEEEeeccC Confidence 32 33342111 1111111 No 85 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=37.76 E-value=1.1 Score=20.35 Aligned_cols=319 Identities=14% Similarity=0.109 Sum_probs=113.7 Q ss_pred CcchHHHHHhhhhhh------------------------ccCcc------c--hhcchhHHHHHHHHhhhHHHHHHH--- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLL------------------------NYEGL------D--KISDPHRRAVTAVLLENQEKFMQE--- 45 (467) Q Consensus 1 ~~~~~~l~~kw~p~l------------------------~~~~~------~--~i~~~~~~~v~~~~~enq~~~~~e--- 45 (467) .... +-.++|.-+. +.+.. + .-....|+.+ ...+..+....+. T Consensus 30 ~~~~-~~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 107 (394) T protein:vir:97 30 ALES-DDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTYRESV-NDFIRSKGKIVNDSLR 107 (394) T ss_pred hhch-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccchhhHHHHHHH-HHHHHHHHHHhhhhhh Confidence 1110 1112222211 00000 0 0000111111 1111111100000 Q ss_pred ------HhhhhhcchhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeee Q lcl|NC_015279. 46 ------QVAFEQGGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKY 119 (467) Q Consensus 46 ------~~~~~~~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY 119 (467) ...........+ .. .+ +. ++.+|.+.--....-.+++...+..+...++.+.||+++++-+--++ T Consensus 108 ~~~~~~~~~~~~~~~~~~--~~--~~-~~-t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--- 178 (394) T protein:vir:97 108 FEGKDEVLMPINETTPVE--PQ--KD-GI-KKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQ--- 178 (394) T ss_pred hhhHHHHHHHHHhhhhhh--hh--cc-cc-ccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEe--- Confidence 000000000000 00 00 00 11111111111112224444445567788999999988876442221 Q ss_pred cCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCC Q lcl|NC_015279. 120 STQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGD 199 (467) Q Consensus 120 ~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~ 199 (467) ..+..+ .+ + +| +. T Consensus 179 --~~~~~~-------~~---------------------------------------------v--------~E-----~~ 191 (394) T protein:vir:97 179 --RATTKM-------VT---------------------------------------------V--------AE-----LE 191 (394) T ss_pred --cCCCcc-------ce---------------------------------------------e--------cc-----cc Confidence 000000 00 0 00 01 Q ss_pred cccee-eeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccccccc Q lcl|NC_015279. 200 NFNEM-AFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTAT 278 (467) Q Consensus 200 ~f~EM-aFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~ 278 (467) ..++. ...+++++..++.-+-...+|-||.+|- +.|.+++|.+-|+..|..-+|..||.-+-+. .+ T Consensus 192 ~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds----~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~---------~~ 258 (394) T protein:vir:97 192 KNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA----DVDLVGIVSESISQIKVNTTNDAIAKVLKSF---------TT 258 (394) T ss_pred cccccccccceeEEeehhheeeehhhHHHHHhhh----hHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cc Confidence 11111 1334555555555555678999999986 3467888888888888888888877533221 11 Q ss_pred ceeEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCcee Q lcl|NC_015279. 279 AGVFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTF 358 (467) Q Consensus 279 ~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~ 358 (467) .+...+ +....++ +.. .. .++.+. +|++|.+...|..- ....+- .-+..+.++ -. T Consensus 259 ~~~~~~----------~~~~~~~-------~~~-~~-~~~~a~-~v~n~~~~~~l~~l---kd~~G~-~i~~~~~~~-~~ 313 (394) T protein:vir:97 259 KTVKNL----------DEIKALL-------NGG-FD-PAYNVS-LIVSQSFYQTLDTL---KDGNGR-YLLQDDITA-VS 313 (394) T ss_pred cccccH----------HHHHHHH-------Hhh-hh-hhhCCE-EEEcHHHHHHHHHh---hccCCC-eeeecCcCC-CC Confidence 222211 0111111 111 11 122333 67899988887652 211110 001111111 11 Q ss_pred EEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeeceee-c Q lcl|NC_015279. 359 AGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMVA-N 437 (467) Q Consensus 359 ~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~~-n 437 (467) -++|. |++|++.+... .|..-+++|=- + .++++..-..+. +...|...++..+-...|++..+ + T Consensus 314 ~~~l~-G~pv~~~~~~~--------~~~~~~~~gd~--~---~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~r~d~~v~~ 378 (394) T protein:vir:97 314 GKVLL-GKPVFVLSDEV--------LGANKAFIGDF--K---RGVLFADRKDLG-LRWADNEIYGQYLQAVLRFGVSKVD 378 (394) T ss_pred Cceec-cceeEEecccc--------cCCccEEEeec--c---ccEEEEEecceE-EEEecccccceeEEEEEEEccEEec Confidence 34674 46666522211 12222222210 0 011111111111 11223334444444556766543 3 Q ss_pred CcccccCccccccccccccccceeeeeccC Q lcl|NC_015279. 438 PFAEGTTVGAGRLRVNSNRYYRRVAVKNLM 467 (467) Q Consensus 438 P~~~~~~~~~~~~~~~~n~y~r~~~v~~~~ 467 (467) | ..|..+.+++.- T Consensus 379 ~-----------------~a~~~~~~~~~~ 391 (394) T protein:vir:97 379 D-----------------KAGYYVTFTPEP 391 (394) T ss_pred c-----------------cceEEEEecccc Confidence 3 112222222221 No 86 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=37.08 E-value=1.1 Score=20.28 Aligned_cols=329 Identities=13% Similarity=0.075 Sum_probs=118.4 Q ss_pred CcchHHHHHhhhhhhccC-----ccchhcchhHHHHH-HHHhhhHHHHHHH----------------------------- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYE-----GLDKISDPHRRAVT-AVLLENQEKFMQE----------------------------- 45 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~-----~~~~i~~~~~~~v~-~~~~enq~~~~~e----------------------------- 45 (467) --..+++.++=.-.++.+ .+.++.....+... ..-++.|.+.+++ T Consensus 12 ~~~~~e~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 91 (394) T protein:vir:10 12 SAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPVDNAQPNGTDLKKKPIDA 91 (394) T ss_pred HHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhcccccchhhhHHHH Confidence 000000111000011100 00011000000000 0001111000000 Q ss_pred ----HhhhhhcchhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecC Q lcl|NC_015279. 46 ----QVAFEQGGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYST 121 (467) Q Consensus 46 ----~~~~~~~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~ 121 (467) -..+..+....+.... +..++..|.+.--.+..-.+++...+..+-.+++.+.||+++++-+--.+ . T Consensus 92 ~~~~~~~~l~~~~~~~~~~~-----~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~-- 162 (394) T protein:vir:10 92 KKKAINDFIHSHGKVIDNAA-----GHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILK--R-- 162 (394) T ss_pred HHHHHHHHHhccchhhhhhh-----cccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEe--c-- Confidence 0000000000000000 00111122222112222234555556667789999999999876655443 0 Q ss_pred CCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCC-CCCc Q lcl|NC_015279. 122 QGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGT-SGDN 200 (467) Q Consensus 122 qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs-~g~~ 200 (467) .+.++ .+- .+.+.... +... T Consensus 163 -~~~~~-------~~~---------------------------------------------------~E~~~~~~~~~~~ 183 (394) T protein:vir:10 163 -ATDRF-------SSV---------------------------------------------------AELAENPALAEPE 183 (394) T ss_pred -CCCcc-------ccc---------------------------------------------------ccccccccccccc Confidence 00000 000 00000000 1134 Q ss_pred cceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccce Q lcl|NC_015279. 201 FNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAG 280 (467) Q Consensus 201 f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~g 280 (467) |.+..|.+.|..+ ...+|-||.+|- ..|.+++|.+-|+..|..-+|+.|+.-.- .+.. .+ T Consensus 184 ~~~v~l~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~~~~il~g~g----~~~~-----~~ 243 (394) T protein:vir:10 184 FEQVDWSVSTYRG-------AIPLSEEAIADS----AVDLTSLVGQSINEKSVNTYNAMIAPVLQ----SFTA-----KA 243 (394) T ss_pred ceeEEeeeeeeEe-------eehhHHHHHhhh----hHHHHHHHHHHHHHHHHHHHHHHHhhccc----cccc-----cc Confidence 6666666666544 467999999984 25788999999999999999999874332 1111 11 Q ss_pred eEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccc---cccccccccCCce Q lcl|NC_015279. 281 VFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPA---LNANLNVDDTGNT 357 (467) Q Consensus 281 v~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~---~~~~~~~d~t~~~ 357 (467) +... . - ......++...... .+. ..+|++|.....|..- ....+ +.... ...++.. T Consensus 244 ~~~~---~--~--~d~l~~~~~~~~~~---------~~~-a~~vmn~~~~~~l~~l---kd~~G~~i~~~~~-~~~~~~~ 302 (394) T protein:vir:10 244 TTTD---T--L--VDSLKHILNVDLDP---------AYS-RALVVTQSLFNTLDTL---KDKNGRYLLHDAS-DSITDGT 302 (394) T ss_pred cccc---c--c--HHHHHHHHHhhhhh---------hcc-CEEEecHHHHHHHHHh---hccCCCeeeeccc-cccccCC Confidence 1111 0 0 01122221111111 122 3577888887777642 21111 00000 1112223 Q ss_pred eEEEecCceEEEecccccccchh---hcc--CCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeee Q lcl|NC_015279. 358 FAGVLQGKYRVYIDPYSSNLTSA---NAA--NGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRY 432 (467) Q Consensus 358 ~~G~l~~~~~vy~D~y~~~~~~~---~~~--~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY 432 (467) ..++| -|++|++.......... .+. .-.+|++++-.+... +- ..+...|.-.+-...|+ T Consensus 303 ~~~~L-~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~----v~-----------~~~~~~~~~~~~~~~r~ 366 (394) T protein:vir:10 303 AKGTV-LGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVT----LA-----------WEDSKIYGRYLGAAFRF 366 (394) T ss_pred ccccc-ccceeEEecccccCCCCCceEEEEeeccccEEEEeecceE----EE-----------EecccccceeEEEEEEe Confidence 34577 55677653221100000 000 001222222111111 11 12333344455556677 Q ss_pred cee-ecCccc--c---cCcccccccccc Q lcl|NC_015279. 433 GMV-ANPFAE--G---TTVGAGRLRVNS 454 (467) Q Consensus 433 ~l~-~nP~~~--~---~~~~~~~~~~~~ 454 (467) +.. .||-+- . ......--.+|. T Consensus 367 d~~~~~~~ai~~~~~~~~~~~~~~~~~~ 394 (394) T protein:vir:10 367 GVKQADSNAGYFVTNTDAASGSTSGTGK 394 (394) T ss_pred ccEEeccccEEEEEeecccCCCCCCCCC Confidence 653 233211 0 011111112333 No 87 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=36.71 E-value=1.2 Score=20.23 Aligned_cols=282 Identities=14% Similarity=0.051 Sum_probs=115.6 Q ss_pred hhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCcccccccc Q lcl|NC_015279. 54 MIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEAD 133 (467) Q Consensus 54 ~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEad 133 (467) |-.| .....+. -+++++...--....-.+++...+.-+-..++-+.||++++...+-.. . ++.++ T Consensus 1 m~~~--~~~~~~~--~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~--~---~~~~a------ 65 (297) T protein:vir:95 1 MTVQ--TFNPENV--LVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQ--T---DGISA------ 65 (297) T ss_pred CCcc--ccccccc--cccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEE--c---CCcee------ Confidence 1111 0000000 011112111111122234444445557788899999998876655332 1 00000 Q ss_pred cccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEE Q lcl|NC_015279. 134 TAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTV 213 (467) Q Consensus 134 t~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tV 213 (467) .| .+ | +..+++-..++++++. T Consensus 66 -~~---------------------------------------------------v~--E-----g~~~~~~~~~f~~v~l 86 (297) T protein:vir:95 66 -YW---------------------------------------------------VN--E-----TEKIKTDKPEVVPVTL 86 (297) T ss_pred -EE---------------------------------------------------ee--c-----CccccccccceeEEEE Confidence 00 01 1 1123333334455555 Q ss_pred EeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeecccc---- Q lcl|NC_015279. 214 TAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDIDSN---- 289 (467) Q Consensus 214 tAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~~~---- 289 (467) ..|..+-...+|-||.+|-. .|.+..|.+-|+..|...+++.+|.--.+ ....|++....... T Consensus 87 ~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g~---------~~~~gi~~~~~~~~~~~~ 153 (297) T protein:vir:95 87 KAHKLGIILVTSREALNYTW----KKFFEDMKPQIVEAFYKKIDEAGLLGHDT---------PFANSVAKAAKDANKVIG 153 (297) T ss_pred eeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhcccCC---------cccccccccccccceecc Confidence 55555556679999999875 46899999999999999999999832111 01222222211110 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCceEEE Q lcl|NC_015279. 290 GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKYRVY 369 (467) Q Consensus 290 ~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy 369 (467) +.-..+....++.++. . ..+..+-+|++|.....|..- .... +...-... .|+| .|++|+ T Consensus 154 ~~~t~~~i~~~~~~l~-------~--~~~~~~~~v~~~~~~~~L~~l---~d~~---G~~i~~~~----~~~l-~G~Pv~ 213 (297) T protein:vir:95 154 GPINYDNILKLQDALY-------D--ADVEPNAFVSKIQNRSALREA---RDGN---KVSIYDKA----ANTI-DGITTV 213 (297) T ss_pred cccCHHHHHHHHHHhh-------h--ccCCcCEEEEcHHHHHHHHHh---hccC---CceeecCC----CCcc-cceeeE Confidence 0001122223333332 1 123445689999998888641 2111 11111111 2344 356776 Q ss_pred ecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCc-----ccc-ceeee--eeeeceee-cC-- Q lcl|NC_015279. 370 IDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGEN-----TFQ-PKIGF--KTRYGMVA-NP-- 438 (467) Q Consensus 370 ~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~-----s~q-P~~g~--~tRY~l~~-nP-- 438 (467) .-+...........-.+.++++|..+.-+.+-. .+ .......|+. -|| =.++| ..|++..+ || T Consensus 214 ~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~----~~--~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a 287 (297) T protein:vir:95 214 DLKSARFEKGDLLAGDFDNLIYGVPYNITYKIS----EE--GQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDA 287 (297) T ss_pred eecCCCCCCceEEEEecccEEEEEecCeEEEEe----ec--cccccccccCccchhhhhcCcEEEEEEEEeccEeecccc Confidence 544322111111111122333444443222111 00 0011111211 011 11222 24555443 33 Q ss_pred cccccCcccccc Q lcl|NC_015279. 439 FAEGTTVGAGRL 450 (467) Q Consensus 439 ~~~~~~~~~~~~ 450 (467) |+.-+.-.+ + T Consensus 288 ~~~l~~at~--~ 297 (297) T protein:vir:95 288 FAKLTPAER--V 297 (297) T ss_pred eEEEeecCC--C Confidence 222111111 1 No 88 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=33.91 E-value=1.3 Score=19.91 Aligned_cols=262 Identities=12% Similarity=0.031 Sum_probs=111.1 Q ss_pred eeeee--eecCCCCCcccccc------cccccccccccccccccccccccccCCCccccccccccccccccccccccccc Q lcl|NC_015279. 113 FAMRS--KYSTQGGTEALFDE------ADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQ 184 (467) Q Consensus 113 FAMRs--rY~~qsGtEAlfnE------adt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~ 184 (467) -||=. +..+---+|-+-+. ....|++-... .....| .| ....+++. T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~---------------~~~l~g-~~----------G~tv~iP~ 54 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADI---------------DNTLVG-QP----------GNTITFPA 54 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhccccee---------------cccccC-CC----------CCEEEeee Confidence 11111 00000000100000 00011100000 000000 00 01111111 Q ss_pred ccchhhHhhcCC-CCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHh-hCCChhHHHHHHHHHHHHHHhcHHHHH Q lcl|NC_015279. 185 GMRTDEAEDLGT-SGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAI-HGLNAEAELANILSSEILAEINREVIR 262 (467) Q Consensus 185 Gm~TA~aE~LGs-~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAi-HGLDAEtELaNILStEImlEINREII~ 262 (467) .-...++|.+.. ..-+..++.+ ...+++.|-|.-.-+++ |+-+. -+-|.=.|..+-++..|+..++.+++. T Consensus 55 ~~~ig~a~~~~~g~~i~~~~lt~--~~~~~~i~~~~~~~~i~-----D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~ 127 (275) T protein:vir:96 55 FVYSGDAKVVPEGEEIPIDLIET--KKRQATIRKIGKGTVLT-----DEALLSGYGDPKGEAVRQHGLAIANKVDNDVLE 127 (275) T ss_pred eccCCccccccCCCCcchhhccc--ceeeEEeehhccccccc-----HHHHHhhccchHHHHHHHHHHHHHHHHHHHHHH Confidence 111123333321 1223444443 44445555554443433 33332 246888899999999999999999998 Q ss_pred HHhhhcccccccccccceeEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcc Q lcl|NC_015279. 263 TIYKVSEQGAVSNTATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYT 342 (467) Q Consensus 263 ~l~~~a~~~k~~~~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~ 342 (467) .+.+..... ....+ ..+.+-..+.++..| -..+++++++|.+++.|.-..-.++. T Consensus 128 ~l~~a~~~~------~~~~~----------~~d~i~dA~~~lgd~---------~~~~~~ivv~p~~~~~L~k~~~~~f~ 182 (275) T protein:vir:96 128 ALQGATLKV------EADIT----------KLAGLQTAIDKFNDE---------DLEPMVLFVNPLDAGKLRASATDNFT 182 (275) T ss_pred HHhcccccc------ccccc----------CHHHHHHHHHHhccc---------cCCccEEEeCHHHHHHHHhccccccc Confidence 776643321 11111 122333344444332 23678999999999988554322322 Q ss_pred cccccccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEe-cCCCccceeEecccchhhcccccCCcc Q lcl|NC_015279. 343 PALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYK-GTSPYDAGLFYCPYVPLQMVRAVGENT 421 (467) Q Consensus 343 ~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyK-G~~~~d~glfyaPYv~l~~~~~~Dp~s 421 (467) +......+. ..+-..|.+ .|++||+|.... +|-.+-++ |.-. ++-.+=++.+-. =|+.+ T Consensus 183 ~~~~~g~~~--~~~G~ig~~-~G~~Vi~s~~~p-----------~~t~~i~~~gA~~----~~~~~~~~vE~~--Rd~~~ 242 (275) T protein:vir:96 183 RATLLGDNV--IVKGAFGEA-LGAIIVRSNKIK-----------EGEAILAKRGAVK----LITKRDFFLETE--RHASH 242 (275) T ss_pred ccccccccc--eecccccee-cCeeEEEeCCCC-----------cceEEEEecccee----eeecCCcccccc--cchhh Confidence 222111111 112236777 788999997542 22222222 1111 111110111111 18889 Q ss_pred ccceeeeeeeecee-ecCc-ccccCcccccccc Q lcl|NC_015279. 422 FQPKIGFKTRYGMV-ANPF-AEGTTVGAGRLRV 452 (467) Q Consensus 422 ~qP~~g~~tRY~l~-~nP~-~~~~~~~~~~~~~ 452 (467) ++=.+--..+||+. .||= ....+-.|+.+-. T Consensus 243 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 243 KSTALFSDKHYVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred cCcEEEEeEEEEEEEEcCccEEEEEecccccCC Confidence 99988888899853 4552 1112222332222 No 89 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=32.59 E-value=1.4 Score=19.76 Aligned_cols=285 Identities=12% Similarity=0.085 Sum_probs=111.6 Q ss_pred ccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCcccccccccccccccccccccc Q lcl|NC_015279. 68 YTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTAFAGQNEGFDLTN 147 (467) Q Consensus 68 ~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~ 147 (467) --++++|.+.--....=.++++..+.-+..+++-|-||++..- -+ .++. ++.+| . T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~-~~---p~~~--~~~~a-------~------------ 55 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQ-QY---MTLT--APPRG-------E------------ 55 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCce-EE---EEEe--CCcee-------E------------ Confidence 0011122221111112234455556778889999999875421 11 0110 00000 0 Q ss_pred cccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEEEeecccccccccHH Q lcl|NC_015279. 148 GMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVTAKSRALKAEYSLE 227 (467) Q Consensus 148 ~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~E 227 (467) ..++ +..+++...++++++..+|.=+-....|-| T Consensus 56 ---------------------------------------wv~E-------g~~~~~~~~~f~~v~l~~~kl~~~~~iS~e 89 (311) T protein:vir:81 56 ---------------------------------------VVGE-------GAQKSESTATFAPVTAIPRKVQVTQRFSQE 89 (311) T ss_pred ---------------------------------------Eeec-------CcccccccceeeEEEEeeEEEEEeehhhHH Confidence 0011 112223333334444444433444578999 Q ss_pred HHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccc-cc-----ccceeEEeeccccchhHHHHHHHHH Q lcl|NC_015279. 228 LAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVS-NT-----ATAGVFDLDIDSNGRWSVEKFKGLL 301 (467) Q Consensus 228 LAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~-~~-----~~~gv~Dl~~~~~~r~~ve~~~~l~ 301 (467) |.|+--. -.++-|++|.+-|+..|...|+.-++.-.. +.-+... ++ .+..+....... .+..+.. T Consensus 90 ll~~~~d-~~~~l~~~i~~~la~ai~~~~d~a~l~G~~--~~~~~~~~gi~~~~~~~~~~~~~~~~~--~~~~~~~---- 160 (311) T protein:vir:81 90 VKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGIN--PLTGAALSGSPAKILDTTNIVELTTGT--SATPDLA---- 160 (311) T ss_pred HhhcCcc-cHHHHHHHHHHHHHHHHHHHHHHhhhcccc--CCCCcccccccccccccceeeeecccc--cchHHHH---- Confidence 9875322 134567778888888888888777764321 0001000 00 111122221111 1111111 Q ss_pred HHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCceEEEecccccccchh- Q lcl|NC_015279. 302 FQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSA- 380 (467) Q Consensus 302 ~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~- 380 (467) |.+....+ +..++..+-+|++|.....|.. |.+..+ ..+-.+.......|+|. +++|+++.+....... T Consensus 161 --i~~~~~~~--~~~~~~~~~~vmn~~~~~~l~~---lkd~~G--~~l~~~~~~~~~~~tl~-G~Pv~~~~~i~~~~~~~ 230 (311) T protein:vir:81 161 --VEAAVGLV--LGDNLSPDGVALDNTFSFMLAT---QRDSQG--RKLYPELGFGTDVASFA-GLNAAVSDTVRGGPEAV 230 (311) T ss_pred --HHHHHHHh--hhcCCCceEEEEcHHHHHHHHh---hhccCC--CeeecCccccCCCceec-ceeEEeccccccccccc Confidence 11211222 2345677778889998888844 121111 00100101111246774 5888887654210000 Q ss_pred -------hccCCCceEEEEEecCCCccceeEecccchhhcccc--cCCcc----ccc-eeee--eeeece-eecC--ccc Q lcl|NC_015279. 381 -------NAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRA--VGENT----FQP-KIGF--KTRYGM-VANP--FAE 441 (467) Q Consensus 381 -------~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~--~Dp~s----~qP-~~g~--~tRY~l-~~nP--~~~ 441 (467) ....+...+++| +- +.+++...-++.+... .|+.. ||- .++| ..|+|. +.+| |+. T Consensus 231 ~~~~~~~~~~~~~~~~~~g---Df---s~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~ 304 (311) T protein:vir:81 231 TASTGVYRTTNPNVKAIAG---DF---SAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAV 304 (311) T ss_pred ccccchhcccCCccEEEEE---ec---ccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEE Confidence 000011111111 00 1123333222222221 12221 222 1333 367774 3566 433 Q ss_pred ccCcccc Q lcl|NC_015279. 442 GTTVGAG 448 (467) Q Consensus 442 ~~~~~~~ 448 (467) -+....+ T Consensus 305 l~~a~~~ 311 (311) T protein:vir:81 305 VRDADES 311 (311) T ss_pred EEeeccC Confidence 2222222 No 90 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=32.11 E-value=1.4 Score=19.70 Aligned_cols=306 Identities=16% Similarity=0.070 Sum_probs=128.5 Q ss_pred CC-ccceeeeeeeeeecCCCCC-ccccc-----ccccccccccccccccccccccccccCCC-ccccccccccccccccc Q lcl|NC_015279. 105 MS-GPTGLIFAMRSKYSTQGGT-EALFD-----EADTAFAGQNEGFDLTNGMSDAAAGLGTT-SQAGSNPAALNPVATAS 176 (467) Q Consensus 105 mT-GPTGLIFAMRsrY~~qsGt-EAlfn-----Eadt~fSg~~a~~~~~~~~~~~~~~~~~~-~~agt~p~~ln~~~~~~ 176 (467) |- .|+|.=-+.|..+++.++. -|||= |.++.|.-. +.. .+.... ...+++..-.+..+... T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~-s~~----------~~~~~~r~i~~G~sv~~~~iG~~~ 69 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRR-SVT----------MDKHMVRTIQNGKSASFPVMGRTK 69 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHH-hhh----------hhccccccccCcceEEEeeeccee Confidence 54 5566666677777666665 35553 344444311 100 000000 00011111111111001 Q ss_pred ccccccccccchhhHhhcCCC--CCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHH Q lcl|NC_015279. 177 STGYNVGQGMRTDEAEDLGTS--GDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILA 254 (467) Q Consensus 177 ~~~~~~~~Gm~TA~aE~LGs~--g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEIml 254 (467) ...+ ..++.+..+ +.+..|.-++||+... +...+.-.-+.++ | .|--.|++.-...++.. T Consensus 70 ~~~~--------~~g~~l~~~~~~~~~~~~~i~ID~~~y--------~~~~Vdd~D~~q~-~-~D~r~~~~~~~g~aLA~ 131 (347) T protein:vir:88 70 GYYL--------APGENLDDKRKDIKHSEKVIQIDGLLT--------SDVLIYDIEDAMN-H-YDVRAEYSAQLGEALAI 131 (347) T ss_pred eeee--------ccccCCCCCCCCCccceEEEEEechhh--------hhhhhhhHHHHhh-c-CCchHHHHHHHHHHHHH Confidence 1111 122333322 2357889999998532 2333443333333 4 78889999999999999 Q ss_pred HhcHHHHHHHhhhcccccccccccceeEEee--------ccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEc Q lcl|NC_015279. 255 EINREVIRTIYKVSEQGAVSNTATAGVFDLD--------IDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCS 326 (467) Q Consensus 255 EINREII~~l~~~a~~~k~~~~~~~gv~Dl~--------~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S 326 (467) ++++-|+..|...+..-....-..+|..+-. +..+..-..+.....+++....+. .+-.-=.|.|+|++ T Consensus 132 ~~D~~i~~~l~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Ld---e~~VP~~gR~~vv~ 208 (347) T protein:vir:88 132 AADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLT---KNYVPAGDRRFYCA 208 (347) T ss_pred HHHHHHHHHHHHhhccccccccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHh---hcCCCCCCCEEEeC Confidence 9999999888776654322221222211111 000000000111122233322222 22233458999999 Q ss_pred hHHHHHHhhhcchhcccccccccccc-cCCceeEEEecCceEEEecccccccchhhccCCCceEEEE------------E Q lcl|NC_015279. 327 ADVASALTMAGVLDYTPALNANLNVD-DTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVG------------Y 393 (467) Q Consensus 327 ~~Va~~L~~sG~~~~~~~~~~~~~~d-~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vG------------y 393 (467) |+....|-..= +.. +...+.+ +.-.-..|.+ .+++||.=++....+.........|-..+ | T Consensus 209 P~~y~~Ll~~~--~~~---~~~~~~~~~~~~G~vg~i-~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~ 282 (347) T protein:vir:88 209 PEDYSAILSAL--MPN---AANYAALIDPETGNIRNV-MGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDD 282 (347) T ss_pred HHHHHHHhcch--hhh---hhhhccccchhcceeeee-ccceEEEeeccccccccccccccccccccccccccccccccc Confidence 99888773211 111 1111111 1111245666 68888886654211100000011121111 2 Q ss_pred ecCCCccceeEeccc----chhhc---ccccCCccccceeeeeeeece-eecCcccc--cCcccc Q lcl|NC_015279. 394 KGTSPYDAGLFYCPY----VPLQM---VRAVGENTFQPKIGFKTRYGM-VANPFAEG--TTVGAG 448 (467) Q Consensus 394 KG~~~~d~glfyaPY----v~l~~---~~~~Dp~s~qP~~g~~tRY~l-~~nP~~~~--~~~~~~ 448 (467) .++..-..+|||.|= +.+.. -...||+.|-=.|==+..||. +.+|-+-. .....+ T Consensus 283 ~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 283 RVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred ccccCcEEEEEechhhhhheecccceeeeeechhhHHHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 233333456777664 22221 112355555443333333433 23442111 111111 No 91 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=31.47 E-value=1.5 Score=19.63 Aligned_cols=270 Identities=13% Similarity=0.089 Sum_probs=110.7 Q ss_pred CCcc-c--eeeeeeee-eecCCCCCcccccccccccccccccccccccccccccccCCCccccccccccccccccccccc Q lcl|NC_015279. 105 MSGP-T--GLIFAMRS-KYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGY 180 (467) Q Consensus 105 mTGP-T--GLIFAMRs-rY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~ 180 (467) |--+ | +-+| -. .|...- .|.| . ....|+.... . .....|. + .... T Consensus 1 Ma~~~T~~~~~i--iPev~s~~v-~~~~-~-~~~v~~~~~~-~--------------~~~l~g~-~----------G~tv 49 (278) T protein:vir:80 1 MADLTTKLANLI--DPEVMGPMI-SAKL-P-KAIKFGKIAP-I--------------DNSLEGQ-P----------GSEI 49 (278) T ss_pred CCCcceehhhee--cHHHHHHHH-HHHH-H-Hhhhhcccce-e--------------cccccCC-C----------CCEE Confidence 1100 0 0000 00 000000 0000 0 0000100000 0 0000000 0 0001 Q ss_pred ccccccchhhHhhcCCCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHh-hCCChhHHHHHHHHHHHHHHhcHH Q lcl|NC_015279. 181 NVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAI-HGLNAEAELANILSSEILAEINRE 259 (467) Q Consensus 181 ~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAi-HGLDAEtELaNILStEImlEINRE 259 (467) ++..--...++|.+.. +..+..-..+..+++++-|-|+- + ++ .-|+-+. -+-|.-.+..+-++.-+..+++++ T Consensus 50 ~ip~~~~~g~a~~~~~-g~~i~~~~lt~~~~~~~i~~~~~-a---~~-v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ 123 (278) T protein:vir:80 50 TVPKYKYIGDAQDVAE-GAAIDYSALETESVKHGIKKAGK-G---VK-LTDESVLSGYGDPVEEAQKQIRMAIASKVDND 123 (278) T ss_pred EEeeeccCCcceeecC-CCcCcccccccceeeEeeehhhc-c---cc-ccHHHHhhccccHHHHHHHHHHHHHHHHHHHH Confidence 1111001122233321 22333334455666666666652 2 22 2344332 367899999999999999999999 Q ss_pred HHHHHhhhcccccccccccceeEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcch Q lcl|NC_015279. 260 VIRTIYKVSEQGAVSNTATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVL 339 (467) Q Consensus 260 II~~l~~~a~~~k~~~~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~ 339 (467) ++..+...... +..+-..|..++ +.+.+-.++-++..+ --....+++++|.+...|...... T Consensus 124 l~~~l~~a~~~-----~~~~~t~~~~~~-----~~~~~~da~~~l~~~--------~~~~~~~ivv~p~~~~~L~k~~~~ 185 (278) T protein:vir:80 124 ILEEALTTTLE-----VKGAINIGLIDK-----IENTFTDAPDAIEDE--------SITTTGVLFLNYKDTAKLREEAAG 185 (278) T ss_pred HHHHHhccccc-----cccccccchhhh-----HHHHHHHHHHhhccc--------CCCcccEEEECHHHHHHHHhhhhh Confidence 99888654322 111111221100 111122221122111 111234899999999999765444 Q ss_pred hcccccccccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccc-cC Q lcl|NC_015279. 340 DYTPALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRA-VG 418 (467) Q Consensus 340 ~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~-~D 418 (467) ++.+......+ ..-+-..|.+ .|++||++..... ..-|+ ++ +| . =.|+..= +.. +.. =| T Consensus 186 ~~~~~~~~g~~--~~~~G~ig~~-~G~~Vi~s~~~p~--------~t~~l-~~-~g--A---i~~~~~~-~~~-vE~~Rd 245 (278) T protein:vir:80 186 SWTKASQLGDD--LLVKGAFGEL-LGWEIVRTKKLAD--------GNALA-VK-AG--A---LKTFLKR-NLL-AESGRD 245 (278) T ss_pred hcccccccccc--ceeeccceee-cceeEEEcCCCCc--------ceEEE-Ee-cc--c---eeeeecC-Ccc-cccccc Confidence 43322211111 1112347787 6799999977421 11222 11 12 1 0122111 111 111 28 Q ss_pred Cccccceeeeeeeeceee-cCccc-ccCcccccccccc Q lcl|NC_015279. 419 ENTFQPKIGFKTRYGMVA-NPFAE-GTTVGAGRLRVNS 454 (467) Q Consensus 419 p~s~qP~~g~~tRY~l~~-nP~~~-~~~~~~~~~~~~~ 454 (467) |..++-.+-...+||+.+ ||-.. ..+-. .|. T Consensus 246 ~~~~~d~i~~~~~yg~~v~~~~~~v~it~~-----a~~ 278 (278) T protein:vir:80 246 MDHKLTKFNADQHYAVALVDETKAVKVVPV-----AGN 278 (278) T ss_pred hhhccceeeeeeEEEEEEEcCcceEEEeec-----cCC Confidence 889999888888998865 55311 11111 111 No 92 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=31.39 E-value=1.5 Score=19.62 Aligned_cols=304 Identities=17% Similarity=0.110 Sum_probs=126.2 Q ss_pred CCccceeeeeeeeeecCCCCC------cccccccccccccccccccccccccccccccCCCc-ccccccccccccccccc Q lcl|NC_015279. 105 MSGPTGLIFAMRSKYSTQGGT------EALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTS-QAGSNPAALNPVATASS 177 (467) Q Consensus 105 mTGPTGLIFAMRsrY~~qsGt------EAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~-~agt~p~~ln~~~~~~~ 177 (467) |.--++--.+.|.-+++.+|. |-|-+|..+.|.-.. - ..+....- -.+++..-.+..+.... T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s-~----------~~~~~~~r~i~~G~sv~i~~iG~~tv 69 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRS-V----------TADKHIVRTIQNGKSAQFPVMGRTSG 69 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHH-h----------hhcccccccccccceEEEecccceee Confidence 555555555555555444443 223344444442110 0 00000000 00111111111111111 Q ss_pred cccccccccchhhHhhc-CC-CCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHH Q lcl|NC_015279. 178 TGYNVGQGMRTDEAEDL-GT-SGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAE 255 (467) Q Consensus 178 ~~~~~~~Gm~TA~aE~L-Gs-~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlE 255 (467) ..++. ++.+ |+ ....-.|+-++||++.+ +.+-+.-.-|.++ | .|-..|++.-....+..+ T Consensus 70 ~~~t~--------G~~l~~~~~~~~~~e~~itID~~~~--------~~~~VddiD~~q~-~-~D~~~~~~~~~g~aLa~~ 131 (347) T protein:vir:94 70 VYLAP--------GERLSDKRKGIKHTEKVITIDGLLT--------ADVMIFDIEDAMN-H-YDVAGEYSNQLGEALAIA 131 (347) T ss_pred eeecC--------CCCcCCCCCCCCcceEEEEecchhh--------hhHHhhhHHHHhc-C-cchHHHHHHHHHHHHHHH Confidence 11111 2222 21 12345677888887632 3344554445555 3 788889999999999999 Q ss_pred hcHHHHHHHhhhccc-cc----ccccccceeEEeeccccchhH---HHHHHHHHHHHHHHHHHHHHhhccCCccEEEEch Q lcl|NC_015279. 256 INREVIRTIYKVSEQ-GA----VSNTATAGVFDLDIDSNGRWS---VEKFKGLLFQIERDANAIAQRTRRGKGNMILCSA 327 (467) Q Consensus 256 INREII~~l~~~a~~-~k----~~~~~~~gv~Dl~~~~~~r~~---ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~ 327 (467) +.+-|++.+..++.. .. ..+....-+++.....+.--. ...+-..+++....++ .+-.--.|.|+|.+| T Consensus 132 ~D~~i~~~~~~~aa~~~~~~~~~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Ld---e~~VP~~~R~~vv~P 208 (347) T protein:vir:94 132 ADGAVLAEMAILCNLPAASNENIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLT---SNYVPAGDRYFYTTP 208 (347) T ss_pred HHHHHHHHHHHHhccccccccccCCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHh---hcCCCCCCcEEEeCH Confidence 999999877544322 11 112222223332222211100 1111122233222222 222333589999999 Q ss_pred HHHHHHhhhcchhccccccc-cccc-ccCCceeEEEecCceEEEecccccccchhhccCCCceEEEE------------- Q lcl|NC_015279. 328 DVASALTMAGVLDYTPALNA-NLNV-DDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVG------------- 392 (467) Q Consensus 328 ~Va~~L~~sG~~~~~~~~~~-~~~~-d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vG------------- 392 (467) +.-++|-.. +.++. .... ...-+-..|.+ .+++||.-+....-..-....+..|-++. T Consensus 209 ~~~~~Ll~~------~~~~~~~~~~~~~~~~G~Vg~i-~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~ 281 (347) T protein:vir:94 209 DNYSAILAA------LMPNAANYAALIDPETGNIRNV-MGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSD 281 (347) T ss_pred HHHHHHhcc------chhhhhhccccccccccceEEE-eceEEEecCcccccccccccccCcceecCcccccccccchhh Confidence 999988432 11111 1110 01111246787 78999987654211000001111222211 Q ss_pred EecCCCccceeEecccc----hhhc---ccccCCccccceeeeeeeece-eecCcccc---cCccc Q lcl|NC_015279. 393 YKGTSPYDAGLFYCPYV----PLQM---VRAVGENTFQPKIGFKTRYGM-VANPFAEG---TTVGA 447 (467) Q Consensus 393 yKG~~~~d~glfyaPYv----~l~~---~~~~Dp~s~qP~~g~~tRY~l-~~nP~~~~---~~~~~ 447 (467) |+|+-.-..+|||-|=- .+.. -.-.|+..|-=.|==+..||- +.+|-+-+ .+.++ T Consensus 282 ~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 282 VKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred hcccccceeEEEeehhhhhhhhcccccccchhchhhHHHHhhhhhhhcCcccccceeEEEEecCCC Confidence 33433334678887751 2111 111244444432222222322 12331111 11111 No 93 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=29.55 E-value=1.6 Score=19.39 Aligned_cols=346 Identities=10% Similarity=0.019 Sum_probs=126.8 Q ss_pred CcchHHHHHhhhhhhccCccchhcchh-----HHHHHHHHhhhHH---H-----HHHHHhhhhh-----cchhhhhhhhh Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPH-----RRAVTAVLLENQE---K-----FMQEQVAFEQ-----GGMIAEQPTNA 62 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~-----~~~v~~~~~enq~---~-----~~~e~~~~~~-----~~~~~e~~~~~ 62 (467) |=+-+++++|..-+-..+ +-++.... .+.+......-|+ + ..++.+.... ..-|.+.+-.. T Consensus 1 ik~L~e~~~e~~e~~~~~-~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~ 79 (390) T protein:vir:40 1 MNNLDKKDSETLNISTAF-LNAIKEGATEAEQVTAFTNMAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESKY 79 (390) T ss_pred CchHHHHHHHHHHHHHHH-HHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHHH Confidence 555554444443322110 01111111 1111110000000 0 0000000000 00000000000 Q ss_pred ccccccc-ccccccc---cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCccccccccccccc Q lcl|NC_015279. 63 VGNGGYT-SSGGQTV---AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTAFAG 138 (467) Q Consensus 63 ~g~~~~~-st~tg~i---~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~fSg 138 (467) .=..... +++.+.. ..+..-++.+.|+. -+-.+++-+.||++....|.. ..+. .++ .+- T Consensus 80 ~~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~---s~i~~~~~~~~~~~~~~~i~~----~~~~--~~a-------~~~- 142 (390) T protein:vir:40 80 YNEVIAGNGFAGVTALLPPTVFERVFEDLTVE---HPLLSKINFVNTTATTEWIIS----VGDV--ATA-------WWG- 142 (390) T ss_pred HHHHHhccCcccCcccccHHHHHHHHHHHHhh---hhhhhhceeeecCCceeEEEE----EcCC--cce-------eee- Confidence 0000000 1111111 11111233344433 345678999999886555431 1110 000 000 Q ss_pred ccccccccccccccccccCCCcccccccccccccccccccccccccccchhh-HhhcCCCCCccceeeeEEEEEEEEeec Q lcl|NC_015279. 139 QNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDE-AEDLGTSGDNFNEMAFSIEKVTVTAKS 217 (467) Q Consensus 139 ~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~-aE~LGs~g~~f~EMaFsIEK~tVtAKS 217 (467) ++ ++.-.+....|.+..|++.|..+- T Consensus 143 --------------------------------------------------~E~~~~~~~~~~~f~~i~l~~~k~~~~--- 169 (390) T protein:vir:40 143 --------------------------------------------------PLCAEIKEVLDNGFDKIQTGMYKLSAY--- 169 (390) T ss_pred --------------------------------------------------ccccccCccccccceeeEeeeeeEEEe--- Confidence 00 000001123588888888887653 Q ss_pred ccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEee---------ccc Q lcl|NC_015279. 218 RALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLD---------IDS 288 (467) Q Consensus 218 RaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~---------~~~ 288 (467) ...|-||.+|-- .|.|++|.+.|+..|..-+|+.||.-= |+ + ...|++--. ... T Consensus 170 ----i~iS~ell~ds~----~~l~~~i~~~la~~i~~~~~~a~l~G~------G~--~-~P~Gil~~~~~~~~~~~~~~~ 232 (390) T protein:vir:40 170 ----IPVCNAMLDLGP----SWLDQYVRTILGEAMALGLEAGIVNGS------GK--D-QPIGMMRDLNNVTAGEHPVKT 232 (390) T ss_pred ----ehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHhhhhccc------CC--C-ccceeeecccccccccccccc Confidence 457889999863 468999999999999999999998410 00 0 122222100 000 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCceEE Q lcl|NC_015279. 289 NGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKYRV 368 (467) Q Consensus 289 ~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~v 368 (467) .+-..-+-...++..+..-......+. .+++.|++-....+..|...-++. |.+|....+.+.-+++| T Consensus 233 ~~~~t~~~~~~~~~~l~~~~~~~~~~~-~~~a~~i~n~~t~~~~l~~~~~~~-----------d~~G~~v~~~~~~g~pv 300 (390) T protein:vir:40 233 ATPLTDLTPATLATKVMLPLTDNGKKS-VSDAILVINPADYWSKIYAATSYM-----------TPQGVWVTGILPVPLEI 300 (390) T ss_pred ccccchhhHHHHHHHHHHHhhcchhhh-hcCceEEEcchhHHHHHHHHhhcc-----------CCCCccccccCCCceeE Confidence 000000111123333332222222222 234455544445555554332332 33344333444457888 Q ss_pred EecccccccchhhccCCCceEEEEEecCCCccceeEecccchhh----------cccccCCccccceeeeeeeec-eeec Q lcl|NC_015279. 369 YIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQ----------MVRAVGENTFQPKIGFKTRYG-MVAN 437 (467) Q Consensus 369 y~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~----------~~~~~Dp~s~qP~~g~~tRY~-l~~n 437 (467) +++++..... ...-...+ +++|-.+....+.+-. .|-.-+ ....+||++|. ++=++.==| -.+. T Consensus 301 v~~~~~p~~~-i~~Gd~s~-~~i~~~~~~~v~~~~~--~~f~~~~~~~r~~~r~dg~v~~~~A~~-~l~~~~~~~~~~~~ 375 (390) T protein:vir:40 301 VQSVAVPVGK-AVAGRAKD-YFMGIGSEQVIRTSTE--YRLLDDETLYYAKQYANGRPKDNSSFL-VFDITGLEGSPAID 375 (390) T ss_pred EEcCCCCCCc-EEEEeece-EEEEeecceEEEecch--hhhhcCcEEEEEEEEeCCEEecccceE-EEEeeccCCCCCCC Confidence 8887753211 00111223 3445444443332110 010111 11123676665 111111101 1334 Q ss_pred Ccccc-cCcccccccccc Q lcl|NC_015279. 438 PFAEG-TTVGAGRLRVNS 454 (467) Q Consensus 438 P~~~~-~~~~~~~~~~~~ 454 (467) ||... ..+.|.. +. T Consensus 376 ~~~~~~~~~~~~~---~~ 390 (390) T protein:vir:40 376 VNVVNNATPSETP---AE 390 (390) T ss_pred cceeeCCCCCCCC---CC Confidence 44332 1111111 11 No 94 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=28.78 E-value=1.7 Score=19.30 Aligned_cols=308 Identities=12% Similarity=0.065 Sum_probs=123.3 Q ss_pred HHHHhhhHHHHHHHHhhhhhcchhhhhhhhhccc---ccccccccccccccCc-hhhhhHHHHHhhhhhhhceeeccCCc Q lcl|NC_015279. 32 TAVLLENQEKFMQEQVAFEQGGMIAEQPTNAVGN---GGYTSSGGQTVAGFDP-VLISLIRRSMPNLVAYDLAGVQPMSG 107 (467) Q Consensus 32 ~~~~~enq~~~~~e~~~~~~~~~~~e~~~~~~g~---~~~~st~tg~i~~~~P-~Lv~l~Rr~~p~LI~~DI~GVQPmTG 107 (467) |+ .|+|-..++.|. .++.|..++ ..-+ +.-.+++...+..+..+++-+.||++ T Consensus 1 ~a--------------------~l~el~~~~~~~~~~g~~~~~~~~---liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~ 57 (333) T protein:vir:78 1 MA--------------------TLNELLPNSAGSNHQGRLAHVPSD---LLPKEIVGPIFDKAQESSLVLRMGEQIPISY 57 (333) T ss_pred Cc--------------------hhHHhhhhcccccccCceecCCcc---ccchhHHHHHHHHHHhhchhhhhcceeeccC Confidence 11 133333332222 122222222 1111 11224455556777888999999876 Q ss_pred cceeeeeeeeeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccc Q lcl|NC_015279. 108 PTGLIFAMRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMR 187 (467) Q Consensus 108 PTGLIFAMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~ 187 (467) ..--|.-.. . +. ...|-+.+. ... T Consensus 58 ~~~~~p~~~--~----~~-------~a~~v~eg~-------------------------------------------~~~ 81 (333) T protein:vir:78 58 GETIIPTTV--K----RP-------EVGQVGVGT-------------------------------------------SNE 81 (333) T ss_pred CceEEEEEe--C----Cc-------eeEeecCcc-------------------------------------------ccc Confidence 433222111 0 00 001100000 000 Q ss_pred hhhHhhcCCCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhh Q lcl|NC_015279. 188 TDEAEDLGTSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKV 267 (467) Q Consensus 188 TA~aE~LGs~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~ 267 (467) .+++|....++..|.+..++..|..+- ...|-||.+|-. .|.|++|.+.|...|...|+..+|.--- T Consensus 82 ~~e~~~~~~~~~~f~~i~l~~~kl~~~-------~~is~ell~~s~----~~~~~~i~~~la~ai~~~~d~~~l~G~g-- 148 (333) T protein:vir:78 82 QREGGLKPLSGTAWDTRSVSPIKLATI-------VTVSEEFARMNP----SGLYTKLQGDLAYAIGRGIDLAVFHGKS-- 148 (333) T ss_pred ccccccccccccceeEEEEeeEEEEEe-------ehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhcccC-- Confidence 011111111234567776666666654 457778887754 4689999999999999999999883111 Q ss_pred cccccccccccceeEE------eec-cccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchh Q lcl|NC_015279. 268 SEQGAVSNTATAGVFD------LDI-DSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLD 340 (467) Q Consensus 268 a~~~k~~~~~~~gv~D------l~~-~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~ 340 (467) +.......|+.. ... ...+... ...+.-...+-.....-....++.+|++|.-...|.....+. T Consensus 149 ----~~~~~~~~g~~~~~~~~~~~~~~~~~~~~-----~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~ 219 (333) T protein:vir:78 149 ----PLTGSALQGIDTDNVIANTTNVDYLQETG-----DPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYR 219 (333) T ss_pred ----CCCCccccccccccccccccccccccccc-----chhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhc Confidence 111111112111 100 0000000 011111122212222224566778888998877775443332 Q ss_pred cccccccccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEE--------EecCCCccceeEecccchhh Q lcl|NC_015279. 341 YTPALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVG--------YKGTSPYDAGLFYCPYVPLQ 412 (467) Q Consensus 341 ~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vG--------yKG~~~~d~glfyaPYv~l~ 412 (467) ...+- .-+..+..+ .-.|+|. |++|+++.+..... .....+...+++| ..+..+ +-..+|.-.. T Consensus 220 d~~G~-~i~~~~~~~-~~~~~l~-G~Pv~~~~~i~~~~-~~~~~~~~~~~~gD~~~~~~g~~~~~~----i~~~~~~~~~ 291 (333) T protein:vir:78 220 DANGN-VDPSRINLA-AQTGDVL-GLPAQFGRAVGGDL-GAAVDSKTRIIGGDFSQLKFGFADEIR----IKMSDTATLT 291 (333) T ss_pred CCCCc-eeecCcccc-CCCceee-ceeeEEccccCCCc-cccCCCccEEEEEecccEEEEEeeccE----EEEecccccc Confidence 21110 001111111 1136775 57898887642100 0000111223333 322222 1122321111 Q ss_pred cccccCCcccc-ceee--eeeeecee-ecC--cccccCcccc Q lcl|NC_015279. 413 MVRAVGENTFQ-PKIG--FKTRYGMV-ANP--FAEGTTVGAG 448 (467) Q Consensus 413 ~~~~~Dp~s~q-P~~g--~~tRY~l~-~nP--~~~~~~~~~~ 448 (467) .....--.-|| -.++ ...|++.. .+| |+.-+..... T Consensus 292 ~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 292 DSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred ccccceeehhhcCcEEEEEEEEEccEEecccceEEEeccCCC Confidence 11110001122 1122 34577744 566 4332222211 No 95 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=28.29 E-value=1.8 Score=19.23 Aligned_cols=267 Identities=8% Similarity=0.007 Sum_probs=111.4 Q ss_pred ecCCC-C-CcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcC- Q lcl|NC_015279. 119 YSTQG-G-TEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLG- 195 (467) Q Consensus 119 Y~~qs-G-tEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LG- 195 (467) -.+.. . ..-+.+|.-..+--. .... ....++....... + ........++..--.+.++|.+. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~--~~~~----~~~~~~~~~~~~~------l---~g~~G~tv~ip~~~~~g~~~~~~~ 65 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQA--ELDK----KLRFAQFADIDST------L---VGQPGDTLTFPAFTYSGDAQVIAE 65 (274) T ss_pred CCccccchhhhhhhHHHHHHHHH--HHHh----hhhhccccccccc------c---cCCCCCEEEEEeeccCCCccccCC Confidence 11100 0 001111110000000 0000 0000000000000 0 00000111111110112223332 Q ss_pred CCCCccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccc Q lcl|NC_015279. 196 TSGDNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSN 275 (467) Q Consensus 196 s~g~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~ 275 (467) ....++.++.++ ..+++.|-|+-.-+++=|. ++..+-|.-.+..+-++..++.+++++|+..|....... T Consensus 66 g~~i~~~~it~~--~~~~~i~~~~~~~~i~D~~----~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~---- 135 (274) T protein:vir:96 66 GEKIPVDQIGTS--KREAKVRKIGKGTELTDEA----VLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTV---- 135 (274) T ss_pred CCcCchhhcccc--eeEEEEEeeeceeeecHHH----HHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCc---- Confidence 112234444433 3344445454222333222 123467889999999999999999999998875543221 Q ss_pred cccceeEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCC Q lcl|NC_015279. 276 TATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTG 355 (467) Q Consensus 276 ~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~ 355 (467) ..+.-| .+.+-.++.++..+ -..+++++++|.+++.|..-...++.+.....-+ ... T Consensus 136 -----------~~~~~~-~d~i~dA~~~l~d~---------~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~--~~~ 192 (274) T protein:vir:96 136 -----------EADITK-LDGLQTAIDKFNDE---------DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDN--IIV 192 (274) T ss_pred -----------Cccccc-HHHHHHHHHHhccc---------CCCceEEEeCHHHHHHHHhccccccccccccccc--cee Confidence 111111 23333343444322 2367899999999999977544433322211111 111 Q ss_pred ceeEEEecCceEEEecccccccchhhccCCC-ceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeece Q lcl|NC_015279. 356 NTFAGVLQGKYRVYIDPYSSNLTSANAANGN-QYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGM 434 (467) Q Consensus 356 ~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~-dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l 434 (467) +-..|.+ .|++|++|... |+ .=+++| +|.-. |+.. .+...-.--||.+++-.+-...+||. T Consensus 193 ~g~ig~~-~G~~Vi~s~~~----------p~~t~~l~~-~gA~~-----~~~~-~~~~vE~~Rd~~~~~d~i~~~~~yg~ 254 (274) T protein:vir:96 193 KGAFGEA-LGAVIVRSNKL----------NKGEALLAK-KGAVK-----LITK-RDFFLEKDRDASRKSTALYSDKHYVA 254 (274) T ss_pred eccccee-cCeeEEEcCCC----------CcceEEEEe-Cccee-----eeec-CCcccccccchhhcccEEEEeeEEEE Confidence 2237777 57899999764 22 112222 12111 1111 01111111389999998888889998 Q ss_pred ee-cCc--ccccCccccccccccccccceeeeeccC Q lcl|NC_015279. 435 VA-NPF--AEGTTVGAGRLRVNSNRYYRRVAVKNLM 467 (467) Q Consensus 435 ~~-nP~--~~~~~~~~~~~~~~~n~y~r~~~v~~~~ 467 (467) .. ||= ..-+...-..+ | T Consensus 255 ~~~~~~~vv~~t~~~~~~~----------------~ 274 (274) T protein:vir:96 255 YLYDESKVVKITKGAGDEV----------------M 274 (274) T ss_pred EEEcCccEEEEEcCccccc----------------C Confidence 65 551 11122211111 1 No 96 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=27.70 E-value=1.8 Score=19.16 Aligned_cols=258 Identities=10% Similarity=-0.018 Sum_probs=112.7 Q ss_pred cccccccccccccccccc------ccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCC-CCccce Q lcl|NC_015279. 131 EADTAFAGQNEGFDLTNG------MSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTS-GDNFNE 203 (467) Q Consensus 131 Eadt~fSg~~a~~~~~~~------~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~-g~~f~E 203 (467) +|.|.++.---.--.+.. ......+...... .+ .......-+++..=...++|.+... .-+..+ T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~------~L---~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~ 71 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDD------TL---VGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQ 71 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhcccccccc------cc---CCCCCCEEEeeeecCCCccccccCCCccchhh Confidence 222222110000000000 0000000000000 00 0000111111111122344555421 223444 Q ss_pred eeeEEEEEEEEeecccccccccHHHHHHHHHhh-CCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeE Q lcl|NC_015279. 204 MAFSIEKVTVTAKSRALKAEYSLELAQDLKAIH-GLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVF 282 (467) Q Consensus 204 MaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiH-GLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~ 282 (467) ++ ..+.+++.|-|+-.-++| ||.+.- |-|.=.|..+-++.-|+.+++.++|..|..+...- +..+ T Consensus 72 lt--~~~~~a~i~~~gk~~~it-----D~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~-------~~~~ 137 (270) T protein:vir:95 72 MS--MTTTKVTVKETGKAVEVT-----QTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTA-------TVSA 137 (270) T ss_pred cc--cchheeeeehhhCcceec-----HHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccccccc-------cccc Confidence 44 556666667777555555 443322 45999999999999999999999998877543221 1111 Q ss_pred EeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEe Q lcl|NC_015279. 283 DLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVL 362 (467) Q Consensus 283 Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l 362 (467) + .+-+-..+.++.-| -..-++++|.|.+++.|....|+++...-++. ..+-..|.+ T Consensus 138 t----------~~~~~dA~~~lgd~---------~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~-----~~~G~ig~~ 193 (270) T protein:vir:95 138 D----------ATGILDAIEVFNSE---------NDEDYVLYVNPKDYNKLVKSLFKVGGNVQDRA-----ISKGDLVEI 193 (270) T ss_pred C----------HHHHHHHHHHhccc---------cCCCcEEEEcHHHHHHHHhhhcccccccccch-----hccccccee Confidence 1 12232333334322 34458999999999999887777643222111 111236676 Q ss_pred cCceEEEecccccccchhhccCCCceEEEEEe-cCCCccceeEecccchhhcccc-cCCccccceeeeeeeeceee-cCc Q lcl|NC_015279. 363 QGKYRVYIDPYSSNLTSANAANGNQYYVVGYK-GTSPYDAGLFYCPYVPLQMVRA-VGENTFQPKIGFKTRYGMVA-NPF 439 (467) Q Consensus 363 ~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyK-G~~~~d~glfyaPYv~l~~~~~-~Dp~s~qP~~g~~tRY~l~~-nP~ 439 (467) .|++|++|... +.+|-..-+| |+-. |+-.= +.. ... =|+..++-.+--..+|++.. ||= T Consensus 194 -~G~~Viv~s~~----------~~~~~~~l~~~gAi~-----~~~~~-~~~-vEtdRd~~~~~d~i~~~~~y~v~~~~~s 255 (270) T protein:vir:95 194 -VGVSDIVKSKR----------VSENTAFLQRYGAME-----IVNKK-KPE-AYTDFDILKRTHLLSTNYHYSVNLKDET 255 (270) T ss_pred -cceeEEEeCCC----------CCceeEEEEecccee-----eeecC-Cce-eeeccchhhcccEEEeeeEEEEEEEccc Confidence 46899887553 2344333333 1111 11000 000 111 16777776666666676643 211 Q ss_pred -ccccCcccccccccccc Q lcl|NC_015279. 440 -AEGTTVGAGRLRVNSNR 456 (467) Q Consensus 440 -~~~~~~~~~~~~~~~n~ 456 (467) ....+-.+++ -.+| T Consensus 256 kvv~~t~~~a~---~~~~ 270 (270) T protein:vir:95 256 GVVKVTFKPSG---SLEM 270 (270) T ss_pred eEEEEEecCCC---CcCC Confidence 0001111111 1122 No 97 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=27.08 E-value=1.9 Score=19.08 Aligned_cols=332 Identities=12% Similarity=0.066 Sum_probs=124.8 Q ss_pred CcchHHHHHhhhhhhc--------------cCcc--chhcchhHHHHHHHH-----hhhHHHHHHHHhh----------- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLN--------------YEGL--DKISDPHRRAVTAVL-----LENQEKFMQEQVA----------- 48 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~--------------~~~~--~~i~~~~~~~v~~~~-----~enq~~~~~e~~~----------- 48 (467) |-+-++|.++|.-+.+ .+.. -+|.. .++++.... ++.|-+.+.++.. T Consensus 4 ~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (408) T protein:vir:10 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSE-LKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPL 82 (408) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 3356677777643321 1111 12211 111111100 1111111100000 Q ss_pred ------------hhhcchhhhh----hhhhccccccccccccc---ccccCchhhhhHHHHHhhhhhhhceeeccCCccc Q lcl|NC_015279. 49 ------------FEQGGMIAEQ----PTNAVGNGGYTSSGGQT---VAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPT 109 (467) Q Consensus 49 ------------~~~~~~~~e~----~~~~~g~~~~~st~tg~---i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPT 109 (467) ..+..++-.. ...........++..|. -..+.+.++.+.| +.....+++.+.||+++. T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~---~~~~l~~~~~~~~~~~~~ 159 (408) T protein:vir:10 83 NKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVR---QYDSLQQYVRVESVSTSN 159 (408) T ss_pred ccchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHH---hhchhhhhcceeeccCCc Confidence 0000000000 00000000001111111 1122334444444 455678999999999998 Q ss_pred eeeeeeeeeecCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchh Q lcl|NC_015279. 110 GLIFAMRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTD 189 (467) Q Consensus 110 GLIFAMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA 189 (467) |-+--.| ..+.++ . ..+ .+ T Consensus 160 ~~~~~~~--~~~~~~-~-------a~~---------------------------------------------------v~ 178 (408) T protein:vir:10 160 GSRVYEK--WTDVTP-L-------TVM---------------------------------------------------DA 178 (408) T ss_pred ceEEEee--cccccc-c-------eee---------------------------------------------------ec Confidence 8765443 100000 0 000 00 Q ss_pred hHhhcCCCC-CccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhc Q lcl|NC_015279. 190 EAEDLGTSG-DNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVS 268 (467) Q Consensus 190 ~aE~LGs~g-~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a 268 (467) +++.....+ ..|.++.|...|..+- ..+|-||.+|- .+|.+++|.+-|+..|..-+|+.||.-.-+. T Consensus 179 E~~~~~~~~~~~~~~i~~~~~k~~~~-------~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~- 246 (408) T protein:vir:10 179 EDGKIPDLDNPQLTIIKYLIKRYAGI-------ITATNTSLKDT----AENILAWLSSWIAKKVVVTRNQAIIEVMKAA- 246 (408) T ss_pred CccccccccCcceeeEEeeeeeEEee-------ehhHHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccc- Confidence 001000111 2477777777766654 45999999994 3577889999999999998888887432221 Q ss_pred ccccccccccceeEEeeccccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccc Q lcl|NC_015279. 269 EQGAVSNTATAGVFDLDIDSNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNAN 348 (467) Q Consensus 269 ~~~k~~~~~~~gv~Dl~~~~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~ 348 (467) ....++.++ +....+++.. ....-+..+ .+|||+.....|... ....+- .- T Consensus 247 -------~~~~~~~~~----------~~l~~~~~~~-------~~~~~~~~a-~~v~n~~~~~~l~~l---kd~~G~-~i 297 (408) T protein:vir:10 247 -------PKKPTIAKF----------DDVITMINTA-------VDPAIIATS-SLLTNQSGLNKLALV---KTAEGK-YL 297 (408) T ss_pred -------ccccccccH----------HHHHHHHHHh-------hhhhhccCC-EEEEcHHHHHHHHHh---hccCCc-eE Confidence 112222221 1112222111 111112222 467999988888652 221110 00 Q ss_pred cccccCCceeEEEecCceEEEecccccccc--h----hhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccc Q lcl|NC_015279. 349 LNVDDTGNTFAGVLQGKYRVYIDPYSSNLT--S----ANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTF 422 (467) Q Consensus 349 ~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~--~----~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~ 422 (467) +..+.++ -..++| .|++|++-.+..... + ...-...++++++-++..... +.++.- .+-.+. T Consensus 298 ~~~~~~~-~~~~~l-~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~----~~~~~~------~~f~~~ 365 (408) T protein:vir:10 298 LEPDPTK-PNSYLI-KGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLL----PTNIGA------GAFETD 365 (408) T ss_pred eccCcCC-CCCcee-cceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEE----Eccccc------chhhcC Confidence 1111111 112466 566666532211000 0 000011234444444332211 222110 000112 Q ss_pred cceeeeeeeeceee-cC----------cccc--cCcccc--cc Q lcl|NC_015279. 423 QPKIGFKTRYGMVA-NP----------FAEG--TTVGAG--RL 450 (467) Q Consensus 423 qP~~g~~tRY~l~~-nP----------~~~~--~~~~~~--~~ 450 (467) +=.+-+..||+..+ +| -+.. .+..++ .+ T Consensus 366 ~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:10 366 TTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) T ss_pred ceEEEEEEeeccEEeccccEEEEEeeccccCCCCCCCCCcccC Confidence 22333334444331 22 1110 111111 11 No 98 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=26.08 E-value=2 Score=18.95 Aligned_cols=325 Identities=14% Similarity=0.058 Sum_probs=107.7 Q ss_pred CcchHHHHHhhhhhhccCc------cc---------hhcchh-HHHHHHHHhhhHHHHHHHHhh---------------- Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEG------LD---------KISDPH-RRAVTAVLLENQEKFMQEQVA---------------- 48 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~------~~---------~i~~~~-~~~v~~~~~enq~~~~~e~~~---------------- 48 (467) .-+.+++.+...-.++... .. ++.... ...-.....+.+.+...+... T Consensus 56 ~~~l~~~~~~~~~~~e~~~~~~~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 135 (437) T protein:vir:10 56 RSNIEVLEQASALKVEEKRDDSDLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGE 135 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHH Confidence 0000111111111110000 00 000000 000000111111110000000 Q ss_pred ------hhhcchhhhhhhhhcccccccccccccccccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCC Q lcl|NC_015279. 49 ------FEQGGMIAEQPTNAVGNGGYTSSGGQTVAGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQ 122 (467) Q Consensus 49 ------~~~~~~~~e~~~~~~g~~~~~st~tg~i~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~q 122 (467) ..+...+.+........ ..++..|.+. -..+...++.........+++.|.||+.+.+-+--.+. T Consensus 136 ~~~~~~~~~~~~~~~~e~~~~~~--~~~~~~g~lv--p~~~~~~i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~----- 206 (437) T protein:vir:10 136 IADKKVTAFADYLKTGEVRDVTG--IALKDGKVII--PETILTPEKEVHQFPRLGSLVRTESVTTTTGKLPIFNN----- 206 (437) T ss_pred HHHhhhhhhHHHHHhhhhhhhhh--cccccccccc--hHHHHHHHHHhhhhhhhhhcceeEeeccCceeeEEeec----- Confidence 00000011000000000 0011111110 01111112111111134566777777776654333321 Q ss_pred CCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcC-CCCCcc Q lcl|NC_015279. 123 GGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLG-TSGDNF 201 (467) Q Consensus 123 sGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LG-s~g~~f 201 (467) ++..+ .+ ..+.+... ++...| T Consensus 207 ~~~~~-------~~---------------------------------------------------~~e~~~~~e~~~~~~ 228 (437) T protein:vir:10 207 STDLL-------TA---------------------------------------------------HTEYGQTTKNATPVI 228 (437) T ss_pred ccccc-------cc---------------------------------------------------ccccccccccccccc Confidence 00000 00 00000001 012347 Q ss_pred ceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccccccccee Q lcl|NC_015279. 202 NEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGV 281 (467) Q Consensus 202 ~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv 281 (467) .++.|.+.|..+ -..+|-||.+|- ..|.+++|.+.|+..|..-+|..||.-+-+ +....+++... T Consensus 229 ~~v~~~~~k~~~-------~~~is~ell~ds----~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~----~~~~~~~~~~~ 293 (437) T protein:vir:10 229 TPILWDLKTYTG-------GYVFSQELISDS----SYDWQAELQSRLIELRDNTDDSLIITALTD----GIKKTTSTYLL 293 (437) T ss_pred eeeeeehhheee-------ehhhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhhhcc----cccccccccch Confidence 777777766654 367899999984 357888999999999999999998864422 11111111111 Q ss_pred EEeeccccchhHHHHHHHH-HHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccccccc-ccccCCceeE Q lcl|NC_015279. 282 FDLDIDSNGRWSVEKFKGL-LFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANL-NVDDTGNTFA 359 (467) Q Consensus 282 ~Dl~~~~~~r~~ve~~~~l-~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~-~~d~t~~~~~ 359 (467) .|+ ..+ -+.+... -+. +-..||+|.....|... ....+ ..+ ..+.++. .. T Consensus 294 ~~~-------------~~~~~~~l~~~--------~~~-~~~~~~~~~~~~~l~~l---kd~~g--~~~~~~~~~~~-~~ 345 (437) T protein:vir:10 294 GDL-------------KKVLNVTLKPQ--------DSA-AASIVMSQSAYNLFDMA---TDAMG--RPLLQPNVTAA-TG 345 (437) T ss_pred hhH-------------HHHHHhhhhhh--------hhc-CCEEEEcHHHHHHHHHh---hccCC--CeeeccCccCC-CC Confidence 111 111 0111111 112 23568899888877542 11111 011 1111111 13 Q ss_pred EEecCceEEEecccccccch----hhccC--CCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeec Q lcl|NC_015279. 360 GVLQGKYRVYIDPYSSNLTS----ANAAN--GNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYG 433 (467) Q Consensus 360 G~l~~~~~vy~D~y~~~~~~----~~~~~--~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~ 433 (467) ++|. |++|++......+.. ..++. -.+|+++....... ....-+-+.++..+.+..||+ T Consensus 346 ~~l~-G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~~--------------~~~~~~~~~~~~~~~~~~r~d 410 (437) T protein:vir:10 346 YTLL-GKTVVIVDDKLFPSASAGDVNIVVAPLKKAVINFKLTEIT--------------GQFQDTYDIWYKQLGIFLRQN 410 (437) T ss_pred cccc-cceeEEecccccCCcCCCceEEEEeeccccEEEEeeeceE--------------EEEecccccccceeeEEEEEc Confidence 4674 466665332210000 00000 01222221110000 000012233444555566876 Q ss_pred ee-ecC--ccc-------ccCcccccc Q lcl|NC_015279. 434 MV-ANP--FAE-------GTTVGAGRL 450 (467) Q Consensus 434 l~-~nP--~~~-------~~~~~~~~~ 450 (467) .. ++| |+. .+...++.+ T Consensus 411 ~~~~~~~a~~~l~~~~~~~~~~~~~~~ 437 (437) T protein:vir:10 411 VVQASKDLIVNLTGKLKAVTVVQSTAV 437 (437) T ss_pred cEEecccceEEEEeeccccccCCCCCC Confidence 53 344 211 011111212 No 99 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=24.43 E-value=2.2 Score=18.73 Aligned_cols=321 Identities=16% Similarity=0.191 Sum_probs=119.9 Q ss_pred Cc---------chHHHHHhhhhhhc------c------------------------CccchhcchhHHHHHHHHhhhHHH Q lcl|NC_015279. 1 MF---------QSEQLQEKWAPLLN------Y------------------------EGLDKISDPHRRAVTAVLLENQEK 41 (467) Q Consensus 1 ~~---------~~~~l~~kw~p~l~------~------------------------~~~~~i~~~~~~~v~~~~~enq~~ 41 (467) .+ ..+++.+....+-+ . .+.......+|+.+...|..-++. T Consensus 18 ~~~~~k~~~~~~~~~~e~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a~~~~lr~~~~~ 97 (401) T protein:vir:44 18 KFDDFKAKNDKRVEAIEQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKRPARGAQNKVAAEHKDAFVGFLRKGRED 97 (401) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhHHHHHHHHHHHhhhhhh Confidence 00 00011111111000 0 000111112222222222111111 Q ss_pred HHHHHhhhhhcchhhhhhhhhccccccccccccc-c-cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeee Q lcl|NC_015279. 42 FMQEQVAFEQGGMIAEQPTNAVGNGGYTSSGGQT-V-AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKY 119 (467) Q Consensus 42 ~~~e~~~~~~~~~~~e~~~~~~g~~~~~st~tg~-i-~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY 119 (467) . +.+.+....... .++..|. | ..+.+-++.+.|.. .+..+++-+.||++++..+.-.. T Consensus 98 ~------------~~~~e~~a~~~~--~~~~GG~~iP~~~~~~ii~~~~~~---~~l~~~~~~~~~~~~~~~~~~~~--- 157 (401) T protein:vir:44 98 G------------LRDLERKALQVG--TDEDGGYAVPEELDRSILSLLKDE---VVMRQEATVITVGGSDYKKLVNL--- 157 (401) T ss_pred h------------hHHHHHHHhhcC--CCCCCceeccHhHHHHHHHHHHhh---hhhhhhceeeecCCCceEEEEec--- Confidence 1 111111110000 0111111 1 34566677777744 35678899999998864332111 Q ss_pred cCCCCCcccccccccccccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCC-CC Q lcl|NC_015279. 120 STQGGTEALFDEADTAFAGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGT-SG 198 (467) Q Consensus 120 ~~qsGtEAlfnEadt~fSg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs-~g 198 (467) ++..+ .+ .++++.... .. T Consensus 158 ---~~~~a-------~w---------------------------------------------------v~E~~~~~~~~~ 176 (401) T protein:vir:44 158 ---GGTAS-------GW---------------------------------------------------VGETDTRSQTAT 176 (401) T ss_pred ---CCccc-------ee---------------------------------------------------eccccccCcccc Confidence 00000 00 000000000 11 Q ss_pred CccceeeeEEEEEEEEeecccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhccccccccccc Q lcl|NC_015279. 199 DNFNEMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTAT 278 (467) Q Consensus 199 ~~f~EMaFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~ 278 (467) ..|.+..|.+.|..+ -..+|-||.+|- .+|.+++|.+-|+..|...+++.+|. |.-.+ .. T Consensus 177 ~~~~~v~~~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~la~ai~~~~~~~~l~--------G~G~~-~p 236 (401) T protein:vir:44 177 SRLGLIEPFMGEIYG-------NPQATQKMLDDA----FFNVEAWINSELATEFAEQEEIAFTT--------GDGTK-KP 236 (401) T ss_pred ccceeeeeehhheee-------ehhhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHhhhhc--------cCCCC-cc Confidence 246666666665544 356899999984 35789999999999999888888883 11110 22 Q ss_pred ceeEEeecc------------------ccchhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchh Q lcl|NC_015279. 279 AGVFDLDID------------------SNGRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLD 340 (467) Q Consensus 279 ~gv~Dl~~~------------------~~~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~ 340 (467) .|++..... ..+.-..+....+.+.+..+ -+. +..+|+++.....|.. |. T Consensus 237 ~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~--------~~~-~a~~v~n~~~~~~L~~---lk 304 (401) T protein:vir:44 237 KGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKA--------HRT-GAKFMMNNNSLFAIRL---LK 304 (401) T ss_pred ceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchh--------hhc-CCEEEEcHHHHHHHHH---hh Confidence 333322110 00111112223333333221 122 2356788888877753 22 Q ss_pred cccccccccccccCCceeEEEecCceEEEecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCc Q lcl|NC_015279. 341 YTPALNANLNVDDTGNTFAGVLQGKYRVYIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGEN 420 (467) Q Consensus 341 ~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~ 420 (467) ...+- .-+..+.+. --.++| -|++|+++...... -.+.+.+++| +-. -+|-=+..-.+....||- T Consensus 305 d~~G~-~l~~~~~~~-g~~~~l-~G~PVv~~~~~p~~-----~~~~~~i~~G---d~~----~~~~i~~~~~~~~~~~~~ 369 (401) T protein:vir:44 305 DTEGN-YLWRPGLEL-GQPSSL-AGYGIAENEQMPDI-----AADAKAIAFG---NFK----RGYTIVDRIGTRILRDPY 369 (401) T ss_pred ccCCc-eeecCCcCC-CCCcee-cceeeEEecCcCCc-----cCCccEEEEe---ehh----ccEEEEEecceEEeeecc Confidence 21110 001111111 112467 46777776553110 0111222222 110 000000000111112332 Q ss_pred cccceeeeee--eeceee-cCcccccCccccccccccccccceeeeecc Q lcl|NC_015279. 421 TFQPKIGFKT--RYGMVA-NPFAEGTTVGAGRLRVNSNRYYRRVAVKNL 466 (467) Q Consensus 421 s~qP~~g~~t--RY~l~~-nP~~~~~~~~~~~~~~~~n~y~r~~~v~~~ 466 (467) .=+-.++|.. |+|..+ +| ..|+.+.|+-- T Consensus 370 ~~~~~v~~~a~~r~d~~~~~~-----------------~a~~~l~~~aa 401 (401) T protein:vir:44 370 TNKPFVGFYTTKRTGGMLVDS-----------------QAIKLLKIAAA 401 (401) T ss_pred ccCCcEEEEEEEEeccEEecc-----------------cceEEEEeecC Confidence 2233333333 444322 22 22333333333 No 100 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=22.56 E-value=2.4 Score=18.47 Aligned_cols=327 Identities=12% Similarity=0.108 Sum_probs=118.7 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHH---HHHHhh-----hHHHHH-HHHhhhhhcchhhhhhhhhccccccccc Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAV---TAVLLE-----NQEKFM-QEQVAFEQGGMIAEQPTNAVGNGGYTSS 71 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v---~~~~~e-----nq~~~~-~e~~~~~~~~~~~e~~~~~~g~~~~~st 71 (467) .....+.+++=.-+ + -+|... ++.+ ...+-+ ++.+.. +++....+..++...+.... .....+ T Consensus 64 ~~~~~e~~~~~~~~-~----~ei~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~af~~~l~~~e~~~a--l~~~t~ 135 (425) T protein:vir:10 64 GLPTSDALAKVDKV-S----ADLEAL-QAAVDEANIKIAAAQMGANGVKPLRDPEYTEAFKAHVKRGDVQAA--LNKGED 135 (425) T ss_pred hhccHHHHHHHHHH-H----HHHHHH-HHHHHHHHHHHHhhhcccccccccccHHHHHHHHHHhhhhhhHHH--hhcCcC Confidence 11111111110000 0 011100 0000 000000 000000 00001111112211111000 000011 Q ss_pred ccccc---cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCccccccccccccccccccccccc Q lcl|NC_015279. 72 GGQTV---AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTAFAGQNEGFDLTNG 148 (467) Q Consensus 72 ~tg~i---~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~fSg~~a~~~~~~~ 148 (467) +.|.+ ..+.+- +++...+..+..+++.|-||+++..-+.-. .++..+ .+ T Consensus 136 ~~gG~lvP~~~~~~---ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~------~~~~~a-------~w------------ 187 (425) T protein:vir:10 136 SEGGYLTPIEWDRT---ITNKLVLISPMRQLCRVQPVSKAGFSKLFN------MGGTTS-------GW------------ 187 (425) T ss_pred CCCceeccHhHHHH---HHHHHHhhhhhhhhceeeeccCCceEEEEE------cCCcce-------ee------------ Confidence 11111 122233 444444556778899999998776533310 001000 00 Q ss_pred ccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCC-CccceeeeEEEEEEEEeecccccccccHH Q lcl|NC_015279. 149 MSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSG-DNFNEMAFSIEKVTVTAKSRALKAEYSLE 227 (467) Q Consensus 149 ~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g-~~f~EMaFsIEK~tVtAKSRaLKAEYT~E 227 (467) .++++.....+ ..|.++.|++-|..+ ...+|-| T Consensus 188 ---------------------------------------v~E~~~~~~~~~~~f~~v~~~~~k~~~-------~i~iS~e 221 (425) T protein:vir:10 188 ---------------------------------------VGEASQRPQTNAATFQPLSFASGEIYA-------NPAATQQ 221 (425) T ss_pred ---------------------------------------eccccccccccccccceeeeeheeeEe-------ehHhHHH Confidence 00111111011 247777777777655 4568999 Q ss_pred HHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeec------------------ccc Q lcl|NC_015279. 228 LAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDI------------------DSN 289 (467) Q Consensus 228 LAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~------------------~~~ 289 (467) |.+|-. .|.+++|.+-|+..|..-+|+-||.- .-.+ ...|++.... ... T Consensus 222 ll~ds~----~~l~~~i~~~la~ai~~~~d~~~l~G--------~G~~-~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~ 288 (425) T protein:vir:10 222 ILDDAE----IDLESWLATEVQTEFAKQEGKAFLAG--------DGTN-KPNGLLTYIAGGANAAKHPFGAIEVVNSGAA 288 (425) T ss_pred HHhcch----hHHHHHHHHHHHHHHHHHHHhhhhcc--------cCCC-Ccceeeecccccccccccccccccccccccc Confidence 999853 56889999999999999999988731 0000 1223322110 001 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccc-cccccCCceeEEEecCceEE Q lcl|NC_015279. 290 GRWSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNAN-LNVDDTGNTFAGVLQGKYRV 368 (467) Q Consensus 290 ~r~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~-~~~d~t~~~~~G~l~~~~~v 368 (467) +--..+....|.+.+... -+.. ..+|++|.....|... ....+ .. +..+.+. -..++|. |++| T Consensus 289 ~~~~~d~l~~l~~~l~~~--------~~~~-a~~vmn~~~~~~L~~l---kD~~G--~~l~~~~~~~-g~~~~l~-G~PV 352 (425) T protein:vir:10 289 ADITSDGIIDLVYDLPSA--------FTGN-ARFAMNRNTQRQVRKL---KDGQG--NYLWQPSYVA-GQPATLA-GYPV 352 (425) T ss_pred ccccHHHHHHHHhhhhhh--------hccC-CEEEEchHHHHHHHHh---hcCCC--ceeeccCccC-CCCceec-ceee Confidence 111112222333332211 2233 3468899888887542 22111 01 1111111 1135774 5788 Q ss_pred EecccccccchhhccCCCceEEEEEecCCCccceeEecccchhhcccccCCccccceee--eeeeecee-ecCcccccCc Q lcl|NC_015279. 369 YIDPYSSNLTSANAANGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIG--FKTRYGMV-ANPFAEGTTV 445 (467) Q Consensus 369 y~D~y~~~~~~~~~~~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g--~~tRY~l~-~nP~~~~~~~ 445 (467) +++.+..... .+.+-+++| +-. ...+.+. ...+....||-.-+-.++ ...||+.. .+|-+-..-. T Consensus 353 ~~~~~~p~~~-----~~~~~i~~G---d~~--~~~~i~~--~~~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~ 420 (425) T protein:vir:10 353 TEVPDMPDVA-----ANSTPILFG---DFQ--QTYLIID--RIGVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMK 420 (425) T ss_pred EEecCcCCcc-----CCccEEEEE---ehh--ccEEEEE--ecceEEEecccccCCcEEEEEEEEeccEeecccceEEEE Confidence 8886643211 122333333 110 0001110 001111123222222233 33466543 3443221100 Q ss_pred cccccccccc Q lcl|NC_015279. 446 GAGRLRVNSN 455 (467) Q Consensus 446 ~~~~~~~~~n 455 (467) . ..+. T Consensus 421 ~-----~as~ 425 (425) T protein:vir:10 421 V-----AASE 425 (425) T ss_pred e-----eccC Confidence 0 0111 No 101 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=22.30 E-value=2.5 Score=18.43 Aligned_cols=282 Identities=12% Similarity=0.046 Sum_probs=119.3 Q ss_pred hhcccccc---cccccccccccCchh-hhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCccccccccccc Q lcl|NC_015279. 61 NAVGNGGY---TSSGGQTVAGFDPVL-ISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTAF 136 (467) Q Consensus 61 ~~~g~~~~---~st~tg~i~~~~P~L-v~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~f 136 (467) ..+++.-. ..|.++.. ..-+.+ -.+++...++.+..+++-+=||++..--|. ++.+ +.++ .+ T Consensus 1 ma~~~~~~~~~~~t~~gg~-lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip----~~~~--~~~a-------~~ 66 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNG-VIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFT----YLAK--GVGA-------YW 66 (304) T ss_pred CcccccccccccccCCCce-ecchhHHHHHHHHHHhccchhhhcceeeccCCceEEE----EEeC--Ccce-------EE Confidence 11111111 11112221 122222 245555556667788888888877542211 1110 0000 00 Q ss_pred ccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEEEee Q lcl|NC_015279. 137 AGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVTAK 216 (467) Q Consensus 137 Sg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVtAK 216 (467) . +| +.++++-.-+++++++..| T Consensus 67 ---------------------------------------------------v--~E-----~~~~~~~~~~~~~i~~~~~ 88 (304) T protein:vir:94 67 ---------------------------------------------------V--SE-----TERIQTSKPEYAQAEMEAK 88 (304) T ss_pred ---------------------------------------------------e--ec-----CcccccccceeeEEEEEEE Confidence 0 01 1123333444556666666 Q ss_pred cccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEee-----ccccch Q lcl|NC_015279. 217 SRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLD-----IDSNGR 291 (467) Q Consensus 217 SRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~-----~~~~~r 291 (467) ..+-...+|-||.+|- .+|.|+.|.+-|...|...||+.+|.---+. +-.+....+.+.-. ...++. T Consensus 89 k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~----~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (304) T protein:vir:94 89 KIGVIIPLSKEFLKWT----AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSP----YNTSTSGKPLVEGAEEKGNVVTDTN 160 (304) T ss_pred EEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHhhheeccCCC----ccccccccccccccccccccccccc Confidence 6666778999999875 3678899999999999999998887421110 00111111111100 001111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCceEEEec Q lcl|NC_015279. 292 WSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKYRVYID 371 (467) Q Consensus 292 ~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D 371 (467) ...+....++.++. . ......-+||+|.....|... .... +...-.+. .|+| .|++||++ T Consensus 161 ~~~~~i~~~~~~l~--------~-~~~~~~~~v~~~~~~~~L~~l---kd~~---G~~l~~~~----~~~l-~G~PV~~~ 220 (304) T protein:vir:94 161 NLYVDLSALMATIE--------D-EELDPNGVLTTRSFRSKMRNA---LDAN---DRPLFDAN----GNEI-MGLPLSYT 220 (304) T ss_pred chHHHHHHHHHHhh--------h-ccCCcCEEEEcHHHHHHHHHh---hccC---CcEeecCC----Cccc-cceeeEEe Confidence 12222223322321 1 223344578999999888642 2111 11111111 2566 46899988 Q ss_pred ccccccchhhc--cCCCceEEEEEecCCCccceeEecccchhh--cccccCCcc-----cc---ceeeeeeeeceee-cC Q lcl|NC_015279. 372 PYSSNLTSANA--ANGNQYYVVGYKGTSPYDAGLFYCPYVPLQ--MVRAVGENT-----FQ---PKIGFKTRYGMVA-NP 438 (467) Q Consensus 372 ~y~~~~~~~~~--~~~~dY~~vGyKG~~~~d~glfyaPYv~l~--~~~~~Dp~s-----~q---P~~g~~tRY~l~~-nP 438 (467) .+......+.. .-.+.++++|..+..+.+- ..+.. +....|++. || =.+=...||++.+ || T Consensus 221 ~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~------~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~ 294 (304) T protein:vir:94 221 GADVYDKKKSLALMGDWDYARYGILQGIEYAI------SEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKP 294 (304) T ss_pred cccccCCCCcEEEEEehhhEEEEEecceEEEE------eecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecc Confidence 77532111100 0012234455544333210 00000 111112221 22 2233345776543 33 Q ss_pred cccccCcccccccccc Q lcl|NC_015279. 439 FAEGTTVGAGRLRVNS 454 (467) Q Consensus 439 ~~~~~~~~~~~~~~~~ 454 (467) = .-..+.+.+ T Consensus 295 ~------a~~~l~~a~ 304 (304) T protein:vir:94 295 E------AFATLKPTE 304 (304) T ss_pred c------ceEEEEecC Confidence 1 001111111 No 102 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=22.30 E-value=2.5 Score=18.43 Aligned_cols=282 Identities=12% Similarity=0.046 Sum_probs=119.3 Q ss_pred hhcccccc---cccccccccccCchh-hhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCccccccccccc Q lcl|NC_015279. 61 NAVGNGGY---TSSGGQTVAGFDPVL-ISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTAF 136 (467) Q Consensus 61 ~~~g~~~~---~st~tg~i~~~~P~L-v~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~f 136 (467) ..+++.-. ..|.++.. ..-+.+ -.+++...++.+..+++-+=||++..--|. ++.+ +.++ .+ T Consensus 1 ma~~~~~~~~~~~t~~gg~-lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip----~~~~--~~~a-------~~ 66 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNG-VIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFT----YLAK--GVGA-------YW 66 (304) T ss_pred CcccccccccccccCCCce-ecchhHHHHHHHHHHhccchhhhcceeeccCCceEEE----EEeC--Ccce-------EE Confidence 11111111 11112221 122222 245555556667788888888877542211 1110 0000 00 Q ss_pred ccccccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCCCCCccceeeeEEEEEEEEee Q lcl|NC_015279. 137 AGQNEGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGTSGDNFNEMAFSIEKVTVTAK 216 (467) Q Consensus 137 Sg~~a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs~g~~f~EMaFsIEK~tVtAK 216 (467) . +| +.++++-.-+++++++..| T Consensus 67 ---------------------------------------------------v--~E-----~~~~~~~~~~~~~i~~~~~ 88 (304) T protein:vir:10 67 ---------------------------------------------------V--SE-----TERIQTSKPEYAQAEMEAK 88 (304) T ss_pred ---------------------------------------------------e--ec-----CcccccccceeeEEEEEEE Confidence 0 01 1123333444556666666 Q ss_pred cccccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEee-----ccccch Q lcl|NC_015279. 217 SRALKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLD-----IDSNGR 291 (467) Q Consensus 217 SRaLKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~-----~~~~~r 291 (467) ..+-...+|-||.+|- .+|.|+.|.+-|...|...||+.+|.---+. +-.+....+.+.-. ...++. T Consensus 89 k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~----~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (304) T protein:vir:10 89 KIGVIIPLSKEFLKWT----AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSP----YNTSTSGKPLVEGAEEKGNVVTDTN 160 (304) T ss_pred EEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHhhheeccCCC----ccccccccccccccccccccccccc Confidence 6666778999999875 3678899999999999999998887421110 00111111111100 001111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhcccccccccccccCCceeEEEecCceEEEec Q lcl|NC_015279. 292 WSVEKFKGLLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANLNVDDTGNTFAGVLQGKYRVYID 371 (467) Q Consensus 292 ~~ve~~~~l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~~~d~t~~~~~G~l~~~~~vy~D 371 (467) ...+....++.++. . ......-+||+|.....|... .... +...-.+. .|+| .|++||++ T Consensus 161 ~~~~~i~~~~~~l~--------~-~~~~~~~~v~~~~~~~~L~~l---kd~~---G~~l~~~~----~~~l-~G~PV~~~ 220 (304) T protein:vir:10 161 NLYVDLSALMATIE--------D-EELDPNGVLTTRSFRSKMRNA---LDAN---DRPLFDAN----GNEI-MGLPLSYT 220 (304) T ss_pred chHHHHHHHHHHhh--------h-ccCCcCEEEEcHHHHHHHHHh---hccC---CcEeecCC----Cccc-cceeeEEe Confidence 12222223322321 1 223344578999999888642 2111 11111111 2566 46899988 Q ss_pred ccccccchhhc--cCCCceEEEEEecCCCccceeEecccchhh--cccccCCcc-----cc---ceeeeeeeeceee-cC Q lcl|NC_015279. 372 PYSSNLTSANA--ANGNQYYVVGYKGTSPYDAGLFYCPYVPLQ--MVRAVGENT-----FQ---PKIGFKTRYGMVA-NP 438 (467) Q Consensus 372 ~y~~~~~~~~~--~~~~dY~~vGyKG~~~~d~glfyaPYv~l~--~~~~~Dp~s-----~q---P~~g~~tRY~l~~-nP 438 (467) .+......+.. .-.+.++++|..+..+.+- ..+.. +....|++. || =.+=...||++.+ || T Consensus 221 ~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~------~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~ 294 (304) T protein:vir:10 221 GADVYDKKKSLALMGDWDYARYGILQGIEYAI------SEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKP 294 (304) T ss_pred cccccCCCCcEEEEEehhhEEEEEecceEEEE------eecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecc Confidence 77532111100 0012234455544333210 00000 111112221 22 2233345776543 33 Q ss_pred cccccCcccccccccc Q lcl|NC_015279. 439 FAEGTTVGAGRLRVNS 454 (467) Q Consensus 439 ~~~~~~~~~~~~~~~~ 454 (467) = .-..+.+.+ T Consensus 295 ~------a~~~l~~a~ 304 (304) T protein:vir:10 295 E------AFATLKPTE 304 (304) T ss_pred c------ceEEEEecC Confidence 1 001111111 No 103 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=20.18 E-value=2.8 Score=18.12 Aligned_cols=331 Identities=11% Similarity=0.083 Sum_probs=130.2 Q ss_pred CcchHHHHHhhhhhhccCccchhcchhHHHHHHHHhhhH---------------HHHHHHHhhhhhcc--hhhhhhhhhc Q lcl|NC_015279. 1 MFQSEQLQEKWAPLLNYEGLDKISDPHRRAVTAVLLENQ---------------EKFMQEQVAFEQGG--MIAEQPTNAV 63 (467) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~v~~~~~enq---------------~~~~~e~~~~~~~~--~~~e~~~~~~ 63 (467) .=+.+++.++|........ ++....++.........+ ++++++-..+..++ .+...+.+.. T Consensus 39 ~ee~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~ 116 (404) T protein:vir:39 39 AEAMSELKNKRDNEKVRRD--ALREQLVEAQAEQVVNMREEEKGPLNKSEYELKDKFVKEFVNMVRNPMAFLNTVSSKTE 116 (404) T ss_pred HHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHHHHhcchhhhhhhhhhhh Confidence 2223355555644322111 121111111110000000 00111111111000 0000011100 Q ss_pred ccccccccccccc---cccCchhhhhHHHHHhhhhhhhceeeccCCccceeeeeeeeeecCCCCCccccccccccccccc Q lcl|NC_015279. 64 GNGGYTSSGGQTV---AGFDPVLISLIRRSMPNLVAYDLAGVQPMSGPTGLIFAMRSKYSTQGGTEALFDEADTAFAGQN 140 (467) Q Consensus 64 g~~~~~st~tg~i---~~~~P~Lv~l~Rr~~p~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsGtEAlfnEadt~fSg~~ 140 (467) ...++++|.+ ..+.+.++.+.| +.....+++.+.||+++++-+--.| ..+.++ + ..+- T Consensus 117 ---~~~t~~~gg~~iP~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~~-~-------a~~v--- 177 (404) T protein:vir:39 117 ---TSGSDSAAGLTIPQDIRTMINTLVR---QYDSLQQYVRVESVSTSNGSRVYEK--WTDVTP-L-------TVMD--- 177 (404) T ss_pred ---hcccccCCceeccHHHHHHHHHHHH---hhhhHHhhcceeeccCCcceEEEEe--ecCCcc-c-------eeee--- Confidence 0111122211 123344555555 5557888999999999887653322 111100 0 0000 Q ss_pred ccccccccccccccccCCCcccccccccccccccccccccccccccchhhHhhcCC-CCCccceeeeEEEEEEEEeeccc Q lcl|NC_015279. 141 EGFDLTNGMSDAAAGLGTTSQAGSNPAALNPVATASSTGYNVGQGMRTDEAEDLGT-SGDNFNEMAFSIEKVTVTAKSRA 219 (467) Q Consensus 141 a~~~~~~~~~~~~~~~~~~~~agt~p~~ln~~~~~~~~~~~~~~Gm~TA~aE~LGs-~g~~f~EMaFsIEK~tVtAKSRa 219 (467) ++++.... +...|.++.|++.|..+-. T Consensus 178 ------------------------------------------------~Eg~~~~~~~~~~f~~i~~~~~k~~~~~---- 205 (404) T protein:vir:39 178 ------------------------------------------------AEDGKIPDLDNPRLTIIKYLIKRYAGII---- 205 (404) T ss_pred ------------------------------------------------cCccccccccccceeeEEeeeeeEEeee---- Confidence 00000000 1245788888888877654 Q ss_pred ccccccHHHHHHHHHhhCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhcccccccccccceeEEeeccccchhHHHHHHH Q lcl|NC_015279. 220 LKAEYSLELAQDLKAIHGLNAEAELANILSSEILAEINREVIRTIYKVSEQGAVSNTATAGVFDLDIDSNGRWSVEKFKG 299 (467) Q Consensus 220 LKAEYT~ELAQDLkAiHGLDAEtELaNILStEImlEINREII~~l~~~a~~~k~~~~~~~gv~Dl~~~~~~r~~ve~~~~ 299 (467) .+|-||.+|-. .|.+++|.+-|+..|..-+|..||.-. -.+....+..+++ .... T Consensus 206 ---~iS~ell~ds~----~~l~~~i~~~l~~~~~~~~d~~il~g~--------g~~~~~~~~~~~~----------~i~~ 260 (404) T protein:vir:39 206 ---TATNTLLKDTA----ENILAWLSSWIAKKVVVTRNQAIIAAM--------GTVPKKPTIAKFD----------DVIT 260 (404) T ss_pred ---hhHHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHHhcc--------cccccccccccHH----------HHHH Confidence 48999999842 577999999999999999999888422 1122223333321 1112 Q ss_pred HHHHHHHHHHHHHHhhccCCccEEEEchHHHHHHhhhcchhccccccccc-ccccCCceeEEEecCceEEEecccccccc Q lcl|NC_015279. 300 LLFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALNANL-NVDDTGNTFAGVLQGKYRVYIDPYSSNLT 378 (467) Q Consensus 300 l~~~i~~ean~i~~~t~rg~gn~~i~S~~Va~~L~~sG~~~~~~~~~~~~-~~d~t~~~~~G~l~~~~~vy~D~y~~~~~ 378 (467) +++..... .......+||+|.....|... ....+ ..+ ..+.++ -..++| .|++|++-.+..-+. T Consensus 261 ~~~~~~~~--------~~~~~a~~v~n~~~~~~L~~l---kd~~G--~~l~~~~~~~-~~~~~l-~G~pV~~~~~~~~~~ 325 (404) T protein:vir:39 261 MINTSVDP--------AIIATSSLLTNQSGLNKLALV---KTAEG--KYLLEPDPTK-PNSYLI-KGKKVIVVADRWLPN 325 (404) T ss_pred HHHHhhhh--------hhccCCEEEEcHHHHHHHHHh---hccCC--ceeeccCcCC-CCccee-cceeEEEecccccCc Confidence 22111011 111234688999998888642 22111 011 111111 112466 455666532211000 Q ss_pred hh----hcc--CCCceEEEEEecCCCccceeEecccchhhcccccCCccccceeeeeeeeceee-cCc--cc----ccCc Q lcl|NC_015279. 379 SA----NAA--NGNQYYVVGYKGTSPYDAGLFYCPYVPLQMVRAVGENTFQPKIGFKTRYGMVA-NPF--AE----GTTV 445 (467) Q Consensus 379 ~~----~~~--~~~dY~~vGyKG~~~~d~glfyaPYv~l~~~~~~Dp~s~qP~~g~~tRY~l~~-nP~--~~----~~~~ 445 (467) .. .++ ...+|++++.++.. .+=..+|+...+ ...|=.+-...||+..+ +|- .. ...+ T Consensus 326 ~~~~~~~~~~gd~~~~~~~~~~~~~----~i~~~~~~~~~~------~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a~ 395 (404) T protein:vir:39 326 SGSTVYPLYYGDMSQAITLFDRENM----SLLPTNIGAGAF------ETDTTKIRVIDRFDVKTTDSEALVAGSFTAIAD 395 (404) T ss_pred cCCCccEEEEEeccccEEEEeecce----EEEEeccchhhh------hhceeeEEEEeeeccEEecccceEEEEeecccc Confidence 00 000 00122222222111 122223221111 12334455566776543 442 11 1122 Q ss_pred ccccccccc Q lcl|NC_015279. 446 GAGRLRVNS 454 (467) Q Consensus 446 ~~~~~~~~~ 454 (467) ..+....|+ T Consensus 396 ~~~~~~~~~ 404 (404) T protein:vir:39 396 QVGNFTAGK 404 (404) T ss_pred CCCCCCCCC Confidence 222233344 Done!