Query lcl|NC_016762.1_cdsid_YP_005098074.1 [gene=phi297_00047] [protein=putative coat protein] [protein_id=YP_005098074.1] [location=29614..30708] Match_columns 364 No_of_seqs 17 out of 23 Neff 3.6 Searched_HMMs 1612 Date Thu Nov 7 13:26:15 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_47 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_47_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:105778 Length: 358 100.0 2E-191 1E-194 1066.2 28.1 358 1-364 1-358 (358) 2 protein:vir:80068 Length: 301 100.0 7.3E-35 4.6E-38 207.7 11.9 291 49-363 1-301 (301) 3 protein:vir:107687 Length: 319 99.9 4.4E-28 2.7E-31 170.6 11.0 311 23-363 1-319 (319) 4 protein:vir:79642 Length: 329 99.9 3.3E-26 2.1E-29 160.3 10.4 317 18-360 1-329 (329) 5 protein:vir:5255 Length: 304 # 99.6 9.6E-19 5.9E-22 119.4 10.3 282 50-364 1-298 (304) 6 protein:vir:103285 Length: 296 99.6 1.9E-18 1.2E-21 117.7 10.2 285 43-364 1-294 (296) 7 protein:vir:104342 Length: 314 99.5 3.9E-17 2.4E-20 110.5 9.9 301 21-364 1-312 (314) 8 protein:vir:94070 Length: 339 99.0 4.2E-12 2.6E-15 82.9 9.0 311 1-363 1-339 (339) 9 protein:vir:107732 Length: 379 98.9 2.9E-11 1.8E-14 78.3 12.7 308 1-363 19-379 (379) 10 protein:vir:78558 Length: 336 98.9 1.5E-11 9.2E-15 79.9 10.7 292 12-363 1-336 (336) 11 protein:vir:99576 Length: 388 98.9 2.4E-11 1.5E-14 78.8 9.2 332 1-363 23-388 (388) 12 protein:vir:101557 Length: 336 98.8 5.1E-11 3.2E-14 77.0 10.7 310 12-363 1-336 (336) 13 protein:vir:3643 Length: 336 # 98.8 1.1E-10 6.9E-14 75.2 9.8 310 12-363 1-336 (336) 14 protein:vir:106734 Length: 336 98.6 3.8E-10 2.4E-13 72.2 9.0 294 12-363 1-336 (336) 15 protein:vir:96079 Length: 382 98.3 4.3E-08 2.7E-11 61.0 12.3 322 1-363 17-382 (382) 16 protein:vir:78739 Length: 332 79.9 0.067 4.2E-05 27.0 7.7 297 30-361 1-332 (332) 17 protein:vir:97031 Length: 402 67.9 0.18 0.00011 24.7 6.9 266 43-364 1-287 (402) 18 protein:vir:97255 Length: 310 62.8 0.28 0.00017 23.6 6.9 275 43-359 1-310 (310) 19 protein:vir:100135 Length: 418 58.1 0.42 0.00026 22.6 15.4 297 1-364 96-414 (418) 20 protein:vir:7771 Length: 330 # 54.8 0.49 0.00031 22.3 9.0 282 35-364 1-325 (330) 21 protein:vir:96262 Length: 274 54.7 0.5 0.00031 22.2 7.4 247 43-364 1-257 (274) 22 protein:vir:95898 Length: 274 54.7 0.5 0.00031 22.2 7.4 247 43-364 1-257 (274) 23 protein:vir:739 Length: 231 # 54.2 0.35 0.00022 23.1 5.9 201 88-364 1-218 (231) 24 protein:vir:94622 Length: 341 50.2 0.62 0.00038 21.7 8.7 278 43-364 1-326 (341) 25 protein:vir:80930 Length: 278 44.7 0.8 0.00049 21.1 8.3 258 43-364 1-264 (278) 26 protein:vir:78223 Length: 333 42.8 0.87 0.00054 20.9 8.4 292 22-363 1-333 (333) 27 protein:vir:6324 Length: 335 # 31.5 1.5 0.00092 19.6 8.0 271 27-364 1-315 (335) 28 protein:vir:7409 Length: 408 # 30.5 1.6 0.00098 19.5 8.5 297 1-364 30-394 (408) 29 protein:vir:4092 Length: 390 # 28.3 1.8 0.0011 19.2 9.4 316 1-364 25-369 (390) 30 protein:vir:96123 Length: 274 26.7 1.9 0.0012 19.0 6.7 246 43-364 1-257 (274) 31 protein:vir:80180 Length: 381 26.3 2 0.0012 19.0 10.5 296 36-364 1-323 (381) 32 protein:vir:96833 Length: 275 25.4 2.1 0.0013 18.9 8.4 256 35-364 1-264 (275) 33 protein:vir:4339 Length: 395 # 22.9 2.4 0.0015 18.5 13.6 303 1-364 37-394 (395) 34 protein:vir:95107 Length: 270 22.8 2.4 0.0015 18.5 7.5 246 43-364 1-252 (270) 35 protein:vir:9820 Length: 272 # 22.6 2.4 0.0015 18.5 6.1 257 43-355 1-272 (272) 36 protein:vir:3033 Length: 272 # 22.6 2.4 0.0015 18.5 6.1 257 43-355 1-272 (272) 37 protein:vir:104256 Length: 458 22.5 2.4 0.0015 18.5 14.2 309 1-364 105-457 (458) 38 protein:vir:102605 Length: 273 21.9 2.5 0.0016 18.4 11.4 239 49-364 1-260 (273) 39 protein:vir:105822 Length: 273 21.9 2.5 0.0016 18.4 11.4 239 49-364 1-260 (273) 40 protein:vir:97053 Length: 390 21.6 2.6 0.0016 18.3 14.7 292 1-364 75-390 (390) 41 protein:vir:103323 Length: 364 20.3 2.8 0.0017 18.1 7.0 271 43-364 1-326 (364) No 1 >protein:vir:105778 Length: 358 # NCBI annotation: gp9 # Family: family:all:10995 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224147;genbank:gi:62362222;genbank:GeneID:3342531 Probab=100.00 E-value=1.9e-191 Score=1066.16 Aligned_cols=358 Identities=70% Similarity=1.159 Sum_probs=356.1 Q ss_pred CccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhH Q lcl|NC_016762. 1 MFLTQQAIAAHPRLMGHFQELQANRNIWNNQNAAMLAEHRGAMTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEI 80 (364) Q Consensus 1 ~~ftke~~~~~~~~~~q~~~L~~~R~~~~~~~~~m~a~~~~~~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i 80 (364) |+|+||+++|++++|+||+|||++||+||++|++|+|++++++++...+|||++++|+|+|||||++++|+|++|+|++| T Consensus 1 ~~f~K~~~an~~~~~~qw~~L~~~Rna~n~~~~a~maan~a~~~~~~~~~NAv~~v~~D~wr~~D~~~~q~fr~e~~~~l 80 (358) T protein:vir:10 1 MYFSKETLATNSRLGGHWNELWANRNMWNAQHDAMIAANRSNMTPEWLAVNAVGGFTRDFWAEIDRQVLQLRDQEVGMEI 80 (358) T ss_pred CeechhhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHhhhHHHhhhhhheecccccCCHHHHHHHhhhhhhhcccchhHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhhccchhHHHHHHHhhccCCCeEEEeeCCCCCcccccceeeccCCccceeecCCccchhhhhhcCccccchhh Q lcl|NC_016762. 81 VNDLLQVQTVLPIGKSAKLYNVVGDIADDVSVSIDGQAPYSFDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVL 160 (364) Q Consensus 81 ~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~~~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~ 160 (364) |||||+||+|||||||||+|+++|||+|+||+|||||+|++|||++|+|||||||||||||+||||||.+|+|+|||+++ T Consensus 81 ~NDLm~ls~sv~Igktv~~y~~~gd~~~~v~~SmsGQ~~~~lD~~~y~~dGtpiPIfdsg~~f~WR~~~~~~~~g~d~~~ 160 (358) T protein:vir:10 81 VNDLIGVQTVLPVGKTAKLYNVIGDIADDVSVSIDGQAPFSFDHTEYASDGDPIPVFTAGYGVNWRHAAGLNSLGIDLVL 160 (358) T ss_pred HhhhhhccccccHHHHHHHHhhhcCCCceEEEEecccCcccccceeeeccCCEeeeeccCccccccchhhcCccccchhH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCccccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhc Q lcl|NC_016762. 161 DSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAART 240 (364) Q Consensus 161 D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~ 240 (364) |+|++|+|||+||||||+||||++|+|+|||||||||||||+|++||+++||+||||||||+++++.||++++|++++++ T Consensus 161 daQ~~~~~kv~~~~vdy~lNG~~~I~v~g~t~~Glrn~~n~~qv~l~~~s~g~NiDlttat~~a~~~~f~~~l~~~~~~~ 240 (358) T protein:vir:10 161 DSQMAKMRKFNQKRVNYYLNGDPNIQVQSYPAQGIKNHRNTKKINLGSGSGGANIDLTTADMTALFAFFGKGAFGTLARA 240 (358) T ss_pred HHHHHHHHHHHHHHHhhhhccCCceeecCcccccccCCcceeEEEeccCCCcceeeeccCCHHHHHHHHHHHHHHHHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccceEEEEcHHHHHhhhcceeeecccccccccchhHHHHHhhccchhhhccccccCCCeEEEEEcCcceeeheecce Q lcl|NC_016762. 241 NKVDAYDVLWVSPEINANLAQPYMITMGGGANAVVAGTVLDAVMRFIPAREVRQTFALSGNEFLGYQRRRDVVSPLVGMA 320 (364) Q Consensus 241 N~v~~~~~lyvS~eI~~N~~~py~~~~~~~S~~~~~gTIl~~i~~~~~V~~I~~~~~LtgNe~lg~v~~~dvI~plvGmp 320 (364) ||++++++||||||||+||+|||+ ++||++||||++|++|++|+||+|+++|+|||||+|+|+++||+|||||| T Consensus 241 N~~~~~~~~~vs~ei~~n~~r~Y~------~~~~~~gTIl~~vl~~~~va~I~~~~~LsgNeii~~~~~~~vi~plvG~~ 314 (358) T protein:vir:10 241 NKVAQYDVMWVSPEIWANLAQPYV------VNGVVSGNVLNAVLPFAPVREIRQTFALSGNEFIAYVRRQDIISPLVGMA 314 (358) T ss_pred cccceeeEEEEcHHHHhhhhcccc------cccccchhhHHHhhcccCcccccccccCCCccEEEEEeCCceeeeeecce Confidence 999999999999999999999997 48999999999999999999999999999999999999999999999999 Q ss_pred eecccccccCCCCchhhhhhhhcchhhhcccccceeeEeeccCC Q lcl|NC_016762. 321 TGVIPLPRPLPQVNYNFQIMSAMGIQVKKDDEGLSGVIYGANLA 364 (364) Q Consensus 321 ~gt~p~pR~~p~~nY~f~v~~A~glqiK~D~~G~sgVv~~~~l~ 364 (364) +||||+|||||||||+|+||||+|||||+|++||||||||++|- T Consensus 315 ~gt~~~pR~~p~ddY~f~vwsA~glqik~D~~Gks~Vv~~~~~~ 358 (358) T protein:vir:10 315 VGVVPLPRPLPNVNYNFQIMSAEGLQITADDQGLSGVVYGANLV 358 (358) T ss_pred eeeecCCCCCCCcchhhhhhhhhceeeeeccccceeeEeecccC Confidence 99999999999999999999999999999999999999999999 No 2 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=100.00 E-value=7.3e-35 Score=207.72 Aligned_cols=291 Identities=12% Similarity=0.129 Sum_probs=247.0 Q ss_pred hhhhhcc-ccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEEeeCCCCCccccccee Q lcl|NC_016762. 49 ACNALAG-LGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSVSIDGQAPYSFDHTEY 127 (364) Q Consensus 49 a~Na~a~-lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~~~D~~~Y 127 (364) .+|...+ |+...|..||.++++...+.. . -.+|+++.++++.|.....|.+... .|.+.+.-++...-++....+ T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l--~-~r~l~~v~~~~~~~~~~~~~~~~~~-~G~~~~~~~~~~dip~~~~~~ 76 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEEL--T-ARSVFPQKFDVNEGAESYSFDVMTR-SGAAKIIANGADDLPLVDVDM 76 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhh--h-hhhhcccccCCCCceEEEEEeeecc-ceeEEEecCcccccccccccc Confidence 3454433 799999999999999955443 3 5599999999999999999998876 899999888887778888889 Q ss_pred eccCCccceeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCcccccccc Q lcl|NC_016762. 128 NSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNTIKVNLG 207 (364) Q Consensus 128 ~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg 207 (364) +.+..|||.|.+||+++|||+...+..|+++.++.+.++.+++.+++-+++++|+..+ ..|||-|||+......+ T Consensus 77 ~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~-----g~~GLlN~p~~~~~~~~ 151 (301) T protein:vir:80 77 VRKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKY-----AIKGAFEATGIQIDVSP 151 (301) T ss_pred eeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccc-----cceeeecCCCccccccc Confidence 9999999999999999999999999999999999999999999999999999997654 57999999999988888 Q ss_pred ccCCCcceeecccCHHHHHHHHhhhhHHHH-HhccccccceEEEEcHHHHHhhhcceeeecccccccccchhHHHHHhhc Q lcl|NC_016762. 208 SGAGGANIDLTTATPQQIIDFFTKGAFGQA-ARTNKVDAYDVLWVSPEINANLAQPYMITMGGGANAVVAGTVLDAVMRF 286 (364) Q Consensus 208 ~~~gGaNidlttA~~~~i~~~f~~~~f~~~-~~~N~v~~~~~lyvS~eI~~N~~~py~~~~~~~S~~~~~gTIl~~i~~~ 286 (364) ....|.+-+-.++|+++|++.+.+ ++.++ ...+++..+.+|++||+.+..|.++++ +++. ..||+++|.+. T Consensus 152 ~~~~~~~~~w~~~t~~ei~~di~~-~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~------~~~~-~~tvl~~l~~~ 223 (301) T protein:vir:80 152 TTGVGNVSKWEKKTAEQIIDEIGE-AHTKITVLPGYGTASLKLCLPPKQFELINKKRY------SNED-SRSVLKVLQDN 223 (301) T ss_pred CcccccccccccCCHHHHHHHHHH-HHHHHHHhcCceecccEEEecHHHHHhhhhccc------cCCC-CeeHHHHHHHH Confidence 777888999999999999999986 56555 567888899999999999999999875 4555 67999999998 Q ss_pred cchhhhccccccCC------CeEEEEEcCcceeeheecceeecccccccCCC--CchhhhhhhhcchhhhcccccceeeE Q lcl|NC_016762. 287 IPAREVRQTFALSG------NEFLGYQRRRDVVSPLVGMATGVIPLPRPLPQ--VNYNFQIMSAMGIQVKKDDEGLSGVI 358 (364) Q Consensus 287 ~~V~~I~~~~~Ltg------Ne~lg~v~~~dvI~plvGmp~gt~p~pR~~p~--~nY~f~v~~A~glqiK~D~~G~sgVv 358 (364) .+..+|+....|.+ +-++.|.++++++...++||+-+.|.-+++++ .+|.++++ |++||. -..++ T Consensus 224 ~~~~~I~~~p~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~e~~~~~~~~~~~~r~~---Gv~i~~----P~ai~ 296 (301) T protein:vir:80 224 AWFSAIVRVPDLAGMGTAGSDSFAVIHDSNETAELIIPMDITRHPEEYSFPRTKVPFEERTA---GVVVRF----PAAIV 296 (301) T ss_pred cCcceEEEcceeccCCCCcccEEEEEecCCcEEEEEecCceeeecceecCceeEeeeeeeeE---EEEEEc----cceEE Confidence 88889999988865 44888999999999999999988877666654 23444444 556665 45667 Q ss_pred eeccC Q lcl|NC_016762. 359 YGANL 363 (364) Q Consensus 359 ~~~~l 363 (364) |-.|+ T Consensus 297 ~~~GI 301 (301) T protein:vir:80 297 RVDGI 301 (301) T ss_pred EEecC Confidence 77777 No 3 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=99.91 E-value=4.4e-28 Score=170.56 Aligned_cols=311 Identities=12% Similarity=0.068 Sum_probs=226.6 Q ss_pred HHHHHHHHHHHHHHHHhhcccchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHh Q lcl|NC_016762. 23 ANRNIWNNQNAAMLAEHRGAMTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNV 102 (364) Q Consensus 23 ~~R~~~~~~~~~m~a~~~~~~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~ 102 (364) +--..|+.+...-|+.+-..|+..-+|+..++.|+.+.|.+||.++++. ....++. ..|+++.++++.|.....|.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~--~~~~l~~-~~~i~v~~~~~~~~~~~~~~~ 77 (319) T protein:vir:10 1 MTTKKFDEADKSNVEMYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEE--DYPVGSA-LRVFPVTTELSPTDKTFEYMT 77 (319) T ss_pred CCCcchhHHhhHHHHHHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhh--hhcceec-hhhcccccCCCCceEEEEeee Confidence 1112355444454555555666666677777779999999999999998 4444444 346799999999987766666 Q ss_pred hccCCCeEEEeeCCCCCc-ccccceeeccCCccceeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhcc Q lcl|NC_016762. 103 VGDIADDVSVSIDGQAPY-SFDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDG 181 (364) Q Consensus 103 ~gd~a~~v~~SmdGq~~~-~~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG 181 (364) .-- .|.+.+ +++.+.. ++=-..++++.-||+-+..+|+++|||....+..|+++.++.+.+..+++.++.-+.+++| T Consensus 78 ~~~-~G~a~~-~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G 155 (319) T protein:vir:10 78 FDK-VGTAQI-IADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKG 155 (319) T ss_pred ecc-ccceee-ecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEee Confidence 533 454442 3333221 2222335666779999999999999999999999999999999999999999999999999 Q ss_pred CCceeecCeeeeeecCCCccccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHHHHhhhc Q lcl|NC_016762. 182 ATNIQVENYPAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEINANLAQ 261 (364) Q Consensus 182 ~~~I~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~~ 261 (364) +.. ...|||-|||+....+.+.+++ ..|+++++|++.+.+..-...+..+.+..+.+|.+||+.+..|.+ T Consensus 156 ~~~-----~g~~GLlN~p~~~~~~~~~~~~-----~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~ 225 (319) T protein:vir:10 156 SAP-----HKIVSVFNHPNITKITSGKWID-----VSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAI 225 (319) T ss_pred ccc-----ccceeEEeCCCceeeecCCCCC-----ccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhc Confidence 654 4569999999988777654333 347899999998876433334456788899999999999999999 Q ss_pred ceeeecccccccccchhHHHHHhhccchhhhccccccCC------CeEEEEEcCcceeeheecceeecccccccCCCCch Q lcl|NC_016762. 262 PYMITMGGGANAVVAGTVLDAVMRFIPAREVRQTFALSG------NEFLGYQRRRDVVSPLVGMATGVIPLPRPLPQVNY 335 (364) Q Consensus 262 py~~~~~~~S~~~~~gTIl~~i~~~~~V~~I~~~~~Ltg------Ne~lg~v~~~dvI~plvGmp~gt~p~pR~~p~~nY 335 (364) ++ +++ ..|+++.|.+..+=-.|+....|.+ +-++.|.++++++...++||+.+.|. +--.-+| T Consensus 226 ~~--------~~~-~~t~l~~lk~~~~~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~--e~~~l~~ 294 (319) T protein:vir:10 226 RM--------PET-TMSYLDYFKSQNSGIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPEAFNMLPA--QPKDLHF 294 (319) T ss_pred cc--------CCC-CeeHHHHHHHhcCCceEEEeeeecccCCCcceEEEEEecCCceEEEecCcceeeeee--eecCceE Confidence 86 444 5799999998755556777777764 67899999999999999999987764 2222333 Q ss_pred hhhhhhh-cchhhhcccccceeeEeeccC Q lcl|NC_016762. 336 NFQIMSA-MGIQVKKDDEGLSGVIYGANL 363 (364) Q Consensus 336 ~f~v~~A-~glqiK~D~~G~sgVv~~~~l 363 (364) ......+ .|+.||. -..+.|-.|+ T Consensus 295 ~~~~~~r~~Gv~i~~----P~ai~~~dGI 319 (319) T protein:vir:10 295 KVPCTSKCTGLTIYR----PMTIVLITGV 319 (319) T ss_pred EEeeeeeeEEEEEEc----cceeEeeecC Confidence 3222332 2456665 4556677777 No 4 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=99.88 E-value=3.3e-26 Score=160.28 Aligned_cols=317 Identities=13% Similarity=0.108 Sum_probs=224.3 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHhhcccchhhhhhhh-hccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhccchhH Q lcl|NC_016762. 18 FQELQAN-RNIWNNQNAAMLAEHRGAMTPGMLACNA-LAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGK 95 (364) Q Consensus 18 ~~~L~~~-R~~~~~~~~~m~a~~~~~~t~~~~a~Na-~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igk 95 (364) -+-+..+ +-.+++.+.+.+|... .+ +++.+.|. .+.+--.-|..||.++++. +...++.. .|+++.++++.|. T Consensus 1 ~~~~~~~~~~~~d~~~~~~~a~~~-~~-~~~~~~~~~~~~f~~~ql~~id~~v~e~--~~~~l~~~-~~i~i~~~~~~~~ 75 (329) T protein:vir:79 1 MRGNIMSKEMKYDEFEANVIANHM-QL-RGAKNDASDMGIWTSQELHKIKAQAYEK--EYPAGSAL-RVFPVTSELSDTD 75 (329) T ss_pred CccchhhhhhccchhhhhhHhhhc-cc-ccceeccchhhHHHHHHHHHHHHHHHhh--hhcccchh-hhcccccCCCCce Confidence 2223333 3345555555555522 21 23333332 2224456699999999998 56666666 4679999999999 Q ss_pred HHHHHHhhccCCCeEEEeeCCCC-CcccccceeeccCCccceeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHH Q lcl|NC_016762. 96 SAKLYNVVGDIADDVSVSIDGQA-PYSFDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRI 174 (364) Q Consensus 96 tv~~y~~~gd~a~~v~~SmdGq~-~~~~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~ 174 (364) ....|.+.-. .|.+.. +++.+ --++-...++++--||+.|.+||+++|||....+..|+++..+.+.+..+++.+++ T Consensus 76 ~~~t~~~~~~-~G~a~~-~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~ 153 (329) T protein:vir:79 76 KTFEYQTFDK-VGHAKI-IADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLV 153 (329) T ss_pred eEEEeeeeec-ceeeee-ecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhh Confidence 8887777654 565553 33322 22333344667778999999999999999999999999999999999999999999 Q ss_pred HhhhhccCCceeecCeeeeeecCCCccccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHH Q lcl|NC_016762. 175 VAYTLDGATNIQVENYPAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPE 254 (364) Q Consensus 175 ~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~e 254 (364) -+.+++|+.. ...|||-|||++....-| .+-+-+..++|+++|++.+...........+++..+.+|.+||+ T Consensus 154 n~i~f~G~~~-----~g~~GLlN~p~v~~~~~~---~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~ 225 (329) T protein:vir:79 154 NHLVFKGSKP-----HKIISVFEHPNLTTINSA---GWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPS 225 (329) T ss_pred ccEEEeeccc-----ccceeeecCCCccccccC---CCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHH Confidence 9999999654 556999999999765543 23445566889999999998644444455677788999999999 Q ss_pred HHHhhhcceeeecccccccccchhHHHHHhhccchhhhccccccCC------CeEEEEEcCcceeeheecceeecccccc Q lcl|NC_016762. 255 INANLAQPYMITMGGGANAVVAGTVLDAVMRFIPAREVRQTFALSG------NEFLGYQRRRDVVSPLVGMATGVIPLPR 328 (364) Q Consensus 255 I~~N~~~py~~~~~~~S~~~~~gTIl~~i~~~~~V~~I~~~~~Ltg------Ne~lg~v~~~dvI~plvGmp~gt~p~pR 328 (364) -+.-|.+++ +++ ..|+++.|.+..+--.|+....|.+ +.++.|...+++++..++||+-+.|.-+ T Consensus 226 ~~~~L~~~~--------~~~-~~tvl~~lk~~~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~q~ 296 (329) T protein:vir:79 226 MRKVLMVRM--------PET-TMSYLDYFKQQNGGITIESISELEDIDGAGTKAALVYEKDPMNMSIEIPEAFNMLTAQP 296 (329) T ss_pred HHHHhhccc--------CCC-CccHHHHHHHhCCCcEEEEcccccccCCCCceEEEEEecCCceEEEecCcceeeeecee Confidence 999998886 444 5899999998765556777777754 7889999999999999999998776544 Q ss_pred cCCCCchhhhhhhh-cchhhhccc--ccceeeEee Q lcl|NC_016762. 329 PLPQVNYNFQIMSA-MGIQVKKDD--EGLSGVIYG 360 (364) Q Consensus 329 ~~p~~nY~f~v~~A-~glqiK~D~--~G~sgVv~~ 360 (364) +. .+|.....++ .|..||.=. --..|++-| T Consensus 297 ~~--~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 297 KD--LHFKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred cC--ceEEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 33 3333322222 235566532 333445544 No 5 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=99.63 E-value=9.6e-19 Score=119.37 Aligned_cols=282 Identities=14% Similarity=0.141 Sum_probs=194.0 Q ss_pred hhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEEe-eCCCCCcccccceee Q lcl|NC_016762. 50 CNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSVS-IDGQAPYSFDHTEYN 128 (364) Q Consensus 50 ~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~S-mdGq~~~~~D~~~Y~ 128 (364) +-++|=|-+++ ..+|+++.+. +..++. .-+|+|+.+.++.+.+...|.+..- .|.+.-. ++++ ..++....++ T Consensus 1 ~~~lafl~~qL-~~id~~vye~--~~~~~~-~~~lipv~t~~~~~~~~~~~~~~d~-~G~a~~~~i~~~-a~dip~vd~~ 74 (304) T protein:vir:52 1 MSLLAYVKNGL-TAVSKDIAET--KYPEIV-FPQFVYVDQQTAVGITEKLHYGADE-HGSLDDGLITVG-TSTLDQVEVG 74 (304) T ss_pred CchHHHHHHHH-HHHhhhhhcc--ccccch-hhhhccccCCCCcccceEEEeeeec-cCcccccccCCc-CCccceeecc Confidence 22333232322 4567766655 344444 3578999999999998877777655 6767633 3444 4668888899 Q ss_pred ccCCccceeecCCccch--hhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCccccccc Q lcl|NC_016762. 129 SDGDPIPVFTAGYGVNW--RHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNTIKVNL 206 (364) Q Consensus 129 ~dGtPiPIf~sgy~~~W--R~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~l 206 (364) ++-.-.||+..|-+++| .|....+-.|+++..+-+.+..+++.+++-+.++.|+.. .+..|||-|||+.-..+. T Consensus 75 ~~~~~~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~----~~g~~GllN~p~v~~~~~ 150 (304) T protein:vir:52 75 FTPTRSYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAK----DSRLTGLLNNKSVEVYAI 150 (304) T ss_pred cceeEEEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeecc----ccceEEEEeCCCcceeee Confidence 99999999887766666 888888999999999999999999999999999999753 355899999999986654 Q ss_pred cccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHHHHhhhcceeeecccccccccchhHHHHHhhc Q lcl|NC_016762. 207 GSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEINANLAQPYMITMGGGANAVVAGTVLDAVMRF 286 (364) Q Consensus 207 g~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~~py~~~~~~~S~~~~~gTIl~~i~~~ 286 (364) - ++|++-+..++|+++|++.+....-......+++..+.+|.++|+.++-+..=. +... ..|||+.|++- T Consensus 151 ~--~~~a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~-------~~~~-~~Tvl~~l~~n 220 (304) T protein:vir:52 151 K--GAAQNTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQ-------RANT-DTTALEFLTKH 220 (304) T ss_pred c--CCccCCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhcc-------CCCC-CchHHHHHHHh Confidence 3 235556677999999999998654444566777788999999999998886432 1223 57999999985 Q ss_pred cchh-----hhcccc-ccC--C----CeEEEEEcCcceeeheecceeecccccccCCCCchhhhhhhhcchhhhcccccc Q lcl|NC_016762. 287 IPAR-----EVRQTF-ALS--G----NEFLGYQRRRDVVSPLVGMATGVIPLPRPLPQVNYNFQIMSAMGIQVKKDDEGL 354 (364) Q Consensus 287 ~~V~-----~I~~~~-~Lt--g----Ne~lg~v~~~dvI~plvGmp~gt~p~pR~~p~~nY~f~v~~A~glqiK~D~~G~ 354 (364) .+-. .|+... .+. | +.++.|.+++++++..+-||+-..|. .|++...| +.+..+| T Consensus 221 ~~~~~g~~l~I~~v~~~~~~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l~~---q~~~~~~~----------~vp~~~r 287 (304) T protein:vir:52 221 LSAAAGRQVAIKALPSNYGTRVTDGKTRAMVYVNSKEHVIFDVPMSPTVLDA---QPKGLLAF----------ESGLRMA 287 (304) T ss_pred cccccCCcceEEEecccccccCCCCceEEEEEecChhheEEecCccccccch---hhcCCceE----------Eecceee Confidence 4311 344332 221 2 46899999999999999888866552 23332223 3333222 Q ss_pred e-eeEeeccCC Q lcl|NC_016762. 355 S-GVIYGANLA 364 (364) Q Consensus 355 s-gVv~~~~l~ 364 (364) + ||.-=.-+| T Consensus 288 ~gGv~v~~P~a 298 (304) T protein:vir:52 288 FGGVTFMEPDS 298 (304) T ss_pred eeeEEEEccce Confidence 2 222112222 No 6 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=99.62 E-value=1.9e-18 Score=117.72 Aligned_cols=285 Identities=11% Similarity=0.057 Sum_probs=202.6 Q ss_pred cchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEEeeCCCCCc-c Q lcl|NC_016762. 43 MTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSVSIDGQAPY-S 121 (364) Q Consensus 43 ~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~-~ 121 (364) |+-. .+-..+-+-..=|..||+++++. ....++. -.|+++.++++.|-....|.+.-- .|.+. .+++.+.. + T Consensus 1 ~~~~--~a~~~~~f~~~ql~~id~~v~e~--~~~~l~~-~~~i~v~~~~~~~~~~~~~~~~~~-~G~a~-~~~~~~~dip 73 (296) T protein:vir:10 1 MGVD--KADAAGIWTVKQLTASLNKAYET--EYDQNSV-VNLFPVSNEIPGYAKYFEYPVFDG-VGIAQ-IVADYTDDLP 73 (296) T ss_pred Cccc--chhhhHHHHHHHHHHHHHHHHhh--hhccccc-ceecccccCCCCceeEEEeeeeec-cCcee-EeCCCccccc Confidence 4322 22232235577899999999987 3334433 346788888888866555544322 44443 23333210 1 Q ss_pred cccceeeccCCccceeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCcc Q lcl|NC_016762. 122 FDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNT 201 (364) Q Consensus 122 ~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~ 201 (364) +=-..++.+--||+-+..+|+++|||....+-.|+++..+.+.+..+++.++.-+.++.|+. ++..|||=|||+. T Consensus 74 ~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~-----~~g~~GLlN~p~v 148 (296) T protein:vir:10 74 LVDALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGST-----AHGIPSVFDYPNI 148 (296) T ss_pred eeeccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecc-----cccceeEeecCCC Confidence 10112344555777778999999999999999999999999999999999999999999954 4568999999997 Q ss_pred ccccccccCCCcceeecccCHHHHHHHHhhhhHHHHH-hccccccceEEEEcHHHHHhhhcceeeecccccccccchhHH Q lcl|NC_016762. 202 IKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAA-RTNKVDAYDVLWVSPEINANLAQPYMITMGGGANAVVAGTVL 280 (364) Q Consensus 202 ~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~-~~N~v~~~~~lyvS~eI~~N~~~py~~~~~~~S~~~~~gTIl 280 (364) ...+-++ | | +++.+|++-+.+ ++.++. ..+.+..+.+|-++|+...-|.+.+ +++ ..|++ T Consensus 149 ~~~~~~~-------~-W-~~~t~i~~Di~~-~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~--------~~~-~~t~l 209 (296) T protein:vir:10 149 NNVVSGG-------S-W-SQPTTAVSDITS-LLDIIETSTNGQHRATHLLLPTTARRIMQNLV--------PGT-SVSYG 209 (296) T ss_pred ccccccC-------C-c-cCHHHHHHHHHH-HHHHHHHhhCceecceeEEeCHHHHHHHhhcc--------CCC-CccHH Confidence 5443321 1 2 455577666664 455444 4567889999999999999999886 444 57999 Q ss_pred HHHhhccchhhhccccccCC------CeEEEEEcCcceeeheecceeecccccccCCCCchhhhhhhhc-chhhhccccc Q lcl|NC_016762. 281 DAVMRFIPAREVRQTFALSG------NEFLGYQRRRDVVSPLVGMATGVIPLPRPLPQVNYNFQIMSAM-GIQVKKDDEG 353 (364) Q Consensus 281 ~~i~~~~~V~~I~~~~~Ltg------Ne~lg~v~~~dvI~plvGmp~gt~p~pR~~p~~nY~f~v~~A~-glqiK~D~~G 353 (364) +.|.+..+=-.|+....|.+ +-++.|...++++...++||+.+-|. +--+.+|.++++.+. |+.||. T Consensus 210 ~~ik~~~~~l~i~~~~~l~~a~~~g~~~~v~~~~~~~~~~~~v~~~~~~~~~--e~~~l~~~~~~~~~~~Gv~i~~---- 283 (296) T protein:vir:10 210 EFFRQNNSGVTVEFVQYLNDYNGTGTSAAIAYEKDPNNMAIEIPEATNALPA--QPKDLHFKIPVTSKATGLIVYR---- 283 (296) T ss_pred HHHHHhcCCceEEEeeeeccCCCCcceEEEEEEcCCceEEEEcCcceeeecc--cccCceEEEeeEeeEEEEEEEC---- Confidence 99999877667787777765 45788899999999999999977654 445578888888876 588887 Q ss_pred ceeeEeeccCC Q lcl|NC_016762. 354 LSGVIYGANLA 364 (364) Q Consensus 354 ~sgVv~~~~l~ 364 (364) -..++|-.|+. T Consensus 284 P~ai~~~dGI~ 294 (296) T protein:vir:10 284 PLTMAVMKGIT 294 (296) T ss_pred CceeEEEeeee Confidence 44566666655 No 7 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=99.53 E-value=3.9e-17 Score=110.55 Aligned_cols=301 Identities=12% Similarity=0.084 Sum_probs=192.0 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccchh-hhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhccchhHHHHH Q lcl|NC_016762. 21 LQANRNIWNNQNAAMLAEHRGAMTPG-MLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKL 99 (364) Q Consensus 21 L~~~R~~~~~~~~~m~a~~~~~~t~~-~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~ 99 (364) .-++ |+...+.+.++ .++.+ +.+--+.+ +--.=|..||.++++. +..+++.- .|+++.+.++.+..... T Consensus 1 ~~~~---~~~~~~~~~~~---~~~~~~~~~d~~~~-fl~~ql~~id~~v~e~--~~~~~~~~-~~i~v~~~~~~~~et~~ 70 (314) T protein:vir:10 1 MAIK---FDAEQAKITTH---LEQMGVEKADAAGI-WAVSQLTAALNRAYEK--EYAENSVV-NIFPVTNEIPGHAKYFE 70 (314) T ss_pred Cccc---hHHHHHHHHHH---HHhhcccchhhhHH-HHHHHHHHHHHHHhhh--hccccccc-eeeccccCCCCceeEEE Confidence 1111 22111111111 11111 11111111 3344588999999998 55555443 46788888888765444 Q ss_pred HHhhccCCCeEEEeeCCCCCcccccc--eeeccCCccceeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhh Q lcl|NC_016762. 100 YNVVGDIADDVSVSIDGQAPYSFDHT--EYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAY 177 (364) Q Consensus 100 y~~~gd~a~~v~~SmdGq~~~~~D~~--~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy 177 (364) |.+. |..|.+. .+++.+. ++--. .+..+--||.-+..||+++|||....+--|+++..+-+.+..+++.+++-+. T Consensus 71 ~~~~-e~~G~a~-~~~d~~~-dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i 147 (314) T protein:vir:10 71 YPEF-DGVGIAQ-IIADYSD-DLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKL 147 (314) T ss_pred eeee-cccccee-eeCCccc-ccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceE Confidence 4443 2245444 3333321 11111 2445556777778999999999999999999999999999999999999999 Q ss_pred hhccCCceeecCeeeeeecCCCccccccccccCCCcceeecccCHHHHHHHHhhhhHHHHH-hccccccceEEEEcHHHH Q lcl|NC_016762. 178 TLDGATNIQVENYPAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAA-RTNKVDAYDVLWVSPEIN 256 (364) Q Consensus 178 ~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~-~~N~v~~~~~lyvS~eI~ 256 (364) ++.|+.. +..|||-|||+.....- +-+ | +|+++|++-+.. ++.++- .++.+..+.+|-++|+-. T Consensus 148 ~f~G~~~-----~g~~GLlN~p~v~~~~~-------~~~-W-aT~~ei~~Di~~-~~~~l~~~s~g~~~p~~l~Lpp~~~ 212 (314) T protein:vir:10 148 VWSGSAP-----HGIVSVFDQPNINNVVA-------TPN-W-SVPQNAIDDVTA-MIDAVESSTQGLHHVTDILLPASAR 212 (314) T ss_pred EEeeccc-----ccceeEeecCCCccccC-------CCC-c-ccHHHHHHHHHH-HHHHHHHhcCccccceeEEecHHHH Confidence 9999544 56899999999643221 112 4 688999888875 455444 456777889999999999 Q ss_pred HhhhcceeeecccccccccchhHHHHHhhccchhhhccccccCC------CeEEEEEcCcceeeheecceeecccccccC Q lcl|NC_016762. 257 ANLAQPYMITMGGGANAVVAGTVLDAVMRFIPAREVRQTFALSG------NEFLGYQRRRDVVSPLVGMATGVIPLPRPL 330 (364) Q Consensus 257 ~N~~~py~~~~~~~S~~~~~gTIl~~i~~~~~V~~I~~~~~Ltg------Ne~lg~v~~~dvI~plvGmp~gt~p~pR~~ 330 (364) .-|.+++ +.. ..||++.|.+-.+=-+|+....|.+ +-++.|...++++...+.||+-..|. +- T Consensus 213 ~~L~~~~--------~~~-~~tvl~~l~~n~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~--e~ 281 (314) T protein:vir:10 213 RVMQGLV--------PQT-NLSYGELFTRNNPGLTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTNVLPA--QP 281 (314) T ss_pred Hhhcccc--------cCC-CccHHHHHHHhCCCcEEEEcccccccCCCcceEEEEEecCCcEEEEecCccceeecc--ee Confidence 8887763 222 5799999998543334555555542 23788999999999999999876553 33 Q ss_pred CCCchhhhhhhhc-chhhhcccccceeeEeeccCC Q lcl|NC_016762. 331 PQVNYNFQIMSAM-GIQVKKDDEGLSGVIYGANLA 364 (364) Q Consensus 331 p~~nY~f~v~~A~-glqiK~D~~G~sgVv~~~~l~ 364 (364) ...+|.....++. |+.||. -..+.|-.|+. T Consensus 282 ~~~~~~~~~~~r~~Gv~i~~----P~ai~~~dGI~ 312 (314) T protein:vir:10 282 KDLHFRYPVTSKATGLIVYR----PLTMAVIKGIT 312 (314) T ss_pred cCceEEEcceeeeEEEEEEC----cceeEeeeeee Confidence 3344444434443 577776 34455555555 No 8 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=98.97 E-value=4.2e-12 Score=82.93 Aligned_cols=311 Identities=13% Similarity=0.090 Sum_probs=173.7 Q ss_pred CccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH--HHhhccc-----chhhhhhhhhccccHHHHHHHHHHHHHHhc Q lcl|NC_016762. 1 MFLTQQAIAAHPRLMGHFQELQANRNIWNNQNAAML--AEHRGAM-----TPGMLACNALAGLGREFWAEIDAQIIQYRN 73 (364) Q Consensus 1 ~~ftke~~~~~~~~~~q~~~L~~~R~~~~~~~~~m~--a~~~~~~-----t~~~~a~Na~a~lprD~W~e~D~~~~q~~~ 73 (364) |-.+-+ ++.-++|...=..|+.....++ .-...+| .|.+... +..+||..+=.-+|.++++... T Consensus 1 ~~~~~~--------~~~~~~l~~~g~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~-~~~~i~a~~~~~i~~~vy~~~~ 71 (339) T protein:vir:94 1 MSINND--------RTDIKQLEKVGIIFDGYSPKSISSEVSAYAMDAVNLTPTLQTT-ANAGIPAWMTTFVDRRVIDIQL 71 (339) T ss_pred Cceech--------HHHHHHHHhhceeeccchhhhcchhhHhhhccccccccccccc-cccchhhhhhhhhchhheeecc Confidence 222222 2222222222222221111111 0011111 1112211 1235777777788888888733 Q ss_pred ccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEEeeCCCCCcc-cccceeeccCCccceeecCCccchhhhhhcC Q lcl|NC_016762. 74 QETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSVSIDGQAPYS-FDHTEYNSDGDPIPVFTAGYGVNWRHAAGMS 152 (364) Q Consensus 74 q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~~-~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~ 152 (364) + .+.. -.|.|+.+.-+-+.....|.+. |..|.+.+= |..... +=-..+++.=-.++++..||.++.+|....+ T Consensus 72 ~--~~~~-~~l~pv~t~g~w~~~t~~y~~~-e~~G~a~~y--gd~ad~Pl~~~~v~~~~~~v~~~~~g~~y~~~E~~~A~ 145 (339) T protein:vir:94 72 A--PMAA-AKIFPEVKKGDWTTTYGVFIIA-EPVGQVATY--SDWSANGMSKANVNFESRQNYRYQTWTEYGDLEMATYG 145 (339) T ss_pred c--ccch-hhhcccccCCCCcccEEEEeee-ecccceEEc--ccccCCCcccccceeeEEeEEEEEEEEeecHHHHHHHH Confidence 3 3322 3466665543322222222222 224444432 221111 1112344555678899999999999999999 Q ss_pred ccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCccccccccccCCCcceeecccCHHHHHHHHhhh Q lcl|NC_016762. 153 TVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQIIDFFTKG 232 (364) Q Consensus 153 t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~ 232 (364) --|+|+..+-+.+..+++.+++-..++.|+. ++..|||-|||+.....- ++-+-.++|+++|++-+.. T Consensus 146 ~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~-----~~~~~GLlN~P~l~~~v~------~s~~Wa~kT~~eI~~Di~~- 213 (339) T protein:vir:94 146 EAGIDYVARQEISASLVMAKFANSSYLLGVA-----GIANYGLMNDPSLPAPVA------ATVNWATAAPEDIANDVVA- 213 (339) T ss_pred hhCCChHHHHHHHHHHHHHHhhceEEeeeec-----ccceEEEEeCCCcccccc------CCCCcccCCHHHHHHHHHH- Confidence 9999999999999999999999999999964 467799999999854331 1234568899999998864 Q ss_pred hHHHHHhcc----ccccceEEEEcHHHHHhhhcceeeecccccccccchhHHHHHhhc-cchhhhcccccc---CCCeEE Q lcl|NC_016762. 233 AFGQAARTN----KVDAYDVLWVSPEINANLAQPYMITMGGGANAVVAGTVLDAVMRF-IPAREVRQTFAL---SGNEFL 304 (364) Q Consensus 233 ~f~~~~~~N----~v~~~~~lyvS~eI~~N~~~py~~~~~~~S~~~~~gTIl~~i~~~-~~V~~I~~~~~L---tgNe~l 304 (364) ++.++-... ++..+.+|.++||-...|.++ +.+ ..||++.|.+. |.+ .|+....| .|+-+. T Consensus 214 ~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~---------n~~-~~Tvl~~lk~n~pnl-~i~~~~el~~a~g~~~~ 282 (339) T protein:vir:94 214 MVGRLISQSGGLITGQERMVMALAPSALNNVNRT---------NNF-GLSAGAKIAQTYPNI-QFVAVPEFDTASGRLVQ 282 (339) T ss_pred HHHHHHHhcCCeeeeccCcEEEecHHHHHhcccC---------CcC-CccHHHHHHHhcCCc-EEEEccccccCCCceEE Confidence 666664433 334577999999999999887 233 46999999985 322 23333333 233333 Q ss_pred EE---EcCcceeeheecceeecccccccCCCCchhhhhhhhcchhhhcccccc---------eeeEeeccC Q lcl|NC_016762. 305 GY---QRRRDVVSPLVGMATGVIPLPRPLPQVNYNFQIMSAMGIQVKKDDEGL---------SGVIYGANL 363 (364) Q Consensus 305 g~---v~~~dvI~plvGmp~gt~p~pR~~p~~nY~f~v~~A~glqiK~D~~G~---------sgVv~~~~l 363 (364) .+ ...++.....+-||+-..|- + .-++..|.+..++ ..+.|..|+ T Consensus 283 ~~~~~~~~~~~~~~~~p~~~~~lpv--q------------~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 283 LWVPEVNGQPTGEVAFAEKLRSHSI--E------------RYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred EEEEeccCCcceEEEcchhhhcccc--E------------EcCceEEecceeeeeeEEEEccceeeeeecC Confidence 33 33355665555555422221 1 1233334443333 234455555 No 9 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=98.95 E-value=2.9e-11 Score=78.34 Aligned_cols=308 Identities=14% Similarity=0.120 Sum_probs=162.4 Q ss_pred CccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccchh------------hhhhhhhccccH--HHHHHHHH Q lcl|NC_016762. 1 MFLTQQAIAAHPRLMGHFQELQANRNIWNNQNAAMLAEHRGAMTPG------------MLACNALAGLGR--EFWAEIDA 66 (364) Q Consensus 1 ~~ftke~~~~~~~~~~q~~~L~~~R~~~~~~~~~m~a~~~~~~t~~------------~~a~Na~a~lpr--D~W~e~D~ 66 (364) |-+.+..+. .+++++|-.-=..|+..+..++-+...+|.+. ..+.| .++|. ..| +.. T Consensus 19 ~~~~~~~~~-----~~~~~~l~~~gi~~~~~~~~~~~~~~~amd~~~~~~~~~~~~~l~~~~~--~g~~~~l~~~--~p~ 89 (379) T protein:vir:10 19 MVMDSADVT-----LDNLKHLESYGIHLNGRKNKLFELMQFAMDSNDIGPIPTPLSPLSPVSI--PGLIQFLQNW--LPG 89 (379) T ss_pred hhhcccccc-----HHHHHHHHhcCccccchhhhhhhhhhhhhccccccccccccCccccccc--cchHHHHHhh--cch Confidence 222222111 22334443332333333332222222222111 11111 23444 333 333 Q ss_pred HHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEE----eeCCCCCcccccceeeccCCccc------- Q lcl|NC_016762. 67 QIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSV----SIDGQAPYSFDHTEYNSDGDPIP------- 135 (364) Q Consensus 67 ~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~----SmdGq~~~~~D~~~Y~~dGtPiP------- 135 (364) .+-.+..++... +|.|+++ .||-.++..+ ...|++ ..| .||+-+| T Consensus 90 ~i~~~tap~~a~----~l~pv~t-------------~g~W~~~~~~~~v~e~~G~A------~~y-gd~~d~pl~d~~~~ 145 (379) T protein:vir:10 90 HVRILTAVREAD----EFLGLST-------------VGQWDDEQIVQRVLEGLGTA------QPY-TDGGNMALMSWTPT 145 (379) T ss_pred HHHHHhhhhhhh----hhccccc-------------CCCceeeeEEEeeeeeeeee------EEe-ccccCCCeeeeeee Confidence 332232333222 2333333 3441111111 112332 233 2333333 Q ss_pred -------eeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCccccccccc Q lcl|NC_016762. 136 -------VFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNTIKVNLGS 208 (364) Q Consensus 136 -------If~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg~ 208 (364) -|..||.++++|....+--|+++..+-+.+..+++.+++-..+|.|+. ..++..|||-|||+.....-.. T Consensus 146 ~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~---d~~~~~yGllNdP~l~a~~t~a 222 (379) T protein:vir:10 146 FETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYN---DGSGRTFGFLNDPNLPAYVAVP 222 (379) T ss_pred eeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeec---CCCcceEEEEeCCCCccccccc Confidence 355788889999998889999999999999999999999999999932 2478999999999986433233 Q ss_pred cCCCcceeecccCHHHHHHHHhhhhHHHHHhccc-c----ccceEEEEcHHHHHhhhcceeeecccccccccchhHHHHH Q lcl|NC_016762. 209 GAGGANIDLTTATPQQIIDFFTKGAFGQAARTNK-V----DAYDVLWVSPEINANLAQPYMITMGGGANAVVAGTVLDAV 283 (364) Q Consensus 209 ~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~-v----~~~~~lyvS~eI~~N~~~py~~~~~~~S~~~~~gTIl~~i 283 (364) +-.|.+-+..++|+++|++-+. .+|.++-...+ + ....+|-++|+....|.++ +.| ..||++.+ T Consensus 223 tg~~~~t~Wa~kT~~eI~~Di~-~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~---------n~~-g~Tvl~~l 291 (379) T protein:vir:10 223 NGAGGSPLWAQKTTLEIIADLR-NGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP---------TEL-GYSVAQYM 291 (379) T ss_pred CCcccccccccCCHHHHHHHHH-HHHHHHHHhhCCeecccccceeEEecHHHHHhhccc---------ccc-CccHHHHH Confidence 3345566667889999999876 46776654433 2 3466999999999999987 344 57999999 Q ss_pred hhc-cchhhhccccccCC-----CeEEEEEcCcce--------eeheecceeecccccccCC--CCchhhhhhhhcchhh Q lcl|NC_016762. 284 MRF-IPAREVRQTFALSG-----NEFLGYQRRRDV--------VSPLVGMATGVIPLPRPLP--QVNYNFQIMSAMGIQV 347 (364) Q Consensus 284 ~~~-~~V~~I~~~~~Ltg-----Ne~lg~v~~~dv--------I~plvGmp~gt~p~pR~~p--~~nY~f~v~~A~glqi 347 (364) .+- |.+ .|+....|.+ +.++.|.++.+- +.-.+-|++-+.|.-++.. .-+|..++||+ .| T Consensus 292 k~n~Pnl-~i~t~pEL~~aggg~~~~~~~~~~~~~~~t~~~~~~~~~~p~k~~~l~ve~~~~~~~~~~~~rt~Gv---~i 367 (379) T protein:vir:10 292 RESYPNV-TFVSAPELNDANGGSSAIYYYADAVENNGTDDGRTWLQVVPTKMFTLGVEKKIKGYAEGYTNATAGA---ML 367 (379) T ss_pred HHhcCCc-EEEEcccccccCCCccEEEEEeeccCCCccCCcceEEEecchhhhhccceecCceeEeccccceeee---ee Confidence 974 444 3555555532 346666665442 2222233332222211111 11334444433 23 Q ss_pred hcccccceeeEeeccC Q lcl|NC_016762. 348 KKDDEGLSGVIYGANL 363 (364) Q Consensus 348 K~D~~G~sgVv~~~~l 363 (364) |.= ..++|..|- T Consensus 368 r~P----~Ai~~~~G~ 379 (379) T protein:vir:10 368 KRP----FATYRQTGA 379 (379) T ss_pred ecc----hhhheecCC Confidence 221 123333333 No 10 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=98.94 E-value=1.5e-11 Score=79.94 Aligned_cols=292 Identities=13% Similarity=0.045 Sum_probs=165.1 Q ss_pred HHHHHHHHHHHHHHHHHHH-------HH--HHHHHHhhcccchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHH Q lcl|NC_016762. 12 PRLMGHFQELQANRNIWNN-------QN--AAMLAEHRGAMTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVN 82 (364) Q Consensus 12 ~~~~~q~~~L~~~R~~~~~-------~~--~~m~a~~~~~~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~n 82 (364) ++..++.++|-.-=..|.. .. -+|-|. ...|++...+. .++|.=+=.-+|.+++++..+... -. T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~a~da~---d~~~~~~t~~~-~g~~~~l~~~i~p~~~~~~~~~~~---~~ 73 (336) T protein:vir:78 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAA---DLSPHLSSTGS-SGIPNYLTTYVDPSVIDILVAPMK---AA 73 (336) T ss_pred CchHHHHHHHhccCeecchhhhhhhHHHHHHHHhhh---hhccccccCCC-cchHHHHHHhcccceeeehhhhhh---hh Confidence 2223344444332222321 11 111111 11222222222 246665555566666655222211 23 Q ss_pred HHHHhhhccchhHHHHHHHhhccCCCe--E----EEeeCCCCCcccccceeeccCCccce--------------eecCCc Q lcl|NC_016762. 83 DLLQVQTVLPIGKSAKLYNVVGDIADD--V----SVSIDGQAPYSFDHTEYNSDGDPIPV--------------FTAGYG 142 (364) Q Consensus 83 DLm~l~~~v~igktv~~y~~~gd~a~~--v----~~SmdGq~~~~~D~~~Y~~dGtPiPI--------------f~sgy~ 142 (364) .|.|+++ +|| |. . ...+.|++ ..| .||+-+|. |-.||. T Consensus 74 ~l~~v~t-------------~g~--W~~~~~~~~~~e~~G~a------~~y-gd~~D~P~vd~~~~~~~~~v~~~~~g~~ 131 (336) T protein:vir:78 74 ELVGESK-------------KGD--WTTLVAAFITAEPTTTV------ATY-GDYSSDGDSGTNINYPQRQSYFFQTWTR 131 (336) T ss_pred hhccccc-------------CCC--ccccEEEEeeeecceee------EEe-ecccCCCeeecceeeEEEEEEEEEeeee Confidence 4445544 233 21 1 11223332 234 34444443 558999 Q ss_pred cchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCccccccccccCCCcceeecccCH Q lcl|NC_016762. 143 VNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNTIKVNLGSGAGGANIDLTTATP 222 (364) Q Consensus 143 ~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~ 222 (364) ++++|....+--|+++..+-+.+..+.+.+++-.+++.|+ .++..|||-|||+.....-++ ++ ...++|+ T Consensus 132 yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd-----~~~~~~GllN~P~l~a~~t~~--~~---~w~~~T~ 201 (336) T protein:vir:78 132 WGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGV-----AGLENYGLINDPSLSAPITAT--TP---WSGSPAV 201 (336) T ss_pred ecHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEec-----cccceEEEEeCCCCCcccccC--cC---cccccCH Confidence 9999999999999999999999999999999999999996 468899999999987544222 11 1236899 Q ss_pred HHHHHHHhhhhHHHHHhccc----cccceEEEEcHHHHHhhhcceeeecccccccccchhHHHHHhhc-cchhhhccccc Q lcl|NC_016762. 223 QQIIDFFTKGAFGQAARTNK----VDAYDVLWVSPEINANLAQPYMITMGGGANAVVAGTVLDAVMRF-IPAREVRQTFA 297 (364) Q Consensus 223 ~~i~~~f~~~~f~~~~~~N~----v~~~~~lyvS~eI~~N~~~py~~~~~~~S~~~~~gTIl~~i~~~-~~V~~I~~~~~ 297 (364) ++|++.+.+ +|.++....+ +-.+.+|.++|+-...|+++ +.| ..||++.+.+- |.+ .|..... T Consensus 202 ~~I~~Di~~-~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~~---------n~~-g~tv~~~lk~n~Pnl-~i~t~pe 269 (336) T protein:vir:78 202 EAVVNEVVT-LFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT---------NQY-GLSAAAKLKEIFPKL-EFVTIPE 269 (336) T ss_pred HHHHHHHHH-HHHHHHHhcCCeeeeccceEEEechHHHHhccCC---------Ccc-CccHHHHHHHhcCcc-EEEEccc Confidence 999999874 7777755553 23578999999999999887 344 47999999974 333 2333333 Q ss_pred cC---CCeEEEEEcC---cceeeheecceeeccccccc----CCCCchhhhhhhhcchhhhcccccceeeEeeccC Q lcl|NC_016762. 298 LS---GNEFLGYQRR---RDVVSPLVGMATGVIPLPRP----LPQVNYNFQIMSAMGIQVKKDDEGLSGVIYGANL 363 (364) Q Consensus 298 Lt---gNe~lg~v~~---~dvI~plvGmp~gt~p~pR~----~p~~nY~f~v~~A~glqiK~D~~G~sgVv~~~~l 363 (364) |. |+-+..++.. .+.....+=|++ +..|.+ ...-+|.+++||+ .||. -..+.+..|+ T Consensus 270 l~~Agg~~~~~~~~~~~~~~t~~~~~p~~f--~~lpvq~~~~~~~v~~~~rt~Gv---~i~~----P~ai~~~~GI 336 (336) T protein:vir:78 270 YDTASGRLVQLWAPRVEGKDTATCGFTEKM--RAHSIERYSSYFRQKKSAGTWGA---VIFR----PFAVAQMIGV 336 (336) T ss_pred ccccCcceEEEEEeeccCCcceeeecchhh--hccceeecCceeEeccccceeee---eeec----cchheeeccC Confidence 33 6656555444 334443333333 223322 1223555555544 3332 2334455555 No 11 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=98.86 E-value=2.4e-11 Score=78.82 Aligned_cols=332 Identities=13% Similarity=0.037 Sum_probs=162.4 Q ss_pred CccchhhhhhhHH--HHH---HHHHHHHHHHH--HHHHHHHHHHHhhcccchhhhhhhhhccccHHHHHHHHHHHHHHhc Q lcl|NC_016762. 1 MFLTQQAIAAHPR--LMG---HFQELQANRNI--WNNQNAAMLAEHRGAMTPGMLACNALAGLGREFWAEIDAQIIQYRN 73 (364) Q Consensus 1 ~~ftke~~~~~~~--~~~---q~~~L~~~R~~--~~~~~~~m~a~~~~~~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~ 73 (364) +-+.|-.+.+... +.. +|...+..+.. +......+.|.- ++.....++.| .++|.-+=.-||.+++++.. T Consensus 23 ~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~a~d-a~~~~~~t~~~--~gip~~~~~~~~p~~~~~~~ 99 (388) T protein:vir:99 23 NGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFD-SAYVAPTTQAS--IPTPIQFLQQWLPGFVKVLT 99 (388) T ss_pred cCCcceeeechhhHhhhhcceeccCccchhhhhhhhhhhhhhcccC-cccccccccCc--ccHHHHHhhhhccceeeeee Confidence 1111111111010 111 12111111111 111111111110 01101112222 36888888889999888843 Q ss_pred ccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEEeeCCCCCc-ccccceeeccCCccceeecCCccchhhhhhcC Q lcl|NC_016762. 74 QETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSVSIDGQAPY-SFDHTEYNSDGDPIPVFTAGYGVNWRHAAGMS 152 (364) Q Consensus 74 q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~-~~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~ 152 (364) +-- --.+|.|+++.=.=......|.+. |..|.+.+- |.... .+=-..+++.=-++=.+..||.++++|....+ T Consensus 100 ~p~---~~~~l~pv~t~g~W~~~~~~f~v~-e~~G~A~~y--gd~~D~Pl~d~~~~~~~r~v~~~~~g~~yg~~El~~A~ 173 (388) T protein:vir:99 100 SAR---KIDEILGVKTVGSWEDQEIVQGIV-EPAGTAMEY--GDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRAS 173 (388) T ss_pred chh---hhhhhccccccCCccceeEEEeee-ecceeEEEe--ecccCCCceeccceeeeeeEEEEEeeeeecHHHHHHHH Confidence 322 122444554421100001111111 212222211 11000 00001111122223345689999999999999 Q ss_pred ccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCccccccccccCCCcceeecccCHHHHHHHHhhh Q lcl|NC_016762. 153 TVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQIIDFFTKG 232 (364) Q Consensus 153 t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~ 232 (364) --|+|+..+-+.+..|.+.+++-+.+|.|+.. -..+.+|||-|||+.-+.. +..-+|.+-...++|+++|++-+.. T Consensus 174 ~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g--~~~~~~yGllNdP~l~a~v-~at~~~~~~~Wa~kT~~eI~~Di~~- 249 (388) T protein:vir:99 174 AMRINSAEVKRQGAAVQLEIMRNAIGFYGWEG--KNGNRTFGFLNDPSLLPAI-ASTTPGGWVSGGANAFQGIVGDLRL- 249 (388) T ss_pred hhCCCcHHHHHHHHHHHHHhhhceEEEEeecC--CCccceEEEeeCCCccccc-ccccCCcCcccccCCHHHHHHHHHH- Confidence 99999999999999999999999999999641 1125799999999975432 1111222333457899999998874 Q ss_pred hHHHHHhccc-c----ccceEEEEcHHHHHhhhcceeeecccccccccchhHHHHHhhc-cchhhhccccccC-----CC Q lcl|NC_016762. 233 AFGQAARTNK-V----DAYDVLWVSPEINANLAQPYMITMGGGANAVVAGTVLDAVMRF-IPAREVRQTFALS-----GN 301 (364) Q Consensus 233 ~f~~~~~~N~-v----~~~~~lyvS~eI~~N~~~py~~~~~~~S~~~~~gTIl~~i~~~-~~V~~I~~~~~Lt-----gN 301 (364) +|.++-...+ + ..+-+|=++|+-+..|+++ +.| ..||++.|.+- |.+ .|+....|. |+ T Consensus 250 ~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~~---------n~~-g~Tvl~~lk~n~Pnl-~i~t~pEl~~a~~tgg 318 (388) T protein:vir:99 250 MLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV---------TDL-GISVRDWLKQTYPRV-RVMSAPELQGGNPDDG 318 (388) T ss_pred HHHHHHHhcCCeeeecccceEEEechHHHHhcccc---------CcC-CccHHHHHHHhcCCc-EEEEecccccccccCC Confidence 7777644443 2 2234799999999999877 334 57999999985 333 344444443 33 Q ss_pred eEEEEEcCcceeeheecce----eeccccccc-----------CCCCchhhhhhhhcchhhhcccccceeeEeeccC Q lcl|NC_016762. 302 EFLGYQRRRDVVSPLVGMA----TGVIPLPRP-----------LPQVNYNFQIMSAMGIQVKKDDEGLSGVIYGANL 363 (364) Q Consensus 302 e~lg~v~~~dvI~plvGmp----~gt~p~pR~-----------~p~~nY~f~v~~A~glqiK~D~~G~sgVv~~~~l 363 (364) .-+.+-..+++-....|+| +.....|++ ...-+|..++|| ..||. -..++|..|+ T Consensus 319 ~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~~~~~~rt~G---v~ir~----P~Ai~~~~GI 388 (388) T protein:vir:99 319 KDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAG---VMLKR----PWAVVRLIGL 388 (388) T ss_pred ceeEEEEecccccccccCccCcceeEEecccccccccceecCceeEeccccceee---eEEec----cchhheeccC Confidence 3333333333332222222 222222221 112233334442 22332 2234555555 No 12 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=98.85 E-value=5.1e-11 Score=76.99 Aligned_cols=310 Identities=12% Similarity=0.049 Sum_probs=166.4 Q ss_pred HHHHHHHHHHHHHHHHHH-------H--HHHHHHHHhhcccchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHH Q lcl|NC_016762. 12 PRLMGHFQELQANRNIWN-------N--QNAAMLAEHRGAMTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVN 82 (364) Q Consensus 12 ~~~~~q~~~L~~~R~~~~-------~--~~~~m~a~~~~~~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~n 82 (364) +.-.+..++|-.-=-.|+ . .+-+|-|. .-+|++..+.. +++|.=+=.=+|.+++++..+.-. -. T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~da~---d~~~~~~~~~~-~~i~~~l~~~i~p~~~~~~~~p~~---a~ 73 (336) T protein:vir:10 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAA---DLSPHLSSTGS-SGIPNYLTTYVDPAVIDILVAPMK---AA 73 (336) T ss_pred CchHHHHHHHhhcCeeecchhhhhhhhHHHhhhhhh---hccCccccCCC-chhHHHHHhhcccceeeehhhhhh---hh Confidence 111233333322111121 1 11222221 22233333333 357764444566777665322211 12 Q ss_pred HHHHhhhccchhHHHHHHHhhccCCCeEEEeeCCCCCcc-cccceeeccCCccceeecCCccchhhhhhcCccccchhhh Q lcl|NC_016762. 83 DLLQVQTVLPIGKSAKLYNVVGDIADDVSVSIDGQAPYS-FDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLD 161 (364) Q Consensus 83 DLm~l~~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~~-~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D 161 (364) .|.|+++.=.=......|++ -|..|.+.+- |..... +=-..++++=.+|..|-+||.++++|....+--|+|+..+ T Consensus 74 ~l~pv~t~g~W~~~~~~~~~-~e~~G~a~~y--gd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~ 150 (336) T protein:vir:10 74 ELVGESKKGDWTTLVAAFIT-AEPTTKVATY--GDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASE 150 (336) T ss_pred hhccccccCCccceeEEEee-eeceeeEEEe--eccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHH Confidence 34555442111101111221 2223333322 211100 0011123333456677899999999999999999999999 Q ss_pred hHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCccccccccccCCCcceeec-ccCHHHHHHHHhhhhHHHHHhc Q lcl|NC_016762. 162 SQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNTIKVNLGSGAGGANIDLT-TATPQQIIDFFTKGAFGQAART 240 (364) Q Consensus 162 ~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlt-tA~~~~i~~~f~~~~f~~~~~~ 240 (364) -+.+..|.+.+++-.+++.|+ .++..|||-|||+..+..-.+ -+-| ++|+++|++-+. .+|.++... T Consensus 151 Ka~aA~~ale~~~N~i~~~Gd-----~~~~~yGllN~P~l~a~~t~~------t~~~~~~t~eei~~Di~-~~~~~l~~q 218 (336) T protein:vir:10 151 LNYSSALGLAKFLNGSYLFGV-----AGLENYGLINDPSLSAPITAT------TPWSGSPAVEAVVNEVV-ALFQVLQTQ 218 (336) T ss_pred HHHHHHHHHHHhhCcEEEEec-----cccceEEEEeCCCCccccccC------CCcccccCHHHHHHHHH-HHHHHHHHh Confidence 999999999999999999996 468999999999986433211 1222 568899988887 478887764 Q ss_pred cc----cccceEEEEcHHHHHhhhcceeeecccccccccchhHHHHHhhc-cchhhhccccccC---CCeEEEEEcC--- Q lcl|NC_016762. 241 NK----VDAYDVLWVSPEINANLAQPYMITMGGGANAVVAGTVLDAVMRF-IPAREVRQTFALS---GNEFLGYQRR--- 309 (364) Q Consensus 241 N~----v~~~~~lyvS~eI~~N~~~py~~~~~~~S~~~~~gTIl~~i~~~-~~V~~I~~~~~Lt---gNe~lg~v~~--- 309 (364) .+ .-.+.+|.++|+-...|+++ +.| ..||++.+.+- |.+ .|+....|. |+.+..+... T Consensus 219 s~G~i~~~~~~tL~LP~~~~~~Ls~~---------n~~-g~Tvl~~lk~n~Pnl-~i~t~pEl~~a~G~~~~l~~~~~~~ 287 (336) T protein:vir:10 219 SQGIITQEDVLRMGLPPTAMSDLSKT---------NQY-GLAAAAKLKDIFPKL-EFVTIPEYDTASGRLVQLWAPRVEG 287 (336) T ss_pred cCCeecccCcceEEecHHHHHhccCC---------Ccc-CccHHHHHHHhcCcc-EEEEccccccCCCceEEEEEEecCC Confidence 43 23589999999999999887 344 57999999985 444 344444444 6666555433 Q ss_pred cceeeheecceeecccccccC----CCCchhhhhhhhcchhhhcccccceeeEeeccC Q lcl|NC_016762. 310 RDVVSPLVGMATGVIPLPRPL----PQVNYNFQIMSAMGIQVKKDDEGLSGVIYGANL 363 (364) Q Consensus 310 ~dvI~plvGmp~gt~p~pR~~----p~~nY~f~v~~A~glqiK~D~~G~sgVv~~~~l 363 (364) .+.... ++|.-.+..|-+. ...+|..++||+ .||. -..+.+..|+ T Consensus 288 ~~t~~~--~~p~~~~~l~vq~~~~~~~v~~~~rt~Gv---~i~~----P~ai~~~~GI 336 (336) T protein:vir:10 288 KDTATC--GFTEKMRAHSIERYSSYFRQKKSAGTWGA---VIFR----PFAVAQMIGV 336 (336) T ss_pred Ccceee--ecchhhhccceeecCceeEeccccceeee---eeec----cchheeeecC Confidence 222222 2332122222111 112334444433 3332 2234555555 No 13 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=98.76 E-value=1.1e-10 Score=75.15 Aligned_cols=310 Identities=12% Similarity=0.053 Sum_probs=165.0 Q ss_pred HHHHHHHHHHHHHHHHHHH---------HHHHHHHHhhcccchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHH Q lcl|NC_016762. 12 PRLMGHFQELQANRNIWNN---------QNAAMLAEHRGAMTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVN 82 (364) Q Consensus 12 ~~~~~q~~~L~~~R~~~~~---------~~~~m~a~~~~~~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~n 82 (364) +.-.+..++|-.-=-.|+. .+-+|-|. .-+|++..+ ..+++|.=+=.-+|.++++...+. +. -. T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~da~---d~~~~~~~~-~~~~~~~~l~~~i~p~~~~~~~~~--~~-~~ 73 (336) T protein:vir:36 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAA---DLSPHLSST-GSSGIPNYLTTYVDPSVIDILVAP--MK-AA 73 (336) T ss_pred CchHHHHHHHhhcCeeecchhhhhhhHHHHhhhhhh---hccCccccC-CCcchHHHHHHhhccceEeeecch--hh-hh Confidence 1122333333221112211 11122121 122333333 233577655455665555552222 11 12 Q ss_pred HHHHhhhccchhHHHHHHHhhccCCCeEEEeeCCCCCcc-cccceeeccCCccceeecCCccchhhhhhcCccccchhhh Q lcl|NC_016762. 83 DLLQVQTVLPIGKSAKLYNVVGDIADDVSVSIDGQAPYS-FDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLD 161 (364) Q Consensus 83 DLm~l~~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~~-~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D 161 (364) .|.|+++.=.=......|++ -|..|.+.+- |..... +=-..++++=.+|..|-.||.++++|....+--|+|+..+ T Consensus 74 ~l~pv~t~g~W~~~~~~~~~-~e~~G~a~~y--gd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~ 150 (336) T protein:vir:36 74 ELVGESKKGDWTTLVAAFIT-AEPTTKVATY--GDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASE 150 (336) T ss_pred hhccccccCCccceeEEEee-eeceeeEEEe--eccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHH Confidence 45555542111101111221 2223333322 211100 0011123333456677899999999999999999999999 Q ss_pred hHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCccccccccccCCCcceeec-ccCHHHHHHHHhhhhHHHHHhc Q lcl|NC_016762. 162 SQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNTIKVNLGSGAGGANIDLT-TATPQQIIDFFTKGAFGQAART 240 (364) Q Consensus 162 ~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlt-tA~~~~i~~~f~~~~f~~~~~~ 240 (364) -+.+..|.+.+++-.+++.|+ .++..|||-|||+.....-.+ -+-| ++|+++|++-+.+ +|.++... T Consensus 151 Ka~aA~~ale~~~N~i~~~Gd-----~~~~~yGllNdP~l~a~~t~~------t~~~~~~t~~ei~~Di~~-~~~~l~~q 218 (336) T protein:vir:36 151 LNYSSALGLAKFLNGSYLFGV-----AGLENYGLINDPSLSAPITAT------TPWSGSPAVEAVVNEVVA-LFQVLQTQ 218 (336) T ss_pred HHHHHHHHHHHhhCcEEEEec-----cccceEEEEecCCCccccccC------CCcccccCHHHHHHHHHH-HHHHHHHh Confidence 999999999999999999996 468999999999986433211 1222 5688999888874 78777765 Q ss_pred cc----cccceEEEEcHHHHHhhhcceeeecccccccccchhHHHHHhhc-cchhhhccccccC---CCeEEEEEcC--- Q lcl|NC_016762. 241 NK----VDAYDVLWVSPEINANLAQPYMITMGGGANAVVAGTVLDAVMRF-IPAREVRQTFALS---GNEFLGYQRR--- 309 (364) Q Consensus 241 N~----v~~~~~lyvS~eI~~N~~~py~~~~~~~S~~~~~gTIl~~i~~~-~~V~~I~~~~~Lt---gNe~lg~v~~--- 309 (364) .+ +-.+.+|.++|+-...|+++ +.| ..||++.+.+- |.+ .|+....|. |+.+..+... T Consensus 219 t~G~i~~~~~~tL~LP~~~~~~Ls~~---------n~~-g~Tvl~~lk~n~Pnl-~i~t~pEl~~a~g~~~~l~~~~~~~ 287 (336) T protein:vir:36 219 SQGIITQEDVLRMGLPPTAMSDLSKT---------NQY-GLAAAAKLKDIFPKL-EFVTIPEYDTASGRLVQLWAPRVEG 287 (336) T ss_pred cCCeeeeccccEEEechHHHHhccCC---------Ccc-CccHHHHHHHhcCcc-EEEEccccccCCCceEEEEEEecCC Confidence 53 24589999999999999887 344 57999999985 444 333344443 6666555433 Q ss_pred cceeeheecceeecccccccC----CCCchhhhhhhhcchhhhcccccceeeEeeccC Q lcl|NC_016762. 310 RDVVSPLVGMATGVIPLPRPL----PQVNYNFQIMSAMGIQVKKDDEGLSGVIYGANL 363 (364) Q Consensus 310 ~dvI~plvGmp~gt~p~pR~~----p~~nY~f~v~~A~glqiK~D~~G~sgVv~~~~l 363 (364) .+.... ++|.-.+..|-+. ...+|..++||+ .||. -..+.+..|+ T Consensus 288 ~~t~~~--~~p~~~~~l~vq~~~~~~~v~~~~rt~Gv---~i~~----P~ai~~~~GI 336 (336) T protein:vir:36 288 KDTATC--GFTEKMRAHSIERYSSYFRQKKSAGTWGA---VIFR----PFAVAQMIGV 336 (336) T ss_pred Ccceee--ecchhhhccceeecCceeEeccccceeee---eeec----cchheeeecC Confidence 222222 2332122222111 112334444433 3332 2234555555 No 14 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=98.62 E-value=3.8e-10 Score=72.23 Aligned_cols=294 Identities=13% Similarity=0.052 Sum_probs=163.3 Q ss_pred HHHHHHHHHHHHHHHHHHH-------HH--HHHHHHhhcccchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHH Q lcl|NC_016762. 12 PRLMGHFQELQANRNIWNN-------QN--AAMLAEHRGAMTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVN 82 (364) Q Consensus 12 ~~~~~q~~~L~~~R~~~~~-------~~--~~m~a~~~~~~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~n 82 (364) ++..++.++|-.-=..|.. .. -+|-|. ...|++..... .++|.=+=.-+|.+++++.-+... .. T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~a~da~---d~~~~~~t~~~-~g~~~~l~~~i~p~~~~~~~~~~~---~~ 73 (336) T protein:vir:10 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAA---DLSPHLSSTGS-SGIPNYLTTYVDPSVIDILVAPMK---AA 73 (336) T ss_pred CchHHHHHHHhccCeecchhhhhhhHHHHHHHHhhh---hhccccccCCC-cchHHHHHhhcCcceeeeeechhc---hh Confidence 2223334444332222321 11 111111 11222222222 245664445556565555222211 22 Q ss_pred HHHHhhhccchhHHHHHHHhhccCCCeEE----EeeCCCCCcccccceeeccCCccce--------------eecCCccc Q lcl|NC_016762. 83 DLLQVQTVLPIGKSAKLYNVVGDIADDVS----VSIDGQAPYSFDHTEYNSDGDPIPV--------------FTAGYGVN 144 (364) Q Consensus 83 DLm~l~~~v~igktv~~y~~~gd~a~~v~----~SmdGq~~~~~D~~~Y~~dGtPiPI--------------f~sgy~~~ 144 (364) +|.|+++ +||-.++.. ..+.|++ ..| -|++-+|. |-.||.++ T Consensus 74 ~l~~v~t-------------~g~w~~~~~~~~~~e~~G~a------~~y-gd~~d~P~~d~~~~~~~~~v~~~~~g~~yg 133 (336) T protein:vir:10 74 ELVGESK-------------KGDWTTLVAAFITAEPTTKV------ATY-GDYSSDGDSGTNINYPQRQSYFFQTWTRWG 133 (336) T ss_pred hhccccc-------------CCCcceeeEEEEeeeeeeeE------EEc-cccCCCcceeeeeeeeeeeEEEEEEEEeeC Confidence 3444443 344111111 1233443 344 24455554 44677888 Q ss_pred hhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCccccccccccCCCcceeecccCHHH Q lcl|NC_016762. 145 WRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQ 224 (364) Q Consensus 145 WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~ 224 (364) =+|....+--|+++..+-+.+..+.+.+++-.+++-|+. ++..|||-|||+.-...-++ ++ ...++|+++ T Consensus 134 ~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~-----~~~~~GllN~P~l~a~~t~~--~~---~w~~~T~~e 203 (336) T protein:vir:10 134 ERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA-----GLENYGLINDPSLSAPITAT--TP---WSGSPAVEA 203 (336) T ss_pred HHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeec-----ccceEEEeecCCCCcccccC--cC---cccccCHHH Confidence 889888889999999999999999999999999999954 68899999999987544222 11 123688999 Q ss_pred HHHHHhhhhHHHHHhccc----cccceEEEEcHHHHHhhhcceeeecccccccccchhHHHHHhhc-cchhhhccccccC Q lcl|NC_016762. 225 IIDFFTKGAFGQAARTNK----VDAYDVLWVSPEINANLAQPYMITMGGGANAVVAGTVLDAVMRF-IPAREVRQTFALS 299 (364) Q Consensus 225 i~~~f~~~~f~~~~~~N~----v~~~~~lyvS~eI~~N~~~py~~~~~~~S~~~~~gTIl~~i~~~-~~V~~I~~~~~Lt 299 (364) |++.+.+ +|.++....+ +-.+.+|.++|+-...|+++ +.| ..||++.+.+- |.+ .|.....|. T Consensus 204 I~~Di~~-~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~~---------n~~-g~tv~~~lk~n~Pnl-~i~t~pel~ 271 (336) T protein:vir:10 204 VVNEVVT-LFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT---------NQY-GLSAAAKLKEIFPKL-EFVTIPEYD 271 (336) T ss_pred HHHHHHH-HHHHHHHhcCCeeeeccceEEEechHHHHhccCC---------Ccc-CccHHHHHHHhCCcc-EEEEccccc Confidence 9999874 7777755553 23578999999999999876 344 47999999974 333 244444443 Q ss_pred ---CCeEEEEEcC---cceeeheecceeecccccccC----CCCchhhhhhhhcchhhhcccccceeeEeeccC Q lcl|NC_016762. 300 ---GNEFLGYQRR---RDVVSPLVGMATGVIPLPRPL----PQVNYNFQIMSAMGIQVKKDDEGLSGVIYGANL 363 (364) Q Consensus 300 ---gNe~lg~v~~---~dvI~plvGmp~gt~p~pR~~----p~~nY~f~v~~A~glqiK~D~~G~sgVv~~~~l 363 (364) |+-+..+++. .+..... +|.-.+..|.+. ...+|.+++||+ .||+ -..+.+..|+ T Consensus 272 ~Agg~~~~~~~~~~~~~~t~~~~--~P~~f~~lpvq~~~~~~~v~~~~rt~Gv---~i~r----P~ai~~~~GI 336 (336) T protein:vir:10 272 TASGRLVQLWAPRVEGKDTATCG--FTEKMRAHSIERYSSYFRQKKSAGTWGA---VIFR----PFAVAQMLGV 336 (336) T ss_pred ccCCceEEEEEecccCCcceeee--cChhhhccceeecCceeEeccccceeee---eeec----cchheeeccC Confidence 6666665444 2333333 333223333322 223455555544 3332 2234445555 No 15 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=98.28 E-value=4.3e-08 Score=60.98 Aligned_cols=322 Identities=12% Similarity=0.009 Sum_probs=163.5 Q ss_pred CccchhhhhhhHHHHHHHHHHHHHHHHHHH-----HHHHHHHH----hhcccch---hhhhhhhhccccHHHHHHHHHHH Q lcl|NC_016762. 1 MFLTQQAIAAHPRLMGHFQELQANRNIWNN-----QNAAMLAE----HRGAMTP---GMLACNALAGLGREFWAEIDAQI 68 (364) Q Consensus 1 ~~ftke~~~~~~~~~~q~~~L~~~R~~~~~-----~~~~m~a~----~~~~~t~---~~~a~Na~a~lprD~W~e~D~~~ 68 (364) .=|.+..+ -+...++|..-=-.|+. +-.+++.. ...+|-+ +..... ..++|.-+=.-||.++ T Consensus 17 ~~~~~~~~-----~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~-~~g~p~~~l~~~~p~~ 90 (382) T protein:vir:96 17 KPFDLKNV-----THEAVAALGRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTP-SIPTPIQFLQTWLPGF 90 (382) T ss_pred cchhhhcc-----cHHHHHHHhccccccCcccchhHhhhhhhhhhhhhhcccccccCCccccC-CccHHHHHHhhhhhhh Confidence 00111111 01112222222122211 00111111 1112221 111112 3357899999999999 Q ss_pred HHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEEeeCCCCCcc-cccceeeccCCccceeecCCccchhh Q lcl|NC_016762. 69 IQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSVSIDGQAPYS-FDHTEYNSDGDPIPVFTAGYGVNWRH 147 (364) Q Consensus 69 ~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~~-~D~~~Y~~dGtPiPIf~sgy~~~WR~ 147 (364) +++..+... -.+|.|+++.=.=......|. +-|..|.+.+- |..... +=-..+++.=-++-.+..||.++=+| T Consensus 91 ~~~~~~p~~---~~~l~pv~t~g~W~~~t~ty~-~~e~~G~A~~y--gd~~D~Pl~d~~~~~~~r~v~~~~~g~~yg~lE 164 (382) T protein:vir:96 91 VKVMTAARK---IDEIIGIDTVGSWEDQEIVQG-IVEPAGTAVEY--GDHTNIPLTSWNANFERRTIVRGELGLLVGTLE 164 (382) T ss_pred hhhhhhhhh---hhhhccccccCCccceEEEEe-eeecccceEEe--ecccCCCccccccceeEEEEEEEEEeeeecHHH Confidence 888443321 124455544211000011111 11222322221 111000 00011112223345556666666566 Q ss_pred hhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecC--eeeeeecCCCccccccccccCCCcceeecccCHHHH Q lcl|NC_016762. 148 AAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVEN--YPAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQI 225 (364) Q Consensus 148 ~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~--~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~i 225 (364) ..-..--|+|+..+-+.+..+.+.+++-+.+|.|+. +| ...|||-|||+...... +++-+..++|+++| T Consensus 165 ~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~----~g~~~~~yGllNdP~l~a~~t-----~a~~~Wa~kT~~eI 235 (382) T protein:vir:96 165 EGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQ----SGLGNRTYGFLNDPNLPPFQT-----PPSQGWATADWAGI 235 (382) T ss_pred HHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeee----cCcCcceEEEEeCCCcccccc-----cCCCCcccccHHHH Confidence 666666699999999999999999999999999973 33 67899999998542222 33444668899999 Q ss_pred HHHHhhhhHHHHHhcccc-----ccceEEEEcHHHHHhhhcceeeecccccccccchhHHHHHhhc-cchhhhccccccC Q lcl|NC_016762. 226 IDFFTKGAFGQAARTNKV-----DAYDVLWVSPEINANLAQPYMITMGGGANAVVAGTVLDAVMRF-IPAREVRQTFALS 299 (364) Q Consensus 226 ~~~f~~~~f~~~~~~N~v-----~~~~~lyvS~eI~~N~~~py~~~~~~~S~~~~~gTIl~~i~~~-~~V~~I~~~~~Lt 299 (364) ++-+.. ++.++.....- ..+-+|=++|+-+..|+++ +.| ..||++.+.+- |.+ .|+....|. T Consensus 236 ~~Di~~-l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~---------n~~-g~Tvl~~lk~n~Pnl-~i~t~peL~ 303 (382) T protein:vir:96 236 IGDIRE-AVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVT---------TPY-GISVSDWIEQTYPKM-RIVSAPELS 303 (382) T ss_pred HHHHHH-HHHHHHhccCCeeeecccceEEeechHHHhhcccc---------Ccc-CccHHHHHHHhcCCc-EEEEccccc Confidence 998874 77777554431 2344788999999988876 344 57999999986 444 566666664 Q ss_pred -------CCeEEEEEcCccee-----eheecceeecccccccCCCCchhhhhhhh--cchhhhcccccc-ee-------- Q lcl|NC_016762. 300 -------GNEFLGYQRRRDVV-----SPLVGMATGVIPLPRPLPQVNYNFQIMSA--MGIQVKKDDEGL-SG-------- 356 (364) Q Consensus 300 -------gNe~lg~v~~~dvI-----~plvGmp~gt~p~pR~~p~~nY~f~v~~A--~glqiK~D~~G~-sg-------- 356 (364) |..-+.|...+++- ++-..||+.-.. | =.|++.+. .++-.|.+..++ .| T Consensus 304 ~a~~~g~g~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~--p------~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~a 375 (382) T protein:vir:96 304 GVQMQGKTPEDALVLFVEEVDASVDGSTDGGSVFSQLV--Q------SKFITLGVEKRAKSYVEDFSNGTAGALCKRPWA 375 (382) T ss_pred cccCCCccceeEEEEecchhhhhcccccccCcceeccc--c------ceeeeccceeecceeEeccccceeeeEEEcchh Confidence 24667777777653 444444442100 0 00111111 223333333222 22 Q ss_pred eEeeccC Q lcl|NC_016762. 357 VIYGANL 363 (364) Q Consensus 357 Vv~~~~l 363 (364) ++|..|+ T Consensus 376 i~~~~GI 382 (382) T protein:vir:96 376 VVRYLGI 382 (382) T ss_pred hhhccCC Confidence 3344444 No 16 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=79.94 E-value=0.067 Score=27.02 Aligned_cols=297 Identities=16% Similarity=0.103 Sum_probs=124.8 Q ss_pred HHHHHHHHHhhcccchhhhhhhhhcccc-HHHHHHHHHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCC Q lcl|NC_016762. 30 NQNAAMLAEHRGAMTPGMLACNALAGLG-REFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIAD 108 (364) Q Consensus 30 ~~~~~m~a~~~~~~t~~~~a~Na~a~lp-rD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~ 108 (364) |. .+++---.+....-+.|+.++.. .=|=++|...+..-|+.. .+|-+|..+. .+.-||||+...+ |+..= T Consensus 1 ~~---~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~---s~~~~~~~~r-~i~~G~tv~i~~i-g~~~~ 72 (332) T protein:vir:78 1 MT---TLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNA---SIFKGLVRSY-DLRGGKSKQFMFT-GKLSA 72 (332) T ss_pred Cc---ccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHH---hhhhhccccc-cccccceEEEEec-cceeE Confidence 00 00000000000111345554431 123366677777776633 6677887774 4556888776554 44100 Q ss_pred eEE---EeeCCCCCcccccceeeccCCccceeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCce Q lcl|NC_016762. 109 DVS---VSIDGQAPYSFDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNI 185 (364) Q Consensus 109 ~v~---~SmdGq~~~~~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I 185 (364) ..+ .++.++. .+++++-. |-|=+.-|--.+=+-+---..-+|+++.--.++-.++.+++-.+++.=-.+ T Consensus 73 ~~~~~g~~l~~~~--~~~~~~~~-----l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~- 144 (332) T protein:vir:78 73 GYHTPGTPIVGDA--GIKANEKT-----LVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAK- 144 (332) T ss_pred eeecCCCCCCCCC--CCCCceEE-----EEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHh- Confidence 000 0122211 12222111 222221221112111122244567877777788888888888777642111 Q ss_pred eecCeeeeeecCCCccccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccce-EEEEcHHHHHhhhc--- Q lcl|NC_016762. 186 QVENYPAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYD-VLWVSPEINANLAQ--- 261 (364) Q Consensus 186 ~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~-~lyvS~eI~~N~~~--- 261 (364) ..-+......+|-...+.++. ++ +-+++.+++.+.. ++ +.++.++|.... .+.|+||....|.+ T Consensus 145 --aa~~~~~~~~~~g~~~~~~~~--~~------~~~~~~~~~~i~~-a~-~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d 212 (332) T protein:vir:78 145 --ASAEASPVTGEPGGFHVNIGA--GN------TNDAQAIVDGFFE-AA-AVLDERSAPQEGRVAVLSPRQYYSLISSVD 212 (332) T ss_pred --hhcccCcccccccccccccCC--cc------ccCHHHHHHHHHH-HH-HHHhhcCCCccCCEEEeCHHHHHHHHhhcC Confidence 000111111122222233221 11 2367888888763 34 456666675444 57789999999964 Q ss_pred ceeeec-ccc-cccccchhHHHHHhhccchhhhccccccCCCeE----------------------EEEEcCcceeehee Q lcl|NC_016762. 262 PYMITM-GGG-ANAVVAGTVLDAVMRFIPAREVRQTFALSGNEF----------------------LGYQRRRDVVSPLV 317 (364) Q Consensus 262 py~~~~-~~~-S~~~~~gTIl~~i~~~~~V~~I~~~~~LtgNe~----------------------lg~v~~~dvI~plv 317 (364) +=+.+. ..+ +.....|++ |.++.|+. |.++..||.+-. .+++..++.+.-.. T Consensus 213 ~~~~n~~~~~~~~~~~~g~~---i~~i~G~~-V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~ 288 (332) T protein:vir:78 213 TNILNREIGNSQGDMNSGKG---LYSIAGIR-ILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQ 288 (332) T ss_pred ceeeeeeccccccceeccee---eeEEeeeE-EEecCccccCcccccccccccccccccccccccceEEeecccceeeee Confidence 333221 112 222334433 34444443 677777764321 12222222221111 Q ss_pred cceeecccccccCCCCchh-hhh--hhhcchhhhcccccceeeEeec Q lcl|NC_016762. 318 GMATGVIPLPRPLPQVNYN-FQI--MSAMGIQVKKDDEGLSGVIYGA 361 (364) Q Consensus 318 Gmp~gt~p~pR~~p~~nY~-f~v--~~A~glqiK~D~~G~sgVv~~~ 361 (364) .+.+- ...-|.....+|. ..| .-++|- |-.----.++++++ T Consensus 289 ~~~~~-~~~t~~~~~~~~~~d~i~~~~~~G~--~v~rPe~~v~l~~a 332 (332) T protein:vir:78 289 SVAPT-IQTTSGDFNVQYQGDLIVGKLAMGC--GSLRTSVAGSFQAA 332 (332) T ss_pred eeccc-hhhhhcccchhhhHhhhhhhhhhcC--ceecccceEEEeeC Confidence 11110 0111212222222 222 113331 22222335566666 No 17 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=67.92 E-value=0.18 Score=24.66 Aligned_cols=266 Identities=12% Similarity=0.086 Sum_probs=117.6 Q ss_pred cchh----hhhhhhhccccHHHH-HHHHHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEE---ee Q lcl|NC_016762. 43 MTPG----MLACNALAGLGREFW-AEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSV---SI 114 (364) Q Consensus 43 ~t~~----~~a~Na~a~lprD~W-~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~---Sm 114 (364) |+.. .-+.|+.+.- .++. +++..++.+-|+. -+++-+++.+ +.|.=|||++.+.+ |...=+.+. ++ T Consensus 1 Ms~~n~~t~~~~~~s~~~-~al~le~f~geV~taF~~---~si~~~~~~v-rti~~GkS~qf~~i-G~~~a~y~~~G~~l 74 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEV-DSLLIEKFNGKVNEQYLK---GENILSYFDV-QTVTGTNTVSNKYL-GETELQVLAPGQSP 74 (402) T ss_pred CCCcccccccccccccch-hhhhhhhhhhhHHHHHHH---HHhhcCccee-eeecccceEEEEEE-eeeEEeeecccccc Confidence 3332 1122332222 4444 5566666666544 4677788887 57888888886655 442111222 45 Q ss_pred CCCCCcccccceeeccCCccceeecCCccchhhhhhcCccccc-hhhhhHHHHHHHHHHHHHhhhhccCCceeecCe-ee Q lcl|NC_016762. 115 DGQAPYSFDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGID-LVLDSQAAKLRKFNKRIVAYTLDGATNIQVENY-PA 192 (364) Q Consensus 115 dGq~~~~~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D-~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~-t~ 192 (364) +|+.+. -|+.+-.-|+.- |..+|=.-+---..-|| +++.=-.+.=..+.+.+=++++ +-|...+. .+ T Consensus 75 dg~~~~-~~k~~ItID~lL-------~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii---~~i~~aa~a~t 143 (402) T protein:vir:97 75 NATPTQ-ADKNQLVIDTTV-------IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAI---QQMLLGGIANT 143 (402) T ss_pred CCCCcc-cccEEEEeCcee-------echhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHH---HHHHHhhcccc Confidence 554333 344444444422 11111100000022344 2222112223344444433332 11111110 01 Q ss_pred eeecCCCccccccccccCCCcceeec------ccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHHHHhhhcc-eee Q lcl|NC_016762. 193 QGLRNHRNTIKVNLGSGAGGANIDLT------TATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEINANLAQP-YMI 265 (364) Q Consensus 193 ~GLrnh~n~~~~~lg~~~gGaNidlt------tA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~~p-y~~ 265 (364) .++..-|.... + |.++..+ ..+++.|+.+|. .++.++-+.+=-...-..+|+|+.+..|.+- -+. T Consensus 144 ~~~~~~~~~~~--~-----g~s~~~~~t~~~a~~~~~~l~~ai~-~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~ 215 (402) T protein:vir:97 144 KAERNKPRVKG--H-----GFSINVNVTESEALANPQYVMAAVE-YALEQQLEQEVDISDVAIMMPWKFFNALRDADRIV 215 (402) T ss_pred ccccccCcccc--c-----ccccccccccchhhcCHHHHHHHHH-HHHHHHHhcCCCccccEEEeChHHHHHHhhccccc Confidence 11111111110 1 1122222 247788888877 4688888777777888999999999998864 111 Q ss_pred ec---ccccccccchhHHHHHhhccchhhhccccccCCCeEEEEEcCcceeeheecceeecccccccCCCCchhhhhhhh Q lcl|NC_016762. 266 TM---GGGANAVVAGTVLDAVMRFIPAREVRQTFALSGNEFLGYQRRRDVVSPLVGMATGVIPLPRPLPQVNYNFQIMSA 342 (364) Q Consensus 266 ~~---~~~S~~~~~gTIl~~i~~~~~V~~I~~~~~LtgNe~lg~v~~~dvI~plvGmp~gt~p~pR~~p~~nY~f~v~~A 342 (364) +. .+++..+..|.| ..+.|| +|.++..|+.. |..+..-++........|+ T Consensus 216 n~d~~~~~~g~~~~G~v----~~v~Gv-~Vv~SnnlP~~----------------a~~it~~~ls~a~~G~~y~------ 268 (402) T protein:vir:97 216 DKTYTISQSGATINGFV----LSSYNC-PVIPSNRFPTF----------------AQDQAHHLLSNEDNGYRYD------ 268 (402) T ss_pred chhhccccCCcccccee----EEEece-EEEecCccccc----------------cccccccccccCCCCccCC------ Confidence 11 122333443333 334554 34556555521 1111111122222233344 Q ss_pred cchhhhcccccceeeEeecc-CC Q lcl|NC_016762. 343 MGIQVKKDDEGLSGVIYGAN-LA 364 (364) Q Consensus 343 ~glqiK~D~~G~sgVv~~~~-l~ 364 (364) ++.|+...+++++=-. |. T Consensus 269 ----~t~d~t~~~~~~f~~~Av~ 287 (402) T protein:vir:97 269 ----PIAEMNGAVAVLFTSDALL 287 (402) T ss_pred ----cCcccceeEEEEEecceEE Confidence 4466666666664221 11 No 18 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=62.79 E-value=0.28 Score=23.61 Aligned_cols=275 Identities=15% Similarity=0.166 Sum_probs=122.2 Q ss_pred cchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEEeeC------- Q lcl|NC_016762. 43 MTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSVSID------- 115 (364) Q Consensus 43 ~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~Smd------- 115 (364) |+ .++-+=|. .+.. ..+.+.++..|.++.. +| |+|+--. +....+.|++.--+.+.--+.+. T Consensus 1 mp-altLaea~-k~~~---d~l~~~ViE~~~~~s~--lL-~~LpF~~---veg~~~~ynR~~~~~~~~~~~v~~~~~~~g 69 (310) T protein:vir:97 1 MA-SVTLAESA-KLAQ---DELVAGVIENIITVNR--MF-DVLPFDS---IEGNSLAYNRENVLGDVIMAGVGTTFSGAG 69 (310) T ss_pred Cc-ccchHHHh-hcCc---chHHHHHHHHHhccch--HH-HhCCccc---ccCCcceeeEeeccCCcccccccccccCCC Confidence 32 11111111 1111 1346677777655433 33 4444211 22223444433211121111111 Q ss_pred -CCCCcccccceeeccCCccceeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeee Q lcl|NC_016762. 116 -GQAPYSFDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQG 194 (364) Q Consensus 116 -Gq~~~~~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~G 194 (364) .++++.+++.+|. +=|+-+.+.++-|-...-+++..|.+.---..+++.++++.++=++|||..-+ .-+| T Consensus 70 ~~~~~~t~~~~~~~-----L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n----~F~G 140 (310) T protein:vir:97 70 AGKAAATFTKVNSN-----LTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGN----EFAG 140 (310) T ss_pred ccccccccceeeee-----eeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCC----cccc Confidence 2566666666665 55788888888876554445555655555566779999999999999998642 4668 Q ss_pred ecCC-CccccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHH---HHhhhcceeeecccc Q lcl|NC_016762. 195 LRNH-RNTIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEI---NANLAQPYMITMGGG 270 (364) Q Consensus 195 Lrnh-~n~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI---~~N~~~py~~~~~~~ 270 (364) |... .-...++.|+. |.. +|-++.|+++++.-. +.-+++-+..+|.. ++.+.|-- + T Consensus 141 L~~~~~~~q~i~~~~~--gg~--~t~d~LDeLl~~v~~----------~~g~p~~~l~~~~~~r~i~A~~R~~------~ 200 (310) T protein:vir:97 141 LIQLCASGQKATTGAT--GSA--ISFAILDELMDLVVD----------KDGQVDYLTMHARTLRSYKALLRAL------G 200 (310) T ss_pred hhhcCCccceeecCCC--CCC--CCHHHHHHHHHHHhc----------CCCCCCEEEecHHHHHHHHHHHHHh------c Confidence 8654 22344555543 333 333466666665531 11346789999964 66666632 1 Q ss_pred cccccchhHHH---HHhhccchhhhccccccCCCe----------EEEEEcCcc-eeeheecceeecc--cccccCC--- Q lcl|NC_016762. 271 ANAVVAGTVLD---AVMRFIPAREVRQTFALSGNE----------FLGYQRRRD-VVSPLVGMATGVI--PLPRPLP--- 331 (364) Q Consensus 271 S~~~~~gTIl~---~i~~~~~V~~I~~~~~LtgNe----------~lg~v~~~d-vI~plvGmp~gt~--p~pR~~p--- 331 (364) ..++-+-++.. .|..+.|| .|.+...++.++ ++.+-.-.| ..+=++|-+.+-- ...|.+- T Consensus 201 ~~g~~~~~~~~~G~~v~~~~Gi-Pi~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~ 279 (310) T protein:vir:97 201 GASINEVVELPSGAEVPAYSGT-PIFRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESE 279 (310) T ss_pred CCCCCCccccCCCCEEeeeCCe-EEEEeCccCCCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCccc Confidence 12332333322 34555555 244445555443 222222111 1112223221110 0112211 Q ss_pred -CCchhhhh-hhhcchhhhcc--cccceeeEe Q lcl|NC_016762. 332 -QVNYNFQI-MSAMGIQVKKD--DEGLSGVIY 359 (364) Q Consensus 332 -~~nY~f~v-~~A~glqiK~D--~~G~sgVv~ 359 (364) ..=+-|+| | =.|+-++.+ .-...+|-+ T Consensus 280 ~~~v~~~~V~~-Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 280 DSDEHIWRVKW-YCGLALFSEKGLACADGITN 310 (310) T ss_pred CCcceeEEEEE-eeeEEEecccceeeeccccC Confidence 11111222 1 011111111 111222222 No 19 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=58.06 E-value=0.42 Score=22.64 Aligned_cols=297 Identities=10% Similarity=-0.006 Sum_probs=120.3 Q ss_pred CccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhH Q lcl|NC_016762. 1 MFLTQQAIAAHPRLMGHFQELQANRNIWNNQNAAMLAEHRGAMTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEI 80 (364) Q Consensus 1 ~~ftke~~~~~~~~~~q~~~L~~~R~~~~~~~~~m~a~~~~~~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i 80 (364) .-..++....... ...+..-...+..+.. ....+...+..++ ....+.-..+|.++=.+|...+.+. T Consensus 96 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~--~~~~~~g~lvp~~~~~~ii~~~~~~--------- 162 (418) T protein:vir:10 96 PKTLGQLVTESEE-MKGMDGSARKSVRVRV-DRKSIMNVPATVG--SGVSGSNSLVVADRQAGIIAPPQRK--------- 162 (418) T ss_pred hhhhhHHhhhHHH-HHHHHHHHhhhhhhhh-HHHHHHHhhhhcc--CCCCCCccccchhHHHHHHHHHhhh--------- Confidence 0000011110000 1111110111111110 0000111011111 1112223347776544433322222 Q ss_pred HHHHHHhhhccchhHHHHHHHhhccCCCeEEEeeCCCCCcccccceeeccCCccceeecCCc---cch----------hh Q lcl|NC_016762. 81 VNDLLQVQTVLPIGKSAKLYNVVGDIADDVSVSIDGQAPYSFDHTEYNSDGDPIPVFTAGYG---VNW----------RH 147 (364) Q Consensus 81 ~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~~~D~~~Y~~dGtPiPIf~sgy~---~~W----------R~ 147 (364) ..|+.+.+++|++..-..|.+..+....+ .+--+|+++|--+..|. +.. ++ T Consensus 163 -~~l~~~~~~~~~~~~~~~~~~~~~~~~~a---------------~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~e 226 (418) T protein:vir:10 163 -MTIRDLLMPGQTSSSSIEYTVETGFTNNA---------------AAVAEGAQKPTSDLKFNLKNQPVRTIAHLFKASRQ 226 (418) T ss_pred -hhHHhhcceeeccCCceeEEEEecCCCce---------------eeeccCccccccccceeeEEEeeeeEEEeehhhHH Confidence 23455566666643211222211111111 12234555554433222 111 22 Q ss_pred hhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCccccccccccCCCcceeecccCHHHHHH Q lcl|NC_016762. 148 AAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQIID 227 (364) Q Consensus 148 ~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~i~~ 227 (364) ++ +.. .++..-=+..-.+++.+++...+|+|+++ +....||.+.........+. -+..+.++|++ T Consensus 227 ll--~ds-~~l~~~i~~~l~~a~~~~~d~a~l~G~g~----~~~p~Gi~~~~~~~~~~~~~--------~~~~~~~~i~~ 291 (418) T protein:vir:10 227 IL--DDA-PALQSYIDGRARYGLQLTEEGQILKGDGT----GANILGILPQASAFMPSITL--------ANATPIDKIRL 291 (418) T ss_pred HH--HhH-HHHHHHHHHHHHHHHHHHHHHHHhccCCC----Cccccccccccccccccccc--------cccccHHHHHH Confidence 22 111 24444445556677889999999999774 23356887765443333221 01224566655 Q ss_pred HHhhhhHHHHHhccccccceEEEEcHHHHHhhhc-------ceeeecccccccccchhHHHHHhhccchhhhccccccCC Q lcl|NC_016762. 228 FFTKGAFGQAARTNKVDAYDVLWVSPEINANLAQ-------PYMITMGGGANAVVAGTVLDAVMRFIPAREVRQTFALSG 300 (364) Q Consensus 228 ~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~~-------py~~~~~~~S~~~~~gTIl~~i~~~~~V~~I~~~~~Ltg 300 (364) +.. .+.. .+ +....|++||+.+.-|.. |.+.+. .+ -.++|+ +.+| ++.+..++. T Consensus 292 ~~~-----~~~~-~~-~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~---~~-~~~~~l----~G~p----V~~~~~~p~ 352 (418) T protein:vir:10 292 ALL-----QAVL-AE-FPATGIVLNPIDWASIELTKDSQGRYIVGNP---VN-GTTPRL----WNLP----VVETQAMTA 352 (418) T ss_pred HHH-----hhcc-cc-CCCCEEEEcHHHHHHHHHhhcCCCceecccc---cc-CCCcee----ccee----eEEcCCCCC Confidence 543 2222 22 344579999999876643 222110 01 113332 3332 456778888 Q ss_pred CeEEEEEcCcce-eeheecceeeccccccc-CCCCchhhhhhhhcchhhhcccccceeeEeeccCC Q lcl|NC_016762. 301 NEFLGYQRRRDV-VSPLVGMATGVIPLPRP-LPQVNYNFQIMSAMGIQVKKDDEGLSGVIYGANLA 364 (364) Q Consensus 301 Ne~lg~v~~~dv-I~plvGmp~gt~p~pR~-~p~~nY~f~v~~A~glqiK~D~~G~sgVv~~~~l~ 364 (364) ++++.-.-+..+ |.-..|+.+-+-+..+. +-++-.-|++..-++..+... .+++++.--+ T Consensus 353 ~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~----~a~~~~~~~~ 414 (418) T protein:vir:10 353 NEFLVGAFSMAAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRP----ESFVTGALVE 414 (418) T ss_pred CcEEEeeccceEEEEEecceEEEEecccchhhhcCceEEEEEEeeccEEecc----cceEEEEecc Confidence 887766555433 33344555533222221 112222344444455544443 3455544443 No 20 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=54.80 E-value=0.49 Score=22.26 Aligned_cols=282 Identities=8% Similarity=0.030 Sum_probs=113.6 Q ss_pred HHHHhhcccchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEEee Q lcl|NC_016762. 35 MLAEHRGAMTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSVSI 114 (364) Q Consensus 35 m~a~~~~~~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~Sm 114 (364) |..+..-+..- ..-.++-+.+|.+++.++= +..++.. -|+++.+.++++..-..|.+..+ ...+. T Consensus 1 m~~~~~~a~~~-~~t~~~g~~i~~~~~~~ii----~~~~~~s------~l~~~~~~~~~~~~~~~~p~~~~-~~~a~--- 65 (330) T protein:vir:77 1 MAGSTVPSTQV-ALTGDFSAFLTPEQSQDYF----AEIEKTS------IVQRIARKVPMGPTGISIPHWTG-AVSAS--- 65 (330) T ss_pred Ccccccchhhc-cccCCCcceechhHHHHHH----HHHHhcc------chhhhcceeeccCCceEEEEEcC-Cccee--- Confidence 43331111100 1112334448888775543 3322222 15556666776543333444322 11121 Q ss_pred CCCCCcccccceeeccCCccceeecCC---ccchhhhhhc--------CccccchhhhhHHHHHHHHHHHHHhhhhccCC Q lcl|NC_016762. 115 DGQAPYSFDHTEYNSDGDPIPVFTAGY---GVNWRHAAGM--------STVGIDLVLDSQAAKLRKFNKRIVAYTLDGAT 183 (364) Q Consensus 115 dGq~~~~~D~~~Y~~dGtPiPIf~sgy---~~~WR~~~~~--------~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~ 183 (364) +--+|.++|--+..| .++.+...+. +-..+++..--...-.+.+.++++..+|+|++ T Consensus 66 ------------~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g 133 (330) T protein:vir:77 66 ------------WTGEAERKPITKGSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGID 133 (330) T ss_pred ------------EecCCCccccccceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccC Confidence 122444444333222 1222222111 12234444444455566778888899999987 Q ss_pred ceeecCeeeeeecCCCccccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHHHHhhhc-- Q lcl|NC_016762. 184 NIQVENYPAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEINANLAQ-- 261 (364) Q Consensus 184 ~I~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~~-- 261 (364) + +....||-+.........+. ..++. ++...++++.+. .+...+.+.+. ....|+++|..+.-|.. T Consensus 134 ~----~~~~~g~~~~~~~~~~~~~~----~~~~~-~~~~~~~~~~l~-~~~~~~~~~~~--~~~~~vmn~~~~~~l~~lk 201 (330) T protein:vir:77 134 K----PSAFKGYLAETTKVVSLADT----NLTTA-SGPQGNAYLAVN-NALSLLVNSGK--KWTGTLLDNVTEPILNTAV 201 (330) T ss_pred C----CCccccccccccccceeecc----ccccc-ccccchhHHHHH-HHHHhhhhcCC--CccEEEEcHHHHHHHHHHh Confidence 4 56667777776544333321 11121 122222222223 23444444432 34478999999876653 Q ss_pred -----ceeeecccc--cccccchhHHHHHhhccchhhhccccccCC----Ce--EEEEEcCcceeeheecceeecccccc Q lcl|NC_016762. 262 -----PYMITMGGG--ANAVVAGTVLDAVMRFIPAREVRQTFALSG----NE--FLGYQRRRDVVSPLVGMATGVIPLPR 328 (364) Q Consensus 262 -----py~~~~~~~--S~~~~~gTIl~~i~~~~~V~~I~~~~~Ltg----Ne--~lg~v~~~dvI~plvGmp~gt~p~pR 328 (364) |.+...... ..+...+| |+.+| |..+..++. |. ++....+.-+|....|+.+-.-...- T Consensus 202 d~~G~~l~~~~~~~~~~~~~~~~~----l~G~P----V~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~ 273 (330) T protein:vir:77 202 DGNGRPLFVESTYTEQVGAIREGR----ILGRP----TYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQAT 273 (330) T ss_pred ccCCceeecCccccccccccCCce----eccee----eEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecce Confidence 333210000 00111222 22222 223333332 22 22224444444444455442211110 Q ss_pred -----------------cCCCCchhhhhhhhcchhhhcccccceeeEeeccCC Q lcl|NC_016762. 329 -----------------PLPQVNYNFQIMSAMGIQVKKDDEGLSGVIYGANLA 364 (364) Q Consensus 329 -----------------~~p~~nY~f~v~~A~glqiK~D~~G~sgVv~~~~l~ 364 (364) .+.++-.-||+..-++.++.++ +.-+.+--.+-=+ T Consensus 274 ~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~-~a~~~i~~~~~~~ 325 (330) T protein:vir:77 274 LDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDK-DAFVKLTDQVAGT 325 (330) T ss_pred eeecccccccccccccchhhcCcEEEEEEEEeccEEecc-cceEEEEeccCCc Confidence 0122223344444445444433 1111111111111 No 21 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=54.72 E-value=0.5 Score=22.25 Aligned_cols=247 Identities=15% Similarity=0.096 Sum_probs=111.6 Q ss_pred cchhhhhhhhhccccHHHHHHH-HHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEEeeCCCCCcc Q lcl|NC_016762. 43 MTPGMLACNALAGLGREFWAEI-DAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSVSIDGQAPYS 121 (364) Q Consensus 43 ~t~~~~a~Na~a~lprD~W~e~-D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~~ 121 (364) |+.+++-.+.+ |=++.|..+ ..+.......---..+.++|-+ + .|+||+.- .-+++ |++..=-+| .+ T Consensus 1 m~~~~T~l~d~--i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g--~---~G~tv~iP-~~~~i-g~a~~~~~g---~~ 68 (274) T protein:vir:96 1 MAQGMTKLTNQ--IVPEVLAPMMQAELEKKLRFASFAEIDNTLVG--Q---PGDTLTFP-AFIYS-GDAKVVAEG---EK 68 (274) T ss_pred CCcceeehhhe--echHHHHHHHHHHHHhhhhccccceecccccC--C---CCCEEEee-eecCC-CccccccCC---Cc Confidence 54443332222 335677644 2222222111111222334432 1 27777542 11232 223321122 23 Q ss_pred cccceeeccCCccceeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCcc Q lcl|NC_016762. 122 FDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNT 201 (364) Q Consensus 122 ~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~ 201 (364) ++-..-.+..+.+.|...|++|.+-+...++. +-|++.......-+.+.+++...++.=-.+ .. T Consensus 69 i~~~~lt~~~~~~~i~~~~~a~~i~D~~~~~~-~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~-------------a~-- 132 (274) T protein:vir:96 69 IPTDILETKKREAKIRKIAKGTSISDEALLSG-YGDPQGEQVRQHGLAHANKVDDDVLEALKS-------------AK-- 132 (274) T ss_pred cchhhcccceeEEEeeeeecceeehHHHHhhc-cchHHHHHHHHHHHHHHHHHHHHHHHHHhc-------------cc-- Confidence 44445566677888988899998888766664 446666666666666667766666632111 00 Q ss_pred ccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHHHHhhhcc----eeeecccccccccch Q lcl|NC_016762. 202 IKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEINANLAQP----YMITMGGGANAVVAG 277 (364) Q Consensus 202 ~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~~p----y~~~~~~~S~~~~~g 277 (364) ..++-.+.+.+.++++.- .|+. .+ ...-.+.|+|++...|.+- |......+.+...+| T Consensus 133 -----------~~~~~~~~~~d~i~~A~~--~lgd---~~--~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G 194 (274) T protein:vir:96 133 -----------LTVEADITKLTGLQTAID--KFND---ED--LEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKG 194 (274) T ss_pred -----------ccccccccCHHHHHHHHH--Hhcc---cc--ccccEEEeCHHHHHHHHhhccccccccccccccceecc Confidence 011112335666665543 3442 22 2566899999999998773 211011111111122 Q ss_pred hHHHHHhhccchhhhccccccCCCeEEEEEcCcceeeheecceeecccccccCCCCchhhhhhhhcchhhhccccccee- Q lcl|NC_016762. 278 TVLDAVMRFIPAREVRQTFALSGNEFLGYQRRRDVVSPLVGMATGVIPLPRPLPQVNYNFQIMSAMGIQVKKDDEGLSG- 356 (364) Q Consensus 278 TIl~~i~~~~~V~~I~~~~~LtgNe~lg~v~~~dvI~plvGmp~gt~p~pR~~p~~nY~f~v~~A~glqiK~D~~G~sg- 356 (364) .|-++.|+. |.++..++.+..+.+-+ +-+.-..++++. -+. .+|...++- T Consensus 195 ----~ig~~~G~~-Vi~s~~~~~~t~~l~~~--gA~~~~~~~~~~--------vE~--------------~Rd~~~~~d~ 245 (274) T protein:vir:96 195 ----AFGEALGAV-IVRSNKLEAGTAILAKK--GAVKLITKRDFF--------LET--------------DRDPSTKTTA 245 (274) T ss_pred ----ccceecCeE-EEEeCCCCCceEEEEec--cceeeeecCCcc--------ccc--------------ccccccccCE Confidence 123344543 56667776555433321 111111111111 111 112222222 Q ss_pred ----eEeeccCC Q lcl|NC_016762. 357 ----VIYGANLA 364 (364) Q Consensus 357 ----Vv~~~~l~ 364 (364) .+||..+- T Consensus 246 i~~~~~y~~~~~ 257 (274) T protein:vir:96 246 LYSDKHYVAYLY 257 (274) T ss_pred EEEeEEEEEEEE Confidence 23444444 No 22 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=54.72 E-value=0.5 Score=22.25 Aligned_cols=247 Identities=15% Similarity=0.096 Sum_probs=111.6 Q ss_pred cchhhhhhhhhccccHHHHHHH-HHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEEeeCCCCCcc Q lcl|NC_016762. 43 MTPGMLACNALAGLGREFWAEI-DAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSVSIDGQAPYS 121 (364) Q Consensus 43 ~t~~~~a~Na~a~lprD~W~e~-D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~~ 121 (364) |+.+++-.+.+ |=++.|..+ ..+.......---..+.++|-+ + .|+||+.- .-+++ |++..=-+| .+ T Consensus 1 m~~~~T~l~d~--i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g--~---~G~tv~iP-~~~~i-g~a~~~~~g---~~ 68 (274) T protein:vir:95 1 MAQGMTKLTNQ--IVPEVLAPMMQAELEKKLRFASFAEIDNTLVG--Q---PGDTLTFP-AFIYS-GDAKVVAEG---EK 68 (274) T ss_pred CCcceeehhhe--echHHHHHHHHHHHHhhhhccccceecccccC--C---CCCEEEee-eecCC-CccccccCC---Cc Confidence 54443332222 335677644 2222222111111222334432 1 27777542 11232 223321122 23 Q ss_pred cccceeeccCCccceeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCcc Q lcl|NC_016762. 122 FDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNT 201 (364) Q Consensus 122 ~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~ 201 (364) ++-..-.+..+.+.|...|++|.+-+...++. +-|++.......-+.+.+++...++.=-.+ .. T Consensus 69 i~~~~lt~~~~~~~i~~~~~a~~i~D~~~~~~-~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~-------------a~-- 132 (274) T protein:vir:95 69 IPTDILETKKREAKIRKIAKGTSISDEALLSG-YGDPQGEQVRQHGLAHANKVDDDVLEALKS-------------AK-- 132 (274) T ss_pred cchhhcccceeEEEeeeeecceeehHHHHhhc-cchHHHHHHHHHHHHHHHHHHHHHHHHHhc-------------cc-- Confidence 44445566677888988899998888766664 446666666666666667766666632111 00 Q ss_pred ccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHHHHhhhcc----eeeecccccccccch Q lcl|NC_016762. 202 IKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEINANLAQP----YMITMGGGANAVVAG 277 (364) Q Consensus 202 ~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~~p----y~~~~~~~S~~~~~g 277 (364) ..++-.+.+.+.++++.- .|+. .+ ...-.+.|+|++...|.+- |......+.+...+| T Consensus 133 -----------~~~~~~~~~~d~i~~A~~--~lgd---~~--~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G 194 (274) T protein:vir:95 133 -----------LTVEADITKLTGLQTAID--KFND---ED--LEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKG 194 (274) T ss_pred -----------ccccccccCHHHHHHHHH--Hhcc---cc--ccccEEEeCHHHHHHHHhhccccccccccccccceecc Confidence 011112335666665543 3442 22 2566899999999998773 211011111111122 Q ss_pred hHHHHHhhccchhhhccccccCCCeEEEEEcCcceeeheecceeecccccccCCCCchhhhhhhhcchhhhccccccee- Q lcl|NC_016762. 278 TVLDAVMRFIPAREVRQTFALSGNEFLGYQRRRDVVSPLVGMATGVIPLPRPLPQVNYNFQIMSAMGIQVKKDDEGLSG- 356 (364) Q Consensus 278 TIl~~i~~~~~V~~I~~~~~LtgNe~lg~v~~~dvI~plvGmp~gt~p~pR~~p~~nY~f~v~~A~glqiK~D~~G~sg- 356 (364) .|-++.|+. |.++..++.+..+.+-+ +-+.-..++++. -+. .+|...++- T Consensus 195 ----~ig~~~G~~-Vi~s~~~~~~t~~l~~~--gA~~~~~~~~~~--------vE~--------------~Rd~~~~~d~ 245 (274) T protein:vir:95 195 ----AFGEALGAV-IVRSNKLEAGTAILAKK--GAVKLITKRDFF--------LET--------------DRDPSTKTTA 245 (274) T ss_pred ----ccceecCeE-EEEeCCCCCceEEEEec--cceeeeecCCcc--------ccc--------------ccccccccCE Confidence 123344543 56667776555433321 111111111111 111 112222222 Q ss_pred ----eEeeccCC Q lcl|NC_016762. 357 ----VIYGANLA 364 (364) Q Consensus 357 ----Vv~~~~l~ 364 (364) .+||..+- T Consensus 246 i~~~~~y~~~~~ 257 (274) T protein:vir:95 246 LYSDKHYVAYLY 257 (274) T ss_pred EEEeEEEEEEEE Confidence 23444444 No 23 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=54.22 E-value=0.35 Score=23.07 Aligned_cols=201 Identities=19% Similarity=0.211 Sum_probs=100.6 Q ss_pred hhccchhHHHHHHHhhccCCCeEEEeeCCCCCcccccceeeccCCccceeecCCccchhhhhhcCccccchhhhhHHHHH Q lcl|NC_016762. 88 QTVLPIGKSAKLYNVVGDIADDVSVSIDGQAPYSFDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKL 167 (364) Q Consensus 88 ~~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~~~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~ 167 (364) ---++.|+|++.-.-+|| +-++ ..| .+++-..-++.-+-.=|...|.+|.+.+..-++.-| |.+.+.. T Consensus 1 ~~~~~~Gdtit~P~~iGd-a~~v---~eG---~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~g-----Dp~~ea~ 68 (231) T protein:vir:73 1 ENGINLANLCEYPNDIGD-AADV---AEG---GEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYG-----DPIGESN 68 (231) T ss_pred CccccCCceEEecccccc-hhhh---cCC---CcCChhhccccceeeeEeeeccceeeeHHHHhhccC-----chHHHHH Confidence 456788888887554666 4322 233 234444455666667788999999999988776433 6666666 Q ss_pred HHHHHHHHhhhhccCCceeecCeeeeeecCCCccccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccce Q lcl|NC_016762. 168 RKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYD 247 (364) Q Consensus 168 rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~ 247 (364) |.+...+.+.+=+- |- .... + -+|.. . +..+.+.|.++.- .|+. ...... T Consensus 69 ~Q~~~~iA~kvD~d---i~---~~~~-------~--a~l~~-----~---~~~t~d~i~~A~~--~fgd-----e~~~~~ 118 (231) T protein:vir:73 69 KQLGLSLANKVDDD---LL---KAAK-------T--TSQTV-----S---TKANVDGVQAALD--IFND-----EDAQAY 118 (231) T ss_pred HHHHHHHHHhhhHH---HH---Hhhc-------c--ccccc-----c---ccccHHHHHHHHH--Hhcc-----ccccce Confidence 66666666553221 00 0000 0 01110 1 1225666766554 3543 224566 Q ss_pred EEEEcHHHHHhhhcceeee----cccccccccchhHHHHHhhccchhhhccccccCCCeEEEEEcCcceeeheecceeec Q lcl|NC_016762. 248 VLWVSPEINANLAQPYMIT----MGGGANAVVAGTVLDAVMRFIPAREVRQTFALSGNEFLGYQRRRDVVSPLVGMATGV 323 (364) Q Consensus 248 ~lyvS~eI~~N~~~py~~~----~~~~S~~~~~gTIl~~i~~~~~V~~I~~~~~LtgNe~lg~v~~~dvI~plvGmp~gt 323 (364) .++|+|.....|.+ +--. ...+.+-...|+| -++.|+ .|..|.+++.+..+... T Consensus 119 vivv~p~~~~~Lrk-~~~~~~~~~~~g~~i~~~G~i----G~i~G~-~Vi~S~~~~~~~~~~~~---------------- 176 (231) T protein:vir:73 119 VLIVNPKDAAKIRK-DANAKNIGSEVGANALINGTY----ADVLGA-QIVRSKKLAEGSALMFK---------------- 176 (231) T ss_pred EEEEcchHHHhhhh-ccchhhhhhhhccceeeeccc----ceEcce-EEEEcCCCCCCceeeee---------------- Confidence 79999988777754 2100 0001111222222 233333 34444444443332110 Q ss_pred ccccccCCCCchhhhhhhhcchhhhccc--------cccee-----eEeeccCC Q lcl|NC_016762. 324 IPLPRPLPQVNYNFQIMSAMGIQVKKDD--------EGLSG-----VIYGANLA 364 (364) Q Consensus 324 ~p~pR~~p~~nY~f~v~~A~glqiK~D~--------~G~sg-----Vv~~~~l~ 364 (364) |.+ .=+|+++..|+|. .-++- ..||..|. T Consensus 177 -----------~i~-~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~ 218 (231) T protein:vir:73 177 -----------IVS-NSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLY 218 (231) T ss_pred -----------EEe-eccceeeeecccceeeccccccccccEEEEeEEEEEEEE Confidence 111 1366776666553 33333 34555555 No 24 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=50.17 E-value=0.62 Score=21.73 Aligned_cols=278 Identities=17% Similarity=0.182 Sum_probs=112.7 Q ss_pred cchhhhhhhhhcc----------ccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhccch--hHHHHHHHhhccCCCeE Q lcl|NC_016762. 43 MTPGMLACNALAG----------LGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVLPI--GKSAKLYNVVGDIADDV 110 (364) Q Consensus 43 ~t~~~~a~Na~a~----------lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~i--gktv~~y~~~gd~a~~v 110 (364) |..+ |-+.+ +=.+.|.. .+++.|++.. ++-+|.. ..+.-+ |+||+.- +.|+. . + T Consensus 1 ~~~~----~~~~~~~~~t~~v~~fipei~s~---~i~~~l~~~~---v~~~~~~-d~~~~~~~Gdtv~ip-~~g~~-~-~ 66 (341) T protein:vir:94 1 MALG----NTITGPSINTQRGQQFIPEQWLS---EVQMFRKAKM---LDTSVVK-TWGAQVKKGDTFHVP-RISEL-G-V 66 (341) T ss_pred Ccch----hhhccccccchhHHHHHHHHHHH---HHHHHHHhhc---chhhccc-cccccccCCceEEEe-ccCcc-e-e Confidence 4444 44444 22356654 4455544432 2222321 111111 4555443 22331 1 1 Q ss_pred EEeeCCCCCcccccceeeccCCccce-eecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecC Q lcl|NC_016762. 111 SVSIDGQAPYSFDHTEYNSDGDPIPV-FTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVEN 189 (364) Q Consensus 111 ~~SmdGq~~~~~D~~~Y~~dGtPiPI-f~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~ 189 (364) . ..... ..++-......-.+|.| -+..+.+.+-+..-.+. .+|++..-...+.+.+++++.++++.-.... T Consensus 67 ~-d~~~~--~~i~~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~-~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~---- 138 (341) T protein:vir:94 67 E-DKATD--VPVGVQPVNDTDFVITVDTDRTTAVALDDLLEIQA-SYDLRAPYLEAMGYALAKDMTGSILGLRAAV---- 138 (341) T ss_pred e-eecCC--CccccccccCceEEEEEeeeeecceeechHHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHHHhhhc---- Confidence 1 01111 11111122222223333 23345566665444433 5688888888999999999999887542211 Q ss_pred eeeeeecCCCccccccccccCCCcceeecccCHHHH-HHHHhhhhHHHHHhccccc-cceEEEEcHHHHHhhhcc--eee Q lcl|NC_016762. 190 YPAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQI-IDFFTKGAFGQAARTNKVD-AYDVLWVSPEINANLAQP--YMI 265 (364) Q Consensus 190 ~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~i-~~~f~~~~f~~~~~~N~v~-~~~~lyvS~eI~~N~~~p--y~~ 265 (364) +... .+.+ + ...+...+ .+.+.+ .+.++ .+. +.++.++|- ..-.++|+|+....|..- |.- T Consensus 139 -~~~~---~~~~-----~---~~~~~~~t-~~~~~~~~~~i~-~a~-~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~ 203 (341) T protein:vir:94 139 -QNTA---SQNV-----F---SSSNGAIT-GNGQAFSFAVFL-AAR-RLLLEADVPEEKIVLLISPGQESALFTIPQFIS 203 (341) T ss_pred -cccc---cCcc-----c---cCcccccc-CchhhhhHHHHH-HHH-HHHhhcCCCccCCEEEeCHHHHHHHhhchhhhh Confidence 1110 0110 1 11122222 222222 23333 223 334555553 456799999999988642 211 Q ss_pred ecccccccccchhHHHHHhhccchhhhccccccCCCeEEEEEcCcceeeheeccee--ecccccccCCCCchhh-hhh-- Q lcl|NC_016762. 266 TMGGGANAVVAGTVLDAVMRFIPAREVRQTFALSGNEFLGYQRRRDVVSPLVGMAT--GVIPLPRPLPQVNYNF-QIM-- 340 (364) Q Consensus 266 ~~~~~S~~~~~gTIl~~i~~~~~V~~I~~~~~LtgNe~lg~v~~~dvI~plvGmp~--gt~p~pR~~p~~nY~f-~v~-- 340 (364) .-..++.....|. |.++.|+ .|.++..|+.+...+++....-..|.-..|. ++.+.++ +..+|.- +-+ T Consensus 204 ~~~~g~~~l~~G~----ig~i~G~-~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~--~~~~~~~~~gl~~ 276 (341) T protein:vir:94 204 KDFINNAPIAQGQ----IGSLMGV-RVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPK--QDSFTSLPATFTG 276 (341) T ss_pred hhccccchhheee----eeeEece-EEEEeccccccccccccccccceecccccccccccccccc--cccccccEEEEEE Confidence 0001112233332 2233444 3666777877776666655544443322221 1111111 1111100 000 Q ss_pred ----------------hhcchh---hhcccc----cc---eeeEeeccCC Q lcl|NC_016762. 341 ----------------SAMGIQ---VKKDDE----GL---SGVIYGANLA 364 (364) Q Consensus 341 ----------------~A~glq---iK~D~~----G~---sgVv~~~~l~ 364 (364) .+-.+| .-.+.+ +. ..++||+++- T Consensus 277 ~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~G~~~l 326 (341) T protein:vir:94 277 NSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMVGRQAYGARLY 326 (341) T ss_pred ecccccceeeecchhhhccccccccccccchhhhhhhhhhhhhhhccccc Confidence 000111 001111 01 2346777665 No 25 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=44.71 E-value=0.8 Score=21.13 Aligned_cols=258 Identities=16% Similarity=0.118 Sum_probs=116.1 Q ss_pred cchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhcc--chhHHHHHHHhhccCCCeEEEeeCCCCCc Q lcl|NC_016762. 43 MTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVL--PIGKSAKLYNVVGDIADDVSVSIDGQAPY 120 (364) Q Consensus 43 ~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v--~igktv~~y~~~gd~a~~v~~SmdGq~~~ 120 (364) |+.+.+-. ...|=.+.|..+ +++-+++. -.+-.+......+ ..|++|+. +..+.+ |++..=-+| . T Consensus 1 Ma~~~T~~--~~~iiPev~s~~---v~~~~~~~---~v~~~~~~~~~~l~g~~G~tv~i-p~~~~~-g~a~~~~~g---~ 67 (278) T protein:vir:80 1 MADLTTKL--ANLIDPEVMGPM---ISAKLPKA---IKFGKIAPIDNSLEGQPGSEITV-PKYKYI-GDAQDVAEG---A 67 (278) T ss_pred CCCcceeh--hheecHHHHHHH---HHHHHHHh---hhhcccceecccccCCCCCEEEE-eeeccC-CcceeecCC---C Confidence 43332221 112335666543 22222111 1111221111111 13666643 222232 223211112 1 Q ss_pred ccccceeeccCCccceeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCc Q lcl|NC_016762. 121 SFDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRN 200 (364) Q Consensus 121 ~~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n 200 (364) +++-..-+++-..+.|...++.|..-+...++ .+.|++.......-+.+.+++...+++.-... . T Consensus 68 ~i~~~~lt~~~~~~~i~~~~~a~~v~D~~~~~-~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a----------~---- 132 (278) T protein:vir:80 68 AIDYSALETESVKHGIKKAGKGVKLTDESVLS-GYGDPVEEAQKQIRMAIASKVDNDILEEALTT----------T---- 132 (278) T ss_pred cCcccccccceeeEeeehhhccccccHHHHhh-ccccHHHHHHHHHHHHHHHHHHHHHHHHHhcc----------c---- Confidence 23333445666678888889988888866555 47788777777777788888888887643210 0 Q ss_pred cccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHHHHhhhcc---eeeeccccccc-ccc Q lcl|NC_016762. 201 TIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEINANLAQP---YMITMGGGANA-VVA 276 (364) Q Consensus 201 ~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~~p---y~~~~~~~S~~-~~~ 276 (364) .+. +++ .+..+.++..+.|.. +.. +....++.....+.|+|++...|.+- -+.....+.++ ... T Consensus 133 ---~~~----~~~---~t~~~~~~~~~~~~d-a~~-~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~ 200 (278) T protein:vir:80 133 ---LEV----KGA---INIGLIDKIENTFTD-APD-AIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVK 200 (278) T ss_pred ---ccc----ccc---cccchhhhHHHHHHH-HHH-hhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceee Confidence 000 011 112234445555542 233 33444555556799999999888532 11211121122 111 Q ss_pred hhHHHHHhhccchhhhccccccCCCeEEEEEcCcceeeheecceeecccccccCCCCchhhhhhhhcchhhhccccccee Q lcl|NC_016762. 277 GTVLDAVMRFIPAREVRQTFALSGNEFLGYQRRRDVVSPLVGMATGVIPLPRPLPQVNYNFQIMSAMGIQVKKDDEGLSG 356 (364) Q Consensus 277 gTIl~~i~~~~~V~~I~~~~~LtgNe~lg~v~~~dvI~plvGmp~gt~p~pR~~p~~nY~f~v~~A~glqiK~D~~G~sg 356 (364) | .|-++.|+ .|..+..++.+..+.+-+. -+.-..++++.+... |+-- .+.=.| -++ T Consensus 201 G----~ig~~~G~-~Vi~s~~~p~~t~~l~~~g--Ai~~~~~~~~~vE~~-Rd~~----------~~~d~i----~~~-- 256 (278) T protein:vir:80 201 G----AFGELLGW-EIVRTKKLADGNALAVKAG--ALKTFLKRNLLAESG-RDMD----------HKLTKF----NAD-- 256 (278) T ss_pred c----cceeecce-eEEEcCCCCcceEEEEecc--ceeeeecCCcccccc-cchh----------hcccee----eee-- Confidence 2 23344555 5677778887766555433 233333443322222 1100 000011 111 Q ss_pred eEeeccCC Q lcl|NC_016762. 357 VIYGANLA 364 (364) Q Consensus 357 Vv~~~~l~ 364 (364) .+||..+- T Consensus 257 ~~yg~~v~ 264 (278) T protein:vir:80 257 QHYAVALV 264 (278) T ss_pred eEEEEEEE Confidence 23555554 No 26 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=42.76 E-value=0.87 Score=20.91 Aligned_cols=292 Identities=14% Similarity=0.105 Sum_probs=102.9 Q ss_pred HHHHHHHHHHHHHHHHHhhcccchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhccchhHH-HHHH Q lcl|NC_016762. 22 QANRNIWNNQNAAMLAEHRGAMTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKS-AKLY 100 (364) Q Consensus 22 ~~~R~~~~~~~~~m~a~~~~~~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igkt-v~~y 100 (364) ++.. ..+.++.......+-...+-.+.||..+..+| ++..+++. -|+.+.+.+|++.- .++. T Consensus 1 ~a~l-------~el~~~~~~~~~~g~~~~~~~~liP~~~~~~i----i~~l~~~s------~l~~~~~~~~~~~~~~~~p 63 (333) T protein:vir:78 1 MATL-------NELLPNSAGSNHQGRLAHVPSDLLPKEIVGPI----FDKAQESS------LVLRMGEQIPISYGETIIP 63 (333) T ss_pred Cchh-------HHhhhhcccccccCceecCCccccchhHHHHH----HHHHHhhc------hhhhhcceeeccCCceEEE Confidence 1110 01111111122222111223335888776555 33322222 14556666666532 2222 Q ss_pred HhhccCCCeEEEeeCCCCCcccccceeeccCCccceeecCCc---cchhhhhhcCc--------cccchhhhhHHHHHHH Q lcl|NC_016762. 101 NVVGDIADDVSVSIDGQAPYSFDHTEYNSDGDPIPVFTAGYG---VNWRHAAGMST--------VGIDLVLDSQAAKLRK 169 (364) Q Consensus 101 ~~~gd~a~~v~~SmdGq~~~~~D~~~Y~~dGtPiPIf~sgy~---~~WR~~~~~~t--------~g~D~~~D~q~~~~rk 169 (364) ..+++ ..+..-=.|+ ....-+|+-+|--+..|+ +......+.-+ ..+++..-=+..-.++ T Consensus 64 ~~~~~--~~a~~v~eg~-------~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~a 134 (333) T protein:vir:78 64 TTVKR--PEVGQVGVGT-------SNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYA 134 (333) T ss_pred EEeCC--ceeEeecCcc-------cccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHH Confidence 22222 1121111122 222333444443333331 22222222221 2233334444455677 Q ss_pred HHHHHHhhhhccCCceeecCeeeeeecCCCccccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccceEE Q lcl|NC_016762. 170 FNKRIVAYTLDGATNIQVENYPAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYDVL 249 (364) Q Consensus 170 v~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~l 249 (364) +.+++++-+|+|+++-. +....||.+.....+.+-.. .-.+-...+.++|++... . ...|+-+....| T Consensus 135 i~~~~d~~~l~G~g~~~--~~~~~g~~~~~~~~~~~~~~----~~~~~~~~~~~~i~~~~~-----~-~~~~~~~~~~~~ 202 (333) T protein:vir:78 135 IGRGIDLAVFHGKSPLT--GSALQGIDTDNVIANTTNVD----YLQETGDPLLDRLLDGYD-----L-VSANTDVEFNGW 202 (333) T ss_pred HHHHHHHHHhcccCCCC--Cccccccccccccccccccc----ccccccchhHHHHHHHHH-----h-hccccccCceEE Confidence 78888899999987532 22234454433222221100 000111123344433322 2 234444555578 Q ss_pred EEcHHHHHhhhcc----------eeeecccccccccchhHHHHHhhccchhhhccccccCCC---------eEEEEEcCc Q lcl|NC_016762. 250 WVSPEINANLAQP----------YMITMGGGANAVVAGTVLDAVMRFIPAREVRQTFALSGN---------EFLGYQRRR 310 (364) Q Consensus 250 yvS~eI~~N~~~p----------y~~~~~~~S~~~~~gTIl~~i~~~~~V~~I~~~~~LtgN---------e~lg~v~~~ 310 (364) +++|..+.-|.+- .+. ....+-.++||+ .+| +..+..++.| .++....+. T Consensus 203 vmn~~~~~~L~~~~~~~d~~G~~i~~---~~~~~~~~~~l~----G~P----v~~~~~i~~~~~~~~~~~~~~~~gD~~~ 271 (333) T protein:vir:78 203 AVDPRFRAHLLRAQAYRDANGNVDPS---RINLAAQTGDVL----GLP----AQFGRAVGGDLGAAVDSKTRIIGGDFSQ 271 (333) T ss_pred EEcchHHHHHHHHhhhcCCCCceeec---CccccCCCceee----cee----eEEccccCCCccccCCCccEEEEEeccc Confidence 8899887655432 111 000011122322 221 2333344333 233334444 Q ss_pred ceeeheecceeeccccc----------ccCCCCchhhhhhhhcchhhhcccccceeeEeeccC Q lcl|NC_016762. 311 DVVSPLVGMATGVIPLP----------RPLPQVNYNFQIMSAMGIQVKKDDEGLSGVIYGANL 363 (364) Q Consensus 311 dvI~plvGmp~gt~p~p----------R~~p~~nY~f~v~~A~glqiK~D~~G~sgVv~~~~l 363 (364) -+|-..-|+.+-+-+.. +.+.++---||...-++..|+.. +.-..+..++-- T Consensus 272 ~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~-~a~~~l~~~~a~ 333 (333) T protein:vir:78 272 LKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDK-QAFVKFVDDEQP 333 (333) T ss_pred EEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEccEEecc-cceEEEeccCCC Confidence 33333334433222111 01111111122222222222111 111111111100 No 27 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=31.55 E-value=1.5 Score=19.63 Aligned_cols=271 Identities=12% Similarity=0.071 Sum_probs=122.1 Q ss_pred HHHHHHHHHHHHhhcccchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccC Q lcl|NC_016762. 27 IWNNQNAAMLAEHRGAMTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDI 106 (364) Q Consensus 27 ~~~~~~~~m~a~~~~~~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~ 106 (364) +.+ -+ .-+++ +.-+.++-..| +=++|..++..-|+. -.+|-+|+.+.+ +.=|||++...+ |. T Consensus 1 ms~---~~--~~tr~----~~~~s~~d~al---~le~f~geV~~af~~---~s~~~~~~~~rt-i~~g~s~~~~~i-G~- 62 (335) T protein:vir:63 1 MSF---LN--DLTRP----NYAGKNADVDI---HLEEHLGIVDKHFAY---TSKFAPLMNIRD-LRGSNVVRLDRL-GN- 62 (335) T ss_pred CCC---cc--cchhh----hcccccchhhe---ehhhhhhhHHHHHHh---hhhhccccceee-eccceeEEEeee-ee- Confidence 000 00 00011 11011111111 225566666665554 356668887754 444888765544 65 Q ss_pred CCeEE-----EeeCCCCCcccccceeeccCCccceeecCCccchhhhhhcC---ccccchhhhhHHHHHHHHHHHHHhhh Q lcl|NC_016762. 107 ADDVS-----VSIDGQAPYSFDHTEYNSDGDPIPVFTAGYGVNWRHAAGMS---TVGIDLVLDSQAAKLRKFNKRIVAYT 178 (364) Q Consensus 107 a~~v~-----~SmdGq~~~~~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~---t~g~D~~~D~q~~~~rkv~~k~~dy~ 178 (364) -++. .+|+||.+.. |+..-.-|+.- .+|++.--. ..-+|+++--..+.=+.+.+.+-+.+ T Consensus 63 -~~~~~~~pG~~l~~~~~~~-~k~~itVD~ll----------~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~ 130 (335) T protein:vir:63 63 -VEAKGRRAGEELERSRVVN-DKWNLTVDTLL----------YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQAC 130 (335) T ss_pred -eeeecccCCcCcCCCCccc-cceEEEeccee----------echhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHH Confidence 2244 5677776544 55555555532 233333222 44568887777788888888888876 Q ss_pred h----ccCCceeecCeeeeeecC--CCccccccccccCCCcceeecc----cCHHHHHHHHhhhhHHHHHhcccc---cc Q lcl|NC_016762. 179 L----DGATNIQVENYPAQGLRN--HRNTIKVNLGSGAGGANIDLTT----ATPQQIIDFFTKGAFGQAARTNKV---DA 245 (364) Q Consensus 179 l----nG~~~I~v~~~t~~GLrn--h~n~~~~~lg~~~gGaNidltt----A~~~~i~~~f~~~~f~~~~~~N~v---~~ 245 (364) + .+.+. ..-.++++ |+.. +-+++++. .++++|+++|. .++.++-+.+-- +. T Consensus 131 ~~~i~~aa~~-----~a~~~~~~~~~~G~----------~~~~~~tg~~~~~~~~~l~~a~~-~a~~~L~e~dVP~~~~~ 194 (335) T protein:vir:63 131 LIQVIKAAAM-----DAPVDLEDAFSPGV----------LEKLDLTGLTAKQAADKIVRMHR-RVVETFIDRDLGDAVYS 194 (335) T ss_pred HHHHHhhccc-----cCccccCCCcCCCc----------ceeeeeccCcccccHHHHHHHHH-HHHHHHHhccCCCcccC Confidence 5 22111 01111111 1111 12233321 36888887776 355555544432 23 Q ss_pred ceEEEEcHHHHHhhhcc-eeeec---ccccccccchhHHHHHhhccchhhhccccccCCCeE------------------ Q lcl|NC_016762. 246 YDVLWVSPEINANLAQP-YMITM---GGGANAVVAGTVLDAVMRFIPAREVRQTFALSGNEF------------------ 303 (364) Q Consensus 246 ~~~lyvS~eI~~N~~~p-y~~~~---~~~S~~~~~gTIl~~i~~~~~V~~I~~~~~LtgNe~------------------ 303 (364) .-..+|+|+....|..- -+++. ++++. .+++--+|..+.||. |.++..|+..-. T Consensus 195 dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~---~~~~~g~v~~v~Gv~-V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~ 270 (335) T protein:vir:63 195 EGLTPMSPRVFSLLLEHDKLMNVEYQATGAT---NDYVKSRVAILNGVK-VLETPRFATKAIAAHPLGRHFNVSAEESER 270 (335) T ss_pred ceEEEeChHHHHHHhcccccccccccccccc---ccccCceeEEeeceE-EEeeccCCCCCcccccccccCCccccccce Confidence 47899999999998764 22221 12221 123334567777776 777777764321 Q ss_pred -EEEEcCcceeeheecceeecccccccCCCCchhhhhhhhcchhhhcccccceeeEeeccCC Q lcl|NC_016762. 304 -LGYQRRRDVVSPLVGMATGVIPLPRPLPQVNYNFQIMSAMGIQVKKDDEGLSGVIYGANLA 364 (364) Q Consensus 304 -lg~v~~~dvI~plvGmp~gt~p~pR~~p~~nY~f~v~~A~glqiK~D~~G~sgVv~~~~l~ 364 (364) .++.-.++.+--..-|++ .++.++.... |. | .| .+...||+++- T Consensus 271 ~~~~~~~~~Al~t~~~~~v----t~e~~~~~~~-~~-~-----~i------~~~~a~G~g~l 315 (335) T protein:vir:63 271 QIALFLPSKTLITAQVAPV----QAKLWEDNEK-FS-W-----VL------DTFQMYNIGAR 315 (335) T ss_pred eEEEEEecceEEEEEEeec----ccceeeccch-hh-H-----Hh------HHHHHcCCccc Confidence 111111111111111111 1111111111 00 0 00 12334555554 No 28 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=30.45 E-value=1.6 Score=19.50 Aligned_cols=297 Identities=9% Similarity=0.014 Sum_probs=86.4 Q ss_pred Cccchhh--hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc----ch----------------------------- Q lcl|NC_016762. 1 MFLTQQA--IAAHPRLMGHFQELQANRNIWNNQNAAMLAEHRGAM----TP----------------------------- 45 (364) Q Consensus 1 ~~ftke~--~~~~~~~~~q~~~L~~~R~~~~~~~~~m~a~~~~~~----t~----------------------------- 45 (364) .....++ ...-...+..+.++..+++.-..+.....+...... .. T Consensus 30 ~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 109 (408) T protein:vir:74 30 MALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFLN 109 (408) T ss_pred HHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhhhHHHHHHHHHHHHhcchhhhh Confidence 1110000 001112233333333333322222111111111000 00 Q ss_pred --h---hhhhhhh---ccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEE-eeCC Q lcl|NC_016762. 46 --G---MLACNAL---AGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSV-SIDG 116 (364) Q Consensus 46 --~---~~a~Na~---a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~-SmdG 116 (364) + +...+.. ..||.++-.+|-..+. ..-+|-+.+...++++. +++..+ ..++ T Consensus 110 ~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~-------------------~~~~l~~~~~~~~~~~~-~~~~~~~~~~~ 169 (408) T protein:vir:74 110 TVSSKTETSGSDSAAGLTIPQDIRTMINTLVR-------------------QYDSLQQYVRVESVSTS-SGSRVYEKWTD 169 (408) T ss_pred hhhhhhhcccccCCCceeechhHhhHHHHHHh-------------------hhcchhhhcceeeccCC-cceEEEEeecC Confidence 0 0000000 0144444333222222 22223333333333333 222221 1111 Q ss_pred CCCcccccceeeccCCccce-------------eecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCC Q lcl|NC_016762. 117 QAPYSFDHTEYNSDGDPIPV-------------FTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGAT 183 (364) Q Consensus 117 q~~~~~D~~~Y~~dGtPiPI-------------f~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~ 183 (364) ..+ ...+--+|+.+|= ++-+-.+.+=+ ..++-..+|+..--...-.+++.+++...+|+|++ T Consensus 170 ~~~----~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~-ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G 244 (408) T protein:vir:74 170 VTP----LKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATN-TLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMG 244 (408) T ss_pred Ccc----cccccccccccccccccceeeEEeeeeeEEeeehhHH-HHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 111 0112223333331 11111111111 11223455555666666778888999999999976 Q ss_pred ceeecCeeeeeecCCCccccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHHHHhhhcce Q lcl|NC_016762. 184 NIQVENYPAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEINANLAQPY 263 (364) Q Consensus 184 ~I~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~~py 263 (364) +-+- .+ +..+.++|+..+.. ....++ .....|+.+|..+.-|..-= T Consensus 245 ~~~~------------------~~----------~~~~~~~i~~~~~~-----~l~~~~-~~~a~~v~n~~~~~~l~~lk 290 (408) T protein:vir:74 245 TVPK------------------KP----------TIANFDDVITMINT-----SVDPAI-IATSSLLTNQSGLNKLALVK 290 (408) T ss_pred cccc------------------cc----------ccccHHHHHHHHHH-----hhhhhh-cCCCEEEEcHHHHHHHHHhh Confidence 4210 11 12345666654421 112222 22346888998877665310 Q ss_pred eeecccccccc----cchhHHHHHhhccch-h--hhccccccCCCe--EEEEEcCcc-eeeheecceeecccccc-cCCC Q lcl|NC_016762. 264 MITMGGGANAV----VAGTVLDAVMRFIPA-R--EVRQTFALSGNE--FLGYQRRRD-VVSPLVGMATGVIPLPR-PLPQ 332 (364) Q Consensus 264 ~~~~~~~S~~~----~~gTIl~~i~~~~~V-~--~I~~~~~LtgNe--~lg~v~~~d-vI~plvGmp~gt~p~pR-~~p~ 332 (364) + +.|..-+ ..++- ..|+.+|=+ . ...|+ ...+. ++....+.. ++-...|+.+-+-+..- .+-+ T Consensus 291 --d-~~G~~l~~~~~~~~~~-~~l~G~pV~~~~~~~~~~--~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~ 364 (408) T protein:vir:74 291 --T-AEGKYLLEPDPTKPNS-YLIKGKQVIVVADRWLPN--SGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFET 364 (408) T ss_pred --c-CCCceEeccCcCCCCC-ceecceeeEEecCccccc--ccCCcceEEEEehhccEEEEEecceEEEEeccccchhhc Confidence 0 0111111 01100 011222100 0 00111 11111 111122221 12223333332211100 0001 Q ss_pred CchhhhhhhhcchhhhcccccceeeEeeccCC Q lcl|NC_016762. 333 VNYNFQIMSAMGIQVKKDDEGLSGVIYGANLA 364 (364) Q Consensus 333 ~nY~f~v~~A~glqiK~D~~G~sgVv~~~~l~ 364 (364) +-..|++..-++..+... +.-+.+=+ ...+ T Consensus 365 ~~~~~r~~~r~d~~~~~~-~a~~~~~~-~~~~ 394 (408) T protein:vir:74 365 DTTKIRVIDRFDVKATDS-EALVAGSF-TAIA 394 (408) T ss_pred ceeeEEEEEeeCcEEecc-cceEEEEe-eccc Confidence 111122222222222111 00000000 0001 No 29 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=28.26 E-value=1.8 Score=19.23 Aligned_cols=316 Identities=12% Similarity=0.033 Sum_probs=117.2 Q ss_pred CccchhhhhhhHHHHHHH-HHHHH-HHHHHHHHH------------------HHHHHHhhcccchhhhhhhhhccccHHH Q lcl|NC_016762. 1 MFLTQQAIAAHPRLMGHF-QELQA-NRNIWNNQN------------------AAMLAEHRGAMTPGMLACNALAGLGREF 60 (364) Q Consensus 1 ~~ftke~~~~~~~~~~q~-~~L~~-~R~~~~~~~------------------~~m~a~~~~~~t~~~~a~Na~a~lprD~ 60 (364) ..-..+....-..+..+- .+... .|....... ..+..+.....+ ..+.-..+|.++ T Consensus 25 ~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~~~~~~~----~~~gg~lvP~~~ 100 (390) T protein:vir:40 25 GATEAEQVTAFTNMAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESKYYNEVIAGNG----FAGVTALLPPTV 100 (390) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHHHHHHHHhccC----cccCcccccHHH Confidence 000000000000000000 00000 000000000 000000000000 012222378877 Q ss_pred HHHHHHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeE-EEeeCCCCC----cccccceeecc--CCc Q lcl|NC_016762. 61 WAEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDV-SVSIDGQAP----YSFDHTEYNSD--GDP 133 (364) Q Consensus 61 W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v-~~SmdGq~~----~~~D~~~Y~~d--GtP 133 (364) ..+|=..+.+. ..|+.+.+.+|++..-..+.+..+ .+.+ -+.=.|..+ .+|+...|..+ +.- T Consensus 101 ~~~I~~~~~~~----------s~i~~~~~~~~~~~~~~~i~~~~~-~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~ 169 (390) T protein:vir:40 101 FERVFEDLTVE----------HPLLSKINFVNTTATTEWIISVGD-VATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAY 169 (390) T ss_pred HHHHHHHHHhh----------hhhhhhceeeecCCceeEEEEEcC-CcceeeeccccccCccccccceeeEeeeeeEEEe Confidence 76553322222 335566777777643222233222 1111 111112221 12222222110 001 Q ss_pred cceeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCccccccccccCCCc Q lcl|NC_016762. 134 IPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNTIKVNLGSGAGGA 213 (364) Q Consensus 134 iPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg~~~gGa 213 (364) +|| - ++++- -..+|+..==...-.+++.+++.+.+|+|+++ +.+ -||-+...........+ .. T Consensus 170 i~i-------S-~ell~--ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~----~~P-~Gil~~~~~~~~~~~~~--~~ 232 (390) T protein:vir:40 170 IPV-------C-NAMLD--LGPSWLDQYVRTILGEAMALGLEAGIVNGSGK----DQP-IGMMRDLNNVTAGEHPV--KT 232 (390) T ss_pred ehh-------h-HHHHh--cchHHHHHHHHHHHHHHHHHHHHhhhhcccCC----Ccc-ceeeecccccccccccc--cc Confidence 111 0 22222 22333334344555677888888999999984 333 48876554333322221 11 Q ss_pred ceeecccCHHHHHHHHhhhhHHHHHhccc--cccceEEEEcHHHHHhhhcceeeecccccccccchhHHHHHhhccchhh Q lcl|NC_016762. 214 NIDLTTATPQQIIDFFTKGAFGQAARTNK--VDAYDVLWVSPEINANLAQPYMITMGGGANAVVAGTVLDAVMRFIPARE 291 (364) Q Consensus 214 NidlttA~~~~i~~~f~~~~f~~~~~~N~--v~~~~~lyvS~eI~~N~~~py~~~~~~~S~~~~~gTIl~~i~~~~~V~~ 291 (364) ...++.+++.+++.-+... ...+. .+..-+|+++|.-..++.+... ...+....| +...+.+ |+ . T Consensus 233 ~~~~t~~~~~~~~~~l~~~-----~~~~~~~~~~~a~~i~n~~t~~~~l~~~~-~~~d~~G~~-----v~~~~~~-g~-p 299 (390) T protein:vir:40 233 ATPLTDLTPATLATKVMLP-----LTDNGKKSVSDAILVINPADYWSKIYAAT-SYMTPQGVW-----VTGILPV-PL-E 299 (390) T ss_pred ccccchhhHHHHHHHHHHH-----hhcchhhhhcCceEEEcchhHHHHHHHHh-hccCCCCcc-----ccccCCC-ce-e Confidence 2233333333333222211 11121 2345568888875444433211 011111122 1111112 22 4 Q ss_pred hccccccCCCeEEEEEcCcceeeheecceeecccccccCCCCchhhhhhhhcchhhhcccccceeeEeeccCC Q lcl|NC_016762. 292 VRQTFALSGNEFLGYQRRRDVVSPLVGMATGVIPLPRPLPQVNYNFQIMSAMGIQVKKDDEGLSGVIYGANLA 364 (364) Q Consensus 292 I~~~~~LtgNe~lg~v~~~dvI~plvGmp~gt~p~pR~~p~~nY~f~v~~A~glqiK~D~~G~sgVv~~~~l~ 364 (364) |+++..++.+.++.-..+.-+|-..-||-+-+-+ -..+.++...|++..-++.+++.+-- .-++==+..+ T Consensus 300 vv~~~~~p~~~i~~Gd~s~~~i~~~~~~~v~~~~-~~~f~~~~~~~r~~~r~dg~v~~~~A--~~~l~~~~~~ 369 (390) T protein:vir:40 300 IVQSVAVPVGKAVAGRAKDYFMGIGSEQVIRTST-EYRLLDDETLYYAKQYANGRPKDNSS--FLVFDITGLE 369 (390) T ss_pred EEEcCCCCCCcEEEEeeceEEEEeecceEEEecc-hhhhhcCcEEEEEEEEeCCEEecccc--eEEEEeeccC Confidence 5677888888886666665455544455443322 22344555566666555555554321 0000001111 No 30 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=26.65 E-value=1.9 Score=19.03 Aligned_cols=246 Identities=15% Similarity=0.115 Sum_probs=110.9 Q ss_pred cchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhcc--chhHHHHHHHhhccCCCeEEEeeCCCCCc Q lcl|NC_016762. 43 MTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVL--PIGKSAKLYNVVGDIADDVSVSIDGQAPY 120 (364) Q Consensus 43 ~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v--~igktv~~y~~~gd~a~~v~~SmdGq~~~ 120 (364) |+.+.+... ..+-+++|..+ +++.++.. -.+-.|......+ ..|+||+.... +.+ |++..--.| . T Consensus 1 ma~~~T~~~--d~i~Pev~s~~---v~~~~~~~---~~~~~~~~~~~~l~g~~G~tv~ip~~-~~~-g~~~~~~~g---~ 67 (274) T protein:vir:96 1 MAQGTTKVS--NLIVPEVLAPM---MQAELDKK---LRFAQFADIDSTLVGQPGDTLTFPAF-TYS-GDAQVIAEG---E 67 (274) T ss_pred CCccccchh--hhhhhHHHHHH---HHHHHHhh---hhhcccccccccccCCCCCEEEEEee-ccC-CCccccCCC---C Confidence 443333211 22346777754 22222111 1111111111111 12666655332 221 222211111 1 Q ss_pred ccccceeeccCCccceeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCc Q lcl|NC_016762. 121 SFDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRN 200 (364) Q Consensus 121 ~~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n 200 (364) +++-..-...-..+.|...|++|.+-+-...+ .+.|++.......-+.+.+++..++++--. + .+ T Consensus 68 ~i~~~~it~~~~~~~i~~~~~~~~i~D~~~~~-~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~-----~--------a~- 132 (274) T protein:vir:96 68 KIPVDQIGTSKREAKVRKIGKGTELTDEAVLS-GFGDPQGEAVRQHGLAIANKVDNDVLEALK-----G--------AT- 132 (274) T ss_pred cCchhhcccceeEEEEEeeeceeeecHHHHHh-hcchHHHHHHHHHHHHHHHHHHHHHHHHHh-----c--------CC- Confidence 22333344555667888878888888866655 466777776666667777777777775311 0 00 Q ss_pred cccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHHHHhhhcc----eeeecccccccccc Q lcl|NC_016762. 201 TIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEINANLAQP----YMITMGGGANAVVA 276 (364) Q Consensus 201 ~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~~p----y~~~~~~~S~~~~~ 276 (364) +. ++=.+-+.+.|+++.. .|+. ++ .....+.|+|++...|.+- |......+.+.... T Consensus 133 -----~~-------~~~~~~~~d~i~dA~~--~l~d----~~-~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~ 193 (274) T protein:vir:96 133 -----LT-------VEADITKLDGLQTAID--KFND----ED-LEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVK 193 (274) T ss_pred -----CC-------cCcccccHHHHHHHHH--Hhcc----cC-CCceEEEeCHHHHHHHHhcccccccccccccccceee Confidence 10 0001124566655443 2432 23 2567899999999888542 32211111111112 Q ss_pred hhHHHHHhhccchhhhccccccCCCeEEEEEcCcceeeheecceeecccccccCCCCchhhhhhhhcchhhhccccccee Q lcl|NC_016762. 277 GTVLDAVMRFIPAREVRQTFALSGNEFLGYQRRRDVVSPLVGMATGVIPLPRPLPQVNYNFQIMSAMGIQVKKDDEGLSG 356 (364) Q Consensus 277 gTIl~~i~~~~~V~~I~~~~~LtgNe~lg~v~~~dvI~plvGmp~gt~p~pR~~p~~nY~f~v~~A~glqiK~D~~G~sg 356 (364) | .|-++.|+ .|..+..++.+..+.+-+ +-+.-..++++.+ +++ +|...++- T Consensus 194 g----~ig~~~G~-~Vi~s~~~p~~t~~l~~~--gA~~~~~~~~~~v--------E~~--------------Rd~~~~~d 244 (274) T protein:vir:96 194 G----AFGEALGA-VIVRSNKLNKGEALLAKK--GAVKLITKRDFFL--------EKD--------------RDASRKST 244 (274) T ss_pred c----ccceecCe-eEEEcCCCCcceEEEEeC--cceeeeecCCccc--------ccc--------------cchhhccc Confidence 2 23344555 477788888776554432 2232233333221 111 11111111 Q ss_pred -----eEeeccCC Q lcl|NC_016762. 357 -----VIYGANLA 364 (364) Q Consensus 357 -----Vv~~~~l~ 364 (364) .+||.++- T Consensus 245 ~i~~~~~yg~~~~ 257 (274) T protein:vir:96 245 ALYSDKHYVAYLY 257 (274) T ss_pred EEEEeeEEEEEEE Confidence 24555554 No 31 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=26.26 E-value=2 Score=18.97 Aligned_cols=296 Identities=16% Similarity=0.118 Sum_probs=120.7 Q ss_pred HHHh---hcccchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEE Q lcl|NC_016762. 36 LAEH---RGAMTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSV 112 (364) Q Consensus 36 ~a~~---~~~~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~ 112 (364) .|+- ...|..++...+...-+| +.| ...+++.|+++..+.-+-+-..+ ..--||||+.. +.|.. . +. T Consensus 1 ~~~~~~~~~~~~~~~~~t~~~~fiP-ev~---s~~v~~~l~~~lv~~~l~~~~~~--~~~~GdTV~ip-~~g~~-~-a~- 70 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNVQVFIP-EVW---SSEVRMFRDQKFAALEATKKIPF--EGKKGDLIHIP-NISRA-A-VY- 70 (381) T ss_pred CceecccccccCcccchhhHHhhhh-HHH---HHHHHHHHHHhhhhhhccccccc--eeecCceEEee-ccCcc-e-ee- Confidence 1333 346667777777777676 344 45667777665444221111000 11125555432 23331 1 10 Q ss_pred eeCCCCCcccccceeeccCCccce-eecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCee Q lcl|NC_016762. 113 SIDGQAPYSFDHTEYNSDGDPIPV-FTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYP 191 (364) Q Consensus 113 SmdGq~~~~~D~~~Y~~dGtPiPI-f~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t 191 (364) ......+-..+. -...-..|.| -+..+.+...+..-. ..-+|++..--..+...+++++.+.++.-...+ T Consensus 71 d~~~g~~i~~~~--~~~~~~~itID~~~~~~~~Idd~D~~-~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~------ 141 (381) T protein:vir:80 71 DKQPQTPVNLQA--RTDSEFTFTVTKYKESSFMIEDIVNT-QASYTLRQYYTKEAGYALARDMDNFALAHRAVI------ 141 (381) T ss_pred eecCCCcccccc--cCCceEEEEEeeeeecceeechHHHH-hhccChHHHHHHHHHHHHHHHHHHHHHHHHhhc------ Confidence 001111111110 0000111222 112223333332222 223477777777788899999988887543222 Q ss_pred eeeecCCCccccccccccCCCccee--ecccCHHHHHHHHhhhhHHHHHhcccc-ccceEEEEcHHHHHhhhcc-eeeec Q lcl|NC_016762. 192 AQGLRNHRNTIKVNLGSGAGGANID--LTTATPQQIIDFFTKGAFGQAARTNKV-DAYDVLWVSPEINANLAQP-YMITM 267 (364) Q Consensus 192 ~~GLrnh~n~~~~~lg~~~gGaNid--lttA~~~~i~~~f~~~~f~~~~~~N~v-~~~~~lyvS~eI~~N~~~p-y~~~~ 267 (364) .+.+.......+++..+++.. .++.+.+..++.++. ..++++.++| .++-.++|+|++...|.+- =+.+. T Consensus 142 ----~~~~~~~~~t~~~~i~~~~~~~~~t~~~~~~t~~~i~~--a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~a 215 (381) T protein:vir:80 142 ----NAFPSQRIYSYDTTLGDGTVNAHLTGTPAPLTYAALLL--AKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISV 215 (381) T ss_pred ----ccccccccccccccccccccccccccchhhHHHHHHHH--HHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhh Confidence 222222333333333333333 233344555555552 2345555666 3556899999999988752 11111 Q ss_pred cc-ccccccchhHHHHHhhccchhhhccccccCCCeEEEEEcCcceeeheeccee--ecccccccCCCCchhhhhhhhcc Q lcl|NC_016762. 268 GG-GANAVVAGTVLDAVMRFIPAREVRQTFALSGNEFLGYQRRRDVVSPLVGMAT--GVIPLPRPLPQVNYNFQIMSAMG 344 (364) Q Consensus 268 ~~-~S~~~~~gTIl~~i~~~~~V~~I~~~~~LtgNe~lg~v~~~dvI~plvGmp~--gt~p~pR~~p~~nY~f~v~~A~g 344 (364) .. +++....|. |.++.|+ .|.++..|+-+...++...... |....|. ++.-.++.-...+ .-+.....+ T Consensus 216 d~~~~~~l~~G~----Ig~i~G~-~Vv~Sn~lp~~~~t~~~~~aga--p~~~~~~~~~~~~~g~~s~~a~-av~~~k~yd 287 (381) T protein:vir:80 216 DFSQVKPVTSGV----VGTILGM-EVIVTTQIGINSLTGYVNGQGA--PTQPTPGVLGSPYLPDQAGTAN-VVNTGSASD 287 (381) T ss_pred hhccchhhhcee----eeEEcce-EEEeecccccccccceeeeccc--ccccccccccccccccccccee-eeeeeeeec Confidence 11 111222222 3344444 3566777776655554433321 1111111 1111111100000 001122223 Q ss_pred hhhhccccc----------------ceeeEeeccCC Q lcl|NC_016762. 345 IQVKKDDEG----------------LSGVIYGANLA 364 (364) Q Consensus 345 lqiK~D~~G----------------~sgVv~~~~l~ 364 (364) +.+..|..+ -+|.+-++.=. T Consensus 288 ~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ 323 (381) T protein:vir:80 288 LAVSLSYFGLPVFSGAGATAADGGQTLGSFGGANRW 323 (381) T ss_pred eeeeeeeccceeeecceeeecCCCceeeeehhhhhh Confidence 333333322 22322221111 No 32 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=25.44 E-value=2.1 Score=18.87 Aligned_cols=256 Identities=13% Similarity=0.071 Sum_probs=113.7 Q ss_pred HHHHhhcccchhhhhhhhhccccHHHHHHHHHHHHHHhccc----chhhHHHHHHHhhhccchhHHHHHHHhhccCCCeE Q lcl|NC_016762. 35 MLAEHRGAMTPGMLACNALAGLGREFWAEIDAQIIQYRNQE----TGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDV 110 (364) Q Consensus 35 m~a~~~~~~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~----~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v 110 (364) |+.. -. +=..+.|=.++|..+ +++.+... -...+.++|.+ ..|+||+. +.-.++ |++ T Consensus 1 ~~~~------~~---T~l~d~i~PEv~~~~---v~~~~~~~~~~~~~~~~~~~l~g-----~~G~tv~i-P~~~~i-g~a 61 (275) T protein:vir:96 1 MALE------NM---TKLANMVNPEVLAPM---MQAELDKKLKFAQFADIDNTLVG-----QPGNTITF-PAFVYS-GDA 61 (275) T ss_pred CCCc------cc---chhhhhhchHHHHHH---HHHHHHHhhhhcccceecccccC-----CCCCEEEe-eeeccC-Ccc Confidence 2221 11 011112336677665 22221111 11112233322 12666653 122222 223 Q ss_pred EEeeCCCCCcccccceeeccCCccceeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCe Q lcl|NC_016762. 111 SVSIDGQAPYSFDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENY 190 (364) Q Consensus 111 ~~SmdGq~~~~~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~ 190 (364) ..--+| .+++-..-++.-+-+.|...|++|...+...++. +.|++.......-+.+.+++...+++--.. T Consensus 62 ~~~~~g---~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~-~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~------ 131 (275) T protein:vir:96 62 KVVPEG---EEIPIDLIETKKRQATIRKIGKGTVLTDEALLSG-YGDPKGEAVRQHGLAIANKVDNDVLEALQG------ 131 (275) T ss_pred ccccCC---CCcchhhcccceeeEEeehhcccccccHHHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHHHHhc------ Confidence 321122 2344445556666788899899988888765554 557777777777777888888777742111 Q ss_pred eeeeecCCCccccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHHHHhhhc----ceeee Q lcl|NC_016762. 191 PAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEINANLAQ----PYMIT 266 (364) Q Consensus 191 t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~~----py~~~ 266 (364) ... .++-..-+.+.|+++.- .|+. .+ ...-.+.|+|++...|.+ -|... T Consensus 132 -------a~~-------------~~~~~~~~~d~i~dA~~--~lgd---~~--~~~~~ivv~p~~~~~L~k~~~~~f~~~ 184 (275) T protein:vir:96 132 -------ATL-------------KVEADITKLAGLQTAID--KFND---ED--LEPMVLFVNPLDAGKLRASATDNFTRA 184 (275) T ss_pred -------ccc-------------cccccccCHHHHHHHHH--Hhcc---cc--CCccEEEeCHHHHHHHHhccccccccc Confidence 000 01111224666665543 3432 22 356789999999888844 23211 Q ss_pred cccccccccchhHHHHHhhccchhhhccccccCCCeEEEEEcCcceeeheecceeecccccccCCCCchhhhhhhhcchh Q lcl|NC_016762. 267 MGGGANAVVAGTVLDAVMRFIPAREVRQTFALSGNEFLGYQRRRDVVSPLVGMATGVIPLPRPLPQVNYNFQIMSAMGIQ 346 (364) Q Consensus 267 ~~~~S~~~~~gTIl~~i~~~~~V~~I~~~~~LtgNe~lg~v~~~dvI~plvGmp~gt~p~pR~~p~~nY~f~v~~A~glq 346 (364) -..+.+...+|. |-++.|+ .|.++..++.+..+.+-+ + ++++...+...-+.+..- ....=. T Consensus 185 ~~~g~~~~~~G~----ig~~~G~-~Vi~s~~~p~~t~~i~~~--g--------A~~~~~~~~~~vE~~Rd~---~~~~d~ 246 (275) T protein:vir:96 185 TLLGDNVIVKGA----FGEALGA-IIVRSNKIKEGEAILAKR--G--------AVKLITKRDFFLETERHA---SHKSTA 246 (275) T ss_pred ccccccceeccc----cceecCe-eEEEeCCCCcceEEEEec--c--------ceeeeecCCcccccccch---hhcCcE Confidence 111222222332 2334555 566777888776654432 1 222222111111111110 011111 Q ss_pred hhcccccceeeEeeccCC Q lcl|NC_016762. 347 VKKDDEGLSGVIYGANLA 364 (364) Q Consensus 347 iK~D~~G~sgVv~~~~l~ 364 (364) |+.+.---++++..++++ T Consensus 247 i~~~~~y~~~~~~~~~vv 264 (275) T protein:vir:96 247 LFSDKHYVAYLYDESKVV 264 (275) T ss_pred EEEeEEEEEEEEcCccEE Confidence 122211122333333333 No 33 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=22.92 E-value=2.4 Score=18.52 Aligned_cols=303 Identities=12% Similarity=0.022 Sum_probs=115.4 Q ss_pred CccchhhhhhhHHH---------------------------------H-HHHHHHHHHHHHHHH--HHHHHHHHhhcccc Q lcl|NC_016762. 1 MFLTQQAIAAHPRL---------------------------------M-GHFQELQANRNIWNN--QNAAMLAEHRGAMT 44 (364) Q Consensus 1 ~~ftke~~~~~~~~---------------------------------~-~q~~~L~~~R~~~~~--~~~~m~a~~~~~~t 44 (364) --.++|.-.....+ . ....+.. .++.|.+ ....++...+.+.+ T Consensus 37 ~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 115 (395) T protein:vir:43 37 GEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGEEAPKTAGQMVAESL-KEQGVTSSLRGSHRVSMPRSAIT 115 (395) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccchhhhHHHHHHHHH-HHHHHHHHhhhhhhhhhhhhhhc Confidence 00000000000000 0 0000000 1111110 00111111111111 Q ss_pred hhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEEeeCCCCCccccc Q lcl|NC_016762. 45 PGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSVSIDGQAPYSFDH 124 (364) Q Consensus 45 ~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~~~D~ 124 (364) . ...++-..+|.++-.+ ++...++. ..|+.+.+.+|++.....|.+..+....+ T Consensus 116 ~--~~~~~g~~vp~~~~~~----ii~~~~~~------~~l~~l~~~~~~~~~~~~~~~~~~~~~~a-------------- 169 (395) T protein:vir:43 116 S--IDGSGGALVAPDRRPG----VVAAPQRR------LTIRDLVAPGTTESNSVEYVRETGFVNNA-------------- 169 (395) T ss_pred c--cCCCCccccchhhHHH----HHHHHHhh------hhHHhhccceecCCCceEEEEEecCCCce-------------- Confidence 1 1122222367665443 33332222 33556666666654433333322211111 Q ss_pred ceeeccCCccceeecCCc---cchhhhhhcCc-------cccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeee Q lcl|NC_016762. 125 TEYNSDGDPIPVFTAGYG---VNWRHAAGMST-------VGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQG 194 (364) Q Consensus 125 ~~Y~~dGtPiPIf~sgy~---~~WR~~~~~~t-------~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~G 194 (364) .+-.+|+.+|--+..|+ +++....+.-. +.-++..-=+..-.+.+.+++...+|+|++. +-+..| T Consensus 170 -~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~l~~~v~~~la~a~~~~~d~~~l~G~g~----~~~~~G 244 (395) T protein:vir:43 170 -APVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQILDDASALQSYIDARARYGLMLVEECQLLYGNGT----GANLHG 244 (395) T ss_pred -eeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhccCC----CCcccc Confidence 11224445553333222 11111111111 1112333334445567777888889999763 445677 Q ss_pred ecCCCccccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHHHHhhh-------cceeeec Q lcl|NC_016762. 195 LRNHRNTIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEINANLA-------QPYMITM 267 (364) Q Consensus 195 Lrnh~n~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~-------~py~~~~ 267 (364) |.+..........++ .. +....++|+++. ..+... + .....|++||..+.-|. +|.+.+. T Consensus 245 i~~~~~~~~~~~~~~---~~---~~~~~~~i~~~~-----~~~~~~-~-~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~ 311 (395) T protein:vir:43 245 IIPQAQAYAPPSGVV---VT---AEQRIDRIRLAI-----LQAQLA-E-FPASGIVLNPIDWALIELNKDAENRYIIGSP 311 (395) T ss_pred ccccccccccccccc---cc---cchhHHHHHHHH-----Hhhccc-c-CCCcEEEEcHHHHHHHHHhhccCCceecccc Confidence 776554433332211 00 011234443333 222222 2 23457999999986654 3333211 Q ss_pred ccccccccchhHHHHHhhccchhhhccccccCCCeEEEEEcCcce-eeheecceeeccccccc-CCCCchhhhhhhhcch Q lcl|NC_016762. 268 GGGANAVVAGTVLDAVMRFIPAREVRQTFALSGNEFLGYQRRRDV-VSPLVGMATGVIPLPRP-LPQVNYNFQIMSAMGI 345 (364) Q Consensus 268 ~~~S~~~~~gTIl~~i~~~~~V~~I~~~~~LtgNe~lg~v~~~dv-I~plvGmp~gt~p~pR~-~p~~nY~f~v~~A~gl 345 (364) . ++ ..+| |+.+| |+.+..++.+.++.-.-+..+ +--..|+.+-.-+.... +-++-.-|+++.-++. T Consensus 312 ~---~~-~~~~----l~G~p----Vv~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~ 379 (395) T protein:vir:43 312 Q---NG-TTPT----LWRLP----VVETQAITQDEFLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAF 379 (395) T ss_pred c---cC-CCce----eccee----eEEcCCCCCCcEEEEeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeecc Confidence 0 01 1222 23332 577888888887655544422 33344666643332221 1222234455555555 Q ss_pred hhhcccccceeeEeeccCC Q lcl|NC_016762. 346 QVKKDDEGLSGVIYGANLA 364 (364) Q Consensus 346 qiK~D~~G~sgVv~~~~l~ 364 (364) .+.... .+++..==+ T Consensus 380 ~v~~~~----a~~~~~~ta 394 (395) T protein:vir:43 380 AVYRPE----AFVTGSLTA 394 (395) T ss_pred EEeccc----ceEEEEecc Confidence 553322 233221111 No 34 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=22.81 E-value=2.4 Score=18.51 Aligned_cols=246 Identities=13% Similarity=0.142 Sum_probs=105.7 Q ss_pred cchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhcc--chhHHHHH--HHhhccCCCeEEEeeCCCC Q lcl|NC_016762. 43 MTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTVL--PIGKSAKL--YNVVGDIADDVSVSIDGQA 118 (364) Q Consensus 43 ~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v--~igktv~~--y~~~gd~a~~v~~SmdGq~ 118 (364) |+ .+-.++.|=.++|..+=++-+ .+.-.|-.+..+...+ .-|++|++ |.-+|| +..=-+| T Consensus 1 Ma----~T~~~d~I~Pev~~~~V~e~~------~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igd----ae~~~eg-- 64 (270) T protein:vir:95 1 MT----QTKKANLINPEVLANVVSAQM------QNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGA----AEDLQEG-- 64 (270) T ss_pred CC----ceehhhhcchHHHHHHHHHHH------HhHHhhccccccccccCCCCCCEEEeeeecCCCc----cccccCC-- Confidence 22 122223334666665422211 1111111111111111 13777654 444444 2221222 Q ss_pred CcccccceeeccCCccceeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCC Q lcl|NC_016762. 119 PYSFDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNH 198 (364) Q Consensus 119 ~~~~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh 198 (364) .+++-..-.+.-+-..|...|.+|..-+...+.+ +-|++.......-+.+.+++..-++. ...| T Consensus 65 -~~i~~~~lt~~~~~a~i~~~gk~~~itD~a~~~~-~~dp~~~~~~q~a~~~a~~~d~~li~----------~l~~---- 128 (270) T protein:vir:95 65 -VAMDTTQMSMTTTKVTVKETGKAVEVTQTAIITN-VNGTLQEASRQLAMSLADKVEIDYIA----------ELNK---- 128 (270) T ss_pred -CccchhhcccchheeeeehhhCcceecHHHHhhh-ccchHHHHHHHHHHHHHHHHHHHHHH----------Hhcc---- Confidence 3455666677788888999999999999766654 33544444333333344444333321 1111 Q ss_pred CccccccccccCCCcceee-cccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHHHHhhhc-ceeeecccccccccc Q lcl|NC_016762. 199 RNTIKVNLGSGAGGANIDL-TTATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEINANLAQ-PYMITMGGGANAVVA 276 (364) Q Consensus 199 ~n~~~~~lg~~~gGaNidl-ttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~~-py~~~~~~~S~~~~~ 276 (364) +...- .+.+.+.++++.- .|+. .......+.|.|...+.|.+ .++ .-..++++... T Consensus 129 --------------a~~~~~~~~t~~~~~dA~~--~lgd-----~~~~~~~i~vhs~~~~~Lrk~~~~-~~~~~~~~~~~ 186 (270) T protein:vir:95 129 --------------SKQTATVSADATGILDAIE--VFNS-----ENDEDYVLYVNPKDYNKLVKSLFK-VGGNVQDRAIS 186 (270) T ss_pred --------------cccccccccCHHHHHHHHH--Hhcc-----ccCCCcEEEEcHHHHHHHHhhhcc-cccccccchhc Confidence 11111 1245566655543 3443 23446679999999999864 322 22233333321 Q ss_pred hhHHHHHhhccchhhhccccccCCCeEEEEEcCcceeeheecceeecccccccCCCCchhhhhhhhcchhhhccccccee Q lcl|NC_016762. 277 GTVLDAVMRFIPAREVRQTFALSGNEFLGYQRRRDVVSPLVGMATGVIPLPRPLPQVNYNFQIMSAMGIQVKKDDEGLSG 356 (364) Q Consensus 277 gTIl~~i~~~~~V~~I~~~~~LtgNe~lg~v~~~dvI~plvGmp~gt~p~pR~~p~~nY~f~v~~A~glqiK~D~~G~sg 356 (364) .. .|-.+.|+.-|+.+-.++....+.+ .++-|.-.++.++.+.+. |+--. ..=.+..| T Consensus 187 ~G---~ig~~~G~~Viv~s~~~~~~~~~l~--~~gAi~~~~~~~~~vEtd-Rd~~~----------~~d~i~~~------ 244 (270) T protein:vir:95 187 KG---DLVEIVGVSDIVKSKRVSENTAFLQ--RYGAMEIVNKKKPEAYTD-FDILK----------RTHLLSTN------ 244 (270) T ss_pred cc---ccceecceeEEEeCCCCCceeEEEE--eccceeeeecCCceeeec-cchhh----------cccEEEee------ Confidence 11 2334455555555555555444332 233333333333222111 11000 00011111 Q ss_pred eEeeccCC Q lcl|NC_016762. 357 VIYGANLA 364 (364) Q Consensus 357 Vv~~~~l~ 364 (364) ..||..|. T Consensus 245 ~~y~v~~~ 252 (270) T protein:vir:95 245 YHYSVNLK 252 (270) T ss_pred eEEEEEEE Confidence 22333333 No 35 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=22.59 E-value=2.4 Score=18.47 Aligned_cols=257 Identities=16% Similarity=0.140 Sum_probs=106.8 Q ss_pred cchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhc-c-chhHHHHHHHhhccCCCeEEEeeCCCCCc Q lcl|NC_016762. 43 MTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTV-L-PIGKSAKLYNVVGDIADDVSVSIDGQAPY 120 (364) Q Consensus 43 ~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~-v-~igktv~~y~~~gd~a~~v~~SmdGq~~~ 120 (364) |+..-+-.. +.|-.+.|..+ +++.+++. -.+..|.-+..- . .-|++|+..... .+ +++..-=.|. T Consensus 1 MA~~~T~~~--~~~iPev~s~~---v~~~~~~~---~~~~~~~~~~~~~~g~~G~tv~iP~~~-~~-~~a~~v~eg~--- 67 (272) T protein:vir:98 1 MAVGTTKMA--QMLDPEVLADM---IDAEVGKA---IRFAPLAEVDTTLEGQPGTTLTVPKWD-YI-GDAEDVAEGE--- 67 (272) T ss_pred CCCccccch--heechHHHHHH---HHHHHHHH---hhhhccccccccccCCCCCEEEEEEec-CC-CCcccccCCC--- Confidence 443322212 22335666543 33322111 112222211100 0 024455443221 11 1122111221 Q ss_pred ccccceeeccCCccceeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCc Q lcl|NC_016762. 121 SFDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRN 200 (364) Q Consensus 121 ~~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n 200 (364) .++-..-.++..-+.+..-+..|..-+-...++ ..|++..-.....+.+.+++...+++--.. . T Consensus 68 ~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s-~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~-------------a-- 131 (272) T protein:vir:98 68 AIPMTQLGFKKTTMTIKKAGKGVEITDEAILSG-YGDPVGQAAKQIVEAIDHKVDADVLDALSK-------------S-- 131 (272) T ss_pred cccccccccceEEEEeeeeeeeeeecHHHHhhc-cccHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------c-- Confidence 223333445555666777666676666554443 456666666677777777887777753111 0 Q ss_pred cccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHHHHhhhcc---eeeeccccccc-ccc Q lcl|NC_016762. 201 TIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEINANLAQP---YMITMGGGANA-VVA 276 (364) Q Consensus 201 ~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~~p---y~~~~~~~S~~-~~~ 276 (364) ++..+ ..++.+.|.++.. ++.+.+ .....|.|+|+....|.+- =+...+.+..+ ... T Consensus 132 ----~~~~~--------~~~t~d~i~da~~-----~l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~ 192 (272) T protein:vir:98 132 ----TQTVE--------ATATVDGVSKALD-----IFNDED--DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVS 192 (272) T ss_pred ----ccccc--------cccCHHHHHHHHH-----HHhccC--CCccEEEEcHHHHHHHHHhcccccccccccccccccc Confidence 00000 1235566655432 232333 4456899999999887542 11111111111 112 Q ss_pred hhHHHHHhhccchhhhccccccCCCeEEEEEcCcceeeheecceeecccccccCCC-Cc-------hhhhhhh-hcchhh Q lcl|NC_016762. 277 GTVLDAVMRFIPAREVRQTFALSGNEFLGYQRRRDVVSPLVGMATGVIPLPRPLPQ-VN-------YNFQIMS-AMGIQV 347 (364) Q Consensus 277 gTIl~~i~~~~~V~~I~~~~~LtgNe~lg~v~~~dvI~plvGmp~gt~p~pR~~p~-~n-------Y~f~v~~-A~glqi 347 (364) |. +-.+.|+ .|..+..++.+..+.+... -+....+.++.+... |+--. .+ |.+.+.. .....+ T Consensus 193 g~----ig~i~G~-~Vi~s~~~p~~t~~~~~~~--a~~~~~~~~~~ve~~-r~~~~~~~~i~~~~~~~~~v~~~~~vv~~ 264 (272) T protein:vir:98 193 GV----YGEVLGV-QIVRSRKCPKGTAYMVRKG--ALRIMLKRNTMVETD-RDITKAINQIVANKHYGVYLYKAEKAVKI 264 (272) T ss_pred cc----chhhcCe-eEEEcCCCCcceEEEEcCC--eEEEEecCCceeeec-cccccceeEEEEEEEEEEEEEcCCceEEE Confidence 21 2234565 4788888888777665443 222233333322211 22100 00 1111111 111222 Q ss_pred hcccccce Q lcl|NC_016762. 348 KKDDEGLS 355 (364) Q Consensus 348 K~D~~G~s 355 (364) |.+.-||. T Consensus 265 t~~~a~~~ 272 (272) T protein:vir:98 265 TLKDAAKK 272 (272) T ss_pred EecccccC Confidence 22222322 No 36 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=22.59 E-value=2.4 Score=18.47 Aligned_cols=257 Identities=16% Similarity=0.140 Sum_probs=106.8 Q ss_pred cchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhhhc-c-chhHHHHHHHhhccCCCeEEEeeCCCCCc Q lcl|NC_016762. 43 MTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQTV-L-PIGKSAKLYNVVGDIADDVSVSIDGQAPY 120 (364) Q Consensus 43 ~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~~~-v-~igktv~~y~~~gd~a~~v~~SmdGq~~~ 120 (364) |+..-+-.. +.|-.+.|..+ +++.+++. -.+..|.-+..- . .-|++|+..... .+ +++..-=.|. T Consensus 1 MA~~~T~~~--~~~iPev~s~~---v~~~~~~~---~~~~~~~~~~~~~~g~~G~tv~iP~~~-~~-~~a~~v~eg~--- 67 (272) T protein:vir:30 1 MAVGTTKMA--QMLDPEVLADM---IDAEVGKA---IRFAPLAEVDTTLEGQPGTTLTVPKWD-YI-GDAEDVAEGE--- 67 (272) T ss_pred CCCccccch--heechHHHHHH---HHHHHHHH---hhhhccccccccccCCCCCEEEEEEec-CC-CCcccccCCC--- Confidence 443322212 22335666543 33322111 112222211100 0 024455443221 11 1122111221 Q ss_pred ccccceeeccCCccceeecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCc Q lcl|NC_016762. 121 SFDHTEYNSDGDPIPVFTAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRN 200 (364) Q Consensus 121 ~~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n 200 (364) .++-..-.++..-+.+..-+..|..-+-...++ ..|++..-.....+.+.+++...+++--.. . T Consensus 68 ~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s-~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~-------------a-- 131 (272) T protein:vir:30 68 AIPMTQLGFKKTTMTIKKAGKGVEITDEAILSG-YGDPVGQAAKQIVEAIDHKVDADVLDALSK-------------S-- 131 (272) T ss_pred cccccccccceEEEEeeeeeeeeeecHHHHhhc-cccHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------c-- Confidence 223333445555666777666676666554443 456666666677777777887777753111 0 Q ss_pred cccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHHHHhhhcc---eeeeccccccc-ccc Q lcl|NC_016762. 201 TIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEINANLAQP---YMITMGGGANA-VVA 276 (364) Q Consensus 201 ~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~~p---y~~~~~~~S~~-~~~ 276 (364) ++..+ ..++.+.|.++.. ++.+.+ .....|.|+|+....|.+- =+...+.+..+ ... T Consensus 132 ----~~~~~--------~~~t~d~i~da~~-----~l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~ 192 (272) T protein:vir:30 132 ----TQTVE--------ATATVDGVSKALD-----IFNDED--DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVS 192 (272) T ss_pred ----ccccc--------cccCHHHHHHHHH-----HHhccC--CCccEEEEcHHHHHHHHHhcccccccccccccccccc Confidence 00000 1235566655432 232333 4456899999999887542 11111111111 112 Q ss_pred hhHHHHHhhccchhhhccccccCCCeEEEEEcCcceeeheecceeecccccccCCC-Cc-------hhhhhhh-hcchhh Q lcl|NC_016762. 277 GTVLDAVMRFIPAREVRQTFALSGNEFLGYQRRRDVVSPLVGMATGVIPLPRPLPQ-VN-------YNFQIMS-AMGIQV 347 (364) Q Consensus 277 gTIl~~i~~~~~V~~I~~~~~LtgNe~lg~v~~~dvI~plvGmp~gt~p~pR~~p~-~n-------Y~f~v~~-A~glqi 347 (364) |. +-.+.|+ .|..+..++.+..+.+... -+....+.++.+... |+--. .+ |.+.+.. .....+ T Consensus 193 g~----ig~i~G~-~Vi~s~~~p~~t~~~~~~~--a~~~~~~~~~~ve~~-r~~~~~~~~i~~~~~~~~~v~~~~~vv~~ 264 (272) T protein:vir:30 193 GV----YGEVLGV-QIVRSRKCPKGTAYMVRKG--ALRIMLKRNTMVETD-RDITKAINQIVANKHYGVYLYKAEKAVKI 264 (272) T ss_pred cc----chhhcCe-eEEEcCCCCcceEEEEcCC--eEEEEecCCceeeec-cccccceeEEEEEEEEEEEEEcCCceEEE Confidence 21 2234565 4788888888777665443 222233333322211 22100 00 1111111 111222 Q ss_pred hcccccce Q lcl|NC_016762. 348 KKDDEGLS 355 (364) Q Consensus 348 K~D~~G~s 355 (364) |.+.-||. T Consensus 265 t~~~a~~~ 272 (272) T protein:vir:30 265 TLKDAAKK 272 (272) T ss_pred EecccccC Confidence 22222322 No 37 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=22.48 E-value=2.4 Score=18.46 Aligned_cols=309 Identities=9% Similarity=0.008 Sum_probs=117.2 Q ss_pred Cccchh-------hhhhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----------ccchhhhhhhhhccccHHHH Q lcl|NC_016762. 1 MFLTQQ-------AIAAH--PRLMGHFQELQANRNIWNNQNAAMLAEHRG----------AMTPGMLACNALAGLGREFW 61 (364) Q Consensus 1 ~~ftke-------~~~~~--~~~~~q~~~L~~~R~~~~~~~~~m~a~~~~----------~~t~~~~a~Na~a~lprD~W 61 (364) ..-+++ ..... ....... ....+.++.+ .+...... +.......-+....+|..+ T Consensus 105 ~~~~~e~~~~~~~~~~~~~~~~~~~~~--~~~e~~~~~~---~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~- 178 (458) T protein:vir:10 105 LLTAREGRSFVGDSVAKALYGTQENFE--DEVEKLVLLS---YVMEKGVFETEHGQRHLKAVNQSSSVEVSSESYETIF- 178 (458) T ss_pred HHHHHHhhhhhhhhhhccchhhhhhHH--HHHHHHHHHH---HHHhhccchhhhhhhhhhhhhhcccCccccceehhhH- Confidence 000000 00000 0000000 1111112211 00000000 0000000001112356544 Q ss_pred HHHHHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEEeeCCCCCcc---------cccceeeccCC Q lcl|NC_016762. 62 AEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSVSIDGQAPYS---------FDHTEYNSDGD 132 (364) Q Consensus 62 ~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~~---------~D~~~Y~~dGt 132 (364) .+.++...++.. =|+.+.+.+|++-....|.+..+...-..+.-.+..|.. |+... T Consensus 179 ---~~~ii~~~~~~~------~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~------ 243 (458) T protein:vir:10 179 ---SQRIIRDLQKEL------VVGALFEELPMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIH------ 243 (458) T ss_pred ---hHHHHHHHHhhh------hHHhhcceeecCCcceEEEEecCCcceeecccccccccccccccccccceeeE------ Confidence 444444433322 144556667775555555554442221122333333322 22222 Q ss_pred ccceeecCCccch-hhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCccccccccccCC Q lcl|NC_016762. 133 PIPVFTAGYGVNW-RHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNTIKVNLGSGAG 211 (364) Q Consensus 133 PiPIf~sgy~~~W-R~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg~~~g 211 (364) ++.++-+-.+.. ++++-.. .+++..==+..-.+++.+++...+|+|+++ | .-.||-+++.....+.-.+ T Consensus 244 -~~~~k~~~~v~is~ell~ds--~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~----~-~p~Gi~~~~~~~~~~~~~~-- 313 (458) T protein:vir:10 244 -FSTYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAVSIEEAFMTGDGS----G-KPKGLLTLASEDSAKVVTE-- 313 (458) T ss_pred -eeeeeEEeeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHhhcCCCC----C-ccceeeecccccccceeec-- Confidence 222221111111 2333222 244445455566788888999999999985 3 3468888876554333221 Q ss_pred CcceeecccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHHHHhhhc-------ceee-ecccccccccchhHHHHH Q lcl|NC_016762. 212 GANIDLTTATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEINANLAQ-------PYMI-TMGGGANAVVAGTVLDAV 283 (364) Q Consensus 212 GaNidlttA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~~-------py~~-~~~~~S~~~~~gTIl~~i 283 (364) ....+.++.+.++|+++.. .+.. ++ ...-.|+++|..+.-|.. |-+. +......+..++| | T Consensus 314 ~~~~~~~~~~~~~i~~~~~-----~l~~-~~-~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~----l 382 (458) T protein:vir:10 314 AKADGSVLVTAKTISKLRR-----KLGR-HG-LKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGR----I 382 (458) T ss_pred ccccccccccHHHHHHHHH-----hhhh-hh-cCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCce----e Confidence 1222334557788877554 2222 22 234679999999876542 1110 0000001111222 2 Q ss_pred hhccchhhhccccccCC-----CeEEEEEcCcceeeheecceeecccccccCC-CCch-hhhhhhhcchhhhccccccee Q lcl|NC_016762. 284 MRFIPAREVRQTFALSG-----NEFLGYQRRRDVVSPLVGMATGVIPLPRPLP-QVNY-NFQIMSAMGIQVKKDDEGLSG 356 (364) Q Consensus 284 ~~~~~V~~I~~~~~Ltg-----Ne~lg~v~~~dvI~plvGmp~gt~p~pR~~p-~~nY-~f~v~~A~glqiK~D~~G~sg 356 (364) +.+| |..+..++. .-++|.-.+.=+|-...|+-+-+ ..+ ..+. -|+...-+|+++... ++ T Consensus 383 ~G~p----v~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~~~~v~~-----d~~~~~~~~~~~~~~r~~~~v~~~----~a 449 (458) T protein:vir:10 383 YGLP----VVVSEYFPAKANSAEFAVIVYKDNFVMPRQRAVTVER-----ERQAGKQRDAYYVTQRVNLQRYFA----NG 449 (458) T ss_pred ccee----eEEccccccccCCcceEEEEecccEEEEEeeceEEEe-----ecccCCCceEEEEEEEecceEecc----cc Confidence 2221 122222222 11233222222233333444321 111 1111 244445556665544 45 Q ss_pred eEeeccCC Q lcl|NC_016762. 357 VIYGANLA 364 (364) Q Consensus 357 Vv~~~~l~ 364 (364) +|.++=-| T Consensus 450 ~v~~~~aa 457 (458) T protein:vir:10 450 VVSGTYAA 457 (458) T ss_pred eEEEeecc Confidence 55543323 No 38 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=21.95 E-value=2.5 Score=18.38 Aligned_cols=239 Identities=13% Similarity=0.180 Sum_probs=96.6 Q ss_pred hhhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhh--hccchhHHHHHHHhhccCCCeEEEeeCCCCCcc----- Q lcl|NC_016762. 49 ACNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQ--TVLPIGKSAKLYNVVGDIADDVSVSIDGQAPYS----- 121 (364) Q Consensus 49 a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~--~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~~----- 121 (364) ++|..- + .+.|. ..+++.|+++.- +..|.... ....-|+| |++-..|..+.. T Consensus 1 MA~~~~-~-pe~~~---~~v~~~~~~~lv---~~~l~~~~~~~~~~~Gdt-------------v~ip~~~~~~~~d~~~~ 59 (273) T protein:vir:10 1 MAFNNF-I-PELWS---DMLLEEWTAQTV---FANLVNREYEGTASKGNV-------------VHIAGVVAPTVKDYKAA 59 (273) T ss_pred Ccchhh-h-HHHHH---HHHHHHHHhhhc---cchhhccccccccccCce-------------EEEeecccccccccccC Confidence 334331 4 44664 344555544322 22332211 11222444 444433332211 Q ss_pred ---cccceeeccCCccce-e--ecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeee Q lcl|NC_016762. 122 ---FDHTEYNSDGDPIPV-F--TAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGL 195 (364) Q Consensus 122 ---~D~~~Y~~dGtPiPI-f--~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GL 195 (364) .+-... ..+.+|+ + +..+.+.+-+..-.+.. .|+.. -.......+.+++..++++= + .+ T Consensus 60 ~~~~~~~~~--~~~~~~~tid~~~~~~~~i~d~d~~~~~-~~~~~-~~~~~~~alA~~vD~~i~~~---~-------~~- 124 (273) T protein:vir:10 60 GRQTSADAI--SDTGVDLLIDQEKSIDFLVDDIDRVQVA-GSLEA-YTRAGATALATDTDKFIADM---L-------VD- 124 (273) T ss_pred CCccCcccc--ccceEEEEEeeeeecceEeecHHHhhhh-ccHHH-HHHHHHHHHHHHHHHHHHHH---H-------hc- Confidence 111111 2222333 2 23455555543333322 34433 34445567788887777641 0 00 Q ss_pred cCCCccccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhcccc-ccceEEEEcHHHHHhhhc--ceeeec--ccc Q lcl|NC_016762. 196 RNHRNTIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKV-DAYDVLWVSPEINANLAQ--PYMITM--GGG 270 (364) Q Consensus 196 rnh~n~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v-~~~~~lyvS~eI~~N~~~--py~~~~--~~~ 270 (364) +.... .++ ++.+++.+++.|.. +. +.++.+++ ...-.++|+|+....|.. -++.+. ... T Consensus 125 --a~~~~-------~~~-----~~~~~~~~~~~i~~-a~-~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~ 188 (273) T protein:vir:10 125 --NGTAL-------TGS-----APTDADDAFDLIAK-AL-KELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGD 188 (273) T ss_pred --ccccc-------ccc-----cccchhHHHHHHHH-HH-HHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhcccc Confidence 00000 011 12244555666553 33 33445554 456679999999999854 233211 111 Q ss_pred cccccchhHHHHHhhccchhhhccccccCC---CeEEEEEcCcceeeheecceeecccccccCCCCchhhhhhhhcchhh Q lcl|NC_016762. 271 ANAVVAGTVLDAVMRFIPAREVRQTFALSG---NEFLGYQRRRDVVSPLVGMATGVIPLPRPLPQVNYNFQIMSAMGIQV 347 (364) Q Consensus 271 S~~~~~gTIl~~i~~~~~V~~I~~~~~Ltg---Ne~lg~v~~~dvI~plvGmp~gt~p~pR~~p~~nY~f~v~~A~glqi 347 (364) +.....|. |.++.|+ .|.++..|+- .+++...+++-.... +-..+-+...+....++ | T Consensus 189 ~~~l~~G~----ig~i~G~-~v~~s~~lp~~~~~~~~~~~~~A~~~a~---q~~~~e~~r~~~~~~~~-----------v 249 (273) T protein:vir:10 189 AAGLRAGT----IGNLLGA-RIVESNNLRDTDDEQFVAFHPSAAAYVS---QIDTVEALRDQDSFSDR-----------I 249 (273) T ss_pred ccceeeee----eeEEece-EEEEecccccCCccEEEEEeccceeeee---eeehhhcccCCCcceee-----------e Confidence 12233444 4456665 6777777753 345555443311100 00011111111110111 1 Q ss_pred hcccccceeeEeeccCC Q lcl|NC_016762. 348 KKDDEGLSGVIYGANLA 364 (364) Q Consensus 348 K~D~~G~sgVv~~~~l~ 364 (364) .|+ .+||+++- T Consensus 250 ----~~~--~~yg~~v~ 260 (273) T protein:vir:10 250 ----RAL--HVYGGKVV 260 (273) T ss_pred ----eee--eeeeeeEe Confidence 111 34555444 No 39 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=21.95 E-value=2.5 Score=18.38 Aligned_cols=239 Identities=13% Similarity=0.180 Sum_probs=96.6 Q ss_pred hhhhhccccHHHHHHHHHHHHHHhcccchhhHHHHHHHhh--hccchhHHHHHHHhhccCCCeEEEeeCCCCCcc----- Q lcl|NC_016762. 49 ACNALAGLGREFWAEIDAQIIQYRNQETGMEIVNDLLQVQ--TVLPIGKSAKLYNVVGDIADDVSVSIDGQAPYS----- 121 (364) Q Consensus 49 a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i~nDLm~l~--~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~~----- 121 (364) ++|..- + .+.|. ..+++.|+++.- +..|.... ....-|+| |++-..|..+.. T Consensus 1 MA~~~~-~-pe~~~---~~v~~~~~~~lv---~~~l~~~~~~~~~~~Gdt-------------v~ip~~~~~~~~d~~~~ 59 (273) T protein:vir:10 1 MAFNNF-I-PELWS---DMLLEEWTAQTV---FANLVNREYEGTASKGNV-------------VHIAGVVAPTVKDYKAA 59 (273) T ss_pred Ccchhh-h-HHHHH---HHHHHHHHhhhc---cchhhccccccccccCce-------------EEEeecccccccccccC Confidence 334331 4 44664 344555544322 22332211 11222444 444433332211 Q ss_pred ---cccceeeccCCccce-e--ecCCccchhhhhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeee Q lcl|NC_016762. 122 ---FDHTEYNSDGDPIPV-F--TAGYGVNWRHAAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGL 195 (364) Q Consensus 122 ---~D~~~Y~~dGtPiPI-f--~sgy~~~WR~~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GL 195 (364) .+-... ..+.+|+ + +..+.+.+-+..-.+.. .|+.. -.......+.+++..++++= + .+ T Consensus 60 ~~~~~~~~~--~~~~~~~tid~~~~~~~~i~d~d~~~~~-~~~~~-~~~~~~~alA~~vD~~i~~~---~-------~~- 124 (273) T protein:vir:10 60 GRQTSADAI--SDTGVDLLIDQEKSIDFLVDDIDRVQVA-GSLEA-YTRAGATALATDTDKFIADM---L-------VD- 124 (273) T ss_pred CCccCcccc--ccceEEEEEeeeeecceEeecHHHhhhh-ccHHH-HHHHHHHHHHHHHHHHHHHH---H-------hc- Confidence 111111 2222333 2 23455555543333322 34433 34445567788887777641 0 00 Q ss_pred cCCCccccccccccCCCcceeecccCHHHHHHHHhhhhHHHHHhcccc-ccceEEEEcHHHHHhhhc--ceeeec--ccc Q lcl|NC_016762. 196 RNHRNTIKVNLGSGAGGANIDLTTATPQQIIDFFTKGAFGQAARTNKV-DAYDVLWVSPEINANLAQ--PYMITM--GGG 270 (364) Q Consensus 196 rnh~n~~~~~lg~~~gGaNidlttA~~~~i~~~f~~~~f~~~~~~N~v-~~~~~lyvS~eI~~N~~~--py~~~~--~~~ 270 (364) +.... .++ ++.+++.+++.|.. +. +.++.+++ ...-.++|+|+....|.. -++.+. ... T Consensus 125 --a~~~~-------~~~-----~~~~~~~~~~~i~~-a~-~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~ 188 (273) T protein:vir:10 125 --NGTAL-------TGS-----APTDADDAFDLIAK-AL-KELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGD 188 (273) T ss_pred --ccccc-------ccc-----cccchhHHHHHHHH-HH-HHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhcccc Confidence 00000 011 12244555666553 33 33445554 456679999999999854 233211 111 Q ss_pred cccccchhHHHHHhhccchhhhccccccCC---CeEEEEEcCcceeeheecceeecccccccCCCCchhhhhhhhcchhh Q lcl|NC_016762. 271 ANAVVAGTVLDAVMRFIPAREVRQTFALSG---NEFLGYQRRRDVVSPLVGMATGVIPLPRPLPQVNYNFQIMSAMGIQV 347 (364) Q Consensus 271 S~~~~~gTIl~~i~~~~~V~~I~~~~~Ltg---Ne~lg~v~~~dvI~plvGmp~gt~p~pR~~p~~nY~f~v~~A~glqi 347 (364) +.....|. |.++.|+ .|.++..|+- .+++...+++-.... +-..+-+...+....++ | T Consensus 189 ~~~l~~G~----ig~i~G~-~v~~s~~lp~~~~~~~~~~~~~A~~~a~---q~~~~e~~r~~~~~~~~-----------v 249 (273) T protein:vir:10 189 AAGLRAGT----IGNLLGA-RIVESNNLRDTDDEQFVAFHPSAAAYVS---QIDTVEALRDQDSFSDR-----------I 249 (273) T ss_pred ccceeeee----eeEEece-EEEEecccccCCccEEEEEeccceeeee---eeehhhcccCCCcceee-----------e Confidence 12233444 4456665 6777777753 345555443311100 00011111111110111 1 Q ss_pred hcccccceeeEeeccCC Q lcl|NC_016762. 348 KKDDEGLSGVIYGANLA 364 (364) Q Consensus 348 K~D~~G~sgVv~~~~l~ 364 (364) .|+ .+||+++- T Consensus 250 ----~~~--~~yg~~v~ 260 (273) T protein:vir:10 250 ----RAL--HVYGGKVV 260 (273) T ss_pred ----eee--eeeeeeEe Confidence 111 34555444 No 40 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=21.64 E-value=2.6 Score=18.34 Aligned_cols=292 Identities=15% Similarity=0.016 Sum_probs=107.4 Q ss_pred CccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccchhhhhhhhhccccHHHHHHHHHHHHHHhcccchhhH Q lcl|NC_016762. 1 MFLTQQAIAAHPRLMGHFQELQANRNIWNNQNAAMLAEHRGAMTPGMLACNALAGLGREFWAEIDAQIIQYRNQETGMEI 80 (364) Q Consensus 1 ~~ftke~~~~~~~~~~q~~~L~~~R~~~~~~~~~m~a~~~~~~t~~~~a~Na~a~lprD~W~e~D~~~~q~~~q~~~~~i 80 (364) ---..+....+......+......+.... .......+ ...+. ...++-..+|.++..+| ++..++.. T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~~--~~~~~g~lip~~~~~~i----i~~~~~~~---- 141 (390) T protein:vir:97 75 HVSVGDMFVASEQFQASTGRWNDRSARAT-MNIKAALN--TASTD--AAGSAGALTTPNRLPGF----ITPPDARL---- 141 (390) T ss_pred cccchhhhhhhHHHHHHHHHhhhhhhhhh-hHHHHHHH--hhhcc--cccccccccchhhhHHH----HHHHhhhh---- Confidence 00001111111111111111111000000 00000000 01111 11222233677766544 33322221 Q ss_pred HHHHHHhhhccchhHHHHHHHhhccCCCeEEEeeCCCCCcccccceeeccCCccceeecCCc---cch----------hh Q lcl|NC_016762. 81 VNDLLQVQTVLPIGKSAKLYNVVGDIADDVSVSIDGQAPYSFDHTEYNSDGDPIPVFTAGYG---VNW----------RH 147 (364) Q Consensus 81 ~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~SmdGq~~~~~D~~~Y~~dGtPiPIf~sgy~---~~W----------R~ 147 (364) -|.++.+.+|++.....|.+....++.+ .+--+|+.+|--+..|+ ++. |+ T Consensus 142 --~i~~~~~~~~~~~~~~~~~~~~~~~~~a---------------~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e 204 (390) T protein:vir:97 142 --TVRDLIGSGRTDSALIEYVQETGFVNNA---------------AIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQ 204 (390) T ss_pred --hhHhhcceeeccCCceEEEEEecCCcce---------------eeecCCccccccccceeEEEEeeeeEEEeehhhHH Confidence 2334556666543333333322211211 12224444443332221 111 12 Q ss_pred hhhcCccccchhhhhHHHHHHHHHHHHHhhhhccCCceeecCeeeeeecCCCccccccccccCCCcceeecccCHHHHHH Q lcl|NC_016762. 148 AAGMSTVGIDLVLDSQAAKLRKFNKRIVAYTLDGATNIQVENYPAQGLRNHRNTIKVNLGSGAGGANIDLTTATPQQIID 227 (364) Q Consensus 148 ~~~~~t~g~D~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~t~~GLrnh~n~~~~~lg~~~gGaNidlttA~~~~i~~ 227 (364) ++- .+ .++..-=...-.+.+.+++...+|+|++. +....||.+.........+.+ .....++|++ T Consensus 205 ll~-ds--~~l~~~i~~~la~a~~~~~d~a~l~G~g~----~~~p~Gi~~~~~~~~~~~~~~--------~~~~~d~~~~ 269 (390) T protein:vir:97 205 ILS-DA--PQLASYMNNRLIRGLKVKEDAEILRGTGA----NDGLLGLIPQATTYAAPTTIA--------GATRVDQLRL 269 (390) T ss_pred HHH-hH--HHHHHHHHHHHHHHHHHHHHHHHhhcCCC----Cccccceeecccccccccccc--------ccchHHHHHH Confidence 211 11 13333334445677888888999999764 234568877654332222110 0112344433 Q ss_pred HHhhhhHHHHHhccccccceEEEEcHHHHHhhh-------cceeeecccccccccchhHHHHHhhccchhhhccccccCC Q lcl|NC_016762. 228 FFTKGAFGQAARTNKVDAYDVLWVSPEINANLA-------QPYMITMGGGANAVVAGTVLDAVMRFIPAREVRQTFALSG 300 (364) Q Consensus 228 ~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~-------~py~~~~~~~S~~~~~gTIl~~i~~~~~V~~I~~~~~Ltg 300 (364) +. ..++. ++ .....|++||..+.-|. +|.+. +...+++ ..|+.+| ++.+..++. T Consensus 270 ~~-----~~~~~-~~-~~~~~~v~n~~~~~~L~~lkd~~G~~l~~------~~~~~~~--~~l~G~p----V~~~~~~~~ 330 (390) T protein:vir:97 270 AM-----LQASL-AE-YPASGIVINPIDWAAIELAKDANNQYLIG------NARGTLT--PTLWGLP----VVATQAMAP 330 (390) T ss_pred HH-----Hhhcc-cc-CCCCEEEEcHHHHHHHHHhhcCCCceeec------CccCCCC--ceeccee----eEEcCCCCC Confidence 33 22332 22 34567999999987776 33321 1111110 1233332 455667777 Q ss_pred CeEEEEEcCcce-eeheecceeecccccccC--CCCch-hhhhhhhcchhhhcccccceeeEeeccCC Q lcl|NC_016762. 301 NEFLGYQRRRDV-VSPLVGMATGVIPLPRPL--PQVNY-NFQIMSAMGIQVKKDDEGLSGVIYGANLA 364 (364) Q Consensus 301 Ne~lg~v~~~dv-I~plvGmp~gt~p~pR~~--p~~nY-~f~v~~A~glqiK~D~~G~sgVv~~~~l~ 364 (364) ++++....+..+ +-...|+.+-+ .+.. -..|+ .|++..-++..+... +. ++++. || T Consensus 331 ~~~~~gd~~~~~~~~~~~~~~i~~---~~~~~~f~~~~~~~r~~~r~d~~v~~~-~a---~v~~~-~a 390 (390) T protein:vir:97 331 GEFLVGAFDLAAQIFDQWDARVEI---GYVNDDFQRNMVTVLAEERLALVVYRP-EA---LITGS-FA 390 (390) T ss_pred CcEEEEeccceEEEEEecceEEEE---eecccccccCcEEEEEEEeeccEEecc-cc---EEEEE-eC Confidence 777766655433 23333333321 1111 11111 133333333333222 11 11111 11 No 41 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=20.34 E-value=2.8 Score=18.14 Aligned_cols=271 Identities=14% Similarity=0.079 Sum_probs=114.8 Q ss_pred cchh----hhhhhhhccccHHHH-HHHHHHHHHHhcccchhhHHHHHHHhhhccchhHHHHHHHhhccCCCeEEE---ee Q lcl|NC_016762. 43 MTPG----MLACNALAGLGREFW-AEIDAQIIQYRNQETGMEIVNDLLQVQTVLPIGKSAKLYNVVGDIADDVSV---SI 114 (364) Q Consensus 43 ~t~~----~~a~Na~a~lprD~W-~e~D~~~~q~~~q~~~~~i~nDLm~l~~~v~igktv~~y~~~gd~a~~v~~---Sm 114 (364) |+.. .-+.++.+.- .++. +++..++.+-|+. -+++-+++.+ |.|.=|||++.+++ |...=+.+. ++ T Consensus 1 ms~~n~~t~~~~~~~~~~-~al~le~f~geV~taf~~---~s~~~~~~~~-rti~~gkS~q~~~i-G~~~~~~~~~G~~l 74 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEV-DSLLIEKFNNRVHEQYLK---GENLLQWFDV-QEVVGTNSVSNKYI-GETELQVLSPGKSP 74 (364) T ss_pred CCCcccccccccccccch-hhhhhhhhhhhHHHHHHH---HHhhcCccee-eeecccceEEeeee-eeeEEeeeccCccc Confidence 3332 1122322221 3444 5566666665544 4677788887 67888999887665 441111111 45 Q ss_pred CCCCCcccccceeeccCCccceeecCCccchhhhhhc---Cccccc-hhhhhHHHHHHHHHHHHHhhhhccCCceeecCe Q lcl|NC_016762. 115 DGQAPYSFDHTEYNSDGDPIPVFTAGYGVNWRHAAGM---STVGID-LVLDSQAAKLRKFNKRIVAYTLDGATNIQVENY 190 (364) Q Consensus 115 dGq~~~~~D~~~Y~~dGtPiPIf~sgy~~~WR~~~~~---~t~g~D-~~~D~q~~~~rkv~~k~~dy~lnG~~~I~v~~~ 190 (364) +|+.+. -|+..-.-|+.- -+|.+.-- -..-+| +++.=-.+.=..+.+.+-++++ +-+. T Consensus 75 d~~~~~-~~k~~itID~ll----------~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~---~~v~---- 136 (364) T protein:vir:10 75 DASPTE-FDKNRLVVDTTV----------IARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVI---QQLV---- 136 (364) T ss_pred CCCCcc-cCcEEEEeccee----------eechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHH---HHHH---- Confidence 554333 233333333321 12222100 012233 1211111112233333333332 0000 Q ss_pred eeeeecCCC-ccccccccccCCCcceeec------ccCHHHHHHHHhhhhHHHHHhccccccceEEEEcHHHHHhhhcc- Q lcl|NC_016762. 191 PAQGLRNHR-NTIKVNLGSGAGGANIDLT------TATPQQIIDFFTKGAFGQAARTNKVDAYDVLWVSPEINANLAQP- 262 (364) Q Consensus 191 t~~GLrnh~-n~~~~~lg~~~gGaNidlt------tA~~~~i~~~f~~~~f~~~~~~N~v~~~~~lyvS~eI~~N~~~p- 262 (364) .-++.|-. .+.... +.+ +|.+|++. ..+++.++++|. .++.++-+.|=-...-..+|+|+.+..|.+- T Consensus 137 -~aa~a~~~~~~~~~~-~~~-~g~~i~~~~~a~~~~~~~~~l~~ai~-~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~ 212 (364) T protein:vir:10 137 -LGGISNTEAIRKNPR-VAG-HGFSIHIVGLASSFLTSPQYMMAAIE-MAMEQQTEQEVDTSELCGLMPWTAFNCLRDAD 212 (364) T ss_pred -hhhhhcccccccCCc-ccC-CcceeeecccCcchhhhHHHHHHHHH-HHHHHHhhcCCCccccEEEeChHHHHHHhcCC Confidence 01222211 111111 111 24455553 224567777776 5788887777777788999999999988764 Q ss_pred eeeec---ccccccccchhHHHHHhhccchhhhccccccCC----------------------CeE---------EEEEc Q lcl|NC_016762. 263 YMITM---GGGANAVVAGTVLDAVMRFIPAREVRQTFALSG----------------------NEF---------LGYQR 308 (364) Q Consensus 263 y~~~~---~~~S~~~~~gTIl~~i~~~~~V~~I~~~~~Ltg----------------------Ne~---------lg~v~ 308 (364) -+++. .+++.++..| +|..+.||. |.++..||- |.+ .++.= T Consensus 213 ~lvn~d~~~~~~~~~~~G----~v~~v~Gv~-Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f 287 (364) T protein:vir:10 213 RIVDKSYTIAASDNTVDG----FVLKSWNTP-IVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLF 287 (364) T ss_pred ccccccccccCCCccccc----eeEEEeceE-EEeccccccccccccccccccccccccccCCcccccccccceeEEEEE Confidence 11211 1233445444 455677776 788877752 111 01111 Q ss_pred CcceeeheecceeecccccccCCCCch-hhhhhhhcchhhhcccccceeeEeeccCC Q lcl|NC_016762. 309 RRDVVSPLVGMATGVIPLPRPLPQVNY-NFQIMSAMGIQVKKDDEGLSGVIYGANLA 364 (364) Q Consensus 309 ~~dvI~plvGmp~gt~p~pR~~p~~nY-~f~v~~A~glqiK~D~~G~sgVv~~~~l~ 364 (364) .++-+--..-|++ ..+-++...+ -|.+ -+...||+++- T Consensus 288 ~~~Al~tv~~~~~----t~e~~~~~~~~~~~i--------------da~~a~G~g~l 326 (364) T protein:vir:10 288 TQDALLVGRTISI----TGDIFYEKKEKTWYI--------------DTFLAEGAIPD 326 (364) T ss_pred ecceEEEEEEecc----eeeeeeccceeeeee--------------eeehcccCccc Confidence 1111111111112 1122211111 1111 11223333332 Done!