Query lcl|NC_011222.1_cdsid_YP_002221581.1 [gene=B40-8043] [protein=MP1] [protein_id=YP_002221581.1] [location=41213..42529] Match_columns 438 No_of_seqs 4 out of 7 Neff 2.1 Searched_HMMs 1612 Date Thu Nov 7 12:46:21 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_43 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_43_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:96123 Length: 274 95.1 0.0029 1.8E-06 34.5 12.9 262 1-338 1-274 (274) 2 protein:vir:9820 Length: 272 # 94.7 0.0027 1.7E-06 34.6 11.5 259 1-334 1-272 (272) 3 protein:vir:3033 Length: 272 # 94.7 0.0027 1.7E-06 34.6 11.5 259 1-334 1-272 (272) 4 protein:vir:97433 Length: 274 94.2 0.0024 1.5E-06 34.9 10.2 263 1-342 1-274 (274) 5 protein:vir:94494 Length: 274 94.2 0.0024 1.5E-06 34.9 10.2 263 1-342 1-274 (274) 6 protein:vir:93742 Length: 274 94.1 0.0024 1.5E-06 34.9 10.1 262 1-340 1-274 (274) 7 protein:vir:96833 Length: 275 93.9 0.001 6.4E-07 37.0 7.7 255 1-351 1-275 (275) 8 protein:vir:96262 Length: 274 93.8 0.0022 1.4E-06 35.1 9.2 261 1-306 1-274 (274) 9 protein:vir:95898 Length: 274 93.8 0.0022 1.4E-06 35.1 9.2 261 1-306 1-274 (274) 10 protein:vir:80930 Length: 278 93.6 0.0057 3.6E-06 32.9 11.2 259 1-301 1-278 (278) 11 protein:vir:7990 Length: 273 # 92.9 0.0094 5.8E-06 31.7 12.7 253 1-305 1-273 (273) 12 protein:vir:105334 Length: 276 92.8 0.0071 4.4E-06 32.4 10.4 258 1-333 1-276 (276) 13 protein:vir:1239 Length: 274 # 90.2 0.015 9.3E-06 30.6 9.4 260 1-318 1-274 (274) 14 protein:vir:3613 Length: 272 # 82.4 0.051 3.1E-05 27.7 7.9 253 1-311 1-272 (272) 15 protein:vir:94622 Length: 341 79.9 0.1 6.2E-05 26.1 15.3 307 1-333 1-341 (341) 16 protein:vir:9927 Length: 295 # 72.0 0.051 3.2E-05 27.7 4.8 262 1-300 1-295 (295) 17 protein:vir:94142 Length: 304 59.8 0.39 0.00024 22.9 10.4 273 1-310 1-304 (304) 18 protein:vir:105905 Length: 304 59.8 0.39 0.00024 22.9 10.4 273 1-310 1-304 (304) 19 protein:vir:102605 Length: 273 58.9 0.4 0.00025 22.7 12.1 249 1-305 1-273 (273) 20 protein:vir:105822 Length: 273 58.9 0.4 0.00025 22.7 12.1 249 1-305 1-273 (273) 21 protein:vir:78739 Length: 332 54.8 0.49 0.00031 22.3 10.4 284 1-345 1-332 (332) 22 protein:vir:9759 Length: 303 # 50.4 0.61 0.00038 21.8 12.0 258 35-311 1-303 (303) 23 protein:vir:8885 Length: 347 # 39.5 1 0.00063 20.5 9.1 296 1-330 1-347 (347) 24 protein:vir:104085 Length: 320 25.5 2 0.0013 18.9 11.7 276 22-330 1-320 (320) 25 protein:vir:8187 Length: 311 # 24.9 2.1 0.0013 18.8 13.7 261 1-312 1-311 (311) No 1 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=95.06 E-value=0.0029 Score=34.54 Aligned_cols=262 Identities=14% Similarity=0.153 Sum_probs=125.9 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhhHHHhhccCccccccCChHHHHHHHhhcCCceeEEEEeccceEeeeecceE Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGAYDFFLSQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNVTVSNARTC 80 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA~d~f~~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~Tv~nart~ 80 (438) |.-..|+..++=. -.-|..++ .+-++. ..-...+.... ..+....|..+++|.+++.+....-+. .. T Consensus 1 ma~~~T~~~d~i~----Pev~s~~v-----~~~~~~-~~~~~~~~~~~--~~l~g~~G~tv~ip~~~~~g~~~~~~~-g~ 67 (274) T protein:vir:96 1 MAQGTTKVSNLIV----PEVLAPMM-----QAELDK-KLRFAQFADID--STLVGQPGDTLTFPAFTYSGDAQVIAE-GE 67 (274) T ss_pred CCccccchhhhhh----hHHHHHHH-----HHHHHh-hhhhccccccc--ccccCCCCCEEEEEeeccCCCccccCC-CC Confidence 5544455444321 11122221 111111 11011111111 123345688999999986555544332 33 Q ss_pred EecCcccceeeeeeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCccceeec Q lcl|NC_011222. 81 VIADAENTSRLIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFGNLLYYTK 160 (438) Q Consensus 81 vi~~~entsr~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~d~Ly~t~ 160 (438) -|++++.+..-.+.+-..++++|.+.=-. ...+..+-+....+..-+.+...+|+++++.|..-...+-.+.+ T Consensus 68 ~i~~~~it~~~~~~~i~~~~~~~~i~D~~---~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~~~~~~---- 140 (274) T protein:vir:96 68 KIPVDQIGTSKREAKVRKIGKGTELTDEA---VLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADIT---- 140 (274) T ss_pred cCchhhcccceeEEEEEeeeceeeecHHH---HHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCcccc---- Confidence 56777666655555555556666552111 12222233444445556677788999999998765544433222 Q ss_pred cCceeEccccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHh------cCcccc--ccee-EEeeeeeeeec Q lcl|NC_011222. 161 NGNDVQVKFTQRNDILSDLHPMFRANDYSGQLHIIGDTSVDSMLRKLEQH------GLYNDV--NKQL-EYANKVFHFTN 231 (438) Q Consensus 161 ~gna~q~p~aqR~e~~~dl~am~R~NDy~Gpl~~iadT~v~s~l~Kleaq------G~~N~t--Nlq~-Qy~N~~~~~sN 231 (438) . =+ .+-+....|-.+++.++. ++=+..++..|+|.... +.+|+. |-++ .|+++.+..+| T Consensus 141 -----~-----~d-~i~dA~~~l~d~~~~~~~-ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~ 208 (274) T protein:vir:96 141 -----K-----LD-GLQTAIDKFNDEDLEPMV-LFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSN 208 (274) T ss_pred -----c-----HH-HHHHHHHHhcccCCCceE-EEeCHHHHHHHHhcccccccccccccccceeecccceecCeeEEEcC Confidence 1 12 222333445556666665 67788999999997532 222221 1122 57889999999 Q ss_pred cccchhhhchhhcccccccchhhhhhh--hhhhccccCCccccceeeccccc-ceeeeeEeeccccccccchhHHHhhhh Q lcl|NC_011222. 232 NMTLESENFAQMYAVESGNVGLLTRVD--RAAYNNTKSGTHEFGKVVLPYFG-KEVGTHYYEEVGDQSAIAGEATADMTC 308 (438) Q Consensus 232 ~vat~A~n~at~Y~mpsG~vGm~twid--Rea~~n~~S~t~~ft~vvdP~fG-~~~g~hyye~vgD~Sgi~Ge~tA~~t~ 308 (438) .+. ..+.|-+-.|++|+...-+ -|.+.+..++. |=+.+ +-||..+++. T Consensus 209 ~~p-----~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~-------d~i~~~~~yg~~~~~~----------------- 259 (274) T protein:vir:96 209 KLN-----KGEALLAKKGAVKLITKRDFFLEKDRDASRKS-------TALYSDKHYVAYLYDE----------------- 259 (274) T ss_pred CCC-----cceEEEEeCcceeeeecCCcccccccchhhcc-------cEEEEeeEEEEEEEcC----------------- Confidence 985 3467888888888765522 22222222221 10000 1123333221 Q ss_pred hhhhcCceEEEeEEEEeecCCcccCccceE Q lcl|NC_011222. 309 DVKHFYGFSVDIAFVVAFNSDPATIANPIM 338 (438) Q Consensus 309 a~kh~Y~~~aD~sfvVAfnSD~A~~~~~Il 338 (438) .=+|......|. ++| T Consensus 260 ------------~~vv~~t~~~~~---~~~ 274 (274) T protein:vir:96 260 ------------SKVVKITKGAGD---EVM 274 (274) T ss_pred ------------ccEEEEEcCccc---ccC Confidence 112222222111 111 No 2 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=94.67 E-value=0.0027 Score=34.63 Aligned_cols=259 Identities=15% Similarity=0.106 Sum_probs=123.0 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhhH--HHhhccCccccccCChHHHHHHHhhcCCceeEEEEeccceEeeeecc Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGAY--DFFLSQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNVTVSNAR 78 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA~--d~f~~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~Tv~nar 78 (438) |...-|+.-.+ .=|..|+.+ +.+..... ...+.. .-..+....|..++||++++....+.-+- T Consensus 1 MA~~~T~~~~~-----------~iPev~s~~v~~~~~~~~~-~~~~~~--~~~~~~g~~G~tv~iP~~~~~~~a~~v~e- 65 (272) T protein:vir:98 1 MAVGTTKMAQM-----------LDPEVLADMIDAEVGKAIR-FAPLAE--VDTTLEGQPGTTLTVPKWDYIGDAEDVAE- 65 (272) T ss_pred CCCccccchhe-----------echHHHHHHHHHHHHHHhh-hhcccc--ccccccCCCCCEEEEEEecCCCCcccccC- Confidence 44333433221 122333332 11111110 000000 00112334577799999987666654332 Q ss_pred eEEecCcccceeeeeeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCcccee Q lcl|NC_011222. 79 TCVIADAENTSRLIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFGNLLYY 158 (438) Q Consensus 79 t~vi~~~entsr~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~d~Ly~ 158 (438) ...|+.++.+.--+++.-..++..|.+.=-...-+-.+..+...+++- +.+...+|++.++.|..-.+.+-+. . T Consensus 66 g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~---~~~a~~~d~~i~~~~~~a~~~~~~~-~-- 139 (272) T protein:vir:98 66 GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIV---EAIDHKVDADVLDALSKSTQTVEAT-A-- 139 (272) T ss_pred CCcccccccccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHH---HHHHHHHHHHHHHHhcccccccccc-c-- Confidence 345666666666666666666666655322222233344444444444 4445678899988875443322111 1 Q ss_pred eccCceeEccccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHhc--Ccccc------ccee-EEeeeeeee Q lcl|NC_011222. 159 TKNGNDVQVKFTQRNDILSDLHPMFRANDYSGQLHIIGDTSVDSMLRKLEQHG--LYNDV------NKQL-EYANKVFHF 229 (438) Q Consensus 159 t~~gna~q~p~aqR~e~~~dl~am~R~NDy~Gpl~~iadT~v~s~l~KleaqG--~~N~t------Nlq~-Qy~N~~~~~ 229 (438) . -+.+.+....|..++... -.++-+..++.+|+|.+..- .+++. |-++ .+++..++. T Consensus 140 -----------t--~d~i~da~~~l~~~~~~~-~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~ 205 (272) T protein:vir:98 140 -----------T--VDGVSKALDIFNDEDDAE-TVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVR 205 (272) T ss_pred -----------C--HHHHHHHHHHHhccCCCc-cEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEE Confidence 0 123344445565565544 46888899999999875321 11221 1112 578899999 Q ss_pred eccccchhhhchhhcccccccchhhhhhh--hhhhccccCCccccceeecccccceeeeeEeeccccccccchhHHHhhh Q lcl|NC_011222. 230 TNNMTLESENFAQMYAVESGNVGLLTRVD--RAAYNNTKSGTHEFGKVVLPYFGKEVGTHYYEEVGDQSAIAGEATADMT 307 (438) Q Consensus 230 sN~vat~A~n~at~Y~mpsG~vGm~twid--Rea~~n~~S~t~~ft~vvdP~fG~~~g~hyye~vgD~Sgi~Ge~tA~~t 307 (438) ||.|. -.+.|..-.|.+|+..+-+ -|.+.+..++. .....+ .-||.|+++ .+....++ T Consensus 206 s~~~p-----~~t~~~~~~~a~~~~~~~~~~ve~~r~~~~~~--~~i~~~----~~~~~~v~~---------~~~vv~~t 265 (272) T protein:vir:98 206 SRKCP-----KGTAYMVRKGALRIMLKRNTMVETDRDITKAI--NQIVAN----KHYGVYLYK---------AEKAVKIT 265 (272) T ss_pred cCCCC-----cceEEEEcCCeEEEEecCCceeeeccccccce--eEEEEE----EEEEEEEEc---------CCceEEEE Confidence 99985 3467888888888876622 22222222221 111111 124555532 11111111 Q ss_pred hhhhhcCceEEEeEEEEeecCCcccCc Q lcl|NC_011222. 308 CDVKHFYGFSVDIAFVVAFNSDPATIA 334 (438) Q Consensus 308 ~a~kh~Y~~~aD~sfvVAfnSD~A~~~ 334 (438) + ++|--+ T Consensus 266 ~--------------------~~a~~~ 272 (272) T protein:vir:98 266 L--------------------KDAAKK 272 (272) T ss_pred e--------------------cccccC Confidence 1 111111 No 3 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=94.67 E-value=0.0027 Score=34.63 Aligned_cols=259 Identities=15% Similarity=0.106 Sum_probs=123.0 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhhH--HHhhccCccccccCChHHHHHHHhhcCCceeEEEEeccceEeeeecc Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGAY--DFFLSQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNVTVSNAR 78 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA~--d~f~~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~Tv~nar 78 (438) |...-|+.-.+ .=|..|+.+ +.+..... ...+.. .-..+....|..++||++++....+.-+- T Consensus 1 MA~~~T~~~~~-----------~iPev~s~~v~~~~~~~~~-~~~~~~--~~~~~~g~~G~tv~iP~~~~~~~a~~v~e- 65 (272) T protein:vir:30 1 MAVGTTKMAQM-----------LDPEVLADMIDAEVGKAIR-FAPLAE--VDTTLEGQPGTTLTVPKWDYIGDAEDVAE- 65 (272) T ss_pred CCCccccchhe-----------echHHHHHHHHHHHHHHhh-hhcccc--ccccccCCCCCEEEEEEecCCCCcccccC- Confidence 44333433221 122333332 11111110 000000 00112334577799999987666654332 Q ss_pred eEEecCcccceeeeeeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCcccee Q lcl|NC_011222. 79 TCVIADAENTSRLIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFGNLLYY 158 (438) Q Consensus 79 t~vi~~~entsr~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~d~Ly~ 158 (438) ...|+.++.+.--+++.-..++..|.+.=-...-+-.+..+...+++- +.+...+|++.++.|..-.+.+-+. . T Consensus 66 g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~---~~~a~~~d~~i~~~~~~a~~~~~~~-~-- 139 (272) T protein:vir:30 66 GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIV---EAIDHKVDADVLDALSKSTQTVEAT-A-- 139 (272) T ss_pred CCcccccccccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHH---HHHHHHHHHHHHHHhcccccccccc-c-- Confidence 345666666666666666666666655322222233344444444444 4445678899988875443322111 1 Q ss_pred eccCceeEccccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHhc--Ccccc------ccee-EEeeeeeee Q lcl|NC_011222. 159 TKNGNDVQVKFTQRNDILSDLHPMFRANDYSGQLHIIGDTSVDSMLRKLEQHG--LYNDV------NKQL-EYANKVFHF 229 (438) Q Consensus 159 t~~gna~q~p~aqR~e~~~dl~am~R~NDy~Gpl~~iadT~v~s~l~KleaqG--~~N~t------Nlq~-Qy~N~~~~~ 229 (438) . -+.+.+....|..++... -.++-+..++.+|+|.+..- .+++. |-++ .+++..++. T Consensus 140 -----------t--~d~i~da~~~l~~~~~~~-~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~ 205 (272) T protein:vir:30 140 -----------T--VDGVSKALDIFNDEDDAE-TVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVR 205 (272) T ss_pred -----------C--HHHHHHHHHHHhccCCCc-cEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEE Confidence 0 123344445565565544 46888899999999875321 11221 1112 578899999 Q ss_pred eccccchhhhchhhcccccccchhhhhhh--hhhhccccCCccccceeecccccceeeeeEeeccccccccchhHHHhhh Q lcl|NC_011222. 230 TNNMTLESENFAQMYAVESGNVGLLTRVD--RAAYNNTKSGTHEFGKVVLPYFGKEVGTHYYEEVGDQSAIAGEATADMT 307 (438) Q Consensus 230 sN~vat~A~n~at~Y~mpsG~vGm~twid--Rea~~n~~S~t~~ft~vvdP~fG~~~g~hyye~vgD~Sgi~Ge~tA~~t 307 (438) ||.|. -.+.|..-.|.+|+..+-+ -|.+.+..++. .....+ .-||.|+++ .+....++ T Consensus 206 s~~~p-----~~t~~~~~~~a~~~~~~~~~~ve~~r~~~~~~--~~i~~~----~~~~~~v~~---------~~~vv~~t 265 (272) T protein:vir:30 206 SRKCP-----KGTAYMVRKGALRIMLKRNTMVETDRDITKAI--NQIVAN----KHYGVYLYK---------AEKAVKIT 265 (272) T ss_pred cCCCC-----cceEEEEcCCeEEEEecCCceeeeccccccce--eEEEEE----EEEEEEEEc---------CCceEEEE Confidence 99985 3467888888888876622 22222222221 111111 124555532 11111111 Q ss_pred hhhhhcCceEEEeEEEEeecCCcccCc Q lcl|NC_011222. 308 CDVKHFYGFSVDIAFVVAFNSDPATIA 334 (438) Q Consensus 308 ~a~kh~Y~~~aD~sfvVAfnSD~A~~~ 334 (438) + ++|--+ T Consensus 266 ~--------------------~~a~~~ 272 (272) T protein:vir:30 266 L--------------------KDAAKK 272 (272) T ss_pred e--------------------cccccC Confidence 1 111111 No 4 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=94.20 E-value=0.0024 Score=34.90 Aligned_cols=263 Identities=17% Similarity=0.131 Sum_probs=128.4 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhhHHHhhccCccccccCChHHHHHHHhhcCCceeEEEEeccceEeeeecceE Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGAYDFFLSQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNVTVSNARTC 80 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA~d~f~~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~Tv~nart~ 80 (438) |.--.|+..++= .-.-|..++.+==--.++-.|-+..+ ..+....|..+++|++++-+..+.- +... T Consensus 1 ma~~~T~~~d~i----iPev~~~~v~~~~~~~l~~~~~~~~d--------~~l~g~~G~tv~iP~~~~~g~a~~~-~~g~ 67 (274) T protein:vir:97 1 MPQGLTKTSDQI----IPEVLAPMMQAQLEKKLRFASFAEVD--------STLQGQPGDTLTFPAFVYSGDAQVV-AEGE 67 (274) T ss_pred CCccceehhhee----chHHHHHHHHHhhhhhhhhcccceec--------ccccCCCCCEEEEeeecCCCccccc-cCCC Confidence 655555444431 11223333221000001111111111 1234456899999999865544432 2334 Q ss_pred EecCcccceeeeeeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCccceeec Q lcl|NC_011222. 81 VIADAENTSRLIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFGNLLYYTK 160 (438) Q Consensus 81 vi~~~entsr~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~d~Ly~t~ 160 (438) -|++++-+..-.+.+-...+++|.+.=-. ...++.+-+....+..-+.+-..+|+++++.|...++.+-.+.+ T Consensus 68 ~i~~~~lt~~~~~~~i~~~~~~~~i~D~~---~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~---- 140 (274) T protein:vir:97 68 KIPTDILETKKREAKIRKIAKGTSITDEA---LLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADIT---- 140 (274) T ss_pred cccccccccceeEEEeeeecceecccHHH---HHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccc---- Confidence 57777766555555545555555552111 11122223334445555778889999999999888876644332 Q ss_pred cCceeEccccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHh------cCccc--cccee-EEeeeeeeeec Q lcl|NC_011222. 161 NGNDVQVKFTQRNDILSDLHPMFRANDYSGQLHIIGDTSVDSMLRKLEQH------GLYND--VNKQL-EYANKVFHFTN 231 (438) Q Consensus 161 ~gna~q~p~aqR~e~~~dl~am~R~NDy~Gpl~~iadT~v~s~l~Kleaq------G~~N~--tNlq~-Qy~N~~~~~sN 231 (438) . -+ .+-+....|-.++..+. .++=+..+++.|+|.... ++++. +|-++ .|+++.++.+| T Consensus 141 -----~-----~d-~i~dA~~~l~d~~~~~~-~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~ 208 (274) T protein:vir:97 141 -----K-----LN-GLQSAIDKFNDEDLEPM-VLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTN 208 (274) T ss_pred -----C-----HH-HHHHHHHHhhccCCCce-EEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcC Confidence 1 11 22233344555566655 466788899999986422 22222 12223 68999999999 Q ss_pred cccchhhhchhhcccccccchhhhh--hhhhhhccccCCccccceeecccccceeeeeEeeccccccccchhHHHhhhhh Q lcl|NC_011222. 232 NMTLESENFAQMYAVESGNVGLLTR--VDRAAYNNTKSGTHEFGKVVLPYFGKEVGTHYYEEVGDQSAIAGEATADMTCD 309 (438) Q Consensus 232 ~vat~A~n~at~Y~mpsG~vGm~tw--idRea~~n~~S~t~~ft~vvdP~fG~~~g~hyye~vgD~Sgi~Ge~tA~~t~a 309 (438) .+. ..+.|-.-+|++|+..+ +.-|.+-+...+.... ..+ .-||.++++. +++ ..++++ T Consensus 209 ~~p-----~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i--~~~----~~y~~~~~~~----~~v-----v~~t~~ 268 (274) T protein:vir:97 209 KLE-----AGTAILAKKGAVKLILKRDFFLEVARDASTKTTAL--YSD----KHYVAYLYDE----SKA-----VKITKG 268 (274) T ss_pred CCC-----cceEEEEeCcceEeeecCCceeccccchhhcccEE--EEE----EEEEEEEEcC----Cce-----EEEecC Confidence 985 35778888999997655 2222222222221100 000 1133333221 111 111111 Q ss_pred hhhcCceEEEeEEEEeecCCcccCccceEEEEe Q lcl|NC_011222. 310 VKHFYGFSVDIAFVVAFNSDPATIANPIMKVEV 342 (438) Q Consensus 310 ~kh~Y~~~aD~sfvVAfnSD~A~~~~~IlK~~~ 342 (438) . +. +|. T Consensus 269 ~---~~------------------------~~~ 274 (274) T protein:vir:97 269 S---GS------------------------LEM 274 (274) T ss_pred c---cc------------------------ccC Confidence 0 00 000 No 5 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=94.20 E-value=0.0024 Score=34.90 Aligned_cols=263 Identities=17% Similarity=0.131 Sum_probs=128.4 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhhHHHhhccCccccccCChHHHHHHHhhcCCceeEEEEeccceEeeeecceE Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGAYDFFLSQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNVTVSNARTC 80 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA~d~f~~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~Tv~nart~ 80 (438) |.--.|+..++= .-.-|..++.+==--.++-.|-+..+ ..+....|..+++|++++-+..+.- +... T Consensus 1 ma~~~T~~~d~i----iPev~~~~v~~~~~~~l~~~~~~~~d--------~~l~g~~G~tv~iP~~~~~g~a~~~-~~g~ 67 (274) T protein:vir:94 1 MPQGLTKTSDQI----IPEVLAPMMQAQLEKKLRFASFAEVD--------STLQGQPGDTLTFPAFVYSGDAQVV-AEGE 67 (274) T ss_pred CCccceehhhee----chHHHHHHHHHhhhhhhhhcccceec--------ccccCCCCCEEEEeeecCCCccccc-cCCC Confidence 655555444431 11223333221000001111111111 1234456899999999865544432 2334 Q ss_pred EecCcccceeeeeeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCccceeec Q lcl|NC_011222. 81 VIADAENTSRLIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFGNLLYYTK 160 (438) Q Consensus 81 vi~~~entsr~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~d~Ly~t~ 160 (438) -|++++-+..-.+.+-...+++|.+.=-. ...++.+-+....+..-+.+-..+|+++++.|...++.+-.+.+ T Consensus 68 ~i~~~~lt~~~~~~~i~~~~~~~~i~D~~---~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~---- 140 (274) T protein:vir:94 68 KIPTDILETKKREAKIRKIAKGTSITDEA---LLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADIT---- 140 (274) T ss_pred cccccccccceeEEEeeeecceecccHHH---HHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccc---- Confidence 57777766555555545555555552111 11122223334445555778889999999999888876644332 Q ss_pred cCceeEccccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHh------cCccc--cccee-EEeeeeeeeec Q lcl|NC_011222. 161 NGNDVQVKFTQRNDILSDLHPMFRANDYSGQLHIIGDTSVDSMLRKLEQH------GLYND--VNKQL-EYANKVFHFTN 231 (438) Q Consensus 161 ~gna~q~p~aqR~e~~~dl~am~R~NDy~Gpl~~iadT~v~s~l~Kleaq------G~~N~--tNlq~-Qy~N~~~~~sN 231 (438) . -+ .+-+....|-.++..+. .++=+..+++.|+|.... ++++. +|-++ .|+++.++.+| T Consensus 141 -----~-----~d-~i~dA~~~l~d~~~~~~-~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~ 208 (274) T protein:vir:94 141 -----K-----LN-GLQSAIDKFNDEDLEPM-VLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTN 208 (274) T ss_pred -----C-----HH-HHHHHHHHhhccCCCce-EEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcC Confidence 1 11 22233344555566655 466788899999986422 22222 12223 68999999999 Q ss_pred cccchhhhchhhcccccccchhhhh--hhhhhhccccCCccccceeecccccceeeeeEeeccccccccchhHHHhhhhh Q lcl|NC_011222. 232 NMTLESENFAQMYAVESGNVGLLTR--VDRAAYNNTKSGTHEFGKVVLPYFGKEVGTHYYEEVGDQSAIAGEATADMTCD 309 (438) Q Consensus 232 ~vat~A~n~at~Y~mpsG~vGm~tw--idRea~~n~~S~t~~ft~vvdP~fG~~~g~hyye~vgD~Sgi~Ge~tA~~t~a 309 (438) .+. ..+.|-.-+|++|+..+ +.-|.+-+...+.... ..+ .-||.++++. +++ ..++++ T Consensus 209 ~~p-----~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i--~~~----~~y~~~~~~~----~~v-----v~~t~~ 268 (274) T protein:vir:94 209 KLE-----AGTAILAKKGAVKLILKRDFFLEVARDASTKTTAL--YSD----KHYVAYLYDE----SKA-----VKITKG 268 (274) T ss_pred CCC-----cceEEEEeCcceEeeecCCceeccccchhhcccEE--EEE----EEEEEEEEcC----Cce-----EEEecC Confidence 985 35778888999997655 2222222222221100 000 1133333221 111 111111 Q ss_pred hhhcCceEEEeEEEEeecCCcccCccceEEEEe Q lcl|NC_011222. 310 VKHFYGFSVDIAFVVAFNSDPATIANPIMKVEV 342 (438) Q Consensus 310 ~kh~Y~~~aD~sfvVAfnSD~A~~~~~IlK~~~ 342 (438) . +. +|. T Consensus 269 ~---~~------------------------~~~ 274 (274) T protein:vir:94 269 S---GS------------------------LEM 274 (274) T ss_pred c---cc------------------------ccC Confidence 0 00 000 No 6 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=94.13 E-value=0.0024 Score=34.92 Aligned_cols=262 Identities=16% Similarity=0.153 Sum_probs=127.4 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhhHHHhhccCccccccCChHHHHHHHhhcCCceeEEEEeccceEeeeecceE Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGAYDFFLSQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNVTVSNARTC 80 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA~d~f~~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~Tv~nart~ 80 (438) |.--.|+..++=. | .-|..++.+==--.+...|....+ ..+....|.+++||++++-+....- +... T Consensus 1 ma~~~T~~~~~ii--P--ev~~~~v~~~~~~~~~~~~~~~~~--------~~l~g~~G~tv~ip~~~~~g~~~~~-~eg~ 67 (274) T protein:vir:93 1 MPQGITKTSNQII--P--EVLAPMMQAQLEKKLRFASFAEVD--------STLQGQPGDTLTFPAFVYSGDAQVV-AEGE 67 (274) T ss_pred CCccceehhheec--h--HHHHHHHHHHHHhhhhhccccccc--------ccccCCCCCEEEEEeeccCCCcccc-cCCC Confidence 5554454443311 1 122222210000000111111111 1234456889999999865444322 2234 Q ss_pred EecCcccceeeeeeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCccceeec Q lcl|NC_011222. 81 VIADAENTSRLIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFGNLLYYTK 160 (438) Q Consensus 81 vi~~~entsr~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~d~Ly~t~ 160 (438) -|++++.+..-.+.+-..++++|.+.=-... -+..+-+....+..-+.+-..+|++++++|...+..+-.+.+ T Consensus 68 ~i~~~~it~~~~~~~i~~~~~~~~i~D~~~~---~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~~~~~~---- 140 (274) T protein:vir:93 68 KIPTDILETKKREAKIRKIAKGTSITDEALL---SGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADIT---- 140 (274) T ss_pred cccccccccceeEEEeeeecccccccHHHHH---hhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc---- Confidence 5777777666666666667777666221111 112222334445556777889999999999877655533322 Q ss_pred cCceeEccccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHh------cCcccc--ccee-EEeeeeeeeec Q lcl|NC_011222. 161 NGNDVQVKFTQRNDILSDLHPMFRANDYSGQLHIIGDTSVDSMLRKLEQH------GLYNDV--NKQL-EYANKVFHFTN 231 (438) Q Consensus 161 ~gna~q~p~aqR~e~~~dl~am~R~NDy~Gpl~~iadT~v~s~l~Kleaq------G~~N~t--Nlq~-Qy~N~~~~~sN 231 (438) . -+. +-+....|-.++..++ .++=+..++..|+|.... ++++.. |-++ .|+++.++.+| T Consensus 141 -----~-----~d~-i~dA~~~l~d~~~~~~-~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~ 208 (274) T protein:vir:93 141 -----K-----LNG-LQSAIDKFNDEDLEPM-VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTN 208 (274) T ss_pred -----C-----HHH-HHHHHHHhhhccCCcc-EEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEEEcC Confidence 1 112 2233344445666666 467788999999986422 222221 2223 68999999999 Q ss_pred cccchhhhchhhcccccccchhhhh--hhhhhhccccCCccccceeecccccceeeeeEe-eccccccccchhHHHhhhh Q lcl|NC_011222. 232 NMTLESENFAQMYAVESGNVGLLTR--VDRAAYNNTKSGTHEFGKVVLPYFGKEVGTHYY-EEVGDQSAIAGEATADMTC 308 (438) Q Consensus 232 ~vat~A~n~at~Y~mpsG~vGm~tw--idRea~~n~~S~t~~ft~vvdP~fG~~~g~hyy-e~vgD~Sgi~Ge~tA~~t~ 308 (438) .+. ..+.|..-.|++|+... +.-|.+.+..++. | ..++.++| -.+-+-+ ....+++ T Consensus 209 ~~p-----~~t~~l~~~gai~~~~~~~~~vE~~Rd~~~~~-------d----~i~~~~~y~~~~~~~~-----~~v~~t~ 267 (274) T protein:vir:93 209 KLE-----AGTAILAKKGAVKLILKRDFFLEVARDASTKT-------T----ALYSDKHYVAYLYDES-----KAVKITK 267 (274) T ss_pred CCC-----cceEEEEeCCeEEEEecCCcccccccchhhcc-------c----EEEEEEEEEEEEEcCC-----ceEEEee Confidence 975 35778888999997655 2222222322221 1 12233322 0111111 1111111 Q ss_pred hhhhcCceEEEeEEEEeecCCcccCccceEEE Q lcl|NC_011222. 309 DVKHFYGFSVDIAFVVAFNSDPATIANPIMKV 340 (438) Q Consensus 309 a~kh~Y~~~aD~sfvVAfnSD~A~~~~~IlK~ 340 (438) +.- =|.. T Consensus 268 ~~~-------------------------s~~~ 274 (274) T protein:vir:93 268 GSG-------------------------SLEM 274 (274) T ss_pred Ccc-------------------------ccCC Confidence 100 0000 No 7 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=93.93 E-value=0.001 Score=36.95 Aligned_cols=255 Identities=16% Similarity=0.158 Sum_probs=120.6 Q ss_pred Ccccc-cchhhhcccCCCcchhhhhhhhhhhH--HHhh-----ccCccccccCChHHHHHHHhhcCCceeEEEEeccceE Q lcl|NC_011222. 1 MSLIA-TRTQEFRLKNPNIDKNMARMTEWGAY--DFFL-----SQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNV 72 (438) Q Consensus 1 ~~~~~-~~~q~~r~K~~n~dK~e~R~s~wGA~--d~f~-----~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~ 72 (438) |...+ |+..++= =|.-|+.+ +-+. .|.+..+ ..+....|..++||+++|-+.. T Consensus 1 ~~~~~~T~l~d~i-----------~PEv~~~~v~~~~~~~~~~~~~~~~~--------~~l~g~~G~tv~iP~~~~ig~a 61 (275) T protein:vir:96 1 MALENMTKLANMV-----------NPEVLAPMMQAELDKKLKFAQFADID--------NTLVGQPGNTITFPAFVYSGDA 61 (275) T ss_pred CCCcccchhhhhh-----------chHHHHHHHHHHHHHhhhhcccceec--------ccccCCCCCEEEeeeeccCCcc Confidence 55544 4443321 12222221 1121 1222111 1234456889999999876555 Q ss_pred eeeecceEEecCcccceeeeeeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Q lcl|NC_011222. 73 TVSNARTCVIADAENTSRLIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVF 152 (438) Q Consensus 73 Tv~nart~vi~~~entsr~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf 152 (438) +.-+ ....|++++-+..-.+..-..++++|.+.=-. ...++.+-+....+..=+.+-..+|.+++++|..-.-.+- T Consensus 62 ~~~~-~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~---~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~~ 137 (275) T protein:vir:96 62 KVVP-EGEEIPIDLIETKKRQATIRKIGKGTVLTDEA---LLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLKVE 137 (275) T ss_pred cccc-CCCCcchhhcccceeeEEeehhcccccccHHH---HHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 5432 23456777665555555545566666651111 1111112222223333344567889999988865432221 Q ss_pred CccceeeccCceeEccccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHH----hcC--ccc--cccee-EEe Q lcl|NC_011222. 153 GNLLYYTKNGNDVQVKFTQRNDILSDLHPMFRANDYSGQLHIIGDTSVDSMLRKLEQ----HGL--YND--VNKQL-EYA 223 (438) Q Consensus 153 ~d~Ly~t~~gna~q~p~aqR~e~~~dl~am~R~NDy~Gpl~~iadT~v~s~l~Klea----qG~--~N~--tNlq~-Qy~ 223 (438) .+++. - +.+-+...+|-..++.++ .++=+..+++.|+|... +.. ++. +|-+| .|+ T Consensus 138 ---------~~~~~-----~-d~i~dA~~~lgd~~~~~~-~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~ 201 (275) T protein:vir:96 138 ---------ADITK-----L-AGLQTAIDKFNDEDLEPM-VLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEAL 201 (275) T ss_pred ---------ccccC-----H-HHHHHHHHHhccccCCcc-EEEeCHHHHHHHHhcccccccccccccccceeccccceec Confidence 11211 1 223344455555555655 47778999999999852 121 211 13233 589 Q ss_pred eeeeeeeccccchhhhchhhcccccccchhhhh--hhhhhhccccCCccccceeeccccc-ceeeeeEeeccccccccch Q lcl|NC_011222. 224 NKVFHFTNNMTLESENFAQMYAVESGNVGLLTR--VDRAAYNNTKSGTHEFGKVVLPYFG-KEVGTHYYEEVGDQSAIAG 300 (438) Q Consensus 224 N~~~~~sN~vat~A~n~at~Y~mpsG~vGm~tw--idRea~~n~~S~t~~ft~vvdP~fG-~~~g~hyye~vgD~Sgi~G 300 (438) ++.+..||.+. ..+.|-.-.|++|+..+ +.-|...+..+++ |=+.+ ..||+++++. T Consensus 202 G~~Vi~s~~~p-----~~t~~i~~~gA~~~~~~~~~~vE~~Rd~~~~~-------d~i~~~~~y~~~~~~~--------- 260 (275) T protein:vir:96 202 GAIIVRSNKIK-----EGEAILAKRGAVKLITKRDFFLETERHASHKS-------TALFSDKHYVAYLYDE--------- 260 (275) T ss_pred CeeEEEeCCCC-----cceEEEEeccceeeeecCCcccccccchhhcC-------cEEEEeEEEEEEEEcC--------- Confidence 99999999875 34678888899887655 2223222222221 11111 1123333211 Q ss_pred hHHHhhhhhhhhcCceEEEeEEEEeecCCcccCccceEEEEeeccCcccCC Q lcl|NC_011222. 301 EATADMTCDVKHFYGFSVDIAFVVAFNSDPATIANPIMKVEVNKENSQFGG 351 (438) Q Consensus 301 e~tA~~t~a~kh~Y~~~aD~sfvVAfnSD~A~~~~~IlK~~~~~e~~~~~~ 351 (438) ..+.|+..+. |-.|- T Consensus 261 ----------------------------------~~vv~~t~~~--~~~~~ 275 (275) T protein:vir:96 261 ----------------------------------SKVVKITKSA--SGLGV 275 (275) T ss_pred ----------------------------------ccEEEEEecc--cccCC Confidence 1111111110 00110 No 8 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=93.76 E-value=0.0022 Score=35.12 Aligned_cols=261 Identities=16% Similarity=0.115 Sum_probs=128.4 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhhHHHhhccCccccccCChHHHHHHHhhcCCceeEEEEeccceEeeeecceE Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGAYDFFLSQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNVTVSNARTC 80 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA~d~f~~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~Tv~nart~ 80 (438) |.---|+..++= .-.-|..++. +-+.. ..-...+.. .-..+....|..++||+++|-+..+.- .... T Consensus 1 m~~~~T~l~d~i----~Pev~~~~v~-----~~~~~-~l~~~~~~~--~~~~l~g~~G~tv~iP~~~~ig~a~~~-~~g~ 67 (274) T protein:vir:96 1 MAQGMTKLTNQI----VPEVLAPMMQ-----AELEK-KLRFASFAE--IDNTLVGQPGDTLTFPAFIYSGDAKVV-AEGE 67 (274) T ss_pred CCcceeehhhee----chHHHHHHHH-----HHHHh-hhhccccce--ecccccCCCCCEEEeeeecCCCccccc-cCCC Confidence 655555544432 1122322221 11110 000001100 012244456899999999876555543 3344 Q ss_pred EecCcccceeeeeeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCccceeec Q lcl|NC_011222. 81 VIADAENTSRLIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFGNLLYYTK 160 (438) Q Consensus 81 vi~~~entsr~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~d~Ly~t~ 160 (438) -|++++-+..-.+..-..++.+|.+.=- =...++.+-+....+..-+.+-..+|.+++++|...+-.+-.+.+ T Consensus 68 ~i~~~~lt~~~~~~~i~~~~~a~~i~D~---~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~---- 140 (274) T protein:vir:96 68 KIPTDILETKKREAKIRKIAKGTSISDE---ALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADIT---- 140 (274) T ss_pred ccchhhcccceeEEEeeeeecceeehHH---HHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc---- Confidence 5677666655555555556666665211 112222233333444444556688999999998765544433322 Q ss_pred cCceeEccccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHh------cCccc--cccee-EEeeeeeeeec Q lcl|NC_011222. 161 NGNDVQVKFTQRNDILSDLHPMFRANDYSGQLHIIGDTSVDSMLRKLEQH------GLYND--VNKQL-EYANKVFHFTN 231 (438) Q Consensus 161 ~gna~q~p~aqR~e~~~dl~am~R~NDy~Gpl~~iadT~v~s~l~Kleaq------G~~N~--tNlq~-Qy~N~~~~~sN 231 (438) . -+.+-+...+|-..+..++ .++=+..++..|+|.... .+++. .|-+| .|++..++.|| T Consensus 141 -----~------~d~i~~A~~~lgd~~~~~~-~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~ 208 (274) T protein:vir:96 141 -----K------LTGLQTAIDKFNDEDLEPM-VLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSN 208 (274) T ss_pred -----C------HHHHHHHHHHhcccccccc-EEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeC Confidence 1 1112233344444555665 567788999999996422 11111 12233 58999999999 Q ss_pred cccchhhhchhhcccccccchhhhh--hhhhhhccccCCccccceeecccccceeeeeEeeccc--cccccchhHHHhh Q lcl|NC_011222. 232 NMTLESENFAQMYAVESGNVGLLTR--VDRAAYNNTKSGTHEFGKVVLPYFGKEVGTHYYEEVG--DQSAIAGEATADM 306 (438) Q Consensus 232 ~vat~A~n~at~Y~mpsG~vGm~tw--idRea~~n~~S~t~~ft~vvdP~fG~~~g~hyye~vg--D~Sgi~Ge~tA~~ 306 (438) .+. ..+.|-.-+|++|.... +.-|...+..+++.. .+.+ ..||+++++.=+ =..-..|..+ | T Consensus 209 ~~~-----~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~--i~~~----~~y~~~~~~~~~~v~~tk~~~~~~--~ 274 (274) T protein:vir:96 209 KLE-----AGTAILAKKGAVKLITKRDFFLETDRDPSTKTTA--LYSD----KHYVAYLYDESKAVKITKGSGSLE--M 274 (274) T ss_pred CCC-----CceEEEEeccceeeeecCCcccccccccccccCE--EEEe----EEEEEEEEcCCcEEEEEcCCcccc--C Confidence 874 35667778899997665 222333333332210 0111 335666654311 0011122111 1 No 9 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=93.76 E-value=0.0022 Score=35.12 Aligned_cols=261 Identities=16% Similarity=0.115 Sum_probs=128.4 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhhHHHhhccCccccccCChHHHHHHHhhcCCceeEEEEeccceEeeeecceE Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGAYDFFLSQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNVTVSNARTC 80 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA~d~f~~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~Tv~nart~ 80 (438) |.---|+..++= .-.-|..++. +-+.. ..-...+.. .-..+....|..++||+++|-+..+.- .... T Consensus 1 m~~~~T~l~d~i----~Pev~~~~v~-----~~~~~-~l~~~~~~~--~~~~l~g~~G~tv~iP~~~~ig~a~~~-~~g~ 67 (274) T protein:vir:95 1 MAQGMTKLTNQI----VPEVLAPMMQ-----AELEK-KLRFASFAE--IDNTLVGQPGDTLTFPAFIYSGDAKVV-AEGE 67 (274) T ss_pred CCcceeehhhee----chHHHHHHHH-----HHHHh-hhhccccce--ecccccCCCCCEEEeeeecCCCccccc-cCCC Confidence 655555544432 1122322221 11110 000001100 012244456899999999876555543 3344 Q ss_pred EecCcccceeeeeeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCccceeec Q lcl|NC_011222. 81 VIADAENTSRLIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFGNLLYYTK 160 (438) Q Consensus 81 vi~~~entsr~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~d~Ly~t~ 160 (438) -|++++-+..-.+..-..++.+|.+.=- =...++.+-+....+..-+.+-..+|.+++++|...+-.+-.+.+ T Consensus 68 ~i~~~~lt~~~~~~~i~~~~~a~~i~D~---~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~---- 140 (274) T protein:vir:95 68 KIPTDILETKKREAKIRKIAKGTSISDE---ALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADIT---- 140 (274) T ss_pred ccchhhcccceeEEEeeeeecceeehHH---HHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc---- Confidence 5677666655555555556666665211 112222233333444444556688999999998765544433322 Q ss_pred cCceeEccccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHh------cCccc--cccee-EEeeeeeeeec Q lcl|NC_011222. 161 NGNDVQVKFTQRNDILSDLHPMFRANDYSGQLHIIGDTSVDSMLRKLEQH------GLYND--VNKQL-EYANKVFHFTN 231 (438) Q Consensus 161 ~gna~q~p~aqR~e~~~dl~am~R~NDy~Gpl~~iadT~v~s~l~Kleaq------G~~N~--tNlq~-Qy~N~~~~~sN 231 (438) . -+.+-+...+|-..+..++ .++=+..++..|+|.... .+++. .|-+| .|++..++.|| T Consensus 141 -----~------~d~i~~A~~~lgd~~~~~~-~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~ 208 (274) T protein:vir:95 141 -----K------LTGLQTAIDKFNDEDLEPM-VLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSN 208 (274) T ss_pred -----C------HHHHHHHHHHhcccccccc-EEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeC Confidence 1 1112233344444555665 567788999999996422 11111 12233 58999999999 Q ss_pred cccchhhhchhhcccccccchhhhh--hhhhhhccccCCccccceeecccccceeeeeEeeccc--cccccchhHHHhh Q lcl|NC_011222. 232 NMTLESENFAQMYAVESGNVGLLTR--VDRAAYNNTKSGTHEFGKVVLPYFGKEVGTHYYEEVG--DQSAIAGEATADM 306 (438) Q Consensus 232 ~vat~A~n~at~Y~mpsG~vGm~tw--idRea~~n~~S~t~~ft~vvdP~fG~~~g~hyye~vg--D~Sgi~Ge~tA~~ 306 (438) .+. ..+.|-.-+|++|.... +.-|...+..+++.. .+.+ ..||+++++.=+ =..-..|..+ | T Consensus 209 ~~~-----~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~--i~~~----~~y~~~~~~~~~~v~~tk~~~~~~--~ 274 (274) T protein:vir:95 209 KLE-----AGTAILAKKGAVKLITKRDFFLETDRDPSTKTTA--LYSD----KHYVAYLYDESKAVKITKGSGSLE--M 274 (274) T ss_pred CCC-----CceEEEEeccceeeeecCCcccccccccccccCE--EEEe----EEEEEEEEcCCcEEEEEcCCcccc--C Confidence 874 35667778899997665 222333333332210 0111 335666654311 0011122111 1 No 10 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=93.61 E-value=0.0057 Score=32.87 Aligned_cols=259 Identities=12% Similarity=0.072 Sum_probs=121.8 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhhHHHhh-----ccCccccccCChHHHHHHHhhcCCceeEEEEeccceEeee Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGAYDFFL-----SQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNVTVS 75 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA~d~f~-----~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~Tv~ 75 (438) |.--.|+..++ +.-..|..++.+ -|. +|..... ..+....|..+++|.+++-+..+.- T Consensus 1 Ma~~~T~~~~~----iiPev~s~~v~~-----~~~~~~v~~~~~~~~--------~~l~g~~G~tv~ip~~~~~g~a~~~ 63 (278) T protein:vir:80 1 MADLTTKLANL----IDPEVMGPMISA-----KLPKAIKFGKIAPID--------NSLEGQPGSEITVPKYKYIGDAQDV 63 (278) T ss_pred CCCcceehhhe----ecHHHHHHHHHH-----HHHHhhhhcccceec--------ccccCCCCCEEEEeeeccCCcceee Confidence 54333444332 122233333211 111 1222111 1233456889999999865544421 Q ss_pred ecceEEecCcccceeeeeeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCcc Q lcl|NC_011222. 76 NARTCVIADAENTSRLIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFGNL 155 (438) Q Consensus 76 nart~vi~~~entsr~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~d~ 155 (438) ....-|++++-+..-.+..-..++.+|.+.=-...=...+. +....+..=+.+...+|+++++.|...+..+-+.. T Consensus 64 -~~g~~i~~~~lt~~~~~~~i~~~~~a~~v~D~~~~~~~~d~---~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~~~~ 139 (278) T protein:vir:80 64 -AEGAAIDYSALETESVKHGIKKAGKGVKLTDESVLSGYGDP---VEEAQKQIRMAIASKVDNDILEEALTTTLEVKGAI 139 (278) T ss_pred -cCCCcCcccccccceeeEeeehhhccccccHHHHhhccccH---HHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 22335666666555555444444444443111111112222 33333444455667889999999987766554332 Q ss_pred ceeeccCceeEccccchhHHHhcchHHHh-ccCcccceEEeecchhHHHHHHHHHhcC------ccc--cccee-EEeee Q lcl|NC_011222. 156 LYYTKNGNDVQVKFTQRNDILSDLHPMFR-ANDYSGQLHIIGDTSVDSMLRKLEQHGL------YND--VNKQL-EYANK 225 (438) Q Consensus 156 Ly~t~~gna~q~p~aqR~e~~~dl~am~R-~NDy~Gpl~~iadT~v~s~l~KleaqG~------~N~--tNlq~-Qy~N~ 225 (438) -..+... -.+.+.+.--.|- .+.....+ ++=+..++..|+|...... +++ +|-++ .|++. T Consensus 140 t~~~~~~---------~~~~~~da~~~l~~~~~~~~~~-ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~ 209 (278) T protein:vir:80 140 NIGLIDK---------IENTFTDAPDAIEDESITTTGV-LFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGW 209 (278) T ss_pred ccchhhh---------HHHHHHHHHHhhcccCCCcccE-EEECHHHHHHHHhhhhhhccccccccccceeeccceeecce Confidence 1111111 1223333333333 33333333 4457889999998864322 121 12233 68999 Q ss_pred eeeeeccccchhhhchhhcccccccchhhhh--hhhhhhccccCCccccceeecccccceeeeeEeec--cccccccchh Q lcl|NC_011222. 226 VFHFTNNMTLESENFAQMYAVESGNVGLLTR--VDRAAYNNTKSGTHEFGKVVLPYFGKEVGTHYYEE--VGDQSAIAGE 301 (438) Q Consensus 226 ~~~~sN~vat~A~n~at~Y~mpsG~vGm~tw--idRea~~n~~S~t~~ft~vvdP~fG~~~g~hyye~--vgD~Sgi~Ge 301 (438) .++.||.+.. ++.|-+-.|++|.... +.-|.+.+..+++ ...+.+ .-||.++++. +--....+|. T Consensus 210 ~Vi~s~~~p~-----~t~~l~~~gAi~~~~~~~~~vE~~Rd~~~~~--d~i~~~----~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 210 EIVRTKKLAD-----GNALAVKAGALKTFLKRNLLAESGRDMDHKL--TKFNAD----QHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred eEEEcCCCCc-----ceEEEEeccceeeeecCCcccccccchhhcc--ceeeee----eEEEEEEEcCcceEEEeeccCC Confidence 9999999863 5778888999986554 2222222222221 111111 2245555432 1112222222 No 11 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=92.95 E-value=0.0094 Score=31.71 Aligned_cols=253 Identities=13% Similarity=0.137 Sum_probs=117.9 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhh--HHHhhccCccccccCChHHHHHHHhhcCCceeEEEEeccceEeeee-- Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGA--YDFFLSQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNVTVSN-- 76 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA--~d~f~~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~Tv~n-- 76 (438) |++ ++| -|.-|+. .+.|..+.--..++-.+ - ......|.+++||++-. +++.+ T Consensus 1 MA~-----~~~------------~pei~~~~v~~~~~~~lv~~~l~~~~--~-~~~~~~GdTv~ip~~~~---~~~~d~~ 57 (273) T protein:vir:79 1 MAF-----NNF------------IPELWSDMLLEEWTAQTVFANLVNRE--Y-EGIASKGNVVHIAGVVA---PTVKDYK 57 (273) T ss_pred Ccc-----hhh------------hHHHHHHHHHHHHHhhccchhhhhcc--c-cccccCCcEEEEeecCc---ccccccc Confidence 443 111 1334432 34555443211111000 0 01234588999999841 22221 Q ss_pred cceEEecCcccceeeeeeeeee-eeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCcc Q lcl|NC_011222. 77 ARTCVIADAENTSRLIGVTWKT-YAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFGNL 155 (438) Q Consensus 77 art~vi~~~entsr~iT~v~~T-~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~d~ 155 (438) .+.-.|..++-+..-++++--. -.|+|.+.-. .-=+-.+ |++.-++.+-+.+...+|.+.++.+.+..|.+-.- T Consensus 58 ~~~~~~~~~~~~~~~~~~tid~~~~~~~~i~d~--d~~~~~~--~~~~~~~~~~~ala~~vD~~i~~~~~~a~~~~~~~- 132 (273) T protein:vir:79 58 AAGRQTSADAISDTGVDLLIDQEKSIDFLVDDI--DRVQVAG--SLEAYTRAGATALATDTDKFIADMLVDNGTALTGS- 132 (273) T ss_pred cCCCccCccccccceEEEEEeeecccceeeccH--HHHhhcc--cHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc- Confidence 1222344555444555555544 3566655211 1111122 34455566778888999999998887766544211 Q ss_pred ceeeccCceeEccccch--hHHHhcchHHHhccCc--ccceEEeecchhHHHHHHHHH----hcCccccc----ce-eEE Q lcl|NC_011222. 156 LYYTKNGNDVQVKFTQR--NDILSDLHPMFRANDY--SGQLHIIGDTSVDSMLRKLEQ----HGLYNDVN----KQ-LEY 222 (438) Q Consensus 156 Ly~t~~gna~q~p~aqR--~e~~~dl~am~R~NDy--~Gpl~~iadT~v~s~l~Klea----qG~~N~tN----lq-~Qy 222 (438) .+.... -+.+-++...|..+|+ .||+-+|. ...+++|.+..+ ....++.+ =+ -.+ T Consensus 133 -----------~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~-p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~ 200 (273) T protein:vir:79 133 -----------APSDADDAFDLIASALKELTKANVPNVGRVVVVN-AEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNL 200 (273) T ss_pred -----------cccchhhHHHHHHHHHHHhhhccCCccCcEEEEC-HHHHHHHhhchhhhhhhhhcccccceeeeEeeEE Confidence 111111 1345556667778887 67776655 888888877543 22222221 11 147 Q ss_pred eeeeeeeeccccchhhhchhhcccccccchhhhhhh-hhhhccccCCccccceeecccccceeeeeEeeccccccccch- Q lcl|NC_011222. 223 ANKVFHFTNNMTLESENFAQMYAVESGNVGLLTRVD-RAAYNNTKSGTHEFGKVVLPYFGKEVGTHYYEEVGDQSAIAG- 300 (438) Q Consensus 223 ~N~~~~~sN~vat~A~n~at~Y~mpsG~vGm~twid-Rea~~n~~S~t~~ft~vvdP~fG~~~g~hyye~vgD~Sgi~G- 300 (438) .+..|++||++..... .+.++.-.+++|+...++ -|.+.+ . +.|...+.=+ +-||....+ -.|+.= T Consensus 201 ~G~~i~~s~~lp~~~~--~~~~a~~~~A~~~a~~~~~~e~~r~---~-~~~~~~v~~~--~~yg~~v~~----p~~vv~~ 268 (273) T protein:vir:79 201 LGARIVESNNLRDTDD--EQFVAFHPSAAAYVSQIDTVEALRD---Q-DSFSDRIRAL--HVYGGKVVR----PTGVVVF 268 (273) T ss_pred eceEEEecccccccCc--eEEEEEeccceeeeeehhhhhcccC---c-ccceeeeeee--eeeeeEEec----CceEEEE Confidence 7999999999975443 223344445556666553 222211 1 2233332211 112222211 111100 Q ss_pred hHHHh Q lcl|NC_011222. 301 EATAD 305 (438) Q Consensus 301 e~tA~ 305 (438) .+++. T Consensus 269 ~~~g~ 273 (273) T protein:vir:79 269 NKTGS 273 (273) T ss_pred eccCC Confidence 00000 No 12 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=92.82 E-value=0.0071 Score=32.37 Aligned_cols=258 Identities=14% Similarity=0.110 Sum_probs=122.2 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhhH--HHh-----hccCccccccCChHHHHHHHhhcCCceeEEEEeccceEe Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGAY--DFF-----LSQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNVT 73 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA~--d~f-----~~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~T 73 (438) |.-.-|+...+= =|.-|+.+ +-+ ..|.++.+ ..|....|..+++|..+|-+..+ T Consensus 1 Ma~~~T~l~d~i-----------~Pev~~~~v~~~~~~~~~~~~~~~~~--------~~l~g~~G~ti~iP~~~~igda~ 61 (276) T protein:vir:10 1 MAQGTTTKSTQI-----------VPEVLAPMMQAELDKKLRFAQFADID--------STLVGQPGDTLTFPAFVYSGDAT 61 (276) T ss_pred CCcceeehhhhh-----------chHHHHHHHHHHHHhhhhhcccceec--------ccccCCCCCEEEeeeecCCCccc Confidence 543344443331 11222221 111 11222111 12344579999999998766555 Q ss_pred eeecceEEecCcccceeeeeeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccC Q lcl|NC_011222. 74 VSNARTCVIADAENTSRLIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFG 153 (438) Q Consensus 74 v~nart~vi~~~entsr~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~ 153 (438) .-. ....|++++.+..-.+.+-..++++|.+. .+=...++.+-+..-.+..=+.+-..+|.++++.|...+-.+- T Consensus 62 ~~~-eg~~i~~~~lt~~~~~a~i~~~~k~~~~t---D~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~~- 136 (276) T protein:vir:10 62 VVP-EGQKIPVDKIETNRREAKIHKIGKGTDIT---DEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTVS- 136 (276) T ss_pred ccc-CCCccCccccccceeeEEeehcccccccc---HHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc- Confidence 322 22356676665554444445556666652 1112222222222233333334557889999988876543332 Q ss_pred ccceeeccCceeEccccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHhcCc------cc--cccee-EEee Q lcl|NC_011222. 154 NLLYYTKNGNDVQVKFTQRNDILSDLHPMFRANDYSGQLHIIGDTSVDSMLRKLEQHGLY------ND--VNKQL-EYAN 224 (438) Q Consensus 154 d~Ly~t~~gna~q~p~aqR~e~~~dl~am~R~NDy~Gpl~~iadT~v~s~l~KleaqG~~------N~--tNlq~-Qy~N 224 (438) .++.. -+.+-+.-.+|..+++.+ ..++=+..++.+|+|+.....- +. .|-+| .|++ T Consensus 137 --------~~~~t------~d~i~~A~~~lgd~~~~~-~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G 201 (276) T protein:vir:10 137 --------ADIGT------LAGLEAAIDTFDDEDLEP-MVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALG 201 (276) T ss_pred --------ccccC------HHHHHHHHHHhccccCcc-cEEEEcHHHHHHHHHhccccccccccccccceeccccceecc Confidence 22222 123445555666666654 4677889999999998533321 11 22223 5788 Q ss_pred eeeeeeccccchhhhchhhcccccccchhhhh--hhhhhhccccCCccccceeecccccceeeeeEeeccccccccchhH Q lcl|NC_011222. 225 KVFHFTNNMTLESENFAQMYAVESGNVGLLTR--VDRAAYNNTKSGTHEFGKVVLPYFGKEVGTHYYEEVGDQSAIAGEA 302 (438) Q Consensus 225 ~~~~~sN~vat~A~n~at~Y~mpsG~vGm~tw--idRea~~n~~S~t~~ft~vvdP~fG~~~g~hyye~vgD~Sgi~Ge~ 302 (438) +.+..++++. ..+.|-.-.|++|+..- +.-|...+...++.. ...+ ..||+.+++.- . T Consensus 202 ~~Vi~s~~~p-----~~t~~l~~~gAi~~~~~~~~~vE~dRd~~~~~d~--i~~~----~~y~~~~~~~~---------~ 261 (276) T protein:vir:10 202 AVIVRSKKLD-----EGEAILAKRGAVKLITKRDFFLETDRDPSTKTTA--LYSD----KHYVAYLYDES---------K 261 (276) T ss_pred eeEEEcCCCC-----cceEEEEeccceeeeecCCceeecccchhhcccE--EEEe----eEEEEEEEcCc---------c Confidence 9999999874 35678888888886443 111222222222100 0000 11333333221 1 Q ss_pred HHhhhhhhhhcCceEEEeEEEEeecCCcccC Q lcl|NC_011222. 303 TADMTCDVKHFYGFSVDIAFVVAFNSDPATI 333 (438) Q Consensus 303 tA~~t~a~kh~Y~~~aD~sfvVAfnSD~A~~ 333 (438) ...++++. =|+|+.- T Consensus 262 vv~~t~~~----------------~~~~~~~ 276 (276) T protein:vir:10 262 AVKVTKGA----------------GTTDSGA 276 (276) T ss_pred eEEEecCC----------------cCCcCCC Confidence 11111111 0111111 No 13 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=90.24 E-value=0.015 Score=30.59 Aligned_cols=260 Identities=15% Similarity=0.110 Sum_probs=119.1 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhhHH-HhhccCccccccCChHHHHHHHhhcCCceeEEEEeccceEeeeecce Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGAYD-FFLSQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNVTVSNART 79 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA~d-~f~~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~Tv~nart 79 (438) |.---|+..++= .-.-|....+. .... ++.++-...+ ..+....|..++||.++|-+..+.-. .. T Consensus 1 ma~~~T~l~d~i----iPev~~~~v~~-~~~~~l~~~~~~~~d--------~~l~g~~G~tv~iP~~~~ig~a~~~~-~g 66 (274) T protein:vir:12 1 MAQGLTKTSNQI----IPEVLAPMMQA-QLEKKLRFASFAEVD--------STLQGQPGDTLTFPAFVYSGDAQVVA-EG 66 (274) T ss_pred CCcceeehhhhh----chHHHHHHHHH-HHHhhhhhcccceec--------ccccCCCCCEEEEeeecCCCcccccc-CC Confidence 655555544432 11112221110 0000 0001111111 12344579999999998655444332 23 Q ss_pred EEecCcccceeee--eeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCccce Q lcl|NC_011222. 80 CVIADAENTSRLI--GVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFGNLLY 157 (438) Q Consensus 80 ~vi~~~entsr~i--T~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~d~Ly 157 (438) ..|++++-+..-. ++-.+-.+|+++=+=.+ .++.+-+....+..-+.+-..+|.++++.|.+.+..+-.+.+ T Consensus 67 ~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~-----~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~~~a~- 140 (274) T protein:vir:12 67 EKIPTDILETKKREAKIRKIAKGTSITDEALL-----SGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADIT- 140 (274) T ss_pred CccchhhcccceeeEEeeeecceeeecHHHHH-----hcccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc- Confidence 3455555444333 33344344444332111 112222223333334556788999999998776555443322 Q ss_pred eeccCceeEccccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHh------cCccc--cccee-EEeeeeee Q lcl|NC_011222. 158 YTKNGNDVQVKFTQRNDILSDLHPMFRANDYSGQLHIIGDTSVDSMLRKLEQH------GLYND--VNKQL-EYANKVFH 228 (438) Q Consensus 158 ~t~~gna~q~p~aqR~e~~~dl~am~R~NDy~Gpl~~iadT~v~s~l~Kleaq------G~~N~--tNlq~-Qy~N~~~~ 228 (438) . -+.+-+...+|-.++..++ .++=+..++..|+|.... .+++. .|-+| .|++..++ T Consensus 141 --------~------~d~i~dA~~~lgd~~~~~~-~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi 205 (274) T protein:vir:12 141 --------K------LNGLQSAIDKFNDEDLEPM-VLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIV 205 (274) T ss_pred --------C------HHHHHHHHHHhcccccccc-EEEeCHHHHHHHHhhhhhhccccccccccceecccceeecCeeEE Confidence 1 1223333444555556555 467788899999986421 11111 12222 48999999 Q ss_pred eeccccchhhhchhhcccccccchhhhh--hhhhhhccccCCccccceeecccccceeeeeEeeccccccccchhHHHhh Q lcl|NC_011222. 229 FTNNMTLESENFAQMYAVESGNVGLLTR--VDRAAYNNTKSGTHEFGKVVLPYFGKEVGTHYYEEVGDQSAIAGEATADM 306 (438) Q Consensus 229 ~sN~vat~A~n~at~Y~mpsG~vGm~tw--idRea~~n~~S~t~~ft~vvdP~fG~~~g~hyye~vgD~Sgi~Ge~tA~~ 306 (438) .+|.+. ..+.|-.-+|++|+..- +.-|...+...+.. ..+.+ ..||+++++. +++ ..+ T Consensus 206 ~s~~~p-----~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d--~i~~~----~~y~~~~~~~----~~v-----v~~ 265 (274) T protein:vir:12 206 RSNKLE-----AGTAILAKKGAVKLILKRDFFLEVARDASTKTT--ALYSD----KHYVAYLYDE----SKA-----VKI 265 (274) T ss_pred EeCCCC-----cceEEEEeccceeeeecCCceeccccchhhccc--EEEee----eEEEEEEEcC----Cce-----EEE Confidence 999875 35678888899887544 22222223332211 00111 2245555432 111 111 Q ss_pred hhhhhhcCceEE Q lcl|NC_011222. 307 TCDVKHFYGFSV 318 (438) Q Consensus 307 t~a~kh~Y~~~a 318 (438) +++.- ..-. T Consensus 266 t~~~~---~~~~ 274 (274) T protein:vir:12 266 TKGSG---SLEM 274 (274) T ss_pred EcCCc---cccC Confidence 11110 0000 No 14 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=82.42 E-value=0.051 Score=27.69 Aligned_cols=253 Identities=13% Similarity=0.074 Sum_probs=117.1 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhhH--HHhh-----ccCccccccCChHHHHHHHhhcCCceeEEEEeccceEe Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGAY--DFFL-----SQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNVT 73 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA~--d~f~-----~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~T 73 (438) |.-..|+..++=. |.-|+.+ +-++ +|.+..+ ..+....|..+++|.+++-+..+ T Consensus 1 ma~~~T~~~d~ii-----------Pev~~~~v~~~~~~~~~~~~~~~~~--------~~l~g~~G~ti~iP~~~~~gda~ 61 (272) T protein:vir:36 1 MSKQKTTLADLVN-----------PEVLAPIVSYELNKALRFAPLAQVD--------TTLQGQPGNTLKFPAFTYIGDAA 61 (272) T ss_pred CCCcceehhhhhc-----------hHHHHHHHHHHHHhhhhhccccccc--------cccccCCCCEEEEeeeccCcccc Confidence 6655566554421 1222211 1111 1222111 12344568899999998654444 Q ss_pred eeecceEEecCccccee--eeeeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|NC_011222. 74 VSNARTCVIADAENTSR--LIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKV 151 (438) Q Consensus 74 v~nart~vi~~~entsr--~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqV 151 (438) .- .....|++++-+.- -.++-++-.+|+++=+=... ...+-.+...+++- +.+...+|++++++|...+..+ T Consensus 62 ~~-~eg~~i~~~~lt~~~~~~~i~~~~k~~~vtD~~~~~--~~~d~~~~~~~~~a---~~~a~~~d~~i~~~l~~~~~~~ 135 (272) T protein:vir:36 62 DV-AEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS--GYGDPIGESNKQLG---LSLANKVDDDLLSAAKTTSQTV 135 (272) T ss_pred cc-CCCCccChhhcCCcceeEeeehhhccccccHHHHhh--ccchHHHHHHHHHH---HHHHHHHHHHHHHHhccccccc Confidence 21 11234555544333 33444555555554322111 11222333333443 3445788899998887655443 Q ss_pred cCccceeeccCceeEccccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHh---cCcccc----ccee-EEe Q lcl|NC_011222. 152 FGNLLYYTKNGNDVQVKFTQRNDILSDLHPMFRANDYSGQLHIIGDTSVDSMLRKLEQH---GLYNDV----NKQL-EYA 223 (438) Q Consensus 152 f~d~Ly~t~~gna~q~p~aqR~e~~~dl~am~R~NDy~Gpl~~iadT~v~s~l~Kleaq---G~~N~t----Nlq~-Qy~ 223 (438) -+.. .-+ -+-+....|...+.... .++=+..++..|+|...- +.+++. |-++ .|+ T Consensus 136 ~~~~---------------~~d-~i~~A~~~lgd~~~~~~-~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~ 198 (272) T protein:vir:36 136 STKA---------------NVD-GVQAALDIFNDEDAQAY-VLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVL 198 (272) T ss_pred cccc---------------cHH-HHHHHHHHhhhcCCCce-EEEEcHHHHHHHhcccccccccccccccceeeeccceec Confidence 2211 111 23344445555666654 467788899999986531 222222 2223 589 Q ss_pred eeeeeeeccccchhhhchhhcccccccchhhhh--hhhhhhccccCCccccceeecccccceeeeeEeeccccccccchh Q lcl|NC_011222. 224 NKVFHFTNNMTLESENFAQMYAVESGNVGLLTR--VDRAAYNNTKSGTHEFGKVVLPYFGKEVGTHYYEEVGDQSAIAGE 301 (438) Q Consensus 224 N~~~~~sN~vat~A~n~at~Y~mpsG~vGm~tw--idRea~~n~~S~t~~ft~vvdP~fG~~~g~hyye~vgD~Sgi~Ge 301 (438) ++.++.||.+..... +.+.|..-.|.+|+..- +.=|.+-+..+++.. .+.+ .-||+++++ -+ T Consensus 199 G~~Vv~s~~~p~~~~-~~~~~~~~~gA~~~~~~~~~~vE~~R~~~~~~d~--i~~~----~~y~~~v~~----~~----- 262 (272) T protein:vir:36 199 GAQIVRSKKLAEGSA-LMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTV--ITAD----EHYAAYLYD----LT----- 262 (272) T ss_pred CeeEEEeCCCCCCce-eEEEEEecccceeeeecCCcccccccchhhcCcE--EEEE----EEEEEEEEc----Cc----- Confidence 999999999864432 34445555777775432 111222222222110 0001 123444422 11 Q ss_pred HHHhhhhhhh Q lcl|NC_011222. 302 ATADMTCDVK 311 (438) Q Consensus 302 ~tA~~t~a~k 311 (438) ..+.++++-- T Consensus 263 ~vv~~t~~g~ 272 (272) T protein:vir:36 263 KVVNITFTGV 272 (272) T ss_pred cEEEEeecCC Confidence 1122222211 No 15 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=79.92 E-value=0.1 Score=26.07 Aligned_cols=307 Identities=12% Similarity=0.039 Sum_probs=138.3 Q ss_pred Ccccccchh-hhcccCCCcchhhhhhhhhhh--HHHhhccCccccccCChHHHHHHHhhcCCceeEEEEeccceEeeeec Q lcl|NC_011222. 1 MSLIATRTQ-EFRLKNPNIDKNMARMTEWGA--YDFFLSQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNVTVSNA 77 (438) Q Consensus 1 ~~~~~~~~q-~~r~K~~n~dK~e~R~s~wGA--~d~f~~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~Tv~na 77 (438) ||+--|-|- .+- ..+..-|- |..|+. .+.|........+. .+- -.+...|.+++||++. +.++.-. + T Consensus 1 ~~~~~~~~~~~~~--t~~v~~fi--pei~s~~i~~~l~~~~v~~~~~-~d~---~~~~~~Gdtv~ip~~g-~~~~~d~-~ 70 (341) T protein:vir:94 1 MALGNTITGPSIN--TQRGQQFI--PEQWLSEVQMFRKAKMLDTSVV-KTW---GAQVKKGDTFHVPRIS-ELGVEDK-A 70 (341) T ss_pred Ccchhhhcccccc--chhHHHHH--HHHHHHHHHHHHHhhcchhhcc-ccc---cccccCCceEEEeccC-cceeeee-c Confidence 777655442 000 01111121 344553 35555444322221 110 0122338899999985 4443322 3 Q ss_pred ceEEecCcccceeeeeeeeeee-eeeeeeeee-eecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCcc Q lcl|NC_011222. 78 RTCVIADAENTSRLIGVTWKTY-AFGFTMIPN-MYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFGNL 155 (438) Q Consensus 78 rt~vi~~~entsr~iT~v~~T~-~w~vti~P~-i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~d~ 155 (438) +.-.|+.++.+..=++++--++ .|++.+..- ..+. ..+ -+..-.+.+-+.+...+|.+.++.+.+-+.+..+.. T Consensus 71 ~~~~i~~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~-~~d---~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~ 146 (341) T protein:vir:94 71 TDVPVGVQPVNDTDFVITVDTDRTTAVALDDLLEIQA-SYD---LRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNV 146 (341) T ss_pred CCCccccccccCceEEEEEeeeeecceeechHHHHhh-ccc---hHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCcc Confidence 4445555544443344443332 344444210 0011 011 134445666778888899888887766554444433 Q ss_pred c---eeeccCceeEccccchhHHHhcchHHHhccCc--ccceEEeecchhHHHHHHHHH---hcCccc---ccce-eEEe Q lcl|NC_011222. 156 L---YYTKNGNDVQVKFTQRNDILSDLHPMFRANDY--SGQLHIIGDTSVDSMLRKLEQ---HGLYND---VNKQ-LEYA 223 (438) Q Consensus 156 L---y~t~~gna~q~p~aqR~e~~~dl~am~R~NDy--~Gpl~~iadT~v~s~l~Klea---qG~~N~---tNlq-~Qy~ 223 (438) + -...+++. -.--.+.+-++...|.++++ .||+.+| +...+++|.+..+ ....++ +|=+ -.++ T Consensus 147 ~~~~~~~~t~~~----~~~~~~~i~~a~~~Lde~~VP~~gR~lvv-~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~ 221 (341) T protein:vir:94 147 FSSSNGAITGNG----QAFSFAVFLAARRLLLEADVPEEKIVLLI-SPGQESALFTIPQFISKDFINNAPIAQGQIGSLM 221 (341) T ss_pred ccCccccccCch----hhhhHHHHHHHHHHHhhcCCCccCCEEEe-CHHHHHHHhhchhhhhhhccccchhheeeeeeEe Confidence 3 00111111 11123556677888888887 6887766 7899999876421 111111 1111 2688 Q ss_pred eeeeeeeccccchhhhchhhcccccc----------cchhhhhhhhhhhc------cccCCccccceeecccccceeeee Q lcl|NC_011222. 224 NKVFHFTNNMTLESENFAQMYAVESG----------NVGLLTRVDRAAYN------NTKSGTHEFGKVVLPYFGKEVGTH 287 (438) Q Consensus 224 N~~~~~sN~vat~A~n~at~Y~mpsG----------~vGm~twidRea~~------n~~S~t~~ft~vvdP~fG~~~g~h 287 (438) +..|+.||++...+. +.|..-.| ..|.-.+-...+-. .-|.+-=.--++.||...-.+..- T Consensus 222 G~~V~~Sn~lp~~~~---~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~ 298 (341) T protein:vir:94 222 GVRVIRTSLIGNNSA---TGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSK 298 (341) T ss_pred ceEEEEecccccccc---ccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccc Confidence 999999999976542 22211111 11111110000000 000000011233444444444443 Q ss_pred EeeccccccccchhHHHhhhhhhhhcCceEE-EeEEEEeecCCcccC Q lcl|NC_011222. 288 YYEEVGDQSAIAGEATADMTCDVKHFYGFSV-DIAFVVAFNSDPATI 333 (438) Q Consensus 288 yye~vgD~Sgi~Ge~tA~~t~a~kh~Y~~~a-D~sfvVAfnSD~A~~ 333 (438) ..+..++.+ .++-++++.++- -||.-+ +-...|-+..+.+++ T Consensus 299 ~~~~~~~~~---~~~~~~~i~~~~-~~G~~~lrp~~~v~~~~~~~~~ 341 (341) T protein:vir:94 299 APRVTQSFE---NREQVWLMVGRQ-AYGARLYRPLHAVNIHTTGDTV 341 (341) T ss_pred cccccccch---hhhhhhhhhhhh-hhcccccCcceeEEEecCcCCC Confidence 333333332 345666666653 233322 222345555555555 No 16 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=72.03 E-value=0.051 Score=27.68 Aligned_cols=262 Identities=12% Similarity=0.060 Sum_probs=129.4 Q ss_pred Ccccc-cchhhhcccCCCcchhhhhhhhhhhHHHhhccCccccccCChHHHHHHHhhcCCceeEEEEeccceEeeeecce Q lcl|NC_011222. 1 MSLIA-TRTQEFRLKNPNIDKNMARMTEWGAYDFFLSQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNVTVSNART 79 (438) Q Consensus 1 ~~~~~-~~~q~~r~K~~n~dK~e~R~s~wGA~d~f~~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~Tv~nart 79 (438) |.--. |..-.| .++--=-|-.|.+ .|..++++ +|. .+|+.--+.|++|++|=-.|-+..+-. +-- T Consensus 1 mAe~nlt~~~dL--~~~~sidfv~~f~-~~i~~L~~--------~Lg--i~r~~p~a~G~tIt~pK~~~tgda~dV-aEG 66 (295) T protein:vir:99 1 MAEKNLNTMADL--GDIKSIDFVNKFS-KNINDLLK--------LLG--VTRRETLTNDLKIQTYKWEVTLDQTDP-GEG 66 (295) T ss_pred CCCcccccHhhc--cCceeehhhHHhh-hhHHHHHH--------Hhc--cccccccccCCeEEeeeeeeecccccc-cCC Confidence 21100 000000 0000000111111 11111111 110 124444566999999986655444332 234 Q ss_pred EEecCccccee---eeeeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCccc Q lcl|NC_011222. 80 CVIADAENTSR---LIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFGNLL 156 (438) Q Consensus 80 ~vi~~~entsr---~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~d~L 156 (438) .+||-+..+.. -.++.++-+.=++|- -+| ...+|.+-...-=++-++.+...+|.+.+++|.+.++++ T Consensus 67 e~Iplskvt~~~~~t~t~kikK~rK~tTd-EAI---qlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~----- 137 (295) T protein:vir:99 67 ETIPLSKVTRTKDKDYTVKWFKKRRATTA-EAI---ARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKV----- 137 (295) T ss_pred cccchhhheeeeeeeeEEEeeeecccccH-HHH---HhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceee----- Confidence 56777776654 366666666655543 111 112333333333344455566789999999998888776 Q ss_pred eeeccCceeEccccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHhcCccc--cccee--EEeee-eeeeec Q lcl|NC_011222. 157 YYTKNGNDVQVKFTQRNDILSDLHPMFRANDYSGQLHIIGDTSVDSMLRKLEQHGLYND--VNKQL--EYANK-VFHFTN 231 (438) Q Consensus 157 y~t~~gna~q~p~aqR~e~~~dl~am~R~NDy~Gpl~~iadT~v~s~l~KleaqG~~N~--tNlq~--Qy~N~-~~~~sN 231 (438) +|+.+|.-++.=...+..|.. +++-|.+..-|..=.+.+|+-++.-.-.. -...| .|++. ++-+|+ T Consensus 138 ----tg~~lq~a~a~~~~al~~f~E-----e~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~nfLG~q~II~S~ 208 (295) T protein:vir:99 138 ----KGVGLQKALSASWAKLATFNE-----FEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLKNFLGMQNVIVMP 208 (295) T ss_pred ----ehhhHHHHHHHhhhhhhhccc-----ccCCceEEEEehHHHHHHHhccccccchhhhhhhhhhhhhhccceEEEcc Confidence 344455544433333333322 34457788888888888888887753322 11111 36776 588888 Q ss_pred cc------cchhhhchhhccccc---------------ccchhhhhhhhhhhc---cccCCccccceeecccccceeeee Q lcl|NC_011222. 232 NM------TLESENFAQMYAVES---------------GNVGLLTRVDRAAYN---NTKSGTHEFGKVVLPYFGKEVGTH 287 (438) Q Consensus 232 ~v------at~A~n~at~Y~mps---------------G~vGm~twidRea~~---n~~S~t~~ft~vvdP~fG~~~g~h 287 (438) .| +|+++|--..|.=++ |+|||+-=++....- ..-|+-..|..+.| |+..++- T Consensus 209 kv~~G~~~aT~~~Ni~~ay~~~~~g~l~~~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~d---giv~~tI 285 (295) T protein:vir:99 209 SVPEGKIYSTAVENLVFASLNVKGGDLGGLFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPE---GVVEATI 285 (295) T ss_pred cCCCceEEEeeccceEEEEecCCchhhhhhhhhccCcccceEEEeccccceeeehhhhHhHHHhcccccc---eEEEEEE Confidence 76 466677766666555 455554332211110 22233334444444 3333333 Q ss_pred Eeeccccccccch Q lcl|NC_011222. 288 YYEEVGDQSAIAG 300 (438) Q Consensus 288 yye~vgD~Sgi~G 300 (438) +.+-.+++-| T Consensus 286 ---~~~~~~~~~~ 295 (295) T protein:vir:99 286 ---EAAAVPGIGG 295 (295) T ss_pred ---ecCcCCCCCC Confidence 4456677777 No 17 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=59.76 E-value=0.39 Score=22.85 Aligned_cols=273 Identities=10% Similarity=0.034 Sum_probs=105.4 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhhHHHhhccCccccccCChHH----HHHHHhh-----------c-CCceeEE Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGAYDFFLSQTNAMDSMLSDET----KRRAFAS-----------M-GSDIKIP 64 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA~d~f~~Qta~~~lMLs~Et----~dkl~AS-----------~-~splkip 64 (438) |.. |...+++.+- +.+-...+|++. ++.++.. + +..+++| T Consensus 1 ma~-----~~~~~~~~~~-------------------t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip 56 (304) T protein:vir:94 1 MAT-----PTYTPGNVIL-------------------SDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFT 56 (304) T ss_pred Ccc-----cccccccccc-------------------cCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEE Confidence 211 1111111110 111112233333 2222211 1 2347788 Q ss_pred EEeccceEeeeecceEEecCcccceeeeeeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011222. 65 VIDYDKNVTVSNARTCVIADAENTSRLIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAAL 144 (438) Q Consensus 65 VlnYr~s~Tv~nart~vi~~~entsr~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaL 144 (438) +++=...+....- ...++..+..-.-+++..++++=.+.+.=.+...+.++.++-+.+.+. +++...+|...+.-= T Consensus 57 ~~~~~~~a~~v~E-~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~---~~ia~~~d~~~l~G~ 132 (304) T protein:vir:94 57 YLAKGVGAYWVSE-TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIA---EAFYKAFDQAVIFGT 132 (304) T ss_pred EEeCCcceEEeec-CcccccccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHH---HHHHHHHHhhheecc Confidence 8862323322221 234555444444444444444422222111222233333333333333 345556666655321 Q ss_pred hhc-cccccCccceeeccC--ceeE---ccccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHH-hcCccccc Q lcl|NC_011222. 145 EAN-KTKVFGNLLYYTKNG--NDVQ---VKFTQRNDILSDLHPMFRANDYSGQLHIIGDTSVDSMLRKLEQ-HGLYNDVN 217 (438) Q Consensus 145 eAN-KTqVf~d~Ly~t~~g--na~q---~p~aqR~e~~~dl~am~R~NDy~Gpl~~iadT~v~s~l~Klea-qG~~N~tN 217 (438) -++ -++.++..++..... .... ..+.+=.+++..+++ .++ .+-.++-+...+..|+++.. +|-+.=+- T Consensus 133 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~----~~~-~~~~~v~~~~~~~~L~~lkd~~G~~l~~~ 207 (304) T protein:vir:94 133 KSPYNTSTSGKPLVEGAEEKGNVVTDTNNLYVDLSALMATIED----EEL-DPNGVLTTRSFRSKMRNALDANDRPLFDA 207 (304) T ss_pred CCCcccccccccccccccccccccccccchHHHHHHHHHHhhh----ccC-CcCEEEEcHHHHHHHHHhhccCCcEeecC Confidence 111 123333433322211 1111 111111222333333 222 23358889999999999863 22221111 Q ss_pred ceeEEeeeeeeeeccccchhhhchhhcccccccchhhhhhhhhhhc--cccCCccccceeeccc------ccceeeeeEe Q lcl|NC_011222. 218 KQLEYANKVFHFTNNMTLESENFAQMYAVESGNVGLLTRVDRAAYN--NTKSGTHEFGKVVLPY------FGKEVGTHYY 289 (438) Q Consensus 218 lq~Qy~N~~~~~sN~vat~A~n~at~Y~mpsG~vGm~twidRea~~--n~~S~t~~ft~vvdP~------fG~~~g~hyy 289 (438) -.-.+.++-.+.++.|+..+...--.+|-.+-. .|.+|++.. .+...+.+...+.||- |..---.+.- T Consensus 208 ~~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~~----~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~ 283 (304) T protein:vir:94 208 NGNEIMGLPLSYTGADVYDKKKSLALMGDWDYA----RYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRA 283 (304) T ss_pred CCccccceeeEEecccccCCCCcEEEEEehhhE----EEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEE Confidence 123456677788888876554333333333322 123333221 1111122222222221 1111011111 Q ss_pred eccccccccchhHHHhhhhhh Q lcl|NC_011222. 290 EEVGDQSAIAGEATADMTCDV 310 (438) Q Consensus 290 e~vgD~Sgi~Ge~tA~~t~a~ 310 (438) +.--|....+.++.+.++.|+ T Consensus 284 ~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 284 TMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EEEeccEeecccceEEEEecC Confidence 112255666666667777776 No 18 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=59.76 E-value=0.39 Score=22.85 Aligned_cols=273 Identities=10% Similarity=0.034 Sum_probs=105.4 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhhHHHhhccCccccccCChHH----HHHHHhh-----------c-CCceeEE Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGAYDFFLSQTNAMDSMLSDET----KRRAFAS-----------M-GSDIKIP 64 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA~d~f~~Qta~~~lMLs~Et----~dkl~AS-----------~-~splkip 64 (438) |.. |...+++.+- +.+-...+|++. ++.++.. + +..+++| T Consensus 1 ma~-----~~~~~~~~~~-------------------t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip 56 (304) T protein:vir:10 1 MAT-----PTYTPGNVIL-------------------SDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFT 56 (304) T ss_pred Ccc-----cccccccccc-------------------cCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEE Confidence 211 1111111110 111112233333 2222211 1 2347788 Q ss_pred EEeccceEeeeecceEEecCcccceeeeeeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011222. 65 VIDYDKNVTVSNARTCVIADAENTSRLIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAAL 144 (438) Q Consensus 65 VlnYr~s~Tv~nart~vi~~~entsr~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaL 144 (438) +++=...+....- ...++..+..-.-+++..++++=.+.+.=.+...+.++.++-+.+.+. +++...+|...+.-= T Consensus 57 ~~~~~~~a~~v~E-~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~---~~ia~~~d~~~l~G~ 132 (304) T protein:vir:10 57 YLAKGVGAYWVSE-TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIA---EAFYKAFDQAVIFGT 132 (304) T ss_pred EEeCCcceEEeec-CcccccccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHH---HHHHHHHHhhheecc Confidence 8862323322221 234555444444444444444422222111222233333333333333 345556666655321 Q ss_pred hhc-cccccCccceeeccC--ceeE---ccccchhHHHhcchHHHhccCcccceEEeecchhHHHHHHHHH-hcCccccc Q lcl|NC_011222. 145 EAN-KTKVFGNLLYYTKNG--NDVQ---VKFTQRNDILSDLHPMFRANDYSGQLHIIGDTSVDSMLRKLEQ-HGLYNDVN 217 (438) Q Consensus 145 eAN-KTqVf~d~Ly~t~~g--na~q---~p~aqR~e~~~dl~am~R~NDy~Gpl~~iadT~v~s~l~Klea-qG~~N~tN 217 (438) -++ -++.++..++..... .... ..+.+=.+++..+++ .++ .+-.++-+...+..|+++.. +|-+.=+- T Consensus 133 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~----~~~-~~~~~v~~~~~~~~L~~lkd~~G~~l~~~ 207 (304) T protein:vir:10 133 KSPYNTSTSGKPLVEGAEEKGNVVTDTNNLYVDLSALMATIED----EEL-DPNGVLTTRSFRSKMRNALDANDRPLFDA 207 (304) T ss_pred CCCcccccccccccccccccccccccccchHHHHHHHHHHhhh----ccC-CcCEEEEcHHHHHHHHHhhccCCcEeecC Confidence 111 123333433322211 1111 111111222333333 222 23358889999999999863 22221111 Q ss_pred ceeEEeeeeeeeeccccchhhhchhhcccccccchhhhhhhhhhhc--cccCCccccceeeccc------ccceeeeeEe Q lcl|NC_011222. 218 KQLEYANKVFHFTNNMTLESENFAQMYAVESGNVGLLTRVDRAAYN--NTKSGTHEFGKVVLPY------FGKEVGTHYY 289 (438) Q Consensus 218 lq~Qy~N~~~~~sN~vat~A~n~at~Y~mpsG~vGm~twidRea~~--n~~S~t~~ft~vvdP~------fG~~~g~hyy 289 (438) -.-.+.++-.+.++.|+..+...--.+|-.+-. .|.+|++.. .+...+.+...+.||- |..---.+.- T Consensus 208 ~~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~~----~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~ 283 (304) T protein:vir:10 208 NGNEIMGLPLSYTGADVYDKKKSLALMGDWDYA----RYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRA 283 (304) T ss_pred CCccccceeeEEecccccCCCCcEEEEEehhhE----EEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEE Confidence 123456677788888876554333333333322 123333221 1111122222222221 1111011111 Q ss_pred eccccccccchhHHHhhhhhh Q lcl|NC_011222. 290 EEVGDQSAIAGEATADMTCDV 310 (438) Q Consensus 290 e~vgD~Sgi~Ge~tA~~t~a~ 310 (438) +.--|....+.++.+.++.|+ T Consensus 284 ~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 284 TMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EEEeccEeecccceEEEEecC Confidence 112255666666667777776 No 19 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=58.86 E-value=0.4 Score=22.74 Aligned_cols=249 Identities=14% Similarity=0.158 Sum_probs=113.6 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhh--HHHhhccCccccccCChHHHHH--HHhhcCCceeEEEEeccceEeeee Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGA--YDFFLSQTNAMDSMLSDETKRR--AFASMGSDIKIPVIDYDKNVTVSN 76 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA--~d~f~~Qta~~~lMLs~Et~dk--l~AS~~splkipVlnYr~s~Tv~n 76 (438) |++ ++| -+..|+. .+.|..+.- +++-.-+. ..-..|.+++||++. .+++.. T Consensus 1 MA~-----~~~------------~pe~~~~~v~~~~~~~lv-----~~~l~~~~~~~~~~~Gdtv~ip~~~---~~~~~d 55 (273) T protein:vir:10 1 MAF-----NNF------------IPELWSDMLLEEWTAQTV-----FANLVNREYEGTASKGNVVHIAGVV---APTVKD 55 (273) T ss_pred Ccc-----hhh------------hHHHHHHHHHHHHHhhhc-----cchhhccccccccccCceEEEeecc---cccccc Confidence 332 000 1344543 455654432 22211010 112357899999984 122221 Q ss_pred --cceEEecCcccceeeeeeeeeee-eeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccC Q lcl|NC_011222. 77 --ARTCVIADAENTSRLIGVTWKTY-AFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFG 153 (438) Q Consensus 77 --art~vi~~~entsr~iT~v~~T~-~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~ 153 (438) ...-.+..++.+-.-++++-... .|+|.+. .+=. =+..+ |++.-.+.+-+.+...+|.+..+.+.+..+.+-. T Consensus 56 ~~~~~~~~~~~~~~~~~~~~tid~~~~~~~~i~-d~d~-~~~~~--~~~~~~~~~~~alA~~vD~~i~~~~~~a~~~~~~ 131 (273) T protein:vir:10 56 YKAAGRQTSADAISDTGVDLLIDQEKSIDFLVD-DIDR-VQVAG--SLEAYTRAGATALATDTDKFIADMLVDNGTALTG 131 (273) T ss_pred cccCCCccCccccccceEEEEEeeeeecceEee-cHHH-hhhhc--cHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 11222444454445555555443 5566542 2111 11111 2444445566778889999999888776555422 Q ss_pred ccceeeccCceeEccccchhHHHhc---chHHHhccCc--ccceEEeecchhHHHHHHHHH----hcCccccc-c---e- Q lcl|NC_011222. 154 NLLYYTKNGNDVQVKFTQRNDILSD---LHPMFRANDY--SGQLHIIGDTSVDSMLRKLEQ----HGLYNDVN-K---Q- 219 (438) Q Consensus 154 d~Ly~t~~gna~q~p~aqR~e~~~d---l~am~R~NDy--~Gpl~~iadT~v~s~l~Klea----qG~~N~tN-l---q- 219 (438) . .+. ...+++.. +...|..+++ .||+ ++=+...+.+|.+..+ ...+++.+ + + T Consensus 132 ~------------~~~-~~~~~~~~i~~a~~~ld~~~vP~~~R~-lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~i 197 (273) T protein:vir:10 132 S------------APT-DADDAFDLIAKALKELTKANVPNVGRV-VVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTI 197 (273) T ss_pred c------------ccc-chhHHHHHHHHHHHHhhhcCCCcCCCE-EEECHHHHHHHhcchhhhhhhhccccccceeeeee Confidence 1 111 22344444 4445677777 5655 4567888888877543 12222221 1 1 Q ss_pred eEEeeeeeeeeccccchhhhchhhcccccccchhhhhhh-hhhhccccCCccccceeecccccceeeeeEeec-cccccc Q lcl|NC_011222. 220 LEYANKVFHFTNNMTLESENFAQMYAVESGNVGLLTRVD-RAAYNNTKSGTHEFGKVVLPYFGKEVGTHYYEE-VGDQSA 297 (438) Q Consensus 220 ~Qy~N~~~~~sN~vat~A~n~at~Y~mpsG~vGm~twid-Rea~~n~~S~t~~ft~vvdP~fG~~~g~hyye~-vgD~Sg 297 (438) -++.+..|++||++..... .+.++.-.+++|+...++ -|.+ ++. +.|... ..|.|+|-. |=+-.| T Consensus 198 g~i~G~~v~~s~~lp~~~~--~~~~~~~~~A~~~a~q~~~~e~~---r~~-~~~~~~-------v~~~~~yg~~v~~~~~ 264 (273) T protein:vir:10 198 GNLLGARIVESNNLRDTDD--EQFVAFHPSAAAYVSQIDTVEAL---RDQ-DSFSDR-------IRALHVYGGKVVRPTG 264 (273) T ss_pred eEEeceEEEEecccccCCc--cEEEEEeccceeeeeeeehhhcc---cCC-Ccceee-------eeeeeeeeeeEeccce Confidence 3467899999999965433 334444455566655543 1111 111 122222 223333210 011111 Q ss_pred cch-hHHHh Q lcl|NC_011222. 298 IAG-EATAD 305 (438) Q Consensus 298 i~G-e~tA~ 305 (438) +.= .+++. T Consensus 265 ~~~l~~~g~ 273 (273) T protein:vir:10 265 VVVFNKTGS 273 (273) T ss_pred EEEEeccCC Confidence 100 00000 No 20 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=58.86 E-value=0.4 Score=22.74 Aligned_cols=249 Identities=14% Similarity=0.158 Sum_probs=113.6 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhh--HHHhhccCccccccCChHHHHH--HHhhcCCceeEEEEeccceEeeee Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGA--YDFFLSQTNAMDSMLSDETKRR--AFASMGSDIKIPVIDYDKNVTVSN 76 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA--~d~f~~Qta~~~lMLs~Et~dk--l~AS~~splkipVlnYr~s~Tv~n 76 (438) |++ ++| -+..|+. .+.|..+.- +++-.-+. ..-..|.+++||++. .+++.. T Consensus 1 MA~-----~~~------------~pe~~~~~v~~~~~~~lv-----~~~l~~~~~~~~~~~Gdtv~ip~~~---~~~~~d 55 (273) T protein:vir:10 1 MAF-----NNF------------IPELWSDMLLEEWTAQTV-----FANLVNREYEGTASKGNVVHIAGVV---APTVKD 55 (273) T ss_pred Ccc-----hhh------------hHHHHHHHHHHHHHhhhc-----cchhhccccccccccCceEEEeecc---cccccc Confidence 332 000 1344543 455654432 22211010 112357899999984 122221 Q ss_pred --cceEEecCcccceeeeeeeeeee-eeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccC Q lcl|NC_011222. 77 --ARTCVIADAENTSRLIGVTWKTY-AFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFG 153 (438) Q Consensus 77 --art~vi~~~entsr~iT~v~~T~-~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~ 153 (438) ...-.+..++.+-.-++++-... .|+|.+. .+=. =+..+ |++.-.+.+-+.+...+|.+..+.+.+..+.+-. T Consensus 56 ~~~~~~~~~~~~~~~~~~~~tid~~~~~~~~i~-d~d~-~~~~~--~~~~~~~~~~~alA~~vD~~i~~~~~~a~~~~~~ 131 (273) T protein:vir:10 56 YKAAGRQTSADAISDTGVDLLIDQEKSIDFLVD-DIDR-VQVAG--SLEAYTRAGATALATDTDKFIADMLVDNGTALTG 131 (273) T ss_pred cccCCCccCccccccceEEEEEeeeeecceEee-cHHH-hhhhc--cHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 11222444454445555555443 5566542 2111 11111 2444445566778889999999888776555422 Q ss_pred ccceeeccCceeEccccchhHHHhc---chHHHhccCc--ccceEEeecchhHHHHHHHHH----hcCccccc-c---e- Q lcl|NC_011222. 154 NLLYYTKNGNDVQVKFTQRNDILSD---LHPMFRANDY--SGQLHIIGDTSVDSMLRKLEQ----HGLYNDVN-K---Q- 219 (438) Q Consensus 154 d~Ly~t~~gna~q~p~aqR~e~~~d---l~am~R~NDy--~Gpl~~iadT~v~s~l~Klea----qG~~N~tN-l---q- 219 (438) . .+. ...+++.. +...|..+++ .||+ ++=+...+.+|.+..+ ...+++.+ + + T Consensus 132 ~------------~~~-~~~~~~~~i~~a~~~ld~~~vP~~~R~-lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~i 197 (273) T protein:vir:10 132 S------------APT-DADDAFDLIAKALKELTKANVPNVGRV-VVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTI 197 (273) T ss_pred c------------ccc-chhHHHHHHHHHHHHhhhcCCCcCCCE-EEECHHHHHHHhcchhhhhhhhccccccceeeeee Confidence 1 111 22344444 4445677777 5655 4567888888877543 12222221 1 1 Q ss_pred eEEeeeeeeeeccccchhhhchhhcccccccchhhhhhh-hhhhccccCCccccceeecccccceeeeeEeec-cccccc Q lcl|NC_011222. 220 LEYANKVFHFTNNMTLESENFAQMYAVESGNVGLLTRVD-RAAYNNTKSGTHEFGKVVLPYFGKEVGTHYYEE-VGDQSA 297 (438) Q Consensus 220 ~Qy~N~~~~~sN~vat~A~n~at~Y~mpsG~vGm~twid-Rea~~n~~S~t~~ft~vvdP~fG~~~g~hyye~-vgD~Sg 297 (438) -++.+..|++||++..... .+.++.-.+++|+...++ -|.+ ++. +.|... ..|.|+|-. |=+-.| T Consensus 198 g~i~G~~v~~s~~lp~~~~--~~~~~~~~~A~~~a~q~~~~e~~---r~~-~~~~~~-------v~~~~~yg~~v~~~~~ 264 (273) T protein:vir:10 198 GNLLGARIVESNNLRDTDD--EQFVAFHPSAAAYVSQIDTVEAL---RDQ-DSFSDR-------IRALHVYGGKVVRPTG 264 (273) T ss_pred eEEeceEEEEecccccCCc--cEEEEEeccceeeeeeeehhhcc---cCC-Ccceee-------eeeeeeeeeeEeccce Confidence 3467899999999965433 334444455566655543 1111 111 122222 223333210 011111 Q ss_pred cch-hHHHh Q lcl|NC_011222. 298 IAG-EATAD 305 (438) Q Consensus 298 i~G-e~tA~ 305 (438) +.= .+++. T Consensus 265 ~~~l~~~g~ 273 (273) T protein:vir:10 265 VVVFNKTGS 273 (273) T ss_pred EEEEeccCC Confidence 100 00000 No 21 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=54.82 E-value=0.49 Score=22.26 Aligned_cols=284 Identities=13% Similarity=0.058 Sum_probs=124.7 Q ss_pred Ccccc--cchhhhc--ccCCCcc-hhhhhhhhhhh--HHHhhccCccccccCChHHHHHHHhhcCCceeEEEEeccceEe Q lcl|NC_011222. 1 MSLIA--TRTQEFR--LKNPNID-KNMARMTEWGA--YDFFLSQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNVT 73 (438) Q Consensus 1 ~~~~~--~~~q~~r--~K~~n~d-K~e~R~s~wGA--~d~f~~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~T 73 (438) |+.++ +|.+-+| .++++.| +.-.=+..|+. ..-|.-++-...++.. .++ + -|..++||.|. ++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~-r~i---~--~G~tv~i~~ig---~~~ 71 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRS-YDL---R--GGKSKQFMFTG---KLS 71 (332) T ss_pred CcccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhcccc-ccc---c--ccceEEEEecc---cee Confidence 66554 4555554 4555665 22334455653 3345444332222222 222 2 48889999885 444 Q ss_pred eee--cceEEecCcccceeeeeeeeeeeeeeeeeeeeeecccccCHHHHHH-------HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011222. 74 VSN--ARTCVIADAENTSRLIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWN-------RKLQKHIRKFMDTVDKDAIAAL 144 (438) Q Consensus 74 v~n--art~vi~~~entsr~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~-------~kfqk~L~k~~~tvD~~~~AaL 144 (438) +.- .++-..++++.+..-++ ++|-=.-|.+..|+..+++. .-.+.+=++|....|.+.+++| T Consensus 72 ~~~~~~g~~l~~~~~~~~~~~~---------l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l 142 (332) T protein:vir:78 72 AGYHTPGTPIVGDAGIKANEKT---------LVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVL 142 (332) T ss_pred EeeecCCCCCCCCCCCCCceEE---------EEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 443 33333344333322223 33333456666665544433 2456777889999999888877 Q ss_pred hhc------cccccCccceeeccCceeEccccchhHHHhcchHHHhccCc--ccceEEeecchhHHHHHHHHH-----h- Q lcl|NC_011222. 145 EAN------KTKVFGNLLYYTKNGNDVQVKFTQRNDILSDLHPMFRANDY--SGQLHIIGDTSVDSMLRKLEQ-----H- 210 (438) Q Consensus 145 eAN------KTqVf~d~Ly~t~~gna~q~p~aqR~e~~~dl~am~R~NDy--~Gpl~~iadT~v~s~l~Klea-----q- 210 (438) ..- -|.+.+.+-..-..+ -+.+=..=-+.+-++...|.++|+ .||+.++ +...++.|-|... . T Consensus 143 ~~aa~~~~~~~~~~g~~~~~~~~~--~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv-~P~~y~~Ll~~~d~~~~n~~ 219 (332) T protein:vir:78 143 AKASAEASPVTGEPGGFHVNIGAG--NTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVL-SPRQYYSLISSVDTNILNRE 219 (332) T ss_pred HhhhcccCcccccccccccccCCc--cccCHHHHHHHHHHHHHHHhhcCCCccCCEEEe-CHHHHHHHHhhcCceeeeee Confidence 532 122222222111111 121111112345677888999998 5766655 5677877766421 1 Q ss_pred -cCcc--cccc--eeEEeeeeeeeeccccchhhhchhhcccc-------------cccchhhhhhhhhhhccccCCcccc Q lcl|NC_011222. 211 -GLYN--DVNK--QLEYANKVFHFTNNMTLESENFAQMYAVE-------------SGNVGLLTRVDRAAYNNTKSGTHEF 272 (438) Q Consensus 211 -G~~N--~tNl--q~Qy~N~~~~~sN~vat~A~n~at~Y~mp-------------sG~vGm~twidRea~~n~~S~t~~f 272 (438) |+.+ -.|. --++++..+++||++.+.+ ++.+.+. +..+|+.+ |...=.+ T Consensus 220 ~~~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~---g~~~~~~~~~~~~n~~~~~~~~~~~~~~----------h~~a~~~ 286 (332) T protein:vir:78 220 IGNSQGDMNSGKGLYSIAGIRILKSNNLAGLY---GQDLSSAAVTGENNDYQVDASALAGLIF----------HREAAGC 286 (332) T ss_pred ccccccceecceeeeEEeeeEEEecCccccCc---ccccccccccccccccccccccceEEee----------cccceee Confidence 2222 1222 2377899999999997544 1222110 11122111 1111112 Q ss_pred ceeecccccceeeeeEeeccccccccchhHHHhhhhhhhhcCceEEEeEEEEeecCCcccCccceEEEEeecc Q lcl|NC_011222. 273 GKVVLPYFGKEVGTHYYEEVGDQSAIAGEATADMTCDVKHFYGFSVDIAFVVAFNSDPATIANPIMKVEVNKE 345 (438) Q Consensus 273 t~vvdP~fG~~~g~hyye~vgD~Sgi~Ge~tA~~t~a~kh~Y~~~aD~sfvVAfnSD~A~~~~~IlK~~~~~e 345 (438) .+.+||-+.++.+-+.-+.-+| .|.| .- -||..+. ..|.+ +.+++- T Consensus 287 v~~~~~~~~~t~~~~~~~~~~d--~i~~---------~~-~~G~~v~-------rPe~~--------v~l~~a 332 (332) T protein:vir:78 287 IQSVAPTIQTTSGDFNVQYQGD--LIVG---------KL-AMGCGSL-------RTSVA--------GSFQAA 332 (332) T ss_pred eeeeccchhhhhcccchhhhHh--hhhh---------hh-hhcCcee-------cccce--------EEEeeC Confidence 2223333333222221111111 1111 10 1222110 00000 000000 No 22 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=50.40 E-value=0.61 Score=21.76 Aligned_cols=258 Identities=10% Similarity=0.066 Sum_probs=104.5 Q ss_pred hccCccccccCChHH----HHHHHh-----------hc-CCceeEEEEeccceEeeeecceEEecCcccceeeeeeeeee Q lcl|NC_011222. 35 LSQTNAMDSMLSDET----KRRAFA-----------SM-GSDIKIPVIDYDKNVTVSNARTCVIADAENTSRLIGVTWKT 98 (438) Q Consensus 35 ~~Qta~~~lMLs~Et----~dkl~A-----------S~-~splkipVlnYr~s~Tv~nart~vi~~~entsr~iT~v~~T 98 (438) |.-+.+-...+|+|. +++++. .+ +..+++|+......+.... ....++.++-+-.-+++..+. T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~-E~~~~~~s~~~f~~v~l~~~k 79 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVA-ENGKKTHGGLSLEPVTIVPIK 79 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcceEEee-cCccccccccceeeEEeeeEE Confidence 444344445555554 333331 12 2457888886444444333 234445444433333333333 Q ss_pred eeeeeeeeeeeec-------ccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-ccccc--cCccceeeccCceeEcc Q lcl|NC_011222. 99 YAFGFTMIPNMYS-------NNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEA-NKTKV--FGNLLYYTKNGNDVQVK 168 (438) Q Consensus 99 ~~w~vti~P~i~~-------NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeA-NKTqV--f~d~Ly~t~~gna~q~p 168 (438) .+.-+ .++. .-.++.++-+...+. +++.+.+|...+..... ..|.. .+..++...+++++.. T Consensus 80 --l~~~~--~iS~ell~~~~d~~~~l~~~i~~~la---~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~- 151 (303) T protein:vir:97 80 --VEYGA--RLSDEFLYATEEEKIDILKAFNEGFA---KKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKF- 151 (303) T ss_pred --EEEee--hhhHHHhhcCccchHHHHHHHHHHHH---HHHHHHHHhhhhcccccCCcccccccccccccccccccccc- Confidence 33222 2211 112223333333333 45667788777755421 22222 2222344444433322 Q ss_pred ccchhHHHhcchHH---HhccCcccceEEeecchhHHHHHHHH-HhcCcc---cccc--e-eEEeeeeeeeeccccchhh Q lcl|NC_011222. 169 FTQRNDILSDLHPM---FRANDYSGQLHIIGDTSVDSMLRKLE-QHGLYN---DVNK--Q-LEYANKVFHFTNNMTLESE 238 (438) Q Consensus 169 ~aqR~e~~~dl~am---~R~NDy~Gpl~~iadT~v~s~l~Kle-aqG~~N---~tNl--q-~Qy~N~~~~~sN~vat~A~ 238 (438) +.-...+.+|..+ +..++ ..+=.++-+.....+|+++. ++|.|= +... + -.+.+.-.++++.|..... T Consensus 152 -~~~~~~~~~i~~~~~~~~~~~-~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~ 229 (303) T protein:vir:97 152 -TESEDADANIEAAVNLIQGAE-GVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGLKSSVNTTVGAGAD 229 (303) T ss_pred -ccccchHHHHHHHHHHHhhcC-CCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecceeeEEecccCCccc Confidence 1111222233322 22332 33335889999999999986 344321 1111 1 2567788899988764321 Q ss_pred hchhhcccccccc-hhhhhhhhhhhc-cc-----cCCc--cccceeecccccceeeeeEeeccccccccchhHHHhhhhh Q lcl|NC_011222. 239 NFAQMYAVESGNV-GLLTRVDRAAYN-NT-----KSGT--HEFGKVVLPYFGKEVGTHYYEEVGDQSAIAGEATADMTCD 309 (438) Q Consensus 239 n~at~Y~mpsG~v-Gm~twidRea~~-n~-----~S~t--~~ft~vvdP~fG~~~g~hyye~vgD~Sgi~Ge~tA~~t~a 309 (438) -......+==|.. ..+.|..|+... .+ .+++ +.|.+. +..+..-.+| |....+-++-+.++.+ T Consensus 230 ~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n---~~~~r~~~r~-----~~~v~~p~af~~l~~~ 301 (303) T protein:vir:97 230 EAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYN---QIYLRAEAYI-----GWGILDAKSFARVTKG 301 (303) T ss_pred cCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcC---cEEEEEEEEe-----ccEeecccceEEeeCC Confidence 1000000000000 112233332221 00 0111 122221 1111112222 4444444544444444 Q ss_pred hh Q lcl|NC_011222. 310 VK 311 (438) Q Consensus 310 ~k 311 (438) += T Consensus 302 ~~ 303 (303) T protein:vir:97 302 EV 303 (303) T ss_pred CC Confidence 43 No 23 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=39.46 E-value=1 Score=20.54 Aligned_cols=296 Identities=13% Similarity=0.010 Sum_probs=132.3 Q ss_pred Cc------ccccchhhhcccCCCcchhhhhhhhhhhHH--HhhccCccccccCChHHHHHHHhhcCCceeEEEEeccceE Q lcl|NC_011222. 1 MS------LIATRTQEFRLKNPNIDKNMARMTEWGAYD--FFLSQTNAMDSMLSDETKRRAFASMGSDIKIPVIDYDKNV 72 (438) Q Consensus 1 ~~------~~~~~~q~~r~K~~n~dK~e~R~s~wGA~d--~f~~Qta~~~lMLs~Et~dkl~AS~~splkipVlnYr~s~ 72 (438) |. .+.||-.- -..+.|+--+=+..|+..= .|..++-.+.++- -+++ .-|..+++|.|. +.++ T Consensus 1 ~a~~~~~~~~~~~~g~---~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~-~r~i-----~~G~sv~~~~iG-~~~~ 70 (347) T protein:vir:88 1 MANATGGQQIGANQGK---GQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHM-VRTI-----QNGKSASFPVMG-RTKG 70 (347) T ss_pred CCCcccchhhhccCCC---CccccchHHHHHHHHHHHHHHHHHHHhhhhhccc-cccc-----cCcceEEEeeec-ceee Confidence 33 22233222 2355666666667777543 3554443222222 2222 238899999986 4444 Q ss_pred eeeecceEEecCcccceeeeeeeeeeeeeeeeeeeeeecccccCHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011222. 73 TVSNARTCVIADAENTSRLIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNR-------KLQKHIRKFMDTVDKDAIAALE 145 (438) Q Consensus 73 Tv~nart~vi~~~entsr~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~-------kfqk~L~k~~~tvD~~~~AaLe 145 (438) .-...++-...+-++. .+--+-++|--.-|.+..|+..+++.- -.+.+-++|....|...+.+|. T Consensus 71 ~~~~~g~~l~~~~~~~--------~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~ 142 (347) T protein:vir:88 71 YYLAPGENLDDKRKDI--------KHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMA 142 (347) T ss_pred eeeccccCCCCCCCCC--------ccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3333444332221111 112234555556778888888887652 3677788888999998887774 Q ss_pred h--cccc-----cc---CccceeeccCceeEccccchhHH---HhcchHHHhccCcc--cceEEeecchhHHHHHHHHHh Q lcl|NC_011222. 146 A--NKTK-----VF---GNLLYYTKNGNDVQVKFTQRNDI---LSDLHPMFRANDYS--GQLHIIGDTSVDSMLRKLEQH 210 (438) Q Consensus 146 A--NKTq-----Vf---~d~Ly~t~~gna~q~p~aqR~e~---~~dl~am~R~NDy~--Gpl~~iadT~v~s~l~Kleaq 210 (438) . +.+- +. +...-...++.....|-.-.+.+ +-++...|.++|+- ||+.+| +...+++|.+..+- T Consensus 143 ~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv-~P~~y~~Ll~~~~~ 221 (347) T protein:vir:88 143 KLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYC-APEDYSAILSALMP 221 (347) T ss_pred HhhccccccccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEe-CHHHHHHHhcchhh Confidence 2 2111 11 11111112222222222223333 34466778888874 666665 45556677553211 Q ss_pred --cCcc----cccce-eEEeeeeeeeeccccchhh---h-------chhhcccccccchhhhh-hhhhhhccccCCcccc Q lcl|NC_011222. 211 --GLYN----DVNKQ-LEYANKVFHFTNNMTLESE---N-------FAQMYAVESGNVGLLTR-VDRAAYNNTKSGTHEF 272 (438) Q Consensus 211 --G~~N----~tNlq-~Qy~N~~~~~sN~vat~A~---n-------~at~Y~mpsG~vGm~tw-idRea~~n~~S~t~~f 272 (438) +-++ -.|-+ -.+....+++||++..++. . -+..++.+++..+=+.| +++....--|..-=+. T Consensus 222 ~~~~~~~~~~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~ 301 (347) T protein:vir:88 222 NAANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGT 301 (347) T ss_pred hhhhhccccchhcceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhh Confidence 1111 11222 2466889999999975442 1 12223333333333332 1111111000000001 Q ss_pred ceeecccccceeeeeEeeccccccccchhHHHhhhhhhhhcCceEE---EeEEEEeecCCc Q lcl|NC_011222. 273 GKVVLPYFGKEVGTHYYEEVGDQSAIAGEATADMTCDVKHFYGFSV---DIAFVVAFNSDP 330 (438) Q Consensus 273 t~vvdP~fG~~~g~hyye~vgD~Sgi~Ge~tA~~t~a~kh~Y~~~a---D~sfvVAfnSD~ 330 (438) -+.+||-.+..+.. +.-++.+.+ ++-||..+ |.+.++.+++=+ T Consensus 302 v~~~d~~~e~~r~~--------------~~~~d~i~~-~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 302 VKLKDMALERARRP--------------EFQADQIIG-KYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred eecccceeeeeech--------------hhHHHHhhh-hhhhcCceeccceEEEEEeCCCC Confidence 11222222211111 122232222 23355443 333334333322 No 24 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=25.54 E-value=2 Score=18.88 Aligned_cols=276 Identities=15% Similarity=0.079 Sum_probs=105.3 Q ss_pred hhhhhhhhhHHHhhccCccc--cccCChHH----HHHHHh------------hcCCceeEEEEeccceEeeeecceEEec Q lcl|NC_011222. 22 MARMTEWGAYDFFLSQTNAM--DSMLSDET----KRRAFA------------SMGSDIKIPVIDYDKNVTVSNARTCVIA 83 (438) Q Consensus 22 e~R~s~wGA~d~f~~Qta~~--~lMLs~Et----~dkl~A------------S~~splkipVlnYr~s~Tv~nart~vi~ 83 (438) |.|-+.+-+-..-+.++.+. ...|+++. ++.++. .-+..+++||++-..++...+- ...++ T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E-~~~~~ 79 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQWIGE-GDMKP 79 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEecC-Ccccc Confidence 44444342222222222221 12344444 222221 1134578888865555554432 34566 Q ss_pred CcccceeeeeeeeeeeeeeeeeeeeeecccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccCccc----eee Q lcl|NC_011222. 84 DAENTSRLIGVTWKTYAFGFTMIPNMYSNNEIDYQQDWNRKLQKHIRKFMDTVDKDAIAALEANKTKVFGNLL----YYT 159 (438) Q Consensus 84 ~~entsr~iT~v~~T~~w~vti~P~i~~NNeisyq~D~~~kfqk~L~k~~~tvD~~~~AaLeANKTqVf~d~L----y~t 159 (438) ..+..-.-+++.+.+++=.+.+.=.+...+.++.++.+.++|.++ +...+|...+.--.+.+-.-+...+ ... T Consensus 80 ~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a---~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~ 156 (320) T protein:vir:10 80 ITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATA---FAMAFDSAALNGTDSPFPTYLAQTTKSVSLAD 156 (320) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHH---HHHHHHHHhhcccCCCCCccccccccccccee Confidence 666555556666666665555433344445566666666666544 4456777664321111100000000 011 Q ss_pred ccCceeEccccch-hHHHhcchHHHhccCcccceEEeecchhHHHHHHHHHhcC--------cccccc---eeEEeeeee Q lcl|NC_011222. 160 KNGNDVQVKFTQR-NDILSDLHPMFRANDYSGQLHIIGDTSVDSMLRKLEQHGL--------YNDVNK---QLEYANKVF 227 (438) Q Consensus 160 ~~gna~q~p~aqR-~e~~~dl~am~R~NDy~Gpl~~iadT~v~s~l~KleaqG~--------~N~tNl---q~Qy~N~~~ 227 (438) .++ .+...... +..+.+.-. ...+.+.++-.++-+...+.+|+++....+ .++... +..+..... T Consensus 157 ~~~--~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv 233 (320) T protein:vir:10 157 PGG--ATASDLTAYDAVAVNGLS-LLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPT 233 (320) T ss_pred ccc--ccccccccHHHHHHHHHh-hhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeee Confidence 111 11111111 122323222 233456777899999999999999876321 111111 123444444 Q ss_pred eeeccccchhhhchhhcccccccc-----hhhhhhhhhhhccccCCc-----cccceeecccccceeeeeEeeccccccc Q lcl|NC_011222. 228 HFTNNMTLESENFAQMYAVESGNV-----GLLTRVDRAAYNNTKSGT-----HEFGKVVLPYFGKEVGTHYYEEVGDQSA 297 (438) Q Consensus 228 ~~sN~vat~A~n~at~Y~mpsG~v-----Gm~twidRea~~n~~S~t-----~~ft~vvdP~fG~~~g~hyye~vgD~Sg 297 (438) +.++.+... ...-.||=.+..+ |+...++|++........ ..|.+.+ ..+-.-.++ |..- T Consensus 234 ~~~~~~~~~--~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~---~~~r~~~~~-----d~~v 303 (320) T protein:vir:10 234 ILSDHVADG--TTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNL---VAVRVEAEY-----AFHN 303 (320) T ss_pred EecCCCCCC--ceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCc---EEEEEEEee-----ccEE Confidence 555544221 1000111111111 111123333322111110 1122111 111111111 2211 Q ss_pred cchhHHHhhhhhhhhcCceEEEeEEEEeecCCc Q lcl|NC_011222. 298 IAGEATADMTCDVKHFYGFSVDIAFVVAFNSDP 330 (438) Q Consensus 298 i~Ge~tA~~t~a~kh~Y~~~aD~sfvVAfnSD~ 330 (438) .+.++.+.| .-++| .|+ T Consensus 304 ~~~~a~~~l--------------~~~~a--p~~ 320 (320) T protein:vir:10 304 NDKDAFVKL--------------TNVVT--PDA 320 (320) T ss_pred ecccceEEE--------------EeccC--CCC Confidence 111111111 11111 111 No 25 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=24.91 E-value=2.1 Score=18.80 Aligned_cols=261 Identities=8% Similarity=-0.036 Sum_probs=96.9 Q ss_pred CcccccchhhhcccCCCcchhhhhhhhhhhHHHhhccCccccccCChHH----HHHHHh------------hcCCceeEE Q lcl|NC_011222. 1 MSLIATRTQEFRLKNPNIDKNMARMTEWGAYDFFLSQTNAMDSMLSDET----KRRAFA------------SMGSDIKIP 64 (438) Q Consensus 1 ~~~~~~~~q~~r~K~~n~dK~e~R~s~wGA~d~f~~Qta~~~lMLs~Et----~dkl~A------------S~~splkip 64 (438) |.-+++ -...+|++. +++++. --+..+++| T Consensus 1 mat~~~----------------------------------gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p 46 (311) T protein:vir:81 1 MVALAT----------------------------------GTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYM 46 (311) T ss_pred CceecC----------------------------------CceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEE Confidence 221111 112222222 122211 112458889 Q ss_pred EEeccceEeeeecceEEecCcccceeeeeeeeeeeeeeeeeeeeeeccccc-----CHHHHHHHHHHHHH-HHHHHHHHH Q lcl|NC_011222. 65 VIDYDKNVTVSNARTCVIADAENTSRLIGVTWKTYAFGFTMIPNMYSNNEI-----DYQQDWNRKLQKHI-RKFMDTVDK 138 (438) Q Consensus 65 VlnYr~s~Tv~nart~vi~~~entsr~iT~v~~T~~w~vti~P~i~~NNei-----syq~D~~~kfqk~L-~k~~~tvD~ 138 (438) +++-...+.... -...++..+-+-.-+++..+..+ -+..+ -||+ +...++...+.+.| +++.+.+|. T Consensus 47 ~~~~~~~a~wv~-Eg~~~~~~~~~f~~v~l~~~kl~----~~~~i--S~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~ 119 (311) T protein:vir:81 47 TLTAPPRGEVVG-EGAQKSESTATFAPVTAIPRKVQ----VTQRF--SQEVKWADESRQLGVLQTMADLSGVALGRALDL 119 (311) T ss_pred EEeCCceeEEee-cCcccccccceeeEEEEeeEEEE----Eeehh--hHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHH Confidence 886555444332 23344444444344444444443 21122 2222 22223333333333 456777888 Q ss_pred HHHHHHhhccccccCccc-eeeccCceeEccccchhHHHhcchH---HHhccCcccceEEeecchhHHHHHHHH-HhcCc Q lcl|NC_011222. 139 DAIAALEANKTKVFGNLL-YYTKNGNDVQVKFTQRNDILSDLHP---MFRANDYSGQLHIIGDTSVDSMLRKLE-QHGLY 213 (438) Q Consensus 139 ~~~AaLeANKTqVf~d~L-y~t~~gna~q~p~aqR~e~~~dl~a---m~R~NDy~Gpl~~iadT~v~s~l~Kle-aqG~~ 213 (438) ..+.-..+.+-..+.-.+ .-+.+.+..+..-........++.. .++.+++.. -.++-|+....+|++|. ++|-+ T Consensus 120 a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~~~vmn~~~~~~l~~lkd~~G~~ 198 (311) T protein:vir:81 120 IGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSP-DGVALDNTFSFMLATQRDSQGRK 198 (311) T ss_pred hhhccccCCCCcccccccccccccceeeeecccccchHHHHHHHHHHHhhhcCCCc-eEEEEcHHHHHHHHhhhccCCCe Confidence 777654433322221111 1122233333332222223333333 334444433 24888999999999986 33322 Q ss_pred --ccc---cceeEEeeeeeeeeccccchh-----hhchhhcccccccchh-----hhhhhhhhhccccC-------Cc-c Q lcl|NC_011222. 214 --NDV---NKQLEYANKVFHFTNNMTLES-----ENFAQMYAVESGNVGL-----LTRVDRAAYNNTKS-------GT-H 270 (438) Q Consensus 214 --N~t---Nlq~Qy~N~~~~~sN~vat~A-----~n~at~Y~mpsG~vGm-----~twidRea~~n~~S-------~t-~ 270 (438) +.. --+..+.+.-.++++.|...- .......+...+.+-+ +.|-.|.. ..+.. ++ . T Consensus 199 l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~-~~~~~~~~~~~~~~~~ 277 (311) T protein:vir:81 199 LYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVS-IPLELIEFGDPDGLGD 277 (311) T ss_pred eecCccccCCCceecceeEEecccccccccccccccchhcccCCccEEEEEecccEEEEEecc-ceEEEeccCCCCcchh Confidence 111 112345556667776664221 1111112222211100 11101111 00000 00 1 Q ss_pred ccceeecccccceeeeeEeeccccccccchhHHHhhhhhhhh Q lcl|NC_011222. 271 EFGKVVLPYFGKEVGTHYYEEVGDQSAIAGEATADMTCDVKH 312 (438) Q Consensus 271 ~ft~vvdP~fG~~~g~hyye~vgD~Sgi~Ge~tA~~t~a~kh 312 (438) .|.+. +..+..-.. -|..-.+-++.+.++.|.+. T Consensus 278 ~~~~~---~v~~r~~~r-----~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 278 LKRQN---QIAIRAEVV-----YGIGIMSTDAFAVVRDADES 311 (311) T ss_pred hhhcC---cEEEEEEEE-----eccEeecccceEEEEeeccC Confidence 12111 011111111 13333333444444444333 Done!