Query lcl|NC_019417.1_cdsid_YP_006990300.1 [gene=D855_gp53] [protein=hypothetical protein] [protein_id=YP_006990300.1] [location=complement(39602..40105)] Match_columns 167 No_of_seqs 9 out of 11 Neff 3.0 Searched_HMMs 1612 Date Thu Nov 7 17:52:55 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_53 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_53_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:6374 Length: 179 # 100.0 2.2E-80 1.4E-83 457.3 9.9 165 1-167 4-171 (179) 2 protein:vir:79247 Length: 157 90.6 0.015 9.3E-06 30.6 9.7 136 1-167 1-150 (157) 3 protein:vir:99226 Length: 157 80.3 0.096 6E-05 26.2 9.2 137 1-167 1-150 (157) 4 protein:vir:3428 Length: 131 # 36.7 1.2 0.00072 20.2 10.0 125 1-164 1-131 (131) 5 protein:vir:107704 Length: 132 21.6 1.6 0.001 19.4 3.8 57 93-167 1-71 (132) 6 protein:vir:79571 Length: 137 18.7 3.1 0.0019 17.9 10.7 128 1-164 5-137 (137) 7 protein:vir:99874 Length: 154 14.6 4.2 0.0026 17.2 7.1 132 1-164 1-154 (154) 8 protein:vir:79637 Length: 130 12.9 3.9 0.0024 17.3 3.7 58 93-167 1-72 (130) 9 protein:vir:103883 Length: 159 12.1 5.2 0.0033 16.6 9.3 136 1-167 3-152 (159) 10 protein:vir:397 Length: 132 # 10.8 6 0.0037 16.3 10.4 126 1-164 1-132 (132) No 1 >protein:vir:6374 Length: 179 # NCBI annotation: hypothetical protein # Family: family:all:29418 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918987;genbank:gi:34610162;genbank:gi:91214208;genbank:GeneID:2559591 Probab=100.00 E-value=2.2e-80 Score=457.25 Aligned_cols=165 Identities=44% Similarity=0.789 Sum_probs=162.9 Q ss_pred CchHHHHHHHHHHHhhccccccceecccCCceeeccEEeccCCCCccceeEeecccCCCccCCCCccCceeecceEEEEe Q lcl|NC_019417. 1 MSQRLDILKALTAHLEQITIENGYAYDLKGKVYRGRDRFGADFTSKLPIVSILEAKATDYGAFANEEQTVRMDDWVLLVQ 80 (167) Q Consensus 1 ~s~rL~ilk~LTa~Le~IT~aNGY~~Dla~~VfRGR~~fg~n~~~~iP~v~IlE~p~~~~~~~~~~~~~~~~~~w~llvq 80 (167) =-|||+|||+||+||++|||||||||||+.+|||||++||+| +|+|||||||+|+|+++++++++++++.++|.++|| T Consensus 4 ~p~~l~i~k~LTs~L~~iT~aNGy~fDl~~~vfRgR~~fg~~--~p~P~vsilE~~~p~~~lg~d~ng~vq~~~w~~l~Q 81 (179) T protein:vir:63 4 DPKKLVILKKLTAHLEGVTPTNGFQFDLSSGIYRNRVQFGAE--TPAPAVSILEAQRPDHGLDADENGQAQSEDWLLLVQ 81 (179) T ss_pred CchhhhhhHHHHHHhhhcccccccccchhhhhhhcceeecCC--CCCcEEEeecccCCccccCCCCCCcccccchhhhhh Confidence 468999999999999999999999999999999999999999 999999999999999999999999999999999999 Q ss_pred eeee--cCCCCCchHHHHHHHHHHHHHhhhhhcccc-ccccccchhcccceeeeeeecCCCCCCchhhccceeEEEEEEE Q lcl|NC_019417. 81 GWVK--DDPRNPTDPAYELLAEVEKRLAMLVAKDEN-GQPMYPALYRLGGKIAKLTLAQPVVRPPEDGLSDTAFFFLPVR 157 (167) Q Consensus 81 G~V~--dD~~~PtDpA~~L~ADV~k~L~~~~~~d~~-~~p~~~~~~~l~~~V~~lti~~~v~rppedgvs~~A~F~l~v~ 157 (167) |||+ +|++||||+||+|||||||||+++++.++. |||+||++|||+|+|++|+||++||||||||+|++||||||++ T Consensus 82 g~V~~aed~~hPtD~Ah~lmADVkkrL~~~~~~~~~~~np~~~~~~~~~n~i~~~~~gpgv~r~p~e~~s~~~yf~l~l~ 161 (179) T protein:vir:63 82 GWVNHAEGDKNPTDEAYRLMADVQVRLGELIAIDSSSGNPQYPSVYMLENLIAGMRAGPGVCRAPAEGASGRSYFYLPLN 161 (179) T ss_pred hhhccccCCCCCccHHHHHHHHHHHHHhhhhccccCCCCCCCcchHHHHHHHhhhccCCccccCchhhcccceeEEEEeE Confidence 9998 999999999999999999999999999998 6999999999999999999999999999999999999999999 Q ss_pred EEEEecCCCC Q lcl|NC_019417. 158 VGLKVDIRNP 167 (167) Q Consensus 158 l~i~ed~~~p 167 (167) |+|+||++|| T Consensus 162 l~i~~~~~dp 171 (179) T protein:vir:63 162 LKIANNTTDP 171 (179) T ss_pred EEEeccCCCc Confidence 9999999999 No 2 >protein:vir:79247 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469166;genbank:gi:157835008;genbank:GeneID:5648828 Probab=90.55 E-value=0.015 Score=30.57 Aligned_cols=136 Identities=15% Similarity=0.173 Sum_probs=70.3 Q ss_pred CchHHHHH---HHHHHHhh-ccccccceecccCCceeeccEEec--cCCCCccceeEeecccCCC----ccCCCCccCce Q lcl|NC_019417. 1 MSQRLDIL---KALTAHLE-QITIENGYAYDLKGKVYRGRDRFG--ADFTSKLPIVSILEAKATD----YGAFANEEQTV 70 (167) Q Consensus 1 ~s~rL~il---k~LTa~Le-~IT~aNGY~~Dla~~VfRGR~~fg--~n~~~~iP~v~IlE~p~~~----~~~~~~~~~~~ 70 (167) ||--++-+ +.|.+||+ ++. +|. +|+ |-.-+. .+..+.-|++=++....-. +...+...-+- T Consensus 1 ~~~~~d~~a~~~~IierLka~v~-------~l~-~V~-~aadla~i~e~~q~tPaayVv~~gd~~~~~~~~~~~~~~~Q~ 71 (157) T protein:vir:79 1 MSDPFDYLFLEPLLIERIRSEVP-------GLA-IVS-GVPDLAALSEQDQPAPSVYVVYLGDEIGTGADYQGGRRAIQA 71 (157) T ss_pred CCCchhhhhhhHHHHHHHHhhhh-------hhh-hhc-cccchhhhhhhcCCCcEEEEEecccccCCCcccccCcceeee Confidence 88887765 77777776 332 221 221 111000 1222456777776554422 22222222233 Q ss_pred eecceEEEE--eeeeecCCCC--CchHHHHHHHHHHHHHhhhhhccccccccccchhcccceeeeeeecCCCCCCchhhc Q lcl|NC_019417. 71 RMDDWVLLV--QGWVKDDPRN--PTDPAYELLAEVEKRLAMLVAKDENGQPMYPALYRLGGKIAKLTLAQPVVRPPEDGL 146 (167) Q Consensus 71 ~~~~w~llv--qG~V~dD~~~--PtDpA~~L~ADV~k~L~~~~~~d~~~~p~~~~~~~l~~~V~~lti~~~v~rppedgv 146 (167) .++.|.+.| +.+= ++++. =.|.|++|+..|+++|.+-.- ++...|+ .++.. +... . T Consensus 72 vtq~f~Vvlavrn~~-~~~~~~a~~d~ag~ll~~v~~AL~GW~P-~~~~~pl------------~~~~~-----~~~~-~ 131 (157) T protein:vir:79 72 IGQQWAVVLVVHYAD-SSNSGEGARREAGPLLGRLVKALTGWAP-AIDVAPL------------ARSAR-----QSPV-T 131 (157) T ss_pred eeeeEEEEEEEeccc-cccccchhHHHHHHHHHHHHHHhcCccc-cccCCce------------eeeec-----CCcc-c Confidence 466776543 2221 22232 345699999999999995533 3332221 12211 1111 1 Q ss_pred cceeEEEEEEEEEEEecCCCC Q lcl|NC_019417. 147 SDTAFFFLPVRVGLKVDIRNP 167 (167) Q Consensus 147 s~~A~F~l~v~l~i~ed~~~p 167 (167) ..--|+|.|+.|.+ +.--| T Consensus 132 y~~gf~yypl~F~~--~~~~~ 150 (157) T protein:vir:79 132 YASGYFYFPLVFTA--RFVYP 150 (157) T ss_pred ccCCeEEEEEEEEE--eeecc Confidence 44568889999988 67777 No 3 >protein:vir:99226 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950461;genbank:gi:119953662;genbank:GeneID:4643086 Probab=80.31 E-value=0.096 Score=26.16 Aligned_cols=137 Identities=17% Similarity=0.205 Sum_probs=67.3 Q ss_pred CchHHHHH---HHHHHHhh-ccccccceecccCCceeeccEEec--cCCCCccceeEeecccCCCcc-C---CCCccCce Q lcl|NC_019417. 1 MSQRLDIL---KALTAHLE-QITIENGYAYDLKGKVYRGRDRFG--ADFTSKLPIVSILEAKATDYG-A---FANEEQTV 70 (167) Q Consensus 1 ~s~rL~il---k~LTa~Le-~IT~aNGY~~Dla~~VfRGR~~fg--~n~~~~iP~v~IlE~p~~~~~-~---~~~~~~~~ 70 (167) ||--++-+ +.|.+||+ ++. +|. +| .|..-+. .+..+.-|++=++.....+.. . .+...-|- T Consensus 1 ~~~~~d~~a~~~~IierLka~vp-------~l~-~V-~~aadla~i~~~~q~tPaayVi~~gd~~~~~~~~~~~~~~~Q~ 71 (157) T protein:vir:99 1 MSDPFDYLFLEPLLIERIRSEVP-------GLA-IV-SGVPDLAALSEQDQPAPSVYVVYLGDEIGTGADHQGGRRAIQA 71 (157) T ss_pred CCCchhhhhhhHHHHHHHHhhhh-------HHH-hh-hcccchHHHhhccCCCcEEEEEecccccCCCcccccccceeee Confidence 88777754 67777775 332 111 22 1111110 122355577777655543221 1 11111133 Q ss_pred eecceEEEE--eeeeec-CCCCCchHHHHHHHHHHHHHhhhhhccccccccccchhcccceeeeeeecCCCCCCchhhcc Q lcl|NC_019417. 71 RMDDWVLLV--QGWVKD-DPRNPTDPAYELLAEVEKRLAMLVAKDENGQPMYPALYRLGGKIAKLTLAQPVVRPPEDGLS 147 (167) Q Consensus 71 ~~~~w~llv--qG~V~d-D~~~PtDpA~~L~ADV~k~L~~~~~~d~~~~p~~~~~~~l~~~V~~lti~~~v~rppedgvs 147 (167) .++.|.+.| +-+=.. |-..=.|.|++|+..|+++|.+-.- |+...| |++.. ++.. -.. T Consensus 72 i~q~~~Vvlavr~~~~~~~g~~a~d~ag~ll~~v~~AL~GW~P-~~~~~p--------------l~~~~---~~~~-~~y 132 (157) T protein:vir:99 72 IGQQWAVVLVVHYADSSNSGEGARREAGPLLGRLVKALTGWAP-AIDVAP--------------LARSA---RQSP-VTY 132 (157) T ss_pred eeeeEEEEEEEeccccccccchhHHHHHHHHHHHHHHhcCCcC-cccCCc--------------eeeee---cCCc-ccc Confidence 456665533 111111 1122346699999999999995533 222222 22221 1111 124 Q ss_pred ceeEEEEEEEEEEEecCCCC Q lcl|NC_019417. 148 DTAFFFLPVRVGLKVDIRNP 167 (167) Q Consensus 148 ~~A~F~l~v~l~i~ed~~~p 167 (167) .--|+|.|+.|.+ +.--| T Consensus 133 ~~gf~yypl~F~~--~~~~~ 150 (157) T protein:vir:99 133 ASGYFYFPLVFTA--RFVYP 150 (157) T ss_pred cCceEEEEEEEEE--eeecc Confidence 4568888999888 66667 No 4 >protein:vir:3428 Length: 131 # NCBI annotation: tail component # Family: family:all:911 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040591;genbank:gi:9626255;genbank:GeneID:2703486 Probab=36.68 E-value=1.2 Score=20.23 Aligned_cols=125 Identities=18% Similarity=0.251 Sum_probs=76.3 Q ss_pred Cch---HHHHHHHHHHHhhccccccceecccCCceeeccEEeccCCCCccceeEeecccCCCccCCCCccCceeecceE- Q lcl|NC_019417. 1 MSQ---RLDILKALTAHLEQITIENGYAYDLKGKVYRGRDRFGADFTSKLPIVSILEAKATDYGAFANEEQTVRMDDWV- 76 (167) Q Consensus 1 ~s~---rL~ilk~LTa~Le~IT~aNGY~~Dla~~VfRGR~~fg~n~~~~iP~v~IlE~p~~~~~~~~~~~~~~~~~~w~- 76 (167) |+. |-.|++.|+++|.+|+ +|-||-.|.++ +++|.|++-=..+-..+..-. -+.|. T Consensus 1 ~~ht~IR~~Vid~L~~~l~~v~------------~fdG~P~fide--~ElPAVAV~l~d~~~~~~~ld------~~~w~A 60 (131) T protein:vir:34 1 MKHTELRAAVLDALEKHDTGAT------------FFDGRPAVFDE--ADFPAVAVYLTGAEYTGEELD------SDTWQA 60 (131) T ss_pred CchHHHHHHHHHHHhccCCceE------------EecCCceeecc--ccCcEEEEEeecCCCCcceec------CCeeEE Confidence 664 6677777777776642 56699888887 899998875333333333222 23343 Q ss_pred -EEEeeeeecCCCCCchHHHHHHHHHH-HHHhhhhhccccccccccchhcccceeeeeeecCCCCCCchhhccceeEEEE Q lcl|NC_019417. 77 -LLVQGWVKDDPRNPTDPAYELLAEVE-KRLAMLVAKDENGQPMYPALYRLGGKIAKLTLAQPVVRPPEDGLSDTAFFFL 154 (167) Q Consensus 77 -llvqG~V~dD~~~PtDpA~~L~ADV~-k~L~~~~~~d~~~~p~~~~~~~l~~~V~~lti~~~v~rppedgvs~~A~F~l 154 (167) |-|.=|.+- .+|-+.+-++|..+. -.+. + ...|.+.+..++..+...+==++. .++... T Consensus 61 ~LhI~iyLka--~~~ds~LD~~~E~~i~~v~~-----~---------~~~l~~l~~~~~~~gy~Y~rD~e~---~tW~sa 121 (131) T protein:vir:34 61 ELHIEVFLPA--QVPDSELDAWMESRIYPVMS-----D---------IPALSDLITSMVASGYDYRRDDDA---GLWSSA 121 (131) T ss_pred EEEEEEEeec--CCCHHHHHHHHHHHhHHHhh-----c---------chhhhhHhhhhhhccCCccccccc---ceEEEE Confidence 334555554 367777777777643 3332 1 234455566666666665544433 467778 Q ss_pred EEEEEEEecC Q lcl|NC_019417. 155 PVRVGLKVDI 164 (167) Q Consensus 155 ~v~l~i~ed~ 164 (167) -+++.|+=.| T Consensus 122 dL~y~ItY~~ 131 (131) T protein:vir:34 122 DLTYVITYEM 131 (131) T ss_pred EEEEEEEEeC Confidence 8888888888 No 5 >protein:vir:107704 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5121 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003903;genbank:gi:45686319;genbank:GeneID:2773044 Probab=21.58 E-value=1.6 Score=19.42 Aligned_cols=57 Identities=19% Similarity=0.218 Sum_probs=37.9 Q ss_pred HHHHHHHHHHHHHhhhhhccccccccccchhcccceeeeeeecCCCCCCchhhccceeEEEEEE---EEEEEecCC---- Q lcl|NC_019417. 93 PAYELLAEVEKRLAMLVAKDENGQPMYPALYRLGGKIAKLTLAQPVVRPPEDGLSDTAFFFLPV---RVGLKVDIR---- 165 (167) Q Consensus 93 pA~~L~ADV~k~L~~~~~~d~~~~p~~~~~~~l~~~V~~lti~~~v~rppedgvs~~A~F~l~v---~l~i~ed~~---- 165 (167) --||+++.+|++|+.+.. + .++....+.+.||.+|=.=..+|++|- +..|.-|.+ T Consensus 1 ~hyE~~~a~r~~la~~~~-~-----------------lpVA~eNv~F~Pp~~G~~yLr~~~lpa~T~~~~L~~d~r~y~G 62 (132) T protein:vir:10 1 MHYELSAAARAAFLSKYR-D-----------------FPHYMENRNFTPPKDGGMWLRFNYIEGDTLYLSIDRKCKSYIA 62 (132) T ss_pred CchHHHHHHHHHHHhhhc-C-----------------CcEeecCCCcCCCCCCceEEEEEEccCCceeeeccCcCcEEEE Confidence 669999999999993222 1 245588999999999966667777764 233322221 Q ss_pred -------CC Q lcl|NC_019417. 166 -------NP 167 (167) Q Consensus 166 -------~p 167 (167) -| T Consensus 63 v~QI~Vv~p 71 (132) T protein:vir:10 63 IVQIGVVFP 71 (132) T ss_pred EEEEEEEec Confidence 11 No 6 >protein:vir:79571 Length: 137 # NCBI annotation: putative tail component # Family: family:all:911 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272522;genbank:gi:148609391;genbank:GeneID:5204407 Probab=18.73 E-value=3.1 Score=17.89 Aligned_cols=128 Identities=16% Similarity=0.217 Sum_probs=71.2 Q ss_pred Cch----HHHHHHHHHHHhhccccccceecccCCceeeccEEeccCCCCccceeEeecccCCCccCCCCccCceeecceE Q lcl|NC_019417. 1 MSQ----RLDILKALTAHLEQITIENGYAYDLKGKVYRGRDRFGADFTSKLPIVSILEAKATDYGAFANEEQTVRMDDWV 76 (167) Q Consensus 1 ~s~----rL~ilk~LTa~Le~IT~aNGY~~Dla~~VfRGR~~fg~n~~~~iP~v~IlE~p~~~~~~~~~~~~~~~~~~w~ 76 (167) |.. |-.|++.|+++|..| -.+|-||-.|.|. +++|.|++-=..+-..+..-.+.. |+-. T Consensus 5 M~iht~IR~~Vid~L~~~l~~~-----------~~ffdGrP~fiDe--~ElPAVAV~l~da~~~~~~ld~~~--W~A~-- 67 (137) T protein:vir:79 5 MNRHTQIRQVVLARLREQCGDS-----------ATFFDGLPAFVDA--QELPAVSVWLSDAQYTGKMTDEDD--WQAV-- 67 (137) T ss_pred hHHHHHHHHHHHHHHHhhcCCc-----------EEEeCCccceech--hhCcEEEEEeecCCCCcceecCCe--eEEE-- Confidence 655 445555555555442 2367899999998 899998875443333333322222 2222 Q ss_pred EEEeeeeecCCCCCchHHHHHHHH-HHHHHhhhhhccccccccccchhcccceeeeeeecCCCCCCchhhccceeEEEEE Q lcl|NC_019417. 77 LLVQGWVKDDPRNPTDPAYELLAE-VEKRLAMLVAKDENGQPMYPALYRLGGKIAKLTLAQPVVRPPEDGLSDTAFFFLP 155 (167) Q Consensus 77 llvqG~V~dD~~~PtDpA~~L~AD-V~k~L~~~~~~d~~~~p~~~~~~~l~~~V~~lti~~~v~rppedgvs~~A~F~l~ 155 (167) |-|.=|.+- ..|-+.+-+++.. |.-++. + ...|.+.+..++..+...+==++ ..++...- T Consensus 68 LhI~iyLka--~~~ds~LD~~~E~~I~~v~~-----~---------~~~l~~l~~~~~~~gY~Y~rD~e---~~tW~sad 128 (137) T protein:vir:79 68 LHIAVFIRA--QAPDSELDMWMESTIFPALN-----D---------VPALSGLIDTLIPLGFNYQRDNE---MATWAMAE 128 (137) T ss_pred EEEEEEeec--CCCHHHHHHHHHHHHHHhhc-----c---------hhhhhhHhhhhhcccCCcccccc---cceeEEEE Confidence 333445553 5666666677776 333444 1 22344455555555555443333 24577777 Q ss_pred EEEEEEecC Q lcl|NC_019417. 156 VRVGLKVDI 164 (167) Q Consensus 156 v~l~i~ed~ 164 (167) +++.|+=.+ T Consensus 129 L~y~ItYe~ 137 (137) T protein:vir:79 129 ITYQITYTN 137 (137) T ss_pred EEEEEEEcC Confidence 777777544 No 7 >protein:vir:99874 Length: 154 # NCBI annotation: hypothetical protein # Family: family:all:964 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164078;genbank:gi:56692610;genbank:GeneID:3192602 Probab=14.64 E-value=4.2 Score=17.17 Aligned_cols=132 Identities=14% Similarity=0.104 Sum_probs=55.9 Q ss_pred CchHHHH---HHHHHHHhhccccccceecccCCceeeccEEeccCCC-------CccceeEeecccCCCc-cCCCCccC- Q lcl|NC_019417. 1 MSQRLDI---LKALTAHLEQITIENGYAYDLKGKVYRGRDRFGADFT-------SKLPIVSILEAKATDY-GAFANEEQ- 68 (167) Q Consensus 1 ~s~rL~i---lk~LTa~Le~IT~aNGY~~Dla~~VfRGR~~fg~n~~-------~~iP~v~IlE~p~~~~-~~~~~~~~- 68 (167) |+--|.- +.-+.+||+.--+ .|+ |+.-..++. =+.|++-++-...... .......+ T Consensus 1 ~~~~~~~pfdl~~Vi~RLra~~p-----------~l~-~V~gaadlAal~~~~~~p~PaAyVlp~~d~~~~~~~~~~~g~ 68 (154) T protein:vir:99 1 MADGLCAPFDHNLVIERLRDQVK-----------VLK-HVGGAAELGTITQLRDFRTPAAYVLLAQETLSPKPAGHAGGA 68 (154) T ss_pred CCCCccCCcccHHHHHHHHHhCc-----------chh-hhhhhhhhhhhhhhcCCCCceEEEEecccccCCCCCCccccc Confidence 2211100 0112222221111 122 222222111 2566666665554332 22222222 Q ss_pred --ceeecceEEEEeeeeecC-C-CCCchHHHHHHHHHHHHHhhhhhcccccccc-ccc---hhcccceeeeeeecCCCCC Q lcl|NC_019417. 69 --TVRMDDWVLLVQGWVKDD-P-RNPTDPAYELLAEVEKRLAMLVAKDENGQPM-YPA---LYRLGGKIAKLTLAQPVVR 140 (167) Q Consensus 69 --~~~~~~w~llvqG~V~dD-~-~~PtDpA~~L~ADV~k~L~~~~~~d~~~~p~-~~~---~~~l~~~V~~lti~~~v~r 140 (167) |.....|-++|==-...| + ..=.|.+|.|+++|+++|.+- .|- .++ +.--+|.+.+++- T Consensus 69 ~~Q~i~~~f~Vvl~v~~~~d~~G~~a~d~l~~lr~~v~~AL~GW-------~P~~~~G~~pi~~~gG~l~d~~~------ 135 (154) T protein:vir:99 69 TRQMANVHFAITVAVRNYRDNKGVTAADDLRPVLGDVRKALIGW-------TPPGLAGARDCQLVQGQVVDYDA------ 135 (154) T ss_pred eeeeeeeEEEEEEEeeccCcccchhhHHHHHHHHHHHHHHHhCC-------CCCcccCCceeeecCcceeeccC------ Confidence 444555555442111122 2 245689999999999999944 442 222 2233566665542 Q ss_pred CchhhccceeEEE--EEEEEEEEecC Q lcl|NC_019417. 141 PPEDGLSDTAFFF--LPVRVGLKVDI 164 (167) Q Consensus 141 ppedgvs~~A~F~--l~v~l~i~ed~ 164 (167) +..|+- ..+.++|--+. T Consensus 136 -------g~l~y~~~F~~~~~lgr~~ 154 (154) T protein:vir:99 136 -------SVLIWTDLYQTQHAIGRTS 154 (154) T ss_pred -------cEEEEeeeeeeeeecCCCC Confidence 222322 23445555555 No 8 >protein:vir:79637 Length: 130 # NCBI annotation: gp41 # Family: family:all:5121 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285530;genbank:gi:148734513;genbank:GeneID:5219995 Probab=12.94 E-value=3.9 Score=17.32 Aligned_cols=58 Identities=19% Similarity=0.134 Sum_probs=37.0 Q ss_pred HHHHHHHHHHHHHhhhhhccccccccccchhcccceeeeeeecCCCCCCchhhccceeEEEEEEE---EEEEecCC---- Q lcl|NC_019417. 93 PAYELLAEVEKRLAMLVAKDENGQPMYPALYRLGGKIAKLTLAQPVVRPPEDGLSDTAFFFLPVR---VGLKVDIR---- 165 (167) Q Consensus 93 pA~~L~ADV~k~L~~~~~~d~~~~p~~~~~~~l~~~V~~lti~~~v~rppedgvs~~A~F~l~v~---l~i~ed~~---- 165 (167) --|||++..||.++...+ + +.++-...+.+.||.+|=.=..+|++|-. .+|.-|.+ T Consensus 1 ~~~e~~~aaR~~~~~~~~----------~-------~lpVA~ENv~FtPp~~G~~YLr~~~lpa~T~~~~L~~d~r~y~G 63 (130) T protein:vir:79 1 MHYELSVAARMALAQEYE----------S-------EYMIAYENVEFTPPKGGGIWLKYDYKEADTIIHDLKRKCISYIG 63 (130) T ss_pred CcchhhHHHHHHHHhhhh----------h-------hCceeecCCCcCCCCCCceEEEEEecCCCceeeeccCCCceEEE Confidence 669999999997763322 1 13566788899999998556777877743 22222211 Q ss_pred -------CC Q lcl|NC_019417. 166 -------NP 167 (167) Q Consensus 166 -------~p 167 (167) -| T Consensus 64 v~QI~VV~p 72 (130) T protein:vir:79 64 MVQIGIEFP 72 (130) T ss_pred EEEEEEEec Confidence 11 No 9 >protein:vir:103883 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938246;genbank:gi:38229151;genbank:GeneID:2648198 Probab=12.09 E-value=5.2 Score=16.63 Aligned_cols=136 Identities=17% Similarity=0.208 Sum_probs=60.5 Q ss_pred CchHHHH---HHHHHHHhh----ccccccceecccCCceeeccEEeccCCCCccceeEeecccCCC-cc---CCCCccCc Q lcl|NC_019417. 1 MSQRLDI---LKALTAHLE----QITIENGYAYDLKGKVYRGRDRFGADFTSKLPIVSILEAKATD-YG---AFANEEQT 69 (167) Q Consensus 1 ~s~rL~i---lk~LTa~Le----~IT~aNGY~~Dla~~VfRGR~~fg~n~~~~iP~v~IlE~p~~~-~~---~~~~~~~~ 69 (167) |+.-++- -+.|.+||+ ++-.. +=.-||+. | .... +--|++=++.....+ .. ..+...-+ T Consensus 3 ~~~~~n~lav~~~IieRLka~v~~lr~V-~~aadla~-i----~el~----q~tPaayV~~~g~~~~~~~~~~~~~~~~q 72 (159) T protein:vir:10 3 TAEPFDYLFLETLLVERIRAEVPGLQDV-SGVPDLAT-L----DEQR----QGSPCVYVVYLGDEIGTGASHQGGSRAIQ 72 (159) T ss_pred cccchhhhhhhHHHHHHHHhhhhHHHhh-hcccchHH-H----Hhhh----CCCcEEEEEecccccCCCcccccccceee Confidence 3333332 234444443 22111 11122221 1 0111 223666665443322 11 11222223 Q ss_pred eeecceEEEE-eeeeec--CCCCCchHHHHHHHHHHHHHhhhhhccccccccccchhcccceeeeeeecCCCCCCchhhc Q lcl|NC_019417. 70 VRMDDWVLLV-QGWVKD--DPRNPTDPAYELLAEVEKRLAMLVAKDENGQPMYPALYRLGGKIAKLTLAQPVVRPPEDGL 146 (167) Q Consensus 70 ~~~~~w~llv-qG~V~d--D~~~PtDpA~~L~ADV~k~L~~~~~~d~~~~p~~~~~~~l~~~V~~lti~~~v~rppedgv 146 (167) -.++.|.+.| --.--+ |-..=.|.|.+|+..|+++|.+-.- |+.--| |+.+...+ +.. T Consensus 73 ~v~q~w~Vvlavr~~~~q~~~~a~~d~aG~ll~~v~~AL~GW~P-~~~~~P--------------l~r~~~~~---~~~- 133 (159) T protein:vir:10 73 TVTQHWAAVLTLYYADAQGDGQGARREAGPLLGRLLKALTGWVP-DQGVTP--------------LARSPQAS---PVS- 133 (159) T ss_pred eeeeEEEEEEEEecccccCccchhhHHHHHHHHHHHHHhcCccc-CCcCCC--------------eeecccCC---Ccc- Confidence 3356665543 111112 2334456799999999999996554 211111 23222111 112 Q ss_pred cceeEEEEEEEEEEEecCCCC Q lcl|NC_019417. 147 SDTAFFFLPVRVGLKVDIRNP 167 (167) Q Consensus 147 s~~A~F~l~v~l~i~ed~~~p 167 (167) ..--|+|.|+.|.+ +.-=| T Consensus 134 y~~gfayyPl~F~~--~~~~~ 152 (159) T protein:vir:10 134 YSNGFFYFPLVFTA--NFVFP 152 (159) T ss_pred ccCCEEEeeeeEEe--eeecc Confidence 33568888999988 55556 No 10 >protein:vir:397 Length: 132 # NCBI annotation: gp12 # Family: family:all:911 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046907;genbank:gi:9630477;genbank:GeneID:1261651 Probab=10.75 E-value=6 Score=16.32 Aligned_cols=126 Identities=17% Similarity=0.240 Sum_probs=74.4 Q ss_pred Cch---HHHHHHHHHHHhhccccccceecccCCceeeccEEeccCCCCccceeEeecccCCCccCCCCccCceeecceE- Q lcl|NC_019417. 1 MSQ---RLDILKALTAHLEQITIENGYAYDLKGKVYRGRDRFGADFTSKLPIVSILEAKATDYGAFANEEQTVRMDDWV- 76 (167) Q Consensus 1 ~s~---rL~ilk~LTa~Le~IT~aNGY~~Dla~~VfRGR~~fg~n~~~~iP~v~IlE~p~~~~~~~~~~~~~~~~~~w~- 76 (167) |+. |-.|++.|+++|..+ -.+|-||=.|.++ +++|.|++-=..+-..+..-.+ +.|. T Consensus 1 ~~ht~IR~~Vid~L~~~l~~~-----------~~ffdGrP~fiDe--~elPAVAV~l~d~~~~~~~ld~------~~w~A 61 (132) T protein:vir:39 1 MKHRDIRKVIIDALESAIGTD-----------AIYFDGRPAVLEE--GDFPAVAVYLTDAEYTGEELDA------DTWQA 61 (132) T ss_pred CchHHHHHHHHHHHHhhCCCc-----------eEEecCcceeecc--ccCcEEEEEeecCCCCcceecC------CeeEE Confidence 664 667888888888553 3468899999997 8999988753333333332222 2343 Q ss_pred -EEEeeeeecCCCCCchHHHHHHHHHHH-HHhhhhhccccccccccchhcccceeeeeeecCCCCCCchhhccceeEEEE Q lcl|NC_019417. 77 -LLVQGWVKDDPRNPTDPAYELLAEVEK-RLAMLVAKDENGQPMYPALYRLGGKIAKLTLAQPVVRPPEDGLSDTAFFFL 154 (167) Q Consensus 77 -llvqG~V~dD~~~PtDpA~~L~ADV~k-~L~~~~~~d~~~~p~~~~~~~l~~~V~~lti~~~v~rppedgvs~~A~F~l 154 (167) |-|.=|.+- .+|-+.+-+++..+.. .+. + .-+|++.+..+...+...+==++ ..++... T Consensus 62 ~LhI~iyLka--~~~ds~LD~~aE~~i~p~i~-----~---------~~~l~~l~~~~~~~gy~Y~rD~~---~atW~sa 122 (132) T protein:vir:39 62 ILHIEVFLEA--QVPDSELDDWMETRVYPVLA-----E---------VPGLESLITTMVQQGYDYQRDDD---MALWSSA 122 (132) T ss_pred EEEEEEEeec--CCCHHHHHHHHHHHhHhhhc-----c---------cchhhhHhhhhhhcCCCcccccc---cceEEEE Confidence 334445554 4575555555554522 222 1 11355555555555444443333 3568888 Q ss_pred EEEEEEEecC Q lcl|NC_019417. 155 PVRVGLKVDI 164 (167) Q Consensus 155 ~v~l~i~ed~ 164 (167) -++..|+=.| T Consensus 123 dL~y~ItY~~ 132 (132) T protein:vir:39 123 DLKYSITYDM 132 (132) T ss_pred EEEEEEEEeC Confidence 8888998888 Done!