Query lcl|NC_020858.1_cdsid_YP_007675398.1 [gene=SUAG_00006] [protein=hypothetical protein] [protein_id=YP_007675398.1] [location=complement(3849..4478)] Match_columns 209 No_of_seqs 37 out of 39 Neff 5.1 Searched_HMMs 1612 Date Thu Nov 7 16:15:10 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_6 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_6_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:8841 Length: 208 # 100.0 1.1E-75 6.7E-79 431.6 20.6 192 17-209 1-208 (208) 2 protein:vir:823 Length: 216 # 95.1 0.0028 1.7E-06 34.6 13.8 180 20-207 1-216 (216) 3 protein:vir:3302 Length: 216 # 95.1 0.0028 1.7E-06 34.6 13.8 180 20-207 1-216 (216) 4 protein:vir:2779 Length: 216 # 95.1 0.0028 1.7E-06 34.6 13.8 180 20-207 1-216 (216) 5 protein:vir:105573 Length: 225 93.2 0.0085 5.3E-06 31.9 14.9 188 21-209 1-225 (225) 6 protein:vir:351 Length: 242 # 90.6 0.02 1.2E-05 29.9 11.7 184 18-209 1-227 (242) 7 protein:vir:5111 Length: 234 # 85.7 0.051 3.1E-05 27.7 12.1 180 20-200 1-234 (234) 8 protein:vir:104383 Length: 166 64.2 0.3 0.00019 23.4 12.0 145 42-209 1-155 (166) 9 protein:vir:103760 Length: 207 48.3 0.67 0.00042 21.5 12.3 169 20-209 1-203 (207) 10 protein:vir:10130 Length: 223 14.3 4.3 0.0027 17.1 10.8 181 1-209 21-212 (223) No 1 >protein:vir:8841 Length: 208 # NCBI annotation: constituent protein # Family: family:all:6621 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775249;genbank:gi:27476047;genbank:GeneID:2700595 Probab=100.00 E-value=1.1e-75 Score=431.59 Aligned_cols=192 Identities=27% Similarity=0.424 Sum_probs=180.0 Q ss_pred cccccCHHHHHHHHHHHhcccChhH-HHHHHHHHHHHHHhhhhcccccceEEEEEecCCeeecChhhhhhhhhcccC--- Q lcl|NC_020858. 17 TGAFRDFLDLRTAVIEYVQRPDVAD-VFTRFVALAESRLNRTLRMREQVKTVTLTIASGVAALPEDFAEMIGLYAPG--- 92 (209) Q Consensus 17 ~~ai~~Y~~L~~av~~w~~R~Dlt~-~ip~FI~lAE~~lnR~LR~~~mE~~~tl~i~~g~~~LP~Dfle~r~i~~~~--- 92 (209) .++|+||++||+++++|++|+|+|+ +||+||++||+||||+||+|+|||+++|++++|+++||+||+|+|+|+..+ T Consensus 1 ~~~i~~Y~~L~~~~~~~l~R~Dlt~~~~p~FI~lAE~~inr~Lr~~~~e~~vtl~~~~g~~~LP~Df~e~r~i~~~~~~~ 80 (208) T protein:vir:88 1 MATINNVTDLAIAAIQWSDRQDLTQELLMLFIGNTTDRLNRLLRVRENEHFETLMAFGGGIEIPEHFVALRSITGDALIG 80 (208) T ss_pred CCCcCCHHHHHHHHHHhhccccchhhhhhHHHHHHHHHHHHHhcccccceEEEEEecCceeeCChhHHHHHHHhhcCCCC Confidence 8999999999999999999999995 799999999999999999999999999999999999999999999999654 Q ss_pred Cceeeccchhhcc---------CCceEEEEEcCEEE---ecCCCceEEEEeeecCcccCCCCCcchHHHHhchHHHHHHH Q lcl|NC_020858. 93 GGEYVQQTSQQVN---------AGGCYYAIEGGNIV---APHIAGDVEVSYYAMLPPLSASTTTTNWLLNRYPDIYLYGV 160 (209) Q Consensus 93 g~~~~~~~~~~~~---------~~p~~yti~g~~i~---~P~~~~~v~l~YY~~iP~LS~D~~~sNWLL~~~PdiYLyg~ 160 (209) |+.+..++.+.++ |+|+||+|+|++|+ +|+++.+++|+||++||+|| |+|||||||++|||+||||+ T Consensus 81 g~~l~~v~~~~~~~~~~~~~~~g~p~~yti~g~~i~~~P~p~~~~~~~i~Yy~~iP~LS-~~n~tNwLl~~aPD~YLy~~ 159 (208) T protein:vir:88 81 GRTLQYITQDIFSHYVNYNYQPQGVTYYTRLGNFWRVYPVVPDGAPFIVNYWTVLPELS-LDNPTTWALTKYPQIYLYGV 159 (208) T ss_pred CceecccCHHHhhhhhhhccCccccceEEEEcCeEEEeeecCCCceEEEEeeeccCCCC-CCCchhHHhhcchhHHHHHH Confidence 4666666655543 67899999999997 46888889999999999999 78999999999999999999 Q ss_pred HHHHHHhhcChhHHHHHHHHHHHHHHHHHHhhhhhhcCCCceeeCCCCC Q lcl|NC_020858. 161 GFEAAKYVRDAELAQASKFMLDDAIATARSDDFDARYSRARVRVSGVTP 209 (209) Q Consensus 161 l~ea~~f~~D~er~~~w~~~~~~al~~l~~~d~~ar~~~s~l~v~g~Tp 209 (209) |+|||+|+||++|++.|+|+++++|++|+++|+++||+|++|+|||+.- T Consensus 160 L~ea~~f~~D~~ra~~~~~~~~~al~~L~~~d~~~~~~~~~l~i~~~~~ 208 (208) T protein:vir:88 160 LEQIYLYTMDEERSMFWGQKLERAVMELQNEENAADFASSRLAIKDIER 208 (208) T ss_pred HHHHHhcccCHhHHHHHHHHHHHHHHHHHhhhhhHHhcCCceeeecccC Confidence 9999999999999999999999999999999999999999999999999 No 2 >protein:vir:823 Length: 216 # NCBI annotation: hypothetical protein # Family: family:all:1423 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050556;genbank:gi:9633453;genbank:GeneID:1262282 Probab=95.09 E-value=0.0028 Score=34.58 Aligned_cols=180 Identities=13% Similarity=0.136 Sum_probs=102.0 Q ss_pred ccCHHHHHHHHHHHhcccC---hh-HHHHHHHHHHHHHHhhhhcc-cccceEEEEEecCCe-eecChhhhhhhhhccc-C Q lcl|NC_020858. 20 FRDFLDLRTAVIEYVQRPD---VA-DVFTRFVALAESRLNRTLRM-REQVKTVTLTIASGV-AALPEDFAEMIGLYAP-G 92 (209) Q Consensus 20 i~~Y~~L~~av~~w~~R~D---lt-~~ip~FI~lAE~~lnR~LR~-~~mE~~~tl~i~~g~-~~LP~Dfle~r~i~~~-~ 92 (209) ..|-++|..-+..=+.=.+ -+ ..+-+|++=|-..+= ||- .--.+..++.+..|+ -.||.|-+.++.+... . T Consensus 1 ~~t~~~lI~r~~~~l~D~~~~rW~~~el~~~lNdAv~e~~--l~rpda~~~~~~i~l~~Gt~q~~p~~~~~lI~v~r~~~ 78 (216) T protein:vir:82 1 MTTITEIIGRVNTQLVDPMMVRWPLQELCDYYNDAVRAVI--LARPDAGASLETISCVPGARQVLPDGVIQLLDVICLSD 78 (216) T ss_pred CccHHHHHHHHHHhhhcccccccChHHHHHHHHHHHHHHH--hhcCCCCcceeeEeccccccccccchhhhhhhhhhhCC Confidence 7777777776665443322 00 114455555544332 332 335677889999999 5699999988877543 3 Q ss_pred Cceeeccch-----------hhccCCceEEEEEcCEE----Ee--cCCCceEEEEeeecCc---ccCCCCCcchHHHHhc Q lcl|NC_020858. 93 GGEYVQQTS-----------QQVNAGGCYYAIEGGNI----VA--PHIAGDVEVSYYAMLP---PLSASTTTTNWLLNRY 152 (209) Q Consensus 93 g~~~~~~~~-----------~~~~~~p~~yti~g~~i----~~--P~~~~~v~l~YY~~iP---~LS~D~~~sNWLL~~~ 152 (209) |+ -+++.+ +...|-|.+|+--.... +. |..++.|++.||.... .++.|.+.. . .- T Consensus 79 g~-a~~~vsre~LD~~~P~W~~~~g~p~~~i~de~~pr~f~l~P~p~~~~~vel~~~r~P~a~~~~~~~dd~~---~-~i 153 (216) T protein:vir:82 79 GS-AVRPLSREVLDAQYPEWPTMKGIPECFISNDLSPRVFWLFPAPDKEISIDAVVSRIPEAVYVLTQDDDTP---V-PL 153 (216) T ss_pred CC-ceeeecHHHhcccCCCcCCCCCCceEEEecCCCceEEEEeccCCCCcEEEEEEEecCcchhhccCCCCCC---C-Cc Confidence 43 233322 22346688775544332 23 5667889999997642 344333222 1 13 Q ss_pred hHHHHHHHHHHHH--Hhh------cChhHHHHHHHHHHHHHHHHHHhhhhhhcCCC-ceeeCCC Q lcl|NC_020858. 153 PDIYLYGVGFEAA--KYV------RDAELAQASKFMLDDAIATARSDDFDARYSRA-RVRVSGV 207 (209) Q Consensus 153 PdiYLyg~l~ea~--~f~------~D~er~~~w~~~~~~al~~l~~~d~~ar~~~s-~l~v~g~ 207 (209) |++|.=.++.+++ .|. .|.+|++..-|+|.++++.-+..|. +|..+- +-.=+|+ T Consensus 154 ~~~y~~~Lvdw~lyRa~skd~~~~~d~~rA~~h~q~F~~~lG~k~~ad~-~~~~r~~~~~~~~~ 216 (216) T protein:vir:82 154 EEAYVNPLVEWMLFRAFSKDAAGGAESGLAAQHYQSFVEQLGIKQGADS-ALYARKKVFNGGGV 216 (216) T ss_pred chhhhHHHHHHHHHHHhcCcCccccChHHHHHHHHHHHHHhCCchHHHh-HHHHHHhhccCCCC Confidence 5666655555543 244 4566999999999999997766654 221111 1111244 No 3 >protein:vir:3302 Length: 216 # NCBI annotation: hypothetical protein # Family: family:all:1423 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049518;genbank:gi:9632524;genbank:GeneID:1262013 Probab=95.09 E-value=0.0028 Score=34.58 Aligned_cols=180 Identities=13% Similarity=0.136 Sum_probs=102.0 Q ss_pred ccCHHHHHHHHHHHhcccC---hh-HHHHHHHHHHHHHHhhhhcc-cccceEEEEEecCCe-eecChhhhhhhhhccc-C Q lcl|NC_020858. 20 FRDFLDLRTAVIEYVQRPD---VA-DVFTRFVALAESRLNRTLRM-REQVKTVTLTIASGV-AALPEDFAEMIGLYAP-G 92 (209) Q Consensus 20 i~~Y~~L~~av~~w~~R~D---lt-~~ip~FI~lAE~~lnR~LR~-~~mE~~~tl~i~~g~-~~LP~Dfle~r~i~~~-~ 92 (209) ..|-++|..-+..=+.=.+ -+ ..+-+|++=|-..+= ||- .--.+..++.+..|+ -.||.|-+.++.+... . T Consensus 1 ~~t~~~lI~r~~~~l~D~~~~rW~~~el~~~lNdAv~e~~--l~rpda~~~~~~i~l~~Gt~q~~p~~~~~lI~v~r~~~ 78 (216) T protein:vir:33 1 MTTITEIIGRVNTQLVDPMMVRWPLQELCDYYNDAVRAVI--LARPDAGASLETISCVPGARQVLPDGVIQLLDVICLSD 78 (216) T ss_pred CccHHHHHHHHHHhhhcccccccChHHHHHHHHHHHHHHH--hhcCCCCcceeeEeccccccccccchhhhhhhhhhhCC Confidence 7777777776665443322 00 114455555544332 332 335677889999999 5699999988877543 3 Q ss_pred Cceeeccch-----------hhccCCceEEEEEcCEE----Ee--cCCCceEEEEeeecCc---ccCCCCCcchHHHHhc Q lcl|NC_020858. 93 GGEYVQQTS-----------QQVNAGGCYYAIEGGNI----VA--PHIAGDVEVSYYAMLP---PLSASTTTTNWLLNRY 152 (209) Q Consensus 93 g~~~~~~~~-----------~~~~~~p~~yti~g~~i----~~--P~~~~~v~l~YY~~iP---~LS~D~~~sNWLL~~~ 152 (209) |+ -+++.+ +...|-|.+|+--.... +. |..++.|++.||.... .++.|.+.. . .- T Consensus 79 g~-a~~~vsre~LD~~~P~W~~~~g~p~~~i~de~~pr~f~l~P~p~~~~~vel~~~r~P~a~~~~~~~dd~~---~-~i 153 (216) T protein:vir:33 79 GS-AVRPLSREVLDAQYPEWPTMKGIPECFISNDLSPRVFWLFPAPDKEISIDAVVSRIPEAVYVLTQDDDTP---V-PL 153 (216) T ss_pred CC-ceeeecHHHhcccCCCcCCCCCCceEEEecCCCceEEEEeccCCCCcEEEEEEEecCcchhhccCCCCCC---C-Cc Confidence 43 233322 22346688775544332 23 5667889999997642 344333222 1 13 Q ss_pred hHHHHHHHHHHHH--Hhh------cChhHHHHHHHHHHHHHHHHHHhhhhhhcCCC-ceeeCCC Q lcl|NC_020858. 153 PDIYLYGVGFEAA--KYV------RDAELAQASKFMLDDAIATARSDDFDARYSRA-RVRVSGV 207 (209) Q Consensus 153 PdiYLyg~l~ea~--~f~------~D~er~~~w~~~~~~al~~l~~~d~~ar~~~s-~l~v~g~ 207 (209) |++|.=.++.+++ .|. .|.+|++..-|+|.++++.-+..|. +|..+- +-.=+|+ T Consensus 154 ~~~y~~~Lvdw~lyRa~skd~~~~~d~~rA~~h~q~F~~~lG~k~~ad~-~~~~r~~~~~~~~~ 216 (216) T protein:vir:33 154 EEAYVNPLVEWMLFRAFSKDAAGGAESGLAAQHYQSFVEQLGIKQGADS-ALYARKKVFNGGGV 216 (216) T ss_pred chhhhHHHHHHHHHHHhcCcCccccChHHHHHHHHHHHHHhCCchHHHh-HHHHHHhhccCCCC Confidence 5666655555543 244 4566999999999999997766654 221111 1111244 No 4 >protein:vir:2779 Length: 216 # NCBI annotation: hypothetical protein # Family: family:all:1423 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612896;genbank:gi:20065813;genbank:GeneID:935638 Probab=95.09 E-value=0.0028 Score=34.58 Aligned_cols=180 Identities=13% Similarity=0.136 Sum_probs=102.0 Q ss_pred ccCHHHHHHHHHHHhcccC---hh-HHHHHHHHHHHHHHhhhhcc-cccceEEEEEecCCe-eecChhhhhhhhhccc-C Q lcl|NC_020858. 20 FRDFLDLRTAVIEYVQRPD---VA-DVFTRFVALAESRLNRTLRM-REQVKTVTLTIASGV-AALPEDFAEMIGLYAP-G 92 (209) Q Consensus 20 i~~Y~~L~~av~~w~~R~D---lt-~~ip~FI~lAE~~lnR~LR~-~~mE~~~tl~i~~g~-~~LP~Dfle~r~i~~~-~ 92 (209) ..|-++|..-+..=+.=.+ -+ ..+-+|++=|-..+= ||- .--.+..++.+..|+ -.||.|-+.++.+... . T Consensus 1 ~~t~~~lI~r~~~~l~D~~~~rW~~~el~~~lNdAv~e~~--l~rpda~~~~~~i~l~~Gt~q~~p~~~~~lI~v~r~~~ 78 (216) T protein:vir:27 1 MTTITEIIGRVNTQLVDPMMVRWPLQELCDYYNDAVRAVI--LARPDAGASLETISCVPGARQVLPDGVIQLLDVICLSD 78 (216) T ss_pred CccHHHHHHHHHHhhhcccccccChHHHHHHHHHHHHHHH--hhcCCCCcceeeEeccccccccccchhhhhhhhhhhCC Confidence 7777777776665443322 00 114455555544332 332 335677889999999 5699999988877543 3 Q ss_pred Cceeeccch-----------hhccCCceEEEEEcCEE----Ee--cCCCceEEEEeeecCc---ccCCCCCcchHHHHhc Q lcl|NC_020858. 93 GGEYVQQTS-----------QQVNAGGCYYAIEGGNI----VA--PHIAGDVEVSYYAMLP---PLSASTTTTNWLLNRY 152 (209) Q Consensus 93 g~~~~~~~~-----------~~~~~~p~~yti~g~~i----~~--P~~~~~v~l~YY~~iP---~LS~D~~~sNWLL~~~ 152 (209) |+ -+++.+ +...|-|.+|+--.... +. |..++.|++.||.... .++.|.+.. . .- T Consensus 79 g~-a~~~vsre~LD~~~P~W~~~~g~p~~~i~de~~pr~f~l~P~p~~~~~vel~~~r~P~a~~~~~~~dd~~---~-~i 153 (216) T protein:vir:27 79 GS-AVRPLSREVLDAQYPEWPTMKGIPECFISNDLSPRVFWLFPAPDKEISIDAVVSRIPEAVYVLTQDDDTP---V-PL 153 (216) T ss_pred CC-ceeeecHHHhcccCCCcCCCCCCceEEEecCCCceEEEEeccCCCCcEEEEEEEecCcchhhccCCCCCC---C-Cc Confidence 43 233322 22346688775544332 23 5667889999997642 344333222 1 13 Q ss_pred hHHHHHHHHHHHH--Hhh------cChhHHHHHHHHHHHHHHHHHHhhhhhhcCCC-ceeeCCC Q lcl|NC_020858. 153 PDIYLYGVGFEAA--KYV------RDAELAQASKFMLDDAIATARSDDFDARYSRA-RVRVSGV 207 (209) Q Consensus 153 PdiYLyg~l~ea~--~f~------~D~er~~~w~~~~~~al~~l~~~d~~ar~~~s-~l~v~g~ 207 (209) |++|.=.++.+++ .|. .|.+|++..-|+|.++++.-+..|. +|..+- +-.=+|+ T Consensus 154 ~~~y~~~Lvdw~lyRa~skd~~~~~d~~rA~~h~q~F~~~lG~k~~ad~-~~~~r~~~~~~~~~ 216 (216) T protein:vir:27 154 EEAYVNPLVEWMLFRAFSKDAAGGAESGLAAQHYQSFVEQLGIKQGADS-ALYARKKVFNGGGV 216 (216) T ss_pred chhhhHHHHHHHHHHHhcCcCccccChHHHHHHHHHHHHHhCCchHHHh-HHHHHHhhccCCCC Confidence 5666655555543 244 4566999999999999997766654 221111 1111244 No 5 >protein:vir:105573 Length: 225 # NCBI annotation: hypothetical protein # Family: family:all:1423 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164310;genbank:gi:56692957;genbank:GeneID:3197184 Probab=93.17 E-value=0.0085 Score=31.93 Aligned_cols=188 Identities=11% Similarity=0.026 Sum_probs=107.2 Q ss_pred cCHHHHHHHHHHHhccc-C---hh-HHHHHHHHHHHHHHh---hhhcccccceEEEEEecCCee-ecChhhhhhhhh-c- Q lcl|NC_020858. 21 RDFLDLRTAVIEYVQRP-D---VA-DVFTRFVALAESRLN---RTLRMREQVKTVTLTIASGVA-ALPEDFAEMIGL-Y- 89 (209) Q Consensus 21 ~~Y~~L~~av~~w~~R~-D---lt-~~ip~FI~lAE~~ln---R~LR~~~mE~~~tl~i~~g~~-~LP~Dfle~r~i-~- 89 (209) -|-++|...+..=++=. + -+ ..+-+|++=|-..+= |+||=..--+..++.+..|+. .+|.+-+.++.+ + T Consensus 1 m~~~~lI~r~~~~l~D~~~~~rW~~~el~~~lNdAv~e~~~r~rL~rpda~~~~~~i~l~~Gt~q~~~~~~~~~I~~~~~ 80 (225) T protein:vir:10 1 MTLADLIRRVRTDANDMVEPYFWSDQDVADWLNDAVREAAVRGRLIHESQADAVCRIEVVAGTAVYQLHASLYELSHLGF 80 (225) T ss_pred CCHHHHHHHHHHHhccccccccCChHHHHHHHHHHHHHHHHhcccccccCCCceeeeeecCccccccCchHHHHHHHHhh Confidence 45566666555544321 1 00 125577777665554 355655556778899999985 677775553333 3 Q ss_pred -ccCCceeeccc--h-----------hhccCCceEEEEEcCEEE-e--cCCCceEEEEeeecCc---ccCCCCCc-chHH Q lcl|NC_020858. 90 -APGGGEYVQQT--S-----------QQVNAGGCYYAIEGGNIV-A--PHIAGDVEVSYYAMLP---PLSASTTT-TNWL 148 (209) Q Consensus 90 -~~~g~~~~~~~--~-----------~~~~~~p~~yti~g~~i~-~--P~~~~~v~l~YY~~iP---~LS~D~~~-sNWL 148 (209) ..+|.+...+. + +...|.|.||+-.-++++ + |..++.++|.||...- +.+ +.+. .=++ T Consensus 81 ~~~~~~~~~~~~~~s~e~LD~~~P~W~~~tg~p~~~~~d~~~~~l~P~p~~~~~vel~~~r~P~~~~~~~-~~D~~~p~i 159 (225) T protein:vir:10 81 YPADMSRPTMPVLKSAEVLDVELPEWRACTGKPLYAIQGDTSLRLVPTPDRAGILRVEGYRTPLADMALA-DKDTAQPEI 159 (225) T ss_pred cCcccCCceecccccHHHhcccCCCcccCCCCceEEEeCCcEEEEEecCCCceEEEEEEEeecchhhhcc-ccccccCcc Confidence 34444432211 1 233577888776555565 3 5667889999998731 122 1111 1233 Q ss_pred HHhchHHHHHHHHHHHH----HhhcChhHHHHHHHHHHHHHHHHHHhhh-hhhcCCCceeeCCCCC Q lcl|NC_020858. 149 LNRYPDIYLYGVGFEAA----KYVRDAELAQASKFMLDDAIATARSDDF-DARYSRARVRVSGVTP 209 (209) Q Consensus 149 L~~~PdiYLyg~l~ea~----~f~~D~er~~~w~~~~~~al~~l~~~d~-~ar~~~s~l~v~g~Tp 209 (209) .+.|=...+.=+|+-|+ ....|.++++..-|+|+++++.-+..|. +..=.+.+-..++.=| T Consensus 160 ~~~y~~~Lvdw~lyRa~skd~~~~~d~~rA~~h~q~F~~alG~k~~ad~~r~~r~~~p~~~~~~~~ 225 (225) T protein:vir:10 160 HAEHHRHLVQWALYRGFSIPDMESFDPNRAALAEAAFTAYFGERPDSDLRRITREDVPHHVEAFWP 225 (225) T ss_pred chhhHHHHHHHHHHHHhcCcCccccChHHHHHHHHHHHHHhCCchhHHHHHhccccCcccccccCC Confidence 33333333333333332 2345678999999999999988776553 2333566677777777 No 6 >protein:vir:351 Length: 242 # NCBI annotation: hypothetical protein # Family: family:all:3196 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203465;genbank:gi:15320621;genbank:GeneID:921727 Probab=90.61 E-value=0.02 Score=29.90 Aligned_cols=184 Identities=15% Similarity=0.117 Sum_probs=88.9 Q ss_pred ccccCHHHHHHHHHHH--hcccCh------hH-H---HHHHHHHHHHHHhhhhcccccceEEEEEecCCe--eecChhhh Q lcl|NC_020858. 18 GAFRDFLDLRTAVIEY--VQRPDV------AD-V---FTRFVALAESRLNRTLRMREQVKTVTLTIASGV--AALPEDFA 83 (209) Q Consensus 18 ~ai~~Y~~L~~av~~w--~~R~Dl------t~-~---ip~FI~lAE~~lnR~LR~~~mE~~~tl~i~~g~--~~LP~Dfl 83 (209) -|-.|-.++...|..= |.+.+- || . +-.+.+.+-.++=|..-=+++-+..+++...|. -+||+||. T Consensus 1 ~~~~t~lsiin~v~~~i~L~~~~~a~v~sstD~~~~~l~ala~~~g~eia~~~dW~~l~~~~~~t~~~~~~~y~lP~D~~ 80 (242) T protein:vir:35 1 MAWDTAASIINDAAVELGLLATDVADPYASADVNLVQLCRLLKSLGQDMVRDYQWTHLQQQWTFATQVGLANYEMPPDYN 80 (242) T ss_pred CchHHHHHHHHHHHHHhcccCCccccccccchHHHHHHHHHHHHHHHHHHHhcCCcchheeeeecccccCCCCCcchhhH Confidence 2333333333333333 333331 22 1 334444444555555555667777777665444 58999999 Q ss_pred hhhhhcc---cCCceeecc---------chhhccCC-ceEEEEEcCEEEe---cCCCceEEEEeeecCcccC-CCCCcch Q lcl|NC_020858. 84 EMIGLYA---PGGGEYVQQ---------TSQQVNAG-GCYYAIEGGNIVA---PHIAGDVEVSYYAMLPPLS-ASTTTTN 146 (209) Q Consensus 84 e~r~i~~---~~g~~~~~~---------~~~~~~~~-p~~yti~g~~i~~---P~~~~~v~l~YY~~iP~LS-~D~~~sN 146 (209) .|..=+. ...+.+.-+ ......+. +.+|.|+|++|.. |.++.++.+.|+.+-.=.+ ....+.+ T Consensus 81 R~v~~~~w~rt~~~p~~gP~s~~~W~~l~~~~sa~~~~~~~ri~ggqi~~~P~paa~~~~~f~YiSknWv~~~~~~~k~~ 160 (242) T protein:vir:35 81 RFVDQTGWNRTQRMPLLGPLSAQGWQLLQVLTSAGTVDVMYRLVGGEFVLHPTPESVADIAYEYVSSHWVGTGGSETPNA 160 (242) T ss_pred HhhcCcccceeecceecCCcChhhhhhhhhhccCCCCCceEEEEcCEEEeecCcccccceeEeeecCccccCCCCccccc Confidence 9873111 111111111 11111223 4689999999973 5778999999999987444 3333356 Q ss_pred HHHHhc------hHHHHHHHHHHHHHhhcChhHHHH--HHHHHHHHHHHHHHhhhhhhcCCCc-eee-C--CCCC Q lcl|NC_020858. 147 WLLNRY------PDIYLYGVGFEAAKYVRDAELAQA--SKFMLDDAIATARSDDFDARYSRAR-VRV-S--GVTP 209 (209) Q Consensus 147 WLL~~~------PdiYLyg~l~ea~~f~~D~er~~~--w~~~~~~al~~l~~~d~~ar~~~s~-l~v-~--g~Tp 209 (209) |-.+.. |+=.|=-.| +-.|.+- .|..+ -...|+.+|+... .|=+|++ |.+ + +-|| T Consensus 161 ~t~~ad~Dt~~l~erLL~LGl--IWRWkra-KGldy~e~l~~YE~aL~~~~-----~~d~G~~~l~l~~~~~~~p 227 (242) T protein:vir:35 161 DAPESGGDTLFFDRRLLVCGL--KLRWQRA-KGFDSTACQDDYDKALERAQ-----GGDGAAPVLSLNRRPFATN 227 (242) T ss_pred cccccCCCceechHHHHhHhH--HHHHHhh-cCCCHHHHHHHHHHHHHHHH-----hhcCCCceecCCCCccCCc Confidence 875553 333332222 2233321 11111 1122444444432 2333333 333 2 3577 No 7 >protein:vir:5111 Length: 234 # NCBI annotation: unknown # Family: family:all:1423 # MgeID: mge:114 # MgeName: PBC5 # Cross-refs: genbank:acc:NP_542268;genbank:gi:18071240;genbank:GeneID:929347 Probab=85.72 E-value=0.051 Score=27.68 Aligned_cols=180 Identities=12% Similarity=0.114 Sum_probs=96.2 Q ss_pred ccCHHHHHHHHHHHhcccC---hh-HHHHHHHHHHHHHHhhhhcccccceEEEEEecCCe-eecChhh-----hhhhhhc Q lcl|NC_020858. 20 FRDFLDLRTAVIEYVQRPD---VA-DVFTRFVALAESRLNRTLRMREQVKTVTLTIASGV-AALPEDF-----AEMIGLY 89 (209) Q Consensus 20 i~~Y~~L~~av~~w~~R~D---lt-~~ip~FI~lAE~~lnR~LR~~~mE~~~tl~i~~g~-~~LP~Df-----le~r~i~ 89 (209) ..|-++|...+..=++=.+ -+ ..+-+|++-|-..+= +.|=.--.+..+|.+..|+ ..||.|+ ++++.|. T Consensus 1 m~t~~~lI~r~~~~l~D~~~~rW~~~el~~~lNdAv~e~~-l~rp~a~~~~~~i~l~~Gt~q~lP~d~~~~~~l~li~i~ 79 (234) T protein:vir:51 1 MPKASEIMRLAGIQLLDEDHIRWPLIELADWVNEGVKAIV-LAKPSASSKSAAIQLVKGTHQTLPGTIDGKATLQLIGIN 79 (234) T ss_pred CccHHHHHHHHHHHhccccccccChHHHHHHHHHHHHHHH-hhcCCCCccceeEeeccCCccccccccccchheehhhhh Confidence 5567777776665554321 00 124456665554443 2244556778889999997 8999883 5666664 Q ss_pred c---c-----CCceeeccch-----------hhccCCce-------EEEEEcCEE--Ee--cCCCceEEEEeeecCccc- Q lcl|NC_020858. 90 A---P-----GGGEYVQQTS-----------QQVNAGGC-------YYAIEGGNI--VA--PHIAGDVEVSYYAMLPPL- 138 (209) Q Consensus 90 ~---~-----~g~~~~~~~~-----------~~~~~~p~-------~yti~g~~i--~~--P~~~~~v~l~YY~~iP~L- 138 (209) . + .+++-+++.+ +...|.|. -|...+... +. |..++.|++.||...+++ T Consensus 80 rn~~s~~~~~~~grav~~vsre~LD~~~P~W~~~tg~P~~~~v~~y~~d~~~p~~~~l~P~p~~~g~v~~~~~r~P~~v~ 159 (234) T protein:vir:51 80 RNLVSAAEPRQGLRAIRTCARDVLDAQEPNWHTASYVPFRKEVRQVIYDENLPTEFYVYPGNDGSGFVEAAFSFLPTSVK 159 (234) T ss_pred hhhccccccccCcceeeecCHHHhcccCCCccccCCCCchhhhhhhhccCCCCeEEEEeccCCCCceEEEEEEeecchhh Confidence 2 1 1122233222 12245553 234444443 33 456788999999876554 Q ss_pred ----CCCCCcchHHHH-hchHHHHHHHHHHHH--Hhhc-----ChhHHHHHHHHHHHHHHHHHHhhhhh-hcCCC Q lcl|NC_020858. 139 ----SASTTTTNWLLN-RYPDIYLYGVGFEAA--KYVR-----DAELAQASKFMLDDAIATARSDDFDA-RYSRA 200 (209) Q Consensus 139 ----S~D~~~sNWLL~-~~PdiYLyg~l~ea~--~f~~-----D~er~~~w~~~~~~al~~l~~~d~~a-r~~~s 200 (209) +-+..-.+|=-+ .=|++|.=.++.+++ .|.| |.+|++..-|+|.++++.=+..|... ...+. T Consensus 160 ~~~~~d~~~~a~~~~~~~i~~~y~~~Lvdw~lyRa~skD~e~~d~~rA~~h~q~F~~~lG~k~~~d~~~~pn~r~ 234 (234) T protein:vir:51 160 VANGADPEKIASWDIDVGLPEPYSVPLLDYVLYRCHQKDDTAADLGKATSHYQLFATAVGIKVQSEGTSNPNRRR 234 (234) T ss_pred hhhcCCccccccccccCCccchhhHHHHHHHhhhhcCccccccchHHHHHHHHHHHHHhCCcchhhhhccccccC Confidence 211111233000 124555555555443 2445 56699999999999998655444221 11111 No 8 >protein:vir:104383 Length: 166 # NCBI annotation: hypothetical protein # Family: family:all:2410 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794066;genbank:gi:116222011;genbank:GeneID:4397445 Probab=64.23 E-value=0.3 Score=23.42 Aligned_cols=145 Identities=19% Similarity=0.159 Sum_probs=85.7 Q ss_pred HHHHHHHHHHHHHhhhhcccccceEEEEEecCCeee--cChhhhh--hhhhcccCCceeeccchhhccCCceEEEEEcCE Q lcl|NC_020858. 42 VFTRFVALAESRLNRTLRMREQVKTVTLTIASGVAA--LPEDFAE--MIGLYAPGGGEYVQQTSQQVNAGGCYYAIEGGN 117 (209) Q Consensus 42 ~ip~FI~lAE~~lnR~LR~~~mE~~~tl~i~~g~~~--LP~Dfle--~r~i~~~~g~~~~~~~~~~~~~~p~~yti~g~~ 117 (209) +|-+-|..|--+|=|.-.+-..+.+...+.....+. =|+|.-. ++++... |..+. ...+|++.++. T Consensus 1 ~m~~Al~~AaieFCreS~~~r~t~t~~~~~~~~~v~~~~~~d~~~~~i~~v~~~-~~~L~---------~g~~~~~~s~~ 70 (166) T protein:vir:10 1 MMTDALSMAAVAFSRQSLVCRREVTVVPVAGKEIVLPYDKDDEECVHIIRISDD-NHELF---------VGRDVDISSGR 70 (166) T ss_pred ChHHHHHHHHHHHHhhccceeeeecccccCCcceEEecCCccceeeeEEeeecC-Cceee---------ccccceecCCc Confidence 777777777777777765554444433332222222 2233221 2222221 22222 12358887755 Q ss_pred -EE-ecCCCceEEEEeeecCcccCCCCCcchHHHHhchHHHHHHHHHHHHHhh----cChhHHHHHHHHHHHHHHHHHHh Q lcl|NC_020858. 118 -IV-APHIAGDVEVSYYAMLPPLSASTTTTNWLLNRYPDIYLYGVGFEAAKYV----RDAELAQASKFMLDDAIATARSD 191 (209) Q Consensus 118 -i~-~P~~~~~v~l~YY~~iP~LS~D~~~sNWLL~~~PdiYLyg~l~ea~~f~----~D~er~~~w~~~~~~al~~l~~~ 191 (209) +. .|.. +++++.+--+ |..+. ....--|++ ++|+.-+|++.+..+=- .|++++++..++|.+++.+-.. T Consensus 71 ~L~~~~~~-~~l~V~~a~~-Ps~~a-~~lPd~L~d-y~~aI~~GA~~~L~mmP~k~WsnP~la~y~~~~F~eg~r~A~r- 145 (166) T protein:vir:10 71 SLRFACSP-GEVSVLYAVA-PKAGR-SQIPDELLT-WPEEVAAGALERLFMQTGVSWSDPLRAQYFSVQFSEGIRRAYR- 145 (166) T ss_pred EEEEecCC-CEEEEEEEec-cCCCC-cccchHHhh-HHHHHHHHHHHHHHhCCCCCCCChhhhHHHHHHHHHHHHHHHH- Confidence 55 4444 5788877666 88884 444556764 99999999999887655 6999999999999877755432 Q ss_pred hhhhhcCCCceeeCCCCC Q lcl|NC_020858. 192 DFDARYSRARVRVSGVTP 209 (209) Q Consensus 192 d~~ar~~~s~l~v~g~Tp 209 (209) .+ +-.++.|+ T Consensus 146 --~a------~e~~~~~~ 155 (166) T protein:vir:10 146 --HT------LATSPYSS 155 (166) T ss_pred --Hh------hhhcCccc Confidence 22 22333444 No 9 >protein:vir:103760 Length: 207 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024931;genbank:gi:48697201;genbank:GeneID:2846084 Probab=48.29 E-value=0.67 Score=21.52 Aligned_cols=169 Identities=18% Similarity=0.144 Sum_probs=89.2 Q ss_pred ccCHHHHHHHHHHHhcccChhH---------H---HHHHHHHHHHHHhhh---hcccccceEEEEEecCC---eeecChh Q lcl|NC_020858. 20 FRDFLDLRTAVIEYVQRPDVAD---------V---FTRFVALAESRLNRT---LRMREQVKTVTLTIASG---VAALPED 81 (209) Q Consensus 20 i~~Y~~L~~av~~w~~R~Dlt~---------~---ip~FI~lAE~~lnR~---LR~~~mE~~~tl~i~~g---~~~LP~D 81 (209) .++=-++-+.+..+++-+-+.+ . +-+.+..+.-+...- .|.... ...+-.-.-| .-.||.| T Consensus 1 M~S~v~IcN~AL~~lGa~~I~s~~e~s~~A~~c~~~Y~~~r~~~L~~~pW~FA~~r~~L-a~~~~~P~~~~~yaY~LP~D 79 (207) T protein:vir:10 1 MASQVGICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQL-AALAEAPLFGFSYQYRLPTD 79 (207) T ss_pred CCCHHHHHHHHHHhhchhhhcccccCCHHHHHHHHhhHHHHHHHHhccChhhHhhhhhh-cccccCCCCCCcccccCccc Confidence 4444455555555555321211 1 222222111111100 010000 0111111113 2579999 Q ss_pred hhhhhhhcccCCceeeccchhhccCCceEEEEEcCEEEecCCCceEEEEeeecCcccCCCCCcchHHHHhchHHHHHH-- Q lcl|NC_020858. 82 FAEMIGLYAPGGGEYVQQTSQQVNAGGCYYAIEGGNIVAPHIAGDVEVSYYAMLPPLSASTTTTNWLLNRYPDIYLYG-- 159 (209) Q Consensus 82 fle~r~i~~~~g~~~~~~~~~~~~~~p~~yti~g~~i~~P~~~~~v~l~YY~~iP~LS~D~~~sNWLL~~~PdiYLyg-- 159 (209) ||.++++...+... ..+...-|.|.|+.|.+ .....+.|.|=.+++.- + ++|+++.-+ T Consensus 80 clrv~~v~~~~~~~--------~~~~~~~~~v~g~~ll~-~~~~~~~l~Y~~~v~d~------~-----~fd~~F~~ala 139 (207) T protein:vir:10 80 FIRLLQVGQFDVYP--------RTDTRGLFSIENGNILT-DMQAPLYIRYAKRVTDP------N-----AMDALFREAFA 139 (207) T ss_pred ceEeeeecCCCCcc--------ccccccceEecCCeEEe-cCCCcEEEEEeecCCCh------h-----hhhHHHHHHHH Confidence 99999998754321 11223468999999876 44567889998886622 2 234443332 Q ss_pred --HHHHHH-HhhcChhHHHHHHHHHHHHHHHHHHhhhhhhc-----------CCCceeeCCCCC Q lcl|NC_020858. 160 --VGFEAA-KYVRDAELAQASKFMLDDAIATARSDDFDARY-----------SRARVRVSGVTP 209 (209) Q Consensus 160 --~l~ea~-~f~~D~er~~~w~~~~~~al~~l~~~d~~ar~-----------~~s~l~v~g~Tp 209 (209) +=.+.| ...++..+++...|+++.++.+-...|...+- .+.-...+|-|| T Consensus 140 ~~LAa~lA~pLt~~~~~~~~~~q~~~~~l~~A~~~da~e~~~~~~~~~~~l~aR~~~~~~~~~~ 203 (207) T protein:vir:10 140 CRLAAEACESLTQSATKRQGAWAEHDQAIAAAIRVNAIERPAQPLGDDTWLESRNGVAFPGETP 203 (207) T ss_pred HHHHHHhhHhhcCChHHHHHHHHHHHHHHHHHHhcccccCcccccCCcchhhhcccccccccCC Confidence 223333 45588899999999999988877665543321 123356778888 No 10 >protein:vir:10130 Length: 223 # NCBI annotation: hypothetical protein # Family: family:all:2410 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859260;genbank:gi:32171016;genbank:GeneID:2653415 Probab=14.27 E-value=4.3 Score=17.09 Aligned_cols=181 Identities=19% Similarity=0.165 Sum_probs=97.6 Q ss_pred CeEeccccccCCCccccccccCHHHHHHHHHHHhcccChhHHHHHHHHHHHHHHhhhhcccccceEEEEEecCC-e---e Q lcl|NC_020858. 1 MVVVVPQAVVPDDVPGTGAFRDFLDLRTAVIEYVQRPDVADVFTRFVALAESRLNRTLRMREQVKTVTLTIASG-V---A 76 (209) Q Consensus 1 ~~~~~~~~~~~~~~~~~~ai~~Y~~L~~av~~w~~R~Dlt~~ip~FI~lAE~~lnR~LR~~~mE~~~tl~i~~g-~---~ 76 (209) --.|---+||-++ .+..++.++++-.|..+..-- +.-++-+.|..|--+|=|.-++-. .+.++....+ . + T Consensus 21 ~~~~~~~~~~~~~---~~~Mv~~d~fLP~Vr~~vp~~-l~~~~~~aLr~AAieFCReS~~~R--~t~~~~~~~~~~~~~~ 94 (223) T protein:vir:10 21 SGCVRHFAVVLRR---LNSMAELSDFLPYVRRHISGP-LNIMMTDALSMAAVAFSRQSLVCR--REVTVVPVAGKEIVLP 94 (223) T ss_pred HHHHHHHHHHHHH---HhhhccHHHHHHHHhhhCCCc-hhHHHHHHHHHHHHHHHhhcccee--ecccccCCCCcceEEe Confidence 0001111222222 345556666666665554321 122456677777777766655442 2334433333 2 2 Q ss_pred ecChhh--hhhhhhcccCCceeeccchhhccCCceEEEEEc-CEEEecCCCceEEEEeeecCcccCCCCCcchHHHHhch Q lcl|NC_020858. 77 ALPEDF--AEMIGLYAPGGGEYVQQTSQQVNAGGCYYAIEG-GNIVAPHIAGDVEVSYYAMLPPLSASTTTTNWLLNRYP 153 (209) Q Consensus 77 ~LP~Df--le~r~i~~~~g~~~~~~~~~~~~~~p~~yti~g-~~i~~P~~~~~v~l~YY~~iP~LS~D~~~sNWLL~~~P 153 (209) ..|+|= -+++++.. +|.++.. .+.|++.+ ++++.-+..+++++.+--+ |..+.+.=| --|++ +. T Consensus 95 ~~~~~~~~~rI~~v~~-~g~~L~~---------G~~~t~~s~d~l~l~~~~~tl~V~~alk-Ps~~a~~lP-D~L~d-y~ 161 (223) T protein:vir:10 95 YDKDDEECVHIIRISD-DNHELFV---------GRDVDISSGRSLRFACSPGEVSVLYAVA-PKAGRSQIP-DELLT-WP 161 (223) T ss_pred eccccccceeeEEeec-CCeeeec---------ccceeecCCceEEEEeCCCeEEEEEEec-cCcchhhhh-HHHHH-HH Confidence 344431 22333332 2333221 23487777 5565322336788877655 777654444 45665 99 Q ss_pred HHHHHHHHHHHHHhh----cChhHHHHHHHHHHHHHHHHHHhhhhhhcCCCceeeCCCCC Q lcl|NC_020858. 154 DIYLYGVGFEAAKYV----RDAELAQASKFMLDDAIATARSDDFDARYSRARVRVSGVTP 209 (209) Q Consensus 154 diYLyg~l~ea~~f~----~D~er~~~w~~~~~~al~~l~~~d~~ar~~~s~l~v~g~Tp 209 (209) ++.=+|++.+.-+=- .|++++++|.|.|.+++.+-... + +-.++.|+ T Consensus 162 eaI~~GA~arLlmmP~kpWsNP~lA~y~~~~F~eg~r~Akr~---a------le~~~~~~ 212 (223) T protein:vir:10 162 EEVAAGALERLFMQTGVSWSDPLRAQYFSVQFSEGIRRAYRH---T------LATSPYSS 212 (223) T ss_pred HHHHHHHHHHHHhCCCCCCCChhHHHHHHHHHHHHHHHHHHH---H------HhhcCccc Confidence 999999998877655 68999999999998877655332 2 22333444 Done!