Query lcl|NC_020849.1_cdsid_YP_007674304.1 [gene=PYDG_00097] [protein=hypothetical protein] [protein_id=YP_007674304.1] [location=complement(72338..73045)] Match_columns 235 No_of_seqs 12 out of 14 Neff 3.9 Searched_HMMs 1612 Date Thu Nov 7 16:09:02 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_94 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_94_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95880 Length: 236 100.0 3E-108 2E-111 610.2 18.0 233 1-235 2-236 (236) 2 protein:vir:105573 Length: 225 97.8 3.5E-06 2.2E-09 50.5 13.8 216 1-235 1-225 (225) 3 protein:vir:5111 Length: 234 # 97.5 1.3E-05 7.9E-09 47.4 14.1 202 1-231 2-234 (234) 4 protein:vir:2779 Length: 216 # 96.5 0.0003 1.8E-07 39.9 13.1 202 1-234 2-216 (216) 5 protein:vir:823 Length: 216 # 96.5 0.0003 1.8E-07 39.9 13.1 202 1-234 2-216 (216) 6 protein:vir:3302 Length: 216 # 96.5 0.0003 1.8E-07 39.9 13.1 202 1-234 2-216 (216) 7 protein:vir:103760 Length: 207 85.0 0.042 2.6E-05 28.1 8.6 188 1-235 1-190 (207) 8 protein:vir:107803 Length: 223 58.1 0.42 0.00026 22.6 9.7 192 1-235 1-202 (223) 9 protein:vir:107429 Length: 223 58.1 0.42 0.00026 22.6 9.7 192 1-235 1-202 (223) 10 protein:vir:98502 Length: 223 58.1 0.42 0.00026 22.6 9.7 192 1-235 1-202 (223) 11 protein:vir:93692 Length: 197 46.0 0.75 0.00047 21.3 11.1 165 1-224 3-197 (197) 12 protein:vir:95323 Length: 201 42.2 0.89 0.00055 20.9 10.4 189 1-235 1-196 (201) 13 protein:vir:7328 Length: 201 # 31.3 1.5 0.00093 19.6 8.2 189 1-235 1-196 (201) No 1 >protein:vir:95880 Length: 236 # NCBI annotation: 30 kDa protein # Family: family:all:31944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950545;genbank:gi:119952236;genbank:GeneID:5075707 Probab=100.00 E-value=2.9e-108 Score=610.17 Aligned_cols=233 Identities=30% Similarity=0.432 Sum_probs=230.0 Q ss_pred CcHHhhHhhhhhhhhhccccccCCccccCccchhHHHHHHHHHHHHHHhhcccccccEEEEEecCceeccccchhcc-cc Q lcl|NC_020849. 1 MKLSEFFSLLTYGELANLKVGGKDCGGIYPKYADEVTSYIRQGLTDLHSRFALKHSEVIVQQFEHITLYPLRSDYAV-SN 79 (235) Q Consensus 1 mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~RF~Lk~~~i~v~~~e~~t~YpL~~~ya~-~~ 79 (235) .+|+|+++.||+||||||++||+|+|+|+|+++|+|++|+|+||||||+||.||+|||+|+|.||+++|||+|+||| |+ T Consensus 2 ~~lkev~~~La~gqL~N~~~V~~d~g~i~~~~~p~ii~a~N~gl~~Lh~Rf~lk~~~i~vem~eg~~~Y~L~~~y~v~~~ 81 (236) T protein:vir:95 2 YYIEELFCRLANGVLNNTGIVTDDRGDIEDDSKPFIIVAANEALTRLHGRFNMRNNNVVVEMQEGRTNYPLLAKYAVQSY 81 (236) T ss_pred chHHHHHHHHhcceecceeeeecccccccccccchHHHHHhHHHHHHhhhhhhccCcEEEEEeeCceecccchhhhhccC Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999995 99 Q ss_pred CCCCcccceeccCCcccchhHHHHHHhhhcccCcccccCCCCCceeeeecCCceEEEEecCCCceEEEEeeccCcceecc Q lcl|NC_020849. 80 TSSTEPYKWISDTIERPFQDDIILIESVIDEGGNEIKLNTENDILSVYSPQPDVLQIVSPKNENAVAVMYKANHTKIDLS 159 (235) Q Consensus 80 ~~~~~~~~~I~d~~~kpF~dd~lkI~~V~d~~G~~~~lNd~~~~~s~~tP~~~~lQi~~P~~~~~l~V~YqA~h~~l~~~ 159 (235) +++++.+|||||...|.|.+|+|||++|+||+|++++|||+++|||+|||++++|||.||++|+||+|+|||+||++.+| T Consensus 82 ~p~~~~~~fI~d~~~~~~~~~ilri~~V~dd~G~~~~Lnd~~~~~sv~~P~~nvLqi~~~~~~~~l~vkyq~~~~~l~~~ 161 (236) T protein:vir:95 82 DPNEVKCPFIMDLAGEKFAEDVIRILEVYDDKGRRRPLNDRNNPCSLFTPRPNVLQNNAPKAWEVLNVMYQAKHPKLSTA 161 (236) T ss_pred CCCCcccchhhccccchhHHHHHHHHhhccCCCcccccCCCCCCceeeeCCCcceeeecCCCcceEEEEeecCCCceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999887 Q ss_pred cCCCCcccccCchHHHHHHHHHHHHHHhccCCCccchHHHHHHHHHHHHHHHHHHHcCceecC-CCceeeeeecCcC Q lcl|NC_020849. 160 TDVPSNIEIEIPPQLVRPLALYVSSLAHTSVGSIEGFQTGASKMIEYETACIQIDLLGLIHKE-DWVNENIWRNGWV 235 (235) Q Consensus 160 ~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~~e~~ak~~~y~q~Ye~~c~~l~~~~L~~~~-~~~n~~f~~rGwv 235 (235) +| ..++||||+||++||.|||||||+||||+|||||++.+|+++||+||++|++++|.+++ ++++++|++|||- T Consensus 162 eD--~~~~idlP~t~~~aL~~yVA~r~~T~ig~~EnTAk~~~y~~~Yes~c~~v~~~~l~s~~~v~~~~~f~r~Gw~ 236 (236) T protein:vir:95 162 ED--GYNEIDIPDTLDPALDAYIAYRYYTSLNTPESSAKAAEYLSFYDSICREVVEYDLTSDTEVDTNTLFRKRGWR 236 (236) T ss_pred eC--CcccccCCcchHHHHHHHHHHHhhccCCCcccchhhhhHHHHHHHHHhhHHhhccccccccccccccccCCCC Confidence 76 99999999999999999999999999999999999999999999999999999999998 9999999999999 No 2 >protein:vir:105573 Length: 225 # NCBI annotation: hypothetical protein # Family: family:all:1423 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164310;genbank:gi:56692957;genbank:GeneID:3197184 Probab=97.75 E-value=3.5e-06 Score=50.49 Aligned_cols=216 Identities=13% Similarity=0.076 Sum_probs=115.1 Q ss_pred CcHHhhHhhhhhhhhhccccccCCccccCccchhHHHHHHHHHHHHHHhh-----cccccccEEEEEecCceeccccch- Q lcl|NC_020849. 1 MKLSEFFSLLTYGELANLKVGGKDCGGIYPKYADEVTSYIRQGLTDLHSR-----FALKHSEVIVQQFEHITLYPLRSD- 74 (235) Q Consensus 1 mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~R-----F~Lk~~~i~v~~~e~~t~YpL~~~- 74 (235) |+++++++.-.. +|. |...=.-=.-++|+.++|+++.+.+.| .+-..+...++...|+.-|++-.. T Consensus 1 m~~~~lI~r~~~-~l~-------D~~~~~rW~~~el~~~lNdAv~e~~~r~rL~rpda~~~~~~i~l~~Gt~q~~~~~~~ 72 (225) T protein:vir:10 1 MTLADLIRRVRT-DAN-------DMVEPYFWSDQDVADWLNDAVREAAVRGRLIHESQADAVCRIEVVAGTAVYQLHASL 72 (225) T ss_pred CCHHHHHHHHHH-Hhc-------cccccccCChHHHHHHHHHHHHHHHHhcccccccCCCceeeeeecCccccccCchHH Confidence 999999987542 222 222112345689999999999999984 556666678888888887777321 Q ss_pred hccccCCCCcccceeccCCcccchhHHHHHHhhhcccCcccccCCCCCceeeeecCCceEEEEecCC--CceEEEEeecc Q lcl|NC_020849. 75 YAVSNTSSTEPYKWISDTIERPFQDDIILIESVIDEGGNEIKLNTENDILSVYSPQPDVLQIVSPKN--ENAVAVMYKAN 152 (235) Q Consensus 75 ya~~~~~~~~~~~~I~d~~~kpF~dd~lkI~~V~d~~G~~~~lNd~~~~~s~~tP~~~~lQi~~P~~--~~~l~V~YqA~ 152 (235) |-+.. ..|+.-+.-+|=.-.+-.++ +.|..=-.-. ++++.|- -++=.++.+=++ |.+ ..+|++.|-+. T Consensus 73 ~~I~~------~~~~~~~~~~~~~~~~~s~e-~LD~~~P~W~-~~tg~p~-~~~~d~~~~~l~-P~p~~~~~vel~~~r~ 142 (225) T protein:vir:10 73 YELSH------LGFYPADMSRPTMPVLKSAE-VLDVELPEWR-ACTGKPL-YAIQGDTSLRLV-PTPDRAGILRVEGYRT 142 (225) T ss_pred HHHHH------HhhcCcccCCceecccccHH-HhcccCCCcc-cCCCCce-EEEeCCcEEEEE-ecCCCceEEEEEEEee Confidence 11110 01110000000000000000 1111100000 1111111 111122344333 333 24577777777 Q ss_pred CcceecccCCCCcccccCchHHHHHHHHHHHHHHhccCC-CccchHHHHHHHHHHHHHHHHHHHcCceecCCCceeeeee Q lcl|NC_020849. 153 HTKIDLSTDVPSNIEIEIPPQLVRPLALYVSSLAHTSVG-SIEGFQTGASKMIEYETACIQIDLLGLIHKEDWVNENIWR 231 (235) Q Consensus 153 h~~l~~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~-~~e~~ak~~~y~q~Ye~~c~~l~~~~L~~~~~~~n~~f~~ 231 (235) +-+-- -...+++.+.+||.-|.++|..|+-||+|+-=+ ...+.++|+.++|.++..-+.=.+-++........-.+++ T Consensus 143 P~~~~-~~~~~D~~~p~i~~~y~~~Lvdw~lyRa~skd~~~~~d~~rA~~h~q~F~~alG~k~~ad~~r~~r~~~p~~~~ 221 (225) T protein:vir:10 143 PLADM-ALADKDTAQPEIHAEHHRHLVQWALYRGFSIPDMESFDPNRAALAEAAFTAYFGERPDSDLRRITREDVPHHVE 221 (225) T ss_pred cchhh-hccccccccCccchhhHHHHHHHHHHHHhcCcCccccChHHHHHHHHHHHHHhCCchhHHHHHhccccCccccc Confidence 64321 122345677899999999999999999999865 3355689999999999877653333333322222222333 Q ss_pred cCcC Q lcl|NC_020849. 232 NGWV 235 (235) Q Consensus 232 rGwv 235 (235) .-|- T Consensus 222 ~~~~ 225 (225) T protein:vir:10 222 AFWP 225 (225) T ss_pred ccCC Confidence 3344 No 3 >protein:vir:5111 Length: 234 # NCBI annotation: unknown # Family: family:all:1423 # MgeID: mge:114 # MgeName: PBC5 # Cross-refs: genbank:acc:NP_542268;genbank:gi:18071240;genbank:GeneID:929347 Probab=97.54 E-value=1.3e-05 Score=47.41 Aligned_cols=202 Identities=17% Similarity=0.193 Sum_probs=108.6 Q ss_pred CcHHhhHhhhhhhhhhccccccCCccccCccchhHHHHHHHHHHHHHHhhc-ccccccEEEEEecCc-eeccc----cch Q lcl|NC_020849. 1 MKLSEFFSLLTYGELANLKVGGKDCGGIYPKYADEVTSYIRQGLTDLHSRF-ALKHSEVIVQQFEHI-TLYPL----RSD 74 (235) Q Consensus 1 mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~RF-~Lk~~~i~v~~~e~~-t~YpL----~~~ 74 (235) ++++++++.-.. +|.. . .=.-=.-++|+.++|+++.+.+-|= ....+...|+..+|+ -.||. .+- T Consensus 2 ~t~~~lI~r~~~-~l~D-------~-~~~rW~~~el~~~lNdAv~e~~l~rp~a~~~~~~i~l~~Gt~q~lP~d~~~~~~ 72 (234) T protein:vir:51 2 PKASEIMRLAGI-QLLD-------E-DHIRWPLIELADWVNEGVKAIVLAKPSASSKSAAIQLVKGTHQTLPGTIDGKAT 72 (234) T ss_pred ccHHHHHHHHHH-Hhcc-------c-cccccChHHHHHHHHHHHHHHHhhcCCCCccceeEeeccCCccccccccccchh Confidence 889999887542 2322 1 1123346899999999999999774 445666788989995 34443 122 Q ss_pred h---ccccCCCCcccceeccCCcccchh-HHHHHHh--hhccc-CcccccCCCCCc---------eeeeecCCceEEEEe Q lcl|NC_020849. 75 Y---AVSNTSSTEPYKWISDTIERPFQD-DIILIES--VIDEG-GNEIKLNTENDI---------LSVYSPQPDVLQIVS 138 (235) Q Consensus 75 y---a~~~~~~~~~~~~I~d~~~kpF~d-d~lkI~~--V~d~~-G~~~~lNd~~~~---------~s~~tP~~~~lQi~~ 138 (235) + .+-...... -+++.+ -.+++-+ +-|.. .+-. +++..| .+..+|+ .+= +- T Consensus 73 l~li~i~rn~~s~---------~~~~~~grav~~vsre~LD~~~P~W~--~~tg~P~~~~v~~y~~d~~~p~--~~~-l~ 138 (234) T protein:vir:51 73 LQLIGINRNLVSA---------AEPRQGLRAIRTCARDVLDAQEPNWH--TASYVPFRKEVRQVIYDENLPT--EFY-VY 138 (234) T ss_pred eehhhhhhhhccc---------cccccCcceeeecCHHHhcccCCCcc--ccCCCCchhhhhhhhccCCCCe--EEE-Ee Confidence 2 221110000 011110 1111110 00100 0000 111111 1223332 221 22 Q ss_pred cCCC--ceEEEEeeccCcceecccCCCCc------ccccCchHHHHHHHHHHHHHHhccCCCccchHHHHHHHHHHHHHH Q lcl|NC_020849. 139 PKNE--NAVAVMYKANHTKIDLSTDVPSN------IEIEIPPQLVRPLALYVSSLAHTSVGSIEGFQTGASKMIEYETAC 210 (235) Q Consensus 139 P~~~--~~l~V~YqA~h~~l~~~~d~~~~------~~IdLP~~l~~AL~~~VAsr~~t~i~~~e~~ak~~~y~q~Ye~~c 210 (235) |-+. -.|.+.|-+.++..++....+++ .+++||..|.++|..|+-||+|+.=....+.++|+.++|.++.- T Consensus 139 P~p~~~g~v~~~~~r~P~~v~~~~~~d~~~~a~~~~~~~i~~~y~~~Lvdw~lyRa~skD~e~~d~~rA~~h~q~F~~~- 217 (234) T protein:vir:51 139 PGNDGSGFVEAAFSFLPTSVKVANGADPEKIASWDIDVGLPEPYSVPLLDYVLYRCHQKDDTAADLGKATSHYQLFATA- 217 (234) T ss_pred ccCCCCceEEEEEEeecchhhhhhcCCccccccccccCCccchhhHHHHHHHhhhhcCccccccchHHHHHHHHHHHHH- Confidence 3332 34666776666655443333333 48999999999999999999999754455677999999999865 Q ss_pred HHHHHcCceecCCC-ceeeeee Q lcl|NC_020849. 211 IQIDLLGLIHKEDW-VNENIWR 231 (235) Q Consensus 211 ~~l~~~~L~~~~~~-~n~~f~~ 231 (235) .|++.+... .+-+.|| T Consensus 218 -----lG~k~~~d~~~~pn~r~ 234 (234) T protein:vir:51 218 -----VGIKVQSEGTSNPNRRR 234 (234) T ss_pred -----hCCcchhhhhccccccC Confidence 555555422 2333344 No 4 >protein:vir:2779 Length: 216 # NCBI annotation: hypothetical protein # Family: family:all:1423 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612896;genbank:gi:20065813;genbank:GeneID:935638 Probab=96.54 E-value=0.0003 Score=39.92 Aligned_cols=202 Identities=8% Similarity=-0.015 Sum_probs=110.5 Q ss_pred CcHHhhHhhhhhhhhhccccccCCccccCccchhHHHHHHHHHHHHHHhhc-ccccccEEEEEecCceeccccchhc--- Q lcl|NC_020849. 1 MKLSEFFSLLTYGELANLKVGGKDCGGIYPKYADEVTSYIRQGLTDLHSRF-ALKHSEVIVQQFEHITLYPLRSDYA--- 76 (235) Q Consensus 1 mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~RF-~Lk~~~i~v~~~e~~t~YpL~~~ya--- 76 (235) ++++++++.-. .+|... + +. -=.-++|+.++|+++.+.+-|= ....+...++...|+.-|.+-..+. T Consensus 2 ~t~~~lI~r~~-~~l~D~-----~-~~--rW~~~el~~~lNdAv~e~~l~rpda~~~~~~i~l~~Gt~q~~p~~~~~lI~ 72 (216) T protein:vir:27 2 TTITEIIGRVN-TQLVDP-----M-MV--RWPLQELCDYYNDAVRAVILARPDAGASLETISCVPGARQVLPDGVIQLLD 72 (216) T ss_pred ccHHHHHHHHH-Hhhhcc-----c-cc--ccChHHHHHHHHHHHHHHHhhcCCCCcceeeEeccccccccccchhhhhhh Confidence 89999998753 344331 1 11 1234679999999999999885 5566777888888887765533332 Q ss_pred cccCCCCcccceeccCCcccch---hHHHHHHhhhcccCcccccCCCCCce-eeeecC-CceEEEEecCCC--ceEEEEe Q lcl|NC_020849. 77 VSNTSSTEPYKWISDTIERPFQ---DDIILIESVIDEGGNEIKLNTENDIL-SVYSPQ-PDVLQIVSPKNE--NAVAVMY 149 (235) Q Consensus 77 ~~~~~~~~~~~~I~d~~~kpF~---dd~lkI~~V~d~~G~~~~lNd~~~~~-s~~tP~-~~~lQi~~P~~~--~~l~V~Y 149 (235) +-...+ .++.+ .++|- .-+-+|= ++++.|. -++..+ |... .+-|-+. -.|.+.| T Consensus 73 v~r~~~-----------g~a~~~vsre~LD--~~~P~W~-----~~~g~p~~~i~de~~pr~f-~l~P~p~~~~~vel~~ 133 (216) T protein:vir:27 73 VICLSD-----------GSAVRPLSREVLD--AQYPEWP-----TMKGIPECFISNDLSPRVF-WLFPAPDKEISIDAVV 133 (216) T ss_pred hhhhCC-----------CCceeeecHHHhc--ccCCCcC-----CCCCCceEEEecCCCceEE-EEeccCCCCcEEEEEE Confidence 111100 01111 01111 0011111 1112221 122221 1122 1223333 3467777 Q ss_pred eccCcceecccCCCCcccccCchHHHHHHHHHHHHHHhccCCCc-cchHHHHHHHHHHHHHHHHHHHcCceecCCCcee- Q lcl|NC_020849. 150 KANHTKIDLSTDVPSNIEIEIPPQLVRPLALYVSSLAHTSVGSI-EGFQTGASKMIEYETACIQIDLLGLIHKEDWVNE- 227 (235) Q Consensus 150 qA~h~~l~~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~~-e~~ak~~~y~q~Ye~~c~~l~~~~L~~~~~~~n~- 227 (235) -+.+..... .+.+++.+++||..|.++|..|+-||+|+-=..- .+.++|+.++|.++..-+.=.+-+.. -.+-. T Consensus 134 ~r~P~a~~~-~~~~dd~~~~i~~~y~~~Lvdw~lyRa~skd~~~~~d~~rA~~h~q~F~~~lG~k~~ad~~---~~~r~~ 209 (216) T protein:vir:27 134 SRIPEAVYV-LTQDDDTPVPLEEAYVNPLVEWMLFRAFSKDAAGGAESGLAAQHYQSFVEQLGIKQGADSA---LYARKK 209 (216) T ss_pred EecCcchhh-ccCCCCCCCCcchhhhHHHHHHHHHHHhcCcCccccChHHHHHHHHHHHHHhCCchHHHhH---HHHHHh Confidence 777754321 2344556789999999999999999999976663 56789999999999876642222110 00111 Q ss_pred eeeecCc Q lcl|NC_020849. 228 NIWRNGW 234 (235) Q Consensus 228 ~f~~rGw 234 (235) -++-+|- T Consensus 210 ~~~~~~~ 216 (216) T protein:vir:27 210 VFNGGGV 216 (216) T ss_pred hccCCCC Confidence 1222333 No 5 >protein:vir:823 Length: 216 # NCBI annotation: hypothetical protein # Family: family:all:1423 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050556;genbank:gi:9633453;genbank:GeneID:1262282 Probab=96.54 E-value=0.0003 Score=39.92 Aligned_cols=202 Identities=8% Similarity=-0.015 Sum_probs=110.5 Q ss_pred CcHHhhHhhhhhhhhhccccccCCccccCccchhHHHHHHHHHHHHHHhhc-ccccccEEEEEecCceeccccchhc--- Q lcl|NC_020849. 1 MKLSEFFSLLTYGELANLKVGGKDCGGIYPKYADEVTSYIRQGLTDLHSRF-ALKHSEVIVQQFEHITLYPLRSDYA--- 76 (235) Q Consensus 1 mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~RF-~Lk~~~i~v~~~e~~t~YpL~~~ya--- 76 (235) ++++++++.-. .+|... + +. -=.-++|+.++|+++.+.+-|= ....+...++...|+.-|.+-..+. T Consensus 2 ~t~~~lI~r~~-~~l~D~-----~-~~--rW~~~el~~~lNdAv~e~~l~rpda~~~~~~i~l~~Gt~q~~p~~~~~lI~ 72 (216) T protein:vir:82 2 TTITEIIGRVN-TQLVDP-----M-MV--RWPLQELCDYYNDAVRAVILARPDAGASLETISCVPGARQVLPDGVIQLLD 72 (216) T ss_pred ccHHHHHHHHH-Hhhhcc-----c-cc--ccChHHHHHHHHHHHHHHHhhcCCCCcceeeEeccccccccccchhhhhhh Confidence 89999998753 344331 1 11 1234679999999999999885 5566777888888887765533332 Q ss_pred cccCCCCcccceeccCCcccch---hHHHHHHhhhcccCcccccCCCCCce-eeeecC-CceEEEEecCCC--ceEEEEe Q lcl|NC_020849. 77 VSNTSSTEPYKWISDTIERPFQ---DDIILIESVIDEGGNEIKLNTENDIL-SVYSPQ-PDVLQIVSPKNE--NAVAVMY 149 (235) Q Consensus 77 ~~~~~~~~~~~~I~d~~~kpF~---dd~lkI~~V~d~~G~~~~lNd~~~~~-s~~tP~-~~~lQi~~P~~~--~~l~V~Y 149 (235) +-...+ .++.+ .++|- .-+-+|= ++++.|. -++..+ |... .+-|-+. -.|.+.| T Consensus 73 v~r~~~-----------g~a~~~vsre~LD--~~~P~W~-----~~~g~p~~~i~de~~pr~f-~l~P~p~~~~~vel~~ 133 (216) T protein:vir:82 73 VICLSD-----------GSAVRPLSREVLD--AQYPEWP-----TMKGIPECFISNDLSPRVF-WLFPAPDKEISIDAVV 133 (216) T ss_pred hhhhCC-----------CCceeeecHHHhc--ccCCCcC-----CCCCCceEEEecCCCceEE-EEeccCCCCcEEEEEE Confidence 111100 01111 01111 0011111 1112221 122221 1122 1223333 3467777 Q ss_pred eccCcceecccCCCCcccccCchHHHHHHHHHHHHHHhccCCCc-cchHHHHHHHHHHHHHHHHHHHcCceecCCCcee- Q lcl|NC_020849. 150 KANHTKIDLSTDVPSNIEIEIPPQLVRPLALYVSSLAHTSVGSI-EGFQTGASKMIEYETACIQIDLLGLIHKEDWVNE- 227 (235) Q Consensus 150 qA~h~~l~~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~~-e~~ak~~~y~q~Ye~~c~~l~~~~L~~~~~~~n~- 227 (235) -+.+..... .+.+++.+++||..|.++|..|+-||+|+-=..- .+.++|+.++|.++..-+.=.+-+.. -.+-. T Consensus 134 ~r~P~a~~~-~~~~dd~~~~i~~~y~~~Lvdw~lyRa~skd~~~~~d~~rA~~h~q~F~~~lG~k~~ad~~---~~~r~~ 209 (216) T protein:vir:82 134 SRIPEAVYV-LTQDDDTPVPLEEAYVNPLVEWMLFRAFSKDAAGGAESGLAAQHYQSFVEQLGIKQGADSA---LYARKK 209 (216) T ss_pred EecCcchhh-ccCCCCCCCCcchhhhHHHHHHHHHHHhcCcCccccChHHHHHHHHHHHHHhCCchHHHhH---HHHHHh Confidence 777754321 2344556789999999999999999999976663 56789999999999876642222110 00111 Q ss_pred eeeecCc Q lcl|NC_020849. 228 NIWRNGW 234 (235) Q Consensus 228 ~f~~rGw 234 (235) -++-+|- T Consensus 210 ~~~~~~~ 216 (216) T protein:vir:82 210 VFNGGGV 216 (216) T ss_pred hccCCCC Confidence 1222333 No 6 >protein:vir:3302 Length: 216 # NCBI annotation: hypothetical protein # Family: family:all:1423 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049518;genbank:gi:9632524;genbank:GeneID:1262013 Probab=96.54 E-value=0.0003 Score=39.92 Aligned_cols=202 Identities=8% Similarity=-0.015 Sum_probs=110.5 Q ss_pred CcHHhhHhhhhhhhhhccccccCCccccCccchhHHHHHHHHHHHHHHhhc-ccccccEEEEEecCceeccccchhc--- Q lcl|NC_020849. 1 MKLSEFFSLLTYGELANLKVGGKDCGGIYPKYADEVTSYIRQGLTDLHSRF-ALKHSEVIVQQFEHITLYPLRSDYA--- 76 (235) Q Consensus 1 mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~RF-~Lk~~~i~v~~~e~~t~YpL~~~ya--- 76 (235) ++++++++.-. .+|... + +. -=.-++|+.++|+++.+.+-|= ....+...++...|+.-|.+-..+. T Consensus 2 ~t~~~lI~r~~-~~l~D~-----~-~~--rW~~~el~~~lNdAv~e~~l~rpda~~~~~~i~l~~Gt~q~~p~~~~~lI~ 72 (216) T protein:vir:33 2 TTITEIIGRVN-TQLVDP-----M-MV--RWPLQELCDYYNDAVRAVILARPDAGASLETISCVPGARQVLPDGVIQLLD 72 (216) T ss_pred ccHHHHHHHHH-Hhhhcc-----c-cc--ccChHHHHHHHHHHHHHHHhhcCCCCcceeeEeccccccccccchhhhhhh Confidence 89999998753 344331 1 11 1234679999999999999885 5566777888888887765533332 Q ss_pred cccCCCCcccceeccCCcccch---hHHHHHHhhhcccCcccccCCCCCce-eeeecC-CceEEEEecCCC--ceEEEEe Q lcl|NC_020849. 77 VSNTSSTEPYKWISDTIERPFQ---DDIILIESVIDEGGNEIKLNTENDIL-SVYSPQ-PDVLQIVSPKNE--NAVAVMY 149 (235) Q Consensus 77 ~~~~~~~~~~~~I~d~~~kpF~---dd~lkI~~V~d~~G~~~~lNd~~~~~-s~~tP~-~~~lQi~~P~~~--~~l~V~Y 149 (235) +-...+ .++.+ .++|- .-+-+|= ++++.|. -++..+ |... .+-|-+. -.|.+.| T Consensus 73 v~r~~~-----------g~a~~~vsre~LD--~~~P~W~-----~~~g~p~~~i~de~~pr~f-~l~P~p~~~~~vel~~ 133 (216) T protein:vir:33 73 VICLSD-----------GSAVRPLSREVLD--AQYPEWP-----TMKGIPECFISNDLSPRVF-WLFPAPDKEISIDAVV 133 (216) T ss_pred hhhhCC-----------CCceeeecHHHhc--ccCCCcC-----CCCCCceEEEecCCCceEE-EEeccCCCCcEEEEEE Confidence 111100 01111 01111 0011111 1112221 122221 1122 1223333 3467777 Q ss_pred eccCcceecccCCCCcccccCchHHHHHHHHHHHHHHhccCCCc-cchHHHHHHHHHHHHHHHHHHHcCceecCCCcee- Q lcl|NC_020849. 150 KANHTKIDLSTDVPSNIEIEIPPQLVRPLALYVSSLAHTSVGSI-EGFQTGASKMIEYETACIQIDLLGLIHKEDWVNE- 227 (235) Q Consensus 150 qA~h~~l~~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~~-e~~ak~~~y~q~Ye~~c~~l~~~~L~~~~~~~n~- 227 (235) -+.+..... .+.+++.+++||..|.++|..|+-||+|+-=..- .+.++|+.++|.++..-+.=.+-+.. -.+-. T Consensus 134 ~r~P~a~~~-~~~~dd~~~~i~~~y~~~Lvdw~lyRa~skd~~~~~d~~rA~~h~q~F~~~lG~k~~ad~~---~~~r~~ 209 (216) T protein:vir:33 134 SRIPEAVYV-LTQDDDTPVPLEEAYVNPLVEWMLFRAFSKDAAGGAESGLAAQHYQSFVEQLGIKQGADSA---LYARKK 209 (216) T ss_pred EecCcchhh-ccCCCCCCCCcchhhhHHHHHHHHHHHhcCcCccccChHHHHHHHHHHHHHhCCchHHHhH---HHHHHh Confidence 777754321 2344556789999999999999999999976663 56789999999999876642222110 00111 Q ss_pred eeeecCc Q lcl|NC_020849. 228 NIWRNGW 234 (235) Q Consensus 228 ~f~~rGw 234 (235) -++-+|- T Consensus 210 ~~~~~~~ 216 (216) T protein:vir:33 210 VFNGGGV 216 (216) T ss_pred hccCCCC Confidence 1222333 No 7 >protein:vir:103760 Length: 207 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024931;genbank:gi:48697201;genbank:GeneID:2846084 Probab=85.00 E-value=0.042 Score=28.12 Aligned_cols=188 Identities=10% Similarity=0.095 Sum_probs=104.8 Q ss_pred Cc-HHhhHhhhhhhhhhccccccCCccccCccchhHHHHHHHHHHHHHHhhcccccccEEEEEecCceeccccchhcccc Q lcl|NC_020849. 1 MK-LSEFFSLLTYGELANLKVGGKDCGGIYPKYADEVTSYIRQGLTDLHSRFALKHSEVIVQQFEHITLYPLRSDYAVSN 79 (235) Q Consensus 1 mk-l~E~~~~La~geLsN~~iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~RF~Lk~~~i~v~~~e~~t~YpL~~~ya~~~ 79 (235) |. .-+ ++++|++.|-+=.|.+-+-+......-..+-..+...+++.| .+.--.+-+.+. ++ .. T Consensus 1 M~S~v~-IcN~AL~~lGa~~I~s~~e~s~~A~~c~~~Y~~~r~~~L~~~-pW~FA~~r~~La--------~~------~~ 64 (207) T protein:vir:10 1 MASQVG-ICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAH-VWSFTKARAQLA--------AL------AE 64 (207) T ss_pred CCCHHH-HHHHHHHhhchhhhcccccCCHHHHHHHHhhHHHHHHHHhcc-ChhhHhhhhhhc--------cc------cc Confidence 43 333 788888888775555445555555556666677777888777 333333333332 12 11 Q ss_pred CCCCc-ccceeccCCcccchhHHHHHHhhhcccCcccccCCCCCceeeeecCCceEEEEecCCCceEEEEeeccCcceec Q lcl|NC_020849. 80 TSSTE-PYKWISDTIERPFQDDIILIESVIDEGGNEIKLNTENDILSVYSPQPDVLQIVSPKNENAVAVMYKANHTKIDL 158 (235) Q Consensus 80 ~~~~~-~~~~I~d~~~kpF~dd~lkI~~V~d~~G~~~~lNd~~~~~s~~tP~~~~lQi~~P~~~~~l~V~YqA~h~~l~~ 158 (235) .+..+ .|+| ..+.|-|||.+|.+.... +.-+.-.++.+. -..|.- .+-..+.+.|-++=+ T Consensus 65 ~P~~~~~yaY-------~LP~Dclrv~~v~~~~~~--~~~~~~~~~~v~---g~~ll~---~~~~~~~l~Y~~~v~---- 125 (207) T protein:vir:10 65 APLFGFSYQY-------RLPTDFIRLLQVGQFDVY--PRTDTRGLFSIE---NGNILT---DMQAPLYIRYAKRVT---- 125 (207) T ss_pred CCCCCCcccc-------cCcccceEeeeecCCCCc--cccccccceEec---CCeEEe---cCCCcEEEEEeecCC---- Confidence 11112 1333 236788999999875432 111111112221 112211 111235555544322 Q ss_pred ccCCCCcccccCchHHHHHHHHHHHHHHhccCCCccchHHHHHHHHHHHHHHHHHHHcCceecCCCceeeeeecCcC Q lcl|NC_020849. 159 STDVPSNIEIEIPPQLVRPLALYVSSLAHTSVGSIEGFQTGASKMIEYETACIQIDLLGLIHKEDWVNENIWRNGWV 235 (235) Q Consensus 159 ~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~~e~~ak~~~y~q~Ye~~c~~l~~~~L~~~~~~~n~~f~~rGwv 235 (235) .+=..|+...+||++..|+.+=-++ .++..++.+.+|+|+.+..+-...|-. .....++..-+|+ T Consensus 126 -------d~~~fd~~F~~ala~~LAa~lA~pL--t~~~~~~~~~~q~~~~~l~~A~~~da~---e~~~~~~~~~~~l 190 (207) T protein:vir:10 126 -------DPNAMDALFREAFACRLAAEACESL--TQSATKRQGAWAEHDQAIAAAIRVNAI---ERPAQPLGDDTWL 190 (207) T ss_pred -------ChhhhhHHHHHHHHHHHHHHhhHhh--cCChHHHHHHHHHHHHHHHHHHhcccc---cCcccccCCcchh Confidence 1224799999999999999998887 466679999999999887664433332 2223455555555 No 8 >protein:vir:107803 Length: 223 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1524 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996624;genbank:gi:45580758;genbank:GeneID:2767879 Probab=58.09 E-value=0.42 Score=22.65 Aligned_cols=192 Identities=16% Similarity=0.218 Sum_probs=97.7 Q ss_pred Cc-HHhhHhhhhhhhhhccccccC---CccccCccchhHHHHHHHHHHHHHHh-hcccccccEEEEEecCceeccccchh Q lcl|NC_020849. 1 MK-LSEFFSLLTYGELANLKVGGK---DCGGIYPKYADEVTSYIRQGLTDLHS-RFALKHSEVIVQQFEHITLYPLRSDY 75 (235) Q Consensus 1 mk-l~E~~~~La~geLsN~~iv~~---d~g~I~~~~~~~l~~~iN~GLt~L~~-RF~Lk~~~i~v~~~e~~t~YpL~~~y 75 (235) |. .-+ ++++|++-|-+=.++.+ +-+.-....-..+-..+...+++.|- .|..|--.+ . + T Consensus 1 M~S~v~-IcN~AL~~lG~~~i~~~~s~~E~s~~A~~C~~~Y~~~r~~~Lr~~pW~FA~~r~~L--a--------~----- 64 (223) T protein:vir:10 1 MASEVD-ICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELHTWGFATKCAQL--A--------A----- 64 (223) T ss_pred CCCHHH-HHHHHHHhcccchhhcccCCCCCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhh--h--------h----- Confidence 44 333 67777766644444422 12222333334445556666776662 333332222 1 1 Q ss_pred ccccCCCCc-ccceeccCCcccchhHHHHHHhhhcccCcccccC-CCCCceee-eec-CCceEEEEecCCCceEEEEeec Q lcl|NC_020849. 76 AVSNTSSTE-PYKWISDTIERPFQDDIILIESVIDEGGNEIKLN-TENDILSV-YSP-QPDVLQIVSPKNENAVAVMYKA 151 (235) Q Consensus 76 a~~~~~~~~-~~~~I~d~~~kpF~dd~lkI~~V~d~~G~~~~lN-d~~~~~s~-~tP-~~~~lQi~~P~~~~~l~V~YqA 151 (235) ...++++ .|+| ..+.|-|||.+|.+..-..+.-. +...+..+ ++. ....| +. +...+.+.|-+ T Consensus 65 --~a~p~~~~~yaY-------~LP~Dclrv~~v~~~~~~~~~~~~~~~~~~~~e~~~~g~~~i--~t--d~~~~~l~Y~~ 131 (223) T protein:vir:10 65 --MGISRPEWRFAY-------AQPADAIKIVAVLPHDAANIEAGIDNAQPFSCEIDNTGADII--LT--NQVNAVARYIS 131 (223) T ss_pred --cccCCCCccccc-------cccccceeeeeeccccccccccccccccceEEeeccccceee--ee--cCCceEEEEee Confidence 1122222 1333 34678999999987643322221 11111111 111 11233 22 33456677755 Q ss_pred cCcceecccCCCCcccccCchHHHHHHHHHHHHHHhccCCC-ccchHHHHHHHHHHHHHHHHHHHcCceecCCCceeeee Q lcl|NC_020849. 152 NHTKIDLSTDVPSNIEIEIPPQLVRPLALYVSSLAHTSVGS-IEGFQTGASKMIEYETACIQIDLLGLIHKEDWVNENIW 230 (235) Q Consensus 152 ~h~~l~~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~-~e~~ak~~~y~q~Ye~~c~~l~~~~L~~~~~~~n~~f~ 230 (235) +=+. +=-.|+...+||++..||.+=-++=. ....+++.+.+|+|+.+..+-...|-..+ ....+. T Consensus 132 ~v~d-----------~~~fd~lF~~Ala~~LAa~lA~pLt~~~~~~q~a~~~~~~y~~~l~~A~~~da~e~---~~~~~~ 197 (223) T protein:vir:10 132 LVKD-----------TTKFSPLFVQALAWHLASMLAGPLLKGDVGAAESKRCVGAMQAYLSQAMVSDANQR---KTKPAH 197 (223) T ss_pred cCCC-----------hhcccHHHHHHHHHHHHHHhhHhhcCCcchHHHHHHHHHHHHHHHHHHHhcccccC---cccccc Confidence 4321 22479999999999999998666533 33455777889999998777444443222 233333 Q ss_pred ecCcC Q lcl|NC_020849. 231 RNGWV 235 (235) Q Consensus 231 ~rGwv 235 (235) .-.|+ T Consensus 198 ~~~~l 202 (223) T protein:vir:10 198 MPEWM 202 (223) T ss_pred cchhh Confidence 44444 No 9 >protein:vir:107429 Length: 223 # NCBI annotation: Bbp14 # Family: family:all:1524 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958683;genbank:gi:41179375;genbank:GeneID:2717223 Probab=58.09 E-value=0.42 Score=22.65 Aligned_cols=192 Identities=16% Similarity=0.218 Sum_probs=97.7 Q ss_pred Cc-HHhhHhhhhhhhhhccccccC---CccccCccchhHHHHHHHHHHHHHHh-hcccccccEEEEEecCceeccccchh Q lcl|NC_020849. 1 MK-LSEFFSLLTYGELANLKVGGK---DCGGIYPKYADEVTSYIRQGLTDLHS-RFALKHSEVIVQQFEHITLYPLRSDY 75 (235) Q Consensus 1 mk-l~E~~~~La~geLsN~~iv~~---d~g~I~~~~~~~l~~~iN~GLt~L~~-RF~Lk~~~i~v~~~e~~t~YpL~~~y 75 (235) |. .-+ ++++|++-|-+=.++.+ +-+.-....-..+-..+...+++.|- .|..|--.+ . + T Consensus 1 M~S~v~-IcN~AL~~lG~~~i~~~~s~~E~s~~A~~C~~~Y~~~r~~~Lr~~pW~FA~~r~~L--a--------~----- 64 (223) T protein:vir:10 1 MASEVD-ICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELHTWGFATKCAQL--A--------A----- 64 (223) T ss_pred CCCHHH-HHHHHHHhcccchhhcccCCCCCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhh--h--------h----- Confidence 44 333 67777766644444422 12222333334445556666776662 333332222 1 1 Q ss_pred ccccCCCCc-ccceeccCCcccchhHHHHHHhhhcccCcccccC-CCCCceee-eec-CCceEEEEecCCCceEEEEeec Q lcl|NC_020849. 76 AVSNTSSTE-PYKWISDTIERPFQDDIILIESVIDEGGNEIKLN-TENDILSV-YSP-QPDVLQIVSPKNENAVAVMYKA 151 (235) Q Consensus 76 a~~~~~~~~-~~~~I~d~~~kpF~dd~lkI~~V~d~~G~~~~lN-d~~~~~s~-~tP-~~~~lQi~~P~~~~~l~V~YqA 151 (235) ...++++ .|+| ..+.|-|||.+|.+..-..+.-. +...+..+ ++. ....| +. +...+.+.|-+ T Consensus 65 --~a~p~~~~~yaY-------~LP~Dclrv~~v~~~~~~~~~~~~~~~~~~~~e~~~~g~~~i--~t--d~~~~~l~Y~~ 131 (223) T protein:vir:10 65 --MGISRPEWRFAY-------AQPADAIKIVAVLPHDAANIEAGIDNAQPFSCEIDNTGADII--LT--NQVNAVARYIS 131 (223) T ss_pred --cccCCCCccccc-------cccccceeeeeeccccccccccccccccceEEeeccccceee--ee--cCCceEEEEee Confidence 1122222 1333 34678999999987643322221 11111111 111 11233 22 33456677755 Q ss_pred cCcceecccCCCCcccccCchHHHHHHHHHHHHHHhccCCC-ccchHHHHHHHHHHHHHHHHHHHcCceecCCCceeeee Q lcl|NC_020849. 152 NHTKIDLSTDVPSNIEIEIPPQLVRPLALYVSSLAHTSVGS-IEGFQTGASKMIEYETACIQIDLLGLIHKEDWVNENIW 230 (235) Q Consensus 152 ~h~~l~~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~-~e~~ak~~~y~q~Ye~~c~~l~~~~L~~~~~~~n~~f~ 230 (235) +=+. +=-.|+...+||++..||.+=-++=. ....+++.+.+|+|+.+..+-...|-..+ ....+. T Consensus 132 ~v~d-----------~~~fd~lF~~Ala~~LAa~lA~pLt~~~~~~q~a~~~~~~y~~~l~~A~~~da~e~---~~~~~~ 197 (223) T protein:vir:10 132 LVKD-----------TTKFSPLFVQALAWHLASMLAGPLLKGDVGAAESKRCVGAMQAYLSQAMVSDANQR---KTKPAH 197 (223) T ss_pred cCCC-----------hhcccHHHHHHHHHHHHHHhhHhhcCCcchHHHHHHHHHHHHHHHHHHHhcccccC---cccccc Confidence 4321 22479999999999999998666533 33455777889999998777444443222 233333 Q ss_pred ecCcC Q lcl|NC_020849. 231 RNGWV 235 (235) Q Consensus 231 ~rGwv 235 (235) .-.|+ T Consensus 198 ~~~~l 202 (223) T protein:vir:10 198 MPEWM 202 (223) T ss_pred cchhh Confidence 44444 No 10 >protein:vir:98502 Length: 223 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1524 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996576;genbank:gi:45569507;genbank:GeneID:2767830 Probab=58.09 E-value=0.42 Score=22.65 Aligned_cols=192 Identities=16% Similarity=0.218 Sum_probs=97.7 Q ss_pred Cc-HHhhHhhhhhhhhhccccccC---CccccCccchhHHHHHHHHHHHHHHh-hcccccccEEEEEecCceeccccchh Q lcl|NC_020849. 1 MK-LSEFFSLLTYGELANLKVGGK---DCGGIYPKYADEVTSYIRQGLTDLHS-RFALKHSEVIVQQFEHITLYPLRSDY 75 (235) Q Consensus 1 mk-l~E~~~~La~geLsN~~iv~~---d~g~I~~~~~~~l~~~iN~GLt~L~~-RF~Lk~~~i~v~~~e~~t~YpL~~~y 75 (235) |. .-+ ++++|++-|-+=.++.+ +-+.-....-..+-..+...+++.|- .|..|--.+ . + T Consensus 1 M~S~v~-IcN~AL~~lG~~~i~~~~s~~E~s~~A~~C~~~Y~~~r~~~Lr~~pW~FA~~r~~L--a--------~----- 64 (223) T protein:vir:98 1 MASEVD-ICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELHTWGFATKCAQL--A--------A----- 64 (223) T ss_pred CCCHHH-HHHHHHHhcccchhhcccCCCCCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhh--h--------h----- Confidence 44 333 67777766644444422 12222333334445556666776662 333332222 1 1 Q ss_pred ccccCCCCc-ccceeccCCcccchhHHHHHHhhhcccCcccccC-CCCCceee-eec-CCceEEEEecCCCceEEEEeec Q lcl|NC_020849. 76 AVSNTSSTE-PYKWISDTIERPFQDDIILIESVIDEGGNEIKLN-TENDILSV-YSP-QPDVLQIVSPKNENAVAVMYKA 151 (235) Q Consensus 76 a~~~~~~~~-~~~~I~d~~~kpF~dd~lkI~~V~d~~G~~~~lN-d~~~~~s~-~tP-~~~~lQi~~P~~~~~l~V~YqA 151 (235) ...++++ .|+| ..+.|-|||.+|.+..-..+.-. +...+..+ ++. ....| +. +...+.+.|-+ T Consensus 65 --~a~p~~~~~yaY-------~LP~Dclrv~~v~~~~~~~~~~~~~~~~~~~~e~~~~g~~~i--~t--d~~~~~l~Y~~ 131 (223) T protein:vir:98 65 --MGISRPEWRFAY-------AQPADAIKIVAVLPHDAANIEAGIDNAQPFSCEIDNTGADII--LT--NQVNAVARYIS 131 (223) T ss_pred --cccCCCCccccc-------cccccceeeeeeccccccccccccccccceEEeeccccceee--ee--cCCceEEEEee Confidence 1122222 1333 34678999999987643322221 11111111 111 11233 22 33456677755 Q ss_pred cCcceecccCCCCcccccCchHHHHHHHHHHHHHHhccCCC-ccchHHHHHHHHHHHHHHHHHHHcCceecCCCceeeee Q lcl|NC_020849. 152 NHTKIDLSTDVPSNIEIEIPPQLVRPLALYVSSLAHTSVGS-IEGFQTGASKMIEYETACIQIDLLGLIHKEDWVNENIW 230 (235) Q Consensus 152 ~h~~l~~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~-~e~~ak~~~y~q~Ye~~c~~l~~~~L~~~~~~~n~~f~ 230 (235) +=+. +=-.|+...+||++..||.+=-++=. ....+++.+.+|+|+.+..+-...|-..+ ....+. T Consensus 132 ~v~d-----------~~~fd~lF~~Ala~~LAa~lA~pLt~~~~~~q~a~~~~~~y~~~l~~A~~~da~e~---~~~~~~ 197 (223) T protein:vir:98 132 LVKD-----------TTKFSPLFVQALAWHLASMLAGPLLKGDVGAAESKRCVGAMQAYLSQAMVSDANQR---KTKPAH 197 (223) T ss_pred cCCC-----------hhcccHHHHHHHHHHHHHHhhHhhcCCcchHHHHHHHHHHHHHHHHHHHhcccccC---cccccc Confidence 4321 22479999999999999998666533 33455777889999998777444443222 233333 Q ss_pred ecCcC Q lcl|NC_020849. 231 RNGWV 235 (235) Q Consensus 231 ~rGwv 235 (235) .-.|+ T Consensus 198 ~~~~l 202 (223) T protein:vir:98 198 MPEWM 202 (223) T ss_pred cchhh Confidence 44444 No 11 >protein:vir:93692 Length: 197 # NCBI annotation: Bcep22gp58 # Family: family:all:1423 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944287;genbank:gi:38640364;genbank:GeneID:2658345 Probab=45.96 E-value=0.75 Score=21.26 Aligned_cols=165 Identities=11% Similarity=0.138 Sum_probs=85.0 Q ss_pred CcHHhhHhhhhhhhhhccccccCCccccCccchhHHHHHHHHHHHHHHhh-cccccccEEEEEecCceeccccchhcccc Q lcl|NC_020849. 1 MKLSEFFSLLTYGELANLKVGGKDCGGIYPKYADEVTSYIRQGLTDLHSR-FALKHSEVIVQQFEHITLYPLRSDYAVSN 79 (235) Q Consensus 1 mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~R-F~Lk~~~i~v~~~e~~t~YpL~~~ya~~~ 79 (235) |+++|+++.-.. +|.. . +=.-=.-++|+.++|+++.+..-| =+...+...++...| |.+.|= T Consensus 3 ~t~~~lI~r~~~-~L~D-------~-~~~rW~~~el~dwlNdAv~ei~l~rPda~~~~~~i~l~aG-t~q~LP------- 65 (197) T protein:vir:93 3 IAATDLIARAGN-VLQD-------E-DHIRWEVPELIEWINDAARETIVRRPAARSVAAVLELAAG-TRQAIP------- 65 (197) T ss_pred ccHHHHHHHHHH-hhcc-------c-cccccCcHHHHHHHHHHHHHHHhhCCCCCcceeEEeeccC-cccccc------- Confidence 999999987542 3322 1 113334689999999999998776 223333667777777 433331 Q ss_pred CCCCcccceeccCCcccchhHHHHHHhhhcc-------cCccc------ccCCCCCc--------------eeeeecCCc Q lcl|NC_020849. 80 TSSTEPYKWISDTIERPFQDDIILIESVIDE-------GGNEI------KLNTENDI--------------LSVYSPQPD 132 (235) Q Consensus 80 ~~~~~~~~~I~d~~~kpF~dd~lkI~~V~d~-------~G~~~------~lNd~~~~--------------~s~~tP~~~ 132 (235) .|.++.+.|+.+ .|+.+ -|+...=- .+..+|+ T Consensus 66 -------------------~~~~~Li~vvrn~~~~~~~~gr~vr~vsre~LD~~~P~W~~~~~~~~v~~y~~~e~~p~-- 124 (197) T protein:vir:93 66 -------------------ERGVELLDVVRNMGADGVTPGRIVRRVDRQLLDDQNPDWHAARAKNVVKHFTFDERAPR-- 124 (197) T ss_pred -------------------hhHHHHHHHHHhhhhcccCCCcccccccHHHhcccCCCCccCCCCCceEEEEeecCCCc-- Confidence 112222222211 12211 11111111 1222332 Q ss_pred eEEEEec-CCCceEEEEeeccCcceecccCCCCcccccCchHHHHHHH-HHHHHHHhccCCCccchHHHHHHHHHHHHHH Q lcl|NC_020849. 133 VLQIVSP-KNENAVAVMYKANHTKIDLSTDVPSNIEIEIPPQLVRPLA-LYVSSLAHTSVGSIEGFQTGASKMIEYETAC 210 (235) Q Consensus 133 ~lQi~~P-~~~~~l~V~YqA~h~~l~~~~d~~~~~~IdLP~~l~~AL~-~~VAsr~~t~i~~~e~~ak~~~y~q~Ye~~c 210 (235) .+=+|=| ...-.|.+.|-+-++.+. .+.+..+||..|.++|. +|+-||+|+-=+....-| -- | T Consensus 125 ~~~vyP~p~~~~~ve~~~~r~P~~v~-----~~~~~~~ipeiy~~~lv~~~~lyRa~sKd~~~~a~a--~~--------~ 189 (197) T protein:vir:93 125 IFYVYPPAVAGTKVETLHSELPPAIA-----ASGDTLDMGAEYMNVLVSTSATARCRRTASSRTARS--PP--------C 189 (197) T ss_pred EEEEeecCCCCceEEEEEeeCChhhh-----ccCCCCCCchhhhhhhHHhhhhhhhcCCCCCCcccC--Cc--------c Confidence 3333222 222447777777777663 24577779999999987 699999998754322111 10 1 Q ss_pred HHHHHcCceecCCC Q lcl|NC_020849. 211 IQIDLLGLIHKEDW 224 (235) Q Consensus 211 ~~l~~~~L~~~~~~ 224 (235) -+...+.. T Consensus 190 ------~~~~~~~~ 197 (197) T protein:vir:93 190 ------TIRRSSTP 197 (197) T ss_pred ------ccccCCCC Confidence 01111111 No 12 >protein:vir:95323 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512268;genbank:gi:89152435;genbank:GeneID:3952992 Probab=42.23 E-value=0.89 Score=20.85 Aligned_cols=189 Identities=10% Similarity=0.086 Sum_probs=103.7 Q ss_pred Cc-HHhhHhhhhhhhhhccc-cccCCccccCccchhHHHHHHHHHHHHHHh-hcccccccEEEEEecCceeccccchhcc Q lcl|NC_020849. 1 MK-LSEFFSLLTYGELANLK-VGGKDCGGIYPKYADEVTSYIRQGLTDLHS-RFALKHSEVIVQQFEHITLYPLRSDYAV 77 (235) Q Consensus 1 mk-l~E~~~~La~geLsN~~-iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~-RF~Lk~~~i~v~~~e~~t~YpL~~~ya~ 77 (235) |. .-+ ++++|++-|-|-. |-+-+-+.-....-..+-..+...+++.|- +|.-|--.+ . + T Consensus 1 M~S~v~-IcN~AL~~iG~a~~I~s~~e~s~~A~~C~~~Y~~~r~~~L~~~pW~FA~~r~~L--a--------~------- 62 (201) T protein:vir:95 1 MASVVE-ICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVAL--A--------D------- 62 (201) T ss_pred CCCHHH-HHHHHHHHhCCcccccccccCCHHHHHHHHhhHHHHHHHHhhcCchhhhhhhhc--c--------c------- Confidence 44 333 6788888877643 333343444444445555666677777663 455443332 1 1 Q ss_pred ccCCCCc-ccceeccCCcccchhHHHHHHhhhcccCcccccCCCCCceeee---ecCCceEEEEecCCCceEEEEeeccC Q lcl|NC_020849. 78 SNTSSTE-PYKWISDTIERPFQDDIILIESVIDEGGNEIKLNTENDILSVY---SPQPDVLQIVSPKNENAVAVMYKANH 153 (235) Q Consensus 78 ~~~~~~~-~~~~I~d~~~kpF~dd~lkI~~V~d~~G~~~~lNd~~~~~s~~---tP~~~~lQi~~P~~~~~l~V~YqA~h 153 (235) ...++.+ .|+| ..+.|-|||.+|.+...+ .+-.+..-++.+. .-+-..| ++ +...+.+.|-++- T Consensus 63 ~a~~~~~~~yay-------~LP~Dclrv~~v~~~g~~-~~~~~~~~~f~v~~~~~~~g~~l--~t--d~~~~~l~Yv~~v 130 (201) T protein:vir:95 63 TSNPPPDWEYAY-------QYPSDCLRITEIMLPGVR-NPTAAMRVQYEVGADTNGTGKLI--YT--DQPQAWLKYVSRV 130 (201) T ss_pred ccCCCCCCcccc-------cccchhhhhhhhccCCcc-ccccccchhhhccccccccCcee--ee--cCCceEEEEeecC Confidence 1111222 2333 346789999999765322 1111111111111 0011223 22 2334556765543 Q ss_pred cceecccCCCCcccccCchHHHHHHHHHHHHHHhccCCCccchHHHHHHHHHHHHHHHHHHHcCceecCCCceeeeeecC Q lcl|NC_020849. 154 TKIDLSTDVPSNIEIEIPPQLVRPLALYVSSLAHTSVGSIEGFQTGASKMIEYETACIQIDLLGLIHKEDWVNENIWRNG 233 (235) Q Consensus 154 ~~l~~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~~e~~ak~~~y~q~Ye~~c~~l~~~~L~~~~~~~n~~f~~rG 233 (235) +. +=..|+...+||++..|+.+=-++= .+..++.+.+|+|+.+..+-...|-. ....+.+..-. T Consensus 131 ~d-----------~~~fd~~F~~ala~~LAa~la~plt--~~~~~~~~~~q~~~~~l~~A~~~da~---e~~~~~~~~~~ 194 (201) T protein:vir:95 131 TD-----------VNMFDAIFMEALAWRLAAAINMALT--GNADLGTFALNMYNRVILSAGSHSQN---ESQEPQPPVDE 194 (201) T ss_pred CC-----------hhhccHHHHHHHHHHHHHHhhHhhc--CChHHHHHHHHHHHHHHHHHHhcccc---cCcccCCCcch Confidence 21 1247999999999999999876664 55668999999999987654433332 22344667788 Q ss_pred cC Q lcl|NC_020849. 234 WV 235 (235) Q Consensus 234 wv 235 (235) |+ T Consensus 195 ~l 196 (201) T protein:vir:95 195 FT 196 (201) T ss_pred hh Confidence 99 No 13 >protein:vir:7328 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848219;genbank:gi:30387390;genbank:GeneID:2641866 Probab=31.33 E-value=1.5 Score=19.61 Aligned_cols=189 Identities=10% Similarity=0.080 Sum_probs=100.4 Q ss_pred Cc-HHhhHhhhhhhhhhccc-cccCCccccCccchhHHHHHHHHHHHHHHh-hcccccccEEEEEecCceeccccchhcc Q lcl|NC_020849. 1 MK-LSEFFSLLTYGELANLK-VGGKDCGGIYPKYADEVTSYIRQGLTDLHS-RFALKHSEVIVQQFEHITLYPLRSDYAV 77 (235) Q Consensus 1 mk-l~E~~~~La~geLsN~~-iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~-RF~Lk~~~i~v~~~e~~t~YpL~~~ya~ 77 (235) |. .-+ ++++|++-|-+-. |-.-+-+.-....-..+-..+...+++.|- .|.-|--.+ . + T Consensus 1 M~S~v~-IcN~AL~~iG~a~~I~s~~e~s~~A~~c~~~Y~~~r~~~Lr~~pW~FA~~r~~L--a--------~------- 62 (201) T protein:vir:73 1 MASVIE-ICNRALSNIGNSRSINSLIEASKEAGQCSLHFDACRDAALADFDWNFATKRVAL--A--------D------- 62 (201) T ss_pred CCCHHH-HHHHHHHhhcCcccccccccCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhh--h--------h------- Confidence 43 333 6788888776543 443344444444445555566666766663 444333222 1 1 Q ss_pred ccCCCCc-ccceeccCCcccchhHHHHHHhhhcccCcccccCCCCCceee---eecCCceEEEEecCCCceEEEEeeccC Q lcl|NC_020849. 78 SNTSSTE-PYKWISDTIERPFQDDIILIESVIDEGGNEIKLNTENDILSV---YSPQPDVLQIVSPKNENAVAVMYKANH 153 (235) Q Consensus 78 ~~~~~~~-~~~~I~d~~~kpF~dd~lkI~~V~d~~G~~~~lNd~~~~~s~---~tP~~~~lQi~~P~~~~~l~V~YqA~h 153 (235) ...++.+ .|+| ..+.|-|||.+|.+....-.+-... .|.-+ +.-.-..|.. +...+.+.|-++- T Consensus 63 ~a~~p~~~~yaY-------~LP~Dclrv~~v~~~~~~~~~~~~~-~~~~~~~~~~ieg~~i~t----d~~~~~l~Y~~~v 130 (201) T protein:vir:73 63 TNNPPPDWQYAY-------QYPSDCVRITEIMPTGIRNPTAAQR-IEYVVGSNEDLTGKLIYT----DQPKAWLKYMARV 130 (201) T ss_pred cccCCCCCcccc-------cccccceeeeeeccccccccccccc-cchhccccccccCCEeee----cCCceeEEEeecC Confidence 1122222 1333 4567899999998655431111100 00001 1111122322 2234446665432 Q ss_pred cceecccCCCCcccccCchHHHHHHHHHHHHHHhccCCCccchHHHHHHHHHHHHHHHHHHHcCceecCCCceeeeeecC Q lcl|NC_020849. 154 TKIDLSTDVPSNIEIEIPPQLVRPLALYVSSLAHTSVGSIEGFQTGASKMIEYETACIQIDLLGLIHKEDWVNENIWRNG 233 (235) Q Consensus 154 ~~l~~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~~e~~ak~~~y~q~Ye~~c~~l~~~~L~~~~~~~n~~f~~rG 233 (235) + .+=..|+...+||++..||.+=-++= .+...+.+.+|+|+.+..+-...|-..+ ....+.... T Consensus 131 ~-----------d~~~fd~lF~~ala~~LAa~lA~plt--~~~~~~~~~~q~~~~~~~~A~~~d~~e~---~~~~~~~~~ 194 (201) T protein:vir:73 131 T-----------DVNMYDAIFMEALSWRLAAAINMALT--GSADLGNNALTMYNRVILSAGSHSQNES---QEPQPPVDE 194 (201) T ss_pred C-----------CcccccHHHHHHHHHHHHHHhhHhhc--CChHHHHHHHHHHHHHHHHHHHhhhccc---cCCCCCCch Confidence 2 12247999999999999999876663 4456788899999987766433333222 233566677 Q ss_pred cC Q lcl|NC_020849. 234 WV 235 (235) Q Consensus 234 wv 235 (235) |+ T Consensus 195 ~l 196 (201) T protein:vir:73 195 FT 196 (201) T ss_pred HH Confidence 87 Done!