Query lcl|NC_020848.1_cdsid_YP_007674158.1 [gene=VPNG_00063] [protein=hypothetical protein] [protein_id=YP_007674158.1] [location=complement(56663..57370)] Match_columns 235 No_of_seqs 12 out of 14 Neff 3.9 Searched_HMMs 1612 Date Thu Nov 7 15:54:02 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_63 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_63_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95880 Length: 236 100.0 3E-107 2E-110 604.5 17.9 233 1-235 2-236 (236) 2 protein:vir:105573 Length: 225 98.2 2.6E-07 1.6E-10 56.7 15.2 212 1-235 1-225 (225) 3 protein:vir:5111 Length: 234 # 97.5 2E-05 1.2E-08 46.3 14.3 205 1-232 2-234 (234) 4 protein:vir:3302 Length: 216 # 97.0 0.00017 1E-07 41.3 15.0 199 1-234 2-216 (216) 5 protein:vir:823 Length: 216 # 97.0 0.00017 1E-07 41.3 15.0 199 1-234 2-216 (216) 6 protein:vir:2779 Length: 216 # 97.0 0.00017 1E-07 41.3 15.0 199 1-234 2-216 (216) 7 protein:vir:103760 Length: 207 66.1 0.27 0.00017 23.7 10.2 186 1-235 1-190 (207) 8 protein:vir:7328 Length: 201 # 53.9 0.52 0.00032 22.2 7.9 190 1-235 1-196 (201) 9 protein:vir:98502 Length: 223 45.0 0.79 0.00049 21.2 9.6 192 1-235 1-202 (223) 10 protein:vir:107803 Length: 223 45.0 0.79 0.00049 21.2 9.6 192 1-235 1-202 (223) 11 protein:vir:107429 Length: 223 45.0 0.79 0.00049 21.2 9.6 192 1-235 1-202 (223) 12 protein:vir:95323 Length: 201 36.6 1.2 0.00072 20.2 10.2 189 1-235 1-196 (201) 13 protein:vir:93692 Length: 197 20.5 2.8 0.0017 18.2 11.2 178 1-224 3-197 (197) No 1 >protein:vir:95880 Length: 236 # NCBI annotation: 30 kDa protein # Family: family:all:31944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950545;genbank:gi:119952236;genbank:GeneID:5075707 Probab=100.00 E-value=3.1e-107 Score=604.48 Aligned_cols=233 Identities=28% Similarity=0.477 Sum_probs=231.3 Q ss_pred CcHHhhHhhhhhhhhhccccccCCccccCccchhHHHHHHHHHHHHHHhhcccccCcEEEEEcCCcccccccchhh-ccc Q lcl|NC_020848. 1 MKLSEFFSLLSYGELANLKVGGKDCGGIYPKHSDEVVSYIRQGLTDLHSRFALKHSEVIIQQYANITMYPLRYEYA-QSN 79 (235) Q Consensus 1 mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~RF~Lk~~~i~i~~~eg~t~Y~L~~~ya-~s~ 79 (235) .+|+|+|+.||+||||||++||+|+|+|+|+++|+|++|+|+||||||+||.||+|||+|||.||+++|||+|+|| ||+ T Consensus 2 ~~lkev~~~La~gqL~N~~~V~~d~g~i~~~~~p~ii~a~N~gl~~Lh~Rf~lk~~~i~vem~eg~~~Y~L~~~y~v~~~ 81 (236) T protein:vir:95 2 YYIEELFCRLANGVLNNTGIVTDDRGDIEDDSKPFIIVAANEALTRLHGRFNMRNNNVVVEMQEGRTNYPLLAKYAVQSY 81 (236) T ss_pred chHHHHHHHHhcceecceeeeecccccccccccchHHHHHhHHHHHHhhhhhhccCcEEEEEeeCceecccchhhhhccC Confidence 8999999999999999999999999999999999999999999999999999999999999999999999999999 899 Q ss_pred CCCcccceeeecCcccchhHHHHHHHHhhccCCcccccCCCCCceeEEecCCcEEEEecCCCceeEEEEeccCCCccccc Q lcl|NC_020848. 80 TTSTQPYKFIQDTVEAPFQEDIILIESAVDEGGCEIEINRENDKCSIYTPQPDVIQLTYPEDENAIAILYKANHAKIDLS 159 (235) Q Consensus 80 ~~~~~~~~~I~d~~~~~F~~dilkI~~V~d~~G~e~~lNd~~~~~si~tP~~~~lql~~P~~~~~~~v~YqA~h~~l~~~ 159 (235) +||++++|||||...+.|.+|+||||+|+||+|+|++|||+++|||+|||++++|||++|++|+|++|+|||+||++.+| T Consensus 82 ~p~~~~~~fI~d~~~~~~~~~ilri~~V~dd~G~~~~Lnd~~~~~sv~~P~~nvLqi~~~~~~~~l~vkyq~~~~~l~~~ 161 (236) T protein:vir:95 82 DPNEVKCPFIMDLAGEKFAEDVIRILEVYDDKGRRRPLNDRNNPCSLFTPRPNVLQNNAPKAWEVLNVMYQAKHPKLSTA 161 (236) T ss_pred CCCCcccchhhccccchhHHHHHHHHhhccCCCcccccCCCCCCceeeeCCCcceeeecCCCcceEEEEeecCCCceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCCcccccchHHHHHHHHHHHHHHhccCCCccchHHHHHHHHHHHHHHHHHHHcCceecC-CCcceeeeccCcC Q lcl|NC_020848. 160 ANVPSNIDIEIPAQLVRPLALYVSSLAHTAVGSPEGMNTGATKMIEYETACLQIEVFGLIHKE-EWTNDRVGRNGWV 235 (235) Q Consensus 160 ~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~~e~~ak~~~y~q~YE~~c~~le~~~L~~~~-~~tn~~f~~~Gwv 235 (235) +| .|++||||+||++||.|||||||+||||+|||||++.+|+++||+||++|++++|.+++ .+++++|++|||- T Consensus 162 eD--~~~~idlP~t~~~aL~~yVA~r~~T~ig~~EnTAk~~~y~~~Yes~c~~v~~~~l~s~~~v~~~~~f~r~Gw~ 236 (236) T protein:vir:95 162 ED--GYNEIDIPDTLDPALDAYIAYRYYTSLNTPESSAKAAEYLSFYDSICREVVEYDLTSDTEVDTNTLFRKRGWR 236 (236) T ss_pred eC--CcccccCCcchHHHHHHHHHHHhhccCCCcccchhhhhHHHHHHHHHhhHHhhccccccccccccccccCCCC Confidence 97 89999999999999999999999999999999999999999999999999999999999 7999999999999 No 2 >protein:vir:105573 Length: 225 # NCBI annotation: hypothetical protein # Family: family:all:1423 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164310;genbank:gi:56692957;genbank:GeneID:3197184 Probab=98.21 E-value=2.6e-07 Score=56.67 Aligned_cols=212 Identities=12% Similarity=0.043 Sum_probs=119.4 Q ss_pred CcHHhhHhhhhhhhhhccccccCCccccCccchhHHHHHHHHHHHHHHh-----hcccccCcEEEEEcCCcccccccch- Q lcl|NC_020848. 1 MKLSEFFSLLSYGELANLKVGGKDCGGIYPKHSDEVVSYIRQGLTDLHS-----RFALKHSEVIIQQYANITMYPLRYE- 74 (235) Q Consensus 1 mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~-----RF~Lk~~~i~i~~~eg~t~Y~L~~~- 74 (235) |+++++++.-.. +|. |...=.-=.-++|+.++|+++.+.+. |.+-..+...|++.+|..-|++... T Consensus 1 m~~~~lI~r~~~-~l~-------D~~~~~rW~~~el~~~lNdAv~e~~~r~rL~rpda~~~~~~i~l~~Gt~q~~~~~~~ 72 (225) T protein:vir:10 1 MTLADLIRRVRT-DAN-------DMVEPYFWSDQDVADWLNDAVREAAVRGRLIHESQADAVCRIEVVAGTAVYQLHASL 72 (225) T ss_pred CCHHHHHHHHHH-Hhc-------cccccccCChHHHHHHHHHHHHHHHHhcccccccCCCceeeeeecCccccccCchHH Confidence 999999987542 222 22211334568999999999999998 4566667778999999888877642 Q ss_pred hh---ccc-CCCcccceeeecCcccchhHHHHHHHHhhccCCcccccCCCCCceeEEecCCcEEEEecCCC-ceeEEEEe Q lcl|NC_020848. 75 YA---QSN-TTSTQPYKFIQDTVEAPFQEDIILIESAVDEGGCEIEINRENDKCSIYTPQPDVIQLTYPED-ENAIAILY 149 (235) Q Consensus 75 ya---~s~-~~~~~~~~~I~d~~~~~F~~dilkI~~V~d~~G~e~~lNd~~~~~si~tP~~~~lql~~P~~-~~~~~v~Y 149 (235) |- ..+ +.+..+.. ... .-.++. .|..=-.-. +++-.| --++=.++.+-+.-|.+ .-++.+.| T Consensus 73 ~~I~~~~~~~~~~~~~~-~~~---------~~s~e~-LD~~~P~W~-~~tg~p-~~~~~d~~~~~l~P~p~~~~~vel~~ 139 (225) T protein:vir:10 73 YELSHLGFYPADMSRPT-MPV---------LKSAEV-LDVELPEWR-ACTGKP-LYAIQGDTSLRLVPTPDRAGILRVEG 139 (225) T ss_pred HHHHHHhhcCcccCCce-ecc---------cccHHH-hcccCCCcc-cCCCCc-eEEEeCCcEEEEEecCCCceEEEEEE Confidence 10 000 00000000 000 000011 111100000 000011 11122345555543333 34466655 Q ss_pred ccCCC-ccccccCCCCCcccccchHHHHHHHHHHHHHHhccCC-CccchHHHHHHHHHHHHHHHHHHHcCceecCCCcce Q lcl|NC_020848. 150 KANHA-KIDLSANVPSNIDIEIPAQLVRPLALYVSSLAHTAVG-SPEGMNTGATKMIEYETACLQIEVFGLIHKEEWTND 227 (235) Q Consensus 150 qA~h~-~l~~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~-~~e~~ak~~~y~q~YE~~c~~le~~~L~~~~~~tn~ 227 (235) -+..- +... ...+....+||.-|.++|..|+-||+|+-=+ ...+.++|+.++|.++...+.-.+-++........- T Consensus 140 ~r~P~~~~~~--~~~D~~~p~i~~~y~~~Lvdw~lyRa~skd~~~~~d~~rA~~h~q~F~~alG~k~~ad~~r~~r~~~p 217 (225) T protein:vir:10 140 YRTPLADMAL--ADKDTAQPEIHAEHHRHLVQWALYRGFSIPDMESFDPNRAALAEAAFTAYFGERPDSDLRRITREDVP 217 (225) T ss_pred Eeecchhhhc--cccccccCccchhhHHHHHHHHHHHHhcCcCccccChHHHHHHHHHHHHHhCCchhHHHHHhccccCc Confidence 55542 2321 1234667889999999999999999999865 446778999999999887766665555544444444 Q ss_pred eeeccCcC Q lcl|NC_020848. 228 RVGRNGWV 235 (235) Q Consensus 228 ~f~~~Gwv 235 (235) -+++.-|- T Consensus 218 ~~~~~~~~ 225 (225) T protein:vir:10 218 HHVEAFWP 225 (225) T ss_pred ccccccCC Confidence 44445555 No 3 >protein:vir:5111 Length: 234 # NCBI annotation: unknown # Family: family:all:1423 # MgeID: mge:114 # MgeName: PBC5 # Cross-refs: genbank:acc:NP_542268;genbank:gi:18071240;genbank:GeneID:929347 Probab=97.47 E-value=2e-05 Score=46.34 Aligned_cols=205 Identities=15% Similarity=0.149 Sum_probs=106.1 Q ss_pred CcHHhhHhhhhhhhhhccccccCCccccCccchhHHHHHHHHHHHHHHhhc-ccccCcEEEEEcCCc-cccccc----ch Q lcl|NC_020848. 1 MKLSEFFSLLSYGELANLKVGGKDCGGIYPKHSDEVVSYIRQGLTDLHSRF-ALKHSEVIIQQYANI-TMYPLR----YE 74 (235) Q Consensus 1 mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~RF-~Lk~~~i~i~~~eg~-t~Y~L~----~~ 74 (235) ++++++++.-.. +|.. . .=.-=.-++|+.++|+++.+..-|= ....+...|++.+|. -.+|.+ +- T Consensus 2 ~t~~~lI~r~~~-~l~D-------~-~~~rW~~~el~~~lNdAv~e~~l~rp~a~~~~~~i~l~~Gt~q~lP~d~~~~~~ 72 (234) T protein:vir:51 2 PKASEIMRLAGI-QLLD-------E-DHIRWPLIELADWVNEGVKAIVLAKPSASSKSAAIQLVKGTHQTLPGTIDGKAT 72 (234) T ss_pred ccHHHHHHHHHH-Hhcc-------c-cccccChHHHHHHHHHHHHHHHhhcCCCCccceeEeeccCCccccccccccchh Confidence 899999887542 3322 1 1133456899999999999999774 445667799999995 344433 22 Q ss_pred hh----cccCCCcccceeeecCcccch---hHHHHHHH--HhhccCCccc------ccCCCCCceeEEecCCcEEEEecC Q lcl|NC_020848. 75 YA----QSNTTSTQPYKFIQDTVEAPF---QEDIILIE--SAVDEGGCEI------EINRENDKCSIYTPQPDVIQLTYP 139 (235) Q Consensus 75 ya----~s~~~~~~~~~~I~d~~~~~F---~~dilkI~--~V~d~~G~e~------~lNd~~~~~si~tP~~~~lql~~P 139 (235) ++ +-+-.+.... -+ ...++ ...+|--+ .-+.+.|... .+-|. .+| ..+-++=| T Consensus 73 l~li~i~rn~~s~~~~---~~-~grav~~vsre~LD~~~P~W~~~tg~P~~~~v~~y~~d~------~~p--~~~~l~P~ 140 (234) T protein:vir:51 73 LQLIGINRNLVSAAEP---RQ-GLRAIRTCARDVLDAQEPNWHTASYVPFRKEVRQVIYDE------NLP--TEFYVYPG 140 (234) T ss_pred eehhhhhhhhcccccc---cc-CcceeeecCHHHhcccCCCccccCCCCchhhhhhhhccC------CCC--eEEEEecc Confidence 21 1000000000 00 00000 00011000 0000111100 01111 111 12222222 Q ss_pred CCc-eeEEEEeccCCCccccccCCCC------CcccccchHHHHHHHHHHHHHHhccCCCccchHHHHHHHHHHHHHHHH Q lcl|NC_020848. 140 EDE-NAIAILYKANHAKIDLSANVPS------NIDIEIPAQLVRPLALYVSSLAHTAVGSPEGMNTGATKMIEYETACLQ 212 (235) Q Consensus 140 ~~~-~~~~v~YqA~h~~l~~~~d~~~------~~~IdLP~~l~~AL~~~VAsr~~t~i~~~e~~ak~~~y~q~YE~~c~~ 212 (235) .+. -.+.+.|-+....+++....++ ..+++||..|.++|..|+-||+|+.=....+.++|+.++|.++. T Consensus 141 p~~~g~v~~~~~r~P~~v~~~~~~d~~~~a~~~~~~~i~~~y~~~Lvdw~lyRa~skD~e~~d~~rA~~h~q~F~~---- 216 (234) T protein:vir:51 141 NDGSGFVEAAFSFLPTSVKVANGADPEKIASWDIDVGLPEPYSVPLLDYVLYRCHQKDDTAADLGKATSHYQLFAT---- 216 (234) T ss_pred CCCCceEEEEEEeecchhhhhhcCCccccccccccCCccchhhHHHHHHHhhhhcCccccccchHHHHHHHHHHHH---- Confidence 222 3355555444433322222222 35789999999999999999999985555777899999999965 Q ss_pred HHHcCceecCCCcceeeecc Q lcl|NC_020848. 213 IEVFGLIHKEEWTNDRVGRN 232 (235) Q Consensus 213 le~~~L~~~~~~tn~~f~~~ 232 (235) .+|++.++.....+=.|+ T Consensus 217 --~lG~k~~~d~~~~pn~r~ 234 (234) T protein:vir:51 217 --AVGIKVQSEGTSNPNRRR 234 (234) T ss_pred --HhCCcchhhhhccccccC Confidence 566666664333333333 No 4 >protein:vir:3302 Length: 216 # NCBI annotation: hypothetical protein # Family: family:all:1423 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049518;genbank:gi:9632524;genbank:GeneID:1262013 Probab=97.01 E-value=0.00017 Score=41.28 Aligned_cols=199 Identities=11% Similarity=0.016 Sum_probs=110.4 Q ss_pred CcHHhhHhhhhhhhhhccccccCCccccCccchhHHHHHHHHHHHHHHhhc-ccccCcEEEEEcCCcccccccchhh--- Q lcl|NC_020848. 1 MKLSEFFSLLSYGELANLKVGGKDCGGIYPKHSDEVVSYIRQGLTDLHSRF-ALKHSEVIIQQYANITMYPLRYEYA--- 76 (235) Q Consensus 1 mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~RF-~Lk~~~i~i~~~eg~t~Y~L~~~ya--- 76 (235) ++++++++.-. .+|... + + .-=.-++|+.++|+++.+..-|= ....+...|++.+|+.-|.+...+. T Consensus 2 ~t~~~lI~r~~-~~l~D~-----~-~--~rW~~~el~~~lNdAv~e~~l~rpda~~~~~~i~l~~Gt~q~~p~~~~~lI~ 72 (216) T protein:vir:33 2 TTITEIIGRVN-TQLVDP-----M-M--VRWPLQELCDYYNDAVRAVILARPDAGASLETISCVPGARQVLPDGVIQLLD 72 (216) T ss_pred ccHHHHHHHHH-Hhhhcc-----c-c--cccChHHHHHHHHHHHHHHHhhcCCCCcceeeEeccccccccccchhhhhhh Confidence 89999998753 344331 1 1 12235779999999999999885 5667778899999988776654431 Q ss_pred -cccCCCcccceeeecCcccchhHHHHHHHHhhccCCcccccCCCCCc------eeEEecC-CcEEEEecCCCc-eeEEE Q lcl|NC_020848. 77 -QSNTTSTQPYKFIQDTVEAPFQEDIILIESAVDEGGCEIEINRENDK------CSIYTPQ-PDVIQLTYPEDE-NAIAI 147 (235) Q Consensus 77 -~s~~~~~~~~~~I~d~~~~~F~~dilkI~~V~d~~G~e~~lNd~~~~------~si~tP~-~~~lql~~P~~~-~~~~v 147 (235) ..+- +++..-+. ..++|- . .+++|-. |-++... |...=++=|.+. -.+.+ T Consensus 73 v~r~~--~g~a~~~v-------sre~LD-----~-------~~P~W~~~~g~p~~~i~de~~pr~f~l~P~p~~~~~vel 131 (216) T protein:vir:33 73 VICLS--DGSAVRPL-------SREVLD-----A-------QYPEWPTMKGIPECFISNDLSPRVFWLFPAPDKEISIDA 131 (216) T ss_pred hhhhC--CCCceeee-------cHHHhc-----c-------cCCCcCCCCCCceEEEecCCCceEEEEeccCCCCcEEEE Confidence 1110 00000000 011111 0 0111111 2233322 222222222222 33566 Q ss_pred EeccCCCccc-cccCCCCCcccccchHHHHHHHHHHHHHHhccCCC-ccchHHHHHHHHHHHHHHHHHHHcCceecCCCc Q lcl|NC_020848. 148 LYKANHAKID-LSANVPSNIDIEIPAQLVRPLALYVSSLAHTAVGS-PEGMNTGATKMIEYETACLQIEVFGLIHKEEWT 225 (235) Q Consensus 148 ~YqA~h~~l~-~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~-~e~~ak~~~y~q~YE~~c~~le~~~L~~~~~~t 225 (235) .|-+...... ... ++...++||..|.++|..|+-||+|+-=.. ..+.++|+.++|.++...+.--+-+... .+ T Consensus 132 ~~~r~P~a~~~~~~--~dd~~~~i~~~y~~~Lvdw~lyRa~skd~~~~~d~~rA~~h~q~F~~~lG~k~~ad~~~---~~ 206 (216) T protein:vir:33 132 VVSRIPEAVYVLTQ--DDDTPVPLEEAYVNPLVEWMLFRAFSKDAAGGAESGLAAQHYQSFVEQLGIKQGADSAL---YA 206 (216) T ss_pred EEEecCcchhhccC--CCCCCCCcchhhhHHHHHHHHHHHhcCcCccccChHHHHHHHHHHHHHhCCchHHHhHH---HH Confidence 6655543221 222 234578999999999999999999998776 3777899999999987766433332200 11 Q ss_pred ce-eeeccCc Q lcl|NC_020848. 226 ND-RVGRNGW 234 (235) Q Consensus 226 n~-~f~~~Gw 234 (235) -. -++-+|- T Consensus 207 r~~~~~~~~~ 216 (216) T protein:vir:33 207 RKKVFNGGGV 216 (216) T ss_pred HHhhccCCCC Confidence 11 2333343 No 5 >protein:vir:823 Length: 216 # NCBI annotation: hypothetical protein # Family: family:all:1423 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050556;genbank:gi:9633453;genbank:GeneID:1262282 Probab=97.01 E-value=0.00017 Score=41.28 Aligned_cols=199 Identities=11% Similarity=0.016 Sum_probs=110.4 Q ss_pred CcHHhhHhhhhhhhhhccccccCCccccCccchhHHHHHHHHHHHHHHhhc-ccccCcEEEEEcCCcccccccchhh--- Q lcl|NC_020848. 1 MKLSEFFSLLSYGELANLKVGGKDCGGIYPKHSDEVVSYIRQGLTDLHSRF-ALKHSEVIIQQYANITMYPLRYEYA--- 76 (235) Q Consensus 1 mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~RF-~Lk~~~i~i~~~eg~t~Y~L~~~ya--- 76 (235) ++++++++.-. .+|... + + .-=.-++|+.++|+++.+..-|= ....+...|++.+|+.-|.+...+. T Consensus 2 ~t~~~lI~r~~-~~l~D~-----~-~--~rW~~~el~~~lNdAv~e~~l~rpda~~~~~~i~l~~Gt~q~~p~~~~~lI~ 72 (216) T protein:vir:82 2 TTITEIIGRVN-TQLVDP-----M-M--VRWPLQELCDYYNDAVRAVILARPDAGASLETISCVPGARQVLPDGVIQLLD 72 (216) T ss_pred ccHHHHHHHHH-Hhhhcc-----c-c--cccChHHHHHHHHHHHHHHHhhcCCCCcceeeEeccccccccccchhhhhhh Confidence 89999998753 344331 1 1 12235779999999999999885 5667778899999988776654431 Q ss_pred -cccCCCcccceeeecCcccchhHHHHHHHHhhccCCcccccCCCCCc------eeEEecC-CcEEEEecCCCc-eeEEE Q lcl|NC_020848. 77 -QSNTTSTQPYKFIQDTVEAPFQEDIILIESAVDEGGCEIEINRENDK------CSIYTPQ-PDVIQLTYPEDE-NAIAI 147 (235) Q Consensus 77 -~s~~~~~~~~~~I~d~~~~~F~~dilkI~~V~d~~G~e~~lNd~~~~------~si~tP~-~~~lql~~P~~~-~~~~v 147 (235) ..+- +++..-+. ..++|- . .+++|-. |-++... |...=++=|.+. -.+.+ T Consensus 73 v~r~~--~g~a~~~v-------sre~LD-----~-------~~P~W~~~~g~p~~~i~de~~pr~f~l~P~p~~~~~vel 131 (216) T protein:vir:82 73 VICLS--DGSAVRPL-------SREVLD-----A-------QYPEWPTMKGIPECFISNDLSPRVFWLFPAPDKEISIDA 131 (216) T ss_pred hhhhC--CCCceeee-------cHHHhc-----c-------cCCCcCCCCCCceEEEecCCCceEEEEeccCCCCcEEEE Confidence 1110 00000000 011111 0 0111111 2233322 222222222222 33566 Q ss_pred EeccCCCccc-cccCCCCCcccccchHHHHHHHHHHHHHHhccCCC-ccchHHHHHHHHHHHHHHHHHHHcCceecCCCc Q lcl|NC_020848. 148 LYKANHAKID-LSANVPSNIDIEIPAQLVRPLALYVSSLAHTAVGS-PEGMNTGATKMIEYETACLQIEVFGLIHKEEWT 225 (235) Q Consensus 148 ~YqA~h~~l~-~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~-~e~~ak~~~y~q~YE~~c~~le~~~L~~~~~~t 225 (235) .|-+...... ... ++...++||..|.++|..|+-||+|+-=.. ..+.++|+.++|.++...+.--+-+... .+ T Consensus 132 ~~~r~P~a~~~~~~--~dd~~~~i~~~y~~~Lvdw~lyRa~skd~~~~~d~~rA~~h~q~F~~~lG~k~~ad~~~---~~ 206 (216) T protein:vir:82 132 VVSRIPEAVYVLTQ--DDDTPVPLEEAYVNPLVEWMLFRAFSKDAAGGAESGLAAQHYQSFVEQLGIKQGADSAL---YA 206 (216) T ss_pred EEEecCcchhhccC--CCCCCCCcchhhhHHHHHHHHHHHhcCcCccccChHHHHHHHHHHHHHhCCchHHHhHH---HH Confidence 6655543221 222 234578999999999999999999998776 3777899999999987766433332200 11 Q ss_pred ce-eeeccCc Q lcl|NC_020848. 226 ND-RVGRNGW 234 (235) Q Consensus 226 n~-~f~~~Gw 234 (235) -. -++-+|- T Consensus 207 r~~~~~~~~~ 216 (216) T protein:vir:82 207 RKKVFNGGGV 216 (216) T ss_pred HHhhccCCCC Confidence 11 2333343 No 6 >protein:vir:2779 Length: 216 # NCBI annotation: hypothetical protein # Family: family:all:1423 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612896;genbank:gi:20065813;genbank:GeneID:935638 Probab=97.01 E-value=0.00017 Score=41.28 Aligned_cols=199 Identities=11% Similarity=0.016 Sum_probs=110.4 Q ss_pred CcHHhhHhhhhhhhhhccccccCCccccCccchhHHHHHHHHHHHHHHhhc-ccccCcEEEEEcCCcccccccchhh--- Q lcl|NC_020848. 1 MKLSEFFSLLSYGELANLKVGGKDCGGIYPKHSDEVVSYIRQGLTDLHSRF-ALKHSEVIIQQYANITMYPLRYEYA--- 76 (235) Q Consensus 1 mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~RF-~Lk~~~i~i~~~eg~t~Y~L~~~ya--- 76 (235) ++++++++.-. .+|... + + .-=.-++|+.++|+++.+..-|= ....+...|++.+|+.-|.+...+. T Consensus 2 ~t~~~lI~r~~-~~l~D~-----~-~--~rW~~~el~~~lNdAv~e~~l~rpda~~~~~~i~l~~Gt~q~~p~~~~~lI~ 72 (216) T protein:vir:27 2 TTITEIIGRVN-TQLVDP-----M-M--VRWPLQELCDYYNDAVRAVILARPDAGASLETISCVPGARQVLPDGVIQLLD 72 (216) T ss_pred ccHHHHHHHHH-Hhhhcc-----c-c--cccChHHHHHHHHHHHHHHHhhcCCCCcceeeEeccccccccccchhhhhhh Confidence 89999998753 344331 1 1 12235779999999999999885 5667778899999988776654431 Q ss_pred -cccCCCcccceeeecCcccchhHHHHHHHHhhccCCcccccCCCCCc------eeEEecC-CcEEEEecCCCc-eeEEE Q lcl|NC_020848. 77 -QSNTTSTQPYKFIQDTVEAPFQEDIILIESAVDEGGCEIEINRENDK------CSIYTPQ-PDVIQLTYPEDE-NAIAI 147 (235) Q Consensus 77 -~s~~~~~~~~~~I~d~~~~~F~~dilkI~~V~d~~G~e~~lNd~~~~------~si~tP~-~~~lql~~P~~~-~~~~v 147 (235) ..+- +++..-+. ..++|- . .+++|-. |-++... |...=++=|.+. -.+.+ T Consensus 73 v~r~~--~g~a~~~v-------sre~LD-----~-------~~P~W~~~~g~p~~~i~de~~pr~f~l~P~p~~~~~vel 131 (216) T protein:vir:27 73 VICLS--DGSAVRPL-------SREVLD-----A-------QYPEWPTMKGIPECFISNDLSPRVFWLFPAPDKEISIDA 131 (216) T ss_pred hhhhC--CCCceeee-------cHHHhc-----c-------cCCCcCCCCCCceEEEecCCCceEEEEeccCCCCcEEEE Confidence 1110 00000000 011111 0 0111111 2233322 222222222222 33566 Q ss_pred EeccCCCccc-cccCCCCCcccccchHHHHHHHHHHHHHHhccCCC-ccchHHHHHHHHHHHHHHHHHHHcCceecCCCc Q lcl|NC_020848. 148 LYKANHAKID-LSANVPSNIDIEIPAQLVRPLALYVSSLAHTAVGS-PEGMNTGATKMIEYETACLQIEVFGLIHKEEWT 225 (235) Q Consensus 148 ~YqA~h~~l~-~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~-~e~~ak~~~y~q~YE~~c~~le~~~L~~~~~~t 225 (235) .|-+...... ... ++...++||..|.++|..|+-||+|+-=.. ..+.++|+.++|.++...+.--+-+... .+ T Consensus 132 ~~~r~P~a~~~~~~--~dd~~~~i~~~y~~~Lvdw~lyRa~skd~~~~~d~~rA~~h~q~F~~~lG~k~~ad~~~---~~ 206 (216) T protein:vir:27 132 VVSRIPEAVYVLTQ--DDDTPVPLEEAYVNPLVEWMLFRAFSKDAAGGAESGLAAQHYQSFVEQLGIKQGADSAL---YA 206 (216) T ss_pred EEEecCcchhhccC--CCCCCCCcchhhhHHHHHHHHHHHhcCcCccccChHHHHHHHHHHHHHhCCchHHHhHH---HH Confidence 6655543221 222 234578999999999999999999998776 3777899999999987766433332200 11 Q ss_pred ce-eeeccCc Q lcl|NC_020848. 226 ND-RVGRNGW 234 (235) Q Consensus 226 n~-~f~~~Gw 234 (235) -. -++-+|- T Consensus 207 r~~~~~~~~~ 216 (216) T protein:vir:27 207 RKKVFNGGGV 216 (216) T ss_pred HHhhccCCCC Confidence 11 2333343 No 7 >protein:vir:103760 Length: 207 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024931;genbank:gi:48697201;genbank:GeneID:2846084 Probab=66.07 E-value=0.27 Score=23.67 Aligned_cols=186 Identities=10% Similarity=0.109 Sum_probs=103.4 Q ss_pred Cc-HHhhHhhhhhhhhhccccccCCccccCccchhHHHHHHHHHHHHHHh-hcccccCcEEEEEcCCcccccccchhhcc Q lcl|NC_020848. 1 MK-LSEFFSLLSYGELANLKVGGKDCGGIYPKHSDEVVSYIRQGLTDLHS-RFALKHSEVIIQQYANITMYPLRYEYAQS 78 (235) Q Consensus 1 mk-l~E~~~~La~geLsN~~iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~-RF~Lk~~~i~i~~~eg~t~Y~L~~~ya~s 78 (235) |. .-+ ++++|++.|-+=.|.+-+-+......-..+-..+...+++.|- .|..|--.+ . .. T Consensus 1 M~S~v~-IcN~AL~~lGa~~I~s~~e~s~~A~~c~~~Y~~~r~~~L~~~pW~FA~~r~~L--a---------------~~ 62 (207) T protein:vir:10 1 MASQVG-ICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAHVWSFTKARAQL--A---------------AL 62 (207) T ss_pred CCCHHH-HHHHHHHhhchhhhcccccCCHHHHHHHHhhHHHHHHHHhccChhhHhhhhhh--c---------------cc Confidence 44 333 7888888887755554454555555555666677777887773 343333322 1 11 Q ss_pred cCCCc-c-cceeeecCcccchhHHHHHHHHhhccCCcccccCCCCCceeEEecCCcEEEEecCCCceeEEEEeccCCCcc Q lcl|NC_020848. 79 NTTST-Q-PYKFIQDTVEAPFQEDIILIESAVDEGGCEIEINRENDKCSIYTPQPDVIQLTYPEDENAIAILYKANHAKI 156 (235) Q Consensus 79 ~~~~~-~-~~~~I~d~~~~~F~~dilkI~~V~d~~G~e~~lNd~~~~~si~tP~~~~lql~~P~~~~~~~v~YqA~h~~l 156 (235) ..++. + .|+| ..+.|-|||..|.+...-.. ..+...|.-.-..|..+... .+.+.|-++=+ T Consensus 63 ~~~P~~~~~yaY-------~LP~Dclrv~~v~~~~~~~~-----~~~~~~~~v~g~~ll~~~~~---~~~l~Y~~~v~-- 125 (207) T protein:vir:10 63 AEAPLFGFSYQY-------RLPTDFIRLLQVGQFDVYPR-----TDTRGLFSIENGNILTDMQA---PLYIRYAKRVT-- 125 (207) T ss_pred ccCCCCCCcccc-------cCcccceEeeeecCCCCccc-----cccccceEecCCeEEecCCC---cEEEEEeecCC-- Confidence 11111 1 2222 12566788888887643211 11112233222333222111 13333333211 Q ss_pred ccccCCCCCcccccchHHHHHHHHHHHHHHhccCCCccchHHHHHHHHHHHHHHHHHHHcCceecCCCcceeeeccCcC Q lcl|NC_020848. 157 DLSANVPSNIDIEIPAQLVRPLALYVSSLAHTAVGSPEGMNTGATKMIEYETACLQIEVFGLIHKEEWTNDRVGRNGWV 235 (235) Q Consensus 157 ~~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~~e~~ak~~~y~q~YE~~c~~le~~~L~~~~~~tn~~f~~~Gwv 235 (235) .+=..|....+||.+..|+.+=-++ .++..++...+|+|+.+..+-...|- .+....++..-+|+ T Consensus 126 ---------d~~~fd~~F~~ala~~LAa~lA~pL--t~~~~~~~~~~q~~~~~l~~A~~~da---~e~~~~~~~~~~~l 190 (207) T protein:vir:10 126 ---------DPNAMDALFREAFACRLAAEACESL--TQSATKRQGAWAEHDQAIAAAIRVNA---IERPAQPLGDDTWL 190 (207) T ss_pred ---------ChhhhhHHHHHHHHHHHHHHhhHhh--cCChHHHHHHHHHHHHHHHHHHhccc---ccCcccccCCcchh Confidence 1224789999999999999988777 46778999999999887766655443 33344555566666 No 8 >protein:vir:7328 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848219;genbank:gi:30387390;genbank:GeneID:2641866 Probab=53.88 E-value=0.52 Score=22.15 Aligned_cols=190 Identities=13% Similarity=0.114 Sum_probs=99.0 Q ss_pred Cc-HHhhHhhhhhhhhhccc-cccCCccccCccchhHHHHHHHHHHHHHHh-hcccccCcEEEEEcCCcccccccchhhc Q lcl|NC_020848. 1 MK-LSEFFSLLSYGELANLK-VGGKDCGGIYPKHSDEVVSYIRQGLTDLHS-RFALKHSEVIIQQYANITMYPLRYEYAQ 77 (235) Q Consensus 1 mk-l~E~~~~La~geLsN~~-iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~-RF~Lk~~~i~i~~~eg~t~Y~L~~~ya~ 77 (235) |. .-+ ++++|++-|-+-. |-.-+-+.-....-..+-..+...+++.|- .|.-|--.+ +. T Consensus 1 M~S~v~-IcN~AL~~iG~a~~I~s~~e~s~~A~~c~~~Y~~~r~~~Lr~~pW~FA~~r~~L-----------------a~ 62 (201) T protein:vir:73 1 MASVIE-ICNRALSNIGNSRSINSLIEASKEAGQCSLHFDACRDAALADFDWNFATKRVAL-----------------AD 62 (201) T ss_pred CCCHHH-HHHHHHHhhcCcccccccccCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhh-----------------hh Confidence 43 333 6788888776543 443344444444444455566666666663 454443333 11 Q ss_pred ccCCCcc-cceeeecCcccchhHHHHHHHHhhccCCcccccCCCCCce--eEEecCCcEEEEecCCCceeEEEEeccCCC Q lcl|NC_020848. 78 SNTTSTQ-PYKFIQDTVEAPFQEDIILIESAVDEGGCEIEINRENDKC--SIYTPQPDVIQLTYPEDENAIAILYKANHA 154 (235) Q Consensus 78 s~~~~~~-~~~~I~d~~~~~F~~dilkI~~V~d~~G~e~~lNd~~~~~--si~tP~~~~lql~~P~~~~~~~v~YqA~h~ 154 (235) ..++..+ .|+| ..+.|-|||..|.+....-++-...-..+ +.+.-.-..|....| .+.+.|-+.= T Consensus 63 ~a~~p~~~~yaY-------~LP~Dclrv~~v~~~~~~~~~~~~~~~~~~~~~~~ieg~~i~td~~----~~~l~Y~~~v- 130 (201) T protein:vir:73 63 TNNPPPDWQYAY-------QYPSDCVRITEIMPTGIRNPTAAQRIEYVVGSNEDLTGKLIYTDQP----KAWLKYMARV- 130 (201) T ss_pred cccCCCCCcccc-------cccccceeeeeeccccccccccccccchhccccccccCCEeeecCC----ceeEEEeecC- Confidence 1121111 2222 34678899999987654322111110000 001111122222222 1223333221 Q ss_pred ccccccCCCCCcccccchHHHHHHHHHHHHHHhccCCCccchHHHHHHHHHHHHHHHHHHHcCceecCCCcceeeeccCc Q lcl|NC_020848. 155 KIDLSANVPSNIDIEIPAQLVRPLALYVSSLAHTAVGSPEGMNTGATKMIEYETACLQIEVFGLIHKEEWTNDRVGRNGW 234 (235) Q Consensus 155 ~l~~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~~e~~ak~~~y~q~YE~~c~~le~~~L~~~~~~tn~~f~~~Gw 234 (235) +.+=..|....+||....||.+=-++ .++...+...+|+|+.+..+-...|-.-+ ....+....| T Consensus 131 ----------~d~~~fd~lF~~ala~~LAa~lA~pl--t~~~~~~~~~~q~~~~~~~~A~~~d~~e~---~~~~~~~~~~ 195 (201) T protein:vir:73 131 ----------TDVNMYDAIFMEALSWRLAAAINMAL--TGSADLGNNALTMYNRVILSAGSHSQNES---QEPQPPVDEF 195 (201) T ss_pred ----------CCcccccHHHHHHHHHHHHHHhhHhh--cCChHHHHHHHHHHHHHHHHHHHhhhccc---cCCCCCCchH Confidence 11224789999999999999986665 35566888899999887765555444333 3336666778 Q ss_pred C Q lcl|NC_020848. 235 V 235 (235) Q Consensus 235 v 235 (235) + T Consensus 196 l 196 (201) T protein:vir:73 196 T 196 (201) T ss_pred H Confidence 7 No 9 >protein:vir:98502 Length: 223 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1524 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996576;genbank:gi:45569507;genbank:GeneID:2767830 Probab=45.01 E-value=0.79 Score=21.16 Aligned_cols=192 Identities=15% Similarity=0.160 Sum_probs=95.2 Q ss_pred Cc-HHhhHhhhhhhhhhccccccC---CccccCccchhHHHHHHHHHHHHHHh-hcccccCcEEEEEcCCcccccccchh Q lcl|NC_020848. 1 MK-LSEFFSLLSYGELANLKVGGK---DCGGIYPKHSDEVVSYIRQGLTDLHS-RFALKHSEVIIQQYANITMYPLRYEY 75 (235) Q Consensus 1 mk-l~E~~~~La~geLsN~~iv~~---d~g~I~~~~~~~l~~~iN~GLt~L~~-RF~Lk~~~i~i~~~eg~t~Y~L~~~y 75 (235) |. .-+ ++++|++-|-+=.++.+ +-+.-....-..+-..+...+++.|- .|..|--.+ T Consensus 1 M~S~v~-IcN~AL~~lG~~~i~~~~s~~E~s~~A~~C~~~Y~~~r~~~Lr~~pW~FA~~r~~L----------------- 62 (223) T protein:vir:98 1 MASEVD-ICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELHTWGFATKCAQL----------------- 62 (223) T ss_pred CCCHHH-HHHHHHHhcccchhhcccCCCCCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhh----------------- Confidence 44 333 67777766644444422 11222222333444556666676662 344443333 Q ss_pred hcccCCCcc-cceeeecCcccchhHHHHHHHHhhccCCcccccC-CCCCce-eEEec-CCcEEEEecCCCceeEEEEecc Q lcl|NC_020848. 76 AQSNTTSTQ-PYKFIQDTVEAPFQEDIILIESAVDEGGCEIEIN-RENDKC-SIYTP-QPDVIQLTYPEDENAIAILYKA 151 (235) Q Consensus 76 a~s~~~~~~-~~~~I~d~~~~~F~~dilkI~~V~d~~G~e~~lN-d~~~~~-si~tP-~~~~lql~~P~~~~~~~v~YqA 151 (235) +.+..|..+ .|+| ..+.|-|||..|.+..-..+.-. +...+. ..++. ....|.. +...+.+.|-+ T Consensus 63 a~~a~p~~~~~yaY-------~LP~Dclrv~~v~~~~~~~~~~~~~~~~~~~~e~~~~g~~~i~t----d~~~~~l~Y~~ 131 (223) T protein:vir:98 63 AAMGISRPEWRFAY-------AQPADAIKIVAVLPHDAANIEAGIDNAQPFSCEIDNTGADIILT----NQVNAVARYIS 131 (223) T ss_pred hhcccCCCCccccc-------cccccceeeeeeccccccccccccccccceEEeeccccceeeee----cCCceEEEEee Confidence 111111111 2222 34578899999887653322221 111111 11111 1111222 11234444433 Q ss_pred CCCccccccCCCCCcccccchHHHHHHHHHHHHHHhccCCCc-cchHHHHHHHHHHHHHHHHHHHcCceecCCCcceeee Q lcl|NC_020848. 152 NHAKIDLSANVPSNIDIEIPAQLVRPLALYVSSLAHTAVGSP-EGMNTGATKMIEYETACLQIEVFGLIHKEEWTNDRVG 230 (235) Q Consensus 152 ~h~~l~~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~~-e~~ak~~~y~q~YE~~c~~le~~~L~~~~~~tn~~f~ 230 (235) +=+ .+=-.|....+||++..||.+=-|+=.. ....++.+.+|+|+.+..+-...|-.- .....+. T Consensus 132 ~v~-----------d~~~fd~lF~~Ala~~LAa~lA~pLt~~~~~~q~a~~~~~~y~~~l~~A~~~da~e---~~~~~~~ 197 (223) T protein:vir:98 132 LVK-----------DTTKFSPLFVQALAWHLASMLAGPLLKGDVGAAESKRCVGAMQAYLSQAMVSDANQ---RKTKPAH 197 (223) T ss_pred cCC-----------ChhcccHHHHHHHHHHHHHHhhHhhcCCcchHHHHHHHHHHHHHHHHHHHhccccc---Ccccccc Confidence 321 1224789999999999999986665333 334466688999998877766655543 3344444 Q ss_pred ccCcC Q lcl|NC_020848. 231 RNGWV 235 (235) Q Consensus 231 ~~Gwv 235 (235) .-.|+ T Consensus 198 ~~~~l 202 (223) T protein:vir:98 198 MPEWM 202 (223) T ss_pred cchhh Confidence 44455 No 10 >protein:vir:107803 Length: 223 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1524 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996624;genbank:gi:45580758;genbank:GeneID:2767879 Probab=45.01 E-value=0.79 Score=21.16 Aligned_cols=192 Identities=15% Similarity=0.160 Sum_probs=95.2 Q ss_pred Cc-HHhhHhhhhhhhhhccccccC---CccccCccchhHHHHHHHHHHHHHHh-hcccccCcEEEEEcCCcccccccchh Q lcl|NC_020848. 1 MK-LSEFFSLLSYGELANLKVGGK---DCGGIYPKHSDEVVSYIRQGLTDLHS-RFALKHSEVIIQQYANITMYPLRYEY 75 (235) Q Consensus 1 mk-l~E~~~~La~geLsN~~iv~~---d~g~I~~~~~~~l~~~iN~GLt~L~~-RF~Lk~~~i~i~~~eg~t~Y~L~~~y 75 (235) |. .-+ ++++|++-|-+=.++.+ +-+.-....-..+-..+...+++.|- .|..|--.+ T Consensus 1 M~S~v~-IcN~AL~~lG~~~i~~~~s~~E~s~~A~~C~~~Y~~~r~~~Lr~~pW~FA~~r~~L----------------- 62 (223) T protein:vir:10 1 MASEVD-ICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELHTWGFATKCAQL----------------- 62 (223) T ss_pred CCCHHH-HHHHHHHhcccchhhcccCCCCCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhh----------------- Confidence 44 333 67777766644444422 11222222333444556666676662 344443333 Q ss_pred hcccCCCcc-cceeeecCcccchhHHHHHHHHhhccCCcccccC-CCCCce-eEEec-CCcEEEEecCCCceeEEEEecc Q lcl|NC_020848. 76 AQSNTTSTQ-PYKFIQDTVEAPFQEDIILIESAVDEGGCEIEIN-RENDKC-SIYTP-QPDVIQLTYPEDENAIAILYKA 151 (235) Q Consensus 76 a~s~~~~~~-~~~~I~d~~~~~F~~dilkI~~V~d~~G~e~~lN-d~~~~~-si~tP-~~~~lql~~P~~~~~~~v~YqA 151 (235) +.+..|..+ .|+| ..+.|-|||..|.+..-..+.-. +...+. ..++. ....|.. +...+.+.|-+ T Consensus 63 a~~a~p~~~~~yaY-------~LP~Dclrv~~v~~~~~~~~~~~~~~~~~~~~e~~~~g~~~i~t----d~~~~~l~Y~~ 131 (223) T protein:vir:10 63 AAMGISRPEWRFAY-------AQPADAIKIVAVLPHDAANIEAGIDNAQPFSCEIDNTGADIILT----NQVNAVARYIS 131 (223) T ss_pred hhcccCCCCccccc-------cccccceeeeeeccccccccccccccccceEEeeccccceeeee----cCCceEEEEee Confidence 111111111 2222 34578899999887653322221 111111 11111 1111222 11234444433 Q ss_pred CCCccccccCCCCCcccccchHHHHHHHHHHHHHHhccCCCc-cchHHHHHHHHHHHHHHHHHHHcCceecCCCcceeee Q lcl|NC_020848. 152 NHAKIDLSANVPSNIDIEIPAQLVRPLALYVSSLAHTAVGSP-EGMNTGATKMIEYETACLQIEVFGLIHKEEWTNDRVG 230 (235) Q Consensus 152 ~h~~l~~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~~-e~~ak~~~y~q~YE~~c~~le~~~L~~~~~~tn~~f~ 230 (235) +=+ .+=-.|....+||++..||.+=-|+=.. ....++.+.+|+|+.+..+-...|-.- .....+. T Consensus 132 ~v~-----------d~~~fd~lF~~Ala~~LAa~lA~pLt~~~~~~q~a~~~~~~y~~~l~~A~~~da~e---~~~~~~~ 197 (223) T protein:vir:10 132 LVK-----------DTTKFSPLFVQALAWHLASMLAGPLLKGDVGAAESKRCVGAMQAYLSQAMVSDANQ---RKTKPAH 197 (223) T ss_pred cCC-----------ChhcccHHHHHHHHHHHHHHhhHhhcCCcchHHHHHHHHHHHHHHHHHHHhccccc---Ccccccc Confidence 321 1224789999999999999986665333 334466688999998877766655543 3344444 Q ss_pred ccCcC Q lcl|NC_020848. 231 RNGWV 235 (235) Q Consensus 231 ~~Gwv 235 (235) .-.|+ T Consensus 198 ~~~~l 202 (223) T protein:vir:10 198 MPEWM 202 (223) T ss_pred cchhh Confidence 44455 No 11 >protein:vir:107429 Length: 223 # NCBI annotation: Bbp14 # Family: family:all:1524 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958683;genbank:gi:41179375;genbank:GeneID:2717223 Probab=45.01 E-value=0.79 Score=21.16 Aligned_cols=192 Identities=15% Similarity=0.160 Sum_probs=95.2 Q ss_pred Cc-HHhhHhhhhhhhhhccccccC---CccccCccchhHHHHHHHHHHHHHHh-hcccccCcEEEEEcCCcccccccchh Q lcl|NC_020848. 1 MK-LSEFFSLLSYGELANLKVGGK---DCGGIYPKHSDEVVSYIRQGLTDLHS-RFALKHSEVIIQQYANITMYPLRYEY 75 (235) Q Consensus 1 mk-l~E~~~~La~geLsN~~iv~~---d~g~I~~~~~~~l~~~iN~GLt~L~~-RF~Lk~~~i~i~~~eg~t~Y~L~~~y 75 (235) |. .-+ ++++|++-|-+=.++.+ +-+.-....-..+-..+...+++.|- .|..|--.+ T Consensus 1 M~S~v~-IcN~AL~~lG~~~i~~~~s~~E~s~~A~~C~~~Y~~~r~~~Lr~~pW~FA~~r~~L----------------- 62 (223) T protein:vir:10 1 MASEVD-ICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELHTWGFATKCAQL----------------- 62 (223) T ss_pred CCCHHH-HHHHHHHhcccchhhcccCCCCCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhh----------------- Confidence 44 333 67777766644444422 11222222333444556666676662 344443333 Q ss_pred hcccCCCcc-cceeeecCcccchhHHHHHHHHhhccCCcccccC-CCCCce-eEEec-CCcEEEEecCCCceeEEEEecc Q lcl|NC_020848. 76 AQSNTTSTQ-PYKFIQDTVEAPFQEDIILIESAVDEGGCEIEIN-RENDKC-SIYTP-QPDVIQLTYPEDENAIAILYKA 151 (235) Q Consensus 76 a~s~~~~~~-~~~~I~d~~~~~F~~dilkI~~V~d~~G~e~~lN-d~~~~~-si~tP-~~~~lql~~P~~~~~~~v~YqA 151 (235) +.+..|..+ .|+| ..+.|-|||..|.+..-..+.-. +...+. ..++. ....|.. +...+.+.|-+ T Consensus 63 a~~a~p~~~~~yaY-------~LP~Dclrv~~v~~~~~~~~~~~~~~~~~~~~e~~~~g~~~i~t----d~~~~~l~Y~~ 131 (223) T protein:vir:10 63 AAMGISRPEWRFAY-------AQPADAIKIVAVLPHDAANIEAGIDNAQPFSCEIDNTGADIILT----NQVNAVARYIS 131 (223) T ss_pred hhcccCCCCccccc-------cccccceeeeeeccccccccccccccccceEEeeccccceeeee----cCCceEEEEee Confidence 111111111 2222 34578899999887653322221 111111 11111 1111222 11234444433 Q ss_pred CCCccccccCCCCCcccccchHHHHHHHHHHHHHHhccCCCc-cchHHHHHHHHHHHHHHHHHHHcCceecCCCcceeee Q lcl|NC_020848. 152 NHAKIDLSANVPSNIDIEIPAQLVRPLALYVSSLAHTAVGSP-EGMNTGATKMIEYETACLQIEVFGLIHKEEWTNDRVG 230 (235) Q Consensus 152 ~h~~l~~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~~-e~~ak~~~y~q~YE~~c~~le~~~L~~~~~~tn~~f~ 230 (235) +=+ .+=-.|....+||++..||.+=-|+=.. ....++.+.+|+|+.+..+-...|-.- .....+. T Consensus 132 ~v~-----------d~~~fd~lF~~Ala~~LAa~lA~pLt~~~~~~q~a~~~~~~y~~~l~~A~~~da~e---~~~~~~~ 197 (223) T protein:vir:10 132 LVK-----------DTTKFSPLFVQALAWHLASMLAGPLLKGDVGAAESKRCVGAMQAYLSQAMVSDANQ---RKTKPAH 197 (223) T ss_pred cCC-----------ChhcccHHHHHHHHHHHHHHhhHhhcCCcchHHHHHHHHHHHHHHHHHHHhccccc---Ccccccc Confidence 321 1224789999999999999986665333 334466688999998877766655543 3344444 Q ss_pred ccCcC Q lcl|NC_020848. 231 RNGWV 235 (235) Q Consensus 231 ~~Gwv 235 (235) .-.|+ T Consensus 198 ~~~~l 202 (223) T protein:vir:10 198 MPEWM 202 (223) T ss_pred cchhh Confidence 44455 No 12 >protein:vir:95323 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512268;genbank:gi:89152435;genbank:GeneID:3952992 Probab=36.62 E-value=1.2 Score=20.22 Aligned_cols=189 Identities=12% Similarity=0.115 Sum_probs=101.1 Q ss_pred Cc-HHhhHhhhhhhhhhccc-cccCCccccCccchhHHHHHHHHHHHHHHh-hcccccCcEEEEEcCCcccccccchhhc Q lcl|NC_020848. 1 MK-LSEFFSLLSYGELANLK-VGGKDCGGIYPKHSDEVVSYIRQGLTDLHS-RFALKHSEVIIQQYANITMYPLRYEYAQ 77 (235) Q Consensus 1 mk-l~E~~~~La~geLsN~~-iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~-RF~Lk~~~i~i~~~eg~t~Y~L~~~ya~ 77 (235) |. .-+ ++++|++-|-|-. |-+-+-+.-....-..+-..+...+++.|- +|.-|--.+ +. T Consensus 1 M~S~v~-IcN~AL~~iG~a~~I~s~~e~s~~A~~C~~~Y~~~r~~~L~~~pW~FA~~r~~L-----------------a~ 62 (201) T protein:vir:95 1 MASVVE-ICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVAL-----------------AD 62 (201) T ss_pred CCCHHH-HHHHHHHHhCCcccccccccCCHHHHHHHHhhHHHHHHHHhhcCchhhhhhhhc-----------------cc Confidence 44 333 6788888877643 333343333444444555566677777664 565554444 11 Q ss_pred ccCCCc-ccceeeecCcccchhHHHHHHHHhhccCCcccccCCCCCceeE---EecCCcEEEEecCCCceeEEEEeccCC Q lcl|NC_020848. 78 SNTTST-QPYKFIQDTVEAPFQEDIILIESAVDEGGCEIEINRENDKCSI---YTPQPDVIQLTYPEDENAIAILYKANH 153 (235) Q Consensus 78 s~~~~~-~~~~~I~d~~~~~F~~dilkI~~V~d~~G~e~~lNd~~~~~si---~tP~~~~lql~~P~~~~~~~v~YqA~h 153 (235) ...+.. -.|+| ..+.|-|||.+|.+....-. -.+..-++.+ +.-+-..|....| .+.+.|-++= T Consensus 63 ~a~~~~~~~yay-------~LP~Dclrv~~v~~~g~~~~-~~~~~~~f~v~~~~~~~g~~l~td~~----~~~l~Yv~~v 130 (201) T protein:vir:95 63 TSNPPPDWEYAY-------QYPSDCLRITEIMLPGVRNP-TAAMRVQYEVGADTNGTGKLIYTDQP----QAWLKYVSRV 130 (201) T ss_pred ccCCCCCCcccc-------cccchhhhhhhhccCCcccc-ccccchhhhccccccccCceeeecCC----ceEEEEeecC Confidence 111111 12333 33578899999976532211 1100011111 1101122222222 2234443321 Q ss_pred CccccccCCCCCcccccchHHHHHHHHHHHHHHhccCCCccchHHHHHHHHHHHHHHHHHHHcCceecCCCcceeeeccC Q lcl|NC_020848. 154 AKIDLSANVPSNIDIEIPAQLVRPLALYVSSLAHTAVGSPEGMNTGATKMIEYETACLQIEVFGLIHKEEWTNDRVGRNG 233 (235) Q Consensus 154 ~~l~~~~d~~~~~~IdLP~~l~~AL~~~VAsr~~t~i~~~e~~ak~~~y~q~YE~~c~~le~~~L~~~~~~tn~~f~~~G 233 (235) + .+=..|....+||.+..|+.+=-|+= ++..++...+|+|+.+..+-...|- .+.....+..-. T Consensus 131 ~-----------d~~~fd~~F~~ala~~LAa~la~plt--~~~~~~~~~~q~~~~~l~~A~~~da---~e~~~~~~~~~~ 194 (201) T protein:vir:95 131 T-----------DVNMFDAIFMEALAWRLAAAINMALT--GNADLGTFALNMYNRVILSAGSHSQ---NESQEPQPPVDE 194 (201) T ss_pred C-----------ChhhccHHHHHHHHHHHHHHhhHhhc--CChHHHHHHHHHHHHHHHHHHhccc---ccCcccCCCcch Confidence 1 12247899999999999999876653 6667899999999987765544443 344455677788 Q ss_pred cC Q lcl|NC_020848. 234 WV 235 (235) Q Consensus 234 wv 235 (235) |+ T Consensus 195 ~l 196 (201) T protein:vir:95 195 FT 196 (201) T ss_pred hh Confidence 99 No 13 >protein:vir:93692 Length: 197 # NCBI annotation: Bcep22gp58 # Family: family:all:1423 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944287;genbank:gi:38640364;genbank:GeneID:2658345 Probab=20.53 E-value=2.8 Score=18.17 Aligned_cols=178 Identities=10% Similarity=0.079 Sum_probs=84.5 Q ss_pred CcHHhhHhhhhhhhhhccccccCCccccCccchhHHHHHHHHHHHHHHhh-cccccCcEEEEEcCCcccccccchhh--- Q lcl|NC_020848. 1 MKLSEFFSLLSYGELANLKVGGKDCGGIYPKHSDEVVSYIRQGLTDLHSR-FALKHSEVIIQQYANITMYPLRYEYA--- 76 (235) Q Consensus 1 mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~~~l~~~iN~GLt~L~~R-F~Lk~~~i~i~~~eg~t~Y~L~~~ya--- 76 (235) |+++|+++.-.. +|.. . +=.-=..++|+.++|+++.+..-| =+...+...|++..| |.+.|-+.+. T Consensus 3 ~t~~~lI~r~~~-~L~D-------~-~~~rW~~~el~dwlNdAv~ei~l~rPda~~~~~~i~l~aG-t~q~LP~~~~~Li 72 (197) T protein:vir:93 3 IAATDLIARAGN-VLQD-------E-DHIRWEVPELIEWINDAARETIVRRPAARSVAAVLELAAG-TRQAIPERGVELL 72 (197) T ss_pred ccHHHHHHHHHH-hhcc-------c-cccccCcHHHHHHHHHHHHHHHhhCCCCCcceeEEeeccC-cccccchhHHHHH Confidence 999999987542 3322 2 113345689999999999998776 223344778888888 4433322111 Q ss_pred -cccCCCcccceeeecCcc-cchhHHHHHHHHhhccCCcccccCCCCCce--e------EEec-CCcEEEEecCCC-cee Q lcl|NC_020848. 77 -QSNTTSTQPYKFIQDTVE-APFQEDIILIESAVDEGGCEIEINRENDKC--S------IYTP-QPDVIQLTYPED-ENA 144 (235) Q Consensus 77 -~s~~~~~~~~~~I~d~~~-~~F~~dilkI~~V~d~~G~e~~lNd~~~~~--s------i~tP-~~~~lql~~P~~-~~~ 144 (235) +-+.. ..++..++.. .+=. .|..-. .+.+|--+ + +|-. .|..+-++=|.+ .-. T Consensus 73 ~vvrn~---~~~~~~~gr~vr~vs-----re~LD~-------~~P~W~~~~~~~~v~~y~~~e~~p~~~~vyP~p~~~~~ 137 (197) T protein:vir:93 73 DVVRNM---GADGVTPGRIVRRVD-----RQLLDD-------QNPDWHAARAKNVVKHFTFDERAPRIFYVYPPAVAGTK 137 (197) T ss_pred HHHHhh---hhcccCCCccccccc-----HHHhcc-------cCCCCccCCCCCceEEEEeecCCCcEEEEeecCCCCce Confidence 00000 0011111110 0000 000000 01111100 1 1111 122333432222 233 Q ss_pred EEEEeccCCCccccccCCCCCcccccchHHHHHHH-HHHHHHHhccCCCccchHHHHHHHHHHHHHHHHHHHcCceecCC Q lcl|NC_020848. 145 IAILYKANHAKIDLSANVPSNIDIEIPAQLVRPLA-LYVSSLAHTAVGSPEGMNTGATKMIEYETACLQIEVFGLIHKEE 223 (235) Q Consensus 145 ~~v~YqA~h~~l~~~~d~~~~~~IdLP~~l~~AL~-~~VAsr~~t~i~~~e~~ak~~~y~q~YE~~c~~le~~~L~~~~~ 223 (235) +.+.|-+-.+.+. -+.+..+||..|.++|. +|+-||+|+-=+....-| -- | -+...|. T Consensus 138 ve~~~~r~P~~v~-----~~~~~~~ipeiy~~~lv~~~~lyRa~sKd~~~~a~a--~~--------~------~~~~~~~ 196 (197) T protein:vir:93 138 VETLHSELPPAIA-----ASGDTLDMGAEYMNVLVSTSATARCRRTASSRTARS--PP--------C------TIRRSST 196 (197) T ss_pred EEEEEeeCChhhh-----ccCCCCCCchhhhhhhHHhhhhhhhcCCCCCCcccC--Cc--------c------ccccCCC Confidence 5555555555553 23457789999999987 699999998754432211 10 1 1111111 Q ss_pred C Q lcl|NC_020848. 224 W 224 (235) Q Consensus 224 ~ 224 (235) . T Consensus 197 ~ 197 (197) T protein:vir:93 197 P 197 (197) T ss_pred C Confidence 1 Done!