Query lcl|NC_013692.1_cdsid_YP_003358480.1 [gene=PP-LIT1_gp83] [protein=N4 gp67-like protein] [protein_id=YP_003358480.1] [location=complement(66434..67168)] Match_columns 244 No_of_seqs 11 out of 14 Neff 3.7 Searched_HMMs 1612 Date Thu Nov 7 13:57:46 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_83 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_83_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95880 Length: 236 100.0 2E-108 1E-111 611.3 17.1 233 2-244 1-236 (236) 2 protein:vir:105573 Length: 225 98.2 1E-07 6.3E-11 58.9 12.5 216 3-244 1-225 (225) 3 protein:vir:5111 Length: 234 # 97.7 7.9E-06 4.9E-09 48.6 14.4 212 1-240 1-234 (234) 4 protein:vir:823 Length: 216 # 97.4 2E-05 1.2E-08 46.4 13.2 209 2-243 1-216 (216) 5 protein:vir:2779 Length: 216 # 97.4 2E-05 1.2E-08 46.4 13.2 209 2-243 1-216 (216) 6 protein:vir:3302 Length: 216 # 97.4 2E-05 1.2E-08 46.4 13.2 209 2-243 1-216 (216) 7 protein:vir:93692 Length: 197 93.0 0.0089 5.5E-06 31.8 11.3 188 1-233 1-197 (197) 8 protein:vir:103760 Length: 207 87.3 0.013 8.1E-06 30.9 7.0 190 1-244 1-190 (207) 9 protein:vir:107803 Length: 223 84.5 0.046 2.9E-05 27.9 8.6 194 1-244 1-202 (223) 10 protein:vir:107429 Length: 223 84.5 0.046 2.9E-05 27.9 8.6 194 1-244 1-202 (223) 11 protein:vir:98502 Length: 223 84.5 0.046 2.9E-05 27.9 8.6 194 1-244 1-202 (223) 12 protein:vir:95323 Length: 201 57.2 0.44 0.00027 22.5 9.3 191 1-244 1-196 (201) 13 protein:vir:108311 Length: 249 41.5 0.93 0.00057 20.8 14.0 223 1-244 1-241 (249) 14 protein:vir:7328 Length: 201 # 29.3 1.7 0.001 19.4 6.4 191 1-244 1-196 (201) No 1 >protein:vir:95880 Length: 236 # NCBI annotation: 30 kDa protein # Family: family:all:31944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950545;genbank:gi:119952236;genbank:GeneID:5075707 Probab=100.00 E-value=1.8e-108 Score=611.29 Aligned_cols=233 Identities=27% Similarity=0.415 Sum_probs=230.3 Q ss_pred eeehhhHHhhhhhhhhccceeccCCccccCccchhHHHHHHHHHHHHHHhhcccccceEEEEEecCcccccccccccc-c Q lcl|NC_013692. 2 TIQLKQVIDLLAEGELSNIKYVNIDTGALVLERVPSLIRAINLGVLDLHKRFLLKEGMLKIQLEEGRRLYPLRPAYQV-G 80 (244) Q Consensus 2 ~mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~p~l~~~iN~GLt~L~~RF~Lk~~~i~vem~eg~~~Y~L~~~~~~-s 80 (244) -.+|+|+++.||+||||||++||+|+|+|+++++|+|++|+|+|||||||||+||+|||+|||+|||+.|||+|+||+ | T Consensus 1 ~~~lkev~~~La~gqL~N~~~V~~d~g~i~~~~~p~ii~a~N~gl~~Lh~Rf~lk~~~i~vem~eg~~~Y~L~~~y~v~~ 80 (236) T protein:vir:95 1 MYYIEELFCRLANGVLNNTGIVTDDRGDIEDDSKPFIIVAANEALTRLHGRFNMRNNNVVVEMQEGRTNYPLLAKYAVQS 80 (236) T ss_pred CchHHHHHHHHhcceecceeeeecccccccccccchHHHHHhHHHHHHhhhhhhccCcEEEEEeeCceecccchhhhhcc Confidence 689999999999999999999999999999999999999999999999999999999999999999999999999995 9 Q ss_pred ccCCCcccceecC-CcccchHHHHHHHHHhhhcccccCCCCCcceeeeeccCcceEeeChhhhcCCCcceeeEEecccCC Q lcl|NC_013692. 81 QKPKPGVPQFITE-GNKLDRQSILKIEKIIGDNGVEYYLNDTWQPLNITTPEFDVLEISDEFYCHSSSKTLEVRYRRAPT 159 (244) Q Consensus 81 ~~~k~~~~~~I~d-~~~~f~~dvLki~~V~~~~g~~~~lNd~~~~~~i~tP~~~~l~~~~~~~~~~~~~vl~v~Y~a~hp 159 (244) ++|||++++|||+ +.+.|+.|||||++|+||+|++++|||+++|||+|||++++|||+ |++||+||+|+|||+|| T Consensus 81 ~~p~~~~~~fI~d~~~~~~~~~ilri~~V~dd~G~~~~Lnd~~~~~sv~~P~~nvLqi~----~~~~~~~l~vkyq~~~~ 156 (236) T protein:vir:95 81 YDPNEVKCPFIMDLAGEKFAEDVIRILEVYDDKGRRRPLNDRNNPCSLFTPRPNVLQNN----APKAWEVLNVMYQAKHP 156 (236) T ss_pred CCCCCcccchhhccccchhHHHHHHHHhhccCCCcccccCCCCCCceeeeCCCcceeee----cCCCcceEEEEeecCCC Confidence 9999999999988 999999999999999999999999999999999999999999999 99999999999999999 Q ss_pred ccccccccCCCccccccccchHHHHHHHHHHHHHHhccCCCccccHHHHHHHHHHHHHHHHHHHHcCccccC-CCCccee Q lcl|NC_013692. 160 PMKICVDNLDSWGCIDIDLPYTHLQALLYFVASRCQTPIGFMENTAQEGFNFSQKYEAECANLDAQNLRIDP-VGNQDRF 238 (244) Q Consensus 160 ~~~i~~D~l~~~~~~~IdLP~tl~~AL~~~VAsr~~t~in~~entak~~~~Y~q~Ye~ec~~l~~~~L~~ds-~~~~~~f 238 (244) ++++|+| +|++||||+||+|||.|||||||+||||+|||||||| +|+++||+||++|++++|++|+ .+++++| T Consensus 157 ~l~~~eD-----~~~~idlP~t~~~aL~~yVA~r~~T~ig~~EnTAk~~-~y~~~Yes~c~~v~~~~l~s~~~v~~~~~f 230 (236) T protein:vir:95 157 KLSTAED-----GYNEIDIPDTLDPALDAYIAYRYYTSLNTPESSAKAA-EYLSFYDSICREVVEYDLTSDTEVDTNTLF 230 (236) T ss_pred ceeeeeC-----CcccccCCcchHHHHHHHHHHHhhccCCCcccchhhh-hHHHHHHHHHhhHHhhcccccccccccccc Confidence 9999998 8999999999999999999999999999999999999 9999999999999999999999 9999999 Q ss_pred eecCcC Q lcl|NC_013692. 239 TRGGWV 244 (244) Q Consensus 239 ~~rGwi 244 (244) ++|||- T Consensus 231 ~r~Gw~ 236 (236) T protein:vir:95 231 RKRGWR 236 (236) T ss_pred ccCCCC Confidence 999999 No 2 >protein:vir:105573 Length: 225 # NCBI annotation: hypothetical protein # Family: family:all:1423 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164310;genbank:gi:56692957;genbank:GeneID:3197184 Probab=98.19 E-value=1e-07 Score=58.93 Aligned_cols=216 Identities=13% Similarity=0.086 Sum_probs=120.7 Q ss_pred eehhhHHhhhhhhhhccceeccCCccccCccchhHHHHHHHHHHHHHHh-----hcccccceEEEEEecCcccccccccc Q lcl|NC_013692. 3 IQLKQVIDLLAEGELSNIKYVNIDTGALVLERVPSLIRAINLGVLDLHK-----RFLLKEGMLKIQLEEGRRLYPLRPAY 77 (244) Q Consensus 3 mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~p~l~~~iN~GLt~L~~-----RF~Lk~~~i~vem~eg~~~Y~L~~~~ 77 (244) |+++++++.-.. +|.. ...=--=.-+.|+..+|+++.+.+. |.+-..+...|++.+|+.-|++...- T Consensus 1 m~~~~lI~r~~~-~l~D-------~~~~~rW~~~el~~~lNdAv~e~~~r~rL~rpda~~~~~~i~l~~Gt~q~~~~~~~ 72 (225) T protein:vir:10 1 MTLADLIRRVRT-DAND-------MVEPYFWSDQDVADWLNDAVREAAVRGRLIHESQADAVCRIEVVAGTAVYQLHASL 72 (225) T ss_pred CCHHHHHHHHHH-Hhcc-------ccccccCChHHHHHHHHHHHHHHHHhcccccccCCCceeeeeecCccccccCchHH Confidence 999999987542 3332 1111123458999999999999998 56677888899999999998887520 Q ss_pred cccccCCCcccceecCCc-ccchHHHHHHHHHhhhcccccCCCCCcce-eeeeccCcceE-eeChhhhcCCCcceeeEEe Q lcl|NC_013692. 78 QVGQKPKPGVPQFITEGN-KLDRQSILKIEKIIGDNGVEYYLNDTWQP-LNITTPEFDVL-EISDEFYCHSSSKTLEVRY 154 (244) Q Consensus 78 ~~s~~~k~~~~~~I~d~~-~~f~~dvLki~~V~~~~g~~~~lNd~~~~-~~i~tP~~~~l-~~~~~~~~~~~~~vl~v~Y 154 (244) . .-....+|..+. .+-...+-+++. .|..--.-. +++..| |-+..| ..+ ..|. +..+.++++.| T Consensus 73 ~-----~I~~~~~~~~~~~~~~~~~~~s~e~-LD~~~P~W~-~~tg~p~~~~~d~--~~~~l~P~----p~~~~~vel~~ 139 (225) T protein:vir:10 73 Y-----ELSHLGFYPADMSRPTMPVLKSAEV-LDVELPEWR-ACTGKPLYAIQGD--TSLRLVPT----PDRAGILRVEG 139 (225) T ss_pred H-----HHHHHhhcCcccCCceecccccHHH-hcccCCCcc-cCCCCceEEEeCC--cEEEEEec----CCCceEEEEEE Confidence 0 000111111110 111111111111 111100000 001111 111111 111 1121 22446788988 Q ss_pred cccCCccccccccCCCccccccccchHHHHHHHHHHHHHHhccCCC-ccccHHHHHHHHHHHHHHHHHHHHcCccccCCC Q lcl|NC_013692. 155 RRAPTPMKICVDNLDSWGCIDIDLPYTHLQALLYFVASRCQTPIGF-MENTAQEGFNFSQKYEAECANLDAQNLRIDPVG 233 (244) Q Consensus 155 ~a~hp~~~i~~D~l~~~~~~~IdLP~tl~~AL~~~VAsr~~t~in~-~entak~~~~Y~q~Ye~ec~~l~~~~L~~ds~~ 233 (244) -+.+-.... ..+.+...+.||..|.++|..|+-||+|+-=+. ..+..+ |+.++|.++.-.+.-.+-++...... T Consensus 140 ~r~P~~~~~----~~~~D~~~p~i~~~y~~~Lvdw~lyRa~skd~~~~~d~~r-A~~h~q~F~~alG~k~~ad~~r~~r~ 214 (225) T protein:vir:10 140 YRTPLADMA----LADKDTAQPEIHAEHHRHLVQWALYRGFSIPDMESFDPNR-AALAEAAFTAYFGERPDSDLRRITRE 214 (225) T ss_pred Eeecchhhh----ccccccccCccchhhHHHHHHHHHHHHhcCcCccccChHH-HHHHHHHHHHHhCCchhHHHHHhccc Confidence 888643222 222344456789999999999999999998643 334455 77999999887766666665555555 Q ss_pred CcceeeecCcC Q lcl|NC_013692. 234 NQDRFTRGGWV 244 (244) Q Consensus 234 ~~~~f~~rGwi 244 (244) +.--+++.-|- T Consensus 215 ~~p~~~~~~~~ 225 (225) T protein:vir:10 215 DVPHHVEAFWP 225 (225) T ss_pred cCcccccccCC Confidence 54455666666 No 3 >protein:vir:5111 Length: 234 # NCBI annotation: unknown # Family: family:all:1423 # MgeID: mge:114 # MgeName: PBC5 # Cross-refs: genbank:acc:NP_542268;genbank:gi:18071240;genbank:GeneID:929347 Probab=97.66 E-value=7.9e-06 Score=48.56 Aligned_cols=212 Identities=17% Similarity=0.168 Sum_probs=113.3 Q ss_pred CeeehhhHHhhhhhhhhccceeccCCccccCccchhHHHHHHHHHHHHHHhhc-ccccceEEEEEecCc-cccccc---- Q lcl|NC_013692. 1 MTIQLKQVIDLLAEGELSNIKYVNIDTGALVLERVPSLIRAINLGVLDLHKRF-LLKEGMLKIQLEEGR-RLYPLR---- 74 (244) Q Consensus 1 ~~mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~p~l~~~iN~GLt~L~~RF-~Lk~~~i~vem~eg~-~~Y~L~---- 74 (244) |+ +++++++.-.. +|... +. --=.-+.|+..+|+++.+.+-|= ....+..+|++.+|+ -.||.+ T Consensus 1 m~-t~~~lI~r~~~-~l~D~------~~--~rW~~~el~~~lNdAv~e~~l~rp~a~~~~~~i~l~~Gt~q~lP~d~~~~ 70 (234) T protein:vir:51 1 MP-KASEIMRLAGI-QLLDE------DH--IRWPLIELADWVNEGVKAIVLAKPSASSKSAAIQLVKGTHQTLPGTIDGK 70 (234) T ss_pred Cc-cHHHHHHHHHH-Hhccc------cc--cccChHHHHHHHHHHHHHHHhhcCCCCccceeEeeccCCccccccccccc Confidence 77 99999987542 33321 11 12345789999999999999774 566788999999995 455543 Q ss_pred cccc-------ccccCCCcccceecCCcccchHHHHHHHH--HhhhcccccCCCCC--cceeeeeccCcceEeeChhhhc Q lcl|NC_013692. 75 PAYQ-------VGQKPKPGVPQFITEGNKLDRQSILKIEK--IIGDNGVEYYLNDT--WQPLNITTPEFDVLEISDEFYC 143 (244) Q Consensus 75 ~~~~-------~s~~~k~~~~~~I~d~~~~f~~dvLki~~--V~~~~g~~~~lNd~--~~~~~i~tP~~~~l~~~~~~~~ 143 (244) +-++ .++..++...- -+..+-..++|--+. -+.+.|.. ...- .+-.+..+|+...| .|.|. T Consensus 71 ~~l~li~i~rn~~s~~~~~~~g---rav~~vsre~LD~~~P~W~~~tg~P--~~~~v~~y~~d~~~p~~~~l-~P~p~-- 142 (234) T protein:vir:51 71 ATLQLIGINRNLVSAAEPRQGL---RAIRTCARDVLDAQEPNWHTASYVP--FRKEVRQVIYDENLPTEFYV-YPGND-- 142 (234) T ss_pred hheehhhhhhhhccccccccCc---ceeeecCHHHhcccCCCccccCCCC--chhhhhhhhccCCCCeEEEE-eccCC-- Confidence 2111 01122221100 000111111111100 00111210 0000 11223445533322 23233 Q ss_pred CCCcceeeEEecccCCccccccccCCCcc----ccccccchHHHHHHHHHHHHHHhccCCCccccHHHHHHHHHHHHHHH Q lcl|NC_013692. 144 HSSSKTLEVRYRRAPTPMKICVDNLDSWG----CIDIDLPYTHLQALLYFVASRCQTPIGFMENTAQEGFNFSQKYEAEC 219 (244) Q Consensus 144 ~~~~~vl~v~Y~a~hp~~~i~~D~l~~~~----~~~IdLP~tl~~AL~~~VAsr~~t~in~~entak~~~~Y~q~Ye~ec 219 (244) .+-++.+.|-+.++.+++.... ++++ ..++.||..|.++|..|+-||+|+.=....+.++ |+.++|.++. T Consensus 143 --~~g~v~~~~~r~P~~v~~~~~~-d~~~~a~~~~~~~i~~~y~~~Lvdw~lyRa~skD~e~~d~~r-A~~h~q~F~~-- 216 (234) T protein:vir:51 143 --GSGFVEAAFSFLPTSVKVANGA-DPEKIASWDIDVGLPEPYSVPLLDYVLYRCHQKDDTAADLGK-ATSHYQLFAT-- 216 (234) T ss_pred --CCceEEEEEEeecchhhhhhcC-CccccccccccCCccchhhHHHHHHHhhhhcCccccccchHH-HHHHHHHHHH-- Confidence 3345678887776654433332 2221 1256789999999999999999997543444455 6699999954 Q ss_pred HHHHHcCccccCCCCcc-eeee Q lcl|NC_013692. 220 ANLDAQNLRIDPVGNQD-RFTR 240 (244) Q Consensus 220 ~~l~~~~L~~ds~~~~~-~f~~ 240 (244) ..|++.++-.+.+ ..|+ T Consensus 217 ----~lG~k~~~d~~~~pn~r~ 234 (234) T protein:vir:51 217 ----AVGIKVQSEGTSNPNRRR 234 (234) T ss_pred ----HhCCcchhhhhccccccC Confidence 5566666655444 3444 No 4 >protein:vir:823 Length: 216 # NCBI annotation: hypothetical protein # Family: family:all:1423 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050556;genbank:gi:9633453;genbank:GeneID:1262282 Probab=97.36 E-value=2e-05 Score=46.37 Aligned_cols=209 Identities=11% Similarity=-0.013 Sum_probs=115.5 Q ss_pred eeehhhHHhhhhhhhhccceeccCCccccCccchhHHHHHHHHHHHHHHhhc-ccccceEEEEEecCccccccccccc-- Q lcl|NC_013692. 2 TIQLKQVIDLLAEGELSNIKYVNIDTGALVLERVPSLIRAINLGVLDLHKRF-LLKEGMLKIQLEEGRRLYPLRPAYQ-- 78 (244) Q Consensus 2 ~mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~p~l~~~iN~GLt~L~~RF-~Lk~~~i~vem~eg~~~Y~L~~~~~-- 78 (244) -++++++++.-. .+|.... +. -=.-+.|+..+|+++.+.+-|= ....+...|++.+|+.-|.+...+. T Consensus 1 ~~t~~~lI~r~~-~~l~D~~------~~--rW~~~el~~~lNdAv~e~~l~rpda~~~~~~i~l~~Gt~q~~p~~~~~lI 71 (216) T protein:vir:82 1 MTTITEIIGRVN-TQLVDPM------MV--RWPLQELCDYYNDAVRAVILARPDAGASLETISCVPGARQVLPDGVIQLL 71 (216) T ss_pred CccHHHHHHHHH-Hhhhccc------cc--ccChHHHHHHHHHHHHHHHhhcCCCCcceeeEeccccccccccchhhhhh Confidence 468999998753 3444321 11 1234679999999999999886 6778899999999998887755432 Q ss_pred -c-cccCCCcccceecCCcccchHHHHHHHHHhhhcccccCCCCCcceeeeeccCcceEeeChhhhcCCCcceeeEEecc Q lcl|NC_013692. 79 -V-GQKPKPGVPQFITEGNKLDRQSILKIEKIIGDNGVEYYLNDTWQPLNITTPEFDVLEISDEFYCHSSSKTLEVRYRR 156 (244) Q Consensus 79 -~-s~~~k~~~~~~I~d~~~~f~~dvLki~~V~~~~g~~~~lNd~~~~~~i~tP~~~~l~~~~~~~~~~~~~vl~v~Y~a 156 (244) + .+-+.-++ .+-..++|.- -+-+|=..-+- ...+-.+..+|+.+. ..|.|. .+-++++.|-+ T Consensus 72 ~v~r~~~g~a~--------~~vsre~LD~--~~P~W~~~~g~-p~~~i~de~~pr~f~-l~P~p~----~~~~vel~~~r 135 (216) T protein:vir:82 72 DVICLSDGSAV--------RPLSREVLDA--QYPEWPTMKGI-PECFISNDLSPRVFW-LFPAPD----KEISIDAVVSR 135 (216) T ss_pred hhhhhCCCCce--------eeecHHHhcc--cCCCcCCCCCC-ceEEEecCCCceEEE-EeccCC----CCcEEEEEEEe Confidence 1 11000001 1111222211 01111110000 011122233343221 223333 33456888888 Q ss_pred cCCccccccccCCCccccccccchHHHHHHHHHHHHHHhccCCCccccHHHHHHHHHHHHHHHHHHHHcCccccCCCCc- Q lcl|NC_013692. 157 APTPMKICVDNLDSWGCIDIDLPYTHLQALLYFVASRCQTPIGFMENTAQEGFNFSQKYEAECANLDAQNLRIDPVGNQ- 235 (244) Q Consensus 157 ~hp~~~i~~D~l~~~~~~~IdLP~tl~~AL~~~VAsr~~t~in~~entak~~~~Y~q~Ye~ec~~l~~~~L~~ds~~~~- 235 (244) .+.-.+ ++..++..+|.||..|.++|..|+-||+|+-=..-..+..-|+.++|.++.-.+.-.+-+. -.++ T Consensus 136 ~P~a~~----~~~~~dd~~~~i~~~y~~~Lvdw~lyRa~skd~~~~~d~~rA~~h~q~F~~~lG~k~~ad~----~~~~r 207 (216) T protein:vir:82 136 IPEAVY----VLTQDDDTPVPLEEAYVNPLVEWMLFRAFSKDAAGGAESGLAAQHYQSFVEQLGIKQGADS----ALYAR 207 (216) T ss_pred cCcchh----hccCCCCCCCCcchhhhHHHHHHHHHHHhcCcCccccChHHHHHHHHHHHHHhCCchHHHh----HHHHH Confidence 765322 1333345567889999999999999999998765433444477999999876653322222 1111 Q ss_pred -ceeeecCc Q lcl|NC_013692. 236 -DRFTRGGW 243 (244) Q Consensus 236 -~~f~~rGw 243 (244) +-|+-+|- T Consensus 208 ~~~~~~~~~ 216 (216) T protein:vir:82 208 KKVFNGGGV 216 (216) T ss_pred HhhccCCCC Confidence 24444555 No 5 >protein:vir:2779 Length: 216 # NCBI annotation: hypothetical protein # Family: family:all:1423 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612896;genbank:gi:20065813;genbank:GeneID:935638 Probab=97.36 E-value=2e-05 Score=46.37 Aligned_cols=209 Identities=11% Similarity=-0.013 Sum_probs=115.5 Q ss_pred eeehhhHHhhhhhhhhccceeccCCccccCccchhHHHHHHHHHHHHHHhhc-ccccceEEEEEecCccccccccccc-- Q lcl|NC_013692. 2 TIQLKQVIDLLAEGELSNIKYVNIDTGALVLERVPSLIRAINLGVLDLHKRF-LLKEGMLKIQLEEGRRLYPLRPAYQ-- 78 (244) Q Consensus 2 ~mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~p~l~~~iN~GLt~L~~RF-~Lk~~~i~vem~eg~~~Y~L~~~~~-- 78 (244) -++++++++.-. .+|.... +. -=.-+.|+..+|+++.+.+-|= ....+...|++.+|+.-|.+...+. T Consensus 1 ~~t~~~lI~r~~-~~l~D~~------~~--rW~~~el~~~lNdAv~e~~l~rpda~~~~~~i~l~~Gt~q~~p~~~~~lI 71 (216) T protein:vir:27 1 MTTITEIIGRVN-TQLVDPM------MV--RWPLQELCDYYNDAVRAVILARPDAGASLETISCVPGARQVLPDGVIQLL 71 (216) T ss_pred CccHHHHHHHHH-Hhhhccc------cc--ccChHHHHHHHHHHHHHHHhhcCCCCcceeeEeccccccccccchhhhhh Confidence 468999998753 3444321 11 1234679999999999999886 6778899999999998887755432 Q ss_pred -c-cccCCCcccceecCCcccchHHHHHHHHHhhhcccccCCCCCcceeeeeccCcceEeeChhhhcCCCcceeeEEecc Q lcl|NC_013692. 79 -V-GQKPKPGVPQFITEGNKLDRQSILKIEKIIGDNGVEYYLNDTWQPLNITTPEFDVLEISDEFYCHSSSKTLEVRYRR 156 (244) Q Consensus 79 -~-s~~~k~~~~~~I~d~~~~f~~dvLki~~V~~~~g~~~~lNd~~~~~~i~tP~~~~l~~~~~~~~~~~~~vl~v~Y~a 156 (244) + .+-+.-++ .+-..++|.- -+-+|=..-+- ...+-.+..+|+.+. ..|.|. .+-++++.|-+ T Consensus 72 ~v~r~~~g~a~--------~~vsre~LD~--~~P~W~~~~g~-p~~~i~de~~pr~f~-l~P~p~----~~~~vel~~~r 135 (216) T protein:vir:27 72 DVICLSDGSAV--------RPLSREVLDA--QYPEWPTMKGI-PECFISNDLSPRVFW-LFPAPD----KEISIDAVVSR 135 (216) T ss_pred hhhhhCCCCce--------eeecHHHhcc--cCCCcCCCCCC-ceEEEecCCCceEEE-EeccCC----CCcEEEEEEEe Confidence 1 11000001 1111222211 01111110000 011122233343221 223333 33456888888 Q ss_pred cCCccccccccCCCccccccccchHHHHHHHHHHHHHHhccCCCccccHHHHHHHHHHHHHHHHHHHHcCccccCCCCc- Q lcl|NC_013692. 157 APTPMKICVDNLDSWGCIDIDLPYTHLQALLYFVASRCQTPIGFMENTAQEGFNFSQKYEAECANLDAQNLRIDPVGNQ- 235 (244) Q Consensus 157 ~hp~~~i~~D~l~~~~~~~IdLP~tl~~AL~~~VAsr~~t~in~~entak~~~~Y~q~Ye~ec~~l~~~~L~~ds~~~~- 235 (244) .+.-.+ ++..++..+|.||..|.++|..|+-||+|+-=..-..+..-|+.++|.++.-.+.-.+-+. -.++ T Consensus 136 ~P~a~~----~~~~~dd~~~~i~~~y~~~Lvdw~lyRa~skd~~~~~d~~rA~~h~q~F~~~lG~k~~ad~----~~~~r 207 (216) T protein:vir:27 136 IPEAVY----VLTQDDDTPVPLEEAYVNPLVEWMLFRAFSKDAAGGAESGLAAQHYQSFVEQLGIKQGADS----ALYAR 207 (216) T ss_pred cCcchh----hccCCCCCCCCcchhhhHHHHHHHHHHHhcCcCccccChHHHHHHHHHHHHHhCCchHHHh----HHHHH Confidence 765322 1333345567889999999999999999998765433444477999999876653322222 1111 Q ss_pred -ceeeecCc Q lcl|NC_013692. 236 -DRFTRGGW 243 (244) Q Consensus 236 -~~f~~rGw 243 (244) +-|+-+|- T Consensus 208 ~~~~~~~~~ 216 (216) T protein:vir:27 208 KKVFNGGGV 216 (216) T ss_pred HhhccCCCC Confidence 24444555 No 6 >protein:vir:3302 Length: 216 # NCBI annotation: hypothetical protein # Family: family:all:1423 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049518;genbank:gi:9632524;genbank:GeneID:1262013 Probab=97.36 E-value=2e-05 Score=46.37 Aligned_cols=209 Identities=11% Similarity=-0.013 Sum_probs=115.5 Q ss_pred eeehhhHHhhhhhhhhccceeccCCccccCccchhHHHHHHHHHHHHHHhhc-ccccceEEEEEecCccccccccccc-- Q lcl|NC_013692. 2 TIQLKQVIDLLAEGELSNIKYVNIDTGALVLERVPSLIRAINLGVLDLHKRF-LLKEGMLKIQLEEGRRLYPLRPAYQ-- 78 (244) Q Consensus 2 ~mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~p~l~~~iN~GLt~L~~RF-~Lk~~~i~vem~eg~~~Y~L~~~~~-- 78 (244) -++++++++.-. .+|.... +. -=.-+.|+..+|+++.+.+-|= ....+...|++.+|+.-|.+...+. T Consensus 1 ~~t~~~lI~r~~-~~l~D~~------~~--rW~~~el~~~lNdAv~e~~l~rpda~~~~~~i~l~~Gt~q~~p~~~~~lI 71 (216) T protein:vir:33 1 MTTITEIIGRVN-TQLVDPM------MV--RWPLQELCDYYNDAVRAVILARPDAGASLETISCVPGARQVLPDGVIQLL 71 (216) T ss_pred CccHHHHHHHHH-Hhhhccc------cc--ccChHHHHHHHHHHHHHHHhhcCCCCcceeeEeccccccccccchhhhhh Confidence 468999998753 3444321 11 1234679999999999999886 6778899999999998887755432 Q ss_pred -c-cccCCCcccceecCCcccchHHHHHHHHHhhhcccccCCCCCcceeeeeccCcceEeeChhhhcCCCcceeeEEecc Q lcl|NC_013692. 79 -V-GQKPKPGVPQFITEGNKLDRQSILKIEKIIGDNGVEYYLNDTWQPLNITTPEFDVLEISDEFYCHSSSKTLEVRYRR 156 (244) Q Consensus 79 -~-s~~~k~~~~~~I~d~~~~f~~dvLki~~V~~~~g~~~~lNd~~~~~~i~tP~~~~l~~~~~~~~~~~~~vl~v~Y~a 156 (244) + .+-+.-++ .+-..++|.- -+-+|=..-+- ...+-.+..+|+.+. ..|.|. .+-++++.|-+ T Consensus 72 ~v~r~~~g~a~--------~~vsre~LD~--~~P~W~~~~g~-p~~~i~de~~pr~f~-l~P~p~----~~~~vel~~~r 135 (216) T protein:vir:33 72 DVICLSDGSAV--------RPLSREVLDA--QYPEWPTMKGI-PECFISNDLSPRVFW-LFPAPD----KEISIDAVVSR 135 (216) T ss_pred hhhhhCCCCce--------eeecHHHhcc--cCCCcCCCCCC-ceEEEecCCCceEEE-EeccCC----CCcEEEEEEEe Confidence 1 11000001 1111222211 01111110000 011122233343221 223333 33456888888 Q ss_pred cCCccccccccCCCccccccccchHHHHHHHHHHHHHHhccCCCccccHHHHHHHHHHHHHHHHHHHHcCccccCCCCc- Q lcl|NC_013692. 157 APTPMKICVDNLDSWGCIDIDLPYTHLQALLYFVASRCQTPIGFMENTAQEGFNFSQKYEAECANLDAQNLRIDPVGNQ- 235 (244) Q Consensus 157 ~hp~~~i~~D~l~~~~~~~IdLP~tl~~AL~~~VAsr~~t~in~~entak~~~~Y~q~Ye~ec~~l~~~~L~~ds~~~~- 235 (244) .+.-.+ ++..++..+|.||..|.++|..|+-||+|+-=..-..+..-|+.++|.++.-.+.-.+-+. -.++ T Consensus 136 ~P~a~~----~~~~~dd~~~~i~~~y~~~Lvdw~lyRa~skd~~~~~d~~rA~~h~q~F~~~lG~k~~ad~----~~~~r 207 (216) T protein:vir:33 136 IPEAVY----VLTQDDDTPVPLEEAYVNPLVEWMLFRAFSKDAAGGAESGLAAQHYQSFVEQLGIKQGADS----ALYAR 207 (216) T ss_pred cCcchh----hccCCCCCCCCcchhhhHHHHHHHHHHHhcCcCccccChHHHHHHHHHHHHHhCCchHHHh----HHHHH Confidence 765322 1333345567889999999999999999998765433444477999999876653322222 1111 Q ss_pred -ceeeecCc Q lcl|NC_013692. 236 -DRFTRGGW 243 (244) Q Consensus 236 -~~f~~rGw 243 (244) +-|+-+|- T Consensus 208 ~~~~~~~~~ 216 (216) T protein:vir:33 208 KKVFNGGGV 216 (216) T ss_pred HhhccCCCC Confidence 24444555 No 7 >protein:vir:93692 Length: 197 # NCBI annotation: Bcep22gp58 # Family: family:all:1423 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944287;genbank:gi:38640364;genbank:GeneID:2658345 Probab=93.01 E-value=0.0089 Score=31.82 Aligned_cols=188 Identities=15% Similarity=0.121 Sum_probs=94.5 Q ss_pred CeeehhhHHhhhhhhhhccceeccCCccccCccchhHHHHHHHHHHHHHHhh-cccccceEEEEEecCcccccccccccc Q lcl|NC_013692. 1 MTIQLKQVIDLLAEGELSNIKYVNIDTGALVLERVPSLIRAINLGVLDLHKR-FLLKEGMLKIQLEEGRRLYPLRPAYQV 79 (244) Q Consensus 1 ~~mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~p~l~~~iN~GLt~L~~R-F~Lk~~~i~vem~eg~~~Y~L~~~~~~ 79 (244) |-|+++||++.-.. +|... +. --=.-+.|+..+|+++.+..-| =+...+..+|++..| +.+.|-+.+.. T Consensus 1 M~~t~~~lI~r~~~-~L~D~-----~~---~rW~~~el~dwlNdAv~ei~l~rPda~~~~~~i~l~aG-t~q~LP~~~~~ 70 (197) T protein:vir:93 1 MPIAATDLIARAGN-VLQDE-----DH---IRWEVPELIEWINDAARETIVRRPAARSVAAVLELAAG-TRQAIPERGVE 70 (197) T ss_pred CcccHHHHHHHHHH-hhccc-----cc---cccCcHHHHHHHHHHHHHHHhhCCCCCcceeEEeeccC-cccccchhHHH Confidence 99999999987542 33321 11 1234578999999999998776 234455888988888 55544332210 Q ss_pred ----cccCCCcccceecC--CcccchHHHHHHHHHhhhcccccCCCC-CcceeeeeccCcceEeeChhhhcCCCcceeeE Q lcl|NC_013692. 80 ----GQKPKPGVPQFITE--GNKLDRQSILKIEKIIGDNGVEYYLND-TWQPLNITTPEFDVLEISDEFYCHSSSKTLEV 152 (244) Q Consensus 80 ----s~~~k~~~~~~I~d--~~~~f~~dvLki~~V~~~~g~~~~lNd-~~~~~~i~tP~~~~l~~~~~~~~~~~~~vl~v 152 (244) -+.- + -++.++ ...+-..++|-.. +-+|=...+=.+ ..+-.+..+|+-.-+ .|.|- .+.++++ T Consensus 71 Li~vvrn~--~-~~~~~~gr~vr~vsre~LD~~--~P~W~~~~~~~~v~~y~~~e~~p~~~~v-yP~p~----~~~~ve~ 140 (197) T protein:vir:93 71 LLDVVRNM--G-ADGVTPGRIVRRVDRQLLDDQ--NPDWHAARAKNVVKHFTFDERAPRIFYV-YPPAV----AGTKVET 140 (197) T ss_pred HHHHHHhh--h-hcccCCCcccccccHHHhccc--CCCCccCCCCCceEEEEeecCCCcEEEE-eecCC----CCceEEE Confidence 0000 0 000000 0011111111110 011110000000 001224455543333 23233 4466788 Q ss_pred EecccCCccccccccCCCccccccccchHHHHHHH-HHHHHHHhccCCCccccHHHHHHHHHHHHHHHHHHHHcCccccC Q lcl|NC_013692. 153 RYRRAPTPMKICVDNLDSWGCIDIDLPYTHLQALL-YFVASRCQTPIGFMENTAQEGFNFSQKYEAECANLDAQNLRIDP 231 (244) Q Consensus 153 ~Y~a~hp~~~i~~D~l~~~~~~~IdLP~tl~~AL~-~~VAsr~~t~in~~entak~~~~Y~q~Ye~ec~~l~~~~L~~ds 231 (244) .|-+.++.+.... ...+||..|.++|. +|+-||+||-=+.....|- - | -|...| T Consensus 141 ~~~r~P~~v~~~~--------~~~~ipeiy~~~lv~~~~lyRa~sKd~~~~a~a~-~----------~------~~~~~~ 195 (197) T protein:vir:93 141 LHSELPPAIAASG--------DTLDMGAEYMNVLVSTSATARCRRTASSRTARSP-P----------C------TIRRSS 195 (197) T ss_pred EEeeCChhhhccC--------CCCCCchhhhhhhHHhhhhhhhcCCCCCCcccCC-c----------c------ccccCC Confidence 8888877444332 35568999999987 6999999998653333222 0 2 233333 Q ss_pred CC Q lcl|NC_013692. 232 VG 233 (244) Q Consensus 232 ~~ 233 (244) -. T Consensus 196 ~~ 197 (197) T protein:vir:93 196 TP 197 (197) T ss_pred CC Confidence 33 No 8 >protein:vir:103760 Length: 207 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024931;genbank:gi:48697201;genbank:GeneID:2846084 Probab=87.32 E-value=0.013 Score=30.90 Aligned_cols=190 Identities=11% Similarity=0.070 Sum_probs=101.4 Q ss_pred CeeehhhHHhhhhhhhhccceeccCCccccCccchhHHHHHHHHHHHHHHhhcccccceEEEEEecCccccccccccccc Q lcl|NC_013692. 1 MTIQLKQVIDLLAEGELSNIKYVNIDTGALVLERVPSLIRAINLGVLDLHKRFLLKEGMLKIQLEEGRRLYPLRPAYQVG 80 (244) Q Consensus 1 ~~mkl~E~~~~La~geLsN~~iv~~d~g~I~~~~~p~l~~~iN~GLt~L~~RF~Lk~~~i~vem~eg~~~Y~L~~~~~~s 80 (244) |+=++. ++++|++.|-+=.|.+-+-+..+...-..+-..+=..+++.| .+.--.+.+.+ -...+..+- .|.+- T Consensus 1 M~S~v~--IcN~AL~~lGa~~I~s~~e~s~~A~~c~~~Y~~~r~~~L~~~-pW~FA~~r~~L--a~~~~~P~~--~~~ya 73 (207) T protein:vir:10 1 MASQVG--ICNRALTKIGDKRITSLDEDSKAAATLNSMYDDVLDACLRAH-VWSFTKARAQL--AALAEAPLF--GFSYQ 73 (207) T ss_pred CCCHHH--HHHHHHHhhchhhhcccccCCHHHHHHHHhhHHHHHHHHhcc-ChhhHhhhhhh--cccccCCCC--CCccc Confidence 764444 889999888875555444444444444444445555666666 33333333322 111111111 11111 Q ss_pred ccCCCcccceecCCcccchHHHHHHHHHhhhcccccCCCCCcceeeeeccCcceEeeChhhhcCCCcceeeEEecccCCc Q lcl|NC_013692. 81 QKPKPGVPQFITEGNKLDRQSILKIEKIIGDNGVEYYLNDTWQPLNITTPEFDVLEISDEFYCHSSSKTLEVRYRRAPTP 160 (244) Q Consensus 81 ~~~k~~~~~~I~d~~~~f~~dvLki~~V~~~~g~~~~lNd~~~~~~i~tP~~~~l~~~~~~~~~~~~~vl~v~Y~a~hp~ 160 (244) + ..+.|-|||..|.+.... +.-+.-.++.+.- .-| ++... ..+.+.|-++-+ T Consensus 74 Y---------------~LP~Dclrv~~v~~~~~~--~~~~~~~~~~v~g---~~l------l~~~~-~~~~l~Y~~~v~- 125 (207) T protein:vir:10 74 Y---------------RLPTDFIRLLQVGQFDVY--PRTDTRGLFSIEN---GNI------LTDMQ-APLYIRYAKRVT- 125 (207) T ss_pred c---------------cCcccceEeeeecCCCCc--cccccccceEecC---CeE------EecCC-CcEEEEEeecCC- Confidence 1 156788999888775432 1111112222221 111 12111 124677877622 Q ss_pred cccccccCCCccccccccchHHHHHHHHHHHHHHhccCCCccccHHHHHHHHHHHHHHHHHHHHcCccccCCCCcceeee Q lcl|NC_013692. 161 MKICVDNLDSWGCIDIDLPYTHLQALLYFVASRCQTPIGFMENTAQEGFNFSQKYEAECANLDAQNLRIDPVGNQDRFTR 240 (244) Q Consensus 161 ~~i~~D~l~~~~~~~IdLP~tl~~AL~~~VAsr~~t~in~~entak~~~~Y~q~Ye~ec~~l~~~~L~~ds~~~~~~f~~ 240 (244) ++. ..|....+||.+..|+.+=-++= ++..+ +.+.+|.|+.+..+-...| ..+....++.. T Consensus 126 ---------d~~----~fd~~F~~ala~~LAa~lA~pLt--~~~~~-~~~~~q~~~~~l~~A~~~d---a~e~~~~~~~~ 186 (207) T protein:vir:10 126 ---------DPN----AMDALFREAFACRLAAEACESLT--QSATK-RQGAWAEHDQAIAAAIRVN---AIERPAQPLGD 186 (207) T ss_pred ---------Chh----hhhHHHHHHHHHHHHHHhhHhhc--CChHH-HHHHHHHHHHHHHHHHhcc---cccCcccccCC Confidence 111 25899999999999999998883 34445 5589999987776655443 33334446666 Q ss_pred cCcC Q lcl|NC_013692. 241 GGWV 244 (244) Q Consensus 241 rGwi 244 (244) -+|+ T Consensus 187 ~~~l 190 (207) T protein:vir:10 187 DTWL 190 (207) T ss_pred cchh Confidence 6666 No 9 >protein:vir:107803 Length: 223 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1524 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996624;genbank:gi:45580758;genbank:GeneID:2767879 Probab=84.52 E-value=0.046 Score=27.91 Aligned_cols=194 Identities=15% Similarity=0.114 Sum_probs=96.2 Q ss_pred CeeehhhHHhhhhhhhhccceeccCC---ccccCccchhHHHHHHHHHHHHHHhhcccccceEEEEEecCcccccccccc Q lcl|NC_013692. 1 MTIQLKQVIDLLAEGELSNIKYVNID---TGALVLERVPSLIRAINLGVLDLHKRFLLKEGMLKIQLEEGRRLYPLRPAY 77 (244) Q Consensus 1 ~~mkl~E~~~~La~geLsN~~iv~~d---~g~I~~~~~p~l~~~iN~GLt~L~~RF~Lk~~~i~vem~eg~~~Y~L~~~~ 77 (244) |+=++. ++++|++-|-+=.++.+. -+.-....-..+-..+=..+++.| ...--.+.+.+ -+..++ .+.| T Consensus 1 M~S~v~--IcN~AL~~lG~~~i~~~~s~~E~s~~A~~C~~~Y~~~r~~~Lr~~-pW~FA~~r~~L--a~~a~p---~~~~ 72 (223) T protein:vir:10 1 MASEVD--ICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELH-TWGFATKCAQL--AAMGIS---RPEW 72 (223) T ss_pred CCCHHH--HHHHHHHhcccchhhcccCCCCCCHHHHHHHHhhHHHHHHHHhhc-CchhHhhhhhh--hhcccC---CCCc Confidence 763333 788888777555555321 122222222223333344555555 11111111111 000000 0112 Q ss_pred cccccCCCcccceecCCcccchHHHHHHHHHhhhcccccCCCCCcceeeeeccCcceEeeCh----hhhcCCCcceeeEE Q lcl|NC_013692. 78 QVGQKPKPGVPQFITEGNKLDRQSILKIEKIIGDNGVEYYLNDTWQPLNITTPEFDVLEISD----EFYCHSSSKTLEVR 153 (244) Q Consensus 78 ~~s~~~k~~~~~~I~d~~~~f~~dvLki~~V~~~~g~~~~lNd~~~~~~i~tP~~~~l~~~~----~~~~~~~~~vl~v~ 153 (244) .+. =..+.|-|||.+|.+..-..+.-. ....++++++... -.++. -..+.+. T Consensus 73 ~ya---------------Y~LP~Dclrv~~v~~~~~~~~~~~-------~~~~~~~~~e~~~~g~~~i~td--~~~~~l~ 128 (223) T protein:vir:10 73 RFA---------------YAQPADAIKIVAVLPHDAANIEAG-------IDNAQPFSCEIDNTGADIILTN--QVNAVAR 128 (223) T ss_pred ccc---------------ccccccceeeeeeccccccccccc-------cccccceEEeeccccceeeeec--CCceEEE Confidence 222 226789999999876543221111 1122233333221 12222 1245688 Q ss_pred ecccCCccccccccCCCccccccccchHHHHHHHHHHHHHHhccC-CCccccHHHHHHHHHHHHHHHHHHHHcCccccCC Q lcl|NC_013692. 154 YRRAPTPMKICVDNLDSWGCIDIDLPYTHLQALLYFVASRCQTPI-GFMENTAQEGFNFSQKYEAECANLDAQNLRIDPV 232 (244) Q Consensus 154 Y~a~hp~~~i~~D~l~~~~~~~IdLP~tl~~AL~~~VAsr~~t~i-n~~entak~~~~Y~q~Ye~ec~~l~~~~L~~ds~ 232 (244) |-++=+ ++. ..|....+||.+..|+.+=-++ +......+ +.+.+|+|+.+..+-...|-. + T Consensus 129 Y~~~v~----------d~~----~fd~lF~~Ala~~LAa~lA~pLt~~~~~~q~-a~~~~~~y~~~l~~A~~~da~---e 190 (223) T protein:vir:10 129 YISLVK----------DTT----KFSPLFVQALAWHLASMLAGPLLKGDVGAAE-SKRCVGAMQAYLSQAMVSDAN---Q 190 (223) T ss_pred EeecCC----------Chh----cccHHHHHHHHHHHHHHhhHhhcCCcchHHH-HHHHHHHHHHHHHHHHhcccc---c Confidence 877722 111 2689999999999999997777 43333333 448899998877766655544 3 Q ss_pred CCcceeeecCcC Q lcl|NC_013692. 233 GNQDRFTRGGWV 244 (244) Q Consensus 233 ~~~~~f~~rGwi 244 (244) ..+..+..-.|+ T Consensus 191 ~~~~~~~~~~~l 202 (223) T protein:vir:10 191 RKTKPAHMPEWM 202 (223) T ss_pred Ccccccccchhh Confidence 334444454555 No 10 >protein:vir:107429 Length: 223 # NCBI annotation: Bbp14 # Family: family:all:1524 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958683;genbank:gi:41179375;genbank:GeneID:2717223 Probab=84.52 E-value=0.046 Score=27.91 Aligned_cols=194 Identities=15% Similarity=0.114 Sum_probs=96.2 Q ss_pred CeeehhhHHhhhhhhhhccceeccCC---ccccCccchhHHHHHHHHHHHHHHhhcccccceEEEEEecCcccccccccc Q lcl|NC_013692. 1 MTIQLKQVIDLLAEGELSNIKYVNID---TGALVLERVPSLIRAINLGVLDLHKRFLLKEGMLKIQLEEGRRLYPLRPAY 77 (244) Q Consensus 1 ~~mkl~E~~~~La~geLsN~~iv~~d---~g~I~~~~~p~l~~~iN~GLt~L~~RF~Lk~~~i~vem~eg~~~Y~L~~~~ 77 (244) |+=++. ++++|++-|-+=.++.+. -+.-....-..+-..+=..+++.| ...--.+.+.+ -+..++ .+.| T Consensus 1 M~S~v~--IcN~AL~~lG~~~i~~~~s~~E~s~~A~~C~~~Y~~~r~~~Lr~~-pW~FA~~r~~L--a~~a~p---~~~~ 72 (223) T protein:vir:10 1 MASEVD--ICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELH-TWGFATKCAQL--AAMGIS---RPEW 72 (223) T ss_pred CCCHHH--HHHHHHHhcccchhhcccCCCCCCHHHHHHHHhhHHHHHHHHhhc-CchhHhhhhhh--hhcccC---CCCc Confidence 763333 788888777555555321 122222222223333344555555 11111111111 000000 0112 Q ss_pred cccccCCCcccceecCCcccchHHHHHHHHHhhhcccccCCCCCcceeeeeccCcceEeeCh----hhhcCCCcceeeEE Q lcl|NC_013692. 78 QVGQKPKPGVPQFITEGNKLDRQSILKIEKIIGDNGVEYYLNDTWQPLNITTPEFDVLEISD----EFYCHSSSKTLEVR 153 (244) Q Consensus 78 ~~s~~~k~~~~~~I~d~~~~f~~dvLki~~V~~~~g~~~~lNd~~~~~~i~tP~~~~l~~~~----~~~~~~~~~vl~v~ 153 (244) .+. =..+.|-|||.+|.+..-..+.-. ....++++++... -.++. -..+.+. T Consensus 73 ~ya---------------Y~LP~Dclrv~~v~~~~~~~~~~~-------~~~~~~~~~e~~~~g~~~i~td--~~~~~l~ 128 (223) T protein:vir:10 73 RFA---------------YAQPADAIKIVAVLPHDAANIEAG-------IDNAQPFSCEIDNTGADIILTN--QVNAVAR 128 (223) T ss_pred ccc---------------ccccccceeeeeeccccccccccc-------cccccceEEeeccccceeeeec--CCceEEE Confidence 222 226789999999876543221111 1122233333221 12222 1245688 Q ss_pred ecccCCccccccccCCCccccccccchHHHHHHHHHHHHHHhccC-CCccccHHHHHHHHHHHHHHHHHHHHcCccccCC Q lcl|NC_013692. 154 YRRAPTPMKICVDNLDSWGCIDIDLPYTHLQALLYFVASRCQTPI-GFMENTAQEGFNFSQKYEAECANLDAQNLRIDPV 232 (244) Q Consensus 154 Y~a~hp~~~i~~D~l~~~~~~~IdLP~tl~~AL~~~VAsr~~t~i-n~~entak~~~~Y~q~Ye~ec~~l~~~~L~~ds~ 232 (244) |-++=+ ++. ..|....+||.+..|+.+=-++ +......+ +.+.+|+|+.+..+-...|-. + T Consensus 129 Y~~~v~----------d~~----~fd~lF~~Ala~~LAa~lA~pLt~~~~~~q~-a~~~~~~y~~~l~~A~~~da~---e 190 (223) T protein:vir:10 129 YISLVK----------DTT----KFSPLFVQALAWHLASMLAGPLLKGDVGAAE-SKRCVGAMQAYLSQAMVSDAN---Q 190 (223) T ss_pred EeecCC----------Chh----cccHHHHHHHHHHHHHHhhHhhcCCcchHHH-HHHHHHHHHHHHHHHHhcccc---c Confidence 877722 111 2689999999999999997777 43333333 448899998877766655544 3 Q ss_pred CCcceeeecCcC Q lcl|NC_013692. 233 GNQDRFTRGGWV 244 (244) Q Consensus 233 ~~~~~f~~rGwi 244 (244) ..+..+..-.|+ T Consensus 191 ~~~~~~~~~~~l 202 (223) T protein:vir:10 191 RKTKPAHMPEWM 202 (223) T ss_pred Ccccccccchhh Confidence 334444454555 No 11 >protein:vir:98502 Length: 223 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1524 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996576;genbank:gi:45569507;genbank:GeneID:2767830 Probab=84.52 E-value=0.046 Score=27.91 Aligned_cols=194 Identities=15% Similarity=0.114 Sum_probs=96.2 Q ss_pred CeeehhhHHhhhhhhhhccceeccCC---ccccCccchhHHHHHHHHHHHHHHhhcccccceEEEEEecCcccccccccc Q lcl|NC_013692. 1 MTIQLKQVIDLLAEGELSNIKYVNID---TGALVLERVPSLIRAINLGVLDLHKRFLLKEGMLKIQLEEGRRLYPLRPAY 77 (244) Q Consensus 1 ~~mkl~E~~~~La~geLsN~~iv~~d---~g~I~~~~~p~l~~~iN~GLt~L~~RF~Lk~~~i~vem~eg~~~Y~L~~~~ 77 (244) |+=++. ++++|++-|-+=.++.+. -+.-....-..+-..+=..+++.| ...--.+.+.+ -+..++ .+.| T Consensus 1 M~S~v~--IcN~AL~~lG~~~i~~~~s~~E~s~~A~~C~~~Y~~~r~~~Lr~~-pW~FA~~r~~L--a~~a~p---~~~~ 72 (223) T protein:vir:98 1 MASEVD--ICNLALAYLGDEATVAGINPPEGSVQAEYCARFYPFARDSLLELH-TWGFATKCAQL--AAMGIS---RPEW 72 (223) T ss_pred CCCHHH--HHHHHHHhcccchhhcccCCCCCCHHHHHHHHhhHHHHHHHHhhc-CchhHhhhhhh--hhcccC---CCCc Confidence 763333 788888777555555321 122222222223333344555555 11111111111 000000 0112 Q ss_pred cccccCCCcccceecCCcccchHHHHHHHHHhhhcccccCCCCCcceeeeeccCcceEeeCh----hhhcCCCcceeeEE Q lcl|NC_013692. 78 QVGQKPKPGVPQFITEGNKLDRQSILKIEKIIGDNGVEYYLNDTWQPLNITTPEFDVLEISD----EFYCHSSSKTLEVR 153 (244) Q Consensus 78 ~~s~~~k~~~~~~I~d~~~~f~~dvLki~~V~~~~g~~~~lNd~~~~~~i~tP~~~~l~~~~----~~~~~~~~~vl~v~ 153 (244) .+. =..+.|-|||.+|.+..-..+.-. ....++++++... -.++. -..+.+. T Consensus 73 ~ya---------------Y~LP~Dclrv~~v~~~~~~~~~~~-------~~~~~~~~~e~~~~g~~~i~td--~~~~~l~ 128 (223) T protein:vir:98 73 RFA---------------YAQPADAIKIVAVLPHDAANIEAG-------IDNAQPFSCEIDNTGADIILTN--QVNAVAR 128 (223) T ss_pred ccc---------------ccccccceeeeeeccccccccccc-------cccccceEEeeccccceeeeec--CCceEEE Confidence 222 226789999999876543221111 1122233333221 12222 1245688 Q ss_pred ecccCCccccccccCCCccccccccchHHHHHHHHHHHHHHhccC-CCccccHHHHHHHHHHHHHHHHHHHHcCccccCC Q lcl|NC_013692. 154 YRRAPTPMKICVDNLDSWGCIDIDLPYTHLQALLYFVASRCQTPI-GFMENTAQEGFNFSQKYEAECANLDAQNLRIDPV 232 (244) Q Consensus 154 Y~a~hp~~~i~~D~l~~~~~~~IdLP~tl~~AL~~~VAsr~~t~i-n~~entak~~~~Y~q~Ye~ec~~l~~~~L~~ds~ 232 (244) |-++=+ ++. ..|....+||.+..|+.+=-++ +......+ +.+.+|+|+.+..+-...|-. + T Consensus 129 Y~~~v~----------d~~----~fd~lF~~Ala~~LAa~lA~pLt~~~~~~q~-a~~~~~~y~~~l~~A~~~da~---e 190 (223) T protein:vir:98 129 YISLVK----------DTT----KFSPLFVQALAWHLASMLAGPLLKGDVGAAE-SKRCVGAMQAYLSQAMVSDAN---Q 190 (223) T ss_pred EeecCC----------Chh----cccHHHHHHHHHHHHHHhhHhhcCCcchHHH-HHHHHHHHHHHHHHHHhcccc---c Confidence 877722 111 2689999999999999997777 43333333 448899998877766655544 3 Q ss_pred CCcceeeecCcC Q lcl|NC_013692. 233 GNQDRFTRGGWV 244 (244) Q Consensus 233 ~~~~~f~~rGwi 244 (244) ..+..+..-.|+ T Consensus 191 ~~~~~~~~~~~l 202 (223) T protein:vir:98 191 RKTKPAHMPEWM 202 (223) T ss_pred Ccccccccchhh Confidence 334444454555 No 12 >protein:vir:95323 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512268;genbank:gi:89152435;genbank:GeneID:3952992 Probab=57.19 E-value=0.44 Score=22.54 Aligned_cols=191 Identities=13% Similarity=0.079 Sum_probs=98.2 Q ss_pred CeeehhhHHhhhhhhhhccce-eccCCccccCccchhHHHHHHHHHHHHHHh-hcccccceEEEEEecCccccccccccc Q lcl|NC_013692. 1 MTIQLKQVIDLLAEGELSNIK-YVNIDTGALVLERVPSLIRAINLGVLDLHK-RFLLKEGMLKIQLEEGRRLYPLRPAYQ 78 (244) Q Consensus 1 ~~mkl~E~~~~La~geLsN~~-iv~~d~g~I~~~~~p~l~~~iN~GLt~L~~-RF~Lk~~~i~vem~eg~~~Y~L~~~~~ 78 (244) |+=++. ++++|++-|-|-. |-+-+-+.-+...-..+-..+=..+++.|- +|=-+--.+- +- +.-+ +.|. T Consensus 1 M~S~v~--IcN~AL~~iG~a~~I~s~~e~s~~A~~C~~~Y~~~r~~~L~~~pW~FA~~r~~La----~~-a~~~--~~~~ 71 (201) T protein:vir:95 1 MASVVE--ICNRALSNIGNSRSINSLTEASKEAGECSLHFEACRDAVLSDFDWNFATKRVALA----DT-SNPP--PDWE 71 (201) T ss_pred CCCHHH--HHHHHHHHhCCcccccccccCCHHHHHHHHhhHHHHHHHHhhcCchhhhhhhhcc----cc-cCCC--CCCc Confidence 763333 7899998887643 332232222222223333334444555442 2332222221 00 0001 1122 Q ss_pred ccccCCCcccceecCCcccchHHHHHHHHHhhhcccccCCCCCcceeeee---ccCcceEeeChhhhcCCCcceeeEEec Q lcl|NC_013692. 79 VGQKPKPGVPQFITEGNKLDRQSILKIEKIIGDNGVEYYLNDTWQPLNIT---TPEFDVLEISDEFYCHSSSKTLEVRYR 155 (244) Q Consensus 79 ~s~~~k~~~~~~I~d~~~~f~~dvLki~~V~~~~g~~~~lNd~~~~~~i~---tP~~~~l~~~~~~~~~~~~~vl~v~Y~ 155 (244) +.+ ..+.|-|||.+|-+.- .+.+-.+...+..+. .-.-.+|... -+ .+.+.|- T Consensus 72 yay---------------~LP~Dclrv~~v~~~g-~~~~~~~~~~~f~v~~~~~~~g~~l~td------~~--~~~l~Yv 127 (201) T protein:vir:95 72 YAY---------------QYPSDCLRITEIMLPG-VRNPTAAMRVQYEVGADTNGTGKLIYTD------QP--QAWLKYV 127 (201) T ss_pred ccc---------------cccchhhhhhhhccCC-ccccccccchhhhccccccccCceeeec------CC--ceEEEEe Confidence 222 2678999999986542 211111111111111 0011223332 12 2348887 Q ss_pred ccCCccccccccCCCccccccccchHHHHHHHHHHHHHHhccCCCccccHHHHHHHHHHHHHHHHHHHHcCccccCCCCc Q lcl|NC_013692. 156 RAPTPMKICVDNLDSWGCIDIDLPYTHLQALLYFVASRCQTPIGFMENTAQEGFNFSQKYEAECANLDAQNLRIDPVGNQ 235 (244) Q Consensus 156 a~hp~~~i~~D~l~~~~~~~IdLP~tl~~AL~~~VAsr~~t~in~~entak~~~~Y~q~Ye~ec~~l~~~~L~~ds~~~~ 235 (244) ++-+ ++. ..|....+||.+..|+.+=-++= ++..+ +.+.+|+|+.+..+-... +..+..+ T Consensus 128 ~~v~----------d~~----~fd~~F~~ala~~LAa~la~plt--~~~~~-~~~~~q~~~~~l~~A~~~---da~e~~~ 187 (201) T protein:vir:95 128 SRVT----------DVN----MFDAIFMEALAWRLAAAINMALT--GNADL-GTFALNMYNRVILSAGSH---SQNESQE 187 (201) T ss_pred ecCC----------Chh----hccHHHHHHHHHHHHHHhhHhhc--CChHH-HHHHHHHHHHHHHHHHhc---ccccCcc Confidence 7622 111 26899999999999999998883 34456 458999998877644433 3445556 Q ss_pred ceeeecCcC Q lcl|NC_013692. 236 DRFTRGGWV 244 (244) Q Consensus 236 ~~f~~rGwi 244 (244) +.+..-.|+ T Consensus 188 ~~~~~~~~l 196 (201) T protein:vir:95 188 PQPPVDEFT 196 (201) T ss_pred cCCCcchhh Confidence 677778899 No 13 >protein:vir:108311 Length: 249 # NCBI annotation: hypothetical protein # Family: family:all:28027 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552279;genbank:gi:160700604;genbank:GeneID:5758827 Probab=41.48 E-value=0.93 Score=20.77 Aligned_cols=223 Identities=15% Similarity=0.065 Sum_probs=123.6 Q ss_pred CeeehhhHHhhhhhhhhccceeccCCccccCcc-chhHHHHHHHHHHHHHHh-h-cccccceEEEEEecCccccccc--- Q lcl|NC_013692. 1 MTIQLKQVIDLLAEGELSNIKYVNIDTGALVLE-RVPSLIRAINLGVLDLHK-R-FLLKEGMLKIQLEEGRRLYPLR--- 74 (244) Q Consensus 1 ~~mkl~E~~~~La~geLsN~~iv~~d~g~I~~~-~~p~l~~~iN~GLt~L~~-R-F~Lk~~~i~vem~eg~~~Y~L~--- 74 (244) |.=++. .+..+-|+.++++ +.|+-.+. -..--+.+.|.=|-.-.. | ..-+.+.+++=..+|+..|.+- T Consensus 1 ~sqt~~----~II~~ALk~aGvl--a~Getp~aee~~DA~~~Ln~Ml~~W~~~rl~V~~~~~~t~vl~~G~~~YtVGi~~ 74 (249) T protein:vir:10 1 MARTVG----DIIRSSMRKIGVL--AAGEPLPANEGDDALEVFAQMVDAWTNETLLIPVVNVVTKVLVENQPEYTIGIYP 74 (249) T ss_pred CccCHH----HHHHHHHHHcccc--ccCCCCCHhHHHHHHHHHHHHHHHHHhCceeEEeeeeeeeeccCCcceEEeeecc Confidence 555555 5566788999998 56654432 334445555554443322 2 1223444554489999999988 Q ss_pred -------cccc-ccccCCCcccceecC-CcccchHHHHHHHHHhhhcccccCCCCCcceeeeec-cCcceEeeChhhhcC Q lcl|NC_013692. 75 -------PAYQ-VGQKPKPGVPQFITE-GNKLDRQSILKIEKIIGDNGVEYYLNDTWQPLNITT-PEFDVLEISDEFYCH 144 (244) Q Consensus 75 -------~~~~-~s~~~k~~~~~~I~d-~~~~f~~dvLki~~V~~~~g~~~~lNd~~~~~~i~t-P~~~~l~~~~~~~~~ 144 (244) |-.. .-.+|.--.-.||.+ +++-+-+++|.+|. |+..+.+ .+ +.+|..+|- |+ .-+-+..=|-.+ T Consensus 75 ~~~~~~~p~~~i~~~RP~~i~sA~~r~~~d~~~~~~~i~~Ed-Y~rI~~K-t~--~~~ps~~f~d~g-~p~g~i~vwP~P 149 (249) T protein:vir:10 75 EPVPDPLPSNHIETGRPERILSAFIRDRYDTDYIQEIIDVET-YSRISRK-TN--TSRPSRFYVSKG-WPLNTILFESVP 149 (249) T ss_pred ccccccCCCCceEeecchheeeeeeecccccchhhhhhchhh-hhhcCCC-CC--CCCceEEEEcCC-CCcceEEEEecC Confidence 5444 233355446778844 66778888877754 4444331 22 234444332 21 111111112234 Q ss_pred CCcceeeEEecccCCccccccccCCCccccccccchHHHHHHHHHHHHHHhccCCCccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_013692. 145 SSSKTLEVRYRRAPTPMKICVDNLDSWGCIDIDLPYTHLQALLYFVASRCQTPIGFMENTAQEGFNFSQKYEAECANLDA 224 (244) Q Consensus 145 ~~~~vl~v~Y~a~hp~~~i~~D~l~~~~~~~IdLP~tl~~AL~~~VAsr~~t~in~~entak~~~~Y~q~Ye~ec~~l~~ 224 (244) +++++| |++...|-..+-. +-+-+-.||||.-|.-||.+=.|-++..--|.+-..+-++ .=++-...++. T Consensus 150 ~~~~tl--~~~v~~pL~~f~~---~a~ltd~i~LPpey~~AL~~NLavel~peyG~ev~~~v~~-----~A~~a~~~ikr 219 (249) T protein:vir:10 150 YQDETL--HLEVVQPLSEILP---TACLTDVINLPPGYERALIYNLCLELASEWGKEVTALVAT-----QAVEGKKWLKR 219 (249) T ss_pred CCCceE--EEEEEeehhhhcc---ccccchhccCCHHHHHHHHHHHHHhhccccCCccCHHHHH-----HHHHHHHHHHh Confidence 466664 5555555433310 0112236889999999999999999988889777766544 12333455666 Q ss_pred cCccccCCCCc--ceeeecCcC Q lcl|NC_013692. 225 QNLRIDPVGNQ--DRFTRGGWV 244 (244) Q Consensus 225 ~~L~~ds~~~~--~~f~~rGwi 244 (244) -|+..--.++. .-..++|-- T Consensus 220 aN~q~~~l~~~~~~~~~r~G~~ 241 (249) T protein:vir:10 220 NNYRPLVLGADRAVATQRKGIG 241 (249) T ss_pred ccCcceeeecccccccccCcce Confidence 66655554322 244455543 No 14 >protein:vir:7328 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:1524 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848219;genbank:gi:30387390;genbank:GeneID:2641866 Probab=29.33 E-value=1.7 Score=19.36 Aligned_cols=191 Identities=12% Similarity=0.060 Sum_probs=96.4 Q ss_pred CeeehhhHHhhhhhhhhccce-eccCCccccCccchhHHHHHHHHHHHHHHh-hcccccceEEEEEecCccccccccccc Q lcl|NC_013692. 1 MTIQLKQVIDLLAEGELSNIK-YVNIDTGALVLERVPSLIRAINLGVLDLHK-RFLLKEGMLKIQLEEGRRLYPLRPAYQ 78 (244) Q Consensus 1 ~~mkl~E~~~~La~geLsN~~-iv~~d~g~I~~~~~p~l~~~iN~GLt~L~~-RF~Lk~~~i~vem~eg~~~Y~L~~~~~ 78 (244) |+=++. ++++|++-|-+-. |-+-+-+.-....-..+-..+=..+++.|- .|=-|--.+ -+ ....| +.|. T Consensus 1 M~S~v~--IcN~AL~~iG~a~~I~s~~e~s~~A~~c~~~Y~~~r~~~Lr~~pW~FA~~r~~L----a~-~a~~p--~~~~ 71 (201) T protein:vir:73 1 MASVIE--ICNRALSNIGNSRSINSLIEASKEAGQCSLHFDACRDAALADFDWNFATKRVAL----AD-TNNPP--PDWQ 71 (201) T ss_pred CCCHHH--HHHHHHHhhcCcccccccccCCHHHHHHHHhhHHHHHHHHhhcCchhHhhhhhh----hh-cccCC--CCCc Confidence 764444 8899998887643 443343333333233333334444555441 222221111 11 01111 1222 Q ss_pred ccccCCCcccceecCCcccchHHHHHHHHHhhhcccccCCCCCcceeee---eccCcceEeeChhhhcCCCcceeeEEec Q lcl|NC_013692. 79 VGQKPKPGVPQFITEGNKLDRQSILKIEKIIGDNGVEYYLNDTWQPLNI---TTPEFDVLEISDEFYCHSSSKTLEVRYR 155 (244) Q Consensus 79 ~s~~~k~~~~~~I~d~~~~f~~dvLki~~V~~~~g~~~~lNd~~~~~~i---~tP~~~~l~~~~~~~~~~~~~vl~v~Y~ 155 (244) +.+ ..+.|-|||.+|.+.-..-.+.. ...+..+ +.-.-.+|. +..+ . +.+.|- T Consensus 72 yaY---------------~LP~Dclrv~~v~~~~~~~~~~~-~~~~~~~~~~~~ieg~~i~------td~~-~-~~l~Y~ 127 (201) T protein:vir:73 72 YAY---------------QYPSDCVRITEIMPTGIRNPTAA-QRIEYVVGSNEDLTGKLIY------TDQP-K-AWLKYM 127 (201) T ss_pred ccc---------------cccccceeeeeeccccccccccc-cccchhccccccccCCEee------ecCC-c-eeEEEe Confidence 222 27789999999876543311111 1111111 000112232 2222 2 247777 Q ss_pred ccCCccccccccCCCccccccccchHHHHHHHHHHHHHHhccCCCccccHHHHHHHHHHHHHHHHHHHHcCccccCCCCc Q lcl|NC_013692. 156 RAPTPMKICVDNLDSWGCIDIDLPYTHLQALLYFVASRCQTPIGFMENTAQEGFNFSQKYEAECANLDAQNLRIDPVGNQ 235 (244) Q Consensus 156 a~hp~~~i~~D~l~~~~~~~IdLP~tl~~AL~~~VAsr~~t~in~~entak~~~~Y~q~Ye~ec~~l~~~~L~~ds~~~~ 235 (244) ++-+ ++. ..|....+||.+..|+.+=-++= +++.+ +.+.+|.|+.+..+-...|-. ++.+ T Consensus 128 ~~v~----------d~~----~fd~lF~~ala~~LAa~lA~plt--~~~~~-~~~~~q~~~~~~~~A~~~d~~---e~~~ 187 (201) T protein:vir:73 128 ARVT----------DVN----MYDAIFMEALSWRLAAAINMALT--GSADL-GNNALTMYNRVILSAGSHSQN---ESQE 187 (201) T ss_pred ecCC----------Ccc----cccHHHHHHHHHHHHHHhhHhhc--CChHH-HHHHHHHHHHHHHHHHHhhhc---cccC Confidence 6522 111 26899999999999999998883 23445 447899998766554444333 3344 Q ss_pred ceeeecCcC Q lcl|NC_013692. 236 DRFTRGGWV 244 (244) Q Consensus 236 ~~f~~rGwi 244 (244) ..+..-.|+ T Consensus 188 ~~~~~~~~l 196 (201) T protein:vir:73 188 PQPPVDEFT 196 (201) T ss_pred CCCCCchHH Confidence 577777788 Done!