Query lcl|NC_018276.1_cdsid_YP_006560631.1 [gene=B620_gp11] [protein=putative phage capsid protein] [protein_id=YP_006560631.1] [location=6635..8275] Match_columns 546 No_of_seqs 4 out of 7 Neff 2.2 Searched_HMMs 1612 Date Thu Nov 7 13:28:05 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_11 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_11_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:94956 Length: 452 98.9 9.8E-09 6.1E-12 64.5 26.0 399 1-483 1-452 (452) 2 protein:vir:97265 Length: 513 98.9 2.5E-08 1.5E-11 62.3 25.7 434 1-546 17-513 (513) 3 protein:vir:95149 Length: 501 98.6 2.1E-07 1.3E-10 57.2 22.7 418 1-546 1-500 (501) 4 protein:vir:96783 Length: 488 98.1 5.4E-06 3.3E-09 49.5 27.9 404 1-480 14-488 (488) 5 protein:vir:95014 Length: 491 97.7 2.9E-05 1.8E-08 45.5 24.3 415 1-500 9-491 (491) 6 protein:vir:78393 Length: 489 97.1 0.00017 1.1E-07 41.2 20.9 412 1-501 9-489 (489) 7 protein:vir:80453 Length: 535 96.6 0.0005 3.1E-07 38.7 24.1 431 1-546 32-532 (535) 8 protein:vir:96179 Length: 468 89.8 0.024 1.5E-05 29.5 26.0 394 16-492 1-468 (468) 9 protein:vir:95806 Length: 440 87.4 0.039 2.4E-05 28.3 26.8 407 9-482 1-440 (440) 10 protein:vir:9871 Length: 429 # 86.5 0.045 2.8E-05 28.0 24.6 405 1-482 1-429 (429) 11 protein:vir:9922 Length: 489 # 83.9 0.065 4E-05 27.1 24.8 437 1-512 15-489 (489) 12 protein:vir:94101 Length: 474 77.5 0.12 7.7E-05 25.6 24.7 408 1-483 10-474 (474) 13 protein:vir:105889 Length: 474 77.5 0.12 7.7E-05 25.6 24.7 408 1-483 10-474 (474) 14 protein:vir:99522 Length: 470 58.2 0.42 0.00026 22.7 25.7 403 1-483 25-470 (470) 15 protein:vir:3964 Length: 453 # 55.8 0.47 0.00029 22.4 23.8 403 1-482 13-453 (453) 16 protein:vir:96266 Length: 474 55.4 0.48 0.0003 22.3 25.9 406 1-487 23-474 (474) 17 protein:vir:95899 Length: 474 55.4 0.48 0.0003 22.3 25.9 406 1-487 23-474 (474) 18 protein:vir:102950 Length: 471 49.0 0.65 0.0004 21.6 26.1 396 1-472 1-471 (471) 19 protein:vir:3609 Length: 452 # 40.2 0.98 0.00061 20.6 28.1 407 1-500 17-452 (452) 20 protein:vir:733 Length: 453 # 36.2 1.2 0.00074 20.2 26.3 408 1-490 17-453 (453) 21 protein:vir:94546 Length: 506 25.7 2 0.0013 18.9 24.7 439 1-487 22-506 (506) 22 protein:vir:102330 Length: 451 21.1 2.7 0.0016 18.3 26.6 410 1-480 1-451 (451) 23 protein:vir:4898 Length: 502 # 20.2 2.8 0.0017 18.1 24.4 427 1-506 49-502 (502) No 1 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=98.93 E-value=9.8e-09 Score=64.48 Aligned_cols=399 Identities=14% Similarity=0.137 Sum_probs=233.4 Q ss_pred CCcccchhHHHHHHH-------------HHHH----hccc------hhhHHHHHHhhcCCCccchHHHHHHHH-HHHhcc Q lcl|NC_018276. 1 MNAADARAVVNDFLQ-------------WVKQ----LLPK------DKYNVFLQLFKFPVSTNELTEEIFNAL-EKVYDG 56 (546) Q Consensus 1 ~~a~d~~~~~~~fls-------------~vk~----~l~k------dky~~f~~~f~fpv~tn~lt~~if~~l-skvfd~ 56 (546) |+-+.++.-|..-+- .|+. .||| +.|..+++.--|+- .+..+-+.| -+|| . T Consensus 1 m~V~~~hp~y~a~~~~W~~~rd~~~G~~~~r~~g~~YLpk~~~E~~~~Y~~rl~rA~~~n----~~~~t~~~~~G~vf-~ 75 (452) T protein:vir:94 1 MPIETKHPEYLAYENDWIDCRVASLGQREVKKKGVRFLPKLSGQTDDMYNAYKQRALFYS----ITSKTLSALSGMVL-D 75 (452) T ss_pred CCCCCcCHHHHHHHHHHHHHHHHhcChHHHHcCCcccCCCCCCCCHHHHHHHHhhccCCc----hHHHHHHHHhchhh-c Confidence 776666554443332 1222 4553 56888888766653 333333333 3344 2 Q ss_pred cCccccccccCccchhhHHHHHHhhcchhhhHHHHHHHHHhcCCceEEEEeehhhhcCCCCcceEEecchhhhhhhhccc Q lcl|NC_018276. 57 KDAYEDYNFVNPDYLQDWNDYRSSVLNAYHFWRYEAFKQFKTNINGVMVVDLPAEQTSERPAPYFYFMPITAVKDFKTNA 136 (546) Q Consensus 57 ~n~~~~yqf~~~e~~~d~~~y~s~vLn~~~fw~~e~fk~~k~~iN~v~vVDm~~~Q~s~kp~py~y~l~i~~v~a~r~n~ 136 (546) +.+ ++..|+.+++++ .--.+.+..+|-+.-+=+....-. +-++||+|.. ...||+.+++-++|+..++.. T Consensus 76 k~p----~~~~p~~l~~~~-~D~~G~~L~~~~~~~~~~~l~~G~-~~ilVD~p~~----g~rPy~~~~~~~~Ii~W~~~~ 145 (452) T protein:vir:94 76 QPP----VITHPDAMSKYF-EDQSGIQFYEVFTRAVEETLLMGR-VGVFIDRPLT----GGDPYISVYTTENILNWEEDE 145 (452) T ss_pred CCc----eecccHHHHHHH-hcccCCCHHHHHHHHHHHHHhcCe-EEEEEeeccC----CCceEEEEechhhhcCccccc Confidence 344 345688888874 345678888877765556666654 5566799964 336999999999999999999 Q ss_pred ccceEEEee-----ecCCC---------cEEE--EecCccee--eccCCccce--eeehhhhhh-hhccccceeeEeccc Q lcl|NC_018276. 137 SGAIEWIML-----PQGDN---------QLAV--IDDEHYSI--YQLDEKGEI--SAEPLTQSA-HDLGYCPATMFWGDP 195 (546) Q Consensus 137 ~~~I~fi~~-----~~dqn---------~i~~--IDde~y~~--y~~d~e~~~--sie~l~dn~-h~lGy~PAr~~~~D~ 195 (546) +|.+.++-+ ..|.. +++| |++..|++ |..++.+.. +-+..++.. |-||+.|..++-+.. T Consensus 146 ~g~l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~l~~IP~v~~~~~~ 225 (452) T protein:vir:94 146 DGRLLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTSTIQNVGVTMDYIPFFCITPSG 225 (452) T ss_pred cCCeeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccceeecCCCcccceeEEEEEcCCC Confidence 998766655 22221 3455 77888876 544444332 223334444 899999999884443 Q ss_pred cccccccceecchhHHHHHHHHHHHHhhh-hhhhhhhcCCcccccCcccCcccccCCCcccCCceeeccCCceEeecccc Q lcl|NC_018276. 196 LMDSQPALKKSPLSNQLAALDWLLFFQTS-KKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSHGDYYVTRTGE 274 (546) Q Consensus 196 i~~skP~Ik~Spis~~L~A~D~ll~~~ts-~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~gd~~t~~tg~ 274 (546) +.+++-.+||- .|+-+-.-||.+-| ..|.--.++.|+=+.+.-+ + T Consensus 226 ---~~~~~~~pPLl-~LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~~----~-------------------------- 271 (452) T protein:vir:94 226 ---LSMTPAKPPMI-DIVDINYSHYRTSADLEHGRHFTGLPTPWITGAE----S-------------------------- 271 (452) T ss_pred ---CCCCCCccchH-HHHHHHHHHhcchhHHHHHHHHcccceeEeecCc----C-------------------------- Confidence 34578888987 66666777887766 6676667778877764211 0 Q ss_pred ccccCCcccccccCccceeeeeCCccccccccccCccceecccHHHHHHHHHHHHHHhhhheeeeec-CCCCcccchhhh Q lcl|NC_018276. 275 VSKCPVCADKRLAGVGSFIEVPLPSRENEGADLRNPVQITTIDKASLDYNVEECERIYDKIYTACVG-FGGDISMNKAFN 353 (546) Q Consensus 275 ~~~Cp~c~~k~~~gags~~~~piP~~~~n~~Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~s~~G-f~~d~q~~ka~n 353 (546) +++-..|+++.+.+|-| ..+ .+++.++-.++.-..++++++.+.|...=-= +....+. ++.+ T Consensus 272 -------~~~i~iG~~~~~~lpe~-----~~~----~~yie~~g~~i~~~~~~l~~le~~m~~~Ga~ll~~~~~~-~~s~ 334 (452) T protein:vir:94 272 -------QSTMHIGSTKAWVIPEV-----AAK----VGFLEFTGQGLQSLEKALSEKQAQLASLSARLIDNSTRG-SEAT 334 (452) T ss_pred -------CCceEecccccccCCCC-----CCc----ceEEccCchhHHHHHHHHHHHHHHHHHHHHHhhccCCCc-chHH Confidence 01235688888766633 222 5799999999998999999999998542100 1222211 2222 Q ss_pred hhhhhhhhhHHHHHHHHHhhcHHHHHHHHHHHHHHHhccccccccccccccccccc--CHHHHHHHHHHHHhhc-ccHHH Q lcl|NC_018276. 354 KDQVKANTESRRNVLLSVKKNFERVQAWVEDTVCRLRYGELYVNNTIHYGSDFYLH--NVEELTAQYEAAKKAG-ATDYQ 430 (546) Q Consensus 354 e~qV~s~~es~r~~l~~lkknfer~qkfV~dti~~Lryg~~~~~~tv~yGt~fyl~--t~EeLte~~~~ak~~g-aS~~e 430 (546) +. +.....+..+.|..+-.|.|+|-.-+++.+|+.. |.. .+-.|.+-++|-+. +++++.+-+ .|..+| +|... T Consensus 335 ea-~~~~~~~~~s~L~~~a~~~e~al~~~l~~~a~w~-g~~-~~~~v~~n~dF~~~~~~~~~~~al~-~~~~~G~is~~t 410 (452) T protein:vir:94 335 ET-VKLRYMSETASLKSVTRAVEALLNKAYSCIMDME-SMG-GTLNIKLNSAFLDSKLTAAELKAWV-EAYLSGGISKEI 410 (452) T ss_pred HH-HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHc-CCC-CceEEEeccccccccCCHHHHHHHH-HHHhcCCCcHHH Confidence 21 1234455568888889999999888888777633 543 35578888999665 466555544 456666 55543 Q ss_pred H-HHHHH-HHHhhhhcCCHHHHHHHHHHhhcCCCccccHHHHHHHHhc-CCCchhh Q lcl|NC_018276. 431 L-DVIQD-QIIETENRNNPNAMERAQVLKHLEPYRHQTRKEVLEMLEA-GFGDMEL 483 (546) Q Consensus 431 i-~~iq~-qi~etEyrNdP~qmqR~~iL~~lEPf~~lT~~EvveL~e~-~~~~~El 483 (546) + ..||. .++..|+ +-.| +..+++ +++ ....-. +=.+-+- T Consensus 411 ~~~~L~~~gvl~~~~-----e~~~--i~~E~~------~~~-~~~~~~~~~~~~~~ 452 (452) T protein:vir:94 411 YIHALKVGKVLPPPG-----ESMG--VIPDPP------APE-PSPSNTPPNPSSKA 452 (452) T ss_pred HHHHHHhCCCCCCcc-----CHHH--HHHHhh------ccC-cccCCCCCCCccCC Confidence 3 34433 3433221 1111 111111 000 000000 0011111 No 2 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=98.86 E-value=2.5e-08 Score=62.28 Aligned_cols=434 Identities=16% Similarity=0.179 Sum_probs=245.1 Q ss_pred CCcccchhHHHHHH---HHH----HHhccc------hhhHHHHHHhhcCCCccchHHHHHHHHHHHhcccCccccccccC Q lcl|NC_018276. 1 MNAADARAVVNDFL---QWV----KQLLPK------DKYNVFLQLFKFPVSTNELTEEIFNALEKVYDGKDAYEDYNFVN 67 (546) Q Consensus 1 ~~a~d~~~~~~~fl---s~v----k~~l~k------dky~~f~~~f~fpv~tn~lt~~if~~lskvfd~~n~~~~yqf~~ 67 (546) -.++..=.-+.|.+ ..| ++.||| +.|..+++.--|+--|.+.++.+- -+||- +.+..+ ... T Consensus 17 ~a~~~~W~~ird~~~G~~~~r~~g~~YLPk~~~E~~~~Y~~rl~rA~~~n~~~~tl~~l~---G~vf~-k~p~~~--~~~ 90 (513) T protein:vir:97 17 DQMLPRWHVIETLLGGTEAMREAGETYLPRHQEETDKGYQERLASAVLLNMVEQTLDTLS---GKPFS-EPIKLN--EDV 90 (513) T ss_pred HHHHHHHHHHHHHhcChHHHHhhcccCCCCCCCCCHHHHHHHHhcccCCChHHHHHHHHh---hhhhh-cCcccC--cCc Confidence 11111111122222 112 244554 568888877666544443333322 23333 444332 234 Q ss_pred ccchhh-H-HHHHHhhcchhhhHHHHHHHHHhcCCceEEEEeehhhhc-------------CCCCcceEEecchhhhhhh Q lcl|NC_018276. 68 PDYLQD-W-NDYRSSVLNAYHFWRYEAFKQFKTNINGVMVVDLPAEQT-------------SERPAPYFYFMPITAVKDF 132 (546) Q Consensus 68 ~e~~~d-~-~~y~s~vLn~~~fw~~e~fk~~k~~iN~v~vVDm~~~Q~-------------s~kp~py~y~l~i~~v~a~ 132 (546) |....+ | +|==-.+.+..+|-+.-+=.....- =+-++||+|..-. ...--||+..++-++|+.. T Consensus 91 p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G-~~~ilVD~P~~~~~~~~~~~T~Ade~~~~~rPy~~~~~~e~IinW 169 (513) T protein:vir:97 91 PKAIEETILPDVDLQGNNLDVFARQWFREGMAKA-LCHVLIDMPRPAPREDGQPRTLADDRREGLRPYWVMIKPECLLFA 169 (513) T ss_pred hHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcC-eEEEEEecCCCCCccchhHHhHHHHHhhccCceEEEecHhhhcCc Confidence 454443 2 3444466777777765554555544 4556899985422 1222499999999999999 Q ss_pred hccccc---ceEEEee-----ecC------CCcEEEEecCcceeeccCCccce--e-eehhhhhhhhccccceeeEeccc Q lcl|NC_018276. 133 KTNASG---AIEWIML-----PQG------DNQLAVIDDEHYSIYQLDEKGEI--S-AEPLTQSAHDLGYCPATMFWGDP 195 (546) Q Consensus 133 r~n~~~---~I~fi~~-----~~d------qn~i~~IDde~y~~y~~d~e~~~--s-ie~l~dn~h~lGy~PAr~~~~D~ 195 (546) |+.-++ -+.++-+ ..| ..+++|+++..|++|-..+.+.- + ..++...-|-||+.|..++-+.. T Consensus 170 ~~~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~~~~~~~~~~e~~~~~~g~~~l~~IP~v~~~~~~ 249 (513) T protein:vir:97 170 RSEVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEPVKKSNAQKEEWALADEWATGLNYVPLVTFYADR 249 (513) T ss_pred ceeccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEeecCCCccccceEEecCCCCcCCceeEEEEecCC Confidence 975544 4555544 111 23478999999999876544321 1 23445566999999999996554 Q ss_pred cccccccceecchhHHHHHHHHHHHHhhh-hhhhhhhcCCcccccCcccCcccccCCCcccCCceeeccCCceEeecccc Q lcl|NC_018276. 196 LMDSQPALKKSPLSNQLAALDWLLFFQTS-KKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSHGDYYVTRTGE 274 (546) Q Consensus 196 i~~skP~Ik~Spis~~L~A~D~ll~~~ts-~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~gd~~t~~tg~ 274 (546) +.+++-.+||- .|+-+..-||.+-| ..|+--.++.|.=+.+-- ..+.++ T Consensus 250 ---~~~~~~~pPLl-~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G~------------------~~~~~~-------- 299 (513) T protein:vir:97 250 ---QGFMMGKPPLL-DLAHLNVAHWQSASDQRHILTVSRFPILACSGA------------------SGEDSD-------- 299 (513) T ss_pred ---CCCCCCccchH-HHHHHHHHHHhhhhhHHHHHHhcccceeeeecC------------------CcCCCC-------- Confidence 45678889987 78888888887766 456555677777766421 011111 Q ss_pred ccccCCcccccccCccceeeeeCCccccccccccCccceecccHHHHHHHHHHHHHHhhhheeeeec--CCCCcccchhh Q lcl|NC_018276. 275 VSKCPVCADKRLAGVGSFIEVPLPSRENEGADLRNPVQITTIDKASLDYNVEECERIYDKIYTACVG--FGGDISMNKAF 352 (546) Q Consensus 275 ~~~Cp~c~~k~~~gags~~~~piP~~~~n~~Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~s~~G--f~~d~q~~ka~ 352 (546) ....|+++.+.+|-| .. ..+|+.++-.++.-..++++++.+.|.. .| +-.....++++ T Consensus 300 ---------~i~iG~~~~~~lpe~-----~~----~~~yie~~g~~i~~~~~~l~~le~qm~~--~Ga~ll~~~~~~~Ta 359 (513) T protein:vir:97 300 ---------PVVVGPNKVLYNPDP-----AG----RFYYVEHTGQAIAAGRTDLKDLEEQMAG--YGAEFLKRKTGGQTA 359 (513) T ss_pred ---------ceEeeccccccCCCC-----CC----cceeeccCchhHHHHHHHHHHHHHHHHH--HHHHhhccCCccccH Confidence 133577787766632 22 3579999988888888889998888843 33 11111223333 Q ss_pred hhhhhhhhhhHHHHHHHHHhhcHHHHHHHHHHHHHHHhcccccccccccccccccccCH-HHHHHHHHHHHhhc-ccHHH Q lcl|NC_018276. 353 NKDQVKANTESRRNVLLSVKKNFERVQAWVEDTVCRLRYGELYVNNTIHYGSDFYLHNV-EELTAQYEAAKKAG-ATDYQ 430 (546) Q Consensus 353 ne~qV~s~~es~r~~l~~lkknfer~qkfV~dti~~Lryg~~~~~~tv~yGt~fyl~t~-EeLte~~~~ak~~g-aS~~e 430 (546) ++..-...+..+.|-.+-.|+|.|-.-+++.+|.- .|.--=..+|.+-++|-+... .+..+.+-.|..+| +|.+. T Consensus 360 --~a~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~w-lg~~~~~~~v~in~dF~~~~~~~~~~~al~~a~~~G~is~~t 436 (513) T protein:vir:97 360 --TARALDSAEATSDLSAMTGLFEDALAQALDITADW-LRLGPNGGTVELVKDYDLEEMDAPGLQALQVAREKRDISRKT 436 (513) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hCCCCCccEEEeccccCcccCCHHHHHHHHHHHhCCCCCHHH Confidence 33445566777889999999999999888887765 342222467888889988774 34556666666677 55433 Q ss_pred HHHHHHHHHhhhhcCCHHHHHHHHHHhhcCCCccccHHHHHHHH----hc--CCCchhheeeeeccchhheeecccCCch Q lcl|NC_018276. 431 LDVIQDQIIETENRNNPNAMERAQVLKHLEPYRHQTRKEVLEML----EA--GFGDMELIAVKLNFSTFVLRFERENTDI 504 (546) Q Consensus 431 i~~iq~qi~etEyrNdP~qmqR~~iL~~lEPf~~lT~~EvveL~----e~--~~~~~Ell~vk~nf~~fv~rfe~en~~i 504 (546) +- .+|+|+.||. ++++.++..+-. ++ |..+.++= T Consensus 437 ~~---------------~~L~r~gvl~-----~d~d~~~~~e~~~~~~~~~~~~~~~d~~-------------------- 476 (513) T protein:vir:97 437 YL---------------NGLRLRGVLP-----EDFDEDEDWEELMEEISEAMGRAGLDLD-------------------- 476 (513) T ss_pred HH---------------HHHHhccCCC-----ccCCHHHHHHHHHHhhhhccCCCCcccc-------------------- Confidence 31 2566666664 344544332221 11 11111110 Q ss_pred hhhhhhcChhhhHHHHHHHHHhhhcchhhcCCCCCCCC-------CCCC Q lcl|NC_018276. 505 VEFGNALPFDQKISIILNTFKSYGKTEYTSQPADGGAG-------SGES 546 (546) Q Consensus 505 ~efg~~l~~~~kieii~ntl~sygn~e~~~qp~~~~~~-------~~~~ 546 (546) ..+. .| |. .+=|+.|=.+-++|||.+ .||| T Consensus 477 -~~~~-~~---------~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 513 (513) T protein:vir:97 477 -PAQK-NP---------PE-GGEGEGEGEGEGGEGGEGGEGGGNPGGES 513 (513) T ss_pred -ccCC-CC---------CC-CCCCCCCCCCCCCCCCCccccCCCCCCCC Confidence 0000 00 00 012334444555654322 2444 No 3 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=98.57 E-value=2.1e-07 Score=57.20 Aligned_cols=418 Identities=17% Similarity=0.230 Sum_probs=213.7 Q ss_pred CC-cccchhH----------HHHHHH---HHH----Hhccc-----------hhhHHHHHHhhcCCCccchHHHHHHHHH Q lcl|NC_018276. 1 MN-AADARAV----------VNDFLQ---WVK----QLLPK-----------DKYNVFLQLFKFPVSTNELTEEIFNALE 51 (546) Q Consensus 1 ~~-a~d~~~~----------~~~fls---~vk----~~l~k-----------dky~~f~~~f~fpv~tn~lt~~if~~ls 51 (546) |- -+..+.- +.|.+. .|| +.||| ..|..+++---|+--|... +-.-+. T Consensus 1 m~~V~~~hp~y~~~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~e~~~~e~~~~Y~~rl~rA~~~n~~~~t---~~~l~G 77 (501) T protein:vir:95 1 MPNVSFIRPELGKLLPLYYLIRDAIAGEPTVKGARTTYLPMPNAEDQSKENKARYEAYLKRAVFYNVARRT---LFGLVG 77 (501) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCcCCCCCCCcccchHHHHHHhhccccCchHHHH---HHHHhh Confidence 43 2222221 223332 232 47776 3588888875555333332 223445 Q ss_pred HHhcccCccccccccCccchhhH-HHHHHhhcchhhhHHHHHHHHHhcCCceEEEEeehhhhc----------CCCCcce Q lcl|NC_018276. 52 KVYDGKDAYEDYNFVNPDYLQDW-NDYRSSVLNAYHFWRYEAFKQFKTNINGVMVVDLPAEQT----------SERPAPY 120 (546) Q Consensus 52 kvfd~~n~~~~yqf~~~e~~~d~-~~y~s~vLn~~~fw~~e~fk~~k~~iN~v~vVDm~~~Q~----------s~kp~py 120 (546) +|| .+.|.. ..|...+.| +|==-.+.+..+|-+.-+=.....-.=+ ++||+|..-. ...-.|| T Consensus 78 ~vf-~k~p~~----~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~-ilVD~P~~~~~~~~t~a~~~~~~~rPy 151 (501) T protein:vir:95 78 QVF-MRDPVV----KVPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAG-LLVDYPTTEAEGGASIADLEAGRIRPT 151 (501) T ss_pred hhh-cCCcce----eCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEE-EEEeecCCCCcccccHHHHHhccCCcE Confidence 666 355544 346666553 2333456677777765555555554444 4569985411 1112499 Q ss_pred EEecchhhhhhhhcccccc---eEEEee----ecCCCc--------EEEE--e-cC--cceeeccCCcc----------c Q lcl|NC_018276. 121 FYFMPITAVKDFKTNASGA---IEWIML----PQGDNQ--------LAVI--D-DE--HYSIYQLDEKG----------E 170 (546) Q Consensus 121 ~y~l~i~~v~a~r~n~~~~---I~fi~~----~~dqn~--------i~~I--D-de--~y~~y~~d~e~----------~ 170 (546) +..++-++|+..|+.-++. +.++-+ .++++. ++++ | |. .+++|..+..+ . T Consensus 152 ~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~~v~r~~~~~~~~~~~~~~~~ 231 (501) T protein:vir:95 152 LYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVHEIWREPQPTKADGSKIPKGN 231 (501) T ss_pred EEEecHhhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEEEEEEecCCcccCcceecCCc Confidence 9999999999999766653 555444 233332 4444 3 23 34667655432 1 Q ss_pred ----eeeehhhhhhhhccccceeeEeccccccccccceecchhHHHHHHHHHHHHhhh-hhhhhhhcCCcccccCcccCc Q lcl|NC_018276. 171 ----ISAEPLTQSAHDLGYCPATMFWGDPLMDSQPALKKSPLSNQLAALDWLLFFQTS-KKHLDLYAPYPIYSGFEQDCT 245 (546) Q Consensus 171 ----~sie~l~dn~h~lGy~PAr~~~~D~i~~skP~Ik~Spis~~L~A~D~ll~~~ts-~~hldlya~YpiYs~yeedC~ 245 (546) .-..++..+-|.|||.|..|+-+. .+.|++-.+||- .|+-+---||.+-| ..|+--+++.|.=|.+.-+=- T Consensus 232 ~~~~~~~~~~~~g~~~l~~IPfv~~~~~---~~~~~~~~pPLl-~lA~lni~hy~~ssd~~~~l~~~~~P~l~i~G~~~~ 307 (501) T protein:vir:95 232 YQQYVVYKPTDAQGKRLTEIPFMFIGSE---NNDSNPDNPNFY-DLASLNMAHYRNSADYEESCYIVGQPTPVLIGLTEE 307 (501) T ss_pred ccccceeeeeccCCCcCCeeeEEEEecC---CCCCCCCccchH-HHHHHHHHHHhhhhHHHHHHHHcccceeeeeCCccc Confidence 112333445699999998887443 345667778887 56556666777655 566666777777665422111 Q ss_pred ccccCCCcccCCceeeccCCceEeeccccccccCCcccccccCccceeeeeCCccccccccccCccceecccHHHHHHHH Q lcl|NC_018276. 246 YENKENGDYCDSGFIKNSHGDYYVTRTGEVSKCPVCADKRLAGVGSFIEVPLPSRENEGADLRNPVQITTIDKASLDYNV 325 (546) Q Consensus 246 ye~~~~~~~C~~G~ikn~~gd~~t~~tg~~~~Cp~c~~k~~~gags~~~~piP~~~~n~~Dl~~pv~i~s~d~~sley~~ 325 (546) +..++ -+.....|+++.+ ++|. ++| ++++.++-..+. . T Consensus 308 ---------------~~~~~---------------~~~~i~~G~~~~~--~lP~----~~~----~~~ie~~~~~i~--~ 345 (501) T protein:vir:95 308 ---------------WVTNV---------------LKGSVNFGSRGGI--PLPV----GAD----AKLLQASENTML--K 345 (501) T ss_pred ---------------ccccC---------------CCCceeecccccc--cCCC----CCc----eeEEecChhhHH--H Confidence 00000 0111233555555 5552 233 577777766664 6 Q ss_pred HHHHHHhhhheeeeec--CCCCcccchhhhhhhhhhhhhHHHHHHHHHhhcHHHHHHHHHHHHHHHhccccccccccccc Q lcl|NC_018276. 326 EECERIYDKIYTACVG--FGGDISMNKAFNKDQVKANTESRRNVLLSVKKNFERVQAWVEDTVCRLRYGELYVNNTIHYG 403 (546) Q Consensus 326 ~e~kri~d~i~~s~~G--f~~d~q~~ka~ne~qV~s~~es~r~~l~~lkknfer~qkfV~dti~~Lryg~~~~~~tv~yG 403 (546) ++++++.+.|... | +-...+.++++++.. -...+..+.|-.+-.|+|.|-.-+++.+|+- .|..--+-+|.+- T Consensus 346 ~~l~~l~~~m~~~--Ga~ll~~~~~~~Ta~~~~--~~~~~~~S~L~~~a~~le~al~~~l~~~a~w-~g~~~~~~~v~i~ 420 (501) T protein:vir:95 346 EAMDTKERQMVAL--GAKLVEQKEVQRTATEAE--LEAASEGSTLSSATKNVSAAFEWALKWAARW-VGQADSGVKFELN 420 (501) T ss_pred HHHHHHHHHHHHH--HHhhccCCccchhHHHHH--HHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH-cCCCCCceEEEEe Confidence 7788888887653 4 111112334444443 3344556788889999999988888877764 4543333456667 Q ss_pred ccccccC--HHHHHHHHHHHHhhc-ccHHHHHHHHHHHHhhhhcCCHHHHHHHHHHhhcCCCccccHHHHHHHHhcCCCc Q lcl|NC_018276. 404 SDFYLHN--VEELTAQYEAAKKAG-ATDYQLDVIQDQIIETENRNNPNAMERAQVLKHLEPYRHQTRKEVLEMLEAGFGD 480 (546) Q Consensus 404 t~fyl~t--~EeLte~~~~ak~~g-aS~~ei~~iq~qi~etEyrNdP~qmqR~~iL~~lEPf~~lT~~EvveL~e~~~~~ 480 (546) .+|-... ++++. .+-.|..+| +|...+= .+|+|+.|+. .++++.. T Consensus 421 ~df~~~~~~~~~~~-al~~~~~~G~is~~t~~---------------~~L~~~~v~~-------~~~~~e~--------- 468 (501) T protein:vir:95 421 TDFDIARMTPDERR-SLVEEWQKGAITFEEMR---------------TGLRKAGVAT-------EDDSKAK--------- 468 (501) T ss_pred cccccccCCHHHHH-HHHHHHhCCCCcHHHHH---------------HHHHhCCCCC-------hhHHHHH--------- Confidence 8886654 33333 333444444 3332220 1233332221 1111111 Q ss_pred hhheeeeeccchhheeecccCCchhhhhhhcChhhhHHHHHHHHHhhhcchh--hcCCCCCCCCCCCC Q lcl|NC_018276. 481 MELIAVKLNFSTFVLRFERENTDIVEFGNALPFDQKISIILNTFKSYGKTEY--TSQPADGGAGSGES 546 (546) Q Consensus 481 ~Ell~vk~nf~~fv~rfe~en~~i~efg~~l~~~~kieii~ntl~sygn~e~--~~qp~~~~~~~~~~ 546 (546) ++|++..+...- +.+. ...+.+||.--|.+ T Consensus 469 ----------------------------------e~i~~~~~~~~~--~~~~~~~~~~~~gg~~~~~~ 500 (501) T protein:vir:95 469 ----------------------------------EKIAKDTAEAMA--LATPANVPGDGSGGDNVGNS 500 (501) T ss_pred ----------------------------------HHHHhhhcCccc--ccccCCCCCCCcccccccCC Confidence 222222221100 0001 11223344332333 No 4 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=98.07 E-value=5.4e-06 Score=49.48 Aligned_cols=404 Identities=11% Similarity=0.122 Sum_probs=207.0 Q ss_pred CCcccchhHHHH-----------HHHHHH----Hhccc-----------------------hhhHHHHHHhhcCCCccch Q lcl|NC_018276. 1 MNAADARAVVND-----------FLQWVK----QLLPK-----------------------DKYNVFLQLFKFPVSTNEL 42 (546) Q Consensus 1 ~~a~d~~~~~~~-----------fls~vk----~~l~k-----------------------dky~~f~~~f~fpv~tn~l 42 (546) |--+..+.-|.. .-..+| +.||| ++|++.++.--|+- . T Consensus 14 m~V~~~hp~y~a~~~~W~~~~d~g~~~~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~~~~~y~~~~~~rA~~~n----~ 89 (488) T protein:vir:96 14 MLTPIYHPDYLVNAPQWLRNLDCVMDNIKRKKQTYLPNLGAIPPEAKTDPKVTALAAKIEKDWEDLTWRLANYVN----I 89 (488) T ss_pred ecccccCHHHHHHhhhhhHhhhhhhHHHHHhhhhcCCCCCCccccccCcchhhhhhccchhhhHhhhhhccccCc----h Confidence 211111111111 112233 46776 12222222222322 2 Q ss_pred HHHHHHHHHHHhcccCccccccccCccchhhH-HHHHHhhcchhhhHHHHHHHHHhcCCceEEEEeehhhhc------CC Q lcl|NC_018276. 43 TEEIFNALEKVYDGKDAYEDYNFVNPDYLQDW-NDYRSSVLNAYHFWRYEAFKQFKTNINGVMVVDLPAEQT------SE 115 (546) Q Consensus 43 t~~if~~lskvfd~~n~~~~yqf~~~e~~~d~-~~y~s~vLn~~~fw~~e~fk~~k~~iN~v~vVDm~~~Q~------s~ 115 (546) +..+.+.|.-.-=.+.|..+ ...|.-.+.| +|==-.+.+..+|-+.-+=..... =-+-++||+|.-.. .. T Consensus 90 ~~~tl~~l~G~vfrk~p~~~--~~~~~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~-G~~~ilVD~P~~~~T~ade~~~ 166 (488) T protein:vir:96 90 VNPTMNAITGAVMRREPEFD--TMDNPVLIGLRDNIDGKGNGIDQECKQALNALQWG-SRCGWLVRSHPESATMADWNKG 166 (488) T ss_pred hHHHHHHhcchhhccCceec--cCCcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhc-CeEEEEEecCCCcCCHHHHHHh Confidence 33334444322223344332 2222123332 223335666777665444444443 34557899984211 11 Q ss_pred CCcceEEecchhhhhhhhcccccc---eEEEee-----ecCCC------c--EEEEecCcceeeccCCcc-ceeeehhhh Q lcl|NC_018276. 116 RPAPYFYFMPITAVKDFKTNASGA---IEWIML-----PQGDN------Q--LAVIDDEHYSIYQLDEKG-EISAEPLTQ 178 (546) Q Consensus 116 kp~py~y~l~i~~v~a~r~n~~~~---I~fi~~-----~~dqn------~--i~~IDde~y~~y~~d~e~-~~sie~l~d 178 (546) .-.||+..++-++|+..|+.-++. +.++-+ .+|.. + ++..++-.|+++-..+++ .....++.. T Consensus 167 ~~rPy~~~~~a~~IinW~~~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~~~l~~g~~~v~~~~~~~~~~e~~~~~~ 246 (488) T protein:vir:96 167 KKLPTAAFYDALHIIDWEVEYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLINHRLVDGLCEFQEVTDDEYSDEWTPVLI 246 (488) T ss_pred cCCcEEEEechhhhcCcceeccCCceeeEEEEEEEEEEeccCCCcccceEEEEEEEECcEEEEEEEecCCcccceEeecC Confidence 224999999999999999876664 665544 22322 2 333677778887663332 222344455 Q ss_pred hhhhccccceeeEeccccccccccceecchhHHHHHHHHHHHHhhhh-hhhhhhcCCcccccCcccCcccccCCCcccCC Q lcl|NC_018276. 179 SAHDLGYCPATMFWGDPLMDSQPALKKSPLSNQLAALDWLLFFQTSK-KHLDLYAPYPIYSGFEQDCTYENKENGDYCDS 257 (546) Q Consensus 179 n~h~lGy~PAr~~~~D~i~~skP~Ik~Spis~~L~A~D~ll~~~ts~-~hldlya~YpiYs~yeedC~ye~~~~~~~C~~ 257 (546) +-|-||+.|..++-+.. +.+++-.+||- .|+-+---||.+.|- .|+=-.+..|+.-..-. T Consensus 247 g~~~l~~IP~v~~~~~~---~~~~~~~pPLl-dLA~lnl~Hy~~ssd~~~il~~~~~p~lv~~~~--------------- 307 (488) T protein:vir:96 247 NSKQSDTIPFFLASSQS---NEWCIDSTPLT-SLAEISLSIYVMNAYSNKAMILANEAKWMVDMG--------------- 307 (488) T ss_pred CCcccCeeEEEEEecCC---CCCCCCCCchH-HHHHHHHHHHhhhhHHHHHHHhcCCceeeeccC--------------- Confidence 67999999999995443 55678888987 677777777776552 34333455554322110 Q ss_pred ceeeccCCceEeeccccccccCCcccccccCccceeeeeCCccccccccccCccceecccHHHHHHHHHHHHHHhhhhee Q lcl|NC_018276. 258 GFIKNSHGDYYVTRTGEVSKCPVCADKRLAGVGSFIEVPLPSRENEGADLRNPVQITTIDKASLDYNVEECERIYDKIYT 337 (546) Q Consensus 258 G~ikn~~gd~~t~~tg~~~~Cp~c~~k~~~gags~~~~piP~~~~n~~Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~ 337 (546) | +....-++ ++ . .|++....+|.|.. ..|+ +++.++.+++ ..++++++.+.|.. T Consensus 308 ~-~~~~~~~~-~~-------------~--~g~~~~~~~~~~~~---~g~~----~~~e~~~~~l--~~~~l~~l~~qm~~ 361 (488) T protein:vir:96 308 D-MNKTMASE-MN-------------P--LGFTLAGRMPYYVK---NGDV----KVIQAQFSPE--TENKVEKLFEQAVK 361 (488) T ss_pred C-CCcccccc-cc-------------c--ceeeeccccccccc---CCce----eecCCchhHH--HHHHHHHHHHHHHH Confidence 0 00000000 00 0 12222223343321 1343 3445555544 34556666665533 Q ss_pred eeecCCCCcccchhhhhhhhhhhhhHHHHHHHHHhhcHHHHHHHHHHHHHHHhccc-----ccccccccccccccccC-- Q lcl|NC_018276. 338 ACVGFGGDISMNKAFNKDQVKANTESRRNVLLSVKKNFERVQAWVEDTVCRLRYGE-----LYVNNTIHYGSDFYLHN-- 410 (546) Q Consensus 338 s~~Gf~~d~q~~ka~ne~qV~s~~es~r~~l~~lkknfer~qkfV~dti~~Lryg~-----~~~~~tv~yGt~fyl~t-- 410 (546) .|. .-++.+.+.+.++..-...+..+.|-.+-.|.|.|-.-+++.+|+- .|. .-.+-.|.+-.+|-... T Consensus 362 --~Ga-~l~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~A~w-~g~~~~~~~~~~~~~~in~dF~~~~ld 437 (488) T protein:vir:96 362 --VGA-SLFTQQSNETATGAAIRSGSSTASMATLGNNVEDTVRNMLRFIMRY-FEGTNLYVNPDELVFKLNRDYFDVEVN 437 (488) T ss_pred --HhH-hhccCCCcchHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH-cCCCCCCcCccceEEEeccCCCCccCC Confidence 220 0111111233445555566678889999999999998888887763 221 11234677788888755 Q ss_pred HHHHHHHHHHHHhhcccHHHHHHHHHHHHhhhhcCCHHHHHHHHHHhhcCCCccccHHHHHHHHhc-CCCc Q lcl|NC_018276. 411 VEELTAQYEAAKKAGATDYQLDVIQDQIIETENRNNPNAMERAQVLKHLEPYRHQTRKEVLEMLEA-GFGD 480 (546) Q Consensus 411 ~EeLte~~~~ak~~gaS~~ei~~iq~qi~etEyrNdP~qmqR~~iL~~lEPf~~lT~~EvveL~e~-~~~~ 480 (546) ++.+.+-++...+-.+|.+.+- .+|+|+.+|. ++.|-++..+-++. |++= T Consensus 438 ~~~~~al~~~~~~G~Is~~t~~---------------~~L~~~gvl~-----~d~~~e~~~~~ie~~g~~~ 488 (488) T protein:vir:96 438 PQMLQVAYAAMMEGNLPQVSWF---------------ELLKRARVVR-----GDMSKEEFDEHIAELGFGM 488 (488) T ss_pred HHHHHHHHHHHhcCCCCHHHHH---------------HHHHhCCcCC-----ccCCHHHHHHHHhhcCCCC Confidence 6656555555544446655542 3567766554 56777777777666 4432 No 5 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=97.68 E-value=2.9e-05 Score=45.46 Aligned_cols=415 Identities=13% Similarity=0.140 Sum_probs=210.8 Q ss_pred CCcccch----------hHHHHHHH-------------HHHHhccchhhHHHHHHhhcCCCccchHHHHHHHHH-HHhcc Q lcl|NC_018276. 1 MNAADAR----------AVVNDFLQ-------------WVKQLLPKDKYNVFLQLFKFPVSTNELTEEIFNALE-KVYDG 56 (546) Q Consensus 1 ~~a~d~~----------~~~~~fls-------------~vk~~l~kdky~~f~~~f~fpv~tn~lt~~if~~ls-kvfd~ 56 (546) -|-+..+ ..+.|.+. +..+.--++.|..+++.--|+- .+..+-+.|. +|| . T Consensus 9 ~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n----~~~~tl~~l~G~vf-r 83 (491) T protein:vir:95 9 SGVKTKHREWLHYAPKWQKVRHALAGDLVGYLRNVGLNEPDKAYGEARQAEYEAGGIVYN----FTRRTLSGMVGSVM-R 83 (491) T ss_pred CCCCccCHHHHHHHHHHHHHHHHhcCcchhhcccCCCcCCCCCCCHHHHHHHHhcccCCC----hHHHHHHHHhchhh-c Confidence 0000000 00111111 0001111234888887766653 3333333333 333 3 Q ss_pred cCccccccccCccchhhHH-HHHHhhcchhhhHHHHHHHHHhcCCceEEEEeehhhhc-------CCCCcceEEecchhh Q lcl|NC_018276. 57 KDAYEDYNFVNPDYLQDWN-DYRSSVLNAYHFWRYEAFKQFKTNINGVMVVDLPAEQT-------SERPAPYFYFMPITA 128 (546) Q Consensus 57 ~n~~~~yqf~~~e~~~d~~-~y~s~vLn~~~fw~~e~fk~~k~~iN~v~vVDm~~~Q~-------s~kp~py~y~l~i~~ 128 (546) +.+.. ..|+.++.|. |==-.+.+..+|-+.-+=.... .=-+-++||+|...+ -..-.||+..++-++ T Consensus 84 k~p~~----~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~-~G~~~ilVD~P~~~~~T~Ade~~~~~rPy~~~~~~~~ 158 (491) T protein:vir:95 84 KEPEI----NIPKELEYLLKNADGSGVGLIQHAQDTLMEIDS-VGRGGLLVDAPETAAATAAEQNAGLLNPTIAFYTTEN 158 (491) T ss_pred CCcee----eccHHHHHHHhccCCCCCCHHHHHHHHHHHHHH-cCeEEEEEecCCCcccCHHHHHHhcCCcEEEEechhh Confidence 44443 3466555432 2223566777776654444444 444557899996531 012259999999999 Q ss_pred hhhhhc---ccccceEEEeeec-----C-CC--------cEEEE--e-cCcc--eeeccCCccc---eeeehhhh-hhhh Q lcl|NC_018276. 129 VKDFKT---NASGAIEWIMLPQ-----G-DN--------QLAVI--D-DEHY--SIYQLDEKGE---ISAEPLTQ-SAHD 182 (546) Q Consensus 129 v~a~r~---n~~~~I~fi~~~~-----d-qn--------~i~~I--D-de~y--~~y~~d~e~~---~sie~l~d-n~h~ 182 (546) |+..|+ ++.+.+.++-+-+ | .+ +++|+ | |..| ++|.++.++. .+.+...+ ..|- T Consensus 159 IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g~~~~~~~~~~~~~g~~~ 238 (491) T protein:vir:95 159 IVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEGGAQEEVVEIYPDLGESL 238 (491) T ss_pred hcCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCCcceeeeeeeeecCCCcc Confidence 999995 4545677766633 1 12 24454 2 4433 4565544432 22222223 3489 Q ss_pred ccccceeeEeccccccccccceecchhHHHHHHHHHHHHhhh-hhhhhhhcCCcccccCcccCcccccCCCcccCCceee Q lcl|NC_018276. 183 LGYCPATMFWGDPLMDSQPALKKSPLSNQLAALDWLLFFQTS-KKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIK 261 (546) Q Consensus 183 lGy~PAr~~~~D~i~~skP~Ik~Spis~~L~A~D~ll~~~ts-~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ik 261 (546) ||+.|..++-+.. +.+++-.+||- .|+-+---||.+-| ..|+--.++.|.-+.+..|=- ..+.++ T Consensus 239 l~~IPfv~~~~~~---~~~~~~~pPLl-~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~~----------~~~~~~ 304 (491) T protein:vir:95 239 RGVIPFTFIGATN---NDATIDDAPLL-PLAELNIGHYRNSADNEESSFVVGQPTLFIYPGDNL----------TPQSFK 304 (491) T ss_pred cCeeEEEEEecCC---CCCCCCcCchH-HHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccc----------Ccchhh Confidence 9999999995543 45677888876 56666556666554 344445666776665432210 001111 Q ss_pred ccCCceEeeccccccccCCcccccccCccceeeeeCCccccccccccCccceeccc-----HHHHHHHHHHHHHHhhhhe Q lcl|NC_018276. 262 NSHGDYYVTRTGEVSKCPVCADKRLAGVGSFIEVPLPSRENEGADLRNPVQITTID-----KASLDYNVEECERIYDKIY 336 (546) Q Consensus 262 n~~gd~~t~~tg~~~~Cp~c~~k~~~gags~~~~piP~~~~n~~Dl~~pv~i~s~d-----~~sley~~~e~kri~d~i~ 336 (546) ... +....-|+++.+.+ |.. + . .+++.+. .+.|+--.+...+++-.|+ T Consensus 305 ~~~-----------------~~~i~~g~~~~~~l--P~~-~-~------~~~ie~~~~~~~~~~l~~~e~qm~~~Ga~l~ 357 (491) T protein:vir:95 305 EAN-----------------PNGIKFGSRCGHNL--GYG-G-S------AQLIQAGENNLARQNMLDKEQQAIQIGAQLI 357 (491) T ss_pred ccC-----------------cceeEecCcCCcCC--CCC-C-c------cceeecCcchHHHHHHHHHHHHHHHHHHHhc Confidence 100 11233455555433 421 1 1 2233332 3444444444455555543 Q ss_pred eeeecCCCCcccchhhhhhhhhhhhhHHHHHHHHHhhcHHHHHHHHHHHHHHHhcccc-cccccccccccccccC--HHH Q lcl|NC_018276. 337 TACVGFGGDISMNKAFNKDQVKANTESRRNVLLSVKKNFERVQAWVEDTVCRLRYGEL-YVNNTIHYGSDFYLHN--VEE 413 (546) Q Consensus 337 ~s~~Gf~~d~q~~ka~ne~qV~s~~es~r~~l~~lkknfer~qkfV~dti~~Lryg~~-~~~~tv~yGt~fyl~t--~Ee 413 (546) . .+. +.+.++..-...+..+.|-.+-.|.|.|-.-+++-+|+- .|.. --...|.+-.+|-+.. +++ T Consensus 358 ~--------~~~--~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w-~G~~~~~~v~i~~n~dF~~~~~~~~~ 426 (491) T protein:vir:95 358 T--------PSQ--QITAESARIQRGADTSVMATIARNVSQAYTDALRWVAMM-LGKPEDSEVEFQLNMDFFLQPMTAQD 426 (491) T ss_pred c--------CCc--chhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH-cCCCCCCceEEEeecccccccCCHHH Confidence 2 122 334455556666778899999999999999999988887 6853 3456778888987665 787 Q ss_pred HHHHHHHHHhhcccHHHHHHHHHHHHhhhhcCCHHHHHHHHHHhhcCCCccccHHHHHHHHhc-CCCchhheeeeeccch Q lcl|NC_018276. 414 LTAQYEAAKKAGATDYQLDVIQDQIIETENRNNPNAMERAQVLKHLEPYRHQTRKEVLEMLEA-GFGDMELIAVKLNFST 492 (546) Q Consensus 414 Lte~~~~ak~~gaS~~ei~~iq~qi~etEyrNdP~qmqR~~iL~~lEPf~~lT~~EvveL~e~-~~~~~Ell~vk~nf~~ 492 (546) +.+-++....-.+|...+ =.+|+|+.|+ +.+.+++..-+++ ++..--.-.|---.+. T Consensus 427 ~~all~~~~~G~is~~t~---------------~~~L~~~~vl-------~~~~e~~~~~ie~~~~~~~~~~~~~~~~~~ 484 (491) T protein:vir:95 427 RAAWMADINAGLLPATAY---------------YAALRKAGVT-------DWTDEDILNAIEDAPLPSGAVTQVAGEIPQ 484 (491) T ss_pred HHHHHHHHhcCCCCHHHH---------------HHHHHhCCCC-------CccHHHHHHHHHhcCCCCCccccccccchh Confidence 777776666544554433 1356776664 2355665555443 3210000000000000 Q ss_pred hheeeccc Q lcl|NC_018276. 493 FVLRFERE 500 (546) Q Consensus 493 fv~rfe~e 500 (546) =+..=| | T Consensus 485 ~~~~~~-~ 491 (491) T protein:vir:95 485 AAQQQQ-E 491 (491) T ss_pred hhhhcc-C Confidence 000000 0 No 6 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=97.10 E-value=0.00017 Score=41.22 Aligned_cols=412 Identities=13% Similarity=0.124 Sum_probs=200.3 Q ss_pred CCcccchh----------HHHHHHHH-------------HHHhccchhhHHHHHHhhcCCCccchHHHHHHHHHHHhccc Q lcl|NC_018276. 1 MNAADARA----------VVNDFLQW-------------VKQLLPKDKYNVFLQLFKFPVSTNELTEEIFNALEKVYDGK 57 (546) Q Consensus 1 ~~a~d~~~----------~~~~fls~-------------vk~~l~kdky~~f~~~f~fpv~tn~lt~~if~~lskvfd~~ 57 (546) -|-+..+. .+.|.+.- ..+.--++.|..+++.--|+- .|..+-+.|.-.-=.+ T Consensus 9 ~~V~~~hp~y~a~~~~W~~ird~~~G~~~~~~r~~yl~~~~~~~~e~~Y~~rl~rA~~~n----~~~~tl~~l~G~vfrk 84 (489) T protein:vir:78 9 SGVKTKHREWLHYAPKWQKVRHALAGELVSYLRNVGLNEPDKAYGEARQAEYEAGGIVYN----FTRRTLSGMVGSVMRK 84 (489) T ss_pred CCCCccCHHHHHHHHHHHHHHHHhcCcccccccCCCCCCCCCCCChHHHHHHHhccccCC----hHHHHHHHHhchhhcC Confidence 00000000 01111110 001111234888887766653 3334444433222234 Q ss_pred CccccccccCccchhhHH-HHHHhhcchhhhHHHHHHHHHhcCCceEEEEeehhhhc-------CCCCcceEEecchhhh Q lcl|NC_018276. 58 DAYEDYNFVNPDYLQDWN-DYRSSVLNAYHFWRYEAFKQFKTNINGVMVVDLPAEQT-------SERPAPYFYFMPITAV 129 (546) Q Consensus 58 n~~~~yqf~~~e~~~d~~-~y~s~vLn~~~fw~~e~fk~~k~~iN~v~vVDm~~~Q~-------s~kp~py~y~l~i~~v 129 (546) .+.. ..|+.++.|. |==-.+.+..+|-+.-+=.... .=-+-++||+|.... -..-.||+..++-++| T Consensus 85 ~p~~----~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~-~G~~~ilVD~P~~~~~T~ade~~~~~rPy~~~~~~~~I 159 (489) T protein:vir:78 85 EPEI----NIPKELEYLLKNADGSGVGLIQHAQDTLMEIDS-VGRGGLLVDAPETGAATAAEQNAGLLNPTIAFYTTENI 159 (489) T ss_pred Ccce----eccHHHHHHHhccCCCCCCHHHHHHHHHHHHHh-cCeEEEEEeeCCCCCcCHHHHHHhcCCcEEEEechhhh Confidence 4433 3465554432 2223466777776654444444 334556899995421 0122599999999999 Q ss_pred hhhhccccc---ceEEEeeec-----CC-Cc--------EEEE--e-cCcc--eeeccCCccce--ee-ehh-hhhhhhc Q lcl|NC_018276. 130 KDFKTNASG---AIEWIMLPQ-----GD-NQ--------LAVI--D-DEHY--SIYQLDEKGEI--SA-EPL-TQSAHDL 183 (546) Q Consensus 130 ~a~r~n~~~---~I~fi~~~~-----dq-n~--------i~~I--D-de~y--~~y~~d~e~~~--si-e~l-~dn~h~l 183 (546) +..|+.-++ .+.++-+-+ |. +. ++|+ | |..| ++|..+.+|.- +. +++ ....|-| T Consensus 160 inW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~~~~~~~~~~~~~g~~~l 239 (489) T protein:vir:78 160 VNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGGAQEDVVEIYPDLGESLR 239 (489) T ss_pred cCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCcccceeeEEeccCCCCcc Confidence 999965444 577666533 21 22 5554 3 3444 45666555432 22 222 3445999 Q ss_pred cccceeeEeccccccccccceecchhHHHHHHHHHHHHhhh-hhhhhhhcCCcccccCcccCcccccCCCcccCCceeec Q lcl|NC_018276. 184 GYCPATMFWGDPLMDSQPALKKSPLSNQLAALDWLLFFQTS-KKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKN 262 (546) Q Consensus 184 Gy~PAr~~~~D~i~~skP~Ik~Spis~~L~A~D~ll~~~ts-~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn 262 (546) |+.|..++-+.. +.+++-.+||- .|+-+---||.+.| ..|.--.++.|.-+.+.-|=- ..+.++. T Consensus 240 ~~IPfv~~~~~~---~~~~~~~pPLl-~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~----------~~~~~~~ 305 (489) T protein:vir:78 240 GVIPFTFIGATN---NDATIDDAPLL-PLAELNIGHYRNSADNEESSFVVGQPTLFIYPGENL----------TPQAFKE 305 (489) T ss_pred CeeeEEEEecCC---CCCCCCcCchH-HHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccC----------Ccccccc Confidence 999999995543 45678888987 56666666666554 345555677777665422100 1111221 Q ss_pred cCCceEeeccccccccCCcccccccCccceeeeeCCccccccccccCccceecccHHHHHHHHHHHHHHhhhheeeeec- Q lcl|NC_018276. 263 SHGDYYVTRTGEVSKCPVCADKRLAGVGSFIEVPLPSRENEGADLRNPVQITTIDKASLDYNVEECERIYDKIYTACVG- 341 (546) Q Consensus 263 ~~gd~~t~~tg~~~~Cp~c~~k~~~gags~~~~piP~~~~n~~Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~s~~G- 341 (546) .. +.-..-|+++.+ ++|..+ ..+++.+.-..+- .++++.+.+.|... | T Consensus 306 ~~-----------------~~~i~~g~~~~~--~lp~~~--------~~~~ie~~~~~~~--r~~l~~le~qm~~l--Ga 354 (489) T protein:vir:78 306 AN-----------------PNGIKFGSRRGH--NLGYGG--------SAQLIQAGENNLA--RQNMLDKEQQAIQI--GA 354 (489) T ss_pred cC-----------------ccceeeCCcccc--cCCCCC--------CcceeccCcchHH--HHHHHHHHHHHHHH--hh Confidence 10 011223444444 444211 1233333322221 34444444444321 1 Q ss_pred -CCCCcccchhhhhhhhhhhhhHHHHHHHHHhhcHHHHHHHHHHHHHHHhcccc-cccccccccccccccC--HHHHHHH Q lcl|NC_018276. 342 -FGGDISMNKAFNKDQVKANTESRRNVLLSVKKNFERVQAWVEDTVCRLRYGEL-YVNNTIHYGSDFYLHN--VEELTAQ 417 (546) Q Consensus 342 -f~~d~q~~ka~ne~qV~s~~es~r~~l~~lkknfer~qkfV~dti~~Lryg~~-~~~~tv~yGt~fyl~t--~EeLte~ 417 (546) .- +.+. +.+.++..-...+..+.|..+-.|.|.|-.-+++-+|+- .|.. --...|.+-.+|-+.. ++.+.+- T Consensus 355 ~l~-~~~~--~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w-~G~~~~~~~~i~~n~dF~~~~~d~~~~~al 430 (489) T protein:vir:78 355 QLI-TPTQ--QITAQSARIQRGADTSVMATIARNVSQAYTDALRWVAVM-LGKPEDTEVEFRLNMDFFLEPMTAQDRAAW 430 (489) T ss_pred hhc-cCCc--chhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH-cCCCCCCceEEEeecccCcccCCHHHHHHH Confidence 00 1122 334445555666778899999999999999999988887 7753 3445667777776654 5656555 Q ss_pred HHHHHhhcccHHHHHHHHHHHHhhhhcCCHHHHHHHHHHhhcCCCccccHHHHHHHH-hcCC-----Cchhheeeeeccc Q lcl|NC_018276. 418 YEAAKKAGATDYQLDVIQDQIIETENRNNPNAMERAQVLKHLEPYRHQTRKEVLEML-EAGF-----GDMELIAVKLNFS 491 (546) Q Consensus 418 ~~~ak~~gaS~~ei~~iq~qi~etEyrNdP~qmqR~~iL~~lEPf~~lT~~EvveL~-e~~~-----~~~Ell~vk~nf~ 491 (546) +++..+-.+|.+.+-. +|+|+.|+ +| |.+++.+=+ ++|. ++-|.= T Consensus 431 ~~~~~~G~is~~t~~~---------------~L~~~gv~---d~----~~e~~~~ei~~~~~~~~~~~~g~~~------- 481 (489) T protein:vir:78 431 MADINAGLLPATAYYA---------------ALRKAGVT---DW----TDADIKDAVADQPLPVATEVQGEIP------- 481 (489) T ss_pred HHHHhcCCCCHHHHHH---------------HHHhCCCC---Cc----cHHHHHHHHhhcCCCcccCCcccCC------- Confidence 5555443455443321 23444332 11 233333222 2221 000000 Q ss_pred hhheeecccC Q lcl|NC_018276. 492 TFVLRFEREN 501 (546) Q Consensus 492 ~fv~rfe~en 501 (546) .=-.. .|- T Consensus 482 ~~~q~--~~~ 489 (489) T protein:vir:78 482 QSAQQ--QEK 489 (489) T ss_pred CCccc--ccC Confidence 00000 000 No 7 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=96.57 E-value=0.0005 Score=38.67 Aligned_cols=431 Identities=15% Similarity=0.143 Sum_probs=204.2 Q ss_pred CC-cccchhH----------HHHHHH---HH----HHhccc-----------hhhHHHHHHhhcCCCccchHHHHHHHHH Q lcl|NC_018276. 1 MN-AADARAV----------VNDFLQ---WV----KQLLPK-----------DKYNVFLQLFKFPVSTNELTEEIFNALE 51 (546) Q Consensus 1 ~~-a~d~~~~----------~~~fls---~v----k~~l~k-----------dky~~f~~~f~fpv~tn~lt~~if~~ls 51 (546) |. -+..+.- +.|.+. .| ++.||| ..|..+++.--|+=-|.+.++- -.- T Consensus 32 m~dV~~~hp~y~a~~~~W~~ird~~~G~~~~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~rA~~~n~~~~tl~~---l~G 108 (535) T protein:vir:80 32 LPNVGYQRVEFGEMLPKWRKIMDCLSGQEAIKAKREEYLPMPSVDSRDEEQRRRYETYLQRAIFYNVTARTLDG---MMG 108 (535) T ss_pred CCCCCcCCHHHHHHHHHHHHHHHHhcChHHHHhcccccCCCCCcccCCcCCHHHHHHHHhhccCCChhHHHHHH---Hhc Confidence 43 2222222 222221 12 235777 2388888886665433333322 334 Q ss_pred HHhcccCccccccccCccchhhHH-HHHHhhcchhhhHHHHHHHHHhcCCceEEEEeehhhh--------cCCCCcceEE Q lcl|NC_018276. 52 KVYDGKDAYEDYNFVNPDYLQDWN-DYRSSVLNAYHFWRYEAFKQFKTNINGVMVVDLPAEQ--------TSERPAPYFY 122 (546) Q Consensus 52 kvfd~~n~~~~yqf~~~e~~~d~~-~y~s~vLn~~~fw~~e~fk~~k~~iN~v~vVDm~~~Q--------~s~kp~py~y 122 (546) +|| .+.+. +..|+..+.|. |==-.+.+..+|-+.-+=.....-. +-++||+|... ......||+. T Consensus 109 ~vf-rk~p~----~~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~-~~iLVD~P~~~~~~t~ade~~~~~rPy~~ 182 (535) T protein:vir:80 109 QVF-SRDPI----RQLPPALEAIVEDIDGEGVSLDQQAKKALGYTMGFGR-AAIFTDYPNVGRPVTVLEQKLGLYRPTIT 182 (535) T ss_pred hhh-cCCcc----eeccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCe-EEEEEeecCCCCcccHHHHHhcCCCcEEE Confidence 455 33443 33465555543 2223456677776654444444443 44577998542 2224469999 Q ss_pred ecchhhhhhhhcccccc---eEEEee----ecCCC--------cEEEE---ecCccee--eccCCcc--ceee--ehhhh Q lcl|NC_018276. 123 FMPITAVKDFKTNASGA---IEWIML----PQGDN--------QLAVI---DDEHYSI--YQLDEKG--EISA--EPLTQ 178 (546) Q Consensus 123 ~l~i~~v~a~r~n~~~~---I~fi~~----~~dqn--------~i~~I---Dde~y~~--y~~d~e~--~~si--e~l~d 178 (546) +++-++|+..|+.-++. +.++-+ .++++ +++|+ .|..|++ |..++++ ..+. ...++ T Consensus 183 ~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v~~~~~~~~~~~~~~~~~~~~~~ 262 (535) T protein:vir:80 183 LVHPTSIINWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQVERWRRETQEEMYYSYSKHVPTD 262 (535) T ss_pred EechhhccCccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEEEEEEEeecCCccccccceeeccc Confidence 99999999999776554 555554 22222 35554 2334443 3222221 1111 12223 Q ss_pred -hhhhccccceeeEeccccccccccceecchhHHHHHHHHHHHHhhh-hhhhhhhcCCcccccCcccCcccccCCCcccC Q lcl|NC_018276. 179 -SAHDLGYCPATMFWGDPLMDSQPALKKSPLSNQLAALDWLLFFQTS-KKHLDLYAPYPIYSGFEQDCTYENKENGDYCD 256 (546) Q Consensus 179 -n~h~lGy~PAr~~~~D~i~~skP~Ik~Spis~~L~A~D~ll~~~ts-~~hldlya~YpiYs~yeedC~ye~~~~~~~C~ 256 (546) ..|-|||.|..++-+. .+.+++-.+||- .|+-+---||.+-| ..|+--+++.|.=+.+.-+=- .. + T Consensus 263 ~g~~~l~~IPfv~~~~~---~~~~~~~~pPLl-~LA~lni~Hy~~ssd~~~il~~~~~P~l~i~G~~~~--~~------~ 330 (535) T protein:vir:80 263 GNGNPFKEIPFQFIGPL---DNNADIDHPPLL-DLCEVNIGHYRNSADYEEMAFVAGQPTAFFTGLTKD--WV------E 330 (535) T ss_pred CCCcccCeeEEEEeecC---CCCCCCCccchH-HHHHHHHHHhhchhHHHHHHHHhcCceeeeecCchh--hh------h Confidence 5599999999988333 244678888987 56666666777776 566666777776665421100 00 0 Q ss_pred CceeeccCCceEeeccccccccCCcccccccCccceeeeeCCccccccccccCcccee--cccHHHHHHHHHHHHHHhhh Q lcl|NC_018276. 257 SGFIKNSHGDYYVTRTGEVSKCPVCADKRLAGVGSFIEVPLPSRENEGADLRNPVQIT--TIDKASLDYNVEECERIYDK 334 (546) Q Consensus 257 ~G~ikn~~gd~~t~~tg~~~~Cp~c~~k~~~gags~~~~piP~~~~n~~Dl~~pv~i~--s~d~~sley~~~e~kri~d~ 334 (546) .+..+.- -..|+++.+. +|..+. +. .+++. ++..+.++-..+...+++-. T Consensus 331 ----~~~~~~~-----------------i~iG~~~~~~--lP~~~~--~~---~~e~~~~~~a~~~l~~~e~qM~~lGa~ 382 (535) T protein:vir:80 331 ----DVFKDFK-----------------VHLGSRAIIP--LPQGAT--AG---ILQITPNSVPFEAMTHKESQMIAMGAN 382 (535) T ss_pred ----cCCCCcc-----------------eEecCccccc--CCCCCC--cc---eeeeccchhHHHHHHHHHHHHHHHHHH Confidence 0000111 2235666654 442222 22 11221 22233333333333344333 Q ss_pred heeeeecCCCCcccchhhhhhhhhhhhhHHHHHHHHHhhcHHHHHHHHHHHHHHHhcccccc--ccccccccccccc--C Q lcl|NC_018276. 335 IYTACVGFGGDISMNKAFNKDQVKANTESRRNVLLSVKKNFERVQAWVEDTVCRLRYGELYV--NNTIHYGSDFYLH--N 410 (546) Q Consensus 335 i~~s~~Gf~~d~q~~ka~ne~qV~s~~es~r~~l~~lkknfer~qkfV~dti~~Lryg~~~~--~~tv~yGt~fyl~--t 410 (546) ++..- ..++++++.. -...+..+.|-.+-.|.|.|-.-+++.+|+ -.|.-.- +-.|.+-.+|-.. + T Consensus 383 ll~~~-------~~~~Ta~~a~--~~~~~~~S~L~~~a~~le~al~~aL~~~A~-w~G~~~~~~~~~i~~n~dF~~~~ld 452 (535) T protein:vir:80 383 LLVKS-------GGNRTFGEAQ--QEEASEQSILSACTKNVSMAFRKALRWANQ-FQTGIVNDETVEYNLNTDFPAARLT 452 (535) T ss_pred hhccC-------cccccHHHHH--HHHHHHhHHHHHHHHHHHHHHHHHHHHHHH-HcCCccCCCceEEEeccccccccCC Confidence 33222 2245555443 345556788888899999998888885544 3553221 2235677777655 4 Q ss_pred HHHHHHHHHHHHhhcccHHHHHHHHHHHHhhhhcCCHHHHHHHHHHhhcCCCccccHHHHHHHHhcCCCchhheeeeecc Q lcl|NC_018276. 411 VEELTAQYEAAKKAGATDYQLDVIQDQIIETENRNNPNAMERAQVLKHLEPYRHQTRKEVLEMLEAGFGDMELIAVKLNF 490 (546) Q Consensus 411 ~EeLte~~~~ak~~gaS~~ei~~iq~qi~etEyrNdP~qmqR~~iL~~lEPf~~lT~~EvveL~e~~~~~~Ell~vk~nf 490 (546) ++++.+-++...+-.+|.+.+- .+|+|+.||- |.... +|... T Consensus 453 ~~~~~all~~~~~G~Is~et~~---------------~~L~r~gvl~---~~~~~--eee~~------------------ 494 (535) T protein:vir:80 453 PNERAELILEWQQGAITFKEMR---------------AGLRRAGVAS---EDDAK--AETEG------------------ 494 (535) T ss_pred HHHHHHHHHHHhcCCCCHHHHH---------------HHHHhCCCCC---cccch--HHHHH------------------ Confidence 6766666655554335553321 1355554432 32222 22111 Q ss_pred chhheeecccCCchhhhhhhcChhhhHHHHHHHHHhhhcchhhcCCCCCCCCCCCC Q lcl|NC_018276. 491 STFVLRFERENTDIVEFGNALPFDQKISIILNTFKSYGKTEYTSQPADGGAGSGES 546 (546) Q Consensus 491 ~~fv~rfe~en~~i~efg~~l~~~~kieii~ntl~sygn~e~~~qp~~~~~~~~~~ 546 (546) |.|.|..+ .+.+.... |+ ..+|.+. .-|+|-|-|.|.. T Consensus 495 -----ri~~E~~~---~~~~~g~~-------~d-~~~~g~~--~~~~~~~~~~~~~ 532 (535) T protein:vir:80 495 -----KATVEFIA---KTAAAGKV-------GD-AASGGTN--KAKLNNGNGGGNQ 532 (535) T ss_pred -----HHHhhhhh---ccccCCCC-------CC-CCCCCCC--cCcccCCcccccc Confidence 11112111 00000000 00 0111111 1233322222222 No 8 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=89.83 E-value=0.024 Score=29.45 Aligned_cols=394 Identities=14% Similarity=0.060 Sum_probs=169.6 Q ss_pred HHHHhccchhhHHHHHHhhcCCCccchHHHHHH--------------HHHHHhcccCccc-------------------- Q lcl|NC_018276. 16 WVKQLLPKDKYNVFLQLFKFPVSTNELTEEIFN--------------ALEKVYDGKDAYE-------------------- 61 (546) Q Consensus 16 ~vk~~l~kdky~~f~~~f~fpv~tn~lt~~if~--------------~lskvfd~~n~~~-------------------- 61 (546) -++.-.|-+|= .|-++|.+|-....++.+... +|.+.|+|++... T Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~k 79 (468) T protein:vir:96 1 MIDIFWPNEKP-YHERVVEQIKPQYETQEEMILRLITKHKENVEDITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDWR 79 (468) T ss_pred CccccCCcCce-eehheeecccccccCcHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccccccccccccccccccccc Confidence 00000111111 233333333222222222221 1222333332110 Q ss_pred --------------ccccc--------CccchhhHHHHHHhhcchhhhHHHHHHHHHhcCCceEEEEeehhhhcCCCCcc Q lcl|NC_018276. 62 --------------DYNFV--------NPDYLQDWNDYRSSVLNAYHFWRYEAFKQFKTNINGVMVVDLPAEQTSERPAP 119 (546) Q Consensus 62 --------------~yqf~--------~~e~~~d~~~y~s~vLn~~~fw~~e~fk~~k~~iN~v~vVDm~~~Q~s~kp~p 119 (546) +|-|. +.+..+-+..++.. ++.+.- .+..+.--.-=-+.+.|.+. +.+.| T Consensus 80 i~~n~~~~Iv~~~~~~l~g~p~~~~~~d~~~~~~l~~~~~n--~~~~~~-~~~~~~~~~~G~~~~~v~~d-----~~~~~ 151 (468) T protein:vir:96 80 MYTNYHQNLVDQKVAYAVANPVTYGTEDEKSLKTIQEVLNH--KWDDKL-VDILTAASNKGVEWIQPYVD-----EQGEF 151 (468) T ss_pred cccchHHHHHHHHHhhhccCCceeccCChHHHHHHHHHHhc--CHHHHH-HHHHHHHhhcCeEEEEEEEc-----CCCce Confidence 11111 11111212222111 111110 11111111111233444442 23455 Q ss_pred eEEecchhhhhhhhccc-cc-ceEEEee--ecCCCcEEEEecCcceeeccCCccce-----------eeehhhhhhhhcc Q lcl|NC_018276. 120 YFYFMPITAVKDFKTNA-SG-AIEWIML--PQGDNQLAVIDDEHYSIYQLDEKGEI-----------SAEPLTQSAHDLG 184 (546) Q Consensus 120 y~y~l~i~~v~a~r~n~-~~-~I~fi~~--~~dqn~i~~IDde~y~~y~~d~e~~~-----------sie~l~dn~h~lG 184 (546) -+-+++-..++.+--+. .+ -++++-+ .++...+-+..+.++..|...+.+.+ .........|.|| T Consensus 152 ~i~~~~p~~~~~v~~~~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 231 (468) T protein:vir:96 152 KTFRVPAEQAIPIWTNKERDELKAFIRLYELDGGERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSMSWN 231 (468) T ss_pred EEEEEcccceEEEEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCceeecccccccccccceeeccccccCC Confidence 55555555554442221 12 2344433 23333454555555555554332211 1111233459999 Q ss_pred ccceeeEeccccccccccceecchhHHHHHHHHHHHHhhhhhhhhhhcCCcccccCcccCcccccCCCcccCCceeeccC Q lcl|NC_018276. 185 YCPATMFWGDPLMDSQPALKKSPLSNQLAALDWLLFFQTSKKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSH 264 (546) Q Consensus 185 y~PAr~~~~D~i~~skP~Ik~Spis~~L~A~D~ll~~~ts~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~ 264 (546) .||.-.|.-+..+.|. .+++-..+||+|+.+.- ...--.|++-|+++...- .-++ .+++..+ T Consensus 232 ~iPvv~~~n~~~g~sd----~e~v~~liDa~d~~~S~---~~~~~~~~~~p~lv~~g~--~~~~-------~~~~~~~-- 293 (468) T protein:vir:96 232 RVPFIPFKNNPQEVSD----LFMYKTIIDAMDKRLSD---TQNTFDEATELIYVLKGY--EGED-------LEEFMYN-- 293 (468) T ss_pred cccEEEecCCCCCCCc----hHHHHHHHHHHHHHHHH---HHHHHHHhcCceeeeecC--Cccc-------cchhhhh-- Confidence 9999999766655543 24555556777766542 111113456677664421 1000 0111111 Q ss_pred CceEeeccccccccCCcccccccCccceeeeeCCccccccccccCccceecccHHHHHHHHHHHHHHhhhheeeeec--C Q lcl|NC_018276. 265 GDYYVTRTGEVSKCPVCADKRLAGVGSFIEVPLPSRENEGADLRNPVQITTIDKASLDYNVEECERIYDKIYTACVG--F 342 (546) Q Consensus 265 gd~~t~~tg~~~~Cp~c~~k~~~gags~~~~piP~~~~n~~Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~s~~G--f 342 (546) +++ +.++.++-. .+++ +++++.+.. .+-..+.++++.+.||..+-+ + T Consensus 294 ---------------------~~~-~~~i~~~~d----~~~~----~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~ 342 (468) T protein:vir:96 294 ---------------------LKY-YKAINVDGD----GSGG----VDTIQIDVP-VQSAKEYLDMLRDYVIEFGQGVDF 342 (468) T ss_pred ---------------------hhc-CceEEecCC----CCCc----ceEEeecCC-hHHHHHHHHHHHHHHHHHhCcccc Confidence 112 223333221 1222 555555442 244445567777777776433 2 Q ss_pred CCCcccchhhhhhhhhhhhhHHHHHHHHHhhcHHHHHHHHHHHHHHHhcccccccccccccccccccCHHHHHHHHHHHH Q lcl|NC_018276. 343 GGDISMNKAFNKDQVKANTESRRNVLLSVKKNFERVQAWVEDTVCRLRYGELYVNNTIHYGSDFYLHNVEELTAQYEAAK 422 (546) Q Consensus 343 ~~d~q~~ka~ne~qV~s~~es~r~~l~~lkknfer~qkfV~dti~~Lryg~~~~~~tv~yGt~fyl~t~EeLte~~~~ak 422 (546) ..+. .+-+.+-.-+++.+..+..|..+..+-|+++.+-+...|+++. |..+=.-.| .-.|-...|.-++++...|+ T Consensus 343 ~~~~-~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~-g~~~d~~~i--~i~f~~~~p~d~~e~a~~~~ 418 (468) T protein:vir:96 343 QQDK-FGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFY-KLSIKVQDV--EITFNFNVMVNELEQSQIGV 418 (468) T ss_pred cccc-cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCccccee--eEEecCCCCcCHHHHHHHHH Confidence 2111 1123444556777777778888888889999888888888763 543321122 23466667777888888888 Q ss_pred hhcccHHHHHHHHHHHHhhhhcCCH-HHHHHHHHHhhcCCCccccHHHHHHHHhcCCCchhheeeeeccch Q lcl|NC_018276. 423 KAGATDYQLDVIQDQIIETENRNNP-NAMERAQVLKHLEPYRHQTRKEVLEMLEAGFGDMELIAVKLNFST 492 (546) Q Consensus 423 ~~gaS~~ei~~iq~qi~etEyrNdP-~qmqR~~iL~~lEPf~~lT~~EvveL~e~~~~~~Ell~vk~nf~~ 492 (546) .+|+=+-|-.. + ++ =+-.|| .+|+|+.. + ++...-.+.++++.+.= =|| T Consensus 419 ~~g~iS~et~i-~-~l---~~v~D~~~E~~ri~~--E---------~~~~~~~~~~~~~~~~~-----~~~ 468 (468) T protein:vir:96 419 NSQYLSKETVV-T-NH---PWVDDPVAEMERIDQ--E---------ELALPSIEEGLNGKENN-----EPT 468 (468) T ss_pred hcCCCchHHHH-H-hC---CCCCCHHHHHHHHHH--H---------HHHHHHHhhccCCCCCC-----CCC Confidence 88853333222 1 11 123466 45666543 1 11111112255554432 233 No 9 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=87.41 E-value=0.039 Score=28.31 Aligned_cols=407 Identities=9% Similarity=-0.010 Sum_probs=174.6 Q ss_pred HHHHHHHHHHHhccchhhHHHHHH----hhcCCC-----------ccchHHHHHHHHHHHhcccCccccc-cccCccchh Q lcl|NC_018276. 9 VVNDFLQWVKQLLPKDKYNVFLQL----FKFPVS-----------TNELTEEIFNALEKVYDGKDAYEDY-NFVNPDYLQ 72 (546) Q Consensus 9 ~~~~fls~vk~~l~kdky~~f~~~----f~fpv~-----------tn~lt~~if~~lskvfd~~n~~~~y-qf~~~e~~~ 72 (546) .|++|.+.-++.+- |+.++..- ..-|.. .+.+...|-+...-.+-|..+.+.+ .-...+.++ T Consensus 1 ~~~~~~~~~~~r~~--~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~~~~ 78 (440) T protein:vir:95 1 MLAAFLGSQKQRLA--ILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGVMEGGSADQLS 78 (440) T ss_pred ChhhHHHHHHHHHH--HHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheeccCceEeeCCCccHHHHH Confidence 66677665444332 22222211 111111 1122233333333333333322211 001111122 Q ss_pred hHHHHHHhhcchhhhHH--HHHHHHHhcCCceEEEEeehhhhcCCCCcceEEecchhhhhhhhccc-cc-ceEEEee--e Q lcl|NC_018276. 73 DWNDYRSSVLNAYHFWR--YEAFKQFKTNINGVMVVDLPAEQTSERPAPYFYFMPITAVKDFKTNA-SG-AIEWIML--P 146 (546) Q Consensus 73 d~~~y~s~vLn~~~fw~--~e~fk~~k~~iN~v~vVDm~~~Q~s~kp~py~y~l~i~~v~a~r~n~-~~-~I~fi~~--~ 146 (546) -|.++ +..-+|.. .++.+.--.-=.+.++|-. .+.+.|-+-+++-..+..+.-.. .+ -+.++-+ . T Consensus 79 ~l~~~----~~~n~~~~~~~~~~~~~~~~G~a~~~~~~-----d~~~~~~i~~~~p~~~~~~~d~~~~~~~~~~i~~~~~ 149 (440) T protein:vir:95 79 TIKDI----EWQNDINALNSDLAFDASVYGRAYEYHFR-----DKDKVDRVVLISPLEMFVIRDLTVEQNIIAAVHLPIY 149 (440) T ss_pred HHHHH----HHhcCHhHHHHHHHHHHhhcCeEEEEEEe-----cCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEe Confidence 22222 22222222 1222221111123334432 23445555555555555443221 22 2555554 3 Q ss_pred cCCCcEEEEecCcceeeccCCccceeeehhhhhhhhccccceeeEeccccccccccceecchhHHHHHHHHHHHHhhhhh Q lcl|NC_018276. 147 QGDNQLAVIDDEHYSIYQLDEKGEISAEPLTQSAHDLGYCPATMFWGDPLMDSQPALKKSPLSNQLAALDWLLFFQTSKK 226 (546) Q Consensus 147 ~dqn~i~~IDde~y~~y~~d~e~~~sie~l~dn~h~lGy~PAr~~~~D~i~~skP~Ik~Spis~~L~A~D~ll~~~ts~~ 226 (546) .++..+-+-++....+|...+.+..-.+.-....|.||.||.-.|..+..+.|. + .++-...+|+|+.+.-.---. T Consensus 150 ~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd--~--e~v~~lida~~~~~s~~~~~~ 225 (440) T protein:vir:95 150 ADKVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWWNNRFRMGD--Y--ESEISLIDAYDAGQSDTANYM 225 (440) T ss_pred cCceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEeeCCCCCCCc--h--hhhHHHHHHHHHHHHHHHHHH Confidence 333446567777777776655443334433455699999999988665444332 3 355555688888663322111 Q ss_pred hhhhhcCCcccccCcccCcccccCCCcccCCceeeccCCceEeeccccccccCCcccccccCccceeeeeC--Ccccccc Q lcl|NC_018276. 227 HLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSHGDYYVTRTGEVSKCPVCADKRLAGVGSFIEVPL--PSRENEG 304 (546) Q Consensus 227 hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~gd~~t~~tg~~~~Cp~c~~k~~~gags~~~~pi--P~~~~n~ 304 (546) . ....|+..-+|+...+.....+ ++.+ +.++-+...+. +..+... T Consensus 226 ~-~~~~~~~v~~g~~~~~~~~~e~------~~~~--------------------------~~~~~~~~~~~~~~~~~~~~ 272 (440) T protein:vir:95 226 S-DLNDAMLLVKGDLDGIKLSPED------AAKM--------------------------KDANMLFLKTGISTTGQQTT 272 (440) T ss_pred H-HhhcceeeeecccccCCCCccc------hhhh--------------------------hhccceecccccccccCCCC Confidence 1 1123344445655555422110 0111 11111111110 1111122 Q ss_pred ccccCccceecccHHHHHHHHHHHHHHhhhheeeeecCCCCccc---chhhhhhhhhhhhhHHHHHHHHHhhcHHHHHHH Q lcl|NC_018276. 305 ADLRNPVQITTIDKASLDYNVEECERIYDKIYTACVGFGGDISM---NKAFNKDQVKANTESRRNVLLSVKKNFERVQAW 381 (546) Q Consensus 305 ~Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~s~~Gf~~d~q~---~ka~ne~qV~s~~es~r~~l~~lkknfer~qkf 381 (546) + .|++++.+. ..+.....++++.+.|+...-. ++.+. +-+.+-.-+++.+-.+..+..+..+.|+++..- T Consensus 273 ~----~~~~lt~~~-~~~~~~~~~~~l~~~i~~~s~~--p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~ 345 (440) T protein:vir:95 273 A----DASYIYKQY-DVNGTEAYKNRLANDIHRFSRI--PNLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKALRR 345 (440) T ss_pred c----ceeEEeecC-CHHHHHHHHHHHHHHHHHHhCC--cccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 256666553 2344556677777777765422 11111 112334446666667777777777888888887 Q ss_pred HHHHHHHHh---cccc--cccccccccccccccCHHHHHHHHHHHHh-hcccHHHHHHHHHHHHhhhhcCCHHHHHHHHH Q lcl|NC_018276. 382 VEDTVCRLR---YGEL--YVNNTIHYGSDFYLHNVEELTAQYEAAKK-AGATDYQLDVIQDQIIETENRNNPNAMERAQV 455 (546) Q Consensus 382 V~dti~~Lr---yg~~--~~~~tv~yGt~fyl~t~EeLte~~~~ak~-~gaS~~ei~~iq~qi~etEyrNdP~qmqR~~i 455 (546) +...+|.+. .|.. ++..+| .|-...|.-+.+....+.+ +|+=+-|-..-+. -+=.+|.+|+|++- T Consensus 346 ~~~li~~~~~~~~~~~~~~~~v~i----~f~~~~p~~~~~~ad~~~kl~g~iS~et~~~~l-----~~~d~~~E~~ri~~ 416 (440) T protein:vir:95 346 RYELISNIHKAINGPVIEANKLTF----TFHPNIPQDVWTEIKAYIEAGGEISQETLMENA-----SFTDYKTEHSRILK 416 (440) T ss_pred HHHHHHHHHhhcCCcccccccceE----EeCCCCCCCHHHHHHHHHHHhccCcHHHHHHhC-----CCCCcHHHHHHHHH Confidence 777777652 3322 223333 4445555555555444333 3433322221111 12235777887664 Q ss_pred HhhcCCCccccHHHHHHHHhcCCCchh Q lcl|NC_018276. 456 LKHLEPYRHQTRKEVLEMLEAGFGDME 482 (546) Q Consensus 456 L~~lEPf~~lT~~EvveL~e~~~~~~E 482 (546) =+. .- .+..++..--...|-.+.| T Consensus 417 E~~--~~-~~~~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 417 QGG--SS-DLEIGQIVGDADVGQADTE 440 (440) T ss_pred HHH--Hh-hhhHHhhccCCCCCCcCCC Confidence 333 10 1111111111112334444 No 10 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=86.52 E-value=0.045 Score=27.97 Aligned_cols=405 Identities=11% Similarity=0.029 Sum_probs=169.7 Q ss_pred CCcccchhHHHHHHHHHHHhccchhhHHHHH--HhhcCC----CccchHH----HHHHHHHHHhcccCccccccccCccc Q lcl|NC_018276. 1 MNAADARAVVNDFLQWVKQLLPKDKYNVFLQ--LFKFPV----STNELTE----EIFNALEKVYDGKDAYEDYNFVNPDY 70 (546) Q Consensus 1 ~~a~d~~~~~~~fls~vk~~l~kdky~~f~~--~f~fpv----~tn~lt~----~if~~lskvfd~~n~~~~yqf~~~e~ 70 (546) |++-.-++.+..+..++..+-.-.+|-.=-+ +-+.+- +.+.++. .|-+...-..=|..+. |.-.+.+. T Consensus 1 l~~~~l~~~i~~~~~~~~r~~~l~~yy~g~~~il~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~--~~~~~~~~ 78 (429) T protein:vir:98 1 MTKDLLSELIQKHRSFNLSYSAYKQLYEGDHAILQQKQKEQYKPDNRLVVNFAKYIVDTFNGYFIGVPVQ--TSHENKQV 78 (429) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccCCCcceeecchHHHHHHHHhhhhcccCce--eecCChHH Confidence 8777766666666555544443333311000 001111 1122222 2222221111121111 22122222 Q ss_pred hhhHHHHHHhhcchhhhHHHHHHHHHhcCCceEEEEeehhhhcCCCCcceEEecchhhhhhhhccc-cc-ceEEEeeecC Q lcl|NC_018276. 71 LQDWNDYRSSVLNAYHFWRYEAFKQFKTNINGVMVVDLPAEQTSERPAPYFYFMPITAVKDFKTNA-SG-AIEWIMLPQG 148 (546) Q Consensus 71 ~~d~~~y~s~vLn~~~fw~~e~fk~~k~~iN~v~vVDm~~~Q~s~kp~py~y~l~i~~v~a~r~n~-~~-~I~fi~~~~d 148 (546) .+-|..+.+. .++.. .-.++.+.--.-=-+.++|-.. +.+.|-+=|++=..+..+-.+. .+ -+.++-|..+ T Consensus 79 ~~~l~~~~~~-n~~~~-~~~~~~~~~~~~G~~~~~v~~d-----~~g~~~~~~~~p~~~~~v~dd~~~~~~~~~i~~~~~ 151 (429) T protein:vir:98 79 SNYLELLDGY-NDQDD-NNAELSKICSIYGHGYELVFND-----ENAEAGITYLTPLEAFIVYDDSIRQKPLFAVRYFYN 151 (429) T ss_pred HHHHHHHHhh-cCHhH-HHHHHHHHHhhcCeEEEEEEec-----CCCcEEEEEEcccceEEEEeCCCCCceEEEEEEEEe Confidence 3334443333 11211 1223333322222344455432 3455656555444444433222 22 3555555344 Q ss_pred CCc---EEEEecCcceeeccCCccceeeehhhhhhhhccccceeeEeccccccccccceecchhHHHHHHHHHHHHhhhh Q lcl|NC_018276. 149 DNQ---LAVIDDEHYSIYQLDEKGEISAEPLTQSAHDLGYCPATMFWGDPLMDSQPALKKSPLSNQLAALDWLLFFQTSK 225 (546) Q Consensus 149 qn~---i~~IDde~y~~y~~d~e~~~sie~l~dn~h~lGy~PAr~~~~D~i~~skP~Ik~Spis~~L~A~D~ll~~~ts~ 225 (546) ++. .++.+++....|...+.+ .++.....|.||.||--.|.-+..+.|. + +++-..+||+|+.+.-. T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~g~vPvv~~~n~~~g~sd--~--e~v~~liD~~d~~~s~~--- 221 (429) T protein:vir:98 152 KGGVLEGSYSDASNITYFKDGEKG---IEIGESEPHPFDGVPMIEYVENEERQSL--L--ASVVTLINAFNKAISEK--- 221 (429) T ss_pred cCceEEEEEEeCceEEEEEecCCc---eEecccccccCCccceEEecCCCCCCCc--H--HHHHHHHHHHHHHHHHH--- Confidence 333 555666666666544442 3333556699999998887555554433 3 35555557777755432 Q ss_pred hhhhhhcCCcccccCcccCcccccCCCcccCCceeeccCCceEeeccccccccCCcccccccCccceeeeeCCccccccc Q lcl|NC_018276. 226 KHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSHGDYYVTRTGEVSKCPVCADKRLAGVGSFIEVPLPSRENEGA 305 (546) Q Consensus 226 ~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~gd~~t~~tg~~~~Cp~c~~k~~~gags~~~~piP~~~~n~~ 305 (546) ..---|++-|+..... +. . ++.++++ +-.+.++ .+|+.+...+ T Consensus 222 ~~~~~~~~~p~~~i~g--~~-----~----~~~~~~~------------------------~~~~~~~--~~~~~~~~~~ 264 (429) T protein:vir:98 222 ANDVEYFADAYLKILG--AE-----L----DDETLKS------------------------LRDTRII--NLKDTDAQQL 264 (429) T ss_pred HHHHHHhcCceeeeec--CC-----C----Ccchhhh------------------------HhhCcee--eccCCCCCCc Confidence 2222456677766431 00 0 0011110 0112233 3344333334 Q ss_pred cccCccceecccHHHHHHHHHHHHHHhhhheeeeecCCCCcccch--hhhhhhhhhhhhHHHHHHHHHhhcHHHHHHHHH Q lcl|NC_018276. 306 DLRNPVQITTIDKASLDYNVEECERIYDKIYTACVGFGGDISMNK--AFNKDQVKANTESRRNVLLSVKKNFERVQAWVE 383 (546) Q Consensus 306 Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~s~~Gf~~d~q~~k--a~ne~qV~s~~es~r~~l~~lkknfer~qkfV~ 383 (546) | +++++.+. ..+...+.++++.+-|+...-+ ++.+.+. ..+-.-+++.+.++..+..+..+-|+++.+-+. T Consensus 265 ~----~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~--p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~ 337 (429) T protein:vir:98 265 T----VEFLQKPD-ADATQEHLLDRLENLIFRTAMV--ANISDESFGTASGIALRYRLQAMDNLAKTKERKFMSGMNRRY 337 (429) T ss_pred c----eeEEeecC-CHHHHHHHHHHHHHHHHHHhCc--cccCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4 45554433 2333444566777777655422 1111111 123333445555555555566666777666666 Q ss_pred HHHHHH--hcccc--cccccccccccccccCHHHHHHHHHHHHh-hcccHHHHHHHHHHHHhhhhcCCH-HHHHHHHHHh Q lcl|NC_018276. 384 DTVCRL--RYGEL--YVNNTIHYGSDFYLHNVEELTAQYEAAKK-AGATDYQLDVIQDQIIETENRNNP-NAMERAQVLK 457 (546) Q Consensus 384 dti~~L--ryg~~--~~~~tv~yGt~fyl~t~EeLte~~~~ak~-~gaS~~ei~~iq~qi~etEyrNdP-~qmqR~~iL~ 457 (546) ..++++ .-|.. +...+|. |-...|.-+.++...+.+ +|+=+.|-.. .++ -+-.|| .+|+|++.=+ T Consensus 338 ~li~~~~~~~~~~~d~~~i~v~----f~~~~p~~~~~~a~~~~kl~g~is~et~~--~~l---~~v~d~~~E~~ri~~E~ 408 (429) T protein:vir:98 338 KLIASYPTSKIGPKDWIGIKYK----FTRNLPANLLEESQIAGNLAGIVSEETQV--GVL---SIVENPQKEIERKNSDK 408 (429) T ss_pred HHHHHHhccCCCccccccceEE----eCCCCCcCHHHHHHHHHHHhccCchHHHH--HhC---CCCCCHHHHHHHHHHHH Confidence 666664 11212 2233443 444445445554443333 3332222111 000 122355 4566666544 Q ss_pred hcCCCccccHHHHHHHHhc-CCCchh Q lcl|NC_018276. 458 HLEPYRHQTRKEVLEMLEA-GFGDME 482 (546) Q Consensus 458 ~lEPf~~lT~~EvveL~e~-~~~~~E 482 (546) +- ...++...|-.. +=++.| T Consensus 409 ~~-----~~~~~~~~~~~~~~~~~~~ 429 (429) T protein:vir:98 409 ST-----LISRQAGGLNGQNTTTILE 429 (429) T ss_pred HH-----HHHHHHhhhcCCCCCCCCC Confidence 31 011122222221 222333 No 11 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=83.87 E-value=0.065 Score=27.09 Aligned_cols=437 Identities=9% Similarity=0.027 Sum_probs=163.4 Q ss_pred CCcccchhHHHHHHH-HHHHhccchhhHHHHH----HhhcCCCc----------cchHHHHHHHHHHHhcccCccccccc Q lcl|NC_018276. 1 MNAADARAVVNDFLQ-WVKQLLPKDKYNVFLQ----LFKFPVST----------NELTEEIFNALEKVYDGKDAYEDYNF 65 (546) Q Consensus 1 ~~a~d~~~~~~~fls-~vk~~l~kdky~~f~~----~f~fpv~t----------n~lt~~if~~lskvfd~~n~~~~yqf 65 (546) ++..+-.+.+..+.. +++.+ +|+..+.. +.+-|.+. ..+...|-+.+..-+=|..+. | T Consensus 15 ~~~~~~~~~i~~~~~~~~~r~---~~~~~yy~g~~~i~~~~~~~~~~~~~~ki~~n~~~~iv~~~~~~l~g~~~~----~ 87 (489) T protein:vir:99 15 LWIDQLKNYISRFKAEQLERL---KELKRYYLGDNNIKYRPAKTDKYAADNRIASDFAKYITVFEQGYMLGVPVE----Y 87 (489) T ss_pred CCHHHHHHHHHHHHHHHHHHH---HHHHHHhcccCccccccccccccCCcceeecchHHHHHHHHhhhhccCCce----e Confidence 233333333333321 11111 33333332 11222211 122233333332222222221 1 Q ss_pred cCccchhhHHHHHHhhcchhhhH--HHHHHHHHhcCCceEEEEeehhhhcCCCCcceEEecchhhhhhhhcccc-c-ceE Q lcl|NC_018276. 66 VNPDYLQDWNDYRSSVLNAYHFW--RYEAFKQFKTNINGVMVVDLPAEQTSERPAPYFYFMPITAVKDFKTNAS-G-AIE 141 (546) Q Consensus 66 ~~~e~~~d~~~y~s~vLn~~~fw--~~e~fk~~k~~iN~v~vVDm~~~Q~s~kp~py~y~l~i~~v~a~r~n~~-~-~I~ 141 (546) +.+ + ++-..++.++++.-.|. -.+..+.--..=-+..+|-+..- ..+.+.|.+.+++-.++..+.-+.. + -+. T Consensus 88 ~~~-d-~~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~~~~~v~~~~~-~d~~~~~~i~~~~p~~~~~v~dd~~~~~~~~ 164 (489) T protein:vir:99 88 KNE-N-KDLQAAIDLMSVRNNEDYHNVKIKTDLSIYGRAYELLTVEKI-DDKKTEVKLYQLPAEQTFVIYDDTYQRNSLM 164 (489) T ss_pred ecC-C-hhHHHHHHHHHhhcChhHHHHHHHHHHhhCCeEEEEEeeccC-cCCCcceEEEEEcccceEEEEcCCCCCceEE Confidence 111 1 12233344433322222 22333332221133444444322 2467888888888777766653322 2 344 Q ss_pred EEee---ecCCCc----EEEEecCcceeeccCCccceeeehhhhhhhhccccceeeEeccccccccccceecchhHHHHH Q lcl|NC_018276. 142 WIML---PQGDNQ----LAVIDDEHYSIYQLDEKGEISAEPLTQSAHDLGYCPATMFWGDPLMDSQPALKKSPLSNQLAA 214 (546) Q Consensus 142 fi~~---~~dqn~----i~~IDde~y~~y~~d~e~~~sie~l~dn~h~lGy~PAr~~~~D~i~~skP~Ik~Spis~~L~A 214 (546) ++-+ ..+++. +.+.+++.-..|...+.+..-.++....-|.||.||.--|.-..-+.|. + .++-..+|| T Consensus 165 ~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~s~--~--~~v~~liDa 240 (489) T protein:vir:99 165 AVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPVNEYANNEERTGA--Y--ESVLDNIDA 240 (489) T ss_pred EEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCceeEEEeecCCCCCCc--h--hhhHHHHHH Confidence 4433 122222 4455555444454433222223333455699999998777443332221 2 234444566 Q ss_pred HHHHHHHhhhhhhhhhhcCCcccccCcccCcccccCCCcccCCceeeccCCceEeeccccccccCCcccccccCccceee Q lcl|NC_018276. 215 LDWLLFFQTSKKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSHGDYYVTRTGEVSKCPVCADKRLAGVGSFIE 294 (546) Q Consensus 215 ~D~ll~~~ts~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~gd~~t~~tg~~~~Cp~c~~k~~~gags~~~ 294 (546) +|+.+.-. .+---|++-|+....-.... ..+.++.=+ +...+..|.+-.. .-...+.++. T Consensus 241 ~d~~~s~~---~~~~~~~~~~~l~i~g~~~~--~~~~~~~~~-~~~~~~~~~~~~~--------------~~~~~~~~~~ 300 (489) T protein:vir:99 241 YDLSQSEL---ANFQQDSVNALLVIAGNAYT--GADENDYLD-DGRLNPNGRLAIS--------------IGFKKAQVLI 300 (489) T ss_pred HHHHHHHH---HHHHHHhhhhhhhhccCCcc--cccchhhhh-hcccccccccccc--------------cccccceeee Confidence 66655332 22223455666554332222 110000000 0001111111111 0001111111 Q ss_pred -eeCCccccccccccCccceeccc--HHHHHHHHHHHHHHhhhheeeeec--CCCCcccchhhhhhhhhhhhhHHHHHHH Q lcl|NC_018276. 295 -VPLPSRENEGADLRNPVQITTID--KASLDYNVEECERIYDKIYTACVG--FGGDISMNKAFNKDQVKANTESRRNVLL 369 (546) Q Consensus 295 -~piP~~~~n~~Dl~~pv~i~s~d--~~sley~~~e~kri~d~i~~s~~G--f~~d~q~~ka~ne~qV~s~~es~r~~l~ 369 (546) .+-++..... .-|++++.+ .++++ ..++++...||...-+ +..+ +...+.+..-+++.+-++..+.. T Consensus 301 ~~~~~~~~~~~----~~~~~l~~~~~~~~~~---~~~~~l~~~i~~~s~~p~~~~~-~~~~n~Sg~Al~~~~~~l~~k~~ 372 (489) T protein:vir:99 301 LDDNPNPNGVK----PQAYFLKKEYDTAGSE---AYKNRLVADILRFTFTPDTQDM-KFSGVQSGESMKYKLMASDNYRE 372 (489) T ss_pred eccccCccccc----cceeeeeecCChHHHH---HHHHHHHHHHHHHhCCcccccc-cccccchHHHHHHHHHHHHHHHH Confidence 1111111111 225555433 44444 4566777777754322 1111 11122334444444444555545 Q ss_pred HHhhcHHHHHHHHHHHHHHH---hccccccccccc-ccccccccCHHHHHHHHHHHHhh-cccHHHHHHHHHHHHhhhhc Q lcl|NC_018276. 370 SVKKNFERVQAWVEDTVCRL---RYGELYVNNTIH-YGSDFYLHNVEELTAQYEAAKKA-GATDYQLDVIQDQIIETENR 444 (546) Q Consensus 370 ~lkknfer~qkfV~dti~~L---ryg~~~~~~tv~-yGt~fyl~t~EeLte~~~~ak~~-gaS~~ei~~iq~qi~etEyr 444 (546) +-.+-|+++-+-+...++.+ ..+......... -.-.|....|.-++++...+.+. |+-+-|-..-+...+. - T Consensus 373 ~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~giis~et~~~~l~~v~---~ 449 (489) T protein:vir:99 373 KQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLYGIVSDQTIFEILNTVT---G 449 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhcCCCC---c Confidence 44555555555555555443 333322222111 13466777777777776665554 4333222111111111 1 Q ss_pred CCHHH-HHHHHHHhhcCCCccccHHHHHHHHhcC-CCchhheeeeeccchhheeecccCCchhhhhhhcC Q lcl|NC_018276. 445 NNPNA-MERAQVLKHLEPYRHQTRKEVLEMLEAG-FGDMELIAVKLNFSTFVLRFERENTDIVEFGNALP 512 (546) Q Consensus 445 NdP~q-mqR~~iL~~lEPf~~lT~~EvveL~e~~-~~~~Ell~vk~nf~~fv~rfe~en~~i~efg~~l~ 512 (546) .||.+ |+|++.=+.-+ .+..++...| ..+++- +.++. | T Consensus 450 ~d~~~E~~ri~~E~~~~-------~~~~~~~~~~~~~~~~~--------------~~~~~---------p 489 (489) T protein:vir:99 450 VDAEAELKRLKEEADKK-------QSLPEPRLVGDASGQEE--------------PTAEK---------P 489 (489) T ss_pred hhHHHHHHHHHHHHHHH-------hccccccccCCCCCCcC--------------CCCCC---------C Confidence 25654 77775422110 0000110000 000000 00000 0 No 12 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=77.52 E-value=0.12 Score=25.56 Aligned_cols=408 Identities=10% Similarity=0.049 Sum_probs=170.8 Q ss_pred CCccc-chhHHHHHHHH----HHHhccchh-hHHHHH---HhhcC-------------------CCccchH----HHHHH Q lcl|NC_018276. 1 MNAAD-ARAVVNDFLQW----VKQLLPKDK-YNVFLQ---LFKFP-------------------VSTNELT----EEIFN 48 (546) Q Consensus 1 ~~a~d-~~~~~~~fls~----vk~~l~kdk-y~~f~~---~f~fp-------------------v~tn~lt----~~if~ 48 (546) ++..+ +...++.|+.. .+++....+ |..... ..+=| -+.|-|+ ..|-+ T Consensus 10 ~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd 89 (474) T protein:vir:94 10 IEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVD 89 (474) T ss_pred ccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHH Confidence 22222 12233444333 333322111 111111 00000 0111121 12211 Q ss_pred HHHHHhcccCccccccccC-ccchhhHHHHHHhhcchhhhHH--HHHHHHHhcCCceEEEEeehhhhcCCCCcceEEecc Q lcl|NC_018276. 49 ALEKVYDGKDAYEDYNFVN-PDYLQDWNDYRSSVLNAYHFWR--YEAFKQFKTNINGVMVVDLPAEQTSERPAPYFYFMP 125 (546) Q Consensus 49 ~lskvfd~~n~~~~yqf~~-~e~~~d~~~y~s~vLn~~~fw~--~e~fk~~k~~iN~v~vVDm~~~Q~s~kp~py~y~l~ 125 (546) ...-.+=|+ |. .|..-. .+.-+....++..+++.-+|+. .+..+.--.-=.+.++|-. .+.+.|.+-+++ T Consensus 90 ~~~~yl~g~-pv-~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~-----d~~~~~~~~~i~ 162 (474) T protein:vir:94 90 TRVGYLHGV-PV-TYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYI-----DTNGDIRIKNID 162 (474) T ss_pred hHhhheecc-ce-eEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEe-----CCCCeeEEEEEc Confidence 111111121 11 122211 1223333444444444444443 2333333333345555543 345567787887 Q ss_pred hhhhhhhhcccccceEEEeee----cCCCc----EEEEecCcceeeccCCccceeeehhhhhhhhccccceeeEeccccc Q lcl|NC_018276. 126 ITAVKDFKTNASGAIEWIMLP----QGDNQ----LAVIDDEHYSIYQLDEKGEISAEPLTQSAHDLGYCPATMFWGDPLM 197 (546) Q Consensus 126 i~~v~a~r~n~~~~I~fi~~~----~dqn~----i~~IDde~y~~y~~d~e~~~sie~l~dn~h~lGy~PAr~~~~D~i~ 197 (546) -..+..+.-+..+.+.++-+. ..++. +-+-+++.+.+|..++.+.... .....|.||.+|---|.....+ T Consensus 163 p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~--~~~~~~~~g~vPvv~~~n~~~g 240 (474) T protein:vir:94 163 PYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQE--VGRYEHLFDYNPLFGVPNNKEM 240 (474) T ss_pred ccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccc--cccccCCCCccceEEecCCCCC Confidence 777777664444455555441 22332 3345777777777665443322 2445699999998877666554 Q ss_pred cccccceecchhHHHHHHHHHHHHhhhhhhhhhhcCCcccccCcccCcccccCCCcccCCceeeccCCceEeeccccccc Q lcl|NC_018276. 198 DSQPALKKSPLSNQLAALDWLLFFQTSKKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSHGDYYVTRTGEVSK 277 (546) Q Consensus 198 ~skP~Ik~Spis~~L~A~D~ll~~~ts~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~gd~~t~~tg~~~~ 277 (546) .|. + .++-..+||+|+.+--.- +---+++-|+++.... ...+ .++ T Consensus 241 ~sd--~--e~v~~liDa~d~~~S~~~---~~~~~~~~~~l~i~g~--~~~~---------~~~----------------- 285 (474) T protein:vir:94 241 IGD--A--EKVIHLIDAYDLTMSDAS---SEISQTRLAYLVLRGM--GMSE---------EMI----------------- 285 (474) T ss_pred CCc--h--HHHHHHHHHHHHHHHHHH---HHHHHhhcchhhhccC--CCCc---------hhh----------------- Confidence 443 2 345555677777654322 2223456676653210 0000 000 Q ss_pred cCCcccccccCccceeeeeCCccccccccccCccceecccHHHHHHHHHHHHHHhhhheeeeecCCCCccc---chhhhh Q lcl|NC_018276. 278 CPVCADKRLAGVGSFIEVPLPSRENEGADLRNPVQITTIDKASLDYNVEECERIYDKIYTACVGFGGDISM---NKAFNK 354 (546) Q Consensus 278 Cp~c~~k~~~gags~~~~piP~~~~n~~Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~s~~Gf~~d~q~---~ka~ne 354 (546) +..+.+|.++..+ +.+| +++++.+.. .+-..+.++++.+.|+...-+ ++.+. +-+.+- T Consensus 286 ------~~~~~~~~i~~~~------~~~~----~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~--p~~~~~~~~~n~Sg 346 (474) T protein:vir:94 286 ------QETQKSGAFELFD------KDMD----VKYLTKDVN-DTMIENHLDRIEKNIMRFAKS--VNFNSDEFNGNVPI 346 (474) T ss_pred ------hhhhhcceeEecC------CCCc----eeEEeccCC-HHHHHHHHHHHHHHHHHHhCC--cccccccccccchH Confidence 0112233333311 1122 455554442 244556678888888875433 22221 123344 Q ss_pred hhhhhhhhHHHHHHHHHhhcHHHHHHHHHHHHHHH---hc-c---cccccccccccccccccCHHHHHHHHHHHHh-hcc Q lcl|NC_018276. 355 DQVKANTESRRNVLLSVKKNFERVQAWVEDTVCRL---RY-G---ELYVNNTIHYGSDFYLHNVEELTAQYEAAKK-AGA 426 (546) Q Consensus 355 ~qV~s~~es~r~~l~~lkknfer~qkfV~dti~~L---ry-g---~~~~~~tv~yGt~fyl~t~EeLte~~~~ak~-~ga 426 (546) .-+++.+-.+..+.....+-|+++-+-+...++++ .. | ..+..-++ .|-...|.-+++...++++ .|+ T Consensus 347 ~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~----~f~~~~p~d~~e~a~~~~kl~g~ 422 (474) T protein:vir:94 347 IGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIF----KFTRNIPVNKLEESQVLINLKGQ 422 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceE----EeCCCCCCCHHHHHHHHHHHhcc Confidence 44566666666676777777777777666666654 11 1 12222333 3444455555555554443 342 Q ss_pred cHHHHHHHHHHHHhhhhcCCH-HHHHHHHHHhh--cCCCccccHHHHHHHHhcCCCchhh Q lcl|NC_018276. 427 TDYQLDVIQDQIIETENRNNP-NAMERAQVLKH--LEPYRHQTRKEVLEMLEAGFGDMEL 483 (546) Q Consensus 427 S~~ei~~iq~qi~etEyrNdP-~qmqR~~iL~~--lEPf~~lT~~EvveL~e~~~~~~El 483 (546) =+-|-..-+. -+-.|| .+|+|+..=+. ....+++...+ .-+++-.++-. T Consensus 423 iS~et~~~~l-----~~v~d~~~E~eri~~E~~e~~~~~~~~~~~~---~~~~~~~~~s~ 474 (474) T protein:vir:94 423 VSERTRLGQS-----QLVDDVDYELDEMEKESLEFNDKLPDIDEGD---ANDKSQNNQSE 474 (474) T ss_pred CchHHHHHhC-----CCCCCHHHHHHHHHHHHHHHHhhcccccCCC---cCCCCccccCC Confidence 2211111110 112355 44666643221 11111110000 00000111111 No 13 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=77.52 E-value=0.12 Score=25.56 Aligned_cols=408 Identities=10% Similarity=0.049 Sum_probs=170.8 Q ss_pred CCccc-chhHHHHHHHH----HHHhccchh-hHHHHH---HhhcC-------------------CCccchH----HHHHH Q lcl|NC_018276. 1 MNAAD-ARAVVNDFLQW----VKQLLPKDK-YNVFLQ---LFKFP-------------------VSTNELT----EEIFN 48 (546) Q Consensus 1 ~~a~d-~~~~~~~fls~----vk~~l~kdk-y~~f~~---~f~fp-------------------v~tn~lt----~~if~ 48 (546) ++..+ +...++.|+.. .+++....+ |..... ..+=| -+.|-|+ ..|-+ T Consensus 10 ~~~~~~~~e~i~~~i~~~~~~~~r~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd 89 (474) T protein:vir:10 10 IEAQGILPKHIEALIESHKDDRERMVNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDVSVNNKLNNSFDSEIVD 89 (474) T ss_pred ccccCCCHHHHHHHHHHhhhhhHHHHHHHHHHhhhcchhhhhcchhhhhhhhhhhcccccccccCcccccccchHHHHHH Confidence 22222 12233444333 333322111 111111 00000 0111121 12211 Q ss_pred HHHHHhcccCccccccccC-ccchhhHHHHHHhhcchhhhHH--HHHHHHHhcCCceEEEEeehhhhcCCCCcceEEecc Q lcl|NC_018276. 49 ALEKVYDGKDAYEDYNFVN-PDYLQDWNDYRSSVLNAYHFWR--YEAFKQFKTNINGVMVVDLPAEQTSERPAPYFYFMP 125 (546) Q Consensus 49 ~lskvfd~~n~~~~yqf~~-~e~~~d~~~y~s~vLn~~~fw~--~e~fk~~k~~iN~v~vVDm~~~Q~s~kp~py~y~l~ 125 (546) ...-.+=|+ |. .|..-. .+.-+....++..+++.-+|+. .+..+.--.-=.+.++|-. .+.+.|.+-+++ T Consensus 90 ~~~~yl~g~-pv-~~~~~~~~~~~e~~~~~l~~~~~~n~~~~~~~~~~~~~~~~G~a~~~~~~-----d~~~~~~~~~i~ 162 (474) T protein:vir:10 90 TRVGYLHGV-PV-TYDLDENAEKNEKLKKFITNFAIRNSVDDEDSEIGKMAAICGYGARLAYI-----DTNGDIRIKNID 162 (474) T ss_pred hHhhheecc-ce-eEeeCCCCcchHHHHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEe-----CCCCeeEEEEEc Confidence 111111121 11 122211 1223333444444444444443 2333333333345555543 345567787887 Q ss_pred hhhhhhhhcccccceEEEeee----cCCCc----EEEEecCcceeeccCCccceeeehhhhhhhhccccceeeEeccccc Q lcl|NC_018276. 126 ITAVKDFKTNASGAIEWIMLP----QGDNQ----LAVIDDEHYSIYQLDEKGEISAEPLTQSAHDLGYCPATMFWGDPLM 197 (546) Q Consensus 126 i~~v~a~r~n~~~~I~fi~~~----~dqn~----i~~IDde~y~~y~~d~e~~~sie~l~dn~h~lGy~PAr~~~~D~i~ 197 (546) -..+..+.-+..+.+.++-+. ..++. +-+-+++.+.+|..++.+.... .....|.||.+|---|.....+ T Consensus 163 p~~~~~v~d~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~--~~~~~~~~g~vPvv~~~n~~~g 240 (474) T protein:vir:10 163 PYNVIFVGDNILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQE--VGRYEHLFDYNPLFGVPNNKEM 240 (474) T ss_pred ccceEEEEcCCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccc--cccccCCCCccceEEecCCCCC Confidence 777777664444455555441 22332 3345777777777665443322 2445699999998877666554 Q ss_pred cccccceecchhHHHHHHHHHHHHhhhhhhhhhhcCCcccccCcccCcccccCCCcccCCceeeccCCceEeeccccccc Q lcl|NC_018276. 198 DSQPALKKSPLSNQLAALDWLLFFQTSKKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSHGDYYVTRTGEVSK 277 (546) Q Consensus 198 ~skP~Ik~Spis~~L~A~D~ll~~~ts~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~gd~~t~~tg~~~~ 277 (546) .|. + .++-..+||+|+.+--.- +---+++-|+++.... ...+ .++ T Consensus 241 ~sd--~--e~v~~liDa~d~~~S~~~---~~~~~~~~~~l~i~g~--~~~~---------~~~----------------- 285 (474) T protein:vir:10 241 IGD--A--EKVIHLIDAYDLTMSDAS---SEISQTRLAYLVLRGM--GMSE---------EMI----------------- 285 (474) T ss_pred CCc--h--HHHHHHHHHHHHHHHHHH---HHHHHhhcchhhhccC--CCCc---------hhh----------------- Confidence 443 2 345555677777654322 2223456676653210 0000 000 Q ss_pred cCCcccccccCccceeeeeCCccccccccccCccceecccHHHHHHHHHHHHHHhhhheeeeecCCCCccc---chhhhh Q lcl|NC_018276. 278 CPVCADKRLAGVGSFIEVPLPSRENEGADLRNPVQITTIDKASLDYNVEECERIYDKIYTACVGFGGDISM---NKAFNK 354 (546) Q Consensus 278 Cp~c~~k~~~gags~~~~piP~~~~n~~Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~s~~Gf~~d~q~---~ka~ne 354 (546) +..+.+|.++..+ +.+| +++++.+.. .+-..+.++++.+.|+...-+ ++.+. +-+.+- T Consensus 286 ------~~~~~~~~i~~~~------~~~~----~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~--p~~~~~~~~~n~Sg 346 (474) T protein:vir:10 286 ------QETQKSGAFELFD------KDMD----VKYLTKDVN-DTMIENHLDRIEKNIMRFAKS--VNFNSDEFNGNVPI 346 (474) T ss_pred ------hhhhhcceeEecC------CCCc----eeEEeccCC-HHHHHHHHHHHHHHHHHHhCC--cccccccccccchH Confidence 0112233333311 1122 455554442 244556678888888875433 22221 123344 Q ss_pred hhhhhhhhHHHHHHHHHhhcHHHHHHHHHHHHHHH---hc-c---cccccccccccccccccCHHHHHHHHHHHHh-hcc Q lcl|NC_018276. 355 DQVKANTESRRNVLLSVKKNFERVQAWVEDTVCRL---RY-G---ELYVNNTIHYGSDFYLHNVEELTAQYEAAKK-AGA 426 (546) Q Consensus 355 ~qV~s~~es~r~~l~~lkknfer~qkfV~dti~~L---ry-g---~~~~~~tv~yGt~fyl~t~EeLte~~~~ak~-~ga 426 (546) .-+++.+-.+..+.....+-|+++-+-+...++++ .. | ..+..-++ .|-...|.-+++...++++ .|+ T Consensus 347 ~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~~~i~~----~f~~~~p~d~~e~a~~~~kl~g~ 422 (474) T protein:vir:10 347 IGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSYLNLIF----KFTRNIPVNKLEESQVLINLKGQ 422 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceE----EeCCCCCCCHHHHHHHHHHHhcc Confidence 44566666666676777777777777666666654 11 1 12222333 3444455555555554443 342 Q ss_pred cHHHHHHHHHHHHhhhhcCCH-HHHHHHHHHhh--cCCCccccHHHHHHHHhcCCCchhh Q lcl|NC_018276. 427 TDYQLDVIQDQIIETENRNNP-NAMERAQVLKH--LEPYRHQTRKEVLEMLEAGFGDMEL 483 (546) Q Consensus 427 S~~ei~~iq~qi~etEyrNdP-~qmqR~~iL~~--lEPf~~lT~~EvveL~e~~~~~~El 483 (546) =+-|-..-+. -+-.|| .+|+|+..=+. ....+++...+ .-+++-.++-. T Consensus 423 iS~et~~~~l-----~~v~d~~~E~eri~~E~~e~~~~~~~~~~~~---~~~~~~~~~s~ 474 (474) T protein:vir:10 423 VSERTRLGQS-----QLVDDVDYELDEMEKESLEFNDKLPDIDEGD---ANDKSQNNQSE 474 (474) T ss_pred CchHHHHHhC-----CCCCCHHHHHHHHHHHHHHHHhhcccccCCC---cCCCCccccCC Confidence 2211111110 112355 44666643221 11111110000 00000111111 No 14 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=58.18 E-value=0.42 Score=22.66 Aligned_cols=403 Identities=11% Similarity=0.122 Sum_probs=164.3 Q ss_pred CCcccchhHHHHHHHHHHH-hccc-hhhHHHHHH----hhcCC----Ccc----chHHHHHHHHHHHhcccCcccccccc Q lcl|NC_018276. 1 MNAADARAVVNDFLQWVKQ-LLPK-DKYNVFLQL----FKFPV----STN----ELTEEIFNALEKVYDGKDAYEDYNFV 66 (546) Q Consensus 1 ~~a~d~~~~~~~fls~vk~-~l~k-dky~~f~~~----f~fpv----~tn----~lt~~if~~lskvfd~~n~~~~yqf~ 66 (546) +... .+.+|++.-+. ..|+ +|...+..- .+-|- +.+ .+...|-+...--+-|+.. .|. T Consensus 25 ~~~~----~i~~~i~~~~~~~~~~~~~l~~Yy~g~~~i~~~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~----~~~ 96 (470) T protein:vir:99 25 LTSN----ELLGFIAYNETVLKPRYRENMKLYLGKHKILTAPEKETGADNRIVVNSAKYVVDVYNGYFCGIEP----KLA 96 (470) T ss_pred cCHH----HHHHHHHHHHHhhHHHHHHHHHHhccccccccCcccccCCcceeecchHHHHHHHHhhhhccCCe----eEe Confidence 2222 23333332211 1121 122221110 00000 011 1222222222221112211 122 Q ss_pred CccchhhHHHHHHhhcchhhhHHHHHHHH---------HhcCCceEEEEeehhhhcCCCCcceEEecchhhhhhhhcccc Q lcl|NC_018276. 67 NPDYLQDWNDYRSSVLNAYHFWRYEAFKQ---------FKTNINGVMVVDLPAEQTSERPAPYFYFMPITAVKDFKTNAS 137 (546) Q Consensus 67 ~~e~~~d~~~y~s~vLn~~~fw~~e~fk~---------~k~~iN~v~vVDm~~~Q~s~kp~py~y~l~i~~v~a~r~n~~ 137 (546) .+++-+... . +..+|....|.. ...- .+-++|-. .+.+.|.+-+++-..+..+.-+.. T Consensus 97 ~~~d~~~~~-~------l~~~~~~n~~~~~~~~~~~~~~~~G-~~~~~v~~-----d~dg~~~i~~~~p~~~~~i~d~~~ 163 (470) T protein:vir:99 97 LLNDSSKID-E------IARWNRQENFFDTINEISKQCDIFG-RSIASIYQ-----GEDARPHLMYSSPNHAFIIYDDTV 163 (470) T ss_pred eCCchhHHH-H------HHHHHHhcCHhHHHHHHHHHHHhcC-eeEEEEEe-----CCCCeEEEEEEccceeEEEEcCCC Confidence 233221111 1 122333322221 1111 23334422 234567777777777766654433 Q ss_pred c--ceEEEeee--cCCCc----E-EEEecCcceeeccCCccceeeehhhhhhhhccccceeeEeccccccccccceecch Q lcl|NC_018276. 138 G--AIEWIMLP--QGDNQ----L-AVIDDEHYSIYQLDEKGEISAEPLTQSAHDLGYCPATMFWGDPLMDSQPALKKSPL 208 (546) Q Consensus 138 ~--~I~fi~~~--~dqn~----i-~~IDde~y~~y~~d~e~~~sie~l~dn~h~lGy~PAr~~~~D~i~~skP~Ik~Spi 208 (546) + -+.++-+. .+.+. + ++.++..|.....+... ..+......|.+|.+|---|.-+..+.|. + +++ T Consensus 164 ~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~g~vPvv~~~n~~~g~sd--~--e~v 237 (470) T protein:vir:99 164 QRQPLAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEE--DTNAAGYAINPYGLVPAVEFFENEERQGI--F--DSI 237 (470) T ss_pred CcceEEEEEEEEEecCCeeEEEEEEEecCeEEEEEeccccc--ccccccccccCCCccceEeecCCCCCCcc--h--HhH Confidence 2 24445442 22222 2 34455555444333222 22222345599999998877666555443 3 355 Q ss_pred hHHHHHHHHHHHHhhhhhhhhhhcCCcccccCcccCcccccCCCcccCCceeeccCCceEeeccccccccCCcccccccC Q lcl|NC_018276. 209 SNQLAALDWLLFFQTSKKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSHGDYYVTRTGEVSKCPVCADKRLAG 288 (546) Q Consensus 209 s~~L~A~D~ll~~~ts~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~gd~~t~~tg~~~~Cp~c~~k~~~g 288 (546) -..+||+|+.+... ..---+++-|+.+... ++...- .+|++ ..-+. T Consensus 238 ~~liDa~~~~~s~~---~~~~~~~~~~~~~i~g--~~~~~~---------------------~~g~~--------~~~~~ 283 (470) T protein:vir:99 238 KTLINALDKVISQK---ANQVEYFDNAYMYMIG--FKLPED---------------------DEGNP--------KFDFK 283 (470) T ss_pred HHHHHHHHHHHHHH---HHHHHHhcCceeeeec--CCcccc---------------------cccch--------hhhhh Confidence 55567777765432 2222355677776432 111100 01111 01122 Q ss_pred ccceeeeeCCccccccccccCccceecccHHHHHHHHHHHHHHhhhheeeeec--CCCCcccchhhhhhhhhhhhhHHHH Q lcl|NC_018276. 289 VGSFIEVPLPSRENEGADLRNPVQITTIDKASLDYNVEECERIYDKIYTACVG--FGGDISMNKAFNKDQVKANTESRRN 366 (546) Q Consensus 289 ags~~~~piP~~~~n~~Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~s~~G--f~~d~q~~ka~ne~qV~s~~es~r~ 366 (546) ..-++.+|.. .+...+| |++++.+. ..+-..+.++++...|+...-+ +..+ +.+-+.+-.-+++.+..+-. T Consensus 284 ~~~~~~~~~~-~~~~~~~----~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~-~~~~n~Sg~Ai~~~~~~l~~ 356 (470) T protein:vir:99 284 NNRVLYVSQL-DPDTNPQ----IGFIAKPD-ADQMQENLIQHLTDFIFMMAMVPNIQDK-NFAGNSSGVALQYKLFAMKN 356 (470) T ss_pred hcceeeecCC-CCCCCCc----ceEEeecC-ChHHHHHHHHHHHHHHHHHhCCcccccc-ccccCchHHHHHHHHHHHHH Confidence 2334445443 2333344 55665442 2233334567777777655322 1111 11112344456666777777 Q ss_pred HHHHHhhcHHHHHHHHHHHHHHHhc--cc---ccccccccccccccccCHHHHHHHHHHHHh-hcccHHHHHHHHHHHHh Q lcl|NC_018276. 367 VLLSVKKNFERVQAWVEDTVCRLRY--GE---LYVNNTIHYGSDFYLHNVEELTAQYEAAKK-AGATDYQLDVIQDQIIE 440 (546) Q Consensus 367 ~l~~lkknfer~qkfV~dti~~Lry--g~---~~~~~tv~yGt~fyl~t~EeLte~~~~ak~-~gaS~~ei~~iq~qi~e 440 (546) +..+..+-|+++-+-+...++.+.- |. .+..-+|. |-...|.-+++...++.+ +|+=+.|-. |++ + T Consensus 357 k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~i~v~----f~~~~p~~~~e~a~~~~kl~giis~et~-l~~-l-- 428 (470) T protein:vir:99 357 KADSKERKFDKSLMQLYRIVLATLFNNKQDQELWSELDFK----FTRNLPEDMASAIDNAKNAEGIVSKKTQ-LGM-I-- 428 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccccceEE----eCCCCCcCHHHHHHHHHHHhccCCHHHH-HHh-C-- Confidence 7777788888888888888776631 11 22333443 444444444444443332 343222211 111 1 Q ss_pred hhhcCCHH-HHHHHHHHhhcCCCccccHHHHH--HHHhcCCCchhh Q lcl|NC_018276. 441 TENRNNPN-AMERAQVLKHLEPYRHQTRKEVL--EMLEAGFGDMEL 483 (546) Q Consensus 441 tEyrNdP~-qmqR~~iL~~lEPf~~lT~~Evv--eL~e~~~~~~El 483 (546) --=||. +|+|++.=+. .-...+.+.+. ...+..-.++|. T Consensus 429 --~~vd~~~E~eri~~E~~--~~~~~~~~~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 429 --PDIEPDAEMKQIAKEKA--DAIKQTQQLSMPIDILKRDNNAEEE 470 (470) T ss_pred --CCCCHHHHHHHHHHHHH--HHHHHHHhhcCCCCcCCCCCCccCC Confidence 111443 4666543321 11111111111 111222334444 No 15 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=55.84 E-value=0.47 Score=22.38 Aligned_cols=403 Identities=13% Similarity=0.034 Sum_probs=170.3 Q ss_pred CCcccchhHHHHHHHHHHHhccc-hhhHHHHHHh----hcCC--Cc---cchHHHHHHHHHHHhcccCccccccccCc-- Q lcl|NC_018276. 1 MNAADARAVVNDFLQWVKQLLPK-DKYNVFLQLF----KFPV--ST---NELTEEIFNALEKVYDGKDAYEDYNFVNP-- 68 (546) Q Consensus 1 ~~a~d~~~~~~~fls~vk~~l~k-dky~~f~~~f----~fpv--~t---n~lt~~if~~lskvfd~~n~~~~yqf~~~-- 68 (546) -|..-+...+..++++-..-+++ +|+.++..-- .=|. ++ +.++ .+-..++-|.. -+|-|.+| T Consensus 13 ~d~~~~~~~l~~~i~~~~~~~~r~~~~~~yy~g~~~i~~~~~~~~~~~~~ki~---~n~~~~ivd~~---~~~l~g~~~~ 86 (453) T protein:vir:39 13 KDEPITNEVVTKFMEKHRLEVARYEYLKNMYRGIMAIDAEPTKDLWKPDNRLT---VNFTKYIVDTF---TGYFNGIPVK 86 (453) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccCchhcCCCccccCccceee---cchHHHHHHHH---hhhhcccCce Confidence 22223333444444333322221 2222232211 1111 11 1221 11111111111 11222222 Q ss_pred ---cchhhHHHHHHhhcchhhhH--HHHHHHHHhcCCceEEEEeehhhhcCCCCcceEEecchhhhhhhhcccc--cceE Q lcl|NC_018276. 69 ---DYLQDWNDYRSSVLNAYHFW--RYEAFKQFKTNINGVMVVDLPAEQTSERPAPYFYFMPITAVKDFKTNAS--GAIE 141 (546) Q Consensus 69 ---e~~~d~~~y~s~vLn~~~fw--~~e~fk~~k~~iN~v~vVDm~~~Q~s~kp~py~y~l~i~~v~a~r~n~~--~~I~ 141 (546) ++- +-...+..+++.-.|. -.++.+.--.-=-+...|-. .+.+.|-+-+++-.++..+--+.. .-+. T Consensus 87 ~~~~d~-~~~~~l~~i~~~N~~~~~~~~~~~~~~~~G~~~~~v~~-----d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~ 160 (453) T protein:vir:39 87 KSHSDK-ETLSKLQEFDNLNDMEDEESELAKMACIYGRAFELLYQ-----NEETQTNVIYNTPENMFMVYDDTIKQEPLF 160 (453) T ss_pred eccCCh-HHHHHHHHHHHhcChhHHHHHHHHHHhhcCeEEEEEEe-----cCCCceEEEEEcccceEEEecCCCCCeEEE Confidence 111 1112233222222222 22222222221224445543 345677777777777776654322 2355 Q ss_pred EEeeecCCCc---EEEEecCcceeeccCCccceeeehhhhhhhhccccceeeEeccccccccccceecchhHHHHHHHHH Q lcl|NC_018276. 142 WIMLPQGDNQ---LAVIDDEHYSIYQLDEKGEISAEPLTQSAHDLGYCPATMFWGDPLMDSQPALKKSPLSNQLAALDWL 218 (546) Q Consensus 142 fi~~~~dqn~---i~~IDde~y~~y~~d~e~~~sie~l~dn~h~lGy~PAr~~~~D~i~~skP~Ik~Spis~~L~A~D~l 218 (546) ++-+..+++. +-+-++.+...|..++.+. .++ ....|.+|.||---|.-+..+.|. + .++-...||+|+. T Consensus 161 ~ir~~~~~~~~~~~~~yt~~~i~~~~~~~~~~-~~~--~~~~~~~g~vPvv~~~n~~~g~sd--~--e~v~~liDa~~~~ 233 (453) T protein:vir:39 161 AVRYGYDDDYKLYGEVYTKETTYALNGTMGFY-NMT--EQAPNPFDDLPVVEFYFNEERMSI--F--ESVISLVNAFNKA 233 (453) T ss_pred EEEEEEeCCeEEEEEEEeCCeEEEEEecCCce-eee--cccccCCCceeEEEecCCCCCCcc--h--hhhHHHHHHHHHH Confidence 5656444443 3344555555565554432 222 445699999999888766555443 3 3556666888887 Q ss_pred HHHhhhhhhhhhhcCCcccccCcccCcccccCCCcccCCceeeccCCceEeeccccccccCCcccccccCccceeeeeCC Q lcl|NC_018276. 219 LFFQTSKKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSHGDYYVTRTGEVSKCPVCADKRLAGVGSFIEVPLP 298 (546) Q Consensus 219 l~~~ts~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~gd~~t~~tg~~~~Cp~c~~k~~~gags~~~~piP 298 (546) +.-.-.- --|++-|+++.+.- ..+.. .++ ..++ +.++.++-. T Consensus 234 ~s~~~~~---~~~~~~p~~~~~g~--~~~~~---------~~~-----------------------~~~~-~~~~~~~~~ 275 (453) T protein:vir:39 234 ISEKAND---VDYFSDQYLTFLGA--AVEEE---------DLK-----------------------NIRS-NRVINYYGE 275 (453) T ss_pred HHHHHHH---HHHhhCceeeeecC--CCCch---------hhh-----------------------hhhh-cceeeecCC Confidence 6543221 22556676654321 11000 000 0011 123333322 Q ss_pred ccccccccccCccceecccHHHHHHHHHHHHHHhhhheeeeecCCCCcccch--hhhhhhhhhhhhHHHHHHHHHhhcHH Q lcl|NC_018276. 299 SRENEGADLRNPVQITTIDKASLDYNVEECERIYDKIYTACVGFGGDISMNK--AFNKDQVKANTESRRNVLLSVKKNFE 376 (546) Q Consensus 299 ~~~~n~~Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~s~~Gf~~d~q~~k--a~ne~qV~s~~es~r~~l~~lkknfe 376 (546) ..+...+| |++++.+. ..+...+.++++.+.||.-+-. ++++.+. ..+-.-+++.+.++..+..+..+-|+ T Consensus 276 ~~~~~~~~----~~~lt~~~-~~~~~~~~~~~l~~~I~~~s~~--p~~~~~~~gn~Sg~Al~~~~~~l~~ka~~~~~~~~ 348 (453) T protein:vir:39 276 SSEAKNVD----VKFLEKPD-SDSQTENLLDRLTKLIFQTTMV--ANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQ 348 (453) T ss_pred CCCCCCCc----eeEEeecC-CHHHHHHHHHHHHHHHHHHhCC--cccccccccCChHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222333 45555443 2355566777888877664422 1111111 12333456666666677777777788 Q ss_pred HHHHHHHHHHHHH--hcccc--cccccccccccccccCHHHHHHHHHHH-HhhcccHHHHHHHHHHHHhhhhcCCH-HHH Q lcl|NC_018276. 377 RVQAWVEDTVCRL--RYGEL--YVNNTIHYGSDFYLHNVEELTAQYEAA-KKAGATDYQLDVIQDQIIETENRNNP-NAM 450 (546) Q Consensus 377 r~qkfV~dti~~L--ryg~~--~~~~tv~yGt~fyl~t~EeLte~~~~a-k~~gaS~~ei~~iq~qi~etEyrNdP-~qm 450 (546) ++.+-+...++.+ +.|.. +...+|.+ ....|.-++++..++ |.+|+=+.|-..-+. -+-.|| .+| T Consensus 349 ~~l~~~~~li~~~~~~~~~~~~~~~i~v~f----~~~~p~~~~~~a~~~~kl~g~is~et~l~~l-----~~v~D~~~E~ 419 (453) T protein:vir:39 349 SSLNSRYKLYCELSTNVSNKEAWKDIEYTF----TRNEPKDIKEQAETANILMGITSQETALSVI-----SVIPDVQAEM 419 (453) T ss_pred HHHHHHHHHHHHHHhccCCccccccceEEe----CCCCCcCHHHHHHHHHHHhccCChHHHHHhC-----CCCCCHHHHH Confidence 8777777777665 23433 24455554 344444444444333 334532222211111 112233 456 Q ss_pred HHHHHHhhcCCCccccHHHHHHHHh--------cCCCchh Q lcl|NC_018276. 451 ERAQVLKHLEPYRHQTRKEVLEMLE--------AGFGDME 482 (546) Q Consensus 451 qR~~iL~~lEPf~~lT~~EvveL~e--------~~~~~~E 482 (546) +|++.=++ + ..+..-.... .+=.++| T Consensus 420 ~ri~~E~~--~----~~~~~~~~~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 420 EKIKKEEA--S----TAIFDKDKQPSEKGTDTVVPETNEE 453 (453) T ss_pred HHHHHHHH--H----HHHHHHhccCCCCCCCCCCCCcCCC Confidence 66553222 0 0000000000 1233444 No 16 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=55.39 E-value=0.48 Score=22.33 Aligned_cols=406 Identities=13% Similarity=0.034 Sum_probs=176.4 Q ss_pred CCcccchhHHHHHHHHHHHhccchhhHHHHHHhhc-----CCC---------------------ccchHHHHHHHHHHHh Q lcl|NC_018276. 1 MNAADARAVVNDFLQWVKQLLPKDKYNVFLQLFKF-----PVS---------------------TNELTEEIFNALEKVY 54 (546) Q Consensus 1 ~~a~d~~~~~~~fls~vk~~l~kdky~~f~~~f~f-----pv~---------------------tn~lt~~if~~lskvf 54 (546) -++-+....+..|.++-.+-++ +...+.+| ++. .+.+..-|-+...-.+ T Consensus 23 ~~~~~~~~~i~~~i~~~~~~~~-----~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl 97 (474) T protein:vir:96 23 PKVETQEEMIIRLINNHKQKLK-----DINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYV 97 (474) T ss_pred ccccchHHHHHHHHHHHHHHHH-----HHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhh Confidence 3444444455555444333222 11122211 110 0112222222221112 Q ss_pred cccCccccccccCccchhhHHHHHHhhcchhhhHH--HHHHHHHhcCCceEEEEeehhhhcCCCCcceEEecchhhhhhh Q lcl|NC_018276. 55 DGKDAYEDYNFVNPDYLQDWNDYRSSVLNAYHFWR--YEAFKQFKTNINGVMVVDLPAEQTSERPAPYFYFMPITAVKDF 132 (546) Q Consensus 55 d~~n~~~~yqf~~~e~~~d~~~y~s~vLn~~~fw~--~e~fk~~k~~iN~v~vVDm~~~Q~s~kp~py~y~l~i~~v~a~ 132 (546) =|+.+. |.=-+.+..+.|..++. ..|.. .+..+.--.-=.+-..|-. .+.+.|-+-+++-..+..+ T Consensus 98 ~g~p~~--~~~~~~~~~~~l~~~~~-----n~~~~~~~~l~~~~~~~G~~~~~~~~-----d~~~~~~i~~~~p~~~~~v 165 (474) T protein:vir:96 98 AGKPVT--YAHDDDKVLDVIHQVLD-----TRWDNKLIDILTAASNKGIDWLQVYI-----NEDGELKLFRVPAEQAIPI 165 (474) T ss_pred cccCce--eccCChHHHHHHHHHHh-----ccHHHHHHHHHHHHhhCCeEEEEeee-----CCCCceEEEEEcccceEEE Confidence 122211 11112222333333332 12322 1222222222223344433 2345677777777777766 Q ss_pred hccc--ccceEEEeeecCCC--cEEEEecCcceeeccCCccceee-------ehhhhhhhhccccceeeEeccccccccc Q lcl|NC_018276. 133 KTNA--SGAIEWIMLPQGDN--QLAVIDDEHYSIYQLDEKGEISA-------EPLTQSAHDLGYCPATMFWGDPLMDSQP 201 (546) Q Consensus 133 r~n~--~~~I~fi~~~~dqn--~i~~IDde~y~~y~~d~e~~~si-------e~l~dn~h~lGy~PAr~~~~D~i~~skP 201 (546) --+. ..-++|+-+.+.++ .+-+-++.....|-..+.+...- .......|.+|.||..-|+.+..+.|- T Consensus 166 ~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d- 244 (474) T protein:vir:96 166 WTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSD- 244 (474) T ss_pred EcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCc- Confidence 4332 23366665533333 34444555444444333221111 111223499999999999877665543 Q ss_pred cceecchhHHHHHHHHHHHHhhhhhhhhhhcCCcccccCcccCcccccCCCcccCCceeeccCCceEeeccccccccCCc Q lcl|NC_018276. 202 ALKKSPLSNQLAALDWLLFFQTSKKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSHGDYYVTRTGEVSKCPVC 281 (546) Q Consensus 202 ~Ik~Spis~~L~A~D~ll~~~ts~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~gd~~t~~tg~~~~Cp~c 281 (546) + .++-..+||+|+.+--.- .---|++-|+.+... ++-++ .+ ++..+ T Consensus 245 -~--e~v~~liDa~d~~~S~~~---~~~~~~~~p~lv~~g--~~~~~--~~-----~~~~~------------------- 290 (474) T protein:vir:96 245 -I--WMYKSFVDAIDKRLSDVQ---NMFDESVELIYILRG--YEGED--LS-----EFMEG------------------- 290 (474) T ss_pred -h--HHHHHHHHHHHHHHHHHH---HHHHHhhcchhhhcC--CCccc--cc-----chhhh------------------- Confidence 2 355556677776554221 111345567666432 11111 01 11110 Q ss_pred ccccccCccceeeeeCCccccccccccCccceecccHHHHHHHHHHHHHHhhhheeeeecCC-CCcccchhhhhhhhhhh Q lcl|NC_018276. 282 ADKRLAGVGSFIEVPLPSRENEGADLRNPVQITTIDKASLDYNVEECERIYDKIYTACVGFG-GDISMNKAFNKDQVKAN 360 (546) Q Consensus 282 ~~k~~~gags~~~~piP~~~~n~~Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~s~~Gf~-~d~q~~ka~ne~qV~s~ 360 (546) +..+.++.+ |+ ..| |++++.+.. .+...+.++++.+.|+...-+.. .+.+.+.+.+..-++.- T Consensus 291 -----~~~~~~i~~--~~----~~~----~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~ 354 (474) T protein:vir:96 291 -----LKYYKAINV--SS----DGG----VETIQVEVP-VASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFL 354 (474) T ss_pred -----hhccceeec--cC----CCc----eeEEeccCC-HHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHH Confidence 011112222 21 112 344433321 23344567777777776553321 11112223445556666 Q ss_pred hhHHHHHHHHHhhcHHHHHHHHHHHHHHHhcccccccccccccccccccCHHHHHHHHHHHHhhcccHHHHHHHHHHHHh Q lcl|NC_018276. 361 TESRRNVLLSVKKNFERVQAWVEDTVCRLRYGELYVNNTIHYGSDFYLHNVEELTAQYEAAKKAGATDYQLDVIQDQIIE 440 (546) Q Consensus 361 ~es~r~~l~~lkknfer~qkfV~dti~~Lryg~~~~~~tv~yGt~fyl~t~EeLte~~~~ak~~gaS~~ei~~iq~qi~e 440 (546) +-.+..+.....+-|+++-.-+...++++. |-.+=...| .-.|....|.-++++...|+++|+-+-|-..-+. T Consensus 355 ~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~-g~~~d~~~i--~i~f~~~~p~~~~e~a~~~~~~giiS~et~~~~l---- 427 (474) T protein:vir:96 355 YTNLNLKANKLKNKANVALQELMQFILDFN-KIKLDAKEI--EITFNFNVMVNDLEQSQIGAQSQYLSKETLVRHH---- 427 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCccccee--eEEecCCCccCHHHHHHHHHHcCCCChHHHHHhC---- Confidence 777777777788888888888888887752 433322222 2457777888889999999888876544322111 Q ss_pred hhhcCCHH-HHHHHHHH-----hhcCCCccccHHHHHHHHhcCCCchhheeee Q lcl|NC_018276. 441 TENRNNPN-AMERAQVL-----KHLEPYRHQTRKEVLEMLEAGFGDMELIAVK 487 (546) Q Consensus 441 tEyrNdP~-qmqR~~iL-----~~lEPf~~lT~~EvveL~e~~~~~~Ell~vk 487 (546) -+=.||. +|+|++.= +.+....+ .++-...+.+ +.|.=--| T Consensus 428 -p~v~D~~~E~eri~~E~~~~~~~~~~~~~---~~~~~~~~~~--~~~~~e~~ 474 (474) T protein:vir:96 428 -PWVDDPKAELERLDEEQLELNKQLPNLDD---GGADGAQQQQ--QSENNQSK 474 (474) T ss_pred -CCCCCHHHHHHHHHHHHHHHHhhcccccc---ccCCCCCCcC--CCCccccC Confidence 1223553 46665322 12211111 1111111100 00000011 No 17 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=55.39 E-value=0.48 Score=22.33 Aligned_cols=406 Identities=13% Similarity=0.034 Sum_probs=176.4 Q ss_pred CCcccchhHHHHHHHHHHHhccchhhHHHHHHhhc-----CCC---------------------ccchHHHHHHHHHHHh Q lcl|NC_018276. 1 MNAADARAVVNDFLQWVKQLLPKDKYNVFLQLFKF-----PVS---------------------TNELTEEIFNALEKVY 54 (546) Q Consensus 1 ~~a~d~~~~~~~fls~vk~~l~kdky~~f~~~f~f-----pv~---------------------tn~lt~~if~~lskvf 54 (546) -++-+....+..|.++-.+-++ +...+.+| ++. .+.+..-|-+...-.+ T Consensus 23 ~~~~~~~~~i~~~i~~~~~~~~-----~~~~l~~Yy~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl 97 (474) T protein:vir:95 23 PKVETQEEMIIRLINNHKQKLK-----DINVGQKYYDKDNDINYQAYKQDLHGNIDYTKPDWRITTNFHQNLVDQKVSYV 97 (474) T ss_pred ccccchHHHHHHHHHHHHHHHH-----HHHHHHHHhcccCccccccchhhhcccccccccccccccchHHHHHHhhhhhh Confidence 3444444455555444333222 11122211 110 0112222222221112 Q ss_pred cccCccccccccCccchhhHHHHHHhhcchhhhHH--HHHHHHHhcCCceEEEEeehhhhcCCCCcceEEecchhhhhhh Q lcl|NC_018276. 55 DGKDAYEDYNFVNPDYLQDWNDYRSSVLNAYHFWR--YEAFKQFKTNINGVMVVDLPAEQTSERPAPYFYFMPITAVKDF 132 (546) Q Consensus 55 d~~n~~~~yqf~~~e~~~d~~~y~s~vLn~~~fw~--~e~fk~~k~~iN~v~vVDm~~~Q~s~kp~py~y~l~i~~v~a~ 132 (546) =|+.+. |.=-+.+..+.|..++. ..|.. .+..+.--.-=.+-..|-. .+.+.|-+-+++-..+..+ T Consensus 98 ~g~p~~--~~~~~~~~~~~l~~~~~-----n~~~~~~~~l~~~~~~~G~~~~~~~~-----d~~~~~~i~~~~p~~~~~v 165 (474) T protein:vir:95 98 AGKPVT--YAHDDDKVLDVIHQVLD-----TRWDNKLIDILTAASNKGIDWLQVYI-----NEDGELKLFRVPAEQAIPI 165 (474) T ss_pred cccCce--eccCChHHHHHHHHHHh-----ccHHHHHHHHHHHHhhCCeEEEEeee-----CCCCceEEEEEcccceEEE Confidence 122211 11112222333333332 12322 1222222222223344433 2345677777777777766 Q ss_pred hccc--ccceEEEeeecCCC--cEEEEecCcceeeccCCccceee-------ehhhhhhhhccccceeeEeccccccccc Q lcl|NC_018276. 133 KTNA--SGAIEWIMLPQGDN--QLAVIDDEHYSIYQLDEKGEISA-------EPLTQSAHDLGYCPATMFWGDPLMDSQP 201 (546) Q Consensus 133 r~n~--~~~I~fi~~~~dqn--~i~~IDde~y~~y~~d~e~~~si-------e~l~dn~h~lGy~PAr~~~~D~i~~skP 201 (546) --+. ..-++|+-+.+.++ .+-+-++.....|-..+.+...- .......|.+|.||..-|+.+..+.|- T Consensus 166 ~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d- 244 (474) T protein:vir:95 166 WTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSD- 244 (474) T ss_pred EcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCc- Confidence 4332 23366665533333 34444555444444333221111 111223499999999999877665543 Q ss_pred cceecchhHHHHHHHHHHHHhhhhhhhhhhcCCcccccCcccCcccccCCCcccCCceeeccCCceEeeccccccccCCc Q lcl|NC_018276. 202 ALKKSPLSNQLAALDWLLFFQTSKKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSHGDYYVTRTGEVSKCPVC 281 (546) Q Consensus 202 ~Ik~Spis~~L~A~D~ll~~~ts~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~gd~~t~~tg~~~~Cp~c 281 (546) + .++-..+||+|+.+--.- .---|++-|+.+... ++-++ .+ ++..+ T Consensus 245 -~--e~v~~liDa~d~~~S~~~---~~~~~~~~p~lv~~g--~~~~~--~~-----~~~~~------------------- 290 (474) T protein:vir:95 245 -I--WMYKSFVDAIDKRLSDVQ---NMFDESVELIYILRG--YEGED--LS-----EFMEG------------------- 290 (474) T ss_pred -h--HHHHHHHHHHHHHHHHHH---HHHHHhhcchhhhcC--CCccc--cc-----chhhh------------------- Confidence 2 355556677776554221 111345567666432 11111 01 11110 Q ss_pred ccccccCccceeeeeCCccccccccccCccceecccHHHHHHHHHHHHHHhhhheeeeecCC-CCcccchhhhhhhhhhh Q lcl|NC_018276. 282 ADKRLAGVGSFIEVPLPSRENEGADLRNPVQITTIDKASLDYNVEECERIYDKIYTACVGFG-GDISMNKAFNKDQVKAN 360 (546) Q Consensus 282 ~~k~~~gags~~~~piP~~~~n~~Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~s~~Gf~-~d~q~~ka~ne~qV~s~ 360 (546) +..+.++.+ |+ ..| |++++.+.. .+...+.++++.+.|+...-+.. .+.+.+.+.+..-++.- T Consensus 291 -----~~~~~~i~~--~~----~~~----~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Alk~~ 354 (474) T protein:vir:95 291 -----LKYYKAINV--SS----DGG----VETIQVEVP-VASTKEYLDMMRAYIVEFGQGVDFQTDKFGSATSGIALKFL 354 (474) T ss_pred -----hhccceeec--cC----CCc----eeEEeccCC-HHHHHHHHHHHHHHHHHHhCCcCccccccccccHHHHHHHH Confidence 011112222 21 112 344433321 23344567777777776553321 11112223445556666 Q ss_pred hhHHHHHHHHHhhcHHHHHHHHHHHHHHHhcccccccccccccccccccCHHHHHHHHHHHHhhcccHHHHHHHHHHHHh Q lcl|NC_018276. 361 TESRRNVLLSVKKNFERVQAWVEDTVCRLRYGELYVNNTIHYGSDFYLHNVEELTAQYEAAKKAGATDYQLDVIQDQIIE 440 (546) Q Consensus 361 ~es~r~~l~~lkknfer~qkfV~dti~~Lryg~~~~~~tv~yGt~fyl~t~EeLte~~~~ak~~gaS~~ei~~iq~qi~e 440 (546) +-.+..+.....+-|+++-.-+...++++. |-.+=...| .-.|....|.-++++...|+++|+-+-|-..-+. T Consensus 355 ~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~-g~~~d~~~i--~i~f~~~~p~~~~e~a~~~~~~giiS~et~~~~l---- 427 (474) T protein:vir:95 355 YTNLNLKANKLKNKANVALQELMQFILDFN-KIKLDAKEI--EITFNFNVMVNDLEQSQIGAQSQYLSKETLVRHH---- 427 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCccccee--eEEecCCCccCHHHHHHHHHHcCCCChHHHHHhC---- Confidence 777777777788888888888888887752 433322222 2457777888889999999888876544322111 Q ss_pred hhhcCCHH-HHHHHHHH-----hhcCCCccccHHHHHHHHhcCCCchhheeee Q lcl|NC_018276. 441 TENRNNPN-AMERAQVL-----KHLEPYRHQTRKEVLEMLEAGFGDMELIAVK 487 (546) Q Consensus 441 tEyrNdP~-qmqR~~iL-----~~lEPf~~lT~~EvveL~e~~~~~~Ell~vk 487 (546) -+=.||. +|+|++.= +.+....+ .++-...+.+ +.|.=--| T Consensus 428 -p~v~D~~~E~eri~~E~~~~~~~~~~~~~---~~~~~~~~~~--~~~~~e~~ 474 (474) T protein:vir:95 428 -PWVDDPKAELERLDEEQLELNKQLPNLDD---GGADGAQQQQ--QSENNQSK 474 (474) T ss_pred -CCCCCHHHHHHHHHHHHHHHHhhcccccc---ccCCCCCCcC--CCCccccC Confidence 1223553 46665322 12211111 1111111100 00000011 No 18 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=49.01 E-value=0.65 Score=21.60 Aligned_cols=396 Identities=11% Similarity=0.081 Sum_probs=164.1 Q ss_pred CCcccchhHHHHHHHHHHHhccchhhHHHHHHhhc------------------CC---------CccchHHHHHHHHHHH Q lcl|NC_018276. 1 MNAADARAVVNDFLQWVKQLLPKDKYNVFLQLFKF------------------PV---------STNELTEEIFNALEKV 53 (546) Q Consensus 1 ~~a~d~~~~~~~fls~vk~~l~kdky~~f~~~f~f------------------pv---------~tn~lt~~if~~lskv 53 (546) |+.....+.+.+++.+-++.++ +|...-..+.= ++ +-+-++.-.+..+... T Consensus 1 ~~~e~~~~~i~~~~~~~~~~~~--~~~~~~~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~ 78 (471) T protein:vir:10 1 MEIEVIKKIISSQMVKHGKFVS--QAAEAEKYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADNRISHNWHQLLLDQ 78 (471) T ss_pred CCHHHHHHHHHHHHHHHHHHHH--HHHHHHHHhccccccccccchhhhhcccccccccccccccccceeccchhHHHHHh Confidence 9999999999998876555443 23222222210 00 0111222222222221 Q ss_pred hcccCccccccccCc--------cchhhHHHHHHhhcchhhhHHHHHHHHHhcCCceE----EEEeehhhhcCCCCcceE Q lcl|NC_018276. 54 YDGKDAYEDYNFVNP--------DYLQDWNDYRSSVLNAYHFWRYEAFKQFKTNINGV----MVVDLPAEQTSERPAPYF 121 (546) Q Consensus 54 fd~~n~~~~yqf~~~--------e~~~d~~~y~s~vLn~~~fw~~e~fk~~k~~iN~v----~vVDm~~~Q~s~kp~py~ 121 (546) . -+|-|.+| +..+-|..+... -.+-.-.+.-+. .-+-|. +.+| .+.+.|-+ T Consensus 79 ~------~~yl~G~p~~~~~~~~~~~~~l~~~~~n---~~~~~~~~~~~~--~~~~G~~~~~v~~d------~~~g~~~~ 141 (471) T protein:vir:10 79 K------KAYALTYPPTFDVDDKKVNDMIVDVLGD---DYERISKQLCVN--AGNAGIAWLHVWKD------ASDNSFRY 141 (471) T ss_pred h------hhhhcccCceeccCChHHHHHHHHHHhc---CHHHHHHHHHHH--HhhCCeEEEEEEee------CCCCeeEE Confidence 1 12222222 223333333221 001111111111 111121 1122 12234444 Q ss_pred Eecchhhhhhhhccc-cc-ceEEEee-e---cCCCc----EEEEecCcceeeccCCcccee-----------------ee Q lcl|NC_018276. 122 YFMPITAVKDFKTNA-SG-AIEWIML-P---QGDNQ----LAVIDDEHYSIYQLDEKGEIS-----------------AE 174 (546) Q Consensus 122 y~l~i~~v~a~r~n~-~~-~I~fi~~-~---~dqn~----i~~IDde~y~~y~~d~e~~~s-----------------ie 174 (546) -.++=..++.+--.. .+ -+.++-+ . .+.+. +-+-+++..-.|...+.+..- .. T Consensus 142 ~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 221 (471) T protein:vir:10 142 ACVDSKEVIPIYSKSLDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDTMNGDRS 221 (471) T ss_pred EEEcccceEEEEcCCCCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCccccccccccccccccccccccc Confidence 444444444333211 11 2334322 1 11222 222345444445444332111 11 Q ss_pred hhhhhhhhccccceeeEeccccccccccceecchhHHHHHHHHHHHHhhhhhhhhhhcCCcccccCcccCcccccCCCcc Q lcl|NC_018276. 175 PLTQSAHDLGYCPATMFWGDPLMDSQPALKKSPLSNQLAALDWLLFFQTSKKHLDLYAPYPIYSGFEQDCTYENKENGDY 254 (546) Q Consensus 175 ~l~dn~h~lGy~PAr~~~~D~i~~skP~Ik~Spis~~L~A~D~ll~~~ts~~hldlya~YpiYs~yeedC~ye~~~~~~~ 254 (546) ....-.|.||.||.--|.....+.|- + +++-..+||+|+.+-. ...--.|++-|++...--++. +.+ T Consensus 222 ~~~~~~~~~g~iPvv~~~n~~~~~sd--~--e~v~~liDa~d~~~S~---~~~~~~~~~~~~lv~~g~~~~----~~~-- 288 (471) T protein:vir:10 222 SDNSFKHDFGLVPFIPFKNNEIETND--L--KPIKDLVDVYDKVFSG---FVNDTDDVQEVIFVLTNYGGQ----DKQ-- 288 (471) T ss_pred ccccccCCCCceeEEEeccCCCCCCc--h--HHHHHHHHHHHHHHHH---HHHHHHHhhCceeeeecCCcc----ccc-- Confidence 11223599999998888655444332 3 3455555777765432 222224556777754332221 000 Q ss_pred cCCceeeccCCceEeeccccccccCCcccccccCccceeeeeCCccccccccccCccceecccHHHHHHHHHHHHHHhhh Q lcl|NC_018276. 255 CDSGFIKNSHGDYYVTRTGEVSKCPVCADKRLAGVGSFIEVPLPSRENEGADLRNPVQITTIDKASLDYNVEECERIYDK 334 (546) Q Consensus 255 C~~G~ikn~~gd~~t~~tg~~~~Cp~c~~k~~~gags~~~~piP~~~~n~~Dl~~pv~i~s~d~~sley~~~e~kri~d~ 334 (546) .+.+ ..++ +.++.++-+. +...+| |++++.+.. .+-....++++.+. T Consensus 289 ---~~~~-----------------------~~~~-~~~i~~~~~~-~~~~~~----~~~l~~~~~-~~~~~~~~~~l~~~ 335 (471) T protein:vir:10 289 ---EFLE-----------------------DLKR-YKMIKMDNDG-MGDQSG----VTTIAIDIP-TEARNLILERTKKQ 335 (471) T ss_pred ---hhHH-----------------------Hhhc-CCeEEecCCC-CccCcc----ceEEeecCC-hHHHHHHHHHHHHH Confidence 0000 0111 1222232221 112223 455554432 24455667788888 Q ss_pred heeeeecCCCCcccch--hhhhhhhhhhhhHHHHHHHHHhhcHHHHHHHHHHHHHHHhcccccccccccccccccccCHH Q lcl|NC_018276. 335 IYTACVGFGGDISMNK--AFNKDQVKANTESRRNVLLSVKKNFERVQAWVEDTVCRLRYGELYVNNTIHYGSDFYLHNVE 412 (546) Q Consensus 335 i~~s~~Gf~~d~q~~k--a~ne~qV~s~~es~r~~l~~lkknfer~qkfV~dti~~Lryg~~~~~~tv~yGt~fyl~t~E 412 (546) ||...-+ ++.+.++ ..+..-+++.+-.+..|.....+.|+++-+-+...++++.=...+..-+ -.|-...|. T Consensus 336 I~~~s~t--p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~d~~~i~----i~f~~~~p~ 409 (471) T protein:vir:10 336 IFISGQG--VNPETDKLGNSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILKHLGLSDKLKIK----QTWTRNSIN 409 (471) T ss_pred HHHHhCC--cCCCcccccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCceeE----EEeCCCCCC Confidence 8875533 1221111 1233446666777777777778888888777777777653222222222 345555566 Q ss_pred HHHHHHHHHHh-hcccHHHHHHHHHHHHhhhhcCCH-HHHHHHHH-----HhhcCCCccccHHHHHH Q lcl|NC_018276. 413 ELTAQYEAAKK-AGATDYQLDVIQDQIIETENRNNP-NAMERAQV-----LKHLEPYRHQTRKEVLE 472 (546) Q Consensus 413 eLte~~~~ak~-~gaS~~ei~~iq~qi~etEyrNdP-~qmqR~~i-----L~~lEPf~~lT~~Evve 472 (546) -+++....|.+ +|+=+.|-. |++ .-+-.|| .+|+|++. .....++..-..++=++ T Consensus 410 n~~e~~~~~~kl~g~iS~et~-~~~----~p~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~e~~ 471 (471) T protein:vir:10 410 NDTEMAQVVSTLATITSRENV-AKS----NPIVEDWQDELRLQKAEQEGRSEKLYDMEEVEHESEVE 471 (471) T ss_pred CHHHHHHHHHHHhccCchHHH-HHh----CCCCCCHHHHHHHHHHHHHHHHhcccccCCCCCccccC Confidence 56665555443 332221111 111 1123355 45666654 22222332222222222 No 19 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=40.18 E-value=0.98 Score=20.62 Aligned_cols=407 Identities=11% Similarity=0.036 Sum_probs=165.5 Q ss_pred CCcccchhHHHHHHHHHHHhccchhhHHHHHHhh--------cC-CCccc----hHHHHHHHHHHHhcccCccccccccC Q lcl|NC_018276. 1 MNAADARAVVNDFLQWVKQLLPKDKYNVFLQLFK--------FP-VSTNE----LTEEIFNALEKVYDGKDAYEDYNFVN 67 (546) Q Consensus 1 ~~a~d~~~~~~~fls~vk~~l~kdky~~f~~~f~--------fp-v~tn~----lt~~if~~lskvfd~~n~~~~yqf~~ 67 (546) |+..+-.+.+.....++..+. |...+.+--| .+ -+.+. +...|-+...-..-|..+.+. -.+ T Consensus 17 ~~~~~i~~~i~~~~~~~~r~~---~~~~Yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~--~~d 91 (452) T protein:vir:36 17 ITVEVVTKFMEKHKLEVARYE---YLKNMYLGIMAIDDEPAKDSWKPDNRLAVNFTKYIVDTFTGYFNGIPVKKS--HSD 91 (452) T ss_pred CCHHHHHHHHHHHHHHHHHHH---HHHHHhccccccccCccccccCccceeecchHHHHHHHHhhhhcccCceee--cCC Confidence 554444444443333333322 1222222110 01 01111 222333333222223332221 122 Q ss_pred ccchhhHHHHHHhhcchhhhHH--HHHHHH-HhcCCceEEEEeehhhhcCCCCcceEEecchhhhhhhhcccc--cceEE Q lcl|NC_018276. 68 PDYLQDWNDYRSSVLNAYHFWR--YEAFKQ-FKTNINGVMVVDLPAEQTSERPAPYFYFMPITAVKDFKTNAS--GAIEW 142 (546) Q Consensus 68 ~e~~~d~~~y~s~vLn~~~fw~--~e~fk~-~k~~iN~v~vVDm~~~Q~s~kp~py~y~l~i~~v~a~r~n~~--~~I~f 142 (546) .+..+-|..+ ++.-.|.. .++.+. ...-. +-.+|=. .+.+.|-+-+++-.++..+--+.. .-+.+ T Consensus 92 ~~~~~~l~~~----~~~n~~~~~~~~~~~~~~~~G~-~~~~v~~-----d~~g~~~i~~~~p~~~~~v~d~~~~~~~~~~ 161 (452) T protein:vir:36 92 KEILTKLQEF----DNLNDMEDEESELAKMACIYGR-AFEFLYQ-----DEDTQTNVVYNSPENMFMVYDDTVKQEPLFA 161 (452) T ss_pred hhHHHHHHHH----HhhcChhHHHHHHHHHHHhcCe-EEEEEEe-----cCCCeeEEEEEcccceEEEEcCCCCCceEEE Confidence 2222333332 22222222 112221 11111 2222211 245677777777777776654432 23666 Q ss_pred EeeecCCCc---EEEEecCcceeeccCCccceeeehhhhhhhhccccceeeEeccccccccccceecchhHHHHHHHHHH Q lcl|NC_018276. 143 IMLPQGDNQ---LAVIDDEHYSIYQLDEKGEISAEPLTQSAHDLGYCPATMFWGDPLMDSQPALKKSPLSNQLAALDWLL 219 (546) Q Consensus 143 i~~~~dqn~---i~~IDde~y~~y~~d~e~~~sie~l~dn~h~lGy~PAr~~~~D~i~~skP~Ik~Spis~~L~A~D~ll 219 (546) +-+..+... +-+-++...-.|..++.+ ..+.....|.+|.||...|+....+.|. + +++-..+||+|+.+ T Consensus 162 i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~---~~~~~~~~~~~g~iPvv~~~n~~~g~sd--~--e~v~~liDa~d~~~ 234 (452) T protein:vir:36 162 VRYGVDEDKKLQGEVYTLLETIKISGENDE---ISFGEGTYNPYPDLPVVEFYFNEERMSI--F--ESVISLVNAFNKAI 234 (452) T ss_pred EEEEEecCceEEEEEEecCeEEEEEEcCCc---eEEecceeccCCcccEEEecCCCCCCcc--h--HHHHHHHHHHHHHH Confidence 655333322 333444444444433332 2222456699999999888766554443 3 35555567777765 Q ss_pred HHhhhhhhhhhhcCCcccccCcccCcccccCCCcccCCceeeccCCceEeeccccccccCCcccccccCccceeeeeCCc Q lcl|NC_018276. 220 FFQTSKKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSHGDYYVTRTGEVSKCPVCADKRLAGVGSFIEVPLPS 299 (546) Q Consensus 220 ~~~ts~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~gd~~t~~tg~~~~Cp~c~~k~~~gags~~~~piP~ 299 (546) .- ...---|++-|+.+.+. ++-+. .+.++ .+ .+.++.++.. T Consensus 235 s~---~~~~~~~~~~p~~~~~g--~~~~~---------~~~~~-----------------------~~-~~~~~~~~~~- 275 (452) T protein:vir:36 235 SE---KANDVDYFSDQYLTFLG--AAVEE---------EDLKN-----------------------IR-SNRVINYYAD- 275 (452) T ss_pred HH---HHHHHHHhcCceeEeec--CCcCc---------hhhhh-----------------------hh-hcceEEecCC- Confidence 43 22212366778777642 11000 00000 01 1223333332 Q ss_pred cccccccccCccceecccHHHHHHHHHHHHHHhhhheeeeecCCCCcccch--hhhhhhhhhhhhHHHHHHHHHhhcHHH Q lcl|NC_018276. 300 RENEGADLRNPVQITTIDKASLDYNVEECERIYDKIYTACVGFGGDISMNK--AFNKDQVKANTESRRNVLLSVKKNFER 377 (546) Q Consensus 300 ~~~n~~Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~s~~Gf~~d~q~~k--a~ne~qV~s~~es~r~~l~~lkknfer 377 (546) .+..++| |++++.+. ..+-..+.++++.+.|+...-+ ++++.+. ..+-.-+++.+.++..|..+..+-|++ T Consensus 276 ~~~~~~~----~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~--p~~~~~~~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~ 348 (452) T protein:vir:36 276 GEGKNVD----VKFLEKPD-SDSQTENLLDRLTKLIFQTTMV--ANISDESFGSSSGVSLAYKLQAMSNLALSFQRKFQS 348 (452) T ss_pred CCccCCc----ceeEeecC-CHHHHHHHHHHHHHHHHHHhCc--cccCcccccCCcHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1222333 55555432 1233345566777777654422 1111111 223344667777777777777788888 Q ss_pred HHHHHHHHHHHHhc--cc--ccccccccccccccccCHHHHHHHHHH-HHhhcccHHHHHHHHHHHHhhhhcCCH-HHHH Q lcl|NC_018276. 378 VQAWVEDTVCRLRY--GE--LYVNNTIHYGSDFYLHNVEELTAQYEA-AKKAGATDYQLDVIQDQIIETENRNNP-NAME 451 (546) Q Consensus 378 ~qkfV~dti~~Lry--g~--~~~~~tv~yGt~fyl~t~EeLte~~~~-ak~~gaS~~ei~~iq~qi~etEyrNdP-~qmq 451 (546) +..-+...|+++.- |. .+...+|.+ ....|.-++++... +|.+|+=+.|-.. .++ -+-.|| .+|+ T Consensus 349 ~l~~~~~li~~~~~~~~~~~~~~~i~i~f----~~~~p~d~~~~a~~~~k~~g~iS~et~~--~~~---~~~~d~~~E~~ 419 (452) T protein:vir:36 349 SLNSRYKLFCELSTNVSNKDSWKDIEYTF----TRNEPKDIKEQAETANILMGITSQETAL--SVI---SVIPDVQAEME 419 (452) T ss_pred HHHHHHHHHHHHHhccCCccccccceEEe----CCCCCcCHHHHHHHHHHHhccCChHHHH--HhC---CCCCCHHHHHH Confidence 87777777766532 22 234445555 23334333333332 3345543322111 011 122243 3455 Q ss_pred HHHHHhhcCCCccccHHHHHHHHhcCCCchhheeeeeccchhheeeccc Q lcl|NC_018276. 452 RAQVLKHLEPYRHQTRKEVLEMLEAGFGDMELIAVKLNFSTFVLRFERE 500 (546) Q Consensus 452 R~~iL~~lEPf~~lT~~EvveL~e~~~~~~Ell~vk~nf~~fv~rfe~e 500 (546) |++.=+. ++. .+...+..+.+.. ..-.-.=+.| T Consensus 420 ri~~E~~---------~~~-~~~~~~~~~~~~~------~~~~~~~~~e 452 (452) T protein:vir:36 420 KIKKEEA---------STA-IFDKDKQPSEKGT------DTVVSETNEE 452 (452) T ss_pred HHHHHHH---------HHH-HHHhhccCCCCcc------cccCccccCC Confidence 5543221 111 1111111111100 0000000011 No 20 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=36.18 E-value=1.2 Score=20.17 Aligned_cols=408 Identities=9% Similarity=0.003 Sum_probs=162.0 Q ss_pred CCcccchhHHHHHHHHHHHhccchhhHHHHHH----h-hcCC----Cccc----hHHHHHHHHHHHhcccCccccccccC Q lcl|NC_018276. 1 MNAADARAVVNDFLQWVKQLLPKDKYNVFLQL----F-KFPV----STNE----LTEEIFNALEKVYDGKDAYEDYNFVN 67 (546) Q Consensus 1 ~~a~d~~~~~~~fls~vk~~l~kdky~~f~~~----f-~fpv----~tn~----lt~~if~~lskvfd~~n~~~~yqf~~ 67 (546) |...+-++.+...-.+++.+- |+..+..- . +.+. +.+. +...|-+...-.+-|..+.+ .-.+ T Consensus 17 ~~~~~i~~~i~~~~~~~~r~~---~~~~yy~g~~~i~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~--~~~d 91 (453) T protein:vir:73 17 ITDKVVNDFMKKHQEEVERYE---YLGNMYKGIMEISSQKAKDSWKPDNRLTNNFAKYIVDTFVGYFNGIPIKK--THDD 91 (453) T ss_pred CCHHHHHHHHHHHHHHHHHHH---HHHHHhccccchhcCCCCCccCccceeecchHHHHHHHhhhhhcccCcee--ecCC Confidence 444444444444333332221 11222111 0 1111 1122 22233333333333433221 1112 Q ss_pred ccchhhHHHHHHhhcchhhhHH--HHHHHHHhcCCceEEEEeehhhhcCCCCcceEEecchhhhhhhhcccccc--eEEE Q lcl|NC_018276. 68 PDYLQDWNDYRSSVLNAYHFWR--YEAFKQFKTNINGVMVVDLPAEQTSERPAPYFYFMPITAVKDFKTNASGA--IEWI 143 (546) Q Consensus 68 ~e~~~d~~~y~s~vLn~~~fw~--~e~fk~~k~~iN~v~vVDm~~~Q~s~kp~py~y~l~i~~v~a~r~n~~~~--I~fi 143 (546) .+..+-|..+... -+|++ .+..+.--.-=.+-++|-. .+.+.|-+-+++=..+..+.-...+. +..+ T Consensus 92 ~~~~~~l~~~~~~----n~~~~~~~~~~~~~~~~G~~~~~v~~-----d~~~~~~i~~~~p~~~~~v~dd~~~~~~~~~i 162 (453) T protein:vir:73 92 KSVLEAMQLFDNL----NDMEDEESELAKIACVYGRAYELMYQ-----NESTESEVIYCSPLNVFMVYDDSIKQKPLFAV 162 (453) T ss_pred hHHHHHHHHHHHh----cChhHHHHHHHHHHHhcCeEEEEEEe-----CCCCceEEEEEcccceEEEEeCCCCceeEEEE Confidence 2223334444332 22222 1222221111123334432 23455656555555554444333222 3444 Q ss_pred eeecCCCc---EEEEecCcceeeccCCccceeeehhhhhhhhccccceeeEeccccccccccceecchhHHHHHHHHHHH Q lcl|NC_018276. 144 MLPQGDNQ---LAVIDDEHYSIYQLDEKGEISAEPLTQSAHDLGYCPATMFWGDPLMDSQPALKKSPLSNQLAALDWLLF 220 (546) Q Consensus 144 ~~~~dqn~---i~~IDde~y~~y~~d~e~~~sie~l~dn~h~lGy~PAr~~~~D~i~~skP~Ik~Spis~~L~A~D~ll~ 220 (546) -|..+.+. +.+-+++....|..++.+ ........|.+|.||---|..+..+.|. | +++-...||+|+.+- T Consensus 163 ~~~~~~~~~~~~~vyt~~~i~~~~~~~~~---~~~~~~~~~~~g~vPvv~~~n~~~g~s~--~--~~v~~liDa~~~~~S 235 (453) T protein:vir:73 163 YYGFDEEGNLSGTVYTLLETISITGKAGE---VKFGESTYNVYSDLPIVEYNFNEERQSI--F--EPVHSLINSYNKVTS 235 (453) T ss_pred EEEEecCceEEEEEEeCCeEEEEEecCCc---eEEccceeccCCceeEEEecCCCCCCcc--h--hhHHHHHHHHHHHHH Confidence 34333332 334456555555544432 2222344599999999888766655443 4 356666688888754 Q ss_pred HhhhhhhhhhhcCCcccccCcccCcccccCCCcccCCceeeccCCceEeeccccccccCCcccccccCccceeeeeCCcc Q lcl|NC_018276. 221 FQTSKKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSHGDYYVTRTGEVSKCPVCADKRLAGVGSFIEVPLPSR 300 (546) Q Consensus 221 ~~ts~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~gd~~t~~tg~~~~Cp~c~~k~~~gags~~~~piP~~ 300 (546) ..-- -+ .|++-|++..+. ++-. +..+++- ..+.++..+.+.. T Consensus 236 ~~~~--~~-~~~~~~~l~~~g--~~~~---------~~~~~~~------------------------~~~~~~~~~~~~~ 277 (453) T protein:vir:73 236 EKAN--DV-EYFSDQYLVFLG--AEVD---------EEDAKNI------------------------KDNRLINFFDKNS 277 (453) T ss_pred HHHH--HH-HHhccceeeeec--CCCC---------chhhhcc------------------------ccccccccccccc Confidence 3221 22 356777775431 1100 0111110 0111111111111 Q ss_pred cccccccc-CccceecccHHHHHHHHHHHHHHhhhheeeeec--CCCCcccchhhhhhhhhhhhhHHHHHHHHHhhcHHH Q lcl|NC_018276. 301 ENEGADLR-NPVQITTIDKASLDYNVEECERIYDKIYTACVG--FGGDISMNKAFNKDQVKANTESRRNVLLSVKKNFER 377 (546) Q Consensus 301 ~~n~~Dl~-~pv~i~s~d~~sley~~~e~kri~d~i~~s~~G--f~~d~q~~ka~ne~qV~s~~es~r~~l~~lkknfer 377 (546) ..+..+-. .-|++++.+.. .+-....++++.+-|+...-+ ++.+.. ...+..-+++.+..+-.+..+..+-|++ T Consensus 278 ~~~~~~~~~~d~~~l~~~~~-~~~~~~~~~~l~~~I~~~s~~p~~~~~~~--gn~Sg~Al~~~~~~l~~ka~~~~~~~~~ 354 (453) T protein:vir:73 278 NGQGTNAAKVDVKFLDKPDS-DVQTENLLNRLERSIFQFTMAANISDENF--GNSSGVALAYKLQAMSNLALSFQRKFQS 354 (453) T ss_pred ccccccccCceeEEeeecCC-HHHHHHHHHHHHHHHHHHhCCcccCcccc--cCccHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11111111 11555554432 244455677777777664422 222211 1223344555566555566666666777 Q ss_pred HHHHHHHHHHHHh--ccc--ccccccccccccccccCHHHHHHHHHHHHhhcccHHHHHHHHHHHHhhhhcCCHH-HHHH Q lcl|NC_018276. 378 VQAWVEDTVCRLR--YGE--LYVNNTIHYGSDFYLHNVEELTAQYEAAKKAGATDYQLDVIQDQIIETENRNNPN-AMER 452 (546) Q Consensus 378 ~qkfV~dti~~Lr--yg~--~~~~~tv~yGt~fyl~t~EeLte~~~~ak~~gaS~~ei~~iq~qi~etEyrNdP~-qmqR 452 (546) +..-+...++.+. -|. .+...+|+++ +--..+..++...+ +|..|+=+.|-.. + ++ -+=.||. +++| T Consensus 355 ~l~~~~~li~~~~~~~~~~~~~~~i~v~f~-~~~p~~~~~~a~~~--~k~~giis~et~~-~-~~---~~~~d~~~E~~r 426 (453) T protein:vir:73 355 ALNRRYSLWSSLSTNASNKDAWKDIEYTFT-RNEPKDIKEQAETA--NILKGITSEETAL-S-VI---SVIPDVQAEMEK 426 (453) T ss_pred HHHHHHHHHHHHHhccCCccccccceEEeC-CCCCCCHHHHHHHH--HHHhccCcHHHHH-H-hC---CCCCCHHHHHHH Confidence 6666666666552 222 2345566652 22223444444433 3344543322211 1 11 0112443 4555 Q ss_pred HHHHhhcCCCccccHHHHHHHHhc-CCCchhheeeeecc Q lcl|NC_018276. 453 AQVLKHLEPYRHQTRKEVLEMLEA-GFGDMELIAVKLNF 490 (546) Q Consensus 453 ~~iL~~lEPf~~lT~~EvveL~e~-~~~~~Ell~vk~nf 490 (546) ++.=++ |-..+-.. ....++.. |=|+ T Consensus 427 i~~E~~----------~~~~~~~~~~~~~~~~~--~~~~ 453 (453) T protein:vir:73 427 IKKKKL----------LQLSLTRTSNLVRMKQM--RGNL 453 (453) T ss_pred HHHHHH----------HHHHHHHhccCCcchhh--hcCC Confidence 443211 11111111 22222211 1222 No 21 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=25.69 E-value=2 Score=18.90 Aligned_cols=439 Identities=9% Similarity=0.023 Sum_probs=170.3 Q ss_pred CCcccchhHHHHHHHHHHHhccchhhHHHHHHhh-----c----C---CCc----cchHHHHHHHHHHHhcccCcccccc Q lcl|NC_018276. 1 MNAADARAVVNDFLQWVKQLLPKDKYNVFLQLFK-----F----P---VST----NELTEEIFNALEKVYDGKDAYEDYN 64 (546) Q Consensus 1 ~~a~d~~~~~~~fls~vk~~l~kdky~~f~~~f~-----f----p---v~t----n~lt~~if~~lskvfd~~n~~~~yq 64 (546) |+.-+-.+.++.++.+-+.-+ +|+.++..--| = + .+. ..+...|-+...-.+-|+.+.++ T Consensus 22 l~~~~i~~li~~~~~~~~~r~--~~l~~YY~g~~~~i~~~~~~~~~~~~~~~ki~~n~~~~Iv~~~~~~l~G~p~~~~-- 97 (506) T protein:vir:94 22 LTPNKIMKFITHHFNYQRPRL--EMLDDYYQGYNLKILDKQSRRHEDGKADHRATHSFAKYIADFQTSYSVGNPINVK-- 97 (506) T ss_pred CCHHHHHHHHHHHHHHHHHHH--HHHHHHhcCCCccccccccccccccCCcceeecchHHHHHHHhhhhhcccCceee-- Confidence 555545555555444322111 22222221111 0 0 011 12334444443333334332221 Q ss_pred ccCccchhhHHHHHHhhcchhhhH--HHHHHHHHhcCCceEEEEeehhhhcCCCCcceEEecchhhhhhhhcccc-c-ce Q lcl|NC_018276. 65 FVNPDYLQDWNDYRSSVLNAYHFW--RYEAFKQFKTNINGVMVVDLPAEQTSERPAPYFYFMPITAVKDFKTNAS-G-AI 140 (546) Q Consensus 65 f~~~e~~~d~~~y~s~vLn~~~fw--~~e~fk~~k~~iN~v~vVDm~~~Q~s~kp~py~y~l~i~~v~a~r~n~~-~-~I 140 (546) -.+.+..+-|..+... -.|. -.+..+.--.-=.+...|.. .+.+.|-+-+.+-..++.+.-+.. + -+ T Consensus 98 ~~d~~~~~~l~~~~~~----N~~~~~~~~~~~~~~~~G~a~~~v~~-----ded~~~~i~~~~p~~~~~v~dd~~~~~~~ 168 (506) T protein:vir:94 98 LPDDGSNSGFDTFNKA----NDVDAENYDLFLDMSRYGRAYEYVYR-----GEDNEEHLAKLDPLDTFVIYSTDVDPKPI 168 (506) T ss_pred cCcchHHHHHHHHHhc----cCHhHHHHHHHHHHHhcCeEEEEEEe-----cCCCeeEEEEEcccceEEEecCCCCCceE Confidence 1122222333333322 1222 22233322221223444443 234566666666666655543322 2 24 Q ss_pred EEEee----ecCCCc---EE----EEecCcceeeccCCccceeeehhhhhhhhccccceeeEeccccccccccceecchh Q lcl|NC_018276. 141 EWIML----PQGDNQ---LA----VIDDEHYSIYQLDEKGEISAEPLTQSAHDLGYCPATMFWGDPLMDSQPALKKSPLS 209 (546) Q Consensus 141 ~fi~~----~~dqn~---i~----~IDde~y~~y~~d~e~~~sie~l~dn~h~lGy~PAr~~~~D~i~~skP~Ik~Spis 209 (546) .++-+ ..+++. +. +-.++.+.+|.....+ .+......|.||.||..-|.-..-+.|- + .++- T Consensus 169 ~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~~~---~~~~~~~~~~~g~vPvv~~~n~~~~~sd--~--e~~~ 241 (506) T protein:vir:94 169 MAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTPIM---GKMQVDTTKPITTFPVVEFKNSNFRLGD--F--ENVL 241 (506) T ss_pred EEEEEEeeeeccCCceeEEEEEEEEEeCceEEEeccccCc---cceeccccccCCccceEEecCCCCCCCc--h--hhhH Confidence 44433 223333 21 2355566666544432 2333456799999999877433322221 2 2344 Q ss_pred HHHHHHHHHHHHhhhhhhhhhhcCCcccccCcccCcccccCCCcccCCceeeccCCceEeeccccccccCCcccccccCc Q lcl|NC_018276. 210 NQLAALDWLLFFQTSKKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSHGDYYVTRTGEVSKCPVCADKRLAGV 289 (546) Q Consensus 210 ~~L~A~D~ll~~~ts~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~gd~~t~~tg~~~~Cp~c~~k~~~ga 289 (546) ...||+|+.+--.---.. +...|+..-+|++..=. +..+......+. ..+|..........+ .+..+. T Consensus 242 ~liDa~d~~~S~~~~~~~-~~~~~~l~~~g~~~~~~-~~~~~~~~~~~~---~~~~~~~~~~~~~~~------~~~~~~- 309 (506) T protein:vir:94 242 PLIDLYDAAQSDTANYMT-DLNEAMLIIQGDIDTLF-EGSDMMNTIDPN---DEDAMAKLAKDKLEL------IKEMKD- 309 (506) T ss_pred HHHHHHHHHHHHHHHHHH-HhhhHHHHHhcCccccc-cchhcccccccc---ccccccccccchhHH------Hhhhhh- Confidence 445777776532211111 22334444444433221 111111111110 001111111000000 001111 Q ss_pred cceeeeeCCccccccccccCccceecccHHHHHHHHHHHHHHhhhheeeeec--CCCCcccchhhhhhhhhhhhhHHHHH Q lcl|NC_018276. 290 GSFIEVPLPSRENEGADLRNPVQITTIDKASLDYNVEECERIYDKIYTACVG--FGGDISMNKAFNKDQVKANTESRRNV 367 (546) Q Consensus 290 gs~~~~piP~~~~n~~Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~s~~G--f~~d~q~~ka~ne~qV~s~~es~r~~ 367 (546) +.++.++-. ...++..-...+++++.+. ..+-..+.++++...|+...-+ +..+... -..+-.-+++.+-++..+ T Consensus 310 ~~~~~~~~~-~~~~~~~~~~d~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~n~Sg~Aik~~~~~l~~k 386 (506) T protein:vir:94 310 ANMLLLKSG-MTVNGTQTSVDAKYINKTY-DVVGSEAYKKRVAGDIHKFSHTPDLTDENFA-SNSSGVAMQYKVLGTVEL 386 (506) T ss_pred cCeeeeccc-ccccCccccccceeeeecC-CHHHHHHHHHHHHHHHHHHhCcccccccccc-ccchHHHHHHHHHHHHHH Confidence 112222211 1111111112356666543 2334445567777887754321 2211111 122333456666667777 Q ss_pred HHHHhhcHHHHHHHHHHHHHHH---hccccc---ccccccccccccccCHHHHHHHHHHHHhh-cccHHHHHHHHHHHHh Q lcl|NC_018276. 368 LLSVKKNFERVQAWVEDTVCRL---RYGELY---VNNTIHYGSDFYLHNVEELTAQYEAAKKA-GATDYQLDVIQDQIIE 440 (546) Q Consensus 368 l~~lkknfer~qkfV~dti~~L---ryg~~~---~~~tv~yGt~fyl~t~EeLte~~~~ak~~-gaS~~ei~~iq~qi~e 440 (546) ..+..+-|+++-.-+...|+.+ ..|... ...+| .|-...|.-+++....+.+. |+=+-|-..- . T Consensus 387 ~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i----~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~-----~ 457 (506) T protein:vir:94 387 ASTKRRMFERGLYARYQIISDIENSIHGDWTFDPQELTF----TFRDNLPADNISQIKALVQAGATLPQKYLYQ-----Q 457 (506) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccceE----EeCCCCCcCHHHHHHHHHHHhccCChHHHHH-----h Confidence 7777777888877777776665 223222 22233 35666676666666655543 4322221111 1 Q ss_pred hhhcCCH-HHHHHHHHHhh-----cCCCccccHHHHHHHHhcCCCchhheeee Q lcl|NC_018276. 441 TENRNNP-NAMERAQVLKH-----LEPYRHQTRKEVLEMLEAGFGDMELIAVK 487 (546) Q Consensus 441 tEyrNdP-~qmqR~~iL~~-----lEPf~~lT~~EvveL~e~~~~~~Ell~vk 487 (546) --+=.|| .+|+|+..=+. .+.+...+-++.. ...-+++.=-|| T Consensus 458 lp~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~e~~ 506 (506) T protein:vir:94 458 LPGVTNPQDIVDMMKEQSANGDYSFDQNGVISNDGQT----NTTATQTDEEVR 506 (506) T ss_pred CCCCCCHHHHHHHHHHHHHHHhhcchhhcCCCcccCc----cccccccccCCC Confidence 1122244 34555543221 1111111111110 012222233355 No 22 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=21.10 E-value=2.7 Score=18.26 Aligned_cols=410 Identities=10% Similarity=-0.003 Sum_probs=173.0 Q ss_pred CCcccchhHHHHHHHHHHHhccchhhH-----HHHHH-------hhcCC-Cc----cchHHHHHHHHHHHhcccCccccc Q lcl|NC_018276. 1 MNAADARAVVNDFLQWVKQLLPKDKYN-----VFLQL-------FKFPV-ST----NELTEEIFNALEKVYDGKDAYEDY 63 (546) Q Consensus 1 ~~a~d~~~~~~~fls~vk~~l~kdky~-----~f~~~-------f~fpv-~t----n~lt~~if~~lskvfd~~n~~~~y 63 (546) |++.+-++.+..+..+++.+.--.+|- +...- ...+. +. +.+...|-+.....+-|..+.+ T Consensus 1 l~~~~i~~~i~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~ki~~n~~~~Ivd~~~~yl~G~p~~~-- 78 (451) T protein:vir:10 1 MELEKIRAIISADAARRQEILQAKSYYYNKNDILKKGVVVQNRDENPLRNADNRISHNFHEILVDEKASYMFTYPVLF-- 78 (451) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhcccCccccccccccccccccccccccccccchHHHHHHhhhhheeccccee-- Confidence 888877777777666655442222221 00000 00111 11 1233333333222333333321 Q ss_pred cccCc-cchhhHHHHHHhhcchhhhHHHHHHHHHhcCCceEEEEeehh---hhcCCCCcceEEecchhhhhhhhccc--c Q lcl|NC_018276. 64 NFVNP-DYLQDWNDYRSSVLNAYHFWRYEAFKQFKTNINGVMVVDLPA---EQTSERPAPYFYFMPITAVKDFKTNA--S 137 (546) Q Consensus 64 qf~~~-e~~~d~~~y~s~vLn~~~fw~~e~fk~~k~~iN~v~vVDm~~---~Q~s~kp~py~y~l~i~~v~a~r~n~--~ 137 (546) .-... +..+-|+.|+..-+.....+ .-+.-..-=.+-.+|-+.+ -.+-..+.+-+-+++-..++.+--.. . T Consensus 79 ~~~~~~~~~~~~~~~~~n~~~~~~~~---~~~~~~~~G~a~~~~y~de~~~~~~~~~~~~~~~~i~p~~~~~vydd~~~~ 155 (451) T protein:vir:10 79 DIDNNKELNEKVTDVLGNEFTRKAKN---LAIEASNCGSAWLHYWIDEEYSGEQVTNQTFKYGVVNTEEIIPIYRNGIER 155 (451) T ss_pred ecCCcHHHHHHHHHHhccCHHHHHHH---HHHHHhhcCeEEEEEeecCCcccccccccceeEEEEcccceEEEEcCCCCC Confidence 11111 11222443332212111111 1111111112223332211 11111122234444444444332221 1 Q ss_pred cceEEEee---ecCCC---------cEEEEecCcceeeccCCcccee-eehhhhhhhhccccceeeEeccccccccccce Q lcl|NC_018276. 138 GAIEWIML---PQGDN---------QLAVIDDEHYSIYQLDEKGEIS-AEPLTQSAHDLGYCPATMFWGDPLMDSQPALK 204 (546) Q Consensus 138 ~~I~fi~~---~~dqn---------~i~~IDde~y~~y~~d~e~~~s-ie~l~dn~h~lGy~PAr~~~~D~i~~skP~Ik 204 (546) .-+.++-| ..+.+ .+-+.+++..-.|.-.+.+... .......-|.+|.||.--|+....+.|. + T Consensus 156 ~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~~~d--~- 232 (451) T protein:vir:10 156 ELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQHRFNSVPFVEFSNNIKKQSD--L- 232 (451) T ss_pred ceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCccccccccccccCCCCeeeEEEeccCCCCCCc--h- Confidence 22455543 12222 1334555555455432222111 1112233499999999888765544332 3 Q ss_pred ecchhHHHHHHHHHHHHhhhhhhhhhhcCCcccccCcccCcccccCCCcccCCceeeccCCceEeeccccccccCCcccc Q lcl|NC_018276. 205 KSPLSNQLAALDWLLFFQTSKKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSHGDYYVTRTGEVSKCPVCADK 284 (546) Q Consensus 205 ~Spis~~L~A~D~ll~~~ts~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~gd~~t~~tg~~~~Cp~c~~k 284 (546) +++-...||+|+.+--. -.--.|++-|+....--+.. + .+++.+ T Consensus 233 -e~v~~liDa~~~~~S~~---~~~~~~~~~~~l~~~g~~~~--~-------~~~~~~----------------------- 276 (451) T protein:vir:10 233 -SKYKKILDLYDRVMSGF---ANDLEDIQQIIYILENFGGE--D-------TSEFLK----------------------- 276 (451) T ss_pred -hhHHHHHHHHHHHHHHH---HHHHHHhccceeeeecCCcc--c-------chhhHH----------------------- Confidence 45555567777654322 11123555666543211000 0 000000 Q ss_pred cccCccceeeeeCCccccccccccCccceecccHHHHHHHHHHHHHHhhhheeeeecCCCCcccch--hhhhhhhhhhhh Q lcl|NC_018276. 285 RLAGVGSFIEVPLPSRENEGADLRNPVQITTIDKASLDYNVEECERIYDKIYTACVGFGGDISMNK--AFNKDQVKANTE 362 (546) Q Consensus 285 ~~~gags~~~~piP~~~~n~~Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~s~~Gf~~d~q~~k--a~ne~qV~s~~e 362 (546) . ...+.++.++.. .+..++| |++++.+. ..+-..+.++++.+.|+...-+ ++++.+. ..+-.-+++.+- T Consensus 277 ~-~~~~~~i~~~~~-~~~~~~~----~~~l~~~~-~~~~~~~~~~~l~~~I~~~s~~--p~~~~~~~gn~Sg~Alk~~~~ 347 (451) T protein:vir:10 277 E-LKRYKTIKTETD-SEGDSGG----LKTMQIEI-PTEARKIILEILKKQIYESGQG--LQQDTENFGNASGVALKFFYR 347 (451) T ss_pred H-HhhCCeEEecCc-CCccCCc----ceEEeecC-CHHHHHHHHHHHHHHHHHHhCc--ccccccccccccHHHHHHHHH Confidence 1 122233444422 3334444 55665554 2355566788888888776432 2222211 233444677777 Q ss_pred HHHHHHHHHhhcHHHHHHHHHHHHHHHhcccccccccccccccccccCHHHHHHHHHHHHhh-cccHHHHHHHHHHHHhh Q lcl|NC_018276. 363 SRRNVLLSVKKNFERVQAWVEDTVCRLRYGELYVNNTIHYGSDFYLHNVEELTAQYEAAKKA-GATDYQLDVIQDQIIET 441 (546) Q Consensus 363 s~r~~l~~lkknfer~qkfV~dti~~Lryg~~~~~~tv~yGt~fyl~t~EeLte~~~~ak~~-gaS~~ei~~iq~qi~et 441 (546) ++..|.....+-|+++..-+...++++.=...+..-+ -.|...-|.-+++....+.+. |+=+.|-..-+. T Consensus 348 ~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~d~~~i~----i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~~----- 418 (451) T protein:vir:10 348 KLELKSGLLETEFRTSFDKLIKAILYFLGVTDYKKIQ----QTYTRNMMSNDLEDADIATKSVGIIPTKIILRHH----- 418 (451) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCcccee----EEecCCCCCCHHHHHHHHHHHhccCchHHHHHhC----- Confidence 8888888888999999999999888764222233333 356666676677666655543 432222111111 Q ss_pred hhcCCHHHHHHHHHHhhcCCCccccHHHHHHHHhc--CCCc Q lcl|NC_018276. 442 ENRNNPNAMERAQVLKHLEPYRHQTRKEVLEMLEA--GFGD 480 (546) Q Consensus 442 EyrNdP~qmqR~~iL~~lEPf~~lT~~EvveL~e~--~~~~ 480 (546) -+=.||.+..++..-.+- ++.-++... +++| T Consensus 419 p~v~d~~~e~~~~~ee~~--------~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 419 PWVDDVEEAEKLYLEEKK--------IQASKVSDDYNNFTE 451 (451) T ss_pred CCCCCHHHHHHHHHHHHH--------HHHHHHHhhcCCCCC Confidence 122356554433322211 111122121 3333 No 23 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=20.20 E-value=2.8 Score=18.12 Aligned_cols=427 Identities=9% Similarity=0.022 Sum_probs=160.7 Q ss_pred CCcccchhHHHHHHHHHHHhccchhhHHHHHHhhcCC---Ccc----chHHHHHHHHHHHhcccCccccccccCccchhh Q lcl|NC_018276. 1 MNAADARAVVNDFLQWVKQLLPKDKYNVFLQLFKFPV---STN----ELTEEIFNALEKVYDGKDAYEDYNFVNPDYLQD 73 (546) Q Consensus 1 ~~a~d~~~~~~~fls~vk~~l~kdky~~f~~~f~fpv---~tn----~lt~~if~~lskvfd~~n~~~~yqf~~~e~~~d 73 (546) ++.-..+ -..-|++.+++.--+-..+...- ..+. +.+ .+...|-+.+.-.+=|.... |...+.++.+. T Consensus 49 i~~h~~~--~~~rl~~l~~yY~g~~~~i~~~~-~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~--~~~~d~~~~~~ 123 (502) T protein:vir:48 49 INHHKLR--QAPRIQELLDYARGENHDVLKSG-RRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIR--VEYDDNEDNSQ 123 (502) T ss_pred HHHHHHH--HHHHHHHHHHHhcCCCccccccc-cccccccccceeecchHHHHHHHHhhhhcccCee--EecCCccchhH Confidence 2211100 01111222221110000000000 0000 011 22222222222222222221 11112222222 Q ss_pred HHHHHHhhcchhhh--HHHHHHHHHhcCCceEEEEeehhhhcCCCCcceEEecchhhhhhhhccc--ccceEEEee-ec- Q lcl|NC_018276. 74 WNDYRSSVLNAYHF--WRYEAFKQFKTNINGVMVVDLPAEQTSERPAPYFYFMPITAVKDFKTNA--SGAIEWIML-PQ- 147 (546) Q Consensus 74 ~~~y~s~vLn~~~f--w~~e~fk~~k~~iN~v~vVDm~~~Q~s~kp~py~y~l~i~~v~a~r~n~--~~~I~fi~~-~~- 147 (546) -..++..+++.-+| .-.++.+.--.-=.+-++|-+ .+.+.|-+-+++-.++..+--+. ..-+.++-| .. T Consensus 124 ~~~~l~~~~~~N~~~~~~~~~~~~~~~~G~a~~~v~~-----dedg~~~i~~~~p~~~~~vydd~~~~~~~~~ir~~~~~ 198 (502) T protein:vir:48 124 NDDAIKRIGRINDIDTHNRNLIRDLSQTGRAYEVIYR-----SEYDETRIKRLSPLETFVIYDNSLEDNSIAAVRYYNRG 198 (502) T ss_pred HHHHHHHHHhhcCHhHHHHHHHHHHhhcCeEEEEEEe-----CCCCceEEEEEcccceEEEEcCCCCCceEEEEEEEEEe Confidence 22222221111111 112222222211123344443 23455666666655555544332 233555554 21 Q ss_pred -CCCc---EEEEecCcceeeccCCccceeeehhhhhhhhccccceeeEeccccccccccceecchhHHHHHHHHHHHHhh Q lcl|NC_018276. 148 -GDNQ---LAVIDDEHYSIYQLDEKGEISAEPLTQSAHDLGYCPATMFWGDPLMDSQPALKKSPLSNQLAALDWLLFFQT 223 (546) Q Consensus 148 -dqn~---i~~IDde~y~~y~~d~e~~~sie~l~dn~h~lGy~PAr~~~~D~i~~skP~Ik~Spis~~L~A~D~ll~~~t 223 (546) +++. +-+.+++..-.|...+. ... ....-|.+|.||.--|+-+..+.|. + +++-..+||+|+.+.-.- T Consensus 199 ~~~~~~~~~~iyt~~~i~~~~~~~~--~~~--~~~~~~~~g~vPvv~~~nn~~g~sd--~--e~v~~liDa~d~~~S~~~ 270 (502) T protein:vir:48 199 TLQNAKDVVEIYTNQHIYTLDASDS--FNE--ISVTPHAFGTVPITEFLNNADGIGD--Y--ETELYLIDLYDSAESDTA 270 (502) T ss_pred ecCCcEEEEEEEeCCeEEEEEeCCc--eee--ccceecCCCccceEEecCCCCCCCc--h--hhhHHHHHHHHHHHHHHH Confidence 2222 32344444334433222 211 1344599999999988766555443 3 355556688888764332 Q ss_pred hhhhhhhhcCCcccccCcccCcccccCCCcccCCceeeccCCceEeeccccccccCCcccccccCccceeeeeCCccccc Q lcl|NC_018276. 224 SKKHLDLYAPYPIYSGFEQDCTYENKENGDYCDSGFIKNSHGDYYVTRTGEVSKCPVCADKRLAGVGSFIEVPLPSRENE 303 (546) Q Consensus 224 s~~hldlya~YpiYs~yeedC~ye~~~~~~~C~~G~ikn~~gd~~t~~tg~~~~Cp~c~~k~~~gags~~~~piP~~~~n 303 (546) .-++ +++-|++...... ..+.+. ++. ......++-++.|..... T Consensus 271 --~~~~-~~~~~~lv~~g~~----------~~~~~~------------~~~-----------~~~~~~~~~~~~~~~~~~ 314 (502) T protein:vir:48 271 --NHMS-DMADAILAIYGDL----------ALPQGM------------QAS-----------DMKRTRLMQLKPPKSADG 314 (502) T ss_pred --HHHH-HhcCceeeeecCc----------cccccc------------chh-----------hhhhcceeeccccccccc Confidence 1233 5567776653211 000000 000 011112333333322211 Q ss_pred cccccCccceecccHHHHHHHHHHHHHHhhhheeeeec--CCCCcccchhhhhhhhhhhhhHHHHHHHHHhhcHHHHHHH Q lcl|NC_018276. 304 GADLRNPVQITTIDKASLDYNVEECERIYDKIYTACVG--FGGDISMNKAFNKDQVKANTESRRNVLLSVKKNFERVQAW 381 (546) Q Consensus 304 ~~Dl~~pv~i~s~d~~sley~~~e~kri~d~i~~s~~G--f~~d~q~~ka~ne~qV~s~~es~r~~l~~lkknfer~qkf 381 (546) ..+ ...+++++.+.. .+-....++++...|+...-. +..+ +.+-..+-.-++..+..+..+..+..+-|+++..- T Consensus 315 ~~~-~~d~~~l~~~~~-~~~~~~~~~~L~~~I~~~s~~p~~~~~-~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~ 391 (502) T protein:vir:48 315 KEG-TVKAEYLTKSYD-VSGAEAYKTRLNKDIHVFTNTPDMSDN-HFSGNASGEALKYKLFGLDQDRVDTQSQFTQGLKR 391 (502) T ss_pred ccc-CcceeEeeecCC-HHHHHHHHHHHHHHHHHHhCCCCcCcc-ccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111 133556654432 133444567777777653211 1111 11112344445666666666666777777777777 Q ss_pred HHHHHHHHh--cc--c--ccccccccccccccccCHHHHHHHHHHH-HhhcccHHHHHHHHHHHHhhhhcCCHHH-HHHH Q lcl|NC_018276. 382 VEDTVCRLR--YG--E--LYVNNTIHYGSDFYLHNVEELTAQYEAA-KKAGATDYQLDVIQDQIIETENRNNPNA-MERA 453 (546) Q Consensus 382 V~dti~~Lr--yg--~--~~~~~tv~yGt~fyl~t~EeLte~~~~a-k~~gaS~~ei~~iq~qi~etEyrNdP~q-mqR~ 453 (546) +...++++- .| . .+...+|.+ -..-|.-++++..++ |.+|+=+-|- .|++ + -+-.||.+ |+|+ T Consensus 392 ~~~li~~~~~~~~~~~~~d~~~i~i~f----~~~~p~d~~e~a~~~~kl~g~iS~et-~l~~-l---~~v~D~~~E~~ri 462 (502) T protein:vir:48 392 RYRLAARIGSLVNEFKDFDESRLKITF----TPNLPKSLYEQVSILNDLGGQVSQET-ALSL-S---GLVENPTEELDKI 462 (502) T ss_pred HHHHHHHHHhhcccccccccccceEEe----CCCCCcCHHHHHHHHHHHhccCcHHH-HHHh-C---CCCCCHHHHHHHH Confidence 777766652 12 1 223344544 333333344444333 3345322221 1111 1 13346654 8887 Q ss_pred HHHhhcCCCccccHHHHHHHHhcCCCchhheeeeeccchhheeecccCCchhh Q lcl|NC_018276. 454 QVLKHLEPYRHQTRKEVLEMLEAGFGDMELIAVKLNFSTFVLRFERENTDIVE 506 (546) Q Consensus 454 ~iL~~lEPf~~lT~~EvveL~e~~~~~~Ell~vk~nf~~fv~rfe~en~~i~e 506 (546) ..-+.-........ +.......|. +.=..-=++|--+++| T Consensus 463 ~~E~~~~~~~~~~~-~~~~~~~~~~------------d~~~e~~~~~~~~~~~ 502 (502) T protein:vir:48 463 NEESSKIDFKGYPS-YFYDNVGKYT------------DEVKETHTDDFERVYE 502 (502) T ss_pred HHHHHhhhhhcccc-cccccccccC------------CCccCCCCcCcCCCCC Confidence 65443211111110 0010100111 1111122445556666 Done!