Query lcl|NC_011222.1_cdsid_YP_002221578.1 [gene=B40-8040] [protein=MP3] [protein_id=YP_002221578.1] [location=38151..39884] Match_columns 577 No_of_seqs 5 out of 8 Neff 2.5 Searched_HMMs 1612 Date Thu Nov 7 12:46:15 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_40 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_40_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:97265 Length: 513 98.9 7.3E-09 4.5E-12 65.2 25.2 450 1-577 1-495 (513) 2 protein:vir:94956 Length: 452 98.8 1.9E-08 1.1E-11 63.0 24.7 415 1-577 1-450 (452) 3 protein:vir:96783 Length: 488 98.7 9.5E-08 5.9E-11 59.1 26.8 416 1-518 14-488 (488) 4 protein:vir:95149 Length: 501 98.7 1.1E-07 6.9E-11 58.7 23.8 440 1-548 1-501 (501) 5 protein:vir:95014 Length: 491 98.3 1.9E-06 1.1E-09 52.0 23.9 444 1-555 5-491 (491) 6 protein:vir:95806 Length: 440 97.2 0.00014 9E-08 41.6 26.2 407 46-519 1-440 (440) 7 protein:vir:96179 Length: 468 97.1 0.00018 1.1E-07 41.1 18.1 419 22-529 1-468 (468) 8 protein:vir:80453 Length: 535 96.7 0.00039 2.4E-07 39.3 26.2 450 1-546 28-535 (535) 9 protein:vir:78393 Length: 489 96.6 0.00046 2.9E-07 38.9 23.7 433 1-524 5-489 (489) 10 protein:vir:80680 Length: 441 95.5 0.002 1.2E-06 35.4 27.5 419 3-520 1-441 (441) 11 protein:vir:97336 Length: 492 95.3 0.0024 1.5E-06 35.0 23.3 434 12-519 1-492 (492) 12 protein:vir:95113 Length: 474 94.0 0.0058 3.6E-06 32.9 22.5 430 8-519 1-474 (474) 13 protein:vir:95899 Length: 474 93.3 0.0081 5E-06 32.0 19.3 428 8-520 1-474 (474) 14 protein:vir:96266 Length: 474 93.3 0.0081 5E-06 32.0 19.3 428 8-520 1-474 (474) 15 protein:vir:9871 Length: 429 # 93.2 0.0083 5.1E-06 32.0 25.1 414 3-543 1-429 (429) 16 protein:vir:733 Length: 453 # 91.5 0.016 9.8E-06 30.5 22.3 420 1-527 11-453 (453) 17 protein:vir:3964 Length: 453 # 90.6 0.02 1.2E-05 29.9 22.1 417 1-519 11-453 (453) 18 protein:vir:105461 Length: 470 90.4 0.021 1.3E-05 29.7 22.1 428 3-519 1-470 (470) 19 protein:vir:105889 Length: 474 88.3 0.033 2.1E-05 28.7 22.5 440 1-520 3-474 (474) 20 protein:vir:94101 Length: 474 88.3 0.033 2.1E-05 28.7 22.5 440 1-520 3-474 (474) 21 protein:vir:94805 Length: 492 86.8 0.043 2.7E-05 28.1 24.4 436 12-543 1-492 (492) 22 protein:vir:93747 Length: 472 85.7 0.051 3.2E-05 27.7 21.7 435 1-519 5-472 (472) 23 protein:vir:94546 Length: 506 85.2 0.055 3.4E-05 27.5 19.8 458 1-524 1-506 (506) 24 protein:vir:96494 Length: 501 84.2 0.062 3.9E-05 27.2 21.6 456 1-543 1-501 (501) 25 protein:vir:3609 Length: 452 # 84.1 0.063 3.9E-05 27.2 24.6 405 1-519 15-452 (452) 26 protein:vir:9922 Length: 489 # 84.1 0.063 3.9E-05 27.2 21.9 443 1-528 13-489 (489) 27 protein:vir:102950 Length: 471 81.6 0.084 5.2E-05 26.5 24.2 428 3-522 1-471 (471) 28 protein:vir:99522 Length: 470 79.7 0.1 6.3E-05 26.0 18.0 442 1-520 6-470 (470) 29 protein:vir:2732 Length: 501 # 78.0 0.12 7.4E-05 25.6 23.5 454 1-543 1-501 (501) 30 protein:vir:1236 Length: 483 # 76.6 0.13 8.3E-05 25.4 22.4 435 1-519 15-483 (483) 31 protein:vir:102330 Length: 451 74.0 0.16 0.0001 24.9 24.8 419 3-518 1-451 (451) 32 protein:vir:99072 Length: 479 66.4 0.27 0.00017 23.7 22.8 411 69-553 1-479 (479) 33 protein:vir:4898 Length: 502 # 65.8 0.28 0.00017 23.6 29.5 459 1-543 9-502 (502) 34 protein:vir:5961 Length: 503 # 62.7 0.33 0.0002 23.2 24.5 449 1-542 7-503 (503) 35 protein:vir:38 Length: 496 # N 62.5 0.33 0.00021 23.2 26.4 440 1-520 2-496 (496) 36 protein:vir:97447 Length: 474 61.1 0.36 0.00022 23.0 20.9 435 1-543 1-474 (474) 37 protein:vir:94498 Length: 474 61.1 0.36 0.00022 23.0 20.9 435 1-543 1-474 (474) 38 protein:vir:96839 Length: 474 54.3 0.51 0.00031 22.2 23.5 432 8-540 1-474 (474) 39 protein:vir:105292 Length: 478 37.5 1.1 0.00069 20.3 23.8 438 1-543 8-478 (478) 40 protein:vir:99781 Length: 511 26.5 1.9 0.0012 19.0 20.1 438 1-520 31-511 (511) 41 protein:vir:78907 Length: 518 25.8 2 0.0012 18.9 27.7 446 1-539 1-518 (518) 42 protein:vir:106571 Length: 499 24.3 2.2 0.0014 18.7 22.6 436 1-523 1-499 (499) No 1 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=98.93 E-value=7.3e-09 Score=65.18 Aligned_cols=450 Identities=12% Similarity=0.092 Sum_probs=250.6 Q ss_pred CC-cchHHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhcc------hhHHHHHHHhhcc Q lcl|NC_011222. 1 MG-KSLDEIREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAK------DKYDIFLSMFHFP 73 (577) Q Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~k------dky~~f~~~f~fp 73 (577) |- ++.+.+. .+||+=. ++..|=++|+ -++.-.+ .|+ .+=++.||| +.|..+++.--|+ T Consensus 1 m~~~~~~~v~--~~h~~y~----a~~~~W~~ir---d~~~G~~---~~r---~~g~~YLPk~~~E~~~~Y~~rl~rA~~~ 65 (513) T protein:vir:97 1 MADKDPKSPA--TTSGAYD----QMLPRWHVIE---TLLGGTE---AMR---EAGETYLPRHQEETDKGYQERLASAVLL 65 (513) T ss_pred CCCCCCCCCC--cCCHHHH----HHHHHHHHHH---HHhcChH---HHH---hhcccCCCCCCCCCHHHHHHHHhcccCC Confidence 21 2222221 3455422 2222222222 2221111 111 122234443 4588887665554 Q ss_pred CCCccchHHHHHHHHHHhccCCccccccccChhhhhh-h-hHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccC- Q lcl|NC_011222. 74 VKTNGVTSEIFDKLSRVFDGRNPVYNYQFKSSEDRDD-W-EYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQV- 150 (577) Q Consensus 74 v~t~~lt~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d-~-~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~- 150 (577) - .|...-+.|.-..=.+++.. +...|..+.+ | ++==.+++++.+|.+.-+=.++..- -+.++||||.... T Consensus 66 n----~~~~tl~~l~G~vf~k~p~~--~~~~p~~~~~~l~~d~D~~G~~L~~f~~~~~~~~l~~G-~~~ilVD~P~~~~~ 138 (513) T protein:vir:97 66 N----MVEQTLDTLSGKPFSEPIKL--NEDVPKAIEETILPDVDLQGNNLDVFARQWFREGMAKA-LCHVLIDMPRPAPR 138 (513) T ss_pred C----hHHHHHHHHhhhhhhcCccc--CcCchHHHHHHHhhccCCCCCCHHHHHHHHHHHHHhcC-eEEEEEecCCCCCc Confidence 2 23333333332222245533 2334444432 2 2222488899999888777777655 5667899986432 Q ss_pred -CCc-----------cccchhhhhHHHHHHHHhhccccc---ceeeeee-----ecC------CCEEEEEeccceeeecc Q lcl|NC_011222. 151 -GEK-----------PEPYFFWLPIANVLSYRTCGKDCN---LMAYIMY-----VTD------ENKIVYIDEERYVRFDK 204 (577) Q Consensus 151 -~~r-----------pqpyf~~~pie~V~~y~~~~~~~~---~i~~i~~-----~qD------~N~i~~IDd~~y~~y~k 204 (577) ++. .-||+...+-++|+.+++ ..++ -..++-+ ..| ..++++.+...|.+|.+ T Consensus 139 ~~~~~~T~Ade~~~~~rPy~~~~~~e~IinW~~--~~v~G~~~L~~v~l~E~~~~~Dgf~~~~~~q~rvL~~g~~~v~r~ 216 (513) T protein:vir:97 139 EDGQPRTLADDRREGLRPYWVMIKPECLLFARS--EVINGVEVLQHVRIIEHYMEQDGFAEVCKRRIRVLEPGLVQLWEP 216 (513) T ss_pred cchhHHhHHHHHhhccCceEEEecHhhhcCcce--eccCcceeeeeEEEEEEEeecCCCcceEEEEEEEEeCceEEEEEe Confidence 222 349999999999999986 2222 2333322 122 23478889999998876 Q ss_pred CCcc-----ceeeehhhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHH-HHHHHhhcCCceeeccc Q lcl|NC_011222. 205 TREN-----DLILEVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTA-KKHLDLYASYPIYSGYE 278 (577) Q Consensus 205 n~k~-----ei~~e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts-~~hldlyA~YPiYs~y~ 278 (577) .+.+ +|++....-|-||++|..++ |-. .+.+.+-.+||- .|+-+.--|+.+.| .+|.--.++.|.-+.+- T Consensus 217 ~~~~~~~~~e~~~~~~g~~~l~~IP~v~~-~~~--~~~~~~~~pPLl-~LA~ln~~hy~~~Sd~~~il~~~~~P~l~~~G 292 (513) T protein:vir:97 217 VKKSNAQKEEWALADEWATGLNYVPLVTF-YAD--RQGFMMGKPPLL-DLAHLNVAHWQSASDQRHILTVSRFPILACSG 292 (513) T ss_pred ecCCCccccceEEecCCCCcCCceeEEEE-ecC--CCCCCCCccchH-HHHHHHHHHHhhhhhHHHHHHhcccceeeeec Confidence 5433 67777777799999999554 533 344558889987 56666666666655 45666677888877641 Q ss_pred ccCCccCCCCCccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHH Q lcl|NC_011222. 279 RDCHYESHDGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSL 358 (577) Q Consensus 279 ~DC~~~~~~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sl 358 (577) - ... + ++.-.+|+++.+.+|- .+.+ .+++.|+-+++ T Consensus 293 ~--------~~~-------------~--------------~~~i~iG~~~~~~lpe-----~~~~----~~yie~~g~~i 328 (513) T protein:vir:97 293 A--------SGE-------------D--------------SDPVVVGPNKVLYNPD-----PAGR----FYYVEHTGQAI 328 (513) T ss_pred C--------CcC-------------C--------------CCceEeeccccccCCC-----CCCc----ceeeccCchhH Confidence 0 000 0 0113377777776664 2333 68999999999 Q ss_pred HHHhhHHHHHHHHHHhhhhcCCCccc-cchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccccc Q lcl|NC_011222. 359 EYNVNEEKRLRDELVRSVTGGEGELN-RSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSCNI 437 (577) Q Consensus 359 eY~~~~~kri~d~i~~s~~Gf~~d~q-~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~ti 437 (577) .=.+++++++.+.|... |-. .+. ..-+.+.++...++.+..+.|..+-.+++.+-+-+++.++.. +|.--=+.+| T Consensus 329 ~~~~~~l~~le~qm~~~--Ga~-ll~~~~~~~Ta~a~~~~~~~~~S~L~~~a~~le~al~~~l~~~a~w-lg~~~~~~~v 404 (513) T protein:vir:97 329 AAGRTDLKDLEEQMAGY--GAE-FLKRKTGGQTATARALDSAEATSDLSAMTGLFEDALAQALDITADW-LRLGPNGGTV 404 (513) T ss_pred HHHHHHHHHHHHHHHHH--HHH-hhccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hCCCCCccEE Confidence 99999999999999443 311 111 122456666677788888999999999999999999988876 4522225678 Q ss_pred cCCcccccCCH-HHHHHHHHHHHHcC-CCHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhCCCcCccHHHHHHHHhcCC Q lcl|NC_011222. 438 NYGTEFYIYTP-EELSERYKIMKETG-ASEAELDALRQQIIETEYRNDPTQMQRLLILNEIEPYSHLTREEAVNLYKENV 515 (577) Q Consensus 438 ~yGskFy~~t~-eeL~~~i~~Ak~~G-as~~~i~~L~~qi~e~EyrNnP~qmqr~~vL~~leP~~~LT~~Ev~~l~e~g~ 515 (577) .+-..|...+. .+.+..+..|...| +|...+- .+|+|+.||. ++++.+++.+....- T Consensus 405 ~in~dF~~~~~~~~~~~al~~a~~~G~is~~t~~---------------~~L~r~gvl~-----~d~d~~~~~e~~~~~- 463 (513) T protein:vir:97 405 ELVKDYDLEEMDAPGLQALQVAREKRDISRKTYL---------------NGLRLRGVLP-----EDFDEDEDWEELMEE- 463 (513) T ss_pred EeccccCcccCCHHHHHHHHHHHhCCCCCHHHHH---------------HHHHhccCCC-----ccCCHHHHHHHHHHh- Confidence 88899999885 45667777777777 5543331 2566666665 355544443331110 Q ss_pred CchhheeeeecchhhhhhhhccCCchhhhhhccChhhhHHHHHHHHHHhhcccccCccccCC Q lcl|NC_011222. 516 ISEEDLRVKLNLPTFVRRFERENMNIIEFGSALDYKKKIEIIINTLKKYANGLQNGSVRPTE 577 (577) Q Consensus 516 ~~eEdl~vk~n~~~fv~rfe~en~~i~efg~~l~~~~ki~ii~n~~~~y~n~~~~~~~~~~~ 577 (577) .+...+ +.|.-++--.. |+ |-|-+.-.+-++| T Consensus 464 ------------------~~~~~~---~~~~d~~~~~~-----~~----~~~~~~~~~~~~~ 495 (513) T protein:vir:97 464 ------------------ISEAMG---RAGLDLDPAQK-----NP----PEGGEGEGEGEGE 495 (513) T ss_pred ------------------hhhccC---CCCccccccCC-----CC----CCCCCCCCCCCCC Confidence 000000 00000000000 00 1111100011222 No 2 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=98.85 E-value=1.9e-08 Score=62.97 Aligned_cols=415 Identities=16% Similarity=0.147 Sum_probs=235.1 Q ss_pred CCcchHHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhc------chhHHHHHHHhhccC Q lcl|NC_011222. 1 MGKSLDEIREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIA------KDKYDIFLSMFHFPV 74 (577) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~------kdky~~f~~~f~fpv 74 (577) |.-+ .+||+=. .+.++=++++ -++.-.+ .++. .=++.|| ++.|..+++.--|+ T Consensus 1 m~V~-------~~hp~y~----a~~~~W~~~r---d~~~G~~---~~r~---~g~~YLpk~~~E~~~~Y~~rl~rA~~~- 59 (452) T protein:vir:94 1 MPIE-------TKHPEYL----AYENDWIDCR---VASLGQR---EVKK---KGVRFLPKLSGQTDDMYNAYKQRALFY- 59 (452) T ss_pred CCCC-------CcCHHHH----HHHHHHHHHH---HHhcChH---HHHc---CCcccCCCCCCCCHHHHHHHHhhccCC- Confidence 6643 4677532 2333333332 1221111 1110 0111233 35688888765554 Q ss_pred CCccchHHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccCCCcc Q lcl|NC_011222. 75 KTNGVTSEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQVGEKP 154 (577) Q Consensus 75 ~t~~lt~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~~~rp 154 (577) ..|...-+.|.-.+=.+++. +..|+.+..++. -.++.++.+|.+.-+=+++..-. +-++||+|+- + - T Consensus 60 ---n~~~~t~~~~~G~vf~k~p~----~~~p~~l~~~~~-D~~G~~L~~~~~~~~~~~l~~G~-~~ilVD~p~~--g--~ 126 (452) T protein:vir:94 60 ---SITSKTLSALSGMVLDQPPV----ITHPDAMSKYFE-DQSGIQFYEVFTRAVEETLLMGR-VGVFIDRPLT--G--G 126 (452) T ss_pred ---chHHHHHHHHhchhhcCCce----ecccHHHHHHHh-cccCCCHHHHHHHHHHHHHhcCe-EEEEEeeccC--C--C Confidence 23333334333332234553 356777777753 35899999999888878777665 5667799864 2 3 Q ss_pred ccchhhhhHHHHHHHHhhcccccceeeeee-----ecCCC---------EEEE--Eeccceee--eccCCccceeee--- Q lcl|NC_011222. 155 EPYFFWLPIANVLSYRTCGKDCNLMAYIMY-----VTDEN---------KIVY--IDEERYVR--FDKTRENDLILE--- 213 (577) Q Consensus 155 qpyf~~~pie~V~~y~~~~~~~~~i~~i~~-----~qD~N---------~i~~--IDd~~y~~--y~kn~k~ei~~e--- 213 (577) .||+..-+-++++.+++ +.+|...++-+ ..|.. ++++ |++..|+. |...+++.|... T Consensus 127 rPy~~~~~~~~Ii~W~~--~~~g~l~~v~lre~~~~~d~~d~f~~~~~~~yRvL~l~~g~~~v~~~~~~~~~~~~~~~~~ 204 (452) T protein:vir:94 127 DPYISVYTTENILNWEE--DEDGRLLMVVLREFYTVRDTADRYVQNIRVRYRCLELVDGLLQITVHETQDGKVWELAKTS 204 (452) T ss_pred ceEEEEechhhhcCccc--cccCCeeEEEEEEEEEEecCCCcccceeEEEEEEEEEeCCeEEEEEEEccCCceeeeccce Confidence 58998889999999998 44576666655 22332 3443 66766654 666665556532 Q ss_pred -hhhh-hhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHH-HHHHHhhcCCceeecccccCCccCCCCCc Q lcl|NC_011222. 214 -VDNM-HDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTA-KKHLDLYASYPIYSGYERDCHYESHDGKE 290 (577) Q Consensus 214 -~~~~-H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts-~~hldlyA~YPiYs~y~~DC~~~~~~G~~ 290 (577) ++.- |-|||+|.. ++|-. .+.+.+-++||-+ |+-+--=|+.+.| ..|.--.++.|.-+...- + T Consensus 205 ~~~~~~~~l~~IP~v-~~~~~--~~~~~~~~pPLl~-LA~ln~~hy~~~sd~~~~l~~~~~P~l~~~g~----------~ 270 (452) T protein:vir:94 205 TIQNVGVTMDYIPFF-CITPS--GLSMTPAKPPMID-IVDINYSHYRTSADLEHGRHFTGLPTPWITGA----------E 270 (452) T ss_pred eecCCCcccceeEEE-EEcCC--CCCCCCCccchHH-HHHHHHHHhcchhHHHHHHHHcccceeEeecC----------c Confidence 3333 889999994 44532 3345588899874 4444444555555 566666778887776421 1 Q ss_pred cccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHH Q lcl|NC_011222. 291 RCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRD 370 (577) Q Consensus 291 ~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d 370 (577) + +++-..|+++.+.+|-| +.+ .+++.|+-..+.=++++++++.+ T Consensus 271 ---~------------------------~~~i~iG~~~~~~lpe~-----~~~----~~yie~~g~~i~~~~~~l~~le~ 314 (452) T protein:vir:94 271 ---S------------------------QSTMHIGSTKAWVIPEV-----AAK----VGFLEFTGQGLQSLEKALSEKQA 314 (452) T ss_pred ---C------------------------CCceEecccccccCCCC-----CCc----ceEEccCchhHHHHHHHHHHHHH Confidence 0 01235788888776632 233 68999999999999999999999 Q ss_pred HHHhhhhcCCCccccchhh--cccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccccccCCcccccCC- Q lcl|NC_011222. 371 ELVRSVTGGEGELNRSEAI--NEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSCNINYGTEFYIYT- 447 (577) Q Consensus 371 ~i~~s~~Gf~~d~q~~kA~--ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~ti~yGskFy~~t- 447 (577) .|...= ...+....+. +..++..++.+..+.|..+..+++++-.-+++++|+.. |.. .+..|.+..+|.+.. T Consensus 315 ~m~~~G---a~ll~~~~~~~~s~ea~~~~~~~~~s~L~~~a~~~e~al~~~l~~~a~w~-g~~-~~~~v~~n~dF~~~~~ 389 (452) T protein:vir:94 315 QLASLS---ARLIDNSTRGSEATETVKLRYMSETASLKSVTRAVEALLNKAYSCIMDME-SMG-GTLNIKLNSAFLDSKL 389 (452) T ss_pred HHHHHH---HHhhccCCCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHc-CCC-CceEEEeccccccccC Confidence 996532 0122222222 22234456666678999999999999888999777754 543 456888899997654 Q ss_pred -HHHHHHHHHHHHHcC-CCHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhCCCcCccHHHHHHHHhcCCCchhheeeee Q lcl|NC_011222. 448 -PEELSERYKIMKETG-ASEAELDALRQQIIETEYRNDPTQMQRLLILNEIEPYSHLTREEAVNLYKENVISEEDLRVKL 525 (577) Q Consensus 448 -~eeL~~~i~~Ak~~G-as~~~i~~L~~qi~e~EyrNnP~qmqr~~vL~~leP~~~LT~~Ev~~l~e~g~~~eEdl~vk~ 525 (577) ++++. .+..|...| +|...+- .+|+|..|| || T Consensus 390 ~~~~~~-al~~~~~~G~is~~t~~---------------~~L~~~gvl---~~--------------------------- 423 (452) T protein:vir:94 390 TAAELK-AWVEAYLSGGISKEIYI---------------HALKVGKVL---PP--------------------------- 423 (452) T ss_pred CHHHHH-HHHHHHhcCCCcHHHHH---------------HHHHhCCCC---CC--------------------------- Confidence 44433 333455555 3332211 133333332 11 Q ss_pred cchhhhhhhhccCCchhhhhhccChhhhHHHHHHHHHHhhcccccCccccCC Q lcl|NC_011222. 526 NLPTFVRRFERENMNIIEFGSALDYKKKIEIIINTLKKYANGLQNGSVRPTE 577 (577) Q Consensus 526 n~~~fv~rfe~en~~i~efg~~l~~~~ki~ii~n~~~~y~n~~~~~~~~~~~ 577 (577) +...+.|+..+..=|-..+|+-.+|.. T Consensus 424 -------------------------~~e~~~i~~E~~~~~~~~~~~~~~~~~ 450 (452) T protein:vir:94 424 -------------------------PGESMGVIPDPPAPEPSPSNTPPNPSS 450 (452) T ss_pred -------------------------ccCHHHHHHHhhccCcccCCCCCCCcc Confidence 011122223333222222333222222 No 3 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=98.70 E-value=9.5e-08 Score=59.08 Aligned_cols=416 Identities=12% Similarity=0.074 Sum_probs=229.9 Q ss_pred CCcchHHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhcc-------------------- Q lcl|NC_011222. 1 MGKSLDEIREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAK-------------------- 60 (577) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~k-------------------- 60 (577) |.-+ .+||+= .++.++=++++ +.-.++ .-..=++.||| T Consensus 14 m~V~-------~~hp~y----~a~~~~W~~~~-----d~g~~~------~k~~g~~YLPk~~~~~~~~~~d~~y~~~~~~ 71 (488) T protein:vir:96 14 MLTP-------IYHPDY----LVNAPQWLRNL-----DCVMDN------IKRKKQTYLPNLGAIPPEAKTDPKVTALAAK 71 (488) T ss_pred eccc-------ccCHHH----HHHhhhhhHhh-----hhhhHH------HHHhhhhcCCCCCCccccccCcchhhhhhcc Confidence 4422 456653 33444445543 122221 11122356775 Q ss_pred ---hhHHHHHHHhhccCCCccchHHHHHHHHHHhccCCccccccccCh----hhhhhhhHHHhhccchhHHHHHHHHHHH Q lcl|NC_011222. 61 ---DKYDIFLSMFHFPVKTNGVTSEIFDKLSRVFDGRNPVYNYQFKSS----EDRDDWEYYRKDVLKEPSVWSTDGWDNF 133 (577) Q Consensus 61 ---dky~~f~~~f~fpv~t~~lt~~iF~~L~kV~dgqd~~~~y~f~~~----e~~~d~~~y~se~ln~~~fw~~~~fk~~ 133 (577) ++|++.++.--|+ ..|....+.|.-..=.+++..+ ...+ +.++|.. .++.++.+|.+.-+=.++ T Consensus 72 ~~~~y~~~~~~rA~~~----n~~~~tl~~l~G~vfrk~p~~~--~~~~~~l~~l~~d~D---~~G~~L~~f~~~~~~~~l 142 (488) T protein:vir:96 72 IEKDWEDLTWRLANYV----NIVNPTMNAITGAVMRREPEFD--TMDNPVLIGLRDNID---GKGNGIDQECKQALNALQ 142 (488) T ss_pred chhhhHhhhhhccccC----chhHHHHHHhcchhhccCceec--cCCcHHHHHHHhccC---CCCCCHHHHHHHHHHHHH Confidence 1222222222222 2344444444443333555443 3322 2333333 388899888887766666 Q ss_pred HhCCCeEEEEeeccccCC------CccccchhhhhHHHHHHHHhhcccccc---eeeeee-----ecCCC------E--E Q lcl|NC_011222. 134 KHRINSVLVVDMPEVQVG------EKPEPYFFWLPIANVLSYRTCGKDCNL---MAYIMY-----VTDEN------K--I 191 (577) Q Consensus 134 ~~~~NgviVVDm~~i~~~------~rpqpyf~~~pie~V~~y~~~~~~~~~---i~~i~~-----~qD~N------~--i 191 (577) .. =-+-++||||.-.+. ..-.||+...+-++|+.+++ ..++. ..++-+ .+|+. + + T Consensus 143 ~~-G~~~ilVD~P~~~~T~ade~~~~~rPy~~~~~a~~IinW~~--~~v~G~~~L~~v~lrE~~~~~D~~~~~~~~~~~~ 219 (488) T protein:vir:96 143 WG-SRCGWLVRSHPESATMADWNKGKKLPTAAFYDALHIIDWEV--EYIDGEEKLTYLSLLEDYQERDGGTYVSKQRLIN 219 (488) T ss_pred hc-CeEEEEEecCCCcCCHHHHHHhcCCcEEEEechhhhcCcce--eccCCceeeEEEEEEEEEEeccCCCcccceEEEE Confidence 54 445678899942211 23359999999999999987 32232 444433 23432 1 2 Q ss_pred EEEeccceeeeccCCc---cceeeehhhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHH-HHHHh Q lcl|NC_011222. 192 VYIDEERYVRFDKTRE---NDLILEVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAK-KHLDL 267 (577) Q Consensus 192 ~~IDd~~y~~y~kn~k---~ei~~e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~-~hldl 267 (577) ....+-.|..+-+.++ .+|+.+...-|-||++|..++ |- . .+.+.+-++||- .|+-+---|+.++|. +|+=- T Consensus 220 ~~l~~g~~~v~~~~~~~~~~e~~~~~~g~~~l~~IP~v~~-~~-~-~~~~~~~~pPLl-dLA~lnl~Hy~~ssd~~~il~ 295 (488) T protein:vir:96 220 HRLVDGLCEFQEVTDDEYSDEWTPVLINSKQSDTIPFFLA-SS-Q-SNEWCIDSTPLT-SLAEISLSIYVMNAYSNKAMI 295 (488) T ss_pred EEEECcEEEEEEEecCCcccceEeecCCCcccCeeEEEEE-ec-C-CCCCCCCCCchH-HHHHHHHHHHhhhhHHHHHHH Confidence 2244555555544221 256666556699999999555 42 2 345568888887 455555556665554 33322 Q ss_pred hcCCceeecccccCCccCCCCCccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCc Q lcl|NC_011222. 268 YASYPIYSGYERDCHYESHDGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNP 347 (577) Q Consensus 268 yA~YPiYs~y~~DC~~~~~~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~p 347 (577) .+..|+.-.. +.|.+. + .-+.+ .+.-.++|+..-.+.| +.| T Consensus 296 ~~~~p~lv~~--------~~~~~~---~-----~~~~~--------------~~~g~~~~~~~~~~~~-----~g~---- 336 (488) T protein:vir:96 296 LANEAKWMVD--------MGDMNK---T-----MASEM--------------NPLGFTLAGRMPYYVK-----NGD---- 336 (488) T ss_pred hcCCceeeec--------cCCCCc---c-----ccccc--------------ccceeeeccccccccc-----CCc---- Confidence 4555543221 111110 0 00000 0112344554444444 134 Q ss_pred ceeecccHHHHHHHhhHHHHHHHHHHhhhhcCCCccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhh Q lcl|NC_011222. 348 ITMLSADTGSLEYNVNEEKRLRDELVRSVTGGEGELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLR 427 (577) Q Consensus 348 v~i~n~di~sleY~~~~~kri~d~i~~s~~Gf~~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lR 427 (577) .+++-+..+.+ .+++.+++.+.|.. .|. ..++.+.+.+.+++..++..-.+.|..+-.+++++-+-+++.+|+.= T Consensus 337 ~~~~e~~~~~l--~~~~l~~l~~qm~~--~Ga-~l~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~~~A~w~ 411 (488) T protein:vir:96 337 VKVIQAQFSPE--TENKVEKLFEQAVK--VGA-SLFTQQSNETATGAAIRSGSSTASMATLGNNVEDTVRNMLRFIMRYF 411 (488) T ss_pred eeecCCchhHH--HHHHHHHHHHHHHH--HhH-hhccCCCcchHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHc Confidence 45666777666 47778888888854 231 23344445677788888888899999999999999999999888752 Q ss_pred cC----cccccccccCCcccccCC--HHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhCCCcC Q lcl|NC_011222. 428 YG----DSFVSCNINYGTEFYIYT--PEELSERYKIMKETGASEAELDALRQQIIETEYRNDPTQMQRLLILNEIEPYSH 501 (577) Q Consensus 428 yg----~~~~~~ti~yGskFy~~t--~eeL~~~i~~Ak~~Gas~~~i~~L~~qi~e~EyrNnP~qmqr~~vL~~leP~~~ 501 (577) =. ..-.+..|....+|...+ ++++.+-++...+-.+|...+- .+|+|+.||. ++ T Consensus 412 g~~~~~~~~~~~~~~in~dF~~~~ld~~~~~al~~~~~~G~Is~~t~~---------------~~L~~~gvl~-----~d 471 (488) T protein:vir:96 412 EGTNLYVNPDELVFKLNRDYFDVEVNPQMLQVAYAAMMEGNLPQVSWF---------------ELLKRARVVR-----GD 471 (488) T ss_pred CCCCCCcCccceEEEeccCCCCccCCHHHHHHHHHHHhcCCCCHHHHH---------------HHHHhCCcCC-----cc Confidence 11 112244677888888865 6655555555544446765543 2567766655 57 Q ss_pred ccHHHHHHHHhcCCCch Q lcl|NC_011222. 502 LTREEAVNLYKENVISE 518 (577) Q Consensus 502 LT~~Ev~~l~e~g~~~e 518 (577) .|-+++.+-.+.+-+++ T Consensus 472 ~~~e~~~~~ie~~g~~~ 488 (488) T protein:vir:96 472 MSKEEFDEHIAELGFGM 488 (488) T ss_pred CCHHHHHHHHhhcCCCC Confidence 78888888877766666 No 4 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=98.67 E-value=1.1e-07 Score=58.72 Aligned_cols=440 Identities=14% Similarity=0.139 Sum_probs=231.5 Q ss_pred CCcchHHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhcc-----------hhHHHHHHH Q lcl|NC_011222. 1 MGKSLDEIREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAK-----------DKYDIFLSM 69 (577) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~k-----------dky~~f~~~ 69 (577) |- .| -.+||+ .+.+..|=+.|+ .++.-. ..|+. .=++.||| ..|..+++- T Consensus 1 m~----~V--~~~hp~----y~~~~~~W~~ir---d~~~G~---~~~r~---~g~~YLP~~~~e~~~~e~~~~Y~~rl~r 61 (501) T protein:vir:95 1 MP----NV--SFIRPE----LGKLLPLYYLIR---DAIAGE---PTVKG---ARTTYLPMPNAEDQSKENKARYEAYLKR 61 (501) T ss_pred CC----CC--CCCCHH----HHHHHHHHHHHH---HHhcCh---HHHHh---cccccCcCCCCCCCcccchHHHHHHhhc Confidence 32 01 156776 233333333332 222211 11211 11345665 458888887 Q ss_pred hhccCCCccchHHHHHH-HHHHhccCCccccccccChhhhhhhh-HHHhhccchhHHHHHHHHHHHHhCCCeEEEEeecc Q lcl|NC_011222. 70 FHFPVKTNGVTSEIFDK-LSRVFDGRNPVYNYQFKSSEDRDDWE-YYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPE 147 (577) Q Consensus 70 f~fpv~t~~lt~~iF~~-L~kV~dgqd~~~~y~f~~~e~~~d~~-~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~ 147 (577) --|+- .|..+-+. +.+|| .+++..+ -|.-++.|. +==-++.++.+|.+.-+=..+..--=|| +||+|. T Consensus 62 A~~~n----~~~~t~~~l~G~vf-~k~p~~~----~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~~~i-lVD~P~ 131 (501) T protein:vir:95 62 AVFYN----VARRTLFGLVGQVF-MRDPVVK----VPALLNPLVANATGSGINLTQLAKRAVSLNLAYSRAGL-LVDYPT 131 (501) T ss_pred cccCc----hHHHHHHHHhhhhh-cCCccee----CcHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCeEEE-EEeecC Confidence 55442 23333333 33444 3555442 455555532 2113788999998887777766655554 569996 Q ss_pred ccC----------CCccccchhhhhHHHHHHHHhhcc-cccceeeeee----ecCCC--------EEEEEe--cc---ce Q lcl|NC_011222. 148 VQV----------GEKPEPYFFWLPIANVLSYRTCGK-DCNLMAYIMY----VTDEN--------KIVYID--EE---RY 199 (577) Q Consensus 148 i~~----------~~rpqpyf~~~pie~V~~y~~~~~-~~~~i~~i~~----~qD~N--------~i~~ID--d~---~y 199 (577) ... ..+.-||+...+-++++.+++.-- -.+.+.++-+ ..+++ +++++. +. .+ T Consensus 132 ~~~~~~~t~a~~~~~~~rPy~~~~~~~~IinW~~~~v~g~~~l~~v~l~E~~~~~d~~f~~~~~~q~RvL~~~~~g~~~~ 211 (501) T protein:vir:95 132 TEAEGGASIADLEAGRIRPTLYVYSPTEIINWRTTDRGAEEVLSLVVLFETWCAADDGFEMKTSGQFRVLRLDEEGYYVH 211 (501) T ss_pred CCCcccccHHHHHhccCCcEEEEecHhhhcCcceeccCCceeeeEEEEEEEEeecCCCcccceeEEEEEEeeCCCceEEE Confidence 522 123359999999999999986211 1123443333 22222 244543 32 35 Q ss_pred eeeccCCcc----------------ceeeehhhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHH-H Q lcl|NC_011222. 200 VRFDKTREN----------------DLILEVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTA-K 262 (577) Q Consensus 200 ~~y~kn~k~----------------ei~~e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts-~ 262 (577) ++|-.++.. +|......-|-|||+|+.++ |- ..+.+.+-++||-+ |+-+--=||.++| . T Consensus 212 ~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~-~~--~~~~~~~~~pPLl~-lA~lni~hy~~ssd~ 287 (501) T protein:vir:95 212 EIWREPQPTKADGSKIPKGNYQQYVVYKPTDAQGKRLTEIPFMFI-GS--ENNDSNPDNPNFYD-LASLNMAHYRNSADY 287 (501) T ss_pred EEEEecCCcccCcceecCCcccccceeeeeccCCCcCCeeeEEEE-ec--CCCCCCCCccchHH-HHHHHHHHHhhhhHH Confidence 667665432 34444455599999998544 33 24445577788763 3333334555544 5 Q ss_pred HHHHhhcCCceeecccccCCccCCCCCccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCC Q lcl|NC_011222. 263 KHLDLYASYPIYSGYERDCHYESHDGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVP 342 (577) Q Consensus 263 ~hldlyA~YPiYs~y~~DC~~~~~~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~ 342 (577) .|.--.++.|.-+.+.-+= +|-..... +.-..|+++.+ ++| .++ T Consensus 288 ~~~l~~~~~P~l~i~G~~~---------------------~~~~~~~~---------~~i~~G~~~~~--~lP----~~~ 331 (501) T protein:vir:95 288 EESCYIVGQPTPVLIGLTE---------------------EWVTNVLK---------GSVNFGSRGGI--PLP----VGA 331 (501) T ss_pred HHHHHHcccceeeeeCCcc---------------------cccccCCC---------Cceeecccccc--cCC----CCC Confidence 5666678888776651110 01111111 11234555555 555 344 Q ss_pred CccCcceeecccHHHHHHHhhHHHHHHHHHHhhhhcCCCccccc-hhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHH Q lcl|NC_011222. 343 DLKNPITMLSADTGSLEYNVNEEKRLRDELVRSVTGGEGELNRS-EAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDS 421 (577) Q Consensus 343 Dlr~pv~i~n~di~sleY~~~~~kri~d~i~~s~~Gf~~d~q~~-kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvld 421 (577) | ++++-++-..+. +++++++.+.|... | ...++.. -+.+.++.........+.|..+-.|++.+-.-+++ T Consensus 332 ~----~~~ie~~~~~i~--~~~l~~l~~~m~~~--G-a~ll~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~le~al~~~l~ 402 (501) T protein:vir:95 332 D----AKLLQASENTML--KEAMDTKERQMVAL--G-AKLVEQKEVQRTATEAELEAASEGSTLSSATKNVSAAFEWALK 402 (501) T ss_pred c----eeEEecChhhHH--HHHHHHHHHHHHHH--H-HhhccCCccchhHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHH Confidence 5 577777766664 78899999998764 4 2223322 23455666677777788999999999999999999 Q ss_pred HHHHhhcCcccccccccCCcccccCCH-HHHHHHHHHHHHcC-CCHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhCCC Q lcl|NC_011222. 422 TICLLRYGDSFVSCNINYGTEFYIYTP-EELSERYKIMKETG-ASEAELDALRQQIIETEYRNDPTQMQRLLILNEIEPY 499 (577) Q Consensus 422 ti~~lRyg~~~~~~ti~yGskFy~~t~-eeL~~~i~~Ak~~G-as~~~i~~L~~qi~e~EyrNnP~qmqr~~vL~~leP~ 499 (577) .+|.. +|..--+.+|.+..+|..... .+++..+..|...| +|...+- .+|+|+ ..++|- T Consensus 403 ~~a~w-~g~~~~~~~v~i~~df~~~~~~~~~~~al~~~~~~G~is~~t~~---------------~~L~~~---~v~~~~ 463 (501) T protein:vir:95 403 WAARW-VGQADSGVKFELNTDFDIARMTPDERRSLVEEWQKGAITFEEMR---------------TGLRKA---GVATED 463 (501) T ss_pred HHHHH-cCCCCCceEEEEecccccccCCHHHHHHHHHHHhCCCCcHHHHH---------------HHHHhC---CCCChh Confidence 87775 565444457777888988763 33455566667777 4544331 123443 444454 Q ss_pred cCccHHHHHHHHhcCCCchhheeeeecchhhhhhhhccCCchhhhhhcc Q lcl|NC_011222. 500 SHLTREEAVNLYKENVISEEDLRVKLNLPTFVRRFERENMNIIEFGSAL 548 (577) Q Consensus 500 ~~LT~~Ev~~l~e~g~~~eEdl~vk~n~~~fv~rfe~en~~i~efg~~l 548 (577) .....++|.+........++.-.. --=+....+ .|+.= T Consensus 464 ~~~e~e~i~~~~~~~~~~~~~~~~--------~~~~~gg~~---~~~~~ 501 (501) T protein:vir:95 464 DSKAKEKIAKDTAEAMALATPANV--------PGDGSGGDN---VGNSE 501 (501) T ss_pred HHHHHHHHHhhhcCcccccccCCC--------CCCCccccc---ccCCC Confidence 444455554443321111110000 000000000 11111 No 5 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=98.27 E-value=1.9e-06 Score=52.01 Aligned_cols=444 Identities=11% Similarity=0.089 Sum_probs=229.2 Q ss_pred CCcchHHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhccccc----chHHHHHHHHHHHhhcchhHHHHHHHhhccCCC Q lcl|NC_011222. 1 MGKSLDEIREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDDR----NKPVIDFLSKVKTWIAKDKYDIFLSMFHFPVKT 76 (577) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d~----~~~~~~fl~~v~~~l~kdky~~f~~~f~fpv~t 76 (577) -||--| | -.+||+= .++.+|=+.|+ -++.. ++ +..+.-++- +.--++.|..+++.--|+ T Consensus 5 ~~~~~~-V--~~~hp~y----~a~~~~W~~ir---d~~~G-~~~~~~r~~yl~~~~---~~~~e~~Y~~rl~rA~~~--- 67 (491) T protein:vir:95 5 NGQGSG-V--KTKHREW----LHYAPKWQKVR---HALAG-DLVGYLRNVGLNEPD---KAYGEARQAEYEAGGIVY--- 67 (491) T ss_pred CCccCC-C--CccCHHH----HHHHHHHHHHH---HHhcC-cchhhcccCCCcCCC---CCCCHHHHHHHHhcccCC--- Confidence 233222 1 1466653 23333333332 22222 21 111111111 111134488888876665 Q ss_pred ccchHHHHHHHHHHhccCCccccccccChhhhhhhhH-HHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccC----- Q lcl|NC_011222. 77 NGVTSEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEY-YRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQV----- 150 (577) Q Consensus 77 ~~lt~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~-y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~----- 150 (577) ..|...-+.|.-.+=.+++..+ -|+.+++|.. ==.++.++.+|.+.-+=.++. .=-+-++||+|.... T Consensus 68 -n~~~~tl~~l~G~vfrk~p~~~----~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~-~G~~~ilVD~P~~~~~T~Ad 141 (491) T protein:vir:95 68 -NFTRRTLSGMVGSVMRKEPEIN----IPKELEYLLKNADGSGVGLIQHAQDTLMEIDS-VGRGGLLVDAPETAAATAAE 141 (491) T ss_pred -ChHHHHHHHHhchhhcCCceee----ccHHHHHHHhccCCCCCCHHHHHHHHHHHHHH-cCeEEEEEecCCCcccCHHH Confidence 2334444444433333555432 3454544321 113788998888877655555 445667899997532 Q ss_pred --CCccccchhhhhHHHHHHHHhhc-ccccceeeeeeec-----C---------CCEEEEE--e-ccc--eeeeccCCcc Q lcl|NC_011222. 151 --GEKPEPYFFWLPIANVLSYRTCG-KDCNLMAYIMYVT-----D---------ENKIVYI--D-EER--YVRFDKTREN 208 (577) Q Consensus 151 --~~rpqpyf~~~pie~V~~y~~~~-~~~~~i~~i~~~q-----D---------~N~i~~I--D-d~~--y~~y~kn~k~ 208 (577) ....-||+...+-++++.+++.- +-.+.+.++-+-+ | ..+++++ + +.. ..+|.+++++ T Consensus 142 e~~~~~rPy~~~~~~~~IinW~~~~v~g~~~L~~v~l~E~~~~~d~~~~f~~~~~~qyRvL~l~~~g~~~~~v~r~~~~g 221 (491) T protein:vir:95 142 QNAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYHEPGNEFETKYGEQYRVLDIDTDGNYRQRLFRFDAEG 221 (491) T ss_pred HHHhcCCcEEEEechhhhcCceeeeeCCceeeeEEEEEEeEEeecCCCCcccceEEEEEEEeecCCCceEEEEEEEcCCC Confidence 12335999999999999998511 1123455555422 2 1224443 2 333 3567666544 Q ss_pred ce------eeehhhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHH-HHHHHhhcCCceeecccccC Q lcl|NC_011222. 209 DL------ILEVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTA-KKHLDLYASYPIYSGYERDC 281 (577) Q Consensus 209 ei------~~e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts-~~hldlyA~YPiYs~y~~DC 281 (577) .- ..+...-|-||++|..++ |-. .+.+.+-++||- .|+-+---||.++| .+|.--.++.|.-+.+.-|= T Consensus 222 ~~~~~~~~~~~~~g~~~l~~IPfv~~-~~~--~~~~~~~~pPLl-~LA~lni~Hy~~ssd~~~~l~~~~~P~l~~~G~d~ 297 (491) T protein:vir:95 222 GAQEEVVEIYPDLGESLRGVIPFTFI-GAT--NNDATIDDAPLL-PLAELNIGHYRNSADNEESSFVVGQPTLFIYPGDN 297 (491) T ss_pred cceeeeeeeeecCCCcccCeeEEEEE-ecC--CCCCCCCcCchH-HHHHHHHHHhhhhhHHHHHHHHcccceeeeecCcc Confidence 22 222223489999999444 532 344557888877 34434344554444 34555567777776544221 Q ss_pred CccCCCCCccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHH Q lcl|NC_011222. 282 HYESHDGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYN 361 (577) Q Consensus 282 ~~~~~~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~ 361 (577) ... ++....+. .....|+++.+.+|- ..+ .+++-+.-..+ - T Consensus 298 ~~~------------------~~~~~~~~---------~~i~~g~~~~~~lP~------~~~----~~~ie~~~~~~--~ 338 (491) T protein:vir:95 298 LTP------------------QSFKEANP---------NGIKFGSRCGHNLGY------GGS----AQLIQAGENNL--A 338 (491) T ss_pred cCc------------------chhhccCc---------ceeEecCcCCcCCCC------CCc----cceeecCcchH--H Confidence 110 11111110 112345555544442 122 33444432223 2 Q ss_pred hhHHHHHHHHHHhhhhcCCCccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcc-cccccccCC Q lcl|NC_011222. 362 VNEEKRLRDELVRSVTGGEGELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDS-FVSCNINYG 440 (577) Q Consensus 362 ~~~~kri~d~i~~s~~Gf~~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~-~~~~ti~yG 440 (577) +++++++.+.|...=- ..+..+.+.+.+++..++..-.+.|..+-.|++.+-.-+++.+|.. .|.. --+..|.+. T Consensus 339 ~~~l~~~e~qm~~~Ga---~l~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w-~G~~~~~~v~i~~n 414 (491) T protein:vir:95 339 RQNMLDKEQQAIQIGA---QLITPSQQITAESARIQRGADTSVMATIARNVSQAYTDALRWVAMM-LGKPEDSEVEFQLN 414 (491) T ss_pred HHHHHHHHHHHHHHHH---HhccCCcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH-cCCCCCCceEEEee Confidence 6677777777765411 2233344677788888888899999999999999999999999988 7864 345577888 Q ss_pred cccccCC--HHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhCCCcCccHHHHHHHHhcCCCch Q lcl|NC_011222. 441 TEFYIYT--PEELSERYKIMKETGASEAELDALRQQIIETEYRNDPTQMQRLLILNEIEPYSHLTREEAVNLYKENVISE 518 (577) Q Consensus 441 skFy~~t--~eeL~~~i~~Ak~~Gas~~~i~~L~~qi~e~EyrNnP~qmqr~~vL~~leP~~~LT~~Ev~~l~e~g~~~e 518 (577) .+|.+.. ++++.+-++...+-.+|...+- .+|+|+.|+ |+ |.+++.+..+.--+.- T Consensus 415 ~dF~~~~~~~~~~~all~~~~~G~is~~t~~---------------~~L~~~~vl---~~----~~e~~~~~ie~~~~~~ 472 (491) T protein:vir:95 415 MDFFLQPMTAQDRAAWMADINAGLLPATAYY---------------AALRKAGVT---DW----TDEDILNAIEDAPLPS 472 (491) T ss_pred cccccccCCHHHHHHHHHHHhcCCCCHHHHH---------------HHHHhCCCC---Cc----cHHHHHHHHHhcCCCC Confidence 8997766 7777666666665555655431 346666654 32 4555555433221100 Q ss_pred -hheeeeecchhhhhhhhccCCchhhhhhccChhhhHH Q lcl|NC_011222. 519 -EDLRVKLNLPTFVRRFERENMNIIEFGSALDYKKKIE 555 (577) Q Consensus 519 -Edl~vk~n~~~fv~rfe~en~~i~efg~~l~~~~ki~ 555 (577) -...|=--.+.=+.. +-| T Consensus 473 ~~~~~~~~~~~~~~~~-------------------~~~ 491 (491) T protein:vir:95 473 GAVTQVAGEIPQAAQQ-------------------QQE 491 (491) T ss_pred Cccccccccchhhhhh-------------------ccC Confidence 000000000000000 000 No 6 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=97.17 E-value=0.00014 Score=41.64 Aligned_cols=407 Identities=11% Similarity=0.061 Sum_probs=187.7 Q ss_pred HHHHHHHHHHHhhcchhHHHHHHH----hhccCC-------C----ccchHHHHHHHHHHhccCCccccc-cccChhhhh Q lcl|NC_011222. 46 PVIDFLSKVKTWIAKDKYDIFLSM----FHFPVK-------T----NGVTSEIFDKLSRVFDGRNPVYNY-QFKSSEDRD 109 (577) Q Consensus 46 ~~~~fl~~v~~~l~kdky~~f~~~----f~fpv~-------t----~~lt~~iF~~L~kV~dgqd~~~~y-~f~~~e~~~ 109 (577) .|++|.+.-++.+- |+.++..- ..-|.. . +.+.+-|=+...-.+-|....+++ .-.+.+.++ T Consensus 1 ~~~~~~~~~~~r~~--~l~~yy~g~~~~~~~~~~~~~~~~~~~ki~~n~~~~ivd~~~~~l~g~~~~~~~~~~~~~~~~~ 78 (440) T protein:vir:95 1 MLAAFLGSQKQRLA--ILASYAQGDNFSILSGHRRLDDEKADYRVRHKWGGYISSFATGYVIGNPVSIGVMEGGSADQLS 78 (440) T ss_pred ChhhHHHHHHHHHH--HHHHHhccCCcccccccccccccCCcceeecchHHHHHHhhhhheeccCceEeeCCCccHHHHH Confidence 44444443222221 11111110 001111 1 112222222322333445545443 344556666 Q ss_pred hhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccCCCccccchhhhhHHHHHHHHhhcccccceeeeee--ecC Q lcl|NC_011222. 110 DWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQVGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIMY--VTD 187 (577) Q Consensus 110 d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~--~qD 187 (577) -|..+.+. +++.+....-+-++.. -=.++++|- ++.+++|. .-..-|.+....|...+ +.+-+.++-+ ..+ T Consensus 79 ~l~~~~~~-n~~~~~~~~~~~~~~~-~G~a~~~~~---~d~~~~~~-i~~~~p~~~~~~~d~~~-~~~~~~~i~~~~~~~ 151 (440) T protein:vir:95 79 TIKDIEWQ-NDINALNSDLAFDASV-YGRAYEYHF---RDKDKVDR-VVLISPLEMFVIRDLTV-EQNIIAAVHLPIYAD 151 (440) T ss_pred HHHHHHHh-cCHhHHHHHHHHHHhh-cCeEEEEEE---ecCCCceE-EEEEcccceEEEEcCCC-CCceEEEEEEEEecC Confidence 66655423 3555544444434443 334566664 44555543 22223444433343322 2234555554 333 Q ss_pred CCEEEEEeccceeeeccCCc--cceeeehhhhhhhcccceeeEecccccccCcceeeccchhHH---HHHHHHHHHHHHH Q lcl|NC_011222. 188 ENKIVYIDEERYVRFDKTRE--NDLILEVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSEL---DSFDWYLYYSTAK 262 (577) Q Consensus 188 ~N~i~~IDd~~y~~y~kn~k--~ei~~e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L---~a~D~ll~~~ts~ 262 (577) +.++-+-++..+.+|...+. +.+..+....|.||.||.--|.-.. .-.|=++... +|+|+.++.. T Consensus 152 ~~~~~vyt~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~-------~g~sd~e~v~~lida~~~~~s~~--- 221 (440) T protein:vir:95 152 KVNMTVYTKDKVITYKPYSNNSVRLVVDDVKKHSYNDVPVVEWWNNR-------FRMGDYESEISLIDAYDAGQSDT--- 221 (440) T ss_pred ceEEEEEeCCeEEEEEEecCCccceeecceeeccCceeeEEEeeCCC-------CCCCchhhhHHHHHHHHHHHHHH--- Confidence 34455668888888875543 3566666666999999995443222 2234444444 5555555433 Q ss_pred HHHHhhcCCce--eecccccCCccCCCCCccccCcceecccccccccccccccccCCCccccccCccceeeeec---Ccc Q lcl|NC_011222. 263 KHLDLYASYPI--YSGYERDCHYESHDGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPI---PDE 337 (577) Q Consensus 263 ~hldlyA~YPi--Ys~y~~DC~~~~~~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPi---P~~ 337 (577) ..--.+++-|+ ..|+...+..-..++.. .+.++ ++-++. +.. T Consensus 222 ~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~--------------------------------~~~~~-~~~~~~~~~~~~ 268 (440) T protein:vir:95 222 ANYMSDLNDAMLLVKGDLDGIKLSPEDAAK--------------------------------MKDAN-MLFLKTGISTTG 268 (440) T ss_pred HHHHHHhhcceeeeecccccCCCCccchhh--------------------------------hhhcc-ceeccccccccc Confidence 33223444554 45544433322221111 11111 122211 111 Q ss_pred cccCCCccCcceeecccHHHHHHHhhHHHHHHHHHHhhhhcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHH Q lcl|NC_011222. 338 MHNVPDLKNPITMLSADTGSLEYNVNEEKRLRDELVRSVTGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQ 416 (577) Q Consensus 338 ~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d~i~~s~~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k 416 (577) +..++| |++++.+.. .+-.....+++.+.|+...-.-. ...+..-+.++++..+-.-.+.+++.+..+.|++.- T Consensus 269 ~~~~~~----~~~lt~~~~-~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l 343 (440) T protein:vir:95 269 QQTTAD----ASYIYKQYD-VNGTEAYKNRLANDIHRFSRIPNLDDDRFNSTSSGIALLYKMIGLEQVRKDKETYFTKAL 343 (440) T ss_pred CCCCcc----eeEEeecCC-HHHHHHHHHHHHHHHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112223 666666542 34445567778888877652211 111223467888888888899999999999999998 Q ss_pred HHHHHHHHHhh---cCcccccccccCCcccccCCHHHHHHHHHHHHHc-CCCHHHHHHHHHHHHHHhhcCCHHHHHHHHH Q lcl|NC_011222. 417 TFVDSTICLLR---YGDSFVSCNINYGTEFYIYTPEELSERYKIMKET-GASEAELDALRQQIIETEYRNDPTQMQRLLI 492 (577) Q Consensus 417 ~Fvldti~~lR---yg~~~~~~ti~yGskFy~~t~eeL~~~i~~Ak~~-Gas~~~i~~L~~qi~e~EyrNnP~qmqr~~v 492 (577) .-+...++.+. .|..+- ...-.-.|-...|..+.+.++++.+. |+=+-+-. +=.--+=.+|.+++|++- T Consensus 344 ~~~~~li~~~~~~~~~~~~~--~~~v~i~f~~~~p~~~~~~ad~~~kl~g~iS~et~-----~~~l~~~d~~~E~~ri~~ 416 (440) T protein:vir:95 344 RRRYELISNIHKAINGPVIE--ANKLTFTFHPNIPQDVWTEIKAYIEAGGEISQETL-----MENASFTDYKTEHSRILK 416 (440) T ss_pred HHHHHHHHHHHhhcCCcccc--cccceEEeCCCCCCCHHHHHHHHHHHhccCcHHHH-----HHhCCCCCcHHHHHHHHH Confidence 88888877662 222111 11223456667777777777665443 32111111 111112234666666543 Q ss_pred HHhhCCCcCccHHHHHHHHhcCCCchh Q lcl|NC_011222. 493 LNEIEPYSHLTREEAVNLYKENVISEE 519 (577) Q Consensus 493 L~~leP~~~LT~~Ev~~l~e~g~~~eE 519 (577) +=+.-. +..++..--.+.|-.++| T Consensus 417 --E~~~~~-~~~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 417 --QGGSSD-LEIGQIVGDADVGQADTE 440 (440) T ss_pred --HHHHhh-hhHHhhccCCCCCCcCCC Confidence 322211 111111111122333344 No 7 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=97.07 E-value=0.00018 Score=41.06 Aligned_cols=419 Identities=13% Similarity=0.064 Sum_probs=189.1 Q ss_pred HHHHHHHHHHHHHhhhhc-cccc----chHHHHHHHHHHHhhcchhHHHHHHHhh--ccC----------------C-Cc Q lcl|NC_011222. 22 AKAKEHEERIAFHTRVRT-SDDR----NKPVIDFLSKVKTWIAKDKYDIFLSMFH--FPV----------------K-TN 77 (577) Q Consensus 22 aka~~heeri~fh~~~~t-a~d~----~~~~~~fl~~v~~~l~kdky~~f~~~f~--fpv----------------~-t~ 77 (577) -+-+.|+-+=.+|-++-- ..+- ...+..+++.-..-++ +|....+.+. .++ . -| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~--~~~~~~~yY~g~~~i~~~~~~~~~~~~~~~~~~~~ 78 (468) T protein:vir:96 1 MIDIFWPNEKPYHERVVEQIKPQYETQEEMILRLITKHKENVE--DITVGERYYNHQPDVLFNAPKRNVKGEIDPFKPDW 78 (468) T ss_pred CccccCCcCceeehheeecccccccCcHHHHHHHHHHHHHHHH--HHHHHHHHhcCCCcccccccccccccccccccccc Confidence 223334444445554432 2222 2233333333222222 2222211111 010 0 01 Q ss_pred cchHHHHHHHHHH----hccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccCCCc Q lcl|NC_011222. 78 GVTSEIFDKLSRV----FDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQVGEK 153 (577) Q Consensus 78 ~lt~~iF~~L~kV----~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~~~r 153 (577) -|+--....+... +=|....++ -.+.+..+-+..+++ .|+.+.... ..+....-=-+++.|. ++.+++ T Consensus 79 ki~~n~~~~Iv~~~~~~l~g~p~~~~--~~d~~~~~~l~~~~~--n~~~~~~~~-~~~~~~~~G~~~~~v~---~d~~~~ 150 (468) T protein:vir:96 79 RMYTNYHQNLVDQKVAYAVANPVTYG--TEDEKSLKTIQEVLN--HKWDDKLVD-ILTAASNKGVEWIQPY---VDEQGE 150 (468) T ss_pred ccccchHHHHHHHHHhhhccCCceec--cCChHHHHHHHHHHh--cCHHHHHHH-HHHHHhhcCeEEEEEE---EcCCCc Confidence 1111111111111 112333322 233444444444442 244444332 3344333444677776 444554 Q ss_pred cccchhhhhHHHHHHHHhhcccccceeeeee--ecCCCEEEEEeccceeeeccCCcc-------------ceeeehhhhh Q lcl|NC_011222. 154 PEPYFFWLPIANVLSYRTCGKDCNLMAYIMY--VTDENKIVYIDEERYVRFDKTREN-------------DLILEVDNMH 218 (577) Q Consensus 154 pqpyf~~~pie~V~~y~~~~~~~~~i~~i~~--~qD~N~i~~IDd~~y~~y~kn~k~-------------ei~~e~~~~H 218 (577) +. +.+.-|.+....|.. ....+.+.++-+ .++..++-+-.+.++..|...+.. .+..+....| T Consensus 151 ~~-i~~~~p~~~~~v~~~-~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 228 (468) T protein:vir:96 151 FK-TFRVPAEQAIPIWTN-KERDELKAFIRLYELDGGERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNKSM 228 (468) T ss_pred eE-EEEEcccceEEEEcC-CCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCceeecccccccccccceeeccccc Confidence 42 222234333222322 112234444433 334445555556665555543211 2233344559 Q ss_pred hhcccceeeEecccccccCcceeeccchhHH---HHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCccccCc Q lcl|NC_011222. 219 DLGYCPARFFWSDSISLSEPDIKISPITSEL---DSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKERCDDG 295 (577) Q Consensus 219 ~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L---~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~C~~G 295 (577) .||.||.--|--.. .-.|=++.+. +|+|+.++. ...-=.+++-|+++... ..+.+. .+ T Consensus 229 ~~~~iPvv~~~n~~-------~g~sd~e~v~~liDa~d~~~S~---~~~~~~~~~~p~lv~~g-------~~~~~~--~~ 289 (468) T protein:vir:96 229 SWNRVPFIPFKNNP-------QEVSDLFMYKTIIDAMDKRLSD---TQNTFDEATELIYVLKG-------YEGEDL--EE 289 (468) T ss_pred cCCcccEEEecCCC-------CCCCchHHHHHHHHHHHHHHHH---HHHHHHHhcCceeeeec-------CCcccc--ch Confidence 99999995553322 2344455544 555555443 33322455677776441 111110 00 Q ss_pred ceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHHHHHhh Q lcl|NC_011222. 296 FLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRDELVRS 375 (577) Q Consensus 296 ~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d~i~~s 375 (577) +.. . +..+..+.++- .++.| |++++++... +-....++++.+.|+.. T Consensus 290 ~~~-----------------------~-~~~~~~i~~~~----d~~~~----~~~l~~~~~~-~~~~~~~~~l~~~I~~~ 336 (468) T protein:vir:96 290 FMY-----------------------N-LKYYKAINVDG----DGSGG----VDTIQIDVPV-QSAKEYLDMLRDYVIEF 336 (468) T ss_pred hhh-----------------------h-hhcCceEEecC----CCCCc----ceEEeecCCh-HHHHHHHHHHHHHHHHH Confidence 100 0 11233444433 12333 5666666532 44445567788888877 Q ss_pred hhcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccccccCCcccccCCHHHHHHH Q lcl|NC_011222. 376 VTGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSCNINYGTEFYIYTPEELSER 454 (577) Q Consensus 376 ~~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~ti~yGskFy~~t~eeL~~~ 454 (577) .-+.. ...+..-+.++++.+.-+-.+..++.+..+.|+++-+-++..++++. |..+=.-+| --.|....|..+++. T Consensus 337 s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~-g~~~d~~~i--~i~f~~~~p~d~~e~ 413 (468) T protein:vir:96 337 GQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFY-KLSIKVQDV--EITFNFNVMVNELEQ 413 (468) T ss_pred hCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCccccee--eEEecCCCCcCHHHH Confidence 63311 11122246788889988999999999999999999998888888873 433211122 345777888889999 Q ss_pred HHHHHHcCC-CHHHHHHHHHHHHHHhhcCCH-HHHHHHHHHHhhCCCcCccHHHHHHHHhcCCCchhheeeeecchh Q lcl|NC_011222. 455 YKIMKETGA-SEAELDALRQQIIETEYRNDP-TQMQRLLILNEIEPYSHLTREEAVNLYKENVISEEDLRVKLNLPT 529 (577) Q Consensus 455 i~~Ak~~Ga-s~~~i~~L~~qi~e~EyrNnP-~qmqr~~vL~~leP~~~LT~~Ev~~l~e~g~~~eEdl~vk~n~~~ 529 (577) +++|+..|+ |...+.. +.- +-.|| .+|+|++- +-+ +... .+.++.+.++ +=|| T Consensus 414 a~~~~~~g~iS~et~i~-~l~-----~v~D~~~E~~ri~~--E~~--------~~~~-~~~~~~~~~~-----~~~~ 468 (468) T protein:vir:96 414 SQIGVNSQYLSKETVVT-NHP-----WVDDPVAEMERIDQ--EEL--------ALPS-IEEGLNGKEN-----NEPT 468 (468) T ss_pred HHHHHhcCCCchHHHHH-hCC-----CCCCHHHHHHHHHH--HHH--------HHHH-HhhccCCCCC-----CCCC Confidence 999999885 4333211 111 22354 33444322 111 1111 1122322221 1122 No 8 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=96.71 E-value=0.00039 Score=39.28 Aligned_cols=450 Identities=13% Similarity=0.146 Sum_probs=217.4 Q ss_pred CCcchHHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhcc-----------hhHHHHHHH Q lcl|NC_011222. 1 MGKSLDEIREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAK-----------DKYDIFLSM 69 (577) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~k-----------dky~~f~~~ 69 (577) +|-|.-.|. .+||+ .+.+.+|=+.| ..++...++ .-..=++.||| ..|..+++. T Consensus 28 ~~~~m~dV~--~~hp~----y~a~~~~W~~i---rd~~~G~~~------~r~~g~~YLP~~~~~~~~~E~~~~Y~~rl~r 92 (535) T protein:vir:80 28 LGPSLPNVG--YQRVE----FGEMLPKWRKI---MDCLSGQEA------IKAKREEYLPMPSVDSRDEEQRRRYETYLQR 92 (535) T ss_pred CCCCCCCCC--cCCHH----HHHHHHHHHHH---HHHhcChHH------HHhcccccCCCCCcccCCcCCHHHHHHHHhh Confidence 444443321 46665 23333333333 233432221 11122345666 238888887 Q ss_pred hhccCCCccchHHHHHHHH-HHhccCCccccccccChhhhhhhhH-HHhhccchhHHHHHHHHHHHHhCCCeEEEEeecc Q lcl|NC_011222. 70 FHFPVKTNGVTSEIFDKLS-RVFDGRNPVYNYQFKSSEDRDDWEY-YRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPE 147 (577) Q Consensus 70 f~fpv~t~~lt~~iF~~L~-kV~dgqd~~~~y~f~~~e~~~d~~~-y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~ 147 (577) --|+= .|..+-+.|. +|| .+++.. .-|+-++.|.. ==.++.++.+|.+.-+=..+..-- +-++||+|. T Consensus 93 A~~~n----~~~~tl~~l~G~vf-rk~p~~----~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~~G~-~~iLVD~P~ 162 (535) T protein:vir:80 93 AIFYN----VTARTLDGMMGQVF-SRDPIR----QLPPALEAIVEDIDGEGVSLDQQAKKALGYTMGFGR-AAIFTDYPN 162 (535) T ss_pred ccCCC----hhHHHHHHHhchhh-cCCcce----eccHHHHHHHhccCCCCCCHHHHHHHHHHHHHhcCe-EEEEEeecC Confidence 55542 2333333333 333 345533 33555555541 114788898888877766666544 456679986 Q ss_pred ccC--------CCccccchhhhhHHHHHHHHhhcc-cccceeeeee-----ecCCC-------EEEEEec---ccee--e Q lcl|NC_011222. 148 VQV--------GEKPEPYFFWLPIANVLSYRTCGK-DCNLMAYIMY-----VTDEN-------KIVYIDE---ERYV--R 201 (577) Q Consensus 148 i~~--------~~rpqpyf~~~pie~V~~y~~~~~-~~~~i~~i~~-----~qD~N-------~i~~IDd---~~y~--~ 201 (577) ... ....-||+...+-++|+.+++.-. -.+...++-+ .+|+. +++++.- ..|. + T Consensus 163 ~~~~~t~ade~~~~~rPy~~~y~ae~IinW~~~~v~G~~~Lt~v~lrE~~~~~dd~f~~~~~~q~RvL~~~~~G~y~v~~ 242 (535) T protein:vir:80 163 VGRPVTVLEQKLGLYRPTITLVHPTSIINWRTKLVGGKSVISLVVIQENVLAQDDGFETTYVQQWRVLQLNAEGNYQVER 242 (535) T ss_pred CCCcccHHHHHhcCCCcEEEEechhhccCccccccCCccceeEEEEEEEEEecCCCcccceeEEEEEEEecCCceEEEEE Confidence 421 124459999999999999997321 1123444433 22322 3555532 2333 3 Q ss_pred eccCCcc-------ceeeehhhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHH-HHHHHhhcCCce Q lcl|NC_011222. 202 FDKTREN-------DLILEVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTA-KKHLDLYASYPI 273 (577) Q Consensus 202 y~kn~k~-------ei~~e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts-~~hldlyA~YPi 273 (577) |...+++ ++......-|-|||+|..++ | ... +.+.+-++||- .|+-+--=||.+.| ..|.--.++.|. T Consensus 243 ~~~~~~~~~~~~~~~~~~~~~g~~~l~~IPfv~~-~-~~~-~~~~~~~pPLl-~LA~lni~Hy~~ssd~~~il~~~~~P~ 318 (535) T protein:vir:80 243 WRRETQEEMYYSYSKHVPTDGNGNPFKEIPFQFI-G-PLD-NNADIDHPPLL-DLCEVNIGHYRNSADYEEMAFVAGQPT 318 (535) T ss_pred EEeecCCccccccceeecccCCCcccCeeEEEEe-e-cCC-CCCCCCccchH-HHHHHHHHHhhchhHHHHHHHHhcCce Confidence 4433322 12222335599999999544 5 333 34557888887 34434444666666 556666778887 Q ss_pred eecccccCCccCCCCCccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecc Q lcl|NC_011222. 274 YSGYERDCHYESHDGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSA 353 (577) Q Consensus 274 Ys~y~~DC~~~~~~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~ 353 (577) -+.+ |.+. +|.....+ ...-..|+++.+.+|- ..+ -..+.+.-. T Consensus 319 l~i~----------G~~~-----------~~~~~~~~--------~~~i~iG~~~~~~lP~------~~~-~~~~e~~~~ 362 (535) T protein:vir:80 319 AFFT----------GLTK-----------DWVEDVFK--------DFKVHLGSRAIIPLPQ------GAT-AGILQITPN 362 (535) T ss_pred eeee----------cCch-----------hhhhcCCC--------CcceEecCcccccCCC------CCC-cceeeeccc Confidence 6655 2111 01000000 0112356666664442 233 123444333 Q ss_pred cHHHHHHHhhHHHHHHHHHHhhhhcCCCccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccc Q lcl|NC_011222. 354 DTGSLEYNVNEEKRLRDELVRSVTGGEGELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFV 433 (577) Q Consensus 354 di~sleY~~~~~kri~d~i~~s~~Gf~~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~ 433 (577) .++ .+.++++-+.|...=-.. .-+..-+.+.++...++.+..+.|..+-.|++.+-.-+++.+|.. +|.-.- T Consensus 363 ~~a-----~~~l~~~e~qM~~lGa~l--l~~~~~~~Ta~~a~~~~~~~~S~L~~~a~~le~al~~aL~~~A~w-~G~~~~ 434 (535) T protein:vir:80 363 SVP-----FEAMTHKESQMIAMGANL--LVKSGGNRTFGEAQQEEASEQSILSACTKNVSMAFRKALRWANQF-QTGIVN 434 (535) T ss_pred hhH-----HHHHHHHHHHHHHHHHHh--hccCcccccHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH-cCCccC Confidence 333 244556666665531111 111111233334445777778889999999999988889866554 453221 Q ss_pred --cccccCCcccccCC--HHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhCCCcCccHHHHHH Q lcl|NC_011222. 434 --SCNINYGTEFYIYT--PEELSERYKIMKETGASEAELDALRQQIIETEYRNDPTQMQRLLILNEIEPYSHLTREEAVN 509 (577) Q Consensus 434 --~~ti~yGskFy~~t--~eeL~~~i~~Ak~~Gas~~~i~~L~~qi~e~EyrNnP~qmqr~~vL~~leP~~~LT~~Ev~~ 509 (577) +..|.+..+|.... ++++.+-++...+-.+|...+- .+|+|..| |+|......++.+. T Consensus 435 ~~~~~i~~n~dF~~~~ld~~~~~all~~~~~G~Is~et~~---------------~~L~r~gv---l~~~~~~eee~~ri 496 (535) T protein:vir:80 435 DETVEYNLNTDFPAARLTPNERAELILEWQQGAITFKEMR---------------AGLRRAGV---ASEDDAKAETEGKA 496 (535) T ss_pred CCceEEEeccccccccCCHHHHHHHHHHHhcCCCCHHHHH---------------HHHHhCCC---CCcccchHHHHHHH Confidence 22467778886664 6766665555444446654331 13555444 44444443322221 Q ss_pred HHh-------cCCCchhheeeeecchhhhhhhhccCCchhhhhh Q lcl|NC_011222. 510 LYK-------ENVISEEDLRVKLNLPTFVRRFERENMNIIEFGS 546 (577) Q Consensus 510 l~e-------~g~~~eEdl~vk~n~~~fv~rfe~en~~i~efg~ 546 (577) -.| .|...+. -+=-+=+-..-+-|+---.-|+ T Consensus 497 ~~E~~~~~~~~g~~~d~-----~~~g~~~~~~~~~~~~~~~~~~ 535 (535) T protein:vir:80 497 TVEFIAKTAAAGKVGDA-----ASGGTNKAKLNNGNGGGNQAGN 535 (535) T ss_pred HhhhhhccccCCCCCCC-----CCCCCCcCcccCCccccccCCC Confidence 111 1211100 0000000111112222222233 No 9 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=96.62 E-value=0.00046 Score=38.87 Aligned_cols=433 Identities=11% Similarity=0.091 Sum_probs=222.4 Q ss_pred CCcchHHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhcccc---cchHHHHHHHHHHHhhcchhHHHHHHHhhccCCCc Q lcl|NC_011222. 1 MGKSLDEIREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDD---RNKPVIDFLSKVKTWIAKDKYDIFLSMFHFPVKTN 77 (577) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d---~~~~~~~fl~~v~~~l~kdky~~f~~~f~fpv~t~ 77 (577) -||--| | -.+||+= .++.+|=+.|+ -++.... .+..+..++. +.--++.|..+++.--|+ T Consensus 5 ~~~~~~-V--~~~hp~y----~a~~~~W~~ir---d~~~G~~~~~~r~~yl~~~~---~~~~e~~Y~~rl~rA~~~---- 67 (489) T protein:vir:78 5 NGQGSG-V--KTKHREW----LHYAPKWQKVR---HALAGELVSYLRNVGLNEPD---KAYGEARQAEYEAGGIVY---- 67 (489) T ss_pred CCccCC-C--CccCHHH----HHHHHHHHHHH---HHhcCcccccccCCCCCCCC---CCCChHHHHHHHhccccC---- Confidence 333222 1 1467653 23333333332 2222211 1111222221 121134588888776555 Q ss_pred cchHHHHHHHHHHhccCCccccccccChhhhhhhhH-HHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccC------ Q lcl|NC_011222. 78 GVTSEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEY-YRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQV------ 150 (577) Q Consensus 78 ~lt~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~-y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~------ 150 (577) ..|...-+.|.-.+=.+++.. .-|+.++.|.. ==.++.++.+|.+.-+=.++. .=-+-++||+|.... T Consensus 68 n~~~~tl~~l~G~vfrk~p~~----~~p~~l~~l~~d~D~~G~~L~~f~~~~~~~~l~-~G~~~ilVD~P~~~~~T~ade 142 (489) T protein:vir:78 68 NFTRRTLSGMVGSVMRKEPEI----NIPKELEYLLKNADGSGVGLIQHAQDTLMEIDS-VGRGGLLVDAPETGAATAAEQ 142 (489) T ss_pred ChHHHHHHHHhchhhcCCcce----eccHHHHHHHhccCCCCCCHHHHHHHHHHHHHh-cCeEEEEEeeCCCCCcCHHHH Confidence 233444444443333355543 34555554421 113788998888877655555 445667889996531 Q ss_pred -CCccccchhhhhHHHHHHHHhhcc-cccceeeeeeec-----CC-C--------EEEEE--e-ccce--eeeccCCcc- Q lcl|NC_011222. 151 -GEKPEPYFFWLPIANVLSYRTCGK-DCNLMAYIMYVT-----DE-N--------KIVYI--D-EERY--VRFDKTREN- 208 (577) Q Consensus 151 -~~rpqpyf~~~pie~V~~y~~~~~-~~~~i~~i~~~q-----D~-N--------~i~~I--D-d~~y--~~y~kn~k~- 208 (577) ....-||+...+-++++.+++.-. -.+.+.++-+-+ |. + +++++ + +..| .+|.+++++ T Consensus 143 ~~~~~rPy~~~~~~~~IinW~~~~v~G~~~Lt~v~lrE~~~~~d~~~~f~~~~~~q~RvL~~~~~g~~~~~~~r~~~~g~ 222 (489) T protein:vir:78 143 NAGLLNPTIAFYTTENIVNWRLTRVGSVNRVTMVVLRETWEYNEPGNEFETKYGEQYRVLDIDSDGNYRQRLFRFDAEGG 222 (489) T ss_pred HHhcCCcEEEEechhhhcCceeeeeCCccceeEEEEEEeEEeecCCCCccceeEEEEEEEecCCCcceEEEEEEeecCCc Confidence 123369999999999999986221 112455544422 21 2 24454 3 3333 456666555 Q ss_pred ---cee-e-ehhhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHH-HHHHHhhcCCceeecccccCC Q lcl|NC_011222. 209 ---DLI-L-EVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTA-KKHLDLYASYPIYSGYERDCH 282 (577) Q Consensus 209 ---ei~-~-e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts-~~hldlyA~YPiYs~y~~DC~ 282 (577) +++ . +...-|-||++|..++ |-. .+.+.+-++||- .|+-+---||.++| .+|.--.++.|.-+.+.-|=. T Consensus 223 ~~~~~~~~~~~~g~~~l~~IPfv~~-~~~--~~~~~~~~pPLl-~LA~lni~Hy~~ssd~~~~l~~~~~P~l~i~G~d~~ 298 (489) T protein:vir:78 223 AQEDVVEIYPDLGESLRGVIPFTFI-GAT--NNDATIDDAPLL-PLAELNIGHYRNSADNEESSFVVGQPTLFIYPGENL 298 (489) T ss_pred ccceeeEEeccCCCCccCeeeEEEE-ecC--CCCCCCCcCchH-HHHHHHHHHhhhhhHHHHHHHHcccceeeeecCccC Confidence 221 1 1223399999999444 532 344557888887 34444444554444 455566788888776521111 Q ss_pred ccCCCCCccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHh Q lcl|NC_011222. 283 YESHDGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNV 362 (577) Q Consensus 283 ~~~~~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~ 362 (577) . ++ +.+-.+-.+ ...|+++.+.+|- ..+ .+++-+.-..+. + T Consensus 299 ~----------~~--------~~~~~~~~~---------i~~g~~~~~~lp~------~~~----~~~ie~~~~~~~--r 339 (489) T protein:vir:78 299 T----------PQ--------AFKEANPNG---------IKFGSRRGHNLGY------GGS----AQLIQAGENNLA--R 339 (489) T ss_pred C----------cc--------cccccCccc---------eeeCCcccccCCC------CCC----cceeccCcchHH--H Confidence 0 01 111111111 1234455443332 122 244444433332 6 Q ss_pred hHHHHHHHHHHhhhhcCCCccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcc-cccccccCCc Q lcl|NC_011222. 363 NEEKRLRDELVRSVTGGEGELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDS-FVSCNINYGT 441 (577) Q Consensus 363 ~~~kri~d~i~~s~~Gf~~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~-~~~~ti~yGs 441 (577) ++++++.+.|... | ...+..+.+.+.+++..++..-.+.|..+-.|++.+-.-+++.+|+. +|.. -.+.+|.+-. T Consensus 340 ~~l~~le~qm~~l--G-a~l~~~~~~~Ta~~~~~~~~~~~S~L~~~a~~~e~al~~~l~~~a~w-~G~~~~~~~~i~~n~ 415 (489) T protein:vir:78 340 QNMLDKEQQAIQI--G-AQLITPTQQITAQSARIQRGADTSVMATIARNVSQAYTDALRWVAVM-LGKPEDTEVEFRLNM 415 (489) T ss_pred HHHHHHHHHHHHH--h-hhhccCCcchhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH-cCCCCCCceEEEeec Confidence 7777887777642 2 12233344677788888888889999999999999999999999998 8864 3445777788 Q ss_pred ccccCC--HHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhCCCcCccHHHHHHHH-hcCC--- Q lcl|NC_011222. 442 EFYIYT--PEELSERYKIMKETGASEAELDALRQQIIETEYRNDPTQMQRLLILNEIEPYSHLTREEAVNLY-KENV--- 515 (577) Q Consensus 442 kFy~~t--~eeL~~~i~~Ak~~Gas~~~i~~L~~qi~e~EyrNnP~qmqr~~vL~~leP~~~LT~~Ev~~l~-e~g~--- 515 (577) +|.+.+ ++++.+-++...+-.+|...+-. +|+|+.|+ |+ |.+++.+-. +.+. T Consensus 416 dF~~~~~d~~~~~al~~~~~~G~is~~t~~~---------------~L~~~gv~---d~----~~e~~~~ei~~~~~~~~ 473 (489) T protein:vir:78 416 DFFLEPMTAQDRAAWMADINAGLLPATAYYA---------------ALRKAGVT---DW----TDADIKDAVADQPLPVA 473 (489) T ss_pred ccCcccCCHHHHHHHHHHHhcCCCCHHHHHH---------------HHHhCCCC---Cc----cHHHHHHHHhhcCCCcc Confidence 887765 45454444444433366543322 33444332 32 334433322 2211 Q ss_pred ------Cchhh-eeee Q lcl|NC_011222. 516 ------ISEED-LRVK 524 (577) Q Consensus 516 ------~~eEd-l~vk 524 (577) +.++- =.=| T Consensus 474 ~~~~g~~~~~~q~~~~ 489 (489) T protein:vir:78 474 TEVQGEIPQSAQQQEK 489 (489) T ss_pred cCCcccCCCCcccccC Confidence 11000 0001 No 10 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=95.50 E-value=0.002 Score=35.43 Aligned_cols=419 Identities=11% Similarity=0.007 Sum_probs=179.2 Q ss_pred cchHHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhcchhHHHHHHHhhccCCCccchHH Q lcl|NC_011222. 3 KSLDEIREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAKDKYDIFLSMFHFPVKTNGVTSE 82 (577) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~kdky~~f~~~f~fpv~t~~lt~~ 82 (577) -+.|+... |..+.+ .|+.++.-+..+.---+....+. . ...-.|+ .+-.+-.+..||-. +++. T Consensus 1 ~~~~~~~~-------i~~l~~--~~~~~~~r~~~l~~Yy~G~~~i~-~---~~~~~~~-~~~~~k~~~n~~~~---ivd~ 63 (441) T protein:vir:80 1 MNSDELAL-------IEGMYD--RIQRLSSWHCCIEGYYEGSNRVR-D---LGVAIPP-ELQRVQTVVSWPGI---AVDA 63 (441) T ss_pred CCccHHHH-------HHHHHH--HHHHHHHHHHHHHHHHhcCCcch-h---cCcccch-hhhhhhhhcchHHH---HHHH Confidence 22222111 111111 23332222222211000000000 0 0000111 11111112222211 1222 Q ss_pred HHHHHHHHhccCCccccccccC--hhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccCCCccccchhh Q lcl|NC_011222. 83 IFDKLSRVFDGRNPVYNYQFKS--SEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQVGEKPEPYFFW 160 (577) Q Consensus 83 iF~~L~kV~dgqd~~~~y~f~~--~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~~~rpqpyf~~ 160 (577) .-+.| +++| |+. .+++++|.. .+++.+..+..+-.+..... ++.+|= .+.+| +|-+-. T Consensus 64 ~~~~l--~~~g--------~~~~d~~~l~~i~~----~n~~~~~~~~~~~~~~~~G~-a~~~v~---~d~~g--~~~i~~ 123 (441) T protein:vir:80 64 LEERL--DWLG--------WTNGDGYGLDGVYA----ANRLATASCDVHLDALIFGL-SFVAII---PHGDG--TVSVRP 123 (441) T ss_pred HHhhh--cccc--------ccCCChHHHHHHHH----hcCHHHHHHHHHHHHhhcCe-eEEEEE---eCCCC--ceEEEE Confidence 22222 1222 223 234544422 35677777666666666553 666552 33344 444444 Q ss_pred hhHHHHHH-HHhhcccccceeeeeeecCCCE--EEEEeccceeeeccCCccceeeehhhhhhhcccceeeEecccccccC Q lcl|NC_011222. 161 LPIANVLS-YRTCGKDCNLMAYIMYVTDENK--IVYIDEERYVRFDKTRENDLILEVDNMHDLGYCPARFFWSDSISLSE 237 (577) Q Consensus 161 ~pie~V~~-y~~~~~~~~~i~~i~~~qD~N~--i~~IDd~~y~~y~kn~k~ei~~e~~~~H~lGy~PA~~~~gD~~~~sk 237 (577) .+-++++. |............+.+..++.. +.+-.......|..++.+.|......-|.||.||.--|. -+-..+. T Consensus 124 ~~p~~~~~i~d~~~~~~~~~~~~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-n~~~~~~ 202 (441) T protein:vir:80 124 QSPKNCTGKFSADGSRLDAGLVVQQTCDPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIV-NRRRTSR 202 (441) T ss_pred EccceEEEEEeCCCCceeEEEEEEEEecCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEee-ccccCCc Confidence 45555442 3322221111112222333332 334456666677777777776655666999999994442 1112233 Q ss_pred cceeeccchhH-H---HHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCccccCcceeccccccccccccccc Q lcl|NC_011222. 238 PDIKISPITSE-L---DSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKERCDDGFLKNEKNEWITGVDGKPM 313 (577) Q Consensus 238 p~ik~SpL~~~-L---~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~C~~G~i~n~~~d~~tg~~~~~~ 313 (577) | +-+|-|.+. . +|+|+.++- ...--.+++.|+....- + +-+ .+- T Consensus 203 ~-~G~s~l~~~v~~liDa~~~~~s~---~~~~~~~~~~~~~~i~G--~------~~~----~~~---------------- 250 (441) T protein:vir:80 203 I-DGRSEITRSIRAYTDEAVRTLLG---QSVNRDFYAYPQRWVTG--V------SAD----EFS---------------- 250 (441) T ss_pred c-CCcccchhhHHHHHHHHHHHHHH---HHHHHHhhcCceeeeec--C------Ccc----ccc---------------- Confidence 3 456766652 2 555555553 33333456677654421 0 000 000 Q ss_pred ccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHHHHHhhhhcCCCc---cccchhhc Q lcl|NC_011222. 314 ACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRDELVRSVTGGEGE---LNRSEAIN 390 (577) Q Consensus 314 ~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d~i~~s~~Gf~~d---~q~~kA~n 390 (577) ..+....+|.++.+|-. +..+ -+.+.+.+.++++-+.+..+.....|. +.++...+ .+.+-.++ T Consensus 251 -----~~~~~~~~~~i~~~~~~-~~~~------~~~~~~~~~~~~~~~~~~l~~~i~~~~-~~~~~p~~~~g~~~~~~~S 317 (441) T protein:vir:80 251 -----QPGWVLSMASVWAVDKD-DDGD------TPNVGSFPVNSPTPYSDQMRLLAQLTA-GEAAVPERYFGFITSNPPS 317 (441) T ss_pred -----cchhhhcccccccCCCC-CCCC------cceeEecCccchHHHHHHHHHHHHHHh-cccCCCHHHhccCCCcchH Confidence 11233456666655542 1111 234444455666666665555444443 11121000 01122346 Q ss_pred ccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcc---c--ccccccCCcccccCCHHHHHHHHHHHH---HcC Q lcl|NC_011222. 391 EKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDS---F--VSCNINYGTEFYIYTPEELSERYKIMK---ETG 462 (577) Q Consensus 391 e~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~---~--~~~ti~yGskFy~~t~eeL~~~i~~Ak---~~G 462 (577) +.+.+.-...+.+++....+-|++.-+=+...++++.-... . ...++ +|..--|..++++.+++. .+| T Consensus 318 g~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~~~~~~~~~~~~~i~~----~f~~~~~~~~~e~ad~~~kl~~~g 393 (441) T protein:vir:80 318 GEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKALDSRVDEADFFGDVGL----RWRDASTPTRAATADAVTKLVGAG 393 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccceeeeE----EeCCCCCcCHHHHHHHHHHHHhcC Confidence 77777888888888888888888877766677777642211 1 12233 466666777777776654 356 Q ss_pred CCHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhCCCcCccHHHHHHH--HhcCCCchhh Q lcl|NC_011222. 463 ASEAELDALRQQIIETEYRNDPTQMQRLLILNEIEPYSHLTREEAVNL--YKENVISEED 520 (577) Q Consensus 463 as~~~i~~L~~qi~e~EyrNnP~qmqr~~vL~~leP~~~LT~~Ev~~l--~e~g~~~eEd 520 (577) .+......+... .-..|.+++|++-.+. . ..+++..+ ..++-. ++. T Consensus 394 ~~~~s~~~~~~~-----l~~~~~e~~~~~~e~~--e----~~~~~~~~~~~~~~~~-~~~ 441 (441) T protein:vir:80 394 ILPADSRTVLEM-----LGLDDVQVEAVMRHRA--E----SSDPLAVLAGAISRQT-NEV 441 (441) T ss_pred cccccHHHHHHh-----CCCCHHHHHHHHHHHH--H----HHHHHHHHhhhhhccc-ccC Confidence 544333333221 1334556665543321 1 01111111 011111 122 No 11 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=95.29 E-value=0.0024 Score=34.98 Aligned_cols=434 Identities=13% Similarity=0.083 Sum_probs=193.6 Q ss_pred hhCchhHHHHHHHHH------------HHHHHHHHhhhhc---ccccchHHHHHHHHHHHhhcc-hhHHHHHHH----hh Q lcl|NC_011222. 12 YRHPEGISQIAKAKE------------HEERIAFHTRVRT---SDDRNKPVIDFLSKVKTWIAK-DKYDIFLSM----FH 71 (577) Q Consensus 12 ~~~~~~~~~~aka~~------------heeri~fh~~~~t---a~d~~~~~~~fl~~v~~~l~k-dky~~f~~~----f~ 71 (577) ...-.-|||||.|.- |.+ .|+..... ..++...+..|+++-++-+++ +|...+..- ++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~i~~ 78 (492) T protein:vir:97 1 MQFIQLISQVAQALIKGGNILYPSQPTQTE--IFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVK 78 (492) T ss_pred ChHHHHHHHHHHHHhcCCceeeccchhhhh--HhhhcccCCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccc Confidence 333456888888753 333 34444433 233334444444333222221 111111111 11 Q ss_pred ccCCC----------------ccchHHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHh Q lcl|NC_011222. 72 FPVKT----------------NGVTSEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKH 135 (577) Q Consensus 72 fpv~t----------------~~lt~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~ 135 (577) -|.+. ..+.+-|=+...--+=|.-. +|.=.+.+..+-|..|++ +++-+..+.-+=++... T Consensus 79 ~~~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~--~~~~~d~~~~~~l~~~~~--n~~~~~~~~~~~~~~~~ 154 (492) T protein:vir:97 79 EPKPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPI--AFKHTDDEVVKRIDEVLG--NRFDDKLHSVLTGASNK 154 (492) T ss_pred ccccccccccccccccccccccchHHHHHHHHhhhhcccCc--eeccCchHHHHHHHHHHh--ccHHHHHHHHHHHHhhc Confidence 11110 11111111111111112222 233345566666666653 35556665444444444 Q ss_pred CCCeEEEEeeccccCCCccccchhhhhHHHHHHHHhhcccccceeeeee--ecCCCEEEEEeccceeeeccCCcccee-- Q lcl|NC_011222. 136 RINSVLVVDMPEVQVGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIMY--VTDENKIVYIDEERYVRFDKTRENDLI-- 211 (577) Q Consensus 136 ~~NgviVVDm~~i~~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~--~qD~N~i~~IDd~~y~~y~kn~k~ei~-- 211 (577) =.++++|- .+.+++|. +-.++-++++..-......+.+.++-+ .++..++-+.++.....|...+ +++. T Consensus 155 -G~a~~~v~---~d~dg~~~--~~~~~p~~~~~i~d~~~~~~~~~~vr~~~~~~~~~~~~y~~~~v~~~~~~~-~~~~~~ 227 (492) T protein:vir:97 155 -GIEWLHPY---LDEEGEFK--LFRVPAEQGIPIWTDKEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYEN-GSLIPD 227 (492) T ss_pred -CeEEEEEE---ecCCCceE--EEEEcccceEEEEcCCCCCceEEEEEEEeeccceeEEEEecCeEEEEEEec-Ceeeec Confidence 35777775 44555543 222233444332221112345555544 4555566666776666665444 2211 Q ss_pred -------eehh-hhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCc Q lcl|NC_011222. 212 -------LEVD-NMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHY 283 (577) Q Consensus 212 -------~e~~-~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~ 283 (577) .+++ ..|.||.||.--|..+. .-.|=++.+++..|.|=...+....-=.+++-|+.+...- T Consensus 228 ~~~~~~~~~~~~~~~~~g~vPvv~~~nn~-------~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~---- 296 (492) T protein:vir:97 228 YSNNLENSKTHFSTGSWGKIPFIPFKNND-------LEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNY---- 296 (492) T ss_pred ccccccccccccccCCCCCcceEEecCCC-------CCCCchHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecC---- Confidence 1222 33999999985553332 2344455555444544444444444335678888776421 Q ss_pred cCCCCCccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhh Q lcl|NC_011222. 284 ESHDGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVN 363 (577) Q Consensus 284 ~~~~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~ 363 (577) .+.. .+ ++.. -...+..+.+ | ++.| |++++.+.. .+-... T Consensus 297 ---~~~~---~~-------~~~~----------------~~~~~~~~~~--~----~~~~----~~~l~~~~~-~~~~~~ 336 (492) T protein:vir:97 297 ---DDQE---LP-------EFKR----------------LLRYYGAIKV--S----DNGG----VDTIQVEVP-VENSKK 336 (492) T ss_pred ---Cccc---ch-------hHHH----------------HHhhccceec--C----CCCc----ceeEeccCC-HHHHHH Confidence 0100 00 0000 0112222222 2 1223 344443332 133344 Q ss_pred HHHHHHHHHHhhhhcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccccccCCcc Q lcl|NC_011222. 364 EEKRLRDELVRSVTGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSCNINYGTE 442 (577) Q Consensus 364 ~~kri~d~i~~s~~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~ti~yGsk 442 (577) ..+++.+.|+...-.-. .+.+.+-..++.+.+.-+-.+..++.+..+-|++.-.-+..+++++. |.+.=..+|. -. T Consensus 337 ~~~~L~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~-~~~~~~~~i~--v~ 413 (492) T protein:vir:97 337 YLDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF-DIKGEHKDVD--IS 413 (492) T ss_pred HHHHHHHHHHHHhCCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCCcccceee--EE Confidence 46677777776652211 11123345788888888899999999999999998888888888763 3221111222 45 Q ss_pred cccCCHHHHHHHHHHHHH-cCC-CHHHHHHHHHHHHHHh-hcCCH-HHHHHHHH-----HHhhCCCcCccHHHHHHHHhc Q lcl|NC_011222. 443 FYIYTPEELSERYKIMKE-TGA-SEAELDALRQQIIETE-YRNDP-TQMQRLLI-----LNEIEPYSHLTREEAVNLYKE 513 (577) Q Consensus 443 Fy~~t~eeL~~~i~~Ak~-~Ga-s~~~i~~L~~qi~e~E-yrNnP-~qmqr~~v-----L~~leP~~~LT~~Ev~~l~e~ 513 (577) |....|..+++.++++.+ .|+ |...+ ++.- +=.|| .+|+|++. .+.+....+-..+...+.-+. T Consensus 414 f~~~~p~~~~e~a~~~~kl~G~iS~et~-------l~~l~~v~d~~~Eleri~~E~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (492) T protein:vir:97 414 FNYNKVANTELQVQTAQQSMGIVSHETV-------LENHPFVEDLQAELERIEQEQTEYNKQLPNLDDGGADSAQQQERS 486 (492) T ss_pred ecCCCCCCHHHHHHHHHHHhccCchHHH-------HHhCCCCCCHHHHHHHHHHHHHHHHHhhhccccCCCCCCcccccc Confidence 777788888887776654 352 22211 1100 11232 23444322 111111111111111111111 Q ss_pred CCCchh Q lcl|NC_011222. 514 NVISEE 519 (577) Q Consensus 514 g~~~eE 519 (577) +--.+| T Consensus 487 ~~~~~e 492 (492) T protein:vir:97 487 NNKESE 492 (492) T ss_pred cccccC Confidence 111222 No 12 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=93.98 E-value=0.0058 Score=32.86 Aligned_cols=430 Identities=11% Similarity=0.049 Sum_probs=186.9 Q ss_pred HHHHhhCchhHH---HHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhcc-hhHHHHHHHhh----ccC----- Q lcl|NC_011222. 8 IREIYRHPEGIS---QIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAK-DKYDIFLSMFH----FPV----- 74 (577) Q Consensus 8 ~~~~~~~~~~~~---~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~k-dky~~f~~~f~----fpv----- 74 (577) .-+++|-|-+-+ ++-+++.-| ..++...+..++..-++-+++ +|+..+..-.| -+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~r~~~~~~~ 69 (474) T protein:vir:95 1 MFNIIRMPWDKPYGEEVVEQLKPQ-----------FETQEEMIIRLIDDHRKQLDKITVGQRYYDKDNDIVKQMKKVDVY 69 (474) T ss_pred CcceeecCCCCchhhHHHHhhhhc-----------cCChHHHHHHHHHHHHHHHHHHHHHHHHhcccCchhccccccccc Confidence 345555555411 111111111 111122222222211111100 11222211111 000 Q ss_pred -------CCc----cchHHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEE Q lcl|NC_011222. 75 -------KTN----GVTSEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVV 143 (577) Q Consensus 75 -------~t~----~lt~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVV 143 (577) +.| .+...|=+....-+-|.... |.=.+.+..+-|..|+. .++.+....- .+....-=.+++.| T Consensus 70 ~~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~--~~~~d~~~~~~l~~~~~--n~~~~~~~e~-~~~~~~~G~~~~~v 144 (474) T protein:vir:95 70 GNIDYDKPDWRITTNFHQNLVDQKVSYVASKPVT--YSCEDESVLKIIHDVLD--TRWDNKLIDI-LTATSNKGIDWLQV 144 (474) T ss_pred cccccccccceeccchHHHHHHHHHhhhccCCce--eccCchHHHHHHHHHHh--ccHHHHHHHH-HHHHhhcCcEEEEE Confidence 011 12222222222222233332 33334444445555542 3444433333 33333333567777 Q ss_pred eeccccCCCccccchhhhhHHHHH-HHHhhcccccceeeeee--ecCCCEEEEEeccceeeeccCCccceee-------- Q lcl|NC_011222. 144 DMPEVQVGEKPEPYFFWLPIANVL-SYRTCGKDCNLMAYIMY--VTDENKIVYIDEERYVRFDKTRENDLIL-------- 212 (577) Q Consensus 144 Dm~~i~~~~rpqpyf~~~pie~V~-~y~~~~~~~~~i~~i~~--~qD~N~i~~IDd~~y~~y~kn~k~ei~~-------- 212 (577) . ++.++++. +...-| .+++ .|.. ....+.+.++-+ ..+..++-+.+++.+..|...+ +.+.. T Consensus 145 ~---~d~~~~~~-i~~~~p-~~~~~v~d~-~~~~~~~~~i~~~~~~~~~~~~~y~~~~~~~~~~~~-~~~~~~~~~~~~~ 217 (474) T protein:vir:95 145 Y---INENGEMK-LFRVPA-EQAIPIWVD-KEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLEN-GGLIPDYYYGANH 217 (474) T ss_pred E---ecCCCceE-EEEEcc-cceEEEEcC-CCCCceEEEEEEEEEcCeeEEEEEeCCeEEEEEEcC-CccccccccCccc Confidence 5 55666653 222223 3333 2322 112234555555 4455556667777666676555 22221 Q ss_pred -e-hhhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCc Q lcl|NC_011222. 213 -E-VDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKE 290 (577) Q Consensus 213 -e-~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~ 290 (577) + ....|.||.||---|+... .-.|=++.+.+..|+|=...+....-=.+++-|++.... ..|.+ T Consensus 218 ~~~~~~~~~~g~iPvv~~~nn~-------~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g-------~~~~~ 283 (474) T protein:vir:95 218 IQSHFSNGNWGRVPFIAFKNNP-------EEVSDIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKG-------YEGQD 283 (474) T ss_pred ccccccccCCCccceEeecCCC-------CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec-------CCccc Confidence 1 2234999999985554333 334445555544444433333333322466777766431 11111 Q ss_pred cccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHH Q lcl|NC_011222. 291 RCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRD 370 (577) Q Consensus 291 ~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d 370 (577) . .++.. -...+..+.+ | ++.| |++++.++ ..+-....++++.+ T Consensus 284 ~--~~~~~------------------------~~~~~~~i~~--~----~~~~----~~~l~~~~-~~~~~~~~~~~l~~ 326 (474) T protein:vir:95 284 L--EEFMR------------------------GLKYYKAINV--D----GDGG----VETIQVEV-PVSSTKEYIDLMRA 326 (474) T ss_pred c--hhhhh------------------------hhhccceeec--c----CCCc----eeEEeecC-CHHHHHHHHHHHHH Confidence 0 00000 0112223332 2 2223 44554443 23444455677888 Q ss_pred HHHhhhhcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccccccCCcccccCCHH Q lcl|NC_011222. 371 ELVRSVTGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSCNINYGTEFYIYTPE 449 (577) Q Consensus 371 ~i~~s~~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~ti~yGskFy~~t~e 449 (577) .|+...-+.. ...+.+-+.|+++.+..+-++..++.+..+.|++.-+-++.+++++. |..+=...|. -.|....|. T Consensus 327 ~i~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~-g~~~d~~~i~--v~f~~~~p~ 403 (474) T protein:vir:95 327 YIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFN-NLKMDVKDIE--ISFNFNRMM 403 (474) T ss_pred HHHHHhCCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCcccceee--EEeccCCCc Confidence 8887653322 11223346788899999999999999999999999888888888874 3221111221 347777888 Q ss_pred HHHHHHHHHHHcCCCHHHHHHHHHHHHHHhhcCCH-HHHHHHHH-----HHhhCCCcCccHHHHHHHHhcCCCchh Q lcl|NC_011222. 450 ELSERYKIMKETGASEAELDALRQQIIETEYRNDP-TQMQRLLI-----LNEIEPYSHLTREEAVNLYKENVISEE 519 (577) Q Consensus 450 eL~~~i~~Ak~~Gas~~~i~~L~~qi~e~EyrNnP-~qmqr~~v-----L~~leP~~~LT~~Ev~~l~e~g~~~eE 519 (577) .+++++++|.++|+=+.+-..-+. -+=.|| .+|+|++- .+.+.++.....+.-.+--+.+--..| T Consensus 404 d~~e~a~~~~~~g~iS~et~i~~l-----~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~ 474 (474) T protein:vir:95 404 NDAEQSQIIAQSQYLSRETLVKSS-----PLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNDKESE 474 (474) T ss_pred CHHHHHHHHHhcCCCchHHHHHhC-----CCCCCHHHHHHHHHHHHHHHHhcccccccccCCCCcCCCCCccCCCC Confidence 899999999988853322111111 122244 34444331 111222211111000000000001111 No 13 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=93.27 E-value=0.0081 Score=32.04 Aligned_cols=428 Identities=14% Similarity=0.074 Sum_probs=179.3 Q ss_pred HHHHhhCchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhcchhHHHHHHHhhc--------------- Q lcl|NC_011222. 8 IREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAKDKYDIFLSMFHF--------------- 72 (577) Q Consensus 8 ~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~kdky~~f~~~f~f--------------- 72 (577) .-++.|.|-+=.-.+.. =+|+.+ .+.+....+..|.++-.+-++ +...+.+| T Consensus 1 ~~~~~~~~~~~~~~~~~---~~~~~~-----~~~~~~~~i~~~i~~~~~~~~-----~~~~l~~Yy~g~~~i~~~~~~~~ 67 (474) T protein:vir:95 1 MINIIRMPWDKPYGEEV---VEQMKP-----KVETQEEMIIRLINNHKQKLK-----DINVGQKYYDKDNDINYQAYKQD 67 (474) T ss_pred CcccccCCCCCCCCcch---hhhccc-----cccchHHHHHHHHHHHHHHHH-----HHHHHHHHhcccCccccccchhh Confidence 22333333221111000 001100 011111222222221111111 00011100 Q ss_pred ------cC-CCccch----HHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEE Q lcl|NC_011222. 73 ------PV-KTNGVT----SEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVL 141 (577) Q Consensus 73 ------pv-~t~~lt----~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~Ngvi 141 (577) +- +.+-|+ +-|=+...-.+=|+...++. .+.+..+-++.|+. .++.+....- .+....-=.+++ T Consensus 68 ~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~--~~~~~~~~l~~~~~--n~~~~~~~~l-~~~~~~~G~~~~ 142 (474) T protein:vir:95 68 LHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAH--DDDKVLDVIHQVLD--TRWDNKLIDI-LTAASNKGIDWL 142 (474) T ss_pred hcccccccccccccccchHHHHHHhhhhhhcccCceecc--CChHHHHHHHHHHh--ccHHHHHHHH-HHHHhhCCeEEE Confidence 00 111122 11211111112233333322 34444455555542 3444444443 344444444666 Q ss_pred EEeeccccCCCccccchhhhhHHHHHHHHhhcccccceeeeeeecCCC--EEEEEeccceeeeccCCcc---------ce Q lcl|NC_011222. 142 VVDMPEVQVGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIMYVTDEN--KIVYIDEERYVRFDKTREN---------DL 210 (577) Q Consensus 142 VVDm~~i~~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~~qD~N--~i~~IDd~~y~~y~kn~k~---------ei 210 (577) .|- ++.++++. +-..+-++++..-......+.+.++-+.+... ++-+-++..+..|...+.+ ++ T Consensus 143 ~~~---~d~~~~~~--i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~ 217 (474) T protein:vir:95 143 QVY---INEDGELK--LFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEH 217 (474) T ss_pred Eee---eCCCCceE--EEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeecccccccc Confidence 665 44555543 32234444443322222345566665533333 3444455555555433321 11 Q ss_pred eeehhhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCc Q lcl|NC_011222. 211 ILEVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKE 290 (577) Q Consensus 211 ~~e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~ 290 (577) ......-|.||.||..-|+....+.+ ++ .++-..++|+|+.++.. ..-=.+++-|+...... .|.. T Consensus 218 ~~~~~~~~~~~~vPvv~~~nn~~~~~--d~--e~v~~liDa~d~~~S~~---~~~~~~~~~p~lv~~g~-------~~~~ 283 (474) T protein:vir:95 218 IQTHFSTGSWERVPFIAFKNNPEEVS--DI--WMYKSFVDAIDKRLSDV---QNMFDESVELIYILRGY-------EGED 283 (474) T ss_pred ccCcccccCCCccceEEecCCCCCCC--ch--HHHHHHHHHHHHHHHHH---HHHHHHhhcchhhhcCC-------Cccc Confidence 11122239999999966655433333 22 23334445555555433 22223555666654320 1100 Q ss_pred cccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHH Q lcl|NC_011222. 291 RCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRD 370 (577) Q Consensus 291 ~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d 370 (577) .| +.... ...+..+.+ | ++.| |++++++... +......+++.+ T Consensus 284 ---~~-------~~~~~----------------~~~~~~i~~--~----~~~~----~~~l~~~~~~-~~~~~~~~~l~~ 326 (474) T protein:vir:95 284 ---LS-------EFMEG----------------LKYYKAINV--S----SDGG----VETIQVEVPV-ASTKEYLDMMRA 326 (474) T ss_pred ---cc-------chhhh----------------hhccceeec--c----CCCc----eeEEeccCCH-HHHHHHHHHHHH Confidence 00 00000 111122222 2 2233 4444443322 334445677777 Q ss_pred HHHhhhhcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccccccCCcccccCCHH Q lcl|NC_011222. 371 ELVRSVTGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSCNINYGTEFYIYTPE 449 (577) Q Consensus 371 ~i~~s~~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~ti~yGskFy~~t~e 449 (577) .|+.-.-+.. .+.+.+-+.|+++.+.-+-.+.+++....+-|+++-.-++.+++++. |..+=...| .-.|..-.|. T Consensus 327 ~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~-g~~~d~~~i--~i~f~~~~p~ 403 (474) T protein:vir:95 327 YIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFN-KIKLDAKEI--EITFNFNVMV 403 (474) T ss_pred HHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCccccee--eEEecCCCcc Confidence 7877653321 22233346788888888888889988888889888888888888753 322211122 2457888889 Q ss_pred HHHHHHHHHHHcCCCHHHHHHHHHHHHHHhhcCCH-HHHHHHHH-----HHhhCCCcCccHHHHHHHHhcC--CCchhh Q lcl|NC_011222. 450 ELSERYKIMKETGASEAELDALRQQIIETEYRNDP-TQMQRLLI-----LNEIEPYSHLTREEAVNLYKEN--VISEED 520 (577) Q Consensus 450 eL~~~i~~Ak~~Gas~~~i~~L~~qi~e~EyrNnP-~qmqr~~v-----L~~leP~~~LT~~Ev~~l~e~g--~~~eEd 520 (577) .+++..+.|++.|+-+-+-..-+.- +=.|| .+|+|++- .+.+....+ .++-...+.+ --+++. T Consensus 404 ~~~e~a~~~~~~giiS~et~~~~lp-----~v~D~~~E~eri~~E~~~~~~~~~~~~~---~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 404 NDLEQSQIGAQSQYLSKETLVRHHP-----WVDDPKAELERLDEEQLELNKQLPNLDD---GGADGAQQQQQSENNQSK 474 (474) T ss_pred CHHHHHHHHHHcCCCChHHHHHhCC-----CCCCHHHHHHHHHHHHHHHHhhcccccc---ccCCCCCCcCCCCccccC Confidence 9999999998888654332211111 11233 33454421 111111111 1111111100 001111 No 14 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=93.27 E-value=0.0081 Score=32.04 Aligned_cols=428 Identities=14% Similarity=0.074 Sum_probs=179.3 Q ss_pred HHHHhhCchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhcchhHHHHHHHhhc--------------- Q lcl|NC_011222. 8 IREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAKDKYDIFLSMFHF--------------- 72 (577) Q Consensus 8 ~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~kdky~~f~~~f~f--------------- 72 (577) .-++.|.|-+=.-.+.. =+|+.+ .+.+....+..|.++-.+-++ +...+.+| T Consensus 1 ~~~~~~~~~~~~~~~~~---~~~~~~-----~~~~~~~~i~~~i~~~~~~~~-----~~~~l~~Yy~g~~~i~~~~~~~~ 67 (474) T protein:vir:96 1 MINIIRMPWDKPYGEEV---VEQMKP-----KVETQEEMIIRLINNHKQKLK-----DINVGQKYYDKDNDINYQAYKQD 67 (474) T ss_pred CcccccCCCCCCCCcch---hhhccc-----cccchHHHHHHHHHHHHHHHH-----HHHHHHHHhcccCccccccchhh Confidence 22333333221111000 001100 011111222222221111111 00011100 Q ss_pred ------cC-CCccch----HHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEE Q lcl|NC_011222. 73 ------PV-KTNGVT----SEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVL 141 (577) Q Consensus 73 ------pv-~t~~lt----~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~Ngvi 141 (577) +- +.+-|+ +-|=+...-.+=|+...++. .+.+..+-++.|+. .++.+....- .+....-=.+++ T Consensus 68 ~~~~~~~~~~~~ki~~n~~k~Iv~~~~~yl~g~p~~~~~--~~~~~~~~l~~~~~--n~~~~~~~~l-~~~~~~~G~~~~ 142 (474) T protein:vir:96 68 LHGNIDYTKPDWRITTNFHQNLVDQKVSYVAGKPVTYAH--DDDKVLDVIHQVLD--TRWDNKLIDI-LTAASNKGIDWL 142 (474) T ss_pred hcccccccccccccccchHHHHHHhhhhhhcccCceecc--CChHHHHHHHHHHh--ccHHHHHHHH-HHHHhhCCeEEE Confidence 00 111122 11211111112233333322 34444455555542 3444444443 344444444666 Q ss_pred EEeeccccCCCccccchhhhhHHHHHHHHhhcccccceeeeeeecCCC--EEEEEeccceeeeccCCcc---------ce Q lcl|NC_011222. 142 VVDMPEVQVGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIMYVTDEN--KIVYIDEERYVRFDKTREN---------DL 210 (577) Q Consensus 142 VVDm~~i~~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~~qD~N--~i~~IDd~~y~~y~kn~k~---------ei 210 (577) .|- ++.++++. +-..+-++++..-......+.+.++-+.+... ++-+-++..+..|...+.+ ++ T Consensus 143 ~~~---~d~~~~~~--i~~~~p~~~~~v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~ 217 (474) T protein:vir:96 143 QVY---INEDGELK--LFRVPAEQAIPIWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEH 217 (474) T ss_pred Eee---eCCCCceE--EEEEcccceEEEEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeecccccccc Confidence 665 44555543 32234444443322222345566665533333 3444455555555433321 11 Q ss_pred eeehhhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCc Q lcl|NC_011222. 211 ILEVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKE 290 (577) Q Consensus 211 ~~e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~ 290 (577) ......-|.||.||..-|+....+.+ ++ .++-..++|+|+.++.. ..-=.+++-|+...... .|.. T Consensus 218 ~~~~~~~~~~~~vPvv~~~nn~~~~~--d~--e~v~~liDa~d~~~S~~---~~~~~~~~~p~lv~~g~-------~~~~ 283 (474) T protein:vir:96 218 IQTHFSTGSWERVPFIAFKNNPEEVS--DI--WMYKSFVDAIDKRLSDV---QNMFDESVELIYILRGY-------EGED 283 (474) T ss_pred ccCcccccCCCccceEEecCCCCCCC--ch--HHHHHHHHHHHHHHHHH---HHHHHHhhcchhhhcCC-------Cccc Confidence 11122239999999966655433333 22 23334445555555433 22223555666654320 1100 Q ss_pred cccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHH Q lcl|NC_011222. 291 RCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRD 370 (577) Q Consensus 291 ~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d 370 (577) .| +.... ...+..+.+ | ++.| |++++++... +......+++.+ T Consensus 284 ---~~-------~~~~~----------------~~~~~~i~~--~----~~~~----~~~l~~~~~~-~~~~~~~~~l~~ 326 (474) T protein:vir:96 284 ---LS-------EFMEG----------------LKYYKAINV--S----SDGG----VETIQVEVPV-ASTKEYLDMMRA 326 (474) T ss_pred ---cc-------chhhh----------------hhccceeec--c----CCCc----eeEEeccCCH-HHHHHHHHHHHH Confidence 00 00000 111122222 2 2233 4444443322 334445677777 Q ss_pred HHHhhhhcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccccccCCcccccCCHH Q lcl|NC_011222. 371 ELVRSVTGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSCNINYGTEFYIYTPE 449 (577) Q Consensus 371 ~i~~s~~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~ti~yGskFy~~t~e 449 (577) .|+.-.-+.. .+.+.+-+.|+++.+.-+-.+.+++....+-|+++-.-++.+++++. |..+=...| .-.|..-.|. T Consensus 327 ~I~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~-g~~~d~~~i--~i~f~~~~p~ 403 (474) T protein:vir:96 327 YIVEFGQGVDFQTDKFGSATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFN-KIKLDAKEI--EITFNFNVMV 403 (474) T ss_pred HHHHHhCCcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCccccee--eEEecCCCcc Confidence 7877653321 22233346788888888888889988888889888888888888753 322211122 2457888889 Q ss_pred HHHHHHHHHHHcCCCHHHHHHHHHHHHHHhhcCCH-HHHHHHHH-----HHhhCCCcCccHHHHHHHHhcC--CCchhh Q lcl|NC_011222. 450 ELSERYKIMKETGASEAELDALRQQIIETEYRNDP-TQMQRLLI-----LNEIEPYSHLTREEAVNLYKEN--VISEED 520 (577) Q Consensus 450 eL~~~i~~Ak~~Gas~~~i~~L~~qi~e~EyrNnP-~qmqr~~v-----L~~leP~~~LT~~Ev~~l~e~g--~~~eEd 520 (577) .+++..+.|++.|+-+-+-..-+.- +=.|| .+|+|++- .+.+....+ .++-...+.+ --+++. T Consensus 404 ~~~e~a~~~~~~giiS~et~~~~lp-----~v~D~~~E~eri~~E~~~~~~~~~~~~~---~~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 404 NDLEQSQIGAQSQYLSKETLVRHHP-----WVDDPKAELERLDEEQLELNKQLPNLDD---GGADGAQQQQQSENNQSK 474 (474) T ss_pred CHHHHHHHHHHcCCCChHHHHHhCC-----CCCCHHHHHHHHHHHHHHHHhhcccccc---ccCCCCCCcCCCCccccC Confidence 9999999998888654332211111 11233 33454421 111111111 1111111100 001111 No 15 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=93.24 E-value=0.0083 Score=32.01 Aligned_cols=414 Identities=12% Similarity=0.061 Sum_probs=182.0 Q ss_pred cchHHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhcchhHHHHHHHhhccCCCccchHH Q lcl|NC_011222. 3 KSLDEIREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAKDKYDIFLSMFHFPVKTNGVTSE 82 (577) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~kdky~~f~~~f~fpv~t~~lt~~ 82 (577) .+.+.|++...+ |..|+.=..++.- ||.-= +.|.+--+++++ -+.+.++.- T Consensus 1 l~~~~l~~~i~~------------~~~~~~r~~~l~~------yy~g~-~~il~~~~~~~~----------~~~~ki~~n 51 (429) T protein:vir:98 1 MTKDLLSELIQK------------HRSFNLSYSAYKQ------LYEGD-HAILQQKQKEQY----------KPDNRLVVN 51 (429) T ss_pred CCHHHHHHHHHH------------HHHHHHHHHHHHH------Hhccc-cccccccccccC----------CCcceeecc Confidence 455666655432 3332211111100 11100 000000111111 111222222 Q ss_pred HHHHHHHHh----ccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccCCCccccch Q lcl|NC_011222. 83 IFDKLSRVF----DGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQVGEKPEPYF 158 (577) Q Consensus 83 iF~~L~kV~----dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~~~rpqpyf 158 (577) ....+.... =|.... |.-.+.+..+-|..+.+ .+++-.....-+=++.... -++++|- .+.+|+|.=- T Consensus 52 ~~~~ivd~~~~~l~g~~~~--~~~~~~~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~G-~~~~~v~---~d~~g~~~~~- 123 (429) T protein:vir:98 52 FAKYIVDTFNGYFIGVPVQ--TSHENKQVSNYLELLDG-YNDQDDNNAELSKICSIYG-HGYELVF---NDENAEAGIT- 123 (429) T ss_pred hHHHHHHHHhhhhcccCce--eecCChHHHHHHHHHHh-hcCHhHHHHHHHHHHhhcC-eEEEEEE---ecCCCcEEEE- Confidence 222222211 122222 22344445555555543 3566665555444444444 5777775 4556665422 Q ss_pred hhhhHHHHHHHHhhcccccceeeeeeecCC--CE-EEEEeccceeeeccCCccceeeehhhhhhhcccceeeEecccccc Q lcl|NC_011222. 159 FWLPIANVLSYRTCGKDCNLMAYIMYVTDE--NK-IVYIDEERYVRFDKTRENDLILEVDNMHDLGYCPARFFWSDSISL 235 (577) Q Consensus 159 ~~~pie~V~~y~~~~~~~~~i~~i~~~qD~--N~-i~~IDd~~y~~y~kn~k~ei~~e~~~~H~lGy~PA~~~~gD~~~~ 235 (577) ..-|.+....|.... +.+.+.++-+..+. .. ..+.+++.+..|... .+.+.+....-|.||.||---+-- T Consensus 124 ~~~p~~~~~v~dd~~-~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~g~vPvv~~~n----- 196 (429) T protein:vir:98 124 YLTPLEAFIVYDDSI-RQKPLFAVRYFYNKGGVLEGSYSDASNITYFKDG-EKGIEIGESEPHPFDGVPMIEYVE----- 196 (429) T ss_pred EEcccceEEEEeCCC-CCceEEEEEEEEecCceEEEEEEeCceEEEEEec-CCceEecccccccCCccceEEecC----- Confidence 233544433343222 22345555553332 22 445555656666544 466777777779999999843322 Q ss_pred cCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCccccCcceeccccccccccccccccc Q lcl|NC_011222. 236 SEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKERCDDGFLKNEKNEWITGVDGKPMAC 315 (577) Q Consensus 236 skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~C~~G~i~n~~~d~~tg~~~~~~~C 315 (577) + + .-.|=++.+.+..|+|=...+....--.|++-|+....- ..+.. .++.+ T Consensus 197 ~-~-~g~sd~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g-------~~~~~----~~~~~---------------- 247 (429) T protein:vir:98 197 N-E-ERQSLLASVVTLINAFNKAISEKANDVEYFADAYLKILG-------AELDD----ETLKS---------------- 247 (429) T ss_pred C-C-CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec-------CCCCc----chhhh---------------- Confidence 2 1 344555555555555444444443333566777766421 11111 11111 Q ss_pred CCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHHHHHhhhhcCCCccccch--hhcccc Q lcl|NC_011222. 316 PICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRDELVRSVTGGEGELNRSE--AINEKQ 393 (577) Q Consensus 316 p~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d~i~~s~~Gf~~d~q~~k--A~ne~~ 393 (577) +..+.+ +.+|+.+..++| |++++++.. .+-.....+++.+.|+...-+ ++++.+. ..|+.+ T Consensus 248 --------~~~~~~--~~~~~~~~~~~~----~~~l~~~~~-~~~~~~~~~~l~~~i~~~s~~--p~~~~~~~gn~Sg~A 310 (429) T protein:vir:98 248 --------LRDTRI--INLKDTDAQQLT----VEFLQKPDA-DATQEHLLDRLENLIFRTAMV--ANISDESFGTASGIA 310 (429) T ss_pred --------HhhCce--eeccCCCCCCcc----eeEEeecCC-HHHHHHHHHHHHHHHHHHhCc--cccCccccccchHHH Confidence 111122 334444444444 555554442 233444566777777766532 2222222 347777 Q ss_pred eeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHh--hcCccc--ccccccCCcccccCCHHHHHHHHHHHHHc-CCCHHHH Q lcl|NC_011222. 394 VKAGFESLTTKLNRIKRGFEEAQTFVDSTICLL--RYGDSF--VSCNINYGTEFYIYTPEELSERYKIMKET-GASEAEL 468 (577) Q Consensus 394 V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~l--Ryg~~~--~~~ti~yGskFy~~t~eeL~~~i~~Ak~~-Gas~~~i 468 (577) .....-.+.+++.+..+-|++.-+-+...++++ .-+.++ ...+ -+|....|..+.+...++.+. |+=+-+- T Consensus 311 l~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~d~~~i~----v~f~~~~p~~~~~~a~~~~kl~g~is~et 386 (429) T protein:vir:98 311 LRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYPTSKIGPKDWIGIK----YKFTRNLPANLLEESQIAGNLAGIVSEET 386 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccccccce----EEeCCCCCcCHHHHHHHHHHHhccCchHH Confidence 888888888888888888888777777777765 111111 1112 346666677777776655443 2211111 Q ss_pred HHHHHHHHHHhhcCCH-HHHHHHHHHHhhCCCcCccHHHHHHHHhcCCCchhheeeeecchhhhhhhhccCCchhh Q lcl|NC_011222. 469 DALRQQIIETEYRNDP-TQMQRLLILNEIEPYSHLTREEAVNLYKENVISEEDLRVKLNLPTFVRRFERENMNIIE 543 (577) Q Consensus 469 ~~L~~qi~e~EyrNnP-~qmqr~~vL~~leP~~~LT~~Ev~~l~e~g~~~eEdl~vk~n~~~fv~rfe~en~~i~e 543 (577) .. .. --+=.|| .+|+|++.=+.- ..-.+...+- +.++.++.| T Consensus 387 ~~--~~---l~~v~d~~~E~~ri~~E~~~-----~~~~~~~~~~-----------------------~~~~~~~~~ 429 (429) T protein:vir:98 387 QV--GV---LSIVENPQKEIERKNSDKST-----LISRQAGGLN-----------------------GQNTTTILE 429 (429) T ss_pred HH--Hh---CCCCCCHHHHHHHHHHHHHH-----HHHHHHhhhc-----------------------CCCCCCCCC Confidence 00 00 0122233 344444432210 0001111121 223333333 No 16 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=91.45 E-value=0.016 Score=30.46 Aligned_cols=420 Identities=12% Similarity=0.092 Sum_probs=176.1 Q ss_pred CC----cchHHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhcchhHHHHHHHhhccCCC Q lcl|NC_011222. 1 MG----KSLDEIREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAKDKYDIFLSMFHFPVKT 76 (577) Q Consensus 1 ~~----~~~~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~kdky~~f~~~f~fpv~t 76 (577) |- -.-++|+++.+ .|.+|+. |+.+. ..||.- -+.|.+--++.+. -+. T Consensus 11 ~~~~~~~~~~~i~~~i~------------~~~~~~~---r~~~~---~~yy~g-~~~i~~~~~~~~~----------~~~ 61 (453) T protein:vir:73 11 YSRDEEITDKVVNDFMK------------KHQEEVE---RYEYL---GNMYKG-IMEISSQKAKDSW----------KPD 61 (453) T ss_pred ccccccCCHHHHHHHHH------------HHHHHHH---HHHHH---HHHhcc-ccchhcCCCCCcc----------Ccc Confidence 11 11233333322 2444331 11111 111111 0111111111110 012 Q ss_pred ccch----HHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccCCC Q lcl|NC_011222. 77 NGVT----SEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQVGE 152 (577) Q Consensus 77 ~~lt----~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~~~ 152 (577) +-++ .-|=+...-.+-|....++ -.+.+..+.|..+. ..+++.+..+.-+-++.... .++++|- ...++ T Consensus 62 ~ki~~n~~~~ivd~~~~~l~g~~~~~~--~~d~~~~~~l~~~~-~~n~~~~~~~~~~~~~~~~G-~~~~~v~---~d~~~ 134 (453) T protein:vir:73 62 NRLTNNFAKYIVDTFVGYFNGIPIKKT--HDDKSVLEAMQLFD-NLNDMEDEESELAKIACVYG-RAYELMY---QNEST 134 (453) T ss_pred ceeecchHHHHHHHhhhhhcccCceee--cCChHHHHHHHHHH-HhcChhHHHHHHHHHHHhcC-eEEEEEE---eCCCC Confidence 2222 2222222222334443333 34555556666665 33567666666555555544 4666663 34455 Q ss_pred ccccchhhhhHHHHHHHHhhcccccceeeeeeecCC-CE--EEEEeccceeeeccCCccceeeehhhhhhhcccceeeEe Q lcl|NC_011222. 153 KPEPYFFWLPIANVLSYRTCGKDCNLMAYIMYVTDE-NK--IVYIDEERYVRFDKTRENDLILEVDNMHDLGYCPARFFW 229 (577) Q Consensus 153 rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~~qD~-N~--i~~IDd~~y~~y~kn~k~ei~~e~~~~H~lGy~PA~~~~ 229 (577) .|. .-..-|.+....|.. ..+...+..+.+..+. +. +.+-++..+.+|..++ ++|.+....-|.||.||.--|. T Consensus 135 ~~~-i~~~~p~~~~~v~dd-~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~-~~~~~~~~~~~~~g~vPvv~~~ 211 (453) T protein:vir:73 135 ESE-VIYCSPLNVFMVYDD-SIKQKPLFAVYYGFDEEGNLSGTVYTLLETISITGKA-GEVKFGESTYNVYSDLPIVEYN 211 (453) T ss_pred ceE-EEEEcccceEEEEeC-CCCceeEEEEEEEEecCceEEEEEEeCCeEEEEEecC-CceEEccceeccCCceeEEEec Confidence 443 222344444344433 2233445555553333 32 4445677777777655 6676666666999999984443 Q ss_pred cccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCccccCcceeccccccccccc Q lcl|NC_011222. 230 SDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKERCDDGFLKNEKNEWITGVD 309 (577) Q Consensus 230 gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~C~~G~i~n~~~d~~tg~~ 309 (577) -+..+.| +| +++-...+|+|+.++... .--.+++-|++...- ..+. +..+++.. T Consensus 212 n~~~g~s--~~--~~v~~liDa~~~~~S~~~---~~~~~~~~~~l~~~g-------~~~~----~~~~~~~~-------- 265 (453) T protein:vir:73 212 FNEERQS--IF--EPVHSLINSYNKVTSEKA---NDVEYFSDQYLVFLG-------AEVD----EEDAKNIK-------- 265 (453) T ss_pred CCCCCCc--ch--hhHHHHHHHHHHHHHHHH---HHHHHhccceeeeec-------CCCC----chhhhccc-------- Confidence 3332222 22 233334455555554333 322366778765421 1100 11111110 Q ss_pred ccccccCCCccccccCccceeeeecCcccccCCCcc-CcceeecccHHHHHHHhhHHHHHHHHHHhhhhcCCCccccc-- Q lcl|NC_011222. 310 GKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLK-NPITMLSADTGSLEYNVNEEKRLRDELVRSVTGGEGELNRS-- 386 (577) Q Consensus 310 ~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr-~pv~i~n~di~sleY~~~~~kri~d~i~~s~~Gf~~d~q~~-- 386 (577) .+..+..+.+....+..+-. .-|++++.+... +-....++++.+.|+...-+ ++++.+ T Consensus 266 ----------------~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~-~~~~~~~~~l~~~I~~~s~~--p~~~~~~~ 326 (453) T protein:vir:73 266 ----------------DNRLINFFDKNSNGQGTNAAKVDVKFLDKPDSD-VQTENLLNRLERSIFQFTMA--ANISDENF 326 (453) T ss_pred ----------------ccccccccccccccccccccCceeEEeeecCCH-HHHHHHHHHHHHHHHHHhCC--cccCcccc Confidence 00111111111111111100 115666655432 33355567777777765422 122211 Q ss_pred hhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhh-c-Cccc--ccccccCCcccccCCHHHHHHHHHHH-HHc Q lcl|NC_011222. 387 EAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLR-Y-GDSF--VSCNINYGTEFYIYTPEELSERYKIM-KET 461 (577) Q Consensus 387 kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lR-y-g~~~--~~~ti~yGskFy~~t~eeL~~~i~~A-k~~ 461 (577) -..|+++.++-.-.+-.++.+..+.|++.-+-+...++++. . |.++ ...+| .|....|..+++...++ |.. T Consensus 327 gn~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~v----~f~~~~p~~~~~~a~~~~k~~ 402 (453) T protein:vir:73 327 GNSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTNASNKDAWKDIEY----TFTRNEPKDIKEQAETANILK 402 (453) T ss_pred cCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceE----EeCCCCCCCHHHHHHHHHHHh Confidence 13577788888888888888888888887777777776652 2 2221 12233 45555566666655544 334 Q ss_pred CCCHHHHHHHHHHHHHHhhcC--CH-HHHHHHHHHHhhCCCcCccHHHHHHHHhc-CCCchhheeeeecc Q lcl|NC_011222. 462 GASEAELDALRQQIIETEYRN--DP-TQMQRLLILNEIEPYSHLTREEAVNLYKE-NVISEEDLRVKLNL 527 (577) Q Consensus 462 Gas~~~i~~L~~qi~e~EyrN--nP-~qmqr~~vL~~leP~~~LT~~Ev~~l~e~-g~~~eEdl~vk~n~ 527 (577) |+=+-+- .+ ....+ || .+|+|++- +=+ |-..+-.. ....+++.- =|+ T Consensus 403 giis~et------~~-~~~~~~~d~~~E~~ri~~--E~~--------~~~~~~~~~~~~~~~~~~--~~~ 453 (453) T protein:vir:73 403 GITSEET------AL-SVISVIPDVQAEMEKIKK--KKL--------LQLSLTRTSNLVRMKQMR--GNL 453 (453) T ss_pred ccCcHHH------HH-HhCCCCCCHHHHHHHHHH--HHH--------HHHHHHHhccCCcchhhh--cCC Confidence 5322111 11 11112 33 23344332 111 11011111 111111111 111 No 17 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=90.62 E-value=0.02 Score=29.91 Aligned_cols=417 Identities=12% Similarity=0.065 Sum_probs=177.2 Q ss_pred CCc----chHHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhcchhHHHHHHHhhccCCC Q lcl|NC_011222. 1 MGK----SLDEIREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAKDKYDIFLSMFHFPVKT 76 (577) Q Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~kdky~~f~~~f~fpv~t 76 (577) |-+ +.+.|.+..+ .|+.++. |+.. ..+||.-- +.|..--++++.. +. T Consensus 11 ~p~d~~~~~~~l~~~i~------------~~~~~~~---r~~~---~~~yy~g~-~~i~~~~~~~~~~----------~~ 61 (453) T protein:vir:39 11 FPKDEPITNEVVTKFME------------KHRLEVA---RYEY---LKNMYRGI-MAIDAEPTKDLWK----------PD 61 (453) T ss_pred cCCCCCCCHHHHHHHHH------------HHHHHHH---HHHH---HHHHhhcc-CchhcCCCccccC----------cc Confidence 222 2222333322 1222211 1100 01111110 0000000111110 11 Q ss_pred ccch----HHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccCCC Q lcl|NC_011222. 77 NGVT----SEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQVGE 152 (577) Q Consensus 77 ~~lt----~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~~~ 152 (577) +.++ .-|=+.+.-..=|....++. .+.+..+.+..+.+. +++-.....-+=.+.... -++++|- ...++ T Consensus 62 ~ki~~n~~~~ivd~~~~~l~g~~~~~~~--~d~~~~~~l~~i~~~-N~~~~~~~~~~~~~~~~G-~~~~~v~---~d~~g 134 (453) T protein:vir:39 62 NRLTVNFTKYIVDTFTGYFNGIPVKKSH--SDKETLSKLQEFDNL-NDMEDEESELAKMACIYG-RAFELLY---QNEET 134 (453) T ss_pred ceeecchHHHHHHHHhhhhcccCceecc--CChHHHHHHHHHHHh-cChhHHHHHHHHHHhhcC-eEEEEEE---ecCCC Confidence 1121 11111111111122222222 233444444444433 456555555555555555 4666664 34455 Q ss_pred ccccchhhhhHHHHHHHHhhcccccceeeeeeecCCCE---EEEEeccceeeeccCCccceeeehhhhhhhcccceeeEe Q lcl|NC_011222. 153 KPEPYFFWLPIANVLSYRTCGKDCNLMAYIMYVTDENK---IVYIDEERYVRFDKTRENDLILEVDNMHDLGYCPARFFW 229 (577) Q Consensus 153 rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~~qD~N~---i~~IDd~~y~~y~kn~k~ei~~e~~~~H~lGy~PA~~~~ 229 (577) .|. .-..-|.+....|.... ..+.+.++-+..+++. +-+-++..+..|..+. +.|.++...-|.||.||.--|. T Consensus 135 ~~~-i~~~~p~~~~~v~d~~~-~~~~~~~ir~~~~~~~~~~~~~yt~~~i~~~~~~~-~~~~~~~~~~~~~g~vPvv~~~ 211 (453) T protein:vir:39 135 QTN-VIYNTPENMFMVYDDTI-KQEPLFAVRYGYDDDYKLYGEVYTKETTYALNGTM-GFYNMTEQAPNPFDDLPVVEFY 211 (453) T ss_pred ceE-EEEEcccceEEEecCCC-CCeEEEEEEEEEeCCeEEEEEEEeCCeEEEEEecC-CceeeecccccCCCceeEEEec Confidence 443 11123433333343222 3345555656444443 3344666666676555 6777776667999999984443 Q ss_pred cccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCccccCcceeccccccccccc Q lcl|NC_011222. 230 SDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKERCDDGFLKNEKNEWITGVD 309 (577) Q Consensus 230 gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~C~~G~i~n~~~d~~tg~~ 309 (577) -+..+.| +| +++-...+|+|+.++... .-=.+++-|++...-- +..+ ..+++ T Consensus 212 n~~~g~s--d~--e~v~~liDa~~~~~s~~~---~~~~~~~~p~~~~~g~-----~~~~------~~~~~---------- 263 (453) T protein:vir:39 212 FNEERMS--IF--ESVISLVNAFNKAISEKA---NDVDYFSDQYLTFLGA-----AVEE------EDLKN---------- 263 (453) T ss_pred CCCCCCc--ch--hhhHHHHHHHHHHHHHHH---HHHHHhhCceeeeecC-----CCCc------hhhhh---------- Confidence 3333222 22 233334456666555433 2223556676653221 0000 00000 Q ss_pred ccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHHHHHhhhhcCCCccccch-- Q lcl|NC_011222. 310 GKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRDELVRSVTGGEGELNRSE-- 387 (577) Q Consensus 310 ~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d~i~~s~~Gf~~d~q~~k-- 387 (577) ...+.++.++-...+..++| |+.++.+.. .+...+..+++.+.|+.-.-+ ++++.+. T Consensus 264 --------------~~~~~~~~~~~~~~~~~~~~----~~~lt~~~~-~~~~~~~~~~l~~~I~~~s~~--p~~~~~~~g 322 (453) T protein:vir:39 264 --------------IRSNRVINYYGESSEAKNVD----VKFLEKPDS-DSQTENLLDRLTKLIFQTTMV--ANISDESFG 322 (453) T ss_pred --------------hhhcceeeecCCCCCCCCCc----eeEEeecCC-HHHHHHHHHHHHHHHHHHhCC--ccccccccc Confidence 01122333333222223333 555555432 345556677888888775522 2222211 Q ss_pred hhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhh--cCcccccccccCCcccccCCHHHHHHHHHHHHHc-C-C Q lcl|NC_011222. 388 AINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLR--YGDSFVSCNINYGTEFYIYTPEELSERYKIMKET-G-A 463 (577) Q Consensus 388 A~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lR--yg~~~~~~ti~yGskFy~~t~eeL~~~i~~Ak~~-G-a 463 (577) ..++.+.++....+..++.+..+.|++.-+-++..++++. .|.+.-.-.| .-.|....|..+++..+++.+. | + T Consensus 323 n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i--~v~f~~~~p~~~~~~a~~~~kl~g~i 400 (453) T protein:vir:39 323 SSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCELSTNVSNKEAWKDI--EYTFTRNEPKDIKEQAETANILMGIT 400 (453) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccc--eEEeCCCCCcCHHHHHHHHHHHhccC Confidence 3577788888888889999888999888888888777762 3332211111 1346666677777776655443 3 3 Q ss_pred CHHHHHHHHHHHHHHhhcCCH-HHHHHHHHHHhhCCCcCccHHHHHHHHh--------cCCCchh Q lcl|NC_011222. 464 SEAELDALRQQIIETEYRNDP-TQMQRLLILNEIEPYSHLTREEAVNLYK--------ENVISEE 519 (577) Q Consensus 464 s~~~i~~L~~qi~e~EyrNnP-~qmqr~~vL~~leP~~~LT~~Ev~~l~e--------~g~~~eE 519 (577) |...+..+- -+=.|| .+|+|++. +-+. ..+.-.+... ....++| T Consensus 401 s~et~l~~l------~~v~D~~~E~~ri~~--E~~~----~~~~~~~~~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 401 SQETALSVI------SVIPDVQAEMEKIKK--EEAS----TAIFDKDKQPSEKGTDTVVPETNEE 453 (453) T ss_pred ChHHHHHhC------CCCCCHHHHHHHHHH--HHHH----HHHHHHhccCCCCCCCCCCCCcCCC Confidence 332221110 011122 34444332 1110 0000000000 1122233 No 18 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=90.35 E-value=0.021 Score=29.75 Aligned_cols=428 Identities=14% Similarity=0.050 Sum_probs=173.0 Q ss_pred cchHHHHHHhh-----CchhHHHHHHHHHH---HHHHHHHhhhhcccccchHHHHHHHHHHHhhcchhHHHHHHHhhccC Q lcl|NC_011222. 3 KSLDEIREIYR-----HPEGISQIAKAKEH---EERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAKDKYDIFLSMFHFPV 74 (577) Q Consensus 3 ~~~~~~~~~~~-----~~~~~~~~aka~~h---eeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~kdky~~f~~~f~fpv 74 (577) .-+++|.++.. |.+-+....+..+. +--|-++. ..+.|.+.....-..=- T Consensus 1 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~Yy~g~~~I~~~~----------------------~~~~~~~~~~~~~~~~~ 58 (470) T protein:vir:10 1 MELDALKKLIQNTSTSRNDLINNYKQAVNYYENKTDITTRN----------------------NGKAKLNKEGKKDPLRS 58 (470) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhccc----------------------cchhccccccccccccc Confidence 34445544421 22222211111110 00000000 00000000000000000 Q ss_pred CCccchHHHHHHHHHHhc----cCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccC Q lcl|NC_011222. 75 KTNGVTSEIFDKLSRVFD----GRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQV 150 (577) Q Consensus 75 ~t~~lt~~iF~~L~kV~d----gqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~ 150 (577) +.|-++.-.+..|....- |+...+ .-.+.+..+-+..+... ++.+.+..-+-.+.. -=.+++.|= ++. T Consensus 59 ~~~ki~~n~~k~Iv~~~~~yl~G~p~~~--~~~d~~~~~~l~~~~~~--~~~~~~~~l~~~~~~-~G~a~~~~y---~d~ 130 (470) T protein:vir:10 59 ADNRIPSNFYQLLVDQEAGYVASVFPDI--DVGKDADNKKIIDVLGD--DRALTLNGLLVDSSN-AGRAWLHYW---IDE 130 (470) T ss_pred CCcccccchHHHHHHhhhhheeccceee--ecCchHHHHHHHHHHhh--hHHHHHHHHHHHHhh-cCeeEEEEE---ecC Confidence 111111111111111111 111111 11111111111111111 222222211222222 222233321 334 Q ss_pred CCccccchhhhhHHHHHHHHhhcccccceeeeee-e---cCCCE----EEEEeccceeeeccCCccceeee--------- Q lcl|NC_011222. 151 GEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIMY-V---TDENK----IVYIDEERYVRFDKTRENDLILE--------- 213 (577) Q Consensus 151 ~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~-~---qD~N~----i~~IDd~~y~~y~kn~k~ei~~e--------- 213 (577) ++++. +-..+-+.++-+-....+.+.+.++-+ . .+++. +-+-++..+..|...+.+....+ T Consensus 131 ~~~~~--~~~~~p~~~~~v~d~~~~~~~~a~ir~y~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~ 208 (470) T protein:vir:10 131 DGNFR--YGIIQPDQITPIYATTLDNKLLGILRSYKQLDPDSGKYFTVHEYWTDKEAQFFRTNATDSTVIEPYNIITSYD 208 (470) T ss_pred CCceE--EEEEcccceEEEEcCCCCCceEEEEEEEEeeecCCceEEEEEEEEcCCcEEEEEeecCcceeccccccccccc Confidence 44433 221222333322222223345555533 2 22222 22334455555555544433222 Q ss_pred ---------hhh-hhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCc Q lcl|NC_011222. 214 ---------VDN-MHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHY 283 (577) Q Consensus 214 ---------~~~-~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~ 283 (577) .+. -|.||+||..-|+-.. .-+|=+....+..|.|-...+....-=.+++-|++....- T Consensus 209 ~~~~~~~~~~~~~~~~~g~vPvv~~~nn~-------~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~---- 277 (470) T protein:vir:10 209 LSAGYETGQSNTLKHNFGRVPFIEFSKNK-------YRLPELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNY---- 277 (470) T ss_pred cccccccccccccccCCCeeeEEEeecCC-------CCCCchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecC---- Confidence 122 3999999996554433 3445555555444444444444333334566676664310 Q ss_pred cCCCCCccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhh Q lcl|NC_011222. 284 ESHDGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVN 363 (577) Q Consensus 284 ~~~~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~ 363 (577) .+.+ .+ .. ..-...+..+.++-. ....++| |+++++++.+ +-... T Consensus 278 ---~~~~---~~---------------~~--------~~~~~~~~~i~~~~~-~~~~~~~----~~~lt~~~~~-~~~~~ 322 (470) T protein:vir:10 278 ---GGAD---LH---------------QF--------MNDLRKYKSIKINNT-GNGDNSG----VDKLQIDIPV-EARDD 322 (470) T ss_pred ---Cccc---cc---------------hh--------hhhhhhcCeEeccCC-CCCcCce----eEEEeecCCh-HHHHH Confidence 0100 00 00 000111122333321 1122334 5677766643 55566 Q ss_pred HHHHHHHHHHhhhhcCCCccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccccccCCccc Q lcl|NC_011222. 364 EEKRLRDELVRSVTGGEGELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSCNINYGTEF 443 (577) Q Consensus 364 ~~kri~d~i~~s~~Gf~~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~ti~yGskF 443 (577) ..+++.+.|+...-+........-+.|+++++.-+-.+.+++++..+.|+++=.-++..++++. |...+.. ..-.-.| T Consensus 323 ~~~~L~~~I~~~s~~p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~l-~~~~~d~-~~i~i~f 400 (470) T protein:vir:10 323 ALKITRKNIFLFGQGIDPANFESSNASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRYL-NFSDADK-RHISQHW 400 (470) T ss_pred HHHHHHHHHHHHhCCCCCCccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cccCccc-ceeeEEe Confidence 7788999999876553221111136788899999999999999999989888888888887643 2211111 1224677 Q ss_pred ccCCHHHHHHHHHHHHHc-CCCHHHHHHHHHHHHHHhhcCCHH-HHHHHHHHHh-hCCCcCccHHHHHHHHhcCCCchh Q lcl|NC_011222. 444 YIYTPEELSERYKIMKET-GASEAELDALRQQIIETEYRNDPT-QMQRLLILNE-IEPYSHLTREEAVNLYKENVISEE 519 (577) Q Consensus 444 y~~t~eeL~~~i~~Ak~~-Gas~~~i~~L~~qi~e~EyrNnP~-qmqr~~vL~~-leP~~~LT~~Ev~~l~e~g~~~eE 519 (577) ....|..+++.++++.+. |+=+-+- -+-..-+=.||. +|+|++.=+. -.|+ .++..++...|.-+|| T Consensus 401 ~~~~p~d~~e~~~~~~~~~g~iS~et-----~l~~~p~v~D~~~E~eri~~E~~e~~~~----~~~~~~~~~~~~dde~ 470 (470) T protein:vir:10 401 TRTKVEDSLTKAQIVSTVANYSSKEA-----VAKANPIVDDWQQELKDLAKDKEENDPY----SNQADELNGKGVNDEQ 470 (470) T ss_pred ccCCCCCHHHHHHHHHHHhccCcHHH-----HHHhCCCCCCHHHHHHHHHHHHHHHHHh----hccccccCCCCCCCCC Confidence 888888888888776553 4222221 111111233442 3444432110 0111 1233444556777777 No 19 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=88.30 E-value=0.033 Score=28.69 Aligned_cols=440 Identities=12% Similarity=0.049 Sum_probs=185.1 Q ss_pred CCcchHHHHHHhhCchhHHHHHHHHH-HHHHHHHHhhhhcccccchHHHHHHHHHHHhh--cchhHHHHHHHh---hcc- Q lcl|NC_011222. 1 MGKSLDEIREIYRHPEGISQIAKAKE-HEERIAFHTRVRTSDDRNKPVIDFLSKVKTWI--AKDKYDIFLSMF---HFP- 73 (577) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~aka~~-heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l--~kdky~~f~~~f---~fp- 73 (577) |-|-+++|.+-=--||-|.. .++ |.++ ..|+ .....+++-....++.+- +..++..+.+.- .-+ T Consensus 3 ~~~~~~~~~~~~~~~e~i~~---~i~~~~~~---~~r~---~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (474) T protein:vir:10 3 LYKLIDDIEAQGILPKHIEA---LIESHKDD---RERM---VNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDV 73 (474) T ss_pred hHHHHhhccccCCCHHHHHH---HHHHhhhh---hHHH---HHHHHHHhhhcchhhhhcchhhhhhhhhhhccccccccc Confidence 44555555433223333222 211 1110 0000 111112211111111000 000111111100 000 Q ss_pred CCCccchHHHHHHHHH----HhccCCccccc---cccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeec Q lcl|NC_011222. 74 VKTNGVTSEIFDKLSR----VFDGRNPVYNY---QFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMP 146 (577) Q Consensus 74 v~t~~lt~~iF~~L~k----V~dgqd~~~~y---~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~ 146 (577) -+.|-|+--....+.. .+=|+...++. .-.+++..+-|+.+.+. +++.+.-+.-+-.+.... .++++|- T Consensus 74 ~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G-~a~~~~~-- 149 (474) T protein:vir:10 74 SVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIR-NSVDDEDSEIGKMAAICG-YGARLAY-- 149 (474) T ss_pred CcccccccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhh-cCHhHHHHHHHHHHhhcC-eEEEEEE-- Confidence 0112222221111211 11133333333 11344444444444423 355554444444444444 5777774 Q ss_pred cccCCCccccchhhhhHHHHHHHHhhcccccceeeeee--e--cCCCE----EEEEeccceeeeccCCccceeeehhhhh Q lcl|NC_011222. 147 EVQVGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIMY--V--TDENK----IVYIDEERYVRFDKTRENDLILEVDNMH 218 (577) Q Consensus 147 ~i~~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~--~--qD~N~----i~~IDd~~y~~y~kn~k~ei~~e~~~~H 218 (577) ++.++++ .+-..+-++++..-. +..+.+.++-+ . .+++. +-+-+++.+.+|..++.+.+......-| T Consensus 150 -~d~~~~~--~~~~i~p~~~~~v~d--~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~ 224 (474) T protein:vir:10 150 -IDTNGDI--RIKNIDPYNVIFVGD--NILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEH 224 (474) T ss_pred -eCCCCee--EEEEEcccceEEEEc--CCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccC Confidence 4455543 343334444332211 22233444433 1 22332 4455888888898888777777666779 Q ss_pred hhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCccccCccee Q lcl|NC_011222. 219 DLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKERCDDGFLK 298 (577) Q Consensus 219 ~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~C~~G~i~ 298 (577) .||+||.--|.-.. .-.|=+..+.+..|.|=...+...+--.+++-|++...-. .+.. + ++ T Consensus 225 ~~g~vPvv~~~n~~-------~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~-------~~~~---~-~~- 285 (474) T protein:vir:10 225 LFDYNPLFGVPNNK-------EMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGM-------GMSE---E-MI- 285 (474) T ss_pred CCCccceEEecCCC-------CCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccC-------CCCc---h-hh- Confidence 99999984443322 3445555555444444443333333333556666543210 1111 0 00 Q ss_pred cccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHHHHHhhhhc Q lcl|NC_011222. 299 NEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRDELVRSVTG 378 (577) Q Consensus 299 n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d~i~~s~~G 378 (577) +.....|.+...+ ++.| +++++++... +-.....+++.+.|+...-+ T Consensus 286 ----------------------~~~~~~~~i~~~~------~~~~----~~~l~~~~~~-~~~~~~~~~l~~~I~~~s~~ 332 (474) T protein:vir:10 286 ----------------------QETQKSGAFELFD------KDMD----VKYLTKDVND-TMIENHLDRIEKNIMRFAKS 332 (474) T ss_pred ----------------------hhhhhcceeEecC------CCCc----eeEEeccCCH-HHHHHHHHHHHHHHHHHhCC Confidence 0111223332211 2333 5666655532 44455677888888886533 Q ss_pred CC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHh---hcCcc-cccccccCCcccccCCHHHHHH Q lcl|NC_011222. 379 GE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLL---RYGDS-FVSCNINYGTEFYIYTPEELSE 453 (577) Q Consensus 379 f~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~l---Ryg~~-~~~~ti~yGskFy~~t~eeL~~ 453 (577) .. .+.+.+-+.++++.++.+-.+.+++....+.|++.-+-++..++++ ..+.. -+.. ..-.-.|-...|..+++ T Consensus 333 p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~-~~i~~~f~~~~p~d~~e 411 (474) T protein:vir:10 333 VNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSY-LNLIFKFTRNIPVNKLE 411 (474) T ss_pred cccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcccc-ccceEEeCCCCCCCHHH Confidence 21 1122334678888888888888888888888888777777666654 11111 1111 12234566777777877 Q ss_pred HHHHHHHc-C-CCHHHHHHHHHHHHHHhhcCCH-HHHHHHHHHH--hhCCCcCccHHHHHHHHhcCCCchhh Q lcl|NC_011222. 454 RYKIMKET-G-ASEAELDALRQQIIETEYRNDP-TQMQRLLILN--EIEPYSHLTREEAVNLYKENVISEED 520 (577) Q Consensus 454 ~i~~Ak~~-G-as~~~i~~L~~qi~e~EyrNnP-~qmqr~~vL~--~leP~~~LT~~Ev~~l~e~g~~~eEd 520 (577) ..+++.+. | +|...+... --+=.|| .+|+|++.=+ ..+...++--. +.-+..--++-| T Consensus 412 ~a~~~~kl~g~iS~et~~~~------l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~---~~~~~~~~~~s~ 474 (474) T protein:vir:10 412 ESQVLINLKGQVSERTRLGQ------SQLVDDVDYELDEMEKESLEFNDKLPDIDEG---DANDKSQNNQSE 474 (474) T ss_pred HHHHHHHHhccCchHHHHHh------CCCCCCHHHHHHHHHHHHHHHHhhcccccCC---CcCCCCccccCC Confidence 77766543 3 222111110 0112344 3344442211 01111111000 000111111222 No 20 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=88.30 E-value=0.033 Score=28.69 Aligned_cols=440 Identities=12% Similarity=0.049 Sum_probs=185.1 Q ss_pred CCcchHHHHHHhhCchhHHHHHHHHH-HHHHHHHHhhhhcccccchHHHHHHHHHHHhh--cchhHHHHHHHh---hcc- Q lcl|NC_011222. 1 MGKSLDEIREIYRHPEGISQIAKAKE-HEERIAFHTRVRTSDDRNKPVIDFLSKVKTWI--AKDKYDIFLSMF---HFP- 73 (577) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~aka~~-heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l--~kdky~~f~~~f---~fp- 73 (577) |-|-+++|.+-=--||-|.. .++ |.++ ..|+ .....+++-....++.+- +..++..+.+.- .-+ T Consensus 3 ~~~~~~~~~~~~~~~e~i~~---~i~~~~~~---~~r~---~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (474) T protein:vir:94 3 LYKLIDDIEAQGILPKHIEA---LIESHKDD---RERM---VNLYNRYKTHIDYVPIFKRRPIEEKEDFETGGNVRRLDV 73 (474) T ss_pred hHHHHhhccccCCCHHHHHH---HHHHhhhh---hHHH---HHHHHHHhhhcchhhhhcchhhhhhhhhhhccccccccc Confidence 44555555433223333222 211 1110 0000 111112211111111000 000111111100 000 Q ss_pred CCCccchHHHHHHHHH----HhccCCccccc---cccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeec Q lcl|NC_011222. 74 VKTNGVTSEIFDKLSR----VFDGRNPVYNY---QFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMP 146 (577) Q Consensus 74 v~t~~lt~~iF~~L~k----V~dgqd~~~~y---~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~ 146 (577) -+.|-|+--....+.. .+=|+...++. .-.+++..+-|+.+.+. +++.+.-+.-+-.+.... .++++|- T Consensus 74 ~~~~ki~~n~~~~ivd~~~~yl~g~pv~~~~~~~~~~~e~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G-~a~~~~~-- 149 (474) T protein:vir:94 74 SVNNKLNNSFDSEIVDTRVGYLHGVPVTYDLDENAEKNEKLKKFITNFAIR-NSVDDEDSEIGKMAAICG-YGARLAY-- 149 (474) T ss_pred CcccccccchHHHHHHhHhhheeccceeEeeCCCCcchHHHHHHHHHHHhh-cCHhHHHHHHHHHHhhcC-eEEEEEE-- Confidence 0112222221111211 11133333333 11344444444444423 355554444444444444 5777774 Q ss_pred cccCCCccccchhhhhHHHHHHHHhhcccccceeeeee--e--cCCCE----EEEEeccceeeeccCCccceeeehhhhh Q lcl|NC_011222. 147 EVQVGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIMY--V--TDENK----IVYIDEERYVRFDKTRENDLILEVDNMH 218 (577) Q Consensus 147 ~i~~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~--~--qD~N~----i~~IDd~~y~~y~kn~k~ei~~e~~~~H 218 (577) ++.++++ .+-..+-++++..-. +..+.+.++-+ . .+++. +-+-+++.+.+|..++.+.+......-| T Consensus 150 -~d~~~~~--~~~~i~p~~~~~v~d--~~~~~~~~i~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~ 224 (474) T protein:vir:94 150 -IDTNGDI--RIKNIDPYNVIFVGD--NILEPTYSLRYFYEKDDDNGTDYVYAEFYDNAYYYVFRGEGIDALQEVGRYEH 224 (474) T ss_pred -eCCCCee--EEEEEcccceEEEEc--CCCceEEEEEEEEEeeCCCceEEEEEEEEcCceEEEEeecCCCcccccccccC Confidence 4455543 343334444332211 22233444433 1 22332 4455888888898888777777666779 Q ss_pred hhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCccccCccee Q lcl|NC_011222. 219 DLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKERCDDGFLK 298 (577) Q Consensus 219 ~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~C~~G~i~ 298 (577) .||+||.--|.-.. .-.|=+..+.+..|.|=...+...+--.+++-|++...-. .+.. + ++ T Consensus 225 ~~g~vPvv~~~n~~-------~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~-------~~~~---~-~~- 285 (474) T protein:vir:94 225 LFDYNPLFGVPNNK-------EMIGDAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGM-------GMSE---E-MI- 285 (474) T ss_pred CCCccceEEecCCC-------CCCCchHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccC-------CCCc---h-hh- Confidence 99999984443322 3445555555444444443333333333556666543210 1111 0 00 Q ss_pred cccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHHHHHhhhhc Q lcl|NC_011222. 299 NEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRDELVRSVTG 378 (577) Q Consensus 299 n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d~i~~s~~G 378 (577) +.....|.+...+ ++.| +++++++... +-.....+++.+.|+...-+ T Consensus 286 ----------------------~~~~~~~~i~~~~------~~~~----~~~l~~~~~~-~~~~~~~~~l~~~I~~~s~~ 332 (474) T protein:vir:94 286 ----------------------QETQKSGAFELFD------KDMD----VKYLTKDVND-TMIENHLDRIEKNIMRFAKS 332 (474) T ss_pred ----------------------hhhhhcceeEecC------CCCc----eeEEeccCCH-HHHHHHHHHHHHHHHHHhCC Confidence 0111223332211 2333 5666655532 44455677888888886533 Q ss_pred CC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHh---hcCcc-cccccccCCcccccCCHHHHHH Q lcl|NC_011222. 379 GE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLL---RYGDS-FVSCNINYGTEFYIYTPEELSE 453 (577) Q Consensus 379 f~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~l---Ryg~~-~~~~ti~yGskFy~~t~eeL~~ 453 (577) .. .+.+.+-+.++++.++.+-.+.+++....+.|++.-+-++..++++ ..+.. -+.. ..-.-.|-...|..+++ T Consensus 333 p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~~~~~~~~~~-~~i~~~f~~~~p~d~~e 411 (474) T protein:vir:94 333 VNFNSDEFNGNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRKGYNLDDDSY-LNLIFKFTRNIPVNKLE 411 (474) T ss_pred cccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcccc-ccceEEeCCCCCCCHHH Confidence 21 1122334678888888888888888888888888777777666654 11111 1111 12234566777777877 Q ss_pred HHHHHHHc-C-CCHHHHHHHHHHHHHHhhcCCH-HHHHHHHHHH--hhCCCcCccHHHHHHHHhcCCCchhh Q lcl|NC_011222. 454 RYKIMKET-G-ASEAELDALRQQIIETEYRNDP-TQMQRLLILN--EIEPYSHLTREEAVNLYKENVISEED 520 (577) Q Consensus 454 ~i~~Ak~~-G-as~~~i~~L~~qi~e~EyrNnP-~qmqr~~vL~--~leP~~~LT~~Ev~~l~e~g~~~eEd 520 (577) ..+++.+. | +|...+... --+=.|| .+|+|++.=+ ..+...++--. +.-+..--++-| T Consensus 412 ~a~~~~kl~g~iS~et~~~~------l~~v~d~~~E~eri~~E~~e~~~~~~~~~~~---~~~~~~~~~~s~ 474 (474) T protein:vir:94 412 ESQVLINLKGQVSERTRLGQ------SQLVDDVDYELDEMEKESLEFNDKLPDIDEG---DANDKSQNNQSE 474 (474) T ss_pred HHHHHHHHhccCchHHHHHh------CCCCCCHHHHHHHHHHHHHHHHhhcccccCC---CcCCCCccccCC Confidence 77766543 3 222111110 0112344 3344442211 01111111000 000111111222 No 21 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=86.76 E-value=0.043 Score=28.06 Aligned_cols=436 Identities=12% Similarity=0.058 Sum_probs=189.4 Q ss_pred hhCchhHHHHHHHHHHHHHH----------HHHhhhhcc---cccchHHHHHHHHHHHhhcc-hhHHHHHHH----hhcc Q lcl|NC_011222. 12 YRHPEGISQIAKAKEHEERI----------AFHTRVRTS---DDRNKPVIDFLSKVKTWIAK-DKYDIFLSM----FHFP 73 (577) Q Consensus 12 ~~~~~~~~~~aka~~heeri----------~fh~~~~ta---~d~~~~~~~fl~~v~~~l~k-dky~~f~~~----f~fp 73 (577) ...-.-|||||.|.-.---| .|+.....- .++...+..|+++-++-+++ +|..+|..- ++-| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~r~~~l~~YY~g~~~I~~~~ 80 (492) T protein:vir:94 1 MQFIQLISQVAQALIKGGNILYPSQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQEYYEQRPDIVKEP 80 (492) T ss_pred ChHHHHHHHHHHHHhcCCceeecCccchhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 33345678888875321111 233333332 23333444443332222111 111111110 0111 Q ss_pred CC-----------C-----ccchHHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCC Q lcl|NC_011222. 74 VK-----------T-----NGVTSEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRI 137 (577) Q Consensus 74 v~-----------t-----~~lt~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~ 137 (577) .+ . ..+.+.|=+...--+-|....++ -.+.+..+-|..|+. +++-+..+ +..+....-= T Consensus 81 ~~~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~G~p~~~~--~~d~~~~~~l~~~~~--n~~~~~~~-~~~~~a~~~G 155 (492) T protein:vir:94 81 KPVDATGAVDPLKPDDRMITNFHANLVDQKVSYIVGKPIAFK--HTDDEVVKRIDEVLG--NRFDDKLH-SVLTGASNKG 155 (492) T ss_pred ccccccccccccccccccccchHHHHHHHHHhhhcccCceec--cCchHHHHHHHHHHh--ccHHHHHH-HHHHHHhhCC Confidence 11 0 12222222222222223333332 345555566666653 35555554 3445555556 Q ss_pred CeEEEEeeccccCCCccccchhhhhHHHHHHHHhhcccccceeeeee--ecCCCEEEEEeccceeeeccCCcc------- Q lcl|NC_011222. 138 NSVLVVDMPEVQVGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIMY--VTDENKIVYIDEERYVRFDKTREN------- 208 (577) Q Consensus 138 NgviVVDm~~i~~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~--~qD~N~i~~IDd~~y~~y~kn~k~------- 208 (577) .++++|. .+.+++|.=- ..-|.+..-.|.. ....+.+.++-+ .++..++-+-++.....|...+.. T Consensus 156 ~a~~~v~---~d~dg~~~~~-~~~p~~~~~v~d~-~~~~~~~a~ir~~~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~ 230 (492) T protein:vir:94 156 IEWLHPY---LDEEGEFKLF-RVPAEQGIPIWTD-KEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGSLIPDYSN 230 (492) T ss_pred eEEEEEE---ecCCCceEEE-EEcccceEEEEcC-CCCCceEEEEEEEeeccceeEEEEecCeEEEEEEecCeeeecccc Confidence 7777886 3445554311 1223332222321 112344555544 445555666677666666544421 Q ss_pred --ceeeehhhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCC Q lcl|NC_011222. 209 --DLILEVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESH 286 (577) Q Consensus 209 --ei~~e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~ 286 (577) .+.......|.||.||.--+..+. .-.|=+..+++..|.|=...+....--.+++-|+++...- T Consensus 231 ~~~~~~~~~~~~~~g~vPvv~~~nn~-------~~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~------- 296 (492) T protein:vir:94 231 NLENSKTHFSTGSWGKIPFIPFKNND-------LEISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNY------- 296 (492) T ss_pred ccccccccccccCCCccceEEecCCC-------CCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC------- Confidence 111122334999999995554433 2344455555444444444444444445677788775321 Q ss_pred CCCccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecc--cHHHHHHHhhH Q lcl|NC_011222. 287 DGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSA--DTGSLEYNVNE 364 (577) Q Consensus 287 ~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~--di~sleY~~~~ 364 (577) .+.. .+ +... -...+..+.+ | ++.| |++++. +.+.++.+ T Consensus 297 ~~~~---~~-------~~~~----------------~~~~~~~~~~--~----~~~~----~~~l~~~~~~~~~~~~--- 337 (492) T protein:vir:94 297 DDQE---LP-------EFKR----------------LLRYYGAIKV--S----DNGG----VDTIQVEVPVENSKKY--- 337 (492) T ss_pred Cccc---ch-------hhHH----------------HHhhccceec--C----CCCc----ceeEeccCCHHHHHHH--- Confidence 1110 00 0000 0111122222 2 1223 333332 34444444 Q ss_pred HHHHHHHHHhhhhcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccccccCCccc Q lcl|NC_011222. 365 EKRLRDELVRSVTGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSCNINYGTEF 443 (577) Q Consensus 365 ~kri~d~i~~s~~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~ti~yGskF 443 (577) .+++.+.|+.-.-.-. .+.+.+-..++.+.+.-+-.+.+++.+..+.|++.-.-+...++++. |...-...| ...| T Consensus 338 ~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~-~~~~~~~~i--~v~f 414 (492) T protein:vir:94 338 LDELYQKIMLFGQAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF-DIKGEHKDV--DISF 414 (492) T ss_pred HHHHHHHHHHHhCCcCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCCccccee--eEEe Confidence 4566676766552211 11122345788888888899999999999999888888888877752 322111112 2457 Q ss_pred ccCCHHHHHHHHHHHHHc-CCCHHHHHHHHHHHHHH-hhcCCH-HHHHHHHH-----HHhhCCCcCccHHHHHHHHhcCC Q lcl|NC_011222. 444 YIYTPEELSERYKIMKET-GASEAELDALRQQIIET-EYRNDP-TQMQRLLI-----LNEIEPYSHLTREEAVNLYKENV 515 (577) Q Consensus 444 y~~t~eeL~~~i~~Ak~~-Gas~~~i~~L~~qi~e~-EyrNnP-~qmqr~~v-----L~~leP~~~LT~~Ev~~l~e~g~ 515 (577) ....|..+.+..+++.+. |+=+.+- .++. -+-.|| .+|+|++- ++.+.++.+--.+.-.+ T Consensus 415 ~~~~p~~~~e~~~~~~kl~giiS~et------~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~------ 482 (492) T protein:vir:94 415 NYNKVANTELQVQTAQQSMGIVSHET------VLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADSAQQ------ 482 (492) T ss_pred cCCCCCCHHHHHHHHHHHhccCchHH------HHHhCCCCCCHHHHHHHHHHHHHHHHhhccccccccCCCCcc------ Confidence 777777777777766543 5322111 1111 111133 24444322 12221111100000000 Q ss_pred CchhheeeeecchhhhhhhhccCCchhh Q lcl|NC_011222. 516 ISEEDLRVKLNLPTFVRRFERENMNIIE 543 (577) Q Consensus 516 ~~eEdl~vk~n~~~fv~rfe~en~~i~e 543 (577) =|.+|-.--| T Consensus 483 ------------------~~~~~~~e~e 492 (492) T protein:vir:94 483 ------------------QERSNNKESE 492 (492) T ss_pred ------------------ccCCccccCC Confidence 0111111111 No 22 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=85.69 E-value=0.051 Score=27.67 Aligned_cols=435 Identities=11% Similarity=0.039 Sum_probs=183.5 Q ss_pred CCcchHHHHHHhhCchhH---HHH-HHHH-HHHHHHHHHhhhhcccccchHHHHHHHHHHHhhc-chhH------HHHHH Q lcl|NC_011222. 1 MGKSLDEIREIYRHPEGI---SQI-AKAK-EHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIA-KDKY------DIFLS 68 (577) Q Consensus 1 ~~~~~~~~~~~~~~~~~~---~~~-aka~-~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~-kdky------~~f~~ 68 (577) |--+.+...++.+-+... .++ .+.+ .|.+|+.-..++.- ||.- + -.++. ++|+ ..... T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~~~~~~~~------YY~g---~-~~i~~~~~~~~~~~~~~~~~~ 74 (472) T protein:vir:93 5 QPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQE------YYEQ---R-PDIVKEPKPVDATGAVDPLKP 74 (472) T ss_pred CCcchhhhhceeeecCchhhHHHHHHHHHHHHHHHHHHHHHHHH------Hhcc---c-cccccccchhhcccccccccc Confidence 333334444444433222 221 1111 33333321111110 1100 0 00000 0111 00000 Q ss_pred HhhccCCCccchHHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccc Q lcl|NC_011222. 69 MFHFPVKTNGVTSEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEV 148 (577) Q Consensus 69 ~f~fpv~t~~lt~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i 148 (577) -++. ...+...|=+.+.--+=|.-..+ .=.+.+..+-|..|++ +++-+..+.-+=++.... .++++|. . T Consensus 75 ~~ri---~~n~~~~ivd~~~~~l~g~~~~~--~~~d~~~~~~l~~~~~--n~~~~~~~~~~~~~~~~G-~~~~~v~---~ 143 (472) T protein:vir:93 75 DDRM---ITNFHANLVDQKVSYIVGKPIAF--KHTDDEVVKRIDEVLG--NRFDDKLHSVLTGASNKG-IEWLHPY---L 143 (472) T ss_pred cccc---ccchHHHHHHHHhhhhcccCeee--ccCChHHHHHHHHHHh--ccHHHHHHHHHHHHhhcC-eEEEEEE---E Confidence 0000 01222222222222222322222 2234555555666653 356666665554455554 4777776 4 Q ss_pred cCCCccccchhhhhHHHHHHHHhhcccccceeeeee--ecCCCEEEEEeccceeeeccCCcccee---------eeh-hh Q lcl|NC_011222. 149 QVGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIMY--VTDENKIVYIDEERYVRFDKTRENDLI---------LEV-DN 216 (577) Q Consensus 149 ~~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~--~qD~N~i~~IDd~~y~~y~kn~k~ei~---------~e~-~~ 216 (577) +.+++|. .-..-|.+....|.. ....+.+.++-+ .++++++-+.++....+|..++.. ++ -++ .. T Consensus 144 d~d~~~~-i~~~~p~~~~~i~d~-~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 220 (472) T protein:vir:93 144 DEEGEFK-LFRVPAEQGIPIWTD-KEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYENGS-LIPDYSNNLENSKTHFS 220 (472) T ss_pred CCCCceE-EEEEcccceEEEEcC-CCCCceEEEEEEEEeecceeEEEEecCeEEEEEEecCe-eeecccccccccccccc Confidence 5566554 222233333333332 123345666555 556667777777776667655422 11 112 23 Q ss_pred hhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCccccCcc Q lcl|NC_011222. 217 MHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKERCDDGF 296 (577) Q Consensus 217 ~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~C~~G~ 296 (577) -|.||.||---|..+.. -.|=++.+.+..|.|-...+....-=.+++-|+.+.... .+.. .+ T Consensus 221 ~~~~~~vPvv~~~nn~~-------g~s~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~-------~~~~---~~- 282 (472) T protein:vir:93 221 TGSWGKIPFIPFKNNDL-------EISDIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNY-------DDQE---LP- 282 (472) T ss_pred cCCCCCcceEEecCCCC-------CCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecC-------Cccc---ch- Confidence 39999999854433322 334445544444444333333333234567777764311 0100 00 Q ss_pred eecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHHHHHhhh Q lcl|NC_011222. 297 LKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRDELVRSV 376 (577) Q Consensus 297 i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d~i~~s~ 376 (577) ++.. ....+..+.+ | ++.| |++++.+.. .+-....++++.+.|+... T Consensus 283 ------~~~~----------------~~~~~~~~~~--~----~~~~----~~~l~~~~~-~~~~~~~~~~l~~~i~~~s 329 (472) T protein:vir:93 283 ------EFKR----------------LLRYYGAIKV--S----DNGG----VDTIQVEVP-VENSKKYLDELYQKIMLFG 329 (472) T ss_pred ------hhHH----------------HHhhcccccc--C----CCCc----ceeEeecCC-HHHHHHHHHHHHHHHHHHh Confidence 0100 0112222222 2 2223 334433322 1333444667777776654 Q ss_pred hcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccccccCCcccccCCHHHHHHHH Q lcl|NC_011222. 377 TGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSCNINYGTEFYIYTPEELSERY 455 (577) Q Consensus 377 ~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~ti~yGskFy~~t~eeL~~~i 455 (577) -... .+.+.+-..++++.+.-+.++..++.+..+.|++.-+-++.+++++. |.+.=...| .-.|....|..+++.+ T Consensus 330 ~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~-~~~~~~~~i--~v~f~~~~p~~~~~~~ 406 (472) T protein:vir:93 330 QAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF-DIKGEHKDV--DISFNYNKVANTELQV 406 (472) T ss_pred CCCCCCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCccccee--eEEeCCCCCCCHHHHH Confidence 2211 11122335688888888899999999999999998888888887753 332211122 2346677777777777 Q ss_pred HHHHHc-CCCHHHHHHHHHHHHHHh-hcCCH-HHHHHHHH-----HHhhCCCcCccHHHHHHHHhcCCCchh Q lcl|NC_011222. 456 KIMKET-GASEAELDALRQQIIETE-YRNDP-TQMQRLLI-----LNEIEPYSHLTREEAVNLYKENVISEE 519 (577) Q Consensus 456 ~~Ak~~-Gas~~~i~~L~~qi~e~E-yrNnP-~qmqr~~v-----L~~leP~~~LT~~Ev~~l~e~g~~~eE 519 (577) .++.+. |+=+ +...++.- +-.|| .+|+|++- ++.+.++.+.-.+.-.+--+.|=-+.| T Consensus 407 ~~~~k~~giis------~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~e 472 (472) T protein:vir:93 407 QTAQQSMGIVS------HETVLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNNKESE 472 (472) T ss_pred HHHHHHhccCc------hHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhccCcCcccCCCCCCCCCCCcccCC Confidence 665543 5311 11112211 11232 34444322 222333222111100000011111112 No 23 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=85.20 E-value=0.055 Score=27.51 Aligned_cols=458 Identities=12% Similarity=0.052 Sum_probs=181.4 Q ss_pred CCcchHHHHH-HhhCchhHH-----HHHHHHHH-----HHHHHHHhhhhcccccchHHHHHHHHHHHhhcchhHHHHHHH Q lcl|NC_011222. 1 MGKSLDEIRE-IYRHPEGIS-----QIAKAKEH-----EERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAKDKYDIFLSM 69 (577) Q Consensus 1 ~~~~~~~~~~-~~~~~~~~~-----~~aka~~h-----eeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~kdky~~f~~~ 69 (577) |--.|..=|+ -.+-|+++. +|-|-++| -.|++-..+.-... +. +-+.+-.+. .++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~l~~~~i~~li~~~~~~~~~r~~~l~~YY~g~--~~---~i~~~~~~~--~~~------- 66 (506) T protein:vir:94 1 MDYDLTEHKQANLIYQESLENLTPNKIMKFITHHFNYQRPRLEMLDDYYQGY--NL---KILDKQSRR--HED------- 66 (506) T ss_pred CCcchhhhhcceeecccchhcCCHHHHHHHHHHHHHHHHHHHHHHHHHhcCC--Cc---ccccccccc--ccc------- Confidence 2111111000 011122211 11111111 01111111100000 00 000000000 000 Q ss_pred hhccCCCc----cchHHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEee Q lcl|NC_011222. 70 FHFPVKTN----GVTSEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDM 145 (577) Q Consensus 70 f~fpv~t~----~lt~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm 145 (577) ..+.+ .+...|=+...-.+-|+...++. .+.+..+-|+.+.+ -+++.+....-+=++.... .+++.|. T Consensus 67 ---~~~~~ki~~n~~~~Iv~~~~~~l~G~p~~~~~--~d~~~~~~l~~~~~-~N~~~~~~~~~~~~~~~~G-~a~~~v~- 138 (506) T protein:vir:94 67 ---GKADHRATHSFAKYIADFQTSYSVGNPINVKL--PDDGSNSGFDTFNK-ANDVDAENYDLFLDMSRYG-RAYEYVY- 138 (506) T ss_pred ---cCCcceeecchHHHHHHHhhhhhcccCceeec--CcchHHHHHHHHHh-ccCHhHHHHHHHHHHHhcC-eEEEEEE- Confidence 01111 22233333333333444444433 23334444555542 2455554444444444444 4666665 Q ss_pred ccccCCCccccchhhhhHHHHHHHHhhcccccceeeeee----ecCCCE-------EEEEeccceeeeccCCccceeeeh Q lcl|NC_011222. 146 PEVQVGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIMY----VTDENK-------IVYIDEERYVRFDKTRENDLILEV 214 (577) Q Consensus 146 ~~i~~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~----~qD~N~-------i~~IDd~~y~~y~kn~k~ei~~e~ 214 (577) ++.+++|. .-..+-++++..--...+.+.+.++-+ ..+++. .-+-.++.+.+|.... +.+.++. T Consensus 139 --~ded~~~~--i~~~~p~~~~~v~dd~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~yt~~~~~~~~~~~-~~~~~~~ 213 (506) T protein:vir:94 139 --RGEDNEEH--LAKLDPLDTFVIYSTDVDPKPIMAVRYHQIELVDDNQVSTINYVPETWTADTYTLYNPTP-IMGKMQV 213 (506) T ss_pred --ecCCCeeE--EEEEcccceEEEecCCCCCceEEEEEEEeeeeccCCceeEEEEEEEEEeCceEEEecccc-Cccceec Confidence 45555543 212233333332222222344555433 234443 1123556666665443 5677777 Q ss_pred hhhhhhcccceeeEecccccccCcceeeccchhHH---HHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCcc Q lcl|NC_011222. 215 DNMHDLGYCPARFFWSDSISLSEPDIKISPITSEL---DSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKER 291 (577) Q Consensus 215 ~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L---~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~ 291 (577) ...|.||.||..-| - ++ + .-.|-+.... +|+|+.++-..--.. +...++...+|++..=.. ...+... T Consensus 214 ~~~~~~g~vPvv~~-~----n~-~-~~~sd~e~~~~liDa~d~~~S~~~~~~~-~~~~~~l~~~g~~~~~~~-~~~~~~~ 284 (506) T protein:vir:94 214 DTTKPITTFPVVEF-K----NS-N-FRLGDFENVLPLIDLYDAAQSDTANYMT-DLNEAMLIIQGDIDTLFE-GSDMMNT 284 (506) T ss_pred cccccCCccceEEe-c----CC-C-CCCCchhhhHHHHHHHHHHHHHHHHHHH-HhhhHHHHHhcCcccccc-chhcccc Confidence 77899999998434 2 12 1 2344455544 555554443322222 334445555554432221 1111111 Q ss_pred ccCcceecccccccccccccccccCCCccccc--cCccceeeeecC---cccccCCCccCcceeecccHHHHHHHhhHHH Q lcl|NC_011222. 292 CDDGFLKNEKNEWITGVDGKPMACPICSSKRL--RGAGSYVEIPIP---DEMHNVPDLKNPITMLSADTGSLEYNVNEEK 366 (577) Q Consensus 292 C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~--~g~gs~~evPiP---~~~~d~~Dlr~pv~i~n~di~sleY~~~~~k 366 (577) .... ...|....+ .=..... +.-+-++.++-. ......+ .+++++.+.. .+-....++ T Consensus 285 ~~~~--------~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----d~~~l~~~~~-~~~~~~~~~ 347 (506) T protein:vir:94 285 IDPN--------DEDAMAKLA----KDKLELIKEMKDANMLLLKSGMTVNGTQTSV----DAKYINKTYD-VVGSEAYKK 347 (506) T ss_pred cccc--------ccccccccc----cchhHHHhhhhhcCeeeecccccccCccccc----cceeeeecCC-HHHHHHHHH Confidence 1111 000000000 0000000 001112222210 0011222 3566665432 233444567 Q ss_pred HHHHHHHhhhhc--CCCccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhh---cCcccccc-cccCC Q lcl|NC_011222. 367 RLRDELVRSVTG--GEGELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLR---YGDSFVSC-NINYG 440 (577) Q Consensus 367 ri~d~i~~s~~G--f~~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lR---yg~~~~~~-ti~yG 440 (577) ++.+.|+...-+ +.. .+..-..++++.++-.-.+.+++.+..+-|++.-+-++.+++++- -+...++. +| - T Consensus 348 ~l~~~I~~~s~~p~~~~-~~~~~n~Sg~Aik~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i--~ 424 (506) T protein:vir:94 348 RVAGDIHKFSHTPDLTD-ENFASNSSGVAMQYKVLGTVELASTKRRMFERGLYARYQIISDIENSIHGDWTFDPQEL--T 424 (506) T ss_pred HHHHHHHHHhCcccccc-ccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccccc--e Confidence 788888765422 211 122235678888888888899999999999998888888777752 23333332 11 2 Q ss_pred cccccCCHHHHHHHHHHHHHc-C-CCHHHHHHHHHHHHHHhhcCCH-HHHHHHHHHHh-----hCCCcCccHHHHHHHHh Q lcl|NC_011222. 441 TEFYIYTPEELSERYKIMKET-G-ASEAELDALRQQIIETEYRNDP-TQMQRLLILNE-----IEPYSHLTREEAVNLYK 512 (577) Q Consensus 441 skFy~~t~eeL~~~i~~Ak~~-G-as~~~i~~L~~qi~e~EyrNnP-~qmqr~~vL~~-----leP~~~LT~~Ev~~l~e 512 (577) -.|....|..+++.++++.+. | +|...+... --+=.|| .+|+|++.=+. .+.+...+.++-. T Consensus 425 i~f~~~~p~d~~e~a~~~~kl~g~iS~et~~~~------lp~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~---- 494 (506) T protein:vir:94 425 FTFRDNLPADNISQIKALVQAGATLPQKYLYQQ------LPGVTNPQDIVDMMKEQSANGDYSFDQNGVISNDGQT---- 494 (506) T ss_pred EEeCCCCCcCHHHHHHHHHHHhccCChHHHHHh------CCCCCCHHHHHHHHHHHHHHHhhcchhhcCCCcccCc---- Confidence 357888888888888877664 3 233222111 1122233 34444432111 1111111111100 Q ss_pred cCCCchhheeee Q lcl|NC_011222. 513 ENVISEEDLRVK 524 (577) Q Consensus 513 ~g~~~eEdl~vk 524 (577) ...-++++=-|| T Consensus 495 ~~~~~~~~~e~~ 506 (506) T protein:vir:94 495 NTTATQTDEEVR 506 (506) T ss_pred cccccccccCCC Confidence 011122222344 No 24 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=84.21 E-value=0.062 Score=27.20 Aligned_cols=456 Identities=13% Similarity=0.096 Sum_probs=190.4 Q ss_pred CCcchHHHHHHhhCchhHHHHHHHHHH-HHHHHHHhhh-hc-ccccchHHHHHHHHHHHhhcchhHHHHHHHhh---c-- Q lcl|NC_011222. 1 MGKSLDEIREIYRHPEGISQIAKAKEH-EERIAFHTRV-RT-SDDRNKPVIDFLSKVKTWIAKDKYDIFLSMFH---F-- 72 (577) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~aka~~h-eeri~fh~~~-~t-a~d~~~~~~~fl~~v~~~l~kdky~~f~~~f~---f-- 72 (577) |.+. +|..-.+-....+-+.| |.++.||-.- .. ..|....++.|.++-+.-. ..+|.+....+. . T Consensus 1 ~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~-~~r~~~~~~yY~g~~~~i 73 (501) T protein:vir:96 1 MEQT------LFTDSTGQERVLNLRFHRESRIRYRADNLEELMVNNWELLKNFINHHKLRQ-APRIQELLDYARGENHDV 73 (501) T ss_pred Ccee------eeeecccceeccccccchhHHhhhcccccccccCChHHHHHHHHHHHHHHH-HHHHHHHHHHhcCCCCcc Confidence 5443 23333333333333333 4466776322 21 2222223444433222110 012222222221 1 Q ss_pred ------cCC---CccchHHHHHHHHHHh----ccCCccccccccChhh----hhhhhHHHhhccchhHHHHHHHHHHHHh Q lcl|NC_011222. 73 ------PVK---TNGVTSEIFDKLSRVF----DGRNPVYNYQFKSSED----RDDWEYYRKDVLKEPSVWSTDGWDNFKH 135 (577) Q Consensus 73 ------pv~---t~~lt~~iF~~L~kV~----dgqd~~~~y~f~~~e~----~~d~~~y~se~ln~~~fw~~~~fk~~~~ 135 (577) +-. .+-++--....+.... =|+...+ ...+.+. .+-++.+.+ .+++.+..+.-+=++... T Consensus 74 ~~~~~~~~~~~~~~ri~~n~~k~Ivd~~~~yl~g~p~~~--~~~~~~~~~~~~~~l~~~~~-~n~~~~~~~~~~~~~~~~ 150 (501) T protein:vir:96 74 LKSGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRV--EYDDNDDNSQNDDAIKRIGR-INDLDSLNRTLIRDLSQT 150 (501) T ss_pred cCccccCccccccceeecchHHHHHHHHhhhhcccCeeE--eeCCccchhHHHHHHHHHHH-hcCHHHHHHHHHHHHhhc Confidence 000 1111111111222211 1232222 2222222 222333332 245655555555555555 Q ss_pred CCCeEEEEeeccccCCCccccchhhhhHHHHH-HHHhhcccccceeeeee-ec-C--CC--EEEEEeccceeeeccCCcc Q lcl|NC_011222. 136 RINSVLVVDMPEVQVGEKPEPYFFWLPIANVL-SYRTCGKDCNLMAYIMY-VT-D--EN--KIVYIDEERYVRFDKTREN 208 (577) Q Consensus 136 ~~NgviVVDm~~i~~~~rpqpyf~~~pie~V~-~y~~~~~~~~~i~~i~~-~q-D--~N--~i~~IDd~~y~~y~kn~k~ 208 (577) . .++++|= .+.+++|. +-..+-++++ .|... ...+.+.++.+ .. + ++ .+-+.+++.+.+|...+ T Consensus 151 G-~a~~~v~---~dedg~~~--i~~~~p~~~~~v~d~~-~~~~~~~~v~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~-- 221 (501) T protein:vir:96 151 G-RAYEVIY---RSEYDETR--IKRLSPLETFVIYDNS-LEDNSIAAVRYYNRGTLQSAKDVVEIYTDEHIYTLDASD-- 221 (501) T ss_pred C-eEEEEEE---EcCCCceE--EEEEccceeEEEEcCC-CCCceEEEEEEEEeecCCCcEEEEEEEcCCcEEEEeeCC-- Confidence 4 4666662 33445442 2222333332 23221 12345666655 21 1 22 24455666666665433 Q ss_pred ceeeehhhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCC Q lcl|NC_011222. 209 DLILEVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDG 288 (577) Q Consensus 209 ei~~e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G 288 (577) .+.......|.||.||.--|. |.| .-.|-+....+..|.|-...+....--.+++-|+....-. T Consensus 222 ~~~~~~~~~~~~g~vPvv~~~------nn~-~g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~--------- 285 (501) T protein:vir:96 222 DFNEISVTTHAFGTVPITEYL------NNI-DGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGD--------- 285 (501) T ss_pred CceeccccccCCCccceEEec------CCc-cCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecc--------- Confidence 333334456999999974332 223 4566666666666655555555544334666777655221 Q ss_pred CccccCcceecccccccccccccccccCCCccccccCccceeeeecCcc---cccCCCccCcceeecc--cHHHHHHHhh Q lcl|NC_011222. 289 KERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDE---MHNVPDLKNPITMLSA--DTGSLEYNVN 363 (577) Q Consensus 289 ~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~---~~d~~Dlr~pv~i~n~--di~sleY~~~ 363 (577) .-.+.| + . ...+.....+.++.|.- +...+ -+++++. +.++++.+ T Consensus 286 -~~~~~~-------~-----~-----------~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~l~~~~~~~~~~~~-- 335 (501) T protein:vir:96 286 -LALPKG-------M-----Q-----------ASDMKRTRLMQLKPPKSADGKEGTV----KAEYLTKSYDVSGAEAY-- 335 (501) T ss_pred -cccCcc-------c-----c-----------hhhhhhcCeeeecccccccccccCc----ceeeEeccCCHHHHHHH-- Confidence 001111 0 0 00111222334444321 11222 2556554 34555544 Q ss_pred HHHHHHHHHHhhhhcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhh---cCcccccccccC Q lcl|NC_011222. 364 EEKRLRDELVRSVTGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLR---YGDSFVSCNINY 439 (577) Q Consensus 364 ~~kri~d~i~~s~~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lR---yg~~~~~~ti~y 439 (577) ++++.+.|+...-... .+.+.+-+.++++.+.....+.+++.+..+.|++.-+-+...++++- .....+.- ..- T Consensus 336 -~~~l~~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~-~~i 413 (501) T protein:vir:96 336 -KTRLNRDIHIFTNTPDMSDTNFSGNTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRLAARIGSLVNEFKDFDE-SLL 413 (501) T ss_pred -HHHHHHHHHHHhCCcccCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc-ccc Confidence 4566677766542211 11122345788888888889999998888888888877777766552 22112211 011 Q ss_pred CcccccCCHHHHHHHHHHHHH-cC-CCHHHHHHHHHHHHHHhhcCCH-HHHHHHHHHHh-hCCCcCccHHHHHHHHhcCC Q lcl|NC_011222. 440 GTEFYIYTPEELSERYKIMKE-TG-ASEAELDALRQQIIETEYRNDP-TQMQRLLILNE-IEPYSHLTREEAVNLYKENV 515 (577) Q Consensus 440 GskFy~~t~eeL~~~i~~Ak~-~G-as~~~i~~L~~qi~e~EyrNnP-~qmqr~~vL~~-leP~~~LT~~Ev~~l~e~g~ 515 (577) .-.|....|..+++.+.++.+ .| +|...+..+ . -+=.|| .+|+|++-=+. .++-.. ..+..+.. |- T Consensus 414 ~i~f~~~~p~n~~e~ad~~~kl~g~iS~et~~~~-l-----~~v~D~~~E~~ri~~E~~~~~~~~~--~~~~~~~~--~~ 483 (501) T protein:vir:96 414 KITFTPNLPKSLNEQVSILTGLGGQVSQETALSL-S-----GLVESPNEELDKINKEMSEIDFKGY--SNDFNEHV--GK 483 (501) T ss_pred eEEeCCCCCcCHHHHHHHHHHHhccCchHHHHHh-C-----CCCCCHHHHHHHHHHHHHHhhcccc--ccchhhcc--cc Confidence 234666677777777665433 44 333221111 1 122343 34555432221 111111 11111111 11 Q ss_pred CchhheeeeecchhhhhhhhccCCchhh Q lcl|NC_011222. 516 ISEEDLRVKLNLPTFVRRFERENMNIIE 543 (577) Q Consensus 516 ~~eEdl~vk~n~~~fv~rfe~en~~i~e 543 (577) -+++ .--.=.+|-.+..| T Consensus 484 ~~~~----------~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 484 YTDE----------VKETHTDDFEREYE 501 (501) T ss_pred cCCc----------CCCCCCCccccccC Confidence 1111 00011123334444 No 25 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=84.13 E-value=0.063 Score=27.17 Aligned_cols=405 Identities=12% Similarity=0.076 Sum_probs=173.4 Q ss_pred CCcchHHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhc---chhHHHHHHHhhccCCCc Q lcl|NC_011222. 1 MGKSLDEIREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIA---KDKYDIFLSMFHFPVKTN 77 (577) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~---kdky~~f~~~f~fpv~t~ 77 (577) -..+.+.|+..++ .|+.|+ .|+.+. .+||.- --.++- +++. -+.+ T Consensus 15 ~~~~~~~i~~~i~------------~~~~~~---~r~~~~---~~Yy~g----~~~i~~~~~~~~~----------~~~~ 62 (452) T protein:vir:36 15 EPITVEVVTKFME------------KHKLEV---ARYEYL---KNMYLG----IMAIDDEPAKDSW----------KPDN 62 (452) T ss_pred cCCCHHHHHHHHH------------HHHHHH---HHHHHH---HHHhcc----ccccccCcccccc----------Cccc Confidence 1112333433322 133332 111110 111111 001111 1110 0112 Q ss_pred cch----HHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccCCCc Q lcl|NC_011222. 78 GVT----SEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQVGEK 153 (577) Q Consensus 78 ~lt----~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~~~r 153 (577) .+. .-|=+...-..-|....++ -.+.+..+-|+.+.+. +++.+..+.-+=.+....- ++.+|= +..+++ T Consensus 63 ki~~n~~~~ivd~~~~~l~g~~~~~~--~~d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~~~G~-~~~~v~---~d~~g~ 135 (452) T protein:vir:36 63 RLAVNFTKYIVDTFTGYFNGIPVKKS--HSDKEILTKLQEFDNL-NDMEDEESELAKMACIYGR-AFEFLY---QDEDTQ 135 (452) T ss_pred eeecchHHHHHHHHhhhhcccCceee--cCChhHHHHHHHHHhh-cChhHHHHHHHHHHHhcCe-EEEEEE---ecCCCe Confidence 222 1222222122224444433 2345555555555533 4666666555555555443 545542 344555 Q ss_pred cccchhhhhHHHHH-HHHhhcccccceeeeeeec--CCCE-EEEEeccceeeeccCCccceeeehhhhhhhcccceeeEe Q lcl|NC_011222. 154 PEPYFFWLPIANVL-SYRTCGKDCNLMAYIMYVT--DENK-IVYIDEERYVRFDKTRENDLILEVDNMHDLGYCPARFFW 229 (577) Q Consensus 154 pqpyf~~~pie~V~-~y~~~~~~~~~i~~i~~~q--D~N~-i~~IDd~~y~~y~kn~k~ei~~e~~~~H~lGy~PA~~~~ 229 (577) |. +-..+-++++ .|... .+.+.+.++-+.. ++.. +-+-++..+.+|..++ +.|.+....-|.||.||..-+. T Consensus 136 ~~--i~~~~p~~~~~v~d~~-~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~-~~~~~~~~~~~~~g~iPvv~~~ 211 (452) T protein:vir:36 136 TN--VVYNSPENMFMVYDDT-VKQEPLFAVRYGVDEDKKLQGEVYTLLETIKISGEN-DEISFGEGTYNPYPDLPVVEFY 211 (452) T ss_pred eE--EEEEcccceEEEEcCC-CCCceEEEEEEEEecCceEEEEEEecCeEEEEEEcC-CceEEecceeccCCcccEEEec Confidence 43 2222333332 33221 2334566665533 2232 3344555555555443 6677777777999999985443 Q ss_pred cccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCccccCcceeccccccccccc Q lcl|NC_011222. 230 SDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKERCDDGFLKNEKNEWITGVD 309 (577) Q Consensus 230 gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~C~~G~i~n~~~d~~tg~~ 309 (577) .... -.|=+....+..|+|=...+....-=.+++-|+..... ..... .+..+ T Consensus 212 n~~~-------g~sd~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g-------~~~~~----~~~~~---------- 263 (452) T protein:vir:36 212 FNEE-------RMSIFESVISLVNAFNKAISEKANDVDYFSDQYLTFLG-------AAVEE----EDLKN---------- 263 (452) T ss_pred CCCC-------CCcchHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeec-------CCcCc----hhhhh---------- Confidence 3322 33444444444444433333333323466778777531 00000 00000 Q ss_pred ccccccCCCccccccCccceeeeecCcccccCCCccCcceeeccc--HHHHHHHhhHHHHHHHHHHhhhhcCCCccccch Q lcl|NC_011222. 310 GKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSAD--TGSLEYNVNEEKRLRDELVRSVTGGEGELNRSE 387 (577) Q Consensus 310 ~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~d--i~sleY~~~~~kri~d~i~~s~~Gf~~d~q~~k 387 (577) ...+.++.++.. .+..++| |++++.+ .+.++ ...+++.+.|+...-+ ++++-+. T Consensus 264 --------------~~~~~~~~~~~~-~~~~~~~----~~~l~~~~~~~~~~---~~~~~l~~~I~~~s~~--p~~~~~~ 319 (452) T protein:vir:36 264 --------------IRSNRVINYYAD-GEGKNVD----VKFLEKPDSDSQTE---NLLDRLTKLIFQTTMV--ANISDES 319 (452) T ss_pred --------------hhhcceEEecCC-CCccCCc----ceeEeecCCHHHHH---HHHHHHHHHHHHHhCc--cccCccc Confidence 111233444432 1223334 5555544 34444 4455666777655422 1222111 Q ss_pred --hhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhc--Cccc--ccccccCCcccccCCHHHHHHHHHHHHH- Q lcl|NC_011222. 388 --AINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRY--GDSF--VSCNINYGTEFYIYTPEELSERYKIMKE- 460 (577) Q Consensus 388 --A~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRy--g~~~--~~~ti~yGskFy~~t~eeL~~~i~~Ak~- 460 (577) ..++++.++..-++.+++.+..+.|+++-+-++.+++++.- |.+. ...+| .|....|..+++..+++.+ T Consensus 320 ~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~~i~i----~f~~~~p~d~~~~a~~~~k~ 395 (452) T protein:vir:36 320 FGSSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCELSTNVSNKDSWKDIEY----TFTRNEPKDIKEQAETANIL 395 (452) T ss_pred ccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccccceE----EeCCCCCcCHHHHHHHHHHH Confidence 45778888888899999998888888888877777776532 2221 12233 3555566666666654433 Q ss_pred cCC-CHHHHHHHHHHHHHHhhcCCH-HHHHHHHHHHhhCCCcCccHHHHHHHHh-----------cCCCchh Q lcl|NC_011222. 461 TGA-SEAELDALRQQIIETEYRNDP-TQMQRLLILNEIEPYSHLTREEAVNLYK-----------ENVISEE 519 (577) Q Consensus 461 ~Ga-s~~~i~~L~~qi~e~EyrNnP-~qmqr~~vL~~leP~~~LT~~Ev~~l~e-----------~g~~~eE 519 (577) +|+ |...+.. ++ -+=.|| .+|+|++.=+ .++.....+ .+-.++| T Consensus 396 ~g~iS~et~~~---~~---~~~~d~~~E~~ri~~E~---------~~~~~~~~~~~~~~~~~~~~~~~~~~e 452 (452) T protein:vir:36 396 MGITSQETALS---VI---SVIPDVQAEMEKIKKEE---------ASTAIFDKDKQPSEKGTDTVVSETNEE 452 (452) T ss_pred hccCChHHHHH---hC---CCCCCHHHHHHHHHHHH---------HHHHHHHhhccCCCCcccccCccccCC Confidence 342 2211110 00 111232 3344433211 111111111 1112222 No 26 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=84.09 E-value=0.063 Score=27.16 Aligned_cols=443 Identities=12% Similarity=0.038 Sum_probs=170.5 Q ss_pred CCcchHHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhcchhHHHHHHHhhccCCCccch Q lcl|NC_011222. 1 MGKSLDEIREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAKDKYDIFLSMFHFPVKTNGVT 80 (577) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~kdky~~f~~~f~fpv~t~~lt 80 (577) +.-..++|++...+-.. .+ -+|+....+ ||..--...+.-...+++..- ..+. ..+. T Consensus 13 ~~~~~~~~~~~i~~~~~-~~-------~~r~~~~~~---------yy~g~~~i~~~~~~~~~~~~~-----~ki~-~n~~ 69 (489) T protein:vir:99 13 SKLWIDQLKNYISRFKA-EQ-------LERLKELKR---------YYLGDNNIKYRPAKTDKYAAD-----NRIA-SDFA 69 (489) T ss_pred CCCCHHHHHHHHHHHHH-HH-------HHHHHHHHH---------HhcccCccccccccccccCCc-----ceee-cchH Confidence 33344566655432110 01 111111111 111000000000000111000 0011 1222 Q ss_pred HHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccCCCccccchhh Q lcl|NC_011222. 81 SEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQVGEKPEPYFFW 160 (577) Q Consensus 81 ~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~~~rpqpyf~~ 160 (577) +.|=+.+..-+=|....++. .+.+.-+-+..+.+. +++..... +..+....-=.++.+|-+. ...++.+++.+-+ T Consensus 70 ~~iv~~~~~~l~g~~~~~~~--~d~~~~~~l~~~~~~-n~~~~~~~-~~~~~~~~~G~~~~~v~~~-~~~d~~~~~~i~~ 144 (489) T protein:vir:99 70 KYITVFEQGYMLGVPVEYKN--ENKDLQAAIDLMSVR-NNEDYHNV-KIKTDLSIYGRAYELLTVE-KIDDKKTEVKLYQ 144 (489) T ss_pred HHHHHHHhhhhccCCceeec--CChhHHHHHHHHHhh-cChhHHHH-HHHHHHhhCCeEEEEEeec-cCcCCCcceEEEE Confidence 22222222222233332221 222222222222212 24444333 3334433333455555321 2234555665554 Q ss_pred hhHHHHHHHHhhcccccceeeeee--ecCC--C---EEEEEeccceeeeccCCc--cceeeehhhhhhhcccceeeEecc Q lcl|NC_011222. 161 LPIANVLSYRTCGKDCNLMAYIMY--VTDE--N---KIVYIDEERYVRFDKTRE--NDLILEVDNMHDLGYCPARFFWSD 231 (577) Q Consensus 161 ~pie~V~~y~~~~~~~~~i~~i~~--~qD~--N---~i~~IDd~~y~~y~kn~k--~ei~~e~~~~H~lGy~PA~~~~gD 231 (577) .+-.+++..-......+-+.++-+ ..++ + .+.+.+++...+|..... +.+.++...-|.||.||.--|. T Consensus 145 ~~p~~~~~v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~-- 222 (489) T protein:vir:99 145 LPAEQTFVIYDDTYQRNSLMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDYEGHFFKGVPVNEYA-- 222 (489) T ss_pred EcccceEEEEcCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceecccccccCCceeEEEee-- Confidence 444444332221112234444433 2222 2 255566666666655432 3444444555999999984332 Q ss_pred cccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCccccCcceeccccccccccccc Q lcl|NC_011222. 232 SISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKERCDDGFLKNEKNEWITGVDGK 311 (577) Q Consensus 232 ~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~C~~G~i~n~~~d~~tg~~~~ 311 (577) ++ + .-.|-+..+.+..|.|=...+....-=.|.+-|+....-...+.... ... T Consensus 223 ---n~-~-~~~s~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~~~~--~~~-------------------- 275 (489) T protein:vir:99 223 ---NN-E-ERTGAYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTGADE--NDY-------------------- 275 (489) T ss_pred ---cC-C-CCCCchhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCcccccc--hhh-------------------- Confidence 22 1 34455555554444443333333332234445554432221111110 000 Q ss_pred ccccCCCccccccCccceeeeec-------------CcccccCCCccCcceeecccH--HHHHHHhhHHHHHHHHHHhhh Q lcl|NC_011222. 312 PMACPICSSKRLRGAGSYVEIPI-------------PDEMHNVPDLKNPITMLSADT--GSLEYNVNEEKRLRDELVRSV 376 (577) Q Consensus 312 ~~~Cp~C~~K~~~g~gs~~evPi-------------P~~~~d~~Dlr~pv~i~n~di--~sleY~~~~~kri~d~i~~s~ 376 (577) .+....+++....++. ++..... .-|++++.++ ++++ ..++++.+.|+... T Consensus 276 -------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~l~~~~~~~~~~---~~~~~l~~~i~~~s 341 (489) T protein:vir:99 276 -------LDDGRLNPNGRLAISIGFKKAQVLILDDNPNPNGVK----PQAYFLKKEYDTAGSE---AYKNRLVADILRFT 341 (489) T ss_pred -------hhhcccccccccccccccccceeeeeccccCccccc----cceeeeeecCChHHHH---HHHHHHHHHHHHHh Confidence 0000111111111111 1111111 2255555443 3333 44567777777544 Q ss_pred hcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHh---hcCcccccccc-cCCcccccCCHHHH Q lcl|NC_011222. 377 TGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLL---RYGDSFVSCNI-NYGTEFYIYTPEEL 451 (577) Q Consensus 377 ~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~l---Ryg~~~~~~ti-~yGskFy~~t~eeL 451 (577) -+-. .+.+...+.++++.+.-.-.+.+++.+-.+.|+++-+-+...++++ ..+........ .-.-.|....|..+ T Consensus 342 ~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~ 421 (489) T protein:vir:99 342 FTPDTQDMKFSGVQSGESMKYKLMASDNYREKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQND 421 (489) T ss_pred CCcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCH Confidence 2211 1223334668888888888888888877777777777666666654 22221111111 11347888888888 Q ss_pred HHHHHHHHHcC--CCHHHHHHHHHHHHHHhhcCCHHH-HHHHHHHHhhCCCcCccHHHHHHHHhcCCCch-hh-eeeeec Q lcl|NC_011222. 452 SERYKIMKETG--ASEAELDALRQQIIETEYRNDPTQ-MQRLLILNEIEPYSHLTREEAVNLYKENVISE-ED-LRVKLN 526 (577) Q Consensus 452 ~~~i~~Ak~~G--as~~~i~~L~~qi~e~EyrNnP~q-mqr~~vL~~leP~~~LT~~Ev~~l~e~g~~~e-Ed-l~vk~n 526 (577) +++++++.+.. +|...+... ...+. -.||.+ |+|++- +-+.-..+ .++...|-.++ +. -.-| T Consensus 422 ~~~~~~~~kl~giis~et~~~~-l~~v~---~~d~~~E~~ri~~--E~~~~~~~-----~~~~~~~~~~~~~~~~~~~-- 488 (489) T protein:vir:99 422 NEIVTAAQNLYGIVSDQTIFEI-LNTVT---GVDAEAELKRLKE--EADKKQSL-----PEPRLVGDASGQEEPTAEK-- 488 (489) T ss_pred HHHHHHHHHHhccCCHHHHHHh-cCCCC---chhHHHHHHHHHH--HHHHHhcc-----ccccccCCCCCCcCCCCCC-- Confidence 88888776653 332222111 01111 014443 665543 21111110 11110111100 00 1111 Q ss_pred ch Q lcl|NC_011222. 527 LP 528 (577) Q Consensus 527 ~~ 528 (577) | T Consensus 489 -p 489 (489) T protein:vir:99 489 -P 489 (489) T ss_pred -C Confidence 1 No 27 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=81.63 E-value=0.084 Score=26.48 Aligned_cols=428 Identities=12% Similarity=0.019 Sum_probs=172.2 Q ss_pred cchHHHHHHhhCchhHHHHHHHH-HHHHHHHHHhhhhcccccchHHHH---HHHHHHHhhcchhHHHHHHHhhccC-CCc Q lcl|NC_011222. 3 KSLDEIREIYRHPEGISQIAKAK-EHEERIAFHTRVRTSDDRNKPVID---FLSKVKTWIAKDKYDIFLSMFHFPV-KTN 77 (577) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~aka~-~heeri~fh~~~~ta~d~~~~~~~---fl~~v~~~l~kdky~~f~~~f~fpv-~t~ 77 (577) ..++.|.++. .+.+ .|.+|+.-..++. .||.- -+.+-+.-..+.+-.........+. +-+ T Consensus 1 ~~~e~~~~~i---------~~~~~~~~~~~~~~~~~~------~Yy~g~hdi~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 65 (471) T protein:vir:10 1 MEIEVIKKII---------SSQMVKHGKFVSQAAEAE------KYYRNENDIKRKRKPADKKGAENEAKAEDNAFRNADN 65 (471) T ss_pred CCHHHHHHHH---------HHHHHHHHHHHHHHHHHH------HHhccccccccccchhhhhcccccccccccccccccc Confidence 2222332221 1111 2334432221111 11110 0000000000000000000001111 112 Q ss_pred cchHHHHHHHHHHhc----cCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccC-CC Q lcl|NC_011222. 78 GVTSEIFDKLSRVFD----GRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQV-GE 152 (577) Q Consensus 78 ~lt~~iF~~L~kV~d----gqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~-~~ 152 (577) -+.--.+..+....- |....+ .=.+.+..+-|..|.. +++.+..+.-+ +....-=.+++.|= +.. ++ T Consensus 66 ki~~n~~~~Ivd~~~~yl~G~p~~~--~~~~~~~~~~l~~~~~--n~~~~~~~~~~-~~~~~~G~~~~~v~---~d~~~g 137 (471) T protein:vir:10 66 RISHNWHQLLLDQKKAYALTYPPTF--DVDDKKVNDMIVDVLG--DDYERISKQLC-VNAGNAGIAWLHVW---KDASDN 137 (471) T ss_pred eeccchhHHHHHhhhhhhcccCcee--ccCChHHHHHHHHHHh--cCHHHHHHHHH-HHHhhCCeEEEEEE---eeCCCC Confidence 233233333333211 233332 2234455555555542 34444433333 33332224444441 221 34 Q ss_pred ccccchhhhhHHHHHHHHhhcccccceeeeee-e----cCCCEE---EEEeccceeeeccCCcc---------------- Q lcl|NC_011222. 153 KPEPYFFWLPIANVLSYRTCGKDCNLMAYIMY-V----TDENKI---VYIDEERYVRFDKTREN---------------- 208 (577) Q Consensus 153 rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~-~----qD~N~i---~~IDd~~y~~y~kn~k~---------------- 208 (577) ++. +...-|.+.+--|... .+.+.+.+|-+ . .+...+ -+-++..+..|...+.+ T Consensus 138 ~~~-~~~~~p~~~~~i~d~~-~~~~~~~~ir~~~~~~~~~~~~~~~~~vy~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~ 215 (471) T protein:vir:10 138 SFR-YACVDSKEVIPIYSKS-LDKKSIGVLRVYSSIDETDGKNYTVYEYWNDKECSFYRHEKEKPLEELETFQAISLIDT 215 (471) T ss_pred eeE-EEEEcccceEEEEcCC-CCCceEEEEEEEEeeccCCCceeEEEEEEeCCcEEEEEecCCccccccccccccccccc Confidence 432 2222343333233332 23345555533 2 122222 22345555556544322 Q ss_pred ---ceeeehhhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccC Q lcl|NC_011222. 209 ---DLILEVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYES 285 (577) Q Consensus 209 ---ei~~e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~ 285 (577) ++..+....|.||.||.--|..... -.|=+....+..|.|=...+....-=.+++-|++...--++. T Consensus 216 ~~~~~~~~~~~~~~~g~iPvv~~~n~~~-------~~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~--- 285 (471) T protein:vir:10 216 MNGDRSSDNSFKHDFGLVPFIPFKNNEI-------ETNDLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGGQ--- 285 (471) T ss_pred ccccccccccccCCCCceeEEEeccCCC-------CCCchHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCcc--- Confidence 2223334459999999844433222 234444444444444333333333334566776643321110 Q ss_pred CCCCccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHH Q lcl|NC_011222. 286 HDGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEE 365 (577) Q Consensus 286 ~~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~ 365 (577) . .+ . ...-+..+.++.++-+. +..++| |.+++++... +-....+ T Consensus 286 ----~---~~---------------~--------~~~~~~~~~~i~~~~~~-~~~~~~----~~~l~~~~~~-~~~~~~~ 329 (471) T protein:vir:10 286 ----D---KQ---------------E--------FLEDLKRYKMIKMDNDG-MGDQSG----VTTIAIDIPT-EARNLIL 329 (471) T ss_pred ----c---cc---------------h--------hHHHhhcCCeEEecCCC-CccCcc----ceEEeecCCh-HHHHHHH Confidence 0 00 0 00011122233333321 122333 5566655432 4445566 Q ss_pred HHHHHHHHhhhhcCCC-ccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccccccCCcccc Q lcl|NC_011222. 366 KRLRDELVRSVTGGEG-ELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSCNINYGTEFY 444 (577) Q Consensus 366 kri~d~i~~s~~Gf~~-d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~ti~yGskFy 444 (577) +++.+.|+...-+..- +.+.+ ..++++.+.-+-.+..++++..+.|++.-+-+...++++.=...+. .| --.|. T Consensus 330 ~~l~~~I~~~s~tp~~~~~~~g-n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~d~~--~i--~i~f~ 404 (471) T protein:vir:10 330 ERTKKQIFISGQGVNPETDKLG-NSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILKHLGLSDKL--KI--KQTWT 404 (471) T ss_pred HHHHHHHHHHhCCcCCCccccc-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc--ee--EEEeC Confidence 7888888886633211 11122 3577888888888888888989999988888888887753111111 12 24577 Q ss_pred cCCHHHHHHHHHHHHHc-C-CCHHHHHHHHHHHHHHhhcCCH-HHHHHHHHH--HhhCCCcCccHHHHHHHHhcCCCchh Q lcl|NC_011222. 445 IYTPEELSERYKIMKET-G-ASEAELDALRQQIIETEYRNDP-TQMQRLLIL--NEIEPYSHLTREEAVNLYKENVISEE 519 (577) Q Consensus 445 ~~t~eeL~~~i~~Ak~~-G-as~~~i~~L~~qi~e~EyrNnP-~qmqr~~vL--~~leP~~~LT~~Ev~~l~e~g~~~eE 519 (577) ...|..+++.+++|.+. | +|..-+... --+=.|| .+|+|++.= ...+..++++ |...++ T Consensus 405 ~~~p~n~~e~~~~~~kl~g~iS~et~~~~------~p~v~D~~~E~eri~~E~~~~~~~~~~~~----------~~~~~~ 468 (471) T protein:vir:10 405 RNSINNDTEMAQVVSTLATITSRENVAKS------NPIVEDWQDELRLQKAEQEGRSEKLYDME----------EVEHES 468 (471) T ss_pred CCCCCCHHHHHHHHHHHhccCchHHHHHh------CCCCCCHHHHHHHHHHHHHHHHhcccccC----------CCCCcc Confidence 77788888877766553 3 232221110 0122344 345555331 1122111111 111111 Q ss_pred hee Q lcl|NC_011222. 520 DLR 522 (577) Q Consensus 520 dl~ 522 (577) +.- T Consensus 469 e~~ 471 (471) T protein:vir:10 469 EVE 471 (471) T ss_pred ccC Confidence 111 No 28 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=79.68 E-value=0.1 Score=26.02 Aligned_cols=442 Identities=13% Similarity=0.112 Sum_probs=174.3 Q ss_pred CCcchHHHHHHhhCchh----HHHHHHHHH-HHHHHHHHhhhhcccccchHHHHHHHHHHHhhcchhHHHHHHHhhccCC Q lcl|NC_011222. 1 MGKSLDEIREIYRHPEG----ISQIAKAKE-HEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAKDKYDIFLSMFHFPVK 75 (577) Q Consensus 1 ~~~~~~~~~~~~~~~~~----~~~~aka~~-heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~kdky~~f~~~f~fpv~ 75 (577) -|.---|-.+.|+-|.. ...|++.+. |..|.+ +++.... .||.-= +.|.+. ++.++.. .--+. T Consensus 6 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~--~~~~~l~---~Yy~g~-~~i~~~-~~~~~~~-----~~ki~ 73 (470) T protein:vir:99 6 YGRDKVTGNSSFIFPKGEKLTSNELLGFIAYNETVLK--PRYRENM---KLYLGK-HKILTA-PEKETGA-----DNRIV 73 (470) T ss_pred CCcccccCCceEEeCCCCCcCHHHHHHHHHHHHHhhH--HHHHHHH---HHhccc-cccccC-cccccCC-----cceee Confidence 00000011111111100 112333322 221111 1111100 011000 000000 0001000 00000 Q ss_pred CccchHHHHHHHHHHhccCCccccccccC--hhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccCCCc Q lcl|NC_011222. 76 TNGVTSEIFDKLSRVFDGRNPVYNYQFKS--SEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQVGEK 153 (577) Q Consensus 76 t~~lt~~iF~~L~kV~dgqd~~~~y~f~~--~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~~~r 153 (577) +.+.+.|=+...--+=|+...++..=.+ .+.+++|. ..+++.+..+.-+=.+.... .++++|- +..+++ T Consensus 74 -~n~~~~Ivd~~~~~l~g~p~~~~~~~d~~~~~~l~~~~----~~n~~~~~~~~~~~~~~~~G-~~~~~v~---~d~dg~ 144 (470) T protein:vir:99 74 -VNSAKYVVDVYNGYFCGIEPKLALLNDSSKIDEIARWN----RQENFFDTINEISKQCDIFG-RSIASIY---QGEDAR 144 (470) T ss_pred -cchHHHHHHHHhhhhccCCeeEeeCCchhHHHHHHHHH----HhcCHhHHHHHHHHHHHhcC-eeEEEEE---eCCCCe Confidence 0122222222211222232222221001 11222221 12355555555555555554 4566663 445555 Q ss_pred cccchhhhhHHHHHHHHhhcccccceeeeee-ecCCC-E----E-EEEeccceeeeccCCcc-ceeeehhhhhhhcccce Q lcl|NC_011222. 154 PEPYFFWLPIANVLSYRTCGKDCNLMAYIMY-VTDEN-K----I-VYIDEERYVRFDKTREN-DLILEVDNMHDLGYCPA 225 (577) Q Consensus 154 pqpyf~~~pie~V~~y~~~~~~~~~i~~i~~-~qD~N-~----i-~~IDd~~y~~y~kn~k~-ei~~e~~~~H~lGy~PA 225 (577) |. ..+.-|.+..-.|...+ +.+.+.++-+ ..+++ . + ++.++..| .|...+-+ +.......-|.||.||- T Consensus 145 ~~-i~~~~p~~~~~i~d~~~-~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~vPv 221 (470) T protein:vir:99 145 PH-LMYSSPNHAFIIYDDTV-QRQPLAFVHYQIDNSNNWTDAYGVIQYADKFY-KFKGYDIEEDTNAAGYAINPYGLVPA 221 (470) T ss_pred EE-EEEEccceeEEEEcCCC-CcceEEEEEEEEEecCCeeEEEEEEEecCeEE-EEEecccccccccccccccCCCccce Confidence 43 22223333222232222 2334555544 23322 2 3 33444444 44443322 33333344499999997 Q ss_pred eeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCccccCcceeccccccc Q lcl|NC_011222. 226 RFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKERCDDGFLKNEKNEWI 305 (577) Q Consensus 226 ~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~C~~G~i~n~~~d~~ 305 (577) --+.-+. .-.|=+..+++..|.|-...+....--.+++-|+....-- . ..+. ++| T Consensus 222 v~~~n~~-------~g~sd~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~--~---~~~~---~~g---------- 276 (470) T protein:vir:99 222 VEFFENE-------ERQGIFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGF--K---LPED---DEG---------- 276 (470) T ss_pred EeecCCC-------CCCcchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecC--C---cccc---ccc---------- Confidence 4432222 3445555555555555444444444334566777664220 0 0000 011 Q ss_pred ccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHHHHHhhhhcCC-Cccc Q lcl|NC_011222. 306 TGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRDELVRSVTGGE-GELN 384 (577) Q Consensus 306 tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d~i~~s~~Gf~-~d~q 384 (577) .+ ......+-++.+|.. .+..++| |+.++++. ..+-....++++.+.|+...-+.. ...+ T Consensus 277 -----~~--------~~~~~~~~~~~~~~~-~~~~~~~----~~~l~~~~-~~~~~~~~~~~l~~~i~~~s~~p~~~~~~ 337 (470) T protein:vir:99 277 -----NP--------KFDFKNNRVLYVSQL-DPDTNPQ----IGFIAKPD-ADQMQENLIQHLTDFIFMMAMVPNIQDKN 337 (470) T ss_pred -----ch--------hhhhhhcceeeecCC-CCCCCCc----ceEEeecC-ChHHHHHHHHHHHHHHHHHhCCccccccc Confidence 11 011222334444433 2233344 56666553 222233446677777776542211 1122 Q ss_pred cchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhh--cCcccccccccCCcccccCCHHHHHHHHHHHHHc- Q lcl|NC_011222. 385 RSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLR--YGDSFVSCNINYGTEFYIYTPEELSERYKIMKET- 461 (577) Q Consensus 385 ~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lR--yg~~~~~~ti~yGskFy~~t~eeL~~~i~~Ak~~- 461 (577) .+-+.|+++.++.+..+.+++.+..+.|+++-+-+...++++. -+...++. ..-.-.|....|..+++..+++.+. T Consensus 338 ~~~n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~~-~~i~v~f~~~~p~~~~e~a~~~~kl~ 416 (470) T protein:vir:99 338 FAGNSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLATLFNNKQDQELW-SELDFKFTRNLPEDMASAIDNAKNAE 416 (470) T ss_pred cccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCccccc-ccceEEeCCCCCcCHHHHHHHHHHHh Confidence 2335688888888889999999999999998888888887763 12222211 1123446666666666666655442 Q ss_pred C-CCHHHHHHHHHHHHHHhhcCCH-HHHHHHHHHHhhCCCcCccHHHH--HHHHhcCCCchhh Q lcl|NC_011222. 462 G-ASEAELDALRQQIIETEYRNDP-TQMQRLLILNEIEPYSHLTREEA--VNLYKENVISEED 520 (577) Q Consensus 462 G-as~~~i~~L~~qi~e~EyrNnP-~qmqr~~vL~~leP~~~LT~~Ev--~~l~e~g~~~eEd 520 (577) | +|...+.... - +- || .+|+|++-=+ +.-...+.+.. .+.-+...-+||| T Consensus 417 giis~et~l~~l-~-----~v-d~~~E~eri~~E~--~~~~~~~~~~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 417 GIVSKKTQLGMI-P-----DI-EPDAEMKQIAKEK--ADAIKQTQQLSMPIDILKRDNNAEEE 470 (470) T ss_pred ccCCHHHHHHhC-C-----CC-CHHHHHHHHHHHH--HHHHHHHHhhcCCCCcCCCCCCccCC Confidence 3 3332221110 0 11 32 3355543221 11111111111 1223344556666 No 29 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=77.97 E-value=0.12 Score=25.65 Aligned_cols=454 Identities=13% Similarity=0.073 Sum_probs=183.2 Q ss_pred CCcch-------HHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhc--ccccchHHHHHHHHHHHh-hcc-hhHHHHHHH Q lcl|NC_011222. 1 MGKSL-------DEIREIYRHPEGISQIAKAKEHEERIAFHTRVRT--SDDRNKPVIDFLSKVKTW-IAK-DKYDIFLSM 69 (577) Q Consensus 1 ~~~~~-------~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~t--a~d~~~~~~~fl~~v~~~-l~k-dky~~f~~~ 69 (577) |..+| +..++.-.|+ |.+..||-.-.. ..+....++.|++..+.- .|+ +|+.+|..- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~r~~~l~~yY~g 68 (501) T protein:vir:27 1 MEQTLFTDSTGQDLVLNLRFHR------------ESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARG 68 (501) T ss_pred CCceeEEeccchhhhhhcccCh------------hHHHhhccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 32222 2233333333 234444433221 222222344444433211 111 222222211 Q ss_pred hhccCCC-----------ccchHHHHHHHHHH----hccCCccccccccCh---hhhhh-hhHHHhhccchhHHHHHHHH Q lcl|NC_011222. 70 FHFPVKT-----------NGVTSEIFDKLSRV----FDGRNPVYNYQFKSS---EDRDD-WEYYRKDVLKEPSVWSTDGW 130 (577) Q Consensus 70 f~fpv~t-----------~~lt~~iF~~L~kV----~dgqd~~~~y~f~~~---e~~~d-~~~y~se~ln~~~fw~~~~f 130 (577) =+-.+.+ +-++--....+... +=|....++. .+. +.+++ |..+. ..+++.+..+.-+= T Consensus 69 ~~~~i~~~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~--~d~~~~~~~~~~l~~~~-~~n~~~~~~~~~~~ 145 (501) T protein:vir:27 69 ENHDVLQFGRRKDREMADKRAVHNYGRMISKFKTGYLAGNPIRVEY--DDNDNNSQNDDTIKRIG-RINDIDSHNRTLIR 145 (501) T ss_pred CCccccccCccCccccccceeccchHHHHHHHHhhhhcccCeeEec--CCccchHHHHHHHHHHH-HhcChhHHHHHHHH Confidence 1111110 11111111111111 2223333322 221 11222 33333 22467776666666 Q ss_pred HHHHhCCCeEEEEeeccccCCCccccchhhhhHHHHHHHHhhcccccceeeeeeec----CCC--EEEEEeccceeeecc Q lcl|NC_011222. 131 DNFKHRINSVLVVDMPEVQVGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIMYVT----DEN--KIVYIDEERYVRFDK 204 (577) Q Consensus 131 k~~~~~~NgviVVDm~~i~~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~~q----D~N--~i~~IDd~~y~~y~k 204 (577) ++..... ++++|- .+.+++|.=-+ .-|.+....|... ...+.+.++.+.. +++ .+-+-+++...+|.. T Consensus 146 ~~~~~G~-a~~~vy---~ded~~~~i~~-~~p~~~~~v~d~~-~~~~~~~~ir~~~~~~~~~~~~~~~vyt~~~v~~~~~ 219 (501) T protein:vir:27 146 DLSQTGR-AYEVIY---RNEYDETRIKR-LNPLETFVIYDNS-LEDNSIAAVRYYNRGTLQNAKDVVEIYTNEHIYTLDA 219 (501) T ss_pred HHhhCCe-EEEEEE---eCCCCceEEEE-EccceeEEEecCC-CCCceEEEEEEEEeeecCCcEEEEEEEeCCeEEEEEe Confidence 6666554 666663 34445442111 1233222223221 1234566665521 122 144455666666665 Q ss_pred CCccceeeehhhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCcc Q lcl|NC_011222. 205 TRENDLILEVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYE 284 (577) Q Consensus 205 n~k~ei~~e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~ 284 (577) ++ .+.......|.||.||.--|. ++ + .-.|-+..+++..|.|-...+....--.+.+-|+....... T Consensus 220 ~~--~~~~~~~~~~~~g~vPvv~~~-----nn-~-~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~---- 286 (501) T protein:vir:27 220 SD--DFNEISVTTHAFGTVPITEFL-----NN-V-DGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGDL---- 286 (501) T ss_pred CC--ceeeccccccCCCcccEEEec-----CC-C-CCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCc---- Confidence 54 333334456999999984442 22 2 45666666665555555544444443345566666543210 Q ss_pred CCCCCccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccH--HHHHHHh Q lcl|NC_011222. 285 SHDGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADT--GSLEYNV 362 (577) Q Consensus 285 ~~~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di--~sleY~~ 362 (577) .+ +.| ..+.. +....++.++.|.......+ .-.+++++++. ++++.+ T Consensus 287 ----~~--~~~------------~~~~~-----------~~~~~~~~~~~~~~~~~~~~-~~~~~~l~~~~~~~~~~~~- 335 (501) T protein:vir:27 287 ----AL--PKG------------MQASD-----------MKRTRLMQLKPPKSADGKEG-TVKAEYLTKSYDVSGAEAY- 335 (501) T ss_pred ----cC--Ccc------------cchhh-----------hhhcCceeecccccccCCCC-CcceeeeeccCCHHHHHHH- Confidence 00 001 11111 11223444444322111111 12366666543 445544 Q ss_pred hHHHHHHHHHHhhhhc--CCCccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHh---hcCcccccccc Q lcl|NC_011222. 363 NEEKRLRDELVRSVTG--GEGELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLL---RYGDSFVSCNI 437 (577) Q Consensus 363 ~~~kri~d~i~~s~~G--f~~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~l---Ryg~~~~~~ti 437 (577) ++++.+.|+...-+ +.. .+..-..++++.+.....+.+++....+.|++.-+-+...++++ ......+.. . T Consensus 336 --~~~l~~~I~~~s~~p~~~~-~~~~~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~-~ 411 (501) T protein:vir:27 336 --KTRLNRDIHIFTNIPDMSD-TNFSGNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDE-S 411 (501) T ss_pred --HHHHHHHHHHHhCCcccCc-cccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-c Confidence 45555566554322 111 12223468888888888888888888888888888777777764 221111111 0 Q ss_pred cCCcccccCCHHHHHHHHHHHHH-cC-CCHHHHHHHHHHHHHH-hhcCCH-HHHHHHHHHHhhCCCcCccHHHHHHHHhc Q lcl|NC_011222. 438 NYGTEFYIYTPEELSERYKIMKE-TG-ASEAELDALRQQIIET-EYRNDP-TQMQRLLILNEIEPYSHLTREEAVNLYKE 513 (577) Q Consensus 438 ~yGskFy~~t~eeL~~~i~~Ak~-~G-as~~~i~~L~~qi~e~-EyrNnP-~qmqr~~vL~~leP~~~LT~~Ev~~l~e~ 513 (577) .-.-.|....|..+++.+.++.+ +| +|... .++. -+=.|| .+|+|++--+ +.- ..+.. . T Consensus 412 ~i~v~f~~~~p~n~~e~ad~~~kl~g~iS~et-------~l~~l~~v~D~~~E~eri~~E~--~e~---~~~~~-----~ 474 (501) T protein:vir:27 412 LLKITFTPNLPKSLNEQVSILTGLGGQVSQET-------ALSLSGLVESPNEELDKINKEV--SEI---DFKGY-----S 474 (501) T ss_pred cceEEeCCCCCcCHHHHHHHHHHHhccCcHHH-------HHHhCCCCCCHHHHHHHHHHHH--Hhh---hHhhh-----c Confidence 01234666667777776665543 33 22211 1111 122232 2344433221 100 00000 0 Q ss_pred CCCchhheeeeecchhhhhhhhccCCchhh Q lcl|NC_011222. 514 NVISEEDLRVKLNLPTFVRRFERENMNIIE 543 (577) Q Consensus 514 g~~~eEdl~vk~n~~~fv~rfe~en~~i~e 543 (577) +-.+++ ++=.=+.-=..=++|.-+-+| T Consensus 475 ~~~~~~---~~~~~d~~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 475 NDFNEH---VGKYTDEVKETHTDDFERAYE 501 (501) T ss_pred Cccccc---cccccCCCCCCccccccccCC Confidence 000000 000000000111333333444 No 30 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=76.60 E-value=0.13 Score=25.37 Aligned_cols=435 Identities=12% Similarity=0.064 Sum_probs=178.7 Q ss_pred CCcch-HHHHHHhhCch---hHHHH-HHHH-HHHHHHHHHhhhhcccccchHHHHHHHHHHHhhcc-hhHHHHHHHhhcc Q lcl|NC_011222. 1 MGKSL-DEIREIYRHPE---GISQI-AKAK-EHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAK-DKYDIFLSMFHFP 73 (577) Q Consensus 1 ~~~~~-~~~~~~~~~~~---~~~~~-aka~-~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~k-dky~~f~~~f~fp 73 (577) .|..- ++.-.+.+-|. -+.+. .+.+ .|.+|+.=..++. .||.- + -.++.+ +|+.. ..-..+ T Consensus 15 ~~~~~~~~~~~~~~~~~~~e~~~~~i~~~i~~~~~~~~r~~~l~------~YY~g---~-~~i~~~~~~~~~--~~~~~~ 82 (483) T protein:vir:12 15 SQPTQTEIFDAIVRTNNKPETLEEMIVRYIKQHLEKLPEISIGQ------EYYEQ---R-PDIVKEPKPVDA--TGAVDP 82 (483) T ss_pred CcchhhhhhhcccccCCchhhHHHHHHHHHHHHHHHHHHHHHHH------HHhcc---c-cccccccccccc--cccccc Confidence 23322 22223333222 22221 1221 3344432111111 01110 0 001110 11100 000001 Q ss_pred CCC-----ccchHHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccc Q lcl|NC_011222. 74 VKT-----NGVTSEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEV 148 (577) Q Consensus 74 v~t-----~~lt~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i 148 (577) .+. ..+.+-|=+...--+=|.... |.=.+.+..+-|..|+. +++-+..+.-+=++... =.++++|- + T Consensus 83 ~~~~~ki~~n~~k~Ivd~~~~~l~G~p~~--~~~~d~~~~~~l~~~~~--n~~~~~~~~~~~~~~~~-G~~y~~v~---~ 154 (483) T protein:vir:12 83 LKPDDRMITNFHANLVDQKVSYIVGKPIA--FKHTDDEVVKRIDEVLG--NRFDDKLHSVLTGASNK-GIEWLHPY---L 154 (483) T ss_pred cccccccccchHHHHHHHHhhhhcccCce--eccCChHHHHHHHHHHh--ccHHHHHHHHHHHHhhC-CeEEEEEE---E Confidence 000 111111111111111222222 22345555555666653 34555554433333333 35777775 4 Q ss_pred cCCCccccchhhhhHHHHHHHHhhcccccceeeeee--ecCCCEEEEEeccceeeeccCCcccee---------eeh-hh Q lcl|NC_011222. 149 QVGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIMY--VTDENKIVYIDEERYVRFDKTRENDLI---------LEV-DN 216 (577) Q Consensus 149 ~~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~--~qD~N~i~~IDd~~y~~y~kn~k~ei~---------~e~-~~ 216 (577) +.+++|. +-..-|.+....|.. ....+.+.++-+ .+++.++-+.++.....|..++ +.+. .++ .. T Consensus 155 d~d~~~~-i~~~~p~~~~~v~d~-~~~~~~~~~ir~~~~~~~~~~~~y~~~~v~~~~~~~-~~~~~~~~~~~~~~~~~~~ 231 (483) T protein:vir:12 155 DEEGEFK-LFRVPAEQGIPIWTD-KEHEELEAFIRMYKLENETKVEYWDKVTVNYYVYEN-GSLIPDYSNNLENSKTHFS 231 (483) T ss_pred cCCCceE-EEEEcccceEEEEcC-CCCCceEEEEEEEEeecceEEEEEecCeEEEEEEeC-Ceeeecccccccccccccc Confidence 5566543 223334443333432 112345666655 4566667677776666664443 2111 112 22 Q ss_pred hhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCccccCcc Q lcl|NC_011222. 217 MHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKERCDDGF 296 (577) Q Consensus 217 ~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~C~~G~ 296 (577) -|.||.||.--|..+..+.| ++ +++-..++|+|+.++..... =.+.+-|+.+... ..+.. .+ T Consensus 232 ~~~~g~vPvv~~~nn~~g~s--d~--e~v~~liDa~d~~~S~~~~~---~~~~~~~~lv~~g-------~~~~~---~~- 293 (483) T protein:vir:12 232 TGSWGKIPFIPFKNNDLEIS--DI--FMYKTLIDAYNRRLSDLSNT---FKDSNELTYVLTN-------YDDQE---LP- 293 (483) T ss_pred cCCCCccceEEecCCCCCCC--ch--hhHHHHHHHHHHHHHHHHHH---HHHhcCceeeeec-------CCccc---ch- Confidence 39999999855544333333 22 23444446666655443332 2456777765321 00100 00 Q ss_pred eecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHHHHHhhh Q lcl|NC_011222. 297 LKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRDELVRSV 376 (577) Q Consensus 297 i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d~i~~s~ 376 (577) ++. .....+..+.+ | ++.| |++++.+... +-....++++.+.|+... T Consensus 294 ------~~~----------------~~~~~~~~~~~--~----~~~~----~~~l~~~~~~-~~~~~~~~~l~~~I~~~s 340 (483) T protein:vir:12 294 ------EFK----------------RLLRYYGAIKV--S----DNGG----VDTIQVEVPV-ENSKKYLDELYQKIMLFG 340 (483) T ss_pred ------hHH----------------Hhhhhcccccc--C----CCCc----ceEEeecCCH-HHHHHHHHHHHHHHHHHh Confidence 000 00112222222 1 1223 4444433322 334445667777777654 Q ss_pred hcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccccccCCcccccCCHHHHHHHH Q lcl|NC_011222. 377 TGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSCNINYGTEFYIYTPEELSERY 455 (577) Q Consensus 377 ~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~ti~yGskFy~~t~eeL~~~i 455 (577) -... .+.+.+-+.++++.+.-.-++.+++.+..+-|++.-+-+..+++++. |...-...| --.|....|..+++.+ T Consensus 341 ~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~-~~~~~~~~i--~v~f~~~~p~~~~~~a 417 (483) T protein:vir:12 341 QAVDFSSDKFGSAPSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHF-DIKGEHKDV--DISFNYNKVANTELQV 417 (483) T ss_pred CCCCCCccccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCCCcccee--eEEeCCCCCCCHHHHH Confidence 2211 11122345788888888999999999999999999888888888763 322111122 2456777777777777 Q ss_pred HHHHH-cCC-CHHHHHHHHHHHHHH-hhcCCH-HHHHHHHHH-----HhhCCCcCccHHHHHHHHhcCCCchh Q lcl|NC_011222. 456 KIMKE-TGA-SEAELDALRQQIIET-EYRNDP-TQMQRLLIL-----NEIEPYSHLTREEAVNLYKENVISEE 519 (577) Q Consensus 456 ~~Ak~-~Ga-s~~~i~~L~~qi~e~-EyrNnP-~qmqr~~vL-----~~leP~~~LT~~Ev~~l~e~g~~~eE 519 (577) +++.+ +|+ |... .++. -+-.|| .+|+|++.= +...++.+---+.-.+--+.|=...| T Consensus 418 ~~~~kl~GiiS~et-------~~~~~~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~d~~~~~~~~~~~e~e 483 (483) T protein:vir:12 418 QTAQQSMGIVSHET-------VLENHPFVEDLQAELERIEQEQMEYNKQLPNLDDGGADGAQQQERSNNKESE 483 (483) T ss_pred HHHHHHhccCchHH-------HHHhCCCCCCHHHHHHHHHHHHHHHHhhcccccccccCCcccCCCCCcccCC Confidence 66543 442 2211 1111 111233 244443321 11111111000000011111222222 No 31 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=74.03 E-value=0.16 Score=24.90 Aligned_cols=419 Identities=13% Similarity=0.054 Sum_probs=175.8 Q ss_pred cchHHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhcchhHHHHHHHhhccC-CCccc-- Q lcl|NC_011222. 3 KSLDEIREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAKDKYDIFLSMFHFPV-KTNGV-- 79 (577) Q Consensus 3 ~~~~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~kdky~~f~~~f~fpv-~t~~l-- 79 (577) -..++|++..++ |.+|+.-..++.- ||.. +.-+++.+-...-.-...+. +.|-| T Consensus 1 l~~~~i~~~i~~------------~~~~~~r~~~~~~------YY~g-----~~~i~~~~~~~~~~~~~~~~~~~~ki~~ 57 (451) T protein:vir:10 1 MELEKIRAIISA------------DAARRQEILQAKS------YYYN-----KNDILKKGVVVQNRDENPLRNADNRISH 57 (451) T ss_pred CCHHHHHHHHHH------------HHHHHHHHHHHHH------Hhcc-----cCcccccccccccccccccccccccccc Confidence 445555554322 3333211111110 1110 01111100000000011111 11112 Q ss_pred --hHHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccCCCcc--- Q lcl|NC_011222. 80 --TSEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQVGEKP--- 154 (577) Q Consensus 80 --t~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~~~rp--- 154 (577) .+-|=+..-.-+-|....|+.. .+.+..+-|+.|+. +++.+.-.. ..+....-=.++.+|-+.+ ....+. T Consensus 58 n~~~~Ivd~~~~yl~G~p~~~~~~-~~~~~~~~~~~~~~--n~~~~~~~~-~~~~~~~~G~a~~~~y~de-~~~~~~~~~ 132 (451) T protein:vir:10 58 NFHEILVDEKASYMFTYPVLFDID-NNKELNEKVTDVLG--NEFTRKAKN-LAIEASNCGSAWLHYWIDE-EYSGEQVTN 132 (451) T ss_pred chHHHHHHhhhhheecccceeecC-CcHHHHHHHHHHhc--cCHHHHHHH-HHHHHhhcCeEEEEEeecC-Ccccccccc Confidence 2222222222223444433321 11222233444432 344443333 3333344445566654222 111111 Q ss_pred -ccchhhhhHHHHH-HHHhhcccccceeeeeee---cCCC---------EEEEEeccceeeecc---CCccceeeehhhh Q lcl|NC_011222. 155 -EPYFFWLPIANVL-SYRTCGKDCNLMAYIMYV---TDEN---------KIVYIDEERYVRFDK---TRENDLILEVDNM 217 (577) Q Consensus 155 -qpyf~~~pie~V~-~y~~~~~~~~~i~~i~~~---qD~N---------~i~~IDd~~y~~y~k---n~k~ei~~e~~~~ 217 (577) ..-+-..+-++++ -|.. ..+.+.+.+|-+. .+.+ ++-+.+++.+-.|.- ...++...+...- T Consensus 133 ~~~~~~~i~p~~~~~vydd-~~~~~~~~~ir~~~~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (451) T protein:vir:10 133 QTFKYGVVNTEEIIPIYRN-GIERELEAVIRYYIQLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHITVQ 211 (451) T ss_pred cceeEEEEcccceEEEEcC-CCCCceEEEEEEEEeeecccccccceEEEEEEEEeCCeEEEEEecccCcccccccccccc Confidence 1112112223332 2322 1233455666441 2222 233455555555542 2223444444445 Q ss_pred hhhcccceeeEecccccccCcceeeccchhHH---HHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCccccC Q lcl|NC_011222. 218 HDLGYCPARFFWSDSISLSEPDIKISPITSEL---DSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKERCDD 294 (577) Q Consensus 218 H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L---~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~C~~ 294 (577) |.||.||.--|+.... ..|=++.+. +|.|..++-. ..-=.|++-|+....- ..|.. .. T Consensus 212 ~~~g~vPvv~~~nn~~-------~~~d~e~v~~liDa~~~~~S~~---~~~~~~~~~~~l~~~g-------~~~~~--~~ 272 (451) T protein:vir:10 212 HRFNSVPFVEFSNNIK-------KQSDLSKYKKILDLYDRVMSGF---ANDLEDIQQIIYILEN-------FGGED--TS 272 (451) T ss_pred CCCCeeeEEEeccCCC-------CCCchhhHHHHHHHHHHHHHHH---HHHHHHhccceeeeec-------CCccc--ch Confidence 9999999854433222 234444444 5555544433 3222355666654321 01100 00 Q ss_pred cceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHHHHHh Q lcl|NC_011222. 295 GFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRDELVR 374 (577) Q Consensus 295 G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d~i~~ 374 (577) ++. ..+..+.++.++. ..+..++| |.+++.+.. .+-..+.++++.+.|+. T Consensus 273 ~~~------------------------~~~~~~~~i~~~~-~~~~~~~~----~~~l~~~~~-~~~~~~~~~~l~~~I~~ 322 (451) T protein:vir:10 273 EFL------------------------KELKRYKTIKTET-DSEGDSGG----LKTMQIEIP-TEARKIILEILKKQIYE 322 (451) T ss_pred hhH------------------------HHHhhCCeEEecC-cCCccCCc----ceEEeecCC-HHHHHHHHHHHHHHHHH Confidence 000 0122223444442 23334455 566666542 34455677888888887 Q ss_pred hhhcCCCccccc--hhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccccccCCcccccCCHHHHH Q lcl|NC_011222. 375 SVTGGEGELNRS--EAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSCNINYGTEFYIYTPEELS 452 (577) Q Consensus 375 s~~Gf~~d~q~~--kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~ti~yGskFy~~t~eeL~ 452 (577) ..-+ ++++.+ -..++++.+..+-.+..++....+.|+++-.-++..++++. |..-+ ..| ...|...-|..++ T Consensus 323 ~s~~--p~~~~~~~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~-~~~d~-~~i--~i~f~~~~p~n~~ 396 (451) T protein:vir:10 323 SGQG--LQQDTENFGNASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYFL-GVTDY-KKI--QQTYTRNMMSNDL 396 (451) T ss_pred HhCc--ccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCCc-cce--eEEecCCCCCCHH Confidence 6533 222221 14688889999999999999999999999999999998873 32111 122 3468888888888 Q ss_pred HHHHHHHHc-CC-CHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHhhCCCcCccHHHHHHHHhcCCCch Q lcl|NC_011222. 453 ERYKIMKET-GA-SEAELDALRQQIIETEYRNDPTQMQRLLILNEIEPYSHLTREEAVNLYKENVISE 518 (577) Q Consensus 453 ~~i~~Ak~~-Ga-s~~~i~~L~~qi~e~EyrNnP~qmqr~~vL~~leP~~~LT~~Ev~~l~e~g~~~e 518 (577) +.++++.+. |+ |..-+..+ . -+=.||.+..++..-.+-+ ..++..+ .-|-+++ T Consensus 397 e~~~~~~kl~g~iS~et~~~~-~-----p~v~d~~~e~~~~~ee~~~-----~~~~~~~--~~~~~~~ 451 (451) T protein:vir:10 397 EDADIATKSVGIIPTKIILRH-H-----PWVDDVEEAEKLYLEEKKI-----QASKVSD--DYNNFTE 451 (451) T ss_pred HHHHHHHHHhccCchHHHHHh-C-----CCCCCHHHHHHHHHHHHHH-----HHHHHHh--hcCCCCC Confidence 888877654 42 22211111 1 1222444332221111100 1111111 1233333 No 32 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=66.43 E-value=0.27 Score=23.72 Aligned_cols=411 Identities=10% Similarity=0.019 Sum_probs=163.9 Q ss_pred HhhccCCCccchHHH-------------------HHHHHHHhccCCcccccc---------------------------- Q lcl|NC_011222. 69 MFHFPVKTNGVTSEI-------------------FDKLSRVFDGRNPVYNYQ---------------------------- 101 (577) Q Consensus 69 ~f~fpv~t~~lt~~i-------------------F~~L~kV~dgqd~~~~y~---------------------------- 101 (577) |-.||.. +||.+- .++|.+-|+|+..-..+. T Consensus 1 ~~~~p~~--~l~~~~~~~~~~~~l~~~~~~~~~r~~~~~~YY~g~~~i~~~~~~~~~~~~~~~~~~~~~n~~~~iVd~~~ 78 (479) T protein:vir:99 1 MIDLPDE--DLSSEGLAKYLETKVFPKMNTECERLDDFEAWTKNGQEVPDLATRHKNKEREVLQQLSRKPWMGLMVNSFA 78 (479) T ss_pred CccCCcc--cCChhHHHHHHHHHHHHHHHHHhHHHHHHHHHHhcCCcccccccccCChhHHHHHHHhhcCcHHHHHHHHH Confidence 4444432 122110 223334444433221111 Q ss_pred -------ccChh--hhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccCCCccccchhhh-hHHHHHHHHh Q lcl|NC_011222. 102 -------FKSSE--DRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQVGEKPEPYFFWL-PIANVLSYRT 171 (577) Q Consensus 102 -------f~~~e--~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~~~rpqpyf~~~-pie~V~~y~~ 171 (577) |+.++ ..+.+....++ +++-..-+.-+-.+.... .++++|--..-..++.+.|.+-.. |.+.+-.|.. T Consensus 79 ~~l~~~gf~~~d~~~~~~~~~i~~~-N~~d~~~~~~~~~a~~~G-~af~~v~~~~~~~d~~g~~~i~~~~p~~~~~iydd 156 (479) T protein:vir:99 79 QQLIVDGYRKTGTNENAKGWDTWRL-NQMDKQQFWLNRAVLTFG-YAFIKVTSGISPLDGTTVARIKCIDPRDAFAIWED 156 (479) T ss_pred hhcccccccCCCchhhHHHHHHHHh-cChhHHHHHHHHHHhhcC-ceEEEEecCCCCcCCCCceEEEEechhheEEEecC Confidence 11111 01000000001 122222222222333333 367777421112355666655333 4333334433 Q ss_pred hcccccceeeeeeecCCCEEEEEeccceeeeccCCccceeeehhhhhhhcccceeeEecccccccCcceeeccchhHHHH Q lcl|NC_011222. 172 CGKDCNLMAYIMYVTDENKIVYIDEERYVRFDKTRENDLILEVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDS 251 (577) Q Consensus 172 ~~~~~~~i~~i~~~qD~N~i~~IDd~~y~~y~kn~k~ei~~e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a 251 (577) ... .+...|.....+.+...+-.+..+.+|.... +.|..+...-|.||.||.--| -.+-.. .+ .-.|=|...++. T Consensus 157 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~h~~g~vPvv~f-~n~~~~-~~-~g~sd~e~v~~l 231 (479) T protein:vir:99 157 PYW-DEWPKYLLERQPNGQYWWWTEEDYSIFEFKQ-GKFIYRETVSHDYGHIPFVRY-VNVMDL-RG-VCYGDVEPLVTV 231 (479) T ss_pred Ccc-cceeeEEEeecCceeEEEEecceEEEEEecC-CceeeccccccCCCCcceEEe-ecCCCc-Cc-CCcchhHHHHHH Confidence 222 1222232223444455556666666776654 678776666699999998433 222111 12 456666666666 Q ss_pred HHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCccccCcceecccccccccccccccccCCCccccccCccceee Q lcl|NC_011222. 252 FDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVE 331 (577) Q Consensus 252 ~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~e 331 (577) .|.|-...+....--.+.+.|+.+..- + ...+ =+++ .. .+..+..+.++. T Consensus 232 iDa~~~~~s~~~~~~~~~a~p~~~i~G--~-----~~~~-~~~~---------------~~-------~~~~~~~~~i~~ 281 (479) T protein:vir:99 232 AKAIDKTGLDILLVQHHQSFQIRWATG--L-----MLPE-GANA---------------DQ-------EKMRFAQESMLI 281 (479) T ss_pred HHHHHHHHHHHHHHHHHhhchhhhhcC--C-----Cccc-cccc---------------ch-------hcccccccccee Confidence 666655555544444556777655431 0 0000 0000 00 011122222332 Q ss_pred eecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHHHHHhhhhcCCCccccchh--hcccceeccHHHHHHHHHHHh Q lcl|NC_011222. 332 IPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRDELVRSVTGGEGELNRSEA--INEKQVKAGFESLTTKLNRIK 409 (577) Q Consensus 332 vPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d~i~~s~~Gf~~d~q~~kA--~ne~~V~s~~ds~~~~l~~ik 409 (577) ++= .. +++...+-++++-+.+..+.+.-.|. +++|.. +-..+.+ .++.+.+.-...+.+++.+.. T Consensus 282 ~~~-----~~------~~~~q~~~~~~~~~~~~l~~~i~~i~-~~t~~p-~~~~g~~~n~Sg~Al~~~~~~l~~ka~~~~ 348 (479) T protein:vir:99 282 SQN-----EK------ASFGAIPAAPLDGLLNAYKESLLEFL-ALAQLP-PHIAGQIVNVAADALAAGTRQTMQKLFEKQ 348 (479) T ss_pred ecC-----CC------ceEEEecccchHHHHHHHHHHHHHHh-ccCCCC-HHHcccccchHHHHHHHHHHHHHHHHHHHH Confidence 211 11 23444445556656555554443332 333321 1111111 467778888888888888888 Q ss_pred hhHHHHHHHHHHHHHHhhcCcc---cccccccCCcccccCCHHHHHHHHHHHHH---c-CCCHHHHHHHHHHHHHHhhcC Q lcl|NC_011222. 410 RGFEEAQTFVDSTICLLRYGDS---FVSCNINYGTEFYIYTPEELSERYKIMKE---T-GASEAELDALRQQIIETEYRN 482 (577) Q Consensus 410 knfe~~k~Fvldti~~lRyg~~---~~~~ti~yGskFy~~t~eeL~~~i~~Ak~---~-Gas~~~i~~L~~qi~e~EyrN 482 (577) +-|.++-+=++..+++++=+.. .+..++.+. +..++. +.++..++.+ + ++|...+..+.- .+ T Consensus 349 ~~f~~al~~~~~l~~~~~~~~~~~~~~~i~~~w~---~~~~~s-~~~~ad~~~kl~~ag~is~et~l~~l~-gv------ 417 (479) T protein:vir:99 349 ATWKASHNQTMRLVNKIEGRTEEATDLDFTITWQ---DVTIQS-LAQFADAWAKMVESLKIPAEGVWDMIP-NL------ 417 (479) T ss_pred HHHHHHHHHHHHHHHHHcCCCccccceeeeEEec---CCCCCC-HHHHHHHHHHHHhcCCCCHHHHHHhcC-CC------ Confidence 8888877777777777764322 133344432 111222 3333333332 3 456544432210 11 Q ss_pred CHHHHHHHHHHHhhCCCcCccHHHHHHHHhcCCCchhheeeeecchhhhhhhhccCCchhhhh--hccChhhh Q lcl|NC_011222. 483 DPTQMQRLLILNEIEPYSHLTREEAVNLYKENVISEEDLRVKLNLPTFVRRFERENMNIIEFG--SALDYKKK 553 (577) Q Consensus 483 nP~qmqr~~vL~~leP~~~LT~~Ev~~l~e~g~~~eEdl~vk~n~~~fv~rfe~en~~i~efg--~~l~~~~k 553 (577) ++.|++|..-.++-++= .++..+....|....+.=.-.=+-+.-- +-+| ..| ++|+-.-- T Consensus 418 ~~~~~e~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~----~~~~~~~~~~~~~ 479 (479) T protein:vir:99 418 DQSTVNGWKEIYDREGD----FGKYMRKLQNGPDPAEQRGGPNGATNMQ---QANN----KTGEPASLNKSGA 479 (479) T ss_pred CHHHHHHHHHHHHHHHH----HHHHHHHHhcccCcccccCCCCCCCCCC---CCCC----CCcchhccCCCCC Confidence 34556655444433211 1111111111111100000000000000 0000 000 00000000 No 33 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=65.76 E-value=0.28 Score=23.63 Aligned_cols=459 Identities=12% Similarity=0.057 Sum_probs=187.3 Q ss_pred CCcchHHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhc--ccccchHHHHHHHHHHHh-hcc-hhHHHHHHH-----hh Q lcl|NC_011222. 1 MGKSLDEIREIYRHPEGISQIAKAKEHEERIAFHTRVRT--SDDRNKPVIDFLSKVKTW-IAK-DKYDIFLSM-----FH 71 (577) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~t--a~d~~~~~~~fl~~v~~~-l~k-dky~~f~~~-----f~ 71 (577) .|--+...++.-.|+ |.++.||....+ ..+....++.|.++-+.- +|+ +|+..+..- .+ T Consensus 9 ~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~i~~~i~~h~~~~~~rl~~l~~yY~g~~~~i~~ 76 (502) T protein:vir:48 9 DSTGQDLVLNLRFHR------------ESRIRYRADNLEELMVNNWELLKNFINHHKLRQAPRIQELLDYARGENHDVLK 76 (502) T ss_pred ecchhHHHhhcccCh------------hHHhhhcccchhhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccccc Confidence 233334444444443 234555544433 222223344444332211 111 122222111 11 Q ss_pred ccCC------CccchHHHHHHHHHH----hccCCcccccc-c-cChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCe Q lcl|NC_011222. 72 FPVK------TNGVTSEIFDKLSRV----FDGRNPVYNYQ-F-KSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINS 139 (577) Q Consensus 72 fpv~------t~~lt~~iF~~L~kV----~dgqd~~~~y~-f-~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~Ng 139 (577) -+.. .+-++--....+... +=|....+++. - .+..-.+-|+.+.+. +++-+..+.-+=++.... .+ T Consensus 77 ~~~~~~~~~~~~ki~~n~~k~Ivd~~~~yl~g~p~~~~~~d~~~~~~~~~~l~~~~~~-N~~~~~~~~~~~~~~~~G-~a 154 (502) T protein:vir:48 77 SGRRKDNEMADKRAVHNYGRMISKFKTGYLAGNPIRVEYDDNEDNSQNDDAIKRIGRI-NDIDTHNRNLIRDLSQTG-RA 154 (502) T ss_pred cccccccccccceeecchHHHHHHHHhhhhcccCeeEecCCccchhHHHHHHHHHHhh-cCHhHHHHHHHHHHhhcC-eE Confidence 1111 011111111122221 12233323221 0 111122224444433 466665555444444444 56 Q ss_pred EEEEeeccccCCCccccchhhhhHHHH-HHHHhhcccccceeeeee-ec--C-CCE--EEEEeccceeeeccCCccceee Q lcl|NC_011222. 140 VLVVDMPEVQVGEKPEPYFFWLPIANV-LSYRTCGKDCNLMAYIMY-VT--D-ENK--IVYIDEERYVRFDKTRENDLIL 212 (577) Q Consensus 140 viVVDm~~i~~~~rpqpyf~~~pie~V-~~y~~~~~~~~~i~~i~~-~q--D-~N~--i~~IDd~~y~~y~kn~k~ei~~ 212 (577) +++|- .+.++++. +-.++-+++ ..|.... ..+.+.++-+ .. + ++. +-+.+++...+|. +.+.+.. T Consensus 155 ~~~v~---~dedg~~~--i~~~~p~~~~~vydd~~-~~~~~~~ir~~~~~~~~~~~~~~~iyt~~~i~~~~--~~~~~~~ 226 (502) T protein:vir:48 155 YEVIY---RSEYDETR--IKRLSPLETFVIYDNSL-EDNSIAAVRYYNRGTLQNAKDVVEIYTNQHIYTLD--ASDSFNE 226 (502) T ss_pred EEEEE---eCCCCceE--EEEEcccceEEEEcCCC-CCceEEEEEEEEEeecCCcEEEEEEEeCCeEEEEE--eCCceee Confidence 66664 34445432 222222222 2233211 2345555554 21 1 221 3344555555554 3344554 Q ss_pred ehhhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCccc Q lcl|NC_011222. 213 EVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKERC 292 (577) Q Consensus 213 e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~C 292 (577) ....-|.||.||.--|+-.. .-.|-++.+++..|.|-...+....-=.+.+-|++..... ... T Consensus 227 ~~~~~~~~g~vPvv~~~nn~-------~g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~----------~~~ 289 (502) T protein:vir:48 227 ISVTPHAFGTVPITEFLNNA-------DGIGDYETELYLIDLYDSAESDTANHMSDMADAILAIYGD----------LAL 289 (502) T ss_pred ccceecCCCccceEEecCCC-------CCCCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecC----------ccc Confidence 44555999999985554322 3445555555444444333333333233556776654221 001 Q ss_pred cCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHHHH Q lcl|NC_011222. 293 DDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRDEL 372 (577) Q Consensus 293 ~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d~i 372 (577) +++ ..+. ......++-++.|.......+ .-.+++++.+... +-....++++...| T Consensus 290 ~~~------------~~~~-----------~~~~~~~~~~~~~~~~~~~~~-~~d~~~l~~~~~~-~~~~~~~~~L~~~I 344 (502) T protein:vir:48 290 PQG------------MQAS-----------DMKRTRLMQLKPPKSADGKEG-TVKAEYLTKSYDV-SGAEAYKTRLNKDI 344 (502) T ss_pred ccc------------cchh-----------hhhhcceeecccccccccccc-CcceeEeeecCCH-HHHHHHHHHHHHHH Confidence 111 0000 011112233333322211111 1235666655432 23344467777777 Q ss_pred HhhhhcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhh--cCc-ccccccccCCcccccCCH Q lcl|NC_011222. 373 VRSVTGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLR--YGD-SFVSCNINYGTEFYIYTP 448 (577) Q Consensus 373 ~~s~~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lR--yg~-~~~~~ti~yGskFy~~t~ 448 (577) +...-... .+.+..-..++++.+.-.-.+.+++.+..+-|++.-.-+...++++- .|. ..++. ..-.-.|-..-| T Consensus 345 ~~~s~~p~~~~~~~~~n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~d~-~~i~i~f~~~~p 423 (502) T protein:vir:48 345 HVFTNTPDMSDNHFSGNASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGSLVNEFKDFDE-SRLKITFTPNLP 423 (502) T ss_pred HHHhCCCCcCccccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-ccceEEeCCCCC Confidence 75432111 11122335688888888889999999988999988888888777652 121 11211 001124555556 Q ss_pred HHHHHHHHHHHH-cC-CCHHHHHHHHHHHHHHhhcCCHH-HHHHHHHHHhhCCCcCccHHHHHHHHhcCCCchhheeeee Q lcl|NC_011222. 449 EELSERYKIMKE-TG-ASEAELDALRQQIIETEYRNDPT-QMQRLLILNEIEPYSHLTREEAVNLYKENVISEEDLRVKL 525 (577) Q Consensus 449 eeL~~~i~~Ak~-~G-as~~~i~~L~~qi~e~EyrNnP~-qmqr~~vL~~leP~~~LT~~Ev~~l~e~g~~~eEdl~vk~ 525 (577) ..++++.+++.+ .| +|...+..+- -+=.||. +|+|++.=+.=...... ....+=.++- T Consensus 424 ~d~~e~a~~~~kl~g~iS~et~l~~l------~~v~D~~~E~~ri~~E~~~~~~~~~-------------~~~~~~~~~~ 484 (502) T protein:vir:48 424 KSLYEQVSILNDLGGQVSQETALSLS------GLVENPTEELDKINEESSKIDFKGY-------------PSYFYDNVGK 484 (502) T ss_pred cCHHHHHHHHHHHhccCcHHHHHHhC------CCCCCHHHHHHHHHHHHHhhhhhcc-------------cccccccccc Confidence 666666665443 34 3432221111 1223443 36665433221000000 0000001111 Q ss_pred cchhhhhhhhccCCchhh Q lcl|NC_011222. 526 NLPTFVRRFERENMNIIE 543 (577) Q Consensus 526 n~~~fv~rfe~en~~i~e 543 (577) +-+.=..-=++|--+++| T Consensus 485 ~~d~~~e~~~~~~~~~~~ 502 (502) T protein:vir:48 485 YTDEVKETHTDDFERVYE 502 (502) T ss_pred cCCCccCCCCcCcCCCCC Confidence 111111122344455666 No 34 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=62.73 E-value=0.33 Score=23.23 Aligned_cols=449 Identities=13% Similarity=0.058 Sum_probs=183.3 Q ss_pred CCcchH-HHHH-HhhCchhH-----HHHHHHHHHHHHHHHHhhhhcccccchHH---HHHHHHHHHhhcchhHHHHHHHh Q lcl|NC_011222. 1 MGKSLD-EIRE-IYRHPEGI-----SQIAKAKEHEERIAFHTRVRTSDDRNKPV---IDFLSKVKTWIAKDKYDIFLSMF 70 (577) Q Consensus 1 ~~~~~~-~~~~-~~~~~~~~-----~~~aka~~heeri~fh~~~~ta~d~~~~~---~~fl~~v~~~l~kdky~~f~~~f 70 (577) .|+..- .... +.+-+..+ ..|.+.+.+..+- ++.+.. .|| .+.+.+..+... ++...+. T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~----~~~~~~---~YY~g~~~i~~~~~~~~~-~~~~~~~--- 75 (503) T protein:vir:59 7 LGKTHTEELNEIIVESAKEIAEPDTTMIQKLIDEHNPE----PLLKGV---RYYMCENDIEKKRRTYYD-AAGQQLV--- 75 (503) T ss_pred CChhhHHhHHHhhhhhhhhccchhHHHHHHHHHhhcHH----HHHHHH---HHhccccchhhccchhcc-ccccccc--- Confidence 121110 0000 11111111 1222222222110 111100 011 111111111000 0000000 Q ss_pred hccCCCccchHHHHHHHHH----HhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeec Q lcl|NC_011222. 71 HFPVKTNGVTSEIFDKLSR----VFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMP 146 (577) Q Consensus 71 ~fpv~t~~lt~~iF~~L~k----V~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~ 146 (577) ....+-+-++--.+..+.. -+-|+...++ =.+.+..+-|+.|+. +++.+..+.-+=++. .-=.+++.|. T Consensus 76 ~~~~~~~ri~~n~~~~ivd~~~~yl~g~~~~~~--~~d~~~~~~l~~~~~--n~~~~~~~~~~~~~~-~~G~~~~~v~-- 148 (503) T protein:vir:59 76 DDTKTNNRTSHAWHKLFVDQKTQYLVGEPVTFT--SDNKTLLEYVNELAD--DDFDDILNETVKNMS-NKGIEYWHPF-- 148 (503) T ss_pred ccccccceeecchHHHHHHHHHhhhhcCCeeec--cCcHHHHHHHHHHHh--cCHHHHHHHHHHHHh-hCCeEEEEEe-- Confidence 0011112222222333322 2223443333 234455445555553 456665555433333 3444667776 Q ss_pred cccCCCccccchhhhhHHHHHHHHhhcccccceeeeee-e--cCCC----EEEEEeccceeeeccCCccc---------- Q lcl|NC_011222. 147 EVQVGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIMY-V--TDEN----KIVYIDEERYVRFDKTREND---------- 209 (577) Q Consensus 147 ~i~~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~-~--qD~N----~i~~IDd~~y~~y~kn~k~e---------- 209 (577) ++.++++ -+-..+-++++.......+.+.+.++-+ . .+++ ++-+-+++.+..|...+.+. T Consensus 149 -~d~dg~~--~i~~~~p~~~~~i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~ 225 (503) T protein:vir:59 149 -VDEEGEF--DYVIFPAEEMIVVYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENN 225 (503) T ss_pred -ecCCCce--EEEEEccceeEEEEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccccc Confidence 4445543 3433344444433332233455555543 1 1222 35466777777776654221 Q ss_pred ---eeeehhhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCC Q lcl|NC_011222. 210 ---LILEVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESH 286 (577) Q Consensus 210 ---i~~e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~ 286 (577) ...+....|.||.||---|.... .-.|=+..+.+..|.|=...+...+-=.+++-|+...... T Consensus 226 ~~~~~~~~~~~~~~~~vPiv~~~nn~-------~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~------- 291 (503) T protein:vir:59 226 PRPHMTKGGQAIGWGRVPIIPFKNNE-------EMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNY------- 291 (503) T ss_pred cccceeecceeccCCccceEEecCCC-------CCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecC------- Confidence 11233455999999985553322 3445455555444444433344444335677777664211 Q ss_pred CCCccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHH Q lcl|NC_011222. 287 DGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEK 366 (577) Q Consensus 287 ~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~k 366 (577) .|.. +..+.. ....+.++.+ | +..| |++++.++.. +-.....+ T Consensus 292 ~~~~--~~~~~~------------------------~~~~~~~~~~--~----~~~~----~~~l~~~~~~-~~~~~~~~ 334 (503) T protein:vir:59 292 DGEN--PKEFTA------------------------NLRYHSVIKV--S----GDGG----VDTLRAEIPV-DSAAKELE 334 (503) T ss_pred Cccc--cchhhh------------------------hhhcccceec--c----CCCc----ceeEeccCCH-HHHHHHHH Confidence 1111 000000 1112223332 2 2233 4455554432 33345566 Q ss_pred HHHHHHHhhhhcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHh---hcCcccccccccCCcc Q lcl|NC_011222. 367 RLRDELVRSVTGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLL---RYGDSFVSCNINYGTE 442 (577) Q Consensus 367 ri~d~i~~s~~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~l---Ryg~~~~~~ti~yGsk 442 (577) ++.+.|+..+-+.+ ......-..|++++++....+..++.+..+.|+++-+-++.+++++ ..+..+.+. ..-.-. T Consensus 335 ~l~~~i~~~s~~p~~~~~~~~~~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~-~~i~i~ 413 (503) T protein:vir:59 335 RIQDELYKSAQAVDNSPETIGGGATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPD-KELTMT 413 (503) T ss_pred HHHHHHHHHhcccCCCcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc-cceeEE Confidence 78888888775532 1223345678899999999999999999999998888888877664 222221111 011223 Q ss_pred cccCCHHHHHHHHHHH---HHcCC-CHHHHHHHHHHHHHHhhcCCH-HHHHHHHHHH--hhCCCcCccHHHHHHHHhcCC Q lcl|NC_011222. 443 FYIYTPEELSERYKIM---KETGA-SEAELDALRQQIIETEYRNDP-TQMQRLLILN--EIEPYSHLTREEAVNLYKENV 515 (577) Q Consensus 443 Fy~~t~eeL~~~i~~A---k~~Ga-s~~~i~~L~~qi~e~EyrNnP-~qmqr~~vL~--~leP~~~LT~~Ev~~l~e~g~ 515 (577) |...-|..++++..++ ..+|+ |...+..+ .- +=-|| .+|+|++-=+ ..+....++-. +.|. T Consensus 414 f~~~~p~d~~~~~~~~~kl~~~GiiS~et~l~~-l~-----~v~d~~~E~~ri~~E~~~~~~~~~~~~~~------~~~~ 481 (503) T protein:vir:59 414 FTRTRIQNDSEIVQSLVQGVTGGIMSKETAVAR-NP-----FVQDPEEELARIEEEMNQYAEMQGNLLDD------EGGD 481 (503) T ss_pred eCCCCCCCHHHHHHHHHHHHhCCCCchHHHHHh-CC-----CCCCHHHHHHHHHHHHHHHHhhhccccCc------cCCC Confidence 5555555555555544 44675 65443322 11 11244 4455543211 01111111000 0000 Q ss_pred Cchhheeeeecchhhhhhhhcc---CCchh Q lcl|NC_011222. 516 ISEEDLRVKLNLPTFVRRFERE---NMNII 542 (577) Q Consensus 516 ~~eEdl~vk~n~~~fv~rfe~e---n~~i~ 542 (577) -+++. |=++ --+.+ +|.+. T Consensus 482 ~~~~~-----~~~~---~~~~~~~~~g~~~ 503 (503) T protein:vir:59 482 DDLEE-----DDPN---AGAAESGGAGQVS 503 (503) T ss_pred CCCCc-----CCCC---CCcccCCCCCCcC Confidence 00000 0000 00111 11222 No 35 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=62.49 E-value=0.33 Score=23.19 Aligned_cols=440 Identities=10% Similarity=0.084 Sum_probs=160.8 Q ss_pred CCcchHHHHHHhhCchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhh--cchhHHHHHH---Hh-hccC Q lcl|NC_011222. 1 MGKSLDEIREIYRHPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWI--AKDKYDIFLS---MF-HFPV 74 (577) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l--~kdky~~f~~---~f-~fpv 74 (577) .++=...||+..|-- |..+-.|++-+.-+|..|+. ..++|++|. =+.|+..+.. .- .=|. T Consensus 2 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-------------~~~~i~~~~~yy~g~~~~~~~~~~~~~~~~~ 67 (496) T protein:vir:38 2 INQIIAGVKGVMRRM-GLLKALKDVKDHKKVNANDE-------------DYKYIDMWKRLYQGHYAEWHNLNYEHNGNPV 67 (496) T ss_pred hhHHHHHHHHHHHHh-ccchhhHHHHhcCCCcCCHH-------------HHHHHHHHHHHhcCCCchhhcchhccCCCcc Confidence 233333344433321 11111111111112222111 111111110 0001111000 00 0111 Q ss_pred CC----ccchHHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccC Q lcl|NC_011222. 75 KT----NGVTSEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQV 150 (577) Q Consensus 75 ~t----~~lt~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~ 150 (577) .+ ..+.+-|=+.+...+=|+-+.. ...+++..+-|+.+.+ ..+|.+.++.-+-++...- .+++-| -+++ T Consensus 68 ~~~~~~~n~~k~i~~~~a~~l~~~p~~i--~~~d~~~~e~l~~~~~-~n~f~~~~~~~~~~a~~~G-~~~~~~---~~D~ 140 (496) T protein:vir:38 68 NRRQLSMNLPKVTAKYMSKLLFNEKVKI--NIDDKAAEEFVLNVLK-TNGFTKNMERYIEYGEAMG-GFVIKV---YHDG 140 (496) T ss_pred ccceeecchHHHHHHHHhhhhhCCcceE--eeCChHHHHHHHHHHh-ccCHHHHHHHHHHHHhhhC-cEEEEE---EEcC Confidence 12 2344555555555554544432 3344444443343331 2344444444444444443 233322 2455 Q ss_pred CCccccchhhhhHHHHHHHHhhcccccceeeee-eecCCCEEE------EEeccceee---eccCCccceeeeh------ Q lcl|NC_011222. 151 GEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIM-YVTDENKIV------YIDEERYVR---FDKTRENDLILEV------ 214 (577) Q Consensus 151 ~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~-~~qD~N~i~------~IDd~~y~~---y~kn~k~ei~~e~------ 214 (577) +++|.--+ .|-++++-.....++...+.|+- ++.++.... |.|..++++ |+.+..+++--++ T Consensus 141 ~~~~~i~~--v~~~~~~P~~~~~~~~~~~~f~~~~~~~~~~y~~le~h~~~~~~~~I~~~~y~~~~~~~~g~~v~~~~~~ 218 (496) T protein:vir:38 141 NKNVKVSF--ATADCMYPLSNDSENVDECVIANSFHKNNKYYTLLEWNEWQGDVYTVTTELYQSDDPNELGTKVSLTLLF 218 (496) T ss_pred CCcEEEEE--EcccceEEEEecCCcEEEEEEEEEEEeCCeEEEEEEEEEEeCceEEEEEEEEecCCccccCccccccccc Confidence 66543322 24444442222224444455553 345544332 223344333 3333323221111 Q ss_pred ------hhhhhhcccceeeEeccccccc----CcceeeccchhHH---HHHHHHHHHHHHHHHHHhhcC-Cceeeccccc Q lcl|NC_011222. 215 ------DNMHDLGYCPARFFWSDSISLS----EPDIKISPITSEL---DSFDWYLYYSTAKKHLDLYAS-YPIYSGYERD 280 (577) Q Consensus 215 ------~~~H~lGy~PA~~~~gD~~~~s----kp~ik~SpL~~~L---~a~D~ll~~~ts~~hldlyA~-YPiYs~y~~D 280 (577) ...+.++-.|- +.+--++.++ .| +=.|-+++++ +++|+.++.-.. ++..+ ..++- +.+ T Consensus 219 ~~~~~~~~~~~~~~~~f-~~~~~~~~N~~~~~~p-~G~Sd~~~~~~lid~ld~~~s~~~~----~~~~~~~~i~v--~~~ 290 (496) T protein:vir:38 219 DDIEPVVPLPDFTRPTF-IYIKPNIANNKNLTSP-LGISVYANALDTLKTLDLMFDSYYQ----EFKLGKKKVLV--PSS 290 (496) T ss_pred cccccceeecCCCcceE-EEecCCcccccccCCc-CCCchHhhHHHHHHHHHHHHHHHHH----HHhhcccceec--chH Confidence 11111211111 1112122221 12 2345566666 555555554433 23333 33332 222 Q ss_pred CCccCCCCCccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHH Q lcl|NC_011222. 281 CHYESHDGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEY 360 (577) Q Consensus 281 C~~~~~~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY 360 (577) +..-.+.++ |..... +..+-..+..+.. ...+. .+.++..++++-+ +. T Consensus 291 ~l~~~~~~~-----------------g~~~~~---------~~~~~~~~~~~~~--~~~~~---~~~i~~~~~~i~~-e~ 338 (496) T protein:vir:38 291 FVKTAVNLD-----------------GSTTQY---------FDSTDEAFFLYQG--DQDDN---GKAIKDISVEIRS-TE 338 (496) T ss_pred HhhccCCCC-----------------CccccC---------CCCccceEEEeec--CCCcc---cccceeeccccCH-HH Confidence 211111110 000000 0001111222221 11222 2357788888765 55 Q ss_pred HhhHHHHHHHHHHhhhhcCC-----CccccchhhcccceeccHHHHHHHHHH----HhhhHHHHHHHHHHHHHHh-hcC- Q lcl|NC_011222. 361 NVNEEKRLRDELVRSVTGGE-----GELNRSEAINEKQVKAGFESLTTKLNR----IKRGFEEAQTFVDSTICLL-RYG- 429 (577) Q Consensus 361 ~~~~~kri~d~i~~s~~Gf~-----~d~q~~kA~ne~~V~s~~ds~~~~l~~----ikknfe~~k~Fvldti~~l-Ryg- 429 (577) +....+.+.+.|+..| |++ .+.+. +.|.+.+.+-...+.+.+.+ +++.|.++=.+++.+..++ +++ T Consensus 339 ~~~~l~~~l~~i~~~~-g~~~~~f~~~~~g--~~tAtei~~~~~~l~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~g 415 (496) T protein:vir:38 339 FIESINAMLRIYAMQV-GLSAGTFTFDENG--LKTATEVVSEKSETYQTKNSHSQLIEQGIKEMIVSILEVGKFIEAYSG 415 (496) T ss_pred HHHHHHHHHHHHHHhh-CCChhhcCCCccc--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 5666777777776555 644 33322 33445555555555554444 4455555655666655432 232 Q ss_pred cccccccccCCcccccC---CHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHhhcCCHHHH-HHHHHHHhhCCCcCccHH Q lcl|NC_011222. 430 DSFVSCNINYGTEFYIY---TPEELSERYKIMKETGASEAELDALRQQIIETEYRNDPTQM-QRLLILNEIEPYSHLTRE 505 (577) Q Consensus 430 ~~~~~~ti~yGskFy~~---t~eeL~~~i~~Ak~~Gas~~~i~~L~~qi~e~EyrNnP~qm-qr~~vL~~leP~~~LT~~ 505 (577) .+.-..+|. -.|... +.++....+..+..+|+-+-+ .++...+-=++++. +..+-++.=.+ ..+ T Consensus 416 ~~~~~~~i~--v~f~d~i~~d~~~~~~~~~~~~~~GiiS~e------t~l~~~~~~~d~ea~~el~ri~~E~~-~~~--- 483 (496) T protein:vir:38 416 EVVELDTIT--VDFDDSIAQDEDTTINRYTNAKNQGMIPLK------IALQRAWNITEAEADEWAEMLAKEKQ-AEM--- 483 (496) T ss_pred CCCCccceE--EEeCCCCCCCHHHHHHHHHHHHhcCCCCHH------HHHHhcCCCChHHHHHHHHHHHHhhh-ccC--- Confidence 222222221 234443 444555555556667985432 23334433233332 22222221000 000 Q ss_pred HHHHHHhcCCCchhh Q lcl|NC_011222. 506 EAVNLYKENVISEED 520 (577) Q Consensus 506 Ev~~l~e~g~~~eEd 520 (577) +..++ .|+.+||+ T Consensus 484 ~~~d~--~~~~~~~e 496 (496) T protein:vir:38 484 PNNDM--NGIFGEEE 496 (496) T ss_pred ccccc--cCCCCCCC Confidence 00011 25666666 No 36 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=61.12 E-value=0.36 Score=23.02 Aligned_cols=435 Identities=12% Similarity=0.073 Sum_probs=183.1 Q ss_pred CCcch---------HHHHHHh--hCchhHHHHHHHHH-HHHHHHHHhhhhcccccchHHH---HHHHHHHHhhcchhHHH Q lcl|NC_011222. 1 MGKSL---------DEIREIY--RHPEGISQIAKAKE-HEERIAFHTRVRTSDDRNKPVI---DFLSKVKTWIAKDKYDI 65 (577) Q Consensus 1 ~~~~~---------~~~~~~~--~~~~~~~~~aka~~-heeri~fh~~~~ta~d~~~~~~---~fl~~v~~~l~kdky~~ 65 (577) |=+-| .|+-+++ ++-.-...|.+.+. |.+|+. ++.+.. .||. +.|.+.+...++-.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~---~~~~~~---~YY~g~~~i~~~~~~~~~~~~~~~ 74 (474) T protein:vir:97 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLD---KITVGQ---RYYDKDNDIVKQMKKVDVHGNIDY 74 (474) T ss_pred CcccccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHH---HHHHHH---HHhccccchhcccchhcccccccc Confidence 11111 1111111 11111122333322 333331 111100 0111 11222111100000000 Q ss_pred HHHHhhccCCCcc----chHHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEE Q lcl|NC_011222. 66 FLSMFHFPVKTNG----VTSEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVL 141 (577) Q Consensus 66 f~~~f~fpv~t~~----lt~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~Ngvi 141 (577) . -+.|- +.+.|=+.+..-+=|+.. +|+=.+++..+-|..|.. +++.+....-+=++.... .+++ T Consensus 75 ~-------~~~~ki~~n~~k~Ivd~~~~~l~g~p~--~~~~~d~~~~~~l~~~~~--n~~~~~~~e~~~~~~~~G-~~~~ 142 (474) T protein:vir:97 75 D-------KPDWRITTNFHQNLVDQKVSYVASKPV--TYSCEDENVLKVIHDVLD--TRWDNKLIDILTATSNKG-IDWL 142 (474) T ss_pred c-------cCcceeecchHHHHHHHHHhhhhcCCc--eeccCcHHHHHHHHHHHh--ccHHHHHHHHHHHHhhcC-ceEE Confidence 0 01111 222222222222223333 233345556666666653 355554444433333333 4666 Q ss_pred EEeeccccCCCccccchhhhhHHHHHHHHhhcccccceeeeee--ecCCCEEEEEeccceeeeccCCccceee------- Q lcl|NC_011222. 142 VVDMPEVQVGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIMY--VTDENKIVYIDEERYVRFDKTRENDLIL------- 212 (577) Q Consensus 142 VVDm~~i~~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~--~qD~N~i~~IDd~~y~~y~kn~k~ei~~------- 212 (577) +|. ++.+++|. +-+.+-.+++..--.......+.++-+ .++..++.+.+++.+.+|..++.+ +.. T Consensus 143 ~~~---~d~~~~~~--i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~-~~~~~~~~~~ 216 (474) T protein:vir:97 143 QVY---INENGEMK--LFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGG-LIPDYYYGAN 216 (474) T ss_pred EEE---ecCCCeeE--EEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCc-cccccccCcC Confidence 665 45566543 322333444332221222344555554 344445666777777677655422 111 Q ss_pred --eh-hhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCC Q lcl|NC_011222. 213 --EV-DNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGK 289 (577) Q Consensus 213 --e~-~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~ 289 (577) +. ...|.||.||.--|--.. .-.|=+..+++..|.|=...+....-=.+++-|++.... ..|. T Consensus 217 ~~~~~~~~~~~g~vPvv~~~nn~-------~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g-------~~~~ 282 (474) T protein:vir:97 217 HVQSHFSNGNWGRVPFIAFKNNP-------EEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKG-------YEGE 282 (474) T ss_pred cccccccccCCCccceEEecCCc-------CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec-------CCcc Confidence 11 223999999984442222 334445555544444433333333322355666665321 1111 Q ss_pred ccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHH Q lcl|NC_011222. 290 ERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLR 369 (577) Q Consensus 290 ~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~ 369 (577) .. .++..+ +..+.++.++ ++.| |++++.++ ..+-....++++. T Consensus 283 ~~--~~~~~~------------------------~~~~~~i~~~------~~~~----~~~l~~~~-~~~~~~~~~~~l~ 325 (474) T protein:vir:97 283 DL--EEFMRG------------------------LKYYKAINVD------GDGG----VETIQVEV-PVSSTKEYIDLMR 325 (474) T ss_pred cc--hhhhhh------------------------hhccceeecc------CCCc----eeEEeecC-CHHHHHHHHHHHH Confidence 10 011110 1123333321 1222 44444433 2244445567788 Q ss_pred HHHHhhhhcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccc-cccCCcccccCC Q lcl|NC_011222. 370 DELVRSVTGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSC-NINYGTEFYIYT 447 (577) Q Consensus 370 d~i~~s~~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~-ti~yGskFy~~t 447 (577) +.|+...-+.. .+.+.+-+.++++.+..+-.+..++.+..+.|+++-+-+..+++++. |-. ++. +|. -.|.... T Consensus 326 ~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~-~~~-~d~~~i~--v~f~~~~ 401 (474) T protein:vir:97 326 VYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFN-NLK-TDVKDIE--ISFNFNR 401 (474) T ss_pred HHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCC-cccceee--EEeccCc Confidence 88887653322 12223346788899999999999999999999999998888888762 322 121 222 2467777 Q ss_pred HHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHhhcCCH-HHHHHHHHH-----HhhCCCcCccHHHHHHHHhcCCCchhhe Q lcl|NC_011222. 448 PEELSERYKIMKETGASEAELDALRQQIIETEYRNDP-TQMQRLLIL-----NEIEPYSHLTREEAVNLYKENVISEEDL 521 (577) Q Consensus 448 ~eeL~~~i~~Ak~~Gas~~~i~~L~~qi~e~EyrNnP-~qmqr~~vL-----~~leP~~~LT~~Ev~~l~e~g~~~eEdl 521 (577) |..+++..++|.+.|+-+-+-.. .++ -+=.|| .+|+|++.= +.+.++.+--...-.+ T Consensus 402 p~~~~e~a~~~~~~g~iS~et~l--~~l---~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~------------ 464 (474) T protein:vir:97 402 MMNDAEQSQIIAQSQYLSRETLV--KSS---PLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQ------------ 464 (474) T ss_pred ccCHHHHHHHHHHcCCCCHHHHH--HhC---CCCCCHHHHHHHHHHHHHHHHhhccccCCCCCCCccc------------ Confidence 88888888888888753322111 111 011233 234443321 1111111110000000 Q ss_pred eeeecchhhhhhhhccCCchhh Q lcl|NC_011222. 522 RVKLNLPTFVRRFERENMNIIE 543 (577) Q Consensus 522 ~vk~n~~~fv~rfe~en~~i~e 543 (577) -|..+..-.| T Consensus 465 ------------~~~~~~~~~e 474 (474) T protein:vir:97 465 ------------QEGSNNKESE 474 (474) T ss_pred ------------CCCCcccccC Confidence 0111111111 No 37 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=61.12 E-value=0.36 Score=23.02 Aligned_cols=435 Identities=12% Similarity=0.073 Sum_probs=183.1 Q ss_pred CCcch---------HHHHHHh--hCchhHHHHHHHHH-HHHHHHHHhhhhcccccchHHH---HHHHHHHHhhcchhHHH Q lcl|NC_011222. 1 MGKSL---------DEIREIY--RHPEGISQIAKAKE-HEERIAFHTRVRTSDDRNKPVI---DFLSKVKTWIAKDKYDI 65 (577) Q Consensus 1 ~~~~~---------~~~~~~~--~~~~~~~~~aka~~-heeri~fh~~~~ta~d~~~~~~---~fl~~v~~~l~kdky~~ 65 (577) |=+-| .|+-+++ ++-.-...|.+.+. |.+|+. ++.+.. .||. +.|.+.+...++-.... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~i~~~~~~~~---~~~~~~---~YY~g~~~i~~~~~~~~~~~~~~~ 74 (474) T protein:vir:94 1 MFNIIRMPWDKPYGEEVVEQLKPQFETQEEMIVRLIDDHRKQLD---KITVGQ---RYYDKDNDIVKQMKKVDVHGNIDY 74 (474) T ss_pred CcccccccCCCchhhHHHHhhhhcccCHHHHHHHHHHHHHHHHH---HHHHHH---HHhccccchhcccchhcccccccc Confidence 11111 1111111 11111122333322 333331 111100 0111 11222111100000000 Q ss_pred HHHHhhccCCCcc----chHHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEE Q lcl|NC_011222. 66 FLSMFHFPVKTNG----VTSEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVL 141 (577) Q Consensus 66 f~~~f~fpv~t~~----lt~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~Ngvi 141 (577) . -+.|- +.+.|=+.+..-+=|+.. +|+=.+++..+-|..|.. +++.+....-+=++.... .+++ T Consensus 75 ~-------~~~~ki~~n~~k~Ivd~~~~~l~g~p~--~~~~~d~~~~~~l~~~~~--n~~~~~~~e~~~~~~~~G-~~~~ 142 (474) T protein:vir:94 75 D-------KPDWRITTNFHQNLVDQKVSYVASKPV--TYSCEDENVLKVIHDVLD--TRWDNKLIDILTATSNKG-IDWL 142 (474) T ss_pred c-------cCcceeecchHHHHHHHHHhhhhcCCc--eeccCcHHHHHHHHHHHh--ccHHHHHHHHHHHHhhcC-ceEE Confidence 0 01111 222222222222223333 233345556666666653 355554444433333333 4666 Q ss_pred EEeeccccCCCccccchhhhhHHHHHHHHhhcccccceeeeee--ecCCCEEEEEeccceeeeccCCccceee------- Q lcl|NC_011222. 142 VVDMPEVQVGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIMY--VTDENKIVYIDEERYVRFDKTRENDLIL------- 212 (577) Q Consensus 142 VVDm~~i~~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~--~qD~N~i~~IDd~~y~~y~kn~k~ei~~------- 212 (577) +|. ++.+++|. +-+.+-.+++..--.......+.++-+ .++..++.+.+++.+.+|..++.+ +.. T Consensus 143 ~~~---~d~~~~~~--i~~~~p~~~~~v~d~~~~~~~~~~ir~~~~~~~~~~~~yt~~~~~~y~~~~~~-~~~~~~~~~~ 216 (474) T protein:vir:94 143 QVY---INENGEMK--LFRVPAEQAIPIWVDKEREELKSFIRYYKFNNEEKVEFWTDTTVTYYVLENGG-LIPDYYYGAN 216 (474) T ss_pred EEE---ecCCCeeE--EEEEcccceEEEEcCCCCCceEEEEEEEEecCeEEEEEEeCCeEEEEEEcCCc-cccccccCcC Confidence 665 45566543 322333444332221222344555554 344445666777777677655422 111 Q ss_pred --eh-hhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCC Q lcl|NC_011222. 213 --EV-DNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGK 289 (577) Q Consensus 213 --e~-~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~ 289 (577) +. ...|.||.||.--|--.. .-.|=+..+++..|.|=...+....-=.+++-|++.... ..|. T Consensus 217 ~~~~~~~~~~~g~vPvv~~~nn~-------~g~sd~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g-------~~~~ 282 (474) T protein:vir:94 217 HVQSHFSNGNWGRVPFIAFKNNP-------EEVSDIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKG-------YEGE 282 (474) T ss_pred cccccccccCCCccceEEecCCc-------CCCCcHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeec-------CCcc Confidence 11 223999999984442222 334445555544444433333333322355666665321 1111 Q ss_pred ccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHH Q lcl|NC_011222. 290 ERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLR 369 (577) Q Consensus 290 ~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~ 369 (577) .. .++..+ +..+.++.++ ++.| |++++.++ ..+-....++++. T Consensus 283 ~~--~~~~~~------------------------~~~~~~i~~~------~~~~----~~~l~~~~-~~~~~~~~~~~l~ 325 (474) T protein:vir:94 283 DL--EEFMRG------------------------LKYYKAINVD------GDGG----VETIQVEV-PVSSTKEYIDLMR 325 (474) T ss_pred cc--hhhhhh------------------------hhccceeecc------CCCc----eeEEeecC-CHHHHHHHHHHHH Confidence 10 011110 1123333321 1222 44444433 2244445567788 Q ss_pred HHHHhhhhcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccc-cccCCcccccCC Q lcl|NC_011222. 370 DELVRSVTGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSC-NINYGTEFYIYT 447 (577) Q Consensus 370 d~i~~s~~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~-ti~yGskFy~~t 447 (577) +.|+...-+.. .+.+.+-+.++++.+..+-.+..++.+..+.|+++-+-+..+++++. |-. ++. +|. -.|.... T Consensus 326 ~~I~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~-~~~-~d~~~i~--v~f~~~~ 401 (474) T protein:vir:94 326 VYIMEFGQGVDFQTDKFGSAPSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFN-NLK-TDVKDIE--ISFNFNR 401 (474) T ss_pred HHHHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCC-cccceee--EEeccCc Confidence 88887653322 12223346788899999999999999999999999998888888762 322 121 222 2467777 Q ss_pred HHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHhhcCCH-HHHHHHHHH-----HhhCCCcCccHHHHHHHHhcCCCchhhe Q lcl|NC_011222. 448 PEELSERYKIMKETGASEAELDALRQQIIETEYRNDP-TQMQRLLIL-----NEIEPYSHLTREEAVNLYKENVISEEDL 521 (577) Q Consensus 448 ~eeL~~~i~~Ak~~Gas~~~i~~L~~qi~e~EyrNnP-~qmqr~~vL-----~~leP~~~LT~~Ev~~l~e~g~~~eEdl 521 (577) |..+++..++|.+.|+-+-+-.. .++ -+=.|| .+|+|++.= +.+.++.+--...-.+ T Consensus 402 p~~~~e~a~~~~~~g~iS~et~l--~~l---~~v~D~~~E~eri~~E~~~~~~~~~~~~~~~~~~~~~------------ 464 (474) T protein:vir:94 402 MMNDAEQSQIIAQSQYLSRETLV--KSS---PLVDDYKAELERIEQEQMEYNKQLPNLDDGGADGAQQ------------ 464 (474) T ss_pred ccCHHHHHHHHHHcCCCCHHHHH--HhC---CCCCCHHHHHHHHHHHHHHHHhhccccCCCCCCCccc------------ Confidence 88888888888888753322111 111 011233 234443321 1111111110000000 Q ss_pred eeeecchhhhhhhhccCCchhh Q lcl|NC_011222. 522 RVKLNLPTFVRRFERENMNIIE 543 (577) Q Consensus 522 ~vk~n~~~fv~rfe~en~~i~e 543 (577) -|..+..-.| T Consensus 465 ------------~~~~~~~~~e 474 (474) T protein:vir:94 465 ------------QEGSNNKESE 474 (474) T ss_pred ------------CCCCcccccC Confidence 0111111111 No 38 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=54.31 E-value=0.51 Score=22.20 Aligned_cols=432 Identities=12% Similarity=0.054 Sum_probs=176.7 Q ss_pred HHHHhh--CchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhcc-hhHHHHHHH--------h------ Q lcl|NC_011222. 8 IREIYR--HPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAK-DKYDIFLSM--------F------ 70 (577) Q Consensus 8 ~~~~~~--~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~k-dky~~f~~~--------f------ 70 (577) .-+|.+ .+.-+..+. +||.+- ..+....+..|.+.-++-++. +|+.++..- . T Consensus 1 ~~~~~~~~~~~~~~~~~------~~~~~~-----~~~~~~~i~~~i~~~~~~~~~~~~~~~Yy~g~~~i~~~~~~~~~~~ 69 (474) T protein:vir:96 1 MIVIFWPNEKPYHERVV------EQIKPK-----YETQEEMIIRLINDHKPKIDDITVGERYYNHDPDVLRLAPKLDNKG 69 (474) T ss_pred CeeeccCCCchhhhhHH------HHhhhc-----cCChHHHHHHHHHHHHHHHHHHHHHHHHhccCCcchhccchhcccc Confidence 111100 000000000 111110 111111111121111100000 011111000 0 Q ss_pred -hccCC-Cccch----HHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEe Q lcl|NC_011222. 71 -HFPVK-TNGVT----SEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVD 144 (577) Q Consensus 71 -~fpv~-t~~lt----~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVD 144 (577) ..|-. -|-|. +-|=+...--+=|....++. .+.+..+-+..+... ++.+.-.-.+=.+... =-++++|. T Consensus 70 ~~~~~~~~~ki~~n~~~~Ivd~~~~~l~g~p~~~~~--~d~~~~~~l~~~~~n--~~~~~~~~~~~~~~~~-G~~~~~~y 144 (474) T protein:vir:96 70 EIDPLKPDWRMFTNYHQNLVDQKVAYAVANPVTFSS--DDDKSLKTIQEVLNH--KWDDKLVDILTAASNK-GIEWLQPY 144 (474) T ss_pred cccccccchhcccchHHHHHHhhhhhhcccCceeec--CchHHHHHHHHHHhc--CHHHHHHHHHHHHHhc-CeeEEEEE Confidence 01111 11122 11211111122233333332 344444444444321 3322222222223333 23566665 Q ss_pred eccccCCCccccchhhhhHHHHHHHHhhcccccceeeeee--ecCCCEEEEEeccceeeeccCCcccee----------- Q lcl|NC_011222. 145 MPEVQVGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIMY--VTDENKIVYIDEERYVRFDKTRENDLI----------- 211 (577) Q Consensus 145 m~~i~~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~~--~qD~N~i~~IDd~~y~~y~kn~k~ei~----------- 211 (577) ++.++++. +.+.-|....-.|.. ......+.++-+ .++..++-+-.+..+..|...+.+.+. T Consensus 145 ---~d~~~~~~-i~~~~p~~~~~v~d~-~~~~~~~~~vr~~~~~~~~~~~~yt~~~v~~~~~~~~~~~~~~~~~~~~~~~ 219 (474) T protein:vir:96 145 ---IDENGEFK-TFRVPAEQAIPIWTN-KERDTLKAFIRYYRLDGAERVEYWTDSDVTYYEYQDGILIPDYYHGEEHIQS 219 (474) T ss_pred ---ecCCCceE-EEEEcccceEEEEcC-CCCCceEEEEEEEeecCceEEEEEeCCeEEEEEecCCceeeccccccccccc Confidence 55566543 223233332222322 122344555544 333344444445555555544322211 Q ss_pred -eeh-hhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCC Q lcl|NC_011222. 212 -LEV-DNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGK 289 (577) Q Consensus 212 -~e~-~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~ 289 (577) .++ ..-|.+|.||..-|..... -.|=+..+.+..|.|=...+....-=.+++-|++...- ..|. T Consensus 220 ~~~~~~~~~~~g~iPvv~~~nn~~-------g~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g-------~~~~ 285 (474) T protein:vir:96 220 HYYVGNKRVSWGRVPFIPFKNNPQ-------EMSDLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKG-------YEGQ 285 (474) T ss_pred cccccccccCCCceeEEEeccCCC-------CCCcHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeec-------CCcc Confidence 112 2339999999965544333 34445555544444444444444433456677765321 1111 Q ss_pred ccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeeccc--HHHHHHHhhHHHH Q lcl|NC_011222. 290 ERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSAD--TGSLEYNVNEEKR 367 (577) Q Consensus 290 ~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~d--i~sleY~~~~~kr 367 (577) + .+|+.+ -+..+-.+.+|- ++.| |++++.+ .+.++ ...++ T Consensus 286 ~--~~~~~~------------------------~~~~~~~i~~~~-----~~~~----~~~l~~~~~~~~~~---~~~~~ 327 (474) T protein:vir:96 286 D--LDEFMR------------------------NLKYYKAINVDG-----DGSG----VDTIQIEVPVQSSK---EYLDM 327 (474) T ss_pred c--ccchhh------------------------hhhcCceEEecC-----CCCc----eeEEeecCChHHHH---HHHHH Confidence 0 001111 112233344432 3344 5666544 34444 44567 Q ss_pred HHHHHHhhhhcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccccccCCcccccC Q lcl|NC_011222. 368 LRDELVRSVTGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSCNINYGTEFYIY 446 (577) Q Consensus 368 i~d~i~~s~~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~ti~yGskFy~~ 446 (577) +.+.|+...-+.. .+.+.+-+.++++.+..+-.+.+++.+..+-|++.-+-++.+++++. |-.+=.-+|. -.|... T Consensus 328 l~~~i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~-~~~~~~~~i~--i~f~~~ 404 (474) T protein:vir:96 328 LRDYVIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFY-KLNIKVQDVE--ITFNFN 404 (474) T ss_pred HHHHHHHHhCCccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCcccceee--EEeccC Confidence 7888887663311 11223346789999999999999999999999998888888888774 3221111222 357888 Q ss_pred CHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHhhcCCHH-HHHHHHHHHhhCCCcCccHHHHHHHHhcCCCchhheeeee Q lcl|NC_011222. 447 TPEELSERYKIMKETGASEAELDALRQQIIETEYRNDPT-QMQRLLILNEIEPYSHLTREEAVNLYKENVISEEDLRVKL 525 (577) Q Consensus 447 t~eeL~~~i~~Ak~~Gas~~~i~~L~~qi~e~EyrNnP~-qmqr~~vL~~leP~~~LT~~Ev~~l~e~g~~~eEdl~vk~ 525 (577) .|..+++.+++++.+|+-+-+-..=+.-++ .||. +|+|++ ++-+-+.. ....+........+| T Consensus 405 ~p~~~~e~~~~~~~ag~iS~et~~~~~~~v-----~d~~~E~~ri~--~E~~e~~~----~~~~~~~~~~~~~~d----- 468 (474) T protein:vir:96 405 VMVNELEQSQIGVQSQYLSKETVVTNHPWV-----DDPVAELERIE--QDNIDFNK----QLPPLEGDANGRAQD----- 468 (474) T ss_pred CCcCHHHHHHHHHhcCCCchHHHHHhCCCC-----CCHHHHHHHHH--HHHHHHHh----cccccccccccccCC----- Confidence 899999999999998843332221111122 2443 344432 22111111 111111111111111 Q ss_pred cchhhhhhhhccCCc Q lcl|NC_011222. 526 NLPTFVRRFERENMN 540 (577) Q Consensus 526 n~~~fv~rfe~en~~ 540 (577) +.+.+| T Consensus 469 ---------~~~e~~ 474 (474) T protein:vir:96 469 ---------NESETN 474 (474) T ss_pred ---------CcccCC Confidence 112222 No 39 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=37.51 E-value=1.1 Score=20.33 Aligned_cols=438 Identities=14% Similarity=0.067 Sum_probs=174.6 Q ss_pred CCcc-----hHHHHHHhhCc-hhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHHHHHHHHhhcchhHHHHHHHhhccC Q lcl|NC_011222. 1 MGKS-----LDEIREIYRHP-EGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAKDKYDIFLSMFHFPV 74 (577) Q Consensus 1 ~~~~-----~~~~~~~~~~~-~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~kdky~~f~~~f~fpv 74 (577) .+|+ ++.|+..++-. +=|++.. .+|..|+.-..++ ..||.- +.=+.+.+.......-..+- T Consensus 8 ~~~~~~~e~~~~~~~~~~~~~~~i~~~i--~~~~~~~~~~~~~------~~yY~g-----~~~i~~~~~~~~~~~~~~~~ 74 (478) T protein:vir:10 8 WDKPYHEQVVEQIKPKYETQEEMILRLV--REHKENIDNITMG------ERYYNH-----HPDILDAPPKRDVNGDYDET 74 (478) T ss_pred CCchhHHHHHHHHhhccCCcHHHHHHHH--HHHHHHHHHHHHH------HHHhcC-----CCchhccccccccccccccc Confidence 1221 22333322211 1111111 1222222211111 111110 00000000000001000110 Q ss_pred -CCccch----HHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeecccc Q lcl|NC_011222. 75 -KTNGVT----SEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQ 149 (577) Q Consensus 75 -~t~~lt----~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~ 149 (577) +.+-|. ..|=+.+.--+=|+...+ .-.+.+..+.+..+.. +++.+....-+=++.... -++++|. .. T Consensus 75 ~~~~ki~~n~~~~ivd~~~~~l~g~~~~~--~~~~d~~~~~l~~~~~--n~~~~~~~~~~~~~~~~G-~~~~~~~---~d 146 (478) T protein:vir:10 75 KPDWRMYTNYHQNLVDQKVAYAVANPVTF--GVDNDKALKQIQHTLN--HKWDDKLVDILTAASNKG-IEWVQPY---VD 146 (478) T ss_pred cccceeccchHHHHHHHHHhhhccCCeee--ecCChHHHHHHHHHHh--cCHHHHHHHHHHHHHhcC-eEEEEEE---ec Confidence 111222 222222212222343333 3345555544444442 355555554444444444 4666665 45 Q ss_pred CCCccccchhhhhHHHHHHHHhhcccccceeee-eee-cCCCEEEEEeccceeeeccCC-------------ccceeeeh Q lcl|NC_011222. 150 VGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYI-MYV-TDENKIVYIDEERYVRFDKTR-------------ENDLILEV 214 (577) Q Consensus 150 ~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i-~~~-qD~N~i~~IDd~~y~~y~kn~-------------k~ei~~e~ 214 (577) .++++. ..+.-|.+..-.|.-... ...+.++ .|. .+..++.+-.+..+..|...+ ...+.... T Consensus 147 ~~g~~~-~~~~~p~~~~~i~d~~~~-~~~~~~v~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 224 (478) T protein:vir:10 147 EEGEFK-TFRVPAEQAVPIWTNKER-DELQAFIRVYELDGAERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQG 224 (478) T ss_pred CCCeeE-EEEEcccceEEEEcCCCC-CceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCeeeccccccccccccceecc Confidence 566553 222233332222222111 1223333 232 333334444444444443322 11233334 Q ss_pred hhhhhhcccceeeEecccccccCcceeeccchhHH---HHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCcc Q lcl|NC_011222. 215 DNMHDLGYCPARFFWSDSISLSEPDIKISPITSEL---DSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKER 291 (577) Q Consensus 215 ~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L---~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~ 291 (577) ...|.+|.||..-|.-.. .-.|=++.+. +|+|+.++. ...-=.+++-|++...-. .+.. T Consensus 225 ~~~~~~~~vPvv~~~n~~-------~g~sd~~~v~~liDa~~~~~S~---~~~~~~~~~~p~~~~~g~-------~~~~- 286 (478) T protein:vir:10 225 NKLMSWGRVPFIPFKNNP-------QEVSDLFMYKTIIDALDKRLSD---TQNTFDESVELIYILKGY-------EGED- 286 (478) T ss_pred cccccCCccceEEeccCC-------CCCCcHHHHHHHHHHHHHHHHH---HHHHHHHhhCceeeeecC-------Cccc- Confidence 445999999984443333 3344445444 555554443 333223566777764221 1100 Q ss_pred ccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHHH Q lcl|NC_011222. 292 CDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRDE 371 (577) Q Consensus 292 C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d~ 371 (577) .+ . ...-+..+.++.++- .++.| |+++++++. .+-....++++.+. T Consensus 287 --~~-----------~------------~~~~~~~~~~~~~~~----~~~~~----~~~l~~~~~-~~~~~~~~~~l~~~ 332 (478) T protein:vir:10 287 --MK-----------D------------FMHNLKYYKAISVAG----ESGSG----VDTIKVEVP-IDSVKEYTKMLRDY 332 (478) T ss_pred --cc-----------h------------hhhhhhhcceEEecC----CCCCc----ceEEeecCC-hHHHHHHHHHHHHH Confidence 00 0 000112233444432 23344 556655432 23334556667777 Q ss_pred HHhhhhcCC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhcCcccccccccCCcccccCCHHH Q lcl|NC_011222. 372 LVRSVTGGE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLRYGDSFVSCNINYGTEFYIYTPEE 450 (577) Q Consensus 372 i~~s~~Gf~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lRyg~~~~~~ti~yGskFy~~t~ee 450 (577) |+...-+.. ...+.+-+.++++.++..-.+..++.+..+.|++.-+-++.+++++. |..+=..+| --.|....|.. T Consensus 333 i~~~s~~p~~~~~~~~~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~-g~~~~~~~i--~i~f~~~~p~d 409 (478) T protein:vir:10 333 IIEFGQGVDFQQDKFGNSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFY-RLDVKVQDI--EITFNFNVMVN 409 (478) T ss_pred HHHHhCccccCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-CCCcccccc--eEEecCCCCCC Confidence 777653321 12223346788999999999999999999999999999999988875 321111122 24466666766 Q ss_pred HHHHHHHHHH-cCC-CHHHHHHHHHHHHHHhhcCCH-HHHHHHHHHHhhCCCcCccHHHHHHHHhcCCCchhheeeeecc Q lcl|NC_011222. 451 LSERYKIMKE-TGA-SEAELDALRQQIIETEYRNDP-TQMQRLLILNEIEPYSHLTREEAVNLYKENVISEEDLRVKLNL 527 (577) Q Consensus 451 L~~~i~~Ak~-~Ga-s~~~i~~L~~qi~e~EyrNnP-~qmqr~~vL~~leP~~~LT~~Ev~~l~e~g~~~eEdl~vk~n~ 527 (577) ++++.+++.+ +|+ |...+.. +.- +=.|| .+|+|+. .+-+.. .+++.++ ..|..+++.- T Consensus 410 ~~e~a~~~~kl~g~iS~et~~~-~l~-----~v~D~~~E~~ri~--~E~~~~----~~~~~~~-~~~~~~~~~~------ 470 (478) T protein:vir:10 410 ELENSQIAMNSTGLLSKETILS-NHA-----WVEDPVAEMERIE--QENIEL----NQQLPDI-EEGLNGEQQR------ 470 (478) T ss_pred HHHHHHHHHHHhCCCChHHHHH-hCC-----CCCCHHHHHHHHH--HHHHHH----Hhhcccc-ccccCCCCCC------ Confidence 7666665433 232 2211110 000 11232 2333332 221111 1111111 1222222110 Q ss_pred hhhhhhhhccCCchhh Q lcl|NC_011222. 528 PTFVRRFERENMNIIE 543 (577) Q Consensus 528 ~~fv~rfe~en~~i~e 543 (577) +++|.+ -| T Consensus 471 -------~~~~~~-~~ 478 (478) T protein:vir:10 471 -------QSENNQ-PE 478 (478) T ss_pred -------CCCCCC-CC Confidence 011100 00 No 40 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=26.55 E-value=1.9 Score=19.01 Aligned_cols=438 Identities=14% Similarity=0.083 Sum_probs=167.7 Q ss_pred CCcchHHHHHHhhCchhHHHHHHHHHH-HHHHH-HHhhhhcccccchHHHHH---HHHHHHhhcchhHHHHHHHhhccCC Q lcl|NC_011222. 1 MGKSLDEIREIYRHPEGISQIAKAKEH-EERIA-FHTRVRTSDDRNKPVIDF---LSKVKTWIAKDKYDIFLSMFHFPVK 75 (577) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~aka~~h-eeri~-fh~~~~ta~d~~~~~~~f---l~~v~~~l~kdky~~f~~~f~fpv~ 75 (577) |- +.++. ..-.++ .|.+.+.| ..+.+ -..++. .||.-- |.+... .++++.. - .-+. T Consensus 31 ~~-~~e~~--~~~~~~---~i~~~i~~~~~~~~~r~~~l~------~Yy~g~~~i~~~~~~--~~~~~~~---~--~ki~ 91 (511) T protein:vir:99 31 YD-GTESD--LLQNVN---EVSKYIEHHMDYQRPRLKVLS------DYYEGKTKNLVELTR--RKEEYMA---D--NRVA 91 (511) T ss_pred cc-hhhhh--hhccHH---HHHHHHHHHHHhhHHHHHHHH------HHhcccCccccccCc--ccccccC---c--ceee Confidence 10 00110 001111 12222222 11110 000000 011100 000000 0111100 0 0000 Q ss_pred CccchHHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccCCCccc Q lcl|NC_011222. 76 TNGVTSEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQVGEKPE 155 (577) Q Consensus 76 t~~lt~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~~~rpq 155 (577) ..+.+-|=+...--+-|....+ .-.+.+..+-|..+.+. +++.+..+..+=++.... .++++|- ...++++. T Consensus 92 -~n~~k~Iv~~~~~yl~g~p~~~--~~~d~~~~~~l~~~~~~-n~~~~~~~~~~~~~~i~G-~a~~~vy---~ded~~~~ 163 (511) T protein:vir:99 92 -HDYASYISDFINGYFLGNPIQY--QDDDKDVLEAIEAFNDL-NDVESHNRSLGLDLSIYG-KAYELMI---RNQDDETR 163 (511) T ss_pred -cchHHHHHHHHHhhhcccCcee--ecCchHHHHHHHHHHhh-cCHhHHHHHHHHHHHhcC-eeEEEEE---eCCCCceE Confidence 0111222222222222333322 22344444455555433 355555555444444444 5666664 34444432 Q ss_pred cchhhhhHHHHHHHHhhcccccceeeeeee--c-----CCC---EEEEEeccceeeeccCCccceee----ehhhhhhhc Q lcl|NC_011222. 156 PYFFWLPIANVLSYRTCGKDCNLMAYIMYV--T-----DEN---KIVYIDEERYVRFDKTRENDLIL----EVDNMHDLG 221 (577) Q Consensus 156 pyf~~~pie~V~~y~~~~~~~~~i~~i~~~--q-----D~N---~i~~IDd~~y~~y~kn~k~ei~~----e~~~~H~lG 221 (577) +-..+-+.++.......+.+.+.++-+- + +.+ ++-+-+++.+.+|..++-+.+.. ....-|.|| T Consensus 164 --i~~~~p~~~~~vyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g 241 (511) T protein:vir:99 164 --LYKSDAMSTFVIYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFE 241 (511) T ss_pred --EEEEccceeEEEEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCC Confidence 2222333333222222223445555441 1 112 23345666666666554332221 223349999 Q ss_pred ccceeeEecccccccCcceeeccchhHH---HHHHHHHHHHHHHHHHHhhcCCceeecccccCCccCCCCCccccCccee Q lcl|NC_011222. 222 YCPARFFWSDSISLSEPDIKISPITSEL---DSFDWYLYYSTAKKHLDLYASYPIYSGYERDCHYESHDGKERCDDGFLK 298 (577) Q Consensus 222 y~PA~~~~gD~~~~skp~ik~SpL~~~L---~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~~~~~~G~~~C~~G~i~ 298 (577) .||.--|.- | + .-.|-+...+ +|+|+.++-...-.. +...+|....|+... . .+.+ T Consensus 242 ~vPvv~~~n-----n-~-~g~sd~e~v~~liDa~d~~~S~~~~~~~-~~~~~~lv~~G~~~~--------~----~~~~- 300 (511) T protein:vir:99 242 RMPITEFSN-----N-E-RRKGDYEKVITLIDLYDNAESDTANYMS-DLNDAMLLIKGNLNL--------D----PVEV- 300 (511) T ss_pred ccceEEecC-----C-C-CCCCchhhhHHHHHHHHHHHHHHHHHHH-HhhchhhhhccCccc--------C----chhh- Confidence 999955532 2 1 3445555555 555555544332222 223344444443110 0 0000 Q ss_pred cccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHhhHHHHHHHHHHhhhhc Q lcl|NC_011222. 299 NEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNVNEEKRLRDELVRSVTG 378 (577) Q Consensus 299 n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~~~~kri~d~i~~s~~G 378 (577) .+.. .++....+...+..+....+..++| +++++.+.. .+-....++++.+-|+...-. T Consensus 301 ----------~~~~------~~~~~~~~~~~~~~~~~~~~~~~~d----~~~l~~~~~-~~~~e~~~~~L~~~I~~~s~~ 359 (511) T protein:vir:99 301 ----------RKQK------EANVLFLEPTVYADSEGRETEGSVD----GGYIYKQYD-VQGTEAYKDRLNSDIHMFTNT 359 (511) T ss_pred ----------cccc------cccceecccccccccccccCCCCcc----eeEEeecCC-HHHHHHHHHHHHHHHHHHhCC Confidence 0000 0000111111111111112233444 555554432 122234466677777653311 Q ss_pred CC-CccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHhh--cCcc-----cccccccCCcccccCCHHH Q lcl|NC_011222. 379 GE-GELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLLR--YGDS-----FVSCNINYGTEFYIYTPEE 450 (577) Q Consensus 379 f~-~d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~lR--yg~~-----~~~~ti~yGskFy~~t~ee 450 (577) .. .+.+.+-..++++.+.-+-.+.+++.+..+.|++.-+-+...++++- .+.. +...++ .|....|.. T Consensus 360 P~~~~~~~~gn~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i----~f~~~~p~n 435 (511) T protein:vir:99 360 PNMKDDNFSGTQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRY----VYNRNLPKS 435 (511) T ss_pred cccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceE----EeCCCCCcC Confidence 10 11122235688888888999999999999989888888888777652 2221 122233 577777888 Q ss_pred HHHHHHHHHHc-C-CCHHHHHHHHHHHHHHhhcCCH-HHHHHHHHHHhh------CC-CcCc---cHHHHHHHHhcCCCc Q lcl|NC_011222. 451 LSERYKIMKET-G-ASEAELDALRQQIIETEYRNDP-TQMQRLLILNEI------EP-YSHL---TREEAVNLYKENVIS 517 (577) Q Consensus 451 L~~~i~~Ak~~-G-as~~~i~~L~~qi~e~EyrNnP-~qmqr~~vL~~l------eP-~~~L---T~~Ev~~l~e~g~~~ 517 (577) ++++++++.+. | +|...+..+ . -+=.|| .+|+|++-=+.- ++ +.+. .-++.- -.+..=.+ T Consensus 436 ~~e~~~~~~kl~GiiS~et~l~~-l-----~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~d 508 (511) T protein:vir:99 436 LIEELKAYIDSGGKISQTTLMSL-F-----SFFQDPELEVKKIEEDEKESIKKAQKNMYQDPRNINDDEQD-DSTKDSID 508 (511) T ss_pred HHHHHHHHHHHhccCCHHHHHHh-C-----CCCCCHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCCCCC-CCCcCccc Confidence 88887766543 5 232221111 0 122233 245554432210 00 0000 000000 00011111 Q ss_pred hhh Q lcl|NC_011222. 518 EED 520 (577) Q Consensus 518 eEd 520 (577) +|. T Consensus 509 ~~e 511 (511) T protein:vir:99 509 KKE 511 (511) T ss_pred ccC Confidence 222 No 41 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=25.81 E-value=2 Score=18.92 Aligned_cols=446 Identities=13% Similarity=0.057 Sum_probs=175.0 Q ss_pred CCcch--HH-HHHHhh-CchhHHHHHHHHHHHHHHHHHhhhhcccccchHHHHH-HHHHHHhhcchhHHHHHHHhhccCC Q lcl|NC_011222. 1 MGKSL--DE-IREIYR-HPEGISQIAKAKEHEERIAFHTRVRTSDDRNKPVIDF-LSKVKTWIAKDKYDIFLSMFHFPVK 75 (577) Q Consensus 1 ~~~~~--~~-~~~~~~-~~~~~~~~aka~~heeri~fh~~~~ta~d~~~~~~~f-l~~v~~~l~kdky~~f~~~f~fpv~ 75 (577) ||--- ++ |..-|| -|+| .-=|||.+-.++- +++...+-+.+ +.+ .|-.. |- .-.|--.. T Consensus 1 ~~~~~~~~~~i~~w~~~~~~~--------~~~~~~~~~~~~~-~~~~~~~~~~~~~~~--~w~~~--~~---~~~~~~~~ 64 (518) T protein:vir:78 1 MGVWSVMTRFIKGWLNGKPNG--------SEPELIPKYLPLV-PDNQKEWSKDSYLTS--LWAQG--YV---PTVHDKLM 64 (518) T ss_pred CcchhhHHHHHHHhhcCCCCc--------cchhccHHHhhhc-ccchhhhhhhhhhhh--hcccC--CC---Cccccccc Confidence 65322 21 222222 1222 1124555544432 22222222221 111 01000 00 00011122 Q ss_pred CccchHHHHHHHHHHhccCCccccc---cccChhhhhhhhHHHhhcc---chhHHHHHHHHHHHHhCCCeEE--EEeecc Q lcl|NC_011222. 76 TNGVTSEIFDKLSRVFDGRNPVYNY---QFKSSEDRDDWEYYRKDVL---KEPSVWSTDGWDNFKHRINSVL--VVDMPE 147 (577) Q Consensus 76 t~~lt~~iF~~L~kV~dgqd~~~~y---~f~~~e~~~d~~~y~se~l---n~~~fw~~~~fk~~~~~~Ngvi--VVDm~~ 147 (577) +..|...|=+++....=|.-+..+. +..+.|.++. ++.+++ +|...++.-+=.+... =.+++ .+|=.. T Consensus 65 ~~~l~~~i~~~~A~ll~~e~~~i~v~~~~~~d~e~~~~---~l~~il~~n~f~~~~~~~~e~a~a~-G~~~~k~~~d~~~ 140 (518) T protein:vir:78 65 NSGTGNEIVVVAAEYISGKPLSIDVTGVNGSKDENLTK---QLKEALRIDNFDSKSVKIVELAGGS-GVSAVKINILNGR 140 (518) T ss_pred cCChHHHHHHHHHHhhcCCCceEEecCccccCcHHHHH---HHHHHHHhccHHHHHHHHHHHhhcc-CceEEEEEEECCe Confidence 3455666777777755554333222 1223343333 233444 3333333332222222 22332 233222 Q ss_pred ccCCCccccchhhhhHHHHHHHHhhcccccceeeee-eecCCC-EE---------------EEEeccceeeeccCC---c Q lcl|NC_011222. 148 VQVGEKPEPYFFWLPIANVLSYRTCGKDCNLMAYIM-YVTDEN-KI---------------VYIDEERYVRFDKTR---E 207 (577) Q Consensus 148 i~~~~rpqpyf~~~pie~V~~y~~~~~~~~~i~~i~-~~qD~N-~i---------------~~IDd~~y~~y~kn~---k 207 (577) +...--+...|| |+. .+|+.. .+.|+- .+.++. ++ .+.+..++++|.+-+ . T Consensus 141 ~~i~~v~ad~~~--P~~------~~g~~~-~~~f~~~~~~~~k~~~y~~lE~he~~~~~~~~~~~~~~~I~n~ly~~~~~ 211 (518) T protein:vir:78 141 PSISVHSSSQFW--IDF------KNNEPF-RFNFFEEIPTSNKADIYYLVESREIKQWDKEGKKLSGGFVTYSVIKIDGD 211 (518) T ss_pred eEEEEEcCCeeE--EEe------ecCcEE-EEEEEEEeecCCcceeEEEEEeeccccccceeecccceeEEEEEeeecCc Confidence 222223444455 532 234432 233332 223222 22 123334455555421 1 Q ss_pred ccee-------e------ehh------hhhhhcccceeeEec----ccccccCcceeeccchhHHHHHHHHHHHHHHHHH Q lcl|NC_011222. 208 NDLI-------L------EVD------NMHDLGYCPARFFWS----DSISLSEPDIKISPITSELDSFDWYLYYSTAKKH 264 (577) Q Consensus 208 ~ei~-------~------e~~------~~H~lGy~PA~~~~g----D~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~h 264 (577) ++.. . +.+ .++-.+-.|.-+++- .......| +=.|-++++.+.+|.|=-..++..+ T Consensus 212 ~~v~~~~~~~~~~l~~~~~~~~~~e~~~~~tg~~~~~~~~~~n~~~N~~~~~sp-lG~S~~~~~~~~id~lD~~~s~~~~ 290 (518) T protein:vir:78 212 KTTPISAERLPEQITSYLHTNDIQLNHSVSIGLKSMGAYLINNSPSNTRYPHLN-LGESDLSQCTNYLFAVDYFFTVYMR 290 (518) T ss_pred ccccccccccccccccccccccCccceeeccCCccceEEeeccccccccccCCC-cCcchHhhhhHHHHHHHHHHHHHHH Confidence 1110 0 000 011122233222211 11111223 4567777777555555444444444 Q ss_pred HHhhcCCceeecccccCCccCCCCCccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCc Q lcl|NC_011222. 265 LDLYASYPIYSGYERDCHYESHDGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDL 344 (577) Q Consensus 265 ldlyA~YPiYs~y~~DC~~~~~~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dl 344 (577) ++..+=+.+-.-+.=.. -++.|.. .+ .....+.+..+-+.++..+..+.+. T Consensus 291 -e~~~g~~~i~v~~~~l~-~~~~~~~-----------------~~----------~~~~fd~~~~~y~~i~~~~~~~~~~ 341 (518) T protein:vir:78 291 -EGEKTKTKIAASERMFR-KKVNKST-----------------DK----------EEWSMNVDEDYFMQFKGTLDAGAKL 341 (518) T ss_pred -HHHhCCceeeechhHhc-cCCCCCC-----------------Cc----------cccccCCCCceEEEecCcCCCCCcc Confidence 33334333322111111 0111110 00 0111334455555555455556666 Q ss_pred cCcceeecccHHHHHHHhhHHHHHHHHHHhhhhc-----CCCccccchhhcccceeccHHHHHHHHHHHhh----hHHHH Q lcl|NC_011222. 345 KNPITMLSADTGSLEYNVNEEKRLRDELVRSVTG-----GEGELNRSEAINEKQVKAGFESLTTKLNRIKR----GFEEA 415 (577) Q Consensus 345 r~pv~i~n~di~sleY~~~~~kri~d~i~~s~~G-----f~~d~q~~kA~ne~~V~s~~ds~~~~l~~ikk----nfe~~ 415 (577) .+.++.++|++-+=+| ......+.+.|...| | |+.+ ...+ +.+.|++...-+++.+.+..+ .+++| T Consensus 342 ~~~i~~~~~~Ir~e~~-~~~~~~~l~~~~~~~-G~s~~tfg~~-~~~~--TATei~s~~~~~~~t~~~~~~~~e~al~~l 416 (518) T protein:vir:78 342 NDMIQFMQGDFRDGSY-RETMEYFAQKAVSKS-GYNPATFNLG-NREV--KATEIWSLQDATVRKIEKKKRLIQNVYEQM 416 (518) T ss_pred ccceeeeecccChHHH-HHHHHHHHHHHHHhh-CCChhhcCcc-cccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7899999999987554 445666677776666 4 3322 2233 344455544444444444444 44444 Q ss_pred HHHHHHHHHHhhc------CcccccccccCCcccccCCHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHhh-cCCHHHHH Q lcl|NC_011222. 416 QTFVDSTICLLRY------GDSFVSCNINYGTEFYIYTPEELSERYKIMKETGASEAELDALRQQIIETEY-RNDPTQMQ 488 (577) Q Consensus 416 k~Fvldti~~lRy------g~~~~~~ti~yGskFy~~t~eeL~~~i~~Ak~~Gas~~~i~~L~~qi~e~Ey-rNnP~qmq 488 (577) -..++.+...+-- +.+.++-+|+.+- --..+.++..++.+.+.++|+-+-+. .+...+ -=+++|.+ T Consensus 417 ~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D-~i~~D~~~~~~~~~~~v~aGimS~e~------~i~~~~~~~~deea~ 489 (518) T protein:vir:78 417 LWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPD-PMSVNLNELSSTLNNMNSALAMSVEE------KVKLIHPKWEDEEIQ 489 (518) T ss_pred HHHHHHHHHhhcCccccccCCCceeEEEEeCC-CCCCCHHHHHHHHHHHHhcCCCCHHH------HHHHhCCCCCHHHHH Confidence 4444443332210 0112334555443 34566777778888888899744221 111111 11222221 Q ss_pred HHHHHHhhCCCcCccHHHHHHHHhcCCCchhheeeeecchhhhhhhhccCC Q lcl|NC_011222. 489 RLLILNEIEPYSHLTREEAVNLYKENVISEEDLRVKLNLPTFVRRFERENM 539 (577) Q Consensus 489 r~~vL~~leP~~~LT~~Ev~~l~e~g~~~eEdl~vk~n~~~fv~rfe~en~ 539 (577) +||..++++.-..+. .-|.=+.-|+.+.| T Consensus 490 ----------------~e~~ri~~E~~~~~~------~~p~~~~g~~~~~g 518 (518) T protein:vir:78 490 ----------------AEVKRIYLENAIGEV------PDPEAIGGMETKGG 518 (518) T ss_pred ----------------HHHHHHHHHhcccCC------CCCccccCCCCCCC Confidence 244444444322211 11222234666666 No 42 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=24.34 E-value=2.2 Score=18.72 Aligned_cols=436 Identities=12% Similarity=0.053 Sum_probs=183.5 Q ss_pred CC-----cchHHHHHHhhCchhHHHHHHHHH-HHHHHHHHhhhhcccccchHHHHHHHHHHHhhcchhHHHHHHHhhccC Q lcl|NC_011222. 1 MG-----KSLDEIREIYRHPEGISQIAKAKE-HEERIAFHTRVRTSDDRNKPVIDFLSKVKTWIAKDKYDIFLSMFHFPV 74 (577) Q Consensus 1 ~~-----~~~~~~~~~~~~~~~~~~~aka~~-heeri~fh~~~~ta~d~~~~~~~fl~~v~~~l~kdky~~f~~~f~fpv 74 (577) |. +.|+++.++- +..|.+.+. |.+|+ .++... ..||.-- +.|.+--+++++ - T Consensus 1 ~~~~~~~~~~~~~~~~~-----~~~i~~~i~~~~~~~---~~~~~l---~~Yy~g~-~~i~~~~~~~~~----------~ 58 (499) T protein:vir:10 1 MAVVIDKDLLDDVNEPN-----IEAINYAIRELQNRK---KRLDKL---SDYYNGK-QEIEKHEFDNAT----------V 58 (499) T ss_pred CccchhhhHHhhhhcCC-----HHHHHHHHHHHHHHH---HHHHHH---HHHhccc-cchhcCCcCcCC----------C Confidence 33 2334443321 223333332 33332 222111 1111110 111111111111 1 Q ss_pred CCcc----chHHHHHHHHHHhccCCccccccccChhhhhhhhHHHhhccchhHHHHHHHHHHHHhCCCeEEEEeeccccC Q lcl|NC_011222. 75 KTNG----VTSEIFDKLSRVFDGRNPVYNYQFKSSEDRDDWEYYRKDVLKEPSVWSTDGWDNFKHRINSVLVVDMPEVQV 150 (577) Q Consensus 75 ~t~~----lt~~iF~~L~kV~dgqd~~~~y~f~~~e~~~d~~~y~se~ln~~~fw~~~~fk~~~~~~NgviVVDm~~i~~ 150 (577) +.+- +.+-|=+...-.+=|+...++ -.+.+..+.+..+.+. +++-.+...-+ +....-=.++++|- +.. T Consensus 59 ~~~ki~~n~~~~Iv~~~~~~l~g~p~~~~--~~~~~~~~~l~~~~~~-n~~~~~~~~~~-~~~~~~G~~~~~v~---~~~ 131 (499) T protein:vir:10 59 EAANVMVNHAKYITDMNVGFMTGNPVKYV--AEKGKNIDDILEVFNQ-IDIHKHDIELE-KDLSVFGYGYELLY---LKK 131 (499) T ss_pred CcceeecchHHHHHHHHhhhhcccCceee--cCChhHHHHHHHHHhh-cCHhHHHHHHH-HHHHhcCceEEEEE---ecc Confidence 1222 222222222222334444433 3456666666665533 45554444433 33333445677765 444 Q ss_pred CCcccc----------------chhhhhHHHHHHHHhhcccccceeeeee--ec--CCCE----EEEEeccceeeeccCC Q lcl|NC_011222. 151 GEKPEP----------------YFFWLPIANVLSYRTCGKDCNLMAYIMY--VT--DENK----IVYIDEERYVRFDKTR 206 (577) Q Consensus 151 ~~rpqp----------------yf~~~pie~V~~y~~~~~~~~~i~~i~~--~q--D~N~----i~~IDd~~y~~y~kn~ 206 (577) +|.+-. +...-|.+....|.. ..+.+.+.++-+ .. +++. +-+-+++...+|...+ T Consensus 132 ~g~~~~~~~~~~~~~~~~~~~~~~~v~p~~~~~v~~d-~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~ 210 (499) T protein:vir:10 132 TDPISVRDELGNEKLTPNTELKIEVIDPRATVVVCDD-TVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKT 210 (499) T ss_pred cccccccccccccccccccceEEEEEcccceEEEecC-CCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecC Confidence 443211 111122222222221 112234444433 12 2222 2233455555565444 Q ss_pred ccce----eeehhhhhhhcccceeeEecccccccCcceeeccchhHHHHHHHHHHHHHHHHHHHhhcCCceeecccccCC Q lcl|NC_011222. 207 ENDL----ILEVDNMHDLGYCPARFFWSDSISLSEPDIKISPITSELDSFDWYLYYSTAKKHLDLYASYPIYSGYERDCH 282 (577) Q Consensus 207 k~ei----~~e~~~~H~lGy~PA~~~~gD~~~~skp~ik~SpL~~~L~a~D~ll~~~ts~~hldlyA~YPiYs~y~~DC~ 282 (577) .+.+ ......-|.||.||.-.|.. + ..-.|=++.+++..|+|=...+....-=.+++-|++...- T Consensus 211 ~~~~~~~~~~~~~~~~~~g~vPvv~~~n-----~--~~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G---- 279 (499) T protein:vir:10 211 TMEVSANDPIVYDGENLFGAVPIIEFRN-----N--EERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFG---- 279 (499) T ss_pred CccccCcceecccccCCCCccceEEecC-----C--CCCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeec---- Confidence 3222 22233349999999944422 2 2344555666555555555444444433467788877531 Q ss_pred ccCCCCCccccCcceecccccccccccccccccCCCccccccCccceeeeecCcccccCCCccCcceeecccHHHHHHHh Q lcl|NC_011222. 283 YESHDGKERCDDGFLKNEKNEWITGVDGKPMACPICSSKRLRGAGSYVEIPIPDEMHNVPDLKNPITMLSADTGSLEYNV 362 (577) Q Consensus 283 ~~~~~G~~~C~~G~i~n~~~d~~tg~~~~~~~Cp~C~~K~~~g~gs~~evPiP~~~~d~~Dlr~pv~i~n~di~sleY~~ 362 (577) ..+.+ ++ ........|..+.++. .+++| +++++++.. .+-.. T Consensus 280 ---~~~~~---~~-----------------------~~~~~~~~~~~~~~~~----~~~~d----~~~l~~~~~-~~~~~ 321 (499) T protein:vir:10 280 ---FGLGD---DK-----------------------DDIQRLKRGAIEAPPR----EEGAD----IEWLTKSFD-ETQVN 321 (499) T ss_pred ---Ccccc---cc-----------------------chhhhhhhcceeccCC----CCCCc----ceEEeccCC-HHHHH Confidence 11111 00 0011123344443333 24455 556655442 23345 Q ss_pred hHHHHHHHHHHhhhhcCCC-ccccchhhcccceeccHHHHHHHHHHHhhhHHHHHHHHHHHHHHh-hc-CcccccccccC Q lcl|NC_011222. 363 NEEKRLRDELVRSVTGGEG-ELNRSEAINEKQVKAGFESLTTKLNRIKRGFEEAQTFVDSTICLL-RY-GDSFVSCNINY 439 (577) Q Consensus 363 ~~~kri~d~i~~s~~Gf~~-d~q~~kA~ne~~V~s~~ds~~~~l~~ikknfe~~k~Fvldti~~l-Ry-g~~~~~~ti~y 439 (577) ..++++.+.|+..+-...- ..+..-+.++++.+.-+..+.+++.+..+.|+++-+-++..++++ +. |.. ++- ..- T Consensus 322 ~~~~~l~~~I~~~s~~p~~~~~~~~gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~~~-~d~-~~i 399 (499) T protein:vir:10 322 LLSQSIENDIHKISYVPNMNDEKFMGNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNIKGAN-DDA-SGC 399 (499) T ss_pred HHHHHHHHHHHHHhCcccCCchhhcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCc-ccc-ccc Confidence 6677888888876532211 112234568889899999999999999999999888888888876 11 211 100 011 Q ss_pred CcccccCCHHHHHHHHHHHHHc-C-CCHHHHHHHHHHHHHHhhcCCHH-HHHHHHHHHh------hCCCcCc-----cHH Q lcl|NC_011222. 440 GTEFYIYTPEELSERYKIMKET-G-ASEAELDALRQQIIETEYRNDPT-QMQRLLILNE------IEPYSHL-----TRE 505 (577) Q Consensus 440 GskFy~~t~eeL~~~i~~Ak~~-G-as~~~i~~L~~qi~e~EyrNnP~-qmqr~~vL~~------leP~~~L-----T~~ 505 (577) ...|....|..+++.+.++.+. | +|...+... . -+=.||. +|+|++-=+. .+++... +.+ T Consensus 400 ~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~-l-----~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~ 473 (499) T protein:vir:10 400 KISLVANIPSNLSDVVNNVKNADGIIPRKYTYSW-L-----PDVDNPQDVIDEMNQQDAETIKKNQEALRGQDPDRLELE 473 (499) T ss_pred eEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHh-C-----CCCCCHHHHHHHHHHHHHHHHHHHHhhhccCCCCCCCCC Confidence 2345556667777777665543 3 333221111 0 1222333 4555432211 1222111 111 Q ss_pred HHHHHHhcCCCch--------hheee Q lcl|NC_011222. 506 EAVNLYKENVISE--------EDLRV 523 (577) Q Consensus 506 Ev~~l~e~g~~~e--------Edl~v 523 (577) +..+--+.+=.+. --=|| T Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (499) T protein:vir:10 474 DKQDDSSENDKEAGSNHNQSHRTRAV 499 (499) T ss_pred CCCcccCCCCCCCccccccCCCCCCC Confidence 1100000000000 00123 Done!