Query lcl|NC_018838.1_cdsid_YP_006906408.1 [gene=7] [protein=hypothetical protein] [protein_id=YP_006906408.1] [location=5630..6109] Match_columns 159 No_of_seqs 13 out of 15 Neff 2.8 Searched_HMMs 1612 Date Thu Nov 7 12:43:10 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_7 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_7_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:80668 Length: 153 100.0 6.9E-92 4.3E-95 520.4 10.8 153 7-159 1-153 (153) 2 protein:vir:99922 Length: 165 100.0 1.4E-86 8.8E-90 491.2 10.4 157 1-159 1-160 (165) 3 protein:vir:8189 Length: 151 # 100.0 1.2E-82 7.6E-86 469.7 8.8 147 7-159 1-149 (151) 4 protein:vir:9576 Length: 131 # 98.4 5.7E-09 3.6E-12 65.8 9.6 115 7-127 1-131 (131) 5 protein:vir:9761 Length: 140 # 98.3 9.1E-09 5.6E-12 64.7 8.7 123 7-141 1-140 (140) 6 protein:vir:98481 Length: 136 98.3 5.3E-09 3.3E-12 66.0 6.7 127 7-147 1-136 (136) 7 protein:vir:94761 Length: 132 98.2 2.2E-08 1.4E-11 62.6 9.0 112 7-127 1-132 (132) 8 protein:vir:1640 Length: 132 # 97.9 1.6E-07 1E-10 57.8 9.4 115 7-127 1-132 (132) 9 protein:vir:78254 Length: 149 97.9 1.9E-07 1.2E-10 57.4 8.5 135 7-159 1-148 (149) 10 protein:vir:78478 Length: 149 97.9 1.9E-07 1.2E-10 57.4 8.5 135 7-159 1-148 (149) 11 protein:vir:7773 Length: 123 # 97.7 4.1E-07 2.5E-10 55.6 8.5 114 7-129 1-123 (123) 12 protein:vir:2432 Length: 124 # 97.7 3.2E-07 2E-10 56.2 6.9 115 7-129 1-124 (124) 13 protein:vir:99002 Length: 158 97.6 1.4E-06 8.7E-10 52.7 9.9 136 7-159 1-147 (158) 14 protein:vir:2345 Length: 125 # 97.6 7.6E-07 4.7E-10 54.1 7.9 115 7-129 1-125 (125) 15 protein:vir:104088 Length: 125 97.1 3.1E-06 1.9E-09 50.8 6.4 115 7-129 1-125 (125) 16 protein:vir:4228 Length: 125 # 97.0 5.7E-06 3.5E-09 49.4 6.7 115 7-129 1-125 (125) 17 protein:vir:2505 Length: 128 # 94.7 0.00011 6.7E-08 42.3 3.8 116 7-129 1-128 (128) 18 protein:vir:108221 Length: 150 87.8 0.0071 4.4E-06 32.4 5.9 127 13-159 1-150 (150) 19 protein:vir:106583 Length: 105 72.2 0.12 7.5E-05 25.6 6.9 100 6-119 1-105 (105) 20 protein:vir:79640 Length: 134 58.7 0.4 0.00025 22.8 7.0 112 7-131 1-134 (134) 21 protein:vir:78106 Length: 236 58.5 0.08 5E-05 26.6 3.1 127 9-159 1-153 (236) 22 protein:vir:105776 Length: 133 52.2 0.44 0.00027 22.5 6.1 127 10-153 1-133 (133) 23 protein:vir:107702 Length: 136 49.4 0.64 0.0004 21.6 7.4 110 7-131 1-136 (136) 24 protein:vir:5256 Length: 119 # 45.4 0.77 0.00048 21.2 7.9 105 6-119 1-119 (119) 25 protein:vir:104344 Length: 132 42.4 0.89 0.00055 20.9 7.9 112 7-145 1-132 (132) 26 protein:vir:107756 Length: 147 23.6 2.3 0.0014 18.6 5.9 119 7-133 1-147 (147) 27 protein:vir:103283 Length: 125 23.0 2.4 0.0015 18.5 5.8 109 1-131 1-125 (125) No 1 >protein:vir:80668 Length: 153 # NCBI annotation: gp7 # Family: family:all:7267 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285583;genbank:gi:148727089;genbank:GeneID:5247039 Probab=100.00 E-value=6.9e-92 Score=520.37 Aligned_cols=153 Identities=93% Similarity=1.477 Sum_probs=152.3 Q ss_pred cccccchhhcccccCCchHHHHHHHHhHHHHHHHhccccCCCCCchHHHHHHHHHHHHHhhhhcccchhhhhhcccceee Q lcl|NC_018838. 7 MGIILKPEDIEPFADIPKDKLEAMIADVEAVAVSVAPCIAKPDFKYRDAAKAILRRALLRWNDTGVSGQVQYESAGPFAQ 86 (159) Q Consensus 7 M~~~it~~Dl~pFa~I~e~~a~amI~da~A~A~~vAPCi~~pdf~~~~aAKaILRgAILRW~d~G~SGavt~~tAGpf~q 86 (159) |+++||++||+||+||+++|+++||+|++|||++|||||+||||+|+++||+|||||||||||+|+||++||||||||+| T Consensus 1 m~v~i~~~Dl~pF~dI~~~k~~ami~D~~a~A~~vAPCi~~~~f~~~~aAKaIlrgAiLRW~e~G~SGait~~taGpf~q 80 (153) T protein:vir:80 1 MGIILKPEDIEPFADIPREKLEAMIADVEAVAVSVAPCIAKPDFKYKDAAKAILRRALLRWNDTGVSGQVQYESAGPFAQ 80 (153) T ss_pred CceeechhhccccccCCHHHHHHHHHhhhhhhhhhccccCCCCcccHHHHHHHHHHHhhhhhhcCcccceeeecccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeecccCcceechHHHHHHHHHccccccCCceeeeeeccCCCCccchhhhhccCCcccccceeccCCcccccC Q lcl|NC_018838. 87 TTRSNTPTNLLWPSEIAALKKLCEGDGGAGKAFTITPTMNSRFTHSDVCSTVWGEGCSCGSNINGHDGPLWEI 159 (159) Q Consensus 87 T~~~~~~r~~f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~~Hs~~Cs~~~g~~CsCGa~l~g~~~plwe~ 159 (159) |+|+|++|+|||||||+|||+||++++++|+||+|||+|++++.|||+||+|||++|||||+||||+|||||| T Consensus 81 T~dtrs~r~lfwPSEItqLqklC~~~~~~g~Af~id~t~~~~v~Hs~~Cs~~fGg~CSCGa~l~g~~gplwe~ 153 (153) T protein:vir:80 81 TTRSNTPTNLLWPSEIAALKKLCEGDGGAGKAFTITPTMRSSVNHSEVCSTVWGEGCSCGSDINGYAGPLWEI 153 (153) T ss_pred eeccCCceeccChhhHHHHHHHhcCCCCCcceeEeecCCCCccccccccceeecCccccchhhcccCcccccC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:99922 Length: 165 # NCBI annotation: gp9 # Family: family:all:7267 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655526;genbank:gi:109392296;genbank:GeneID:4157091 Probab=100.00 E-value=1.4e-86 Score=491.24 Aligned_cols=157 Identities=43% Similarity=0.754 Sum_probs=147.8 Q ss_pred Cc--ceeecccccchhhcccccCCchHHHHHHHHhHHHHHHHhccccCCCCCchHHHHHHHHHHHHHhhhhcccchhhhh Q lcl|NC_018838. 1 MQ--GVVLMGIILKPEDIEPFADIPKDKLEAMIADVEAVAVSVAPCIAKPDFKYRDAAKAILRRALLRWNDTGVSGQVQY 78 (159) Q Consensus 1 ~~--~~~lM~~~it~~Dl~pFa~I~e~~a~amI~da~A~A~~vAPCi~~pdf~~~~aAKaILRgAILRW~d~G~SGavt~ 78 (159) |. --.-+.++||++||+||++|+++|+++||+|++|||++|||||+||||+|+++||+|||||||||||+| ||+||| T Consensus 1 ~~~~~~~~p~~ii~~eDl~Pf~~i~~~ka~~mI~da~A~A~~vAPCi~~~~f~~~~aAKaIlrgAiLRW~e~G-SGAit~ 79 (165) T protein:vir:99 1 MTEPTPTEPEPLLTAEDLAPFATIPKAKADEMIEDALGMAEVHAPCINDPGFAHRRAAKAILRGAILRWNEAG-AGAATT 79 (165) T ss_pred CCCCCCCCcceeeehhhccccccCCHHHHHHHHhhhhhhhhhhccccCCCCcccHHHHHHHHHHhhhhhhccc-Cceeee Confidence 21 112345679999999999999999999999999999999999999999999999999999999999999 999999 Q ss_pred hcccceeeeeecccCcc-eechHHHHHHHHHccccccCCceeeeeeccCCCCccchhhhhccCCcccccceeccCCcccc Q lcl|NC_018838. 79 ESAGPFAQTTRSNTPTN-LLWPSEIAALKKLCEGDGGAGKAFTITPTMNSRFTHSDVCSTVWGEGCSCGSNINGHDGPLW 157 (159) Q Consensus 79 ~tAGpf~qT~~~~~~r~-~f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~~Hs~~Cs~~~g~~CsCGa~l~g~~~plw 157 (159) +|||||+||+|+|++|+ |||||||+|||+||+.++++||||||||+||++|+|||+||+|||++|||||+|+++ |||| T Consensus 80 ~TaGPf~qT~DtRs~r~~mfwPSEItqLqklC~~~g~~~~AFsIDt~p~g~v~Hs~~Cs~~fGg~CSCGavl~~~-gplw 158 (165) T protein:vir:99 80 KTAGIYGQTVDTRQPRKAMFFPSEIDQLRKLCRPDDDNGGAFSIDLLPQETVTHAEICSIYFGGGCSCGAILTQG-LPLY 158 (165) T ss_pred cccccceeeeccccccccccChhhHHHHHHHhcCCCCCCcceeeecccCCCcccccccceeecCcccchhhhccC-Cccc Confidence 99999999999999875 999999999999999889999999999999999999999999999999999999966 7999 Q ss_pred cC Q lcl|NC_018838. 158 EI 159 (159) Q Consensus 158 e~ 159 (159) |- T Consensus 159 e~ 160 (165) T protein:vir:99 159 EK 160 (165) T ss_pred cc Confidence 99 No 3 >protein:vir:8189 Length: 151 # NCBI annotation: gp9 # Family: family:all:7267 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817982;genbank:gi:29566416;genbank:GeneID:2700970 Probab=100.00 E-value=1.2e-82 Score=469.66 Aligned_cols=147 Identities=35% Similarity=0.669 Sum_probs=143.1 Q ss_pred cccccchhhcccccCCchHHHHHHHHhHHHHHHHhccccC-CCCCchHHHHHHHHHHHHHhhhhcccchhhhhhccccee Q lcl|NC_018838. 7 MGIILKPEDIEPFADIPKDKLEAMIADVEAVAVSVAPCIA-KPDFKYRDAAKAILRRALLRWNDTGVSGQVQYESAGPFA 85 (159) Q Consensus 7 M~~~it~~Dl~pFa~I~e~~a~amI~da~A~A~~vAPCi~-~pdf~~~~aAKaILRgAILRW~d~G~SGavt~~tAGpf~ 85 (159) |+++||++|| ||+.|+|+|+.+||+||+|||++|||||+ ||||+|+++||+|||||||||||+| ||++||+|||||+ T Consensus 1 m~~iik~eDL-~~~~i~e~~a~~mI~da~a~A~~vAPCi~~dp~f~~~~aAKaIlrgAiLRW~e~G-SGait~~taGp~~ 78 (151) T protein:vir:81 1 MTEIIKAADL-PDDIAANAMAAVWVDGANARASRVAPCLAADPSDDQLAEAKLILIGAVMRWSQAG-SGALQSQTMGPYG 78 (151) T ss_pred CccccccccC-CccccchhhHHHHhhcchhhhhhhcccccCCCCccchHHHHHHHHHhhhhhhccc-Cceeeeccccccc Confidence 9999999999 78889999999999999999999999999 8899999999999999999999999 9999999999999 Q ss_pred eeeecccCc-ceechHHHHHHHHHccccccCCceeeeeeccCCCCccchhhhhccCCcccccceeccCCcccccC Q lcl|NC_018838. 86 QTTRSNTPT-NLLWPSEIAALKKLCEGDGGAGKAFTITPTMNSRFTHSDVCSTVWGEGCSCGSNINGHDGPLWEI 159 (159) Q Consensus 86 qT~~~~~~r-~~f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~~Hs~~Cs~~~g~~CsCGa~l~g~~~plwe~ 159 (159) ||||||++| .|||||||+|||+||+ ++++||||||||+|++++ |||+||+|||++|||||+|||+ ||||- T Consensus 79 qT~DTRs~r~~~fwPSEI~qLqklC~-~~~~g~AFsIdt~p~~~~-Hs~~Cs~~fGg~CSCGa~l~g~--Pl~e~ 149 (151) T protein:vir:81 79 VTFDTRQRGGFNLWPSEITQLQDICK-NGAESKAFAVDTVACGNY-HSPICSVYFGGTCSCGAVLAGQ--PIYEQ 149 (151) T ss_pred cccccccCCCcccChhhHHHHHHHhc-cCCCCcceEEeecccCCc-cccchheeecCccccccccccC--ccccc Confidence 999999976 6999999999999999 688999999999999997 9999999999999999999999 99999 No 4 >protein:vir:9576 Length: 131 # NCBI annotation: gp42 # Family: family:all:1271 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862881;genbank:gi:32469473;genbank:GeneID:1461318 Probab=98.38 E-value=5.7e-09 Score=65.77 Aligned_cols=115 Identities=18% Similarity=0.094 Sum_probs=86.4 Q ss_pred cccccchhhccccc-CCch---HHHHHHHHhHHHHHHHhcccc-------CCCCCchHHHHHHHHHHHHHhh--hhcccc Q lcl|NC_018838. 7 MGIILKPEDIEPFA-DIPK---DKLEAMIADVEAVAVSVAPCI-------AKPDFKYRDAAKAILRRALLRW--NDTGVS 73 (159) Q Consensus 7 M~~~it~~Dl~pFa-~I~e---~~a~amI~da~A~A~~vAPCi-------~~pdf~~~~aAKaILRgAILRW--~d~G~S 73 (159) |..--|.+|++..- .+++ +.++.+++||-.+-+.-.|=. ..++.+++..+|.|.-.++.|= ++.+.. T Consensus 1 m~~fAtv~D~~~rwr~Lt~~E~~ra~~LL~~As~~ir~~~p~~~~~l~~~~~~~~~~~~~~~~V~~~~V~Ral~~~~~~~ 80 (131) T protein:vir:95 1 MENFATVEDLKKLWRALKFDEEKRAEALLEVVSHSLRVEAKKVGKDLDGLVATDPSFTMVVKSVTVDVVARTLMTSTDQE 80 (131) T ss_pred CCccCCHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHhhhhccCCccccccCCccchHHHHHHHHHHHHHHhcCCCCCC Confidence 88888999988552 4433 489999999999988777744 2455667788888888888885 343446 Q ss_pred hhhh-hhcccceeeeeecccCc-ce-echHHHHHHHHHccccccCCceeeeeeccCC Q lcl|NC_018838. 74 GQVQ-YESAGPFAQTTRSNTPT-NL-LWPSEIAALKKLCEGDGGAGKAFTITPTMNS 127 (159) Q Consensus 74 Gavt-~~tAGpf~qT~~~~~~r-~~-f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~ 127 (159) |..| |+|+|||++++....+. ++ |.++|.+.| +. +..|+|+||.-..+ T Consensus 81 G~tq~S~TaG~ys~S~t~~~p~g~lylt~~e~~~L----Gl--~~~r~~~i~~~~~~ 131 (131) T protein:vir:95 81 PMTQVAESALGYSFSGSYLVPGGGLFIKDSELKRL----GL--KKQRYGVIDIYGTD 131 (131) T ss_pred CceeeeeecccceeeeeeecCCCCceeChHHHHHh----CC--CCCceeEEeeccCC Confidence 7766 79999999999888765 45 888988877 33 24589999987766 No 5 >protein:vir:9761 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795523;genbank:gi:28876281;genbank:GeneID:1257822 Probab=98.29 E-value=9.1e-09 Score=64.67 Aligned_cols=123 Identities=14% Similarity=0.088 Sum_probs=80.4 Q ss_pred cccccchhhccccc-CCc---hHHHHHHHHhHHHHHHHhccccC-------CCCCchHHHHHHHHHHHHHhh---hhccc Q lcl|NC_018838. 7 MGIILKPEDIEPFA-DIP---KDKLEAMIADVEAVAVSVAPCIA-------KPDFKYRDAAKAILRRALLRW---NDTGV 72 (159) Q Consensus 7 M~~~it~~Dl~pFa-~I~---e~~a~amI~da~A~A~~vAPCi~-------~pdf~~~~aAKaILRgAILRW---~d~G~ 72 (159) |..--|++|++-.- .++ .+.++++|+||-++-+.-.|=.. .+..+.+.++|.|...++.|= ...+ T Consensus 1 m~~fATv~Dv~~rwr~Lt~dE~~ra~~LL~dAS~~iR~~~p~~g~~~~~~~~~~~~~~~~~k~V~~~mV~Ral~~~~d~- 79 (140) T protein:vir:97 1 MGNFATTDDVILLWRPLSVDELKRANALLKVVSDTLRMEADKVGKDLDKTMVDKPYFVNVIKSVTVDIVARTLMTSTQG- 79 (140) T ss_pred CCcCCCHHHHHHHhcCCCHhHHHHHHHHHHHHHHHHHHhhhhccCCcchhcccCccchhHHHHHHHHHHHHHhcCCCCC- Confidence 99999999998653 342 25899999999999887777332 222334555665555554442 3344 Q ss_pred chhhh-hhcccceeeeeecccCcc-e-echHHHHHHHHHccccccCCceeeeeeccCCCCccchhhhhccCC Q lcl|NC_018838. 73 SGQVQ-YESAGPFAQTTRSNTPTN-L-LWPSEIAALKKLCEGDGGAGKAFTITPTMNSRFTHSDVCSTVWGE 141 (159) Q Consensus 73 SGavt-~~tAGpf~qT~~~~~~r~-~-f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~~Hs~~Cs~~~g~ 141 (159) .|..| ++|+|||++|+....+.| + |.++|.+.| +. +..|+|+||.-. -..|.+ | +|.. T Consensus 80 ~G~tq~S~TaG~ys~S~T~~np~G~lylt~~e~~~L----Gl--~~~r~~~i~~~g--~~~~~~-~--~~~~ 140 (140) T protein:vir:97 80 EPMSQESQSALGYTWSGTYLVPGGGLFIKDNELKRL----GL--KKQRYGGIELYG--EIKRDN-D--YFDR 140 (140) T ss_pred CcceeeeeeccchhheeeeecCCCCceeChHHHHHh----CC--CCCceeeecccC--ccccCc-c--cccC Confidence 56655 799999999998887654 4 889998888 33 235899999843 322322 1 2222 No 6 >protein:vir:98481 Length: 136 # NCBI annotation: ORF3 # Family: family:all:32440 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958281;genbank:gi:41057255;uniprot:Q38596;genbank:GeneID:2732865 Probab=98.25 E-value=5.3e-09 Score=65.97 Aligned_cols=127 Identities=20% Similarity=0.276 Sum_probs=79.5 Q ss_pred cccccchhhcccccC--Cc--h---HHHHHHHHhHHHHHHHhccccCCCCCchHHHHHHHHHHHHHhhhhcccchhhhhh Q lcl|NC_018838. 7 MGIILKPEDIEPFAD--IP--K---DKLEAMIADVEAVAVSVAPCIAKPDFKYRDAAKAILRRALLRWNDTGVSGQVQYE 79 (159) Q Consensus 7 M~~~it~~Dl~pFa~--I~--e---~~a~amI~da~A~A~~vAPCi~~pdf~~~~aAKaILRgAILRW~d~G~SGavt~~ 79 (159) |..-=|.+|++..-. ++ | .+++++|+||-.+.+.-.|=...++ +...|.|.-.++.|--=-+ .| .+|+ T Consensus 1 M~~fAtv~Dl~~rw~~~~~dee~~ra~~~~lL~dAS~~ir~~~p~~~~~~---~~~~~~V~~~~V~R~~~np-~G-~~s~ 75 (136) T protein:vir:98 1 MAAYATVEDYQARAAVTLPDGSPRRAQVEAYLDDASALMARHIPTGHTPD---PGTLRAICVAVVRRVMANP-GG-YRQR 75 (136) T ss_pred CCccCCHHHHHHHhccCCCCchhHHHHHHHHHHHHHHHHHHhCCCCCCCC---hhHHHHHHHHHHHHHhhCC-CC-cccc Confidence 999999999997743 32 2 3688999999999998766443333 4455555444444433233 34 4569 Q ss_pred cccceeeeeecccCcc-eechHHHHHHHHHccccccCCceeeeeecc-CCCCccchhhhhccCCcccccc Q lcl|NC_018838. 80 SAGPFAQTTRSNTPTN-LLWPSEIAALKKLCEGDGGAGKAFTITPTM-NSRFTHSDVCSTVWGEGCSCGS 147 (159) Q Consensus 80 tAGpf~qT~~~~~~r~-~f~PsEI~~LQ~lC~~~~~~g~Aftidt~p-~~~~~Hs~~Cs~~~g~~CsCGa 147 (159) |+|+|++++... .+ .|.++|++.|.---+-.+...+||||++.+ ..+..-. -++|.-.- T Consensus 76 TaG~ys~s~t~~--G~Lylt~~E~~~Lg~~rqr~~~~d~a~si~~~~~~~~~~~d-------p~~~~~~~ 136 (136) T protein:vir:98 76 TIGQYAETLGED--GGLYLTEDEKGQLQPPDQTAPDADAAYSLDLDPGTRAWVDD-------PAGCGWPR 136 (136) T ss_pred cchhHHHhhhcC--CCcccChHHHHHhCCCCCcccccccceecccCCCcCCcCCC-------CCCCCCCC Confidence 999999999873 45 499999999954333334456799998553 1111111 12231111 No 7 >protein:vir:94761 Length: 132 # NCBI annotation: unknown # Family: family:all:1271 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996708;genbank:gi:45597423;genbank:GeneID:2769029 Probab=98.20 E-value=2.2e-08 Score=62.58 Aligned_cols=112 Identities=21% Similarity=0.196 Sum_probs=75.2 Q ss_pred cccccchhhcccc-cCC---chHHHHHHHHhHHHHHHHhcc---------ccCCCCCchHHHHH----HHHHHHHHhhhh Q lcl|NC_018838. 7 MGIILKPEDIEPF-ADI---PKDKLEAMIADVEAVAVSVAP---------CIAKPDFKYRDAAK----AILRRALLRWND 69 (159) Q Consensus 7 M~~~it~~Dl~pF-a~I---~e~~a~amI~da~A~A~~vAP---------Ci~~pdf~~~~aAK----aILRgAILRW~d 69 (159) |..--|++|++.= -.+ +.++++.+++||-++-+.-+| |.-.||.. +.++| ++.+.|++. + T Consensus 1 m~~fAtv~Dl~~r~r~L~~dE~~ra~~LL~dAs~~iR~~~~~~~~~~~~~~~~~~d~~-~~~~k~V~~~~V~Ral~~--~ 77 (132) T protein:vir:94 1 MNPFATVDDLTMLWRPLKGDEKERAEKLLEIVSDTLREEADKVGRDLDVMISEKPSYF-SSVVKSVTVDIVARTLMT--S 77 (132) T ss_pred CCCcCCHHHHHHHhccCChhHHHHHHHHHHHHHHHHHHHHhhhccccccccCCCCccc-hhHHHHHHHHHHHHHhcC--C Confidence 8888888888732 123 337899999998887764444 33233332 33344 556666665 3 Q ss_pred cccchhhh-hhcccceeeeeecccCcc-e-echHHHHHHHHHccccccCCceeeeeeccCC Q lcl|NC_018838. 70 TGVSGQVQ-YESAGPFAQTTRSNTPTN-L-LWPSEIAALKKLCEGDGGAGKAFTITPTMNS 127 (159) Q Consensus 70 ~G~SGavt-~~tAGpf~qT~~~~~~r~-~-f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~ 127 (159) .+..|..| |+|+|||++++....+.| + |.++|.+.| +. +..|+|+||.-.++ T Consensus 78 ~~~~g~tq~S~TaG~ys~S~T~~np~G~lylt~~e~~~L----Gl--~~~r~~~i~~~~~~ 132 (132) T protein:vir:94 78 TDQEPMTQTTESALGYSVSGSYLVPGGGLFIKNSELSRL----GL--KKQRFGVIDFYGND 132 (132) T ss_pred CCCCCceeeeeecccceeeeeeecCCCCceeChHHHHhh----CC--CCCceEEEeecCCC Confidence 23356666 799999999998887654 4 889988888 33 24689999987766 No 8 >protein:vir:1640 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695061;genbank:gi:23455752;genbank:GeneID:955488 Probab=97.95 E-value=1.6e-07 Score=57.81 Aligned_cols=115 Identities=17% Similarity=0.089 Sum_probs=77.3 Q ss_pred cccccchhhccccc-CCch---HHHHHHHHhHHHHHHHhccccC-------CCCCc-hHHHHHHHHHHHHHhhh--hccc Q lcl|NC_018838. 7 MGIILKPEDIEPFA-DIPK---DKLEAMIADVEAVAVSVAPCIA-------KPDFK-YRDAAKAILRRALLRWN--DTGV 72 (159) Q Consensus 7 M~~~it~~Dl~pFa-~I~e---~~a~amI~da~A~A~~vAPCi~-------~pdf~-~~~aAKaILRgAILRW~--d~G~ 72 (159) |..--|.+|++-.- .+++ +.++++|.||-.+-+.-.|=.. -++.+ ....+|.|--.++.|== +.+. T Consensus 1 m~~fAtv~Dv~~r~r~L~~~E~~ra~~lL~dAs~~ir~~~p~~~~~l~a~~~e~~~~~~~~~~~V~~~~V~Ral~~~~~~ 80 (132) T protein:vir:16 1 MNPFATVDDLTMLWRPLKGDEKERAEKLLEIVSDSLREEADKVGRDLYAMIAEKPSYFASVVKSVTVDIVARTLMTSTDQ 80 (132) T ss_pred CCccCCHHHHHHHhcCCCHhHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccchhHHHHHHHHHHHHHhcCCCCC Confidence 88888999988653 5544 4899999999888876555332 12222 23345655444444432 2233 Q ss_pred chhhh-hhcccceeeeeecccCcc-e-echHHHHHHHHHccccccCCceeeeeeccCC Q lcl|NC_018838. 73 SGQVQ-YESAGPFAQTTRSNTPTN-L-LWPSEIAALKKLCEGDGGAGKAFTITPTMNS 127 (159) Q Consensus 73 SGavt-~~tAGpf~qT~~~~~~r~-~-f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~ 127 (159) .|..| |+|+|||++++....+.| + |.++|.+.| +. +.+|+|+||.-.++ T Consensus 81 ~G~tq~S~TaG~ys~S~t~~~p~G~lylt~~e~~~L----G~--~~~r~~~i~~~~~~ 132 (132) T protein:vir:16 81 EPMTQTTESALGYSVSGSYLVPGGGLFIKNSELSRL----GL--KKQRFGVIDFYGND 132 (132) T ss_pred CCceeeeeeccchheeeeeecCCCcceeChHHHHhh----CC--CCCceEEEeecCCC Confidence 46666 799999999998887654 4 899999877 32 34699999987766 No 9 >protein:vir:78254 Length: 149 # NCBI annotation: gp16 # Family: family:all:2817 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491668;genbank:gi:157786492;genbank:GeneID:5625732 Probab=97.87 E-value=1.9e-07 Score=57.42 Aligned_cols=135 Identities=18% Similarity=0.234 Sum_probs=84.8 Q ss_pred cccccchhhccccc--CC---chHHHHHHHHhHHHHHHHhccccC--CCCCchHHHHHHHHHHHHHhhhhcccchhhhhh Q lcl|NC_018838. 7 MGIILKPEDIEPFA--DI---PKDKLEAMIADVEAVAVSVAPCIA--KPDFKYRDAAKAILRRALLRWNDTGVSGQVQYE 79 (159) Q Consensus 7 M~~~it~~Dl~pFa--~I---~e~~a~amI~da~A~A~~vAPCi~--~pdf~~~~aAKaILRgAILRW~d~G~SGavt~~ 79 (159) |+ -=|.+|++.+- .+ .+..++.+++||.++-++-.|=|+ -+|.++.+.+|+|--.+++|-- ++-+| ++|+ T Consensus 1 ~a-fAtv~Dve~rw~r~LT~eE~~~ae~lL~dAs~~IR~~iP~La~~~~dp~~~a~v~~V~~~mV~R~~-rnpeG-~~S~ 77 (149) T protein:vir:78 1 MA-YAEPSDVVARLGRPLTDDEETQVETFLEDAEIEIRSRIPDLDDKAEDEDYLKRVIKVEASAVTRLI-RNPDG-YIGE 77 (149) T ss_pred CC-cCCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhccccccccCCcchhhHHHHHHHHHHHHHh-cCCCC-eeee Confidence 22 23578888763 33 445699999999999999889877 4555555566777777777755 45466 5789 Q ss_pred cccceeeeeecccCcc-e-echHHHHHHHHHccccccCCceeeeeec-c--C-CCCccchhhhhccCCcccccceeccCC Q lcl|NC_018838. 80 SAGPFAQTTRSNTPTN-L-LWPSEIAALKKLCEGDGGAGKAFTITPT-M--N-SRFTHSDVCSTVWGEGCSCGSNINGHD 153 (159) Q Consensus 80 tAGpf~qT~~~~~~r~-~-f~PsEI~~LQ~lC~~~~~~g~Aftidt~-p--~-~~~~Hs~~Cs~~~g~~CsCGa~l~g~~ 153 (159) |.|+|+.++....+.| + +-++|++.| +..++. |+|+|.+. | + |.|.. .-|+-|+ .--+. T Consensus 78 T~G~YS~slt~~np~G~LylT~~E~a~L----G~~r~~-G~~~i~p~~~~~~~~~~~~--~~~~~~~--------~~~~~ 142 (149) T protein:vir:78 78 TDGNYSYQLNWRLNTGAIEITDKEWAQL----GLSKNV-GVLNVRPKTPLERSGEYPA--FGSVEWQ--------VFQQS 142 (149) T ss_pred ecchhhhhhhccCCCCceeeCHHHHHhh----CCcccc-cceeecccCccccCCCCCc--ccceeee--------eeecc Confidence 9999999998887655 4 889999999 333333 69999864 3 2 22210 0011111 01122 Q ss_pred cccccC Q lcl|NC_018838. 154 GPLWEI 159 (159) Q Consensus 154 ~plwe~ 159 (159) .|||=- T Consensus 143 ~~~~~~ 148 (149) T protein:vir:78 143 SPLYWG 148 (149) T ss_pred Cccccc Confidence 233211 No 10 >protein:vir:78478 Length: 149 # NCBI annotation: gp16 # Family: family:all:2817 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491587;genbank:gi:157786410;genbank:GeneID:5625630 Probab=97.87 E-value=1.9e-07 Score=57.42 Aligned_cols=135 Identities=18% Similarity=0.234 Sum_probs=84.8 Q ss_pred cccccchhhccccc--CC---chHHHHHHHHhHHHHHHHhccccC--CCCCchHHHHHHHHHHHHHhhhhcccchhhhhh Q lcl|NC_018838. 7 MGIILKPEDIEPFA--DI---PKDKLEAMIADVEAVAVSVAPCIA--KPDFKYRDAAKAILRRALLRWNDTGVSGQVQYE 79 (159) Q Consensus 7 M~~~it~~Dl~pFa--~I---~e~~a~amI~da~A~A~~vAPCi~--~pdf~~~~aAKaILRgAILRW~d~G~SGavt~~ 79 (159) |+ -=|.+|++.+- .+ .+..++.+++||.++-++-.|=|+ -+|.++.+.+|+|--.+++|-- ++-+| ++|+ T Consensus 1 ~a-fAtv~Dve~rw~r~LT~eE~~~ae~lL~dAs~~IR~~iP~La~~~~dp~~~a~v~~V~~~mV~R~~-rnpeG-~~S~ 77 (149) T protein:vir:78 1 MA-YAEPSDVVARLGRPLTDDEETQVETFLEDAEIEIRSRIPDLDDKAEDEDYLKRVIKVEASAVTRLI-RNPDG-YIGE 77 (149) T ss_pred CC-cCCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhccccccccCCcchhhHHHHHHHHHHHHHh-cCCCC-eeee Confidence 22 23578888763 33 445699999999999999889877 4555555566777777777755 45466 5789 Q ss_pred cccceeeeeecccCcc-e-echHHHHHHHHHccccccCCceeeeeec-c--C-CCCccchhhhhccCCcccccceeccCC Q lcl|NC_018838. 80 SAGPFAQTTRSNTPTN-L-LWPSEIAALKKLCEGDGGAGKAFTITPT-M--N-SRFTHSDVCSTVWGEGCSCGSNINGHD 153 (159) Q Consensus 80 tAGpf~qT~~~~~~r~-~-f~PsEI~~LQ~lC~~~~~~g~Aftidt~-p--~-~~~~Hs~~Cs~~~g~~CsCGa~l~g~~ 153 (159) |.|+|+.++....+.| + +-++|++.| +..++. |+|+|.+. | + |.|.. .-|+-|+ .--+. T Consensus 78 T~G~YS~slt~~np~G~LylT~~E~a~L----G~~r~~-G~~~i~p~~~~~~~~~~~~--~~~~~~~--------~~~~~ 142 (149) T protein:vir:78 78 TDGNYSYQLNWRLNTGAIEITDKEWAQL----GLSKNV-GVLNVRPKTPLERSGEYPA--FGSVEWQ--------VFQQS 142 (149) T ss_pred ecchhhhhhhccCCCCceeeCHHHHHhh----CCcccc-cceeecccCccccCCCCCc--ccceeee--------eeecc Confidence 9999999998887655 4 889999999 333333 69999864 3 2 22210 0011111 01122 Q ss_pred cccccC Q lcl|NC_018838. 154 GPLWEI 159 (159) Q Consensus 154 ~plwe~ 159 (159) .|||=- T Consensus 143 ~~~~~~ 148 (149) T protein:vir:78 143 SPLYWG 148 (149) T ss_pred Cccccc Confidence 233211 No 11 >protein:vir:7773 Length: 123 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817607;genbank:gi:29566037;genbank:GeneID:1259231 Probab=97.74 E-value=4.1e-07 Score=55.61 Aligned_cols=114 Identities=22% Similarity=0.253 Sum_probs=80.9 Q ss_pred cccccchhhccccc--CC---chHHHHHHHHhHHHHHHHhccccC--CCCCchHHHHHHHHHHHHHhhhhcccchhhhhh Q lcl|NC_018838. 7 MGIILKPEDIEPFA--DI---PKDKLEAMIADVEAVAVSVAPCIA--KPDFKYRDAAKAILRRALLRWNDTGVSGQVQYE 79 (159) Q Consensus 7 M~~~it~~Dl~pFa--~I---~e~~a~amI~da~A~A~~vAPCi~--~pdf~~~~aAKaILRgAILRW~d~G~SGavt~~ 79 (159) |+ -=|.+|++.|- .+ ++..++.+++||+.+-+.--|=++ -+|..+.+..|+|.-.+++|-- +.-+| .+|+ T Consensus 1 ~~-~At~~Dv~ar~~r~LT~~E~~~ve~lL~dAs~~ir~r~P~l~~~a~d~~~~~~~~~V~~~~V~R~~-rnpeG-~~s~ 77 (123) T protein:vir:77 1 MP-YATASDVTSRWARQPTDEETALINVRLADVERMIKRRIPDLATKVTDPDYLEDLKQVEADAVLRLV-RNPEG-YLSE 77 (123) T ss_pred CC-cCCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhccCcccccCCcchhHHHHHHHHHHHHHHh-hCCCC-ceec Confidence 22 23588888763 34 445689999999999999999877 4455555666777777777755 33366 5779 Q ss_pred cccceeeeeecccCcc-e-echHHHHHHHHHccccccCCceeeeeeccCCCC Q lcl|NC_018838. 80 SAGPFAQTTRSNTPTN-L-LWPSEIAALKKLCEGDGGAGKAFTITPTMNSRF 129 (159) Q Consensus 80 tAGpf~qT~~~~~~r~-~-f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~ 129 (159) |.|+|++++....+.| | +-++|++.|+- +..|+|+|.++|---. T Consensus 78 T~G~ys~sl~~a~~~g~Lylt~~E~~~Lg~------~~~~~~~i~p~~~~~~ 123 (123) T protein:vir:77 78 TDGNYTYMLRSDLASGKLEIFPEEWEILGY------RRSRMTVIVPNPVMPT 123 (123) T ss_pred ccchhhhhhcccCCCCcceeCHHHHHhhcC------CCCceeEEeeceecCC Confidence 9999999998776554 4 78999998852 2357999998873221 No 12 >protein:vir:2432 Length: 124 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046834;genbank:gi:9630402;genbank:GeneID:1261576 Probab=97.67 E-value=3.2e-07 Score=56.21 Aligned_cols=115 Identities=17% Similarity=0.194 Sum_probs=78.0 Q ss_pred cccccchhhccccc--CC---chHHHHHHHHhHHHHHHHhccccCC--CCCchHHHHHHHHHHHHHhhhhcccchhhhhh Q lcl|NC_018838. 7 MGIILKPEDIEPFA--DI---PKDKLEAMIADVEAVAVSVAPCIAK--PDFKYRDAAKAILRRALLRWNDTGVSGQVQYE 79 (159) Q Consensus 7 M~~~it~~Dl~pFa--~I---~e~~a~amI~da~A~A~~vAPCi~~--pdf~~~~aAKaILRgAILRW~d~G~SGavt~~ 79 (159) |. -=|++|++.|- .+ ++..++.+++||-.+-+..=|=+++ .+..+.+..|+|.-.+++|---.. +| .+|+ T Consensus 1 ~~-~At~~Dv~~rw~r~Lt~~E~~~ve~lL~dAs~~ir~r~P~l~~~~~~~~~~~~v~~V~a~~V~R~~rnP-~G-~~s~ 77 (124) T protein:vir:24 1 MA-YATADDVVTLWAKEPEPEVMALIERRLEQVERMIRRRIPDLDARVSSDIFRADLIDIEADAVLRLVRNP-EG-YLSE 77 (124) T ss_pred CC-CCCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHhcCCCcchhcCCCCChhhHHHHHHHHHHHHhhCC-CC-ceec Confidence 22 22588888774 33 4456899999999999988887752 233455555666666666654334 66 5789 Q ss_pred cccceeeeeecccCcc-e-echHHHHHHHHHccccccCCceeeeeeccCCCC Q lcl|NC_018838. 80 SAGPFAQTTRSNTPTN-L-LWPSEIAALKKLCEGDGGAGKAFTITPTMNSRF 129 (159) Q Consensus 80 tAGpf~qT~~~~~~r~-~-f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~ 129 (159) |.|+|++++.+..+.| | +-++|++.|+ . ++..|+|+|.+++---- T Consensus 78 T~G~Ys~sl~~~~~~g~Lylt~~E~~~Lg----~-~r~~~~~~i~p~~~~~~ 124 (124) T protein:vir:24 78 TDGAYTYQLQADLSQGKLVILDEEWTTLG----V-NRLSRMSTLVPNIVMPT 124 (124) T ss_pred ccchhHHhhhhcccCCceeeCHHHHHhhC----c-ccccceeEeecceeeCC Confidence 9999999998876654 4 7899999885 2 23347999998762111 No 13 >protein:vir:99002 Length: 158 # NCBI annotation: gp31 # Family: family:all:32652 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655896;genbank:gi:109521468;genbank:GeneID:4157967 Probab=97.63 E-value=1.4e-06 Score=52.68 Aligned_cols=136 Identities=18% Similarity=0.226 Sum_probs=80.8 Q ss_pred cccccchhhccccc--CCchH------HHHHHHHhHHHHHHHhccccCCCCC-chHHHHHHHHHHHHHh-hhhcccchhh Q lcl|NC_018838. 7 MGIILKPEDIEPFA--DIPKD------KLEAMIADVEAVAVSVAPCIAKPDF-KYRDAAKAILRRALLR-WNDTGVSGQV 76 (159) Q Consensus 7 M~~~it~~Dl~pFa--~I~e~------~a~amI~da~A~A~~vAPCi~~pdf-~~~~aAKaILRgAILR-W~d~G~SGav 76 (159) |++--+.||++.|. .+|++ +|+++.+|+=..|+.+ -|..=|-. +-++.+|+|+-.|.-| |+.-. | + T Consensus 1 ~~alasvee~~trl~~~lp~~~~r~~a~a~~vLd~~S~~ar~~-~gr~W~~~~daP~~vr~ivL~aa~R~~~NP~--g-~ 76 (158) T protein:vir:99 1 MAALVSVEEFTTFLRVPLPEEGSEKYTQMEFLLTLASDWAREL-SCKPWLLPADAPVTARGIILAASRREWNNPK--R-V 76 (158) T ss_pred CcceeeHhhhhhhhcccCChhhhHHHHHHHHHHHHHHHHHHHh-cCccCCCCCcchhHHHHHHHHHHHHHHhcCC--c-e Confidence 99999999999995 47533 3444477754433333 24432322 2356677766555555 55544 3 6 Q ss_pred hhhcccceeeeeeccc-CcceechHHHHHHHHHccccccCCceeeeeeccCCCCccchhhhhccCCcccccceeccCCcc Q lcl|NC_018838. 77 QYESAGPFAQTTRSNT-PTNLLWPSEIAALKKLCEGDGGAGKAFTITPTMNSRFTHSDVCSTVWGEGCSCGSNINGHDGP 155 (159) Q Consensus 77 t~~tAGpf~qT~~~~~-~r~~f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~~Hs~~Cs~~~g~~CsCGa~l~g~~~p 155 (159) +|+++|||..++--.- +-++|-+.|++.|+++.+. .|+-|+++|+--+-..-.-.-.+++++ +--| T Consensus 77 ~~~~~G~~~~~~~~~g~~~~ffT~~E~~~L~r~~~s---~GG~~~~~ttR~d~~~~~~yv~v~~~G----------dpfP 143 (158) T protein:vir:99 77 SYVVKGPQSATFMQSAYPPGFFTDAEEAKLRSYGRS---TGNWGVIETYRDDEEQLNGYLEVYPHG----------GLMP 143 (158) T ss_pred EEeeecchhhhcccccCCCcccCHHHHHHHHHhhcc---cCceeEEEeecCccccCCceecccCCC----------Cccc Confidence 6799999999995553 3369999999999999643 488999999763322211011122221 1112 Q ss_pred cccC Q lcl|NC_018838. 156 LWEI 159 (159) Q Consensus 156 lwe~ 159 (159) ||-- T Consensus 144 ~~~~ 147 (158) T protein:vir:99 144 VYHP 147 (158) T ss_pred ccCc Confidence 2111 No 14 >protein:vir:2345 Length: 125 # NCBI annotation: gp15 # Family: family:all:2817 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075282;genbank:gi:12657869;genbank:GeneID:920134 Probab=97.58 E-value=7.6e-07 Score=54.13 Aligned_cols=115 Identities=20% Similarity=0.239 Sum_probs=89.8 Q ss_pred cccccchhhccccc--CC---chHHHHHHHHhHHHHHHHhccccC---CCCCchHHHHHHHHHHHHHhhhhcccchhhhh Q lcl|NC_018838. 7 MGIILKPEDIEPFA--DI---PKDKLEAMIADVEAVAVSVAPCIA---KPDFKYRDAAKAILRRALLRWNDTGVSGQVQY 78 (159) Q Consensus 7 M~~~it~~Dl~pFa--~I---~e~~a~amI~da~A~A~~vAPCi~---~pdf~~~~aAKaILRgAILRW~d~G~SGavt~ 78 (159) |+---+.+|.+.|- .+ +++.++..|.||+.|-++.=|=|+ +.+..+....|+|.-.+++|-. ++-+| .+| T Consensus 1 ma~~A~~eDV~a~w~R~lt~eE~~~V~~~L~~ae~~irrriPdL~~r~~~~~~~~~~v~~V~a~~V~Rv~-rnPeG-y~s 78 (125) T protein:vir:23 1 MATLATHEDVTAFWARTPTAEEIVLINRRLAQAERMLLRAIPELLIKASSDPVFRAEVIDIEAEAVLRLV-RNHEG-YLS 78 (125) T ss_pred CCcccCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhcCChhhhhcCCCcchhhHHHHHHHHHHHHh-cCCCC-ccc Confidence 88888899988884 34 556788999999999999999886 5556666779999999999977 55367 777 Q ss_pred hcccceeeeeecccCcc-e-echHHHHHHHHHccccccCCceeeeeeccCCCC Q lcl|NC_018838. 79 ESAGPFAQTTRSNTPTN-L-LWPSEIAALKKLCEGDGGAGKAFTITPTMNSRF 129 (159) Q Consensus 79 ~tAGpf~qT~~~~~~r~-~-f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~ 129 (159) +|.|+|++++.++...| + .-|+|.+.|+- . ..|+|.|.++|---. T Consensus 79 eT~g~Yt~~l~~~~~~g~L~it~~E~a~Lg~-----~-~s~~~vi~p~~~~p~ 125 (125) T protein:vir:23 79 ETDGNYTYMLQAQDPNRKLEILPEEWEVLGI-----V-RSGLGILVPTVVLPS 125 (125) T ss_pred cccchhhhhhhccCCCCceeecHHHHHhhcc-----c-cccceEEeeceecCC Confidence 99999999998886555 4 78999999863 1 237999998873221 No 15 >protein:vir:104088 Length: 125 # NCBI annotation: gp20 # Family: family:all:2817 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655599;genbank:gi:109392470;genbank:GeneID:4156956 Probab=97.11 E-value=3.1e-06 Score=50.76 Aligned_cols=115 Identities=17% Similarity=0.233 Sum_probs=81.3 Q ss_pred cccccchhhccccc--CC---chHHHHHHHHhHHHHHHHhccccC---CCCCchHHHHHHHHHHHHHhhhhcccchhhhh Q lcl|NC_018838. 7 MGIILKPEDIEPFA--DI---PKDKLEAMIADVEAVAVSVAPCIA---KPDFKYRDAAKAILRRALLRWNDTGVSGQVQY 78 (159) Q Consensus 7 M~~~it~~Dl~pFa--~I---~e~~a~amI~da~A~A~~vAPCi~---~pdf~~~~aAKaILRgAILRW~d~G~SGavt~ 78 (159) |+ -=+++|.+.|- .+ ++..++..++||+.|-.+-=|=|+ ..|..+....++|.-.|++|-.=.- +| .+| T Consensus 1 ma-~A~~~Dv~~~w~r~lT~~E~~~v~~~L~~Ae~~Ir~riP~L~~r~~a~~~~~~~v~~Vea~aV~Rv~rNP-eG-y~s 77 (125) T protein:vir:10 1 MA-YANAQDVVTLWAKEPEPEVMELIERRLAQVERMIKRRIPNLDLKVAADATFQADLIDIEADAVLRLVRNP-EG-YIS 77 (125) T ss_pred CC-cCCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHhCCChhhhhhcCCCccccHHHHHHHHHHHHhcCC-Cc-ccc Confidence 43 34688888884 34 445677889999999998888775 2233344447788888888865444 67 488 Q ss_pred hcccceeeeeecccCcc-e-echHHHHHHHHHccccccCCceeeeeeccCCCC Q lcl|NC_018838. 79 ESAGPFAQTTRSNTPTN-L-LWPSEIAALKKLCEGDGGAGKAFTITPTMNSRF 129 (159) Q Consensus 79 ~tAGpf~qT~~~~~~r~-~-f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~ 129 (159) +|-|+|++++.++-..| + +-|+|++.|+= .+.-|+|+|.+.+---- T Consensus 78 ~T~G~Ys~~l~~~~~~g~L~it~~Ew~~Lg~-----~r~s~~~~i~p~~~~~~ 125 (125) T protein:vir:10 78 ETDGAYTYQLQTDLSQGRLTILDDEWTTLGV-----NRLSRMSVIAPNIVMPT 125 (125) T ss_pred cccchhHHhhhcccccCceeeCHHHHHhhcc-----ccccceeeeecccccCC Confidence 99999999998776544 4 78999999862 23347999988762111 No 16 >protein:vir:4228 Length: 125 # NCBI annotation: predicted 14.0Kd protein # Family: family:all:2817 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039683;swissprot:sw:q05225;genbank:gi:9625449;uniprot:Q05225;genbank:GeneID:2942926 Probab=96.97 E-value=5.7e-06 Score=49.35 Aligned_cols=115 Identities=17% Similarity=0.205 Sum_probs=83.7 Q ss_pred cccccchhhccccc--CC---chHHHHHHHHhHHHHHHHhccccC---CCCCchHHHHHHHHHHHHHhhhhcccchhhhh Q lcl|NC_018838. 7 MGIILKPEDIEPFA--DI---PKDKLEAMIADVEAVAVSVAPCIA---KPDFKYRDAAKAILRRALLRWNDTGVSGQVQY 78 (159) Q Consensus 7 M~~~it~~Dl~pFa--~I---~e~~a~amI~da~A~A~~vAPCi~---~pdf~~~~aAKaILRgAILRW~d~G~SGavt~ 78 (159) |+ -=+++|.+.|- .+ .+..++..+.||+.|-.+.=|=|+ ..+..+.+..++|--.|++|-.=.- +| .+| T Consensus 1 m~-~A~~eDV~a~w~r~lt~~e~~~v~~~L~~Ae~~Ir~riPdL~~r~~~~~~~~~~v~~Vea~aV~Rv~RNp-eG-y~s 77 (125) T protein:vir:42 1 MA-YATAEDVVTLWAKEPEPEVMALIERRLQQIERMIKRRIPDLDVKAAASATFRADLIDIEADAVLRLVRNP-EG-YLS 77 (125) T ss_pred CC-cccHhHHHHHhCCCCChHHHHHHHHHHHHHHHHHHHhCCCchhhhcccCcchhhHHHHHHHHHHHHHhCC-Cc-ccc Confidence 43 34678877773 33 455678899999999998888775 4456667778888888888866554 67 677 Q ss_pred hcccceeeeeecccCcc-e-echHHHHHHHHHccccccCCceeeeeeccCCCC Q lcl|NC_018838. 79 ESAGPFAQTTRSNTPTN-L-LWPSEIAALKKLCEGDGGAGKAFTITPTMNSRF 129 (159) Q Consensus 79 ~tAGpf~qT~~~~~~r~-~-f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~ 129 (159) +|-|+|++++.++-..| + +-|+|.+.|+= .++-|+|+|.+++---- T Consensus 78 ~T~G~Ys~~l~~~~~~g~L~it~eEw~~L~p-----~~~~g~~~i~P~~~~~~ 125 (125) T protein:vir:42 78 ETDGAYTYQLQADLSQGKLTILDEEWEILGV-----NSQKRMAVIVPNVVMPT 125 (125) T ss_pred ccchhHHHhhhcccccCceeeCHHHHHhhCc-----cccccceeecccceeCC Confidence 99999999998875545 4 78999999862 22446999987762111 No 17 >protein:vir:2505 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:28222 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569746;genbank:gi:18496896;genbank:GeneID:932265 Probab=94.68 E-value=0.00011 Score=42.34 Aligned_cols=116 Identities=13% Similarity=0.100 Sum_probs=63.6 Q ss_pred ccccc---chhhcccc--cCC---chHHHHHHHHhHHHHHHHhccccCCCCCc---hHHHHHHHHHHHHHhhhhcccchh Q lcl|NC_018838. 7 MGIIL---KPEDIEPF--ADI---PKDKLEAMIADVEAVAVSVAPCIAKPDFK---YRDAAKAILRRALLRWNDTGVSGQ 75 (159) Q Consensus 7 M~~~i---t~~Dl~pF--a~I---~e~~a~amI~da~A~A~~vAPCi~~pdf~---~~~aAKaILRgAILRW~d~G~SGa 75 (159) |..-- |.+|+.-= ..+ ++..++.||++|=.+-+-.=+-=..||.. -...+-.|++.|+.|=+|.= +. T Consensus 1 ~~~~~alAtvdDv~~~lrr~Lt~dE~~~a~~Ll~eAsdlI~g~l~~~~vp~~~p~~v~rVvA~ivarAltr~~~~~--pe 78 (128) T protein:vir:25 1 MTECKALATSQDVKRALRRDLTEAEQTDLSELLAEATDLVVGYLHPYPVPTPTPGPIKRVVASMVAAVLTRPTQIL--PE 78 (128) T ss_pred CccchhccCHHHHHHHhcCCCCHHHHHHHHHHHhcchheeeeecCCCCCCCCCCchHHHHHHHHHHHHhhCCCccC--CC Confidence 33211 12222100 011 34456667776554443221111123322 34556678888888866643 45 Q ss_pred hhhhcccceeeeeecccCc-ceechHHHHHHHHHccccccCCceeeeeeccCCCC Q lcl|NC_018838. 76 VQYESAGPFAQTTRSNTPT-NLLWPSEIAALKKLCEGDGGAGKAFTITPTMNSRF 129 (159) Q Consensus 76 vt~~tAGpf~qT~~~~~~r-~~f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~ 129 (159) -|+.|||||+.+|.++... |+|.-+| +|+.=+.. +-+||+|...-.-++ T Consensus 79 ~~S~TAgpfs~~ft~~~~~~g~yLTaa---~k~~Lrp~--R~~~~sV~l~sery~ 128 (128) T protein:vir:25 79 TQSLTADGFGVTFTPGGNSPGPYLSAA---LKQRLRPY--RTGMVAVEMGSERYC 128 (128) T ss_pred ceeeecccccccccCCCCCCCceEcHH---HHhhcccc--cceeeEeecccccCC Confidence 5677999999999888764 6765543 33333432 336999988776555 No 18 >protein:vir:108221 Length: 150 # NCBI annotation: gp11 # Family: family:all:28004 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552340;genbank:gi:160700660;genbank:GeneID:5758941 Probab=87.81 E-value=0.0071 Score=32.35 Aligned_cols=127 Identities=14% Similarity=0.129 Sum_probs=67.9 Q ss_pred hhhcccccCC-------------chHHHHHHHHhHHHHHHHhccc-----cCCCCCchHHHHHHHHHHHHHhhhhcccch Q lcl|NC_018838. 13 PEDIEPFADI-------------PKDKLEAMIADVEAVAVSVAPC-----IAKPDFKYRDAAKAILRRALLRWNDTGVSG 74 (159) Q Consensus 13 ~~Dl~pFa~I-------------~e~~a~amI~da~A~A~~vAPC-----i~~pdf~~~~aAKaILRgAILRW~d~G~SG 74 (159) -+|..||+|. ++..|+.+.++|--+-++=-|- |.++|..-.-.+-+|+++|+++--|. +| T Consensus 1 ~ad~~pFadv~~lea~WrpLt~~E~~~Ae~LL~~As~~IR~~~Pa~a~a~l~~dd~~A~~Vs~~vVk~Am~~~~e~--~G 78 (150) T protein:vir:10 1 MADVTPFIDVSQFEAMFRPLGDGERLLAEVLLKAAAIRIRDRVAAAGRAPLEPDDAMAILVSFEVTRDAMPPIPEM--AG 78 (150) T ss_pred CCCCccccchhhhHhhhcccChhHHHHHHHHHHHHHHHHhhcccccCCCCCCCCcchhHHHHHHHHHHhccccccc--cc Confidence 4566677654 4566777887776666654333 44555556666788999999986653 67 Q ss_pred hhh-hhcccceeeeeecccCcc-eechHHHHHHHHHccccccCCceeeeeeccCCCCccchhhhhccCCcccc---ccee Q lcl|NC_018838. 75 QVQ-YESAGPFAQTTRSNTPTN-LLWPSEIAALKKLCEGDGGAGKAFTITPTMNSRFTHSDVCSTVWGEGCSC---GSNI 149 (159) Q Consensus 75 avt-~~tAGpf~qT~~~~~~r~-~f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~~Hs~~Cs~~~g~~CsC---Ga~l 149 (159) ..+ ++|+|||..+...+.+.+ ++|=+--++|-- |+-++---+ .-|-.-||..-.- .-++ T Consensus 79 ~ss~S~T~G~rses~T~snPag~L~ft~~~k~lLG-------------is~ta~P~~---~~~~~df~~~~~~~~~~~~~ 142 (150) T protein:vir:10 79 RTQYSITTDDRTEQATMATAAGLLDFNERHWSLLG-------------ISATAGPEY---GGMGGDFGQLGRANPYPIVI 142 (150) T ss_pred cchhhhccccccccccccchhhhhhhhHHHHHHhC-------------CCccCCccc---cCCCcchhhhcCCCCcceEe Confidence 887 789999999888887754 555443333322 222221000 0122333322000 0000 Q ss_pred ccCCcccccC Q lcl|NC_018838. 150 NGHDGPLWEI 159 (159) Q Consensus 150 ~g~~~plwe~ 159 (159) -.| .=|-- T Consensus 143 ~~~--~~~~~ 150 (150) T protein:vir:10 143 GSD--ADWLG 150 (150) T ss_pred cCC--ccccC Confidence 000 00000 No 19 >protein:vir:106583 Length: 105 # NCBI annotation: putative protein # Family: family:all:6481 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958586;genbank:gi:41179246;genbank:GeneID:2717119 Probab=72.20 E-value=0.12 Score=25.60 Aligned_cols=100 Identities=13% Similarity=0.241 Sum_probs=67.2 Q ss_pred ecccccchhhcccc----cCCchHHHHHHHHhHHHHHHHhccccCCCCCchHHHHHHHHHH-HHHhhhhcccchhhhhhc Q lcl|NC_018838. 6 LMGIILKPEDIEPF----ADIPKDKLEAMIADVEAVAVSVAPCIAKPDFKYRDAAKAILRR-ALLRWNDTGVSGQVQYES 80 (159) Q Consensus 6 lM~~~it~~Dl~pF----a~I~e~~a~amI~da~A~A~~vAPCi~~pdf~~~~aAKaILRg-AILRW~d~G~SGavt~~t 80 (159) .|.+.--.+.|.-+ ++.+.+..+.+|.|+.++.+.. +..+ ..+....-|+|. ||-|+|-.|..|. +|+| T Consensus 1 ~~~~~~~~e~ik~L~~~~d~~~DelL~~lieda~~~vl~y---~nr~--~ip~~l~~~v~evav~~fNR~G~EG~-tS~S 74 (105) T protein:vir:10 1 MLNVDQLTEIVSALSTRLENVNNALLTELVKESIAQVLDY---TGQK--KLVGSMDIYVKKLAVINYNRLGIEGE-TQRS 74 (105) T ss_pred CCchHHHHHHHHHHhccCCCchhHHHHHHHHHHHHHHHHH---cCCc--ccchhHHHHHHHHHHHHhcccCCccc-ceee Confidence 55555555555544 3467779999999999999875 3333 455666666665 7999999997775 5699 Q ss_pred ccceeeeeecccCcceechHHHHHHHHHccccccCCcee Q lcl|NC_018838. 81 AGPFAQTTRSNTPTNLLWPSEIAALKKLCEGDGGAGKAF 119 (159) Q Consensus 81 AGpf~qT~~~~~~r~~f~PsEI~~LQ~lC~~~~~~g~Af 119 (159) .|=.+.|+.++. |.||.+-=+-++.. +-++ | T Consensus 75 egGvS~sy~~~~------~~~~~~~l~~yR~~-~v~~-~ 105 (105) T protein:vir:10 75 EGGITNYLETGI------PKDIRQGLNSYRIA-KVKK-L 105 (105) T ss_pred cCCeeeeeeccC------cHHHHHHHHHHhhh-cccC-C Confidence 999999997754 46666544555542 2222 2 No 20 >protein:vir:79640 Length: 134 # NCBI annotation: gp38 # Family: family:all:5122 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285527;genbank:gi:148734510;genbank:GeneID:5219998 Probab=58.74 E-value=0.4 Score=22.78 Aligned_cols=112 Identities=21% Similarity=0.237 Sum_probs=65.5 Q ss_pred cccccchhhc----ccccCCchHHHHHHHHhHHHHHHHhccccCCCCCc-hHHHHHHHHHHHHHhh----hhccc----- Q lcl|NC_018838. 7 MGIILKPEDI----EPFADIPKDKLEAMIADVEAVAVSVAPCIAKPDFK-YRDAAKAILRRALLRW----NDTGV----- 72 (159) Q Consensus 7 M~~~it~~Dl----~pFa~I~e~~a~amI~da~A~A~~vAPCi~~pdf~-~~~aAKaILRgAILRW----~d~G~----- 72 (159) |+.+.+.+.. +.|.+.|.+..+..+++|.... | +..|. ..+.|-..+--=+|.- +-.|. T Consensus 1 m~d~~~ve~Fr~l~PeF~~vpde~l~~~~~~A~~~i-----~--~~~~g~~~~~al~lltAHLl~l~~~~~~~g~~~~~~ 73 (134) T protein:vir:79 1 MNDIEILEQIYKIAPAFKKVDPELIQAWIELAKDFV-----C--EKHFKDKYFRAVALYTLHLMTLDGAMKQESESVESY 73 (134) T ss_pred CchHHHHHHHHHhccccccCCHHHHHHHHHHhhhhh-----c--CCCCChHHHHHHHHHHHHHHhhcccccccccccccc Confidence 7776555532 4588899999999998876654 3 23333 2233333333334422 22221 Q ss_pred c-hhhhhhcccceeeeeecccCcc-eec------hHHHHHHHHHccccccCCceeeeeeccCCCCcc Q lcl|NC_018838. 73 S-GQVQYESAGPFAQTTRSNTPTN-LLW------PSEIAALKKLCEGDGGAGKAFTITPTMNSRFTH 131 (159) Q Consensus 73 S-Gavt~~tAGpf~qT~~~~~~r~-~f~------PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~~H 131 (159) + +..++.+.|++++|++.-+..+ -+| =++-.+|.+.. ++-|.+-|--.+++.. T Consensus 74 ~grv~ssst~G~vSvS~a~ps~~~~~~Wl~~TpYGq~y~~L~k~~------~GGf~~~t~~~~~~~r 134 (134) T protein:vir:79 74 SHRIASFSLTGEFSQTFSKVSDDTSGNTLRQTPWGKMYEVLNKKK------GGGFGLTTAFHRRCSR 134 (134) T ss_pred cchhhhhhhhcceeeeccCcccchhHHHHhcCHHHHHHHHHHHhh------ccchHhhhhccccCCC Confidence 2 4444577999999997755433 233 46777888853 3467777766554422 No 21 >protein:vir:78106 Length: 236 # NCBI annotation: hypothetical protein # Family: family:all:29846 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294804;genbank:gi:149882825;genbank:GeneID:5309134 Probab=58.55 E-value=0.08 Score=26.59 Aligned_cols=127 Identities=23% Similarity=0.325 Sum_probs=63.8 Q ss_pred cccchhhcccccCCchHHHHHHHHhHHHHHHHhccccC--CCCCchHHHHHHHHHHHHHhhhhcccchhhhhhcccceee Q lcl|NC_018838. 9 IILKPEDIEPFADIPKDKLEAMIADVEAVAVSVAPCIA--KPDFKYRDAAKAILRRALLRWNDTGVSGQVQYESAGPFAQ 86 (159) Q Consensus 9 ~~it~~Dl~pFa~I~e~~a~amI~da~A~A~~vAPCi~--~pdf~~~~aAKaILRgAILRW~d~G~SGavt~~tAGpf~q 86 (159) ..|+|+.|.. +...+ +.++.+|+-+||||+ ..|.+..+-|-+|||...-.---+| .--+ ++-|--+. T Consensus 1 mtikpdeigs----ddgaa----rrvlvlartiapcldslpedsdrrkdalailrnvyeevltrg-arnv--rsegvasa 69 (236) T protein:vir:78 1 MTIKPDEIGS----DDGAA----RRVLVLARTIAPCLDSLPEDSDRRKDALAILRNVYEEVLTRG-ARNV--RSEGVASA 69 (236) T ss_pred CccCcccccC----ccccc----eeeeehhhhhhhhhhhccccchhhhHHHHHHHHHHHHHHhhh-hhhh--hhccccce Confidence 3466666652 11111 356788899999999 5677788889999998865555555 2223 33343333 Q ss_pred eeecccCcceechHHHHHHHHHccccccCCceeeeeeccCCCCccchhhhhccCCcccccceeccCC------------- Q lcl|NC_018838. 87 TTRSNTPTNLLWPSEIAALKKLCEGDGGAGKAFTITPTMNSRFTHSDVCSTVWGEGCSCGSNINGHD------------- 153 (159) Q Consensus 87 T~~~~~~r~~f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~~Hs~~Cs~~~g~~CsCGa~l~g~~------------- 153 (159) .+.+.. ..-|--..-..|+.||+..---+ |.. .-+-+|-+. ..=--|....|+- T Consensus 70 rvsyev-gsaftdddraslralcgavppvq-asa-egtrqgpfp----------kerpvgprvdgdvlmhldsFVRlRA~ 136 (236) T protein:vir:78 70 RVSYEV-GSAFTDDDRASLRALCGAVPPVQ-ASA-EGTRQGPFP----------KERPVGPRVDGDVLMHLDSFVRQRAA 136 (236) T ss_pred eeeecc-ccccccchhhHHHHHhccCCccc-ccc-cccccCCCc----------cccCCCCccchhhhhhhHHHHHHhhh Confidence 332322 23466666678999998632211 111 111122111 0011233333331 Q ss_pred -----------cccccC Q lcl|NC_018838. 154 -----------GPLWEI 159 (159) Q Consensus 154 -----------~plwe~ 159 (159) -|=|-. T Consensus 137 RkpdPYNpaqt~eDWt~ 153 (236) T protein:vir:78 137 RAPDPYNPDSTVEDWTA 153 (236) T ss_pred cCCCCCCcccCCccccC Confidence 112221 No 22 >protein:vir:105776 Length: 133 # NCBI annotation: gp11 # Family: family:all:10997 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224149;genbank:gi:62362224;genbank:GeneID:3342529 Probab=52.21 E-value=0.44 Score=22.54 Aligned_cols=127 Identities=16% Similarity=0.172 Sum_probs=76.1 Q ss_pred ccchhhcccc-c----CCchHHHHHHHHhHHHHHHHhccccCCCCCchHHHHHHHHHHHHHhhhhcccchhhhhhcccc- Q lcl|NC_018838. 10 ILKPEDIEPF-A----DIPKDKLEAMIADVEAVAVSVAPCIAKPDFKYRDAAKAILRRALLRWNDTGVSGQVQYESAGP- 83 (159) Q Consensus 10 ~it~~Dl~pF-a----~I~e~~a~amI~da~A~A~~vAPCi~~pdf~~~~aAKaILRgAILRW~d~G~SGavt~~tAGp- 83 (159) -||.+|..+| + ++|+--.++++ .++..+-+||+. ..+ ...+|.|.-=|+-+-.....+=-|+||+|=- T Consensus 1 mIT~~qa~~~L~slG~svP~~iL~~~v----~q~nsi~~cLda-gY~-e~tq~LI~lya~~LlA~~~g~R~IsSQ~APSG 74 (133) T protein:vir:10 1 MITTEQAKEYLESVGITLPDFILQAIV----EQANSIQECLDA-HYP-PATALLIQSYLLGLMALGQGDRYISSQTAPNG 74 (133) T ss_pred CCCHHHHHHHHHhcCCcchHHHHHHHH----HHHhhHHHHHhC-CCC-HHHHHHHHHHHHHHHhhccCCceeecccCCcc Confidence 6999999999 3 35666555555 455778999995 443 3457888887777777777566788887622 Q ss_pred eeeeeecccCcceechHHHHHHHHHccccccCCceeeeeeccCCCCccchhhhhccCCcccccceeccCC Q lcl|NC_018838. 84 FAQTTRSNTPTNLLWPSEIAALKKLCEGDGGAGKAFTITPTMNSRFTHSDVCSTVWGEGCSCGSNINGHD 153 (159) Q Consensus 84 f~qT~~~~~~r~~f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~~Hs~~Cs~~~g~~CsCGa~l~g~~ 153 (159) =+|+|++.+. +.=|-.--.+|++|=. .|=+ -+.+|.+-..-..+=-+..-++|-| |||. T Consensus 75 ASrSF~Y~~~-~~~~~~l~~~L~~lD~----~gCt--~~Lip~d~~~~a~vG~f~vvggc~c----~~~~ 133 (133) T protein:vir:10 75 ASRSFRYQSF-ADRWKGALSLLRGADK----FRCA--NGLIPPDPTNTAFAGIWIGKGGCMC----NGDK 133 (133) T ss_pred ccccccccCC-CccHHHHHHHHHhhhh----cccc--ccccCCCccccccceeeeecccccc----CCCC Confidence 5678877652 3334444456666533 3322 2344433332222222334456777 4564 No 23 >protein:vir:107702 Length: 136 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003900;genbank:gi:45686316;genbank:GeneID:2773042 Probab=49.39 E-value=0.64 Score=21.65 Aligned_cols=110 Identities=21% Similarity=0.272 Sum_probs=64.1 Q ss_pred ccccc--c-hhh----cccccCCchHHHHHHHHhHHHHHHHhccccCCCCCc-hHHHHHHHHHHHHHhhhhc-------- Q lcl|NC_018838. 7 MGIIL--K-PED----IEPFADIPKDKLEAMIADVEAVAVSVAPCIAKPDFK-YRDAAKAILRRALLRWNDT-------- 70 (159) Q Consensus 7 M~~~i--t-~~D----l~pFa~I~e~~a~amI~da~A~A~~vAPCi~~pdf~-~~~aAKaILRgAILRW~d~-------- 70 (159) |+-.- + .|. -+.|++.|.+..++.|++|-... | ...|. ..+.|-..+--=+| |.|. T Consensus 1 ~~~~~~~~~ve~fR~l~PeF~dvPde~i~~~~d~A~~~v-----~--~~~~Gk~y~~al~lltAHLl-~l~~~~~~~~~~ 72 (136) T protein:vir:10 1 MNQETLIAVVEQMRKLVPALRKVPDETLYAWVEMAELFV-----C--QKTFKDAYVKALALYALHLA-FLDGALKGEDED 72 (136) T ss_pred CCchHHHHHHHHHHHhccccccCCHHHHHHHHHHHHHhh-----c--CCCChhHHHHHHHHHHHHHH-hccccccccccc Confidence 44332 1 121 23578889999999998875554 3 22333 22233334433344 4432 Q ss_pred ---ccchhhhhhcccceeeeeecccCcc-------eechHHHHHHHHHccccccCCceeeeeeccCCCCcc Q lcl|NC_018838. 71 ---GVSGQVQYESAGPFAQTTRSNTPTN-------LLWPSEIAALKKLCEGDGGAGKAFTITPTMNSRFTH 131 (159) Q Consensus 71 ---G~SGavt~~tAGpf~qT~~~~~~r~-------~f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~~H 131 (159) ++.|.+++.+.|.++++++.-+..+ .=|=++-..|.|+++ +-|.+-|--.++| | T Consensus 73 ~~~~s~rv~ssat~GevSVS~a~~s~~~s~~WL~~TpyGq~y~aL~k~~~------gGf~l~t~~~~~c-~ 136 (136) T protein:vir:10 73 LESYSRRVTSFSLSGEFSQTFGEVTKNQSGDMMLSTPWGKMFEQLKARRR------GRFALMTGLRGGC-H 136 (136) T ss_pred ccccccceehheeccceeEeeccccCchhhHhhhcCHHHHHHHHHHhhcc------cchhhhhcccccC-C Confidence 2233444578999999997654322 134578888888644 4788888667777 7 No 24 >protein:vir:5256 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852761;genbank:gi:31544036;uniprot:Q7Y5T9;genbank:GeneID:2753553 Probab=45.42 E-value=0.77 Score=21.20 Aligned_cols=105 Identities=15% Similarity=0.100 Sum_probs=60.5 Q ss_pred ecccccchhhcccccCCchHHHHHHHHhHHHHHHHhccccCCCCCchHHHHHHHHHHHHHhhhhc------ccchhhhhh Q lcl|NC_018838. 6 LMGIILKPEDIEPFADIPKDKLEAMIADVEAVAVSVAPCIAKPDFKYRDAAKAILRRALLRWNDT------GVSGQVQYE 79 (159) Q Consensus 6 lM~~~it~~Dl~pFa~I~e~~a~amI~da~A~A~~vAPCi~~pdf~~~~aAKaILRgAILRW~d~------G~SGavt~~ 79 (159) ++.+.-=-++-+-|++.|++..+..+++|-.. +-|+-- -+..+.+...+--=+|.++.. +.+|.++|. T Consensus 1 m~t~~~Fr~~~PeF~~~pd~~i~~~l~~A~~~---l~~~~~---g~~~~~~~~L~~AH~l~l~~~~~~~~g~~~g~v~S~ 74 (119) T protein:vir:52 1 MPLTEDFLLRYTEFGKTDAKRIGLFLSDAQAE---VSKVQW---GKLYDRGVMALTAHLLKLSADAEISGGAANRNLASE 74 (119) T ss_pred CCcHHHHHHhhhhccCCCHHHHHHHHHHHHHh---hCCcCC---chHHHHHHHHHHHHHHHhhhhhhccccccccceeee Confidence 22221114456778899988888887776433 333221 122333333344444544432 345889999 Q ss_pred cccceeeeeecccC---cce-----echHHHHHHHHHccccccCCcee Q lcl|NC_018838. 80 SAGPFAQTTRSNTP---TNL-----LWPSEIAALKKLCEGDGGAGKAF 119 (159) Q Consensus 80 tAGpf~qT~~~~~~---r~~-----f~PsEI~~LQ~lC~~~~~~g~Af 119 (159) |.|..++++++... ..- =|=+|-.+|.++.+. -|... T Consensus 75 s~G~vSvS~~~~~~~~~~~~w~~~T~YG~~y~~L~r~~g~---Gg~Va 119 (119) T protein:vir:52 75 SAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGV---GVMVA 119 (119) T ss_pred eecceeeeeeccccCCcchhhhhcCHHHHHHHHHHHHhcC---CCcCC Confidence 99999999976632 122 245788999999885 22222 No 25 >protein:vir:104344 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398973;genbank:gi:81343957;genbank:GeneID:3778876 Probab=42.38 E-value=0.89 Score=20.87 Aligned_cols=112 Identities=17% Similarity=0.267 Sum_probs=58.9 Q ss_pred cccccc---hhhcccccCCchHHHHHHHHhHHHHHHHhccccCCCCCchHHHHHHHHHHHHH------hhhhcccchhhh Q lcl|NC_018838. 7 MGIILK---PEDIEPFADIPKDKLEAMIADVEAVAVSVAPCIAKPDFKYRDAAKAILRRALL------RWNDTGVSGQVQ 77 (159) Q Consensus 7 M~~~it---~~Dl~pFa~I~e~~a~amI~da~A~A~~vAPCi~~pdf~~~~aAKaILRgAIL------RW~d~G~SGavt 77 (159) |+-.|- --=-++|++.|.+..++.|+-|..-. |.+.=.-++ +.|-+..---|+ .=|+.| |.+- T Consensus 1 ~~~~~~e~~R~l~P~f~kvpdevI~~wielA~lfV-----c~~~~g~~~-~~AlaL~taHLm~~dga~k~en~~--~~t~ 72 (132) T protein:vir:10 1 MNDAILAFMRSLVPALKAVDDESINVWIDLARLYV-----CADKFGNDA-DRAVGLYALHLMLSDGAFKGENEG--LETY 72 (132) T ss_pred CchHHHHHHHHhcchhhcCChHHHHHHHHHHHHHH-----HhhcCchhH-HHHHHHHHHHHhhccccccccccc--hhhh Confidence 443221 11235788899999888887665443 655322222 222222211111 113333 3332 Q ss_pred hhc------ccceeeeeecccC-ccee----chHHHHHHHHHccccccCCceeeeeeccCCCCccchhhhhccCCcccc Q lcl|NC_018838. 78 YES------AGPFAQTTRSNTP-TNLL----WPSEIAALKKLCEGDGGAGKAFTITPTMNSRFTHSDVCSTVWGEGCSC 145 (159) Q Consensus 78 ~~t------AGpf~qT~~~~~~-r~~f----~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~~Hs~~Cs~~~g~~CsC 145 (159) |+. .|+|++|+++.+. .+-+ |=.=.++|.++ .|+.|.+-|--.+++ |-| T Consensus 73 S~rvaS~Sl~Ge~Sisf~~~sa~~s~L~~tp~Gkl~~~L~k~------~~GgfgL~t~~~~~~-------------cgc 132 (132) T protein:vir:10 73 SRRMASYSLSGEFSITYDNQSAIQGDLSSSSWGRMYKALLRK------KGGGFGLITSAAGGG-------------CGC 132 (132) T ss_pred hhhhhhhcccCceeeecccccccccccccCcHHHHHHHHHHh------ccCccccccccCcCC-------------CCC Confidence 333 5999999987653 2333 66677778773 446887766554333 555 No 26 >protein:vir:107756 Length: 147 # NCBI annotation: gp21 # Family: family:all:664 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024869;genbank:gi:48697511;genbank:GeneID:2948377 Probab=23.63 E-value=2.3 Score=18.62 Aligned_cols=119 Identities=14% Similarity=0.059 Sum_probs=64.4 Q ss_pred cccccchhhc----ccccC---CchHHHHHHHHhHHHHHHH--hccccCCCCCchHHHHHHHHHHHHHhh-----hhccc Q lcl|NC_018838. 7 MGIILKPEDI----EPFAD---IPKDKLEAMIADVEAVAVS--VAPCIAKPDFKYRDAAKAILRRALLRW-----NDTGV 72 (159) Q Consensus 7 M~~~it~~Dl----~pFa~---I~e~~a~amI~da~A~A~~--vAPCi~~pdf~~~~aAKaILRgAILRW-----~d~G~ 72 (159) |.+.|+++|. +-|+| .|++..+..+++|-..-.. -.||+ +.+. .+.+-..|--=+|.- ...|. T Consensus 1 m~v~fd~~~Fr~~fPeFad~~~~pd~~i~~~l~~A~~~l~~~~~~~~~-~g~~--~~~~l~Ll~AHll~l~~~~~~g~g~ 77 (147) T protein:vir:10 1 MDHTLDITKFRALFPEFNNDVKYPDALLEQWYAVAGEYLGLTDYACGL-NGNT--LDLALMQLTAHLMKSATILSSNKGA 77 (147) T ss_pred CceecCHHHHHHhcccccCCccCCHHHHHHHHHHHHHhhccccCCccc-Chhh--HHHHHHHHHHHHHHHHHhhccCCCc Confidence 9999998765 44653 5777777777766443221 13332 2222 222222222222222 23466 Q ss_pred chhhhhhcccceeeeeecccC-cc--eec-----hHHHHHHHHHccccccCCceeeeeeccCCC------Cccch Q lcl|NC_018838. 73 SGQVQYESAGPFAQTTRSNTP-TN--LLW-----PSEIAALKKLCEGDGGAGKAFTITPTMNSR------FTHSD 133 (159) Q Consensus 73 SGavt~~tAGpf~qT~~~~~~-r~--~f~-----PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~------~~Hs~ 133 (159) +|.|+|.|.|.-++++++... .+ -+| =+|--+|.+.++. +.+-+--.|... ....| T Consensus 78 ~G~v~Sas~G~VSVSy~~~~~~~~~~~w~~~T~YGq~y~~l~~~~~~-----Gg~vvgG~p~r~a~r~vgg~f~~ 147 (147) T protein:vir:10 78 PMVMTSATIDKVSISTLAPPIKNGWQYWLSTTPYGQMLWALLSMRSS-----GGFVYGGSPELSGYRRIGGVFKP 147 (147) T ss_pred ccceeeeeecceeeeeecCCCCCcchhhhhcCHHHHHHHHHHHhhCc-----cceecCCCCccccccccCceeCC Confidence 889999999999999987632 22 233 3688999999985 122222223111 11111 No 27 >protein:vir:103283 Length: 125 # NCBI annotation: hypothetical protein # Family: family:all:5122 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277463;genbank:gi:71834105;genbank:GeneID:3562394 Probab=22.99 E-value=2.4 Score=18.53 Aligned_cols=109 Identities=23% Similarity=0.255 Sum_probs=65.4 Q ss_pred CcceeecccccchhhcccccCCchHHHHHHHHhHHHHHHHhccccCCCCCchHHHHH------HHHHHHHHhhhhcc--c Q lcl|NC_018838. 1 MQGVVLMGIILKPEDIEPFADIPKDKLEAMIADVEAVAVSVAPCIAKPDFKYRDAAK------AILRRALLRWNDTG--V 72 (159) Q Consensus 1 ~~~~~lM~~~it~~Dl~pFa~I~e~~a~amI~da~A~A~~vAPCi~~pdf~~~~aAK------aILRgAILRW~d~G--~ 72 (159) |.-+ -++|++.|.+..++.|+.|-.-. |.+.-..++-.|.- .-+-+|.--=+|.+ . T Consensus 1 mR~l-----------~P~f~~vpdevi~~wid~A~lFV-----C~~~fg~~~~~Al~lytlHLm~~dga~k~e~~~~~~~ 64 (125) T protein:vir:10 1 MRTL-----------YPPLKSQPDDVLNAWIEVAKLFI-----CLDKFGDKQVQALAFYTLHLLSQDIALKTENDSSQTS 64 (125) T ss_pred Cccc-----------cchhhccCHHHHHHHHHHHHHHH-----HHhhhhhHHHHHHHHHHHHHHhccccccccccccccc Confidence 3322 25788899999999988765544 87744333332210 11223333333322 3 Q ss_pred chhhhhhc-ccceeeeeecccCc-------ceechHHHHHHHHHccccccCCceeeeeeccCCCCcc Q lcl|NC_018838. 73 SGQVQYES-AGPFAQTTRSNTPT-------NLLWPSEIAALKKLCEGDGGAGKAFTITPTMNSRFTH 131 (159) Q Consensus 73 SGavt~~t-AGpf~qT~~~~~~r-------~~f~PsEI~~LQ~lC~~~~~~g~Aftidt~p~~~~~H 131 (159) ||-+++.+ .|+|++|+++.+.- ..=|=.-.++|.++ .|+-|.+-|--.+++.. T Consensus 65 s~r~~s~slsGE~Sit~~~~s~d~s~~~L~~T~wGk~~~~L~k~------~~GgFaL~T~~~~~~cr 125 (125) T protein:vir:10 65 SERVKSYSLSGEYTISYDTSTAAASSSNLEESSWGKLYIDLMRL------KVGRWGLITSGGSRCCR 125 (125) T ss_pred ccceeeeeeccceEeecccccccccccccccCchHHHHHHHHHh------cCCceeeeccccccCCC Confidence 56777655 99999999776531 12466667788773 45689998877666533 Done!