Query lcl|NC_019722.1_cdsid_YP_007112486.1 [gene=F861_gp32] [protein=hypothetical protein] [protein_id=YP_007112486.1] [location=7785..8333] Match_columns 182 No_of_seqs 75 out of 80 Neff 6.4 Searched_HMMs 1612 Date Thu Nov 7 16:40:14 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_10 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_10_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:10327 Length: 182 100.0 1.3E-73 7.8E-77 420.2 20.8 182 1-182 1-182 (182) 2 protein:vir:96764 Length: 177 100.0 1.2E-62 7.5E-66 360.0 19.0 175 1-179 1-177 (177) 3 protein:vir:1994 Length: 182 # 99.5 1.1E-15 6.6E-19 102.7 13.7 165 5-182 1-175 (182) 4 protein:vir:4515 Length: 186 # 97.0 1.8E-05 1.1E-08 46.6 9.3 156 8-182 1-174 (186) 5 protein:vir:488 Length: 187 # 96.9 1.9E-05 1.2E-08 46.4 9.2 156 8-182 1-174 (187) 6 protein:vir:4461 Length: 186 # 96.8 3.1E-05 1.9E-08 45.3 9.5 156 8-182 1-174 (186) 7 protein:vir:79247 Length: 157 96.8 0.00013 8.3E-08 41.8 13.0 141 1-147 1-157 (157) 8 protein:vir:103883 Length: 159 96.7 0.00012 7.7E-08 42.0 11.8 140 1-152 3-159 (159) 9 protein:vir:99226 Length: 157 96.6 0.00018 1.1E-07 41.1 12.6 143 1-147 1-157 (157) 10 protein:vir:99874 Length: 154 96.4 0.00032 2E-07 39.8 12.5 135 1-144 1-154 (154) 11 protein:vir:107857 Length: 154 84.9 0.056 3.5E-05 27.4 11.8 148 1-165 1-154 (154) 12 protein:vir:79065 Length: 154 81.9 0.082 5.1E-05 26.5 12.2 148 1-165 1-154 (154) 13 protein:vir:108220 Length: 133 73.5 0.17 0.00011 24.8 10.4 129 1-140 1-133 (133) 14 protein:vir:5979 Length: 134 # 57.3 0.44 0.00027 22.6 11.2 131 1-164 1-134 (134) 15 protein:vir:105468 Length: 135 50.5 0.61 0.00038 21.8 12.7 131 7-166 1-135 (135) 16 protein:vir:96125 Length: 140 46.8 0.72 0.00045 21.4 11.1 131 1-144 3-140 (140) 17 protein:vir:94061 Length: 175 31.4 1.5 0.00093 19.6 11.6 149 1-175 1-175 (175) 18 protein:vir:1244 Length: 145 # 29.7 1.6 0.001 19.4 11.7 136 1-168 1-145 (145) 19 protein:vir:94096 Length: 141 29.6 1.6 0.001 19.4 12.5 135 1-173 1-141 (141) 20 protein:vir:105892 Length: 141 29.6 1.6 0.001 19.4 12.5 135 1-173 1-141 (141) 21 protein:vir:96260 Length: 141 29.6 1.6 0.001 19.4 12.5 135 1-173 1-141 (141) 22 protein:vir:96894 Length: 140 24.6 2.2 0.0013 18.7 12.3 131 1-144 1-140 (140) No 1 >protein:vir:10327 Length: 182 # NCBI annotation: ORF29 # Family: family:all:1090 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758922;genbank:gi:27311196;genbank:GeneID:956141 Probab=100.00 E-value=1.3e-73 Score=420.24 Aligned_cols=182 Identities=100% Similarity=1.547 Sum_probs=180.5 Q ss_pred CCcccHHHHHHHHHHHHHHhcCcceeeecccccccccccceeeeeeecccccCCcCCCceEEEEEEEEEEEEcCcCcCcH Q lcl|NC_019722. 1 MSQTTITEVHEAIKAKLRETFPKVTVDDYNPEPELSVLAPALLLELEEFPMGADVGDDRYPAACRFSVHCVLGWEVKSLA 80 (182) Q Consensus 1 Ms~~~l~~l~~AI~~~l~~~~P~l~~v~~~~~~~~~~~~PAv~i~~~~~~~~~d~gtG~~~~~~~~~a~ivv~~~~~~~~ 80 (182) ||+++|++||+||+++||++||+|++|++||++.+++.+||||+||++|++++|++|||+++++||+||||+++++++++ T Consensus 1 mt~~~l~~lh~AI~~~Lk~~~p~l~~~~~y~~~~~~i~~PAv~vel~~~~~~~d~~tGq~~~~~~~~a~~vv~~~~~~~~ 80 (182) T protein:vir:10 1 MSQTTITEVHEAIKAKLRETFPKVTVDDYNPEPELSVLAPALLLELEEFPMGADVGDDRYPAACRFSVHCVLGWEVKSLA 80 (182) T ss_pred CCcCCHHHHHHHHHHHHHHhcCCceeeecCccccCccccceeeeeeecCCcCCCCCCCcEEEEEEEEEEEEecccCCCch Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhcccccCCCccccCccceeeeccCccchhhcCceEEEEEEEEEEEeeCCccccCCCCCccEEEEEEc Q lcl|NC_019722. 81 LELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYLGESMWNADGITPQEIYLAYA 160 (182) Q Consensus 81 ~~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v~a~p~~~~~~~~dgy~vW~VeW~Q~v~LG~~~w~~~~~~p~~v~lg~~ 160 (182) +++|+||++|++++++|+|||++++|++|+||+|+|++|+|+++|||.+|+|||+|+||||+++|++++++|++|||||+ T Consensus 81 ~~~~~lAa~l~~~v~~~~wGL~~~~v~~a~~i~a~p~~f~~~~~dgy~vW~VeW~Q~i~LG~s~w~~~g~~p~~v~lg~~ 160 (182) T protein:vir:10 81 LELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYLGESMWNADGITPQEIYLAYA 160 (182) T ss_pred HHHHHHHHHHHHHHhcCcccCCccccCccceeeeccCccChhhcCceEEEEEEEEEEEeeCCcccCCCCCCCceEEEEEc Confidence 99999999999999999999999999999999999999999998999999999999999999999999999999999999 Q ss_pred CCCCCCCcchhhhhhhhcCCCC Q lcl|NC_019722. 161 PGGDVPPADEHEKAEYAGAPNS 182 (182) Q Consensus 161 P~~d~Gp~~e~dY~~~~~~p~~ 182 (182) |++||||+||+||.+++++||| T Consensus 161 P~~~~g~~~e~~y~~~~~~~~~ 182 (182) T protein:vir:10 161 PGGDVPPADEHEKAEYAGAPNS 182 (182) T ss_pred CCCCCCCCCcccchhccCCCCC Confidence 9999999999999999999999 No 2 >protein:vir:96764 Length: 177 # NCBI annotation: putative phage-related protein # Family: family:all:1090 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039825;genbank:gi:126010857;genbank:GeneID:5076274 Probab=100.00 E-value=1.2e-62 Score=360.05 Aligned_cols=175 Identities=22% Similarity=0.372 Sum_probs=165.0 Q ss_pred CCccc-HHHHHHHHHHHHHHhcCcceeeeccccc-ccccccceeeeeeecccccCCcCCCceEEEEEEEEEEEEcCcCcC Q lcl|NC_019722. 1 MSQTT-ITEVHEAIKAKLRETFPKVTVDDYNPEP-ELSVLAPALLLELEEFPMGADVGDDRYPAACRFSVHCVLGWEVKS 78 (182) Q Consensus 1 Ms~~~-l~~l~~AI~~~l~~~~P~l~~v~~~~~~-~~~~~~PAv~i~~~~~~~~~d~gtG~~~~~~~~~a~ivv~~~~~~ 78 (182) ||.+. .++||+||+++||++||++++|+.|++. +..+.+||||+||+++++++|+||||+++++||+||||+++++++ T Consensus 1 ~~~l~~~s~lh~AI~~~l~~~~P~l~tV~~y~~~~~~~~~tPAv~iel~~~~~~~d~g~G~~~~~~r~~a~vvv~~~~~~ 80 (177) T protein:vir:96 1 MVTLKQPSDLYDAIQAELESRLADEVTVASYADFGDVQVVDAMVLIEFEQTSPATRGHDGRYCHQYDITLHAVVGRQRQR 80 (177) T ss_pred CCccchhHHHHHHHHHHHHHhCccceeeccccccccccccCceeEEeeccCCcccCCCCCceEEEEEEEEEEEeCCCCCC Confidence 99943 3569999999999999999999999975 456789999999999999999999999999999999999999999 Q ss_pred cHHHHHHHHHHHHHHhcccccCCCccccCccceeeeccCccchhhcCceEEEEEEEEEEEeeCCccccCCCCCccEEEEE Q lcl|NC_019722. 79 LALELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYLGESMWNADGITPQEIYLA 158 (182) Q Consensus 79 ~~~~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v~a~p~~~~~~~~dgy~vW~VeW~Q~v~LG~~~w~~~~~~p~~v~lg 158 (182) +++++|+||++|+++|++|||||++++|++|+||+++||.|+|++ |||.+|+|||+|+||||+++|++++. +++||+| T Consensus 81 ~~l~a~~lAa~l~~~v~~~~wGLp~~~v~~~~~i~a~pd~f~p~l-dgy~vW~Vew~Q~i~LG~s~w~~~~~-~~~v~~g 158 (177) T protein:vir:96 81 AELEAINLAAAIERVTDENLWGLPYQQVDRPENIRSAPSMFKVGS-DGYDAWGVSFRQRIYLGASLLDDDPV-VREVWMV 158 (177) T ss_pred hHHHHHHHHHHHHHHHhcccccCCccccccceeeecccccccccc-CceeEEEEEEEEEEecCCCccCCCCC-CceEEEE Confidence 999999999999999999999999899999999999999999987 89999999999999999999997755 6999999 Q ss_pred EcCCCCCCCcchhhhhhhhcC Q lcl|NC_019722. 159 YAPGGDVPPADEHEKAEYAGA 179 (182) Q Consensus 159 ~~P~~d~Gp~~e~dY~~~~~~ 179 (182) ++|+ |||+||++|.+++++ T Consensus 159 ~~P~--~g~~~e~~Y~~~~~~ 177 (177) T protein:vir:96 159 TTPP--ADADDESQYERVPDA 177 (177) T ss_pred ecCC--CCCCCcccccCCCCC Confidence 9995 999999999999888 No 3 >protein:vir:1994 Length: 182 # NCBI annotation: Hypothetical protein # Family: family:all:1387 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050641;genbank:gi:9633528;genbank:GeneID:2636286 Probab=99.47 E-value=1.1e-15 Score=102.65 Aligned_cols=165 Identities=14% Similarity=0.044 Sum_probs=114.0 Q ss_pred cHHHHHHHHHHHHHHhc-Ccceeeecccccc-----cc--cccceeeeeeecccccCCcCCCceEEEEEEEEEEEEcCcC Q lcl|NC_019722. 5 TITEVHEAIKAKLRETF-PKVTVDDYNPEPE-----LS--VLAPALLLELEEFPMGADVGDDRYPAACRFSVHCVLGWEV 76 (182) Q Consensus 5 ~l~~l~~AI~~~l~~~~-P~l~~v~~~~~~~-----~~--~~~PAv~i~~~~~~~~~d~gtG~~~~~~~~~a~ivv~~~~ 76 (182) =|++.-+||.+.||+.+ |.|++|+.|+..- ++ ...|||++++.+.. .+..|..+..||.++|++..-+ T Consensus 1 mI~~iEdAi~~rl~~~~g~~v~~V~sy~Gefd~e~l~~~~~~~PAv~Va~~G~~----~~~~r~~~~~r~~v~V~a~~~~ 76 (182) T protein:vir:19 1 MLEETEAALLARVRELFGATLRQVEPLTGTWTNEDVHRLFLAPPSVFLAWMGCG----EGRTRREVESRWAFFVVAELLN 76 (182) T ss_pred ChHHHHHHHHHHHHHHhhhhhhhhccCCCCCChhhhhHhhhcCceeEEEecccc----CcCCceeeeeEEEEEEEecCCC Confidence 36688999999999998 9999999987533 22 36699999998754 4557888999999999886644 Q ss_pred cCcH--HHHHHHHHHHHHHhcccccCCCccccCccceeeeccCccchhhcCceEEEEEEEEEEEeeCCccccCCCCCccE Q lcl|NC_019722. 77 KSLA--LELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYLGESMWNADGITPQE 154 (182) Q Consensus 77 ~~~~--~~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v~a~p~~~~~~~~dgy~vW~VeW~Q~v~LG~~~w~~~~~~p~~ 154 (182) .... .-+-.|..+|..+|.+|++|+ +++.+..++.+.++.+...+|+++|.|+|+|+..|. ++.+.+..+.=. T Consensus 77 g~~~~rvG~y~lv~~v~~lL~~q~~g~----~~~l~p~~vrnL~s~~~~~~gvsvyavef~~~~~lp-~~~d~~~l~df~ 151 (182) T protein:vir:19 77 GEPVNRPGIYQIVERLIAGVNGQTFGP----TTGMRLTQVRNLCDDNRINAGVVLYGVLFSGTTPLP-SVVDLDSLDDYE 151 (182) T ss_pred ChhhhhhhHHHHHHHHHHHHhccCCCC----ccccccceeeeeechhhhhCceEEEEEEeeccccCC-CcCCCCCCcchh Confidence 3322 334479999999999999995 677888899999999988899999999999998773 333333333322 Q ss_pred EEEEEcCCCCCCCcchhhhhhhhcCCCC Q lcl|NC_019722. 155 IYLAYAPGGDVPPADEHEKAEYAGAPNS 182 (182) Q Consensus 155 v~lg~~P~~d~Gp~~e~dY~~~~~~p~~ 182 (182) .+-+--|.-|.-|.-|+- -.+|-| T Consensus 152 ~~~~~~~~pdg~p~~e~~----i~l~q~ 175 (182) T protein:vir:19 152 RHWQTWKFPDETPEFAAH----INVNQE 175 (182) T ss_pred hcceeecCCCcCcchhhc----ccCCCc Confidence 222222221111222211 112222 No 4 >protein:vir:4515 Length: 186 # NCBI annotation: unknown # Family: family:all:964 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599041;genbank:gi:19548999;genbank:GeneID:935225 Probab=96.96 E-value=1.8e-05 Score=46.62 Aligned_cols=156 Identities=10% Similarity=-0.061 Sum_probs=95.8 Q ss_pred HHHHHHHHHHHHhcCcce--eeecccc----cccccccceeeeeeecccccCCcCCC--ceEEEEEEEEEEEEcCc-CcC Q lcl|NC_019722. 8 EVHEAIKAKLRETFPKVT--VDDYNPE----PELSVLAPALLLELEEFPMGADVGDD--RYPAACRFSVHCVLGWE-VKS 78 (182) Q Consensus 8 ~l~~AI~~~l~~~~P~l~--~v~~~~~----~~~~~~~PAv~i~~~~~~~~~d~gtG--~~~~~~~~~a~ivv~~~-~~~ 78 (182) .-.++|+++||++.|.++ +...-++ ....+++||+++=..+...+.....| +..+..+|.+-+++... +.. T Consensus 1 Mkl~~Ii~RLra~vP~l~grV~gaad~a~l~~~~~lp~PaAyVip~~d~~~~~~sq~~~~Q~i~e~f~Vvl~vrn~~d~~ 80 (186) T protein:vir:45 1 MKLTPVIAALRARCPYFENRVAGAAQFKNLPEVGKLRLPAAYVVPGDDSPGENKSQTDYWQELKEGFSVVVILSNGRDER 80 (186) T ss_pred CChHHHHHHHHHhcchhhchhhhhhhhhhhHhhcCCCCceEEEEecccccCCCccccceeeeeeeEEEEEEEEeccCCCC Confidence 346789999999999987 3333222 23557899999988887766433334 34667788888888642 211 Q ss_pred ---cHH-HHHHHHHHHHHHhcccccCCCccccCccceeeeccCccchhhcCceEEEEEEEEEEEeeCCcc-ccC---C-C Q lcl|NC_019722. 79 ---LAL-ELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYLGESM-WNA---D-G 149 (182) Q Consensus 79 ---~~~-~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v~a~p~~~~~~~~dgy~vW~VeW~Q~v~LG~~~-w~~---~-~ 149 (182) +.+ .++.+-.+|...|-| |. |+..+++-++..++-..+ .+|..+|..+|+=...||.+. |.. . - T Consensus 81 G~~aa~D~l~~lr~~v~~AL~G--W~-P~~~~~pi~~~gG~lvd~----~~g~l~y~~~F~~~~~l~~~~t~~~~~l~~l 153 (186) T protein:vir:45 81 GQFASYDVVDDVRQMLFKALLG--WN-PEACGNPITYDGGTLLDL----NRHELIYQFDFSVISELTEDDTRQQDELNSL 153 (186) T ss_pred CcccchhHHHHHHHHHHHHHhC--cc-cCCCCceEEEcCceEEee----cCcEEEEEEEEEEeeccCCCcccchHHHhCC Confidence 222 355555666555554 54 334577666666555544 368999999999999999763 211 1 1 Q ss_pred CCccEEEEEEcCCCCCCCcchhhhhhhhcCCCC Q lcl|NC_019722. 150 ITPQEIYLAYAPGGDVPPADEHEKAEYAGAPNS 182 (182) Q Consensus 150 ~~p~~v~lg~~P~~d~Gp~~e~dY~~~~~~p~~ 182 (182) ...++|.+-. ||..|.+-||- T Consensus 154 ~~l~~~~~~~------------d~~dp~~gpdg 174 (186) T protein:vir:45 154 DELRTLAIDV------------DYLDPGNGPDG 174 (186) T ss_pred CCcceEEEEE------------eeccCCCCCCC Confidence 2234555433 34444444554 No 5 >protein:vir:488 Length: 187 # NCBI annotation: hypothetical protein # Family: family:all:964 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543095;swissprot:trembl:q8w624;genbank:gi:18249907;uniprot:Q8W624;genbank:GeneID:929697 Probab=96.92 E-value=1.9e-05 Score=46.45 Aligned_cols=156 Identities=13% Similarity=-0.001 Sum_probs=96.3 Q ss_pred HHHHHHHHHHHHhcCcce--eeecccc----cccccccceeeeeeecccccCCcCCCce--EEEEEEEEEEEEcCc-CcC Q lcl|NC_019722. 8 EVHEAIKAKLRETFPKVT--VDDYNPE----PELSVLAPALLLELEEFPMGADVGDDRY--PAACRFSVHCVLGWE-VKS 78 (182) Q Consensus 8 ~l~~AI~~~l~~~~P~l~--~v~~~~~----~~~~~~~PAv~i~~~~~~~~~d~gtG~~--~~~~~~~a~ivv~~~-~~~ 78 (182) .-.+.|+++||++.|.++ +...-.+ ....+++||+++=..+...+...+.|.+ .+.-+|.+-+++... +.. T Consensus 1 Mkl~~Ii~rLra~vP~l~grV~gaad~aal~~~~~lp~PaAyVlp~~d~~~~~~sq~~~~Q~i~e~f~Vvl~vrn~~D~~ 80 (187) T protein:vir:48 1 MKLTTIIAALRERCPRFEDRVGGAAQFKAIPDAGKLRLPAAYVVPSDDAPGEQKSQTDYWQDLTEGFSVIVVLSNERDEK 80 (187) T ss_pred CchhHHHHHHHHhcchhhhhhhhhhhhhhhhhhcCCCCceEEEEeccccCCCCCCCcceeeeeeeEEEEEEEEeccCCCC Confidence 345789999999999987 3332222 2355789999998888777655555654 566778888887543 222 Q ss_pred ---cHH-HHHHHHHHHHHHhcccccCCCccccCccceeeeccCccchhhcCceEEEEEEEEEEEeeCCcc-ccC----CC Q lcl|NC_019722. 79 ---LAL-ELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYLGESM-WNA----DG 149 (182) Q Consensus 79 ---~~~-~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v~a~p~~~~~~~~dgy~vW~VeW~Q~v~LG~~~-w~~----~~ 149 (182) +.+ .+..+-.+|...|-| |. |+..+.+-++..++-..+ .+|..+|..+|+=...||.+. |.. .- T Consensus 81 G~~~a~D~l~~lr~~v~~AL~G--W~-P~~~~~pi~~~gG~lvd~----~~g~l~y~~~F~~~~ql~~~~t~~~~~~~~l 153 (187) T protein:vir:48 81 GQWAAYDAVHDVRRELWKALLG--WM-PDPQGGEIVYAGGTLLDL----NRYELYYQFDFTAKYEITEEDTRQAEDVNAL 153 (187) T ss_pred CcchhhHHHHHHHHHHHHHHhC--cC-cCCCCceEEEcCceEeee----cCcEEEEEEEEEeecccCCCCCccHHHHhCC Confidence 122 344555565555554 54 335577666666665554 268999999999999999763 211 11 Q ss_pred CCccEEEEEEcCCCCCCCcchhhhhhhhcCCCC Q lcl|NC_019722. 150 ITPQEIYLAYAPGGDVPPADEHEKAEYAGAPNS 182 (182) Q Consensus 150 ~~p~~v~lg~~P~~d~Gp~~e~dY~~~~~~p~~ 182 (182) ...++|.+-.+ |..|.-=||+ T Consensus 154 ~~l~~~~~~~d------------~~dp~~~pd~ 174 (187) T protein:vir:48 154 PDLSLLSIDVD------------YIDPGTGPDG 174 (187) T ss_pred CCcceEEEEEe------------ccCCCCCCCC Confidence 22455655543 2222233444 No 6 >protein:vir:4461 Length: 186 # NCBI annotation: hypothetical protein # Family: family:all:964 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700384;genbank:gi:23505456;genbank:GeneID:955663 Probab=96.81 E-value=3.1e-05 Score=45.29 Aligned_cols=156 Identities=13% Similarity=0.019 Sum_probs=94.1 Q ss_pred HHHHHHHHHHHHhcCcce--eeecccc----cccccccceeeeeeecccccCCcCCCce--EEEEEEEEEEEEcCc-Cc- Q lcl|NC_019722. 8 EVHEAIKAKLRETFPKVT--VDDYNPE----PELSVLAPALLLELEEFPMGADVGDDRY--PAACRFSVHCVLGWE-VK- 77 (182) Q Consensus 8 ~l~~AI~~~l~~~~P~l~--~v~~~~~----~~~~~~~PAv~i~~~~~~~~~d~gtG~~--~~~~~~~a~ivv~~~-~~- 77 (182) .-.++|.++||++.|.++ +...-.+ ....+++||+++=..+...+...++|.+ .+..+|++-+++... +. T Consensus 1 mkl~~Vi~RLra~vP~l~~rV~gaad~aai~~~~~lp~PaAyVip~~d~~g~~~s~g~~~Q~i~~~f~Vvl~vrn~~d~~ 80 (186) T protein:vir:44 1 MKLTPIIAALRSRCPRFENRVGGAAQFKAIPEAGKLRLPAAYVVPAEDVTGEQKSQTDYWQDLTEGFSVIVVLSNERDEK 80 (186) T ss_pred CChhHHHHHHHHhcchhhhhhhhhhhhhhhhhhcCCCCceEEEEeccccCCCCCcccceeEeeeeeEEEEEEEeccCCCC Confidence 235788999999999986 3322222 2345789999998888777654455644 667778888887432 21 Q ss_pred --CcHH-HHHHHHHHHHHHhcccccCCCccccCccceeeeccCccchhhcCceEEEEEEEEEEEeeCCcc-ccC----CC Q lcl|NC_019722. 78 --SLAL-ELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYLGESM-WNA----DG 149 (182) Q Consensus 78 --~~~~-~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v~a~p~~~~~~~~dgy~vW~VeW~Q~v~LG~~~-w~~----~~ 149 (182) .+.+ .+..+-.++...|-| |. |+..+++-++..+.-..+ ++|..+|.-+|+=...||.+. |.. .- T Consensus 81 G~~aa~D~l~~lr~~v~~AL~G--W~-P~~~~~pi~~~gG~lvd~----~~g~l~y~~~F~~~~~l~~~~t~~~~~~~~l 153 (186) T protein:vir:44 81 GQWASYDAVHDVRQEIWKALLG--WE-PDSQVHEIQYAGGMLLDL----NRHELYYQFDFTVKYEITETDTRQQDDLDGL 153 (186) T ss_pred CCccchHHHHHHHHHHHHHHcC--cC-cCCCCceEEEcCceEEee----cCcEEEEEEEEEEeeccCCCCccchHHHhCC Confidence 1222 244555555555554 43 324577666766555544 368999999999999999764 311 11 Q ss_pred CCccEEEEEEcCCCCCCCcchhhhhhhhcCCCC Q lcl|NC_019722. 150 ITPQEIYLAYAPGGDVPPADEHEKAEYAGAPNS 182 (182) Q Consensus 150 ~~p~~v~lg~~P~~d~Gp~~e~dY~~~~~~p~~ 182 (182) ...++|.+-.+ |..|.-=||+ T Consensus 154 ~~l~~~~~~~d------------~~~p~~g~dg 174 (186) T protein:vir:44 154 PDLKTLSIDVD------------FIEPGTGPDG 174 (186) T ss_pred CCcceEEEEEe------------cccCCCCCCC Confidence 22456665432 2222233444 No 7 >protein:vir:79247 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469166;genbank:gi:157835008;genbank:GeneID:5648828 Probab=96.80 E-value=0.00013 Score=41.81 Aligned_cols=141 Identities=13% Similarity=0.136 Sum_probs=85.5 Q ss_pred CCc-ccHHHHHHHHHHHHHHhcCcceeeecccc----cccccccceeeeeeecccccCC----cCCCc-eEEEEEEEEEE Q lcl|NC_019722. 1 MSQ-TTITEVHEAIKAKLRETFPKVTVDDYNPE----PELSVLAPALLLELEEFPMGAD----VGDDR-YPAACRFSVHC 70 (182) Q Consensus 1 Ms~-~~l~~l~~AI~~~l~~~~P~l~~v~~~~~----~~~~~~~PAv~i~~~~~~~~~d----~gtG~-~~~~~~~~a~i 70 (182) ||+ +|+-+...+|+++||++.|+++.|..-.+ ....-.+||+++=..+..++.. .++|+ ..++.+|.+=+ T Consensus 1 ~~~~~d~~a~~~~IierLka~v~~l~~V~~aadla~i~e~~q~tPaayVv~~gd~~~~~~~~~~~~~~~Q~vtq~f~Vvl 80 (157) T protein:vir:79 1 MSDPFDYLFLEPLLIERIRSEVPGLAIVSGVPDLAALSEQDQPAPSVYVVYLGDEIGTGADYQGGRRAIQAIGQQWAVVL 80 (157) T ss_pred CCCchhhhhhhHHHHHHHHhhhhhhhhhccccchhhhhhhcCCCcEEEEEecccccCCCcccccCcceeeeeeeeEEEEE Confidence 999 79999999999999999999987754321 1122357999998888766433 23344 36888999988 Q ss_pred EEcCcCcC-----cHHHHHHHHHHHHHHhcccccCCCccccCccceeeeccCccchhhcCceEEEEEEEEEEEeeC-Ccc Q lcl|NC_019722. 71 VLGWEVKS-----LALELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYLG-ESM 144 (182) Q Consensus 71 vv~~~~~~-----~~~~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v~a~p~~~~~~~~dgy~vW~VeW~Q~v~LG-~~~ 144 (182) ++...+.. ...++..+=.+|...|.| | -| +....|-...+.+ .++.-.+||....+.|+=.+..- -.. T Consensus 81 avrn~~~~~~~~a~~d~ag~ll~~v~~AL~G--W-~P-~~~~~pl~~~~~~--~~~~y~~gf~yypl~F~~~~~~~~~~~ 154 (157) T protein:vir:79 81 VVHYADSSNSGEGARREAGPLLGRLVKALTG--W-AP-AIDVAPLARSARQ--SPVTYASGYFYFPLVFTARFVYPRVKS 154 (157) T ss_pred EEeccccccccchhHHHHHHHHHHHHHHhcC--c-cc-cccCCceeeeecC--CcccccCCeEEEEEEEEEeeecccccc Confidence 88664432 112344444444444433 4 23 3344444443322 23334578988888888766431 112 Q ss_pred ccC Q lcl|NC_019722. 145 WNA 147 (182) Q Consensus 145 w~~ 147 (182) |.. T Consensus 155 ~~~ 157 (157) T protein:vir:79 155 WKP 157 (157) T ss_pred CCC Confidence 321 No 8 >protein:vir:103883 Length: 159 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938246;genbank:gi:38229151;genbank:GeneID:2648198 Probab=96.66 E-value=0.00012 Score=42.00 Aligned_cols=140 Identities=10% Similarity=0.102 Sum_probs=89.3 Q ss_pred CCc-ccHHHHHHHHHHHHHHhcCcceeeecccc---cc--cccccceeeeeeeccccc----CCcCCCc-eEEEEEEEEE Q lcl|NC_019722. 1 MSQ-TTITEVHEAIKAKLRETFPKVTVDDYNPE---PE--LSVLAPALLLELEEFPMG----ADVGDDR-YPAACRFSVH 69 (182) Q Consensus 1 Ms~-~~l~~l~~AI~~~l~~~~P~l~~v~~~~~---~~--~~~~~PAv~i~~~~~~~~----~d~gtG~-~~~~~~~~a~ 69 (182) |++ +|+-++.++|++.||++.|+++.|..--+ .. .+ .+||++|-..+..+. +..++|+ ..+..+|.+= T Consensus 3 ~~~~~n~lav~~~IieRLka~v~~lr~V~~aadla~i~el~q-~tPaayV~~~g~~~~~~~~~~~~~~~~q~v~q~w~Vv 81 (159) T protein:vir:10 3 TAEPFDYLFLETLLVERIRAEVPGLQDVSGVPDLATLDEQRQ-GSPCVYVVYLGDEIGTGASHQGGSRAIQTVTQHWAAV 81 (159) T ss_pred cccchhhhhhhHHHHHHHHhhhhHHHhhhcccchHHHHhhhC-CCcEEEEEecccccCCCcccccccceeeeeeeEEEEE Confidence 555 99999999999999999999987643321 11 13 579999999998763 3345565 4788899998 Q ss_pred EEEcCcCc----CcH-HHHHHHHHHHHHHhcccccCCCccccCccceeeeccCccchhhcCceEEEEEEEEEEEeeCC-c Q lcl|NC_019722. 70 CVLGWEVK----SLA-LELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYLGE-S 143 (182) Q Consensus 70 ivv~~~~~----~~~-~~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v~a~p~~~~~~~~dgy~vW~VeW~Q~v~LG~-~ 143 (182) +++..... .+. .++..|-.++...|.| |- |++.. .| +.++ +...++.-.+||...-+.|+=.+.+-. . T Consensus 82 lavr~~~~q~~~~a~~d~aG~ll~~v~~AL~G--W~-P~~~~-~P-l~r~-~~~~~~~y~~gfayyPl~F~~~~~~~~~~ 155 (159) T protein:vir:10 82 LTLYYADAQGDGQGARREAGPLLGRLLKALTG--WV-PDQGV-TP-LARS-PQASPVSYSNGFFYFPLVFTANFVFPRLK 155 (159) T ss_pred EEEecccccCccchhhHHHHHHHHHHHHHhcC--cc-cCCcC-CC-eeec-ccCCCccccCCEEEeeeeEEeeeeccccc Confidence 88865332 222 3466777777777776 43 32333 34 3333 222234444789999999988776421 1 Q ss_pred cccCCCCCc Q lcl|NC_019722. 144 MWNADGITP 152 (182) Q Consensus 144 ~w~~~~~~p 152 (182) .|. | T Consensus 156 ~~~-----~ 159 (159) T protein:vir:10 156 SWK-----P 159 (159) T ss_pred cCC-----C Confidence 121 1 No 9 >protein:vir:99226 Length: 157 # NCBI annotation: hypothetical protein # Family: family:all:2406 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950461;genbank:gi:119953662;genbank:GeneID:4643086 Probab=96.64 E-value=0.00018 Score=41.08 Aligned_cols=143 Identities=13% Similarity=0.127 Sum_probs=85.6 Q ss_pred CCc-ccHHHHHHHHHHHHHHhcCcceeeecccc---c-ccccccceeeeeeecccccC----CcCCCc-eEEEEEEEEEE Q lcl|NC_019722. 1 MSQ-TTITEVHEAIKAKLRETFPKVTVDDYNPE---P-ELSVLAPALLLELEEFPMGA----DVGDDR-YPAACRFSVHC 70 (182) Q Consensus 1 Ms~-~~l~~l~~AI~~~l~~~~P~l~~v~~~~~---~-~~~~~~PAv~i~~~~~~~~~----d~gtG~-~~~~~~~~a~i 70 (182) ||+ +|+-++.++|+++||++.|+++.|..--+ . ...-.+|++++=..+..++. ..+.|. ..+..+|.+=+ T Consensus 1 ~~~~~d~~a~~~~IierLka~vp~l~~V~~aadla~i~~~~q~tPaayVi~~gd~~~~~~~~~~~~~~~Q~i~q~~~Vvl 80 (157) T protein:vir:99 1 MSDPFDYLFLEPLLIERIRSEVPGLAIVSGVPDLAALSEQDQPAPSVYVVYLGDEIGTGADHQGGRRAIQAIGQQWAVVL 80 (157) T ss_pred CCCchhhhhhhHHHHHHHHhhhhHHHhhhcccchHHHhhccCCCcEEEEEecccccCCCcccccccceeeeeeeeEEEEE Confidence 999 79999999999999999999887644321 1 12235699999988887643 223343 46888999988 Q ss_pred EEcCcCcC-cHHHHHHHHHHHHHHhccc--ccCCCccccCccceeeeccCccchhhcCceEEEEEEEEEEEee-CCcccc Q lcl|NC_019722. 71 VLGWEVKS-LALELWEFSAAVAQLIRKS--GVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYL-GESMWN 146 (182) Q Consensus 71 vv~~~~~~-~~~~a~~lAa~l~~ll~~q--~wGL~~~~v~~a~~v~a~p~~~~~~~~dgy~vW~VeW~Q~v~L-G~~~w~ 146 (182) ++...... -.-.+.+.|-.|..-|... .|. | +....|-...+.+ .++.-.+||....+.|+=.+.. --..|. T Consensus 81 avr~~~~~~~g~~a~d~ag~ll~~v~~AL~GW~-P-~~~~~pl~~~~~~--~~~~y~~gf~yypl~F~~~~~~~~~~~~~ 156 (157) T protein:vir:99 81 VVHYADSSNSGEGARREAGPLLGRLVKALTGWA-P-AIDVAPLARSARQ--SPVTYASGYFYFPLVFTARFVYPRVKSWK 156 (157) T ss_pred EEeccccccccchhHHHHHHHHHHHHHHhcCCc-C-cccCCceeeeecC--CcccccCceEEEEEEEEEeeeccccccCC Confidence 88754432 1233444444444444433 352 3 3333343332222 2233457899999999866543 112232 Q ss_pred C Q lcl|NC_019722. 147 A 147 (182) Q Consensus 147 ~ 147 (182) . T Consensus 157 ~ 157 (157) T protein:vir:99 157 P 157 (157) T ss_pred C Confidence 2 No 10 >protein:vir:99874 Length: 154 # NCBI annotation: hypothetical protein # Family: family:all:964 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164078;genbank:gi:56692610;genbank:GeneID:3192602 Probab=96.40 E-value=0.00032 Score=39.77 Aligned_cols=135 Identities=10% Similarity=0.033 Sum_probs=84.6 Q ss_pred CCc-----ccHHHHHHHHHHHHHHhcCcceeeeccc-c----cccccccceeeeeeecccccCC---cCCCc--eEEEEE Q lcl|NC_019722. 1 MSQ-----TTITEVHEAIKAKLRETFPKVTVDDYNP-E----PELSVLAPALLLELEEFPMGAD---VGDDR--YPAACR 65 (182) Q Consensus 1 Ms~-----~~l~~l~~AI~~~l~~~~P~l~~v~~~~-~----~~~~~~~PAv~i~~~~~~~~~d---~gtG~--~~~~~~ 65 (182) |.+ ++| .-|+++||++.|.++.++.-. + ....+++||+++=..+-..+.. .++|. ..+..+ T Consensus 1 ~~~~~~~pfdl----~~Vi~RLra~~p~l~~V~gaadlAal~~~~~~p~PaAyVlp~~d~~~~~~~~~~~g~~~Q~i~~~ 76 (154) T protein:vir:99 1 MADGLCAPFDH----NLVIERLRDQVKVLKHVGGAAELGTITQLRDFRTPAAYVLLAQETLSPKPAGHAGGATRQMANVH 76 (154) T ss_pred CCCCccCCccc----HHHHHHHHHhCcchhhhhhhhhhhhhhhhcCCCCceEEEEecccccCCCCCCccccceeeeeeeE Confidence 655 665 678888999999887654321 1 2355789999997777543221 23452 466778 Q ss_pred EEEEEEEcCcCcCcHHHHHHHHHHHHHHhccc--ccCCCccc--cCccceeeeccCccchhhcCceEEEEEEEEEEEeeC Q lcl|NC_019722. 66 FSVHCVLGWEVKSLALELWEFSAAVAQLIRKS--GVWVKGGV--LTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYLG 141 (182) Q Consensus 66 ~~a~ivv~~~~~~~~~~a~~lAa~l~~ll~~q--~wGL~~~~--v~~a~~v~a~p~~~~~~~~dgy~vW~VeW~Q~v~LG 141 (182) |++-+++........-.+.+..-.+...++.. .|. |+.. +.+-++..++-..+ ++|...|.-+|+=...|| T Consensus 77 f~Vvl~v~~~~d~~G~~a~d~l~~lr~~v~~AL~GW~-P~~~~G~~pi~~~gG~l~d~----~~g~l~y~~~F~~~~~lg 151 (154) T protein:vir:99 77 FAITVAVRNYRDNKGVTAADDLRPVLGDVRKALIGWT-PPGLAGARDCQLVQGQVVDY----DASVLIWTDLYQTQHAIG 151 (154) T ss_pred EEEEEEeeccCcccchhhHHHHHHHHHHHHHHHhCCC-CCcccCCceeeecCcceeec----cCcEEEEeeeeeeeeecC Confidence 88888887643333335555555555555544 353 2111 23345555444443 368999999999999999 Q ss_pred Ccc Q lcl|NC_019722. 142 ESM 144 (182) Q Consensus 142 ~~~ 144 (182) .+- T Consensus 152 r~~ 154 (154) T protein:vir:99 152 RTS 154 (154) T ss_pred CCC Confidence 988 No 11 >protein:vir:107857 Length: 154 # NCBI annotation: gp37 # Family: family:all:1532 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024710;genbank:gi:48696947;genbank:GeneID:2845945 Probab=84.95 E-value=0.056 Score=27.43 Aligned_cols=148 Identities=22% Similarity=0.220 Sum_probs=85.1 Q ss_pred CCcccHHHHHHHHHHHHHHhcCcceeeecccccccc----cccceeeeeeecccccCCcCCC--ceEEEEEEEEEEEEcC Q lcl|NC_019722. 1 MSQTTITEVHEAIKAKLRETFPKVTVDDYNPEPELS----VLAPALLLELEEFPMGADVGDD--RYPAACRFSVHCVLGW 74 (182) Q Consensus 1 Ms~~~l~~l~~AI~~~l~~~~P~l~~v~~~~~~~~~----~~~PAv~i~~~~~~~~~d~gtG--~~~~~~~~~a~ivv~~ 74 (182) ||.. ....++|.++||+++|++.+. .+|+..+. -+.-|||+...+=.-+.--.+| .-.-.+++.+=|++.. T Consensus 1 m~~t--~~ii~aiv~rL~~~lP~~~ve-~fP~~p~ey~l~h~~GAvLV~Y~GS~f~~~~~~~~i~Q~R~~~~~vTVi~r~ 77 (154) T protein:vir:10 1 MATT--LEMVDAIVARLRVKLPALVTE-YFPERPDEYRLNHAIGALLVSYPGSQYDTTVDTDMVVQPRRVKFAVAIVLRQ 77 (154) T ss_pred Cchh--HHHHHHHHHHHHHhCCcceEe-eCCCChhHcCCCCCceeEEEEecCcccCCcccCCceeeeeEEEEEEEEEeec Confidence 9995 378899999999999998876 44443222 2557899988765544322223 2345677888888765 Q ss_pred cCcCcHHHHHHHHHHHHHHhcccccCCCccccCccceeeeccCccchhhcCceEEEEEEEEEEEeeCCccccCCCCCccE Q lcl|NC_019722. 75 EVKSLALELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYLGESMWNADGITPQE 154 (182) Q Consensus 75 ~~~~~~~~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v~a~p~~~~~~~~dgy~vW~VeW~Q~v~LG~~~w~~~~~~p~~ 154 (182) -..+ -.|.++--++...|-| |-+| .|.+ +....+.|.-+. +|...|.+.+.=+..+=+.-=..++..-++ T Consensus 78 l~g~--~gal~~LD~vR~aL~G--f~pp--dc~~---~~lv~d~f~ge~-~G~W~Y~l~~at~t~~Ve~~~~~d~pll~~ 147 (154) T protein:vir:10 78 LNGR--GGAIDVLDHVRTALVG--FRPP--DCKK---LAAVSDKFLGES-AGLWQYVIEFSAGAVIVEDAEPNDGPLLTQ 147 (154) T ss_pred cCCc--chhhHHHHHHHHHHhc--cccC--CCce---eehhhhcccccc-cceeeeeeeeccchhhhhccCCCCCceeee Confidence 4332 2344444444444433 4444 3653 778888886655 789888888876443222111222222222 Q ss_pred EEEEEcCCCCC Q lcl|NC_019722. 155 IYLAYAPGGDV 165 (182) Q Consensus 155 v~lg~~P~~d~ 165 (182) | .|.-+ + T Consensus 148 v--~yee~--~ 154 (154) T protein:vir:10 148 V--TYEEE--S 154 (154) T ss_pred e--eeccc--C Confidence 2 23322 1 No 12 >protein:vir:79065 Length: 154 # NCBI annotation: gp11 # Family: family:all:1532 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111211;genbank:gi:134288825;genbank:GeneID:4960739 Probab=81.86 E-value=0.082 Score=26.54 Aligned_cols=148 Identities=20% Similarity=0.215 Sum_probs=84.0 Q ss_pred CCcccHHHHHHHHHHHHHHhcCcceeeecccccccc----cccceeeeeeecccccCCcCCC--ceEEEEEEEEEEEEcC Q lcl|NC_019722. 1 MSQTTITEVHEAIKAKLRETFPKVTVDDYNPEPELS----VLAPALLLELEEFPMGADVGDD--RYPAACRFSVHCVLGW 74 (182) Q Consensus 1 Ms~~~l~~l~~AI~~~l~~~~P~l~~v~~~~~~~~~----~~~PAv~i~~~~~~~~~d~gtG--~~~~~~~~~a~ivv~~ 74 (182) ||.. ....++|.++||+++|++.+. .+|+..+. -+.-|||+...+=.-+.--.+| .-.-.+++.+=|++.. T Consensus 1 m~~t--~~ii~~iv~rL~~~lP~~~ve-~fP~~p~ey~l~h~~GAvLV~Y~GS~f~~~~~~~~i~Q~R~~~~~vTVi~r~ 77 (154) T protein:vir:79 1 MATT--LEMVDSVVARLRVKLPALVTE-YFPERPDEYRLNHAIGALLVSYPGSQYDTTVDTDMVVQPRRVKFAVAIVLRQ 77 (154) T ss_pred Cchh--HHHHHHHHHHHHHhCCcceEe-eCCCChhHcCCCCCceeEEEEecCcccCCcccCCceeeeeEEEEEEEEEeec Confidence 9995 378899999999999998876 44443222 2557899988765544322223 2245567888887765 Q ss_pred cCcCcHHHHHHHHHHHHHHhcccccCCCccccCccceeeeccCccchhhcCceEEEEEEEEEEEeeCCccccCCCCCccE Q lcl|NC_019722. 75 EVKSLALELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYLGESMWNADGITPQE 154 (182) Q Consensus 75 ~~~~~~~~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v~a~p~~~~~~~~dgy~vW~VeW~Q~v~LG~~~w~~~~~~p~~ 154 (182) -..+ -.|.++--++...|-| |-+| .|.+ +....+.|.-+. +|...|.+.+.=+..+=+.-=..++..-++ T Consensus 78 l~g~--~gal~~LD~vR~aL~G--f~pp--dc~~---~~lv~d~f~ge~-~G~W~Y~l~~at~t~~Ve~~e~~d~pll~~ 147 (154) T protein:vir:79 78 LNGR--GGAIDVLDHVRTALVG--FRPP--DCKK---LAAVSDKFLGES-AGLWQYVIEFSAGAVIVEDAEPNDGPLLTQ 147 (154) T ss_pred cCCc--chhhHHHHHHHHHHhc--cccC--CCce---eehhhhcccccc-cceeeeeeeeccchhhhccCCCCCCceeee Confidence 4332 2344444444444433 4444 3653 777888886655 789888888866443222111111221222 Q ss_pred EEEEEcCCCCC Q lcl|NC_019722. 155 IYLAYAPGGDV 165 (182) Q Consensus 155 v~lg~~P~~d~ 165 (182) | .|--+ + T Consensus 148 v--~yee~--~ 154 (154) T protein:vir:79 148 V--TYEEE--S 154 (154) T ss_pred E--eeeec--C Confidence 2 22221 1 No 13 >protein:vir:108220 Length: 133 # NCBI annotation: gp14 # Family: family:all:6424 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552343;genbank:gi:160700663;genbank:GeneID:5758940 Probab=73.48 E-value=0.17 Score=24.81 Aligned_cols=129 Identities=10% Similarity=0.054 Sum_probs=76.4 Q ss_pred CCcccHH-HHHHHHHHHHHHhcC-cceeeecccccccc-cccceeeeeeecccccCCcCCCceEEEEEEEEEEEEcCcCc Q lcl|NC_019722. 1 MSQTTIT-EVHEAIKAKLRETFP-KVTVDDYNPEPELS-VLAPALLLELEEFPMGADVGDDRYPAACRFSVHCVLGWEVK 77 (182) Q Consensus 1 Ms~~~l~-~l~~AI~~~l~~~~P-~l~~v~~~~~~~~~-~~~PAv~i~~~~~~~~~d~gtG~~~~~~~~~a~ivv~~~~~ 77 (182) |+...+- +--.+|++.|++.+| .+..+.--|+..+. -..|++.++=++- .=...+.-....||-+-... T Consensus 1 m~~~Rvp~D~~~~Ik~~L~~~l~a~v~~~~~lPddW~~~s~~P~vvV~dDgg-------pv~wpv~t~~~IRvtv~a~g- 72 (133) T protein:vir:10 1 MSDVRVVGDPVPPVKAYLAAFWGARVRIADEVPDDWHVETDVPLIVVDDDGG-------PIDWPVKSDPLVRCGIYANG- 72 (133) T ss_pred CCCcccCCCChHHHHHHHHhhccccceeeeecCCCccccCCceEEEEecCCC-------ccccceeccceEEEEEeecC- Confidence 9985543 467899999999999 56667777776554 2568877764332 22244444455555543322 Q ss_pred CcHHHHHHHHHHHHHHhc-ccccCCCccccCccceeeeccCccchhhcCceEEEEEEEEEEEee Q lcl|NC_019722. 78 SLALELWEFSAAVAQLIR-KSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYL 140 (182) Q Consensus 78 ~~~~~a~~lAa~l~~ll~-~q~wGL~~~~v~~a~~v~a~p~~~~~~~~dgy~vW~VeW~Q~v~L 140 (182) +-+||+|+.+...+|- ...=|.. --+++..++--++|...--.--+|.|=.=.=+|.+.+ T Consensus 73 --r~~Ar~l~~~~~g~LLa~~i~Gva-~ii~~g~glL~aRD~~tgg~iAsfTV~A~~rt~~~~~ 133 (133) T protein:vir:10 73 --KQTAKNLRRITMGALLAEPIPGIA-HIQRTGIGYVDARDPDTGADIASFTVTATVRTEVITV 133 (133) T ss_pred --ChhHHHHHHHHHHHHhcCCCCcee-EEcCCCceEEecCCCCCCceEEEEEEEeeeeeeEeeC Confidence 2378888887555444 4432443 2367777676666654321112455555566788876 No 14 >protein:vir:5979 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:296 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690679;genbank:geneid:6329147;genbank:gi:22855073;uniprot:O48448;genbank:GeneID:955319 Probab=57.34 E-value=0.44 Score=22.56 Aligned_cols=131 Identities=10% Similarity=0.114 Sum_probs=69.9 Q ss_pred CCcccH-HHHHHHHHHHHHHhcCccee--eecccccccccccceeeeeeecccccCCcCCCceEEEEEEEEEEEEcCcCc Q lcl|NC_019722. 1 MSQTTI-TEVHEAIKAKLRETFPKVTV--DDYNPEPELSVLAPALLLELEEFPMGADVGDDRYPAACRFSVHCVLGWEVK 77 (182) Q Consensus 1 Ms~~~l-~~l~~AI~~~l~~~~P~l~~--v~~~~~~~~~~~~PAv~i~~~~~~~~~d~gtG~~~~~~~~~a~ivv~~~~~ 77 (182) ||--+- .+|..||.+.|++ -|.|.. +..|+........|-|.+.=++..+.+ ..++ .....++..+|... . . T Consensus 1 m~~~s~~~aLq~Ai~~~L~a-d~~l~alvg~I~D~~P~~~~~PYV~lG~~~~~d~~-~~~~-~g~~~~~ti~Vws~-~-g 75 (134) T protein:vir:59 1 MTWKLASRALQKATVENLES-YQPLMEMVNQVTESPGKDDPYPYVVIGDQSSTPFE-TKSS-FGENITMDFHVWGG-T-T 75 (134) T ss_pred CCccchhHHHHHHHHHHhhc-ChhHHHhhhhhhcCCCCCCCCCEEEeCCceeeecC-CCcc-cceEEEEEEEEEEC-C-C Confidence 999763 5899999999996 555432 356666555556776666545444432 2222 24455666777643 2 2 Q ss_pred CcHHHHHHHHHHHHHHhcccccCCCccccCccceeeeccCccchhhcCceEEEEEEEEEEEeeCCccccCCCCCccEEEE Q lcl|NC_019722. 78 SLALELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYLGESMWNADGITPQEIYL 157 (182) Q Consensus 78 ~~~~~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v~a~p~~~~~~~~dgy~vW~VeW~Q~v~LG~~~w~~~~~~p~~v~l 157 (182) ..++..+++++...|+++..-|++-. ..-|.++..|.+...... ..=+.|-| T Consensus 76 --~~ea~~ia~av~~aL~~~~L~l~~~~---------------------lv~l~~~~~~~~rd~dg~-----~~hg~l~f 127 (134) T protein:vir:59 76 --RAEAQDISSRVLEALTYKPLMFEGFT---------------------FVAKKLVLAQVITDTDGV-----TKHGIIKV 127 (134) T ss_pred --hHHHHHHHHHHHHHhcCCCcccCCce---------------------EEEeEEeeeeEEecCCCc-----eEEEEEEE Confidence 35789999999999998866654211 122333333333221100 00111222 Q ss_pred EEcCCCC Q lcl|NC_019722. 158 AYAPGGD 164 (182) Q Consensus 158 g~~P~~d 164 (182) =+-=|+| T Consensus 128 ra~ve~~ 134 (134) T protein:vir:59 128 RFTINNN 134 (134) T ss_pred EEEEecC Confidence 1111211 No 15 >protein:vir:105468 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:1535 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529878;genbank:gi:90592618;genbank:GeneID:3974532 Probab=50.54 E-value=0.61 Score=21.77 Aligned_cols=131 Identities=11% Similarity=0.099 Sum_probs=82.1 Q ss_pred HHHHHHHHHHHHHhcCcceeeecc-cccccccccceeeeeeecccccCCcCCCceEEEEEEEEEEEEcCcCcCcHHHHHH Q lcl|NC_019722. 7 TEVHEAIKAKLRETFPKVTVDDYN-PEPELSVLAPALLLELEEFPMGADVGDDRYPAACRFSVHCVLGWEVKSLALELWE 85 (182) Q Consensus 7 ~~l~~AI~~~l~~~~P~l~~v~~~-~~~~~~~~~PAv~i~~~~~~~~~d~gtG~~~~~~~~~a~ivv~~~~~~~~~~a~~ 85 (182) .++.+||..+|++.||..+ .| ++...++..||.||.+......+.. .+|+--..+|..+-.-+.. +...+..+ T Consensus 1 ~~ii~~I~~~L~~~fpd~~---IY~e~i~Qg~~~PcFFI~~l~~~~~~~~-~~ry~r~~~fdI~Yfp~~~--~~~~e~~~ 74 (135) T protein:vir:10 1 MTIVERIAKRISEIFPDVT---IYSEKQKSGFQVPSFYISKIMTVTKSRF-FDIQDRSLSYSITYFANPD--RPNADMEE 74 (135) T ss_pred ChhHHHHHHHHHHhcCcee---eecccccCCCcCCeeEEEEecCCccccc-cceEEEEeeEEEEEeecCC--CchhhHHH Confidence 4789999999999999844 55 4456679999999999998776654 6688876666666555433 34567778 Q ss_pred HHHHHHHHh---cccccCCCccccCccceeeeccCccchhhcCceEEEEEEEEEEEeeCCccccCCCCCccEEEEEEcCC Q lcl|NC_019722. 86 FSAAVAQLI---RKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWNQTLYLGESMWNADGITPQEIYLAYAPG 162 (182) Q Consensus 86 lAa~l~~ll---~~q~wGL~~~~v~~a~~v~a~p~~~~~~~~dgy~vW~VeW~Q~v~LG~~~w~~~~~~p~~v~lg~~P~ 162 (182) .|..|...+ .+--+|+.. .+..-++ ||.....|..+..++.-++ ..+=.++-. + T Consensus 75 vae~L~~~le~i~~~~~~~~~-------------~~~i~~~-D~VLhf~~~~~~~~~k~~~-----~~~M~~l~~----~ 131 (135) T protein:vir:10 75 VEQKLLNNFTRLDDYATVRNR-------------ETTINQD-DETLVMSFDLRLEMYPVQD-----GGKLERIEF----N 131 (135) T ss_pred HHHHHHHhhhhcCceeEEeCC-------------ceEEEee-cCeEEEEEEEEEEEeecCC-----cchhhhhhh----c Confidence 888886665 232344321 1111122 7788899999888887432 111112110 1 Q ss_pred CCCC Q lcl|NC_019722. 163 GDVP 166 (182) Q Consensus 163 ~d~G 166 (182) +++- T Consensus 132 ~~ik 135 (135) T protein:vir:10 132 GGIQ 135 (135) T ss_pred cCcC Confidence 0111 No 16 >protein:vir:96125 Length: 140 # NCBI annotation: ORF038 # Family: family:all:296 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240084;genbank:gi:66395765;genbank:GeneID:5133106 Probab=46.82 E-value=0.72 Score=21.36 Aligned_cols=131 Identities=14% Similarity=0.066 Sum_probs=64.4 Q ss_pred CCcccHHHHHHHHHHHHHHhcCccee---eecccccccccccceeeeeeecccccCCcCCCceEEEEEEEEEEEEcCcCc Q lcl|NC_019722. 1 MSQTTITEVHEAIKAKLRETFPKVTV---DDYNPEPELSVLAPALLLELEEFPMGADVGDDRYPAACRFSVHCVLGWEVK 77 (182) Q Consensus 1 Ms~~~l~~l~~AI~~~l~~~~P~l~~---v~~~~~~~~~~~~PAv~i~~~~~~~~~d~gtG~~~~~~~~~a~ivv~~~~~ 77 (182) ||. ..+|..||.+.|++ .|.|.. +..|+......+-|-|.+.=.+..+... ........++.++|.... T Consensus 3 msa--~~aLq~Ai~~~L~a-d~~l~alvggrVyD~~P~~~~~PYV~lG~~~~~~~~~--~~~~g~~~~~tl~Vws~~--- 74 (140) T protein:vir:96 3 VTA--EPLLYNKIMNNLIE-NPITDKLVGGRVFDCVQKDVVYPYIVVGESNVTESER--SPGMREIIAITFHVYSQY--- 74 (140) T ss_pred cch--hHHHHHHHHHHhcc-ChhHHhhcCcccccCCccCCCCCEEEeCCceeeecCC--CcccceEEEEEEEEEEcC--- Confidence 665 45899999999996 665532 2456554555566766554444433322 223344556677776532 Q ss_pred CcHHHHHHHHHHHHHHhcccccCCCccccCccceeeeccCccchhhcCceEE-EEEEEEEEEe---eCCcc Q lcl|NC_019722. 78 SLALELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSFRKDTQQGYDS-RVVTWNQTLY---LGESM 144 (182) Q Consensus 78 ~~~~~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v~a~p~~~~~~~~dgy~v-W~VeW~Q~v~---LG~~~ 144 (182) .-..++..+|.++...|+ +.-.|++-.+ .+++-......++- ||+.. =.|+|.=.|. +-+.. T Consensus 75 ~g~~ea~~ia~ai~~aL~-~~l~l~~~~l---v~l~~~~~~~~rd~-dg~t~hgvl~~ra~ve~~~~~~~~ 140 (140) T protein:vir:96 75 ENGAEARELLKYLNYACR-LNINFKDYEL---EWIKKDNSQVFTDI-DQYTKHGVLRLLYKVRHKTLQERV 140 (140) T ss_pred CCHHHHHHHHHHHHHHhc-CCccCCCceE---EEEEEeeeEEeecC-CCceEEEEEEEEEEEeecccccCC Confidence 234578999999999996 4333322111 11111111111111 22211 1133333331 22222 No 17 >protein:vir:94061 Length: 175 # NCBI annotation: hypothetical protein # Family: family:all:3176 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453620;genbank:gi:84662656;genbank:GeneID:5142571 Probab=31.37 E-value=1.5 Score=19.61 Aligned_cols=149 Identities=11% Similarity=0.021 Sum_probs=69.9 Q ss_pred CCc----ccHHHHHHHHHHHHHHhcC----cceeeeccccccccc-ccceeeeeeecc------cccCCcCCCceEEEEE Q lcl|NC_019722. 1 MSQ----TTITEVHEAIKAKLRETFP----KVTVDDYNPEPELSV-LAPALLLELEEF------PMGADVGDDRYPAACR 65 (182) Q Consensus 1 Ms~----~~l~~l~~AI~~~l~~~~P----~l~~v~~~~~~~~~~-~~PAv~i~~~~~------~~~~d~gtG~~~~~~~ 65 (182) ||. ++-+++++|+.+-|.+-|| +.++...+-- ..++ .-+-|++....- ...=++++|.....-. T Consensus 1 m~avtl~~Te~di~~alr~fL~~lf~lp~~~~eVi~g~qN-~~p~P~g~fi~mt~l~~~~lsT~~~~Y~~~~g~~~~~~~ 79 (175) T protein:vir:94 1 MTAATLTPTEDAVFDAMFGFLAKVLDLPDDTQAIIKGFQN-LSSTPTGSCVVVSPGMMTRQDFGSRLYDPGLSKVVIEAH 79 (175) T ss_pred CcceeccccHHHHHHHHHHHHHHHcCCCCCCceEEEeccC-CCCccCCCEEEEecccccccccceeeecccccceeeeee Confidence 988 4446799999999999998 4555544322 1111 112222222111 1112467777777776 Q ss_pred EEEEEEEcCcCcCcHHHHHHHHHHHHHHhcccccCCC-------ccccCccceeeeccCccchhhcCceEEEEEEEE--- Q lcl|NC_019722. 66 FSVHCVLGWEVKSLALELWEFSAAVAQLIRKSGVWVK-------GGVLTKPEGLEVYPGSFRKDTQQGYDSRVVTWN--- 135 (182) Q Consensus 66 ~~a~ivv~~~~~~~~~~a~~lAa~l~~ll~~q~wGL~-------~~~v~~a~~v~a~p~~~~~~~~dgy~vW~VeW~--- 135 (182) .+..|=++.--++ |.+.|..++.+.|+. ||-. +=-+++|+.+.--++.. +=-.-|+++.. T Consensus 80 ~q~~~QvD~YG~~----A~d~A~~~~tl~Rs~-~a~~~~~~~~~PLYad~p~qlp~iN~e~-----QyE~Rwt~~~~lQ~ 149 (175) T protein:vir:94 80 LTYSYQVDCYGPL----APTWASVISVAWKSM-WGVDNTAPAFAPLYADAPQQLNIVNSEG-----QFEQRFMVRLFGQV 149 (175) T ss_pred eEEEEEEEeecCC----hHHHHHHHHHHhcCh-hHhhhhhcccccccCcCccccCccCccc-----cccceEEEEEEEEe Confidence 7777766665444 455666666666665 4421 01123333322221111 11234776643 Q ss_pred -EEEeeCCccccCCCCCccEEEEEEcCCCCCCCcchhhhhh Q lcl|NC_019722. 136 -QTLYLGESMWNADGITPQEIYLAYAPGGDVPPADEHEKAE 175 (182) Q Consensus 136 -Q~v~LG~~~w~~~~~~p~~v~lg~~P~~d~Gp~~e~dY~~ 175 (182) +.+-.=.+-++...+. .+ +.++-.+ T Consensus 150 Np~vt~pq~F~d~v~v~--~~-------------~~d~~~P 175 (175) T protein:vir:94 150 NQRVALPQDFFDSVQLT--SL-------------NIADLLP 175 (175) T ss_pred eeEEeeehhhcccCCcc--ee-------------eceecCC Confidence 3333444444333221 11 1111111 No 18 >protein:vir:1244 Length: 145 # NCBI annotation: similar to phage Spp1 gp17 # Family: family:all:296 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510943;genbank:gi:17426277;genbank:GeneID:927402 Probab=29.68 E-value=1.6 Score=19.41 Aligned_cols=136 Identities=8% Similarity=0.025 Sum_probs=64.4 Q ss_pred CCcccHHHHHHHHHHHHHHhcCccee---eecccccccccccceeeeeeecccccCCcCCCceEEEEEEEEEEEEcCcCc Q lcl|NC_019722. 1 MSQTTITEVHEAIKAKLRETFPKVTV---DDYNPEPELSVLAPALLLELEEFPMGADVGDDRYPAACRFSVHCVLGWEVK 77 (182) Q Consensus 1 Ms~~~l~~l~~AI~~~l~~~~P~l~~---v~~~~~~~~~~~~PAv~i~~~~~~~~~d~gtG~~~~~~~~~a~ivv~~~~~ 77 (182) |+----.+|++||.+.|++ -|.|.. ...|+...+...-|-|.+.=....+.. ..+ ......++..+|.-.. T Consensus 1 M~~s~~~aLq~ai~~~L~a-d~~l~~lvg~~vyD~~P~~~~~PyV~lG~~~~~~~~-t~~-~~~~~~~lti~Vws~~--- 74 (145) T protein:vir:12 1 MWVSVERYLFNKVYNKLKS-NPIIQKQLGGRVFDCVQKDAVYPYIVVGETNVTNKE-TTT-SMVEDVGITLHVYSQA--- 74 (145) T ss_pred CcccHHHHHHHHHHHHhhc-ChhHHHhcCcccccCCccCCCCCEEEeccceeeecC-CCc-ccceEEEEEEEEEEcC--- Confidence 8865456899999999985 554321 235655555556776665444443332 122 3455667777777532 Q ss_pred CcHHHHHHHHHHHHHHhcccccCCCccccCcccee--eeccCccchhhcCceEEE-EEEEEEEEe---eCCccccCCCCC Q lcl|NC_019722. 78 SLALELWEFSAAVAQLIRKSGVWVKGGVLTKPEGL--EVYPGSFRKDTQQGYDSR-VVTWNQTLY---LGESMWNADGIT 151 (182) Q Consensus 78 ~~~~~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v--~a~p~~~~~~~~dgy~vW-~VeW~Q~v~---LG~~~w~~~~~~ 151 (182) .-..++..++.++...|++. |. .+.-.++ +-.......+ .||...- .|++.=.+. +-++.- T Consensus 75 ~gr~ea~~ia~ai~~aL~~~---l~---l~~~~lv~l~~~~~~~~rd-~d~~~~hgvl~~ra~i~~~~~~~~~~------ 141 (145) T protein:vir:12 75 RNRDEASQIIQFLGFVLNNE---IE---IDYYSFIKSRIDTQEVITD-IDQYTKHGIIRLVFKYRHNTLQRSVT------ 141 (145) T ss_pred ccHHHHHHHHHHHHHHhccc---cC---CCCceEEEEEEeeEEEEec-CCCceEEEEEEEEEEEEeCCcccccc------ Confidence 23468899999999888752 32 1111111 1100000011 1221100 122222221 111111 Q ss_pred ccEEEEEEcCCCCCCCc Q lcl|NC_019722. 152 PQEIYLAYAPGGDVPPA 168 (182) Q Consensus 152 p~~v~lg~~P~~d~Gp~ 168 (182) .|.| T Consensus 142 -------------~~~~ 145 (145) T protein:vir:12 142 -------------NGAG 145 (145) T ss_pred -------------cCCC Confidence 1221 No 19 >protein:vir:94096 Length: 141 # NCBI annotation: ORF031 # Family: family:all:296 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240240;genbank:gi:66395916;genbank:GeneID:5133265 Probab=29.59 E-value=1.6 Score=19.40 Aligned_cols=135 Identities=10% Similarity=0.058 Sum_probs=66.8 Q ss_pred CCcccHHHHHHHHHHHHHHhcCccee---eecccccccccccceeeeeeecccccCCcCCCceEEEEEEEEEEEEcCcCc Q lcl|NC_019722. 1 MSQTTITEVHEAIKAKLRETFPKVTV---DDYNPEPELSVLAPALLLELEEFPMGADVGDDRYPAACRFSVHCVLGWEVK 77 (182) Q Consensus 1 Ms~~~l~~l~~AI~~~l~~~~P~l~~---v~~~~~~~~~~~~PAv~i~~~~~~~~~d~gtG~~~~~~~~~a~ivv~~~~~ 77 (182) ||=---.+|..||.+.|++ -|.|.. +..|+......+.|.|.+.=.+..+.++ .+......++..+|..... T Consensus 1 Msms~~~aLQ~Ai~~~L~a-daal~alvg~rI~D~~P~~~~~PYv~lG~~~~~~~~~--~~~~g~~~~~ti~Vws~~~-- 75 (141) T protein:vir:94 1 MWVSVEPELTNQIYKRLIS-DPNINKLVDDRVFDVVQDDAVYPYIVVGESNVTNNES--SATMRETVGIVIHVYSQFA-- 75 (141) T ss_pred CccchhHHHHHHHHHHhhc-ChhhHhhcCCccccCCccCCCCCEEEeCCceeeecCC--CcccceEEEEEEEEEEcCC-- Confidence 7722235899999999996 555432 3456655555567766665555444332 2234456677777775432 Q ss_pred CcHHHHHHHHHHHHHHhcccccCCCccccCccceeeeccCcc--chhhcCceEEE-EEEEEEEEeeCCccccCCCCCccE Q lcl|NC_019722. 78 SLALELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSF--RKDTQQGYDSR-VVTWNQTLYLGESMWNADGITPQE 154 (182) Q Consensus 78 ~~~~~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v~a~p~~~--~~~~~dgy~vW-~VeW~Q~v~LG~~~w~~~~~~p~~ 154 (182) -..++..+|.++...|+.. |. ++.-..+....... .++ .||.... .|+|.=.+ T Consensus 76 -g~~eak~ia~av~~AL~~~---l~---l~~~~lv~l~~~~~~~~rd-~dg~t~hgvl~~ra~v---------------- 131 (141) T protein:vir:94 76 -TQYEAKLILSAIGYVLNRP---IE---IDNYEFQFSRIDSQAVFPD-IDRFTKHGTIRLLFKY---------------- 131 (141) T ss_pred -CHHHHHHHHHHHHHHhccc---cc---CCCceEEEEEEeeeeeeec-CCCceEEEEEEEEEEE---------------- Confidence 3467889999999999742 32 22222221111111 111 1222111 12222222 Q ss_pred EEEEEcCCCCCCCcchhhh Q lcl|NC_019722. 155 IYLAYAPGGDVPPADEHEK 173 (182) Q Consensus 155 v~lg~~P~~d~Gp~~e~dY 173 (182) .|. +-+|.-| T Consensus 132 -------~~~--~~~~~~~ 141 (141) T protein:vir:94 132 -------RHK--KKNEGVY 141 (141) T ss_pred -------Eec--cccccCC Confidence 111 1233333 No 20 >protein:vir:105892 Length: 141 # NCBI annotation: tail protein # Family: family:all:296 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004380;genbank:gi:122891835;genbank:GeneID:4712363 Probab=29.59 E-value=1.6 Score=19.40 Aligned_cols=135 Identities=10% Similarity=0.058 Sum_probs=66.8 Q ss_pred CCcccHHHHHHHHHHHHHHhcCccee---eecccccccccccceeeeeeecccccCCcCCCceEEEEEEEEEEEEcCcCc Q lcl|NC_019722. 1 MSQTTITEVHEAIKAKLRETFPKVTV---DDYNPEPELSVLAPALLLELEEFPMGADVGDDRYPAACRFSVHCVLGWEVK 77 (182) Q Consensus 1 Ms~~~l~~l~~AI~~~l~~~~P~l~~---v~~~~~~~~~~~~PAv~i~~~~~~~~~d~gtG~~~~~~~~~a~ivv~~~~~ 77 (182) ||=---.+|..||.+.|++ -|.|.. +..|+......+.|.|.+.=.+..+.++ .+......++..+|..... T Consensus 1 Msms~~~aLQ~Ai~~~L~a-daal~alvg~rI~D~~P~~~~~PYv~lG~~~~~~~~~--~~~~g~~~~~ti~Vws~~~-- 75 (141) T protein:vir:10 1 MWVSVEPELTNQIYKRLIS-DPNINKLVDDRVFDVVQDDAVYPYIVVGESNVTNNES--SATMRETVGIVIHVYSQFA-- 75 (141) T ss_pred CccchhHHHHHHHHHHhhc-ChhhHhhcCCccccCCccCCCCCEEEeCCceeeecCC--CcccceEEEEEEEEEEcCC-- Confidence 7722235899999999996 555432 3456655555567766665555444332 2234456677777775432 Q ss_pred CcHHHHHHHHHHHHHHhcccccCCCccccCccceeeeccCcc--chhhcCceEEE-EEEEEEEEeeCCccccCCCCCccE Q lcl|NC_019722. 78 SLALELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSF--RKDTQQGYDSR-VVTWNQTLYLGESMWNADGITPQE 154 (182) Q Consensus 78 ~~~~~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v~a~p~~~--~~~~~dgy~vW-~VeW~Q~v~LG~~~w~~~~~~p~~ 154 (182) -..++..+|.++...|+.. |. ++.-..+....... .++ .||.... .|+|.=.+ T Consensus 76 -g~~eak~ia~av~~AL~~~---l~---l~~~~lv~l~~~~~~~~rd-~dg~t~hgvl~~ra~v---------------- 131 (141) T protein:vir:10 76 -TQYEAKLILSAIGYVLNRP---IE---IDNYEFQFSRIDSQAVFPD-IDRFTKHGTIRLLFKY---------------- 131 (141) T ss_pred -CHHHHHHHHHHHHHHhccc---cc---CCCceEEEEEEeeeeeeec-CCCceEEEEEEEEEEE---------------- Confidence 3467889999999999742 32 22222221111111 111 1222111 12222222 Q ss_pred EEEEEcCCCCCCCcchhhh Q lcl|NC_019722. 155 IYLAYAPGGDVPPADEHEK 173 (182) Q Consensus 155 v~lg~~P~~d~Gp~~e~dY 173 (182) .|. +-+|.-| T Consensus 132 -------~~~--~~~~~~~ 141 (141) T protein:vir:10 132 -------RHK--KKNEGVY 141 (141) T ss_pred -------Eec--cccccCC Confidence 111 1233333 No 21 >protein:vir:96260 Length: 141 # NCBI annotation: ORF027 # Family: family:all:296 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240317;genbank:gi:66395991;genbank:GeneID:5133337 Probab=29.59 E-value=1.6 Score=19.40 Aligned_cols=135 Identities=10% Similarity=0.058 Sum_probs=66.8 Q ss_pred CCcccHHHHHHHHHHHHHHhcCccee---eecccccccccccceeeeeeecccccCCcCCCceEEEEEEEEEEEEcCcCc Q lcl|NC_019722. 1 MSQTTITEVHEAIKAKLRETFPKVTV---DDYNPEPELSVLAPALLLELEEFPMGADVGDDRYPAACRFSVHCVLGWEVK 77 (182) Q Consensus 1 Ms~~~l~~l~~AI~~~l~~~~P~l~~---v~~~~~~~~~~~~PAv~i~~~~~~~~~d~gtG~~~~~~~~~a~ivv~~~~~ 77 (182) ||=---.+|..||.+.|++ -|.|.. +..|+......+.|.|.+.=.+..+.++ .+......++..+|..... T Consensus 1 Msms~~~aLQ~Ai~~~L~a-daal~alvg~rI~D~~P~~~~~PYv~lG~~~~~~~~~--~~~~g~~~~~ti~Vws~~~-- 75 (141) T protein:vir:96 1 MWVSVEPELTNQIYKRLIS-DPNINKLVDDRVFDVVQDDAVYPYIVVGESNVTNNES--SATMRETVGIVIHVYSQFA-- 75 (141) T ss_pred CccchhHHHHHHHHHHhhc-ChhhHhhcCCccccCCccCCCCCEEEeCCceeeecCC--CcccceEEEEEEEEEEcCC-- Confidence 7722235899999999996 555432 3456655555567766665555444332 2234456677777775432 Q ss_pred CcHHHHHHHHHHHHHHhcccccCCCccccCccceeeeccCcc--chhhcCceEEE-EEEEEEEEeeCCccccCCCCCccE Q lcl|NC_019722. 78 SLALELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLEVYPGSF--RKDTQQGYDSR-VVTWNQTLYLGESMWNADGITPQE 154 (182) Q Consensus 78 ~~~~~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v~a~p~~~--~~~~~dgy~vW-~VeW~Q~v~LG~~~w~~~~~~p~~ 154 (182) -..++..+|.++...|+.. |. ++.-..+....... .++ .||.... .|+|.=.+ T Consensus 76 -g~~eak~ia~av~~AL~~~---l~---l~~~~lv~l~~~~~~~~rd-~dg~t~hgvl~~ra~v---------------- 131 (141) T protein:vir:96 76 -TQYEAKLILSAIGYVLNRP---IE---IDNYEFQFSRIDSQAVFPD-IDRFTKHGTIRLLFKY---------------- 131 (141) T ss_pred -CHHHHHHHHHHHHHHhccc---cc---CCCceEEEEEEeeeeeeec-CCCceEEEEEEEEEEE---------------- Confidence 3467889999999999742 32 22222221111111 111 1222111 12222222 Q ss_pred EEEEEcCCCCCCCcchhhh Q lcl|NC_019722. 155 IYLAYAPGGDVPPADEHEK 173 (182) Q Consensus 155 v~lg~~P~~d~Gp~~e~dY 173 (182) .|. +-+|.-| T Consensus 132 -------~~~--~~~~~~~ 141 (141) T protein:vir:96 132 -------RHK--KKNEGVY 141 (141) T ss_pred -------Eec--cccccCC Confidence 111 1233333 No 22 >protein:vir:96894 Length: 140 # NCBI annotation: ORF029 # Family: family:all:296 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240162;genbank:gi:66395835;genbank:GeneID:5133235 Probab=24.57 E-value=2.2 Score=18.75 Aligned_cols=131 Identities=11% Similarity=0.094 Sum_probs=67.5 Q ss_pred CCcccHHHHHHHHHHHHHHhcCccee---eecccccccccccceeeeeeecccccCCcCCCceEEEEEEEEEEEEcCcCc Q lcl|NC_019722. 1 MSQTTITEVHEAIKAKLRETFPKVTV---DDYNPEPELSVLAPALLLELEEFPMGADVGDDRYPAACRFSVHCVLGWEVK 77 (182) Q Consensus 1 Ms~~~l~~l~~AI~~~l~~~~P~l~~---v~~~~~~~~~~~~PAv~i~~~~~~~~~d~gtG~~~~~~~~~a~ivv~~~~~ 77 (182) ||=---.+|+.||.+.|++ .|.|.. +..|+........|-|.+.=.+..+.+. .++ .....++..+|.... T Consensus 1 Msms~~~aLq~Ai~a~L~a-da~l~alvg~~VyD~~P~~~~~Pyv~lG~~~~~~~~~-~~~-~g~~~~~~i~Vws~~--- 74 (140) T protein:vir:96 1 MWVSVEPELTVQIYKRLKA-SPIINKFVGDRVFDVVQEDAVYPYIVVGESNVTNNES-STM-MRETVGIVIHVYSQF--- 74 (140) T ss_pred CCccHHHHHHHHHHHHhhc-ChhHHHhcCCccccCCccCCCCCEEEecCceeeecCC-Ccc-cceEEEEEEEEEEcC--- Confidence 7732346899999999986 566543 2356555555566766665455544332 222 344667777777532 Q ss_pred CcHHHHHHHHHHHHHHhcccccCCCccccCccceee--eccCccchhhcCceEE-EEEEEEEEEe---eCCcc Q lcl|NC_019722. 78 SLALELWEFSAAVAQLIRKSGVWVKGGVLTKPEGLE--VYPGSFRKDTQQGYDS-RVVTWNQTLY---LGESM 144 (182) Q Consensus 78 ~~~~~a~~lAa~l~~ll~~q~wGL~~~~v~~a~~v~--a~p~~~~~~~~dgy~v-W~VeW~Q~v~---LG~~~ 144 (182) .-..++..++.++...|+ +.--|++ -..+. -......++- ||... =.|++.=.+- +||-. T Consensus 75 ~g~~ea~~ia~av~~AL~-~~l~l~~-----~~lv~l~~~~~~~~rd~-dg~~~hgvl~~r~~v~~~~~~~~~ 140 (140) T protein:vir:96 75 ATQYEAKQIISAIGYVLN-RPIDIEN-----YEFQFSRIDSQSVFPDI-DRFTKHGTIRLLFKYRHIKKGEGV 140 (140) T ss_pred CCHHHHHHHHHHHHHHhC-CCccCCC-----CeEEEEEEeeeEEEecC-CCceEEEEEEEEEEEEeeccccCC Confidence 234678999999999886 4232321 11111 1111111111 22211 1244444442 44433 Done!