Query lcl|NC_011023.1_cdsid_YP_001994839.1 [gene=22] [protein=gp22] [protein_id=YP_001994839.1] [location=15579..15947] Match_columns 122 No_of_seqs 11 out of 13 Neff 2.7 Searched_HMMs 1612 Date Thu Nov 7 13:43:25 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_21 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_21_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:104090 Length: 122 100.0 1.2E-74 7.6E-78 425.8 11.6 122 1-122 1-122 (122) 2 protein:vir:4229 Length: 122 # 100.0 2.2E-74 1.3E-77 424.4 11.7 122 1-122 1-122 (122) 3 protein:vir:2434 Length: 122 # 100.0 1.6E-73 9.8E-77 419.7 11.5 122 1-122 1-122 (122) 4 protein:vir:7775 Length: 122 # 100.0 1.9E-73 1.2E-76 419.3 11.5 122 1-122 1-122 (122) 5 protein:vir:2346 Length: 126 # 100.0 5.5E-66 3.4E-69 378.4 11.8 119 1-122 1-126 (126) 6 protein:vir:78288 Length: 120 100.0 3.9E-65 2.4E-68 373.7 12.1 119 1-122 1-120 (120) 7 protein:vir:78529 Length: 101 100.0 5.5E-56 3.4E-59 323.5 7.7 101 19-122 1-101 (101) 8 protein:vir:9577 Length: 112 # 97.1 6E-06 3.7E-09 49.2 7.8 105 1-122 1-111 (112) 9 protein:vir:9762 Length: 112 # 96.9 1.5E-05 9.5E-09 47.0 8.2 105 1-122 1-111 (112) 10 protein:vir:1641 Length: 115 # 96.1 0.00011 6.7E-08 42.3 8.4 105 1-122 4-114 (115) 11 protein:vir:94767 Length: 104 89.4 0.0062 3.8E-06 32.7 6.6 98 13-122 1-103 (104) 12 protein:vir:81216 Length: 118 83.4 0.054 3.4E-05 27.5 8.5 108 1-122 1-116 (118) 13 protein:vir:98924 Length: 113 82.7 0.047 2.9E-05 27.9 7.8 108 1-120 3-113 (113) 14 protein:vir:44 Length: 120 # N 79.3 0.11 6.6E-05 25.9 8.9 109 1-120 10-120 (120) 15 protein:vir:81177 Length: 109 76.5 0.13 8.4E-05 25.4 9.0 102 4-122 1-102 (109) 16 protein:vir:80934 Length: 120 75.7 0.14 8.9E-05 25.2 8.9 109 1-120 10-120 (120) 17 protein:vir:5258 Length: 123 # 71.4 0.19 0.00012 24.6 7.7 102 1-120 1-123 (123) 18 protein:vir:1582 Length: 117 # 67.0 0.17 0.00011 24.8 6.5 94 1-122 2-98 (117) 19 protein:vir:102856 Length: 107 66.7 0.26 0.00016 23.8 9.2 101 4-122 1-102 (107) 20 protein:vir:105006 Length: 107 66.7 0.26 0.00016 23.8 9.2 101 4-122 1-102 (107) 21 protein:vir:107606 Length: 107 66.7 0.26 0.00016 23.8 9.2 101 4-122 1-102 (107) 22 protein:vir:102084 Length: 107 66.7 0.26 0.00016 23.8 9.2 101 4-122 1-102 (107) 23 protein:vir:4459 Length: 134 # 64.3 0.3 0.00019 23.4 7.4 105 1-122 11-118 (134) 24 protein:vir:100244 Length: 109 63.4 0.32 0.0002 23.3 9.1 102 3-122 1-108 (109) 25 protein:vir:80342 Length: 108 60.2 0.38 0.00023 22.9 9.0 104 4-122 1-108 (108) 26 protein:vir:95261 Length: 133 59.7 0.39 0.00024 22.8 7.9 116 1-122 1-122 (133) 27 protein:vir:79686 Length: 118 56.1 0.33 0.00021 23.2 6.1 94 1-122 2-99 (118) 28 protein:vir:100134 Length: 109 46.6 0.73 0.00045 21.3 10.3 107 3-122 1-108 (109) 29 protein:vir:1436 Length: 108 # 39.1 1 0.00064 20.5 9.0 105 4-122 1-107 (108) 30 protein:vir:3872 Length: 146 # 37.0 1.1 0.00071 20.3 9.5 107 1-122 34-142 (146) 31 protein:vir:107716 Length: 132 35.0 1.3 0.00078 20.0 6.3 106 1-122 1-125 (132) 32 protein:vir:96107 Length: 133 33.1 1.4 0.00085 19.8 7.0 111 1-119 1-133 (133) 33 protein:vir:4343 Length: 118 # 30.7 1.6 0.00097 19.5 7.9 104 4-122 1-113 (118) 34 protein:vir:1890 Length: 110 # 28.3 1.8 0.0011 19.2 9.3 105 4-122 1-106 (110) 35 protein:vir:99571 Length: 131 24.5 2.2 0.0013 18.7 7.9 114 1-122 1-125 (131) 36 protein:vir:4789 Length: 123 # 23.3 2.3 0.0014 18.6 7.2 97 1-122 7-106 (123) 37 protein:vir:2506 Length: 116 # 20.9 2.7 0.0017 18.2 5.9 105 3-122 1-109 (116) No 1 >protein:vir:104090 Length: 122 # NCBI annotation: gp22 # Family: family:all:2818 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655601;genbank:gi:109392472;genbank:GeneID:4156958 Probab=100.00 E-value=1.2e-74 Score=425.79 Aligned_cols=122 Identities=93% Similarity=1.399 Sum_probs=122.0 Q ss_pred CcccccCCCcCcEEEeeeeEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeeccCCC Q lcl|NC_011023. 1 MSLLDTGARYQPVTVYPEEMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKE 80 (122) Q Consensus 1 MSLLD~g~~~e~v~VYPee~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~~ 80 (122) |||||+|++|++|||||||+++|.||||+|+||++|||+.||+||++||||||||||||||||+|||||+|||||||+++ T Consensus 1 mslld~g~~yd~V~VYPe~~v~D~dGNt~t~Ps~tgI~~~aR~qV~~qsgTsarraeqdn~gf~te~vyrmRfprsf~~e 80 (122) T protein:vir:10 1 MSLLDTGARYQNVIVYPEEMVIDSDGNKRTKPSKTGIPALARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKE 80 (122) T ss_pred CccccCCCCccceEecCceEEEecCCCceecCCcCCccceeeeeeecCCCCcccccccccCCcccceeEEEecccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 81 HGILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 81 ~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) ||||||||+|||+|+|||+||||++|||||||||.+|||||| T Consensus 81 hGiLgaqS~veW~G~RwalfG~~~~y~sS~r~a~~~YtvkRf 122 (122) T protein:vir:10 81 HGILGAQSQIEWRGQRWALFGDATVYDSSPALSRVDYTIKRF 122 (122) T ss_pred cccccccceeeecceEEeeecccccccCCcceeeeeEEEEeC Confidence 999999999999999999999999999999999999999999 No 2 >protein:vir:4229 Length: 122 # NCBI annotation: predicted 14.0Kd protein # Family: family:all:2818 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039684;swissprot:sw:q05226;genbank:gi:9625450;uniprot:Q05226;genbank:GeneID:2942924 Probab=100.00 E-value=2.2e-74 Score=424.45 Aligned_cols=122 Identities=92% Similarity=1.370 Sum_probs=122.0 Q ss_pred CcccccCCCcCcEEEeeeeEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeeccCCC Q lcl|NC_011023. 1 MSLLDTGARYQPVTVYPEEMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKE 80 (122) Q Consensus 1 MSLLD~g~~~e~v~VYPee~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~~ 80 (122) |||||+|++|++|||||||+++|.||||+++||++|||++|||||++|||||+|||||||+||++|+||+|||||+|+++ T Consensus 1 mslld~g~~y~~viVYPee~~~D~DGNt~t~PS~~GIp~~Ar~Qv~~qsgTsarraE~d~~G~~~erVy~mr~prsf~~e 80 (122) T protein:vir:42 1 MSLLDTGARYQTCIVYPEEMVIDSDGNKRTRPSNTGIPAIARFQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKE 80 (122) T ss_pred CccccCCCCccceEEcCceEEEeCCCCceecCCCCCcceeeeEEecCCccccccccccCCCCcchheeeeeecCcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 81 HGILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 81 ~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) ||+|||||+|||+||||+|||||++|||||||||++|||||| T Consensus 81 hg~L~aqs~ieW~G~RW~l~Gdp~~y~ss~~tar~~ytvkr~ 122 (122) T protein:vir:42 81 HGILGAQSQIEWRDQRWALFGDATVYDSSPALARVDYTIKRY 122 (122) T ss_pred cccccccceeeECCeEEEEecccccccCCcceEEEEEEEEeC Confidence 999999999999999999999999999999999999999999 No 3 >protein:vir:2434 Length: 122 # NCBI annotation: gp20 # Family: family:all:2818 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046836;genbank:gi:9630404;genbank:GeneID:1261606 Probab=100.00 E-value=1.6e-73 Score=419.71 Aligned_cols=122 Identities=88% Similarity=1.322 Sum_probs=122.0 Q ss_pred CcccccCCCcCcEEEeeeeEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeeccCCC Q lcl|NC_011023. 1 MSLLDTGARYQPVTVYPEEMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKE 80 (122) Q Consensus 1 MSLLD~g~~~e~v~VYPee~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~~ 80 (122) |||||+|++||+|||||||+++|.||||+++||++|||++|||||++|||||+|||||||+||++|+||+|||||+|+++ T Consensus 1 mslld~~~~yd~viVYPe~~~~D~DGNt~t~PS~~GI~~~Ar~Qv~~qsgTsarraE~d~~G~~~erVy~~r~pr~f~~e 80 (122) T protein:vir:24 1 MSLLDTGARYQPVLVYPEELVIDADGNKKTQPSKTPIQAIARFQVANQSGTSARRAEQDNGGFTTEKVYRMRFPRSFTKE 80 (122) T ss_pred CccccCCCCccceEEcCceEEEeCCCCceecCCCCCcceeeeEEecCcccccccccccCCCCcchheeeeeecccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 81 HGILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 81 ~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) ||+|||||+|||+||||+|||||++|||||||||++|||||| T Consensus 81 hg~L~aqs~veW~G~RW~l~Gdp~~y~ss~rtar~~ytv~r~ 122 (122) T protein:vir:24 81 HGILGAQTQIEWKGQRWALFGDATEYDSSPALARVDYTIKRF 122 (122) T ss_pred cccccccceeeecceEEEEecchhccCCCcceEEEEEEEEeC Confidence 999999999999999999999999999999999999999999 No 4 >protein:vir:7775 Length: 122 # NCBI annotation: gp20 # Family: family:all:2818 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817609;genbank:gi:29566039;genbank:GeneID:1259233 Probab=100.00 E-value=1.9e-73 Score=419.26 Aligned_cols=122 Identities=72% Similarity=1.174 Sum_probs=122.0 Q ss_pred CcccccCCCcCcEEEeeeeEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeeccCCC Q lcl|NC_011023. 1 MSLLDTGARYQPVTVYPEEMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKE 80 (122) Q Consensus 1 MSLLD~g~~~e~v~VYPee~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~~ 80 (122) |||||+|++||+|||||||+++|.||||+++||++|||++|||||++|||||+|||||||+||++|+||+|||||+|+++ T Consensus 1 mslld~~~~yd~viVYPe~~~~D~DGNt~t~PS~~GI~~~Ar~Qv~~qsgTsarraE~d~~G~~~erVy~~r~pr~f~~e 80 (122) T protein:vir:77 1 MSLLDGGPAYEDVIVYPEEVVTDEDGNTQTRPSKTGIPAKARFQVQGQSGTSARRAEQDNEGFESEKVYRMRFPRSWDAE 80 (122) T ss_pred CccccCCCCccceEEcCceEEEeCCCCceecCCCCCcceeeeEEecCcccccccccccCCCCcchheeeeeecccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 81 HGILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 81 ~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) ||+|||||+|||+||||+|||||++|||||||||++|||||| T Consensus 81 hg~L~aqs~veW~G~RW~l~Gdp~~y~ss~rtar~~ytv~r~ 122 (122) T protein:vir:77 81 HGVLGAQSEIEWRGVRWALFGDVNFYNSSRRTARIDYTVKRY 122 (122) T ss_pred cccccccceeeecceEEEEecchhccCCCcceEEEEEEEEeC Confidence 999999999999999999999999999999999999999999 No 5 >protein:vir:2346 Length: 126 # NCBI annotation: gp16 # Family: family:all:2818 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075283;genbank:gi:12657870;genbank:GeneID:920135 Probab=100.00 E-value=5.5e-66 Score=378.37 Aligned_cols=119 Identities=53% Similarity=0.886 Sum_probs=117.6 Q ss_pred CcccccCCCc-------CcEEEeeeeEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEe Q lcl|NC_011023. 1 MSLLDTGARY-------QPVTVYPEEMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRF 73 (122) Q Consensus 1 MSLLD~g~~~-------e~v~VYPee~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~ 73 (122) |||||+|++| ++|||||||+++|.|||++++||++|||++|||||++|||||+|||||||+||++||||+||| T Consensus 1 mslld~g~~y~~p~d~~~~v~VYPe~~v~D~dGNt~~~Ps~~gI~~~ArfQv~~qSgTsarRaE~d~~G~~te~V~~~r~ 80 (126) T protein:vir:23 1 MSLLDRGGTYGSPEDGFDPVTVYPEVTRKDRLGNTLVGPSLTGIETVARFQVQGQSGTSARRAEMDDIGDMTEQVYTMRL 80 (126) T ss_pred CccccCCcccCCCcccCcceEecceEEEeeCCCCeeecCCCCCceeEEEEEecCccccccchhhccCCCCCCceEEEEEe Confidence 9999999999 889999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccCCCCCccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 74 PRSFTKEHGILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 74 ~Rs~~~~~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) ||+|+++ |+|||+|||+|+||+|||||+.|+|||+|+|++|+|||| T Consensus 81 ~r~f~~e---L~a~sqveW~G~RW~l~Gdp~~y~ssr~t~~~~ytv~R~ 126 (126) T protein:vir:23 81 PRSFTTE---LKSGSEVVWRGERWGVYGEPRRYKGSRRIAHLEYTVRRF 126 (126) T ss_pred ecCCchh---hccceeeeecceEEEeecChhhcCCCCceEEEEEEEEeC Confidence 9999886 899999999999999999999999999999999999999 No 6 >protein:vir:78288 Length: 120 # NCBI annotation: gp17 # Family: family:all:2818 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491669;genbank:gi:157786493;genbank:GeneID:5625766 Probab=100.00 E-value=3.9e-65 Score=373.72 Aligned_cols=119 Identities=55% Similarity=0.910 Sum_probs=117.3 Q ss_pred Cc-ccccCCCcCcEEEeeeeEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeeccCC Q lcl|NC_011023. 1 MS-LLDTGARYQPVTVYPEEMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTK 79 (122) Q Consensus 1 MS-LLD~g~~~e~v~VYPee~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~ 79 (122) || |||+|++||+|||||||+++|.|||++++||++|||++|||||++|||||+|||||||+||++||||+|||||+|++ T Consensus 1 ~~~lld~~~~~~~v~VYPe~~~~D~dGNt~~~Ps~~gV~~~ArfQv~~qSgTs~rRaE~d~~G~~te~Vy~~rl~r~f~~ 80 (120) T protein:vir:78 1 MSGLLDDGANYEPVTVYPEVTRKDRLGNTLVGPSATGVETVARFQVQNQSGTSSRRAEMDDIGDMTEQVYTMRLPRSFTT 80 (120) T ss_pred CcccccCCCCcCceEEcceEEEeecCCCeeecCCCCCceeEEEEEEcCccccccchhcccCCCCCCceEEEEEeecCCch Confidence 76 99999999999999999999999999999999999999999999999999999999999999999999999999988 Q ss_pred CCCccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 80 EHGILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 80 ~~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) + |+|||+|||+|+||+|||||+.|||||+|||.+|+|||| T Consensus 81 e---L~a~s~veW~G~RW~~~Gdp~~y~ssrrta~~~ytv~R~ 120 (120) T protein:vir:78 81 E---LKSGSEVVWRGERWGVYGDPRRYNGSRRTARLEYVVRRF 120 (120) T ss_pred h---hccceeeeecceEEEeecChhhcCCCCceEEEEEEEEeC Confidence 6 899999999999999999999999999999999999999 No 7 >protein:vir:78529 Length: 101 # NCBI annotation: gp17 # Family: family:all:2818 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491588;genbank:gi:157786411;genbank:GeneID:5625681 Probab=100.00 E-value=5.5e-56 Score=323.54 Aligned_cols=101 Identities=50% Similarity=0.828 Sum_probs=100.2 Q ss_pred eEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeeccCCCCCccCcceEEEECCeEEE Q lcl|NC_011023. 19 EMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKEHGILGAQSQIEWRGQRWA 98 (122) Q Consensus 19 e~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~~~g~lgaqS~veW~G~rw~ 98 (122) ++.+|.||||+++||++|||+.|||||+||||||||||||||+||+|||||+|||||||+++ ||+||+|||+|+||| T Consensus 1 ~~~~D~~GNt~t~PS~tgi~t~ARfQV~~QSGTSaRRaE~Dn~G~~tE~VY~mRfpRsf~~E---L~~~sev~W~G~RW~ 77 (101) T protein:vir:78 1 MTRKDRLGNTLVGPSATGVETVARFQVQNQSGTSSRRAEMDDIGDMTEQVYTMRLPRSFTTE---LKSGSEVVWRGERWG 77 (101) T ss_pred CccccccCccccccccCCccceeeeeecCCCCcchhhhhccccCcccceeEEEecCcchhhh---ccCCceeEeeeeeee Confidence 88999999999999999999999999999999999999999999999999999999999998 899999999999999 Q ss_pred EecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 99 LFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 99 vfGd~~~y~sS~rTah~~ytirR~ 122 (122) +||||++|||||||||.+|+|||| T Consensus 78 lfGd~~~Yn~Srr~a~i~YtvkRf 101 (101) T protein:vir:78 78 VYGDPRRYNGSRRTARLEYVVRRF 101 (101) T ss_pred eecchhhccCCcceeEEeEEEeeC Confidence 999999999999999999999999 No 8 >protein:vir:9577 Length: 112 # NCBI annotation: gp43 # Family: family:all:1270 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862882;genbank:gi:32469474;genbank:GeneID:1461319 Probab=97.10 E-value=6e-06 Score=49.21 Aligned_cols=105 Identities=14% Similarity=0.180 Sum_probs=73.1 Q ss_pred CcccccCCCcCcEEEe-eeeEEEecCCCCccCCCccCcee-eEEEEecCcccCcccccccccCCCCC--CceEEEEeeec Q lcl|NC_011023. 1 MSLLDTGARYQPVTVY-PEEMVIDGDGNKRTRPSKVGIPA-IARLQVANQSGTSARRAEQDNEGFET--EKVYRMRFPRS 76 (122) Q Consensus 1 MSLLD~g~~~e~v~VY-Pee~~~D~dGNt~t~Ps~~Gvp~-~AriQv~~qsgTsarr~eqd~eG~~s--eqvy~~r~~Rs 76 (122) ||+|. | +.|+|+ +-....|.-||...-..+ +++ ..-++| +|++ +++++==++ .-.|++.|||. T Consensus 1 m~~i~-G---etVtvi~~~~tG~D~~G~p~~e~~~--e~V~nVLV~P----~s~~---d~~~~~~p~G~~v~~tla~PK~ 67 (112) T protein:vir:95 1 MGRIK-G---ITVTLIGKTKTGKDDFGHPIYENTE--IQVDNVLVVP----ASTE---DVTNQLNLTGKKASYTLGIPKG 67 (112) T ss_pred Ccccc-c---eeEEEecceeccccCCCCCeeeccc--eecCceEeCC----CChh---hcccccCcceeEEEEEEecCCC Confidence 99997 3 899999 778899999997765544 222 233333 2233 222221122 24699999999 Q ss_pred cCCCCCccCcceEEEECCeEEEEecceeeeCCCCccee--eeEEEEeC Q lcl|NC_011023. 77 FTKEHGILGAQSQIEWRGQRWALFGDATVYDSSPALAR--VDYQIKRF 122 (122) Q Consensus 77 ~~~~~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah--~~ytirR~ 122 (122) +... =..|.|.-+|+.|-++|||..|.+.---.. +.-++-|| T Consensus 68 ~~~~----l~g~~V~~~G~~~~vvG~P~~~~~~~~P~~WN~~V~ver~ 111 (112) T protein:vir:95 68 DQNE----WKDREVRFFGRKWRTIGIPLEGIEAMMPLDWNKKVMVEAY 111 (112) T ss_pred CCCc----ccCcEEEEeCcEEEEecCCccccCCCCCCccCCeEEEEEc Confidence 9864 478999999999999999999988654322 26777788 No 9 >protein:vir:9762 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:1270 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795524;genbank:gi:28876280;genbank:GeneID:1257821 Probab=96.86 E-value=1.5e-05 Score=46.99 Aligned_cols=105 Identities=18% Similarity=0.244 Sum_probs=73.8 Q ss_pred CcccccCCCcCcEEEe-eeeEEEecCCCCccCCCccCcee-eEEEEecCcccCcccccccccCCCCCC--ceEEEEeeec Q lcl|NC_011023. 1 MSLLDTGARYQPVTVY-PEEMVIDGDGNKRTRPSKVGIPA-IARLQVANQSGTSARRAEQDNEGFETE--KVYRMRFPRS 76 (122) Q Consensus 1 MSLLD~g~~~e~v~VY-Pee~~~D~dGNt~t~Ps~~Gvp~-~AriQv~~qsgTsarr~eqd~eG~~se--qvy~~r~~Rs 76 (122) ||+|. | |.|+|+ |-....|.-||....=.. +++ ..=+++ +|+ ++.+++-=++. -.|++.|||. T Consensus 1 m~~ik-G---etVtvi~~~~tG~D~~g~p~~~~~~--e~V~nVLV~P----~s~---~d~~~~~~p~G~~v~~tl~fPK~ 67 (112) T protein:vir:97 1 MGKLR-G---ITITLIDKVTIDIDPFGNPIKKDKE--ISVDNVLVSP----ATS---DDITSQLSLSGKKAVYTLAIPKG 67 (112) T ss_pred Ccccc-c---eeEEEeccccccccCCCCceecccc--eecCcEEeCC----CCh---hhcccccCcCceEEEEEEecCCC Confidence 99997 3 999999 778899999998766432 333 222233 222 23444433443 3699999999 Q ss_pred cCCCCCccCcceEEEECCeEEEEecceeeeCCCCccee--eeEEEEeC Q lcl|NC_011023. 77 FTKEHGILGAQSQIEWRGQRWALFGDATVYDSSPALAR--VDYQIKRF 122 (122) Q Consensus 77 ~~~~~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah--~~ytirR~ 122 (122) ++.. =..|.|.-+|+.|-++|||..|.+.---.. +.-++-|| T Consensus 68 ~~~~----lrg~~V~~~G~~~~vvG~P~~~~~~~~P~~WN~~V~Ver~ 111 (112) T protein:vir:97 68 DNHD----WGDKEVRFFGEKWRTVGLALEGIEELIPLEWNKKVMVERY 111 (112) T ss_pred CCCc----ccCcEEEEeCCeeEEecCCccccCCCCCCccCCeEEEEEc Confidence 9874 467899999999999999999987654322 26677788 No 10 >protein:vir:1641 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:1270 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695062;genbank:gi:23455753;genbank:GeneID:955487 Probab=96.07 E-value=0.00011 Score=42.32 Aligned_cols=105 Identities=17% Similarity=0.234 Sum_probs=71.9 Q ss_pred CcccccCCCcCcEEEeeee-EEEecCCCCccCCCccCcee-eEEEEecCcccCcccccccccCCCCC--CceEEEEeeec Q lcl|NC_011023. 1 MSLLDTGARYQPVTVYPEE-MVIDGDGNKRTRPSKVGIPA-IARLQVANQSGTSARRAEQDNEGFET--EKVYRMRFPRS 76 (122) Q Consensus 1 MSLLD~g~~~e~v~VYPee-~~~D~dGNt~t~Ps~~Gvp~-~AriQv~~qsgTsarr~eqd~eG~~s--eqvy~~r~~Rs 76 (122) ||+|. | +.|+|+=.. ...|.-||....=.. +++ ..-++| +|++ +++++==++ .-.|++.|||. T Consensus 4 m~~ik-G---etVtvi~~~~tG~D~~g~pi~~~~~--e~V~nVLV~P----~s~~---d~~~~~~p~G~~v~~tla~PK~ 70 (115) T protein:vir:16 4 MGMIK-G---IAVTLIDKVETGKDPFGNPIYEDKE--IVVNNVLVSP----TSSD---DIVNQLTLTGKKAIYTLAIPKK 70 (115) T ss_pred ecccC-c---eeEEEecceecccCCCCCCcccccc--eEcCceeeCC----CChh---hcccccCcceeEEEEEEecCCC Confidence 99997 3 889888665 558999998776422 333 333444 2233 222221122 24699999999 Q ss_pred cCCCCCccCcceEEEECCeEEEEecceeeeCCCCcce--eeeEEEEeC Q lcl|NC_011023. 77 FTKEHGILGAQSQIEWRGQRWALFGDATVYDSSPALA--RVDYQIKRF 122 (122) Q Consensus 77 ~~~~~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTa--h~~ytirR~ 122 (122) +... =..|.|.-+|+.|-++|||..|.+.---. -+.-++-|| T Consensus 71 ~~~~----lrg~~V~~~G~~~~vvGdP~~~~~~~~P~~WN~~V~ver~ 114 (115) T protein:vir:16 71 DTHD----WENKKVRFFGKTWRTFGEPLEGIEGLIPLDWNKKVTVEHY 114 (115) T ss_pred CCCc----ccCceEEEeCceeEEecCCCCcccccCCCccCCeEEEEEe Confidence 8774 47899999999999999999998864322 237788888 No 11 >protein:vir:94767 Length: 104 # NCBI annotation: unknown # Family: family:all:1270 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996709;genbank:gi:45597424;genbank:GeneID:2769039 Probab=89.35 E-value=0.0062 Score=32.70 Aligned_cols=98 Identities=15% Similarity=0.210 Sum_probs=63.7 Q ss_pred EEEeee-eEEEecCCCCccCCCccCcee-eEEEEecCcccCcccc-cccccCCCCCCceEEEEeeeccCCCCCccCcceE Q lcl|NC_011023. 13 VTVYPE-EMVIDGDGNKRTRPSKVGIPA-IARLQVANQSGTSARR-AEQDNEGFETEKVYRMRFPRSFTKEHGILGAQSQ 89 (122) Q Consensus 13 v~VYPe-e~~~D~dGNt~t~Ps~~Gvp~-~AriQv~~qsgTsarr-~eqd~eG~~seqvy~~r~~Rs~~~~~g~lgaqS~ 89 (122) |+|+.. ....|.-||....=.. +++ ..-+++ +|++.- .+..-+|- .-.|++.|||..... =..|. T Consensus 1 Vtl~~~~~~G~D~~g~pi~~~~~--e~V~nVLV~P----~s~~d~~~~~~p~G~--~v~~tla~PK~~~~~----l~g~~ 68 (104) T protein:vir:94 1 MTLIDKVETGKDPFGNPIYEDKE--IVVNNVLVSP----TSSDDIVNQLTLTGK--KAIYTLAIPKKDTHD----WENKK 68 (104) T ss_pred CEeccceecCcCCCCCCcccccc--eEcCceeeCC----CChhhcccccCcCce--EEEEEEecCCCCCCc----ccCce Confidence 666655 4557888887665322 333 333344 233322 12222332 346999999998774 47899 Q ss_pred EEECCeEEEEecceeeeCCCCccee--eeEEEEeC Q lcl|NC_011023. 90 IEWRGQRWALFGDATVYDSSPALAR--VDYQIKRF 122 (122) Q Consensus 90 veW~G~rw~vfGd~~~y~sS~rTah--~~ytirR~ 122 (122) |.-+|+.|-++|||..|.+.---.. +.-++-|| T Consensus 69 V~~~G~~~~vvGdP~~~~~~~~P~~WN~~V~ver~ 103 (104) T protein:vir:94 69 VRFFGKTWRTFGEPLEGIEELIPLDWNKKVTVEHY 103 (104) T ss_pred EEEeCcEEEEecCCccccCCcCCcccCCeEEEEEe Confidence 9999999999999999988654332 37788899 No 12 >protein:vir:81216 Length: 118 # NCBI annotation: gp9 # Family: family:all:10295 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456739;genbank:gi:157168382;uniprot:Q9MBJ6;genbank:GeneID:5580339 Probab=83.45 E-value=0.054 Score=27.53 Aligned_cols=108 Identities=10% Similarity=0.142 Sum_probs=72.9 Q ss_pred CcccccCCCcCcEEEeeeeEEEecCCCCc----cCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeec Q lcl|NC_011023. 1 MSLLDTGARYQPVTVYPEEMVIDGDGNKR----TRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRS 76 (122) Q Consensus 1 MSLLD~g~~~e~v~VYPee~~~D~dGNt~----t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs 76 (122) |++.=.+ -..||-|- -.-|.+||.+ .-|.+--|+..+.|||.+++= .+ --.+..|=+.+|.=- T Consensus 1 m~~~F~~---~v~ilRa~-~~~~~yg~d~~~dw~~pv~ipV~~~vSvQPv~StE-------~~--~~r~~vVt~w~l~~P 67 (118) T protein:vir:81 1 MTVIFVN---AVTVLRAR-EVGSVYSSEKTLTWDDPVRIDVPFLVSVQPRGSTE-------GG--TDRPTVVSAWWMCTP 67 (118) T ss_pred Ceeeeee---eEEEecCC-ccccccCCCcccccCCceeeeccCcceeeecCccc-------cC--CCCceeeeeeEeecC Confidence 9987644 34445554 6778888866 456666676789999976332 21 114555656653211 Q ss_pred cCCCCCccCcceEEEEC-CeEEEEecceeeeCCCCcc---eeeeEEEEeC Q lcl|NC_011023. 77 FTKEHGILGAQSQIEWR-GQRWALFGDATVYDSSPAL---ARVDYQIKRF 122 (122) Q Consensus 77 ~~~~~g~lgaqS~veW~-G~rw~vfGd~~~y~sS~rT---ah~~ytirR~ 122 (122) -.+..+ |-|-..|+-. |.-+-+.|+|-++-++-+| .|.++.++.- T Consensus 68 pg~d~~-Lra~DRVr~a~G~~~eV~G~P~~wp~P~~t~~v~Hvea~Levv 116 (118) T protein:vir:81 68 PGTDLD-LRPEDRVELATGLQLEVVGQPLRWPDPVNQDQVHHVEANLEVV 116 (118) T ss_pred CCCCcC-CCccceeeeccccEEEEecCcccccCccccccccceeEEEEEe Confidence 111222 6899999985 9999999999998777666 5889999999 No 13 >protein:vir:98924 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:2747 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164421;genbank:gi:56694911;genbank:GeneID:3197315 Probab=82.71 E-value=0.047 Score=27.89 Aligned_cols=108 Identities=15% Similarity=0.049 Sum_probs=68.3 Q ss_pred CcccccCCCcCcEEEeeeeEEEecCCCCccCCCccCcee-eEEEEe-cCcccCcccccccccCCCCCCceEEEEeeeccC Q lcl|NC_011023. 1 MSLLDTGARYQPVTVYPEEMVIDGDGNKRTRPSKVGIPA-IARLQV-ANQSGTSARRAEQDNEGFETEKVYRMRFPRSFT 78 (122) Q Consensus 1 MSLLD~g~~~e~v~VYPee~~~D~dGNt~t~Ps~~Gvp~-~AriQv-~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~ 78 (122) |-.++...-.+.+++- +-.-+|++|....-. ++.+ ++|||+ ..-|||...||...| -|--+ ++.-+. T Consensus 3 m~~ipk~~l~~sit~k-~~~~~dd~g~~~y~~---pv~I~nvrv~~~~~ysgt~n~rq~~~n------aviF~-ya~~S~ 71 (113) T protein:vir:98 3 MPKPPIDFLVDSFMYK-EYMGENSWSEPEYAR---PVLISNCRIDRGAEYTSTTSGRQLLYN------AVVFC-YEGMTT 71 (113) T ss_pred cCcCChhhccceEEEE-EecccCCCCCcccCC---cEeecceEecccceeeccCCCceeeee------eEEEE-ecccCc Confidence 4444544444556554 556688899875433 5555 999999 778999999999887 23222 555553 Q ss_pred CCCCccCcceEEEECCeEEEEecceeeeCC-CCcceeeeEEEE Q lcl|NC_011023. 79 KEHGILGAQSQIEWRGQRWALFGDATVYDS-SPALARVDYQIK 120 (122) Q Consensus 79 ~~~g~lgaqS~veW~G~rw~vfGd~~~y~s-S~rTah~~ytir 120 (122) . ...+..+|.|-|+|.-+-|-.=-..|.- |.+.-|-+-++. T Consensus 72 p-~~~~~~~skivfdG~eytI~~i~~~~e~~sn~v~~yELEVi 113 (113) T protein:vir:98 72 P-LPQFKAQSVLHFDGRDHVITKVIPNHEAYSKTLYSYELEVV 113 (113) T ss_pred c-ceEecCCCeEEeCCcceEEeeeccCcCCCCCceeEEEEEEC Confidence 3 2456788888888887766654444433 444444455555 No 14 >protein:vir:44 Length: 120 # NCBI annotation: gp9 # Family: family:all:2747 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463470;swissprot:trembl:q9t1b4;genbank:gi:16798792;uniprot:Q9T1B4;genbank:GeneID:922373 Probab=79.29 E-value=0.11 Score=25.93 Aligned_cols=109 Identities=9% Similarity=0.076 Sum_probs=69.3 Q ss_pred CcccccCCCcCcEEEeeeeEEEecCCCCccCCCccCcee-eEEEEecCc-ccCcccccccccCCCCCCceEEEEeeeccC Q lcl|NC_011023. 1 MSLLDTGARYQPVTVYPEEMVIDGDGNKRTRPSKVGIPA-IARLQVANQ-SGTSARRAEQDNEGFETEKVYRMRFPRSFT 78 (122) Q Consensus 1 MSLLD~g~~~e~v~VYPee~~~D~dGNt~t~Ps~~Gvp~-~AriQv~~q-sgTsarr~eqd~eG~~seqvy~~r~~Rs~~ 78 (122) |-.++..--.+.+++- +..-.|++|....-. +|++ ++|||+... ||++.+||...| --+|. ++.-+. T Consensus 10 m~~iPk~~l~~sit~k-~~~~~d~~g~~~y~~---pv~I~nvRvd~~~~ysg~~n~rq~~~n-----aviFi--~a~~S~ 78 (120) T protein:vir:44 10 APPLPLDWLIHNISYE-AYKEEDRHNQVVYEK---GIEIEHVRVDFSKSNQIAGLSDSDRYD-----AVIFI--DAVNSM 78 (120) T ss_pred cCCcChhhccceEEEE-EecCCCCCCCCcccC---ceeccCeEEecceeeecCCCCceeeee-----eEEEE--ecccCC Confidence 3344444444666654 333345544443433 5666 899998554 688899998886 33333 566665 Q ss_pred CCCCccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEE Q lcl|NC_011023. 79 KEHGILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIK 120 (122) Q Consensus 79 ~~~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytir 120 (122) .+..+...+|.|.|+|..+-|-+=-..|.-|-..-|-+-++- T Consensus 79 p~~~~~~~gskI~f~G~eytI~~i~~~~~~sn~vh~yEleVi 120 (120) T protein:vir:44 79 NVPSDFVSRSRIFFSGKAYKIVKVIPCYATSNSVHHWEIEVI 120 (120) T ss_pred ccceecCcCCEEEeCCceEEEEeeeeccCCCCceEEEEEEeC Confidence 555556789999999999998886555655644444466666 No 15 >protein:vir:81177 Length: 109 # NCBI annotation: putative head-tail adaptor # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285814;genbank:gi:148747735;genbank:GeneID:5247220 Probab=76.50 E-value=0.13 Score=25.36 Aligned_cols=102 Identities=14% Similarity=0.162 Sum_probs=61.7 Q ss_pred cccCCCcCcEEEeeeeEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeeccCCCCCc Q lcl|NC_011023. 4 LDTGARYQPVTVYPEEMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKEHGI 83 (122) Q Consensus 4 LD~g~~~e~v~VYPee~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~~~g~ 83 (122) ++.|.-+..|.++=.+...|++|+........+ .+-|.|.+. ||. ... .+.+-.++.+++++... +.- T Consensus 1 M~~g~L~~rI~i~~~~~~~d~~G~~~~~w~~~~-~~wA~v~~~--s~~--e~~--~a~~~~~~~~~~f~iR~-----~~~ 68 (109) T protein:vir:81 1 MNPGQFRHKITLMKLVTTQDEIGNTIEEWQPVR-TCWAAIKTV--NGR--EYF--AAASVQAERTYRFIIRY-----TPG 68 (109) T ss_pred CCccccCccEEEEeeeeeeCCCCCeecceeeEE-EEEEEEEec--Cch--hee--eccceeeeeeEEEEEEe-----CCC Confidence 778888899999988999999999887776665 367777774 332 111 22333444444443321 112 Q ss_pred cCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 84 LGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 84 lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) |-+.-.|.|+|+.|.|-+=... ...+.-.+|.=- T Consensus 69 i~~~~ri~~~g~~y~I~~v~~~-----~~~~~~l~i~~~ 102 (109) T protein:vir:81 69 INETMKIDYQGRLFDIQSVLND-----DEGKKTLTIIAT 102 (109) T ss_pred CCcccEEEECCeEEEEEeecCC-----ccCCcEEEEEEE Confidence 5667799999999999881111 112211122111 No 16 >protein:vir:80934 Length: 120 # NCBI annotation: gp9 # Family: family:all:2747 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468395;genbank:gi:157324969;genbank:GeneID:5601367 Probab=75.72 E-value=0.14 Score=25.21 Aligned_cols=109 Identities=7% Similarity=0.045 Sum_probs=69.0 Q ss_pred CcccccCCCcCcEEEeeeeEEEecCCCCccCCCccCcee-eEEEEecCc-ccCcccccccccCCCCCCceEEEEeeeccC Q lcl|NC_011023. 1 MSLLDTGARYQPVTVYPEEMVIDGDGNKRTRPSKVGIPA-IARLQVANQ-SGTSARRAEQDNEGFETEKVYRMRFPRSFT 78 (122) Q Consensus 1 MSLLD~g~~~e~v~VYPee~~~D~dGNt~t~Ps~~Gvp~-~AriQv~~q-sgTsarr~eqd~eG~~seqvy~~r~~Rs~~ 78 (122) |-.++..--.+.+++- +-.-.|++|....-. +|++ +.|||+... ||++.+||...| --+|. ++.-+. T Consensus 10 m~~iPk~~l~~sit~k-~~~~~d~~g~~~y~~---pv~I~nvRvd~~~~ysg~~n~rq~~~n-----aviFi--~a~~S~ 78 (120) T protein:vir:80 10 APPLPLDWLIHNISYE-AYKEEGRHNQVVYEK---GFEIEHVRVDFSKSNQIAGLSDSDRYD-----AVIFI--DAVNSM 78 (120) T ss_pred cCCcChhhccceEEEE-EecCCCCCCCccccC---ceeccCeEEecceeeecCCCCceeeee-----eEEEE--ecccCC Confidence 4444444444666654 333344444433333 5666 899998554 688999998886 33333 565555 Q ss_pred CCCCccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEE Q lcl|NC_011023. 79 KEHGILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIK 120 (122) Q Consensus 79 ~~~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytir 120 (122) ........+|.|.|+|..+-|-+=-..|.-|-..-|-+-++- T Consensus 79 p~~~~~~~gskI~f~G~eytI~~i~~~~~~s~~vh~yEleVi 120 (120) T protein:vir:80 79 NVPDDFISRSRIFFSGKAYKIVKVIPCYATSENVHHWEIEVI 120 (120) T ss_pred ccceecccCCEEEeCCceEEEEEeeeccCCCCceeEEEEEeC Confidence 555456789999999999998886555656644545566666 No 17 >protein:vir:5258 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4880 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852763;genbank:gi:31544038;uniprot:Q776V7;genbank:GeneID:2777139 Probab=71.36 E-value=0.19 Score=24.59 Aligned_cols=102 Identities=18% Similarity=0.232 Sum_probs=58.7 Q ss_pred CcccccCCCc------CcEEEeeeeEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCC---CCCCceEEE Q lcl|NC_011023. 1 MSLLDTGARY------QPVTVYPEEMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEG---FETEKVYRM 71 (122) Q Consensus 1 MSLLD~g~~~------e~v~VYPee~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG---~~seqvy~~ 71 (122) |+|||--+.- +.++|.=.+.--.+|| - +-+-+..|+.|-|||.+.+- -+.-.|| ..+=++|+. T Consensus 1 m~~ldvs~v~ldpdF~~titv~R~~g~~~~~g-~--~~~t~~~t~~avVqP~~~~d-----lq~LpeG~ri~~sIkI~Tq 72 (123) T protein:vir:52 1 MSLINQSGRFLNSRFRQQITVQKQSGSHSASG-F--DVRYEKQQITAIVIPTSPND-----VLLLPEGERYLPSIKVYTQ 72 (123) T ss_pred CCcccccccccCcccCceEEEEccCccEeCCc-c--ccccccceEEEEEeeCChhh-----cccccccccccceEEEEec Confidence 9999954321 3456654443223333 2 33346888999999965222 2222233 456677776 Q ss_pred EeeeccCCCCCccCcceEEEECCeEEEEec------------ceeeeCCCCcceeeeEEEE Q lcl|NC_011023. 72 RFPRSFTKEHGILGAQSQIEWRGQRWALFG------------DATVYDSSPALARVDYQIK 120 (122) Q Consensus 72 r~~Rs~~~~~g~lgaqS~veW~G~rw~vfG------------d~~~y~sS~rTah~~ytir 120 (122) . .|...-.|-|+|++|=|.= =..+|.++-+-+-..+++- T Consensus 73 ~----------~L~vGD~vlw~G~~YrVi~~~d~s~YGYy~~i~~~~~~t~~~~~~~f~~t 123 (123) T protein:vir:52 73 Q----------QLNIGDLVDYRGQTYKIKTAANWGDYGYYNNIGVRHSQTAKVDSTGFTVT 123 (123) T ss_pred c----------ccccccEEEeCCcEEEEEEcCCccccceecceeecccccCccccccceeC Confidence 2 2333468999999987752 3456666555544445555 No 18 >protein:vir:1582 Length: 117 # NCBI annotation: minor capsid protein # Family: family:all:2747 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695164;swissprot:trembl:o03932;genbank:gi:23455805;uniprot:O03932;genbank:GeneID:955538 Probab=66.97 E-value=0.17 Score=24.80 Aligned_cols=94 Identities=16% Similarity=0.123 Sum_probs=60.9 Q ss_pred CcccccCCCcCcEEEee-eeEEEecCCCCccCCCccCcee-eEEEEec-CcccCcccccccccCCCCCCceEEEEeeecc Q lcl|NC_011023. 1 MSLLDTGARYQPVTVYP-EEMVIDGDGNKRTRPSKVGIPA-IARLQVA-NQSGTSARRAEQDNEGFETEKVYRMRFPRSF 77 (122) Q Consensus 1 MSLLD~g~~~e~v~VYP-ee~~~D~dGNt~t~Ps~~Gvp~-~AriQv~-~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~ 77 (122) |-.+|..--.+.+++=- +-.-+|++|....-- +|++ ++|||.. .-|||...||...| -|--+ |+..+ T Consensus 2 m~~ipk~~l~~sitlk~~~~~~~d~yg~~~y~~---pi~I~nvrvd~~t~ysgt~n~Rq~~~n------avif~-y~~~s 71 (117) T protein:vir:15 2 MMKPPKWMCQQTITLTLTDPTKTDEWGQLLTGE---PVTIEHCVVQPQTIYSGSNNDRTIVAN------AVVFV-YAGIS 71 (117) T ss_pred CCccchhhccceEEEEEeccCCcCCCCCeeecC---ceeeeeeEecccceecccCCCCeEEec------eEEEE-ecccC Confidence 77777776667777643 435588888877633 4655 8888875 34899999999998 33333 45544 Q ss_pred CCCCCccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 78 TKEHGILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 78 ~~~~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) .-+ ..+..||. |+...|+| .+|||.++ T Consensus 72 ~P~-~~~~~~~~-----------g~ki~f~G------~eYtI~~i 98 (117) T protein:vir:15 72 NPL-LTVTKNNV-----------GSKLVFEG------EEYTVQKI 98 (117) T ss_pred Ccc-eEEecccc-----------cceeeeCC------eeEEeeee Confidence 321 22344432 66777777 27888887 No 19 >protein:vir:102856 Length: 107 # NCBI annotation: head-tail adaptor protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338139;genbank:gi:77020229;genbank:GeneID:3703765 Probab=66.73 E-value=0.26 Score=23.76 Aligned_cols=101 Identities=8% Similarity=0.071 Sum_probs=61.3 Q ss_pred cccCCCcCcEEEe-eeeEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeeccCCCCC Q lcl|NC_011023. 4 LDTGARYQPVTVY-PEEMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKEHG 82 (122) Q Consensus 4 LD~g~~~e~v~VY-Pee~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~~~g 82 (122) +|-|.-++.|++. |.+...|..|+....-... .++-|.|.+. ||.-. ....+-.++.++++..-.. + T Consensus 1 M~~G~L~~rI~i~~~~~~~~d~~G~~~~~w~~~-~~~wA~v~~~--sg~e~----~~a~~~~~~~t~~i~iR~~-~---- 68 (107) T protein:vir:10 1 MNPAKLDKRLTFQVKDENAKGPDGDPIDGYKDA-FTVWGSFVYL--KGRKY----FEAAAANSEVQGETEIRNR-D---- 68 (107) T ss_pred CCccccCccEEEEeceeeccCCCCccccceEee-EEEEEEEEec--Cchhh----eeccceeeeeeEEEEEEec-C---- Confidence 7888888999998 8888889999876643333 4578888874 33211 1122234444444432221 1 Q ss_pred ccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 83 ILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 83 ~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) -|-+.-.|.|+|+.|-|.+ + .+.. .|..+.|-.- T Consensus 69 ~I~~~~ri~~~g~~y~I~~-v--~~~~---~~~~~l~~~~ 102 (107) T protein:vir:10 69 DVSADMKIKYKNVIYDIVS-V--IPTQ---DHTLLIMWKR 102 (107) T ss_pred CCCcccEEEECCeEEEEEe-e--cCCC---CCcEEEEEEE Confidence 2578889999999999987 2 2221 2333444333 No 20 >protein:vir:105006 Length: 107 # NCBI annotation: putative head-tail adaptor protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459971;genbank:gi:85701386;genbank:GeneID:3882147 Probab=66.73 E-value=0.26 Score=23.76 Aligned_cols=101 Identities=8% Similarity=0.071 Sum_probs=61.3 Q ss_pred cccCCCcCcEEEe-eeeEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeeccCCCCC Q lcl|NC_011023. 4 LDTGARYQPVTVY-PEEMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKEHG 82 (122) Q Consensus 4 LD~g~~~e~v~VY-Pee~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~~~g 82 (122) +|-|.-++.|++. |.+...|..|+....-... .++-|.|.+. ||.-. ....+-.++.++++..-.. + T Consensus 1 M~~G~L~~rI~i~~~~~~~~d~~G~~~~~w~~~-~~~wA~v~~~--sg~e~----~~a~~~~~~~t~~i~iR~~-~---- 68 (107) T protein:vir:10 1 MNPAKLDKRLTFQVKDENAKGPDGDPIDGYKDA-FTVWGSFVYL--KGRKY----FEAAAANSEVQGETEIRNR-D---- 68 (107) T ss_pred CCccccCccEEEEeceeeccCCCCccccceEee-EEEEEEEEec--Cchhh----eeccceeeeeeEEEEEEec-C---- Confidence 7888888999998 8888889999876643333 4578888874 33211 1122234444444432221 1 Q ss_pred ccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 83 ILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 83 ~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) -|-+.-.|.|+|+.|-|.+ + .+.. .|..+.|-.- T Consensus 69 ~I~~~~ri~~~g~~y~I~~-v--~~~~---~~~~~l~~~~ 102 (107) T protein:vir:10 69 DVSADMKIKYKNVIYDIVS-V--IPTQ---DHTLLIMWKR 102 (107) T ss_pred CCCcccEEEECCeEEEEEe-e--cCCC---CCcEEEEEEE Confidence 2578889999999999987 2 2221 2333444333 No 21 >protein:vir:107606 Length: 107 # NCBI annotation: head-tail adaptor protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338190;genbank:gi:77020176;genbank:GeneID:3703737 Probab=66.73 E-value=0.26 Score=23.76 Aligned_cols=101 Identities=8% Similarity=0.071 Sum_probs=61.3 Q ss_pred cccCCCcCcEEEe-eeeEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeeccCCCCC Q lcl|NC_011023. 4 LDTGARYQPVTVY-PEEMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKEHG 82 (122) Q Consensus 4 LD~g~~~e~v~VY-Pee~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~~~g 82 (122) +|-|.-++.|++. |.+...|..|+....-... .++-|.|.+. ||.-. ....+-.++.++++..-.. + T Consensus 1 M~~G~L~~rI~i~~~~~~~~d~~G~~~~~w~~~-~~~wA~v~~~--sg~e~----~~a~~~~~~~t~~i~iR~~-~---- 68 (107) T protein:vir:10 1 MNPAKLDKRLTFQVKDENAKGPDGDPIDGYKDA-FTVWGSFVYL--KGRKY----FEAAAANSEVQGETEIRNR-D---- 68 (107) T ss_pred CCccccCccEEEEeceeeccCCCCccccceEee-EEEEEEEEec--Cchhh----eeccceeeeeeEEEEEEec-C---- Confidence 7888888999998 8888889999876643333 4578888874 33211 1122234444444432221 1 Q ss_pred ccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 83 ILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 83 ~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) -|-+.-.|.|+|+.|-|.+ + .+.. .|..+.|-.- T Consensus 69 ~I~~~~ri~~~g~~y~I~~-v--~~~~---~~~~~l~~~~ 102 (107) T protein:vir:10 69 DVSADMKIKYKNVIYDIVS-V--IPTQ---DHTLLIMWKR 102 (107) T ss_pred CCCcccEEEECCeEEEEEe-e--cCCC---CCcEEEEEEE Confidence 2578889999999999987 2 2221 2333444333 No 22 >protein:vir:102084 Length: 107 # NCBI annotation: head-tail adaptor # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512317;genbank:gi:89152486;genbank:GeneID:3953077 Probab=66.73 E-value=0.26 Score=23.76 Aligned_cols=101 Identities=8% Similarity=0.071 Sum_probs=61.3 Q ss_pred cccCCCcCcEEEe-eeeEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeeccCCCCC Q lcl|NC_011023. 4 LDTGARYQPVTVY-PEEMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKEHG 82 (122) Q Consensus 4 LD~g~~~e~v~VY-Pee~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~~~g 82 (122) +|-|.-++.|++. |.+...|..|+....-... .++-|.|.+. ||.-. ....+-.++.++++..-.. + T Consensus 1 M~~G~L~~rI~i~~~~~~~~d~~G~~~~~w~~~-~~~wA~v~~~--sg~e~----~~a~~~~~~~t~~i~iR~~-~---- 68 (107) T protein:vir:10 1 MNPAKLDKRLTFQVKDENAKGPDGDPIDGYKDA-FTVWGSFVYL--KGRKY----FEAAAANSEVQGETEIRNR-D---- 68 (107) T ss_pred CCccccCccEEEEeceeeccCCCCccccceEee-EEEEEEEEec--Cchhh----eeccceeeeeeEEEEEEec-C---- Confidence 7888888999998 8888889999876643333 4578888874 33211 1122234444444432221 1 Q ss_pred ccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 83 ILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 83 ~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) -|-+.-.|.|+|+.|-|.+ + .+.. .|..+.|-.- T Consensus 69 ~I~~~~ri~~~g~~y~I~~-v--~~~~---~~~~~l~~~~ 102 (107) T protein:vir:10 69 DVSADMKIKYKNVIYDIVS-V--IPTQ---DHTLLIMWKR 102 (107) T ss_pred CCCcccEEEECCeEEEEEe-e--cCCC---CCcEEEEEEE Confidence 2578889999999999987 2 2221 2333444333 No 23 >protein:vir:4459 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700382;genbank:gi:23505454;genbank:GeneID:955661 Probab=64.27 E-value=0.3 Score=23.43 Aligned_cols=105 Identities=13% Similarity=0.092 Sum_probs=60.9 Q ss_pred Cccc-ccCCCcCcEEEeeeeEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCCCCce--EEEEeeecc Q lcl|NC_011023. 1 MSLL-DTGARYQPVTVYPEEMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKV--YRMRFPRSF 77 (122) Q Consensus 1 MSLL-D~g~~~e~v~VYPee~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqv--y~~r~~Rs~ 77 (122) |.++ |-|.-++.|++.=-+.+.|..|+.......++ .+.|.|-+ .||.-. ..+.+-.++.. +++||.. T Consensus 11 ~~~~M~aG~L~~RI~i~~~~~~~D~~G~~~~~w~~~~-~vwA~v~~--~sg~E~----~~a~~~~~~~t~~i~IR~~~-- 81 (134) T protein:vir:44 11 TYLLPDPGELDQRIVIRRRVDVPADDFGVTPTYPEQI-RTWAKKAQ--PGAAAY----QGSVQIENRVTHYFTIRFRR-- 81 (134) T ss_pred eEeccCccccCccEEEEeeeeeeCCCCCeecceEeeE-EEEEEEEe--cCchhe----eeccceeeeeeEEEEEEeCC-- Confidence 7666 88888899999888889999999766554443 44666655 344211 22333344444 4555422 Q ss_pred CCCCCccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 78 TKEHGILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 78 ~~~~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) -|-+.-.|.|+|+.|-|.+ +...++..+.. .-.-+-= T Consensus 82 -----~It~~~RI~~~g~~y~I~~-I~~~~~~~~~L--~i~c~ev 118 (134) T protein:vir:44 82 -----GITADHEVLHDDISYRVKR-VRDLNGKRRFL--LIECEAL 118 (134) T ss_pred -----CCCcccEEEECCeEEEEEE-ecCCCcCCcEE--EEEEEEe Confidence 2567789999999999987 21112221111 0000111 No 24 >protein:vir:100244 Length: 109 # NCBI annotation: gp73 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355409;genbank:gi:77864699;genbank:GeneID:3725966 Probab=63.42 E-value=0.32 Score=23.32 Aligned_cols=102 Identities=10% Similarity=0.151 Sum_probs=62.4 Q ss_pred ccccCCCcCcEEEeeeeEEEecCCCCcc-CCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeeccCCCC Q lcl|NC_011023. 3 LLDTGARYQPVTVYPEEMVIDGDGNKRT-RPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKEH 81 (122) Q Consensus 3 LLD~g~~~e~v~VYPee~~~D~dGNt~t-~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~~~ 81 (122) +++-|.-++.|+++=.+...|.+|+..+ .....+ ++-|.|.+. ||. ...+.....-+....+++|+.. T Consensus 1 mm~~g~L~~rI~i~~~~~~~d~~G~~~~~~w~~~~-~~wA~i~~~--~g~--e~~~a~~~~~~~~~~i~iR~~~------ 69 (109) T protein:vir:10 1 MLRSSDLTEFIVIERKGGRTNENGEPLPDDWVTHD-EVWASVRFV--SGK--EHVISGAVRSSAIASIRIRFRE------ 69 (109) T ss_pred CCCccccCccEEEEeeeeccCCCCCeeccceeeEE-EEEEEEEec--Cch--heeeccceeeeeeEEEEEEecC------ Confidence 6677888899999999999999999643 233333 578888885 332 1122223333444555666432 Q ss_pred CccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEE-----eC Q lcl|NC_011023. 82 GILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIK-----RF 122 (122) Q Consensus 82 g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytir-----R~ 122 (122) -|-+.-.|.|+|+.|-|.+ +.. ++. +.-.+|. ++ T Consensus 70 -~I~~~~ri~~~g~~y~I~~-v~~-~~~----~~~l~i~c~egv~~ 108 (109) T protein:vir:10 70 -DIDSEMRIRYGDQLYDIVA-VLP-NRR----KGSLDLPVKVGEKY 108 (109) T ss_pred -CCCcccEEEECCeEEEEEe-ecc-CCC----CcEEEEEEEeeecc Confidence 1567789999999999997 221 222 2222332 11 No 25 >protein:vir:80342 Length: 108 # NCBI annotation: gp9, phage head-tail adaptor, putative # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111088;genbank:gi:134288641;genbank:GeneID:4960589 Probab=60.22 E-value=0.38 Score=22.91 Aligned_cols=104 Identities=10% Similarity=0.093 Sum_probs=60.6 Q ss_pred cccCCCcCcEEEeeeeEEEecCCCCc--cCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeeccCCCC Q lcl|NC_011023. 4 LDTGARYQPVTVYPEEMVIDGDGNKR--TRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKEH 81 (122) Q Consensus 4 LD~g~~~e~v~VYPee~~~D~dGNt~--t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~~~ 81 (122) ++-|.-++.|+++=.+...|+.||.. .|... -.+-|.|.+. ||.-. ...+..+-.....+++|+... T Consensus 1 M~~G~L~~rI~i~~~~~~~d~~G~~~~~~w~~~--~~~wA~v~~~--~~~e~--~~a~~~~~~~~~~i~iR~~~~----- 69 (108) T protein:vir:80 1 MKTGKLKERIVIERPSGETNENDEPIPGAWIVH--ARPWADVLFL--NGKEH--VISGAVRGATIASMRIRYRAG----- 69 (108) T ss_pred CCccccCccEEEEeeeeccCCCCCeeccceeeE--EEEEEEEEec--Cchhe--eeccceeeeeeEEEEEEecCC----- Confidence 78888889999999899999999843 34422 2356777764 33211 122223334445556664322 Q ss_pred CccCcceEEEECCeEEEEecceeeeCCCCcceee--eEEEEeC Q lcl|NC_011023. 82 GILGAQSQIEWRGQRWALFGDATVYDSSPALARV--DYQIKRF 122 (122) Q Consensus 82 g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~--~ytirR~ 122 (122) |-+.-.|.|+|+.|-|.+ +.. .+.++-... .-.++.- T Consensus 70 --I~~~~Ri~~~g~~y~I~~-v~~-~~~~~~l~i~~~e~v~~~ 108 (108) T protein:vir:80 70 --IDEQMRVRYDGRLYDITA-VLP-ARKRGYLDLSVKVGEKYV 108 (108) T ss_pred --CCcccEEEECCeEEEEEe-ecc-CCCCCEEEEEEEeeeecC Confidence 466789999999999997 332 222221111 1122222 No 26 >protein:vir:95261 Length: 133 # NCBI annotation: Phage hypothetical protein # Family: family:all:31736 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944894;genbank:gi:38707834;genbank:GeneID:2744047 Probab=59.69 E-value=0.39 Score=22.84 Aligned_cols=116 Identities=17% Similarity=0.136 Sum_probs=64.4 Q ss_pred CcccccCCCcCcEEEee-ee-EEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCC---CCceEEEEeee Q lcl|NC_011023. 1 MSLLDTGARYQPVTVYP-EE-MVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFE---TEKVYRMRFPR 75 (122) Q Consensus 1 MSLLD~g~~~e~v~VYP-ee-~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~---seqvy~~r~~R 75 (122) |-||-+- --++|=+ .| -+...+|.-+..-..+-+|+.|.|||..-|--.-..++.--||=- .=++|+-.-=+ T Consensus 1 M~~~~rh---s~~~~R~~seg~Y~~~~GrWV~g~~~v~~~i~asIQP~~~ss~~~~q~~~lpeGrrit~avrIYTda~L~ 77 (133) T protein:vir:95 1 MRLLNRH---SFVVKRKVSEDGYYNDDGDWVASQDIVEVNCKGNIQPYIKGSVKNGTQIALPEGIRLTDTRILYTTYKLR 77 (133) T ss_pred CCccccc---eeEEEEeecCCceEccCCcccCCCCccceeeeeeecccccccccccchhcccCCeeeeeEEEEEeeeeee Confidence 9999864 2334433 11 222334555544444557889999996533111111101122311 12334433323 Q ss_pred ccCCCCCccCcce-EEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 76 SFTKEHGILGAQS-QIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 76 s~~~~~g~lgaqS-~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) ..+-.. ..++ .|.|+|.+|=+|.-..--+|=-++.|-+|-+-|= T Consensus 78 vage~~---~~~gDvvl~dg~eYev~~r~~w~~Gv~~isHyrY~aVR~ 122 (133) T protein:vir:95 78 TSDDVE---WNESDIVMIDGHEYEVFMTMDWSQQLSHTSHYEYIIIRR 122 (133) T ss_pred eecccc---cCCCcEEEEcCCceEEEEecchhhccccCCceeEEEEee Confidence 332211 1333 7999999999999887777878999998876665 No 27 >protein:vir:79686 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:2747 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285885;genbank:gi:148750842;genbank:GeneID:5220385 Probab=56.14 E-value=0.33 Score=23.21 Aligned_cols=94 Identities=15% Similarity=0.105 Sum_probs=58.7 Q ss_pred CcccccCCCcCcEEEee--eeEEEecCCCCccCCCccCcee-eEEEEec-CcccCcccccccccCCCCCCceEEEEeeec Q lcl|NC_011023. 1 MSLLDTGARYQPVTVYP--EEMVIDGDGNKRTRPSKVGIPA-IARLQVA-NQSGTSARRAEQDNEGFETEKVYRMRFPRS 76 (122) Q Consensus 1 MSLLD~g~~~e~v~VYP--ee~~~D~dGNt~t~Ps~~Gvp~-~AriQv~-~qsgTsarr~eqd~eG~~seqvy~~r~~Rs 76 (122) |-.+|..--.+.+++=. |..-+|++|....-- ++++ ++|||.. .-|||...||...| -|--+ |+.. T Consensus 2 m~~ipk~~l~~sit~k~~~~~~~~D~yg~~~y~~---p~~I~nvrvd~~t~ySgt~n~rq~~~n------avif~-y~~~ 71 (118) T protein:vir:79 2 KLPIPYQMAVSTVHLKLTDQSAKKDRYGRTVPTW---EGDITKCVVNMQTTYSGTNNDRQIVAN------GLIVM-YAGY 71 (118) T ss_pred CCccchhhccceEEEEEeccccCcCCCCCeeccC---CeeeeeeEecccceecccCCCCeEEec------eEEEE-eccc Confidence 77777666666666653 334668888775433 4555 8899875 34899999999998 33333 4444 Q ss_pred cCCCCCccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 77 FTKEHGILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 77 ~~~~~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) +.-+ ..+..|| | |+..+|+|- +|||.++ T Consensus 72 s~p~-~~~~~~s--------~---g~kivfdG~------eYtI~~i 99 (118) T protein:vir:79 72 SNPI-PTLTKEN--------L---GSKLTYQGL------DYTVTSL 99 (118) T ss_pred Cccc-cEEeccc--------c---ccceeeCCe------eEEeeee Confidence 4322 1123333 3 667777763 7888877 No 28 >protein:vir:100134 Length: 109 # NCBI annotation: gp8 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945038;genbank:gi:38707898;genbank:GeneID:2744181 Probab=46.64 E-value=0.73 Score=21.34 Aligned_cols=107 Identities=11% Similarity=0.137 Sum_probs=60.2 Q ss_pred ccccCCCcCcEEEeeeeEEEecCCCCccC-CCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeeccCCCC Q lcl|NC_011023. 3 LLDTGARYQPVTVYPEEMVIDGDGNKRTR-PSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKEH 81 (122) Q Consensus 3 LLD~g~~~e~v~VYPee~~~D~dGNt~t~-Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~~~ 81 (122) +++.|.-++.|+++=.+.+.|+.|+.... .... -.+.|.|.+. ||.-. .+ +.+=.++.++++.+... . T Consensus 1 mm~~G~L~~rI~i~~~~~~~d~~G~~~~~~w~~~-~~~wA~v~~~--s~~e~---~~-a~~~~~~~~~~~~iR~~----~ 69 (109) T protein:vir:10 1 MLKAGELTERITIEKRGGGVNENGEPLPGDWVEH-ASVWANVRFL--SGKEY---VV-SGAIHSSAIASMRIRFR----R 69 (109) T ss_pred CCCccccCccEEEEeeeeeeCCCCCeeccceEEE-EEEEEEEEec--Cchhe---ee-ccceeeeeEEEEEEEeC----C Confidence 78889889999999999999999985432 2222 2456777664 34222 21 22223344444433321 1 Q ss_pred CccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 82 GILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 82 g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) + |-+.-.|.|+|+.|-|-+-....+...-....... .+| T Consensus 70 ~-I~~~~ri~~~g~~y~I~~v~~d~~~~~~~l~~~~~-e~~ 108 (109) T protein:vir:10 70 D-VDSEMRIRHDGRLYDIAAVLPNRRQGYVDLSVKVG-EKY 108 (109) T ss_pred C-CCcccEEEECCeEEEEeecCCCCCCCeEEEEEEEE-Eee Confidence 2 46677999999999999843222211111111111 233 No 29 >protein:vir:1436 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536365;genbank:gi:17975170;genbank:GeneID:929148 Probab=39.14 E-value=1 Score=20.51 Aligned_cols=105 Identities=12% Similarity=0.143 Sum_probs=58.9 Q ss_pred cccCCCcCcEEEeeeeEEEecCCCCc-cCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeeccCCCCC Q lcl|NC_011023. 4 LDTGARYQPVTVYPEEMVIDGDGNKR-TRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKEHG 82 (122) Q Consensus 4 LD~g~~~e~v~VYPee~~~D~dGNt~-t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~~~g 82 (122) ++-|.-++.|+++=.+...|..|+.. +.... =-.+-|.|.+.. |. .....+..+-+....+++|+.. + T Consensus 1 M~~G~L~~rI~i~~~~~~~d~~G~~~~~~w~~-~~~~wA~i~~~~--g~--e~~~a~~~~~~~t~~i~iR~~~------~ 69 (108) T protein:vir:14 1 MEAGKLKERIVIERPSGETNENDEPIPGAWVV-HARPWADVRFLN--GK--EHVISGAVRGATVASMRIRYRA------G 69 (108) T ss_pred CCccccCccEEEEeeeeccCCCCCeeccceee-EEEEEEEEEecC--ch--heeeccceeeeeeEEEEEEecC------C Confidence 77888889999998899999999854 32222 234677777743 31 1122222333344445555421 2 Q ss_pred ccCcceEEEECCeEEEEecceeeeCCCCcceee-eEEEEeC Q lcl|NC_011023. 83 ILGAQSQIEWRGQRWALFGDATVYDSSPALARV-DYQIKRF 122 (122) Q Consensus 83 ~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~-~ytirR~ 122 (122) |.+.-.|.|+|+.|-|.+ +...+....+.-. +..+ ++ T Consensus 70 -I~~~~ri~~~g~~y~I~~-v~~~~~~~~l~i~~~~~v-~~ 107 (108) T protein:vir:14 70 -IGDQMRIRYDGRLYDITA-VLPARKRGYLDLSVKVGE-KY 107 (108) T ss_pred -CCcccEEEECCeEEEEEe-eccCCCCCEEEEEEEeee-ec Confidence 566779999999999996 4333221111000 1011 11 No 30 >protein:vir:3872 Length: 146 # NCBI annotation: putative head-tail joining protein # Family: family:all:28619 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680489;swissprot:trembl:p94213;genbank:gi:22296529;uniprot:P94213;genbank:GeneID:951708 Probab=36.99 E-value=1.1 Score=20.27 Aligned_cols=107 Identities=16% Similarity=0.188 Sum_probs=62.7 Q ss_pred Cccc-ccCCCcCcEEEeeeeEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCC-CCceEEEEeeeccC Q lcl|NC_011023. 1 MSLL-DTGARYQPVTVYPEEMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFE-TEKVYRMRFPRSFT 78 (122) Q Consensus 1 MSLL-D~g~~~e~v~VYPee~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~-seqvy~~r~~Rs~~ 78 (122) |.++ +-|.-++-|++.=.....|..|+....+-.+ --+=|.|.+ ++|.= --+.+.+-|.. .--++++|+ |. T Consensus 34 mp~~M~~gkLn~RItfqk~~~~~d~~g~~~~~w~~v-~tvWA~V~~--~~grE-~~~~~a~~~~~e~ti~F~IRY-~~-- 106 (146) T protein:vir:38 34 MVILMRINRMTERIAFVSYESKKVNGVPVDGVIVKH-MTVWAEVPK--VPIRE-ANDPQTKLGTRKDSPTFLVRF-LT-- 106 (146) T ss_pred ceeeeccccCCccEEEEEeeeeecCCCcCCCcceee-eEEEEeeec--cchhh-hHhhhhhhhhhcceeEEEEEe-cC-- Confidence 8874 9999999999997777778777755443111 113566665 33311 01113333342 224588996 11 Q ss_pred CCCCccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 79 KEHGILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 79 ~~~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) .-+ |..--.|.|+|+.|-|.+ =.|...|.+|+...= T Consensus 107 -~~~-I~~~mRI~y~gk~YeI~~------I~pd~~~k~~~~I~a 142 (146) T protein:vir:38 107 -AEE-IQPTWRIQWRGNEYQITG------LDPDYERRDLTTITA 142 (146) T ss_pred -Ccc-CCcccEEEECCeEEEEee------eCCccccCcEEEEEE Confidence 111 233449999999999988 234455666655444 No 31 >protein:vir:107716 Length: 132 # NCBI annotation: gp19 # Family: family:all:7161 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024867;genbank:gi:48697509;genbank:GeneID:2948332 Probab=34.97 E-value=1.3 Score=20.04 Aligned_cols=106 Identities=22% Similarity=0.353 Sum_probs=60.0 Q ss_pred Cc-----ccccCCC---cCcEEEee-eeEEEecCCCCccCCCccCceee-EEEEecCcccCcccccccccCCCCCCceEE Q lcl|NC_011023. 1 MS-----LLDTGAR---YQPVTVYP-EEMVIDGDGNKRTRPSKVGIPAI-ARLQVANQSGTSARRAEQDNEGFETEKVYR 70 (122) Q Consensus 1 MS-----LLD~g~~---~e~v~VYP-ee~~~D~dGNt~t~Ps~~Gvp~~-AriQv~~qsgTsarr~eqd~eG~~seqvy~ 70 (122) |+ ||---+. .++|.-|= +....|.-||-.+-= ..++|++ +-+|... ++.--+-|-+..|+|+ T Consensus 1 m~iPG~NLl~~A~~vI~~q~V~y~rf~~Rt~n~~gq~i~~y-~~p~~i~~gS~Q~V~-------~~~v~~~GLd~~~~Yv 72 (132) T protein:vir:10 1 MSVPGLNLLAMALGLIASETVEYFAETGRTKQPNGVFIASY-ASPVPIEECSVQAVD-------RSKYTDLGLDFQKTYV 72 (132) T ss_pred CcccchhHHHHHhhhhccccchhhcccccccccccceeeee-cCCcccccceeeecC-------hhhheecccceeeeee Confidence 32 2211110 13333332 234556666655432 2268886 8888744 5556778999999999 Q ss_pred EEe-eec-cCC-CCCccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEE------EeC Q lcl|NC_011023. 71 MRF-PRS-FTK-EHGILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQI------KRF 122 (122) Q Consensus 71 ~r~-~Rs-~~~-~~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~yti------rR~ 122 (122) +=| |.. +.. ++| .|.-++.|+|.||=|.|+..-|- .+..-. =-+ T Consensus 73 ~lf~s~~~i~~iqRg--~agD~liwnGrr~~v~g~~dW~~------QDGW~~~lcv~~G~~ 125 (132) T protein:vir:10 73 TWFVPNQAFTTIKRG--KAGDVLEWNGGRYQMNGGIDWTG------QDSWGTATCVLIGPA 125 (132) T ss_pred eEeecchhhhhcccC--CCCCEEEECCeEEEecccceeee------eccceEEEEEEecCc Confidence 988 554 333 333 46678999999999999766542 111000 000 No 32 >protein:vir:96107 Length: 133 # NCBI annotation: conserved hypothetical protein ORF026 # Family: family:all:7161 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294443;genbank:gi:149408340;genbank:GeneID:5237226 Probab=33.15 E-value=1.4 Score=19.82 Aligned_cols=111 Identities=18% Similarity=0.128 Sum_probs=59.4 Q ss_pred Cc-----ccccCCC---cCcEEEee-eeEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEE Q lcl|NC_011023. 1 MS-----LLDTGAR---YQPVTVYP-EEMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRM 71 (122) Q Consensus 1 MS-----LLD~g~~---~e~v~VYP-ee~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~ 71 (122) |+ ||---+. .++|.-|= +....|.-||-.+-= ..++|+.+-+|... ++.--+-|-+..|+|++ T Consensus 1 m~iPG~NLl~~A~~VI~~Q~V~y~rf~~Rt~n~~gq~i~~y-~~p~~i~gS~Q~V~-------~~~v~~~GLd~~~~Yv~ 72 (133) T protein:vir:96 1 MVIPGANLLRMAFSVIGTQLVQYRKFEQRTKNSQAQYVSVF-GEPFQLAASIQRVR-------RDQYVQFNLEFQRNYVM 72 (133) T ss_pred CcccchhHHHHHhhhhccccchhhcccccccccccceeeee-cCCccceeeEEecC-------hhheeecCcceeeeeeE Confidence 21 1111100 13333332 234456666655431 22788889999744 55567789999999998 Q ss_pred EeeeccCCCCCccCcceEEEECCeEEEEecceeeeC------------CCCcceee-eEEE Q lcl|NC_011023. 72 RFPRSFTKEHGILGAQSQIEWRGQRWALFGDATVYD------------SSPALARV-DYQI 119 (122) Q Consensus 72 r~~Rs~~~~~g~lgaqS~veW~G~rw~vfGd~~~y~------------sS~rTah~-~yti 119 (122) =|+=.--..--.=.|.-.+.|+|.||=|.|+..-|- |+---|.. ..|. T Consensus 73 lf~s~~i~~iqRg~agD~liwnGrr~~v~g~~dW~~QDGW~~~lcv~~G~~~ga~~~~~~~ 133 (133) T protein:vir:96 73 IFANFEMVDLDRDLAGDQFIWTGRVFQLESQGSWFYQDGWGVCLAVDIGTAKLAEDGTLTF 133 (133) T ss_pred eecCcceeecccCCCCCEEEECCeEEEecccceeeeeccceEEEEEeecCCCCcCCceecC Confidence 764221111111135678999999999999876552 11111111 1111 No 33 >protein:vir:4343 Length: 118 # NCBI annotation: Orf10 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061506;genbank:gi:9635594;genbank:GeneID:1262867 Probab=30.65 E-value=1.6 Score=19.53 Aligned_cols=104 Identities=11% Similarity=0.089 Sum_probs=57.6 Q ss_pred cccCCCcCcEEEeeeeEEEecCCCCccCC-CccC----ceeeEEEEecCcccCcccccccccCCCCCCc--eEEEEeeec Q lcl|NC_011023. 4 LDTGARYQPVTVYPEEMVIDGDGNKRTRP-SKVG----IPAIARLQVANQSGTSARRAEQDNEGFETEK--VYRMRFPRS 76 (122) Q Consensus 4 LD~g~~~e~v~VYPee~~~D~dGNt~t~P-s~~G----vp~~AriQv~~qsgTsarr~eqd~eG~~seq--vy~~r~~Rs 76 (122) ++-|.-+..|++.=-+.+.|++||..+.. -.++ -.+.|.|.+. ||.-. ..+..-.++. .+++|+. + T Consensus 1 M~~G~l~~rI~i~~~~~~~d~~~G~~~~~w~~~~~~~~~~~WA~v~~~--sg~e~----~~a~~~~~~~~~~f~iRy~-~ 73 (118) T protein:vir:43 1 MLAYRMRHRIQFQRQVHTQDPDTGEETTTWETVLFSGHADLPAEVLTG--PGREL----IAADATQAETTARINCRWF-P 73 (118) T ss_pred CCccccCccEEEEeeeeecCCCCCcccCceeeeeecccceEEEEEEec--Cccce----eecccchheeeEEEEEEec-c Confidence 88899899999998888899876643322 1222 3567888773 34222 1222223444 4455542 1 Q ss_pred cCCCCCccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEE--eC Q lcl|NC_011023. 77 FTKEHGILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIK--RF 122 (122) Q Consensus 77 ~~~~~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytir--R~ 122 (122) .. .+ |-+.-.|.|+|+.|-|.+-+... +..+ -.+|. .= T Consensus 74 ~~--~~-It~~~Ri~~~g~~y~I~~v~~~~-~~~~----~l~i~~~e~ 113 (118) T protein:vir:43 74 VE--RL-ELYTWRVLWDGRVYNITSAETDV-TARR----EWRLRCSDG 113 (118) T ss_pred cc--cC-CCcccEEEECCeEEEEEecCCcc-cCCe----EEEEEEEEe Confidence 11 12 35567999999999998854322 2211 11111 11 No 34 >protein:vir:1890 Length: 110 # NCBI annotation: gp9 # Family: family:all:116 # ACLAME annotation(s): phi:0000012 - phage head tail joining # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037670;genbank:gi:9634128;genbank:GeneID:1262503 Probab=28.28 E-value=1.8 Score=19.23 Aligned_cols=105 Identities=13% Similarity=0.080 Sum_probs=58.4 Q ss_pred cccCCCcCcEEEeeeeEEEecC-CCCccCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEEEeeeccCCCCC Q lcl|NC_011023. 4 LDTGARYQPVTVYPEEMVIDGD-GNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKEHG 82 (122) Q Consensus 4 LD~g~~~e~v~VYPee~~~D~d-GNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~~~g 82 (122) ++-|.-+.-|+++=.+.+.|.. |+........+ .+-|-|.+. ||. .....+.+.-.....+++|+.. T Consensus 1 M~~G~L~~rI~i~~~~~~~d~~~G~~~~~~~~~~-~~wA~v~~~--~~~--e~~~a~~~~~~~~~~~~iR~~~------- 68 (110) T protein:vir:18 1 MQAGKLRHRITLQEPVKVQNPTTGAVINTWRDVA-TVRAEVSPL--SAR--EFIAAQASQGEITTRIVIRYRA------- 68 (110) T ss_pred CCccccCccEEEEeeeeeecCCCCccccceeeeE-EEEEEEEec--Cch--heeecceeeeeeeEEEEEEecC------- Confidence 6788888999999888889976 66654443333 456766664 332 1112222223334455566432 Q ss_pred ccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 83 ILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 83 ~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) -|.+--.|.|+|+.|-|.+=....++..+-. +-.-+.= T Consensus 69 ~I~~~~ri~~~g~~y~I~~v~~d~~~~~~~l--~i~~~e~ 106 (110) T protein:vir:18 69 GVTRKHRILFRGAVYNIHGVLPDPKSGREYL--TLPCSEG 106 (110) T ss_pred CCCcccEEEECCeEEEEEeccCCcccCCeEE--EEEEEEe Confidence 2466779999999999988322222222211 1111111 No 35 >protein:vir:99571 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:7161 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039798;genbank:gi:126011048;genbank:GeneID:4818266 Probab=24.53 E-value=2.2 Score=18.74 Aligned_cols=114 Identities=18% Similarity=0.146 Sum_probs=61.8 Q ss_pred Cc-----ccccCCC---cCcEEEee-eeEEEecCCCCccCCCccCceeeEEEEecCcccCcccccccccCCCCCCceEEE Q lcl|NC_011023. 1 MS-----LLDTGAR---YQPVTVYP-EEMVIDGDGNKRTRPSKVGIPAIARLQVANQSGTSARRAEQDNEGFETEKVYRM 71 (122) Q Consensus 1 MS-----LLD~g~~---~e~v~VYP-ee~~~D~dGNt~t~Ps~~Gvp~~AriQv~~qsgTsarr~eqd~eG~~seqvy~~ 71 (122) |+ ||---+. .++|.-|= +....|.-||-.+-= ..++|+.+-+|... ++.--+-|-+..++|++ T Consensus 1 m~iPG~NLl~~A~~VI~~Q~V~y~rf~~Rt~n~~gq~i~~y-~~p~~i~gS~Q~V~-------~~~v~~~GLd~~~~Yv~ 72 (131) T protein:vir:99 1 MIVPGSNLFMQAASVIALTPVPYLRFTQRVLNPARQWITTY-AAAVDVPMSVQRVP-------RNKYVQFGLEFQRNYVR 72 (131) T ss_pred CcccchHHHHHHhhhhccccchhhcccccccccccceeeee-cCCccceeeEEecC-------hhheeecCcceeeeEEE Confidence 21 1111110 13333332 234456666654431 22788899999854 55567789999999998 Q ss_pred EeeeccCCCCCccCcceEEEECCeEEEEecceeeeC--CCCcceeeeEEEEeC Q lcl|NC_011023. 72 RFPRSFTKEHGILGAQSQIEWRGQRWALFGDATVYD--SSPALARVDYQIKRF 122 (122) Q Consensus 72 r~~Rs~~~~~g~lgaqS~veW~G~rw~vfGd~~~y~--sS~rTah~~ytirR~ 122 (122) =|+=.--..--.=.|.-.+.|+|.||=|.|+..-|. |=.-..-.+.-||-= T Consensus 73 lfts~~i~~iqRg~agD~liwnGrr~~v~g~~dW~~QDGW~~~lcv~~Gi~~~ 125 (131) T protein:vir:99 73 LFAPIEMVDLDRDCGGDMIIWHGRQHKIESQNTWYLQDGWAMSLAVDLGIRSD 125 (131) T ss_pred EeecCcceecccCCCCCEEEECCeEEEecccceeeeeccceEEEEEEeecccC Confidence 654221110111146678999999999999876652 111112224444433 No 36 >protein:vir:4789 Length: 123 # NCBI annotation: putative minor capsid protein 2 # Family: family:all:1526 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150169;swissprot:trembl:q94m42;genbank:gi:15088780;uniprot:Q94M42;genbank:GeneID:955991 Probab=23.26 E-value=2.3 Score=18.57 Aligned_cols=97 Identities=15% Similarity=0.188 Sum_probs=56.0 Q ss_pred CcccccCCCcCcEEEeeeeEEEecCCCCccCCCccCcee-eEEEEe-cCcccCcccccccccC-CCCCCceEEEEeeecc Q lcl|NC_011023. 1 MSLLDTGARYQPVTVYPEEMVIDGDGNKRTRPSKVGIPA-IARLQV-ANQSGTSARRAEQDNE-GFETEKVYRMRFPRSF 77 (122) Q Consensus 1 MSLLD~g~~~e~v~VYPee~~~D~dGNt~t~Ps~~Gvp~-~AriQv-~~qsgTsarr~eqd~e-G~~seqvy~~r~~Rs~ 77 (122) |-..|.-.-++. +.|=+..-+|++|....-- ++-+ ++|||. ..-||+.-.|+.-+|- -|.-+.|--+ |+--+ T Consensus 7 ~p~ipK~~l~ds-it~k~~~~kddyg~~~y~e---pvtI~nvr~dr~t~ysG~~N~r~~taN~~~~~k~aVifl-Y~~~s 81 (123) T protein:vir:47 7 LKGIDKRLLKDV-LTIKKVADKNDYGDEVYSE---PLTIKNVRFDRSVGGSGNRNSKTGTGNSKSRQKQGVIYL-YPSLS 81 (123) T ss_pred cCcCChhhccee-EEEEEecCCCCcCCceecc---ceEeeeeEEeeccccCCcccCcceecccccccCceEEEE-ecccc Confidence 555555443344 4455777778887765322 3333 788887 4567877788877773 4444444434 33222 Q ss_pred CCCCCccCcceEEEECCeEEEEecceeeeCCCCcceeeeEEEEeC Q lcl|NC_011023. 78 TKEHGILGAQSQIEWRGQRWALFGDATVYDSSPALARVDYQIKRF 122 (122) Q Consensus 78 ~~~~g~lgaqS~veW~G~rw~vfGd~~~y~sS~rTah~~ytirR~ 122 (122) .+.-.-.|.|.. .++| +.+|||.++ T Consensus 82 -------~p~~d~~~~~~k-v~dg------------~~EYtI~kI 106 (123) T protein:vir:47 82 -------FVTVDNSWMGAK-VNDG------------IGDYTINGF 106 (123) T ss_pred -------ccceeccccceE-EEcC------------CccEEecce Confidence 233456677644 4444 347889888 No 37 >protein:vir:2506 Length: 116 # NCBI annotation: hypothetical protein # Family: family:all:7208 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569747;genbank:gi:18496897;genbank:GeneID:932259 Probab=20.91 E-value=2.7 Score=18.23 Aligned_cols=105 Identities=24% Similarity=0.319 Sum_probs=51.0 Q ss_pred ccccCCCcCcEEEeeeeEEEecCCCCccCCCccCceeeEEEEe-cCcccCcccccccccCCCCCCceEEEEeeeccCCCC Q lcl|NC_011023. 3 LLDTGARYQPVTVYPEEMVIDGDGNKRTRPSKVGIPAIARLQV-ANQSGTSARRAEQDNEGFETEKVYRMRFPRSFTKEH 81 (122) Q Consensus 3 LLD~g~~~e~v~VYPee~~~D~dGNt~t~Ps~~Gvp~~AriQv-~~qsgTsarr~eqd~eG~~seqvy~~r~~Rs~~~~~ 81 (122) .|-.- .|.+-+---.+.+..-|.+.+-|----- ..--+.+ .+|+|+.|--+.+ -- .-|+|.-| + T Consensus 1 ~~P~P--~eV~H~tr~KvG~n~aGQa~~EP~~R~R-~V~~~~p~~ne~~~aAala~r----~v--tE~tM~T~------~ 65 (116) T protein:vir:25 1 MFPTP--HKVVHVDRVKVGENAMGQAITEPRTRTR-WVTSLRPRVNESGTAAALADR----VI--TEYTMATP------E 65 (116) T ss_pred CCCCC--eeeeeeeeeeecCCcccccccCCccCcc-cccccccccccccchhhhcCc----ee--eeeeeecc------c Confidence 11100 1111111111233444554443310000 0011222 2456655522221 11 12667433 3 Q ss_pred CccCcceEE-EECCeEEEEecceeeeCCCCcceeeeE--EEEeC Q lcl|NC_011023. 82 GILGAQSQI-EWRGQRWALFGDATVYDSSPALARVDY--QIKRF 122 (122) Q Consensus 82 g~lgaqS~v-eW~G~rw~vfGd~~~y~sS~rTah~~y--tirR~ 122 (122) +-.-+.++| .|+|.+.-+-||+.+||+-|---.-.| ++||- T Consensus 66 ~DW~~~d~V~~w~GR~FkV~G~V~DYNlGPF~F~PGY~V~LRrV 109 (116) T protein:vir:25 66 SDWTHGDQVTDARGRKFKVHGDVEDYNLGPFGFTPGYRVTLRRV 109 (116) T ss_pred CCCCcccceecccCcEEEecCCccccccCCCCCCCCeeeeeeec Confidence 334556666 599999999999999999986655544 67888 Done!