Query lcl|NC_017981.1_cdsid_YP_006383622.1 [gene=DIBBI_gp15] [protein=hypothetical protein] [protein_id=YP_006383622.1] [location=11587..12132] Match_columns 181 No_of_seqs 19 out of 22 Neff 4.3 Searched_HMMs 1612 Date Thu Nov 7 13:36:37 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_15 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_15_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:101569 Length: 178 99.6 4.2E-17 2.6E-20 110.4 13.4 171 1-177 1-178 (178) 2 protein:vir:106763 Length: 178 99.6 6.7E-17 4.1E-20 109.3 13.8 168 1-177 1-178 (178) 3 protein:vir:80105 Length: 162 99.6 1.2E-17 7.4E-21 113.4 9.4 154 1-169 1-162 (162) 4 protein:vir:78612 Length: 178 99.5 1.3E-16 8.3E-20 107.6 13.7 166 1-177 1-178 (178) 5 protein:vir:3637 Length: 178 # 99.5 3.4E-16 2.1E-19 105.4 13.2 171 1-173 1-178 (178) 6 protein:vir:94061 Length: 175 99.5 1.1E-15 6.6E-19 102.7 12.7 166 1-175 1-175 (175) 7 protein:vir:5259 Length: 213 # 99.2 3E-14 1.9E-17 94.7 7.7 158 1-169 47-213 (213) 8 protein:vir:99607 Length: 186 97.7 4E-06 2.5E-09 50.2 12.9 163 1-172 1-186 (186) 9 protein:vir:107715 Length: 189 97.2 3.8E-05 2.4E-08 44.8 12.8 161 1-170 1-189 (189) 10 protein:vir:95262 Length: 199 96.2 0.00047 2.9E-07 38.8 12.5 171 1-181 1-198 (199) 11 protein:vir:96106 Length: 183 96.1 0.00067 4.1E-07 38.0 13.1 160 6-167 1-183 (183) 12 protein:vir:81066 Length: 118 93.4 0.0042 2.6E-06 33.6 10.1 113 6-150 1-118 (118) 13 protein:vir:97070 Length: 118 93.4 0.0051 3.2E-06 33.1 10.5 113 6-150 1-118 (118) 14 protein:vir:10368 Length: 118 92.3 0.0098 6.1E-06 31.6 10.5 108 6-150 1-118 (118) 15 protein:vir:1244 Length: 145 # 85.5 0.052 3.3E-05 27.6 9.8 139 1-168 1-145 (145) 16 protein:vir:95765 Length: 127 81.9 0.081 5E-05 26.6 9.5 124 6-152 1-127 (127) 17 protein:vir:1438 Length: 115 # 78.3 0.066 4.1E-05 27.1 7.1 115 1-147 1-115 (115) 18 protein:vir:5979 Length: 134 # 74.3 0.075 4.7E-05 26.7 6.3 128 1-150 3-134 (134) 19 protein:vir:99537 Length: 125 73.9 0.16 0.0001 24.9 8.4 121 6-148 1-125 (125) 20 protein:vir:100116 Length: 115 73.6 0.095 5.9E-05 26.2 6.6 115 1-147 1-115 (115) 21 protein:vir:97325 Length: 145 70.4 0.21 0.00013 24.3 10.0 139 1-168 1-145 (145) 22 protein:vir:94794 Length: 145 69.2 0.23 0.00014 24.1 9.8 139 1-168 1-145 (145) 23 protein:vir:95961 Length: 145 69.2 0.23 0.00014 24.1 9.8 139 1-168 1-145 (145) 24 protein:vir:95111 Length: 145 63.9 0.31 0.00019 23.4 9.9 139 1-168 1-145 (145) 25 protein:vir:94096 Length: 141 63.8 0.31 0.00019 23.4 9.6 135 1-157 1-141 (141) 26 protein:vir:105892 Length: 141 63.8 0.31 0.00019 23.4 9.6 135 1-157 1-141 (141) 27 protein:vir:96260 Length: 141 63.8 0.31 0.00019 23.4 9.6 135 1-157 1-141 (141) 28 protein:vir:93736 Length: 145 63.0 0.33 0.0002 23.3 9.8 137 1-168 1-145 (145) 29 protein:vir:94488 Length: 145 63.0 0.33 0.0002 23.3 9.8 137 1-168 1-145 (145) 30 protein:vir:97421 Length: 145 63.0 0.33 0.0002 23.3 9.8 137 1-168 1-145 (145) 31 protein:vir:94768 Length: 111 52.8 0.55 0.00034 22.0 8.2 107 15-146 1-111 (111) 32 protein:vir:93602 Length: 114 52.1 0.56 0.00035 21.9 9.3 107 15-147 1-114 (114) 33 protein:vir:195 Length: 115 # 50.4 0.61 0.00038 21.8 9.2 107 15-147 1-115 (115) 34 protein:vir:1643 Length: 111 # 45.0 0.79 0.00049 21.2 8.1 108 15-146 1-111 (111) 35 protein:vir:3618 Length: 129 # 44.1 0.82 0.00051 21.1 8.1 126 1-148 1-129 (129) 36 protein:vir:80371 Length: 115 43.9 0.83 0.00051 21.0 6.4 102 28-147 1-115 (115) 37 protein:vir:100242 Length: 114 43.2 0.86 0.00053 21.0 6.6 113 1-147 1-114 (114) 38 protein:vir:9764 Length: 111 # 42.9 0.87 0.00054 20.9 8.3 108 15-146 1-111 (111) 39 protein:vir:9579 Length: 111 # 40.5 0.97 0.0006 20.7 9.0 111 15-146 1-111 (111) 40 protein:vir:96125 Length: 140 39.8 1 0.00062 20.6 9.6 131 1-161 3-140 (140) 41 protein:vir:94921 Length: 125 39.5 1 0.00063 20.5 9.3 118 6-149 1-125 (125) 42 protein:vir:96894 Length: 140 34.6 1.3 0.00079 20.0 9.1 134 1-161 1-140 (140) 43 protein:vir:96485 Length: 128 34.2 1.3 0.00081 19.9 8.4 120 5-148 1-128 (128) 44 protein:vir:105337 Length: 145 31.5 1.5 0.00093 19.6 9.4 139 1-168 1-145 (145) 45 protein:vir:107096 Length: 145 30.9 1.5 0.00096 19.6 9.4 139 1-168 1-145 (145) 46 protein:vir:4348 Length: 121 # 26.9 1.9 0.0012 19.1 9.8 114 8-150 1-121 (121) 47 protein:vir:2741 Length: 128 # 22.0 2.5 0.0016 18.4 8.2 124 5-148 1-128 (128) No 1 >protein:vir:101569 Length: 178 # NCBI annotation: gp05 # Family: family:all:3176 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958109;genbank:gi:41057655;genbank:GeneID:2716826 Probab=99.57 E-value=4.2e-17 Score=110.39 Aligned_cols=171 Identities=15% Similarity=0.105 Sum_probs=137.7 Q ss_pred CCcc-cc-ChhhhhHHHHHHHHHHHHHHhhCCccEeecCCC-CCCCCceEEEEEcccCCcc--ccccCCCCCcCceeeee Q lcl|NC_017981. 1 MAEQ-ET-TIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQN-HNISPPYATLHVIARRDVG--NPEFGPVNDDGIQQIHQ 75 (181) Q Consensus 1 ~~~~-~~-t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg-~~P~~PYatl~vr~~~~~~--~~~~g~vdd~G~q~V~~ 75 (181) |..+ -. .-|+-|++.+-+-.-+++.++-+.-||+.+||+ |-|.++|+.|..-.+...+ ...|++ .+|...+.+ T Consensus 1 m~~~vtl~~Te~di~~alr~fL~~lf~p~~~~eVi~gqqN~~p~P~g~fiimt~l~~~~lsT~~~~Y~~--~~~~~~~~~ 78 (178) T protein:vir:10 1 MTNPVTLRPSEDEVFDTLWEWVTSLFDPASAAQIAKADQNATSTLYGTYALIRPGVREALNQTIRSYDA--TAQTVANEL 78 (178) T ss_pred CCCcccccccHHHHHHHHHHHHHHHcCCCCCceEEeecccCCCcCCCCEEEEecccccccccceeecCC--cchheeeee Confidence 5443 11 225667766665555555566678899999998 4788899999764433232 223433 466888999 Q ss_pred eeeEEEEEEEechHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeeeecccc Q lcl|NC_017981. 76 VIEGTLSVKVFGGAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFTDNV 155 (181) Q Consensus 76 h~eatvelq~fG~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~d~V 155 (181) |.+-.|.|.|||.+|.+.+..++.-+|.+-.++++++.+|+.+-++.-+.+|+.+++++||+|-++++.+.|+.+.+-.. T Consensus 79 ~~~~~~QvD~YG~~A~d~A~~i~tl~Rs~~a~~~f~~~~iaPLYad~p~qlp~iN~e~QyE~Rwt~~~~lQ~Np~vt~pq 158 (178) T protein:vir:10 79 HTGYWYQVDCYGPQAPDWANTIAAMWRTMWSADALRGTALIPLYADQPQQLNIVNGEGQYEQRYMVKLHAQVNQVATAPQ 158 (178) T ss_pred eeEEEEEEEeecCChhHHHHHHHHHhcChhHHHHHhcCccccccCCCccccCccCccccccceEEEEEEEEeeeEEeehh Confidence 99999999999999999999999999999999999988899999999999999999999999999999999999999999 Q ss_pred CceeeeEEecCCc--cccceecee Q lcl|NC_017981. 156 GLIEHVKLTGEID--EVPVSFDID 177 (181) Q Consensus 156 g~Ie~Vevtg~~~--~~p~~~~i~ 177 (181) .-+.+|.||-.+. =+|+ + T Consensus 159 dFfd~~~i~~~~~~di~~~----~ 178 (178) T protein:vir:10 159 LFFTEAPATTATPVDIVPL----D 178 (178) T ss_pred hhccccccccccccceecc----C Confidence 9999999977552 2343 2 No 2 >protein:vir:106763 Length: 178 # NCBI annotation: gp05 # Family: family:all:3176 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944313;genbank:gi:38638612;genbank:GeneID:2657394 Probab=99.56 E-value=6.7e-17 Score=109.27 Aligned_cols=168 Identities=15% Similarity=0.165 Sum_probs=132.7 Q ss_pred CCcc---ccChhhhhHHHHHHHHHHHHHHhh----CCccEeecCCC-CCCCCceEEEEEcccCCcc--ccccCCCCCcCc Q lcl|NC_017981. 1 MAEQ---ETTIDQFVPDEVEAAAYRVLSPLL----PEMMLCYEGQN-HNISPPYATLHVIARRDVG--NPEFGPVNDDGI 70 (181) Q Consensus 1 ~~~~---~~t~~~~i~~l~~~~a~~~ls~ll----~~pvI~AdqNg-~~P~~PYatl~vr~~~~~~--~~~~g~vdd~G~ 70 (181) |..+ .+| |+-|++.+ .+.|..|+ ++.||+.+||+ |-|.++|+.|..-.+...+ ...|++ .+|+ T Consensus 1 m~~~vtl~~T-e~di~~al----r~fL~~lf~~~~~~eVi~gqqN~~p~P~g~fiimt~l~~~~lsT~~~~Y~~--~~~~ 73 (178) T protein:vir:10 1 MTNPVTLRPS-EDEVFDTL----WGWVTSLFDPALASQIAKADQNATSTLYGTYALIRPGVREALNQTIRTYDA--TAGT 73 (178) T ss_pred CCCccccccc-HHHHHHHH----HHHHHHhcCcccCceEEEeccCCCCccCCCEEEEecccccccccceeeccC--Ccce Confidence 5443 222 44555544 44555554 45699999998 4788999999764333222 223443 4589 Q ss_pred eeeeeeeeEEEEEEEechHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeee Q lcl|NC_017981. 71 QQIHQVIEGTLSVKVFGGAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQR 150 (181) Q Consensus 71 q~V~~h~eatvelq~fG~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~ 150 (181) +.+.+|.+-+|.|.|||.+|-+.+..++.-+|.+-.++++++.+|+.+=++.-+.+|+.+++++||+|-++++.+.|+.+ T Consensus 74 ~~~~~~~~~~~QvD~YG~~A~d~A~~i~tl~Rs~~a~~~f~~~~iaPLYad~P~q~p~iN~e~QyE~Rwt~~~~lQ~Np~ 153 (178) T protein:vir:10 74 VSNELHTGYWYQVDCYGPQAPDWANTIAAMWRTMWSADALRGTALIPLYADQPQQLNIVNGEGQYEQRYMVKLHAQVNQV 153 (178) T ss_pred eeeeeeeEEEEEEEeecCChhHHHHHHHHHhcChhHHHHHhcCcccceecCCcccCCccCccccccceEEEEEEEEeeeE Confidence 99999999999999999999999999999999999999999877889999999999999999999999999999999999 Q ss_pred eccccCceeeeEEecCCccccceecee Q lcl|NC_017981. 151 FTDNVGLIEHVKLTGEIDEVPVSFDID 177 (181) Q Consensus 151 ~~d~Vg~Ie~Vevtg~~~~~p~~~~i~ 177 (181) .+-...-+.+|.||-.+ +|++ ++.+ T Consensus 154 vt~pqdFfd~v~i~~~~-~vd~-~~~~ 178 (178) T protein:vir:10 154 ATAPQQFFTEVPATTAT-PVDI-VPLD 178 (178) T ss_pred Eeehhhhcccccccccc-ccce-eccC Confidence 99999999999997765 2222 2223 No 3 >protein:vir:80105 Length: 162 # NCBI annotation: gp13 # Family: family:all:2729 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468717;genbank:gi:157325297;genbank:GeneID:5601796 Probab=99.56 E-value=1.2e-17 Score=113.37 Aligned_cols=154 Identities=11% Similarity=0.084 Sum_probs=111.9 Q ss_pred CCccccChhhhhHHH--HHHHHHHHHHHh-hCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeee Q lcl|NC_017981. 1 MAEQETTIDQFVPDE--VEAAAYRVLSPL-LPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVI 77 (181) Q Consensus 1 ~~~~~~t~~~~i~~l--~~~~a~~~ls~l-l~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~ 77 (181) |.. .++-||- .=.+.-.-|-.+ ..+.||.+++.|++|..||+|+.|.+ ++.+.+ +...=++.- T Consensus 1 ~~~-----~~~~~~~~~lv~~ii~~i~~~~~gl~vI~~~~~g~~p~yPF~TY~v~~-------pyi~~~--~~~~~~e~~ 66 (162) T protein:vir:80 1 MPN-----DTAGYDYGKLVKTLINAVNELSGGLQLIESSSGGEQPEYPFCQYTITS-------PYIAIS--PDIVEGEQF 66 (162) T ss_pred CCC-----ccccccHHHHHHHHHHHHHhhhcceeEEEccCCCCCCCCCeEEEEEec-------CccccC--CcccCCcce Confidence 321 2222321 111111111223 24799999999999999999998753 223322 222334566 Q ss_pred eEEEEEEEechHH---HHHHHHHHHHhCChhHHHHHHH-cCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeeeecc Q lcl|NC_017981. 78 EGTLSVKVFGGAA---RRHLDNLRSRTKKMSSRDIMTR-ERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFTD 153 (181) Q Consensus 78 eatvelq~fG~~A---~~~ld~L~~~lk~ps~~~~l~~-~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~d 153 (181) ++++++.|+-.++ ++++++|..-++++.....+++ +||++.|++.++|.++| ...+||+|..+|++|||..++.+ T Consensus 67 ~~~isi~~~S~~~~eAl~la~~l~~~f~~~~~~~~~~~~~gIvvvdv~~~~~R~~~-~~~~yerR~GFD~~~Rv~r~~e~ 145 (162) T protein:vir:80 67 EIVISLTWRALSGHQALNLANITNKYFRSQKGRFFMQENGGIVVVSVQNSGLRDTF-ISIEYERSAGIDLRLRVVDSYSS 145 (162) T ss_pred EEEEEEEEEeCCHHHHHHHHHHHHHHhhcCCceeeeeecCcEEEEecCCCccceeE-eeeeeeeeecceEEEEEeecccc Confidence 7999999998765 5666689999999999888865 69999999999999999 67789999999999999999999 Q ss_pred ccCceeeeEEe-cCCcc Q lcl|NC_017981. 154 NVGLIEHVKLT-GEIDE 169 (181) Q Consensus 154 ~Vg~Ie~Vevt-g~~~~ 169 (181) +...||+++++ +++|- T Consensus 146 ~~~tIe~i~~~~~~~g~ 162 (162) T protein:vir:80 146 EIQEIDNISFTNENLGG 162 (162) T ss_pred ccceeeeecccCcccCC Confidence 99999999872 33333 No 4 >protein:vir:78612 Length: 178 # NCBI annotation: BcepNY3gp04 # Family: family:all:3176 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294841;genbank:gi:149882904;genbank:GeneID:5291083 Probab=99.54 E-value=1.3e-16 Score=107.62 Aligned_cols=166 Identities=16% Similarity=0.171 Sum_probs=132.5 Q ss_pred CCcc---ccChhhhhHHHHHHHHHHHHHHhhC----CccEeecCCC-CCCCCceEEEEEcccCCcc--ccccCCCCCcCc Q lcl|NC_017981. 1 MAEQ---ETTIDQFVPDEVEAAAYRVLSPLLP----EMMLCYEGQN-HNISPPYATLHVIARRDVG--NPEFGPVNDDGI 70 (181) Q Consensus 1 ~~~~---~~t~~~~i~~l~~~~a~~~ls~ll~----~pvI~AdqNg-~~P~~PYatl~vr~~~~~~--~~~~g~vdd~G~ 70 (181) |..+ .+| |+-|++.+ .+.|..|++ +.||+.+||+ |-|.++|+.|..-.+...+ ...|++ .+|+ T Consensus 1 m~~~vtl~~T-e~di~~al----r~fL~~lf~~~~~~eVi~gqqN~~p~P~g~fiimt~l~~~~lsT~~~~Y~~--~~~~ 73 (178) T protein:vir:78 1 MTNPVTLRPS-EDEVFDTL----WGWVTSLFDPALASQIAKADQNATSTLYGTYALIRPGVREALNQTIRTYDA--TAGT 73 (178) T ss_pred CCCccccccc-HHHHHHHH----HHHHHHhcCcccCceEEEeccCCCCccCCCEEEEecccccccccceeeccC--ccce Confidence 5443 222 44555543 445555554 5699999998 4788999999764333222 223443 4589 Q ss_pred eeeeeeeeEEEEEEEechHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeee Q lcl|NC_017981. 71 QQIHQVIEGTLSVKVFGGAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQR 150 (181) Q Consensus 71 q~V~~h~eatvelq~fG~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~ 150 (181) +.+.+|.+-+|.|.|||.+|-+.+..++.-+|.+-.++++++.+|+.+=++.-+.+|+.+++.+||+|-++++.+.|+.+ T Consensus 74 ~~~~~~~~~~~QvD~YG~~A~d~A~~i~tl~Rs~~a~~~f~~~~iaPLYad~P~q~p~iN~e~QyE~Rwt~~~~lQ~Np~ 153 (178) T protein:vir:78 74 VSNELHTGYWYQVDCYGPQAPDWANTIAAMWRTMWSADALRGTALIPLYADQPQQLNIVNGENQFEQRYMVKLHAQVNQV 153 (178) T ss_pred eeeeeeeEEEEEEEEecCChhHHHHHHHHHhcChhHHHHHhcCcccceecCCcccCCccCccccccceEEEEEEEEeeeE Confidence 99999999999999999999999999999999999999999877889999999999999999999999999999999999 Q ss_pred eccccCceeeeEEecCCc--cccceecee Q lcl|NC_017981. 151 FTDNVGLIEHVKLTGEID--EVPVSFDID 177 (181) Q Consensus 151 ~~d~Vg~Ie~Vevtg~~~--~~p~~~~i~ 177 (181) .+-...-+.+|.||-.+. =+|+ + T Consensus 154 vt~pqdFfd~v~v~~~~~~di~~~----~ 178 (178) T protein:vir:78 154 ATAPQQFFTEVPATTATPVDIVPL----D 178 (178) T ss_pred Eeehhhhccccccceecccceecc----C Confidence 999999999999977552 2343 2 No 5 >protein:vir:3637 Length: 178 # NCBI annotation: gp05 # Family: family:all:3176 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705632;genbank:gi:23752317;genbank:gi:47835036;genbank:GeneID:955731 Probab=99.50 E-value=3.4e-16 Score=105.37 Aligned_cols=171 Identities=15% Similarity=0.104 Sum_probs=136.5 Q ss_pred CCcc-cc-ChhhhhHHHHHHHHHHHHHHhhCCccEeecCCC-CCCCCceEEEEEcccCCcc--ccccCCCCCcCceeeee Q lcl|NC_017981. 1 MAEQ-ET-TIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQN-HNISPPYATLHVIARRDVG--NPEFGPVNDDGIQQIHQ 75 (181) Q Consensus 1 ~~~~-~~-t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg-~~P~~PYatl~vr~~~~~~--~~~~g~vdd~G~q~V~~ 75 (181) |..+ -. .-|+-|++.+-+-.-+++.++-+.-||+.+||+ |-|.++|+.|..-.+...+ ...|++ .+|...+.+ T Consensus 1 m~~~vtl~~Te~di~~alr~fL~~lf~p~~~~eVi~gqqN~~p~P~g~fiimt~l~~~~lsT~~~~Y~~--~~~~~~~~~ 78 (178) T protein:vir:36 1 MTNPVTLRPSEDEVFDTLWEWVTSLFDPASAAQIAKADQNATSTLYGTYALIRPGVREALNQTIRSYDA--TAQTVANEL 78 (178) T ss_pred CCCcccccccHHHHHHHHHHHHHHHcCCCCCceEEeecccCCCcCCCCEEEEecccccccccceeecCC--cchheeeee Confidence 5444 11 225667766665555566566678899999998 4788899999764433232 223433 466888999 Q ss_pred eeeEEEEEEEechHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeeeecccc Q lcl|NC_017981. 76 VIEGTLSVKVFGGAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFTDNV 155 (181) Q Consensus 76 h~eatvelq~fG~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~d~V 155 (181) |.+-.|.|.|||.+|.+.+..++.-+|.+-.++++++.+|+.+-++.-+.+|+.+++++||+|-++++.+.|+.+.+-.. T Consensus 79 ~~~~~~QvD~YG~~A~d~A~~i~tl~Rs~~a~~~f~~~~iaPLYad~p~qlp~iN~e~QyE~Rwt~~~~lQ~Np~vt~pq 158 (178) T protein:vir:36 79 HTGYWYQVDCYGPQAPDWANTIAAMWRTMWSADALRGTALIPLYADQPQQLNIVNGEGQYEQRYMVKLHAQVNQVETAPQ 158 (178) T ss_pred eeEEEEEEEeecCChhHHHHHHHHHhcChhHHHHHhcCccccccCCCccccCccCccccccceEEEEEEEEeeeEEeehh Confidence 99999999999999999999999999999999999988899999999999999999999999999999999999999999 Q ss_pred CceeeeEEecCC--ccccce Q lcl|NC_017981. 156 GLIEHVKLTGEI--DEVPVS 173 (181) Q Consensus 156 g~Ie~Vevtg~~--~~~p~~ 173 (181) .-+.++.+|-.. +-+|+. T Consensus 159 dFfd~~~~~~~~~vd~~~~~ 178 (178) T protein:vir:36 159 LFFTEAPATTATPVDIVPLD 178 (178) T ss_pred hhcccCccceeeecccccCC Confidence 999999876444 234443 No 6 >protein:vir:94061 Length: 175 # NCBI annotation: hypothetical protein # Family: family:all:3176 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453620;genbank:gi:84662656;genbank:GeneID:5142571 Probab=99.46 E-value=1.1e-15 Score=102.66 Aligned_cols=166 Identities=16% Similarity=0.081 Sum_probs=130.7 Q ss_pred CCccccCh-hhhhHHHHHHHHHHHHHHhhC-----CccEeecCCC-CCCCCceEEEEEcccCCc--cccccCCCCCcCce Q lcl|NC_017981. 1 MAEQETTI-DQFVPDEVEAAAYRVLSPLLP-----EMMLCYEGQN-HNISPPYATLHVIARRDV--GNPEFGPVNDDGIQ 71 (181) Q Consensus 1 ~~~~~~t~-~~~i~~l~~~~a~~~ls~ll~-----~pvI~AdqNg-~~P~~PYatl~vr~~~~~--~~~~~g~vdd~G~q 71 (181) |...-.+| |+-|++. ..+.|..|++ .-||+.+||+ |-|.++|+.|-.-.+... ....|++ ++|++ T Consensus 1 m~avtl~~Te~di~~a----lr~fL~~lf~lp~~~~eVi~g~qN~~p~P~g~fi~mt~l~~~~lsT~~~~Y~~--~~g~~ 74 (175) T protein:vir:94 1 MTAATLTPTEDAVFDA----MFGFLAKVLDLPDDTQAIIKGFQNLSSTPTGSCVVVSPGMMTRQDFGSRLYDP--GLSKV 74 (175) T ss_pred CcceeccccHHHHHHH----HHHHHHHHcCCCCCCceEEEeccCCCCccCCCEEEEecccccccccceeeecc--cccce Confidence 77766666 5556654 4456666665 4699999997 478899999976433222 2223433 66899 Q ss_pred eeeeeeeEEEEEEEechHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeeee Q lcl|NC_017981. 72 QIHQVIEGTLSVKVFGGAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQRF 151 (181) Q Consensus 72 ~V~~h~eatvelq~fG~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~ 151 (181) .+.+|.+-+|.|.|||.+|.+.+..++.-||.+-.+++.. ..|+.+-++.-+.+|+.+++++||+|-++++.+.|+.+. T Consensus 75 ~~~~~~q~~~QvD~YG~~A~d~A~~~~tl~Rs~~a~~~~~-~~~~PLYad~p~qlp~iN~e~QyE~Rwt~~~~lQ~Np~v 153 (175) T protein:vir:94 75 VIEAHLTYSYQVDCYGPLAPTWASVISVAWKSMWGVDNTA-PAFAPLYADAPQQLNIVNSEGQFEQRFMVRLFGQVNQRV 153 (175) T ss_pred eeeeeeEEEEEEEeecCChHHHHHHHHHHhcChhHhhhhh-cccccccCcCccccCccCccccccceEEEEEEEEeeeEE Confidence 9999999999999999999999999999999997777553 359999999999999999999999999999999999999 Q ss_pred ccccCceeeeEEecCCccccceec Q lcl|NC_017981. 152 TDNVGLIEHVKLTGEIDEVPVSFD 175 (181) Q Consensus 152 ~d~Vg~Ie~Vevtg~~~~~p~~~~ 175 (181) +-...-+++|.|+-.-.+ ..+| T Consensus 154 t~pq~F~d~v~v~~~~~d--~~~P 175 (175) T protein:vir:94 154 ALPQDFFDSVQLTSLNIA--DLLP 175 (175) T ss_pred eeehhhcccCCcceeece--ecCC Confidence 999999999987433321 1222 No 7 >protein:vir:5259 Length: 213 # NCBI annotation: hypothetical protein # Family: family:all:4879 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852764;genbank:gi:31544039;uniprot:Q7Y5T6;genbank:GeneID:2753558 Probab=99.23 E-value=3e-14 Score=94.72 Aligned_cols=158 Identities=20% Similarity=0.194 Sum_probs=121.6 Q ss_pred CCccccChhhhhHHHHHHHHHHHHHHhhC---CccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCcee-eeee Q lcl|NC_017981. 1 MAEQETTIDQFVPDEVEAAAYRVLSPLLP---EMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQ-IHQV 76 (181) Q Consensus 1 ~~~~~~t~~~~i~~l~~~~a~~~ls~ll~---~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~-V~~h 76 (181) |+ -||+..+=. ...-.++..||. --||-+|+. -.|-+||+|...-.+-..+-.++.- ||.|- +.-. T Consensus 47 M~--TTTvS~lD~----~~LRq~ir~Ll~LPeg~Vid~~~d-a~p~~pFITV~~l~ss~lG~a~reF---dg~rEvit~S 116 (213) T protein:vir:52 47 MD--TTTISGFDI----VRLRKLIQQALQLPDGVVIGGWLP-ENPLSAFITVDVLMSSETGIARRDF---DGKRERITMS 116 (213) T ss_pred hh--hhccccccH----HHHHHHHHHHHhCCcceecCCcCC-CCCCCCeEEEeecccchhhhhhhhc---cCchhhhhhh Confidence 33 244433322 233344444433 568999988 4466799999776555455333333 35554 6668 Q ss_pred eeEEEEEEEechHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeeeeccccC Q lcl|NC_017981. 77 IEGTLSVKVFGGAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFTDNVG 156 (181) Q Consensus 77 ~eatvelq~fG~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~d~Vg 156 (181) ++-+++++.||.+||.+|.+|+.-|.....-.+|+..|.+++...+|+|+|+. ....||+||.+|+.+-|.-...-.|- T Consensus 117 ~et~vs~tafGtnAy~ll~kl~a~L~ss~al~~LK~l~aGlVr~S~v~nLsa~-iggg~e~RArfdltfsH~HrVet~l~ 195 (213) T protein:vir:52 117 MQNTVSFSCFGTNAMAQCYKLKAILQSSVILQALKTMNVGIVSFSDVRNLTAT-IGSDYEERGQFDAVFSHHHIVDTPLD 195 (213) T ss_pred hccEEEEEEeChhHHHHHHHHHHHHhHHHHHHHHHHhccceeeecccccccee-cCCCchhheeeeeeeeeeeeeccchH Confidence 88999999999999999999999999999999999999999999999999999 55569999999999999999988888 Q ss_pred ceeeeE-----EecCCcc Q lcl|NC_017981. 157 LIEHVK-----LTGEIDE 169 (181) Q Consensus 157 ~Ie~Ve-----vtg~~~~ 169 (181) .+.+|+ +|-.|++ T Consensus 196 ~~~~v~~rt~hit~~~~~ 213 (213) T protein:vir:52 196 PIKRVEQRTNHLTQQIGE 213 (213) T ss_pred HHHHhhhhhhhHHHhhcC Confidence 888887 4666666 No 8 >protein:vir:99607 Length: 186 # NCBI annotation: hypothetical protein # Family: family:all:7160 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039797;genbank:gi:126011047;genbank:GeneID:4818302 Probab=97.66 E-value=4e-06 Score=50.16 Aligned_cols=163 Identities=12% Similarity=0.124 Sum_probs=122.1 Q ss_pred CCccccChhhhhHHHHHHHHHHHHHHh----hCCccEeecCCCC--CCCCceEEEEEcccCCccccccCCC-CCcCceee Q lcl|NC_017981. 1 MAEQETTIDQFVPDEVEAAAYRVLSPL----LPEMMLCYEGQNH--NISPPYATLHVIARRDVGNPEFGPV-NDDGIQQI 73 (181) Q Consensus 1 ~~~~~~t~~~~i~~l~~~~a~~~ls~l----l~~pvI~AdqNg~--~P~~PYatl~vr~~~~~~~~~~g~v-dd~G~q~V 73 (181) |-+. +|+.+....+++.| ++-+|+-++|=-+ ++..|-+-..+-..+..++.+.+.- |++|...+ T Consensus 1 M~dn---------eL~~~ir~~L~a~L~sqg~~~~Vv~~fQPtqQG~~~~~~~Ff~~~~~~~~G~~~q~r~y~~~~~~~~ 71 (186) T protein:vir:99 1 MTDL---------ELIDTVVPYIEAAIASQGWPFLVVQKDQPTQQGVPSIGTVFFELLFNIEYGSPATENQYNAASNQFD 71 (186) T ss_pred CCch---------hHHHHHHHHHHHHHHhcCCCceeeccccCcccCccCCCeEEEEeccCCcccCChhhcccccccccce Confidence 5443 34444444455444 7789998887533 7788888887766666666666543 56666554 Q ss_pred ---eeeeeEEEEEEEec------hH---HHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeE Q lcl|NC_017981. 74 ---HQVIEGTLSVKVFG------GA---ARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAIL 141 (181) Q Consensus 74 ---~~h~eatvelq~fG------~~---A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~l 141 (181) +++.+.|..+|.|= .+ |-|.+.-.+.-++--...++|.+.|++|...+.|.+-++++|..+||.+=.+ T Consensus 72 ~ietQl~e~TyQiqalv~~d~~~~~~~TA~Dv~~~vR~i~qS~~fi~~l~~q~vgI~RaseIr~~~f~nD~~~~E~~PsF 151 (186) T protein:vir:99 72 EIETQLVITTIQISALIPQDPKNIDQPTANDVLQYVKRYVASRNIARELVKKKVNIYRVSEVRAQWFEDDNHQFSNHPNF 151 (186) T ss_pred eehheeeeeeEEEEEeeccCccchhhhhHHHHHHHHHHHHhhHHHHHHHHhcCcceeccccccccccccCCCCCccCcee Confidence 45888999999994 22 3456666677777778889999999999999999999999999999999999 Q ss_pred EEEEEEeeeeccccCceeeeE---E-ecCCccccc Q lcl|NC_017981. 142 DMGFRFTQRFTDNVGLIEHVK---L-TGEIDEVPV 172 (181) Q Consensus 142 el~irY~~~~~d~Vg~Ie~Ve---v-tg~~~~~p~ 172 (181) ||.+-|..++.-+|..|.+|. | -=+-|.+|+ T Consensus 152 di~vTh~~~it~~t~av~~~~p~~~~~~~~~~~~~ 186 (186) T protein:vir:99 152 DLEVTHSGKVTFSVAAVDRCDPLEVPPYGSGTFPV 186 (186) T ss_pred EEEEEeeeeEeechhhhhhcccccCCCCCCCcCCC Confidence 999999999999999999997 3 112234555 No 9 >protein:vir:107715 Length: 189 # NCBI annotation: gp18 # Family: family:all:7160 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024866;genbank:gi:48697508;genbank:GeneID:2948331 Probab=97.16 E-value=3.8e-05 Score=44.82 Aligned_cols=161 Identities=13% Similarity=0.042 Sum_probs=112.1 Q ss_pred CCccccChhhhhHHHHHHHHHHHHHHhhC----------CccEeecCC--CCCCCCceEEEEEcccCCccccccC----C Q lcl|NC_017981. 1 MAEQETTIDQFVPDEVEAAAYRVLSPLLP----------EMMLCYEGQ--NHNISPPYATLHVIARRDVGNPEFG----P 64 (181) Q Consensus 1 ~~~~~~t~~~~i~~l~~~~a~~~ls~ll~----------~pvI~AdqN--g~~P~~PYatl~vr~~~~~~~~~~g----~ 64 (181) |-+ .+-+--+.+++..++-+.... .||..|+|= --+|..|-+..... .+.++=+-| . T Consensus 1 M~d-----n~l~~~ir~~L~a~L~~~g~~~~~t~~~p~~~~V~~afQPtqQG~~~~~~iff~~i--~~~~~G~p~r~~~~ 73 (189) T protein:vir:10 1 MDE-----NQLLIVLIAMLDAGLAAYAANPLNTRSLPDGITSQRAFQPRQEGAPEGPAIIINHT--LTRQIGFPKRFSKQ 73 (189) T ss_pred Cch-----hhHHHHHHHHHHHHHHHhcccchhhccccccceeeecccccccccccCCeEEEeec--cccCcccccccccc Confidence 443 333333444444444433333 688999776 23677775555321 122222333 3 Q ss_pred CCCcCceeee---eeeeEEEEEEEec----h--H---HHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhh Q lcl|NC_017981. 65 VNDDGIQQIH---QVIEGTLSVKVFG----G--A---ARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSE 132 (181) Q Consensus 65 vdd~G~q~V~---~h~eatvelq~fG----~--~---A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea 132 (181) .|++|.+.++ ++.+.|..+|.|= . + |-|.+.-.+.-++--...++|.+.|++|...+-+.+-++++|. T Consensus 74 ~~~~~~~~~~ietQ~~e~TyQiqalv~~d~~~~~~~TA~Dv~~~vR~i~qS~~f~~~l~~q~vgI~RaseIr~~~f~nD~ 153 (189) T protein:vir:10 74 LNPPDGDWIKVQGQRYESTYHIEALIPQSPAEPNAMTESDVLNVARFILQSDEAVAFLTPQNIGLLRVTELRSNYIIDDK 153 (189) T ss_pred cccccchhhhhhheeeeeeeEEEEeeccCccchhhhhHHHHHHHHHHHHhhHHHHHHHHhcCcceeccccccccccccCC Confidence 5777777755 6888999999887 2 2 2455556667777778889999999999999999999999999 Q ss_pred hhheeeeeEEEEEEEeeeeccccCceeeeEEecCCccc Q lcl|NC_017981. 133 IYAEPSAILDMGFRFTQRFTDNVGLIEHVKLTGEIDEV 170 (181) Q Consensus 133 ~~yE~RA~lel~irY~~~~~d~Vg~Ie~Vevtg~~~~~ 170 (181) .+||.+=.+|+.+-|..+++-+|..|..++- .+-.| T Consensus 154 ~~~E~~PsFd~~vTh~~~~t~~t~~v~~~~~--~~hRi 189 (189) T protein:vir:10 154 EQNENVPFFEITFSHRIEFSATVPLIDVFKT--VFNRV 189 (189) T ss_pred CCCccCceeEEEEEecceeecccceeeeecc--ccccC Confidence 9999999999999999999999999988761 11111 No 10 >protein:vir:95262 Length: 199 # NCBI annotation: Phage hypothetical protein # Family: family:all:31737 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944895;genbank:gi:38707835;genbank:GeneID:2744048 Probab=96.21 E-value=0.00047 Score=38.82 Aligned_cols=171 Identities=18% Similarity=0.213 Sum_probs=111.9 Q ss_pred CCccccChhhhhHHHHHHHHHHHHHHhhC-----Cc-cEeecC------CCCCCCCceEEEEEcccCCccccccCCCC-- Q lcl|NC_017981. 1 MAEQETTIDQFVPDEVEAAAYRVLSPLLP-----EM-MLCYEG------QNHNISPPYATLHVIARRDVGNPEFGPVN-- 66 (181) Q Consensus 1 ~~~~~~t~~~~i~~l~~~~a~~~ls~ll~-----~p-vI~Adq------Ng~~P~~PYatl~vr~~~~~~~~~~g~vd-- 66 (181) |.-.---+++ =+-.++-+++..+|. +| ||.+|+ -|-.|..||++. ++....- .||-+- T Consensus 1 mql~~~~~e~----g~v~~~v~~ig~~la~dkn~rp~vi~~~psdnsndkglkpd~pfitv--~~~~~~t--pyg~~ld~ 72 (199) T protein:vir:95 1 MQLETAELEK----GLVRTLVDVIGHRLARDKNNRPNVIRAYPSDNSNDKGLKPDQPFITV--YCQDAAT--PYGWVLDK 72 (199) T ss_pred CcccHHHHHh----hHHHHHHHHHHHHhhhccCCCCcEEEecCCCCCccCCCCCCCceEEE--EEecccC--chhhhhhh Confidence 3211111222 222345566666665 33 788876 355899999998 4433222 455443 Q ss_pred --CcCceeeeeeeeEEEEEEEechHHHHHHHHHHHHhCChhHHHHH-HHcCeeeecCCcc-ccchhhhhhhhheeeeeEE Q lcl|NC_017981. 67 --DDGIQQIHQVIEGTLSVKVFGGAARRHLDNLRSRTKKMSSRDIM-TRERFIIYATEQV-LDVNITRSEIYAEPSAILD 142 (181) Q Consensus 67 --d~G~q~V~~h~eatvelq~fG~~A~~~ld~L~~~lk~ps~~~~l-~~~giai~d~g~V-qdlt~L~ea~~yE~RA~le 142 (181) ++|..-.+-.-.-.+-+.+-|++|+-+.-++++||.+.|.++++ +.-|-+++|.|++ ||-+-| +. -||..+-+- T Consensus 73 fve~~~~cyri~f~i~v~it~~gk~~~si~~~~kqrle~ss~r~~~~e~tg~~vldtg~~p~~~~~l-~t-d~e~s~plv 150 (199) T protein:vir:95 73 FVEDDVVCYRIAFQIPVLITVNGKGAHSIMLELKQRLEMSSVRDLILEETGATVLDTGAIPNDYTYL-NT-DFENSAPLV 150 (199) T ss_pred hhhcCceEEEeeeeeEEEEEeeCCcchhhHHHHHhhhhHHHHHHHHhhhcCceeeeccCCCcccchh-cc-cCCCCCcee Confidence 44555555555567778889999999999999999999999986 5578999999998 677777 43 599999988 Q ss_pred EEEEEeeeecc-ccCceeeeEEecCCc--------cccceeceeeecC Q lcl|NC_017981. 143 MGFRFTQRFTD-NVGLIEHVKLTGEID--------EVPVSFDIDILTP 181 (181) Q Consensus 143 l~irY~~~~~d-~Vg~Ie~Vevtg~~~--------~~p~~~~i~~~t~ 181 (181) +++--.+-..| +-|+||+|-+.|.+. ++-..||.+---- T Consensus 151 ~~l~~~svl~d~~~~iiervi~dg~l~y~egq~~~~~~ihldvdskgv 198 (199) T protein:vir:95 151 VTLVKNSVLKDERGSIIERVIVDGELVYEEGQEPPEYTIHLDVDSKGV 198 (199) T ss_pred eeeeehheeeecccceeeeeeeccEEEecCCCCCceEEEEEeccccCC Confidence 88877766655 568999999877653 2222222211100 No 11 >protein:vir:96106 Length: 183 # NCBI annotation: hypothetical protein ORF027 # Family: family:all:7160 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294444;genbank:gi:149408341;genbank:GeneID:5237225 Probab=96.15 E-value=0.00067 Score=37.99 Aligned_cols=160 Identities=11% Similarity=0.031 Sum_probs=110.9 Q ss_pred cChhhhhHHHHHHHHHHHHHHhh-----CCccEeecCCC--CCCCCceEEEEEcccCCccccccCC---CCCcCcee--- Q lcl|NC_017981. 6 TTIDQFVPDEVEAAAYRVLSPLL-----PEMMLCYEGQN--HNISPPYATLHVIARRDVGNPEFGP---VNDDGIQQ--- 72 (181) Q Consensus 6 ~t~~~~i~~l~~~~a~~~ls~ll-----~~pvI~AdqNg--~~P~~PYatl~vr~~~~~~~~~~g~---vdd~G~q~--- 72 (181) .|=.+-|-.++=+....+|++|. +.+|+-++|=- -+|..|-+....--....| +-+. -|..+.+. T Consensus 1 M~D~eLi~~v~~~i~~~~l~~l~~~~~~d~~V~q~fQPtqQG~~~~~~i~f~~i~~~~~G--w~~r~~~yd~~~~~~~~i 78 (183) T protein:vir:96 1 MFDGELIEKLVVELTSAMTSAKETLQFPDFEVVQKAQPTQQGTSTKPAIFFQKLFDIPRG--WPATDWYLDNAARKYVEI 78 (183) T ss_pred CCchhHHHHHHHHHHHHHHHHHHhccCCCeeEEecccccccccccCCeEEEEecCCCCCC--Ccccccccccccchhhhh Confidence 33333333333345555666553 57888888863 3788886666322222122 2222 34444333 Q ss_pred eeeeeeEEEEEEEec----hH-----HHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEE Q lcl|NC_017981. 73 IHQVIEGTLSVKVFG----GA-----ARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDM 143 (181) Q Consensus 73 V~~h~eatvelq~fG----~~-----A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel 143 (181) =+++.+.|..+|.|= .+ |-|.+.-.+.-++--...++|.+.|++|...+.+.+-++++|..+||.+=.+|| T Consensus 79 etQl~e~TyQiqalv~~d~~~~~~~TA~Dv~~~vR~i~qS~~f~~~l~~q~vgI~RaseIr~~~f~nD~~~~E~~PsFd~ 158 (183) T protein:vir:96 79 TRQHVETTFQISSLHWQNPELDHVVTAADIANYVRAYFQARSTIQRVKELDFLILRVSHISNEAFENDNHQFEFHPSFDM 158 (183) T ss_pred hheeeeeeEEEEEeeccCccchhhhhHHHHHHHHHHHHhhHHHHHHHHhcCcceeccccccccccccCCCCCCcCceeEE Confidence 346888999999987 22 245556666777777888999999999999999999999999999999999999 Q ss_pred EEEEeeeeccccCceeeeE-EecCC Q lcl|NC_017981. 144 GFRFTQRFTDNVGLIEHVK-LTGEI 167 (181) Q Consensus 144 ~irY~~~~~d~Vg~Ie~Ve-vtg~~ 167 (181) .+-|..++.-+|..+.+.. |-=+| T Consensus 159 ~vTh~~~it~~t~av~~~~~~~~~~ 183 (183) T protein:vir:96 159 VVTYNQYIRLHENAAYSADGVLIGI 183 (183) T ss_pred EEEeeeeEeecchhhhhcceeeeeC Confidence 9999999999999988765 21122 No 12 >protein:vir:81066 Length: 118 # NCBI annotation: p13 # Family: family:all:3244 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285683;genbank:gi:148727191;genbank:GeneID:5247111 Probab=93.43 E-value=0.0042 Score=33.63 Aligned_cols=113 Identities=6% Similarity=0.043 Sum_probs=56.4 Q ss_pred cChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCC-CCCCceEEEEEcccCCccccccCC-CCCcCceeeeeeeeEEEEE Q lcl|NC_017981. 6 TTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNH-NISPPYATLHVIARRDVGNPEFGP-VNDDGIQQIHQVIEGTLSV 83 (181) Q Consensus 6 ~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~-~P~~PYatl~vr~~~~~~~~~~g~-vdd~G~q~V~~h~eatvel 83 (181) .|+++.+..++ +++.+.+|=+ +.=-. .|..||+++..-+.. .....-|. .|. +..+++| T Consensus 1 Ms~e~~l~a~L--------~~~~~~Rvyp-~~aP~~~~~~Pyiv~q~vsg~-p~~~l~G~~~~~---------~~~rvQI 61 (118) T protein:vir:81 1 MSYGRVLKDLL--------DPVFSGRVYA-DIPPDSPPLDAYAIYQRVGGV-PVYWQEGGMPEK---------VNARVQI 61 (118) T ss_pred CchHHHHHHHH--------HhhcCCcccc-ccCCCCCccCceEEEEecCCc-ccccccCCCCCc---------cceeEEE Confidence 66666665554 4554433322 11111 244699999655533 22222222 111 2367999 Q ss_pred EEech---HHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeee Q lcl|NC_017981. 84 KVFGG---AARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQR 150 (181) Q Consensus 84 q~fG~---~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~ 150 (181) ++|+. +|++..++++..+.-... +.. +.+-....|..-==.|+.+||.++|.++ T Consensus 62 dvyA~t~~~A~~l~~av~~al~~~~~--------~~~-----~~~~~d~ye~dt~l~r~~~Df~iw~~~~ 118 (118) T protein:vir:81 62 QIWSRSKQEAYLATVQVLRLVSEAPD--------MQV-----LSQPIDDYVREIKLYGSRVDVSMWYPIT 118 (118) T ss_pred EEeeCCHHHHHHHHHHHHHHhhhccc--------eee-----ccCCccccccccCceeEEEEEEEEecCC Confidence 99985 456666666655532110 111 1110011111111258999999999999 No 13 >protein:vir:97070 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:3244 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453569;genbank:gi:84662604;genbank:GeneID:5142485 Probab=93.38 E-value=0.0051 Score=33.14 Aligned_cols=113 Identities=8% Similarity=0.060 Sum_probs=56.7 Q ss_pred cChhhhhHHHHHHHHHHHHHHhhCCccEe--ecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEEEEE Q lcl|NC_017981. 6 TTIDQFVPDEVEAAAYRVLSPLLPEMMLC--YEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGTLSV 83 (181) Q Consensus 6 ~t~~~~i~~l~~~~a~~~ls~ll~~pvI~--AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eatvel 83 (181) .+++.- .+.+|+++.+.+|=+ |-++ .|..||+++..-+.. .....-|...+ -+...++| T Consensus 1 M~~e~~--------l~a~L~~~~~~Rvyp~~aP~~--~~~~Pyiv~q~vsg~-p~~~ldG~~~~--------~~~~rvQI 61 (118) T protein:vir:97 1 MSYGRM--------LKDLLDPVFSGRVYADIPPDS--PPLDAYAIYQRVGGV-PVYWKEGGMPD--------KVNARVQV 61 (118) T ss_pred CchHHH--------HHHHHhhhcCCccccccCCCC--CCcCCEEEEEecCCc-ccccccCCCCC--------ccceeEEE Confidence 444444 455566666654433 1111 244699999655543 22222232111 12367999 Q ss_pred EEech---HHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeee Q lcl|NC_017981. 84 KVFGG---AARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQR 150 (181) Q Consensus 84 q~fG~---~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~ 150 (181) ++|+. +|++..++++..+.-.. .....+.+ ..+.|..-==.|+.+||.++|+++ T Consensus 62 dvyA~t~~~A~~l~~av~~al~~~~--------~~~~~~~~-----~~~ye~dt~lyr~~~Df~iw~~~~ 118 (118) T protein:vir:97 62 QIWSRSKQEAYLATVQVLRIVSEAN--------DMQVLSQP-----IDDYVRELKLYGSRVDISMWYNLT 118 (118) T ss_pred EEeeCCHHHHHHHHHHHHHHhhccc--------ccccccCC-----cccccccCCceEEEEEEEEEeecC Confidence 99985 45666666655553211 01111100 011111111258999999999999 No 14 >protein:vir:10368 Length: 118 # NCBI annotation: conserved phage protein # Family: family:all:3244 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858960;genbank:gi:32128425;genbank:GeneID:2648389 Probab=92.26 E-value=0.0098 Score=31.60 Aligned_cols=108 Identities=7% Similarity=0.072 Sum_probs=56.0 Q ss_pred cChhhhhHHHHHHHHHHHHHHhhCCccEe--ecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEEEEE Q lcl|NC_017981. 6 TTIDQFVPDEVEAAAYRVLSPLLPEMMLC--YEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGTLSV 83 (181) Q Consensus 6 ~t~~~~i~~l~~~~a~~~ls~ll~~pvI~--AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eatvel 83 (181) .|+++.+..+ |+++.+.+|=+ |-++ .|..||+++..-+.. .....-|...+ -+...++| T Consensus 1 Ms~e~~l~a~--------L~~~~~~RVyp~~aP~~--~~~~Pyiv~q~vsg~-p~~~l~G~~~~--------~~~~rvQI 61 (118) T protein:vir:10 1 MSYGRVLKDL--------LDPVFSGRVYADIPPDS--PPLDAYAIYQRVGGV-PVYWQEGGMPE--------KVNARVQI 61 (118) T ss_pred CchHHHHHHH--------HhhhcCCccccccCCCC--CCcCCEEEEEecCCc-ccccccCCCCc--------cceeEEEE Confidence 5555555544 55555533322 1111 245699999655543 22222222111 12367999 Q ss_pred EEech---HHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhhe-----eeeeEEEEEEEeee Q lcl|NC_017981. 84 KVFGG---AARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAE-----PSAILDMGFRFTQR 150 (181) Q Consensus 84 q~fG~---~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE-----~RA~lel~irY~~~ 150 (181) +||+. +|++..++++..+.-.. .+..+..+ ..-|| .|+.+||.++|+++ T Consensus 62 dvyA~t~~~A~~l~~av~~al~~~~--------~~~~~~~~----------~d~ye~dt~l~r~~~Df~vw~~~~ 118 (118) T protein:vir:10 62 QIWSRSKQEAYLATVQVLRLVSEAN--------DMQVLSQP----------IDDYVREIKLYGSRVDISMWYNLT 118 (118) T ss_pred EEeeCCHHHHHHHHHHHHHHhhhcc--------cceeccCC----------CccccccCCceEEEEEEEEeeecC Confidence 99985 45555666655553210 11111110 11233 49999999999999 No 15 >protein:vir:1244 Length: 145 # NCBI annotation: similar to phage Spp1 gp17 # Family: family:all:296 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510943;genbank:gi:17426277;genbank:GeneID:927402 Probab=85.48 E-value=0.052 Score=27.60 Aligned_cols=139 Identities=9% Similarity=0.130 Sum_probs=68.9 Q ss_pred CC-ccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeE Q lcl|NC_017981. 1 MA-EQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEG 79 (181) Q Consensus 1 ~~-~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~ea 79 (181) |. .-+.-.++||+...-+-+ -|+.+++.++ .|.-...++.||+++--....+. +.-.. .-.+. T Consensus 1 M~~s~~~aLq~ai~~~L~ad~--~l~~lvg~~v--yD~~P~~~~~PyV~lG~~~~~~~-----~t~~~-------~~~~~ 64 (145) T protein:vir:12 1 MWVSVERYLFNKVYNKLKSNP--IIQKQLGGRV--FDCVQKDAVYPYIVVGETNVTNK-----ETTTS-------MVEDV 64 (145) T ss_pred CcccHHHHHHHHHHHHhhcCh--hHHHhcCccc--ccCCccCCCCCEEEeccceeeec-----CCCcc-------cceEE Confidence 44 333334444444332211 2456666553 23333357899999822111111 11011 23466 Q ss_pred EEEEEEec-----hHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeeeeccc Q lcl|NC_017981. 80 TLSVKVFG-----GAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFTDN 154 (181) Q Consensus 80 tvelq~fG-----~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~d~ 154 (181) ++.|.+|+ .++.++++.+...|..+ +.-.|..+.+.. ....-.+++..--..++++.|++++...-. T Consensus 65 ~lti~Vws~~~gr~ea~~ia~ai~~aL~~~-----l~l~~~~lv~l~-~~~~~~~rd~d~~~~hgvl~~ra~i~~~~~-- 136 (145) T protein:vir:12 65 GITLHVYSQARNRDEASQIIQFLGFVLNNE-----IEIDYYSFIKSR-IDTQEVITDIDQYTKHGIIRLVFKYRHNTL-- 136 (145) T ss_pred EEEEEEEEcCccHHHHHHHHHHHHHHhccc-----cCCCCceEEEEE-EeeEEEEecCCCceEEEEEEEEEEEEeCCc-- Confidence 77888887 45577777787777543 333344443211 112223333333467898888888765432 Q ss_pred cCceeeeEEecCCc Q lcl|NC_017981. 155 VGLIEHVKLTGEID 168 (181) Q Consensus 155 Vg~Ie~Vevtg~~~ 168 (181) +=+||.+.+ T Consensus 137 -----~~~~~~~~~ 145 (145) T protein:vir:12 137 -----QRSVTNGAG 145 (145) T ss_pred -----ccccccCCC Confidence 223555555 No 16 >protein:vir:95765 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950594;genbank:gi:119953789;genbank:GeneID:5076835 Probab=81.95 E-value=0.081 Score=26.56 Aligned_cols=124 Identities=12% Similarity=0.144 Sum_probs=69.9 Q ss_pred cChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEEEEEEE Q lcl|NC_017981. 6 TTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGTLSVKV 85 (181) Q Consensus 6 ~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eatvelq~ 85 (181) .||++.+||.+ +- ++. +..++.=.=+. ...+.||..|.=.... +.....+.+ +.++.|.+ T Consensus 1 m~P~qeLfd~~----f~-~~~-~Gy~vYD~lP~-~~v~YPFVvig~~~~~-----------~~~tKt~~G--~i~l~i~V 60 (127) T protein:vir:95 1 MTPNHALFRRL----FA-ISN-IRVDTYDFLPD-AKSAYPFVYIGENNGS-----------DIPNKDLLG--RLRQTVHL 60 (127) T ss_pred CchhHHHHHHH----HH-HHh-cCCccccccCc-CCCCcCEEEEeeeeec-----------ccccceeee--EEEEEEEe Confidence 89999999955 32 444 46666654443 2468999999421111 111233444 56777899 Q ss_pred echHH-HHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhhe--eeeeEEEEEEEeeeec Q lcl|NC_017981. 86 FGGAA-RRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAE--PSAILDMGFRFTQRFT 152 (181) Q Consensus 86 fG~~A-~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE--~RA~lel~irY~~~~~ 152 (181) ||..- |-...++..++..-... .-+-.+.++ .-+-++...+.+..--. .||+|+|.++|+-+-- T Consensus 61 Wg~~~~R~~vs~i~~~i~~~~~~-~~~~~~y~~--~~~~s~~qil~Dtstnt~L~Hgil~l~f~f~~~~~ 127 (127) T protein:vir:95 61 YGLRTDRANLDDISAYLESEVKR-AHDGYDYHL--YHVETSKQIIPDNTDVQPLLHIVLDFTFDYTKKEN 127 (127) T ss_pred ecCchhhhhHHHHHHHHHHHhhh-hcccceeEE--EEecceeEEecccCCcceeEEEEEEEEEEeeccCC Confidence 98765 55555666555432211 122222222 22222333444433323 6899999999998755 No 17 >protein:vir:1438 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:2712 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536367;genbank:gi:17975172;genbank:GeneID:929144 Probab=78.27 E-value=0.066 Score=27.06 Aligned_cols=115 Identities=12% Similarity=0.049 Sum_probs=52.4 Q ss_pred CCccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEE Q lcl|NC_017981. 1 MAEQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGT 80 (181) Q Consensus 1 ~~~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eat 80 (181) |+ =..++..|+++.+..|=+ +.--..++.||+++..-+.. ....--|+ -++...+ T Consensus 1 ~~--------------~~~i~~aL~~l~~~RVyp-~~aP~~~~~Pyiv~q~vsg~-p~~~L~G~---------~~~~~~~ 55 (115) T protein:vir:14 1 MS--------------VIVIRDALQGIGGAKGYL-GVAPAKAPAPYFVVTRVHGA-LDMALAGL---------TGGRSGS 55 (115) T ss_pred Ce--------------eEeeehhhcccccccccc-ccCCCCCCCCEEEEEeecCc-ccccccCC---------CCCcceE Confidence 22 123344455555544432 22222445699999654432 22222222 2345688 Q ss_pred EEEEEechHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEE Q lcl|NC_017981. 81 LSVKVFGGAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRF 147 (181) Q Consensus 81 velq~fG~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY 147 (181) ++|++|+... +.+++|+...+ +.+..... ....+.+.+.+-+.|..-==.|+.+||.++| T Consensus 56 vQIDvyA~t~-~~A~~l~~~v~-----~~~~~~~~-~~~~~~~~~~~d~ye~dt~lyR~s~D~~vWf 115 (115) T protein:vir:14 56 YQIDCYAPTF-TDADRLADLAV-----DRAMSVQD-RFSVGGVDELPDDYSEDTGLFRISLELSVEF 115 (115) T ss_pred EEEEEeeCCH-HHHHHHHHHHH-----HHHhcCcc-ceeeeeecCCCCCCcccccceeeEEEEEEeC Confidence 9999997532 22333333321 11222111 1122222222222221111259999999999 No 18 >protein:vir:5979 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:296 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690679;genbank:geneid:6329147;genbank:gi:22855073;uniprot:O48448;genbank:GeneID:955319 Probab=74.35 E-value=0.075 Score=26.74 Aligned_cols=128 Identities=11% Similarity=0.126 Sum_probs=59.6 Q ss_pred CCccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEE Q lcl|NC_017981. 1 MAEQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGT 80 (181) Q Consensus 1 ~~~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eat 80 (181) ++.-+.-.|.||+.-.-+- --|..|++. |+ |.....++.||+++--...++. +.-...| .+.+ T Consensus 3 ~~s~~~aLq~Ai~~~L~ad--~~l~alvg~--I~-D~~P~~~~~PYV~lG~~~~~d~-----~~~~~~g-------~~~~ 65 (134) T protein:vir:59 3 WKLASRALQKATVENLESY--QPLMEMVNQ--VT-ESPGKDDPYPYVVIGDQSSTPF-----ETKSSFG-------ENIT 65 (134) T ss_pred ccchhHHHHHHHHHHhhcC--hhHHHhhhh--hh-cCCCCCCCCCEEEeCCceeeec-----CCCcccc-------eEEE Confidence 3334566677776544332 224456652 44 5555677899999922111111 1111222 2344 Q ss_pred EEEEEech----HHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeee Q lcl|NC_017981. 81 LSVKVFGG----AARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQR 150 (181) Q Consensus 81 velq~fG~----~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~ 150 (181) +.|.++.. +|.+++..+...|...... +....++..... ..-.+++......++++.|+++.... T Consensus 66 ~ti~Vws~~g~~ea~~ia~av~~aL~~~~L~--l~~~~lv~l~~~---~~~~~rd~dg~~~hg~l~fra~ve~~ 134 (134) T protein:vir:59 66 MDFHVWGGTTRAEAQDISSRVLEALTYKPLM--FEGFTFVAKKLV---LAQVITDTDGVTKHGIIKVRFTINNN 134 (134) T ss_pred EEEEEEECCChHHHHHHHHHHHHHhcCCCcc--cCCceEEEeEEe---eeeEEecCCCceEEEEEEEEEEEecC Confidence 55555543 4556666666665422211 111112222222 22223334334556777777766554 No 19 >protein:vir:99537 Length: 125 # NCBI annotation: putative protein # Family: family:all:504 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958542;genbank:gi:41179324;genbank:GeneID:2717175 Probab=73.92 E-value=0.16 Score=24.88 Aligned_cols=121 Identities=10% Similarity=0.057 Sum_probs=66.2 Q ss_pred cChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEEEEEEE Q lcl|NC_017981. 6 TTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGTLSVKV 85 (181) Q Consensus 6 ~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eatvelq~ 85 (181) .||++.+||.+=...- -|+.||+=.=|.. ..+.||+.|.=....+. . + .-+--.+.++.|.+ T Consensus 1 m~P~q~Lfd~~f~~~~-----~lG~~vyD~lP~~-~v~YPFVvig~~~~~~~---~----t-----Kt~~~g~i~lti~V 62 (125) T protein:vir:99 1 MNPYEELFKTVIEYCK-----KTGYPTFDYLPDE-SQGYPFIMVGDQINNDI---Y----A-----KDFVTGTSNLTIHV 62 (125) T ss_pred CchhHHHHHHHHHHHH-----hcCCceeeecCCC-CCCcCEEEEeeeeecCC---C----C-----ccccceEEEEEEEE Confidence 8999999997643332 3688877765643 45899999932111110 0 0 11112356788889 Q ss_pred echHH-HHHHHHHHHHhCChhHHHHHHHcCeeee-cCCccccchhhhhhhhhe--eeeeEEEEEEEe Q lcl|NC_017981. 86 FGGAA-RRHLDNLRSRTKKMSSRDIMTRERFIIY-ATEQVLDVNITRSEIYAE--PSAILDMGFRFT 148 (181) Q Consensus 86 fG~~A-~~~ld~L~~~lk~ps~~~~l~~~giai~-d~g~Vqdlt~L~ea~~yE--~RA~lel~irY~ 148 (181) ||... +-.++++..++.. .......-.|..|. +..++| .+.+..-.. .++++.|.++.- T Consensus 63 Wg~~~~R~~v~~i~~~i~~-~~~~~~~t~~y~~~~~~~~~q---ii~D~s~~t~L~Hg~l~l~F~ir 125 (125) T protein:vir:99 63 FAEYNYRAEVATIMEQIQQ-LIPKFITTNHYLFGLTGSSSN---ILGETADSIQLQHGRLILDFNLR 125 (125) T ss_pred eeCcccchhHHHHHHHHHH-HhccceeccCcEEEeeeeeEE---EeecCCCCceeeEEEEEEEEeeC Confidence 98765 4444555555533 22333344555552 222333 344444445 577776665554 No 20 >protein:vir:100116 Length: 115 # NCBI annotation: gp10 # Family: family:all:2712 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945040;genbank:gi:38707900;genbank:GeneID:2744163 Probab=73.57 E-value=0.095 Score=26.18 Aligned_cols=115 Identities=11% Similarity=0.040 Sum_probs=50.8 Q ss_pred CCccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEE Q lcl|NC_017981. 1 MAEQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGT 80 (181) Q Consensus 1 ~~~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eat 80 (181) |+ =.+++..|+++-+.+|=+ +---..++.||+++..-+.. ....--|+ -++...+ T Consensus 1 ~~--------------~~~i~~aL~~l~~~RVyp-~~aP~~~~~Pyiv~q~vsg~-p~~~L~G~---------~~~~~~~ 55 (115) T protein:vir:10 1 MS--------------VIVIRDALQGIGGAKGYL-GVAPEKAPAPYFVVTRVHGA-LDMALAGL---------TGGRSGS 55 (115) T ss_pred Ce--------------eEEeehhhcccCCceeec-ccCCCCCCCCEEEEEeecCc-cccccCCC---------CCCcceE Confidence 22 112334444444433321 22112445699999654432 22222232 2345688 Q ss_pred EEEEEechHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEE Q lcl|NC_017981. 81 LSVKVFGGAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRF 147 (181) Q Consensus 81 velq~fG~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY 147 (181) ++|++|+... +.+++|+...++ ++..... ....+.+.+.+-+.|..-==.|+.+||.++| T Consensus 56 vQIDvyA~t~-~~A~~l~~~v~~-----~~~~~~~-~~~~~~~~~~~d~ye~dt~lyR~s~D~~vWf 115 (115) T protein:vir:10 56 YQIDCYAPTF-TDADRLADLAVD-----RAMSVQD-RFSVGGVDELPDDYSEDTGLFRISLELSVEF 115 (115) T ss_pred EEEEEeeCCH-HHHHHHHHHHHH-----HHhcCcc-ceeEeeecCCCCCCcccccceeeEEEEEEeC Confidence 9999997532 333333333221 1222111 1111222222222221111259999999999 No 21 >protein:vir:97325 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240617;genbank:gi:66396297;genbank:GeneID:5133681 Probab=70.36 E-value=0.21 Score=24.30 Aligned_cols=139 Identities=9% Similarity=0.117 Sum_probs=72.4 Q ss_pred CC-ccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeE Q lcl|NC_017981. 1 MA-EQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEG 79 (181) Q Consensus 1 ~~-~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~ea 79 (181) |. .-+.-.+.||+.-.-+-+ -|..|++-+ |+ |.....++.||+++--...++. +.-.. .-.+. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~--~l~alvggr-V~-D~~P~~a~~PYv~lG~~~~~d~-----~~~~~-------~g~~~ 64 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNL--IIRKQLDGR-VF-DCVQKDAVYPYIVVGETNVTNK-----ETTTS-------MVEDV 64 (145) T ss_pred CcchHhHHHHHHHHHHhhcCh--hHHHhhcCc-ee-cCCccCCCCCEEEeCcceeeec-----CCCcc-------cceEE Confidence 66 455666677766544322 245556654 33 6666677899999922111111 11011 12355 Q ss_pred EEEEEEec-----hHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeeeeccc Q lcl|NC_017981. 80 TLSVKVFG-----GAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFTDN 154 (181) Q Consensus 80 tvelq~fG-----~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~d~ 154 (181) ++.|.+|. .+|.+++..++..|..+ +.-.|..+.+. ++...-.+++..-...++++.|+++++-.-. T Consensus 65 ~~ti~Vws~~~g~~eak~ia~av~~aL~~~-----l~l~~~~lv~l-~~~~~~~~rd~dg~~~hgvl~fra~ve~~~~-- 136 (145) T protein:vir:97 65 GITLHVYSQARNRDEASQIIQFLGFVLNNE-----IEIDYYSFIKS-RIDTQEVITDIDQYTKHGIIRLVFKYRHNTL-- 136 (145) T ss_pred EEEEEEEEcCCCHHHHHHHHHHHHHHhccc-----cCCCCCeEEEe-EEeeeeEeecCCCceEEEEEEEEEEEecCce-- Confidence 66676665 46677888888777543 22233333322 2222233344444467888888887764311 Q ss_pred cCceeeeEEecCCc Q lcl|NC_017981. 155 VGLIEHVKLTGEID 168 (181) Q Consensus 155 Vg~Ie~Vevtg~~~ 168 (181) +=.||.+.+ T Consensus 137 -----~~~~~~~~~ 145 (145) T protein:vir:97 137 -----QRSVTNGAG 145 (145) T ss_pred -----ecccccCCC Confidence 112565665 No 22 >protein:vir:94794 Length: 145 # NCBI annotation: ORF028 # Family: family:all:296 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240542;genbank:gi:66396219;genbank:GeneID:5133574 Probab=69.23 E-value=0.23 Score=24.13 Aligned_cols=139 Identities=11% Similarity=0.102 Sum_probs=74.8 Q ss_pred CC-ccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeE Q lcl|NC_017981. 1 MA-EQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEG 79 (181) Q Consensus 1 ~~-~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~ea 79 (181) |. .-+.-.+.||+.-.-+-+ -|..|++-.| + |.....++.||+++--...++. +. +- ..-.+. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada--~l~alvggrV-~-D~~P~~~~~PYv~lG~~~~~d~--------~~--~~--~~g~~~ 64 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNP--IIQKQLDGRV-F-DCVQKDAVYPYIVVGETNVTNK--------ET--TT--SMVEDV 64 (145) T ss_pred CchhHHHHHHHHHHHHhhcCH--hHHHhhcccc-c-cCCcCCCCCCEEEecCceeeec--------CC--Cc--ccceEE Confidence 55 455556666665543322 2556666553 3 5555677899999922111111 11 00 123456 Q ss_pred EEEEEEec-----hHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeeeeccc Q lcl|NC_017981. 80 TLSVKVFG-----GAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFTDN 154 (181) Q Consensus 80 tvelq~fG-----~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~d~ 154 (181) ++.|.++. .+|.+++..++..|..+.. +...-++.....+.+.+ ++..-...++++.|+++.+-.-. T Consensus 65 ~~ti~Vws~~~g~~eak~ia~av~~aL~~~l~---l~~~~lv~l~~~~~~~~---rd~dg~~~hgvl~fra~ve~~~~-- 136 (145) T protein:vir:94 65 GITLHVYSQARNRDEASQIIQFLGFVLNNEIE---IDYYSFIKSRIDTQEVI---TDIDQYTKHGIIRLVFKYRHNTL-- 136 (145) T ss_pred EEEEEEEEcCCCHHHHHHHHHHHHHHhccccC---CCCCeEEEeEEeeeeEe---ecCCCceEEEEEEEEEEEEeccc-- Confidence 67777774 5678888888888854321 12222333333333332 33334467899999888875432 Q ss_pred cCceeeeEEecCCc Q lcl|NC_017981. 155 VGLIEHVKLTGEID 168 (181) Q Consensus 155 Vg~Ie~Vevtg~~~ 168 (181) +-.||.+.+ T Consensus 137 -----~~~~~~~~~ 145 (145) T protein:vir:94 137 -----QRSVTNGAG 145 (145) T ss_pred -----ccccccCCC Confidence 334666665 No 23 >protein:vir:95961 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240391;genbank:gi:66396072;genbank:GeneID:5133472 Probab=69.21 E-value=0.23 Score=24.12 Aligned_cols=139 Identities=10% Similarity=0.100 Sum_probs=74.8 Q ss_pred CC-ccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeE Q lcl|NC_017981. 1 MA-EQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEG 79 (181) Q Consensus 1 ~~-~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~ea 79 (181) |. .-+.-.+.||+.-.-+-+ -|..|++-.| + |.....++.||+++--...++. +. +- ..-.+. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada--~l~alvggrV-~-D~~P~~~~~PYv~lG~~~~~d~--------~~--~~--~~g~~~ 64 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNP--IIQKQLDGRV-F-DCVQKDAVYPYIVVGETNVTNK--------ET--TT--SMVEDV 64 (145) T ss_pred CchhHHHHHHHHHHHHhhcCH--hHHHhhcccc-c-cCCcCCCCCCEEEecCceeeec--------CC--Cc--ccceEE Confidence 55 455556666665543322 2556666553 3 5555677899999922111111 11 00 123456 Q ss_pred EEEEEEec-----hHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeeeeccc Q lcl|NC_017981. 80 TLSVKVFG-----GAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFTDN 154 (181) Q Consensus 80 tvelq~fG-----~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~d~ 154 (181) ++.|.++. .+|.+++..++..|..+.. +...-++.....+.+.+ ++..-...++++.|+++.+-.-. T Consensus 65 ~~ti~Vws~~~g~~eak~ia~av~~aL~~~l~---l~~~~lv~l~~~~~~~~---rd~dg~~~hgvl~fra~ve~~~~-- 136 (145) T protein:vir:95 65 GITLHVYSQARNRDEASQIIQFLGFVLNNEIE---IDYYSFIKSRIDTQEVI---TDIDQYTKHGVIRLVFKYRHNTL-- 136 (145) T ss_pred EEEEEEEEcCCCHHHHHHHHHHHHHHhccccC---CCCCeEEEeEEeeeeEe---ecCCCceEEEEEEEEEEEEeccc-- Confidence 67777774 5678888888888854321 12222333333333332 33334467899999888875432 Q ss_pred cCceeeeEEecCCc Q lcl|NC_017981. 155 VGLIEHVKLTGEID 168 (181) Q Consensus 155 Vg~Ie~Vevtg~~~ 168 (181) +-.||.+.+ T Consensus 137 -----~~~~~~~~~ 145 (145) T protein:vir:95 137 -----QRSVTNGAG 145 (145) T ss_pred -----ccccccCCC Confidence 334666665 No 24 >protein:vir:95111 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240829;genbank:gi:66394699;genbank:GeneID:5133905 Probab=63.95 E-value=0.31 Score=23.38 Aligned_cols=139 Identities=11% Similarity=0.094 Sum_probs=75.2 Q ss_pred CC-ccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeE Q lcl|NC_017981. 1 MA-EQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEG 79 (181) Q Consensus 1 ~~-~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~ea 79 (181) |. .-+.-.+.||+.-.-+-+ -|..|++-+ |+ |.....++.||+++--...++. +. . -..-.+. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada--~l~alvggr-V~-D~~P~~a~~PYV~lG~~~~~~~--------~~--~--~~~g~~~ 64 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNS--IIQKQLDGR-VF-DCVQKDAVYPYIVVGETNVTNK--------ET--T--TSMVEDV 64 (145) T ss_pred CchhHHHHHHHHHHHHhhcCh--hHHHhhcCc-ee-cCCcCCCCCCEEEecCceeeec--------CC--C--cccceEE Confidence 65 456667777776554422 255666654 43 5566677899999922211111 11 0 0123456 Q ss_pred EEEEEEec-----hHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeeeeccc Q lcl|NC_017981. 80 TLSVKVFG-----GAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFTDN 154 (181) Q Consensus 80 tvelq~fG-----~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~d~ 154 (181) ++.|.++. .+|.+++..++..|..+.. +...-++.....+.+. +++.-....++++.|+++.+-.-. T Consensus 65 ~~ti~Vws~~~g~~eak~ia~av~~aL~~~l~---l~~~~lv~l~~~~~~~---~rd~dg~~~hgvl~~ra~ve~~~~-- 136 (145) T protein:vir:95 65 GITLHVYSQARNRDEASQIIQFLGFVLNNEIE---IDYYSFIKSRIDTQEV---ITDIDRYTKHGIIRLVFKYRHNTL-- 136 (145) T ss_pred EEEEEEEEcCCCHHHHHHHHHHHHHHhccccC---CCCCeEEEeEEeeeeE---eecCCCceEEEEEEEEEEEEeccc-- Confidence 67777774 5667888888888754321 1122223333333332 233333467899988888775432 Q ss_pred cCceeeeEEecCCc Q lcl|NC_017981. 155 VGLIEHVKLTGEID 168 (181) Q Consensus 155 Vg~Ie~Vevtg~~~ 168 (181) +-.||.+.+ T Consensus 137 -----~~~~~~~~~ 145 (145) T protein:vir:95 137 -----QRSVTNGAG 145 (145) T ss_pred -----ccccccCCC Confidence 334666665 No 25 >protein:vir:94096 Length: 141 # NCBI annotation: ORF031 # Family: family:all:296 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240240;genbank:gi:66395916;genbank:GeneID:5133265 Probab=63.75 E-value=0.31 Score=23.36 Aligned_cols=135 Identities=7% Similarity=0.060 Sum_probs=69.1 Q ss_pred CC-ccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeE Q lcl|NC_017981. 1 MA-EQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEG 79 (181) Q Consensus 1 ~~-~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~ea 79 (181) |. .-+.-.+.||+.-..+-+ -|..|++-+ |+ |.....+..||+++--...++. ++ . . ..-.+. T Consensus 1 Msms~~~aLQ~Ai~~~L~ada--al~alvg~r-I~-D~~P~~~~~PYv~lG~~~~~~~--------~~--~-~-~~g~~~ 64 (141) T protein:vir:94 1 MWVSVEPELTNQIYKRLISDP--NINKLVDDR-VF-DVVQDDAVYPYIVVGESNVTNN--------ES--S-A-TMRETV 64 (141) T ss_pred CccchhHHHHHHHHHHhhcCh--hhHhhcCCc-cc-cCCccCCCCCEEEeCCceeeec--------CC--C-c-ccceEE Confidence 54 345556666665443322 245556655 33 5566677889999922111111 11 1 0 112345 Q ss_pred EEEEEEe-----chHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeeeeccc Q lcl|NC_017981. 80 TLSVKVF-----GGAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFTDN 154 (181) Q Consensus 80 tvelq~f-----G~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~d~ 154 (181) ++.|.++ ..+|.+++..++..|..| +.-.|..+.+. ++...-.+++.-....++++.|+++..-.-... T Consensus 65 ~~ti~Vws~~~g~~eak~ia~av~~AL~~~-----l~l~~~~lv~l-~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~~~~ 138 (141) T protein:vir:94 65 GIVIHVYSQFATQYEAKLILSAIGYVLNRP-----IEIDNYEFQFS-RIDSQAVFPDIDRFTKHGTIRLLFKYRHKKKNE 138 (141) T ss_pred EEEEEEEEcCCCHHHHHHHHHHHHHHhccc-----ccCCCceEEEE-EEeeeeeeecCCCceEEEEEEEEEEEEeccccc Confidence 6666676 235688888888888544 22233333322 111222334443456788888888766544332 Q ss_pred cCc Q lcl|NC_017981. 155 VGL 157 (181) Q Consensus 155 Vg~ 157 (181) -=+ T Consensus 139 ~~~ 141 (141) T protein:vir:94 139 GVY 141 (141) T ss_pred cCC Confidence 222 No 26 >protein:vir:105892 Length: 141 # NCBI annotation: tail protein # Family: family:all:296 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004380;genbank:gi:122891835;genbank:GeneID:4712363 Probab=63.75 E-value=0.31 Score=23.36 Aligned_cols=135 Identities=7% Similarity=0.060 Sum_probs=69.1 Q ss_pred CC-ccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeE Q lcl|NC_017981. 1 MA-EQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEG 79 (181) Q Consensus 1 ~~-~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~ea 79 (181) |. .-+.-.+.||+.-..+-+ -|..|++-+ |+ |.....+..||+++--...++. ++ . . ..-.+. T Consensus 1 Msms~~~aLQ~Ai~~~L~ada--al~alvg~r-I~-D~~P~~~~~PYv~lG~~~~~~~--------~~--~-~-~~g~~~ 64 (141) T protein:vir:10 1 MWVSVEPELTNQIYKRLISDP--NINKLVDDR-VF-DVVQDDAVYPYIVVGESNVTNN--------ES--S-A-TMRETV 64 (141) T ss_pred CccchhHHHHHHHHHHhhcCh--hhHhhcCCc-cc-cCCccCCCCCEEEeCCceeeec--------CC--C-c-ccceEE Confidence 54 345556666665443322 245556655 33 5566677889999922111111 11 1 0 112345 Q ss_pred EEEEEEe-----chHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeeeeccc Q lcl|NC_017981. 80 TLSVKVF-----GGAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFTDN 154 (181) Q Consensus 80 tvelq~f-----G~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~d~ 154 (181) ++.|.++ ..+|.+++..++..|..| +.-.|..+.+. ++...-.+++.-....++++.|+++..-.-... T Consensus 65 ~~ti~Vws~~~g~~eak~ia~av~~AL~~~-----l~l~~~~lv~l-~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~~~~ 138 (141) T protein:vir:10 65 GIVIHVYSQFATQYEAKLILSAIGYVLNRP-----IEIDNYEFQFS-RIDSQAVFPDIDRFTKHGTIRLLFKYRHKKKNE 138 (141) T ss_pred EEEEEEEEcCCCHHHHHHHHHHHHHHhccc-----ccCCCceEEEE-EEeeeeeeecCCCceEEEEEEEEEEEEeccccc Confidence 6666676 235688888888888544 22233333322 111222334443456788888888766544332 Q ss_pred cCc Q lcl|NC_017981. 155 VGL 157 (181) Q Consensus 155 Vg~ 157 (181) -=+ T Consensus 139 ~~~ 141 (141) T protein:vir:10 139 GVY 141 (141) T ss_pred cCC Confidence 222 No 27 >protein:vir:96260 Length: 141 # NCBI annotation: ORF027 # Family: family:all:296 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240317;genbank:gi:66395991;genbank:GeneID:5133337 Probab=63.75 E-value=0.31 Score=23.36 Aligned_cols=135 Identities=7% Similarity=0.060 Sum_probs=69.1 Q ss_pred CC-ccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeE Q lcl|NC_017981. 1 MA-EQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEG 79 (181) Q Consensus 1 ~~-~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~ea 79 (181) |. .-+.-.+.||+.-..+-+ -|..|++-+ |+ |.....+..||+++--...++. ++ . . ..-.+. T Consensus 1 Msms~~~aLQ~Ai~~~L~ada--al~alvg~r-I~-D~~P~~~~~PYv~lG~~~~~~~--------~~--~-~-~~g~~~ 64 (141) T protein:vir:96 1 MWVSVEPELTNQIYKRLISDP--NINKLVDDR-VF-DVVQDDAVYPYIVVGESNVTNN--------ES--S-A-TMRETV 64 (141) T ss_pred CccchhHHHHHHHHHHhhcCh--hhHhhcCCc-cc-cCCccCCCCCEEEeCCceeeec--------CC--C-c-ccceEE Confidence 54 345556666665443322 245556655 33 5566677889999922111111 11 1 0 112345 Q ss_pred EEEEEEe-----chHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeeeeccc Q lcl|NC_017981. 80 TLSVKVF-----GGAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFTDN 154 (181) Q Consensus 80 tvelq~f-----G~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~d~ 154 (181) ++.|.++ ..+|.+++..++..|..| +.-.|..+.+. ++...-.+++.-....++++.|+++..-.-... T Consensus 65 ~~ti~Vws~~~g~~eak~ia~av~~AL~~~-----l~l~~~~lv~l-~~~~~~~~rd~dg~t~hgvl~~ra~v~~~~~~~ 138 (141) T protein:vir:96 65 GIVIHVYSQFATQYEAKLILSAIGYVLNRP-----IEIDNYEFQFS-RIDSQAVFPDIDRFTKHGTIRLLFKYRHKKKNE 138 (141) T ss_pred EEEEEEEEcCCCHHHHHHHHHHHHHHhccc-----ccCCCceEEEE-EEeeeeeeecCCCceEEEEEEEEEEEEeccccc Confidence 6666676 235688888888888544 22233333322 111222334443456788888888766544332 Q ss_pred cCc Q lcl|NC_017981. 155 VGL 157 (181) Q Consensus 155 Vg~ 157 (181) -=+ T Consensus 139 ~~~ 141 (141) T protein:vir:96 139 GVY 141 (141) T ss_pred cCC Confidence 222 No 28 >protein:vir:93736 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240465;genbank:gi:66396143;genbank:GeneID:5133505 Probab=62.96 E-value=0.33 Score=23.26 Aligned_cols=137 Identities=9% Similarity=0.107 Sum_probs=72.8 Q ss_pred CC-ccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeE Q lcl|NC_017981. 1 MA-EQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEG 79 (181) Q Consensus 1 ~~-~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~ea 79 (181) |. .-+.-.+.||+.-.-+-+ -|..|++-+ |+ |.....++.||+++--...++.+ .-.. .-.+. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada--~l~alvggr-I~-D~~P~~a~~PYV~lG~~~~~d~~-----~~~~-------~g~~~ 64 (145) T protein:vir:93 1 MWVSVERYLFNKVYNKLKSNL--IIQKQLDGR-VF-DCVQKDAVYPYIVVGETNVTNKE-----TTTS-------MVEDV 64 (145) T ss_pred CchhHHHHHHHHHHHHhhcCh--hHHHhhcCc-ee-cCCcCCCCCCEEEeCCceeeecC-----CCcc-------cceEE Confidence 65 456667777776554322 255666654 43 55566778999999221111111 0011 23355 Q ss_pred EEEEEEec-----hHHHHHHHHHHHHhCChhHHHHHHHcCeee--ecCCccccchhhhhhhhheeeeeEEEEEEEeeeec Q lcl|NC_017981. 80 TLSVKVFG-----GAARRHLDNLRSRTKKMSSRDIMTRERFII--YATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFT 152 (181) Q Consensus 80 tvelq~fG-----~~A~~~ld~L~~~lk~ps~~~~l~~~giai--~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~ 152 (181) ++.|.++. .+|.+++..++..|..+ +.-.|..+ ....+.+ .+++..-.-.++++.|+++.+-.-. T Consensus 65 ~~ti~Vws~~~g~~eak~ia~av~~aL~~~-----l~l~~~~lv~l~~~~~~---~~rd~dg~~~hgvl~fra~ve~~~~ 136 (145) T protein:vir:93 65 GITLHVYSQARNRDEASQIIQFLGFVLNNE-----IEIDYYSFIKSRIDTQE---VITDIDQYTKHGIIRLVFKYRHNTL 136 (145) T ss_pred EEEEEEEEcCCCHHHHHHHHHHHHHHhccc-----cCCCCCeEEEeEEeeee---EeecCCcceEEEEEEEEEEEEeccc Confidence 66677775 45677777777777543 22223332 2222222 2233333456788888888765432 Q ss_pred cccCceeeeEEecCCc Q lcl|NC_017981. 153 DNVGLIEHVKLTGEID 168 (181) Q Consensus 153 d~Vg~Ie~Vevtg~~~ 168 (181) +-.||.+.+ T Consensus 137 -------~~~~~~~~~ 145 (145) T protein:vir:93 137 -------QRSVTNGAG 145 (145) T ss_pred -------ccccccCCC Confidence 334666665 No 29 >protein:vir:94488 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240682;genbank:gi:66396364;genbank:GeneID:5133752 Probab=62.96 E-value=0.33 Score=23.26 Aligned_cols=137 Identities=9% Similarity=0.107 Sum_probs=72.8 Q ss_pred CC-ccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeE Q lcl|NC_017981. 1 MA-EQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEG 79 (181) Q Consensus 1 ~~-~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~ea 79 (181) |. .-+.-.+.||+.-.-+-+ -|..|++-+ |+ |.....++.||+++--...++.+ .-.. .-.+. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada--~l~alvggr-I~-D~~P~~a~~PYV~lG~~~~~d~~-----~~~~-------~g~~~ 64 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNL--IIQKQLDGR-VF-DCVQKDAVYPYIVVGETNVTNKE-----TTTS-------MVEDV 64 (145) T ss_pred CchhHHHHHHHHHHHHhhcCh--hHHHhhcCc-ee-cCCcCCCCCCEEEeCCceeeecC-----CCcc-------cceEE Confidence 65 456667777776554322 255666654 43 55566778999999221111111 0011 23355 Q ss_pred EEEEEEec-----hHHHHHHHHHHHHhCChhHHHHHHHcCeee--ecCCccccchhhhhhhhheeeeeEEEEEEEeeeec Q lcl|NC_017981. 80 TLSVKVFG-----GAARRHLDNLRSRTKKMSSRDIMTRERFII--YATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFT 152 (181) Q Consensus 80 tvelq~fG-----~~A~~~ld~L~~~lk~ps~~~~l~~~giai--~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~ 152 (181) ++.|.++. .+|.+++..++..|..+ +.-.|..+ ....+.+ .+++..-.-.++++.|+++.+-.-. T Consensus 65 ~~ti~Vws~~~g~~eak~ia~av~~aL~~~-----l~l~~~~lv~l~~~~~~---~~rd~dg~~~hgvl~fra~ve~~~~ 136 (145) T protein:vir:94 65 GITLHVYSQARNRDEASQIIQFLGFVLNNE-----IEIDYYSFIKSRIDTQE---VITDIDQYTKHGIIRLVFKYRHNTL 136 (145) T ss_pred EEEEEEEEcCCCHHHHHHHHHHHHHHhccc-----cCCCCCeEEEeEEeeee---EeecCCcceEEEEEEEEEEEEeccc Confidence 66677775 45677777777777543 22223332 2222222 2233333456788888888765432 Q ss_pred cccCceeeeEEecCCc Q lcl|NC_017981. 153 DNVGLIEHVKLTGEID 168 (181) Q Consensus 153 d~Vg~Ie~Vevtg~~~ 168 (181) +-.||.+.+ T Consensus 137 -------~~~~~~~~~ 145 (145) T protein:vir:94 137 -------QRSVTNGAG 145 (145) T ss_pred -------ccccccCCC Confidence 334666665 No 30 >protein:vir:97421 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240755;genbank:gi:66396436;genbank:GeneID:5133777 Probab=62.96 E-value=0.33 Score=23.26 Aligned_cols=137 Identities=9% Similarity=0.107 Sum_probs=72.8 Q ss_pred CC-ccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeE Q lcl|NC_017981. 1 MA-EQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEG 79 (181) Q Consensus 1 ~~-~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~ea 79 (181) |. .-+.-.+.||+.-.-+-+ -|..|++-+ |+ |.....++.||+++--...++.+ .-.. .-.+. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada--~l~alvggr-I~-D~~P~~a~~PYV~lG~~~~~d~~-----~~~~-------~g~~~ 64 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNL--IIQKQLDGR-VF-DCVQKDAVYPYIVVGETNVTNKE-----TTTS-------MVEDV 64 (145) T ss_pred CchhHHHHHHHHHHHHhhcCh--hHHHhhcCc-ee-cCCcCCCCCCEEEeCCceeeecC-----CCcc-------cceEE Confidence 65 456667777776554322 255666654 43 55566778999999221111111 0011 23355 Q ss_pred EEEEEEec-----hHHHHHHHHHHHHhCChhHHHHHHHcCeee--ecCCccccchhhhhhhhheeeeeEEEEEEEeeeec Q lcl|NC_017981. 80 TLSVKVFG-----GAARRHLDNLRSRTKKMSSRDIMTRERFII--YATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFT 152 (181) Q Consensus 80 tvelq~fG-----~~A~~~ld~L~~~lk~ps~~~~l~~~giai--~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~ 152 (181) ++.|.++. .+|.+++..++..|..+ +.-.|..+ ....+.+ .+++..-.-.++++.|+++.+-.-. T Consensus 65 ~~ti~Vws~~~g~~eak~ia~av~~aL~~~-----l~l~~~~lv~l~~~~~~---~~rd~dg~~~hgvl~fra~ve~~~~ 136 (145) T protein:vir:97 65 GITLHVYSQARNRDEASQIIQFLGFVLNNE-----IEIDYYSFIKSRIDTQE---VITDIDQYTKHGIIRLVFKYRHNTL 136 (145) T ss_pred EEEEEEEEcCCCHHHHHHHHHHHHHHhccc-----cCCCCCeEEEeEEeeee---EeecCCcceEEEEEEEEEEEEeccc Confidence 66677775 45677777777777543 22223332 2222222 2233333456788888888765432 Q ss_pred cccCceeeeEEecCCc Q lcl|NC_017981. 153 DNVGLIEHVKLTGEID 168 (181) Q Consensus 153 d~Vg~Ie~Vevtg~~~ 168 (181) +-.||.+.+ T Consensus 137 -------~~~~~~~~~ 145 (145) T protein:vir:97 137 -------QRSVTNGAG 145 (145) T ss_pred -------ccccccCCC Confidence 334666665 No 31 >protein:vir:94768 Length: 111 # NCBI annotation: unknown # Family: family:all:1269 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996711;genbank:gi:45597426;genbank:GeneID:2769040 Probab=52.79 E-value=0.55 Score=22.03 Aligned_cols=107 Identities=8% Similarity=0.132 Sum_probs=66.9 Q ss_pred HHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEE-EcccCCccccccCCCCCcCceeeeeeeeEEEEEEEechHHHHH Q lcl|NC_017981. 15 EVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLH-VIARRDVGNPEFGPVNDDGIQQIHQVIEGTLSVKVFGGAARRH 93 (181) Q Consensus 15 l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~-vr~~~~~~~~~~g~vdd~G~q~V~~h~eatvelq~fG~~A~~~ 93 (181) .||.+.-+.|..=|+.|+-+--+-+ .+++|++++ .-..+ ++ -+..+++-+|+||..-+++ T Consensus 1 miE~~v~~~L~~~l~vpv~~e~p~~--~p~~FV~vErtGG~~----------~~-------~~~~~~lAVQ~~~~S~~eA 61 (111) T protein:vir:94 1 MIEIIIKNFLDTHLSVSSFLEKKGE--MPLSYVLFEKTGSSK----------SN-------HLLSSTFAFQSYAPSMYEA 61 (111) T ss_pred ChHHhHHHHHhhcCCcceEeecCCC--CCCceEEEEecCCcc----------cc-------ccccceEEEEecchhHHHH Confidence 7888888888888888865554333 256788872 11111 11 1245899999999876544 Q ss_pred H---HHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEE Q lcl|NC_017981. 94 L---DNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFR 146 (181) Q Consensus 94 l---d~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~ir 146 (181) + ++|+..+..- -+-.+|+=++.+..-|.|-- +..+|=.+++++|..= T Consensus 62 a~La~~v~~~~~~l-----~~~~~i~~v~~~s~Ynf~d~-~tk~~RYQav~~i~~~ 111 (111) T protein:vir:94 62 AKLNEQLKEVVERL-----IELNEISNVSLNSDYNFTDT-ETKEYRYQAVFDINHY 111 (111) T ss_pred HHHHHHHHHHHhhc-----ccccccceeecCCCcccCCC-cCCCceEEEEEEEeeC Confidence 3 3444444222 22334555666666777655 5557888888888765 No 32 >protein:vir:93602 Length: 114 # NCBI annotation: putative structural component # Family: family:all:896 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449300;genbank:gi:157166048;uniprot:Q6H9U1;genbank:GeneID:5580424 Probab=52.10 E-value=0.56 Score=21.95 Aligned_cols=107 Identities=16% Similarity=0.159 Sum_probs=52.6 Q ss_pred HHHHHHHHHHHHhhCCccEee---cCCCC-CCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEEEEEEEech-- Q lcl|NC_017981. 15 EVEAAAYRVLSPLLPEMMLCY---EGQNH-NISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGTLSVKVFGG-- 88 (181) Q Consensus 15 l~~~~a~~~ls~ll~~pvI~A---dqNg~-~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eatvelq~fG~-- 88 (181) ..|.-.|.+|+++.+..|-.- .-.++ .-..||+++..-+.. +.+.--|+ .+...+++|++|+. T Consensus 1 M~e~~i~~lL~~~~~gRvyp~~~P~~~~~~~~~~Pyiv~q~vsg~-p~~~l~gp----------~~~~~~vQIDvyA~t~ 69 (114) T protein:vir:93 1 MTEADLYPHLAHLAGGQVYPYVVPLLDGRPSVALPWVVFSLISSV-SADVMGGQ----------AESSVSVQIDVYAGTV 69 (114) T ss_pred CchHHHHHHHHhhcCcccccccCCcccCcCCccCceEEEEeccCc-ccccccCc----------cccceEEEEEeeeCCH Confidence 445667788888877654311 11111 123699999665543 22222221 23357999999964 Q ss_pred -HHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEE Q lcl|NC_017981. 89 -AARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRF 147 (181) Q Consensus 89 -~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY 147 (181) +|+...++++..+. .+. .+.....++- --+.. =.|+.+||.+-- T Consensus 70 ~~A~~l~~~v~~Al~---------~~~--~~~~~~~~~y--e~dt~--lyR~~~d~~v~~ 114 (114) T protein:vir:93 70 TQARQIRQDAREAIM---------LLA--PGSVSEMQDY--IPENR--CYRATLEFQVTV 114 (114) T ss_pred HHHHHHHHHHHHHHh---------hcC--cEeecCCCcc--ccccc--ceeeEEEEEEeC Confidence 45555555554442 211 1222222221 01111 136666665543 No 33 >protein:vir:195 Length: 115 # NCBI annotation: Gp11 # Family: family:all:896 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037705;genbank:gi:9634170;genbank:GeneID:1262540 Probab=50.41 E-value=0.61 Score=21.76 Aligned_cols=107 Identities=17% Similarity=0.167 Sum_probs=54.4 Q ss_pred HHHHHHHHHHHHhhCCccE--eecCCCC---CCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEEEEEEEec-- Q lcl|NC_017981. 15 EVEAAAYRVLSPLLPEMML--CYEGQNH---NISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGTLSVKVFG-- 87 (181) Q Consensus 15 l~~~~a~~~ls~ll~~pvI--~AdqNg~---~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eatvelq~fG-- 87 (181) ..|.-.+.+|+++.+-+|- .|=+++- .=..||+.+..-+.. ..+.--|+ .....+++|.||. T Consensus 1 M~e~~i~~lL~~l~~gRvyp~~aP~~~~~~~~~~~Pyiv~q~vsg~-p~~~L~G~----------~~~~~~vQIDvyA~t 69 (115) T protein:vir:19 1 MNEDNIYALLSPLAEGRVYPYVAPLGSDGKPSVSPPWIIFSIVDDV-SADVLCGQ----------AESRVSVQVDVYSTS 69 (115) T ss_pred CchhHHHHHHhhhcCcccceeeccCCCCCCccccCCeEEEEeccCc-ccccccCC----------CccceEEEEEEeeCC Confidence 4456667777877774432 2344442 225799999766543 32222332 1134689999995 Q ss_pred -hHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEE Q lcl|NC_017981. 88 -GAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRF 147 (181) Q Consensus 88 -~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY 147 (181) ++|++..++++..+. .+.. +...+. ...|.--==.|+.+||.|-- T Consensus 70 ~~~A~~l~~~i~~Al~---------~~~p--~~~~~~----~~ye~dt~lyR~s~d~~V~~ 115 (115) T protein:vir:19 70 IAESRSLRDLVLASLE---------PLTP--TEVVKI----PGYEPDYRLYRATLDFKVTP 115 (115) T ss_pred hHHHHHHHHHHHHHhh---------hcCC--EEecCC----CCcccchhceeeEEEEEecC Confidence 455655555555542 1111 111111 11111111247888877755 No 34 >protein:vir:1643 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695064;genbank:gi:23455755;genbank:GeneID:955492 Probab=44.96 E-value=0.79 Score=21.15 Aligned_cols=108 Identities=8% Similarity=0.101 Sum_probs=65.2 Q ss_pred HHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEEEEEEEechHHHHHH Q lcl|NC_017981. 15 EVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGTLSVKVFGGAARRHL 94 (181) Q Consensus 15 l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eatvelq~fG~~A~~~l 94 (181) .||.+.-+.|..-|+.|+-.--+-+ + +..|++++ |+-+ .... -+..+++-+|+||..-.+++ T Consensus 1 miE~~i~~~L~~~l~Vpv~~e~p~~-~-P~~FV~vE----rtGG---------~~~~---~~~~~~lAVq~w~~S~~eAa 62 (111) T protein:vir:16 1 MIEIIIKNFLDTHLSVSSFLEKKGE-M-PLSYILFE----KTGS---------SKSN---HLLSSTFAFQSYAPSMYEAA 62 (111) T ss_pred ChHHhHHHHHhhcCCceeEeecCCC-C-CCceEEEE----ecCC---------cccc---ccccceEEEEecchhHHHHH Confidence 7888888888888888865544332 2 56788772 2111 1111 12458999999998765443 Q ss_pred ---HHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEE Q lcl|NC_017981. 95 ---DNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFR 146 (181) Q Consensus 95 ---d~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~ir 146 (181) ++++..+.+- -+-.+|+=++....-|.|-- +..+|=.++.+||..= T Consensus 63 ~La~~v~~~l~~l-----~~~~~I~av~~~s~ynf~d~-~tk~~RYQav~~i~~~ 111 (111) T protein:vir:16 63 KLNEQLKEVVERL-----IELNEISNVSLNSDYNFTDT-ETKEYRYQAVFDINHY 111 (111) T ss_pred HHHHHHHHHHhhc-----cccccceeeecCCCCcCCCC-CCCCceEEEEEEEeeC Confidence 4555444222 22234544555555666554 4557788888888765 No 35 >protein:vir:3618 Length: 129 # NCBI annotation: ORF41 # Family: family:all:504 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112704;genbank:gi:13786572;genbank:GeneID:921070 Probab=44.05 E-value=0.82 Score=21.05 Aligned_cols=126 Identities=17% Similarity=0.202 Sum_probs=63.0 Q ss_pred CCccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEE Q lcl|NC_017981. 1 MAEQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGT 80 (181) Q Consensus 1 ~~~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eat 80 (181) |- .||++.+||.+=+.. .. |+.||.=.=|.. ..+.||..+.=....+. +--++ --.+.+ T Consensus 1 mm---ksp~qeL~d~~f~~l----~~-lG~~vyD~lP~~-~v~YPfV~ig~~~~~~~-----~tKt~-------~~g~v~ 59 (129) T protein:vir:36 1 MI---KTRDQSIFDELFKRI----QA-LGYTVYDYKPMN-EVGYPFVELENTQTIHE-----ANKTD-------IKGTVS 59 (129) T ss_pred CC---cChhHHHHHHHHHHH----Hh-cCCeeeeccCCC-CCCcCEEEeeeeeecCC-----ccccc-------cccEEE Confidence 32 268888877543333 22 577765543432 35899999932211111 11111 123467 Q ss_pred EEEEEechHH-HHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhh-hh-eeeeeEEEEEEEe Q lcl|NC_017981. 81 LSVKVFGGAA-RRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEI-YA-EPSAILDMGFRFT 148 (181) Q Consensus 81 velq~fG~~A-~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~-~y-E~RA~lel~irY~ 148 (181) +.|.+||... +-.+.++..++..-.... ..-.|..|.-..+-+++-.+.+.. += =.+++++|.+.+- T Consensus 60 ltihVW~~~~~R~~v~~i~~~i~~~~~~~-~~t~~y~~~~~~~~~~~q~~~D~st~~~L~Hgii~l~f~~r 129 (129) T protein:vir:36 60 LSLSVWGLQKKRKEVSDMASNIFNQALNI-SATDGYSWALNSQASTIQMLDDTTTNTPLKRALINLEFRLR 129 (129) T ss_pred EEEEEEeCCcCchhHHHHHHHHHHHhccc-ccCCCeEEEEEeeeeeEEEeccCCCCceeeEEEEEEEEEeC Confidence 7888887655 444444444443333222 233677765444444454454432 00 1578766666555 No 36 >protein:vir:80371 Length: 115 # NCBI annotation: gp11 # Family: family:all:2712 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111090;genbank:gi:134288622;genbank:GeneID:4960618 Probab=43.91 E-value=0.83 Score=21.04 Aligned_cols=102 Identities=12% Similarity=0.078 Sum_probs=54.3 Q ss_pred hCCccEeecC---------CCCCCC---CceEEE-EEcccCCccccccCCCCCcCceeeeeeeeEEEEEEEechHHHHHH Q lcl|NC_017981. 28 LPEMMLCYEG---------QNHNIS---PPYATL-HVIARRDVGNPEFGPVNDDGIQQIHQVIEGTLSVKVFGGAARRHL 94 (181) Q Consensus 28 l~~pvI~Adq---------Ng~~P~---~PYatl-~vr~~~~~~~~~~g~vdd~G~q~V~~h~eatvelq~fG~~A~~~l 94 (181) |..-||++-= -|-.|+ .||+++ ++.+.++.. --|+. +-..++.+|.+|+. .+.-+ T Consensus 1 ~~~~vir~al~~i~~~~~~~~vAp~~~~~pyivy~rvsga~e~~--L~G~a---------g~~~~~~QID~yA~-T~~ea 68 (115) T protein:vir:80 1 MSVIVVRDALQGIGGAKGYLGVAPEKAPARYFVVTRVHGALDMA--LAGPT---------GGRSGSYQIDCYAP-TFTDA 68 (115) T ss_pred CeeeeeechhhhccccccceeeccccCcCCeEEEeecCCCcccc--ccCCC---------CCceeEEEEeeecC-CHHHH Confidence 3334554321 122444 799998 566655443 33332 23458889999984 23333 Q ss_pred HHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEE Q lcl|NC_017981. 95 DNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRF 147 (181) Q Consensus 95 d~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY 147 (181) ++|+.+.+- ++..+- -.+..|-+.++|-+-++---=-|+.+||.+.| T Consensus 69 ~~La~~v~d-----~~~~~~-~~~~vg~l~e~pd~Ye~DT~l~Rvs~dv~i~f 115 (115) T protein:vir:80 69 DRLADLAVD-----RAMSVQ-DRFSVGGVDELPDDYSADTGLFRVSLELSVEF 115 (115) T ss_pred HHHHHHHHH-----hhhCCc-cccceecccCCCcccccccceEEEEEEEEEeC Confidence 344443332 222100 02345666777666333333359999999999 No 37 >protein:vir:100242 Length: 114 # NCBI annotation: gp71 # Family: family:all:2712 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355407;genbank:gi:77864697;genbank:GeneID:3725964 Probab=43.16 E-value=0.86 Score=20.95 Aligned_cols=113 Identities=12% Similarity=0.097 Sum_probs=50.1 Q ss_pred CCccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEE-EcccCCccccccCCCCCcCceeeeeeeeE Q lcl|NC_017981. 1 MAEQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLH-VIARRDVGNPEFGPVNDDGIQQIHQVIEG 79 (181) Q Consensus 1 ~~~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~-vr~~~~~~~~~~g~vdd~G~q~V~~h~ea 79 (181) |+ +..+++-|..|=+..+.+ .=-.+.-+.||+++. +.+. ....--|+. ++..+ T Consensus 1 ~~--------------~~~i~~~l~~~~g~~~~~-~~aP~~~~~Py~vy~rvsg~--p~~tL~G~~---------g~~~~ 54 (114) T protein:vir:10 1 MS--------------ALTIRDAIGIVGGAKGYV-SVASSAAQSPYYVVSRVSGT--RDMALGGAT---------GGKSG 54 (114) T ss_pred Cc--------------eeeeehhhcccccccccC-CCCCCCCCCceEEEEeccCc--ccccccCCC---------CcceE Confidence 22 122222222222222222 111234567999994 4332 233334442 24567 Q ss_pred EEEEEEechHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEE Q lcl|NC_017981. 80 TLSVKVFGGAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRF 147 (181) Q Consensus 80 tvelq~fG~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY 147 (181) .++|+||+.- +.-+++|+.+.+.-.. .... |..+-|.|.+-+-|..-==.|..+||.|+| T Consensus 55 r~QiD~yA~T-~~eA~~La~~~~~~l~----~~~~---f~~~~l~~~~d~ye~dT~l~Rvsld~si~f 114 (114) T protein:vir:10 55 MFQIDVYAKT-YTEADSLADQIIDRVE----STGM---FSVGGVSDLPDDYSSDTGVFRVSLEISVQF 114 (114) T ss_pred EEEEEeeeCC-HHHHHHHHHHHHhhcc----cccC---eeeeccccCCCCCCcccCceEEEEEEEEeC Confidence 7899999853 3334444433221111 1111 222334444333221111258999999999 No 38 >protein:vir:9764 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795526;genbank:gi:28876278;genbank:GeneID:1257819 Probab=42.89 E-value=0.87 Score=20.93 Aligned_cols=108 Identities=9% Similarity=0.134 Sum_probs=67.5 Q ss_pred HHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEEEEEEEechHHHHHH Q lcl|NC_017981. 15 EVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGTLSVKVFGGAARRHL 94 (181) Q Consensus 15 l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eatvelq~fG~~A~~~l 94 (181) .||...-..|..-|+.|+-+--+-+ +|. +|++++ |+-++ .+ . .-..+++-+|+||..-++++ T Consensus 1 mIE~~i~~yL~~~l~vpv~~e~p~~-~P~-~FV~vE----kTGG~------~~---~---~~~~a~lAvQsyg~S~~~AA 62 (111) T protein:vir:97 1 MIEVIIKKYLDEHLDVPSFFEHQKD-EPA-RFIILE----KTSGA------KQ---N---HLLSSTFAFQSYAESLYEAA 62 (111) T ss_pred ChhhhhhHHHhhhcCceEEEeecCC-CCC-ceEEEE----eeCCc------cc---c---ccccceEEEEecchhHHHHH Confidence 7888888889888999998877766 455 898883 21110 01 0 01348999999998655444 Q ss_pred ---HHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEE Q lcl|NC_017981. 95 ---DNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFR 146 (181) Q Consensus 95 ---d~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~ir 146 (181) ++++..+.+-. +-..|+=++...-=|.|-. +..+|=.+|+.||.+= T Consensus 63 ~La~~V~~a~~~l~-----~l~~i~~v~lns~Ynf~d~-~tk~yRYQa~~di~~~ 111 (111) T protein:vir:97 63 LLNDKVKQVIEQLD-----VLPQVSGVHLNADYNFTDT-ATKRYRYQAVFDINHY 111 (111) T ss_pred HHHHHHHHHhhhhc-----cCccceeeeecccccCCCC-CCCCccEEEEEEEeeC Confidence 34444443222 2223433444444566655 5567888899998875 No 39 >protein:vir:9579 Length: 111 # NCBI annotation: gp45 # Family: family:all:1269 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862884;genbank:gi:32469476;genbank:GeneID:1461321 Probab=40.54 E-value=0.97 Score=20.66 Aligned_cols=111 Identities=9% Similarity=0.123 Sum_probs=70.1 Q ss_pred HHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEEEEEEEechHHHHHH Q lcl|NC_017981. 15 EVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGTLSVKVFGGAARRHL 94 (181) Q Consensus 15 l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eatvelq~fG~~A~~~l 94 (181) .||.+.-+.|..=|+.|+-+--+- ++| +.|++++ |+-+ ..++ -+..+++-+||||..-.++. T Consensus 1 miE~~v~~~L~~~l~vpv~~~vp~-~~P-~~FV~vE----rtGG-----~~~~-------~~~~p~laVq~wg~S~~~Aa 62 (111) T protein:vir:95 1 MIEIIINKYLDGHLDVPSFFEHEA-EAP-DSFVIIQ----KTGG-----KERN-------HSGSATFAFQSYAPTMQKAA 62 (111) T ss_pred ChHHhHHHHhhhhcCeeEEeecCC-CCC-CceEEEE----eeCC-----cccc-------ccccceEEEEeccccHHHHH Confidence 778888888877788888665544 445 6898883 2111 1011 12458999999998655544 Q ss_pred HHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEE Q lcl|NC_017981. 95 DNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFR 146 (181) Q Consensus 95 d~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~ir 146 (181) + |..+++. .....-...+|+-++.+...|.|-- +..+|=.++.++|.+- T Consensus 63 ~-La~~v~~-a~~~l~~~~~i~~v~~~s~ynf~d~-~tk~~RYQ~~~~i~~~ 111 (111) T protein:vir:95 63 E-LNVKVKS-AVKGLIELDSICGVHLNSDYNFTDT-ETKQYRYQAVFDINYF 111 (111) T ss_pred H-HHHHHHH-HHhhhhccccccccccCCccccCCC-CCCCceEEEEEEEEeC Confidence 3 2333331 1222234445777788888888766 5567888888888875 No 40 >protein:vir:96125 Length: 140 # NCBI annotation: ORF038 # Family: family:all:296 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240084;genbank:gi:66395765;genbank:GeneID:5133106 Probab=39.76 E-value=1 Score=20.58 Aligned_cols=131 Identities=9% Similarity=0.113 Sum_probs=67.3 Q ss_pred CCccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEE Q lcl|NC_017981. 1 MAEQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGT 80 (181) Q Consensus 1 ~~~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eat 80 (181) |.- +.-.|.||+.-.-+-+ -|..|++-+ |+ |.....++.||+++--...+ |++.+ -..-.+.+ T Consensus 3 msa-~~aLq~Ai~~~L~ad~--~l~alvggr-Vy-D~~P~~~~~PYV~lG~~~~~----------~~~~~--~~~g~~~~ 65 (140) T protein:vir:96 3 VTA-EPLLYNKIMNNLIENP--ITDKLVGGR-VF-DCVQKDVVYPYIVVGESNVT----------ESERS--PGMREIIA 65 (140) T ss_pred cch-hHHHHHHHHHHhccCh--hHHhhcCcc-cc-cCCccCCCCCEEEeCCceee----------ecCCC--cccceEEE Confidence 322 1235566655443322 245556544 33 54444567999999221111 11101 12233567 Q ss_pred EEEEEec-----hHHHHHHHHHHHHhCChhHHHHHHHcCeeee--cCCccccchhhhhhhhheeeeeEEEEEEEeeeecc Q lcl|NC_017981. 81 LSVKVFG-----GAARRHLDNLRSRTKKMSSRDIMTRERFIIY--ATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFTD 153 (181) Q Consensus 81 velq~fG-----~~A~~~ld~L~~~lk~ps~~~~l~~~giai~--d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~d 153 (181) +.|.+++ .++.+++..++..|..+ +.-.|..+. ...+.+ .+++.-....++++.|++++...-.. T Consensus 66 ~tl~Vws~~~g~~ea~~ia~ai~~aL~~~-----l~l~~~~lv~l~~~~~~---~~rd~dg~t~hgvl~~ra~ve~~~~~ 137 (140) T protein:vir:96 66 ITFHVYSQYENGAEARELLKYLNYACRLN-----INFKDYELEWIKKDNSQ---VFTDIDQYTKHGVLRLLYKVRHKTLQ 137 (140) T ss_pred EEEEEEEcCCCHHHHHHHHHHHHHHhcCC-----ccCCCceEEEEEEeeeE---EeecCCCceEEEEEEEEEEEeecccc Confidence 7788884 35577888888888432 222344333 333332 33444345678999999888765433 Q ss_pred ccCceeee Q lcl|NC_017981. 154 NVGLIEHV 161 (181) Q Consensus 154 ~Vg~Ie~V 161 (181) |+| T Consensus 138 -----~~~ 140 (140) T protein:vir:96 138 -----ERV 140 (140) T ss_pred -----cCC Confidence 233 No 41 >protein:vir:94921 Length: 125 # NCBI annotation: possible peptidoglycan binding protein # Family: family:all:5248 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239283;genbank:gi:66392065;genbank:GeneID:5076566 Probab=39.49 E-value=1 Score=20.55 Aligned_cols=118 Identities=13% Similarity=0.025 Sum_probs=69.4 Q ss_pred cChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCC-CCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEEEEEE Q lcl|NC_017981. 6 TTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNI-SPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGTLSVK 84 (181) Q Consensus 6 ~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P-~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eatvelq 84 (181) .|.+++--++....+.. .-+.| ..++|.|.| ..+|+-+.+..- .....+.| + . .-++.+.+.|| T Consensus 1 Mt~~q~r~~I~~r~~a~----~~~~~--I~~~N~pp~~~~~W~Rlti~~g-~~~~a~iG------~-~-~~~rtGli~iq 65 (125) T protein:vir:94 1 MSYFQEKLDIENYFKAN----WPDTP--IFYENRTANSTGTWVRLTIQNG-DAFQASNG------E-V-SYRHPGVVFVQ 65 (125) T ss_pred CCHHHHHHHHHHHHHhC----CCccc--eeeCCCCCCCCCceEEEEeccC-cccccccC------C-c-eeeeeeEEEEE Confidence 56555544444443322 12345 345666654 678888887753 24445555 1 1 23667999999 Q ss_pred Eech------HHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEee Q lcl|NC_017981. 85 VFGG------AARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQ 149 (181) Q Consensus 85 ~fG~------~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~ 149 (181) +|.+ .+++.+|+++.-.+..+. |=.++...+++++.. +..+| | ..|.+.+|--+ T Consensus 66 iF~p~~~G~~~~~~~ad~~~~~f~~~~~-------g~i~f~~~~~~~~g~--~~gwy-Q-~Nv~I~f~~~~ 125 (125) T protein:vir:94 66 IFTKKEVGSGEALKLADKVDALFRSKTL-------GNIQFKVPQVQKVPS--TTEWY-Q-VNVSTEFYRGS 125 (125) T ss_pred eeecCCcChHHHHHHHHHHHHHHccCCC-------CceEEeeceecCCCC--CCCEE-E-EEEEEeeecCC Confidence 9964 347778877777666544 455888889998874 34332 2 44444444333 No 42 >protein:vir:96894 Length: 140 # NCBI annotation: ORF029 # Family: family:all:296 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240162;genbank:gi:66395835;genbank:GeneID:5133235 Probab=34.65 E-value=1.3 Score=20.00 Aligned_cols=134 Identities=7% Similarity=0.110 Sum_probs=68.2 Q ss_pred CC-ccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeE Q lcl|NC_017981. 1 MA-EQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEG 79 (181) Q Consensus 1 ~~-~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~ea 79 (181) |. .-+.-.+.||+.-..+-+ -|..|++-+| .|.....++.||+++--....+.+ .-.. .-.+. T Consensus 1 Msms~~~aLq~Ai~a~L~ada--~l~alvg~~V--yD~~P~~~~~Pyv~lG~~~~~~~~-----~~~~-------~g~~~ 64 (140) T protein:vir:96 1 MWVSVEPELTVQIYKRLKASP--IINKFVGDRV--FDVVQEDAVYPYIVVGESNVTNNE-----SSTM-------MRETV 64 (140) T ss_pred CCccHHHHHHHHHHHHhhcCh--hHHHhcCCcc--ccCCccCCCCCEEEecCceeeecC-----CCcc-------cceEE Confidence 54 445556666665443322 2455666553 355556778999999221111111 0011 12355 Q ss_pred EEEEEEech-----HHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeeeeccc Q lcl|NC_017981. 80 TLSVKVFGG-----AARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFTDN 154 (181) Q Consensus 80 tvelq~fG~-----~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~d~ 154 (181) ++.|.++.. +|.+++..++..|..+ +.-.|..+.+. ++...-.+++.--.-.++++.|+++..-.-... T Consensus 65 ~~~i~Vws~~~g~~ea~~ia~av~~AL~~~-----l~l~~~~lv~l-~~~~~~~~rd~dg~~~hgvl~~r~~v~~~~~~~ 138 (140) T protein:vir:96 65 GIVIHVYSQFATQYEAKQIISAIGYVLNRP-----IDIENYEFQFS-RIDSQSVFPDIDRFTKHGTIRLLFKYRHIKKGE 138 (140) T ss_pred EEEEEEEEcCCCHHHHHHHHHHHHHHhCCC-----ccCCCCeEEEE-EEeeeEEEecCCCceEEEEEEEEEEEEeecccc Confidence 677777763 4577777787777432 22234444331 111222333333335688888888877654332 Q ss_pred cCceeee Q lcl|NC_017981. 155 VGLIEHV 161 (181) Q Consensus 155 Vg~Ie~V 161 (181) .| T Consensus 139 -----~~ 140 (140) T protein:vir:96 139 -----GV 140 (140) T ss_pred -----CC Confidence 22 No 43 >protein:vir:96485 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238497;genbank:gi:66391773;genbank:GeneID:5176907 Probab=34.21 E-value=1.3 Score=19.95 Aligned_cols=120 Identities=14% Similarity=0.037 Sum_probs=62.0 Q ss_pred ccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEEEEEE Q lcl|NC_017981. 5 ETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGTLSVK 84 (181) Q Consensus 5 ~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eatvelq 84 (181) =.||++.+||.+=.. |.. ++.+|.-.=|. ...+.||..+.=....+. ..+ ++ --.+.++.|. T Consensus 1 m~sp~qeL~d~~f~~----l~~-~g~~vyd~lP~-~~v~YPfV~ig~~~~~~~---~tK--t~-------~~g~v~ltih 62 (128) T protein:vir:96 1 MKQPDQLLHDEMYRI----SSG-LGYDTYTYLPP-EGAAYPFVVMGETMVLPQ---STK--SH-------LIGRLSSTVH 62 (128) T ss_pred CCCHHHHHHHHHHHH----HHh-cCCeeecccCC-CCCCCCEEEEeeeeecCC---ccc--cc-------cccEEEEEEE Confidence 457888777754332 222 46666654454 246899999932111111 111 11 1235677888 Q ss_pred EechHH-----HHHHHHHHHHhCChhHHHHHHHcCeeee---cCCccccchhhhhhhhheeeeeEEEEEEEe Q lcl|NC_017981. 85 VFGGAA-----RRHLDNLRSRTKKMSSRDIMTRERFIIY---ATEQVLDVNITRSEIYAEPSAILDMGFRFT 148 (181) Q Consensus 85 ~fG~~A-----~~~ld~L~~~lk~ps~~~~l~~~giai~---d~g~Vqdlt~L~ea~~yE~RA~lel~irY~ 148 (181) +||... -+++.++...+.. ...-.|..|. ...+.|-++--. --+.=.+++++|.++|- T Consensus 63 VW~~~~~R~~v~~i~~~i~~~l~~-----~~~t~~y~~~~~~~~~~~qii~D~s-t~~~l~Hgil~l~f~~~ 128 (128) T protein:vir:96 63 VWGRVDDRKTLSDMAGQLMSSFFT-----IKNIDGMQFSAEVNESSIDSNRDNS-TDEVLYHFIIYTYFKFI 128 (128) T ss_pred EEECCCCchhHHHHHHHHHHHhhh-----hhccCCeEEEEEEeeeeEEEeeecC-CCceeeEEEEEEEEEeC Confidence 887654 4444444444322 1344455552 223333222211 11345689999999998 No 44 >protein:vir:105337 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950674;genbank:gi:119967844;genbank:GeneID:4643216 Probab=31.52 E-value=1.5 Score=19.63 Aligned_cols=139 Identities=9% Similarity=0.128 Sum_probs=69.4 Q ss_pred CC-ccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeE Q lcl|NC_017981. 1 MA-EQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEG 79 (181) Q Consensus 1 ~~-~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~ea 79 (181) |. .-+.-.+.||+.-.-+-+ -|..|++-.| .|.....++.||+++--...++. +.... .-.+. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~--al~alvg~rV--yD~~P~~a~~PyV~lG~~~~~~~-----~~~~~-------~g~~~ 64 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNP--IVSKQLGGRV--FDCVQKDAVYPYIVVGETNVTNK-----ETTTS-------MFEDV 64 (145) T ss_pred CchhHHHHHHHHHHHHhhcCh--hHHHhhcccc--ccCCccCCCCCEEEeCcceeeec-----CCCcc-------cceEE Confidence 55 455566777765544322 2455555443 44455567899999922111111 11011 12355 Q ss_pred EEEEEEec-----hHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeeeeccc Q lcl|NC_017981. 80 TLSVKVFG-----GAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFTDN 154 (181) Q Consensus 80 tvelq~fG-----~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~d~ 154 (181) ++.|.++. .++.+++..++..|..+ +.-.|..+.+.. +..--.+++......++++.|+++.+-.-. T Consensus 65 ~~ti~Vws~~~g~~ea~~ia~av~~aL~a~-----l~l~~~~lv~l~-~~~~~~~rd~dg~~~hgvl~~ra~ve~~~~-- 136 (145) T protein:vir:10 65 GVTLHVYSQARNRDEASQIIQYLGFVLNSE-----IEINNYSFIKSR-IDTQEVITDIDQYTKHGIIRLIFKYRHNTL-- 136 (145) T ss_pred EEEEEEEEcCCCHHHHHHHHHHHHHHhCCC-----cCCCCCeEEEEE-EeeeeEeecCCCceEEEEEEEEEEEeeccc-- Confidence 66777773 34577777777777433 222233332221 112223334333467888888887764432 Q ss_pred cCceeeeEEecCCc Q lcl|NC_017981. 155 VGLIEHVKLTGEID 168 (181) Q Consensus 155 Vg~Ie~Vevtg~~~ 168 (181) +-.||.+.. T Consensus 137 -----~~~~~~~~~ 145 (145) T protein:vir:10 137 -----QRSVTNGAE 145 (145) T ss_pred -----cccccccCC Confidence 233555552 No 45 >protein:vir:107096 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950611;genbank:gi:119953691;genbank:GeneID:4643105 Probab=30.86 E-value=1.5 Score=19.55 Aligned_cols=139 Identities=8% Similarity=0.125 Sum_probs=69.3 Q ss_pred CC-ccccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeE Q lcl|NC_017981. 1 MA-EQETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEG 79 (181) Q Consensus 1 ~~-~~~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~ea 79 (181) |. .-+.-.+.||+.-.-+-+ -|..|++-.| .|.....++.||+++--...++. +.... .-.+. T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~--al~alvg~rV--yD~~P~~a~~PyV~lG~~~~~~~-----~~~~~-------~g~~~ 64 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNP--IIKKQLGGRV--FDCVQKDAVYPYIVVGETNVTNK-----ETTTS-------MFEDV 64 (145) T ss_pred CchhHHHHHHHHHHHHhhcCh--hHHHhhcccc--ccCCccCCCCCEEEeCcceeeec-----CCCcc-------cceEE Confidence 55 455566677765543322 2455555443 44455567899999922111111 11011 12355 Q ss_pred EEEEEEec-----hHHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeeeeccc Q lcl|NC_017981. 80 TLSVKVFG-----GAARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQRFTDN 154 (181) Q Consensus 80 tvelq~fG-----~~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~~~d~ 154 (181) ++.|.++. .++.+++..++..|..+ +.-.|..+.+.. +..--.+++......++++.|+++.+-.-. T Consensus 65 ~~ti~Vws~~~g~~ea~~ia~av~~aL~a~-----l~l~~~~lv~l~-~~~~~~~rd~dg~~~hgvl~~ra~ve~~~~-- 136 (145) T protein:vir:10 65 GVTLHVYSQARNRDEASQIIQYLGFVLNSE-----IEINNYSFIKSR-IDTQEVITDIDQYTKHGIIRLIFKYRHNTL-- 136 (145) T ss_pred EEEEEEEEcCCCHHHHHHHHHHHHHHhCCC-----cCCCCCeEEEEE-EeeeeEeecCCCceEEEEEEEEEEEeeccc-- Confidence 66777773 34577777777777433 222233332221 112223334333467888888887764432 Q ss_pred cCceeeeEEecCCc Q lcl|NC_017981. 155 VGLIEHVKLTGEID 168 (181) Q Consensus 155 Vg~Ie~Vevtg~~~ 168 (181) +-.||.+.. T Consensus 137 -----~~~~~~~~~ 145 (145) T protein:vir:10 137 -----QRSVTNGAE 145 (145) T ss_pred -----cccccccCC Confidence 233555552 No 46 >protein:vir:4348 Length: 121 # NCBI annotation: Orf15 # Family: family:all:896 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061511;genbank:gi:9635607;genbank:GeneID:1262874 Probab=26.94 E-value=1.9 Score=19.06 Aligned_cols=114 Identities=16% Similarity=0.163 Sum_probs=52.6 Q ss_pred hhhhhHHHHHHHHHHHHHHhhCC-ccEeecC--CCC-CCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEEEEE Q lcl|NC_017981. 8 IDQFVPDEVEAAAYRVLSPLLPE-MMLCYEG--QNH-NISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGTLSV 83 (181) Q Consensus 8 ~~~~i~~l~~~~a~~~ls~ll~~-pvI~Adq--Ng~-~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eatvel 83 (181) .+..||.+..+-+. |..||+. | .+.++ -.| ..+.||+++..-+.. ....--|+ ..+..++++| T Consensus 1 m~~~i~~~l~~d~~--v~allg~~~-~Rvyp~~~aP~~~~~Pyiv~q~vsg~-p~~~l~g~---------~~~~~~~vQI 67 (121) T protein:vir:43 1 MYPPIFKVCSSSPA--VTAILGASP-LRMYQFGLAPQLVVKPYATWQTISGS-PENYLWGR---------PDADGFTIQV 67 (121) T ss_pred CChHHHHHHhhChh--hhhhhcCCC-ceeeccCCCCCCCcCCeEEEEEecCc-ccceecCC---------CCcceeEEEE Confidence 56678888776544 4455552 2 22222 112 345799999654433 22111222 2345678999 Q ss_pred EEech---HHHHHHHHHHHHhCChhHHHHHHHcCeeeecCCccccchhhhhhhhheeeeeEEEEEEEeee Q lcl|NC_017981. 84 KVFGG---AARRHLDNLRSRTKKMSSRDIMTRERFIIYATEQVLDVNITRSEIYAEPSAILDMGFRFTQR 150 (181) Q Consensus 84 q~fG~---~A~~~ld~L~~~lk~ps~~~~l~~~giai~d~g~Vqdlt~L~ea~~yE~RA~lel~irY~~~ 150 (181) .||+. +|.+..++++..+ +....++...+ ++ ..-+.. =.|..+|+. +...+ T Consensus 68 DvyA~t~~~A~~l~~av~~Al---------~~~~~~~~~~~--~~--ye~dT~--lyR~s~Dv~-w~~~r 121 (121) T protein:vir:43 68 DIFSATAAEARDAAKAIRDAI---------ELSAYVVRWGG--ES--VDPDTK--TYRVSFDVD-WIVQR 121 (121) T ss_pred EeeeCCHHHHHHHHHHHHHHh---------hhcCCcccCCC--CC--Cccccc--ceeeeeEEE-EeecC Confidence 99965 3455555555443 33333222111 00 000111 134455544 22222 No 47 >protein:vir:2741 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695114;genbank:gi:23455883;genbank:GeneID:955650 Probab=21.99 E-value=2.5 Score=18.39 Aligned_cols=124 Identities=14% Similarity=0.066 Sum_probs=60.5 Q ss_pred ccChhhhhHHHHHHHHHHHHHHhhCCccEeecCCCCCCCCceEEEEEcccCCccccccCCCCCcCceeeeeeeeEEEEEE Q lcl|NC_017981. 5 ETTIDQFVPDEVEAAAYRVLSPLLPEMMLCYEGQNHNISPPYATLHVIARRDVGNPEFGPVNDDGIQQIHQVIEGTLSVK 84 (181) Q Consensus 5 ~~t~~~~i~~l~~~~a~~~ls~ll~~pvI~AdqNg~~P~~PYatl~vr~~~~~~~~~~g~vdd~G~q~V~~h~eatvelq 84 (181) =.||++++|+.+=+..- . ++.||.=.=+.. ..+.||.+|.=.... ..+.-++ --.+.++.|. T Consensus 1 M~sp~qeL~~~lf~~l~----~-~g~~vyD~lP~~-~~~YPfV~ig~~~~~-----~~~tkt~-------~~g~~~l~i~ 62 (128) T protein:vir:27 1 MKQPDQLLHDEMYRISC----E-LGYNTYTYLPPD-DAAYPFVVMGETMVL-----PQSTKSH-------LIGRLSSTVH 62 (128) T ss_pred CCCHHHHHHHHHHHHHH----h-cCCceeccCCCC-CCCcCEEEeccceec-----CCccccc-------cccEEEEEEE Confidence 45788888776544332 2 355655433322 357899999221111 1111111 2235678888 Q ss_pred EechHH-HHHHHHHHHHhCChhHHHHHHHcCeeee---cCCccccchhhhhhhhheeeeeEEEEEEEe Q lcl|NC_017981. 85 VFGGAA-RRHLDNLRSRTKKMSSRDIMTRERFIIY---ATEQVLDVNITRSEIYAEPSAILDMGFRFT 148 (181) Q Consensus 85 ~fG~~A-~~~ld~L~~~lk~ps~~~~l~~~giai~---d~g~Vqdlt~L~ea~~yE~RA~lel~irY~ 148 (181) +||... +-.+.++..++..-... ...-.|.-|. +....|-++-- +.-..=.+++++|.+++- T Consensus 63 vW~~~~~R~~v~~i~~~i~~~~~~-~~~t~~y~~~~~~~~~~~qil~Dt-st~~~l~Hgii~l~f~~~ 128 (128) T protein:vir:27 63 VWGHVDDRKTLSDMAGQLMSSFFA-IKKIGGKQFSAEVNESSIDSNRDN-STDEVLYHFIIYTYFKFI 128 (128) T ss_pred EEECCcchhHHHHHHHHHHHHhcc-ccccCCeeEEEEeecceEEEeeec-CCCceeeEEEEEEEEEeC Confidence 898654 44444444444322211 1222343332 22222322211 111235689999999998 Done!