WO2016020528A1 - New hydroxynitrile lyases - Google Patents
New hydroxynitrile lyases Download PDFInfo
- Publication number
- WO2016020528A1 WO2016020528A1 PCT/EP2015/068262 EP2015068262W WO2016020528A1 WO 2016020528 A1 WO2016020528 A1 WO 2016020528A1 EP 2015068262 W EP2015068262 W EP 2015068262W WO 2016020528 A1 WO2016020528 A1 WO 2016020528A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- hnl
- protein
- recombinant
- seq
- cyanohydrin
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/002—Nitriles (-CN)
- C12P13/004—Cyanohydrins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
Definitions
- the present invention relates to novel hydroxynitrile Iyases (HNL), which are capable to catalyze the asymmetric cyanohydrin reaction and wherein the HNLs have a tertiary protein structure comprising a dimer of 7 anti-parallel beta-strands, 2 alpha- helices and a large cavity in each monomer (Allergen BV1 -like type fold).
- HNL novel hydroxynitrile Iyases
- the invention further relates to the use of hydroxynitrile Iyases for producing enantiopure cyanohydrin compounds.
- HNLs Hydroxynitrile Iyases
- cyanogenic pathway a defense mechanism widespread in the plant kingdom. More importantly, they are valuable tools in biocatalysis, due to their ability to synthesize chiral a-cyanohydrins by a C-C bond forming condensation reaction. A chiral center is formed, the carbon chain is prolonged by one carbon atom and an additional versatile functional group - the nitrile - is introduced to the molecule.
- Enantiopure cyanohydrins are versatile building blocks and intermediates that serve as starting material for many enzymatic and chemical follow-up reactions, which find application in pharmaceutical, agrochemical and cosmetic industries (e.g.
- the enzymes need to fulfill several criteria: (i) availability of sufficient quantities of proteins with constant quality and batch-to-batch reproducibility at low cost, (ii) broad substrate range, (iii) high stability under acidic pH and high solvent stability and (iv) activity at low temperatures because the unselective chemical background reaction is significantly suppressed at low pH ( ⁇ 4.5), low temperature and in the absence of water.
- PhaWUL Phlebodium aureum
- novel HNL enzymes which are capable to catalyze the asymmetric cyanohydrin reaction. It is a further objective of the present invention to provide novel HNL enzymes comprising the following properties:
- molecular weight 20 ⁇ 5 kDa when measured by SDS-polyacrylamide electrophoresis; and tertiary protein structure: dimer of 7 anti-parallel beta-strands, 2 alpha-helices and a large cavity in each monomer (Allergen BV1 -like type fold).
- the method comprises the steps of providing an aldehyde or ketone compound and converting the compound to the corresponding asymmetric cyanohydrin compound in the presence of a cyanide donor and an HNL enzyme.
- the invention relates in particular to a selective Z7 HNL, which can stereoselectively catalyze the asymmetric cyanohydrin addition.
- Fig 1 Silver-stained BN-PA gel and related filter of an in-gel HNL activity assay. Extracted and QFF-purified D. teyermanii p rote i n s : Raw protein extract (Extr), Flow through (FT), Active fractions (A9-B2). Substrate: racemic mandelonitrile - 100 mM citrate buffer, pH 4.5.
- Fig. 2 SDS-PA gel of £>/HNL1 expression in E. co//(pMS and pET system) and P. pastoris).
- S soluble fraction
- P insoluble fraction
- T4.F1 , T4.B7 and B1.G4 three different P. pastoris clones expressing /HNL1.
- Marker Page Ruler, pre-stained protein ladder (Thermo Scientific). The arrow indicates the protein of interest.
- Fig. 4 Protein sequence of DMHL2
- Fig. 7 Primers for cloning £>/HNL1 -4 in the pEHISTEV vector.
- Fig. 8 Size exclusion chromatogram.
- Solid line Gel Filtration Standard.
- X-axis elution volume, y-axis UV absorption at 280 nm. Peak eluting at 7.7 ml: thyroglobulin (bovine) 670 kDa; at 12.77 ml: ⁇ -globulin (bovine) 158 kDa; at 14.38 ml: ovalbumin (chicken) 44 kDa; myoglobin (horse) 17 kDa; at 20.23 ml: vitamin B12 1.35 kDa.
- Fig. 9a Ribbon diagram of birch pollen allergen BET V 1 L (PDB Code 1 FM4);
- Fig. 9b Ribbon diagram of £ ⁇ 4HNL1 soaked with benzaldehyde.
- Fig. 10 Representation of the active site of 3 ⁇ 4HNL1 and its bound substrate benzaldehyde. Residue numbering refers to the sequence including HIS-tag, TEV cleavage site and spacer residues.
- the invention relates to a protein with hydroxynitrile lyase (HNL) activity comprising the following properties:
- tertiary protein structure dimer of 7 anti-parallel beta-strands, 2 alpha-helices and a large cavity in each monomer (Allergen BV1 -like type fold);
- the protein fold of the protein is related to the major Birch pollen allergen Bet v 1.
- the protein fold is characterized by 7 anti-parallel beta-strands, 2 alpha-helices and a large cavity.
- the tertiary structure of the allergen BV1 can be retrieved from the protein data bank server under the pdb code 1 FM4 and is depicted in Fig. 9 (top ribbon diagram)
- amino acid sequence shows similarity to the SRPBCC superfamily of proteins.
- SRPBCC means
- SRPBCC START/RHO alpha C/PITP/Bet v1/CoxG/CalC
- steroidogenic acute regulatory protein StAR
- STARD1 - STARD15 C-terminal catalytic domains of the alpha oxygenase subunit of Rieske-type non-heme iron aromatic ring-hydroxylating oxygenases (RHOs alpha C)
- RHOs alpha C Rieske-type non-heme iron aromatic ring-hydroxylating oxygenases
- PITPs phosphatidylinositol transfer proteins
- Bet v 1 the major pollen allergen of white birch, Betula verrucosa
- CoxG CalC
- Other members of this superfamily include PYR/PYL/RCAR plant proteins, the aromatase/cyclase (ARO/CYC) domains of proteins such as
- Streptococcus mutans Smu.440 and related proteins are related proteins.
- a further aspect of the invention is an isolated polypeptide, wherein said polypeptide is derived from a plant, in particular from Polypodiades plants, in particular from the Davalliaceae family, and has at least activity of a hydroxynitrile lyase.
- Hydroxynitnle lyase activity could be determined as described in Example 1 1.
- the protein has a hydroxynitnle lyase activity of at least 10 U/mg, preferably 100 U/mg, preferably 300 U/mg, preferably of at least 500 U/mg enzyme.
- a further aspect of the invention is the isolated polypeptide as described herein, wherein the plants are of genus Davallia, in particular of species Da va Ilia teyermanii.
- a further aspect of the invention is the HNL as described herein, wherein said HNL is a recombinant HNL.
- a further aspect of the invention is the recombinant hydroxynitrile lyase (HNL) as described above, comprising SEQ ID NO: 1 or a variant amino acid sequence having at least 35% sequence identity with SEQ ID NO:1.
- a further aspect of the invention is the recombinant hydroxynitrile lyase (HNL) as described above, comprising SEQ ID NO: 1 or a variant amino acid sequence having at least 50% sequence identity with SEQ ID NO: 1.
- a further aspect of the invention is the recombinant HNL as described above, wherein said variant amino acid sequence having at least 75%, preferably at least 90 % identity with SEQ ID NO: 1.
- the invention relates to the recombinant HNL as described above, wherein the amino acid sequence of the HNL is at least 35%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, or at least 98% identical to SEQ ID NO: 1.
- a further aspect of the invention is the recombinant HNL as described above having the SEQ ID NO: 1 (0/HNL1 ), SEQ ID NO:2 (£3 ⁇ 4HNL2), SEQ ID NO:3 (£>/HNL3), or SEQ ID NO:4 (£3 ⁇ 4HNL4).
- the invention encompasses proteins, in particular fusion proteins or truncations of the protein or parts of the protein with HNL activity, which comprise at least one polypeptide of the invention.
- a further aspect of the invention is an isolated polynucleic acid molecule encoding a recombinant HNL as described above.
- the present invention also relates to nucleic acids coding for the HNL as described above, which are complementary to such nucleic acid sequences which hybridize with coding nucleic acids under stringent conditions.
- a further aspect of the invention is a method to obtain further members of the BetV1 family with HNL activity.
- the method comprises the use of the information from SEQ ID NO: 1 , 2, 3 or 4 or the tertiary structure of these proteins as query to search for related sequences in sequence databases, structure databases or sequences obtained by DNA sequencing projects.
- SEQ ID NO: 1 , 2, 3 and 4 can be used to produce primers for the identification and cloning of directly homologous forms using PGR.
- probes to investigate further naturally occurring functional variants of the / /HNL genes and thus the corresponding coding enzyme variants can be used.
- SEQ ID NO: 1 or from allelic or naturally occurring functional variants thereof, e.g. via PGR using a deficient DNA polymerase a bank of artificially produced functional enzyme variants can be produced.
- standard methods can be used to introduce individual point mutations into the DNA sequence which lead to amino acid exchanges; this means that the protein's properties such as substrate specificity can be changed, or the protein sequence can be changed without changing the fold and activity of the protein.
- sequences coding for the protein can also be used in form of synthetic DNA constructs which are optimized with respect to their codons, DNA/RNA stability, facilitate translation or in order not to hybridize with sequences of SEQ ID NO: 1 , 2, 3 or 4.
- the coding DNA sequences can be cloned into routine vectors and, after transforming host cells with said vectors, can be expressed in cell culture.
- suitable expression vectors are pET-26b(+) and pEHISTEV for E. coli, and also expression vectors of other prokaryotic single cell organisms can be used.
- suitable vectors for P. pastor is ar the pPpT4 S and pPpB1 shuttle vectors.
- the gene may be integrated into the genome of the host cell. Also vectors of other eukaryotic single cell organisms can be used.
- the expression vectors of the invention may contain further functional sequence regions, such as a replication start point, operators, termination signals, coding sequences for antibiotic resistance, tags which facilitate purification (for example a His-tag or a Strep-tag) or other peptide sequences which are produced by the fusion proteins.
- a shuttle vector may contain functional sequence regions as described above and further homologous sequences for correct integration into the genomes.
- a further aspect of the invention is a vector comprising an isolated DNA molecule as described above.
- the vector comprises all regulatory elements necessary for efficient transfection as well as for efficient expression of proteins.
- Such vectors are well known in the art and any suitable vector can be selected for this purpose.
- a further aspect of the present invention relates to a recombinant host cell which is transfected with a vector as described above.
- the host cell is a non-human cell, in particular an E. col i O P. pastoris ce ⁇ .
- Transfection of cells and cultivation of recombinant cells can be performed according to well-known methods in the art.
- Such a recombinant cell as well as any therefrom descendant cell comprises the vector.
- a cell line is provided which expresses the inventive HNL protein either continuously or upon activation depending on the vector.
- host cells can be transformed using the usual methods, such as for example the heat shock method or electroporation.
- a further aspect of the present invention is constituted by expression systems which comprise host cells or host cell cultures which are transformed with the vector systems described above.
- Preferred hosts are single-cell prokaryotic organisms, in particular E coli.
- eukaryotic genes it may be advantageous to use eukaryotic expression systems in order, for example, to introduce post- translational modifications or to facilitate folding of enzymes from eukaryotes.
- Particularly suitable eukaryotic host cells are yeast cells, in particular P. pastoris cells.
- a further aspect of the invention is a recombinant non-human cell obtained by introducing a vector as described herein.
- the expression systems can be cultivated using standard protocols which are known to the skilled person. After expression of a protein with HNL activity of the invention, it is purified, for example by centrifuging and/or using chromatography. In order to carry out catalytic reactions, the purified enzymes and also the raw extracts or centrifugation residues or fractions can be used directly.
- a further aspect of the invention is a culture obtained by culturing the recombinant cell as described herein.
- a further aspect of the invention is a recombinant HNL recovered from the culture as described herein.
- the invention relates to an HNL recovered from the culture as described herein.
- the protein can be isolated by disrupting the cells and recovering the protein from the supernatant.
- the whole culture broth including cells and supernatant culture media
- the culture supernatant can be used as active catalyst preparation without any further isolation or purification or the culture supernatant can be used as active catalyst.
- the invention relates to a method for producing an HNL, comprising recovering the HNL from a culture as described herein. Said isolation or purification from the cell culture can be performed by methods known in the art.
- lyophilization, precipitation, affinity tagging or co-precipitation, affinity chromatography, anion-exchange chromatography and size exclusion chromatography can be used to isolate said protein.
- a further aspect of the invention is a method for producing enantiopure cyanohydrin compounds, wherein an aldehyde or ketone compound is converted to the corresponding cyanohydrin compound in the presence of a cyanide donor and an HNL as described herein.
- a further aspect of the invention is a method for producing enantiopure cyanohydrin compounds, wherein an aldehyde or ketone compound is converted to the corresponding cyanohydrin compound in the presence of a cyanide donor and a recombinant HNL as described herein.
- a further aspect of the invention is the method for producing enantiopure cyanohydrin compounds, wherein a compound of general formula I is reacted with a compound of general formula II in the presence of an HNL to yield an asymmetric cyanohydrin compound III
- R 1 and R 2 are independently from one another H, Ci-2oalkyl, C2-2oalkenyl, C2-2oalkynyl, Ca-iocycloalkyl, C 4- 2ocycloalkylalkyl, Ce-uaryl, C7-2oarylalkyl, 3-14 membered heterocycioaikyi, 4-20 membered heterocycloalkylalkyl, 5-20 membered heteroaryl or 6-20 membered heteroarylalkyl, optionally substituted by one or more R a ;
- R 3 is H, an alkali metal, C(CH 3 )2OH , or Ci-2oalkyl; and
- each R a is independently H, halogen, -CF 3 , -OR b , -NR b R b , -(CH 2 ) n COOR b ,
- each R b is independently H or optionally substituted Ci-2oalkyl, C2-2oalkenyl, or
- n 0, 1 , 2 or 3.
- a further aspect of the invention is the method for producing enantiopure cyanohydrin compounds, wherein a recombinant HNL is used.
- alkyl group if not stated otherwise, denotes a linear or branched
- Ci-2oalkyl preferably a linear or branched chain of one to twenty carbon atoms.
- alkenyl group if not stated otherwise, denotes a partially unsaturated linear or branched C2-2oalkenyl, preferably a linear or branched chain of two to twenty carbon atoms that contains at least one double bond.
- alkynyl group if not stated otherwise, denotes a partially unsaturated linear or branched C2-2oalkynyl, preferably a linear or branched chain of two to twenty carbon atoms that contains at least one triple bond.
- a cycloalkyi group denotes a monocyclic non-aromatic hydrocarbon ring containing three to ten carbon atoms, preferably four to six carbon atoms, or a bicyclic non-aromatic hydrocarbon ring system containing seven to ten carbon atoms, preferably eight to ten carbon atoms, wherein the cycloalkyi group optionally comprises one or more double bonds.
- a heterocycioaikyi group denotes a monocyclic non-aromatic hydrocarbon ring containing three to fourteen carbon atoms, preferably four to eight carbon atoms, or a bicyclic non-aromatic hydrocarbon ring system containing seven to fourteen carbon atoms, preferably eight to ten carbon atoms, wherein in the heterocycioaikyi group one or more of the carbon atoms of the hydrocarbon ring or ring system is replaced by a group selected from the group comprising -N-, -O-,
- An aryl group preferably denotes a mono- or bicyclic, preferably monocyclic aromatic hydrocarbon group having six to fourteen carbon atoms; the aryl group is preferably phenyl.
- a heteroaryl group denotes an aromatic 5- or a 6- membered monocyclic hydrocarbon group wherein at least one of the carbon atoms is replaced by a heteroatom like O, N, and/or S, and wherein the aromatic monocyclic 5- or 6- membered cyclic hydrocarbon group is optionally fused to a further monocyclic 5- to 7-membered, preferably 5- or 6-membered, aromatic or nonaromatic hydrocarbon ring, wherein in the further monocyclic aromatic or nonaromatic hydrocarbon ring one or more, preferably one or two carbon atoms may be replaced by a heteroatom like O, N, and/or S.
- a halogen group is chlorine, bromine, fluorine or iodine.
- the method according to the invention can be carried out in a mono- or biphasic system or in an emulsion.
- the monophasic reaction solution comprises an aqueous or an organic solvent.
- aqueous solutions are for example water, a hydroxynitrile lyase containing solution, or a buffer solution.
- buffer solutions are phosphate buffer, citrate buffer, acetate buffer, borate buffer, MES, HEPES, Tris buffer, or mixtures thereof.
- the pH of these solutions can be between pH 1 and 8, preferably from 2 to 5.
- Appropriate organic solutions can be slightly water-miscible or water immiscible aliphatic or aromatic hydrocarbons, which are optionally halogenated, alcohols, ethers or esters or mixture thereof or the substrate itself.
- Suitable organic solutions are for example, but not limited to ethyl acetate, butyl acetate, methyl tert- butyl ether, diisopropyl ether, dibutyl ether, carbon tetrachloride, benzene, toluene, cyclohexane, hexadecane, hexane, heptane, chloroform, xylene, pentanol, hexanol, octanol and dodecanol, benzaldehyde, or mixtures thereof.
- neoteric solvents which refers to ionic liquids and supercritical fluids that have remarkable new properties. These neoteric solvents are characterized by physical and chemical properties that can be finely tuned for a range of applications by varying the chemical constituents in the case of ionic liquids and by varying physical parameters in the case of supercritical fluids.
- the advantages of conducting bioconversions in aqueous - organic solvent two-liquid phase systems are well known in the art.
- the biphasic system consists of two phases mutually not miscible, e.g. an aqueous and an organic phase.
- a further aspect of the invention is a method for producing enantiopure cyanohydrin compounds, wherein the reaction is carried out in a mono- or biphasic system or in an emulsion.
- the invention relates to a method for producing enantiopure cyanohydrin compounds, wherein the biphasic system comprises aqueous and organic solution as described herein.
- the conversion reaction moreover takes place at temperatures of from -10°C to +50°C, preferably at 0°C to 25°C, more preferred at 0°C to 10°C.
- a further aspect of the invention is a method for producing enantiopure cyanohydrin compounds, wherein the recombinant HNL is a purified enzyme or is contained in cleared lysate.
- a further aspect of the invention is a method for producing enantiopure cyanohydrin compounds, wherein the cyanohydrin compound is obtained with at least 50%, preferably with at least 60%, more preferred with at least 75% enantiomeric excess (e.e.).
- a further aspect of the invention is a method for producing enantiopure cyanohydrin compounds, wherein the asymmetric cyanohydrin compound is obtained with at least 50%, preferably with at least 75%, more preferred with at least 90% enantiomeric excess (e.e.).
- a further aspect of the invention is a method for producing enantiopure cyanohydrin compounds, wherein the cyanohydrin compound is obtained with a conversion rate of at least 15%, preferably with at least 20%, more preferred with at least 50%.
- the invention relates to a method for producing enantiopure cyanohydrin compounds, wherein the used Z7 HNL is a purified enzyme, or contained in whole cells or in the fermentation broth or in a cleared lysate or in immobilized form, for example on a support such as Celite®, Avicel, etc. or as cross-linked enzyme crystals or as cross-linked enzyme aggregates among other immobilization methods.
- a support such as Celite®, Avicel, etc. or as cross-linked enzyme crystals or as cross-linked enzyme aggregates among other immobilization methods.
- the active site of the HNL comprises an arginine, an aspartate, a serine and three tyrosine residues.
- amino acid residues corresponding to R69, D85, S87, Y101 , Y1 17 and Y161 of SEQ ID NO: 1 - 4 contribute to the HNL activity of the novel HNL enzymes.
- Alignment refers to the process or result of matching up the nucleotide or amino acid residues of two or more biological sequences to achieve maximal levels of identity and, in the case of amino acid sequences, conservation, for the purpose of assessing the degree of similarity and the possibility of homology.
- conserved substitution refers to a change at a specific position of an amino acid or, less commonly, DNA sequence that preserves the physico-chemical properties of the original residue or achieves a positive score in the governing scoring matrix.
- Identity or sequence identity refers to the extent to which two (nucleotide or amino acid) sequences have the same residues at the same positions in an alignment, often expressed as a percentage identity of the amino acid residues, based on the total number of amino acid residues of the amino acid sequence. If the two sequences, which are being compared, have a different number of amino acid residues, the percentage is based on the number of the shorter amino acid sequence.
- Homology refers to similarity attributed to descent from a common ancestor. Homologous biological components (genes, proteins, structures) are called homologs. Often the term homolog is used interchangeably with the term variant, meaning rather a certain degree of identity between two sequences without implying a common ancestor. Therefore within the meaning of the invention, the term variant of a sequence means a sequence with a certain degree of identity to the sequence, if not otherwise stated.
- Similarity refers to the extent to which nucleotide or protein sequences are related. Similarity between two sequences can be expressed as percent sequence identity and/or percent positive substitutions.
- sequence identity, similarity and positives values provided herein refer to the value obtained using the BLAST suite of programs using default parameters, a description is found in Altschul et al., J. Mol. Biol., 1990, 215,
- sequence can mean a nucleotide sequence, which constitutes a polynucleotide, or an amino acid sequence, which constitutes a peptide.
- Heterologous nucleic acid sequence or "nucleic acid sequence heterologous to a host” means a nucleic acid sequence which encodes e. g. an expression product such as a polypeptide that is foreign to the host ("heterologous expression” or
- heterologous product e.g. a nucleic acid sequence originating from a donor different from the host or a chemically synthesized nucleic acid sequence which encodes e. g. an expression product such as a polypeptide that is foreign to the host.
- heterologous protein means a protein that is foreign to the host.
- Form to the host can mean, that the nucleic acid or protein originates from a foreign species, or, if from the same species, is substantially modified from its native form in composition or genomic locus by deliberate human intervention.
- a heterologous nucleic acid sequence as referred herein encompasses also nucleic acids, which are codon optimized for the host according to the codon usage of the host.
- a cell which contains or shall contain a vector or an isolated nucleic acid sequence, and supports the replication and/or expression of the vector or of the nucleic acid. It is understood that such terms refer not only to the particular subject cell but also to the progeny or potential progeny of such a cell.
- Host cells may be prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, plant, amphibian, or mammalian cells.
- a “vector”, “vector expressible in a host” or “expression vector” are used interchangeably and is a polynucleic acid construct that is used in transfection or transformation of a host cell and into which a polynucleotide can be inserted, it is generated recombinantly or synthetically, with a series of specified polynucleic acid elements that permit transcription of a particular nucleic acid sequence in a host cell.
- this vector includes a transcriptional unit comprising a particular nucleic acid sequence to be transcribed operably linked to a promoter.
- Vectors are often replicons.
- a vector expressible in a host can be e. g. an autonomously or self-replicating plasmid, a cosmid, a phage, a virus or a retrovirus.
- the term "introduced” in the context of inserting a nucleic acid into a cell means “transfection” or “transformation” or “transduction” and includes reference to the incorporation of a nucleic acid into a eukaryotic or prokaryotic cell where the nucleic acid may be incorporated into the genome of the cell (e.g., chromosome, plasmid, plastid or mitochondrial DNA), converted into an autonomous replicon, or transiently expressed (e.g., transfected mRNA).
- Transduction is usually used for the process of injection of foreign DNA by a virus into the host bacterium.
- Transfection is the process of deliberately introducing nucleic acids into cells.
- the term is often used for the introduction of foreign DNA into eukaryotic cells, notably for non-viral methods.
- Competence is the ability of a cell to take up extracellular DNA from its environment.
- the terms “transformation”, “transformed” or “introducing a nucleic acid into a host cell” are used interchangeably and denote any process wherein an extracellular nucleic acid like a vector, with or without accompanying material, enters a host cell.
- the term “cell transformed” or “transformed cell” means the cell or its progeny into which the extracellular nucleic acid has been introduced and thus harbors the extracellular nucleic acid.
- the nucleic acid might be introduced into the cell so that the nucleic acid is replicable either as a chromosomal integrant or as an extra chromosomal element. Transformation of appropriate host cells with e. g.
- an expression vector can be accomplished by well-known methods such as microinjection, electroporation, particle bombardment or by chemical methods such as Calcium phosphate-mediated transformation, described e. g. in Maniatis et al. , 1982, Molecular Cloning, A laboratory Manual, Cold Spring Harbor Laboratory, in Ausubel et al. , 1994, Current protocols in molecular biology, John Wiley and Sons, or in "Molecular Cloning: A laboratory Manual, Third edition (3 volume set)", Cold Spring Harbor Laboratory Press, 2001.
- nucleic acid or nucleic acid sequence or polynucleotide might be DNA, RNA, or DNA/RNA hybrid. Unless otherwise indicated, the terms include reference to the specified sequence as well as the complementary sequence thereof. In case the nucleic acid sequence is located on a vector it is usually DNA.
- DNA which is referred to herein can be any polydeoxynuclotide sequence, including, e.g.
- double-stranded DNA single-stranded DNA, double-stranded DNA wherein one or both strands are composed of two or more fragments, double-stranded DNA wherein one or both strands have an uninterrupted phosphodiester backbone, DNA containing one or more single-stranded portion(s) and one or more double-stranded portion(s), double- stranded DNA wherein the DNA strands are fully complementary, double-stranded DNA wherein the DNA strands are only partially complementary, circular DNA, covalently- closed DNA, linear DNA, covalently cross-linked DNA, cDNA, chemically synthesized DNA, semi-synthetic DNA, biosynthetic DNA, naturally-isolated DNA, enzyme-digested DNA, sheared DNA, labeled DNA, such as radio labeled DNA and fluorochrome-labeled DNA, DNA containing one or more non-naturally occurring species of nucleic acid.
- DNA sequences can be synthesized by standard chemical techniques, for example, the phosphotriester
- RNA which is referred to herein can be e.g. single-stranded RNA, cRNA, double stranded RNA, double stranded RNA wherein one or both strands are composed of two or more fragments, double-stranded RNA wherein one or both strands have an uninterrupted phosphodiester backbone, RNA containing one or more single-stranded portion(s) and one or more double-stranded portion(s), double- stranded RNA wherein the RNA strands are fully complementary, double-stranded RNA wherein the RNA strands are only partially complementary, covalently cross- linked RNA, enzyme digested RNA, sheared RNA, mRNA, chemically-synthesized RNA, semi-synthetic RNA, biosynthetic RNA, naturally isolated RNA, labeled RNA, such as radiolabeled RNA and fluorochrome-labeled RNA, RNA containing one or more non-naturally occurring
- variants or “variants of a sequence” is meant a nucleic acid sequence that vary from the reference sequence by nucleic acid substitutions, preferably by conservative nucleic acid substitution, whereby one or more nucleic acids are substituted by another with same characteristics. Variants encompass as well degenerated sequences, sequences with deletions and insertions, as long as such modified sequences exhibit the same function (functionally equivalent) as the reference sequence.
- polypeptide refers to a polymer of amino acid residues connected to the other by peptide bonds between the alpha-amino and carboxy groups of adjacent residues.
- the terms apply to amino acid polymers in which one or more amino acid residue is an artificial chemical analogue of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers.
- isolated and purified nucleic acid sequence refers to the state in which the nucleic acid sequence will be free or substantially free of material with which they are naturally associated such as other nucleic acids with which they are found in their natural environment, or the environment in which they are prepared (e. g. cell culture) when such preparation is by recombinant technology practiced in vitro or in vivo.
- LB broth and LB medium are used interchangeable.
- the "origin of replication" (also called the replication origin) is a particular sequence in a genome at which replication is initiated. This can either involve the replication of DNA in living organisms such as prokaryotes and eukaryotes, or that of DNA or RNA in viruses, such as double-stranded RNA viruses. DNA replication may proceed from this point bidirectionally or unidirectionally.
- Promoter refers to a nucleic acid sequence that regulates expression of a transcriptional unit.
- a “promoter region” is a regulatory region capable of binding RNA polymerase in a cell and initiating transcription of a downstream (3' direction) coding sequence. Within the promoter region will be found a transcription initiation site (conveniently defined by mapping with nuclease S1), as well as protein binding domains (consensus sequences) responsible for the binding of RNA
- polymerase such as the putative -35 region and the Pribnow box.
- Signal sequence refers to a nucleic acid sequence which encodes a short amino acid sequence (i.e., signal peptide) present at the NH2-terminus of certain proteins that are normally exported by cells to non- cytoplasmic locations (e.g., secretion) or to be membrane components. Signal peptides direct the transport of proteins from the cytoplasm to non-cytoplasmic locations.
- 'Translation initiation region is a signal region which promotes translation initiation and which functions as the ribosome binding site such as the Shine Dalgarno sequence.
- transcription terminator and “transcription termination region” are used interchangeably and indicate a section of genetic sequence that marks the end of gene or operon on genomic DNA for transcription. It causes RNA polymerase to terminate transcription.
- the transcription termination region is usually part of a transcriptional unit and increases the stability of the mRNA.
- [001 1 1] 'Transcriptional unit refers to a nucleic acid sequence that is normally transcribed into a single RNA molecule.
- the transcriptional unit might contain one gene (monocistronic) or two (dicistronic) or more genes (polycistronic) that code for functionally related polypeptide molecules.
- a nucleic acid sequence is "operably linked" when it is placed into a functional relationship with another nucleic acid sequence.
- DNA for a signal sequence is operably linked to DNA for a protein if it is expressed as a preprotein that participates in the secretion of the protein;
- a promoter is operably linked to a coding sequence if it affects the transcription of the sequence;
- a translation initiation region such as a ribosome binding site is operably linked to a nucleic acid sequence encoding e. g. a polypeptide if it is positioned so as to facilitate translation of the polypeptide.
- Linking can be accomplished by ligation at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide adaptors or linkers are used in accordance with conventional practice.
- the assay detects the cyanogenesis reaction. It was performed on microtiter plate format. A final volume of 150 L contained 10 - 30 pL of sample and 20 pL of racemic mandelonitrile (30 - 90 mM stock). The reaction was performed in 100 mM sodium citrate buffer at pH 4.5 and gaseous cyanide was detected through a Feigl- Anger test paper (Krammer et al., 2007) by the development of blue spots.
- RNA was isolated from Da va Ilia teyermanii f o 11 o w i n g the protocols provided by the SpectrumTM Plant Total RNA Kit (Sigma Aldrich). Quality assessment to ensure RNA integrity was performed with an Agilent 2100 Bioanalyzer (Agilent Technologies) and agarose gel electrophoresis (1 % agarose gel, running conditions: 80V, 40 min). The transcriptome sequence was obtained via a commercial service from Microsynth, by De Novo Next generation sequencing of a normalized
- transcriptome library (Roche sequencing system). The resulting transcriptome data were deposited on an in-house server for analysis with a WEB based tool developed for enzyme identification from transcriptome data.
- a suitable activity screening procedure had to be developed: a sandwich assembly was used, where the polyacrylamide gel was in direct contact with a filter paper soaked with substrate (racemic mandelonitrile 4 pL/mL) and reaction buffer (sodium citrate, 100 mM, pH 4.0). On top of the gel, a Feigl-Anger test paper was placed, separated from the gel by a permeable nylon tissue (Krammer et al., 2007). BN-PAGE gels were subsequently stained by silver staining (Blum, et al., 1987). A sharp 20-kDa-band was observed in the elution fractions, which corresponded to the HNL activity signal (Fig. 1 ).
- One of the six targets displayed HNL activity, determined as described above, and was termed £ ⁇ 4HNL1.
- Neither the gene nor the protein sequence showed similarities higher than 31 % when subjected to a blastp search in NCBI (Basic Local Alignment Search Tool of NCBI, http://blast.ncbi.nlm.nih.gov/Blast.cgi, max target sequences: 100; expect threshold: 10; word size: 3; max matches in a query range: 0; matrix: BLOSUM62; gap costs: existence 1 1 , extension 1 ; computational adjustments; conditional compositional score matrix adjustment; database: non-redundant protein sequences (nr)).
- NCBI Basic Local Alignment Search Tool of NCBI, http://blast.ncbi.nlm.nih.gov/Blast.cgi, max target sequences: 100; expect threshold: 10; word size: 3; max matches in a query range: 0; matrix: BLOSUM62; gap costs: existence 1 1 , extension 1
- the genes coding for proteins with HNL activity from Da va Ilia teyermanii ⁇ not contain any known signal sequences like those of other HNLs from plant species. In other species, the HNLs are separated from the substrate for example by transport to the extra- or intercellular space. DtHHL isoenzymes appear to be intracellular in the natural host and also if the protein is expressed employing its natural unmodified protein sequence in P. pastoris, it remains intracellular. However, it is known in the art from many other examples that natural intracellular proteins can be expressed as secreted proteins by adding a secretion signal sequence to the N-terminal end of the protein sequence or by fusion to other carrier proteins.
- the amino acid sequence of 3 ⁇ 4HNL1 was employed as a template for the identification of similar sequences within the D. teyermanii tra n scri ptom e (Protein- nucleotide BLAST with standard parameters). Three nucleotide sequences were identified as putative Z3 ⁇ 4HNL1 isoenzymes. D. teyermanii g D N A wa s isolated with the PowerPlant ® Pro DNA Isolation Kit (MO BIO Laboratories, Inc.) according to the provided manual.
- Putative isoenzyme sequences were amplified by PGR with 100 ng of gDNA as template (98°C 30 sec initial denaturation, 98°C 7 sec denaturation, 58°C 15 sec annealing, 72°C 30 sec extension, 72°C final elongation). PGR products were isolated from a 1 % agarose gel after electrophoresis and sequenced (LGC Genomics). Identified sequences were confirmed or corrected according to the sequencing results.
- PGR products were cloned into the pJET1.2 vector with the CloneJETTM PGR Cloning Kit (Life Technologies) according to the provided manual.
- E. col HOP 10 F' competent cells were transformed with a standard method. Clones were selected on LB agar with ampicillin (100 mg/L), screened by colony PGR and confirmed by sequencing (LGC Genomics).
- E. co// ' GeneArt® Gene synthesis, Life Technologies.
- the gene was cloned into the pET-26(b)+ vector using standard techniques (restriction sites: Nddi HindW) and confirmed by sequencing (LGC Genomics).
- E. co// BL21 -Gold (DE3) competent cells were transformed with a standard method. Clones were selected on LB agar plates with kanamycin (50 mg/L). HNL expression was performed according to the following protocol: A single colony of E.
- a synthetic gene was ordered after codon optimization for expression in P. pastoris (GeneArt® Gene synthesis, Life Technologies). The gene was cloned into the pPpT4 S and pPpB1 shuttle vectors, under the control of the AOX1 promoter
- E. co//TOP10 F' competent cells were transformed with a standard method. Clones were selected on LB agar plates with Zeocin (25 mg/L). Plasmid DNA was isolated with the GeneJET Plasmid Miniprep Kit (Life Technologies) according to the provided protocol and the sequence of interest was confirmed by sequencing (LGC Genomics). The DNA preparation was linearized with SmA
- CBS7435MutS was performed as described by Lin-Cereghino et al. (2005). Clones were selected on YPD agar plates with Zeocin (100 or 600 mg/L for low and high copy number integration, respectively). Selected P. pas ton ' s CBS 7435 MutS clones were screened for the best Z3 ⁇ 4HNL1 expression in micro-scale cultivation (96-deep well plates).
- the expression was performed according to the following protocol: cells from single colony were transferred to 250 ⁇ _ in a buffered minimal dextrose medium BMD1 % (1 % glucose; (1.34 g/L yeast nitrogen base with ammonium sulfate and without amino acids, 200 mM potassium phosphate buffer at pH 6, 0.4 mg/L biotin, 2 g/L glucose)) at 28°C, 320 rpm and 80% humidity.
- BMD1 % (1 % glucose; (1.34 g/L yeast nitrogen base with ammonium sulfate and without amino acids, 200 mM potassium phosphate buffer at pH 6, 0.4 mg/L biotin, 2 g/L glucose)
- a first induction was performed with 250 pL BMM2 (0.5% final concentration of methanol; (1.34 g/L yeast nitrogen base with ammonium sulfate and without amino acids, 200 mM potassium phosphate buffer at pH 6, 0.4 mg/L biotin, 1 % v/v methanol)) after 60 h of cultivation. Further inductions were performed with 50 pL BMM10 (1.34 g/L yeast nitrogen base with ammonium sulfate and without amino acids, 200 mM potassium phosphate buffer at pH 6, 0.4 mg/L biotin, 5% v/v methanol (0.5% final concentration of methanol)) 10, 24 and 48 h after the first induction. The cultivation was stopped 72 h after the first induction.
- the cells were harvested by centrifugation, disrupted with Y-PERTM Plus Dialyzable yeast protein extraction reagent (Life Technologies) and screened for HNL activity.
- Small scale cultivation of the best P. pastor is CBS 7435 MutS clones expressing £ ⁇ 4HNL1 was performed in shake flasks. A single colony was inoculated in 180 mL (or 50 mL) BMD1 % for 60 h at 28°C, 120 rpm. Induction of protein expression was done by addition of 20 mL (or 5.5 mL) BMM10 and further addition of 2 mL (or 0.5 mL) methanol 10 h, 24 h and 48 h after the first induction. 72 h after the first induction, the cells were pelleted by centrifugation and disrupted as described in Example 7. Cell free lysate was assayed for HNL activity (as described in Example 10).
- DM L isoenzymes 1 -4 were cloned into the pEHISTEV vector (Restriction sites: NcdIHindW) and the constructs confirmed by sequencing (LGC Genomics).
- E. coli BL21 Star (DE3) competent cells were transformed with a standard method. Clones were selected on LB agar plates with kanamycin (50 mg/L). His-HNL expression was performed according to the protocol described above.
- E. coir Cell pellets were re-suspended in 25 ml_ of 50 mM potassium phosphate buffer, pH 6.0, and disrupted by sonication (80% duty cycle, 7 output, 6 min). Samples were kept on ice during cells disruption.
- P. pastoris Cell pellets were disrupted with Y-PER Plus Dialyzable Yeast Protein Extraction Reagent (Life Technologies).
- the protein amounts were quantified by use of the Bradford reagent (Roti ® - Quant - Carl Roth), or the PierceTM BCA Protein Assay Kit (Life Technologies).
- Protein purification was performed by affinity chromatography with a HisTrap FF 5 mL column (GE Healthchare) using an AKTA purifier system.
- E. coli lysate was prepared in 20 mM sodium phosphate, 0.5 M NaCI, 10 mM imidazole, pH 7.4 as described above. Clear lysate (filtered through a 0.45 ⁇ syringe filter) was loaded on the column, previously equilibrated with 20mM sodium phosphate, 0.5 M NaCI, 10 mM imidazole at pH 7.4 (start buffer). Unbound material was washed with 10 column volumes of start buffer.
- Superdex 200 10/300 GL column (catalog # 17-5175-01 , GE Healthcare; Bed volume: 24 ml_; Matrix: Spherical composite of cross-linked agarose and d extra n) at 4°C on an ⁇ Avant 25 (catalog # 28-9308-42, GE Healthcare) with a flow rate of 0.1 mL/min.
- the column was pre-equilibrated with 150 mM NaCI, 10 mM Tris-HCI at pH 8. The absorbance of the eluent was monitored at 280 nm (A280).
- a Gel-Filtration-Standard (catalog #151 -1901 , BioRad) was used in 1 : 10 dilution.
- the apparent molecular mass of native 7 HNL1 was estimated according to the elution profile of the protein as depicted in Fig. 9 to be approximately 40 kDa, indicating that the active enzyme forms a dimer. This is in contrast to the active form of the Phlebodium aureum HNL, whose native size was estimated to be 168 ⁇ 40kDa (Wajant et al., 1995).
- a spectrophotometric assay to detect the HNL reaction in cyanogenesis direction was performed on microtiter plate format.
- a final volume of 200 ⁇ _ contained 20 ⁇ _ of protein sample and 130 ⁇ _ of 50 mM sodium citrate phosphate buffer, pH 5.0.
- Blank reactions were carried out in which 50 mM sodium phosphate buffer pH 6.5 was used instead of protein sample.
- the reaction was initiated by the addition of 50 ⁇ _ of substrate solution (60 mM racemic mandelonitrile in 3 mM sodium citrate-phosphate buffer, pH 3.5).
- the reaction was measured spectrophotometrically following the formation of benzaldehyde at 280 nm for 10 minutes at 25°C in a Synergy Mx plate reader (BioTek). For calculations, the blank values were subtracted.
- the values given in the tables below refer to the average of three (technical triplicates) to nine (three biological triplicates in technical triplicate) single measurements and their respective standard deviations.
- citrate- phosphate buffer was used (pH 2.5 - 5.0).
- Example 11 Hydroxynitrile lyase assay for determination of cyanogenic activity with any cyanohydnn
- the reaction mix consisted of: 140 ⁇ _ of citrate phosphate buffer pH 5.0, 10 ⁇ _ of DMNL (0.001 -0.01 mg) and 10 ⁇ _ of cyanohydrin stock solution (300 mM cyanohydrin in 0.1 M citric acid). The reaction was performed at room temperature for 5-10 min.
- reaction was stopped by the addition of 10 ⁇ _ of mix I [100 mM AAchlorosuccinimide, succinimide 10-fold excess (w/w) in water] followed by incubation at room temperature and 1 ,000 rpm for 5 min.
- Racemic mandelonitrile was used as the positive control.
- Z7 HNL1 prefers the natural substrate mandelonitrile, but also aliphatic substrates such as acetone cyanohydrin are converted.
- HNL activity was measured as described above.
- R- Mandelonitrile was used as the substrate.
- 1 Unit (U) was defined as the amount of enzyme that produces 1 ⁇ of benzaldehyde per min under the assay conditions.
- Each reaction was carried out in biological duplicate and technical triplicate and the blank was subtracted.
- the specific activities were 562 U/mg for Z2/HNL1 , 746 U/mg for 0 HNL2, 607 U/mg for DMML3 and 861 U/mg for / /HNL4, respectively (with a confidence interval of 20%). This is significantly higher (4 to 6 times) than the specific activities of any other recombinant HNL (cleavage of the natural substrate
- enzyme solution (20 ⁇ _, 0.01 mg/mL) was added to 50 mM sodium citrate- phosphate buffer pH 2.5, 3.0, 3.5, 4.0, 4.5 5.0, 5.5, 6.0, 6.5 and 7.0 and 50 mM potassium chloride buffer pH 2.0 and 2.5.
- the reaction was started by addition of racemic mandelonitrile (15 mM final concentration).
- HNL activity was measured as described in Example 10.
- blank reactions were performed at each pH in which sodium phosphate buffer pH 6.5 was used instead of 9/HNL1 solution. Each reaction was carried out in biological triplicate and technical triplicate and the blank was subtracted.
- DtHNL isoenzymes were incubated in 50 mM phosphate citrate buffer (final concentration of 1 mg/mL) at the pH values as specified in Table 2 at 5-8°C.
- HNL activity at different temperature was measured spectrophotometrically in quartz cuvettes (1 mL). Therefore 100 ⁇ _ of enzyme solution (0.001 mg/mL final concentration) was added to 700 pL of 50 mM citrate phosphate buffer, pH 5.0. The reaction was started by the addition of 200 pL 60 mM racemic mandelonitrile (12 mM final concentration). In parallel, blank reactions were performed at each temperature in which sodium phosphate buffer pH 6.5 was used instead of £ ⁇ 4HNL solution. Enzymatic activity was measured from 10-50°C at 280 nm in a spectrophotometer (Cary Series Agilent Technologies). Each reaction was carried out in biological triplicate and technical triplicate and the blank was subtracted.
- Table 6 shows that the activity of 3 ⁇ 4HNL isoenzymes increases with temperature.
- the highest activity of Z3 ⁇ 4HNL1 and 7 HNL3 was observed at 40°C, of DMHL2 at 35°C, and of Z3 ⁇ 4HNL4 at 45°C.
- the chemical background reaction decomposition of racemic mandelonitrile to benzaldehyde and HCN
- the reactions in this system should preferably be carried out at low temperatures, e.g. below 10°C.
- Z2 HNL1 (1 mg/mL) was incubated in sodium phosphate buffer at pH 6.5 at different temperatures as specified in Table 7. At the time-points specified in Table 7, an enzyme aliquot was diluted to 0.01 mg/mL and HNL activity was measured at 25°C as described herein. Each reaction was carried out in biological duplicate (Z7 HNL1 ) or triplicate ( 3 ⁇ 4HNL2, 3, 4) and technical triplicate and the blank was subtracted. The same experiments were carried out for 3 ⁇ 4HNL2 (Table 8), 7 HNL3 (Table 9) and Z7 HNL4 (Table 10). [00149] Z2/HNL1 (Table 7) and DMHL2 (Table 8) are not stable at 40°C and loose activity already after 2 hours. Surprisingly, DMNL3 and Z2 HNL4 are more stable at 40°C. After 48 hours, DMNL3 and Z7 HNL4 show 65% and 50% of their original activity (Table 9 and Table 10).
- the target protein was tested for its ability to catalyze the cyanohydrin synthesis from benzaldehyde (or 2-CI-benzaldehyde, 3-Ph-propionaldehyde, cinnamaldehyde, 2- furylaldehyde, acetophenone) and HCN.
- the reactions were performed in a two-phase system with methyl tertiary-butyl ether in a total volume of 1.5 ml_.
- the HCN-solution ( ⁇ 2 M) was prepared as described before by Okrob et al. , 201 1. All reactions involving HCN were performed in a ventilated hood with gas trap and equipped with a calibrated HCN detector. A stock solution was prepared
- Table 12 summarizes the results of Z7 HNL1 catalyzed synthesis of cyanohydrins.
- Z2 HNL1 is more active than /3 ⁇ 4HNL in terms of mandelonitrile synthesis.
- 0.096 mg/mL of commercial /3 ⁇ 4HNL (Sigma) preparation is used in the same reaction setup, 96.4% of conversion with 98.1 % e.e. is obtained (0.096 mg/mL /3 ⁇ 4HNL and 0.056 mg 7 HNL1 correspond to 5 U/mL of enzyme determined in cyanogenesis direction).
- Enzyme concentration is based on the buffer phase of which 500 ⁇ _ were emulsified with 1 ,000 ⁇ _ of MTBE/HCN. Conversion is defined as the disappearance of substrate (corrected by the internal standard). Non-enzymatic transformation is defined as the consumption of the substrate detected in the blank reaction that did not contain enzyme (racemic cyanohydrin is produced). It needs to be noted that the non- enzymatic background reaction cannot be considered as a blank because it does not occur to the same extent if enzyme is present that consumes the substrates. However, the value gives an explanation to the low e.e. of the enzyme catalyzed product. Hence, the enzyme may still be perfectly stereoselective for the particular substrate but too slow at the respective conditions.
- the £ ⁇ 4HNL1 is superior to 3 ⁇ 4HNL (Sigma Aldrich) in synthesis direction (benzaldehyde to mandelonitrile).
- the known HNLs typically prefer either aromatic substrates (e.g. /3 ⁇ 4HNL) or aliphatic substrates (e.g. LLMUL).
- DM L accepts both types of substrates: it is highly active for the cleavage of mandelonitrile and it also cleaves the aliphatic cyanohydrins
- a single colony of P. pastoris CBS 7435 MutS £>/HNL1 opt B1.G4 was incubated overnight in BMGY medium (10 g/L yeast extract, 20 g/L peptone, 700 ml_ dH2O; 100 mL/L 10x yeast nitrogen base, 100 mL/L 200 mM potassium phosphate buffer pH 6.0, 100 mL/L 10% w/v glycerol, 2 mL/L 500x biotin) at 28°C in shake flask. Cultivation was performed in 5 L BIOSTAT® CT fermenter. First, a glycerol batch phase (glycerol 4%) was performed over-night.
- BMGY medium 10 g/L yeast extract, 20 g/L peptone, 700 ml_ dH2O; 100 mL/L 10x yeast nitrogen base, 100 mL/L 200 mM potassium phosphate buffer pH 6.0, 100 mL/L 10% w/v gly
- the supernatant was discarded and the cell pellet was dried in a 60°C incubator for 3 days. After drying, the tubes were weighed again and the cell dry weight was calculated. At the termination of the fermentation, the entire cell culture was harvested (ca. 1.5 kg of total wet cell weight).
- pEHisTEV £>/HNL1 was incubated overnight in LB medium with 50 mg/L kanamycin at 37°C. After 16 h, the cell pellet was collected by centrifugation and washed two times with minimal media (M9 salts, 2% glucose, 2 mM MgSO 4 , 0.01 mg/mL thiamine, 0.01 mg/mL FeCb). The main-culture (400 mL minimal media with kanamycin (50 mg/L) and selenomethionine (SeMet (50mg/L)) was inoculated with an aliquot of the pre-culture to a final ODeoo of 0.25.
- the culture was incubated at 37°C and 150 rpm until the ODeoo reached 0.5. Then, 0.5 mM IPTG was added and the culture was incubated at 25°C and 150 rpm for 38 h. Cell free extract was prepared as described above.
- SeMet/3 ⁇ 4HNL1 was purified by affinity chromatography (NiSepharose 6 Fast Flow resin). Elution was performed with 20 mM sodium phosphate, 0.5 M NaCI, 300 mM imidazole, pH 7.4. Fractions containing SeMetZ? HNL1 were combined and desalted (PD10 Desalting columns, GE Healthcare LifeScience). Protein was stored at -20°C in 50 mM potassium phosphate buffer pH 6.0 prior to crystallization.
- SeMetZ?/HNL1 (3 mg/mL in 10mM Tris-HCI pH8) was used to grow crystals in 0.2 M sodium thiocyanate, 20% (w/v) polyethylene glycol 3350. A 1 :1 ratio of protein and screening solutions was used. Typically, crystals appeared after 2-3 days. After supplementation of 30% glycerol, the crystals were flash-cooled in liquid-nitrogen.
- the SeMet Z2 HNL1 datasets were collected at 100k from single crystals on the synchrotron beamline ID29 (EMBL, Grenoble, France). The data sets were processed and scaled using the XDS program package (Kabsch, 2010).
- the AutoSol Program (McCoy et al., 2007, Terwilliger et al., 2009) and the AutoBuild Program (Terwilliger et al., 2008) from the PHENIX software suit (Adams et al., 2010) were used to define the selenium heavy metal-atom sites using a SeMet Z7/HNL1 SAD data set, as well as to build an initial model. The resulting model was again completed manually in Coot and refined with PHENIX.
- native crystals were also grown by mixing 1 pL protein sample (4 mg/mL in 10mM Tris-HCL pH 8) with 0.5 pL reservoir solution (0.1 M 2-(4-(2- hydroxyethyl)-1 -piperazinyl) ethanesulfonic acid pH 7.5 and 10% (w/v) polyethylene glycol; JSCG condition B4) using the same method as previously described. Typically, crystals appeared after 2-3 days.
- the crystal structure revealed a dimer of 7 anti-parallel beta-strands, 2 alpha- helices and a large cavity in each monomer.
- the residues important for substrate binding and catalysis were identified from the structure containing the substrate benzaldehyde and the structure containing the product mandelonitrile.
- the substrate showed interactions with the residues arginine 94, aspartic acid 1 10 and serine 1 12, tyrosine 126 and tyrosine 142 (Fig. 10).
- This numbering refers to the sequence that starts with the HIS-tag and the TEV cleavage site, including some spacer residues.
- the active site residues are arginine R69, aspartate D85, serine S87 and three tyrosines Y101 , Y1 17 and Y161 in case the numbering refers to the native £ ⁇ 4HNL1 sequence.
- a new hydroxynitrile lyase from fern from the plant to the sequence, ACIB
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The present invention relates to novel hydroxynitrile lyases (HNL) which have a molecular weight of about 20 kDa and a protein fold of the allergen BV1 -like type and to methods of producing them. The invention also relates to the use hydroxynitrile lyases for producing enantiopure cyanohydrin compounds.
Description
New hydroxynitrile iyases
Field of the Invention
[0001] The present invention relates to novel hydroxynitrile Iyases (HNL), which are capable to catalyze the asymmetric cyanohydrin reaction and wherein the HNLs have a tertiary protein structure comprising a dimer of 7 anti-parallel beta-strands, 2 alpha- helices and a large cavity in each monomer (Allergen BV1 -like type fold). The invention further relates to the use of hydroxynitrile Iyases for producing enantiopure cyanohydrin compounds.
Background Art
[0002] Hydroxynitrile Iyases (HNLs) take part in the cyanogenic pathway, a defense mechanism widespread in the plant kingdom. More importantly, they are valuable tools in biocatalysis, due to their ability to synthesize chiral a-cyanohydrins by a C-C bond forming condensation reaction. A chiral center is formed, the carbon chain is prolonged by one carbon atom and an additional versatile functional group - the nitrile - is introduced to the molecule. Enantiopure cyanohydrins are versatile building blocks and intermediates that serve as starting material for many enzymatic and chemical follow-up reactions, which find application in pharmaceutical, agrochemical and cosmetic industries (e.g. Gruber-Khadjawi, M., Fechter, M.H., Griengl, 2012; Lanfranchi et al., 2013; Winkler, Glieder, & Steiner, 2012). To meet the requirements for industrial application, the enzymes need to fulfill several criteria: (i) availability of sufficient quantities of proteins with constant quality and batch-to-batch reproducibility at low cost, (ii) broad substrate range, (iii) high stability under acidic pH and high solvent stability and (iv) activity at low temperatures because the unselective chemical background reaction is significantly suppressed at low pH (< 4.5), low temperature and in the absence of water.
[0003] A number of (R)- and (S)- selective HNLs have been identified so far (see Tables in Dadashipour & Asano, 201 1 ; Winkler et al., 2012). Currently, the (/^-selective hydroxynitrile Iyases from P run us amygdalus ( aHNL), P run us mume ( Pm N L ) ,
Eriobotrya japonica £/HNL, Arabidobsis thaliana (-4 /HNL), Linum usitatissimum (LuHNL) and the bacterial GMNL from Granulicella tundricola can be heterologously expressed. Whereas ,4 /HNL, LuHNL and (S/HNL can be more or less successfully expressed in E. coli, the glycosylated and disulfide bridge containing enzymes from Rosaceae, e.g. Pa NL, PmHUL, and £/HNL, can only be expressed
with reasonable yield in Pichia pas ton's as soluble and active protein. However, the main disadvantage of -4/HNL is its low stability at pH below 5.4. LLMHL displays a very narrow substrate spectrum for only aliphatic aldehydes and methylketones as {R)- selective enzyme. Compared to other HNLs, QHNL is significantly less active
(Hussain et al., 2012). Thus, there is still a need for highly active and stable HNLs, which are easily expressed as recombinant proteins.
[0004] One member of the (/^-selective HNL group is the hydroxynitrile lyase from Phlebodium aureum (PhaHNL), isolated by Wajant et al., 1995. The protein was purified, and a first biochemical characterization was performed. PhaWUL was described as a multimer of 20 kDa subunits. It was described as a new enzyme, not related to other known HNLs. Its specific activity was determined to be exceptionally high (19,000 U/mg), which revealed a great potential of this enzyme as biocatalyst (Wajant et al., 1995). However, the protein sequence was never determined and no further studies were reported.
[0005] Inspired by the results of Wajant et al., ferns were the first target of choice for the discovery of new HNL sequences. Since the HNLs known to date are a perfect example of convergent evolution belonging to four different structural folds to date and sharing no sequence identity between the different subgroups, basic sequence comparison tools (such as BLAST) are not a suitable tool to identify distinct and unique HNL enzymes as the algorithms are based on the homology and similarity between sequences.
[0006] Therefore, a unique strategy was developed to visualize HNL enzymes
(Lanfranchi et al., 2013, Fig. 1 ). By combining this strategy with transcriptome and mass spectrometry (MS) data, a new protein family with HNL activity was identified. Summary of invention
[0007] It is the objective of the present invention to provide novel HNL enzymes which are capable to catalyze the asymmetric cyanohydrin reaction. It is a further objective of the present invention to provide novel HNL enzymes comprising the following properties:
action: catalyzing stereoselective asymmetric cyanohydrin formation;
molecular weight: 20±5 kDa when measured by SDS-polyacrylamide electrophoresis; and
tertiary protein structure: dimer of 7 anti-parallel beta-strands, 2 alpha-helices and a large cavity in each monomer (Allergen BV1 -like type fold).
[0008] It is a further objective of the present invention to provide a method for the recombinant production of the novel HNL enzyme in prokaryotic and eukaryotic expression systems.
[0009] It is a further objective of the present invention to provide a method for producing asymmetric cyanohydrin compounds utilizing the novel HNL enzyme. The method comprises the steps of providing an aldehyde or ketone compound and converting the compound to the corresponding asymmetric cyanohydrin compound in the presence of a cyanide donor and an HNL enzyme. The invention relates in particular to a selective Z7 HNL, which can stereoselectively catalyze the asymmetric cyanohydrin addition.
Brief description of drawings
[0010] Fig 1 : Silver-stained BN-PA gel and related filter of an in-gel HNL activity assay. Extracted and QFF-purified D. teyermanii p rote i n s : Raw protein extract (Extr), Flow through (FT), Active fractions (A9-B2). Substrate: racemic mandelonitrile - 100 mM citrate buffer, pH 4.5.
[001 1] Fig. 2: SDS-PA gel of £>/HNL1 expression in E. co//(pMS and pET system) and P. pastoris). S: soluble fraction, P: insoluble fraction; T4.F1 , T4.B7 and B1.G4: three different P. pastoris clones expressing /HNL1. Marker: Page Ruler, pre-stained protein ladder (Thermo Scientific). The arrow indicates the protein of interest.
[0012] Fig. 3: Protein sequence of Z7 HNL1
[0013] Fig. 4: Protein sequence of DMHL2
[0014] Fig. 5: Protein sequence of Z7 HNL3
[0015] Fig. 6: Protein sequence of Z¾HNL4
[0016] Fig. 7: Primers for cloning £>/HNL1 -4 in the pEHISTEV vector.
[0017] Fig. 8: Size exclusion chromatogram. Solid line: Gel Filtration Standard. X-axis: elution volume, y-axis UV absorption at 280 nm. Peak eluting at 7.7 ml: thyroglobulin (bovine) 670 kDa; at 12.77 ml: γ-globulin (bovine) 158 kDa; at 14.38 ml: ovalbumin (chicken) 44 kDa; myoglobin (horse) 17 kDa; at 20.23 ml: vitamin B12 1.35 kDa.
Dotted line: Peak eluting at 15.36 ml: DMM
[0018] Fig. 9a: Ribbon diagram of birch pollen allergen BET V 1 L (PDB Code 1 FM4); Fig. 9b: Ribbon diagram of £¾HNL1 soaked with benzaldehyde.
[0019] Fig. 10: Representation of the active site of ¾HNL1 and its bound substrate benzaldehyde. Residue numbering refers to the sequence including HIS-tag, TEV cleavage site and spacer residues.
Description of embodiments
[0020] In a first aspect, the invention relates to a protein with hydroxynitrile lyase (HNL) activity comprising the following properties:
action: catalyzing stereoselective cyanohydrin formation;
tertiary protein structure: dimer of 7 anti-parallel beta-strands, 2 alpha-helices and a large cavity in each monomer (Allergen BV1 -like type fold); and
molecular weight: 20±5 kDa when measured by SDS-polyacrylamide electrophoresis [0021] The protein fold of the protein is related to the major Birch pollen allergen Bet v 1. The protein fold is characterized by 7 anti-parallel beta-strands, 2 alpha-helices and a large cavity. The tertiary structure of the allergen BV1 can be retrieved from the protein data bank server under the pdb code 1 FM4 and is depicted in Fig. 9 (top ribbon diagram)
[0022] In another aspect, the amino acid sequence shows similarity to the SRPBCC superfamily of proteins. SRPBCC means
START/RHO alpha C/PITP/Bet v1/CoxG/CalC (SRPBCC) ligand-binding domain. SRPBCC domains have a deep hydrophobic ligand-binding pocket and they bind diverse ligands. Included in this superfamily are the steroidogenic acute regulatory protein (StAR)-related lipid transfer (START) domains of mammalian STARD1 - STARD15, and the C-terminal catalytic domains of the alpha oxygenase subunit of Rieske-type non-heme iron aromatic ring-hydroxylating oxygenases (RHOs alpha C), as well as the SRPBCC domains of phosphatidylinositol transfer proteins (PITPs), Bet v 1 (the major pollen allergen of white birch, Betula verrucosa), CoxG, CalC, and related proteins. Other members of this superfamily include PYR/PYL/RCAR plant proteins, the aromatase/cyclase (ARO/CYC) domains of proteins such as
Streptomyces glaucescens tetracenomycin, and the SRPBCC domains of
Streptococcus mutans Smu.440 and related proteins.
[0023] A further aspect of the invention is an isolated polypeptide, wherein said polypeptide is derived from a plant, in particular from Polypodiades plants, in particular from the Davalliaceae family, and has at least activity of a hydroxynitrile lyase.
[0024] Hydroxynitnle lyase activity could be determined as described in Example 1 1.
The protein has a hydroxynitnle lyase activity of at least 10 U/mg, preferably 100 U/mg, preferably 300 U/mg, preferably of at least 500 U/mg enzyme.
[0025] A further aspect of the invention is the isolated polypeptide as described herein, wherein the plants are of genus Davallia, in particular of species Da va Ilia teyermanii.
[0026] A further aspect of the invention is the HNL as described herein, wherein said HNL is a recombinant HNL.
[0027] A further aspect of the invention is the recombinant hydroxynitrile lyase (HNL) as described above, comprising SEQ ID NO: 1 or a variant amino acid sequence having at least 35% sequence identity with SEQ ID NO:1.
[0028] A further aspect of the invention is the recombinant hydroxynitrile lyase (HNL) as described above, comprising SEQ ID NO: 1 or a variant amino acid sequence having at least 50% sequence identity with SEQ ID NO: 1.
[0029] A further aspect of the invention is the recombinant HNL as described above, wherein said variant amino acid sequence having at least 75%, preferably at least 90 % identity with SEQ ID NO: 1.
[0030] In a further aspect, the invention relates to the recombinant HNL as described above, wherein the amino acid sequence of the HNL is at least 35%, at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, or at least 98% identical to SEQ ID NO: 1.
[0031] A further aspect of the invention is the recombinant HNL as described above having the SEQ ID NO: 1 (0/HNL1 ), SEQ ID NO:2 (£¾HNL2), SEQ ID NO:3 (£>/HNL3), or SEQ ID NO:4 (£¾HNL4).
[0032] Further, the invention encompasses proteins, in particular fusion proteins or truncations of the protein or parts of the protein with HNL activity, which comprise at least one polypeptide of the invention.
[0033] A further aspect of the invention is an isolated polynucleic acid molecule encoding a recombinant HNL as described above.
[0034] The present invention also relates to nucleic acids coding for the HNL as described above, which are complementary to such nucleic acid sequences which hybridize with coding nucleic acids under stringent conditions.
[0035] A further aspect of the invention is a method to obtain further members of the BetV1 family with HNL activity. The method comprises the use of the information from SEQ ID NO: 1 , 2, 3 or 4 or the tertiary structure of these proteins as query to search for related sequences in sequence databases, structure databases or sequences obtained by DNA sequencing projects.
[0036] The information from SEQ ID NO: 1 , 2, 3 and 4 can be used to produce primers for the identification and cloning of directly homologous forms using PGR. Moreover, because of the sequence information, probes to investigate further naturally occurring functional variants of the / /HNL genes and thus the corresponding coding enzyme variants can be used. Starting from SEQ ID NO: 1 or from allelic or naturally occurring functional variants thereof, e.g. via PGR using a deficient DNA polymerase, a bank of artificially produced functional enzyme variants can be produced. Similarly, standard methods can be used to introduce individual point mutations into the DNA sequence which lead to amino acid exchanges; this means that the protein's properties such as substrate specificity can be changed, or the protein sequence can be changed without changing the fold and activity of the protein.
[0037] The sequences coding for the protein can also be used in form of synthetic DNA constructs which are optimized with respect to their codons, DNA/RNA stability, facilitate translation or in order not to hybridize with sequences of SEQ ID NO: 1 , 2, 3 or 4.
[0038] The coding DNA sequences can be cloned into routine vectors and, after transforming host cells with said vectors, can be expressed in cell culture. Examples of suitable expression vectors are pET-26b(+) and pEHISTEV for E. coli, and also expression vectors of other prokaryotic single cell organisms can be used. Examples of suitable vectors for P. pastor is ar the pPpT4 S and pPpB1 shuttle vectors. The gene may be integrated into the genome of the host cell. Also vectors of other eukaryotic single cell organisms can be used.
[0039] The expression vectors of the invention may contain further functional sequence regions, such as a replication start point, operators, termination signals, coding sequences for antibiotic resistance, tags which facilitate purification (for
example a His-tag or a Strep-tag) or other peptide sequences which are produced by the fusion proteins.
[0040] A shuttle vector may contain functional sequence regions as described above and further homologous sequences for correct integration into the genomes.
[0041] A further aspect of the invention is a vector comprising an isolated DNA molecule as described above.
[0042] The vector comprises all regulatory elements necessary for efficient transfection as well as for efficient expression of proteins. Such vectors are well known in the art and any suitable vector can be selected for this purpose.
[0043] A further aspect of the present invention relates to a recombinant host cell which is transfected with a vector as described above. Preferably, the host cell is a non-human cell, in particular an E. col i O P. pastoris ce\\. Transfection of cells and cultivation of recombinant cells can be performed according to well-known methods in the art. Such a recombinant cell as well as any therefrom descendant cell comprises the vector. Thereby a cell line is provided which expresses the inventive HNL protein either continuously or upon activation depending on the vector.
[0044] With the vectors as described herein, host cells can be transformed using the usual methods, such as for example the heat shock method or electroporation.
[0045] A further aspect of the present invention is constituted by expression systems which comprise host cells or host cell cultures which are transformed with the vector systems described above. Preferred hosts are single-cell prokaryotic organisms, in particular E coli. In case of expression of eukaryotic genes, it may be advantageous to use eukaryotic expression systems in order, for example, to introduce post- translational modifications or to facilitate folding of enzymes from eukaryotes.
Particularly suitable eukaryotic host cells are yeast cells, in particular P. pastoris cells.
[0046] Thus, a further aspect of the invention is a recombinant non-human cell obtained by introducing a vector as described herein.
[0047] The expression systems can be cultivated using standard protocols which are known to the skilled person. After expression of a protein with HNL activity of the invention, it is purified, for example by centrifuging and/or using chromatography. In order to carry out catalytic reactions, the purified enzymes and also the raw extracts or centrifugation residues or fractions can be used directly.
[0048] A further aspect of the invention is a culture obtained by culturing the recombinant cell as described herein.
[0049] A further aspect of the invention is a recombinant HNL recovered from the culture as described herein.
[0050] In a further aspect, the invention relates to an HNL recovered from the culture as described herein. Specifically, the protein can be isolated by disrupting the cells and recovering the protein from the supernatant. Alternatively, the whole culture broth including cells and supernatant (culture media) can be used as active catalyst preparation without any further isolation or purification or the culture supernatant can be used as active catalyst.
[0051] In a further aspect, the invention relates to a method for producing an HNL, comprising recovering the HNL from a culture as described herein. Said isolation or purification from the cell culture can be performed by methods known in the art.
Specifically, lyophilization, precipitation, affinity tagging or co-precipitation, affinity chromatography, anion-exchange chromatography and size exclusion chromatography can be used to isolate said protein.
[0052] A further aspect of the invention is a method for producing enantiopure cyanohydrin compounds, wherein an aldehyde or ketone compound is converted to the corresponding cyanohydrin compound in the presence of a cyanide donor and an HNL as described herein.
[0053] A further aspect of the invention is a method for producing enantiopure cyanohydrin compounds, wherein an aldehyde or ketone compound is converted to the corresponding cyanohydrin compound in the presence of a cyanide donor and a recombinant HNL as described herein.
[0054] A further aspect of the invention is the method for producing enantiopure cyanohydrin compounds, wherein a compound of general formula I is reacted with a compound of general formula II in the presence of an HNL to yield an asymmetric cyanohydrin compound III
(I) (III)
wherein
R1 and R2 are independently from one another H, Ci-2oalkyl, C2-2oalkenyl, C2-2oalkynyl, Ca-iocycloalkyl, C4-2ocycloalkylalkyl, Ce-uaryl, C7-2oarylalkyl, 3-14 membered heterocycioaikyi, 4-20 membered heterocycloalkylalkyl, 5-20 membered heteroaryl or 6-20 membered heteroarylalkyl, optionally substituted by one or more Ra; R3 is H, an alkali metal, C(CH3)2OH , or Ci-2oalkyl; and
each Ra is independently H, halogen, -CF3, -ORb, -NRbRb, -(CH2)nCOORb,
-(CH2)nC(=O)R , -(CH2)nCONR R , Ci-2oalkyl, C2-2oalkenyl,or C2-2oalkynyl; and each Rb is independently H or optionally substituted Ci-2oalkyl, C2-2oalkenyl, or
C2-2oalkynyl; and
n is 0, 1 , 2 or 3.
[0055] A further aspect of the invention is the method for producing enantiopure cyanohydrin compounds, wherein a recombinant HNL is used.
[0056] An alkyl group, if not stated otherwise, denotes a linear or branched
Ci-2oalkyl, preferably a linear or branched chain of one to twenty carbon atoms.
[0057] An alkenyl group, if not stated otherwise, denotes a partially unsaturated linear or branched C2-2oalkenyl, preferably a linear or branched chain of two to twenty carbon atoms that contains at least one double bond.
[0058] An alkynyl group, if not stated otherwise, denotes a partially unsaturated linear or branched C2-2oalkynyl, preferably a linear or branched chain of two to twenty carbon atoms that contains at least one triple bond.
[0059] A cycloalkyi group denotes a monocyclic non-aromatic hydrocarbon ring containing three to ten carbon atoms, preferably four to six carbon atoms, or a bicyclic non-aromatic hydrocarbon ring system containing seven to ten carbon atoms, preferably eight to ten carbon atoms, wherein the cycloalkyi group optionally comprises one or more double bonds.
[0060] A heterocycioaikyi group denotes a monocyclic non-aromatic hydrocarbon ring containing three to fourteen carbon atoms, preferably four to eight carbon atoms, or a bicyclic non-aromatic hydrocarbon ring system containing seven to fourteen carbon atoms, preferably eight to ten carbon atoms, wherein in the heterocycioaikyi group one or more of the carbon atoms of the hydrocarbon ring or ring system is replaced by a group selected from the group comprising -N-, -O-,
-S-, -S(O)-, -3(0)2-, -Si- and -P-; wherein the heterocycioaikyi group optionally comprises one or more double or triple bonds.
[0061] An aryl group preferably denotes a mono- or bicyclic, preferably monocyclic aromatic hydrocarbon group having six to fourteen carbon atoms; the aryl group is preferably phenyl.
[0062] A heteroaryl group denotes an aromatic 5- or a 6- membered monocyclic hydrocarbon group wherein at least one of the carbon atoms is replaced by a heteroatom like O, N, and/or S, and wherein the aromatic monocyclic 5- or 6- membered cyclic hydrocarbon group is optionally fused to a further monocyclic 5- to 7-membered, preferably 5- or 6-membered, aromatic or nonaromatic hydrocarbon ring, wherein in the further monocyclic aromatic or nonaromatic hydrocarbon ring one or more, preferably one or two carbon atoms may be replaced by a heteroatom like O, N, and/or S.
[0063] A halogen group is chlorine, bromine, fluorine or iodine.
[0064] The method according to the invention can be carried out in a mono- or biphasic system or in an emulsion.
[0065] The monophasic reaction solution comprises an aqueous or an organic solvent.
[0066] Appropriate aqueous solutions are for example water, a hydroxynitrile lyase containing solution, or a buffer solution. Examples for buffer solutions are phosphate buffer, citrate buffer, acetate buffer, borate buffer, MES, HEPES, Tris buffer, or mixtures thereof. The pH of these solutions can be between pH 1 and 8, preferably from 2 to 5.
[0067] Appropriate organic solutions can be slightly water-miscible or water immiscible aliphatic or aromatic hydrocarbons, which are optionally halogenated, alcohols, ethers or esters or mixture thereof or the substrate itself. Suitable organic solutions are for example, but not limited to ethyl acetate, butyl acetate, methyl tert- butyl ether, diisopropyl ether, dibutyl ether, carbon tetrachloride, benzene, toluene, cyclohexane, hexadecane, hexane, heptane, chloroform, xylene, pentanol, hexanol, octanol and dodecanol, benzaldehyde, or mixtures thereof. Also applicable are neoteric solvents, which refers to ionic liquids and supercritical fluids that have remarkable new properties. These neoteric solvents are characterized by physical and chemical properties that can be finely tuned for a range of applications by varying the chemical constituents in the case of ionic liquids and by varying physical parameters in the case of supercritical fluids.
[0068] The advantages of conducting bioconversions in aqueous - organic solvent two-liquid phase systems are well known in the art. The biphasic system consists of two phases mutually not miscible, e.g. an aqueous and an organic phase.
[0069] A further aspect of the invention is a method for producing enantiopure cyanohydrin compounds, wherein the reaction is carried out in a mono- or biphasic system or in an emulsion.
[0070] Thus, in a further aspect the invention relates to a method for producing enantiopure cyanohydrin compounds, wherein the biphasic system comprises aqueous and organic solution as described herein.
[0071 ] The conversion reaction moreover takes place at temperatures of from -10°C to +50°C, preferably at 0°C to 25°C, more preferred at 0°C to 10°C.
[0072] The choice of applicable electrophiles ranges from aromatic to heteroaromatic and aliphatic aldehydes or ketones. Depending on the substrate and reaction systems, conversions up to 98.5% or enantiomeric excess up to 99.5% could be obtained by the inventive method.
[0073] A further aspect of the invention is a method for producing enantiopure cyanohydrin compounds, wherein the recombinant HNL is a purified enzyme or is contained in cleared lysate.
[0074] A further aspect of the invention is a method for producing enantiopure cyanohydrin compounds, wherein the cyanohydrin compound is obtained with at least 50%, preferably with at least 60%, more preferred with at least 75% enantiomeric excess (e.e.).
[0075] A further aspect of the invention is a method for producing enantiopure cyanohydrin compounds, wherein the asymmetric cyanohydrin compound is obtained with at least 50%, preferably with at least 75%, more preferred with at least 90% enantiomeric excess (e.e.).
[0076] A further aspect of the invention is a method for producing enantiopure cyanohydrin compounds, wherein the cyanohydrin compound is obtained with a conversion rate of at least 15%, preferably with at least 20%, more preferred with at least 50%.
[0077] In another aspect, the invention relates to a method for producing enantiopure cyanohydrin compounds, wherein the used Z7 HNL is a purified enzyme, or contained in whole cells or in the fermentation broth or in a cleared lysate or in immobilized form,
for example on a support such as Celite®, Avicel, etc. or as cross-linked enzyme crystals or as cross-linked enzyme aggregates among other immobilization methods.
[0078] In a further aspect of the invention the active site of the HNL comprises an arginine, an aspartate, a serine and three tyrosine residues.
[0079] In another embodiment, amino acid residues corresponding to R69, D85, S87, Y101 , Y1 17 and Y161 of SEQ ID NO: 1 - 4 contribute to the HNL activity of the novel HNL enzymes.
[0080] The terms "alignment, "conserved substitution", "identity", "homology" and "similarity" are used as defined herein.
[0081] Alignment refers to the process or result of matching up the nucleotide or amino acid residues of two or more biological sequences to achieve maximal levels of identity and, in the case of amino acid sequences, conservation, for the purpose of assessing the degree of similarity and the possibility of homology.
[0082] Conserved substitution refers to a change at a specific position of an amino acid or, less commonly, DNA sequence that preserves the physico-chemical properties of the original residue or achieves a positive score in the governing scoring matrix.
[0083] Identity or sequence identity refers to the extent to which two (nucleotide or amino acid) sequences have the same residues at the same positions in an alignment, often expressed as a percentage identity of the amino acid residues, based on the total number of amino acid residues of the amino acid sequence. If the two sequences, which are being compared, have a different number of amino acid residues, the percentage is based on the number of the shorter amino acid sequence.
[0084] Homology refers to similarity attributed to descent from a common ancestor. Homologous biological components (genes, proteins, structures) are called homologs. Often the term homolog is used interchangeably with the term variant, meaning rather a certain degree of identity between two sequences without implying a common ancestor. Therefore within the meaning of the invention, the term variant of a sequence means a sequence with a certain degree of identity to the sequence, if not otherwise stated.
[0085] Similarity refers to the extent to which nucleotide or protein sequences are related. Similarity between two sequences can be expressed as percent sequence identity and/or percent positive substitutions.
[0086] Unless otherwise stated, sequence identity, similarity and positives values
provided herein refer to the value obtained using the BLAST suite of programs using default parameters, a description is found in Altschul et al., J. Mol. Biol., 1990, 215,
403-410 and in "The NCBI Handbook" (Internet), editors J. McEntyre and J. Ostell,
Bethesda (MD): National Center for Biotechnology Information (US); 2002-, and using the program version BLAST 2.2.27+, if not otherwise stated.
[0087] The term "sequence" can mean a nucleotide sequence, which constitutes a polynucleotide, or an amino acid sequence, which constitutes a peptide.
[0088] "Heterologous nucleic acid sequence" or "nucleic acid sequence heterologous to a host" means a nucleic acid sequence which encodes e. g. an expression product such as a polypeptide that is foreign to the host ("heterologous expression" or
"heterologous product") e.g. a nucleic acid sequence originating from a donor different from the host or a chemically synthesized nucleic acid sequence which encodes e. g. an expression product such as a polypeptide that is foreign to the host. In a similar way, "heterologous protein" means a protein that is foreign to the host.
[0089] "Foreign to the host" can mean, that the nucleic acid or protein originates from a foreign species, or, if from the same species, is substantially modified from its native form in composition or genomic locus by deliberate human intervention.
[0090] A heterologous nucleic acid sequence as referred herein encompasses also nucleic acids, which are codon optimized for the host according to the codon usage of the host.
[0091] The terms "host", "host cell" and "recombinant host cell" are used
interchangeably herein to indicate a cell, which contains or shall contain a vector or an isolated nucleic acid sequence, and supports the replication and/or expression of the vector or of the nucleic acid. It is understood that such terms refer not only to the particular subject cell but also to the progeny or potential progeny of such a cell.
Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein.
[0092] Host cells may be prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, plant, amphibian, or mammalian cells.
[0093] A "vector", "vector expressible in a host" or "expression vector" are used interchangeably and is a polynucleic acid construct that is used in transfection or transformation of a host cell and into which a polynucleotide can be inserted, it is
generated recombinantly or synthetically, with a series of specified polynucleic acid elements that permit transcription of a particular nucleic acid sequence in a host cell. Typically, this vector includes a transcriptional unit comprising a particular nucleic acid sequence to be transcribed operably linked to a promoter. Vectors are often replicons. A vector expressible in a host can be e. g. an autonomously or self-replicating plasmid, a cosmid, a phage, a virus or a retrovirus.
[0094] The term "introduced" in the context of inserting a nucleic acid into a cell, means "transfection" or "transformation" or "transduction" and includes reference to the incorporation of a nucleic acid into a eukaryotic or prokaryotic cell where the nucleic acid may be incorporated into the genome of the cell (e.g., chromosome, plasmid, plastid or mitochondrial DNA), converted into an autonomous replicon, or transiently expressed (e.g., transfected mRNA).
[0095] "Transduction" is usually used for the process of injection of foreign DNA by a virus into the host bacterium.
[0096] "Transfection" is the process of deliberately introducing nucleic acids into cells. The term is often used for the introduction of foreign DNA into eukaryotic cells, notably for non-viral methods.
[0097] In microbiology, genetics, cell biology and molecular biology, "competence" is the ability of a cell to take up extracellular DNA from its environment.
[0098] Within the meaning of the invention and if not otherwise stated, the terms "transformation", "transformed" or "introducing a nucleic acid into a host cell" are used interchangeably and denote any process wherein an extracellular nucleic acid like a vector, with or without accompanying material, enters a host cell. The term "cell transformed" or "transformed cell" means the cell or its progeny into which the extracellular nucleic acid has been introduced and thus harbors the extracellular nucleic acid. The nucleic acid might be introduced into the cell so that the nucleic acid is replicable either as a chromosomal integrant or as an extra chromosomal element. Transformation of appropriate host cells with e. g. an expression vector can be accomplished by well-known methods such as microinjection, electroporation, particle bombardment or by chemical methods such as Calcium phosphate-mediated transformation, described e. g. in Maniatis et al. , 1982, Molecular Cloning, A laboratory Manual, Cold Spring Harbor Laboratory, in Ausubel et al. , 1994, Current protocols in molecular biology, John Wiley and Sons, or in "Molecular Cloning: A laboratory
Manual, Third edition (3 volume set)", Cold Spring Harbor Laboratory Press, 2001.
[0099] "Nucleic acid", "nucleic acid sequence" or "polynucleotide" are used
interchangeably and together with "isolated and purified nucleic acid or nucleic acid sequence or polynucleotide", as referred in the present invention, might be DNA, RNA, or DNA/RNA hybrid. Unless otherwise indicated, the terms include reference to the specified sequence as well as the complementary sequence thereof. In case the nucleic acid sequence is located on a vector it is usually DNA. DNA which is referred to herein can be any polydeoxynuclotide sequence, including, e.g. double-stranded DNA, single-stranded DNA, double-stranded DNA wherein one or both strands are composed of two or more fragments, double-stranded DNA wherein one or both strands have an uninterrupted phosphodiester backbone, DNA containing one or more single-stranded portion(s) and one or more double-stranded portion(s), double- stranded DNA wherein the DNA strands are fully complementary, double-stranded DNA wherein the DNA strands are only partially complementary, circular DNA, covalently- closed DNA, linear DNA, covalently cross-linked DNA, cDNA, chemically synthesized DNA, semi-synthetic DNA, biosynthetic DNA, naturally-isolated DNA, enzyme-digested DNA, sheared DNA, labeled DNA, such as radio labeled DNA and fluorochrome-labeled DNA, DNA containing one or more non-naturally occurring species of nucleic acid. DNA sequences can be synthesized by standard chemical techniques, for example, the phosphotriester method or via automated synthesis methods and PGR methods. The purified and isolated DNA sequence may also be produced by enzymatic techniques.
[00100] RNA which is referred to herein can be e.g. single-stranded RNA, cRNA, double stranded RNA, double stranded RNA wherein one or both strands are composed of two or more fragments, double-stranded RNA wherein one or both strands have an uninterrupted phosphodiester backbone, RNA containing one or more single-stranded portion(s) and one or more double-stranded portion(s), double- stranded RNA wherein the RNA strands are fully complementary, double-stranded RNA wherein the RNA strands are only partially complementary, covalently cross- linked RNA, enzyme digested RNA, sheared RNA, mRNA, chemically-synthesized RNA, semi-synthetic RNA, biosynthetic RNA, naturally isolated RNA, labeled RNA, such as radiolabeled RNA and fluorochrome-labeled RNA, RNA containing one or more non-naturally occurring species of nucleic acid.
[00101] Moreover, DNAs or RNAs comprising unusual bases, such as inosine, or modified bases, such as tritylated bases, to name just two examples, are
polynucleotides as the term is used herein.
[00102] With "variants" or "variants of a sequence" is meant a nucleic acid sequence that vary from the reference sequence by nucleic acid substitutions, preferably by conservative nucleic acid substitution, whereby one or more nucleic acids are substituted by another with same characteristics. Variants encompass as well degenerated sequences, sequences with deletions and insertions, as long as such modified sequences exhibit the same function (functionally equivalent) as the reference sequence.
[00103] The terms "polypeptide", "peptide", "protein", "polypeptidic" and "peptidic" are used interchangeably herein to refer to a polymer of amino acid residues connected to the other by peptide bonds between the alpha-amino and carboxy groups of adjacent residues. The terms apply to amino acid polymers in which one or more amino acid residue is an artificial chemical analogue of a corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers.
[00104] The term "isolated and purified nucleic acid sequence" refers to the state in which the nucleic acid sequence will be free or substantially free of material with which they are naturally associated such as other nucleic acids with which they are found in their natural environment, or the environment in which they are prepared (e. g. cell culture) when such preparation is by recombinant technology practiced in vitro or in vivo.
[00105] As cell culture system continuous or discontinuous culture such as batch culture or fed batch culture can be applied in culture tubes, shake flasks or bacterial fermenters. Host cells are usually cultured in conventional media as known in the art such as complex media like LB broth, "nutrient yeast broth medium", minimal media or a glycerol containing medium as described by Kortz et al., J. Biotechnol., 1995, 39, 59- 65 , or a mineral salt media as described by Kulla et al., Arch. Microbiol., 1983, 135, 1 - 7. LB broth and LB medium are used interchangeable.
[00106] The "origin of replication" (also called the replication origin) is a particular sequence in a genome at which replication is initiated. This can either involve the replication of DNA in living organisms such as prokaryotes and eukaryotes, or that of DNA or RNA in viruses, such as double-stranded RNA viruses. DNA replication may
proceed from this point bidirectionally or unidirectionally.
[00107] "Promoter" as used herein refers to a nucleic acid sequence that regulates expression of a transcriptional unit. A "promoter region" is a regulatory region capable of binding RNA polymerase in a cell and initiating transcription of a downstream (3' direction) coding sequence. Within the promoter region will be found a transcription initiation site (conveniently defined by mapping with nuclease S1), as well as protein binding domains (consensus sequences) responsible for the binding of RNA
polymerase such as the putative -35 region and the Pribnow box.
[00108] "Signal sequence" or "signal peptide sequence" refers to a nucleic acid sequence which encodes a short amino acid sequence (i.e., signal peptide) present at the NH2-terminus of certain proteins that are normally exported by cells to non- cytoplasmic locations (e.g., secretion) or to be membrane components. Signal peptides direct the transport of proteins from the cytoplasm to non-cytoplasmic locations.
[00109] 'Translation initiation region" is a signal region which promotes translation initiation and which functions as the ribosome binding site such as the Shine Dalgarno sequence.
[001 10] 'Terminator", "transcription terminator" and "transcription termination region" are used interchangeably and indicate a section of genetic sequence that marks the end of gene or operon on genomic DNA for transcription. It causes RNA polymerase to terminate transcription. The transcription termination region is usually part of a transcriptional unit and increases the stability of the mRNA.
[001 1 1] 'Transcriptional unit" as used herein refers to a nucleic acid sequence that is normally transcribed into a single RNA molecule. The transcriptional unit might contain one gene (monocistronic) or two (dicistronic) or more genes (polycistronic) that code for functionally related polypeptide molecules.
[001 12] A nucleic acid sequence is "operably linked" when it is placed into a functional relationship with another nucleic acid sequence. For example, DNA for a signal sequence is operably linked to DNA for a protein if it is expressed as a preprotein that participates in the secretion of the protein; a promoter is operably linked to a coding sequence if it affects the transcription of the sequence; or a translation initiation region such as a ribosome binding site is operably linked to a nucleic acid sequence encoding e. g. a polypeptide if it is positioned so as to facilitate translation of
the polypeptide. Linking can be accomplished by ligation at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide adaptors or linkers are used in accordance with conventional practice.
Examples
[001 13] The Examples which follow are set forth to aid in the understanding of the invention but are not intended to, and should not be construed to limit the scope of the invention in any way. The Examples do not include detailed descriptions of
conventional methods, e.g., cloning, transfection, and basic aspects of methods for overexpressing proteins in microbial host cells. Such methods are well known to those of ordinary skill in the art.
Example 1 - Rapid qualitative hydroxynitrile lyase assay
[001 14] The assay detects the cyanogenesis reaction. It was performed on microtiter plate format. A final volume of 150 L contained 10 - 30 pL of sample and 20 pL of racemic mandelonitrile (30 - 90 mM stock). The reaction was performed in 100 mM sodium citrate buffer at pH 4.5 and gaseous cyanide was detected through a Feigl- Anger test paper (Krammer et al., 2007) by the development of blue spots.
Example 2 - £¾HNL1 Identification
[001 15] The total RNA was isolated from Da va Ilia teyermanii f o 11 o w i n g the protocols provided by the Spectrum™ Plant Total RNA Kit (Sigma Aldrich). Quality assessment to ensure RNA integrity was performed with an Agilent 2100 Bioanalyzer (Agilent Technologies) and agarose gel electrophoresis (1 % agarose gel, running conditions: 80V, 40 min). The transcriptome sequence was obtained via a commercial service from Microsynth, by De Novo Next generation sequencing of a normalized
transcriptome library (Roche sequencing system). The resulting transcriptome data were deposited on an in-house server for analysis with a WEB based tool developed for enzyme identification from transcriptome data.
[001 16] Proteins were isolated from Da va Ilia teyermanii leaves using the P-PER™ Plant Protein Extraction Reagent (Life Technologies) according to the manufacturers protocol. PD-10 desalting columns (GE Healthcare) were used for buffer exchange to 50 mM sodium phosphate buffer, pH 5.7.
[001 17] The partial purification of proteins with HNL activity was carried out by using anion exchange chromatography (HiTrap QFF 1 mL column, from HiTrap I EX
Selection Kit, GE Healthcare), previously equilibrated with 20 mM sodium phosphate
buffer, pH 5.7. Elution was performed by applying a gradient from 0 to 1 M NaCI. The fractions showing HNL activity (between 100 and 200 mM NaCI) were employed for Blue-native (BN) PAGE (Polyacrylamide gel electrophoresis; NativePAGE™ No vex® 4-16% Bis-Tris Protein gels, Life Technologies). All buffers were prepared according to the provided manual. The electrophoresis was performed at 4°C in two steps: 60 min at 150 V, and 90 min at 250 V. After electrophoresis, the gel was equilibrated in the reaction buffer (sodium citrate buffer, 100 mM, pH 4.0) for 30 minutes at 4°C.
[001 18] A suitable activity screening procedure had to be developed: a sandwich assembly was used, where the polyacrylamide gel was in direct contact with a filter paper soaked with substrate (racemic mandelonitrile 4 pL/mL) and reaction buffer (sodium citrate, 100 mM, pH 4.0). On top of the gel, a Feigl-Anger test paper was placed, separated from the gel by a permeable nylon tissue (Krammer et al., 2007). BN-PAGE gels were subsequently stained by silver staining (Blum, et al., 1987). A sharp 20-kDa-band was observed in the elution fractions, which corresponded to the HNL activity signal (Fig. 1 ).
[001 19] Bands corresponding to the location of the dark blue spots and the surrounding area were excised from the gel and analyzed by LC-MS/MS.
[00120] Excised protein bands were tryptically digested. Peptide extracts were dissolved in 0.1 % formic acid and separated by nano-RP-HPLC. The sample was ionized in the nanospray source equipped with nanospray tips. It was analyzed in a Thermo LTQ-FT mass spectrometer operated in positive ion mode, applying alternating full scan MS (m/z 400 to 2000) in the ion cyclotron and MS/MS by collision induced dissociation of the five most intense peaks in the ion trap with dynamic exclusion enabled.
[00121] The LC-MS/MS data were analyzed by searching the D. teyermanii
transcriptome database with Proteome Discoverer 1.3 and Mascot 2.3. Hits were subjected to BLAST alignment against NCBI non-redundant public protein database.
[00122] From the MS/transcriptome analysis, six genes were cloned, expressed and investigated for HNL activity.
Example 3 - Cloning and expression of putative HNL proteins
[00123] Synthetic genes were ordered after codon optimization for expression in E. co// (Gene Art® Gene synthesis, Life Technologies). The genes were cloned into the pMS470 vector using standard techniques (restriction sites: NddlHindW). E. coli
TOP 10 F' competent cells were transformed with a standard method. Clones were selected on LB agar plates with the antibiotic ampicillin (100 mg/L). Proteins were expressed in liquid LB media culture with ampicillin (100 mg/L) at 25°C for 20 h after addition of 1 mM of IPTG. Cell free extracts were subjected to the rapid qualitative hydroxynitrile lyase assay as described herein and one of the target sequences displayed HNL activity.
[00124] One of the six targets displayed HNL activity, determined as described above, and was termed £¾HNL1. Neither the gene nor the protein sequence showed similarities higher than 31 % when subjected to a blastp search in NCBI (Basic Local Alignment Search Tool of NCBI, http://blast.ncbi.nlm.nih.gov/Blast.cgi, max target sequences: 100; expect threshold: 10; word size: 3; max matches in a query range: 0; matrix: BLOSUM62; gap costs: existence 1 1 , extension 1 ; computational adjustments; conditional compositional score matrix adjustment; database: non-redundant protein sequences (nr)). The most similar proteins to ¾HNL1 on basis of its amino acid sequence belong to the SRPBCC superfamily of proteins.
[00125] The genes coding for proteins with HNL activity from Da va Ilia teyermanii άο not contain any known signal sequences like those of other HNLs from plant species. In other species, the HNLs are separated from the substrate for example by transport to the extra- or intercellular space. DtHHL isoenzymes appear to be intracellular in the natural host and also if the protein is expressed employing its natural unmodified protein sequence in P. pastoris, it remains intracellular. However, it is known in the art from many other examples that natural intracellular proteins can be expressed as secreted proteins by adding a secretion signal sequence to the N-terminal end of the protein sequence or by fusion to other carrier proteins.
Example 4 - Identification of £¾HNL isoenzymes 2, 3, and 4
[00126] The amino acid sequence of ¾HNL1 was employed as a template for the identification of similar sequences within the D. teyermanii tra n scri ptom e (Protein- nucleotide BLAST with standard parameters). Three nucleotide sequences were identified as putative Z¾HNL1 isoenzymes. D. teyermanii g D N A wa s isolated with the PowerPlant®Pro DNA Isolation Kit (MO BIO Laboratories, Inc.) according to the provided manual. Putative isoenzyme sequences were amplified by PGR with 100 ng of gDNA as template (98°C 30 sec initial denaturation, 98°C 7 sec denaturation, 58°C 15 sec annealing, 72°C 30 sec extension, 72°C final elongation). PGR products were
isolated from a 1 % agarose gel after electrophoresis and sequenced (LGC Genomics). Identified sequences were confirmed or corrected according to the sequencing results.
[00127] PGR products were cloned into the pJET1.2 vector with the CloneJET™ PGR Cloning Kit (Life Technologies) according to the provided manual. E. col HOP 10 F' competent cells were transformed with a standard method. Clones were selected on LB agar with ampicillin (100 mg/L), screened by colony PGR and confirmed by sequencing (LGC Genomics).
Example 5 - £¾HNL1 expression in E. a?// and in P. pastoris
[00128] A synthetic gene was ordered after codon optimization for expression in E. co//' (GeneArt® Gene synthesis, Life Technologies). The gene was cloned into the pET-26(b)+ vector using standard techniques (restriction sites: Nddi HindW) and confirmed by sequencing (LGC Genomics). E. co// BL21 -Gold (DE3) competent cells were transformed with a standard method. Clones were selected on LB agar plates with kanamycin (50 mg/L). HNL expression was performed according to the following protocol: A single colony of E. co//BL21 - Gold (DE3), transformed with pET- 26(b)+ Z2/HNL1 opt, was incubated overnight in LB medium (Luria broth medium (also called LB-Lennox) (20 g/L: 10 g/L Tryptone, 5 g/L Yeast extract, 5 g/L NaCI), Carl ROTH, 76185 Karlsruhe, Germany, www.carlroth.com) with 50 mg/L kanamycin at 37°C. After 16 h, a main-culture was started by inoculation of 400 mL LB medium with kanamycin (50 mg/L) with an aliquot of the pre-culture to a final ODeoo of 0.1. The culture was incubated at 37°C and 150 rpm until the ODeoo reached 0.7. Then, 0.5 mM IPTG was added and the culture was incubated at 25°C and 150 rpm for 20 h
[00129] A synthetic gene was ordered after codon optimization for expression in P. pastoris (GeneArt® Gene synthesis, Life Technologies). The gene was cloned into the pPpT4 S and pPpB1 shuttle vectors, under the control of the AOX1 promoter
(restriction sites: EcoRM NoA). E. co//TOP10 F' competent cells were transformed with a standard method. Clones were selected on LB agar plates with Zeocin (25 mg/L). Plasmid DNA was isolated with the GeneJET Plasmid Miniprep Kit (Life Technologies) according to the provided protocol and the sequence of interest was confirmed by sequencing (LGC Genomics). The DNA preparation was linearized with SmA
(pPpT4 £>/HNL1 ) or BgA\ (pPpB1 £>/HNL1 ). Transformation of P. pastoris
CBS7435MutS was performed as described by Lin-Cereghino et al. (2005). Clones were selected on YPD agar plates with Zeocin (100 or 600 mg/L for low and high copy
number integration, respectively). Selected P. pas ton's CBS 7435 MutS clones were screened for the best Z¾HNL1 expression in micro-scale cultivation (96-deep well plates). The expression was performed according to the following protocol: cells from single colony were transferred to 250 μΐ_ in a buffered minimal dextrose medium BMD1 % (1 % glucose; (1.34 g/L yeast nitrogen base with ammonium sulfate and without amino acids, 200 mM potassium phosphate buffer at pH 6, 0.4 mg/L biotin, 2 g/L glucose)) at 28°C, 320 rpm and 80% humidity. A first induction was performed with 250 pL BMM2 (0.5% final concentration of methanol; (1.34 g/L yeast nitrogen base with ammonium sulfate and without amino acids, 200 mM potassium phosphate buffer at pH 6, 0.4 mg/L biotin, 1 % v/v methanol)) after 60 h of cultivation. Further inductions were performed with 50 pL BMM10 (1.34 g/L yeast nitrogen base with ammonium sulfate and without amino acids, 200 mM potassium phosphate buffer at pH 6, 0.4 mg/L biotin, 5% v/v methanol (0.5% final concentration of methanol)) 10, 24 and 48 h after the first induction. The cultivation was stopped 72 h after the first induction. The cells were harvested by centrifugation, disrupted with Y-PER™ Plus Dialyzable yeast protein extraction reagent (Life Technologies) and screened for HNL activity. Small scale cultivation of the best P. pastor is CBS 7435 MutS clones expressing £¾HNL1 was performed in shake flasks. A single colony was inoculated in 180 mL (or 50 mL) BMD1 % for 60 h at 28°C, 120 rpm. Induction of protein expression was done by addition of 20 mL (or 5.5 mL) BMM10 and further addition of 2 mL (or 0.5 mL) methanol 10 h, 24 h and 48 h after the first induction. 72 h after the first induction, the cells were pelleted by centrifugation and disrupted as described in Example 7. Cell free lysate was assayed for HNL activity (as described in Example 10).
[00130] The level of Z¾HNL1 expression in E. a?// was rather low using the pMS470 vector system (Fig. 2). Switching to the pET-26(b)+ expression vector in E. coli,
£¾HNL1 production significantly increased. The protein was present in the soluble fraction (MW 20 kDa) only (Fig. 2). The protein expression level further increased, when P. pas ton's was used as a host system (Fig. 2). Both E. coli and P. pastoris showed an adequate Z7 HNL1 production for further experiments.
Example 6 - Z¾HNL cloning and expression in E co//BL21 Star (DE3)
[00131] In order to facilitate protein purification, DM L isoenzymes 1 -4 were cloned into the pEHISTEV vector (Restriction sites: NcdIHindW) and the constructs confirmed by sequencing (LGC Genomics). E. coli BL21 Star (DE3) competent cells were
transformed with a standard method. Clones were selected on LB agar plates with kanamycin (50 mg/L). His-HNL expression was performed according to the protocol described above.
Example 7 - Preparation of cell free extract, protein quantification and SDS-PAGE
[00132] E. coir. Cell pellets were re-suspended in 25 ml_ of 50 mM potassium phosphate buffer, pH 6.0, and disrupted by sonication (80% duty cycle, 7 output, 6 min). Samples were kept on ice during cells disruption. P. pastoris: Cell pellets were disrupted with Y-PER Plus Dialyzable Yeast Protein Extraction Reagent (Life
Technologies) at room temperature or mechanical disruption using glass beads alternating cycles of 1 min vortexing and 1 min incubation in ice (50 mM potassium phosphate buffer, pH 6.0).
[00133] The protein amounts were quantified by use of the Bradford reagent (Roti®- Quant - Carl Roth), or the Pierce™ BCA Protein Assay Kit (Life Technologies).
[00134] Precast NuPAGE® Novex 4-12%Bis-Tris Gels (Life Technologies) were employed for SDS- PAGE. 15
of each sample were loaded on the gel after denaturation in LDS Sample Buffer (Life Technologies). Running buffer: NuPAGE® MES SDS Running Buffer (Life Technologies). Running conditions: 200 V, 45 min. Gels were stained with the Simply Blue SafeStain (Life Technologies).
Example 8 - Z¾HNL purification
[00135] Protein purification was performed by affinity chromatography with a HisTrap FF 5 mL column (GE Healthchare) using an AKTA purifier system. E. coli lysate was prepared in 20 mM sodium phosphate, 0.5 M NaCI, 10 mM imidazole, pH 7.4 as described above. Clear lysate (filtered through a 0.45 μηη syringe filter) was loaded on the column, previously equilibrated with 20mM sodium phosphate, 0.5 M NaCI, 10 mM imidazole at pH 7.4 (start buffer). Unbound material was washed with 10 column volumes of start buffer. Elution was performed as follows: a gradient from 0 to 100% elution buffer (20 mM sodium phosphate, 0.5 M NaCI, 500 mM imidazole, pH 7.4) was used for 15 column volumes. A final washing step was done with 100% elution buffer for 5 column volumes. Fractions containing 7/HNL were combined and desalted (HiPrep 26/10 Desalting GE Healthcare). Purified protein fractions were stored in 50 mM sodium phosphate pH 6.5, at -80°C.
[Example 9 - £¾HNL1 gel filtration
[00136] Z¾HNL1 (0.5 ml_; 1 mg/mL in 10mM Tris-HCI pH8) was loaded onto a
Superdex 200 10/300 GL column (catalog # 17-5175-01 , GE Healthcare; Bed volume: 24 ml_; Matrix: Spherical composite of cross-linked agarose and d extra n) at 4°C on an ΑΚΤΑ Avant 25 (catalog # 28-9308-42, GE Healthcare) with a flow rate of 0.1 mL/min. The column was pre-equilibrated with 150 mM NaCI, 10 mM Tris-HCI at pH 8. The absorbance of the eluent was monitored at 280 nm (A280). A Gel-Filtration-Standard (catalog #151 -1901 , BioRad) was used in 1 : 10 dilution. The apparent molecular mass of native 7 HNL1 was estimated according to the elution profile of the protein as depicted in Fig. 9 to be approximately 40 kDa, indicating that the active enzyme forms a dimer. This is in contrast to the active form of the Phlebodium aureum HNL, whose native size was estimated to be 168±40kDa (Wajant et al., 1995).
Example 10 - Hydroxynitrile lyase assay for determination of initial rate activity
[00137] A spectrophotometric assay to detect the HNL reaction in cyanogenesis direction was performed on microtiter plate format. A final volume of 200 μΙ_ contained 20 μΙ_ of protein sample and 130 μΙ_ of 50 mM sodium citrate phosphate buffer, pH 5.0. Blank reactions were carried out in which 50 mM sodium phosphate buffer pH 6.5 was used instead of protein sample. The reaction was initiated by the addition of 50 μΙ_ of substrate solution (60 mM racemic mandelonitrile in 3 mM sodium citrate-phosphate buffer, pH 3.5). The reaction was measured spectrophotometrically following the formation of benzaldehyde at 280 nm for 10 minutes at 25°C in a Synergy Mx plate reader (BioTek). For calculations, the blank values were subtracted. The values given in the tables below refer to the average of three (technical triplicates) to nine (three biological triplicates in technical triplicate) single measurements and their respective standard deviations.
[00138] For the determination of the Z7 HNL1 activity at different pH values, citrate- phosphate buffer was used (pH 2.5 - 5.0).
[00139] Determination of the optimal reaction temperature was performed in cuvettes. A final volume of 1 ml_ contained 100 μΙ_ of sample and 700 μΙ_ of 50 mM citrate phosphate buffer, pH 5.0. The reaction was initiated by the addition of 200 μΙ_ of substrate solution (60 mM racemic mandelonitrile in 3 mM citrate phosphate buffer, pH 3.5). Enzymatic activity was measured from 10 to 50°C at 280 nm in a
spectrophotometer (Cary Series Agilent Technologies). For calculations, the blank
values were subtracted. The values given in the tables below refer to the average of three (technical triplicates) to nine (three biological triplicates in technical triplicate) single measurements and their respective standard deviations.
.Example 11 - Hydroxynitrile lyase assay for determination of cyanogenic activity with any cyanohydnn
[00140] Enzymatic activity with unnatural substrates was measured
spectrophotometrically on microtiter plate format (Andexer, Guterl, Pohl, & Eggert, 2006) in a Synergy Mx plate reader (BioTek). The reaction mix consisted of: 140 μΙ_ of citrate phosphate buffer pH 5.0, 10 μΙ_ of DMNL (0.001 -0.01 mg) and 10 μΙ_ of cyanohydrin stock solution (300 mM cyanohydrin in 0.1 M citric acid). The reaction was performed at room temperature for 5-10 min. The reaction was stopped by the addition of 10 μΙ_ of mix I [100 mM AAchlorosuccinimide, succinimide 10-fold excess (w/w) in water] followed by incubation at room temperature and 1 ,000 rpm for 5 min.
Afterwards, 30 μΙ_ of mix II (barbituric acid 125 mM, isonicotinic acid 65 mM in 0.2 M NaOH) were added and the absorbance was measured at 600 nm for 20 min in a Synergy Mx plate reader (BioTek).
Example 12 -Activity of i¾HNL1 for aliphatic substrates
[00141] The ability of ¾HNL1 to accept aliphatic substrates was measured.
Therefore, a spectrophotometric assay was performed as described herein. Racemic mandelonitrile was used as the positive control. Z7 HNL1 prefers the natural substrate mandelonitrile, but also aliphatic substrates such as acetone cyanohydrin are converted.
Example 13 - Specific activities of DMHL isoenzymes 1-4
[00142] The specific activity of DMHL isoenzymes 1 -4 was determined
spectrophotometrically. Therefore, HNL activity was measured as described above. R- Mandelonitrile was used as the substrate. 1 Unit (U) was defined as the amount of enzyme that produces 1 μηηοΙ of benzaldehyde per min under the assay conditions. Each reaction was carried out in biological duplicate and technical triplicate and the blank was subtracted. The specific activities were 562 U/mg for Z2/HNL1 , 746 U/mg for 0 HNL2, 607 U/mg for DMML3 and 861 U/mg for / /HNL4, respectively (with a confidence interval of 20%). This is significantly higher (4 to 6 times) than the specific activities of any other recombinant HNL (cleavage of the natural substrate
mandelonitrile, Dadashipour & Asano, 201 1 ).
[Example 14 - pH dependence of Z¾HNL isoenzymes 1-4
[00143] ¾HNL activity at different pH was measured spectrophotometrically.
Therefore, enzyme solution (20 μΙ_, 0.01 mg/mL) was added to 50 mM sodium citrate- phosphate buffer pH 2.5, 3.0, 3.5, 4.0, 4.5 5.0, 5.5, 6.0, 6.5 and 7.0 and 50 mM potassium chloride buffer pH 2.0 and 2.5. The reaction was started by addition of racemic mandelonitrile (15 mM final concentration). HNL activity was measured as described in Example 10. In parallel, blank reactions were performed at each pH in which sodium phosphate buffer pH 6.5 was used instead of 9/HNL1 solution. Each reaction was carried out in biological triplicate and technical triplicate and the blank was subtracted. Since HNL reactions are preferably carried out at acidic conditions due to the chemical background reaction at pH > 6, values from pH 5.0 to pH 2.5are more suitable for our purposes. At pH 2.5, the £¾HNL1 enzyme shows 44±1 % of its activity (Table 1 ). These characteristics can overcome the problem of background reaction, common for cyanohydrins at higher pH. Similar results were recorded for £¾HNL2, 3 and 4 as well.
[00144] As used herein, relative [%] means that the values in entry 9 (£¾HNL1 ) and entry 8 (Z7 HNL2-4) were set to 100% and all other % values were normalized to the respective value of each column. Standard deviation was calculated from two different independent experiments. Entries 1 and 2 report results obtained in potassium chloride buffers, while entries 3-12 refer to enzymatic activity in sodium citrate-phosphate buffers.
Table 1
11 6.5 61 4 65 8 75 4 73 4 0.2477
12 7.0 26 12 25 11 28 4 26 14 0.6215
Example 15 - pH stability of £¾HNL isoenzymes 1-4
[00145] DtHNL isoenzymes were incubated in 50 mM phosphate citrate buffer (final concentration of 1 mg/mL) at the pH values as specified in Table 2 at 5-8°C.
Furthermore, 1 mg/mL protein was incubated in 50 mM sodium phosphate buffer pH 6.5. At the time-points specified in Table 2, an aliquot of enzyme was diluted to 0.01 mg/mL. HNL activity was measured as described in Example 10. In parallel, blank reactions were performed at each pH by addition of each storage buffer instead of the protein sample. Each reaction was carried out in biological duplicate and technical triplicate and blank was subtracted. Surprisingly, Z¾HNL1 and 3 are still active after 72 h at pH 2.5 (Table 2, Table 4) whereas the activity of isoenzymes 2 and 4 decreased significantly at this pH (Table 3, Table 5).
[00146] The stability of the Z7 HNL isoenzymes at different pH values is distinct, which opens the possibility to choose the most suitable isoenzymes for particular applications at different pH values.
Table 2
Z¾HNL1
pH 2.5 pH 4.0 pH 5.0 pH 6.5
Time of
Example Relative [%] Relative [%] Relative [%] Relative [%] incubation [h]
1 0 100±31 100+0 100±31 100±31
2 2 1 12±17 92±18 98±17 106±19
3 4 142±12 1 1 1 ±18 133±26 151 ±37
4 8 125±2 92±9 1 13+ 15 1 13±37
5 24 94±2 76±12 1 3+ 16 1 1 1 ±24
6 48 56±31 38±18 64±17 64±20
7 72 63±16 28±5 91 ±2 85±17
Table 3
£¾HNL2
pH 2.5 pH 4.0 pH 5.0 pH 6.5
Time of
Example Relative [%] Relative [%] Relative [%] Relative [%] incubation [h]
1 0 100±13 100±13 100±13 100±13
2 2 1 12±24 101 +1 1 121 ±23 1 19±22
3 4 127±36 120±18 122±27 123±26
4 8 1 10+19 98±20 95±19 96±12
5 24 83±19 96±16 95±7 101 ±7
6 48 64±15 103±18 91 ±25 1 10±17
7 72 28±1 1 102±19 104±14 104±1 1
Table 4
£¾HNL3
pH 2.5 pH 4.0 pH 5.0 pH 6.5
Time of
Example Relative [%] Relative [%] Relative [%] Relative [%] incubation [h]
1 0 100±4 100±4 100±4 100+4
2 4 91 ±1 95±13 104±27 1 18±2
3 8 88±1 1 101 ±5 1 12±25 1 13±3
4 24 80±0 86±6 100±28 78±3
5 48 71 ±1 103±2 1 14±30 86±2
6 72 60±1 1 84±7 107±38 n.d.
Table 5
£¾HNL4
pH 2.5 pH 4.0 pH 5.0 pH 6.5
Time of
Example Relative [%] Relative [%] Relative [%] Relative [%] incubation [h]
1 0 100±5 100±5 100±5 100±5
2 2 94±12 95±6 95±9 1 12±6
3 4 101 ±9 102±4 101 ±2 1 16±5
4 8 88±9 78±13 85±1 100±13
5 24 53±2 88±10 86±14 105±6
6 48 29±0 82±4 82±10 94+4
7 72 7±2 64±1 64±3 74±2
Example 16 - Temperature dependence of £¾HNL isoenzymes 1-4
[00147] ? HNL activity at different temperature was measured spectrophotometrically in quartz cuvettes (1 mL). Therefore 100 μΙ_ of enzyme solution (0.001 mg/mL final concentration) was added to 700 pL of 50 mM citrate phosphate buffer, pH 5.0. The reaction was started by the addition of 200 pL 60 mM racemic mandelonitrile (12 mM
final concentration). In parallel, blank reactions were performed at each temperature in which sodium phosphate buffer pH 6.5 was used instead of £¾HNL solution. Enzymatic activity was measured from 10-50°C at 280 nm in a spectrophotometer (Cary Series Agilent Technologies). Each reaction was carried out in biological triplicate and technical triplicate and the blank was subtracted. Table 6 shows that the activity of ¾HNL isoenzymes increases with temperature. The highest activity of Z¾HNL1 and 7 HNL3 was observed at 40°C, of DMHL2 at 35°C, and of Z¾HNL4 at 45°C. However, beyond 30°C, also the chemical background reaction (decomposition of racemic mandelonitrile to benzaldehyde and HCN) reaches increasing levels. Since the chemical background reaction is non-specific, it negatively influences the e.e. of the reaction products. Hence, the reactions in this system should preferably be carried out at low temperatures, e.g. below 10°C.
Table 6
Example 17 - Temperature stability of £¾HNL isoenzymes 1-4
[00148] Z2 HNL1 (1 mg/mL) was incubated in sodium phosphate buffer at pH 6.5 at different temperatures as specified in Table 7. At the time-points specified in Table 7, an enzyme aliquot was diluted to 0.01 mg/mL and HNL activity was measured at 25°C as described herein. Each reaction was carried out in biological duplicate (Z7 HNL1 ) or triplicate ( ¾HNL2, 3, 4) and technical triplicate and the blank was subtracted. The same experiments were carried out for ¾HNL2 (Table 8), 7 HNL3 (Table 9) and Z7 HNL4 (Table 10).
[00149] Z2/HNL1 (Table 7) and DMHL2 (Table 8) are not stable at 40°C and loose activity already after 2 hours. Surprisingly, DMNL3 and Z2 HNL4 are more stable at 40°C. After 48 hours, DMNL3 and Z7 HNL4 show 65% and 50% of their original activity (Table 9 and Table 10).
Table 7
Table 8
Table 9
£¾HNL3
8°C 20°C 30°C 40°C
Time of incubation Relative Relative Relative Relative
Example SD SD SD SD
[h] [%] [%] [%] [%]
1 0 100 23 100 25 100 30 100 26
2 4 108 18 93 23 96 29 94 32
3 8 99 3 91 16 96 5 95 15
4 24 88 34 98 46 123 50 106 87
5 48 95 30 107 25 96 39 68 24
6 72 94 25 1 10 12 53 2 67 18
Table 10
¾HNL4
8°C 20°C 30°C 40°C
Time of incubation Relative Relative Relative Relative
Example SD SD SD SD
[h] [%] [%] [%] [%]
1 0 100 18 100 18 100 18 100 18
2 4 96 26 93 20 107 30 98 5
3 8 87 23 82 19 94 23 79 5
4 24 77 27 69 20 105 21 53 21
5 48 73 24 85 32 1 1 1 27 51 15
6 72 64 26 97 37 102 30 41 8
Example 18 - Use of £¾HNL1 for the synthesis of cyanohydrins
[00150] The target protein was tested for its ability to catalyze the cyanohydrin synthesis from benzaldehyde (or 2-CI-benzaldehyde, 3-Ph-propionaldehyde, cinnamaldehyde, 2- furylaldehyde, acetophenone) and HCN. The reactions were performed in a two-phase system with methyl tertiary-butyl ether in a total volume of 1.5 ml_. The HCN-solution (~ 2 M) was prepared as described before by Okrob et al. , 201 1. All reactions involving HCN were performed in a ventilated hood with gas trap and equipped with a calibrated HCN detector. A stock solution was prepared
containing MTBE-HCN, 0.5 M aldehyde substrate and an internal standard (2% triisopropylbenzene, v/v final concentration). From this mixture, a sample was withdrawn before starting the reactions and used as a blank value, representing the initial amount of substrate and internal standard. Protein samples were acidified with 50 μΙ_ of 1 M sodium acetate buffer to pH 4.0. The reactions were started by adding 1 ml_ of the reaction solution to 0.5 ml_ of enzyme solution and performed at 5°C 1 ,000 rpm and additional magnetic stirring. After 2 and 24 h, 50 μΙ_ of samples were taken (organic phase), mixed with 1 ml_ of derivatization solution (85% dichloromethane, 10% acetic acid anhydride and 5% pyridine v/v) and incubated at room temperature for 1 h. The samples were analyzed by gas chromatography on an Agilent 6890N
(G1530N) equipped with a FID PAL system for vials and microtiter plate autosampler, using a CP-Chirasil-DEX CB column (Varian CP-7503, 25.0 m, 320 μηη, 0.25 pm) and
molecular hydrogen as carrier gas. The analytical parameters and retention times are given in Table 1 1.
Table 1 1
[00151] Table 12 summarizes the results of Z7 HNL1 catalyzed synthesis of cyanohydrins.
[00152] Z2 HNL1 is more active than /¾HNL in terms of mandelonitrile synthesis. When 0.096 mg/mL of commercial /¾HNL (Sigma) preparation is used in the same reaction setup, 96.4% of conversion with 98.1 % e.e. is obtained (0.096 mg/mL /¾HNL and 0.056 mg 7 HNL1 correspond to 5 U/mL of enzyme determined in cyanogenesis direction).
[00153] -9/HNL1 is more enantioselective than any other reported HNL on 2-chloro- benzaldehyde [/¾HNL5 mutantA1 1 1 G e.e. 96.5%; ¾HNL1 wildtype e.e. 98.2% (Glieder et al., 2003)]. This is surprising, because other HNLs had to be subjected to protein engineering to reach good e.es and already the DM L wild-type sequence is better than the best ¾HNL mutants.
Table 12
enzyme non enzymatic conversion e.e. time
Substrate Product cone. transformation
% % (h)
(mg/mL) %
benzaldehyde (R)-mandelonitrile 0.056 97.9 99.5 24 22.7
2-chlorobenz- (R)-2-chloro-
2 98.5 98.2 2 6.4 aldehyde mandelonitrile
(R)-hydrocinnam-
3- aldehyde 2 97.0 50.4 24 80.0 phenylpropanal
cyanohydrin
(R)-cinnam- cinnamaidehyde aldehyde 6 98.2 92.6 24 0
cyanohydrin
(S)-furylaldeyde
2-furylaldehyde 6 95.3 99.3 0.5 10.5
cyanohydrin
acetophenone
acetophenone 6 23.5 >99 6 0
cyanohydrine
[00154] Enzyme concentration is based on the buffer phase of which 500 μΙ_ were emulsified with 1 ,000 μΙ_ of MTBE/HCN. Conversion is defined as the disappearance of substrate (corrected by the internal standard). Non-enzymatic transformation is defined as the consumption of the substrate detected in the blank reaction that did not contain enzyme (racemic cyanohydrin is produced). It needs to be noted that the non- enzymatic background reaction cannot be considered as a blank because it does not occur to the same extent if enzyme is present that consumes the substrates. However, the value gives an explanation to the low e.e. of the enzyme catalyzed product. Hence, the enzyme may still be perfectly stereoselective for the particular substrate but too slow at the respective conditions.
When using the same Units of enzyme (Units defined as μητιοΙ of mandelonitrile cleaved per minute - in cyanogenesis direction), the £¾HNL1 is superior to ¾HNL (Sigma Aldrich) in synthesis direction (benzaldehyde to mandelonitrile). The known HNLs typically prefer either aromatic substrates (e.g. /¾HNL) or aliphatic substrates (e.g. LLMUL). DM L accepts both types of substrates: it is highly active for the cleavage of mandelonitrile and it also cleaves the aliphatic cyanohydrins
acetonecyanohydrin and the cyanohydrin of hydroxypivaldehyde, albeit with lower activity. £¾HNL1 is at least active in synthesis direction of benzaldehyde, 2-CI- benzaldehyde, 3-Ph-propanal, cinnamaidehyde, furylaldehyde and acetophenone. Example 19 - 5 L scale fermentation of P. pastorissimn expressing ¾HNL1
[00155] A single colony of P. pastoris CBS 7435 MutS £>/HNL1 opt B1.G4 was incubated overnight in BMGY medium (10 g/L yeast extract, 20 g/L peptone, 700 ml_ dH2O; 100 mL/L 10x yeast nitrogen base, 100 mL/L 200 mM potassium phosphate buffer pH 6.0, 100 mL/L 10% w/v glycerol, 2 mL/L 500x biotin) at 28°C in shake flask. Cultivation was performed in 5 L BIOSTAT® CT fermenter. First, a glycerol batch phase (glycerol 4%) was performed over-night. When all the glycerol was consumed from the batch growth phase, a glycerol fed batch was initiated in order to achieve high
cell densities. Feed batch was terminated after consumption of about 1 kg of glycerol (ca. 24 h). Methanol was introduced gradually. During the methanol fed-batch phase, the feed rate was kept at limiting conditions. Induction was performed for 4 days until ca. 800 g of methanol were consumed. After initiation of the feed batch phase, culture samples were taken at different time points. Protein content, Z7 HNL1 specific activity, wet cell weight and cell dry weight were determined. For the determination of the cell dry weight, 200 μΙ_ of culture was harvested in a pre-weighed 1.5 mL tube. The supernatant was discarded and the cell pellet was dried in a 60°C incubator for 3 days. After drying, the tubes were weighed again and the cell dry weight was calculated. At the termination of the fermentation, the entire cell culture was harvested (ca. 1.5 kg of total wet cell weight).
[00156] Parts of the biomass (approximately 950g WCW) was disrupted with Micro DeBEE high pressure homogenizer. A total of approximately 3000kU of £¾HNL1 was obtained, which corresponds to approximately 12g of protein. Protein lysate was stored in 50 mM sodium phosphate buffer pH 6.5 at - 20°C. The fermentation is summarized in Table 13. Alternatively, parts of the same protein lysate were lyophilized and the specific activity was stable for at least three weeks both at -20°C and 4°C.
Table 13 P. pastoris CBS 7435 MutS Z7 HNL1 B1.G4. OD: optical density, WCW: wet cell weight, DCW: dry cell weight
Example 20 - Crystallization of Selenomethionine £¾HNL1
[00157] A single colony of E. co//BL21 Star (DE3), transformed with
pEHisTEV £>/HNL1 was incubated overnight in LB medium with 50 mg/L kanamycin at 37°C. After 16 h, the cell pellet was collected by centrifugation and washed two times with minimal media (M9 salts, 2% glucose, 2 mM MgSO4, 0.01 mg/mL thiamine, 0.01 mg/mL FeCb). The main-culture (400 mL minimal media with kanamycin (50 mg/L) and selenomethionine (SeMet (50mg/L)) was inoculated with an aliquot of the pre-culture to a final ODeoo of 0.25. The culture was incubated at 37°C and 150 rpm until the ODeoo
reached 0.5. Then, 0.5 mM IPTG was added and the culture was incubated at 25°C and 150 rpm for 38 h. Cell free extract was prepared as described above.
SeMet/¾HNL1 was purified by affinity chromatography (NiSepharose 6 Fast Flow resin). Elution was performed with 20 mM sodium phosphate, 0.5 M NaCI, 300 mM imidazole, pH 7.4. Fractions containing SeMetZ? HNL1 were combined and desalted (PD10 Desalting columns, GE Healthcare LifeScience). Protein was stored at -20°C in 50 mM potassium phosphate buffer pH 6.0 prior to crystallization.
[00158] Screening for crystallization conditions was performed using Morpheus Screen MD 1 -46, JCSG+ MD1 -37 (Molecular Dimensions) and Index HT HR2-144 (Hampton Research) by the sitting drop vapor-diffusion method in 96-well plates at 16°C.
[00159] SeMetZ?/HNL1 (3 mg/mL in 10mM Tris-HCI pH8) was used to grow crystals in 0.2 M sodium thiocyanate, 20% (w/v) polyethylene glycol 3350. A 1 :1 ratio of protein and screening solutions was used. Typically, crystals appeared after 2-3 days. After supplementation of 30% glycerol, the crystals were flash-cooled in liquid-nitrogen. The SeMet Z2 HNL1 datasets were collected at 100k from single crystals on the synchrotron beamline ID29 (EMBL, Grenoble, France). The data sets were processed and scaled using the XDS program package (Kabsch, 2010). The AutoSol Program (McCoy et al., 2007, Terwilliger et al., 2009) and the AutoBuild Program (Terwilliger et al., 2008) from the PHENIX software suit (Adams et al., 2010) were used to define the selenium heavy metal-atom sites using a SeMet Z7/HNL1 SAD data set, as well as to build an initial model. The resulting model was again completed manually in Coot and refined with PHENIX.
Example 21 - Crystallization of ¾HNL1 and soaking experiments
[00160] Native crystals of Z7 HNL1 were grown by mixing 0.5 μΙ_ protein sample (4 mg/mL in 10 mM Tris-HCI pH 8) with 1 L reservoir solution (0.9 M NaN03;
Na2HPO4; (NH4)2S04 mix, 0.1 M Tris-Bicine buffer pH 8.5 and 30% (w/v) polyethylene glycol monomethyl ether 550 & polyethylene glycol 20k; Morpheus condition C9) by the sitting drop vapor-diffusion method in Crystal Clear Duo crystallization frames at 16°C. Additionally, native crystals were also grown by mixing 1 pL protein sample (4 mg/mL in 10mM Tris-HCL pH 8) with 0.5 pL reservoir solution (0.1 M 2-(4-(2- hydroxyethyl)-1 -piperazinyl) ethanesulfonic acid pH 7.5 and 10% (w/v) polyethylene
glycol; JSCG condition B4) using the same method as previously described. Typically, crystals appeared after 2-3 days.
[00161] Soaking experiments were performed with the native Z¾HNL1 crystals (grown as described above).
[00162] The Crystal Clear Duo crystallization frames containing mature crystals were opened and neat (R)-mandelonitrile (MXN) (Sigma Aldrich) or benzoic acid - BEZ (Sigma Aldrich) was added with a small CryoLoop. After an incubation period of 30 s, 1 min, 5 min and 15 min crystals were harvested, flash-cooled in liquid nitrogen and used for diffraction data collection. The data sets with the soaked Z7 HNL1 crystals were collected at 100k from single crystals on the synchrotron beamline BM14U (EMBL, Grenoble, France). The datasets were processed and scaled using the program iMosflm (Battye et al., 201 1 ) and SCALA (Evans, 2006). For all soakings, Molecular Replacement was performed with Phaser-MR (McCoy et al., 2007). The previously obtained SeMet £¾HNL1 structure was used as a model for the Molecular Replacement with the data sets from the soaked ? HNL1 crystals. The resulting model was again completed manually in Coot and refined with PHENIX.
[00163] The unit cell, the assigned space groups and data statistics of the best data sets are shown in Table 14.
Table 14: Data-collection and processing statistics
[00164] The crystal structure revealed a dimer of 7 anti-parallel beta-strands, 2 alpha- helices and a large cavity in each monomer.
[00165] The data was subjected to a search against the RCSB Protein Data Bank. The highest similarity of the 7 HNL1 structure was found with the crystal structure of
the birch pollen allergen BET V 1 L (PDB Code 1 FM4). As typical for this type of protein fold, in each monomer a large cavity is formed between the beta-sheets and the alpha- helices (Fig 9).
[00166] The residues important for substrate binding and catalysis were identified from the structure containing the substrate benzaldehyde and the structure containing the product mandelonitrile. The substrate showed interactions with the residues arginine 94, aspartic acid 1 10 and serine 1 12, tyrosine 126 and tyrosine 142 (Fig. 10). This numbering refers to the sequence that starts with the HIS-tag and the TEV cleavage site, including some spacer residues. The active site residues are arginine R69, aspartate D85, serine S87 and three tyrosines Y101 , Y1 17 and Y161 in case the numbering refers to the native £¾HNL1 sequence.
Acknowledgements
[00167] The work leading to this invention has received funding from the European Union's Seventh Framework Program (FP7/2007-2013) under grant agreement n° 289646.
References
Adams, P. D. et al. (2010). Acta Cryst. D66, 213-221.
Battye, T. G. G., Kontogiannis, L, Johnson, O., Powell, H. R. & Leslie, A. G. W.
(201 1 ). Acta Cryst. D67, 271 -281.
Andexer, J., Guterl, J.-K., Pohl, M., & Eggert, T. (2006). A high-throughput screening assay for hydroxynitrile lyase activity. Chemical Communications (Cambridge,
England), 9(40), 4201-3.
Blum, H., Beier, H., & Gross, H. J. (1987). Improved silver staining of plant proteins,
RNA and DNA in polyacrylamide gels. Electrophoresis, 8(2), 93-99.
Dadashipour, M., & Asano, Y. (201 1 ). Hydroxynitrile Lyases: Insights into
Biochemistry, Discovery, and Engineering. ACS Catalysis, 1 (9), 1 121-1 149. Evans, P. (2006). Acta Cryst. D62, 72-82.
E. Lanfranchi, K. Steiner, B. Darnhofer, R. Birner-GrCinberger, A. Glieder, M. Winkler:
A new hydroxynitrile lyase from fern: from the plant to the sequence, ACIB
Science Days 2013, 10.9.2013 - 12.9.2013, Graz
E. Lanfranchi, K. Steiner, B. Darnhofer, R. Birner-Grunberger, A. Glieder, M. Winkler:
A new hydroxynitrile lyase from fern: from the plant to the sequence, Biotrans
2013, 21.7.2013 - 25.7.2013, Manchester
Glieder, A., Weis, R., Skranc, W., Poechlauer, P., Dreveny, !., Majer, S., Wubbolts M., Schwab H., Gruber, K. (2003). Comprehensive Step-by-Step Engineering of an (R)-Hydroxynitrile Lyase for Large-Scale Asymmetric Synthesis.
Angewandte Chemie International Edition, 42(39), 4815-4818.
Gruber-Khadjawi, M., Fechter, M.H., Griengl, H. (2012). Enzyme Catalysis in Organic Synthesis. In K. Drauz, H. Groger, & O. May (Eds.), Enzyme Catalysis in Organic Synthesis. Weinheim, Germany: Wiley-VCH Verlag GmbH & Co. KGaA.
Hussain, Z., Wiedner, R., Steiner, K., Hajek, T., Avi, M., Hecher, B., ... Schwab, H.
(2012). Characterization of two bacterial hydroxynitrile lyases with high similarity to cupin superfamily proteins. Applied and Environmental
Microbiology, 78(6), 2053-5.
Kabsch, W. (2010). Acta Cryst. D66, 125-132.
Krammer B., Rumbold K., Tschemmernegg M., Pochlauer P., Schwab H. (2007). A novel screening assay for hydroxynitrile lyases suitable for high-throughput screening. J Biotechnol. Mar 30;129( ): 151 -61.
Lanfranchi, E., Steiner, K., Glieder, A., Hajnal, I., Sheldon, R. A., Pelt, S. van, &
Winkler, M. (2013). Recent Developments in Hydroxynitrile Lyases for
Industrial Biotechnology. Recent Patents on Biotechnology, 7(3), 197-206.
Lin-Cereghino J., Wong W.W., Xiong S., Giang W., Luong L.T., Vu J., Johnson S.D., Lin-Cereghino G.P. (2005). Condensed protocol for competent cell preparation and transformation of the methylotrophic yeast Pichia pastoris. Biotechniques. Jan;38(1 ):44, 46, 48.
McCoy, A. J., Grosse-Kunstleve, R. W., Adams, P. D., Winn, M. D., Storoni, L. C. & Read, R. J. (2007). J. Appl. Cryst. 40, 658-674.
Terwilliger, T. C, Adams, P. D., Read, R. J., McCoy, A. J., Moriarty, N. W., Grosse- Kunstleve, R. W., Afonine, P. V., Zwart, P. H. & Hung, L.-W. (2009). Acta Cryst. D65, 582-601.
Terwilliger, T. C, Grosse-Kunstleve, R. W., Afonine, P. V., Moriarty, N. W., Zwart, P.
H., Hung, L.-W., Read, R. J. & Adams, P. D. (2008). Acta Cryst. D64, 61 -69.
Wajant, H., Forster, S., Selmar, D., Effenberger, F., & Pfizenmaier, K. (1995).
Purification and Characterization of a Novel (R)-Mandelonitrile Lyase from the Fern Phlebodium aureum. Plant Physiology, 109(4), 1231-1238.
Winkler, M., Glieder, A., & Steiner, K. (2012). C-X Bond Formation:Hydroxynitrile
Lyases: From Nature to Application. In E. M. Carreira & H. Yamamoto (Eds.), Comprehensive Chirality (Vol. 7, pp. 350-371 ).
Claims
1. A protein having hydroxynitrile lyase activity (HNL) comprising the following
properties:
a. action: catalyzing an asymmetric cyanohydrin reaction;
b. tertiary protein structure: dimer of 7 anti-parallel beta-strands, 2 alpha-helices and a large cavity in each monomer (Allergen BV1 -like type fold)
c. molecular weight: 20±5 kDa when measured by SDS-polyacrylamide
electrophoresis.
2. The protein according to claim 1 comprising active site residues R69, D85, S87, Y101 , Y1 17 and Y161 according to the numbering of SEQ ID NO: 1.
3. The protein according to claim 1 or 2, wherein said protein is derived from a plant, in particular from Polypodiades plants, in particular from the Davalliaceae family, and has hydroxynitrile lyase activity.
4. The protein according to any one of claims 1 to 3, wherein said protein is a
recombinant hydroxynitrile lyase (HNL).
5. The recombinant HNL according to claim 3, comprising SEQ ID NO: 1 or a variant amino acid sequence having at least 35% sequence identity with SEQ ID NO: 1.
6. The recombinant HNL according to claim 3 or 4, wherein said variant amino acid sequence has at least 50%, preferably at least 70%, or preferably at least 85 % identity with SEQ ID NO: 1.
7. The recombinant HNL according any one of claims 3 to 5 having the SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:3, or SEQ ID NO:4.
8. An isolated polynucleic acid molecule encoding a recombinant HNL according to any one of claims 3 to 6.
9. A vector comprising an isolated DNA molecule of claim 7.
10. A recombinant HNL recovered from a culture obtained by cuituring recombinant non-human cell obtained by introducing the vector of claim 8.
1 1. A method for producing enantiopure cyanohydrin compounds, wherein an
aldehyde or ketone compound is converted to the corresponding cyanohydrin compound in the presence of a cyanide donor and a protein having hydroxynitrile lyase activity (HNL) according to claim 1 or 2.
12. The method according to claim 10, wherein said protein is a recombinant HNL
according to any one of claims 3 to 6 or a cell or a culture containing such HNL.
13. The method according to claim 10 or 1 1 , wherein a compound of general formula I is reacted with a compound of general formula II in the presence of a HNL to yield an asymmetric cyanohydrin compound III
(Hi)
wherein
R1 and R2 are independently from one another H, Ci-2oalkyl, C2-2oalkenyl,
C2-2oalkynyl, C3-iocycloalkyl, C4-2ocycloalkylalkyl, C6-i4aryl, C7-2oarylalkyl, 3-14 membered heterocycloalkyl, 4-20 membered heterocycloalkylalkyl, 5-20 membered heteroaryl or 6-20 membered heteroarylalkyl, optionally substituted by one or more Ra;
R3 is H, an alkali metal, C(CH3)2OH, or Ci-2oalkyl; and
each Ra is independently H, halogen, -CF3, -OR , -NRbRb, -(CH2)nCOORb,
-(CH2)nC(=O)R , -(CH2)nCONRbRb, Ci-2oalkyl, C2-2oalkenyl,or C2-2oalkynyl; and each Rb is independently H or optionally substituted Ci-2oalkyl, C2-2oalkenyl, or
C2-2oalkynyl; and
n is 0, 1 , 2 or 3.
14. The method according to any one of claims 10 to 12, wherein the reaction is
carried out in a mono- or biphasic system or in an emulsion.
15. The method according to any one of claims 1 1 to 13, wherein the recombinant HNL is a purified enzyme or contained in cleared lysate or a culture supernatant or a whole cell containing the HNL or a culture broth (containing cells and media).
16. The method according to any one of claims 10 to 14, wherein the cyanohydrin compound is obtained with at least 50%, preferably with at least 60%, more preferred with at least 75% enantiomeric excess (e.e.).
17. The method according to any one of claims 10 to 15, wherein the R enantiomer of the cyanohydrin compound is obtained with at least 50%, preferably with at least 60%, more preferred with at least 75% enantiomeric excess (e.e.).
The method according to any one of claims 10 to 16, wherein the reaction is started in the presence of buffer with a pH of from 2 to 6.5, preferably between 2 to 4.5 and most preferably between 2.5 to 4.5.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP14180247 | 2014-08-07 | ||
EP14180247.0 | 2014-08-07 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2016020528A1 true WO2016020528A1 (en) | 2016-02-11 |
Family
ID=51298597
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2015/068262 WO2016020528A1 (en) | 2014-08-07 | 2015-08-07 | New hydroxynitrile lyases |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2016020528A1 (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1160329A2 (en) * | 2000-06-02 | 2001-12-05 | Nippon Shokubai Co., Ltd. | An enzyme reaction method and a method for enzymatically producing an optically active cyanohydrin |
WO2012143374A1 (en) * | 2011-04-18 | 2012-10-26 | Teknologian Tutkimuskeskus Vtt | Hypoallergen |
-
2015
- 2015-08-07 WO PCT/EP2015/068262 patent/WO2016020528A1/en active Application Filing
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1160329A2 (en) * | 2000-06-02 | 2001-12-05 | Nippon Shokubai Co., Ltd. | An enzyme reaction method and a method for enzymatically producing an optically active cyanohydrin |
WO2012143374A1 (en) * | 2011-04-18 | 2012-10-26 | Teknologian Tutkimuskeskus Vtt | Hypoallergen |
Non-Patent Citations (3)
Title |
---|
DATABASE UniProt [online] 3 April 2013 (2013-04-03), "SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblPlants:GSMUA_Achr6P14810_001};", XP002735333, retrieved from EBI accession no. UNIPROT:M0T729 Database accession no. M0T729 * |
MARKOVIC-HOUSLEY Z ET AL: "Crystal Structure of a Hypoallergenic Isoform of the Major Birch Pollen Allergen Bet v 1 and its Likely Biological Function as a Plant Steroid Carrier", JOURNAL OF MOLECULAR BIOLOGY, ACADEMIC PRESS, UNITED KINGDOM, vol. 325, no. 1, 3 January 2003 (2003-01-03), pages 123 - 133, XP027091999, ISSN: 0022-2836, [retrieved on 20030103], DOI: 10.1016/S0022-2836(02)01197-X * |
RADAUER CHRISTIAN ET AL: "The Bet v 1 fold: an ancient, versatile scaffold for binding of large, hydrophobic ligands", BMC EVOLUTIONARY BIOLOGY, BIOMED CENTRAL LTD., LONDON, GB, vol. 8, no. 1, 15 October 2008 (2008-10-15), pages 286, XP021042040, ISSN: 1471-2148, DOI: 10.1186/1471-2148-8-286 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Zirpel et al. | Production of Δ9-tetrahydrocannabinolic acid from cannabigerolic acid by whole cells of Pichia (Komagataella) pastoris expressing Δ9-tetrahydrocannabinolic acid synthase from Cannabis sativa L. | |
JP6494522B2 (en) | Production of ergothioneine by metabolic engineering | |
CN111936629A (en) | Process for producing vanillin | |
US20200399665A1 (en) | Processes for the production of hydroxycinnamic acids using polypeptides having tyrosine ammonia lyase activity | |
JP6562950B2 (en) | Dreamenol synthase and method for producing dreammenol | |
WO2012109534A2 (en) | Cells and methods for producing isobutyric acid | |
CN112481224A (en) | Baeyer-Villiger monooxygenase and application thereof | |
US20170211105A1 (en) | Biosynthetic production of carnosine and beta-alanine | |
Ruwe et al. | Identification and functional characterization of small alarmone synthetases in Corynebacterium glutamicum | |
US10801049B2 (en) | Genetically engineered microorganisms and processes for the production of cannabinoids from a carbon source precursor | |
Xu et al. | Genome hunting of carbonyl reductases from Candida glabrata for efficient preparation of chiral secondary alcohols | |
CN106102454B (en) | Method for biosynthetic production of pterostilbene using O-methyltransferase | |
Zhao et al. | In vitro biosynthesis of isobutyraldehyde through the establishment of a one-step self-assembly-based immobilization strategy | |
Roiban et al. | Development of an Enzymatic Process for the Production of (R)-2-Butyl-2-ethyloxirane | |
US20170211104A1 (en) | Biosynthetic production of acetaminophen, p-aminophenol, and p-aminobenzoic acid | |
US20170211103A1 (en) | Biosynthetic production of choline, ethanolamine, phosphoethanolamine, and phosphocholine | |
Du et al. | Promotion of the asymmetric reduction of prochiral ketone with recombinant E. coli through strengthening intracellular NADPH Supply By Modifying EMP and introducing NAD kinase | |
CN112175919A (en) | Lactone hydrolase mutant and application thereof | |
CN114760980A (en) | Peroxidase activity against 10-acetyl-3, 7-dihydroxyphenoxazines | |
Zhao et al. | Development of cyclic AMP receptor protein-based artificial transcription factor for intensifying gene expression | |
US10036047B2 (en) | Methods for hydroxylating phenylpropanoids | |
US11492607B2 (en) | Highly active S-cyanohydrin lyase and application thereof | |
WO2016020528A1 (en) | New hydroxynitrile lyases | |
García-Franco et al. | Engineering styrene biosynthesis: designing a functional trans-cinnamic acid decarboxylase in Pseudomonas | |
Li et al. | Improved stress tolerance of recombinant Escherichia coli strain expressing nitrile hydratase by combining Global Regulator IrrE |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15750367 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 15750367 Country of ref document: EP Kind code of ref document: A1 |