WO2024188791A1

WO2024188791A1 - Lipase with improved stereoselectivity and lipase-based chiral resolution methods

Info

Publication number: WO2024188791A1
Application number: PCT/EP2024/055979
Authority: WO
Inventors: Elia CALDERINI; Alba HERNANDEZ MARTIN; Oliver Kensch; Florian Richter; Christian Pitzler; Wahed Ahmed Moradi; Andreas REMBIAK; Mark James Ford; Anton Lishchynskyi; Markus SPELBERG; Nadine ZUMBRÄGEL; Fabian BASSO; Florian ERVER
Original assignee: Bayer Aktiengesellschaft
Priority date: 2023-03-14
Filing date: 2024-03-07
Publication date: 2024-09-19

Abstract

The present invention relates to proteins having improved lipase activity, nucleic acid molecules encoding respective proteins having improved lipase activity and methods for chiral resolution of secondary alcohols.

Description

Lipase with improved stereoselectivity and lipase-based chiral resolution methods

Technical field

The present invention relates to proteins with improved lipase activity, nucleic acid molecules encoding said proteins as well as enzymatic methods for the chiral resolution of secondary alcohols.

Background

Enantiomerically enriched or pure alcohols are important compounds for the production of agrochemical or pharmaceutical compounds. Thus, the absolute configuration of the stereocenters of chiral alcohols is crucial for the synthesis of the corresponding active agents. In the production of a desired target molecule, generation of correct chirality is often a challenge.

Various methods for preparing enantiomerically enriched alcohols are known. On the one hand, chiral transition metal catalysts can be used, which, however, are characterized by high costs (G.R. Cook, Transition Metal-Mediated Kinetic Resolution, Current Organic Chemistry, 2000, 4, 869-885; Hirama et al. J. Org. Chem. 1988, 53, 708). Besides metal catalysis, an organocatalytic acyl transfer kinetic resolution is described for bulky ester substrates (Deng et al. J. Org. Chem. 2015, 80, 6, 3159). Also known is the use of enzymes, for example lipases, as biocatalysts for the production of chiral compounds. Kirchner et al. (J. Am. Chem. Soc. (1985), 107, 7072-7076) report two lipases that act as highly stereoselective, practical catalysts in nearly anhydrous organic solvents. Under such “unnatural” conditions the enzymes can asymmetrically catalyze reactions of esterification and transesterification which are not feasible in aqueous solutions because of the domination of hydrolysis. As a result, a number of optically active alcohols, carboxylic acids, and esters have been prepared on a gram scale. Faber and Riva (1992; Synthesis 1992(10), 895-910), and Nascimento et al. (2003, Tetrahedron Asymmetry 14, 311- 311) describe enzymatic resolution reactions of secondary alcohols by transesterification. In addition, Patil et al. (J. Org. Chem. 2008, 73, 4476 - 4483) and Reddy et. al (Synth. Communications, 2003, 33, 3717 - 3726) showed the use of esterases for the enantioselective ester hydrolysis in buffered aqueous systems, however in the presence of DMSO and only moderate yields.

US 4,732,853 A describes a method of making chiral epoxy alcohols by means of enantioselective hydrolysis with lipases.

EP 0716712 Bl describes the lipase-catalyzed acylation of alcohols with diketenes, especially for the production of enantioselective acylated alcohols from racemic alcohols. WO2012146935A1 discloses modified lipase variants, as well as polynucleotides and recombinant expression vectors encoding the lipase variant polypeptides, as well as methods for producing such lipase variants in selected bacterial and fungal host cells. The specified lipase variants have increased enzyme specificity or enhanced trans-selectivity. Further described are methods of their use for reducing or eliminating trans-fatty acids from substrates.

Although several improvements of lipases have been achieved so far, several limitations arising during the asymmetric synthesis of secondary alcohols or resolution of racemic alcohols still have to be overcome, such as unfavorable equilibrium, substrate and product inhibition, poor thermostability, insufficient substrate specificity and in particular low enantioselectivity of the lipase. In addition, downstream processing and isolation of the target chiral alcohols are challenging within large scale applications while importance of atom economy is largely increased on industrial scale.

Thus, there is a need for further improvement of lipases, in particular with respect to the production of enantiomerically enriched or pure products with increased atom economy, as well as for further process improvements.

Summary

The present invention solves the above-described problems at least partially by providing novel lipases with superior enantiomer selectivity as well as providing novel methods for the chiral resolution of industrially relevant building blocks.

The lipases described herein have certain advantages over known wild-type and other already known lipases. In particular, the modified or variant lipases described herein have the advantage that they can produce enantiomerically enriched or enantiomerically nearly pure or pure compounds better than respective wild- type lipases.

A first aspect of the present invention relates to a protein having the activity of a lipase wherein the protein is encoded by an amino acid sequence having at least 80%, preferably 85%, more preferably 90%, furthermore preferably 92%, even more preferably 95% with the amino acid sequence shown under SEQ ID No. 1 , characterized in that the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 in at least one of the following positions: i. the amino acid at position 44 is different from L, preferably the amino acid at position 44 is M, W or Y; ii. the amino acid at position 51 is different from F, preferably the amino acid at position 51 is N, or M; iii. the amino acid at position 52 is different from V, preferably the amino acid at position 52 is L; iv. the amino acid at position 53 is different from T, preferably the amino acid at position 53 is S, P, I, E, or A; v. the amino acid at position 54 is different from D, preferably the amino acid at position 54 is Q, M, F, G, E, L, T or P; vi. the amino acid at position 55 is different from A, preferably the amino acid at position 55 is R, M, D, Y, S, or I; vii. the amino acid at position 109 is different from G, preferably the amino acid at position

109 is H or F; viii. the amino acid at position 110 is different from M, preferably the amino acid at position

110 is T or V; ix. the amino acid at position 111 is different from A, preferably the amino acid at position

111 is T or S; x. the amino acid at position 117 is different from Y, preferably the amino acid at position 117 is F or S; xi. the amino acid at position 121 is different from Y, preferably the amino acid at position

121 is V; xii. the amino acid at position 122 is different from K, preferably the amino acid at position

122 is Q, A, Y, R, or V; xiii. the amino acid at position 153 is different from H, preferably the amino acid at position 153 is N, Y, D, E, or C; xiv. the amino acid at position 160 is different from T, preferably the amino acid at position 160 is E, C, D, P, I, Q, K, M, S, F, A, or N; xv. the amino acid at position 179 is different from D, preferably the amino acid at position 179 is C; xvi. the amino acid at position 181 is different from A, preferably the amino acid at position 181 is Q; xvii. the amino acid at position 184 is different from A, preferably the amino acid at position 184 is G or T; xviii. the amino acid at position 211 is different from Y, preferably the amino acid at position

211 is E; xix. the amino acid at position 212 is different from A, preferably the amino acid at position

212 is S or P; xx. the amino acid at position 216 is different from Y, preferably the amino acid at position 216 is K or A; xxi. the amino acid at position 234 is different from S, preferably the amino acid at position

234 is K, T or G; xxii. the amino acid at position 235 is different from S, preferably the amino acid at position

235 is V or M; xxiii. the amino acid at position 236 is different from K, preferably the amino acid at position 236 is T; xxiv. the amino acid at position 238 is different from R, preferably the amino acid at position 238 is A, K, D, E, or Q; xxv. the amino acid at position 240 is different from Y, preferably the amino acid at position 240 is F; xxvi. the amino acid at position 289 is different from D, preferably the amino acid at position 289 is S or G; xxvii. the amino acid at position 291 is different from G, preferably the amino acid at position 291 is E or W; xxviii. the amino acid at position 317 is different from N, preferably the amino acid at position 317 is T; xxix. the amino acid at position 320 is different from N, preferably the amino acid at position

320 is E or G; xxx. the amino acid at position 321 is different from L, preferably the amino acid at position

321 is F.

SEQ ID NO.l refers to a reference protein sequence with lipase activity.

The meaning of amino acid abbreviations A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, Y is derivable herein below in Table 2 under the paragraph sub-titled “Description of the Sequences”.

An “amino acid corresponding to position x” in a first amino acid sequence (e.g. position 44 in SEQ ID No. 1) means herein that an amino acid of a second amino acid sequence, when compared with the first amino acid sequence, appears at position x of the first amino acid sequence in a pairwise sequence alignment of the first amino acid sequence with the second amino acid sequence in case the numbering of the amino acids of the second amino acid sequence differs from the amino acid numbering of the first amino acid sequence.

In the context of the present invention, the term “identity” in respect to sequence identity or sequences being identical to is to be understood as meaning the number of identical amino acids or nucleotides shared over the entire sequence length by a first nucleic or amino acid sequence with another (second) nucleic or amino acid sequence, respectively, expressed in percent.

“Sequence identity” can be determined by alignment of two amino acid or two nucleotide sequences using global or local alignment algorithms comprised for example in known software like GAP or BESTFIT or the Emboss program “Needle”. This software uses the Needleman and Wunsch global alignment algorithm for aligning two sequences, over their entire length, maximizing the number of matches and minimizing the number of gaps. Generally, the default parameters are used, with a gap creation penalty = 10 and gap extension penalty = 0.5 (both for nucleotide and protein alignments). For nucleotides the default scoring matrix used is DNAFULL and for proteins the default scoring matrix is Blosum62 (Henikoff & Henikoff, 1992, PNAS 89, 10915-10919). Sequence alignments and scores for percentage sequence identity may for example be determined using software, such as EMBOSS, accessible at world wide web site of the EBI (ebi.ac.uk/Tools/emboss/). Alternatively, sequence similarity or identity may be determined by searching against databases (e.g. EMBL, GenBank) by using commonly known algorithms and output formats such as FASTA, BLAST, etc., but preferably hits should be retrieved and aligned pairwise to finally determine sequence identity.

If sequences to be compared with one another are of different length, the identity is to be determined by determining the identity in percent of the number of amino acids or nucleotides, respectively, which the shorter sequence shares with the longer sequence. Preferably, the identity is determined using the known and publicly available computer program ClustalW (Thompson et al., Nucleic Acids Research 22 (1994), 4673-4680). ClustalW is made publicly available by Julie Thompson (Thompson© EMBL- Heidelberg.DE) and Toby Gibson (Gibson@EMBL-Heidelberg.DE), European Molecular Biology Laboratory, Meyerhofstrasse 1, D 69117 Heidelberg, Germany. ClustalW can also be downloaded from various Internet pages, inter alia from IGBMC (Institut de Genetique et de Biologie Moleculaire et Cellulaire, B.P.163, 67404 Illkirch Cedex, France; ftp://ftp-igbmc.u-strasbg.fr/pub/) and from EBI (ftp://ftp.ebi.ac.uk/pub/software/) and all mirrored Internet pages of the EBI (European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK).

Preferably, use is made of the ClustalW computer program of version 1.8 or Clustal 2 to determine the identity between proteins described in the context of the present invention and other proteins. Here, the parameters have to be set as follows: KTUPLE=1, T0PDIAG=5, WIND0W=5, PAIRGAP=3, GAPOPEN=10, GAPEXTEND=0.05, GAPDIST=8, MAXDIV=40, MATRIX=GONNET, ENDGAPS(OFF), NOPGAP, NOHGAP.

Preferably, use is made of the ClustalW computer program of version 1.8 or Clustal 2 to determine the identity for example between the nucleotide sequence of the nucleic acid molecules described in the context of the present invention and the nucleotide sequence of other nucleic acid molecules. Here, the parameters have to be set as follows:

KTUPLE=2, TOPDIAGS=4, PAIRGAP=5, DNAM ATRIXJUB, GAPOPEN=10, GAPEXT=5, MAXDIV=40, TRANSITIONS: unweighted.

“Identity” furthermore means that there is a functional and/or structural equivalence between the nucleic acid molecules in question or the proteins encoded by them. Functional equivalence means that the nucleic acid molecule sequences or the amino acid sequences encode a protein having the activity of a lipase. The nucleic acid molecules which are homologous to the molecules described above and represent derivatives of these molecules are generally variants of these molecules which represent modifications having the same biological function or catalyzing the same reaction, i.e. coding for a protein having the activity of a lipase. They may be either naturally occurring variants, for example sequences from other species, or mutations, where these mutations may have occurred in a natural manner or were introduced by targeted mutagenesis. Furthermore, the variants may be synthetically produced sequences. The allelic variants may be either naturally occurring variants or synthetically produced variants or variants generated by recombinant DNA techniques. However, concerning the present invention it is decisive that those variants encode proteins having lipase-activity and comprise the amino acid substitutions (replacements), deletions or insertions described herein concerning the proteins according to the invention.

A special type of derivatives are, for example, nucleic acid molecules which differ from the nucleic acid molecules described in the context of the present invention as a result of the degeneracy of the genetic code.

According to the NC-IUBMB (Nomenclature Committee of the International Union of Biochemistry and Molecular Biology) lipases belong to the class of hydrolases (EC 3). Hydrolase is a class of enzymes that commonly perform as biochemical catalysts that use water to break a chemical bond, which typically results in dividing a larger molecule into smaller molecules. Under unnatural, anhydrous, conditions these enzymes can also catalyze reactions of esterification, e.g. acetylation, and transesterification. The group of hydrolases comprises enzymes acting on ester bonds (EC 3.1) encompassing carboxylic ester hydrolases (EC 3.1.1) and as a subgroup lipases (EC 3.1.1.3). Lipases have been identified from plants, mammals and microorganisms including e.g. Pseudomonas, Vibrio, Acinetobacter, Burkholderia, Chromobacterium, cutinase from Fusarium solani (FSC), Candida antarctica A (CalA), Rhizopus oryzae (ROL), Thermomyces lanuginosus (TLL), Rhizomucor miehei (RML), Aspergillus Niger, Fusarium heterosporum, Fusarium oxysporum or Fusarium culmorum.

If a protein has the activity of a lipase, this can be detected with methods known and described in the art.

It is not decisive which method is used for detecting if a protein according to the invention has the activity of a lipase. Preferably, in connection with the present invention, the method is described in the “Example”- section.

In a further embodiment of the invention, the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by at least one of the following modifications: i. the amino acid at position 44 is W or Y; ii. the amino acid at position 54 is F; iii. the amino acid at position 55 is R; iv. the amino acid at position 109 is H; v. the amino acid at position 110 is T or V; vi. the amino acid at position 117 is F; vii. the amino acid at position 122 is Q or R; viii. the amino acid at position 160 is E; ix. the amino acid at position 216 is K; x. the amino acid at position 236 is T; xi. the amino acid at position 238 is K or E; xii. the amino acid at position 240 is F.

Preferably, the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by at least the mutation G109H, i.e. the amino acid at position 109 is H.

Lipase variant proteins according to the invention may exhibit further amino acid modifications (amino acid substitutions, deletions or insertions) compared to the amino acid sequences described herein above in respect to the amino acid sequence shown under SEQ ID No. 1.

In a further embodiment of the invention, the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by at least two, further preferably at least three, even further preferably at least four, in particular preferably at least five of the following modifications selected from: i. the amino acid at position 57 is different from N, preferably the amino acid is P; ii. the amino acid at position 109 is different from G, preferably the amino acid is H; iii. the amino acid at position 122 is different from K, preferably the amino acid is R; iv. the amino acid at position 212 is different from A, preferably the amino acid is P; v. the amino acid at position 234 is different from S, preferably the amino acid is K; vi. the amino acid at position 289 is different from D, preferably the amino acid is G.

Preferably, the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by the following modifications: i. the amino acid at position 57 is P; ii. the amino acid at position 109 is H; iii. the amino acid at position 212 is P; iv. the amino acid at position 234 is K; and v. the amino acid at position 289 is G.

Further preferably, the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by the following modifications: i. the amino acid at position 57 is P; ii. the amino acid at position 109 is H; iii. the amino acid at position 122 is R; iv. the amino acid at position 212 is P; v. the amino acid at position 234 is K; and vi. the amino acid at position 289 is G.

In a further embodiment of the invention, the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by the following modifications: the amino acid at position 57 is P; the amino acid at position 109 is H; the amino acid at position 122 is R or K or Q, preferably R; the amino acid at position 212 is P; the amino acid at position 234 is K; the amino acid at position 289 is G; and additionally at least one of the following modifications: the amino acid at position 44 is W or Y; the amino acid at position 54 is F, the amino acid at position 55 is R, the amino acid at position 110 is V or T; the amino acid at position 111 is T, the amino acid at position 117 is F, the amino acid at position 160 is E, the amino acid at position 216 is K, the amino acid at position 236 is T, the amino acid at position 238 is K or E, the amino acid at position 240 is F.

Preferred embodiments of the invention are proteins according to the invention encoding lipases having the amino acid sequences shown under SEQ ID Nos. 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239.

One further embodiment of the invention concerns nucleic acid molecules encoding a protein according to the invention.

Nucleic acid molecules according to the invention can be any kind of nucleic acid, as long as the nucleic acid encodes a protein according to the invention. The nucleic acids can be ribonucleic nucleic acid molecules (e.g. RNA, mRNA) or deoxyribonucleic nucleic acid molecules (DNA, including genomic DNA which may or may not comprise introns and coding DNA).

Of particular interest for the invention are nucleic acid molecules encoding proteins having the activity of a lipase comprising the amino acid sequences shown under SEQ ID Nos. 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80,

82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126,

128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168,

170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210,

212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240.

The invention therefore also concerns nucleic acid molecules encoding a protein having the activity of a lipase selected from the group consisting of a) nucleic acid molecules comprising the nucleic acid sequences shown under SEQ ID Nos. 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240; b) nucleic acid molecules having at least 60%, preferably 70%, more preferably 80%, furthermore preferably 90%, even more preferably 95%, even furthermore preferably 96%, particular preferably 97%, most preferably 98% or especially preferably 99% identity with the nucleic acid sequences shown under a).

In the context of the present invention, the term “hybridizing with” means hybridization under conventional hybridization conditions, preferably under stringent conditions, as described, for example, in Sambrook et al. (Molecular Cloning, A Laboratory Manual, 3rd edition (2001) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. ISBN: 0879695773) or Ausubel et al. (Short Protocols in Molecular Biology, John Wiley & Sons; 5th edition (2002), ISBN: 0471250929). With particular preference, “hybridization” means a hybridization under the following conditions: hybridization buffer:

2xSSC; lOxDenhardt solution (Fikoll 400+PEG+BSA; ratio 1:1:1); 0.1% SDS; 5 mM EDTA; 50 mM Na2HPO4; 250 pg/ml of herring sperm DNA; 50 pg/ml of tRNA; or

25 M sodium phosphate buffer pH 7.2; 1 mM EDTA; 7% SDS hybridization temperature: T = 65 to 68 °C wash buffer: O.lxSSC; 0.1% SDS wash temperature: T = 65 to 68 °C.

Nucleic acid molecules which hybridize with nucleic acid molecules coding for a protein having the activity of a lipase may originate from any organism; accordingly, they may originate from bacteria, fungi, animals, humans, plants or viruses.

Nucleic acid molecules which hybridize with nucleic acid molecules coding for a protein having the activity of a lipase preferably originate from microorganisms, more preferably from fungi or bacteria, most preferably from bacteria. Nucleic acid molecules which hybridize with the molecules mentioned may be isolated, for example, from genomic or from cDNA libraries. Such nucleic acid molecules can be identified and isolated using the nucleic acid molecules described herein or they can be identified and isolated using parts of these molecules or the reverse complements of these molecules, for example by hybridization according to standard methods (see, for example, Sambrook et al., Molecular Cloning, A Laboratory Manual, 3rd edition (2001) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. ISBN: 0879695773; Ausubel et al., Short Protocols in Molecular Biology, John Wiley & Sons; 5th edition (2002), ISBN: 0471250929) or by amplification using PCR.

The fragments used as hybridization samples may also be synthetic fragments or oligonucleotides prepared using the customary synthesis techniques, whose sequence is essentially identical to the nucleic acid molecule described in the context of the present invention. Once genes which hybridize with the nucleic acid sequences described in the context of the present invention are identified and isolated, the sequence should be determined and the properties of the proteins coded for by this sequence should be analyzed to determine whether they are proteins having the activity of a lipase. Methods of how to determine whether a protein has the activity of a protein having the activity of a lipase are known to the person skilled in the art.

The molecules hybridizing with the nucleic acid molecules described in the context of the present invention comprise in particular fragments, derivatives and allelic variants of the nucleic acid molecules mentioned. In the context of the present invention, the term “derivative” means that the sequences of these molecules differ in one or more positions from the sequences of the nucleic acid molecules described above and are highly identical to these sequences. The differences to the nucleic acid molecules described above may, for example, be due to deletion, addition, substitution, insertion or recombination.

The meaning of nucleotide abbreviations a, c, g, t, and those of abbreviations for degenerate nucleotides r, y, s, w, k, m, b, d, h, v, n is derivable herein below from Table 1 under the paragraph sub-titled “Description of the Sequences”. Which amino acids are encoded by codons comprising degenerate nucleotides is derivable herein below from Table 3 under the paragraph sub-titled “Description of the Sequences”.

Furthermore, the invention relates to recombinant nucleic acid molecules comprising a nucleic acid molecule according to the invention.

In conjunction with the present invention, the term „recombinant nucleic acid molecule" is to be understood to mean a nucleic acid molecule, which contains additional sequences in addition to nucleic acid molecules according to the invention, which do not naturally occur in the combination in which they occur in recombinant nucleic acids according to the invention. Here, the abovementioned additional sequences can be any sequences, preferably they are functional or regulatory sequences (promoters, termination signals, enhancers, ribosome binding sites (rbs), leader sequences enhancing transcription, translation or RNA stability, subcellular targeting sequences etc.), particularly preferably they are functional or regulatory sequences that are active in microorganisms, and especially particularly preferably they are regulatory sequences that are active in fungi, in particular yeasts or in bacteria. Methods for the creation of recombinant nucleic acid molecules according to the invention are known to the person skilled in the art and include genetic methods such as bonding nucleic acid molecules by way of ligation, genetic recombination, or new synthesis of nucleic acid molecules. Those methods are described e.g. in Sambrok et al. (Molecular Cloning, A Laboratory Manual, 3rd edition (2001) Cold Spring Harbour Laboratory Press, Cold Spring Harbour, NY. ISBN: 0879695773) or Ausubel et al. (Short Protocols in Molecular Biology, John Wiley & Sons; 5th edition (2002), ISBN: 0471250929).

In a further embodiment, the recombinant nucleic acid molecules according to the invention comprise a nucleic acid molecule according to the invention which is linked with regulatory sequences, which initiate transcription in prokaryotic or eukaryotic cells.

“Regulatory sequences, which initiate transcription” in a cell are also known as promoters.

Information concerning regulatory sequences and plasmids are well known to a person skilled in the art and are described e.g. by the Registry of Standard Biological Parts supported by The International Genetically Engineered Machine (iGEM) Foundation (One Kendall Square, Suite B6104, Cambridge, MA 02139, USA) in the world wide web (http://parts.igem.org/Catalog).

Regulatory sequences which initiate transcription in prokaryotic organisms, e.g. E. coli, and in eukaryotic organisms are sufficiently described in literature, in particular such for expression in yeast are described, e.g. Saccharomyces cerevisiae. An overview of various systems for expression for proteins in various host organisms can be found, for example, in Methods in Enzymology 153 (1987), 383-516 and in Bitter et al. (Methods in Enzymology 153 (1987), 516-544) or in Gomes et al. (2016, Advances in Animal and Veterinary Sciences, 4(4), 346) and Baghban et al. (2018, Current Pharmaceutical Biotechnology, 19(6)). Common yeast promoters are pAOXl, pHIS4, pGAL, pScADH2 (Baghban et al., 2018, see above). Common bacterial promoters are T5, T7, rhamnose-inducible, arabinose-inducible, PhoA, artificial trc (trp-lac) promoter as described by Marschall et al. (2017, Appl Microbiol Biotechnol 101, 501-512) and Tegel et al. (2011, FEBS Journal 278, 729-739).

A further embodiment of recombinant nucleic acid molecules of the present invention are vectors or plasmids, which comprise the nucleic acid molecules according to the invention.

“Vectors” are commonly understood in the field of molecular biology and herein to represent a nucleic acid sequence or a vehicle comprising a nucleic acid sequence used to transfer genetic material (DNA or RNA) into a target cell. Vectors can be plasmids, e.g. T-DNA or binary vectors for generating transgenic plants, expression vectors for expression of nucleic acid sequences in a host cell, shuttle vectors which are eligible to propagate in different hosts, or vectors can be virus particles or bacteriophages having been modified to deliver foreign genetic material into a host.

“Plasmids” are commonly understood in the field of molecular biology and herein to represent an autonomously self-replicating, often circular DNA molecule which is when present in a host cell separated from the chromosomal DNA.

Nucleic acid molecules according to the invention, recombinant nucleic acid molecules according to the invention, vectors or plasmids according to the invention can be used for production of proteins according to the invention, e.g. by expressing the nucleic acid molecules according to the invention in host cells.

Another embodiment of the invention concerns hosts or host cells comprising or expressing a nucleic acid molecule according to invention or comprising proteins according to the invention or comprising a recombinant nucleic acid molecule according to the invention or comprising a vector according to the invention or comprising a plasmid according to the invention.

Nucleic acid molecules according to the invention encoding a protein having the activity of a lipase can be expressed in host cells for e.g. their multiplication or for production of proteins according to the invention. For expression in host cells, nucleic acid molecules according to the invention can be comprised on vectors or plasmids or they can be stably integrated into the genome of a respective host cell. The nucleic acid molecules according to the invention can also be comprised by vectors which support their introduction into host cells.

A further embodiment of the present invention concerns a host or host cell according to the invention comprising a nucleic acid molecule according to the invention or comprising a recombinant nucleic acid molecule according to the invention or comprising a vector according to the invention or comprising a plasmid according to the invention and, in each case comprising a protein according to the invention.

Another embodiment of the present invention concerns a host or host cell according to the invention comprising a nucleic acid molecule according to the invention or comprising a recombinant nucleic acid molecule according to the invention or comprising a vector according to the invention or comprising a plasmid according to the invention and, in each case expressing a protein according to the invention.

Another embodiment of the present invention concerns a host or host cell according to the invention comprising a nucleic acid molecule according to the invention or comprising a recombinant nucleic acid molecule according to the invention or comprising a vector according to the invention or comprising a plasmid according to the invention and, in each case expressing a protein, wherein the protein has the activity of a lipase.

“Expressing a nucleic acid molecule” shall be understood herein to mean that in case the nucleic acid molecule is RNA or mRNA the nucleic acid molecule is translated into a protein, preferably translated into a protein having the activity of a lipase or in case of the nucleic acid molecule is DNA or cDNA it is transcribed (and in case of genomic DNA containing introns is processed) into mRNA, preferably into a mRNA encoding a protein having the activity of a lipase and subsequently translated into a protein, preferably translated into a protein having the activity of a lipase.

Transcription of a given nucleic acid molecule in a host can be demonstrated by methods known to a person skilled in the art, for example, by detection of specific transcripts (mRNA) of foreign nucleic acid molecules by Northern blot analysis or RT-PCR.

Whether hosts or host cells comprise a given protein or comprise a protein which is derived from expressing a nucleic acid molecule can be determined by methods known to a person skilled in the art, for example, by immunological methods, such as Western blot analysis, ELISA (Enzyme Linked Immuno Sorbent Assay) or RIA (Radio Immune Assay). The person skilled in the art is familiar with methods for preparing antibodies which react specifically with a certain protein, i.e. which bind specifically to a certain protein (see, for example, Lottspeich and Zorbas (eds.), 1998, Bioanalytik, Spektrum akad, Verlag, Heidelberg, Berlin, ISBN 3-8274-0041-4). Some companies (Thermo Fisher Scientific, 168 Third Avenue, Waltham, MA USA 0245; GenScript, 60 Centennial Ave., Piscataway, NJ 08854, USA) offer the preparation of such antibodies as an order service.

Furthermore, a person skilled in the art can test if a host or host cell comprises a protein according to the invention by detecting (additional) activity of proteins having the activity of a lipase in a respective host cell. Preferably activity of proteins having additional activity of a lipase in a respective host cell is detected by comparing the activities of lipases of a host cell according to the invention with the respective activity of host cell not comprising a protein according to the invention.

Testing if a protein has the activity of a lipase can be done by methods known in the art.

Host or host cells according to the invention can be produced by a person skilled in the art by known methods for genetically modifying or transforming organisms.

A further subject of the present invention therefore is a host or host cell according to the invention, particularly a prokaryotic or eukaryotic host or host cell, which is genetically modified (or transformed) with a nucleic acid molecule according to the invention or with a recombinant nucleic acid molecule according to the invention or with a vector according to the invention or a plasmid according to the invention. Preferably the genetically modified (transformed) host or host cell according to the invention expresses a protein having the activity of a lipase, more preferably, the genetically modified (transformed) host or host cell according to the invention expresses a protein according to the invention.

“Genetically modified with a nucleic acid molecule” or “transformed with a nucleic acid molecule” shall be understood herein to mean that a nucleic acid molecule is or was introduced into a host or host cell by technical and/or non-naturally occurring means, preferably by technical methods in the field of molecular biology, biotechnology or genetic modification.

Descendants, offspring or progeny of hosts or host cells according to the invention are also an embodiment of the invention, preferably these descendants, offspring or progeny comprise a nucleic acid molecule according to the invention or comprise a recombinant nucleic acid molecule according to the invention or comprise a vector according to the invention or comprise a plasmid according to the invention or comprise a protein according to the invention, more preferably these descendants, offspring or progeny comprise a nucleic acid molecule according to the invention or comprise a recombinant nucleic acid molecule according to the invention or comprise a vector according to the invention or comprise a plasmid according to the invention and, in each case express a protein, wherein the protein has the activity of a lipase, even more preferably these descendants, offspring or progeny comprise a nucleic acid molecule according to the invention or comprise a recombinant nucleic acid molecule according to the invention or comprise a vector according to the invention or comprise a plasmid according to the invention and, in each case express a protein, wherein the protein has the activity of a lipase according to the invention.

The host or host cell according to the invention can be a host or host cell from any prokaryotic or eucaryotic organism. The hosts or host cells can be bacteria or bacteria cells (e.g. E. coli, bacteria of the genus Bacillus, in particular Bacillus subtilis, Agrobacterium, particularly Agrobacterium tumefaciens or Agrobacterium rhizogenes, Pseudomonas, particularly Pseudomonas fluorescens, Streptomyces spp, Rhodococcus spp, in particular Rhodococcus rhodochrous, Vibrio natrigens, Corynebacterium, particularly Corynebacterium glutamicum) or fungi or fungal cells (e.g. Agaricus, in particular Agaricus bisporus, Aspergillus, Trichoderma or yeasts, particularly .S’, cerevisiae, Pichia ssp. like P. pastoris), as well as plants or plant cells or they can be animals or animal cells.

Preferred host cells according to the invention are cells of microorganisms. Within the framework of the present patent application, this is understood to include all bacteria and all protists (e.g. fungi, particularly yeasts and algae), as they are defined in Schlegel "General Microbiology " (Georg Thieme Publishing House (1985), 1-2), for example.

In respect to microorganisms, the hosts or host cells according to the invention are preferably bacteria/bacteria cells or yeast/yeast cells, most preferably they are bacteria/bacteria cells. Concerning bacteria/bacteria cells, the hosts or host cells according to the invention are preferably Bacillus species/ Bacillus species cells or Escherichia coli! Escherichia coli cells cells most preferably Escherichia coli/ Escherichia coli cells.

Alternatively, Pseudomonas, particularly Pseudomonas fluorescens, Streptomyces spp, Rhodococcus spp, in particular Rhodococcus rhodochrous, Vibrio spp, particularly Vibrio natrigens, Corynebacterium, particularly Corynebacterium glutamicum or others can be hosts or host cells according to the invention. A preferred embodiment of the invention concerns hosts or host cells according to the invention comprising a nucleic acid molecule according to the invention, wherein the nucleic acid molecule according to the invention is characterized in that the codons of said nucleic acid molecule are changed such that they are adapted to the frequency of use of the codons of the host or a host cell, respectively.

Host cells according to the invention can be used for production of proteins according to the invention. Proteins according to the invention can be used in methods for production of enantiomerically enriched or nearly enantiomerically pure secondary alcohols.

A second aspect of the present invention relates to a method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I), comprising contacting the substrate with a variant protein having the activity of a lipase according to the first aspect of the present invention or with a lipase according to SEQ ID No. 1 ,

wherein R¹ and R² are independently from each other selected from a substituted or unsubstituted (n)-alkyl, iso-alkyl, unsubstituted aryl, alkyl-substituted or aryl-substituted aryl.

Preferably, the substrate is contacted with a variant protein having the activity of a lipase according to the first aspect of the present invention.

R¹ and and R² are preferably independently from each other chosen to be a linear or branched Ci-io residue.

Further preferably, R¹ is selected from methyl, iso-butyl, tert-butyl, and iso-propyl; and/or R² is methyl.

Even further preferably, R¹ is selected from methyl, iso-butyl, tert-butyl, and iso-propyl; and R² is methyl.

In a further embodiment of the invention, the method comprises hydrolysing a racemic substrate of formula (II- 1) in an enantiomerically selective manner to separate an enantiomerically enriched or pure compound of formula (1-1),

The method preferably comprises isolating the compound (I) or (1-1), and/or (III) or (III-l) after hydrolysis. The isolation of compounds (1-1) can be done by any means which are known to those skilled in the art and is preferably done via extraction and/or direct distillation.

The lipase variants or protein variants according to the invention show improved selectivity and/or improved specific activity regarding the stereoselective hydrolysis of methyl-3-hydroxy-2-methylene- butanoate (formula II- 1) and are better adapted to isolate enantiomerically enriched or nearly pure substrate methyl (3S)-3-hydroxy-2-methylene-butanoate (formula 1-1) compared to a wild-type lipase. Compounds of formula (I)

represent an important building block in the synthesis of complex agrochemical compounds. In particular, methyl (3S)-3-hydroxy-2-methylene-butanoate of formula (1-1)

(1-1 ) which is the S-enantiomer of racemic methyl 3-hydroxy-2-methylene-butanoate, is an important intermediate in the respective synthesis of agrochemical compounds described in WO 2018/228985. The lipase variants according to the present invention selectively hydrolyse the R-enantiomer of racemic methyl 3-hydroxy-2-methylene-butanoate, allowing the separation of the S-enantiomer of said racemic methyl 3 -hydroxy-2-methylene-butanoate . In a further embodiment of the invention, the method is conducted in an aqueous solution.

Preferably, the method is conducted in a mixture or biphasic liquid system of water and an organic solvent, such as methyl-tert-butylether, toluene, 2-methyltetrahydrofuran, methyl isobutyl ketone, cyclohexane, cyclopentyl methyl ether, chlorobenzene, tert-amylmethylether, ethyl acetate, isopropyl acetate. Even further preferably, the organic solvent is methyl-tert-butylether (MTBE).

The ratio of organic solvent to water is further preferably in the range from 1 : 1 to 6: 1 , even more preferably 2:1 to 4:1.

Surprinsingly, it was found that a significant reduction of the aqueous phase, substituted partially by an organic solvent, not only allows for higher concentration of the final product, thus increasing the productivity, but also allows for the reduction of the enzyme / lipase load.

Furthermore, the presence of an organic solvent, such as MTBE, promotes an efficient downstream processing of the reaction products, i.e. isolation of compounds of the formula (I) or (1-1).

In a further embodiment of the invention, the method is carried out at a temperature of between 20 and 60 °C, preferably between 30 and 55 °C. Further preferably, the method is carried out at a temperature of between 35 and 50 °C.

In a further embodiment of the invention, the method is carried out for at least 1 h, preferably at least 2 h, further preferably at least 3 h.

In a further embodiment of the invention, the method is carried out for at most 40 h, preferably at most 30 h, further preferably at most 20 h.

In a further embodiment of the invention, the method is carried out at a pH of between 7 and 8.5, preferably between 7.2 and 8.0, further preferably between 7.4 and 7.6. The pH may be adjusted by addition of organic or inorganic bases, preferably by the addition of inorganic bases, such as hydrogen carbonates or carbonates, hydrogen phosphates and hydroxides.

At a pH above 8.5 the enzyme is deactivated; further, there is partial hydrolysis of the ester of formula (I). Below pH 7, the reaction rate decreased and finally the reaction stops. Further below, at a pH of 4.5 or less, the enzyme is deactived.

The pH may be controlled by addition of a base, such as NaOH, KOH, K2CO3, KHCO3, Na2CO3, NaHCO i. Preferred bases are carbonates or hydrogencarbonates. The base can be added as solid or aqueous solution, diluted or saturated. The base may be added to the reaction before the substrate is dosed, or the base is dosed into the reaction in parallel to the substrate. The substrate and/or base can be dosed or added in one portion. The lipase of SEQ ID No. 1 or the protein variant according to the first aspect of the present invention may be provided as purified enzyme or in form of spray-dried or freeze-dried biomass or as broth. Preferably, the protein variant is provided as broth, i.e. as cell culture, or the broth is centrifugated and the cell pellet is freeze-dried to obtain lyophilized cells or biomass.

While lyophilized biomass is easy to handle in terms of independent timing for performing the claimed method, the use of the broth/cell culture directly to perform the claimed method is cost and time efficient.

For downstream processing, the reaction is preferably conducted in a mixture of water and a water- immiscible solvent. In case of conducting the reaction in a biphasic mixture comprising water and an immiscible organic solvent, the phases are separated and the aqueous phase is eventually back-extracted. The biomass can be separated by known technical means (e.g. centrifuge, filtration or decantation) before distillation or the distillation can take place without separation of the biomass.

It is to be understood in the context of the present invention that all embodiments concerning the method according to the second aspect can be combined with all different lipase variants according to the first aspect of the invention, and in particular with the preferred lipase variants. I.e. the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) may be carried out by means of a protein having the activity of a lipase wherein the protein is encoded by an amino acid sequence having at least 80%, preferably 85%, more preferably 90%, furthermore preferably 92%, even more preferably 95% with the amino acid sequence shown under SEQ ID No. 1 , characterized in that the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 in at least one of the following positions: i. the amino acid at position 44 is different from L, preferably the amino acid at position 44 is M, W or Y; ii. the amino acid at position 51 is different from F, preferably the amino acid at position 51 is N, or M; iii. the amino acid at position 52 is different from V, preferably the amino acid at position 52 is L; iv. the amino acid at position 53 is different from T, preferably the amino acid at position 53 is S, P, I, E, or A; v. the amino acid at position 54 is different from D, preferably the amino acid at position 54 is Q, M, F, G, E, L, T or P; vi. the amino acid at position 55 is different from A, preferably the amino acid at position 55 is R, M, D, Y, S, or I; vii. the amino acid at position 109 is different from G, preferably the amino acid at position 109 is H or F; viii. the amino acid at position 110 is different from M, preferably the amino acid at position 110 is T or V; ix. the amino acid at position 111 is different from A, preferably the amino acid at position 111 is T or S; x. the amino acid at position 117 is different from Y, preferably the amino acid at position 117 is F or S; xi. the amino acid at position 121 is different from Y, preferably the amino acid at position

235 is V or M; xxiii. the amino acid at position 236 is different from K, preferably the amino acid at position

236 is T; xxiv. the amino acid at position 238 is different from R, preferably the amino acid at position 238 is A, K, D, E, or Q; xxv. the amino acid at position 240 is different from Y, preferably the amino acid at position 240 is F; xxvi. the amino acid at position 289 is different from D, preferably the amino acid at position 289 is S or G; xxvii. the amino acid at position 291 is different from G, preferably the amino acid at position 291 is E or W; xxviii. the amino acid at position 317 is different from N, preferably the amino acid at position 317 is T; xxix. the amino acid at position 320 is different from N, preferably the amino acid at position

321 is F.

The amino acid sequence of the variant protein lipase used in the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) may differ from the amino acid sequence of SEQ ID NO. 1 by at least one of the following modifications: i. the amino acid at position 44 is W or Y; ii. the amino acid at position 54 is F; iii. the amino acid at position 55 is R; iv. the amino acid at position 109 is H; v. the amino acid at position 110 is T or V; vi. the amino acid at position 117 is F; vii. the amino acid at position 122 is Q or R; viii. the amino acid at position 160 is E; ix. the amino acid at position 216 is K; x. the amino acid at position 236 is T; xi. the amino acid at position 238 is K or E; xii. the amino acid at position 240 is F.

Preferably, the amino acid sequence of the variant protein lipase used in the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) differs from the amino acid sequence of SEQ ID NO. 1 by at least the mutation G109H, i.e. the amino acid at position 109 is H.

The amino acid sequence of the variant protein lipase used in the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) may differ from the amino acid sequence of SEQ ID NO. 1 by at least two, further preferably at least three, even further preferably at least four, in particular preferably at least five of the following modifications selected from: i. the amino acid at position 57 is different from N, preferably the amino acid is P; ii. the amino acid at position 109 is different from G, preferably the amino acid is H; iii. the amino acid at position 122 is different from K, preferably the amino acid is R; iv. the amino acid at position 212 is different from A, preferably the amino acid is P; v. the amino acid at position 234 is different from S, preferably the amino acid is K; vi. the amino acid at position 289 is different from D, preferably the amino acid is G.

The amino acid sequence of the variant protein lipase used in the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) may differ from the amino acid sequence of SEQ ID NO. 1 by the following modifications: the amino acid at position 57 is P; the amino acid at position 109 is H; the amino acid at position 122 is R or K or Q, preferably R; the amino acid at position 212 is P; the amino acid at position 234 is K; the amino acid at position 289 is G; and additionally at least one of the following modifications: the amino acid at position 44 is W or Y; the amino acid at position 54 is F, the amino acid at position 55 is R, the amino acid at position 110 is V or T; the amino acid at position 111 is T, the amino acid at position 117 is F, the amino acid at position 160 is E, the amino acid at position 216 is K, the amino acid at position 236 is T, the amino acid at position 238 is K or E, the amino acid at position 240 is F.

Preferred proteins according to the invention encoding lipases that may be used in the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) are shown under SEQ ID Nos. 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69,

71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119,

121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161,

163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203,

205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239.

A further embodiment of the invention is the use of a protein according to the first aspect of the invention for the stereoselective hydrolysis of racemic methyl-3-hydroxy-2-methylene-butanoate to isolate methyl (3S)-3-hydroxy-2-methylene-butanoate.

“Enantiomerically enriched” means herein that one of two enantiomers is present in a composition in higher amounts than the other enantiomer, preferably at least 60% of one enantiomer is present in the composition, more preferably at least 65% of one enantiomer is present in the composition, further more preferably at least 70% of one enantiomer is present in the composition, even more preferably at least 75% of one enantiomer is present in the composition, even further more preferably at least 80% of one enantiomer is present in the composition, particular preferably at least 85% of one enantiomer is present in the composition, most preferably at least 90% of one enantiomer is present in the composition or especially preferably at least 94% of one enantiomer is present in the composition.

“Enantiomerically nearly pure” means herein that one of two enantiomers is present in a composition in amounts of at least 95.0%, preferably one of two enantiomers is present in a composition in amounts of at least 95.5%, more preferably one of two enantiomers is present in a composition in amounts of at least 96.0%, further more preferably one of two enantiomers is present in a composition in amounts of at least 96.5%, even more preferably one of two enantiomers is present in a composition in amounts of at least 97.0%, even further more preferably one of two enantiomers is present in a composition in amounts of at least 98.0%, particular preferably one of two enantiomers is present in a composition in amounts of at least 98.5%, most preferably one of two enantiomers is present in a composition in amounts of at least 99.0%, or especially preferably one of two enantiomers is present in a composition in amounts of at least 99.5%.

A third aspect of the present invention relates to a method for the enantiomeric enrichment of compounds of formula (I)

comprising reacting compounds of formula (II) with compounds of formula (IV)

and in particular

with R¹ and R² being defined as above, and R³ being a substutituted or unsubstituted (n) -Alkyl or isoalkyl, in the presence of a lipase under anhydrous conditions.

“Anhydrous conditions” refer to a reaction system comprising at most 3 % water in the liquid phase. The reaction preferably takes place in an organic solvent, such as hexane or n-heptane, or without any additional solvent. R³ can preferably be a linear Ci-i6 residue, in particular methyl or unsubstituted and saturated Cs-Cn residue. Compound (IV) can be in particular vinyl acetate or vinyl laurate.

The lipase is preferably immobilized on a solid carrier during the action. Immobilization leads to increased enzyme stability in non-aqueous solutions.

The reaction is preferably carried out at a temperature of between 20 and 40°C, preferably 25 to 35°C. Such temperature range allows for optimal enzymatic reacticity.

The reaction is preferably carried out for at least 1 h, further preferably at least 3 h, even further preferably at least 5 h.

The lipase is preferably recycled after the reaction has taken place. This allows the method to be designed in a cost-efficient manner. The lipase can thus be used multiple times for the reaction according to the third aspect or a further reaction.

The compound of formula (I) is preferably isolated by means of distillation directly from the reaction mixture or from the supernatant, obtained after decanting the reaction mixture. The lipase preferably remains in the remaining reaction mixture and can be reused for another reaction.

The lipase is preferably CALB lipase. CALB is a non-specific lipase originating from Candida antarctica B and was first described in 1994 (Uppenberg J, Patkar S, Bergfors T, Jones TA (1994) J Mol Biol 235(2):790-792).

The lipase is furthermore preferably immobilized on a hydrophobic carrier, such as acrylic resin. A commercial version of CALB is, for example, Novozym® 435, which can be used for carrying out the reaction.

Detailed description

The polypeptides (enzymes, i.e. lipases) as well as the processes according to the present invention allow for an efficient enantiomeric enrichment of compounds of the formula (I)

It was found that introduction of certain amino acid modifications into protein variants according to the invention improves the activity of the lipase, in particular in respect to its substrate specificity, meaning that these further modified lipase variants are better adapted to produce enantiomerically enriched or nearly pure products compared to known lipase variants. This in particular refers to the substrate (1-1).

Enantionerically pure or at least enriched methyl (3S)-3-hydroxy-2-methylene-butanoate

(1-1 ) which is the S-enantiomer of racemic methyl 3-hydroxy-2-methylene-butanoate, is an important intermediate in the respective synthesis of the compounds described in WO 2018/228985. The lipase variants of the present invention catalyse entioselectively the hydrolysis of the racemic methyl 3-hydroxy- 2-methylene-butanoate into (3R)-3-hydroxy-2-methylene- butanoic acid, leaving methyl (3S)-3-hydroxy-

2-methylene-butanoate unhydrolyzed.

The terms used herein are known to those skilled in the art. Otherwise, the following definitions are used:

For the purposes of the present invention, the term “alkyl” includes saturated hydrocarbon residues which can be branched or straight-chain and unsubstituted or at least monosubstituted. Examples of suitable alkyl residues, which can be unsubstituted or mono- or polysubstituted, are methyl, ethyl, n-propyl, isopropyl, n-butyl, isobutyl, 2-butyl, tert-butyl, n-pentyl, 2-pentyl, 3-pentyl, isopentyl, neo-pentyl, n-hexyl, 2-hexyl,

3-hexyl, n-heptyl, n-octyl, -C(H)(C2H5)2, -C(H)(n- C3H7)2 and -CH2-CH2-C(H)(CH3)-(CH2)3-CH3.

Unless defined otherwise elsewhere, the term “aryl” either alone or in combination with other terms, is to be understood as a mono- or polycyclic, preferably a mono- or bicyclic, aromatic hydrocarbon radical with preferably 6, 10 or 14 carbon atoms. An aryl radical can be unsubstituted or monosubstituted or polysubstituted, either with identical or different substituents.

Description of the Sequences

Throughout the application, nucleotide and amino acid abbreviations are used according to the following IUPAC codes:

Table 1

For discrimination between amino acids and nucleotides, capitalized nucleotide code abbreviations given in above Table are written herein in lower case.

Table 2

Codon usage follows herein the so called “general genetic code” according to the following Table, wherein “f ’ is to be substituted by “u” in ribonucleic acid (RNA) sequences. Table 3

The sequence listing associated with this application is filed in electronic format and hereby incorporated by reference into the specification in its entirety. “PRT” stands for “protein” and “NUC” for “nucleic acid”. Table 4a: Lipase variants according to the first aspect of the present invention (SEQ ID No. 7-169) and further lipases (SEQ ID No. 1-6)

Table 4b: Further lipase variants according to the present invention (protein SEQ ID No. 171-239). The variants described contain the backbone mutations N57P, K122R, A212P, S234K, D289G and G109H (SEQ ID No. 171, 172), unless (i) the mutation “R122K” is indicated in addition, which means that the K122R mutation has been back- mutated such that the position 122 carries an K again, or (ii) the mutation “R122Q” is indicated in addition, which means that the K122R mutation has been further mutated such that the position 122 carries a Q.

Examples

The following examples explain the present invention, but without restricting it.

Starting materials and protocols

Analytical grade chemicals and ready-to-use kits were sourced and used from common suppliers such as Sigma Aldrich, Acros Organics, Fisher Scientific, Qiagen or Stratagene. Novozym® 435 is bought from Sigma Aldrich.

Example 1

Cloning of lipase variants:

Nucleic acid sequences of SEQ ID No. 1 or their variants were cloned into an expression vector based on pKA81. The genetic elements were incorporated into the vector by means known in the art. For expression of the different lipase variants, the vectors were introduced into electrocompetent Escherichia coli (E. coli) W3110 cells.

Variant library:

Nucleotides were substituted in the wild-type sequence or sequences derived therefrom to allow for amino acid substitution. This exchange can be achieved by various molecular biological methods. One method for replacing nucleic acids is site-directed mutagenesis, which can result in mutations at one or more sites in the amino acid sequence. Methods for site-directed mutagenesis are state of the art and are described in the literature (e.g. Directed Mutagenesis: A Practical Approach, 1991, Edited by M.J. McPHERSON, IRL PRESS) and can be purchased as ready-made kits (e.g. the QUIKCHANGE™ lightening mutagenesis kit from Qiagen or Stratagene). After inserting mutations into the gene sequence and culturing in a suitable E. coli cloning strain, the plasmids obtained were transformed into Escherichia coli W3110.

Transformed cells were tested in appropriate biotransformation reactions in order to determine product yield and selectivity. Appropriate biotransformation reactions are described below, see Example 2. Sequence verification was performed as known in the art.

Glycerol stocks of the E. coli cultures transformed with the respective expression plasmids were prepared by adding one volume of 40% glycerol solution to one volume of E. coli culture.

For isolation of single bacterial colonies, appropriate dilutions of E. coli cultures were plated onto LB- Agar plates containing suitable concentrations of kanamycin and incubated at 37°C until single colonies were obtained.

Example 2

Cultivation:

To prepare precultures, sterilized 2 mL 96-well deep-well plates (Eppendorf, Hamburg, Germany) were used and mixed with 590 pL TB medium (50 pg/ml kanamycin) and 10 pL of the glycerol stock of the respective E. coli strain filled. Alternatively, 590 pL of TB medium (50 pg/ml kanamycin) were inoculated with cell material from a colony from an agar plate. The precultures were incubated at 37° C and 250 rpm for 17 h.

Sterile 2 mL 96-well deep-wells with 510 pL TB medium (50 mg/L kanamycin) were used to produce main cultures. The expression cultures were inoculated with 30 pl of preculture and incubated at 37° C. and 250 rpm. After 4 h incubation, the enzyme expression was induced by adding 60 pL IPTG (10 rnM IPTG diluted in expression medium, supplemented with 50 mg/L kanamycin). The expression culture plates were then incubated for 20 h at 28° C and 250 rpm.

Cells were harvested by centrifugation for 15 min at 4°C and 2500 x g. The culture supernatant was discarded, and the cell pellet was suspended in 200 pL PBS. The cells were then lyophilized for 24 h and stored at 4°C until use.

Biotransformation, chiral resolution and analytics:

The chiral resolution of methyl (3S)-3-hydroxy-2-methylene-butanoate was conducted in microtiter plates with the lyophilized supernatants. For the chiral resolution, 100 pl racemic methyl-3-hydroxy-2- methylene-butanoate, 105 pl MTBE (Methyl-tert-butylether) and 45 pl ultrapure water (with 416.6 g/L KHCO3) were used in each well. Each plate was sealed and incubated for 6 h at 45 °C in a shaker at 300 rpm. The reaction was then stopped by adding 22.5 pl of 20% H2SO4 solution. For further extraction, 1 ml MTBE was added per well in the deep well plates. The plates were shaken at room temperature for 10 min and then centrifuged at 2500xg for 10 min. 10 pl of the organic phase were then transferred to 96- well PCR plates containing 100 pl MTBE and analyzed by HPLC.

Analytical HPLC method

The samples were analyzed using HPLC with the following settings:

Instrument: Agilent Technologies 1290 Infinity II; Column: Lux cellulose-2, 100x4.6 mm, 3 pm; Eluent A: premixed heptane (+0.05% formic acid); eluent B: ethanol; flow: isocratic (90% eluent A/10% eluent B), flow rate: 0.8 mL/min; temperature: 25 °C; sample injection volume: 1 pL; Detection: absorption at 210 nm.

Racemic methyl-3-hydroxy-2-methylene-butanoate and racemic 3-hydroxy-2-methylene-butanoic acid (prepared as for the biotransformation samples) were used as a reference substance and as a standard for quantification. Appropriate dilutions of the standards were used in order to be able to quantify the substrate used and the resulting product using standard lines. The samples were analyzed for substrate conversion and product formation. The evaluation and comparison of individual samples with one another was carried out by determining the substrate and product ee[%].

Example 3: Enzymatic hydrolysis of racemic Methyl 3-hydroxy-2-methylene-butanoate to (3R)-3- hydroxy-2-methylene-butanoic acid

The cultivation, biotransformation and HPLC analysis were carried out as described in example 2. During cultivation, glycerol cultures were used for inoculation. The activity and selectivity results of enzyme variants with individual point mutations can be found below in Tables 5 and 6. The activity and selectivity results of enzyme variants with combined mutations can be found in Tables 7 and 8. The activity and selectivity results of enzyme variants with additional combined mutations can be found in Tables 9 and 10.

A high selectivity and a high activity of the respective enzyme variant are decisive for an efficient conversion of racemic methyl 3-hydroxy-2-methylene-butanoate to (3R)-3-hydroxy-2-methylene- butanoic acid in order to generate an almost enantiomerically pure product in high yield.

The enzyme selectivity ee [%] 3-hydroxy-2-methylene-butanoate is defined by the difference of the mole fraction of (3R)-3-hydroxy-2-methylene-butanoic acid and (3S)-3-hydroxy-2-methylene- butanoic acid divided by the sum of the mole fraction of (3R)-3-hydroxy-2-methylene-butanoic acid and (3 S) -3 -hydroxy-2-methylene-butanoic acid.

(3R)3hydroxy2methylenebutanoic acid — (3S)3hydroxy2methylenebutanoic acid

- * 100

(3R)3hydroxy2methylenebutanoic acid + (3S)3hydroxy2methylenebutanoic acid The enzyme activity ee [%] for the conversion of the mole fraction of the substrate methyl (3R)3- hydroxy-2-methylene-butanoate is described by the difference between the mole fraction of methyl-(3S)3- hydroxy-2-methylene-butanoate and methyl-(3R)3-hydroxy-2-methylene-butanoate divided by the sum of methyl-(3S)3-hydroxy-2-methylene-butanoate and methyl-(3R)3-hydroxy-2-methylene-butanoate. methyl (3 R)3hydroxy2methylenebutanoate — methyl(3S)3hydroxy2methylenebutanoate

- * 100 methyl (3 R)3hydroxy2methylenebutanoate + methyl(3S)3hydroxy2methylenebutanoate

The lipase of SEQ ID No. 1 shows a selectivity of 100 % and an activity of 100%.

Table 5: Lipase variants showing a relative improvement in enzyme activity compared to the lipase of SEQ ID No. 1. The relative improvement in enzyme activity is defined as the quotient of the enantiomeric excess ee[%] of the respective variant and the enantiomeric excess ee[%] of the reference lipase (SEQ ID No.l) in percent. The substrate ee[%] of the lipase of SEQ ID No. 1 is 9.8.

Table 6: Lipase variants showing a relative improvement in enzyme selectivity compared to the reference lipase of SEQ ID No. 1. The relative improvement in enzyme selectivity is defined as the quotient of ee[%] 3-hydroxy-2- methylene-butanoic acid of the respective variant and the ee[%] of 3-hydroxy-2-methylene-butanoic acid of the reference lipase of SEQ ID No. 1 in percent. The product ee[%] of the reference lipase is 75.8.

Table 7: Lipase variants showing a relative improvement in enzyme activity compared to the reference lipase of SEQ ID No. 1. The relative improvement in enzyme activity is defined as the quotient of the enantiomeric excess ee[%] of the respective variant and the enantiomeric excess ee[%] of the reference lipase (SEQ ID No.1) in percent.

Table 8: Lipase variants showing a relative improvement in enzyme selectivity compared to the reference lipase of SEQ ID No. 1. The relative improvement in enzyme selectivity is defined as the quotient of ee[%] 3-hydroxy-2- methylene-butanoic acid of the respective variant and the ee[%] of 3-hydroxy-2-methylene-butanoic acid of the reference lipase of SEQ ID No. 1 in percent.

Example 4:

Variants based on the backbone mutations N57P, K122R, A212P, S234K, D289G and G109H have been supplemented with further mutations in the sequence as described in the examples above and tested for enzyme activity and selectivity as described in Example 3.

Table 9: Lipase variants showing a relative improvement in enzyme activity compared to the reference lipase of SEQ ID No. 171. The relative improvement in enzyme activity is defined as the quotient of the enantiomeric excess ee[%] of the respective variant and the enantiomeric excess ee[%] of the reference lipase (SEQ ID No.171) in percent. All lipase variants in the table carry in addition the mutations N57P, K122R, A212P, S234K, D289G, G109H.

Table 10: Lipase variants showing a relative improvement in enzyme selectivity compared to the reference lipase of SEQ ID No. 171. The relative improvement in enzyme selectivity is defined as the quotient of ee[%] 3-hydroxy-2- methylene-butanoic acid of the respective variant and the ee[%] of 3-hydroxy-2-methylene-butanoic acid of the reference lipase of SEQ ID No. 171 in percent. All lipase variants in the table carry in addition the mutations N57P, K122R, A212P, S234K, D289G, G109H.

Example 5: Enzymatic hydrolysis of different esters

Next to a methyl ester of 3-hydroxy-2-methylene-butanoate (a), also the enzymatic hydrolysis of the respective tert-butyl ester (b), iso-butyl-ester (c), and iso-propyl ester (d) were tested under suitable conditions (1 mL scale, 5 g/L ester compound, 5 g/L lyophilizate of enzyme of SEQ ID No. 1, 100 mM KPi buffer pH 8):

All esters (a) to (d) allow for an efficient enantiomeric resolution, wherein the (R)-Ester is preferably hydrolysed:

* compared to timepoint-zero control

Example 6: Alternative solvents

Several different solvents have been tested for the chiral resolution of methyl 3-hydroxy-2-methylene- butanoate. Reaction conditions were: 100 mL total volume, 300 g/L racemic methyl 3-hydroxy-2- BCSmethylene-butanoate, 20 g/L spray dried lipase of SEQ ID No. 1, 70% solvent, 0.54 eq KHCO3, pH 8.5 (titration with 40% w/v K2CO3), 45 °C, 6 h. In particular MTBE, CPME, MIBK and toluene allowed for a high enantiomeric excess of the hydrolysed product (3R)-3-hydroxy-2-methylene-butanoic acid (ee).

Example 7: Further examples on enzymatic hydrolysis of racemic Methyl 3-hydroxy-2-methylene- butanoate to (3R)-3-hydroxy-2-methylene-butanoic acid

Next to the lipase of SEQ ID No.1 , three further related variants have been tested, namely variants with mutations Q295V and K298A (SEQ ID No. 3); as well as with mutations V33I and I254S (SEQ ID No. 5); as well as with mutations T188S, I254S, P302L and Q304E (SEQ ID No. 241).

The cultivation, biotransformation and HPLC analysis were carried out as described in example 2. As described in example 3, the enzyme selectivity ee [%] 3-hydroxy-2-methylene-butanoate is defined by the difference of (3R)-3-hydroxy-2-methylene-butanoic acid and (3S)-3-hydroxy-2-methylene-butanoic acid divided by the sum of (3R)-3-hydroxy-2-methylene-butanoic acid and (3S)-3-hydroxy-2-methylene- butanoic acid. The enzyme activity ee [%] for the conversion of the substrate methyl (3R)3-hydroxy-2- methylene-butanoate is described by the difference between methyl-(3S)3-hydroxy-2-methylene- butanoate and methyl-(3R)3-hydroxy-2-methylene-butanoate divided by the sum of methyl-(3S)3- hydroxy-2-methylene-butanoate and methyl-(3R)3-hydroxy-2-methylene-butanoate. The reference lipase of SEQ ID No. 1 shows a selectivity of 100 % and an activity of 100%

Table 11: Comparative lipase variants showing a similar enzyme activity profile compared to the reference lipase of SEQ ID No. 1. The relative difference in enzyme activity is defined as the quotient of the enantiomeric excess ee[%] of the respective variant and the enantiomeric excess ee[%] of the reference lipase (SEQ ID No.1) in percent.

Table 12: Comparative lipase variants showing a similar enzyme selectivity compared to the reference lipase of SEQ ID No. 1. The relative difference in enzyme selectivity is defined as the quotient of ee[%] 3-hydroxy-2-methylene- butanoic acid of the respective variant and the ee[%] of 3-hydroxy-2-methylene-butanoic acid of the reference lipase of SEQ ID No. 1 in percent.

As shown in the Tables 11 und 12, the comparative lipases worsen the activity of the lipase, and do not improve the selectivity of the discussed substrate.

Example 8: Enzymatic hydrolysis of racemic Methyl 3-hydroxy-2-methylene-butanoate to (3R)-3- hydroxy-2-methylene-butanoic acid

8.0 g spray dried lipase SEQ ID No. 1 were suspended in 50 g water and diluted with 100 mL MTBE. After warming to 45 °C a mixture of 120 g racemic methyl 3-hydroxy-2-methylene-butanoate (99% purity) in 100 g MTBE were dosed into the suspension over 3 h. After full dosage, the mixture was further stirred at 45 °C over night. During dosage and additional stirring time, the pH was kept at 7.5 by the parallel addition of 40% aqueous potassium carbonate solution. Afterwards, the biomass was separated by centrifugation and the phases were separated. After extraction of the aqueous phase with 2x 125g MTBE each, the combined organic extracts were concentrated under reduced pressure at 45 °C yielding methyl (3S)-3-hydroxy-2-methylene-butanoate with 91% purity, 96% ee and and corrected yield of 40%. Example 9: Enzymatic hydrolysis of racemic Methyl 3-hydroxy-2-methylene-butanoate to (3R)-3- hydroxy-2-methylene-butanoic acid

120 g water were warmed to 45 °C and 2.0 g spray dried lipase SEQ ID No. 1 were added, followed by 280 mL MTBE. At 45 °C, 120 g racemic methyl 3-hydroxy-2-methylene-butanoate (99% purity) were dosed into the suspension over 2 h. After full dosage, the mixture was further stirred at 45 °C for 8,5h. During dosage and additional stirring time, the pH was kept at 7.5 by the parallel addition of 40% aqueous potassium carbonate solution. Afterwards, the lower aqueous phase was separated and back-extracted once with 100 ml MTBE. The extract was combined with the upper organic phase and the mixture was distilled at 33-55 °C and reduced pressure down to 10 mbar. After addition of 50 g high boiling Marlotherm, the residue was further distilled at 3-10 mbar up to 150 °C yielding 58.2 g (92% purity, 96% ee, 45% yield) methyl (3S)-3-hydroxy-2-methylene-butanoate with 91% purity, 96% ee and and corrected yield of 40%.

Example 10: Enantioselective acylation of racemic methyl-3-hydroxy-2-methylene-butanoate

A suspension of 1400 g racemic methyl 3-hydroxy-2-methylene-butanoate [10.62 mol, 98.7 % purity] and 105 g Novozyme 435 is heated to 25 °C internal temperature. 1373 g vinyl dodecanoate (5.95 mol, 98 %) is added in 3h using a dosing pump under 50 mbar vacuum. Subsequently, the reaction mixture heated to 35 °C internal temperature and 50 mbar for additional 8h. Acetaldehyde is distilled out under vacuum. Afterwards the suspension is allowed to proceed for additional 8h at 35 °C and 50 mbar. The reaction mixture is heated to 115 °C jacket temperature under vacuum to distill methyl (3S)-3-hydroxy-2- methylene-butanoate out of the suspension. In that respect, the vacuum is gradually reduced to 3 mbar and the jacket temperature is increased to 135 °C. The product is analyzed by using chiral HPLC standard method. A chemical purity of >99 % is and an enantiomeric excess of >98 % ee is achieved. The isolated yield of methyl (3S)-3-hydroxy-2-methylene-butanoate is 41%.

Claims

Claims:

1. A protein variant having the activity of a lipase wherein the protein is encoded by an amino acid sequence having at least 80% identity with the amino acid sequence shown under SEQ ID No. 1 , characterized in that the amino acid sequence of the protein variant differs from the amino acid sequence of SEQ ID NO. 1 in at least one of the following positions: i. the amino acid at position 44 is M, W or Y; ii. the amino acid at position 51 is N, or M; iii. the amino acid at position 52 is L; iv. the amino acid at position 53 is S, P, I, E, or A; v. the amino acid at position 54 is Q, M, F, G, E, L, T or P; vi. the amino acid at position 55 is R, M, D, Y, S, I; vii. the amino acid at position 109 is H or F; viii. the amino acid at position 110 is T or V; ix. the amino acid at position 111 is S; x. the amino acid at position 117 is F or S; xi. the amino acid at position 121 is V; xii. the amino acid at position 122 is Q, A, Y, R, or V; xiii. the amino acid at position 153 is Y, D, E, or C; xiv. the amino acid at position 160 is E, C, D, P, I, Q, K, M, S, F, A, or N; xv. the amino acid at position 179 is C; xvi. the amino acid at position 181 is Q; xvii. the amino acid at position 184 is G or T; xviii. the amino acid at position 211 is E; xix. the amino acid at position 212 is S; xx. the amino acid at position 216 is K or A; xxi. the amino acid at position 234 is K, T or G; xxii. the amino acid at position 235 is V or M; xxiii. the amino acid at position 236 is T; xxiv. the amino acid at position 238 is A, K, D, E, or Q; xxv. the amino acid at position 240 is F; xxvi. the amino acid at position 289 is S or G; xxvii. the amino acid at position 291 is E or W; xxviii. the amino acid at position 317 is T; xxix. the amino acid at position 320 is E or G; xxx. the amino acid at position 321 is F.

2. Protein variant according to claim 1 , characterized in that the amino acid sequence of the protein variant differs from the amino acid sequence of SEQ ID NO. 1 by at least one of the following mutations: i. the amino acid at position 44 is W or Y; ii. the amino acid at position 54 is F; iii. the amino acid at position 55 is R; iv. the amino acid at position 109 is H; v. the amino acid at position 110 is T or V; vi. the amino acid at position 117 is F; vii. the amino acid at position 122 is Q or R; viii. the amino acid at position 160 is E; ix. the amino acid at position 216 is K; x. the amino acid at position 236 is T; xi. the amino acid at position 238 is K or E; xii. the amino acid at position 240 is F.

3. Protein variant according to claim 1 or 2, characterized in that the amino acid sequence of the protein variant differs from the amino acid sequence of SEQ ID NO. 1 by at least the following mutation: the amino acid at position 109 is H.

4. Protein variant according to any one of the preceding claims, wherein the protein variant comprises at least two, preferably at least three of the recited amino acid substitutions.

5. Protein variant according to any one of the preceding claims, wherein the amino acid sequence of the protein variant differs from the amino acid sequence of SEQ ID NO. 1 by at least two, further preferably at least three, even further preferably at least four, in particular preferably at least five, and most preferably all of the following modifications selected from: i. the amino acid at position 57 is P; ii. the amino acid at position 109 is H; iii. the amino acid at position 122 is R; iv. the amino acid at position 212 is P; v. the amino acid at position 234 is K; vi. the amino acid at position 289 is G.

6. Protein variant according to any one of the preceding claims, wherein the amino acid sequence of the protein variant differs from the amino acid sequence of SEQ ID NO. 1 by the following modifications: i. the amino acid at position 57 is P; the amino acid at position 109 is H; the amino acid at position 122 is R or K or Q, preferably R; the amino acid at position 212 is P; the amino acid at position 234 is K; the amino acid at position 289 is G; ii. and additionally at least one of the following modifications: the amino acid at position 44 is W or Y; the amino acid at position 54 is F, the amino acid at position 55 is R, the amino acid at position 110 is V or T; the amino acid at position 111 is T, the amino acid at position 117 is F, the amino acid at position 160 is E, the amino acid at position 216 is K, the amino acid at position 236 is T, the amino acid at position 238 is K or E, the amino acid at position 240 is F.

7. Protein variant according to any one of the preceding claims, wherein the protein variant carries an amino acid sequence of one of SEQ ID No. 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239.

8. Nucleic acid molecule encoding a protein having the activity of a lipase according to any one of the preceding claims.

9. Nucleic acid molecule according to claim 8 encoding a protein having the activity of a lipase selected from the group consisting of a) nucleic acid molecules comprising the nucleic acid sequences shown under of SEQ ID No.

8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102,

104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138,

140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174,

176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210,

212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240; b) nucleic acid molecules having at least 60%, preferably 70%, more preferably 80%, further more preferably 90%, even more preferably 95%, even further more preferably 96%, particular preferably 97%, most preferably 98% or especially preferably 99% identity with the nucleic acid sequences shown under a).

10. A recombinant nucleic acid molecule comprising a nucleic acid molecule according to claim 8 or 9.

11. The recombinant nucleic acid molecule according to claim 10, wherein the recombinant nucleic acid molecule is a vector or a plasmid.

12. A host cell comprising a protein according to any one of claims 1 to 7 or a nucleic acid molecule according to claim 8 or 9 or a recombinant nucleic acid molecule according to claim 10 or 11.

13. Use of a protein according to any one of claims 1 to 7 for the stereoselective hydrolysis of racemic methyl-3-hydroxy-2-methylene-butanoate to (3R)-3-hydroxy-2-methylene- butanoic acid.

14. Method for hydrolysing a substrate of formula (II) in an enantiomerically selective manner to a compound of formula (III) comprising contacting the substrate with a protein variant according to any one of claims 1 to 7,

wherein R¹ and R² are independently selected from a substutituted or unsubstituted (n)-alkyl, isoalkyl, aryl, alkyl-substituted or aryl-substituted aryl.

15. Method according to claim 14 for hydrolysing a substrate of formula (II- 1) in an enantiomerically selective manner to a compound fo formula (III-l),

further comprising separating the compound of formula (1-1).

16. Method according to any of claims 14 or 15, wherein the method is carried out in a biphasic system of water and an organic solvent, preferably selected from methyl-tert-butylether, toluene,

2-methyltetrahydrofuran, methyl isobutyl ketone, cyclohexane, cyclopentyl methyl ether, chlorobenzene, tert-amylmethylether, ethyl acetate, and isopropyl acetate.