WO2024188791A1 - Lipase with improved stereoselectivity and lipase-based chiral resolution methods - Google Patents
Lipase with improved stereoselectivity and lipase-based chiral resolution methods Download PDFInfo
- Publication number
- WO2024188791A1 WO2024188791A1 PCT/EP2024/055979 EP2024055979W WO2024188791A1 WO 2024188791 A1 WO2024188791 A1 WO 2024188791A1 EP 2024055979 W EP2024055979 W EP 2024055979W WO 2024188791 A1 WO2024188791 A1 WO 2024188791A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- amino acid
- lipase
- nucleic acid
- protein
- seq
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 56
- 108090001060 Lipase Proteins 0.000 title claims description 122
- 102000004882 Lipase Human genes 0.000 title claims description 121
- 239000004367 Lipase Substances 0.000 title claims description 117
- 235000019421 lipase Nutrition 0.000 title claims description 117
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 103
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 98
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 96
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 92
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 92
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 325
- 230000000694 effects Effects 0.000 claims description 52
- BZLVMXJERCGZMT-UHFFFAOYSA-N Methyl tert-butyl ether Chemical compound COC(C)(C)C BZLVMXJERCGZMT-UHFFFAOYSA-N 0.000 claims description 30
- 239000000758 substrate Substances 0.000 claims description 30
- 150000001875 compounds Chemical class 0.000 claims description 28
- 230000035772 mutation Effects 0.000 claims description 23
- VLCAYQIMSMPEBW-UHFFFAOYSA-N methyl 3-hydroxy-2-methylidenebutanoate Chemical compound COC(=O)C(=C)C(C)O VLCAYQIMSMPEBW-UHFFFAOYSA-N 0.000 claims description 20
- 239000013598 vector Substances 0.000 claims description 19
- 230000004048 modification Effects 0.000 claims description 18
- 238000012986 modification Methods 0.000 claims description 18
- 239000013612 plasmid Substances 0.000 claims description 16
- 229910052727 yttrium Inorganic materials 0.000 claims description 15
- 229910052717 sulfur Inorganic materials 0.000 claims description 14
- DDQIGAOHYCZMEQ-SCSAIBSYSA-N (3r)-3-hydroxy-2-methylidenebutanoic acid Chemical compound C[C@@H](O)C(=C)C(O)=O DDQIGAOHYCZMEQ-SCSAIBSYSA-N 0.000 claims description 13
- 230000003301 hydrolyzing effect Effects 0.000 claims description 10
- YXFVVABEGXRONW-UHFFFAOYSA-N Toluene Chemical compound CC1=CC=CC=C1 YXFVVABEGXRONW-UHFFFAOYSA-N 0.000 claims description 9
- 239000003960 organic solvent Substances 0.000 claims description 9
- 229910052698 phosphorus Inorganic materials 0.000 claims description 9
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 9
- 230000007062 hydrolysis Effects 0.000 claims description 8
- 238000006460 hydrolysis reaction Methods 0.000 claims description 8
- 229910052740 iodine Inorganic materials 0.000 claims description 8
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 claims description 6
- 238000006467 substitution reaction Methods 0.000 claims description 5
- MVPPADPHJFYWMZ-UHFFFAOYSA-N chlorobenzene Chemical compound ClC1=CC=CC=C1 MVPPADPHJFYWMZ-UHFFFAOYSA-N 0.000 claims description 4
- 230000000707 stereoselective effect Effects 0.000 claims description 4
- NTIZESTWPVYFNL-UHFFFAOYSA-N Methyl isobutyl ketone Chemical compound CC(C)CC(C)=O NTIZESTWPVYFNL-UHFFFAOYSA-N 0.000 claims description 3
- 125000003118 aryl group Chemical group 0.000 claims description 3
- 229910052739 hydrogen Inorganic materials 0.000 claims description 3
- SKTCDJAMAYNROS-UHFFFAOYSA-N methoxycyclopentane Chemical compound COC1CCCC1 SKTCDJAMAYNROS-UHFFFAOYSA-N 0.000 claims description 3
- JWUJQDFVADABEY-UHFFFAOYSA-N 2-methyltetrahydrofuran Chemical compound CC1CCCO1 JWUJQDFVADABEY-UHFFFAOYSA-N 0.000 claims description 2
- XDTMQSROBMDMFD-UHFFFAOYSA-N Cyclohexane Chemical compound C1CCCCC1 XDTMQSROBMDMFD-UHFFFAOYSA-N 0.000 claims description 2
- UIHCLUNTQKBZGK-UHFFFAOYSA-N Methyl isobutyl ketone Natural products CCC(C)C(C)=O UIHCLUNTQKBZGK-UHFFFAOYSA-N 0.000 claims description 2
- 230000002051 biphasic effect Effects 0.000 claims description 2
- JMMWKPVZQRWMSS-UHFFFAOYSA-N isopropanol acetate Natural products CC(C)OC(C)=O JMMWKPVZQRWMSS-UHFFFAOYSA-N 0.000 claims description 2
- 229940011051 isopropyl acetate Drugs 0.000 claims description 2
- GWYFCOCPABKNJV-UHFFFAOYSA-N isovaleric acid Chemical compound CC(C)CC(O)=O GWYFCOCPABKNJV-UHFFFAOYSA-N 0.000 claims description 2
- HVZJRWJGKQPSFL-UHFFFAOYSA-N tert-Amyl methyl ether Chemical compound CCC(C)(C)OC HVZJRWJGKQPSFL-UHFFFAOYSA-N 0.000 claims description 2
- 235000019626 lipase activity Nutrition 0.000 abstract description 5
- 150000003333 secondary alcohols Chemical class 0.000 abstract description 5
- 210000004027 cell Anatomy 0.000 description 55
- 150000001413 amino acids Chemical group 0.000 description 45
- 102000004190 Enzymes Human genes 0.000 description 40
- 108090000790 Enzymes Proteins 0.000 description 40
- 238000006243 chemical reaction Methods 0.000 description 27
- DDQIGAOHYCZMEQ-UHFFFAOYSA-N beta-Oxy-alpha-methylen-buttersaeure Natural products CC(O)C(=C)C(O)=O DDQIGAOHYCZMEQ-UHFFFAOYSA-N 0.000 description 26
- 239000000203 mixture Substances 0.000 description 24
- 230000006872 improvement Effects 0.000 description 15
- 241000894006 Bacteria Species 0.000 description 14
- VLCAYQIMSMPEBW-YFKPBYRVSA-N methyl (3s)-3-hydroxy-2-methylidenebutanoate Chemical compound COC(=O)C(=C)[C@H](C)O VLCAYQIMSMPEBW-YFKPBYRVSA-N 0.000 description 14
- 241000588724 Escherichia coli Species 0.000 description 13
- -1 cerevisiae Species 0.000 description 13
- 239000002773 nucleotide Substances 0.000 description 13
- 125000003729 nucleotide group Chemical group 0.000 description 13
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 12
- 239000000047 product Substances 0.000 description 11
- HBAQYPYDRFILMT-UHFFFAOYSA-N 8-[3-(1-cyclopropylpyrazol-4-yl)-1H-pyrazolo[4,3-d]pyrimidin-5-yl]-3-methyl-3,8-diazabicyclo[3.2.1]octan-2-one Chemical class C1(CC1)N1N=CC(=C1)C1=NNC2=C1N=C(N=C2)N1C2C(N(CC1CC2)C)=O HBAQYPYDRFILMT-UHFFFAOYSA-N 0.000 description 10
- 150000001298 alcohols Chemical class 0.000 description 10
- 238000004519 manufacturing process Methods 0.000 description 10
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 10
- 230000015572 biosynthetic process Effects 0.000 description 8
- 238000009396 hybridization Methods 0.000 description 8
- 108020004414 DNA Proteins 0.000 description 7
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 7
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 7
- 238000007792 addition Methods 0.000 description 7
- 230000001105 regulatory effect Effects 0.000 description 7
- 238000003786 synthesis reaction Methods 0.000 description 7
- 239000002028 Biomass Substances 0.000 description 6
- 230000036983 biotransformation Effects 0.000 description 6
- 230000007071 enzymatic hydrolysis Effects 0.000 description 6
- 238000006047 enzymatic hydrolysis reaction Methods 0.000 description 6
- 230000001965 increasing effect Effects 0.000 description 6
- VLCAYQIMSMPEBW-RXMQYKEDSA-N methyl (3r)-3-hydroxy-2-methylidenebutanoate Chemical compound COC(=O)C(=C)[C@@H](C)O VLCAYQIMSMPEBW-RXMQYKEDSA-N 0.000 description 6
- 229920002477 rna polymer Polymers 0.000 description 6
- 241000233866 Fungi Species 0.000 description 5
- IMNFDUFMRHMDMM-UHFFFAOYSA-N N-Heptane Chemical compound CCCCCCC IMNFDUFMRHMDMM-UHFFFAOYSA-N 0.000 description 5
- 150000002148 esters Chemical class 0.000 description 5
- 238000004128 high performance liquid chromatography Methods 0.000 description 5
- 229930027917 kanamycin Natural products 0.000 description 5
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 5
- 229960000318 kanamycin Drugs 0.000 description 5
- 229930182823 kanamycin A Natural products 0.000 description 5
- 108020004999 messenger RNA Proteins 0.000 description 5
- 244000005700 microbiome Species 0.000 description 5
- TYJJADVDDVDEDZ-UHFFFAOYSA-M potassium hydrogencarbonate Chemical compound [K+].OC([O-])=O TYJJADVDDVDEDZ-UHFFFAOYSA-M 0.000 description 5
- 239000011541 reaction mixture Substances 0.000 description 5
- 239000002904 solvent Substances 0.000 description 5
- 239000000725 suspension Substances 0.000 description 5
- 238000013518 transcription Methods 0.000 description 5
- 230000035897 transcription Effects 0.000 description 5
- 241000607598 Vibrio Species 0.000 description 4
- 239000008346 aqueous phase Substances 0.000 description 4
- 238000004821 distillation Methods 0.000 description 4
- 239000003480 eluent Substances 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 4
- BWHMMNNQKKPAPP-UHFFFAOYSA-L potassium carbonate Chemical compound [K+].[K+].[O-]C([O-])=O BWHMMNNQKKPAPP-UHFFFAOYSA-L 0.000 description 4
- 125000000999 tert-butyl group Chemical group [H]C([H])([H])C(*)(C([H])([H])[H])C([H])([H])[H] 0.000 description 4
- 241000193830 Bacillus <bacterium> Species 0.000 description 3
- 102100021851 Calbindin Human genes 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 101000898082 Homo sapiens Calbindin Proteins 0.000 description 3
- 102000004157 Hydrolases Human genes 0.000 description 3
- 108090000604 Hydrolases Proteins 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- KWYUFKZDYYNOTN-UHFFFAOYSA-M Potassium hydroxide Chemical compound [OH-].[K+] KWYUFKZDYYNOTN-UHFFFAOYSA-M 0.000 description 3
- 241000589516 Pseudomonas Species 0.000 description 3
- 101001021643 Pseudozyma antarctica Lipase B Proteins 0.000 description 3
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 3
- 239000003905 agrochemical Substances 0.000 description 3
- 239000007864 aqueous solution Substances 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 239000003054 catalyst Substances 0.000 description 3
- 230000000052 comparative effect Effects 0.000 description 3
- 238000004590 computer program Methods 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 238000011143 downstream manufacturing Methods 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 125000000959 isobutyl group Chemical group [H]C([H])([H])C([H])(C([H])([H])[H])C([H])([H])* 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 238000002703 mutagenesis Methods 0.000 description 3
- 231100000350 mutagenesis Toxicity 0.000 description 3
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 3
- 229910052757 nitrogen Inorganic materials 0.000 description 3
- 239000011736 potassium bicarbonate Substances 0.000 description 3
- 229910000028 potassium bicarbonate Inorganic materials 0.000 description 3
- 235000015497 potassium bicarbonate Nutrition 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 239000007921 spray Substances 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 238000005809 transesterification reaction Methods 0.000 description 3
- IKHGUXGNUITLKF-UHFFFAOYSA-N Acetaldehyde Chemical compound CC=O IKHGUXGNUITLKF-UHFFFAOYSA-N 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 2
- 241000186216 Corynebacterium Species 0.000 description 2
- 241000186226 Corynebacterium glutamicum Species 0.000 description 2
- 102000053602 DNA Human genes 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 238000002965 ELISA Methods 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- 241001661345 Moesziomyces antarcticus Species 0.000 description 2
- 241000589540 Pseudomonas fluorescens Species 0.000 description 2
- 241000316848 Rhodococcus <scale insect> Species 0.000 description 2
- 241000187693 Rhodococcus rhodochrous Species 0.000 description 2
- CDBYLPFSWZWCQE-UHFFFAOYSA-L Sodium Carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 2
- 241000187747 Streptomyces Species 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 230000010933 acylation Effects 0.000 description 2
- 238000005917 acylation reaction Methods 0.000 description 2
- 239000008272 agar Substances 0.000 description 2
- 125000000217 alkyl group Chemical group 0.000 description 2
- 125000004429 atom Chemical group 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- BVKZGUZCCUSVTD-UHFFFAOYSA-N carbonic acid Chemical class OC(O)=O BVKZGUZCCUSVTD-UHFFFAOYSA-N 0.000 description 2
- 150000004649 carbonic acid derivatives Chemical class 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 230000032050 esterification Effects 0.000 description 2
- 238000005886 esterification reaction Methods 0.000 description 2
- GLVVKKSPKXTQRB-UHFFFAOYSA-N ethenyl dodecanoate Chemical compound CCCCCCCCCCCC(=O)OC=C GLVVKKSPKXTQRB-UHFFFAOYSA-N 0.000 description 2
- 239000000284 extract Substances 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 150000007529 inorganic bases Chemical class 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 125000002950 monocyclic group Chemical group 0.000 description 2
- 239000012074 organic phase Substances 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 239000012071 phase Substances 0.000 description 2
- 229920001184 polypeptide Polymers 0.000 description 2
- 229910000027 potassium carbonate Inorganic materials 0.000 description 2
- 235000015320 potassium carbonate Nutrition 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 102000004196 processed proteins & peptides Human genes 0.000 description 2
- 108090000765 processed proteins & peptides Proteins 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 229920006395 saturated elastomer Polymers 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 238000003756 stirring Methods 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 229910052723 transition metal Inorganic materials 0.000 description 2
- 150000003624 transition metals Chemical class 0.000 description 2
- WROUWQQRXUBECT-UHFFFAOYSA-N 2-ethylacrylic acid Chemical compound CCC(=C)C(O)=O WROUWQQRXUBECT-UHFFFAOYSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- 241000589291 Acinetobacter Species 0.000 description 1
- 239000004925 Acrylic resin Substances 0.000 description 1
- 229920000178 Acrylic resin Polymers 0.000 description 1
- 241000222518 Agaricus Species 0.000 description 1
- 244000251953 Agaricus brunnescens Species 0.000 description 1
- 235000001674 Agaricus brunnescens Nutrition 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 241000228245 Aspergillus niger Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 238000009631 Broth culture Methods 0.000 description 1
- 241001453380 Burkholderia Species 0.000 description 1
- 102000004308 Carboxylic Ester Hydrolases Human genes 0.000 description 1
- 108090000863 Carboxylic Ester Hydrolases Proteins 0.000 description 1
- 241000588881 Chromobacterium Species 0.000 description 1
- 108020004638 Circular DNA Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- 102100031375 Endothelial lipase Human genes 0.000 description 1
- 239000004593 Epoxy Substances 0.000 description 1
- 241001302584 Escherichia coli str. K-12 substr. W3110 Species 0.000 description 1
- 108090000371 Esterases Proteins 0.000 description 1
- 241000223194 Fusarium culmorum Species 0.000 description 1
- 241000146406 Fusarium heterosporum Species 0.000 description 1
- 241000223221 Fusarium oxysporum Species 0.000 description 1
- 241000427940 Fusarium solani Species 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 108010084311 Novozyme 435 Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical class OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 101001064468 Pseudozyma aphidis (strain ATCC 32657 / CBS 517.83 / DSM 70725 / JCM 10318 / NBRC 10182 / NRRL Y-7954 / St-0401) Lipase A Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241000235403 Rhizomucor miehei Species 0.000 description 1
- 240000005384 Rhizopus oryzae Species 0.000 description 1
- 235000013752 Rhizopus oryzae Nutrition 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 1
- 241000223258 Thermomyces lanuginosus Species 0.000 description 1
- 241000223259 Trichoderma Species 0.000 description 1
- XTXRWKRVRITETP-UHFFFAOYSA-N Vinyl acetate Chemical compound CC(=O)OC=C XTXRWKRVRITETP-UHFFFAOYSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 230000002730 additional effect Effects 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 150000005840 aryl radicals Chemical class 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 238000011914 asymmetric synthesis Methods 0.000 description 1
- 125000002619 bicyclic group Chemical group 0.000 description 1
- 239000011942 biocatalyst Substances 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 238000010170 biological method Methods 0.000 description 1
- 239000012455 biphasic mixture Substances 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 125000004432 carbon atom Chemical group C* 0.000 description 1
- 150000001735 carboxylic acids Chemical class 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- PBAYDYUZOSNJGU-UHFFFAOYSA-N chelidonic acid Natural products OC(=O)C1=CC(=O)C=C(C(O)=O)O1 PBAYDYUZOSNJGU-UHFFFAOYSA-N 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 238000004296 chiral HPLC Methods 0.000 description 1
- CRQQGFGUEAVUIL-UHFFFAOYSA-N chlorothalonil Chemical compound ClC1=C(Cl)C(C#N)=C(Cl)C(C#N)=C1Cl CRQQGFGUEAVUIL-UHFFFAOYSA-N 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 108010005400 cutinase Proteins 0.000 description 1
- 238000010908 decantation Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 1
- 229910000397 disodium phosphate Inorganic materials 0.000 description 1
- 235000019800 disodium phosphate Nutrition 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000010931 ester hydrolysis Methods 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 238000012239 gene modification Methods 0.000 description 1
- 230000005017 genetic modification Effects 0.000 description 1
- 235000013617 genetically modified food Nutrition 0.000 description 1
- 229940094991 herring sperm dna Drugs 0.000 description 1
- 150000002430 hydrocarbons Chemical group 0.000 description 1
- 239000012051 hydrophobic carrier Substances 0.000 description 1
- 150000004679 hydroxides Chemical class 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 125000001972 isopentyl group Chemical group [H]C([H])([H])C([H])(C([H])([H])[H])C([H])([H])C([H])([H])* 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 150000004702 methyl esters Chemical class 0.000 description 1
- 125000004108 n-butyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 125000003136 n-heptyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 125000001280 n-hexyl group Chemical group C(CCCCC)* 0.000 description 1
- 125000000740 n-pentyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 125000004123 n-propyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 125000001971 neopentyl group Chemical group [H]C([*])([H])C(C([H])([H])[H])(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 239000012457 nonaqueous media Substances 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 150000007530 organic bases Chemical class 0.000 description 1
- 238000006362 organocatalysis Methods 0.000 description 1
- 238000002888 pairwise sequence alignment Methods 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 125000003538 pentan-3-yl group Chemical group [H]C([H])([H])C([H])([H])C([H])(*)C([H])([H])C([H])([H])[H] 0.000 description 1
- 125000003367 polycyclic group Chemical group 0.000 description 1
- 108091033319 polynucleotide Proteins 0.000 description 1
- 102000040430 polynucleotide Human genes 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- 229910052700 potassium Inorganic materials 0.000 description 1
- 235000011118 potassium hydroxide Nutrition 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 239000013558 reference substance Substances 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 102200009943 rs116840808 Human genes 0.000 description 1
- 229930195734 saturated hydrocarbon Natural products 0.000 description 1
- 125000002914 sec-butyl group Chemical group [H]C([H])([H])C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 239000013605 shuttle vector Substances 0.000 description 1
- 229910000029 sodium carbonate Inorganic materials 0.000 description 1
- 235000017550 sodium carbonate Nutrition 0.000 description 1
- 235000011121 sodium hydroxide Nutrition 0.000 description 1
- 239000012064 sodium phosphate buffer Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000006104 solid solution Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 235000011149 sulphuric acid Nutrition 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 238000004448 titration Methods 0.000 description 1
- 235000010692 trans-unsaturated fatty acids Nutrition 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 229910021642 ultra pure water Inorganic materials 0.000 description 1
- 239000012498 ultrapure water Substances 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 238000010792 warming Methods 0.000 description 1
- 239000011534 wash buffer Substances 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/18—Carboxylic ester hydrolases (3.1.1)
- C12N9/20—Triglyceride splitting, e.g. by means of lipase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P41/00—Processes using enzymes or microorganisms to separate optical isomers from a racemic mixture
- C12P41/003—Processes using enzymes or microorganisms to separate optical isomers from a racemic mixture by ester formation, lactone formation or the inverse reactions
- C12P41/005—Processes using enzymes or microorganisms to separate optical isomers from a racemic mixture by ester formation, lactone formation or the inverse reactions by esterification of carboxylic acid groups in the enantiomers or the inverse reaction
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/40—Preparation of oxygen-containing organic compounds containing a carboxyl group including Peroxycarboxylic acids
- C12P7/42—Hydroxy-carboxylic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y301/00—Hydrolases acting on ester bonds (3.1)
- C12Y301/01—Carboxylic ester hydrolases (3.1.1)
- C12Y301/01003—Triacylglycerol lipase (3.1.1.3)
Definitions
- the present invention relates to proteins with improved lipase activity, nucleic acid molecules encoding said proteins as well as enzymatic methods for the chiral resolution of secondary alcohols.
- Enantiomerically enriched or pure alcohols are important compounds for the production of agrochemical or pharmaceutical compounds.
- the absolute configuration of the stereocenters of chiral alcohols is crucial for the synthesis of the corresponding active agents.
- generation of correct chirality is often a challenge.
- chiral transition metal catalysts can be used, which, however, are characterized by high costs (G.R. Cook, Transition Metal-Mediated Kinetic Resolution, Current Organic Chemistry, 2000, 4, 869-885; Hirama et al. J. Org. Chem. 1988, 53, 708).
- G.R. Cook Transition Metal-Mediated Kinetic Resolution, Current Organic Chemistry, 2000, 4, 869-885; Hirama et al. J. Org. Chem. 1988, 53, 708
- an organocatalytic acyl transfer kinetic resolution is described for bulky ester substrates (Deng et al. J. Org. Chem. 2015, 80, 6, 3159).
- enzymes for example lipases, as biocatalysts for the production of chiral compounds. Kirchner et al. (J. Am.
- EP 0716712 Bl describes the lipase-catalyzed acylation of alcohols with diketenes, especially for the production of enantioselective acylated alcohols from racemic alcohols.
- WO2012146935A1 discloses modified lipase variants, as well as polynucleotides and recombinant expression vectors encoding the lipase variant polypeptides, as well as methods for producing such lipase variants in selected bacterial and fungal host cells.
- the specified lipase variants have increased enzyme specificity or enhanced trans-selectivity. Further described are methods of their use for reducing or eliminating trans-fatty acids from substrates.
- the present invention solves the above-described problems at least partially by providing novel lipases with superior enantiomer selectivity as well as providing novel methods for the chiral resolution of industrially relevant building blocks.
- the lipases described herein have certain advantages over known wild-type and other already known lipases.
- the modified or variant lipases described herein have the advantage that they can produce enantiomerically enriched or enantiomerically nearly pure or pure compounds better than respective wild- type lipases.
- a first aspect of the present invention relates to a protein having the activity of a lipase wherein the protein is encoded by an amino acid sequence having at least 80%, preferably 85%, more preferably 90%, furthermore preferably 92%, even more preferably 95% with the amino acid sequence shown under SEQ ID No. 1 , characterized in that the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 in at least one of the following positions: i. the amino acid at position 44 is different from L, preferably the amino acid at position 44 is M, W or Y; ii. the amino acid at position 51 is different from F, preferably the amino acid at position 51 is N, or M; iii.
- the amino acid at position 52 is different from V, preferably the amino acid at position 52 is L; iv. the amino acid at position 53 is different from T, preferably the amino acid at position 53 is S, P, I, E, or A; v. the amino acid at position 54 is different from D, preferably the amino acid at position 54 is Q, M, F, G, E, L, T or P; vi. the amino acid at position 55 is different from A, preferably the amino acid at position 55 is R, M, D, Y, S, or I; vii. the amino acid at position 109 is different from G, preferably the amino acid at position
- the amino acid at position 110 is different from M, preferably the amino acid at position
- the amino acid at position 111 is different from A, preferably the amino acid at position
- the amino acid at position 111 is T or S; x. the amino acid at position 117 is different from Y, preferably the amino acid at position 117 is F or S; xi. the amino acid at position 121 is different from Y, preferably the amino acid at position
- the amino acid at position 122 is different from K, preferably the amino acid at position
- the amino acid at position 122 is Q, A, Y, R, or V; xiii. the amino acid at position 153 is different from H, preferably the amino acid at position 153 is N, Y, D, E, or C; xiv. the amino acid at position 160 is different from T, preferably the amino acid at position 160 is E, C, D, P, I, Q, K, M, S, F, A, or N; xv. the amino acid at position 179 is different from D, preferably the amino acid at position 179 is C; xvi. the amino acid at position 181 is different from A, preferably the amino acid at position 181 is Q; xvii. the amino acid at position 184 is different from A, preferably the amino acid at position 184 is G or T; xviii. the amino acid at position 211 is different from Y, preferably the amino acid at position
- the amino acid at position 212 is different from A, preferably the amino acid at position
- the amino acid at position 212 is S or P; xx. the amino acid at position 216 is different from Y, preferably the amino acid at position 216 is K or A; xxi. the amino acid at position 234 is different from S, preferably the amino acid at position
- the amino acid at position 235 is different from S, preferably the amino acid at position
- the amino acid at position 236 is different from K, preferably the amino acid at position 236 is T;
- xxiv. the amino acid at position 238 is different from R, preferably the amino acid at position 238 is A, K, D, E, or Q;
- xxv. the amino acid at position 240 is different from Y, preferably the amino acid at position 240 is F;
- the amino acid at position 289 is different from D, preferably the amino acid at position 289 is S or G;
- the amino acid at position 291 is different from G, preferably the amino acid at position 291 is E or W;
- xxviii. the amino acid at position 317 is different from N, preferably the amino acid at position 317 is T;
- the amino acid at position 321 is different from L, preferably the amino acid at position
- SEQ ID NO.l refers to a reference protein sequence with lipase activity.
- amino acid corresponding to position x in a first amino acid sequence means herein that an amino acid of a second amino acid sequence, when compared with the first amino acid sequence, appears at position x of the first amino acid sequence in a pairwise sequence alignment of the first amino acid sequence with the second amino acid sequence in case the numbering of the amino acids of the second amino acid sequence differs from the amino acid numbering of the first amino acid sequence.
- identity in respect to sequence identity or sequences being identical to is to be understood as meaning the number of identical amino acids or nucleotides shared over the entire sequence length by a first nucleic or amino acid sequence with another (second) nucleic or amino acid sequence, respectively, expressed in percent.
- Sequence alignments and scores for percentage sequence identity may for example be determined using software, such as EMBOSS, accessible at world wide web site of the EBI (ebi.ac.uk/Tools/emboss/).
- sequence similarity or identity may be determined by searching against databases (e.g. EMBL, GenBank) by using commonly known algorithms and output formats such as FASTA, BLAST, etc., but preferably hits should be retrieved and aligned pairwise to finally determine sequence identity.
- the identity is to be determined by determining the identity in percent of the number of amino acids or nucleotides, respectively, which the shorter sequence shares with the longer sequence.
- the identity is determined using the known and publicly available computer program ClustalW (Thompson et al., Nucleic Acids Research 22 (1994), 4673-4680).
- ClustalW is made publicly available by Julie Thompson (Thompson ⁇ EMBL- Heidelberg.DE) and Toby Gibson (Gibson@EMBL-Heidelberg.DE), European Molecular Biology Laboratory, Meyerhofstrasse 1, D 69117 Heidelberg, Germany.
- ClustalW can also be downloaded from various Internet pages, inter alia from IGBMC (Institut de Genetique et de Biologie Moleisme et Cellulaire, B.P.163, 67404 Illkirch Cedex, France; ftp://ftp-igbmc.u-strasbg.fr/pub/) and from EBI (ftp://ftp.ebi.ac.uk/pub/software/) and all mirrored Internet pages of the EBI (European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK).
- ClustalW computer program of version 1.8 or Clustal 2 determines the identity between proteins described in the context of the present invention and other proteins.
- ClustalW computer program of version 1.8 or Clustal 2 determines the identity for example between the nucleotide sequence of the nucleic acid molecules described in the context of the present invention and the nucleotide sequence of other nucleic acid molecules.
- the parameters have to be set as follows:
- “Identity” furthermore means that there is a functional and/or structural equivalence between the nucleic acid molecules in question or the proteins encoded by them.
- Functional equivalence means that the nucleic acid molecule sequences or the amino acid sequences encode a protein having the activity of a lipase.
- the nucleic acid molecules which are homologous to the molecules described above and represent derivatives of these molecules are generally variants of these molecules which represent modifications having the same biological function or catalyzing the same reaction, i.e. coding for a protein having the activity of a lipase. They may be either naturally occurring variants, for example sequences from other species, or mutations, where these mutations may have occurred in a natural manner or were introduced by targeted mutagenesis.
- variants may be synthetically produced sequences.
- allelic variants may be either naturally occurring variants or synthetically produced variants or variants generated by recombinant DNA techniques.
- those variants encode proteins having lipase-activity and comprise the amino acid substitutions (replacements), deletions or insertions described herein concerning the proteins according to the invention.
- a special type of derivatives are, for example, nucleic acid molecules which differ from the nucleic acid molecules described in the context of the present invention as a result of the degeneracy of the genetic code.
- hydrolase is a class of enzymes that commonly perform as biochemical catalysts that use water to break a chemical bond, which typically results in dividing a larger molecule into smaller molecules. Under unnatural, anhydrous, conditions these enzymes can also catalyze reactions of esterification, e.g. acetylation, and transesterification.
- the group of hydrolases comprises enzymes acting on ester bonds (EC 3.1) encompassing carboxylic ester hydrolases (EC 3.1.1) and as a subgroup lipases (EC 3.1.1.3).
- Lipases have been identified from plants, mammals and microorganisms including e.g. Pseudomonas, Vibrio, Acinetobacter, Burkholderia, Chromobacterium, cutinase from Fusarium solani (FSC), Candida antarctica A (CalA), Rhizopus oryzae (ROL), Thermomyces lanuginosus (TLL), Rhizomucor miehei (RML), Aspergillus Niger, Fusarium heterosporum, Fusarium oxysporum or Fusarium culmorum.
- Pseudomonas Vibrio, Acinetobacter, Burkholderia, Chromobacterium, cutinase from Fusarium solani (FSC), Candida antarctica A (CalA), Rhizopus oryzae (ROL), Thermomyces lanuginosus (TLL), Rhizomucor miehei (RML), Aspergillus Niger
- a protein has the activity of a lipase, this can be detected with methods known and described in the art.
- the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by at least one of the following modifications: i. the amino acid at position 44 is W or Y; ii. the amino acid at position 54 is F; iii. the amino acid at position 55 is R; iv. the amino acid at position 109 is H; v. the amino acid at position 110 is T or V; vi. the amino acid at position 117 is F; vii. the amino acid at position 122 is Q or R; viii. the amino acid at position 160 is E; ix. the amino acid at position 216 is K; x. the amino acid at position 236 is T; xi. the amino acid at position 238 is K or E; xii. the amino acid at position 240 is F.
- the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by at least the mutation G109H, i.e. the amino acid at position 109 is H.
- Lipase variant proteins according to the invention may exhibit further amino acid modifications (amino acid substitutions, deletions or insertions) compared to the amino acid sequences described herein above in respect to the amino acid sequence shown under SEQ ID No. 1.
- the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by at least two, further preferably at least three, even further preferably at least four, in particular preferably at least five of the following modifications selected from: i. the amino acid at position 57 is different from N, preferably the amino acid is P; ii. the amino acid at position 109 is different from G, preferably the amino acid is H; iii. the amino acid at position 122 is different from K, preferably the amino acid is R; iv. the amino acid at position 212 is different from A, preferably the amino acid is P; v. the amino acid at position 234 is different from S, preferably the amino acid is K; vi. the amino acid at position 289 is different from D, preferably the amino acid is G.
- the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by the following modifications: i. the amino acid at position 57 is P; ii. the amino acid at position 109 is H; iii. the amino acid at position 212 is P; iv. the amino acid at position 234 is K; and v. the amino acid at position 289 is G.
- the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by the following modifications: i. the amino acid at position 57 is P; ii. the amino acid at position 109 is H; iii. the amino acid at position 122 is R; iv. the amino acid at position 212 is P; v. the amino acid at position 234 is K; and vi. the amino acid at position 289 is G.
- the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by the following modifications: the amino acid at position 57 is P; the amino acid at position 109 is H; the amino acid at position 122 is R or K or Q, preferably R; the amino acid at position 212 is P; the amino acid at position 234 is K; the amino acid at position 289 is G; and additionally at least one of the following modifications: the amino acid at position 44 is W or Y; the amino acid at position 54 is F, the amino acid at position 55 is R, the amino acid at position 110 is V or T; the amino acid at position 111 is T, the amino acid at position 117 is F, the amino acid at position 160 is E, the amino acid at position 216 is K, the amino acid at position 236 is T, the amino acid at position 238 is K or E, the amino acid at position 240 is F.
- Preferred embodiments of the invention are proteins according to the invention encoding lipases having the amino acid sequences shown under SEQ ID Nos. 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187
- One further embodiment of the invention concerns nucleic acid molecules encoding a protein according to the invention.
- Nucleic acid molecules according to the invention can be any kind of nucleic acid, as long as the nucleic acid encodes a protein according to the invention.
- the nucleic acids can be ribonucleic nucleic acid molecules (e.g. RNA, mRNA) or deoxyribonucleic nucleic acid molecules (DNA, including genomic DNA which may or may not comprise introns and coding DNA).
- the invention therefore also concerns nucleic acid molecules encoding a protein having the activity of a lipase selected from the group consisting of a) nucleic acid molecules comprising the nucleic acid sequences shown under SEQ ID Nos. 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172,
- hybridizing with means hybridization under conventional hybridization conditions, preferably under stringent conditions, as described, for example, in Sambrook et al. (Molecular Cloning, A Laboratory Manual, 3rd edition (2001) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. ISBN: 0879695773) or Ausubel et al. (Short Protocols in Molecular Biology, John Wiley & Sons; 5th edition (2002), ISBN: 0471250929).
- “hybridization” means a hybridization under the following conditions: hybridization buffer:
- 2xSSC 2xSSC; lOxDenhardt solution (Fikoll 400+PEG+BSA; ratio 1:1:1); 0.1% SDS; 5 mM EDTA; 50 mM Na2HPO4; 250 pg/ml of herring sperm DNA; 50 pg/ml of tRNA; or
- Nucleic acid molecules which hybridize with nucleic acid molecules coding for a protein having the activity of a lipase may originate from any organism; accordingly, they may originate from bacteria, fungi, animals, humans, plants or viruses.
- Nucleic acid molecules which hybridize with nucleic acid molecules coding for a protein having the activity of a lipase preferably originate from microorganisms, more preferably from fungi or bacteria, most preferably from bacteria.
- Nucleic acid molecules which hybridize with the molecules mentioned may be isolated, for example, from genomic or from cDNA libraries. Such nucleic acid molecules can be identified and isolated using the nucleic acid molecules described herein or they can be identified and isolated using parts of these molecules or the reverse complements of these molecules, for example by hybridization according to standard methods (see, for example, Sambrook et al., Molecular Cloning, A Laboratory Manual, 3rd edition (2001) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. ISBN: 0879695773; Ausubel et al., Short Protocols in Molecular Biology, John Wiley & Sons; 5th edition (2002), ISBN: 0471250929) or by amplification using PCR.
- the fragments used as hybridization samples may also be synthetic fragments or oligonucleotides prepared using the customary synthesis techniques, whose sequence is essentially identical to the nucleic acid molecule described in the context of the present invention.
- the molecules hybridizing with the nucleic acid molecules described in the context of the present invention comprise in particular fragments, derivatives and allelic variants of the nucleic acid molecules mentioned.
- the term “derivative” means that the sequences of these molecules differ in one or more positions from the sequences of the nucleic acid molecules described above and are highly identical to these sequences.
- the differences to the nucleic acid molecules described above may, for example, be due to deletion, addition, substitution, insertion or recombination.
- nucleotide abbreviations a, c, g, t, and those of abbreviations for degenerate nucleotides r, y, s, w, k, m, b, d, h, v, n is derivable herein below from Table 1 under the paragraph sub-titled “Description of the Sequences”. Which amino acids are encoded by codons comprising degenerate nucleotides is derivable herein below from Table 3 under the paragraph sub-titled “Description of the Sequences”.
- the invention relates to recombinant nucleic acid molecules comprising a nucleic acid molecule according to the invention.
- the term “heldrecombinant nucleic acid molecule” is to be understood to mean a nucleic acid molecule, which contains additional sequences in addition to nucleic acid molecules according to the invention, which do not naturally occur in the combination in which they occur in recombinant nucleic acids according to the invention.
- the abovementioned additional sequences can be any sequences, preferably they are functional or regulatory sequences (promoters, termination signals, enhancers, ribosome binding sites (rbs), leader sequences enhancing transcription, translation or RNA stability, subcellular targeting sequences etc.), particularly preferably they are functional or regulatory sequences that are active in microorganisms, and especially particularly preferably they are regulatory sequences that are active in fungi, in particular yeasts or in bacteria.
- Methods for the creation of recombinant nucleic acid molecules according to the invention are known to the person skilled in the art and include genetic methods such as bonding nucleic acid molecules by way of ligation, genetic recombination, or new synthesis of nucleic acid molecules.
- the recombinant nucleic acid molecules according to the invention comprise a nucleic acid molecule according to the invention which is linked with regulatory sequences, which initiate transcription in prokaryotic or eukaryotic cells.
- Regulatory sequences, which initiate transcription” in a cell are also known as promoters.
- Regulatory sequences which initiate transcription in prokaryotic organisms e.g. E. coli, and in eukaryotic organisms are sufficiently described in literature, in particular such for expression in yeast are described, e.g. Saccharomyces cerevisiae.
- An overview of various systems for expression for proteins in various host organisms can be found, for example, in Methods in Enzymology 153 (1987), 383-516 and in Bitter et al. (Methods in Enzymology 153 (1987), 516-544) or in Gomes et al. (2016, Advances in Animal and Veterinary Sciences, 4(4), 346) and Baghban et al. (2018, Current Pharmaceutical Biotechnology, 19(6)).
- Common yeast promoters are pAOXl, pHIS4, pGAL, pScADH2 (Baghban et al., 2018, see above).
- Common bacterial promoters are T5, T7, rhamnose-inducible, arabinose-inducible, PhoA, artificial trc (trp-lac) promoter as described by Marschall et al. (2017, Appl Microbiol Biotechnol 101, 501-512) and Tegel et al. (2011, FEBS Journal 278, 729-739).
- a further embodiment of recombinant nucleic acid molecules of the present invention are vectors or plasmids, which comprise the nucleic acid molecules according to the invention.
- Vectors are commonly understood in the field of molecular biology and herein to represent a nucleic acid sequence or a vehicle comprising a nucleic acid sequence used to transfer genetic material (DNA or RNA) into a target cell.
- Vectors can be plasmids, e.g. T-DNA or binary vectors for generating transgenic plants, expression vectors for expression of nucleic acid sequences in a host cell, shuttle vectors which are eligible to propagate in different hosts, or vectors can be virus particles or bacteriophages having been modified to deliver foreign genetic material into a host.
- “Plasmids” are commonly understood in the field of molecular biology and herein to represent an autonomously self-replicating, often circular DNA molecule which is when present in a host cell separated from the chromosomal DNA.
- Nucleic acid molecules according to the invention, recombinant nucleic acid molecules according to the invention, vectors or plasmids according to the invention can be used for production of proteins according to the invention, e.g. by expressing the nucleic acid molecules according to the invention in host cells.
- Another embodiment of the invention concerns hosts or host cells comprising or expressing a nucleic acid molecule according to invention or comprising proteins according to the invention or comprising a recombinant nucleic acid molecule according to the invention or comprising a vector according to the invention or comprising a plasmid according to the invention.
- nucleic acid molecules according to the invention encoding a protein having the activity of a lipase can be expressed in host cells for e.g. their multiplication or for production of proteins according to the invention.
- nucleic acid molecules according to the invention can be comprised on vectors or plasmids or they can be stably integrated into the genome of a respective host cell.
- the nucleic acid molecules according to the invention can also be comprised by vectors which support their introduction into host cells.
- a further embodiment of the present invention concerns a host or host cell according to the invention comprising a nucleic acid molecule according to the invention or comprising a recombinant nucleic acid molecule according to the invention or comprising a vector according to the invention or comprising a plasmid according to the invention and, in each case comprising a protein according to the invention.
- Another embodiment of the present invention concerns a host or host cell according to the invention comprising a nucleic acid molecule according to the invention or comprising a recombinant nucleic acid molecule according to the invention or comprising a vector according to the invention or comprising a plasmid according to the invention and, in each case expressing a protein according to the invention.
- Another embodiment of the present invention concerns a host or host cell according to the invention comprising a nucleic acid molecule according to the invention or comprising a recombinant nucleic acid molecule according to the invention or comprising a vector according to the invention or comprising a plasmid according to the invention and, in each case expressing a protein, wherein the protein has the activity of a lipase.
- “Expressing a nucleic acid molecule” shall be understood herein to mean that in case the nucleic acid molecule is RNA or mRNA the nucleic acid molecule is translated into a protein, preferably translated into a protein having the activity of a lipase or in case of the nucleic acid molecule is DNA or cDNA it is transcribed (and in case of genomic DNA containing introns is processed) into mRNA, preferably into a mRNA encoding a protein having the activity of a lipase and subsequently translated into a protein, preferably translated into a protein having the activity of a lipase.
- Transcription of a given nucleic acid molecule in a host can be demonstrated by methods known to a person skilled in the art, for example, by detection of specific transcripts (mRNA) of foreign nucleic acid molecules by Northern blot analysis or RT-PCR.
- mRNA specific transcripts
- hosts or host cells comprise a given protein or comprise a protein which is derived from expressing a nucleic acid molecule can be determined by methods known to a person skilled in the art, for example, by immunological methods, such as Western blot analysis, ELISA (Enzyme Linked Immuno Sorbent Assay) or RIA (Radio Immune Assay).
- immunological methods such as Western blot analysis, ELISA (Enzyme Linked Immuno Sorbent Assay) or RIA (Radio Immune Assay).
- the person skilled in the art is familiar with methods for preparing antibodies which react specifically with a certain protein, i.e. which bind specifically to a certain protein (see, for example, Lottspeich and Zorbas (eds.), 1998, Bioanalytik, Spektrum akad, Verlag, Heidelberg, Berlin, ISBN 3-8274-0041-4).
- Some companies (Thermo Fisher Scientific, 168 Third Avenue, Waltham, MA USA 0245; GenScript, 60 Centennial Ave., Pi
- a person skilled in the art can test if a host or host cell comprises a protein according to the invention by detecting (additional) activity of proteins having the activity of a lipase in a respective host cell.
- activity of proteins having additional activity of a lipase in a respective host cell is detected by comparing the activities of lipases of a host cell according to the invention with the respective activity of host cell not comprising a protein according to the invention.
- Testing if a protein has the activity of a lipase can be done by methods known in the art.
- Host or host cells according to the invention can be produced by a person skilled in the art by known methods for genetically modifying or transforming organisms.
- a further subject of the present invention therefore is a host or host cell according to the invention, particularly a prokaryotic or eukaryotic host or host cell, which is genetically modified (or transformed) with a nucleic acid molecule according to the invention or with a recombinant nucleic acid molecule according to the invention or with a vector according to the invention or a plasmid according to the invention.
- the genetically modified (transformed) host or host cell according to the invention expresses a protein having the activity of a lipase, more preferably, the genetically modified (transformed) host or host cell according to the invention expresses a protein according to the invention.
- Genetically modified with a nucleic acid molecule or “transformed with a nucleic acid molecule” shall be understood herein to mean that a nucleic acid molecule is or was introduced into a host or host cell by technical and/or non-naturally occurring means, preferably by technical methods in the field of molecular biology, biotechnology or genetic modification.
- Descendants, offspring or progeny of hosts or host cells according to the invention are also an embodiment of the invention, preferably these descendants, offspring or progeny comprise a nucleic acid molecule according to the invention or comprise a recombinant nucleic acid molecule according to the invention or comprise a vector according to the invention or comprise a plasmid according to the invention or comprise a protein according to the invention, more preferably these descendants, offspring or progeny comprise a nucleic acid molecule according to the invention or comprise a recombinant nucleic acid molecule according to the invention or comprise a vector according to the invention or comprise a plasmid according to the invention and, in each case express a protein, wherein the protein has the activity of a lipase, even more preferably these descendants, offspring or progeny comprise a nucleic acid molecule according to the invention or comprise a recombinant nucleic acid molecule according to the invention or comprise a vector according to the invention or comprise a plasmid
- the host or host cell according to the invention can be a host or host cell from any prokaryotic or eucaryotic organism.
- the hosts or host cells can be bacteria or bacteria cells (e.g. E. coli, bacteria of the genus Bacillus, in particular Bacillus subtilis, Agrobacterium, particularly Agrobacterium tumefaciens or Agrobacterium rhizogenes, Pseudomonas, particularly Pseudomonas fluorescens, Streptomyces spp, Rhodococcus spp, in particular Rhodococcus rhodochrous, Vibrio natrigens, Corynebacterium, particularly Corynebacterium glutamicum) or fungi or fungal cells (e.g.
- Agaricus in particular Agaricus bisporus, Aspergillus, Trichoderma or yeasts, particularly .S’, cerevisiae, Pichia ssp. like P. pastoris), as well as plants or plant cells or they can be animals or animal cells.
- Preferred host cells according to the invention are cells of microorganisms.
- this is understood to include all bacteria and all protists (e.g. fungi, particularly yeasts and algae), as they are defined in Schlegel "General Microbiology” (Georg Thieme Publishing House (1985), 1-2), for example.
- the hosts or host cells according to the invention are preferably bacteria/bacteria cells or yeast/yeast cells, most preferably they are bacteria/bacteria cells.
- the hosts or host cells according to the invention are preferably Bacillus species/ Bacillus species cells or Escherichia coli! Escherichia coli cells cells most preferably Escherichia coli/ Escherichia coli cells.
- Pseudomonas particularly Pseudomonas fluorescens, Streptomyces spp, Rhodococcus spp, in particular Rhodococcus rhodochrous
- Vibrio spp particularly Vibrio natrigens
- Corynebacterium particularly Corynebacterium glutamicum or others
- hosts or host cells according to the invention can be hosts or host cells according to the invention.
- a preferred embodiment of the invention concerns hosts or host cells according to the invention comprising a nucleic acid molecule according to the invention, wherein the nucleic acid molecule according to the invention is characterized in that the codons of said nucleic acid molecule are changed such that they are adapted to the frequency of use of the codons of the host or a host cell, respectively.
- Host cells according to the invention can be used for production of proteins according to the invention.
- Proteins according to the invention can be used in methods for production of enantiomerically enriched or nearly enantiomerically pure secondary alcohols.
- a second aspect of the present invention relates to a method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I), comprising contacting the substrate with a variant protein having the activity of a lipase according to the first aspect of the present invention or with a lipase according to SEQ ID No. 1 , wherein R 1 and R 2 are independently from each other selected from a substituted or unsubstituted (n)-alkyl, iso-alkyl, unsubstituted aryl, alkyl-substituted or aryl-substituted aryl.
- the substrate is contacted with a variant protein having the activity of a lipase according to the first aspect of the present invention.
- R 1 and and R 2 are preferably independently from each other chosen to be a linear or branched Ci-io residue.
- R 1 is selected from methyl, iso-butyl, tert-butyl, and iso-propyl; and/or R 2 is methyl.
- R 1 is selected from methyl, iso-butyl, tert-butyl, and iso-propyl; and R 2 is methyl.
- the method comprises hydrolysing a racemic substrate of formula (II- 1) in an enantiomerically selective manner to separate an enantiomerically enriched or pure compound of formula (1-1),
- the method preferably comprises isolating the compound (I) or (1-1), and/or (III) or (III-l) after hydrolysis.
- the isolation of compounds (1-1) can be done by any means which are known to those skilled in the art and is preferably done via extraction and/or direct distillation.
- the lipase variants or protein variants according to the invention show improved selectivity and/or improved specific activity regarding the stereoselective hydrolysis of methyl-3-hydroxy-2-methylene- butanoate (formula II- 1) and are better adapted to isolate enantiomerically enriched or nearly pure substrate methyl (3S)-3-hydroxy-2-methylene-butanoate (formula 1-1) compared to a wild-type lipase.
- Compounds of formula (I) represent an important building block in the synthesis of complex agrochemical compounds.
- (1-1 ) which is the S-enantiomer of racemic methyl 3-hydroxy-2-methylene-butanoate, is an important intermediate in the respective synthesis of agrochemical compounds described in WO 2018/228985.
- the lipase variants according to the present invention selectively hydrolyse the R-enantiomer of racemic methyl 3-hydroxy-2-methylene-butanoate, allowing the separation of the S-enantiomer of said racemic methyl 3 -hydroxy-2-methylene-butanoate .
- the method is conducted in an aqueous solution.
- the method is conducted in a mixture or biphasic liquid system of water and an organic solvent, such as methyl-tert-butylether, toluene, 2-methyltetrahydrofuran, methyl isobutyl ketone, cyclohexane, cyclopentyl methyl ether, chlorobenzene, tert-amylmethylether, ethyl acetate, isopropyl acetate.
- the organic solvent is methyl-tert-butylether (MTBE).
- the ratio of organic solvent to water is further preferably in the range from 1 : 1 to 6: 1 , even more preferably 2:1 to 4:1.
- an organic solvent such as MTBE, promotes an efficient downstream processing of the reaction products, i.e. isolation of compounds of the formula (I) or (1-1).
- the method is carried out at a temperature of between 20 and 60 °C, preferably between 30 and 55 °C. Further preferably, the method is carried out at a temperature of between 35 and 50 °C.
- the method is carried out for at least 1 h, preferably at least 2 h, further preferably at least 3 h.
- the method is carried out for at most 40 h, preferably at most 30 h, further preferably at most 20 h.
- the method is carried out at a pH of between 7 and 8.5, preferably between 7.2 and 8.0, further preferably between 7.4 and 7.6.
- the pH may be adjusted by addition of organic or inorganic bases, preferably by the addition of inorganic bases, such as hydrogen carbonates or carbonates, hydrogen phosphates and hydroxides.
- the enzyme is deactivated; further, there is partial hydrolysis of the ester of formula (I). Below pH 7, the reaction rate decreased and finally the reaction stops. Further below, at a pH of 4.5 or less, the enzyme is deactived.
- the pH may be controlled by addition of a base, such as NaOH, KOH, K2CO3, KHCO3, Na2CO3, NaHCO i.
- bases are carbonates or hydrogencarbonates.
- the base can be added as solid or aqueous solution, diluted or saturated.
- the base may be added to the reaction before the substrate is dosed, or the base is dosed into the reaction in parallel to the substrate.
- the substrate and/or base can be dosed or added in one portion.
- the lipase of SEQ ID No. 1 or the protein variant according to the first aspect of the present invention may be provided as purified enzyme or in form of spray-dried or freeze-dried biomass or as broth.
- the protein variant is provided as broth, i.e. as cell culture, or the broth is centrifugated and the cell pellet is freeze-dried to obtain lyophilized cells or biomass.
- the reaction is preferably conducted in a mixture of water and a water- immiscible solvent.
- the phases are separated and the aqueous phase is eventually back-extracted.
- the biomass can be separated by known technical means (e.g. centrifuge, filtration or decantation) before distillation or the distillation can take place without separation of the biomass.
- the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) may be carried out by means of a protein having the activity of a lipase wherein the protein is encoded by an amino acid sequence having at least 80%, preferably 85%, more preferably 90%, furthermore preferably 92%, even more preferably 95% with the amino acid sequence shown under SEQ ID No.
- the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 in at least one of the following positions: i. the amino acid at position 44 is different from L, preferably the amino acid at position 44 is M, W or Y; ii. the amino acid at position 51 is different from F, preferably the amino acid at position 51 is N, or M; iii. the amino acid at position 52 is different from V, preferably the amino acid at position 52 is L; iv. the amino acid at position 53 is different from T, preferably the amino acid at position 53 is S, P, I, E, or A; v.
- the amino acid at position 54 is different from D, preferably the amino acid at position 54 is Q, M, F, G, E, L, T or P; vi. the amino acid at position 55 is different from A, preferably the amino acid at position 55 is R, M, D, Y, S, or I; vii. the amino acid at position 109 is different from G, preferably the amino acid at position 109 is H or F; viii. the amino acid at position 110 is different from M, preferably the amino acid at position 110 is T or V; ix. the amino acid at position 111 is different from A, preferably the amino acid at position 111 is T or S; x. the amino acid at position 117 is different from Y, preferably the amino acid at position 117 is F or S; xi. the amino acid at position 121 is different from Y, preferably the amino acid at position
- the amino acid at position 122 is different from K, preferably the amino acid at position
- the amino acid at position 122 is Q, A, Y, R, or V; xiii. the amino acid at position 153 is different from H, preferably the amino acid at position 153 is N, Y, D, E, or C; xiv. the amino acid at position 160 is different from T, preferably the amino acid at position 160 is E, C, D, P, I, Q, K, M, S, F, A, or N; xv. the amino acid at position 179 is different from D, preferably the amino acid at position 179 is C; xvi. the amino acid at position 181 is different from A, preferably the amino acid at position 181 is Q; xvii. the amino acid at position 184 is different from A, preferably the amino acid at position 184 is G or T; xviii. the amino acid at position 211 is different from Y, preferably the amino acid at position
- the amino acid at position 212 is different from A, preferably the amino acid at position
- the amino acid at position 212 is S or P; xx. the amino acid at position 216 is different from Y, preferably the amino acid at position 216 is K or A; xxi. the amino acid at position 234 is different from S, preferably the amino acid at position
- the amino acid at position 235 is different from S, preferably the amino acid at position
- the amino acid at position 236 is different from K, preferably the amino acid at position
- the amino acid at position 236 is T; xxiv. the amino acid at position 238 is different from R, preferably the amino acid at position 238 is A, K, D, E, or Q; xxv. the amino acid at position 240 is different from Y, preferably the amino acid at position 240 is F; xxvi. the amino acid at position 289 is different from D, preferably the amino acid at position 289 is S or G; xxvii. the amino acid at position 291 is different from G, preferably the amino acid at position 291 is E or W; xxviii. the amino acid at position 317 is different from N, preferably the amino acid at position 317 is T; xxix. the amino acid at position 320 is different from N, preferably the amino acid at position
- the amino acid at position 321 is different from L, preferably the amino acid at position
- the amino acid sequence of the variant protein lipase used in the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) may differ from the amino acid sequence of SEQ ID NO. 1 by at least one of the following modifications: i. the amino acid at position 44 is W or Y; ii. the amino acid at position 54 is F; iii. the amino acid at position 55 is R; iv. the amino acid at position 109 is H; v. the amino acid at position 110 is T or V; vi. the amino acid at position 117 is F; vii. the amino acid at position 122 is Q or R; viii.
- the amino acid at position 160 is E; ix. the amino acid at position 216 is K; x. the amino acid at position 236 is T; xi. the amino acid at position 238 is K or E; xii. the amino acid at position 240 is F.
- the amino acid sequence of the variant protein lipase used in the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) differs from the amino acid sequence of SEQ ID NO. 1 by at least the mutation G109H, i.e. the amino acid at position 109 is H.
- the amino acid sequence of the variant protein lipase used in the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) may differ from the amino acid sequence of SEQ ID NO. 1 by at least two, further preferably at least three, even further preferably at least four, in particular preferably at least five of the following modifications selected from: i. the amino acid at position 57 is different from N, preferably the amino acid is P; ii. the amino acid at position 109 is different from G, preferably the amino acid is H; iii. the amino acid at position 122 is different from K, preferably the amino acid is R; iv.
- the amino acid at position 212 is different from A, preferably the amino acid is P; v. the amino acid at position 234 is different from S, preferably the amino acid is K; vi. the amino acid at position 289 is different from D, preferably the amino acid is G.
- the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by the following modifications: i. the amino acid at position 57 is P; ii. the amino acid at position 109 is H; iii. the amino acid at position 212 is P; iv. the amino acid at position 234 is K; and v. the amino acid at position 289 is G.
- the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by the following modifications: i. the amino acid at position 57 is P; ii. the amino acid at position 109 is H; iii. the amino acid at position 122 is R; iv. the amino acid at position 212 is P; v. the amino acid at position 234 is K; and vi. the amino acid at position 289 is G.
- the amino acid sequence of the variant protein lipase used in the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) may differ from the amino acid sequence of SEQ ID NO.
- the amino acid at position 57 is P; the amino acid at position 109 is H; the amino acid at position 122 is R or K or Q, preferably R; the amino acid at position 212 is P; the amino acid at position 234 is K; the amino acid at position 289 is G; and additionally at least one of the following modifications: the amino acid at position 44 is W or Y; the amino acid at position 54 is F, the amino acid at position 55 is R, the amino acid at position 110 is V or T; the amino acid at position 111 is T, the amino acid at position 117 is F, the amino acid at position 160 is E, the amino acid at position 216 is K, the amino acid at position 236 is T, the amino acid at position 238 is K or E, the amino acid at position 240 is F.
- Preferred proteins according to the invention encoding lipases that may be used in the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) are shown under SEQ ID Nos. 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69,
- a further embodiment of the invention is the use of a protein according to the first aspect of the invention for the stereoselective hydrolysis of racemic methyl-3-hydroxy-2-methylene-butanoate to isolate methyl (3S)-3-hydroxy-2-methylene-butanoate.
- Enantiomerically enriched means herein that one of two enantiomers is present in a composition in higher amounts than the other enantiomer, preferably at least 60% of one enantiomer is present in the composition, more preferably at least 65% of one enantiomer is present in the composition, further more preferably at least 70% of one enantiomer is present in the composition, even more preferably at least 75% of one enantiomer is present in the composition, even further more preferably at least 80% of one enantiomer is present in the composition, particular preferably at least 85% of one enantiomer is present in the composition, most preferably at least 90% of one enantiomer is present in the composition or especially preferably at least 94% of one enantiomer is present in the composition.
- Enantiomerically nearly pure means herein that one of two enantiomers is present in a composition in amounts of at least 95.0%, preferably one of two enantiomers is present in a composition in amounts of at least 95.5%, more preferably one of two enantiomers is present in a composition in amounts of at least 96.0%, further more preferably one of two enantiomers is present in a composition in amounts of at least 96.5%, even more preferably one of two enantiomers is present in a composition in amounts of at least 97.0%, even further more preferably one of two enantiomers is present in a composition in amounts of at least 98.0%, particular preferably one of two enantiomers is present in a composition in amounts of at least 98.5%, most preferably one of two enantiomers is present in a composition in amounts of at least 99.0%, or especially preferably one of two enantiomers is present in a composition in amounts of at least 99.5%.
- a third aspect of the present invention relates to a method for the enantiomeric enrichment of compounds of formula (I) comprising reacting compounds of formula (II) with compounds of formula (IV) and in particular with R 1 and R 2 being defined as above, and R 3 being a substutituted or unsubstituted (n) -Alkyl or isoalkyl, in the presence of a lipase under anhydrous conditions.
- “Anhydrous conditions” refer to a reaction system comprising at most 3 % water in the liquid phase.
- the reaction preferably takes place in an organic solvent, such as hexane or n-heptane, or without any additional solvent.
- R 3 can preferably be a linear Ci-i6 residue, in particular methyl or unsubstituted and saturated Cs-Cn residue.
- Compound (IV) can be in particular vinyl acetate or vinyl laurate.
- the lipase is preferably immobilized on a solid carrier during the action. Immobilization leads to increased enzyme stability in non-aqueous solutions.
- the reaction is preferably carried out at a temperature of between 20 and 40°C, preferably 25 to 35°C. Such temperature range allows for optimal enzymatic reacticity.
- the reaction is preferably carried out for at least 1 h, further preferably at least 3 h, even further preferably at least 5 h.
- the lipase is preferably recycled after the reaction has taken place. This allows the method to be designed in a cost-efficient manner.
- the lipase can thus be used multiple times for the reaction according to the third aspect or a further reaction.
- the compound of formula (I) is preferably isolated by means of distillation directly from the reaction mixture or from the supernatant, obtained after decanting the reaction mixture.
- the lipase preferably remains in the remaining reaction mixture and can be reused for another reaction.
- the lipase is preferably CALB lipase.
- CALB is a non-specific lipase originating from Candida antarctica B and was first described in 1994 (Uppenberg J, Patkar S, Bergfors T, Jones TA (1994) J Mol Biol 235(2):790-792).
- the lipase is furthermore preferably immobilized on a hydrophobic carrier, such as acrylic resin.
- a hydrophobic carrier such as acrylic resin.
- a commercial version of CALB is, for example, Novozym® 435, which can be used for carrying out the reaction.
- polypeptides enzymes, i.e. lipases
- processes according to the present invention allow for an efficient enantiomeric enrichment of compounds of the formula (I)
- alkyl includes saturated hydrocarbon residues which can be branched or straight-chain and unsubstituted or at least monosubstituted.
- suitable alkyl residues which can be unsubstituted or mono- or polysubstituted, are methyl, ethyl, n-propyl, isopropyl, n-butyl, isobutyl, 2-butyl, tert-butyl, n-pentyl, 2-pentyl, 3-pentyl, isopentyl, neo-pentyl, n-hexyl, 2-hexyl,
- aryl either alone or in combination with other terms, is to be understood as a mono- or polycyclic, preferably a mono- or bicyclic, aromatic hydrocarbon radical with preferably 6, 10 or 14 carbon atoms.
- An aryl radical can be unsubstituted or monosubstituted or polysubstituted, either with identical or different substituents.
- Codon usage follows herein the so called “general genetic code” according to the following Table, wherein “f ’ is to be substituted by “u” in ribonucleic acid (RNA) sequences.
- Table 3
- Table 4b Further lipase variants according to the present invention (protein SEQ ID No. 171-239).
- the variants described contain the backbone mutations N57P, K122R, A212P, S234K, D289G and G109H (SEQ ID No. 171, 172), unless (i) the mutation “R122K” is indicated in addition, which means that the K122R mutation has been back- mutated such that the position 122 carries an K again, or (ii) the mutation “R122Q” is indicated in addition, which means that the K122R mutation has been further mutated such that the position 122 carries a Q.
- Analytical grade chemicals and ready-to-use kits were sourced and used from common suppliers such as Sigma Aldrich, Acros Organics, Fisher Scientific, Qiagen or Stratagene. Novozym® 435 is bought from Sigma Aldrich.
- Nucleic acid sequences of SEQ ID No. 1 or their variants were cloned into an expression vector based on pKA81.
- the genetic elements were incorporated into the vector by means known in the art.
- the vectors were introduced into electrocompetent Escherichia coli (E. coli) W3110 cells.
- Nucleotides were substituted in the wild-type sequence or sequences derived therefrom to allow for amino acid substitution. This exchange can be achieved by various molecular biological methods.
- One method for replacing nucleic acids is site-directed mutagenesis, which can result in mutations at one or more sites in the amino acid sequence. Methods for site-directed mutagenesis are state of the art and are described in the literature (e.g. Directed Mutagenesis: A Practical Approach, 1991, Edited by M.J. McPHERSON, IRL PRESS) and can be purchased as ready-made kits (e.g. the QUIKCHANGETM lightening mutagenesis kit from Qiagen or Stratagene). After inserting mutations into the gene sequence and culturing in a suitable E. coli cloning strain, the plasmids obtained were transformed into Escherichia coli W3110.
- Transformed cells were tested in appropriate biotransformation reactions in order to determine product yield and selectivity. Appropriate biotransformation reactions are described below, see Example 2. Sequence verification was performed as known in the art.
- Glycerol stocks of the E. coli cultures transformed with the respective expression plasmids were prepared by adding one volume of 40% glycerol solution to one volume of E. coli culture.
- E. coli cultures were plated onto LB- Agar plates containing suitable concentrations of kanamycin and incubated at 37°C until single colonies were obtained.
- precultures sterilized 2 mL 96-well deep-well plates (Eppendorf, Hamburg, Germany) were used and mixed with 590 pL TB medium (50 pg/ml kanamycin) and 10 pL of the glycerol stock of the respective E. coli strain filled.
- 590 pL of TB medium 50 pg/ml kanamycin
- 590 pL of TB medium 50 pg/ml kanamycin
- the precultures were incubated at 37° C and 250 rpm for 17 h.
- Cells were harvested by centrifugation for 15 min at 4°C and 2500 x g. The culture supernatant was discarded, and the cell pellet was suspended in 200 pL PBS. The cells were then lyophilized for 24 h and stored at 4°C until use.
- the chiral resolution of methyl (3S)-3-hydroxy-2-methylene-butanoate was conducted in microtiter plates with the lyophilized supernatants.
- 100 pl racemic methyl-3-hydroxy-2- methylene-butanoate, 105 pl MTBE (Methyl-tert-butylether) and 45 pl ultrapure water (with 416.6 g/L KHCO3) were used in each well.
- Each plate was sealed and incubated for 6 h at 45 °C in a shaker at 300 rpm.
- the reaction was then stopped by adding 22.5 pl of 20% H2SO4 solution.
- 1 ml MTBE was added per well in the deep well plates.
- the plates were shaken at room temperature for 10 min and then centrifuged at 2500xg for 10 min. 10 pl of the organic phase were then transferred to 96- well PCR plates containing 100 pl MTBE and analyzed by HPLC.
- Instrument Agilent Technologies 1290 Infinity II; Column: Lux cellulose-2, 100x4.6 mm, 3 pm; Eluent A: premixed heptane (+0.05% formic acid); eluent B: ethanol; flow: isocratic (90% eluent A/10% eluent B), flow rate: 0.8 mL/min; temperature: 25 °C; sample injection volume: 1 pL; Detection: absorption at 210 nm.
- Racemic methyl-3-hydroxy-2-methylene-butanoate and racemic 3-hydroxy-2-methylene-butanoic acid (prepared as for the biotransformation samples) were used as a reference substance and as a standard for quantification. Appropriate dilutions of the standards were used in order to be able to quantify the substrate used and the resulting product using standard lines. The samples were analyzed for substrate conversion and product formation. The evaluation and comparison of individual samples with one another was carried out by determining the substrate and product ee[%].
- a high selectivity and a high activity of the respective enzyme variant are decisive for an efficient conversion of racemic methyl 3-hydroxy-2-methylene-butanoate to (3R)-3-hydroxy-2-methylene- butanoic acid in order to generate an almost enantiomerically pure product in high yield.
- the enzyme selectivity ee [%] 3-hydroxy-2-methylene-butanoate is defined by the difference of the mole fraction of (3R)-3-hydroxy-2-methylene-butanoic acid and (3S)-3-hydroxy-2-methylene- butanoic acid divided by the sum of the mole fraction of (3R)-3-hydroxy-2-methylene-butanoic acid and (3 S) -3 -hydroxy-2-methylene-butanoic acid.
- the lipase of SEQ ID No. 1 shows a selectivity of 100 % and an activity of 100%.
- Table 5 Lipase variants showing a relative improvement in enzyme activity compared to the lipase of SEQ ID No. 1.
- the relative improvement in enzyme activity is defined as the quotient of the enantiomeric excess ee[%] of the respective variant and the enantiomeric excess ee[%] of the reference lipase (SEQ ID No.l) in percent.
- the substrate ee[%] of the lipase of SEQ ID No. 1 is 9.8.
- Table 6 Lipase variants showing a relative improvement in enzyme selectivity compared to the reference lipase of SEQ ID No. 1.
- the relative improvement in enzyme selectivity is defined as the quotient of ee[%] 3-hydroxy-2- methylene-butanoic acid of the respective variant and the ee[%] of 3-hydroxy-2-methylene-butanoic acid of the reference lipase of SEQ ID No. 1 in percent.
- the product ee[%] of the reference lipase is 75.8.
- Table 7 Lipase variants showing a relative improvement in enzyme activity compared to the reference lipase of SEQ ID No. 1.
- the relative improvement in enzyme activity is defined as the quotient of the enantiomeric excess ee[%] of the respective variant and the enantiomeric excess ee[%] of the reference lipase (SEQ ID No.1) in percent.
- Table 8 Lipase variants showing a relative improvement in enzyme selectivity compared to the reference lipase of SEQ ID No. 1.
- the relative improvement in enzyme selectivity is defined as the quotient of ee[%] 3-hydroxy-2- methylene-butanoic acid of the respective variant and the ee[%] of 3-hydroxy-2-methylene-butanoic acid of the reference lipase of SEQ ID No. 171 in percent. All lipase variants in the table carry in addition the mutations N57P, K122R, A212P, S234K, D289G, G109H.
- Example 5 Enzymatic hydrolysis of different esters
- esters (a) to (d) allow for an efficient enantiomeric resolution, wherein the (R)-Ester is preferably hydrolysed:
- Example 7 Further examples on enzymatic hydrolysis of racemic Methyl 3-hydroxy-2-methylene- butanoate to (3R)-3-hydroxy-2-methylene-butanoic acid
- the cultivation, biotransformation and HPLC analysis were carried out as described in example 2.
- the enzyme selectivity ee [%] 3-hydroxy-2-methylene-butanoate is defined by the difference of (3R)-3-hydroxy-2-methylene-butanoic acid and (3S)-3-hydroxy-2-methylene-butanoic acid divided by the sum of (3R)-3-hydroxy-2-methylene-butanoic acid and (3S)-3-hydroxy-2-methylene- butanoic acid.
- the enzyme activity ee [%] for the conversion of the substrate methyl (3R)3-hydroxy-2- methylene-butanoate is described by the difference between methyl-(3S)3-hydroxy-2-methylene- butanoate and methyl-(3R)3-hydroxy-2-methylene-butanoate divided by the sum of methyl-(3S)3- hydroxy-2-methylene-butanoate and methyl-(3R)3-hydroxy-2-methylene-butanoate.
- the reference lipase of SEQ ID No. 1 shows a selectivity of 100 % and an activity of 100%
- Table 11 Comparative lipase variants showing a similar enzyme activity profile compared to the reference lipase of SEQ ID No. 1.
- the relative difference in enzyme activity is defined as the quotient of the enantiomeric excess ee[%] of the respective variant and the enantiomeric excess ee[%] of the reference lipase (SEQ ID No.1) in percent.
- Table 12 Comparative lipase variants showing a similar enzyme selectivity compared to the reference lipase of SEQ ID No. 1.
- the relative difference in enzyme selectivity is defined as the quotient of ee[%] 3-hydroxy-2-methylene- butanoic acid of the respective variant and the ee[%] of 3-hydroxy-2-methylene-butanoic acid of the reference lipase of SEQ ID No. 1 in percent.
- the comparative lipases worsen the activity of the lipase, and do not improve the selectivity of the discussed substrate.
- Example 8 Enzymatic hydrolysis of racemic Methyl 3-hydroxy-2-methylene-butanoate to (3R)-3- hydroxy-2-methylene-butanoic acid
- a suspension of 1400 g racemic methyl 3-hydroxy-2-methylene-butanoate [10.62 mol, 98.7 % purity] and 105 g Novozyme 435 is heated to 25 °C internal temperature. 1373 g vinyl dodecanoate (5.95 mol, 98 %) is added in 3h using a dosing pump under 50 mbar vacuum. Subsequently, the reaction mixture heated to 35 °C internal temperature and 50 mbar for additional 8h. Acetaldehyde is distilled out under vacuum. Afterwards the suspension is allowed to proceed for additional 8h at 35 °C and 50 mbar.
- the reaction mixture is heated to 115 °C jacket temperature under vacuum to distill methyl (3S)-3-hydroxy-2- methylene-butanoate out of the suspension. In that respect, the vacuum is gradually reduced to 3 mbar and the jacket temperature is increased to 135 °C.
- the product is analyzed by using chiral HPLC standard method. A chemical purity of >99 % is and an enantiomeric excess of >98 % ee is achieved.
- the isolated yield of methyl (3S)-3-hydroxy-2-methylene-butanoate is 41%.
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Analytical Chemistry (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The present invention relates to proteins having improved lipase activity, nucleic acid molecules encoding respective proteins having improved lipase activity and methods for chiral resolution of secondary alcohols.
Description
Lipase with improved stereoselectivity and lipase-based chiral resolution methods
Technical field
The present invention relates to proteins with improved lipase activity, nucleic acid molecules encoding said proteins as well as enzymatic methods for the chiral resolution of secondary alcohols.
Background
Enantiomerically enriched or pure alcohols are important compounds for the production of agrochemical or pharmaceutical compounds. Thus, the absolute configuration of the stereocenters of chiral alcohols is crucial for the synthesis of the corresponding active agents. In the production of a desired target molecule, generation of correct chirality is often a challenge.
Various methods for preparing enantiomerically enriched alcohols are known. On the one hand, chiral transition metal catalysts can be used, which, however, are characterized by high costs (G.R. Cook, Transition Metal-Mediated Kinetic Resolution, Current Organic Chemistry, 2000, 4, 869-885; Hirama et al. J. Org. Chem. 1988, 53, 708). Besides metal catalysis, an organocatalytic acyl transfer kinetic resolution is described for bulky ester substrates (Deng et al. J. Org. Chem. 2015, 80, 6, 3159). Also known is the use of enzymes, for example lipases, as biocatalysts for the production of chiral compounds. Kirchner et al. (J. Am. Chem. Soc. (1985), 107, 7072-7076) report two lipases that act as highly stereoselective, practical catalysts in nearly anhydrous organic solvents. Under such “unnatural” conditions the enzymes can asymmetrically catalyze reactions of esterification and transesterification which are not feasible in aqueous solutions because of the domination of hydrolysis. As a result, a number of optically active alcohols, carboxylic acids, and esters have been prepared on a gram scale. Faber and Riva (1992; Synthesis 1992(10), 895-910), and Nascimento et al. (2003, Tetrahedron Asymmetry 14, 311- 311) describe enzymatic resolution reactions of secondary alcohols by transesterification. In addition, Patil et al. (J. Org. Chem. 2008, 73, 4476 - 4483) and Reddy et. al (Synth. Communications, 2003, 33, 3717 - 3726) showed the use of esterases for the enantioselective ester hydrolysis in buffered aqueous systems, however in the presence of DMSO and only moderate yields.
US 4,732,853 A describes a method of making chiral epoxy alcohols by means of enantioselective hydrolysis with lipases.
EP 0716712 Bl describes the lipase-catalyzed acylation of alcohols with diketenes, especially for the production of enantioselective acylated alcohols from racemic alcohols.
WO2012146935A1 discloses modified lipase variants, as well as polynucleotides and recombinant expression vectors encoding the lipase variant polypeptides, as well as methods for producing such lipase variants in selected bacterial and fungal host cells. The specified lipase variants have increased enzyme specificity or enhanced trans-selectivity. Further described are methods of their use for reducing or eliminating trans-fatty acids from substrates.
Although several improvements of lipases have been achieved so far, several limitations arising during the asymmetric synthesis of secondary alcohols or resolution of racemic alcohols still have to be overcome, such as unfavorable equilibrium, substrate and product inhibition, poor thermostability, insufficient substrate specificity and in particular low enantioselectivity of the lipase. In addition, downstream processing and isolation of the target chiral alcohols are challenging within large scale applications while importance of atom economy is largely increased on industrial scale.
Thus, there is a need for further improvement of lipases, in particular with respect to the production of enantiomerically enriched or pure products with increased atom economy, as well as for further process improvements.
Summary
The present invention solves the above-described problems at least partially by providing novel lipases with superior enantiomer selectivity as well as providing novel methods for the chiral resolution of industrially relevant building blocks.
The lipases described herein have certain advantages over known wild-type and other already known lipases. In particular, the modified or variant lipases described herein have the advantage that they can produce enantiomerically enriched or enantiomerically nearly pure or pure compounds better than respective wild- type lipases.
A first aspect of the present invention relates to a protein having the activity of a lipase wherein the protein is encoded by an amino acid sequence having at least 80%, preferably 85%, more preferably 90%, furthermore preferably 92%, even more preferably 95% with the amino acid sequence shown under SEQ ID No. 1 , characterized in that the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 in at least one of the following positions: i. the amino acid at position 44 is different from L, preferably the amino acid at position 44 is M, W or Y; ii. the amino acid at position 51 is different from F, preferably the amino acid at position 51 is N, or M; iii. the amino acid at position 52 is different from V, preferably the amino acid at position 52 is L;
iv. the amino acid at position 53 is different from T, preferably the amino acid at position 53 is S, P, I, E, or A; v. the amino acid at position 54 is different from D, preferably the amino acid at position 54 is Q, M, F, G, E, L, T or P; vi. the amino acid at position 55 is different from A, preferably the amino acid at position 55 is R, M, D, Y, S, or I; vii. the amino acid at position 109 is different from G, preferably the amino acid at position
109 is H or F; viii. the amino acid at position 110 is different from M, preferably the amino acid at position
110 is T or V; ix. the amino acid at position 111 is different from A, preferably the amino acid at position
111 is T or S; x. the amino acid at position 117 is different from Y, preferably the amino acid at position 117 is F or S; xi. the amino acid at position 121 is different from Y, preferably the amino acid at position
121 is V; xii. the amino acid at position 122 is different from K, preferably the amino acid at position
122 is Q, A, Y, R, or V; xiii. the amino acid at position 153 is different from H, preferably the amino acid at position 153 is N, Y, D, E, or C; xiv. the amino acid at position 160 is different from T, preferably the amino acid at position 160 is E, C, D, P, I, Q, K, M, S, F, A, or N; xv. the amino acid at position 179 is different from D, preferably the amino acid at position 179 is C; xvi. the amino acid at position 181 is different from A, preferably the amino acid at position 181 is Q; xvii. the amino acid at position 184 is different from A, preferably the amino acid at position 184 is G or T; xviii. the amino acid at position 211 is different from Y, preferably the amino acid at position
211 is E; xix. the amino acid at position 212 is different from A, preferably the amino acid at position
212 is S or P; xx. the amino acid at position 216 is different from Y, preferably the amino acid at position 216 is K or A; xxi. the amino acid at position 234 is different from S, preferably the amino acid at position
234 is K, T or G; xxii. the amino acid at position 235 is different from S, preferably the amino acid at position
235 is V or M;
xxiii. the amino acid at position 236 is different from K, preferably the amino acid at position 236 is T; xxiv. the amino acid at position 238 is different from R, preferably the amino acid at position 238 is A, K, D, E, or Q; xxv. the amino acid at position 240 is different from Y, preferably the amino acid at position 240 is F; xxvi. the amino acid at position 289 is different from D, preferably the amino acid at position 289 is S or G; xxvii. the amino acid at position 291 is different from G, preferably the amino acid at position 291 is E or W; xxviii. the amino acid at position 317 is different from N, preferably the amino acid at position 317 is T; xxix. the amino acid at position 320 is different from N, preferably the amino acid at position
320 is E or G; xxx. the amino acid at position 321 is different from L, preferably the amino acid at position
321 is F.
SEQ ID NO.l refers to a reference protein sequence with lipase activity.
The meaning of amino acid abbreviations A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, Y is derivable herein below in Table 2 under the paragraph sub-titled “Description of the Sequences”.
An “amino acid corresponding to position x” in a first amino acid sequence (e.g. position 44 in SEQ ID No. 1) means herein that an amino acid of a second amino acid sequence, when compared with the first amino acid sequence, appears at position x of the first amino acid sequence in a pairwise sequence alignment of the first amino acid sequence with the second amino acid sequence in case the numbering of the amino acids of the second amino acid sequence differs from the amino acid numbering of the first amino acid sequence.
In the context of the present invention, the term “identity” in respect to sequence identity or sequences being identical to is to be understood as meaning the number of identical amino acids or nucleotides shared over the entire sequence length by a first nucleic or amino acid sequence with another (second) nucleic or amino acid sequence, respectively, expressed in percent.
“Sequence identity” can be determined by alignment of two amino acid or two nucleotide sequences using global or local alignment algorithms comprised for example in known software like GAP or BESTFIT or the Emboss program “Needle”. This software uses the Needleman and Wunsch global alignment algorithm for aligning two sequences, over their entire length, maximizing the number of matches and minimizing the number of gaps. Generally, the default parameters are used, with a gap creation penalty = 10 and gap extension penalty = 0.5 (both for nucleotide and protein alignments). For nucleotides the
default scoring matrix used is DNAFULL and for proteins the default scoring matrix is Blosum62 (Henikoff & Henikoff, 1992, PNAS 89, 10915-10919). Sequence alignments and scores for percentage sequence identity may for example be determined using software, such as EMBOSS, accessible at world wide web site of the EBI (ebi.ac.uk/Tools/emboss/). Alternatively, sequence similarity or identity may be determined by searching against databases (e.g. EMBL, GenBank) by using commonly known algorithms and output formats such as FASTA, BLAST, etc., but preferably hits should be retrieved and aligned pairwise to finally determine sequence identity.
If sequences to be compared with one another are of different length, the identity is to be determined by determining the identity in percent of the number of amino acids or nucleotides, respectively, which the shorter sequence shares with the longer sequence. Preferably, the identity is determined using the known and publicly available computer program ClustalW (Thompson et al., Nucleic Acids Research 22 (1994), 4673-4680). ClustalW is made publicly available by Julie Thompson (Thompson© EMBL- Heidelberg.DE) and Toby Gibson (Gibson@EMBL-Heidelberg.DE), European Molecular Biology Laboratory, Meyerhofstrasse 1, D 69117 Heidelberg, Germany. ClustalW can also be downloaded from various Internet pages, inter alia from IGBMC (Institut de Genetique et de Biologie Moleculaire et Cellulaire, B.P.163, 67404 Illkirch Cedex, France; ftp://ftp-igbmc.u-strasbg.fr/pub/) and from EBI (ftp://ftp.ebi.ac.uk/pub/software/) and all mirrored Internet pages of the EBI (European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK).
Preferably, use is made of the ClustalW computer program of version 1.8 or Clustal 2 to determine the identity between proteins described in the context of the present invention and other proteins. Here, the parameters have to be set as follows: KTUPLE=1, T0PDIAG=5, WIND0W=5, PAIRGAP=3, GAPOPEN=10, GAPEXTEND=0.05, GAPDIST=8, MAXDIV=40, MATRIX=GONNET, ENDGAPS(OFF), NOPGAP, NOHGAP.
Preferably, use is made of the ClustalW computer program of version 1.8 or Clustal 2 to determine the identity for example between the nucleotide sequence of the nucleic acid molecules described in the context of the present invention and the nucleotide sequence of other nucleic acid molecules. Here, the parameters have to be set as follows:
KTUPLE=2, TOPDIAGS=4, PAIRGAP=5, DNAM ATRIXJUB, GAPOPEN=10, GAPEXT=5, MAXDIV=40, TRANSITIONS: unweighted.
“Identity” furthermore means that there is a functional and/or structural equivalence between the nucleic acid molecules in question or the proteins encoded by them. Functional equivalence means that the nucleic acid molecule sequences or the amino acid sequences encode a protein having the activity of a lipase. The nucleic acid molecules which are homologous to the molecules described above and represent derivatives of these molecules are generally variants of these molecules which represent modifications having the same biological function or catalyzing the same reaction, i.e. coding for a protein having the activity of a
lipase. They may be either naturally occurring variants, for example sequences from other species, or mutations, where these mutations may have occurred in a natural manner or were introduced by targeted mutagenesis. Furthermore, the variants may be synthetically produced sequences. The allelic variants may be either naturally occurring variants or synthetically produced variants or variants generated by recombinant DNA techniques. However, concerning the present invention it is decisive that those variants encode proteins having lipase-activity and comprise the amino acid substitutions (replacements), deletions or insertions described herein concerning the proteins according to the invention.
A special type of derivatives are, for example, nucleic acid molecules which differ from the nucleic acid molecules described in the context of the present invention as a result of the degeneracy of the genetic code.
According to the NC-IUBMB (Nomenclature Committee of the International Union of Biochemistry and Molecular Biology) lipases belong to the class of hydrolases (EC 3). Hydrolase is a class of enzymes that commonly perform as biochemical catalysts that use water to break a chemical bond, which typically results in dividing a larger molecule into smaller molecules. Under unnatural, anhydrous, conditions these enzymes can also catalyze reactions of esterification, e.g. acetylation, and transesterification. The group of hydrolases comprises enzymes acting on ester bonds (EC 3.1) encompassing carboxylic ester hydrolases (EC 3.1.1) and as a subgroup lipases (EC 3.1.1.3). Lipases have been identified from plants, mammals and microorganisms including e.g. Pseudomonas, Vibrio, Acinetobacter, Burkholderia, Chromobacterium, cutinase from Fusarium solani (FSC), Candida antarctica A (CalA), Rhizopus oryzae (ROL), Thermomyces lanuginosus (TLL), Rhizomucor miehei (RML), Aspergillus Niger, Fusarium heterosporum, Fusarium oxysporum or Fusarium culmorum.
If a protein has the activity of a lipase, this can be detected with methods known and described in the art.
It is not decisive which method is used for detecting if a protein according to the invention has the activity of a lipase. Preferably, in connection with the present invention, the method is described in the “Example”- section.
In a further embodiment of the invention, the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by at least one of the following modifications: i. the amino acid at position 44 is W or Y; ii. the amino acid at position 54 is F; iii. the amino acid at position 55 is R; iv. the amino acid at position 109 is H; v. the amino acid at position 110 is T or V; vi. the amino acid at position 117 is F; vii. the amino acid at position 122 is Q or R;
viii. the amino acid at position 160 is E; ix. the amino acid at position 216 is K; x. the amino acid at position 236 is T; xi. the amino acid at position 238 is K or E; xii. the amino acid at position 240 is F.
Preferably, the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by at least the mutation G109H, i.e. the amino acid at position 109 is H.
Lipase variant proteins according to the invention may exhibit further amino acid modifications (amino acid substitutions, deletions or insertions) compared to the amino acid sequences described herein above in respect to the amino acid sequence shown under SEQ ID No. 1.
In a further embodiment of the invention, the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by at least two, further preferably at least three, even further preferably at least four, in particular preferably at least five of the following modifications selected from: i. the amino acid at position 57 is different from N, preferably the amino acid is P; ii. the amino acid at position 109 is different from G, preferably the amino acid is H; iii. the amino acid at position 122 is different from K, preferably the amino acid is R; iv. the amino acid at position 212 is different from A, preferably the amino acid is P; v. the amino acid at position 234 is different from S, preferably the amino acid is K; vi. the amino acid at position 289 is different from D, preferably the amino acid is G.
Preferably, the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by the following modifications: i. the amino acid at position 57 is P; ii. the amino acid at position 109 is H; iii. the amino acid at position 212 is P; iv. the amino acid at position 234 is K; and v. the amino acid at position 289 is G.
Further preferably, the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by the following modifications: i. the amino acid at position 57 is P; ii. the amino acid at position 109 is H; iii. the amino acid at position 122 is R; iv. the amino acid at position 212 is P; v. the amino acid at position 234 is K; and
vi. the amino acid at position 289 is G.
In a further embodiment of the invention, the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by the following modifications: the amino acid at position 57 is P; the amino acid at position 109 is H; the amino acid at position 122 is R or K or Q, preferably R; the amino acid at position 212 is P; the amino acid at position 234 is K; the amino acid at position 289 is G; and additionally at least one of the following modifications: the amino acid at position 44 is W or Y; the amino acid at position 54 is F, the amino acid at position 55 is R, the amino acid at position 110 is V or T; the amino acid at position 111 is T, the amino acid at position 117 is F, the amino acid at position 160 is E, the amino acid at position 216 is K, the amino acid at position 236 is T, the amino acid at position 238 is K or E, the amino acid at position 240 is F.
Preferred embodiments of the invention are proteins according to the invention encoding lipases having the amino acid sequences shown under SEQ ID Nos. 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239.
One further embodiment of the invention concerns nucleic acid molecules encoding a protein according to the invention.
Nucleic acid molecules according to the invention can be any kind of nucleic acid, as long as the nucleic acid encodes a protein according to the invention. The nucleic acids can be ribonucleic nucleic acid molecules (e.g. RNA, mRNA) or deoxyribonucleic nucleic acid molecules (DNA, including genomic DNA which may or may not comprise introns and coding DNA).
Of particular interest for the invention are nucleic acid molecules encoding proteins having the activity of a lipase comprising the amino acid sequences shown under SEQ ID Nos. 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80,
82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126,
128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168,
170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210,
212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240.
The invention therefore also concerns nucleic acid molecules encoding a protein having the activity of a lipase selected from the group consisting of
a) nucleic acid molecules comprising the nucleic acid sequences shown under SEQ ID Nos. 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102, 104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138, 140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174, 176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210, 212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240; b) nucleic acid molecules having at least 60%, preferably 70%, more preferably 80%, furthermore preferably 90%, even more preferably 95%, even furthermore preferably 96%, particular preferably 97%, most preferably 98% or especially preferably 99% identity with the nucleic acid sequences shown under a).
In the context of the present invention, the term “hybridizing with” means hybridization under conventional hybridization conditions, preferably under stringent conditions, as described, for example, in Sambrook et al. (Molecular Cloning, A Laboratory Manual, 3rd edition (2001) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. ISBN: 0879695773) or Ausubel et al. (Short Protocols in Molecular Biology, John Wiley & Sons; 5th edition (2002), ISBN: 0471250929). With particular preference, “hybridization” means a hybridization under the following conditions: hybridization buffer:
2xSSC; lOxDenhardt solution (Fikoll 400+PEG+BSA; ratio 1:1:1); 0.1% SDS; 5 mM EDTA; 50 mM Na2HPO4; 250 pg/ml of herring sperm DNA; 50 pg/ml of tRNA; or
25 M sodium phosphate buffer pH 7.2; 1 mM EDTA; 7% SDS hybridization temperature: T = 65 to 68 °C wash buffer: O.lxSSC; 0.1% SDS wash temperature: T = 65 to 68 °C.
Nucleic acid molecules which hybridize with nucleic acid molecules coding for a protein having the activity of a lipase may originate from any organism; accordingly, they may originate from bacteria, fungi, animals, humans, plants or viruses.
Nucleic acid molecules which hybridize with nucleic acid molecules coding for a protein having the activity of a lipase preferably originate from microorganisms, more preferably from fungi or bacteria, most preferably from bacteria.
Nucleic acid molecules which hybridize with the molecules mentioned may be isolated, for example, from genomic or from cDNA libraries. Such nucleic acid molecules can be identified and isolated using the nucleic acid molecules described herein or they can be identified and isolated using parts of these molecules or the reverse complements of these molecules, for example by hybridization according to standard methods (see, for example, Sambrook et al., Molecular Cloning, A Laboratory Manual, 3rd edition (2001) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. ISBN: 0879695773; Ausubel et al., Short Protocols in Molecular Biology, John Wiley & Sons; 5th edition (2002), ISBN: 0471250929) or by amplification using PCR.
The fragments used as hybridization samples may also be synthetic fragments or oligonucleotides prepared using the customary synthesis techniques, whose sequence is essentially identical to the nucleic acid molecule described in the context of the present invention. Once genes which hybridize with the nucleic acid sequences described in the context of the present invention are identified and isolated, the sequence should be determined and the properties of the proteins coded for by this sequence should be analyzed to determine whether they are proteins having the activity of a lipase. Methods of how to determine whether a protein has the activity of a protein having the activity of a lipase are known to the person skilled in the art.
The molecules hybridizing with the nucleic acid molecules described in the context of the present invention comprise in particular fragments, derivatives and allelic variants of the nucleic acid molecules mentioned. In the context of the present invention, the term “derivative” means that the sequences of these molecules differ in one or more positions from the sequences of the nucleic acid molecules described above and are highly identical to these sequences. The differences to the nucleic acid molecules described above may, for example, be due to deletion, addition, substitution, insertion or recombination.
The meaning of nucleotide abbreviations a, c, g, t, and those of abbreviations for degenerate nucleotides r, y, s, w, k, m, b, d, h, v, n is derivable herein below from Table 1 under the paragraph sub-titled “Description of the Sequences”. Which amino acids are encoded by codons comprising degenerate nucleotides is derivable herein below from Table 3 under the paragraph sub-titled “Description of the Sequences”.
Furthermore, the invention relates to recombinant nucleic acid molecules comprising a nucleic acid molecule according to the invention.
In conjunction with the present invention, the term „recombinant nucleic acid molecule" is to be understood to mean a nucleic acid molecule, which contains additional sequences in addition to nucleic acid molecules according to the invention, which do not naturally occur in the combination in which they occur in recombinant nucleic acids according to the invention. Here, the abovementioned additional sequences can be any sequences, preferably they are functional or regulatory sequences (promoters, termination signals, enhancers, ribosome binding sites (rbs), leader sequences enhancing transcription,
translation or RNA stability, subcellular targeting sequences etc.), particularly preferably they are functional or regulatory sequences that are active in microorganisms, and especially particularly preferably they are regulatory sequences that are active in fungi, in particular yeasts or in bacteria. Methods for the creation of recombinant nucleic acid molecules according to the invention are known to the person skilled in the art and include genetic methods such as bonding nucleic acid molecules by way of ligation, genetic recombination, or new synthesis of nucleic acid molecules. Those methods are described e.g. in Sambrok et al. (Molecular Cloning, A Laboratory Manual, 3rd edition (2001) Cold Spring Harbour Laboratory Press, Cold Spring Harbour, NY. ISBN: 0879695773) or Ausubel et al. (Short Protocols in Molecular Biology, John Wiley & Sons; 5th edition (2002), ISBN: 0471250929).
In a further embodiment, the recombinant nucleic acid molecules according to the invention comprise a nucleic acid molecule according to the invention which is linked with regulatory sequences, which initiate transcription in prokaryotic or eukaryotic cells.
“Regulatory sequences, which initiate transcription” in a cell are also known as promoters.
Information concerning regulatory sequences and plasmids are well known to a person skilled in the art and are described e.g. by the Registry of Standard Biological Parts supported by The International Genetically Engineered Machine (iGEM) Foundation (One Kendall Square, Suite B6104, Cambridge, MA 02139, USA) in the world wide web (http://parts.igem.org/Catalog).
Regulatory sequences which initiate transcription in prokaryotic organisms, e.g. E. coli, and in eukaryotic organisms are sufficiently described in literature, in particular such for expression in yeast are described, e.g. Saccharomyces cerevisiae. An overview of various systems for expression for proteins in various host organisms can be found, for example, in Methods in Enzymology 153 (1987), 383-516 and in Bitter et al. (Methods in Enzymology 153 (1987), 516-544) or in Gomes et al. (2016, Advances in Animal and Veterinary Sciences, 4(4), 346) and Baghban et al. (2018, Current Pharmaceutical Biotechnology, 19(6)). Common yeast promoters are pAOXl, pHIS4, pGAL, pScADH2 (Baghban et al., 2018, see above). Common bacterial promoters are T5, T7, rhamnose-inducible, arabinose-inducible, PhoA, artificial trc (trp-lac) promoter as described by Marschall et al. (2017, Appl Microbiol Biotechnol 101, 501-512) and Tegel et al. (2011, FEBS Journal 278, 729-739).
A further embodiment of recombinant nucleic acid molecules of the present invention are vectors or plasmids, which comprise the nucleic acid molecules according to the invention.
“Vectors” are commonly understood in the field of molecular biology and herein to represent a nucleic acid sequence or a vehicle comprising a nucleic acid sequence used to transfer genetic material (DNA or RNA) into a target cell. Vectors can be plasmids, e.g. T-DNA or binary vectors for generating transgenic plants, expression vectors for expression of nucleic acid sequences in a host cell, shuttle vectors which are
eligible to propagate in different hosts, or vectors can be virus particles or bacteriophages having been modified to deliver foreign genetic material into a host.
“Plasmids” are commonly understood in the field of molecular biology and herein to represent an autonomously self-replicating, often circular DNA molecule which is when present in a host cell separated from the chromosomal DNA.
Nucleic acid molecules according to the invention, recombinant nucleic acid molecules according to the invention, vectors or plasmids according to the invention can be used for production of proteins according to the invention, e.g. by expressing the nucleic acid molecules according to the invention in host cells.
Another embodiment of the invention concerns hosts or host cells comprising or expressing a nucleic acid molecule according to invention or comprising proteins according to the invention or comprising a recombinant nucleic acid molecule according to the invention or comprising a vector according to the invention or comprising a plasmid according to the invention.
Nucleic acid molecules according to the invention encoding a protein having the activity of a lipase can be expressed in host cells for e.g. their multiplication or for production of proteins according to the invention. For expression in host cells, nucleic acid molecules according to the invention can be comprised on vectors or plasmids or they can be stably integrated into the genome of a respective host cell. The nucleic acid molecules according to the invention can also be comprised by vectors which support their introduction into host cells.
A further embodiment of the present invention concerns a host or host cell according to the invention comprising a nucleic acid molecule according to the invention or comprising a recombinant nucleic acid molecule according to the invention or comprising a vector according to the invention or comprising a plasmid according to the invention and, in each case comprising a protein according to the invention.
Another embodiment of the present invention concerns a host or host cell according to the invention comprising a nucleic acid molecule according to the invention or comprising a recombinant nucleic acid molecule according to the invention or comprising a vector according to the invention or comprising a plasmid according to the invention and, in each case expressing a protein according to the invention.
Another embodiment of the present invention concerns a host or host cell according to the invention comprising a nucleic acid molecule according to the invention or comprising a recombinant nucleic acid molecule according to the invention or comprising a vector according to the invention or comprising a plasmid according to the invention and, in each case expressing a protein, wherein the protein has the activity of a lipase.
“Expressing a nucleic acid molecule” shall be understood herein to mean that in case the nucleic acid molecule is RNA or mRNA the nucleic acid molecule is translated into a protein, preferably translated
into a protein having the activity of a lipase or in case of the nucleic acid molecule is DNA or cDNA it is transcribed (and in case of genomic DNA containing introns is processed) into mRNA, preferably into a mRNA encoding a protein having the activity of a lipase and subsequently translated into a protein, preferably translated into a protein having the activity of a lipase.
Transcription of a given nucleic acid molecule in a host can be demonstrated by methods known to a person skilled in the art, for example, by detection of specific transcripts (mRNA) of foreign nucleic acid molecules by Northern blot analysis or RT-PCR.
Whether hosts or host cells comprise a given protein or comprise a protein which is derived from expressing a nucleic acid molecule can be determined by methods known to a person skilled in the art, for example, by immunological methods, such as Western blot analysis, ELISA (Enzyme Linked Immuno Sorbent Assay) or RIA (Radio Immune Assay). The person skilled in the art is familiar with methods for preparing antibodies which react specifically with a certain protein, i.e. which bind specifically to a certain protein (see, for example, Lottspeich and Zorbas (eds.), 1998, Bioanalytik, Spektrum akad, Verlag, Heidelberg, Berlin, ISBN 3-8274-0041-4). Some companies (Thermo Fisher Scientific, 168 Third Avenue, Waltham, MA USA 0245; GenScript, 60 Centennial Ave., Piscataway, NJ 08854, USA) offer the preparation of such antibodies as an order service.
Furthermore, a person skilled in the art can test if a host or host cell comprises a protein according to the invention by detecting (additional) activity of proteins having the activity of a lipase in a respective host cell. Preferably activity of proteins having additional activity of a lipase in a respective host cell is detected by comparing the activities of lipases of a host cell according to the invention with the respective activity of host cell not comprising a protein according to the invention.
Testing if a protein has the activity of a lipase can be done by methods known in the art.
Host or host cells according to the invention can be produced by a person skilled in the art by known methods for genetically modifying or transforming organisms.
A further subject of the present invention therefore is a host or host cell according to the invention, particularly a prokaryotic or eukaryotic host or host cell, which is genetically modified (or transformed) with a nucleic acid molecule according to the invention or with a recombinant nucleic acid molecule according to the invention or with a vector according to the invention or a plasmid according to the invention. Preferably the genetically modified (transformed) host or host cell according to the invention expresses a protein having the activity of a lipase, more preferably, the genetically modified (transformed) host or host cell according to the invention expresses a protein according to the invention.
“Genetically modified with a nucleic acid molecule” or “transformed with a nucleic acid molecule” shall be understood herein to mean that a nucleic acid molecule is or was introduced into a host or host cell by
technical and/or non-naturally occurring means, preferably by technical methods in the field of molecular biology, biotechnology or genetic modification.
Descendants, offspring or progeny of hosts or host cells according to the invention are also an embodiment of the invention, preferably these descendants, offspring or progeny comprise a nucleic acid molecule according to the invention or comprise a recombinant nucleic acid molecule according to the invention or comprise a vector according to the invention or comprise a plasmid according to the invention or comprise a protein according to the invention, more preferably these descendants, offspring or progeny comprise a nucleic acid molecule according to the invention or comprise a recombinant nucleic acid molecule according to the invention or comprise a vector according to the invention or comprise a plasmid according to the invention and, in each case express a protein, wherein the protein has the activity of a lipase, even more preferably these descendants, offspring or progeny comprise a nucleic acid molecule according to the invention or comprise a recombinant nucleic acid molecule according to the invention or comprise a vector according to the invention or comprise a plasmid according to the invention and, in each case express a protein, wherein the protein has the activity of a lipase according to the invention.
The host or host cell according to the invention can be a host or host cell from any prokaryotic or eucaryotic organism. The hosts or host cells can be bacteria or bacteria cells (e.g. E. coli, bacteria of the genus Bacillus, in particular Bacillus subtilis, Agrobacterium, particularly Agrobacterium tumefaciens or Agrobacterium rhizogenes, Pseudomonas, particularly Pseudomonas fluorescens, Streptomyces spp, Rhodococcus spp, in particular Rhodococcus rhodochrous, Vibrio natrigens, Corynebacterium, particularly Corynebacterium glutamicum) or fungi or fungal cells (e.g. Agaricus, in particular Agaricus bisporus, Aspergillus, Trichoderma or yeasts, particularly .S’, cerevisiae, Pichia ssp. like P. pastoris), as well as plants or plant cells or they can be animals or animal cells.
Preferred host cells according to the invention are cells of microorganisms. Within the framework of the present patent application, this is understood to include all bacteria and all protists (e.g. fungi, particularly yeasts and algae), as they are defined in Schlegel "General Microbiology " (Georg Thieme Publishing House (1985), 1-2), for example.
In respect to microorganisms, the hosts or host cells according to the invention are preferably bacteria/bacteria cells or yeast/yeast cells, most preferably they are bacteria/bacteria cells. Concerning bacteria/bacteria cells, the hosts or host cells according to the invention are preferably Bacillus species/ Bacillus species cells or Escherichia coli! Escherichia coli cells cells most preferably Escherichia coli/ Escherichia coli cells.
Alternatively, Pseudomonas, particularly Pseudomonas fluorescens, Streptomyces spp, Rhodococcus spp, in particular Rhodococcus rhodochrous, Vibrio spp, particularly Vibrio natrigens, Corynebacterium, particularly Corynebacterium glutamicum or others can be hosts or host cells according to the invention.
A preferred embodiment of the invention concerns hosts or host cells according to the invention comprising a nucleic acid molecule according to the invention, wherein the nucleic acid molecule according to the invention is characterized in that the codons of said nucleic acid molecule are changed such that they are adapted to the frequency of use of the codons of the host or a host cell, respectively.
Host cells according to the invention can be used for production of proteins according to the invention. Proteins according to the invention can be used in methods for production of enantiomerically enriched or nearly enantiomerically pure secondary alcohols.
A second aspect of the present invention relates to a method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I), comprising contacting the substrate with a variant protein having the activity of a lipase according to the first aspect of the present invention or with a lipase according to SEQ ID No. 1 ,
wherein R1 and R2 are independently from each other selected from a substituted or unsubstituted (n)-alkyl, iso-alkyl, unsubstituted aryl, alkyl-substituted or aryl-substituted aryl.
Preferably, the substrate is contacted with a variant protein having the activity of a lipase according to the first aspect of the present invention.
R1 and and R2 are preferably independently from each other chosen to be a linear or branched Ci-io residue.
Further preferably, R1 is selected from methyl, iso-butyl, tert-butyl, and iso-propyl; and/or R2 is methyl.
Even further preferably, R1 is selected from methyl, iso-butyl, tert-butyl, and iso-propyl; and R2 is methyl.
In a further embodiment of the invention, the method comprises hydrolysing a racemic substrate of formula (II- 1) in an enantiomerically selective manner to separate an enantiomerically enriched or pure compound of formula (1-1),
The method preferably comprises isolating the compound (I) or (1-1), and/or (III) or (III-l) after hydrolysis. The isolation of compounds (1-1) can be done by any means which are known to those skilled in the art and is preferably done via extraction and/or direct distillation.
The lipase variants or protein variants according to the invention show improved selectivity and/or improved specific activity regarding the stereoselective hydrolysis of methyl-3-hydroxy-2-methylene- butanoate (formula II- 1) and are better adapted to isolate enantiomerically enriched or nearly pure substrate methyl (3S)-3-hydroxy-2-methylene-butanoate (formula 1-1) compared to a wild-type lipase. Compounds of formula (I)
represent an important building block in the synthesis of complex agrochemical compounds. In particular, methyl (3S)-3-hydroxy-2-methylene-butanoate of formula (1-1)
(1-1 ) which is the S-enantiomer of racemic methyl 3-hydroxy-2-methylene-butanoate, is an important intermediate in the respective synthesis of agrochemical compounds described in WO 2018/228985. The lipase variants according to the present invention selectively hydrolyse the R-enantiomer of racemic methyl 3-hydroxy-2-methylene-butanoate, allowing the separation of the S-enantiomer of said racemic methyl 3 -hydroxy-2-methylene-butanoate .
In a further embodiment of the invention, the method is conducted in an aqueous solution.
Preferably, the method is conducted in a mixture or biphasic liquid system of water and an organic solvent, such as methyl-tert-butylether, toluene, 2-methyltetrahydrofuran, methyl isobutyl ketone, cyclohexane, cyclopentyl methyl ether, chlorobenzene, tert-amylmethylether, ethyl acetate, isopropyl acetate. Even further preferably, the organic solvent is methyl-tert-butylether (MTBE).
The ratio of organic solvent to water is further preferably in the range from 1 : 1 to 6: 1 , even more preferably 2:1 to 4:1.
Surprinsingly, it was found that a significant reduction of the aqueous phase, substituted partially by an organic solvent, not only allows for higher concentration of the final product, thus increasing the productivity, but also allows for the reduction of the enzyme / lipase load.
Furthermore, the presence of an organic solvent, such as MTBE, promotes an efficient downstream processing of the reaction products, i.e. isolation of compounds of the formula (I) or (1-1).
In a further embodiment of the invention, the method is carried out at a temperature of between 20 and 60 °C, preferably between 30 and 55 °C. Further preferably, the method is carried out at a temperature of between 35 and 50 °C.
In a further embodiment of the invention, the method is carried out for at least 1 h, preferably at least 2 h, further preferably at least 3 h.
In a further embodiment of the invention, the method is carried out for at most 40 h, preferably at most 30 h, further preferably at most 20 h.
In a further embodiment of the invention, the method is carried out at a pH of between 7 and 8.5, preferably between 7.2 and 8.0, further preferably between 7.4 and 7.6. The pH may be adjusted by addition of organic or inorganic bases, preferably by the addition of inorganic bases, such as hydrogen carbonates or carbonates, hydrogen phosphates and hydroxides.
At a pH above 8.5 the enzyme is deactivated; further, there is partial hydrolysis of the ester of formula (I). Below pH 7, the reaction rate decreased and finally the reaction stops. Further below, at a pH of 4.5 or less, the enzyme is deactived.
The pH may be controlled by addition of a base, such as NaOH, KOH, K2CO3, KHCO3, Na2CO3, NaHCO i. Preferred bases are carbonates or hydrogencarbonates. The base can be added as solid or aqueous solution, diluted or saturated. The base may be added to the reaction before the substrate is dosed, or the base is dosed into the reaction in parallel to the substrate. The substrate and/or base can be dosed or added in one portion.
The lipase of SEQ ID No. 1 or the protein variant according to the first aspect of the present invention may be provided as purified enzyme or in form of spray-dried or freeze-dried biomass or as broth. Preferably, the protein variant is provided as broth, i.e. as cell culture, or the broth is centrifugated and the cell pellet is freeze-dried to obtain lyophilized cells or biomass.
While lyophilized biomass is easy to handle in terms of independent timing for performing the claimed method, the use of the broth/cell culture directly to perform the claimed method is cost and time efficient.
For downstream processing, the reaction is preferably conducted in a mixture of water and a water- immiscible solvent. In case of conducting the reaction in a biphasic mixture comprising water and an immiscible organic solvent, the phases are separated and the aqueous phase is eventually back-extracted. The biomass can be separated by known technical means (e.g. centrifuge, filtration or decantation) before distillation or the distillation can take place without separation of the biomass.
It is to be understood in the context of the present invention that all embodiments concerning the method according to the second aspect can be combined with all different lipase variants according to the first aspect of the invention, and in particular with the preferred lipase variants. I.e. the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) may be carried out by means of a protein having the activity of a lipase wherein the protein is encoded by an amino acid sequence having at least 80%, preferably 85%, more preferably 90%, furthermore preferably 92%, even more preferably 95% with the amino acid sequence shown under SEQ ID No. 1 , characterized in that the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 in at least one of the following positions: i. the amino acid at position 44 is different from L, preferably the amino acid at position 44 is M, W or Y; ii. the amino acid at position 51 is different from F, preferably the amino acid at position 51 is N, or M; iii. the amino acid at position 52 is different from V, preferably the amino acid at position 52 is L; iv. the amino acid at position 53 is different from T, preferably the amino acid at position 53 is S, P, I, E, or A; v. the amino acid at position 54 is different from D, preferably the amino acid at position 54 is Q, M, F, G, E, L, T or P; vi. the amino acid at position 55 is different from A, preferably the amino acid at position 55 is R, M, D, Y, S, or I; vii. the amino acid at position 109 is different from G, preferably the amino acid at position 109 is H or F;
viii. the amino acid at position 110 is different from M, preferably the amino acid at position 110 is T or V; ix. the amino acid at position 111 is different from A, preferably the amino acid at position 111 is T or S; x. the amino acid at position 117 is different from Y, preferably the amino acid at position 117 is F or S; xi. the amino acid at position 121 is different from Y, preferably the amino acid at position
121 is V; xii. the amino acid at position 122 is different from K, preferably the amino acid at position
122 is Q, A, Y, R, or V; xiii. the amino acid at position 153 is different from H, preferably the amino acid at position 153 is N, Y, D, E, or C; xiv. the amino acid at position 160 is different from T, preferably the amino acid at position 160 is E, C, D, P, I, Q, K, M, S, F, A, or N; xv. the amino acid at position 179 is different from D, preferably the amino acid at position 179 is C; xvi. the amino acid at position 181 is different from A, preferably the amino acid at position 181 is Q; xvii. the amino acid at position 184 is different from A, preferably the amino acid at position 184 is G or T; xviii. the amino acid at position 211 is different from Y, preferably the amino acid at position
211 is E; xix. the amino acid at position 212 is different from A, preferably the amino acid at position
212 is S or P; xx. the amino acid at position 216 is different from Y, preferably the amino acid at position 216 is K or A; xxi. the amino acid at position 234 is different from S, preferably the amino acid at position
234 is K, T or G; xxii. the amino acid at position 235 is different from S, preferably the amino acid at position
235 is V or M; xxiii. the amino acid at position 236 is different from K, preferably the amino acid at position
236 is T; xxiv. the amino acid at position 238 is different from R, preferably the amino acid at position 238 is A, K, D, E, or Q; xxv. the amino acid at position 240 is different from Y, preferably the amino acid at position 240 is F; xxvi. the amino acid at position 289 is different from D, preferably the amino acid at position 289 is S or G;
xxvii. the amino acid at position 291 is different from G, preferably the amino acid at position 291 is E or W; xxviii. the amino acid at position 317 is different from N, preferably the amino acid at position 317 is T; xxix. the amino acid at position 320 is different from N, preferably the amino acid at position
320 is E or G; xxx. the amino acid at position 321 is different from L, preferably the amino acid at position
321 is F.
The amino acid sequence of the variant protein lipase used in the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) may differ from the amino acid sequence of SEQ ID NO. 1 by at least one of the following modifications: i. the amino acid at position 44 is W or Y; ii. the amino acid at position 54 is F; iii. the amino acid at position 55 is R; iv. the amino acid at position 109 is H; v. the amino acid at position 110 is T or V; vi. the amino acid at position 117 is F; vii. the amino acid at position 122 is Q or R; viii. the amino acid at position 160 is E; ix. the amino acid at position 216 is K; x. the amino acid at position 236 is T; xi. the amino acid at position 238 is K or E; xii. the amino acid at position 240 is F.
Preferably, the amino acid sequence of the variant protein lipase used in the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) differs from the amino acid sequence of SEQ ID NO. 1 by at least the mutation G109H, i.e. the amino acid at position 109 is H.
The amino acid sequence of the variant protein lipase used in the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) may differ from the amino acid sequence of SEQ ID NO. 1 by at least two, further preferably at least three, even further preferably at least four, in particular preferably at least five of the following modifications selected from: i. the amino acid at position 57 is different from N, preferably the amino acid is P; ii. the amino acid at position 109 is different from G, preferably the amino acid is H;
iii. the amino acid at position 122 is different from K, preferably the amino acid is R; iv. the amino acid at position 212 is different from A, preferably the amino acid is P; v. the amino acid at position 234 is different from S, preferably the amino acid is K; vi. the amino acid at position 289 is different from D, preferably the amino acid is G.
Preferably, the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by the following modifications: i. the amino acid at position 57 is P; ii. the amino acid at position 109 is H; iii. the amino acid at position 212 is P; iv. the amino acid at position 234 is K; and v. the amino acid at position 289 is G.
Further preferably, the amino acid sequence of the variant differs from the amino acid sequence of SEQ ID NO. 1 by the following modifications: i. the amino acid at position 57 is P; ii. the amino acid at position 109 is H; iii. the amino acid at position 122 is R; iv. the amino acid at position 212 is P; v. the amino acid at position 234 is K; and vi. the amino acid at position 289 is G.
The amino acid sequence of the variant protein lipase used in the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate enantiomerically enriched or pure compounds of formula (I) may differ from the amino acid sequence of SEQ ID NO. 1 by the following modifications: the amino acid at position 57 is P; the amino acid at position 109 is H; the amino acid at position 122 is R or K or Q, preferably R; the amino acid at position 212 is P; the amino acid at position 234 is K; the amino acid at position 289 is G; and additionally at least one of the following modifications: the amino acid at position 44 is W or Y; the amino acid at position 54 is F, the amino acid at position 55 is R, the amino acid at position 110 is V or T; the amino acid at position 111 is T, the amino acid at position 117 is F, the amino acid at position 160 is E, the amino acid at position 216 is K, the amino acid at position 236 is T, the amino acid at position 238 is K or E, the amino acid at position 240 is F.
Preferred proteins according to the invention encoding lipases that may be used in the method for hydrolysing a racemic substrate of formula (II) in an enantiomerically selective manner to separate
enantiomerically enriched or pure compounds of formula (I) are shown under SEQ ID Nos. 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69,
71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119,
121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161,
163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203,
205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239.
A further embodiment of the invention is the use of a protein according to the first aspect of the invention for the stereoselective hydrolysis of racemic methyl-3-hydroxy-2-methylene-butanoate to isolate methyl (3S)-3-hydroxy-2-methylene-butanoate.
“Enantiomerically enriched” means herein that one of two enantiomers is present in a composition in higher amounts than the other enantiomer, preferably at least 60% of one enantiomer is present in the composition, more preferably at least 65% of one enantiomer is present in the composition, further more preferably at least 70% of one enantiomer is present in the composition, even more preferably at least 75% of one enantiomer is present in the composition, even further more preferably at least 80% of one enantiomer is present in the composition, particular preferably at least 85% of one enantiomer is present in the composition, most preferably at least 90% of one enantiomer is present in the composition or especially preferably at least 94% of one enantiomer is present in the composition.
“Enantiomerically nearly pure” means herein that one of two enantiomers is present in a composition in amounts of at least 95.0%, preferably one of two enantiomers is present in a composition in amounts of at least 95.5%, more preferably one of two enantiomers is present in a composition in amounts of at least 96.0%, further more preferably one of two enantiomers is present in a composition in amounts of at least 96.5%, even more preferably one of two enantiomers is present in a composition in amounts of at least 97.0%, even further more preferably one of two enantiomers is present in a composition in amounts of at least 98.0%, particular preferably one of two enantiomers is present in a composition in amounts of at least 98.5%, most preferably one of two enantiomers is present in a composition in amounts of at least 99.0%, or especially preferably one of two enantiomers is present in a composition in amounts of at least 99.5%.
A third aspect of the present invention relates to a method for the enantiomeric enrichment of compounds of formula (I)
comprising reacting compounds of formula (II) with compounds of formula (IV)
and in particular
with R1 and R2 being defined as above, and R3 being a substutituted or unsubstituted (n) -Alkyl or isoalkyl, in the presence of a lipase under anhydrous conditions.
“Anhydrous conditions” refer to a reaction system comprising at most 3 % water in the liquid phase. The reaction preferably takes place in an organic solvent, such as hexane or n-heptane, or without any additional solvent.
R3 can preferably be a linear Ci-i6 residue, in particular methyl or unsubstituted and saturated Cs-Cn residue. Compound (IV) can be in particular vinyl acetate or vinyl laurate.
The lipase is preferably immobilized on a solid carrier during the action. Immobilization leads to increased enzyme stability in non-aqueous solutions.
The reaction is preferably carried out at a temperature of between 20 and 40°C, preferably 25 to 35°C. Such temperature range allows for optimal enzymatic reacticity.
The reaction is preferably carried out for at least 1 h, further preferably at least 3 h, even further preferably at least 5 h.
The lipase is preferably recycled after the reaction has taken place. This allows the method to be designed in a cost-efficient manner. The lipase can thus be used multiple times for the reaction according to the third aspect or a further reaction.
The compound of formula (I) is preferably isolated by means of distillation directly from the reaction mixture or from the supernatant, obtained after decanting the reaction mixture. The lipase preferably remains in the remaining reaction mixture and can be reused for another reaction.
The lipase is preferably CALB lipase. CALB is a non-specific lipase originating from Candida antarctica B and was first described in 1994 (Uppenberg J, Patkar S, Bergfors T, Jones TA (1994) J Mol Biol 235(2):790-792).
The lipase is furthermore preferably immobilized on a hydrophobic carrier, such as acrylic resin. A commercial version of CALB is, for example, Novozym® 435, which can be used for carrying out the reaction.
Detailed description
The polypeptides (enzymes, i.e. lipases) as well as the processes according to the present invention allow for an efficient enantiomeric enrichment of compounds of the formula (I)
It was found that introduction of certain amino acid modifications into protein variants according to the invention improves the activity of the lipase, in particular in respect to its substrate specificity, meaning that these further modified lipase variants are better adapted to produce enantiomerically enriched or
nearly pure products compared to known lipase variants. This in particular refers to the substrate (1-1).
(1-1 ) which is the S-enantiomer of racemic methyl 3-hydroxy-2-methylene-butanoate, is an important intermediate in the respective synthesis of the compounds described in WO 2018/228985. The lipase variants of the present invention catalyse entioselectively the hydrolysis of the racemic methyl 3-hydroxy- 2-methylene-butanoate into (3R)-3-hydroxy-2-methylene- butanoic acid, leaving methyl (3S)-3-hydroxy-
2-methylene-butanoate unhydrolyzed.
The terms used herein are known to those skilled in the art. Otherwise, the following definitions are used:
For the purposes of the present invention, the term “alkyl” includes saturated hydrocarbon residues which can be branched or straight-chain and unsubstituted or at least monosubstituted. Examples of suitable alkyl residues, which can be unsubstituted or mono- or polysubstituted, are methyl, ethyl, n-propyl, isopropyl, n-butyl, isobutyl, 2-butyl, tert-butyl, n-pentyl, 2-pentyl, 3-pentyl, isopentyl, neo-pentyl, n-hexyl, 2-hexyl,
3-hexyl, n-heptyl, n-octyl, -C(H)(C2H5)2, -C(H)(n- C3H7)2 and -CH2-CH2-C(H)(CH3)-(CH2)3-CH3.
Unless defined otherwise elsewhere, the term “aryl” either alone or in combination with other terms, is to be understood as a mono- or polycyclic, preferably a mono- or bicyclic, aromatic hydrocarbon radical with preferably 6, 10 or 14 carbon atoms. An aryl radical can be unsubstituted or monosubstituted or polysubstituted, either with identical or different substituents.
Description of the Sequences
Throughout the application, nucleotide and amino acid abbreviations are used according to the following IUPAC codes:
For discrimination between amino acids and nucleotides, capitalized nucleotide code abbreviations given in above Table are written herein in lower case.
Codon usage follows herein the so called “general genetic code” according to the following Table, wherein “f ’ is to be substituted by “u” in ribonucleic acid (RNA) sequences.
Table 3
The sequence listing associated with this application is filed in electronic format and hereby incorporated by reference into the specification in its entirety. “PRT” stands for “protein” and “NUC” for “nucleic acid”. Table 4a: Lipase variants according to the first aspect of the present invention (SEQ ID No. 7-169) and further lipases (SEQ ID No. 1-6)
Table 4b: Further lipase variants according to the present invention (protein SEQ ID No. 171-239). The variants described contain the backbone mutations N57P, K122R, A212P, S234K, D289G and G109H (SEQ ID No. 171, 172), unless (i) the mutation “R122K” is indicated in addition, which means that the K122R mutation has been back- mutated such that the position 122 carries an K again, or (ii) the mutation “R122Q” is indicated in addition, which means that the K122R mutation has been further mutated such that the position 122 carries a Q.
Examples
The following examples explain the present invention, but without restricting it.
Starting materials and protocols
Analytical grade chemicals and ready-to-use kits were sourced and used from common suppliers such as Sigma Aldrich, Acros Organics, Fisher Scientific, Qiagen or Stratagene. Novozym® 435 is bought from Sigma Aldrich.
Example 1
Cloning of lipase variants:
Nucleic acid sequences of SEQ ID No. 1 or their variants were cloned into an expression vector based on pKA81. The genetic elements were incorporated into the vector by means known in the art. For expression of the different lipase variants, the vectors were introduced into electrocompetent Escherichia coli (E. coli) W3110 cells.
Variant library:
Nucleotides were substituted in the wild-type sequence or sequences derived therefrom to allow for amino acid substitution. This exchange can be achieved by various molecular biological methods. One method for replacing nucleic acids is site-directed mutagenesis, which can result in mutations at one or more sites in the amino acid sequence. Methods for site-directed mutagenesis are state of the art and are described in the literature (e.g. Directed Mutagenesis: A Practical Approach, 1991, Edited by M.J. McPHERSON, IRL PRESS) and can be purchased as ready-made kits (e.g. the QUIKCHANGE™ lightening mutagenesis kit
from Qiagen or Stratagene). After inserting mutations into the gene sequence and culturing in a suitable E. coli cloning strain, the plasmids obtained were transformed into Escherichia coli W3110.
Transformed cells were tested in appropriate biotransformation reactions in order to determine product yield and selectivity. Appropriate biotransformation reactions are described below, see Example 2. Sequence verification was performed as known in the art.
Glycerol stocks of the E. coli cultures transformed with the respective expression plasmids were prepared by adding one volume of 40% glycerol solution to one volume of E. coli culture.
For isolation of single bacterial colonies, appropriate dilutions of E. coli cultures were plated onto LB- Agar plates containing suitable concentrations of kanamycin and incubated at 37°C until single colonies were obtained.
Example 2
Cultivation:
To prepare precultures, sterilized 2 mL 96-well deep-well plates (Eppendorf, Hamburg, Germany) were used and mixed with 590 pL TB medium (50 pg/ml kanamycin) and 10 pL of the glycerol stock of the respective E. coli strain filled. Alternatively, 590 pL of TB medium (50 pg/ml kanamycin) were inoculated with cell material from a colony from an agar plate. The precultures were incubated at 37° C and 250 rpm for 17 h.
Sterile 2 mL 96-well deep-wells with 510 pL TB medium (50 mg/L kanamycin) were used to produce main cultures. The expression cultures were inoculated with 30 pl of preculture and incubated at 37° C. and 250 rpm. After 4 h incubation, the enzyme expression was induced by adding 60 pL IPTG (10 rnM IPTG diluted in expression medium, supplemented with 50 mg/L kanamycin). The expression culture plates were then incubated for 20 h at 28° C and 250 rpm.
Cells were harvested by centrifugation for 15 min at 4°C and 2500 x g. The culture supernatant was discarded, and the cell pellet was suspended in 200 pL PBS. The cells were then lyophilized for 24 h and stored at 4°C until use.
Biotransformation, chiral resolution and analytics:
The chiral resolution of methyl (3S)-3-hydroxy-2-methylene-butanoate was conducted in microtiter plates with the lyophilized supernatants. For the chiral resolution, 100 pl racemic methyl-3-hydroxy-2- methylene-butanoate, 105 pl MTBE (Methyl-tert-butylether) and 45 pl ultrapure water (with 416.6 g/L KHCO3) were used in each well. Each plate was sealed and incubated for 6 h at 45 °C in a shaker at 300 rpm. The reaction was then stopped by adding 22.5 pl of 20% H2SO4 solution. For further extraction, 1 ml MTBE was added per well in the deep well plates. The plates were shaken at room temperature for 10
min and then centrifuged at 2500xg for 10 min. 10 pl of the organic phase were then transferred to 96- well PCR plates containing 100 pl MTBE and analyzed by HPLC.
Analytical HPLC method
The samples were analyzed using HPLC with the following settings:
Instrument: Agilent Technologies 1290 Infinity II; Column: Lux cellulose-2, 100x4.6 mm, 3 pm; Eluent A: premixed heptane (+0.05% formic acid); eluent B: ethanol; flow: isocratic (90% eluent A/10% eluent B), flow rate: 0.8 mL/min; temperature: 25 °C; sample injection volume: 1 pL; Detection: absorption at 210 nm.
Racemic methyl-3-hydroxy-2-methylene-butanoate and racemic 3-hydroxy-2-methylene-butanoic acid (prepared as for the biotransformation samples) were used as a reference substance and as a standard for quantification. Appropriate dilutions of the standards were used in order to be able to quantify the substrate used and the resulting product using standard lines. The samples were analyzed for substrate conversion and product formation. The evaluation and comparison of individual samples with one another was carried out by determining the substrate and product ee[%].
Example 3: Enzymatic hydrolysis of racemic Methyl 3-hydroxy-2-methylene-butanoate to (3R)-3- hydroxy-2-methylene-butanoic acid
The cultivation, biotransformation and HPLC analysis were carried out as described in example 2. During cultivation, glycerol cultures were used for inoculation. The activity and selectivity results of enzyme variants with individual point mutations can be found below in Tables 5 and 6. The activity and selectivity results of enzyme variants with combined mutations can be found in Tables 7 and 8. The activity and selectivity results of enzyme variants with additional combined mutations can be found in Tables 9 and 10.
A high selectivity and a high activity of the respective enzyme variant are decisive for an efficient conversion of racemic methyl 3-hydroxy-2-methylene-butanoate to (3R)-3-hydroxy-2-methylene- butanoic acid in order to generate an almost enantiomerically pure product in high yield.
The enzyme selectivity ee [%] 3-hydroxy-2-methylene-butanoate is defined by the difference of the mole fraction of (3R)-3-hydroxy-2-methylene-butanoic acid and (3S)-3-hydroxy-2-methylene- butanoic acid divided by the sum of the mole fraction of (3R)-3-hydroxy-2-methylene-butanoic acid and (3 S) -3 -hydroxy-2-methylene-butanoic acid.
(3R)3hydroxy2methylenebutanoic acid — (3S)3hydroxy2methylenebutanoic acid
- * 100
(3R)3hydroxy2methylenebutanoic acid + (3S)3hydroxy2methylenebutanoic acid
The enzyme activity ee [%] for the conversion of the mole fraction of the substrate methyl (3R)3- hydroxy-2-methylene-butanoate is described by the difference between the mole fraction of methyl-(3S)3- hydroxy-2-methylene-butanoate and methyl-(3R)3-hydroxy-2-methylene-butanoate divided by the sum of methyl-(3S)3-hydroxy-2-methylene-butanoate and methyl-(3R)3-hydroxy-2-methylene-butanoate. methyl (3 R)3hydroxy2methylenebutanoate — methyl(3S)3hydroxy2methylenebutanoate
- * 100 methyl (3 R)3hydroxy2methylenebutanoate + methyl(3S)3hydroxy2methylenebutanoate
The lipase of SEQ ID No. 1 shows a selectivity of 100 % and an activity of 100%.
Table 5: Lipase variants showing a relative improvement in enzyme activity compared to the lipase of SEQ ID No. 1. The relative improvement in enzyme activity is defined as the quotient of the enantiomeric excess ee[%] of the respective variant and the enantiomeric excess ee[%] of the reference lipase (SEQ ID No.l) in percent. The substrate ee[%] of the lipase of SEQ ID No. 1 is 9.8.
Table 6: Lipase variants showing a relative improvement in enzyme selectivity compared to the reference lipase of SEQ ID No. 1. The relative improvement in enzyme selectivity is defined as the quotient of ee[%] 3-hydroxy-2- methylene-butanoic acid of the respective variant and the ee[%] of 3-hydroxy-2-methylene-butanoic acid of the reference lipase of SEQ ID No. 1 in percent. The product ee[%] of the reference lipase is 75.8.
Table 7: Lipase variants showing a relative improvement in enzyme activity compared to the reference lipase of SEQ ID No. 1. The relative improvement in enzyme activity is defined as the quotient of the enantiomeric excess ee[%] of the respective variant and the enantiomeric excess ee[%] of the reference lipase (SEQ ID No.1) in percent.
Table 8: Lipase variants showing a relative improvement in enzyme selectivity compared to the reference lipase of SEQ ID No. 1. The relative improvement in enzyme selectivity is defined as the quotient of ee[%] 3-hydroxy-2- methylene-butanoic acid of the respective variant and the ee[%] of 3-hydroxy-2-methylene-butanoic acid of the reference lipase of SEQ ID No. 1 in percent.
Example 4:
Variants based on the backbone mutations N57P, K122R, A212P, S234K, D289G and G109H have been supplemented with further mutations in the sequence as described in the examples above and tested for enzyme activity and selectivity as described in Example 3.
Table 9: Lipase variants showing a relative improvement in enzyme activity compared to the reference lipase of SEQ ID No. 171. The relative improvement in enzyme activity is defined as the quotient of the enantiomeric excess ee[%] of the respective variant and the enantiomeric excess ee[%] of the reference lipase (SEQ ID No.171) in percent. All lipase variants in the table carry in addition the mutations N57P, K122R, A212P, S234K, D289G, G109H.
Table 10: Lipase variants showing a relative improvement in enzyme selectivity compared to the reference lipase of SEQ ID No. 171. The relative improvement in enzyme selectivity is defined as the quotient of ee[%] 3-hydroxy-2- methylene-butanoic acid of the respective variant and the ee[%] of 3-hydroxy-2-methylene-butanoic acid of the reference lipase of SEQ ID No. 171 in percent. All lipase variants in the table carry in addition the mutations N57P, K122R, A212P, S234K, D289G, G109H.
Example 5: Enzymatic hydrolysis of different esters
Next to a methyl ester of 3-hydroxy-2-methylene-butanoate (a), also the enzymatic hydrolysis of the respective tert-butyl ester (b), iso-butyl-ester (c), and iso-propyl ester (d) were tested under suitable conditions (1 mL scale, 5 g/L ester compound, 5 g/L lyophilizate of enzyme of SEQ ID No. 1, 100 mM KPi buffer pH 8):
All esters (a) to (d) allow for an efficient enantiomeric resolution, wherein the (R)-Ester is preferably hydrolysed:
* compared to timepoint-zero control
Example 6: Alternative solvents
Several different solvents have been tested for the chiral resolution of methyl 3-hydroxy-2-methylene- butanoate. Reaction conditions were: 100 mL total volume, 300 g/L racemic methyl 3-hydroxy-2- BCSmethylene-butanoate, 20 g/L spray dried lipase of SEQ ID No. 1, 70% solvent, 0.54 eq KHCO3, pH 8.5 (titration with 40% w/v K2CO3), 45 °C, 6 h. In particular MTBE, CPME, MIBK and toluene allowed for a high enantiomeric excess of the hydrolysed product (3R)-3-hydroxy-2-methylene-butanoic acid (ee).
Example 7: Further examples on enzymatic hydrolysis of racemic Methyl 3-hydroxy-2-methylene- butanoate to (3R)-3-hydroxy-2-methylene-butanoic acid
Next to the lipase of SEQ ID No.1 , three further related variants have been tested, namely variants with mutations Q295V and K298A (SEQ ID No. 3); as well as with mutations V33I and I254S (SEQ ID No. 5); as well as with mutations T188S, I254S, P302L and Q304E (SEQ ID No. 241).
The cultivation, biotransformation and HPLC analysis were carried out as described in example 2. As described in example 3, the enzyme selectivity ee [%] 3-hydroxy-2-methylene-butanoate is defined by the difference of (3R)-3-hydroxy-2-methylene-butanoic acid and (3S)-3-hydroxy-2-methylene-butanoic acid divided by the sum of (3R)-3-hydroxy-2-methylene-butanoic acid and (3S)-3-hydroxy-2-methylene- butanoic acid. The enzyme activity ee [%] for the conversion of the substrate methyl (3R)3-hydroxy-2- methylene-butanoate is described by the difference between methyl-(3S)3-hydroxy-2-methylene- butanoate and methyl-(3R)3-hydroxy-2-methylene-butanoate divided by the sum of methyl-(3S)3- hydroxy-2-methylene-butanoate and methyl-(3R)3-hydroxy-2-methylene-butanoate. The reference lipase of SEQ ID No. 1 shows a selectivity of 100 % and an activity of 100%
Table 11: Comparative lipase variants showing a similar enzyme activity profile compared to the reference lipase of SEQ ID No. 1. The relative difference in enzyme activity is defined as the quotient of the enantiomeric excess ee[%] of the respective variant and the enantiomeric excess ee[%] of the reference lipase (SEQ ID No.1) in percent.
Table 12: Comparative lipase variants showing a similar enzyme selectivity compared to the reference lipase of SEQ ID No. 1. The relative difference in enzyme selectivity is defined as the quotient of ee[%] 3-hydroxy-2-methylene- butanoic acid of the respective variant and the ee[%] of 3-hydroxy-2-methylene-butanoic acid of the reference lipase of SEQ ID No. 1 in percent.
As shown in the Tables 11 und 12, the comparative lipases worsen the activity of the lipase, and do not improve the selectivity of the discussed substrate.
Example 8: Enzymatic hydrolysis of racemic Methyl 3-hydroxy-2-methylene-butanoate to (3R)-3- hydroxy-2-methylene-butanoic acid
8.0 g spray dried lipase SEQ ID No. 1 were suspended in 50 g water and diluted with 100 mL MTBE. After warming to 45 °C a mixture of 120 g racemic methyl 3-hydroxy-2-methylene-butanoate (99% purity) in 100 g MTBE were dosed into the suspension over 3 h. After full dosage, the mixture was further stirred at 45 °C over night. During dosage and additional stirring time, the pH was kept at 7.5 by the parallel addition of 40% aqueous potassium carbonate solution. Afterwards, the biomass was separated by centrifugation and the phases were separated. After extraction of the aqueous phase with 2x 125g MTBE each, the combined organic extracts were concentrated under reduced pressure at 45 °C yielding methyl (3S)-3-hydroxy-2-methylene-butanoate with 91% purity, 96% ee and and corrected yield of 40%.
Example 9: Enzymatic hydrolysis of racemic Methyl 3-hydroxy-2-methylene-butanoate to (3R)-3- hydroxy-2-methylene-butanoic acid
120 g water were warmed to 45 °C and 2.0 g spray dried lipase SEQ ID No. 1 were added, followed by 280 mL MTBE. At 45 °C, 120 g racemic methyl 3-hydroxy-2-methylene-butanoate (99% purity) were dosed into the suspension over 2 h. After full dosage, the mixture was further stirred at 45 °C for 8,5h. During dosage and additional stirring time, the pH was kept at 7.5 by the parallel addition of 40% aqueous potassium carbonate solution. Afterwards, the lower aqueous phase was separated and back-extracted once with 100 ml MTBE. The extract was combined with the upper organic phase and the mixture was distilled at 33-55 °C and reduced pressure down to 10 mbar. After addition of 50 g high boiling Marlotherm, the residue was further distilled at 3-10 mbar up to 150 °C yielding 58.2 g (92% purity, 96% ee, 45% yield) methyl (3S)-3-hydroxy-2-methylene-butanoate with 91% purity, 96% ee and and corrected yield of 40%.
Example 10: Enantioselective acylation of racemic methyl-3-hydroxy-2-methylene-butanoate
A suspension of 1400 g racemic methyl 3-hydroxy-2-methylene-butanoate [10.62 mol, 98.7 % purity] and 105 g Novozyme 435 is heated to 25 °C internal temperature. 1373 g vinyl dodecanoate (5.95 mol, 98 %) is added in 3h using a dosing pump under 50 mbar vacuum. Subsequently, the reaction mixture heated to 35 °C internal temperature and 50 mbar for additional 8h. Acetaldehyde is distilled out under vacuum. Afterwards the suspension is allowed to proceed for additional 8h at 35 °C and 50 mbar. The reaction mixture is heated to 115 °C jacket temperature under vacuum to distill methyl (3S)-3-hydroxy-2- methylene-butanoate out of the suspension. In that respect, the vacuum is gradually reduced to 3 mbar and the jacket temperature is increased to 135 °C. The product is analyzed by using chiral HPLC standard method. A chemical purity of >99 % is and an enantiomeric excess of >98 % ee is achieved. The isolated yield of methyl (3S)-3-hydroxy-2-methylene-butanoate is 41%.
Claims
1. A protein variant having the activity of a lipase wherein the protein is encoded by an amino acid sequence having at least 80% identity with the amino acid sequence shown under SEQ ID No. 1 , characterized in that the amino acid sequence of the protein variant differs from the amino acid sequence of SEQ ID NO. 1 in at least one of the following positions: i. the amino acid at position 44 is M, W or Y; ii. the amino acid at position 51 is N, or M; iii. the amino acid at position 52 is L; iv. the amino acid at position 53 is S, P, I, E, or A; v. the amino acid at position 54 is Q, M, F, G, E, L, T or P; vi. the amino acid at position 55 is R, M, D, Y, S, I; vii. the amino acid at position 109 is H or F; viii. the amino acid at position 110 is T or V; ix. the amino acid at position 111 is S; x. the amino acid at position 117 is F or S; xi. the amino acid at position 121 is V; xii. the amino acid at position 122 is Q, A, Y, R, or V; xiii. the amino acid at position 153 is Y, D, E, or C; xiv. the amino acid at position 160 is E, C, D, P, I, Q, K, M, S, F, A, or N; xv. the amino acid at position 179 is C; xvi. the amino acid at position 181 is Q; xvii. the amino acid at position 184 is G or T; xviii. the amino acid at position 211 is E; xix. the amino acid at position 212 is S; xx. the amino acid at position 216 is K or A; xxi. the amino acid at position 234 is K, T or G; xxii. the amino acid at position 235 is V or M; xxiii. the amino acid at position 236 is T; xxiv. the amino acid at position 238 is A, K, D, E, or Q; xxv. the amino acid at position 240 is F; xxvi. the amino acid at position 289 is S or G; xxvii. the amino acid at position 291 is E or W; xxviii. the amino acid at position 317 is T; xxix. the amino acid at position 320 is E or G; xxx. the amino acid at position 321 is F.
2. Protein variant according to claim 1 , characterized in that the amino acid sequence of the protein variant differs from the amino acid sequence of SEQ ID NO. 1 by at least one of the following mutations: i. the amino acid at position 44 is W or Y; ii. the amino acid at position 54 is F; iii. the amino acid at position 55 is R; iv. the amino acid at position 109 is H; v. the amino acid at position 110 is T or V; vi. the amino acid at position 117 is F; vii. the amino acid at position 122 is Q or R; viii. the amino acid at position 160 is E; ix. the amino acid at position 216 is K; x. the amino acid at position 236 is T; xi. the amino acid at position 238 is K or E; xii. the amino acid at position 240 is F.
3. Protein variant according to claim 1 or 2, characterized in that the amino acid sequence of the protein variant differs from the amino acid sequence of SEQ ID NO. 1 by at least the following mutation: the amino acid at position 109 is H.
4. Protein variant according to any one of the preceding claims, wherein the protein variant comprises at least two, preferably at least three of the recited amino acid substitutions.
5. Protein variant according to any one of the preceding claims, wherein the amino acid sequence of the protein variant differs from the amino acid sequence of SEQ ID NO. 1 by at least two, further preferably at least three, even further preferably at least four, in particular preferably at least five, and most preferably all of the following modifications selected from: i. the amino acid at position 57 is P; ii. the amino acid at position 109 is H; iii. the amino acid at position 122 is R; iv. the amino acid at position 212 is P; v. the amino acid at position 234 is K; vi. the amino acid at position 289 is G.
6. Protein variant according to any one of the preceding claims, wherein the amino acid sequence of the protein variant differs from the amino acid sequence of SEQ ID NO. 1 by the following modifications: i. the amino acid at position 57 is P; the amino acid at position 109 is H; the amino acid at position 122 is R or K or Q, preferably R; the amino acid at position 212 is P; the amino acid at position 234 is K; the amino acid at position 289 is G; ii. and additionally at least one of the following modifications: the amino acid at position 44 is W or Y; the amino acid at position 54 is F, the amino acid at position 55 is R, the amino acid at position 110 is V or T; the amino acid at position 111 is T, the amino acid at position 117 is F, the amino acid at position 160 is E, the amino acid at position 216 is K, the amino acid at position 236 is T, the amino acid at position 238 is K or E, the amino acid at position 240 is F.
7. Protein variant according to any one of the preceding claims, wherein the protein variant carries an amino acid sequence of one of SEQ ID No. 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93, 95, 97, 99, 101, 103, 105, 107, 109, 111, 113, 115, 117, 119, 121, 123, 125, 127, 129, 131, 133, 135, 137, 139, 141, 143, 145, 147, 149, 151, 153, 155, 157, 159, 161, 163, 165, 167, 169, 171, 173, 175, 177, 179, 181, 183, 185, 187, 189, 191, 193, 195, 197, 199, 201, 203, 205, 207, 209, 211, 213, 215, 217, 219, 221, 223, 225, 227, 229, 231, 233, 235, 237, 239.
8. Nucleic acid molecule encoding a protein having the activity of a lipase according to any one of the preceding claims.
9. Nucleic acid molecule according to claim 8 encoding a protein having the activity of a lipase selected from the group consisting of a) nucleic acid molecules comprising the nucleic acid sequences shown under of SEQ ID No.
8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, 94, 96, 98, 100, 102,
104, 106, 108, 110, 112, 114, 116, 118, 120, 122, 124, 126, 128, 130, 132, 134, 136, 138,
140, 142, 144, 146, 148, 150, 152, 154, 156, 158, 160, 162, 164, 166, 168, 170, 172, 174,
176, 178, 180, 182, 184, 186, 188, 190, 192, 194, 196, 198, 200, 202, 204, 206, 208, 210,
212, 214, 216, 218, 220, 222, 224, 226, 228, 230, 232, 234, 236, 238, 240; b) nucleic acid molecules having at least 60%, preferably 70%, more preferably 80%, further more preferably 90%, even more preferably 95%, even further more preferably 96%, particular preferably 97%, most preferably 98% or especially preferably 99% identity with the nucleic acid sequences shown under a).
10. A recombinant nucleic acid molecule comprising a nucleic acid molecule according to claim 8 or 9.
11. The recombinant nucleic acid molecule according to claim 10, wherein the recombinant nucleic acid molecule is a vector or a plasmid.
12. A host cell comprising a protein according to any one of claims 1 to 7 or a nucleic acid molecule according to claim 8 or 9 or a recombinant nucleic acid molecule according to claim 10 or 11.
13. Use of a protein according to any one of claims 1 to 7 for the stereoselective hydrolysis of racemic methyl-3-hydroxy-2-methylene-butanoate to (3R)-3-hydroxy-2-methylene- butanoic acid.
14. Method for hydrolysing a substrate of formula (II) in an enantiomerically selective manner to a compound of formula (III) comprising contacting the substrate with a protein variant according to any one of claims 1 to 7,
wherein R1 and R2 are independently selected from a substutituted or unsubstituted (n)-alkyl, isoalkyl, aryl, alkyl-substituted or aryl-substituted aryl.
15. Method according to claim 14 for hydrolysing a substrate of formula (II- 1) in an enantiomerically selective manner to a compound fo formula (III-l),
16. Method according to any of claims 14 or 15, wherein the method is carried out in a biphasic system of water and an organic solvent, preferably selected from methyl-tert-butylether, toluene,
2-methyltetrahydrofuran, methyl isobutyl ketone, cyclohexane, cyclopentyl methyl ether, chlorobenzene, tert-amylmethylether, ethyl acetate, and isopropyl acetate.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP23161854.7 | 2023-03-14 | ||
EP23161854 | 2023-03-14 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024188791A1 true WO2024188791A1 (en) | 2024-09-19 |
Family
ID=85640926
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2024/055979 WO2024188791A1 (en) | 2023-03-14 | 2024-03-07 | Lipase with improved stereoselectivity and lipase-based chiral resolution methods |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024188791A1 (en) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4732853A (en) | 1984-11-21 | 1988-03-22 | President And Fellows Of Harvard College | Method of making chiral epoxy alcohols |
EP0716712B1 (en) | 1993-08-31 | 1998-11-04 | Basf Aktiengesellschaft | Lipase-catalysed acylation of alcohols with diketenes |
WO2008065060A2 (en) * | 2006-11-28 | 2008-06-05 | Novozymes A/S | Lipolytic enzyme variants |
WO2012146935A1 (en) | 2011-04-29 | 2012-11-01 | Frito-Lay Trading Company, Gmbh | Lipase variants having increased enzyme specificity or enhanced trans-selectivity, and methods of use |
WO2018228985A1 (en) | 2017-06-13 | 2018-12-20 | Bayer Aktiengesellschaft | Herbicidally active 3-phenylisoxazoline-5-carboxamides of tetrahydro and dihydrofuran carboxylic acids and esters |
-
2024
- 2024-03-07 WO PCT/EP2024/055979 patent/WO2024188791A1/en unknown
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4732853A (en) | 1984-11-21 | 1988-03-22 | President And Fellows Of Harvard College | Method of making chiral epoxy alcohols |
EP0716712B1 (en) | 1993-08-31 | 1998-11-04 | Basf Aktiengesellschaft | Lipase-catalysed acylation of alcohols with diketenes |
WO2008065060A2 (en) * | 2006-11-28 | 2008-06-05 | Novozymes A/S | Lipolytic enzyme variants |
WO2012146935A1 (en) | 2011-04-29 | 2012-11-01 | Frito-Lay Trading Company, Gmbh | Lipase variants having increased enzyme specificity or enhanced trans-selectivity, and methods of use |
WO2018228985A1 (en) | 2017-06-13 | 2018-12-20 | Bayer Aktiengesellschaft | Herbicidally active 3-phenylisoxazoline-5-carboxamides of tetrahydro and dihydrofuran carboxylic acids and esters |
Non-Patent Citations (23)
Title |
---|
"Bioanalytik, Spektrum akad", 1998, VERLAG, HEIDELBERG |
AUSUBEL ET AL.: "Short Protocols in Molecular Biology", 2002, JOHN WILEY & SONS |
BAGHBAN ET AL., CURRENT PHARMACEUTICAL BIOTECHNOLOGY, vol. 19, no. 6, 2018 |
BITTER, METHODS IN ENZYMOLOGY, vol. 153, 1987, pages 516 - 544 |
DATABASE UniProt [online] 7 October 2020 (2020-10-07), BATINOVIC S.: "Full=Alpha/beta hydrolase fold domain-containing protein {ECO:0000313 / EMBL: QHG09650.1}", XP093072114, Database accession no. A0A6P1KFL2 * |
DENG ET AL., J. ORG. CHEM., vol. 80, no. 6, 2015, pages 3159 |
FABERRIVA, SYNTHESIS, vol. 1992, no. 10, 1992, pages 895 - 910 |
G.R. COOK: "Transition Metal-Mediated Kinetic Resolution", CURRENT ORGANIC CHEMISTRY, vol. 4, 2000, pages 869 - 885 |
GOMES ET AL., ADVANCES IN ANIMAL AND VETERINARY SCIENCES, vol. 4, no. 4, 2016, pages 346 |
HENIKOFFHENIKOFF, PNAS, vol. 89, 1992, pages 10915 - 10919 |
HIRAMA ET AL., J. ORG. CHEM., vol. 53, 1988, pages 708 |
JOHANNA LÖFGREN ET AL: "Transesterification of a Tertiary Alcohol by Engineered Candida antarctica Lipase A", CHEMBIOCHEM, JOHN WILEY & SONS, INC, HOBOKEN, USA, vol. 20, no. 11, 15 April 2019 (2019-04-15), pages 1438 - 1443, XP072200581, ISSN: 1439-4227, DOI: 10.1002/CBIC.201800792 * |
KIRCHNER ET AL., J. AM. CHEM. SOC., vol. 107, 1985, pages 7072 - 7076 |
MARSCHALL ET AL., APPL MICROBIOL BIOTECHNOL, vol. 101, 2017, pages 501 - 512 |
NASCIMENTO ET AL., TETRAHEDRON ASYMMETRY, vol. 14, 2003, pages 311 - 311 |
PATEL RAMESH N. ET AL: "Enzymatic resolution of racemic secondary alcohols by lipase B from Candida antarctica", JOURNAL OF THE AMERICAN OIL CHEMISTS SOCIETY, vol. 77, no. 10, 1 October 2000 (2000-10-01), DE, pages 1015 - 1019, XP093072302, ISSN: 0003-021X, DOI: 10.1007/s11746-000-0161-y * |
PATIL ET AL., J. ORG. CHEM., vol. 73, 2008, pages 4476 - 4483 |
REDDY, SYNTH. COMMUNICATIONS, vol. 33, 2003, pages 3717 - 3726 |
SAMBROOK ET AL.: "Molecular Cloning, A Laboratory Manual", 2001, COLD SPRING HARBOUR LABORATORY PRESS |
SCHLEGEL: "Thermo Fisher Scientific", vol. 1-2, 1985, GEORG THIEME PUBLISHING HOUSE |
TEGEL ET AL., FEBS JOURNAL, vol. 278, 2011, pages 729 - 739 |
THOMPSON ET AL., NUCLEIC ACIDS RESEARCH, vol. 22, 1994, pages 4673 - 4680 |
UPPENBERG JPATKAR SBERGFORS TJONES TA, J MOL BIOL, vol. 235, no. 2, 1994, pages 790 - 792 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
de María et al. | Biotechnological applications of Candida antarctica lipase A: State-of-the-art | |
Zheng et al. | Enzymatic production of l‐menthol by a high substrate concentration tolerable esterase from newly isolated Bacillus subtilis ECU0554 | |
Zhang et al. | Efficient biocatalytic synthesis of chiral chemicals | |
CN110396507B (en) | L-pantolactone dehydrogenase from Cneubacter physcomitrella | |
US9970043B2 (en) | Process for producing optically active 2-alkyl-1,1,3-trialkoxycarbonylpropane | |
Huang et al. | Functional characterization of a marine Bacillus esterase and its utilization in the stereo-selective production of D-methyl lactate | |
JP5919192B2 (en) | Preparation of β-amino acids | |
CN103820417B (en) | A kind of ester hydrolase, encoding gene, carrier, engineering bacteria and application thereof | |
CN106754818B (en) | Heat-resistant esterase mutant and preparation method and application thereof | |
CN109652470B (en) | Application of lipase in resolution of (R, S) -methyl mandelate | |
WO2023088077A1 (en) | Biocatalysts and methods for the synthesis of pregabalin intermediates | |
WO2024188791A1 (en) | Lipase with improved stereoselectivity and lipase-based chiral resolution methods | |
CN113502305B (en) | Method for synthesizing (R) -isobutyl glutarate monoamide by utilizing recombinant imidinase | |
JP5398943B2 (en) | Butinol I esterase | |
JPWO2019189724A1 (en) | A novel hydrolase and a method for producing (1S, 2S) -1-alkoxycarbonyl-2-vinylcyclopropanecarboxylic acid using it. | |
CN109943618B (en) | Application of recombinant lipase in resolution of (R, S) -alpha-ethyl-2-oxo-1-pyrrolidine methyl acetate | |
JP4170624B2 (en) | Method for producing L-menthol | |
Grulich et al. | Production, enrichment and immobilization of a metagenome-derived epoxide hydrolase | |
WO2024041972A1 (en) | Nucleic acids encoding improved lipase proteins | |
Liang et al. | Functional characterization of a novel microbial esterase identified from the Indian Ocean and its use in the stereoselective preparation of (R)-methyl mandelate | |
KR102703236B1 (en) | Enantioselectivity Improved Candida antarctica Lipase B Variants | |
CN103820521B (en) | Living things catalysis Dynamic Kinetic Resolution prepares the method for R-o-chloromandelic acid methyl ester | |
JP2015195771A (en) | Protein having leucine acid production activity and use thereof | |
TW202413319A (en) | Process for preparing (1r,2s)-2,6-dimethyl-1-indanamine by dynamic-kinetic stereoisomer cleavage | |
WO2011009849A2 (en) | Method for preparing optically active hydroxy acid esters |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 24709397 Country of ref document: EP Kind code of ref document: A1 |