WO2024047057A1 - Means and methods to produce triterpene saponins in eukaryotic cells - Google Patents
Means and methods to produce triterpene saponins in eukaryotic cells Download PDFInfo
- Publication number
- WO2024047057A1 WO2024047057A1 PCT/EP2023/073700 EP2023073700W WO2024047057A1 WO 2024047057 A1 WO2024047057 A1 WO 2024047057A1 EP 2023073700 W EP2023073700 W EP 2023073700W WO 2024047057 A1 WO2024047057 A1 WO 2024047057A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- seq
- sop450
- yeast
- plant
- nucleotide sequence
- Prior art date
Links
- 210000003527 eukaryotic cell Anatomy 0.000 title claims abstract description 18
- 238000000034 method Methods 0.000 title claims description 48
- 229930182493 triterpene saponin Natural products 0.000 title description 5
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 116
- 240000004808 Saccharomyces cerevisiae Species 0.000 claims abstract description 61
- MQUFAARYGOUYEV-UWEXFCAOSA-N Quillaic acid Natural products CC1(C)CC[C@@]2([C@H](O)C[C@]3(C)C(=CC[C@H]4[C@@]5(C)CC[C@H](O)[C@](C)(C=O)[C@H]5CC[C@@]34C)[C@H]2C1)C(=O)O MQUFAARYGOUYEV-UWEXFCAOSA-N 0.000 claims abstract description 28
- MQUFAARYGOUYEV-UAWZMHPWSA-N quillaic acid Chemical compound C1C[C@H](O)[C@@](C)(C=O)[C@@H]2CC[C@@]3(C)[C@]4(C)C[C@@H](O)[C@@]5(C(O)=O)CCC(C)(C)C[C@H]5C4=CC[C@@H]3[C@]21C MQUFAARYGOUYEV-UAWZMHPWSA-N 0.000 claims abstract description 28
- 241000196324 Embryophyta Species 0.000 claims description 98
- 239000002773 nucleotide Substances 0.000 claims description 55
- 125000003729 nucleotide group Chemical group 0.000 claims description 55
- 210000004027 cell Anatomy 0.000 claims description 51
- 230000008488 polyadenylation Effects 0.000 claims description 18
- 210000005253 yeast cell Anatomy 0.000 claims description 16
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical group C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 14
- 230000005030 transcription termination Effects 0.000 claims description 13
- 102100029077 3-hydroxy-3-methylglutaryl-coenzyme A reductase Human genes 0.000 claims description 11
- 241000219287 Saponaria Species 0.000 claims description 10
- 101710158485 3-hydroxy-3-methylglutaryl-coenzyme A reductase Proteins 0.000 claims description 8
- 241001316290 Gypsophila Species 0.000 claims description 8
- 108010045510 NADPH-Ferrihemoprotein Reductase Proteins 0.000 claims description 8
- 102100023897 NADPH-cytochrome P450 reductase Human genes 0.000 claims description 7
- 238000004519 manufacturing process Methods 0.000 abstract description 17
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 54
- 230000014509 gene expression Effects 0.000 description 52
- 240000003946 Saponaria officinalis Species 0.000 description 50
- 102000004316 Oxidoreductases Human genes 0.000 description 40
- 108090000854 Oxidoreductases Proteins 0.000 description 40
- 102000004190 Enzymes Human genes 0.000 description 37
- 108090000790 Enzymes Proteins 0.000 description 37
- 150000007523 nucleic acids Chemical group 0.000 description 33
- 230000009466 transformation Effects 0.000 description 27
- 239000013598 vector Substances 0.000 description 25
- 108020004707 nucleic acids Proteins 0.000 description 21
- 102000039446 nucleic acids Human genes 0.000 description 21
- 239000003550 marker Substances 0.000 description 19
- 229930182490 saponin Natural products 0.000 description 19
- 150000007949 saponins Chemical class 0.000 description 19
- 235000017709 saponins Nutrition 0.000 description 19
- 230000009261 transgenic effect Effects 0.000 description 19
- 108091026890 Coding region Proteins 0.000 description 17
- PAIBKVQNJKUVCE-UHFFFAOYSA-N Gypsogeninsaeure Natural products C1CC(O)C(C)(C(O)=O)C2CCC3(C)C4(C)CCC5(C(O)=O)CCC(C)(C)CC5C4=CCC3C21C PAIBKVQNJKUVCE-UHFFFAOYSA-N 0.000 description 17
- 239000002609 medium Substances 0.000 description 17
- 239000000047 product Substances 0.000 description 17
- 108020004414 DNA Proteins 0.000 description 16
- 239000008188 pellet Substances 0.000 description 15
- 150000003648 triterpenes Chemical class 0.000 description 15
- 210000001519 tissue Anatomy 0.000 description 14
- 230000002018 overexpression Effects 0.000 description 13
- 230000003647 oxidation Effects 0.000 description 13
- 238000007254 oxidation reaction Methods 0.000 description 13
- 150000001875 compounds Chemical class 0.000 description 12
- 239000007788 liquid Substances 0.000 description 12
- JKLISIRFYWXLQG-UHFFFAOYSA-N Epioleonolsaeure Natural products C1CC(O)C(C)(C)C2CCC3(C)C4(C)CCC5(C(O)=O)CCC(C)(C)CC5C4CCC3C21C JKLISIRFYWXLQG-UHFFFAOYSA-N 0.000 description 11
- 108091028043 Nucleic acid sequence Proteins 0.000 description 11
- YBRJHZPWOMJYKQ-UHFFFAOYSA-N Oleanolic acid Natural products CC1(C)CC2C3=CCC4C5(C)CCC(O)C(C)(C)C5CCC4(C)C3(C)CCC2(C1)C(=O)O YBRJHZPWOMJYKQ-UHFFFAOYSA-N 0.000 description 11
- PSZDOEIIIJFCFE-UHFFFAOYSA-N Oleanolic alcohol Natural products C1CC(O)C(C)(C)C2CCC3(C)C4(C)CCC5(CO)CCC(C)(C)CC5C4=CCC3C21C PSZDOEIIIJFCFE-UHFFFAOYSA-N 0.000 description 11
- MIJYXULNPSFWEK-UHFFFAOYSA-N Oleanolinsaeure Natural products C1CC(O)C(C)(C)C2CCC3(C)C4(C)CCC5(C(O)=O)CCC(C)(C)CC5C4=CCC3C21C MIJYXULNPSFWEK-UHFFFAOYSA-N 0.000 description 11
- PSZDOEIIIJFCFE-OSQDELBUSA-N erythrodiol Chemical compound C1C[C@H](O)C(C)(C)[C@@H]2CC[C@@]3(C)[C@]4(C)CC[C@@]5(CO)CCC(C)(C)C[C@H]5C4=CC[C@@H]3[C@]21C PSZDOEIIIJFCFE-OSQDELBUSA-N 0.000 description 11
- HTZRWCSRPTWJCT-UHFFFAOYSA-N erythrodiol Natural products CC1(C)CCC2(CO)CCC3C(CCC4C3(C)CCC5C(C)(C)C(O)CCC45C)C2C1 HTZRWCSRPTWJCT-UHFFFAOYSA-N 0.000 description 11
- PAIBKVQNJKUVCE-HDHBZPKHSA-N gypsogenic acid Natural products CC1(C)CC[C@@]2(CC[C@]3(C)C(=CC[C@@H]4[C@@]5(C)CC[C@H](O)[C@@](C)([C@H]5CC[C@@]34C)C(O)=O)[C@H]2C1)C(O)=O PAIBKVQNJKUVCE-HDHBZPKHSA-N 0.000 description 11
- 229940100243 oleanolic acid Drugs 0.000 description 11
- HZLWUYJLOIAQFC-UHFFFAOYSA-N prosapogenin PS-A Natural products C12CC(C)(C)CCC2(C(O)=O)CCC(C2(CCC3C4(C)C)C)(C)C1=CCC2C3(C)CCC4OC1OCC(O)C(O)C1O HZLWUYJLOIAQFC-UHFFFAOYSA-N 0.000 description 11
- MIJYXULNPSFWEK-GTOFXWBISA-N 3beta-hydroxyolean-12-en-28-oic acid Chemical compound C1C[C@H](O)C(C)(C)[C@@H]2CC[C@@]3(C)[C@]4(C)CC[C@@]5(C(O)=O)CCC(C)(C)C[C@H]5C4=CC[C@@H]3[C@]21C MIJYXULNPSFWEK-GTOFXWBISA-N 0.000 description 10
- 101150053185 P450 gene Proteins 0.000 description 10
- 108700019146 Transgenes Proteins 0.000 description 10
- 238000004458 analytical method Methods 0.000 description 10
- 230000001939 inductive effect Effects 0.000 description 10
- 240000005625 Gypsophila elegans Species 0.000 description 9
- PAIBKVQNJKUVCE-JUENUIDLSA-N gypsogenic acid Chemical compound C1C[C@H](O)[C@@](C)(C(O)=O)[C@@H]2CC[C@@]3(C)[C@]4(C)CC[C@@]5(C(O)=O)CCC(C)(C)C[C@H]5C4=CC[C@@H]3[C@]21C PAIBKVQNJKUVCE-JUENUIDLSA-N 0.000 description 9
- 230000015572 biosynthetic process Effects 0.000 description 8
- 238000012546 transfer Methods 0.000 description 8
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 7
- 241000219828 Medicago truncatula Species 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 239000002207 metabolite Substances 0.000 description 7
- 102000004169 proteins and genes Human genes 0.000 description 7
- 239000001397 quillaja saponaria molina bark Substances 0.000 description 7
- 230000035897 transcription Effects 0.000 description 7
- 238000013518 transcription Methods 0.000 description 7
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 6
- QMHCWDVPABYZMC-UHFFFAOYSA-N 3-Hydroxy-23-oxo-olean-12-en-28-saeure Natural products C1CC(O)C(C)(C=O)C2CCC3(C)C4(C)CCC5(C(O)=O)CCC(C)(C)CC5C4=CCC3C21C QMHCWDVPABYZMC-UHFFFAOYSA-N 0.000 description 6
- QMHCWDVPABYZMC-MYPRUECHSA-N 3beta-hydroxy-23-oxoolean-12-en-28-oic acid Chemical compound C1C[C@H](O)[C@@](C)(C=O)[C@@H]2CC[C@@]3(C)[C@]4(C)CC[C@@]5(C(O)=O)CCC(C)(C)C[C@H]5C4=CC[C@@H]3[C@]21C QMHCWDVPABYZMC-MYPRUECHSA-N 0.000 description 6
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 6
- YKOPWPOFWMYZJZ-FMMUPTMQSA-N Echinocystic acid Natural products C1C[C@H](O)C(C)(C)[C@@H]2CC[C@@]3(C)[C@]4(C)C[C@H](O)[C@@]5(C(O)=O)CCC(C)(C)C[C@H]5C4=CC[C@@H]3[C@]21C YKOPWPOFWMYZJZ-FMMUPTMQSA-N 0.000 description 6
- YKOPWPOFWMYZJZ-UHFFFAOYSA-N Echinocystsaeure Natural products C1CC(O)C(C)(C)C2CCC3(C)C4(C)CC(O)C5(C(O)=O)CCC(C)(C)CC5C4=CCC3C21C YKOPWPOFWMYZJZ-UHFFFAOYSA-N 0.000 description 6
- GCGBHJLBFAPRDB-KCVAUKQGSA-N Scutellaric acid Natural products CC1(C)CC[C@@]2(CC[C@@]3(C)[C@@H]4CC[C@H]5[C@@](C)(CO)[C@H](O)CC[C@]5(C)[C@H]4CC=C3[C@@H]2C1)C(=O)O GCGBHJLBFAPRDB-KCVAUKQGSA-N 0.000 description 6
- 229910052799 carbon Inorganic materials 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 6
- 239000002299 complementary DNA Substances 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- QMHCWDVPABYZMC-RFVOPWELSA-N gypsogenin Natural products CC1(C)CC[C@@]2(CC[C@]3(C)C(=CC[C@@H]4[C@@]5(C)CC[C@H](O)[C@](C)(C=O)[C@H]5CC[C@@]34C)[C@H]2C1)C(=O)O QMHCWDVPABYZMC-RFVOPWELSA-N 0.000 description 6
- MCHWKJRTMPIHRA-UHFFFAOYSA-N n-(pyrrolidin-2-ylmethyl)aniline Chemical compound C1CCNC1CNC1=CC=CC=C1 MCHWKJRTMPIHRA-UHFFFAOYSA-N 0.000 description 6
- 235000000346 sugar Nutrition 0.000 description 6
- TWCMVXMQHSVIOJ-UHFFFAOYSA-N Aglycone of yadanzioside D Natural products COC(=O)C12OCC34C(CC5C(=CC(O)C(O)C5(C)C3C(O)C1O)C)OC(=O)C(OC(=O)C)C24 TWCMVXMQHSVIOJ-UHFFFAOYSA-N 0.000 description 5
- PLMKQQMDOMTZGG-UHFFFAOYSA-N Astrantiagenin E-methylester Natural products CC12CCC(O)C(C)(CO)C1CCC1(C)C2CC=C2C3CC(C)(C)CCC3(C(=O)OC)CCC21C PLMKQQMDOMTZGG-UHFFFAOYSA-N 0.000 description 5
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 5
- IAJILQKETJEXLJ-UHFFFAOYSA-N Galacturonsaeure Natural products O=CC(O)C(O)C(O)C(O)C(O)=O IAJILQKETJEXLJ-UHFFFAOYSA-N 0.000 description 5
- 238000000137 annealing Methods 0.000 description 5
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 5
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 5
- 238000004925 denaturation Methods 0.000 description 5
- 230000036425 denaturation Effects 0.000 description 5
- PFOARMALXZGCHY-UHFFFAOYSA-N homoegonol Natural products C1=C(OC)C(OC)=CC=C1C1=CC2=CC(CCCO)=CC(OC)=C2O1 PFOARMALXZGCHY-UHFFFAOYSA-N 0.000 description 5
- 230000037361 pathway Effects 0.000 description 5
- 239000013612 plasmid Substances 0.000 description 5
- 210000002706 plastid Anatomy 0.000 description 5
- 239000002243 precursor Substances 0.000 description 5
- 230000001105 regulatory effect Effects 0.000 description 5
- 241000219194 Arabidopsis Species 0.000 description 4
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 4
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical compound O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 description 4
- RTZKZFJDLAIYFH-UHFFFAOYSA-N Diethyl ether Chemical compound CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 4
- PNNNRSAQSRJVSB-UHFFFAOYSA-N L-rhamnose Natural products CC(O)C(O)C(O)C(O)C=O PNNNRSAQSRJVSB-UHFFFAOYSA-N 0.000 description 4
- 150000001299 aldehydes Chemical class 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 230000001851 biosynthetic effect Effects 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 230000009977 dual effect Effects 0.000 description 4
- 239000005090 green fluorescent protein Substances 0.000 description 4
- 230000001965 increasing effect Effects 0.000 description 4
- 238000004895 liquid chromatography mass spectrometry Methods 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 239000000758 substrate Substances 0.000 description 4
- 238000011426 transformation method Methods 0.000 description 4
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 3
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- 244000303040 Glycyrrhiza glabra Species 0.000 description 3
- 235000006200 Glycyrrhiza glabra Nutrition 0.000 description 3
- 108090000895 Hydroxymethylglutaryl CoA Reductases Proteins 0.000 description 3
- 238000009825 accumulation Methods 0.000 description 3
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 230000004186 co-expression Effects 0.000 description 3
- 244000038559 crop plants Species 0.000 description 3
- 238000010195 expression analysis Methods 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 230000002538 fungal effect Effects 0.000 description 3
- 229930182830 galactose Natural products 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- BRZYSWJRSDMWLG-CAXSIQPQSA-N geneticin Chemical compound O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](C(C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-CAXSIQPQSA-N 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 239000004009 herbicide Substances 0.000 description 3
- 208000015181 infectious disease Diseases 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 239000000543 intermediate Substances 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 239000013642 negative control Substances 0.000 description 3
- 230000001590 oxidative effect Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000006798 recombination Effects 0.000 description 3
- 238000005215 recombination Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 239000000725 suspension Substances 0.000 description 3
- 230000002103 transcriptional effect Effects 0.000 description 3
- 230000001052 transient effect Effects 0.000 description 3
- 238000011282 treatment Methods 0.000 description 3
- 230000000007 visual effect Effects 0.000 description 3
- WQZGKKKJIJFFOK-SVZMEOIVSA-N (+)-Galactose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-SVZMEOIVSA-N 0.000 description 2
- VLFUANNVMXKBPF-ZAPOICBTSA-N (1s,2r,4as,6ar,6as,6br,8ar,10s,12ar,14bs)-10-hydroxy-1,2,6a,6b,9,9,12a-heptamethyl-2,3,4,5,6,6a,7,8,8a,10,11,12,13,14b-tetradecahydro-1h-picene-4a-carbaldehyde Chemical compound C1C[C@H](O)C(C)(C)[C@@H]2CC[C@@]3(C)[C@]4(C)CC[C@@]5(C=O)CC[C@@H](C)[C@H](C)[C@H]5C4=CC[C@@H]3[C@]21C VLFUANNVMXKBPF-ZAPOICBTSA-N 0.000 description 2
- GXWUEMSASMVWKO-GNLHUFSQSA-N (4as,6ar,6as,6br,10s,12ar,14br)-10-[(2s,3r,4s,5s)-4,5-dihydroxy-3-[(2r,3r,4s,5s,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxyoxan-2-yl]oxy-2,2,6a,6b,9,9,12a-heptamethyl-1,3,4,5,6,6a,7,8,8a,10,11,12,13,14b-tetradecahydropicene-4a-carboxylic acid Chemical compound O([C@@H]1[C@@H](O)[C@@H](O)CO[C@H]1O[C@H]1CC[C@]2(C)[C@H]3CC=C4[C@@]([C@@]3(CCC2C1(C)C)C)(C)CC[C@]1(CCC(C[C@@H]14)(C)C)C(O)=O)[C@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O GXWUEMSASMVWKO-GNLHUFSQSA-N 0.000 description 2
- VCNKUCWWHVTTBY-UHFFFAOYSA-N 18alpha-Oleanane Natural products C1CCC(C)(C)C2CCC3(C)C4(C)CCC5(C)CCC(C)(C)CC5C4CCC3C21C VCNKUCWWHVTTBY-UHFFFAOYSA-N 0.000 description 2
- 108010064634 2,3-oxidosqualene-beta-amyrin-cyclase Proteins 0.000 description 2
- 241000589158 Agrobacterium Species 0.000 description 2
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 2
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 2
- VLFUANNVMXKBPF-UHFFFAOYSA-N Coniferin 2'-O-beta-D-apiofuranoside Natural products C1CC(O)C(C)(C)C2CCC3(C)C4(C)CCC5(C=O)CCC(C)C(C)C5C4=CCC3C21C VLFUANNVMXKBPF-UHFFFAOYSA-N 0.000 description 2
- 229920000858 Cyclodextrin Polymers 0.000 description 2
- 102000002004 Cytochrome P-450 Enzyme System Human genes 0.000 description 2
- 108010015742 Cytochrome P-450 Enzyme System Proteins 0.000 description 2
- AEMOLEFTQBMNLQ-DTEWXJGMSA-N D-Galacturonic acid Natural products O[C@@H]1O[C@H](C(O)=O)[C@H](O)[C@H](O)[C@H]1O AEMOLEFTQBMNLQ-DTEWXJGMSA-N 0.000 description 2
- 108050002220 Green fluorescent protein, GFP Proteins 0.000 description 2
- 206010020649 Hyperkeratosis Diseases 0.000 description 2
- SRBFZHDQGSBBOR-HWQSCIPKSA-N L-arabinopyranose Chemical compound O[C@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-HWQSCIPKSA-N 0.000 description 2
- 240000007594 Oryza sativa Species 0.000 description 2
- 235000007164 Oryza sativa Nutrition 0.000 description 2
- 101000587791 Panax ginseng Squalene monooxygenase SE2 Proteins 0.000 description 2
- 241000235648 Pichia Species 0.000 description 2
- 235000009001 Quillaja saponaria Nutrition 0.000 description 2
- 241001454523 Quillaja saponaria Species 0.000 description 2
- 102000018120 Recombinases Human genes 0.000 description 2
- 108010091086 Recombinases Proteins 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- 108090000992 Transferases Proteins 0.000 description 2
- 102000004357 Transferases Human genes 0.000 description 2
- 102000008579 Transposases Human genes 0.000 description 2
- 108010020764 Transposases Proteins 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- IAJILQKETJEXLJ-RSJOWCBRSA-N aldehydo-D-galacturonic acid Chemical compound O=C[C@H](O)[C@@H](O)[C@@H](O)[C@H](O)C(O)=O IAJILQKETJEXLJ-RSJOWCBRSA-N 0.000 description 2
- AEMOLEFTQBMNLQ-WAXACMCWSA-N alpha-D-glucuronic acid Chemical compound O[C@H]1O[C@H](C(O)=O)[C@@H](O)[C@H](O)[C@H]1O AEMOLEFTQBMNLQ-WAXACMCWSA-N 0.000 description 2
- FSLPMRQHCOLESF-UHFFFAOYSA-N alpha-amyrenol Natural products C1CC(O)C(C)(C)C2CCC3(C)C4(C)CCC5(C)CCC(C)C(C)C5C4=CCC3C21C FSLPMRQHCOLESF-UHFFFAOYSA-N 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 229940088710 antibiotic agent Drugs 0.000 description 2
- AEMOLEFTQBMNLQ-UHFFFAOYSA-N beta-D-galactopyranuronic acid Natural products OC1OC(C(O)=O)C(O)C(O)C1O AEMOLEFTQBMNLQ-UHFFFAOYSA-N 0.000 description 2
- JFSHUTJDVKUMTJ-QHPUVITPSA-N beta-amyrin Chemical compound C1C[C@H](O)C(C)(C)[C@@H]2CC[C@@]3(C)[C@]4(C)CC[C@@]5(C)CCC(C)(C)C[C@H]5C4=CC[C@@H]3[C@]21C JFSHUTJDVKUMTJ-QHPUVITPSA-N 0.000 description 2
- QQFMRPIKDLHLKB-UHFFFAOYSA-N beta-amyrin Natural products CC1C2C3=CCC4C5(C)CCC(O)C(C)(C)C5CCC4(C)C3(C)CCC2(C)CCC1(C)C QQFMRPIKDLHLKB-UHFFFAOYSA-N 0.000 description 2
- PDNLMONKODEGSE-UHFFFAOYSA-N beta-amyrin acetate Natural products CC(=O)OC1CCC2(C)C(CCC3(C)C4(C)CCC5(C)CCC(C)(C)CC5C4=CCC23C)C1(C)C PDNLMONKODEGSE-UHFFFAOYSA-N 0.000 description 2
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 2
- 238000009395 breeding Methods 0.000 description 2
- 230000001488 breeding effect Effects 0.000 description 2
- 108010040093 cellulose synthase Proteins 0.000 description 2
- 210000003763 chloroplast Anatomy 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- -1 electroporation Substances 0.000 description 2
- 210000002257 embryonic structure Anatomy 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000010230 functional analysis Methods 0.000 description 2
- 238000002290 gas chromatography-mass spectrometry Methods 0.000 description 2
- 229930182478 glucoside Natural products 0.000 description 2
- 150000008131 glucosides Chemical class 0.000 description 2
- 229930182470 glycoside Natural products 0.000 description 2
- 150000002338 glycosides Chemical class 0.000 description 2
- SIOMFBXUIJKTMF-UHFFFAOYSA-N hypoglauterpenic acid Natural products C1CC(O)C(C)(C)C2=CCC3(C)C4(C)CCC5(C(O)=O)CCC(C)(C)CC5C4=CCC3C21C SIOMFBXUIJKTMF-UHFFFAOYSA-N 0.000 description 2
- 230000008595 infiltration Effects 0.000 description 2
- 238000001764 infiltration Methods 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 230000000442 meristematic effect Effects 0.000 description 2
- 230000002503 metabolic effect Effects 0.000 description 2
- 238000007884 metabolite profiling Methods 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- BPAWXSVOAOLSRP-UHFFFAOYSA-N oleanane Natural products CCCCCCCCCCCCCCCC(=O)OC1CCC2(C)C(CCC3(C)C2CC=C4C5CC(C)(C)CCC5(C)C(O)CC34C)C1(C)C BPAWXSVOAOLSRP-UHFFFAOYSA-N 0.000 description 2
- QGPIUZIWMRUUCS-UHFFFAOYSA-N oleanolic aldehyde Natural products C1CC(=O)C(C)(C)C2CCC3(C)C4(C)CCC5(C=O)CCC(C)(C)CC5C4=CCC3C21C QGPIUZIWMRUUCS-UHFFFAOYSA-N 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 230000003389 potentiating effect Effects 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 210000001938 protoplast Anatomy 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 230000008929 regeneration Effects 0.000 description 2
- 238000011069 regeneration method Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 235000009566 rice Nutrition 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 238000005382 thermal cycling Methods 0.000 description 2
- 230000010474 transient expression Effects 0.000 description 2
- 239000011782 vitamin Substances 0.000 description 2
- 229930003231 vitamin Natural products 0.000 description 2
- 229940088594 vitamin Drugs 0.000 description 2
- 235000013343 vitamin Nutrition 0.000 description 2
- GEWDNTWNSAZUDX-WQMVXFAESA-N (-)-methyl jasmonate Chemical compound CC\C=C/C[C@@H]1[C@@H](CC(=O)OC)CCC1=O GEWDNTWNSAZUDX-WQMVXFAESA-N 0.000 description 1
- VRYALKFFQXWPIH-PBXRRBTRSA-N (3r,4s,5r)-3,4,5,6-tetrahydroxyhexanal Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)CC=O VRYALKFFQXWPIH-PBXRRBTRSA-N 0.000 description 1
- XBZYWSMVVKYHQN-MYPRUECHSA-N (4as,6as,6br,8ar,9r,10s,12ar,12br,14bs)-10-hydroxy-2,2,6a,6b,9,12a-hexamethyl-9-[(sulfooxy)methyl]-1,2,3,4,4a,5,6,6a,6b,7,8,8a,9,10,11,12,12a,12b,13,14b-icosahydropicene-4a-carboxylic acid Chemical compound C1C[C@H](O)[C@@](C)(COS(O)(=O)=O)[C@@H]2CC[C@@]3(C)[C@]4(C)CC[C@@]5(C(O)=O)CCC(C)(C)C[C@H]5C4=CC[C@@H]3[C@]21C XBZYWSMVVKYHQN-MYPRUECHSA-N 0.000 description 1
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 1
- QYIMSPSDBYKPPY-RSKUXYSASA-N (S)-2,3-epoxysqualene Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C=C(/C)CC\C=C(/C)CC[C@@H]1OC1(C)C QYIMSPSDBYKPPY-RSKUXYSASA-N 0.000 description 1
- FABOBEOYNMHSHB-UHFFFAOYSA-N 15R,16-dihydroxy-3-oxoisopimar-9(11)-ene Natural products C1CC(O)C(C)(CO)C2CCC3(C)C4(C)CC(O)C5(C(O)=O)CCC(C)(C)CC5C4=CCC3C21C FABOBEOYNMHSHB-UHFFFAOYSA-N 0.000 description 1
- ZBMRKNMTMPPMMK-UHFFFAOYSA-N 2-amino-4-[hydroxy(methyl)phosphoryl]butanoic acid;azane Chemical compound [NH4+].CP(O)(=O)CCC(N)C([O-])=O ZBMRKNMTMPPMMK-UHFFFAOYSA-N 0.000 description 1
- CAAMSDWKXXPUJR-UHFFFAOYSA-N 3,5-dihydro-4H-imidazol-4-one Chemical compound O=C1CNC=N1 CAAMSDWKXXPUJR-UHFFFAOYSA-N 0.000 description 1
- 101710152516 3-hydroxy-3-methylglutaryl-coenzyme A reductase 1 Proteins 0.000 description 1
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 240000002234 Allium sativum Species 0.000 description 1
- 241001605719 Appias drusilla Species 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- 241001167018 Aroa Species 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 241000972773 Aulopiformes Species 0.000 description 1
- 102000018720 Basic Helix-Loop-Helix Transcription Factors Human genes 0.000 description 1
- 108010027344 Basic Helix-Loop-Helix Transcription Factors Proteins 0.000 description 1
- 108010006654 Bleomycin Proteins 0.000 description 1
- PIGTXFOGKFOFTO-PPEDVFHSSA-N CC1(C)CC[C@@]2([C@H](O)C[C@]3(C)C(=CC[C@@H]4[C@@]5(C)CCC(O[C@@H]6O[C@@H]([C@@H](O)[C@H](O)[C@H]6O)C(O)=O)[C@@](C)(C=O)[C@@H]5CC[C@@]34C)[C@@H]2C1)C(O)=O Chemical compound CC1(C)CC[C@@]2([C@H](O)C[C@]3(C)C(=CC[C@@H]4[C@@]5(C)CCC(O[C@@H]6O[C@@H]([C@@H](O)[C@H](O)[C@H]6O)C(O)=O)[C@@](C)(C=O)[C@@H]5CC[C@@]34C)[C@@H]2C1)C(O)=O PIGTXFOGKFOFTO-PPEDVFHSSA-N 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- FABOBEOYNMHSHB-UAWZMHPWSA-N Caulophyllogenin Chemical compound C1C[C@H](O)[C@@](C)(CO)[C@@H]2CC[C@@]3(C)[C@]4(C)C[C@@H](O)[C@@]5(C(O)=O)CCC(C)(C)C[C@H]5C4=CC[C@@H]3[C@]21C FABOBEOYNMHSHB-UAWZMHPWSA-N 0.000 description 1
- CSJDAUVLWGFTHJ-UHFFFAOYSA-N Caulophyllogenin Natural products CC(O)C(O)C(O)C(OC1OC(C(O)C(O)C1O)C(=O)O)C=O CSJDAUVLWGFTHJ-UHFFFAOYSA-N 0.000 description 1
- 101710095468 Cyclase Proteins 0.000 description 1
- 102000018832 Cytochromes Human genes 0.000 description 1
- 108010052832 Cytochromes Proteins 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- IGXWBGJHJZYPQS-SSDOTTSWSA-N D-Luciferin Chemical compound OC(=O)[C@H]1CSC(C=2SC3=CC=C(O)C=C3N=2)=N1 IGXWBGJHJZYPQS-SSDOTTSWSA-N 0.000 description 1
- SHZGCJCMOBCMKK-SVZMEOIVSA-N D-fucopyranose Chemical compound C[C@H]1OC(O)[C@H](O)[C@@H](O)[C@H]1O SHZGCJCMOBCMKK-SVZMEOIVSA-N 0.000 description 1
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 1
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- CYCGRDQQIOGCKX-UHFFFAOYSA-N Dehydro-luciferin Natural products OC(=O)C1=CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 CYCGRDQQIOGCKX-UHFFFAOYSA-N 0.000 description 1
- 108700033398 EC 1.1.1.34 Proteins 0.000 description 1
- 101150107463 ERG7 gene Proteins 0.000 description 1
- 101100491986 Emericella nidulans (strain FGSC A4 / ATCC 38163 / CBS 112.46 / NRRL 194 / M139) aromA gene Proteins 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- BJGNCJDXODQBOB-UHFFFAOYSA-N Fivefly Luciferin Natural products OC(=O)C1CSC(C=2SC3=CC(O)=CC=C3N=2)=N1 BJGNCJDXODQBOB-UHFFFAOYSA-N 0.000 description 1
- 102000002464 Galactosidases Human genes 0.000 description 1
- 108010093031 Galactosidases Proteins 0.000 description 1
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 1
- 229930182566 Gentamicin Natural products 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 239000005562 Glyphosate Substances 0.000 description 1
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 241000235649 Kluyveromyces Species 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- SHZGCJCMOBCMKK-JFNONXLTSA-N L-rhamnopyranose Chemical compound C[C@@H]1OC(O)[C@H](O)[C@H](O)[C@H]1O SHZGCJCMOBCMKK-JFNONXLTSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- DDWFXDSYGUXRAY-UHFFFAOYSA-N Luciferin Natural products CCc1c(C)c(CC2NC(=O)C(=C2C=C)C)[nH]c1Cc3[nH]c4C(=C5/NC(CC(=O)O)C(C)C5CC(=O)O)CC(=O)c4c3C DDWFXDSYGUXRAY-UHFFFAOYSA-N 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- QYIMSPSDBYKPPY-UHFFFAOYSA-N OS Natural products CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC1OC1(C)C QYIMSPSDBYKPPY-UHFFFAOYSA-N 0.000 description 1
- 240000004371 Panax ginseng Species 0.000 description 1
- 235000002789 Panax ginseng Nutrition 0.000 description 1
- IAJOBQBIJHVGMQ-UHFFFAOYSA-N Phosphinothricin Natural products CP(O)(=O)CCC(N)C(O)=O IAJOBQBIJHVGMQ-UHFFFAOYSA-N 0.000 description 1
- 108700001094 Plant Genes Proteins 0.000 description 1
- 101100004939 Platycodon grandiflorus CYP716A141 gene Proteins 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 108700008625 Reporter Genes Proteins 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- 108091081021 Sense strand Proteins 0.000 description 1
- 102000005782 Squalene Monooxygenase Human genes 0.000 description 1
- 108020003891 Squalene monooxygenase Proteins 0.000 description 1
- 229940100389 Sulfonylurea Drugs 0.000 description 1
- 240000004429 Tephrosia purpurea Species 0.000 description 1
- 235000017517 Tephrosia purpurea Nutrition 0.000 description 1
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 1
- 241000223259 Trichoderma Species 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 description 1
- HDYANYHVCAPMJV-UHFFFAOYSA-N Uridine diphospho-D-glucuronic acid Natural products O1C(N2C(NC(=O)C=C2)=O)C(O)C(O)C1COP(O)(=O)OP(O)(=O)OC1OC(C(O)=O)C(O)C(O)C1O HDYANYHVCAPMJV-UHFFFAOYSA-N 0.000 description 1
- 241001464837 Viridiplantae Species 0.000 description 1
- 108700040099 Xylose isomerases Proteins 0.000 description 1
- 241000235013 Yarrowia Species 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 239000002671 adjuvant Substances 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 125000003158 alcohol group Chemical group 0.000 description 1
- IAJILQKETJEXLJ-QTBDOELSSA-N aldehydo-D-glucuronic acid Chemical compound O=C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C(O)=O IAJILQKETJEXLJ-QTBDOELSSA-N 0.000 description 1
- PNNNRSAQSRJVSB-BXKVDMCESA-N aldehydo-L-rhamnose Chemical compound C[C@H](O)[C@H](O)[C@@H](O)[C@@H](O)C=O PNNNRSAQSRJVSB-BXKVDMCESA-N 0.000 description 1
- PMMURAAUARKVCB-UHFFFAOYSA-N alpha-D-ara-dHexp Natural products OCC1OC(O)CC(O)C1O PMMURAAUARKVCB-UHFFFAOYSA-N 0.000 description 1
- 150000001413 amino acids Chemical group 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 230000000433 anti-nutritional effect Effects 0.000 description 1
- 101150037081 aroA gene Proteins 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 229930189065 blasticidin Natural products 0.000 description 1
- 229960001561 bleomycin Drugs 0.000 description 1
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 150000001732 carboxylic acid derivatives Chemical group 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 229940097362 cyclodextrins Drugs 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- WQLVFSAGQJTQCK-UHFFFAOYSA-N diosgenin Natural products CC1C(C2(CCC3C4(C)CCC(O)CC4=CCC3C2C2)C)C2OC11CCC(C)CO1 WQLVFSAGQJTQCK-UHFFFAOYSA-N 0.000 description 1
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000013020 embryo development Effects 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 238000007306 functionalization reaction Methods 0.000 description 1
- 235000004611 garlic Nutrition 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000011223 gene expression profiling Methods 0.000 description 1
- 235000008434 ginseng Nutrition 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 229940097043 glucuronic acid Drugs 0.000 description 1
- IAJOBQBIJHVGMQ-BYPYZUCNSA-N glufosinate-P Chemical compound CP(O)(=O)CC[C@H](N)C(O)=O IAJOBQBIJHVGMQ-BYPYZUCNSA-N 0.000 description 1
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 1
- 229940097068 glyphosate Drugs 0.000 description 1
- 230000007407 health benefit Effects 0.000 description 1
- 241000411851 herbal medicine Species 0.000 description 1
- 230000002363 herbicidal effect Effects 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 230000003054 hormonal effect Effects 0.000 description 1
- 125000004356 hydroxy functional group Chemical group O* 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- ZNJFBWYDHIGLCU-HWKXXFMVSA-N jasmonic acid Chemical group CC\C=C/C[C@@H]1[C@@H](CC(O)=O)CCC1=O ZNJFBWYDHIGLCU-HWKXXFMVSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 238000004020 luminiscence type Methods 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- GEWDNTWNSAZUDX-UHFFFAOYSA-N methyl 7-epi-jasmonate Natural products CCC=CCC1C(CC(=O)OC)CCC1=O GEWDNTWNSAZUDX-UHFFFAOYSA-N 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000003750 molluscacide Substances 0.000 description 1
- 230000002013 molluscicidal effect Effects 0.000 description 1
- 150000002772 monosaccharides Chemical class 0.000 description 1
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 108010058731 nopaline synthase Proteins 0.000 description 1
- 125000000955 oleanolic acid group Chemical group 0.000 description 1
- 229920001542 oligosaccharide Polymers 0.000 description 1
- 230000005305 organ development Effects 0.000 description 1
- 239000012074 organic phase Substances 0.000 description 1
- 230000000865 phosphorylative effect Effects 0.000 description 1
- 238000013081 phylogenetic analysis Methods 0.000 description 1
- 230000008121 plant development Effects 0.000 description 1
- 229930000223 plant secondary metabolite Natural products 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 229920001184 polypeptide Polymers 0.000 description 1
- 231100000683 possible toxicity Toxicity 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000009790 rate-determining step (RDS) Methods 0.000 description 1
- 238000007363 ring formation reaction Methods 0.000 description 1
- 235000019515 salmon Nutrition 0.000 description 1
- NWMIYTWHUDFRPL-UHFFFAOYSA-N sapogenin Natural products COC(=O)C1(CO)C(O)CCC2(C)C1CCC3(C)C2CC=C4C5C(C)(O)C(C)CCC5(CCC34C)C(=O)O NWMIYTWHUDFRPL-UHFFFAOYSA-N 0.000 description 1
- 125000003198 secondary alcohol group Chemical group 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 239000000344 soap Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 229960000268 spectinomycin Drugs 0.000 description 1
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 1
- 238000005507 spraying Methods 0.000 description 1
- 229940031439 squalene Drugs 0.000 description 1
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 1
- 230000010473 stable expression Effects 0.000 description 1
- 230000000707 stereoselective effect Effects 0.000 description 1
- 230000001954 sterilising effect Effects 0.000 description 1
- 238000004659 sterilization and disinfection Methods 0.000 description 1
- 230000003637 steroidlike Effects 0.000 description 1
- 150000003431 steroids Chemical class 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- YROXIXLRRCOBKF-UHFFFAOYSA-N sulfonylurea Chemical class OC(=N)N=S(=O)=O YROXIXLRRCOBKF-UHFFFAOYSA-N 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 238000004114 suspension culture Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- OFVLGDICTFRJMM-WESIUVDSSA-N tetracycline Chemical compound C1=CC=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(O)=C(C(N)=O)C(=O)[C@@]4(O)C(O)=C3C(=O)C2=C1O OFVLGDICTFRJMM-WESIUVDSSA-N 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 239000012096 transfection reagent Substances 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 229930195700 triterpenoid aglycone Natural products 0.000 description 1
- 150000008130 triterpenoid saponins Chemical class 0.000 description 1
- HDYANYHVCAPMJV-USQUEEHTSA-N udp-glucuronic acid Chemical compound O([P@](O)(=O)O[P@](O)(=O)OC[C@H]1[C@@H]([C@H]([C@@H](O1)N1C(NC(=O)C=C1)=O)O)O)[C@H]1O[C@@H](C(O)=O)[C@H](O)[C@@H](O)[C@@H]1O HDYANYHVCAPMJV-USQUEEHTSA-N 0.000 description 1
- 238000001946 ultra-performance liquid chromatography-mass spectrometry Methods 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/90—Isomerases (5.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y114/00—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
- C12Y114/14—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with reduced flavin or flavoprotein as one donor, and incorporation of one atom of oxygen (1.14.14)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y504/00—Intramolecular transferases (5.4)
- C12Y504/99—Intramolecular transferases (5.4) transferring other groups (5.4.99)
- C12Y504/99039—Beta-amyrin synthase (5.4.99.39)
Definitions
- the present invention relates to the field of plant secondary metabolites, more specifically to the field of saponins, even more specifically to the field of quillaic acid and derivatives thereof.
- the present invention provides chimeric genes and expression vectors for producing quillaic acid in native and heterologous host cells, such as plants and yeasts.
- Saponins glycosides widely distributed in the plant kingdom, include a diverse group of compounds characterized by their structure containing a steroidal or triterpenoid aglycone and one or more sugar chains. Their structural diversity is reflected in their physicochemical and biological properties, which are exploited in a number of traditional (as soaps, fish poison, and molluscicides) and industrial applications. Recent research has established saponins as the active components in many herbal medicines and highlighted their contributions to the health benefits of foods such as soybeans and garlic. Saponins are glycosides containing one or more sugar chains on a triterpene or steroid aglycone backbone also called a sapogenin.
- Monodesmosidic saponins have a single sugar chain, normally attached at C-3.
- Bidesmosidic saponins have two sugar chains, often with one attached through an ether linkage at C-3 and one attached through an ester linkage at C-28 (triterpene saponins) or an ether linkage at C-26 (furastanol saponins).
- the most common monosaccharides include: D-glucose (Glc), D-galactose (Gal), D-glucuronic acid (GlcA), D-galacturonic acid (GalA), L-rhamnose (Rha), L-arabinose (Ara), D-xylose (Xyl), and D-fucose (Fuc).
- Glc D-glucose
- Gal D-galactose
- GlcA D-glucuronic acid
- GalA D-galacturonic acid
- L-rhamnose Rha
- L-arabinose Ara
- D-xylose Xyl
- D-fucose Fuc
- the nature of the aglycone and the functional groups on the aglycone backbone and number and nature of the sugars can vary greatly resulting in a very diverse group of compounds.
- Quillaic acid is the major aglycone of the widely studied saponins of the Chilean indigenous tree Quillaja saponaria.
- Several glucoside modified variants of quillaic acid are known.
- One is the saponin QS-21 which is known as a potent adjuvant for CTL induction, and induces Thl cytokines (see Kensil CR et al (1995) in Powel, MF, Newman, MJ (Eds), Vaccine Design, p. 525-541).
- Another glucoside modified variant is sapofectosid (also known as SO1861) which is known as a transfection reagent (see Sama S et al (2017) Int. J.
- WO2019/122259 describes the isolation of nucleic acid sequences encoding biosynthetic enzymes from Quillaja saponaria which when expressed in heterologous hosts can produce precursors of quillaic acid.
- the instant invention has isolated alternative enzymes from Saponaria officinalis which can synthesize quillaic acid in heterologous eukaryotic hosts such as yeasts.
- Figure 1 Expression trends for selected transcripts.
- the left and right panels depict data as scaled expression values, and transcript abundances as Counts Per Million (CPM)
- Figure 3 GC-MS analysis revealed that all strains, except the negative control, accumulated a peak at 15.48 corresponding to p-amyrin - results are shown for strain 1, 2, and 3.
- Figure 4 recombinant yeast strains 4, 5 and 6 expressed a single P450 (either SoP450_1944, SoP450_2010 or SoP450_1049 respectively) together with tHMGRl, GgBAS and MtMTRl.
- Recombinant yeast strain 4 (SoP450_1944) was found to accumulate erythrodiol (17.27 min), the C-28 alcohol of p- amyrin in the pellet.
- Figure 5 recombinant yeast strains 4, 5 and 6 expressed a single P450 (either SoP450_1944, SoP450_2010 or SoP450_1049 respectively) together with tHMGRl, GgBAS and MtMTRl.
- Recombinant yeast strain 4 (SoP450_1944) was found to accumulate oleanolic acid (18.37 min) and all its intermediates derived from C-28 p-amyrin oxidation such as erythrodiol (17.25 min) and oleanolic aldehyde (18.83 min).
- Figure 6 Recombinant yeast strain 6 (expressing SoP450_1049) didn't produce any additional peak in the pellet (see Fig. 4), whereas in the medium erythrodiol (17.25 min), oleanolic acid (18.37 min) and echinocystic acid (19.00 min) were detected. The latter compound corresponds to 16-hydroxy-oleanolic acid and therefore leads to the identification of SoP450_1049 as an enzyme with a dual function as a C- 28/C-16 oxidase.
- Figure 7 When the two enzymes SoP450_1944 and SoP450_2010 were combined in recombinant yeast strain 7 , no additional products were detected in the cell pellet, though the levels of erythrodiol (17.27 min) and 23-hydroxy-p-amyrin (16.66 min) were notably reduced.
- Figure 8 Analysis of liquid medium of recombinant yeast strain 7 revealed that, besides the products already detected in yeast strains 4 (SoP450_1944) and 5 (SoP450_2010) expressing each of these two P450s individually, gypsogenin (22.53 min) and gypsogenic acid (23.44 min) were present as additional products. Both of these compounds result from a C-23 oxidation of oleanolic acid: gypsogenin is characterized by aldehyde function at position 23 while the same carbon is further oxidized to carboxy group in gypsogenic acid.
- Figure 11 No additional products were detected in the pellet of recombinant yeast strain 9 co-expressing SoP450_2010 and SoP450_1049, as compared to strains 5 (SoP450_2010) and 6 (SoP450_1049) expressing each enzyme individually.
- Figure 12 No additional products were detected in the medium of yeast strain 9 co-expressing SoP450_2010 and SoP450_1049, as compared to strains 5 (SoP450_2010) and 6 (SoP450_1049) expressing each enzyme individually.
- Figure 14 No additional peaks were identified in recombinant strain 16 by concomitant expression of SoP450_6085 and SoP450_1049, as compared to strain 6 expressing only SoP450_6085.
- Figure 15 The most unexpected result was obtained by metabolite profiling of recombinant yeast strain 19 where the dual specificity C-28/C-16 oxidase (SoP450_1049) was co-expressed with both C-23 oxidases (SoP450_2010 and SoP450_6085).
- This recombinant yeast strain beside all the products derived from C-28 and C-16 oxidation of p-amyrin (such as erythrodiol, oleanolic and echinocystic acid), was also found to synthesize 16-hydroxy-gypsogenic acid (similarly to recombinant yeast strain 10) but differently it also accumulated quillaic acid and traces of gypsogenic acid.
- Figure 16 content of sapofectosid in S. officinalis hairy root lines generated in this invention
- Figure 17 Schematic representation of the triterpene aglycone biosynthetic pathway in S. officinalis
- Figure 18 content of selected saponins in G. elegans hairy root lines generated in this invention. Normalized intensities have been obtained by dividing the peak are of the compound of interest by the peak are of internal standard.
- biosynthetic enzymes in Saponaria officinalis which enzymes when expressed as chimeric genes in an eukaryotic host, lead to the production of quillaic acid.
- quillaic acid can be produced in the yeast Saccharomyces cerevisiae by expression of the identified chimeric genes.
- the isolated biosynthetic enzymes comprise an unusual combination of a dual specificity C-16/C-28 oxidase and two different C-23 oxidases.
- the present invention provides in a first embodiment a eukaryotic cell comprising at least one of the following chimeric genes: a. a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 2 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 2 and transcription termination and polyadenylation signals, b. a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 6 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 6 and transcription termination and polyadenylation signals, c.
- a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 8 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 8 and transcription termination and polyadenylation signals d.
- a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 10 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 10 and transcription termination and polyadenylation signals a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 10 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 10 and transcription termination and polyadenylation signals.
- the eukaryotic cell further comprises a chimeric gene encoding an N- terminal truncated feedback-insensitive 3-hydroxy-3-methylglutaryl coenzyme A reductase.
- Non-limiting examples of N-terminal truncated feedback-insensitive 3-hydroxy-3-methylglutaryl coenzyme A reductases are depicted in the nucleotide sequences SEQ ID NO: 13 and 19.
- the eukaryotic cell further comprises a chimeric gene encoding a NADPH- cytochrome P450 reductase.
- a NADPH-cytochrome P450 oxidase is depicted in the nucleotide sequence SEQ ID NO: 15.
- eukaryotic cells can be higher or low eukaryotic cells.
- Higher eukaryotic cells comprise plant cells and animal cells.
- Lower eukaryotic cells comprise yeast and fungal cells.
- Yeast can for example be from the genus Saccharomyces, Pichia, Yarrowia, Hansenula or Kluyveromyces.
- Fungal cells can for example be from the genus Aspergillus or Trichoderma.
- SEQ ID NO: 2 is a beta-amyrin synthase isolated from Saponaria officinalis.
- SEQ ID NO: 6 is a combined C-28/C-16 oxidase of beta-amyrin isolated from Saponaria officinalis.
- SEQ ID NO: 8 and 10 are two different C-23 oxidases of beta-amyrin isolated from Saponaria officinalis.
- the invention provides a chimeric gene construct comprising the following operably linked DNA elements: a) a promoter region which is active in plant or yeast cells, b) a DNA region encoding SEQ ID NO: 6 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 6 and c) a 3' end region comprising transcription termination and polyadenylation signals functioning in cells of a plant or yeast.
- the invention provides a chimeric gene construct comprising the following operably linked DNA elements: a) a promoter region which is active in eukaryotic cells such as plant or yeast cells, b) a DNA region encoding SEQ ID NO: 8 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 8 and c) a 3' end region comprising transcription termination and polyadenylation signals functioning in cells of a plant or yeast.
- the invention provides a chimeric gene construct comprising the following operably linked DNA elements: a) a promoter region which is active in yeast or plant cells, b) a DNA region encoding SEQ ID NO: 10 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 10 and c) a 3' end region comprising transcription termination and polyadenylation signals functioning in cells of yeast or plants.
- the invention provides a yeast cell which yeast cell comprises the following chimeric genes: a. a yeast-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 2 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 2 and a terminator sequence, b. a yeast-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 6 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 6 and a terminator sequence, c.
- a yeast-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 8 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 8 and a terminator sequence
- a yeast-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 10 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 10 and a terminator sequence.
- the yeast cell further comprises a chimeric gene encoding an amino-terminal truncated feedback-insensitive 3-hydroxy-3-methylglutaryl coenzyme A reductase.
- the yeast cell further comprises a chimeric gene encoding an amino-terminal truncated feedback-insensitive 3-hydroxy-3-methylglutaryl coenzyme A reductase and a chimeric gene encoding a NADPH-cytochrome P450 reductase.
- the invention provides a plant cell comprising a chimeric gene comprising a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 2 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 2 and a terminator sequence, or said plant cell comprises a chimeric gene comprising a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 6 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 6 and a terminator sequence.
- said plant cell is from the genus Saponaria or Gypsophila.
- promoter is meant a sequence of nucleotides from which transcription may be initiated of DNA operably linked downstream (i.e. in the 3' direction on the sense strand of double-stranded DNA).
- the promoter is an inducible promoter.
- inducible as applied to a promoter is well understood by those skilled in the art. In essence, expression under the control of an inducible promoter is "switched on” or increased in response to an applied stimulus. The nature of the stimulus varies between promoters. Some inducible promoters cause little or undetectable levels of expression (or no expression) in the absence of the appropriate stimulus. Other inducible promoters cause detectable constitutive expression in the absence of the stimulus. Whatever the level of expression is in the absence of the stimulus, expression from any inducible promoter is increased in the presence of the correct stimulus.
- a nucleic acid according to the invention may be placed under the control of an externally (inducible) gene promoter to place expression under the control of the user.
- An advantage of introduction of a heterologous gene into a plant or yeast cell is the ability to place expression of the gene under the control of a promoter of choice, in order to be able to influence gene expression, and therefore quillaic or sapofectosid, according to preference.
- Suitable promoters which operate in plants include the Cauliflower Mosaic Virus 35S (CaMV 35S). Other examples are disclosed at pg. 120 of Lindsey & Jones (1989)"Plant Biotechnology in Agriculture” Pub. OU Press, Milton Keynes, UK.
- the promoter may be selected to include one or more sequence motifs or elements conferring developmental and/or tissue-specific regulatory control of expression.
- Inducible plant promoters include the ethanol induced promoter of Caddick et al (1998) Nature Biotechnology 16: 177-180.
- Suitable yeast promoters are for example the galactose inducible promoter from Saccharomyces cerevisiae or the methanol-inducible promoters from Pichia pastoris.
- the invention provides a method to produce quillaic acid in yeast comprising introducing the following chimeric genes: a. a yeast-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 2 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 2 and a polyadenylation and termination sequence, b. a yeast-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 6 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 6 and a polyadenylation and termination sequence, c.
- a yeast-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 8 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 8 and a polyadenylation and termination sequence
- a yeast-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 10 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 10 and a polyadenylation and termination sequence, cultivating said transformed yeast and isolating quillaic acid from said cultivated yeast.
- the method to produce quillaic acid in yeast further comprises the introduction of a chimeric gene encoding an amino-terminal truncated feedback-insensitive 3-hydroxy- 3-methylglutaryl coenzyme A reductase.
- the method to produce quillaic acid in yeast further comprises the introduction of a chimeric gene encoding a NADPH-cytochrome P450 reductase.
- the invention provides a method to produce sapofectosid in Saponaria or Gypsophila species comprising introducing the chimeric gene: a. a plant-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 2 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 2 and transcription termination and polyadenylation signals, or the chimeric gene b.
- a plant-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 6 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 6 and transcription termination and polyadenylation signals, by transformation into hairy roots of a Saponaria or Gypsophila species, cultivating said transformed hairy roots and isolating sapofectosid from said cultivated transformed hairy roots.
- operably linked refers to a functional linkage between the promoter sequence and the gene of interest or a homologue thereof as defined herein above.
- a “chimeric gene” or “chimeric construct” is a recombinant nucleic acid sequence in which a promoter or regulatory nucleic acid sequence is operatively linked to, or associated with, a nucleic acid sequence that codes for an mRNA, such that the regulatory nucleic acid sequence is able to regulate transcription or expression of the associated nucleic acid coding sequence and a terminator sequence.
- the regulatory nucleic acid sequence of the chimeric gene is not normally operatively linked to the associated nucleic acid sequence as found in nature.
- terminal encompasses a control sequence which is a DNA sequence at the end of a transcriptional unit which signals 3' processing and polyadenylation of a primary transcript and termination of transcription.
- the terminator can be derived from the natural gene, from a variety of other plant or yeast genes, or from T-DNA.
- the terminator to be added may be derived from, for example, the nopaline synthase or octopine synthase genes, or alternatively from another plant gene.
- “Selectable marker”, “selectable marker gene” or “reporter gene” includes any gene that confers a phenotype on a cell in which it is expressed to facilitate the identification and/or selection of cells that are transfected or transformed with a nucleic acid construct of the invention. These marker genes enable the identification of a successful transfer of the nucleic acid molecules via a series of different principles. Suitable markers may be selected from markers that confer antibiotic or herbicide resistance, that introduce a new metabolic trait or that allow visual selection.
- selectable marker genes include genes conferring resistance to antibiotics (such as nptll that phosphorylates neomycin and kanamycin, or hpt, phosphorylating hygromycin, or genes conferring resistance to, for example, bleomycin, streptomycin, tetracyclin, chloramphenicol, ampicillin, gentamycin, geneticin (G418), spectinomycin or blasticidin), to herbicides (for example bar which provides resistance to Basta*; aroA or gox providing resistance against glyphosate, or the genes conferring resistance to, for example, imidazolinone, phosphinothricin or sulfonylurea), or genes that provide a metabolic trait (such as manA that allows plants to use mannose as sole carbon source or xylose isomerase for the utilisation of xylose, or antinutritive markers such as the resistance to 2-deoxyglucose).
- antibiotics such as nptll that phosphorylates
- Visual marker genes results in the formation of colour (for example p-glucuronidase, GUS or p- galactosidase with its coloured substrates, for example X-Gal), luminescence (such as the luciferin/luciferase system) or fluorescence (Green Fluorescent Protein, GFP, and derivatives thereof).
- colour for example p-glucuronidase, GUS or p- galactosidase with its coloured substrates, for example X-Gal
- luminescence such as the luciferin/luciferase system
- fluorescence Green Fluorescent Protein
- nucleic acid molecules encoding a selectable marker can be introduced into a host cell on the same vector that comprises the sequence encoding the polypeptides of the invention or used in the methods of the invention, or else in a separate vector. Cells which have been stably transfected with the introduced nucleic acid can be identified for example by selection (for example, cells which have integrated the selectable marker survive whereas the other cells die).
- the process according to the invention for introducing the nucleic acids advantageously employs techniques which enable the removal or excision of these marker genes.
- One such a method is what is known as co-transformation.
- the co- transformation method employs two vectors simultaneously for the transformation, one vector bearing the nucleic acid according to the invention and a second bearing the marker gene(s).
- a large proportion of transformants receives or, in the case of plants, comprises (up to 40% or more of the transformants), both vectors.
- the transformants usually receive only a part of the vector, i.e.
- the marker genes can subsequently be removed from the transformed plant by performing crosses.
- marker genes integrated into a transposon are used for the transformation together with desired nucleic acid (known as the Ac/Ds technology).
- the transformants can be crossed with a transposase source or the transformants are transformed with a nucleic acid construct conferring expression of a transposase, transiently or stable.
- the transposon jumps out of the genome of the host cell once transformation has taken place successfully and is lost.
- the transposon jumps to a different location. In these cases the marker gene must be eliminated by performing crosses.
- Cre/lox Crel is a recombinase that removes the sequences located between the loxP sequences. If the marker gene is integrated between the loxP sequences, it is removed once transformation has taken place successfully, by expression of the recombinase.
- Crel is a recombinase that removes the sequences located between the loxP sequences. If the marker gene is integrated between the loxP sequences, it is removed once transformation has taken place successfully, by expression of the recombinase.
- Further recombination systems are the HIN/HIX, FLP/FRT and REP/STB system (Tribble et al., J. Biol.
- transgenic means with regard to, for example, a nucleic acid sequence, an expression cassette, gene construct or a vector comprising the nucleic acid sequence or an organism transformed with the nucleic acid sequences, expression cassettes or vectors according to the invention.
- transgenic plant for the purposes of the invention is thus understood as meaning, as above, that the nucleic acids used in the method of the invention are not present in, or originating from, the genome of said plant, or are present in the genome of said plant but not at their natural locus in the genome of said plant, it being possible for the nucleic acids to be expressed homologously or heterologously.
- transgenic also means that, while the nucleic acids according to the invention or used in the inventive method are at their natural position in the genome of a plant, the sequence has been modified with regard to the natural sequence, and/or that the regulatory sequences of the natural sequences have been modified.
- Transgenic is preferably understood as meaning the expression of the nucleic acids according to the invention at an unnatural locus in the genome, i.e. homologous or, heterologous expression of the nucleic acids takes place.
- Preferred transgenic plants are mentioned herein.
- sequence identity of two related nucleotide or amino acid sequences, expressed as a percentage, refers to the number of positions in the two optimally aligned sequences which have identical residues (xlOO) divided by the number of positions compared.
- a gap i.e., a position in an alignment where a residue is present in one sequence but not in the other is regarded as a position with non-identical residues.
- the alignment of the two sequences is performed by the Needleman and Wunsch algorithm (Needleman and Wunsch (1970) J Mol Biol.
- sequence alignment can be conveniently performed using standard software program such as GAP which is part of the Wisconsin Package Version 10.1 (Genetics Computer Group, Madison, Wisconsin, USA) using the default scoring matrix with a gap creation penalty of 50 and a gap extension penalty of 3. Sequences are indicated as "essentially similar" when such sequence have a sequence identity of at least about 75%, particularly at least about 80 %, more particularly at least about 85%, quite particularly about 90%, especially about 95%, more especially about 100%, quite especially are identical.
- expression means the transcription of a specific gene or specific genes or specific genetic construct.
- expression in particular means the transcription of a gene or genes or genetic construct into structural RNA (rRNA, tRNA) or mRNA with or without subsequent translation of the latter into a protein. The process includes transcription of DNA and processing of the resulting mRNA product.
- introduction or “transformation” as referred to herein encompass the transfer of an exogenous polynucleotide into a host cell, irrespective of the method used for transfer.
- Plant tissue capable of subsequent clonal propagation may be transformed with a genetic construct of the present invention and a whole plant regenerated there from.
- the particular tissue chosen will vary depending on the clonal propagation systems available for, and best suited to, the particular species being transformed.
- tissue targets include leaf disks, pollen, embryos, cotyledons, hypocotyls, megagametophytes, callus tissue, existing meristematic tissue (e.g., apical meristem, axillary buds, and root meristems), hairy roots and induced meristem tissue (e.g., cotyledon meristem and hypocotyl meristem).
- the polynucleotide may be transiently or stably introduced into a host cell and may be maintained non-integrated, for example, as a plasmid. Alternatively, it may be integrated into the host genome.
- the resulting transformed plant cell may then be used to regenerate a transformed plant in a manner known to persons skilled in the art.
- Transformation of plant species is now a fairly routine technique.
- any of several transformation methods may be used to introduce the gene of interest into a suitable ancestor cell.
- the methods described for the transformation and regeneration of plants from plant tissues or plant cells may be utilized for transient or for stable transformation. Transformation methods include the use of liposomes, electroporation, chemicals that increase free DNA uptake, injection of the DNA directly into the plant, particle gun bombardment, transformation using viruses or pollen and microprojection. Methods may be selected from the calcium/polyethylene glycol method for protoplasts (Krens, F.A. et al., (1982) Nature 296, 72-74; Negrutiu I et al.
- Transgenic plants including transgenic crop plants, are preferably produced via Agrobacterium-mediated transformation.
- An advantageous transformation method is the transformation in planta.
- agrobacteria to act on plant seeds or to inoculate the plant meristem with agrobacteria. It has proved particularly expedient in accordance with the invention to allow a suspension of transformed agrobacteria to act on the intact plant or at least on the flower primordia. The plant is subsequently grown on until the seeds of the treated plant are obtained (Clough and Bent, Plant J. (1998) 16, 735-743).
- Methods for Agrobacterium-mediated transformation of rice include well known methods for rice transformation, such as those described in any of the following: European patent application EP1198985, Aldemita and Hodges (Planta 199: 612-617, 1996); Chan et al.
- nucleic acids or the construct to be expressed is preferably cloned into a vector, which is suitable for transforming Agrobacterium tumefaciens, for example pBinl9 (Bevan et al (1984) Nucl. Acids Res. 12-8711).
- Agrobacteria transformed by such a vector can then be used in known manner for the transformation of plants, such as plants used as a model, like Arabidopsis (Arabidopsis thaliana is within the scope of the present invention not considered as a crop plant), or crop plants such as, by way of example, tobacco plants, for example by immersing bruised leaves or chopped leaves in an agrobacterial solution and then culturing them in suitable media.
- plants used as a model like Arabidopsis (Arabidopsis thaliana is within the scope of the present invention not considered as a crop plant), or crop plants such as, by way of example, tobacco plants, for example by immersing bruised leaves or chopped leaves in an agrobacterial solution and then culturing them in suitable media.
- the transformation of plants by means of Agro bacterium tumefaciens is described, for example, by Hofgen and Willmitzer in Nucl. Acid Res. (1988) 16, 9877 or is known inter alia from F.F. White
- the transformation of the chloroplast genome is generally achieved by a process which has been schematically displayed in Klaus et al., 2004 [Nature Biotechnology 22 (2), 225-229], Briefly the sequences to be transformed are cloned together with a selectable marker gene between flanking sequences homologous to the chloroplast genome. These homologous flanking sequences direct site specific integration into the plastome. Plastidal transformation has been described for many different plant species and an overview is given in Bock (2001) Transgenic plastids in basic research and plant biotechnology. J Mol Biol. 2001 Sep 21; 312 (3):425-38 or Maliga, P (2003) Progress towards commercialization of plastid transformation technology. Trends Biotechnol. 21 , 20-28. Further biotechnological progress has recently been reported in form of marker free plastid transformants, which can be produced by a transient co-integrated maker gene (Klaus et al., 2004, Nature Biotechnology 22(2), 225-229).
- the genetically modified plant cells can be regenerated via all methods with which the skilled worker is familiar. Suitable methods can be found in the abovementioned publications by S.D. Kung and R. Wu, Potrykus or Hofgen and Willmitzer.
- plant cells or cell groupings are selected for the presence of one or more markers which are encoded by plant-expressible genes co-transferred with the gene of interest, following which the transformed material is regenerated into a whole plant.
- the plant material obtained in the transformation is, as a rule, subjected to selective conditions so that transformed plants can be distinguished from untransformed plants.
- the seeds obtained in the above-described manner can be planted and, after an initial growing period, subjected to a suitable selection by spraying.
- a further possibility consists in growing the seeds, if appropriate after sterilization, on agar plates using a suitable selection agent so that only the transformed seeds can grow into plants.
- the transformed plants are screened for the presence of a selectable marker such as the ones described above.
- putatively transformed plants may also be evaluated, for instance using Southern analysis, for the presence of the gene of interest, copy number and/or genomic organisation.
- expression levels of the newly introduced DNA may be monitored using Northern and/or Western analysis, both techniques being well known to persons having ordinary skill in the art.
- the generated transformed plants may be propagated by a variety of means, such as by clonal propagation or classical breeding techniques.
- a first generation (or Tl) transformed plant may be selfed and homozygous second-generation (or T2) transformants selected, and the T2 plants may then further be propagated through classical breeding techniques.
- the generated transformed organisms may take a variety of forms. For example, they may be chimeras of transformed cells and nontransformed cells; clonal transformants (e.g., all cells transformed to contain the expression cassette); grafts of transformed and untransformed tissues (e.g., in plants, a transformed rootstock grafted to an untransformed scion).
- plant as used herein encompasses whole monocotyledonous and dicotyledonous plants, ancestors and progeny of the plants and plant parts, including seeds, shoots, stems, leaves, roots (including tubers), flowers, and tissues and organs, wherein each of the aforementioned comprise the gene/nucleic acid of interest.
- plant also encompasses monocotyledonous and dicotyledonous plant cells, suspension cultures, callus tissue, hairy roots, embryos, meristematic regions, gametophytes, pollen and microspores, again wherein each of the aforementioned comprises the gene/nucleic acid of interest.
- expression cassette refers to any recombinant expression system for the purpose of expressing a nucleic acid sequence of the invention in vitro or in vivo, constitutively or inducibly, in any cell, including, in addition to plant cells, prokaryotic, yeast, fungal, insect or mammalian cells.
- the term includes linear and circular expression systems.
- the term includes all vectors.
- the cassettes can remain episomal or integrate into the host cell genome.
- the expression cassettes can have the ability to selfreplicate or not (i.e., drive only transient expression in a cell).
- the term includes recombinant expression cassettes that contain only the minimum elements needed for transcription of the recombinant nucleic acid.
- S. officinalis transcriptome datasets produced in-house and comprising different organs, sampled in response to hormonal treatments at multiple time points. More specifically, leaves, stems and roots from hydroponically grown S. officinalis plants were sampled in triplicate at six and twenty-four hours after mock or 50 pM Methyl Jasmonate treatment. S. officinalis seeds were sourced from the seed company Jelitto (https://www.ielitto.com/, Schwarmstedt, Germany).
- RNA sequencing datasets were used to draft an S. officinalis reference transcriptome as well as to conduct gene expression analysis. To this end, RNA was extracted from the sampled S. officinalis tissues using the ReliaPrep RNA miniprep Systems (PromegaTM), following manufacturer's instructions for fibrous tissues. RNA was single-end sequenced via Illumina HiSeq 6000 with single-end read lengths of 100 bp for gene expression profiling. RNA was also used as template for cDNA synthesis using qScript cDNA SuperMix (QuantaBioTM) and for the cloning of candidate genes. The S.
- transcripts levels were estimated by mapping single-end reads on the S. officinalis transcriptome using Salmon [2] implemented on a Galaxy pipeline.
- transcripts that were lowly expressed or with zero expression variance across samples were removed and the remaining genes were clustered using Self Organizing Map (SOM) implemented and visualized in R using the Kohonen-package [3] to assemble clusters of transcripts characterized by similar expression trends.
- SOM Self Organizing Map
- SoBAS_1421 a candidate p-amyrin synthase that was therefore selected as the main bait for co-expression analysis.
- BAS is the key enzyme that commits linear triterpene precursors towards biosynthesis of oleanane-type pentacyclic triterpenes via cyclization of 2,3- oxidosqualene.
- SoBAS_1421 full coding sequence and predicted protein are reported as SEQ ID NOs: 1 and 2.
- P450s cytochrome P450s
- SoBAS_1421 Four full length-length transcripts corresponding to cytochrome P450s (P450s) were identified among the genes co-expressed with SoBAS_1421.
- P450s proteins belong to a broad superfamily of enzymes catalyzing oxidation of various substrates.
- Phylogenetic analysis was performed to infer the function of the four candidate P450s from S. officinalis based on the comparison of protein sequences with P450s from other plant species reported to be involved in the oxidation of triterpenes.
- SoP450_1944 The first P450 (hereafter referred to as SoP450_1944, see SEQ ID NO: 3 for the coding sequence) showed phylogenetic relations with plant P450s belonging to the CYP716 family that had been characterized as P-amyrin C-28 oxidases, suggesting that SoP450_1944 may catalyze the same reaction in S. officinalis cells.
- the full coding sequence and predicted protein sequences for SoP450_1944 are reported as SEQ ID NOs: 3 and 4.
- SoP450_1049 The second P450 (hereafter referred to as SoP450_1049, see SEQ ID NO: 7 for the coding sequence) showed sequence similarities with CYP716A141 from Panax ginseng, the latter being the only example reported so far of an enzyme capable of oxidizing the C-28 of position of -amyrin and the C-16 p position of oleanolic acid.
- SoP450_2010 The third P450 identified (hereafter referred to as SoP450_2010, see SEQ ID NO: 5 for the coding sequence) was found to be neighboring P450s involved in oxidation of position C23 of pentacyclic triterpenes.
- the full nucleotide and predicted protein sequences of SoP450_2010 are reported as SEQ ID NOs: 7 and 8.
- SoP450_6085 the fourth P450 identified (hereafter referred to as SoP450_6085, see SEQ ID NO: 9 for the coding sequence) was found to be neighboring P450s involved in oxidation of position C23 of pentacyclic triterpenes.
- SEQ ID NOs: 9 and 10 The full nucleotide and predicted protein sequences of SoP450_6085 are reported as SEQ ID NOs: 9 and 10.
- UDP-glycosyltransferase (UGT) encoding gene SoUGT_2488 was identified in S. officinalis by sequence homology with previously reported UGTs from other plant species [4] as a gene encoding for an enzyme involved in transferring the first UDP-sugar (UDP-Glucuronic Acid) on to the C-3 position of quillaic acid. Therefore it may play a role in committing flux of aglycones towards the biosynthesis of highly-glycosylated oleanane-type saponins.
- SotHMGR (SEQ ID NOs: 19 and 20) was instead identified and cloned as truncated feedback-insensitive version of 3-hydroxy-3-methylglutaryl coenzyme A reductase (tHMGR), an enzyme known to catalyze the rate-limiting step in the biosynthesis of triterpene precursors [5], Lastly the gene SoSQE2 (SEQ ID Nos: 21 and 22) was identified as squalene epoxidase, an enzyme known to catalyze the stereospecific conversion of squalene to 2,3(S)- oxidosqualene, a key precursor in triterpenoid saponins. Therefore, co-expression of SoSQE2 together with other triterpene biosynthetic enzymes can boost production of precursors and ultimately triterpenoids yield as previously reported (Dong L. et al (2016) Metab Eng. 49:1-12).
- Example 2 Cloning candidate genes from S. officinalis and generation of expression constructs for functional analysis
- Each of the seven target genes described above was amplified by PCR using specific primer sets (see Table 1).
- Each of the primers included attB adapter sequences at the 5' end to allow directional cloning in appropriate Gateway® vectors (indicated in red in Table 1).
- PCR thermal cycling involved initial denaturation at 98°C (30 sec) followed by 3 cycles of denaturation (98°C, 10 sec) annealing (61°C, 20 sec), extension (72°C, 1 min 10 sec), followed by 30 cycles using the same conditions but increasing the annealing temperature to 70° C, with a final extension at 72° C (5 min).
- the amplification protocol was slightly modified involving initial denaturation at 98°C (30 sec) followed by 10 cycles of denaturation (98°C, 10 sec), annealing (52°C, 20 sec), and extension (72°C, 1 min 30 sec), then followed by 25 cycles using the same conditions but increasing the annealing temperature to 60° C, with a final extension at 72° C (5 min).
- the same conditions were used to amplify SoUGT_2488 and SotHMGR.
- thermal cycling was set up to include 98°C (30 sec) followed by 35 cycles of denaturation (98°C, 10 sec), annealing (60°C, 20 sec), extension (72°C, 40 sec), and a final extension at 72° C (5 min).
- SoBAS_1421 was cloned into pESC (uracil selection). This vector was modified to be used in gateway recombination systems, as described previously [6], The same plasmid also contained a feedback-insensitive truncated version of 3-hydroxy-3-methylglutaryl coenzyme A reductase 1 (tHMGRl) from Medicago truncatula.
- SoP450_1944, SoP450_2010 and SoP450_1049 were cloned into the Gateway-compatible yeast vectors pAG424GAL (tryptophan selection), pAG425GAL (leucine selection) and pAG427GAL (methionine selection), respectively.
- SoP450_6085 was cloned in all three different vectors (pAG424GAL, pAG425GAL and pAG427GAL) to allow combination with each of the others P450s.
- a partner reductase enzyme known as Cytochrome P450 reductase (MtMTRl - MTR_3gl00160, SEQ. ID NOs: 15 and 16) from M. truncatula was cloned into pAG423GAL (histidine selection). All vectors contain galactose-inducible promoters driving the expression of inserted genes.
- yeast strain used in this study was derived from BY4272 (genotype: MATa; his3Al; leu2A0; ura3A0; lys2A0; trplAO; metl5A0; PAHl-Ob; Perg7::PMET3- ERG7), which contains five auxotrophic selection markers (-URA/-HIS/-LEU/-MET/-TRP) and therefore allows expression of genes from up to five plasmids.
- the transformed yeast strains were selected on solid synthetic yeast media with appropriate supplements. Selected yeast strains were cultured in synthetic liquid medium with Galactose as the only carbon source and incubated for 7 days at 30°C. Methyl-p-cyclodextrin were added to the liquid medium at day two and day four during cultivation to reach a final concentration of 10 mM.
- Methyl-P- cyclodextrins are cyclic oligosaccharides that are able to sequester apolar triterpenes from yeast cells into the liquid medium thus avoiding possible toxicity and feedback inhibition by pathway intermediates, and thereby increase their production yield [7]
- Strains were pelleted by centrifugation and metabolites were extracted, separately for cell pellets and liquid culture medium, by liquid-liquid separation using n- hexane and ethyl acetate. Organic phases were collected, lyophilized by vacuum and analyzed by GC- MS.
- SoBAS 1421 is a monofunctional P-amyrin synthase from S. officinalis
- GC-MS analysis revealed that all strains, except the negative control, accumulated a peak at 15.48 corresponding to p-amyrin (Fig. 3 - results are shown for strain 1, 2, and 3). It is worth to mention that except strain 3 (carrying SoBAS_1421) and strain 11 (the negative control carrying empty vectors), all other strains carried GgBAS, a reported p-amyrin synthase from Glycyrrhiza glabra (SEQ ID NOs: 11 and 12).
- Strain 4 expressed a single P450 (either SoP450_1944, SoP450_2010 or SoP450_1049 respectively) together with tHMGRl, GgBAS and MtMTRl.
- Strain 4 (SoP450_1944) was found to accumulate erythrodiol (17.27 min), the C-28 alcohol of p-amyrin in the pellet (Fig. 4), whereas the medium (Fig. 5) contained oleanolic acid (18.37 min) and all its intermediates derived from C-28 p-amyrin oxidation such as erythrodiol (17.25 min) and oleanolic aldehyde (18.83 min).
- SoP450_1944 corresponds to a p-amyrin C-28 oxidase.
- strain 5 In both pellet and medium (Fig. 4 and 5) of strain 5 (SoP450_2010) , only a peak corresponding to 23-hydroxy-p-amyrin (16.66 min) was detected, confirming this enzyme as a C-23 oxidase.
- strain 13 expressing SoP450_6085 was found to produce only p-amyrin showing that this C-23 oxidase (SoP450_6085) is not able to function on such compound suggesting that perhaps it accepts only substrates with a higher degree of oxidation (Fig. 5).
- gypsogenin is characterized by aldehyde function at position 23 while the same carbon is further oxidized to carboxy group in gypsogenic acid.
- SoP450_2010 is able to accept oleanolic acid as a substrate and confirm its activity to be specifically oxidizing the carbon at position 23.
- strains 14 (Fig. 13) expressing the second candidate C-23 oxidase (SoP450_6085) together with the C-28 oxidase (SoP450_1944) did not produce any additional compound from those produced by strain 4 expressing SoP450_1944 solely.
- SoP450_1944 and SoP450_1049 in strain 8 did not lead to accumulation of any additional products, either in the pellet or liquid medium (Fig. 9 and 10), as compared to the yeast strains expressing the same enzymes individually. This is due to the fact that SoP450_1049 already appeared to have both C-28 and C-16 oxidase activity.
- Yeast strain 18 differed from yeast strain 10 by substitution of SoP450_2010 with another C-23 oxidase, SoP450_6085. Interestingly, and unexpectedly, this combination did not yield any new compound as compared to combination 8 (expressing SoP450_1944 and SoP450_1049) or combination 14 (expressing SoP450_1944 and SoP450_6085). Likewise, strain 17, expressing C-28 oxidase SoP450_1944 together with both C-23 oxidases (SoP450_2010 and SoP450_6085) yielded the same metabolites as strain 7 carrying SoP450_1944 and SoP450_2010 only.
- strain 17 unexpectedly only gypsogenin was detected thus demonstrating that concomitant expression of both C-23 oxidases would promote that oxidation of the C-23 position stops at the aldehyde moiety (gypsogenin) reducing the production of the more oxidized carboxy group (gypsogenic acid).
- the first C-23 oxidase (SoP450_2010) is capable of oxidizing the C-23 position into an alcohol moiety producing caulophyllogenin that is readily converted by the second C-23 enzyme (SoP450_6085) into quillaic acid (Fig. 15).
- the metabolite extract from yeast strain 19 was additionally analyzed by UPLC-MS analysis using a high resolution protocol that allows to distinguish more clearly between quillaic acid and gypsogenic acid (Fig. 15, second panel), confirming that the peak produced by yeast strain 19 unambiguously matches the quillaic acid standard.
- Example 4 Overexpression of newly discovered enzymes in transgenic S. officinalis hairy root lines
- the transgenes selected for ectopic overexpression in S. officinalis included three of the newly discovered S. officinalis genes involved in triterpene production, namely SoBAS_1421, SoP450_1049 and SoUGT_2488.
- the coding sequence of each gene was cloned into the Gateway-compatible binary vector pK7WG2D [8], For each transgene four independent hairy root lines were cultured, analyzed and compared with four independent control lines (ectopically expressing the metabolically non-active green fluorescent protein GFP).
- Example 5 Overexpression of newly discovered enzymes in transgenic Gypsophila elegans hairy root lines
- Transgenic G.elegans hairy root lines were established using Agrobacterium rhizogenes infection of G.elegans seedlings, following a previously established protocol (Pollier J. et al (2019) Plant J. 99:637- 654). These G. elegans hairy roots were cultivated for a span of two months in liquid Murashige and Skoog medium supplemented with vitamins and 1.5% sucrose.
- the chosen transgenes for ectopic overexpression in G. elegans included each of the newly identified S. officinalis P450 genes linked to triterpene production. This involved both individual (namely SoP450_1944, SoP450_6085, SoP450_2010, SoP450_1049) and collective overexpression in a binary vector equipped with seven distinct transcriptional cassettes. These cassettes encompassed the aforementioned P450s as well as SoBAS_1421, SoSQE2, and eGFP, serving as a visual transformation marker. For this purpose, the Golden Gibson assembly method was employed for binary vector construction, enabling the incorporation of multiple transcriptional cassettes, as previously described (Aesaert S et al. 2022) Front. Plant Sci 13:883847).
- the Gateway-compatible binary vector pK7WG2D was utilized, as outlined in Example 4 for S. officinalis hairy roots .
- three separate hairy root lines were cultivated, analyzed, and compared against three distinct control lines (expressing solely the metabolically inactive green fluorescent protein, GFP).
- SEQ ID 21 SoSQE2 Saponaria officinalis, Squalene Epoxidase 2 coding sequence (1557 bps)
- Goossens A Combinatorial biosynthesis of sapogenins and saponins in Saccharomyces cerevisiae using a C-16alpha hydroxylase from Bupleurum falcatum. Proc Natl Acad Sci U S A 2014, 111:1634-1639.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Nutrition Science (AREA)
- Cell Biology (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The present invention provides novel chimeric genes for the production of quillaic acid and derivatives thereof in eukaryotic cells such as yeasts and plants.
Description
MEANS AND METHODS TO PRODUCE TRITERPENE SAPONINS IN EUKARYOTIC CELLS
Field of the invention
The present invention relates to the field of plant secondary metabolites, more specifically to the field of saponins, even more specifically to the field of quillaic acid and derivatives thereof. The present invention provides chimeric genes and expression vectors for producing quillaic acid in native and heterologous host cells, such as plants and yeasts.
Introduction to the invention
Saponins, glycosides widely distributed in the plant kingdom, include a diverse group of compounds characterized by their structure containing a steroidal or triterpenoid aglycone and one or more sugar chains. Their structural diversity is reflected in their physicochemical and biological properties, which are exploited in a number of traditional (as soaps, fish poison, and molluscicides) and industrial applications. Recent research has established saponins as the active components in many herbal medicines and highlighted their contributions to the health benefits of foods such as soybeans and garlic. Saponins are glycosides containing one or more sugar chains on a triterpene or steroid aglycone backbone also called a sapogenin. They are categorized according to the number of sugar chains in their structure as mono, bi-, or tridesmosidic. Monodesmosidic saponins have a single sugar chain, normally attached at C-3. Bidesmosidic saponins have two sugar chains, often with one attached through an ether linkage at C-3 and one attached through an ester linkage at C-28 (triterpene saponins) or an ether linkage at C-26 (furastanol saponins). The most common monosaccharides include: D-glucose (Glc), D-galactose (Gal), D-glucuronic acid (GlcA), D-galacturonic acid (GalA), L-rhamnose (Rha), L-arabinose (Ara), D-xylose (Xyl), and D-fucose (Fuc). The nature of the aglycone and the functional groups on the aglycone backbone and number and nature of the sugars can vary greatly resulting in a very diverse group of compounds.
Quillaic acid is the major aglycone of the widely studied saponins of the Chilean indigenous tree Quillaja saponaria. Several glucoside modified variants of quillaic acid are known. One is the saponin QS-21 which is known as a potent adjuvant for CTL induction, and induces Thl cytokines (see Kensil CR et al (1995) in Powel, MF, Newman, MJ (Eds), Vaccine Design, p. 525-541). Another glucoside modified variant is sapofectosid (also known as SO1861) which is known as a transfection reagent (see Sama S et al (2017) Int. J. of Pharmaceutics 534, 195-205) and a potent endosomal escape molecule. Quillaja saponins and the extensive functionalization of this triterpenoid provide unique opportunities for structural modification and pose a challenge from the standpoint of selectivity in regard to one or the other secondary alcohol group, the aldehyde, and the carboxylic acid functions. The availability of quillaic acid
and its modified variants is limited and chemical synthesis is cumbersome and expensive. It would be an advantage to be able to produce quillaic acid and modified versions thereof in eukaryotic organisms such as for example yeasts and plants. WO2019/122259 describes the isolation of nucleic acid sequences encoding biosynthetic enzymes from Quillaja saponaria which when expressed in heterologous hosts can produce precursors of quillaic acid. The instant invention has isolated alternative enzymes from Saponaria officinalis which can synthesize quillaic acid in heterologous eukaryotic hosts such as yeasts. Interestingly two of the isolated nucleic acids, from the instant invention, each of them when overexpressed as a chimeric gene in hairy roots of Saponaria lead to the production of sapofectosid (SO1861).
Legends to the figures
Figure 1: Expression trends for selected transcripts. The left and right panels depict data as scaled expression values, and transcript abundances as Counts Per Million (CPM)
Figure 2: Agarose gel electrophoresis showing DNA bands corresponding to amplified target genes from cDNA. Target genes are reported within the grey boxed below the corresponding gel bands. J = Jasmonate, 6 = six hours after treatment, L = leaves, R = roots, P = plantlets.
Figure 3: GC-MS analysis revealed that all strains, except the negative control, accumulated a peak at 15.48 corresponding to p-amyrin - results are shown for strain 1, 2, and 3.
Figure 4: recombinant yeast strains 4, 5 and 6 expressed a single P450 (either SoP450_1944, SoP450_2010 or SoP450_1049 respectively) together with tHMGRl, GgBAS and MtMTRl. Recombinant yeast strain 4 (SoP450_1944) was found to accumulate erythrodiol (17.27 min), the C-28 alcohol of p- amyrin in the pellet.
Figure 5: recombinant yeast strains 4, 5 and 6 expressed a single P450 (either SoP450_1944, SoP450_2010 or SoP450_1049 respectively) together with tHMGRl, GgBAS and MtMTRl. Recombinant yeast strain 4 (SoP450_1944) was found to accumulate oleanolic acid (18.37 min) and all its intermediates derived from C-28 p-amyrin oxidation such as erythrodiol (17.25 min) and oleanolic aldehyde (18.83 min).
Figure 6: Recombinant yeast strain 6 (expressing SoP450_1049) didn't produce any additional peak in the pellet (see Fig. 4), whereas in the medium erythrodiol (17.25 min), oleanolic acid (18.37 min) and echinocystic acid (19.00 min) were detected. The latter compound corresponds to 16-hydroxy-oleanolic acid and therefore leads to the identification of SoP450_1049 as an enzyme with a dual function as a C- 28/C-16 oxidase.
Figure 7: When the two enzymes SoP450_1944 and SoP450_2010 were combined in recombinant yeast strain 7 , no additional products were detected in the cell pellet, though the levels of erythrodiol (17.27 min) and 23-hydroxy-p-amyrin (16.66 min) were notably reduced.
Figure 8: Analysis of liquid medium of recombinant yeast strain 7 revealed that, besides the products already detected in yeast strains 4 (SoP450_1944) and 5 (SoP450_2010) expressing each of these two P450s individually, gypsogenin (22.53 min) and gypsogenic acid (23.44 min) were present as additional products. Both of these compounds result from a C-23 oxidation of oleanolic acid: gypsogenin is characterized by aldehyde function at position 23 while the same carbon is further oxidized to carboxy group in gypsogenic acid.
Figure 9: Combination of SoP450_1944 and SoP450_1049 in strain 8 did not lead to accumulation of any additional products in the pellet as compared to the yeast strains expressing the same enzymes individually. This is because SoP450_1049 already appeared to have both C-28 and C-16 oxidase activity.
Figure 10: Combination of SoP450_1944 and SoP450_1049 in strain 8 did not lead to accumulation of any additional products in liquid medium as compared to the yeast strains expressing the same enzymes individually. This is because SoP450_1049 already appeared to have both C-28 and C-16 oxidase activity.
Figure 11: No additional products were detected in the pellet of recombinant yeast strain 9 co-expressing SoP450_2010 and SoP450_1049, as compared to strains 5 (SoP450_2010) and 6 (SoP450_1049) expressing each enzyme individually.
Figure 12: No additional products were detected in the medium of yeast strain 9 co-expressing SoP450_2010 and SoP450_1049, as compared to strains 5 (SoP450_2010) and 6 (SoP450_1049) expressing each enzyme individually.
Figure 13: When the three P450 enzymes SoP450_1944, SoP450_1049 and SoP450_2010 were coexpressed together, in recombinant yeast strain 10, no new peaks appeared in the cell pellet although P-amyrin (15.38 min) was reduced, thus suggesting further conversion. This was confirmed by analysis of the yeast medium for the same strain, which showed a new peak (22.67 min) detected only in strain 10.
Figure 14: No additional peaks were identified in recombinant strain 16 by concomitant expression of SoP450_6085 and SoP450_1049, as compared to strain 6 expressing only SoP450_6085.
Figure 15: The most unexpected result was obtained by metabolite profiling of recombinant yeast strain 19 where the dual specificity C-28/C-16 oxidase (SoP450_1049) was co-expressed with both C-23 oxidases (SoP450_2010 and SoP450_6085). This recombinant yeast strain, beside all the products
derived from C-28 and C-16 oxidation of p-amyrin (such as erythrodiol, oleanolic and echinocystic acid), was also found to synthesize 16-hydroxy-gypsogenic acid (similarly to recombinant yeast strain 10) but differently it also accumulated quillaic acid and traces of gypsogenic acid.
Figure 16: content of sapofectosid in S. officinalis hairy root lines generated in this invention
Figure 17: Schematic representation of the triterpene aglycone biosynthetic pathway in S. officinalis
Figure 18: content of selected saponins in G. elegans hairy root lines generated in this invention. Normalized intensities have been obtained by dividing the peak are of the compound of interest by the peak are of internal standard.
Detailed description of the invention
To facilitate the understanding of this invention a number of terms are defined below. Terms defined herein (unless otherwise specified) have meanings as commonly understood by a person of ordinary skill in the areas relevant to the present invention. As used in this specification and its appended claims, terms such as "a", "an" and "the" are not intended to refer to only a singular entity but include the general class of which a specific example may be used for illustration, unless the context dictates otherwise. The terminology herein is used to describe specific embodiments of the invention, but their usage does not delimit the invention, except as outlined in the claims.
In the present invention we have identified biosynthetic enzymes in Saponaria officinalis, which enzymes when expressed as chimeric genes in an eukaryotic host, lead to the production of quillaic acid. We have surprisingly shown that for the first time quillaic acid can be produced in the yeast Saccharomyces cerevisiae by expression of the identified chimeric genes. Even more surprisingly the isolated biosynthetic enzymes comprise an unusual combination of a dual specificity C-16/C-28 oxidase and two different C-23 oxidases. Even more surprisingly the overexpression of only a chimeric gene encoding the isolated beta-amyrin synthase or only a chimeric gene encoding the C-28/C-16 oxidase in Saponaria hairy roots results in an increase of SO1861 (or sapofectosid) levels in said hairy roots.
Accordingly, the present invention provides in a first embodiment a eukaryotic cell comprising at least one of the following chimeric genes: a. a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 2 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 2 and transcription termination and polyadenylation signals,
b. a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 6 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 6 and transcription termination and polyadenylation signals, c. a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 8 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 8 and transcription termination and polyadenylation signals, d. a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 10 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 10 and transcription termination and polyadenylation signals.
In yet another embodiment the eukaryotic cell further comprises a chimeric gene encoding an N- terminal truncated feedback-insensitive 3-hydroxy-3-methylglutaryl coenzyme A reductase.
Non-limiting examples of N-terminal truncated feedback-insensitive 3-hydroxy-3-methylglutaryl coenzyme A reductases are depicted in the nucleotide sequences SEQ ID NO: 13 and 19.
In yet another embodiment the eukaryotic cell further comprises a chimeric gene encoding a NADPH- cytochrome P450 reductase. A non-limiting example of a NADPH-cytochrome P450 oxidase is depicted in the nucleotide sequence SEQ ID NO: 15.
It is understood that eukaryotic cells can be higher or low eukaryotic cells. Higher eukaryotic cells comprise plant cells and animal cells. Lower eukaryotic cells comprise yeast and fungal cells. Yeast can for example be from the genus Saccharomyces, Pichia, Yarrowia, Hansenula or Kluyveromyces. Fungal cells can for example be from the genus Aspergillus or Trichoderma.
SEQ ID NO: 2 is a beta-amyrin synthase isolated from Saponaria officinalis.
SEQ ID NO: 6 is a combined C-28/C-16 oxidase of beta-amyrin isolated from Saponaria officinalis.
SEQ ID NO: 8 and 10 are two different C-23 oxidases of beta-amyrin isolated from Saponaria officinalis.
In another embodiment the invention provides a chimeric gene construct comprising the following operably linked DNA elements: a) a promoter region which is active in plant or yeast cells, b) a DNA region encoding SEQ ID NO: 6 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 6 and c) a 3' end region comprising transcription termination and polyadenylation signals functioning in cells of a plant or yeast.
In another embodiment the invention provides a chimeric gene construct comprising the following operably linked DNA elements: a) a promoter region which is active in eukaryotic cells such as plant or yeast cells, b) a DNA region encoding SEQ ID NO: 8 or a nucleotide sequence encoding a sequence with
95% identity over the total length of SEQ ID NO: 8 and c) a 3' end region comprising transcription termination and polyadenylation signals functioning in cells of a plant or yeast.
In another embodiment the invention provides a chimeric gene construct comprising the following operably linked DNA elements: a) a promoter region which is active in yeast or plant cells, b) a DNA region encoding SEQ ID NO: 10 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 10 and c) a 3' end region comprising transcription termination and polyadenylation signals functioning in cells of yeast or plants.
In another embodiment the invention provides a yeast cell which yeast cell comprises the following chimeric genes: a. a yeast-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 2 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 2 and a terminator sequence, b. a yeast-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 6 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 6 and a terminator sequence, c. a yeast-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 8 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 8 and a terminator sequence, d. a yeast-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 10 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 10 and a terminator sequence.
In yet another embodiment the yeast cell further comprises a chimeric gene encoding an amino-terminal truncated feedback-insensitive 3-hydroxy-3-methylglutaryl coenzyme A reductase.
In yet another embodiment the yeast cell further comprises a chimeric gene encoding an amino-terminal truncated feedback-insensitive 3-hydroxy-3-methylglutaryl coenzyme A reductase and a chimeric gene encoding a NADPH-cytochrome P450 reductase.
In yet another embodiment the invention provides a plant cell comprising a chimeric gene comprising a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 2 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 2 and a terminator sequence, or said plant cell comprises a chimeric gene comprising a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 6 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 6 and a terminator sequence.
In a particular embodiment said plant cell is from the genus Saponaria or Gypsophila.
By "promoter" is meant a sequence of nucleotides from which transcription may be initiated of DNA operably linked downstream (i.e. in the 3' direction on the sense strand of double-stranded DNA).
In a particular embodiment, the promoter is an inducible promoter.
The term "inducible" as applied to a promoter is well understood by those skilled in the art. In essence, expression under the control of an inducible promoter is "switched on" or increased in response to an applied stimulus. The nature of the stimulus varies between promoters. Some inducible promoters cause little or undetectable levels of expression (or no expression) in the absence of the appropriate stimulus. Other inducible promoters cause detectable constitutive expression in the absence of the stimulus. Whatever the level of expression is in the absence of the stimulus, expression from any inducible promoter is increased in the presence of the correct stimulus.
Thus, a nucleic acid according to the invention may be placed under the control of an externally (inducible) gene promoter to place expression under the control of the user. An advantage of introduction of a heterologous gene into a plant or yeast cell, is the ability to place expression of the gene under the control of a promoter of choice, in order to be able to influence gene expression, and therefore quillaic or sapofectosid, according to preference.
Suitable promoters which operate in plants include the Cauliflower Mosaic Virus 35S (CaMV 35S). Other examples are disclosed at pg. 120 of Lindsey & Jones (1989)"Plant Biotechnology in Agriculture" Pub. OU Press, Milton Keynes, UK. The promoter may be selected to include one or more sequence motifs or elements conferring developmental and/or tissue-specific regulatory control of expression. Inducible plant promoters include the ethanol induced promoter of Caddick et al (1998) Nature Biotechnology 16: 177-180. Suitable yeast promoters are for example the galactose inducible promoter from Saccharomyces cerevisiae or the methanol-inducible promoters from Pichia pastoris.
In yet another embodiment the invention provides a method to produce quillaic acid in yeast comprising introducing the following chimeric genes: a. a yeast-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 2 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 2 and a polyadenylation and termination sequence,
b. a yeast-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 6 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 6 and a polyadenylation and termination sequence, c. a yeast-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 8 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 8 and a polyadenylation and termination sequence, d. a yeast-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 10 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 10 and a polyadenylation and termination sequence, cultivating said transformed yeast and isolating quillaic acid from said cultivated yeast.
In yet another embodiment the method to produce quillaic acid in yeast further comprises the introduction of a chimeric gene encoding an amino-terminal truncated feedback-insensitive 3-hydroxy- 3-methylglutaryl coenzyme A reductase.
In yet another embodiment the method to produce quillaic acid in yeast further comprises the introduction of a chimeric gene encoding a NADPH-cytochrome P450 reductase.
In yet another embodiment the invention provides a method to produce sapofectosid in Saponaria or Gypsophila species comprising introducing the chimeric gene: a. a plant-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 2 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 2 and transcription termination and polyadenylation signals, or the chimeric gene b. a plant-expressible promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 6 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 6 and transcription termination and polyadenylation signals, by transformation into hairy roots of a Saponaria or Gypsophila species, cultivating said transformed hairy roots and isolating sapofectosid from said cultivated transformed hairy roots.
The term "operably linked" as used herein refers to a functional linkage between the promoter sequence and the gene of interest or a homologue thereof as defined herein above.
A "chimeric gene" or "chimeric construct" is a recombinant nucleic acid sequence in which a promoter or regulatory nucleic acid sequence is operatively linked to, or associated with, a nucleic acid sequence that codes for an mRNA, such that the regulatory nucleic acid sequence is able to regulate transcription or expression of the associated nucleic acid coding sequence and a terminator sequence. The regulatory
nucleic acid sequence of the chimeric gene is not normally operatively linked to the associated nucleic acid sequence as found in nature.
The term "terminator" encompasses a control sequence which is a DNA sequence at the end of a transcriptional unit which signals 3' processing and polyadenylation of a primary transcript and termination of transcription. The terminator can be derived from the natural gene, from a variety of other plant or yeast genes, or from T-DNA. For plant terminators, the terminator to be added may be derived from, for example, the nopaline synthase or octopine synthase genes, or alternatively from another plant gene.
"Selectable marker", "selectable marker gene" or "reporter gene" includes any gene that confers a phenotype on a cell in which it is expressed to facilitate the identification and/or selection of cells that are transfected or transformed with a nucleic acid construct of the invention. These marker genes enable the identification of a successful transfer of the nucleic acid molecules via a series of different principles. Suitable markers may be selected from markers that confer antibiotic or herbicide resistance, that introduce a new metabolic trait or that allow visual selection. Examples of selectable marker genes include genes conferring resistance to antibiotics (such as nptll that phosphorylates neomycin and kanamycin, or hpt, phosphorylating hygromycin, or genes conferring resistance to, for example, bleomycin, streptomycin, tetracyclin, chloramphenicol, ampicillin, gentamycin, geneticin (G418), spectinomycin or blasticidin), to herbicides (for example bar which provides resistance to Basta*; aroA or gox providing resistance against glyphosate, or the genes conferring resistance to, for example, imidazolinone, phosphinothricin or sulfonylurea), or genes that provide a metabolic trait (such as manA that allows plants to use mannose as sole carbon source or xylose isomerase for the utilisation of xylose, or antinutritive markers such as the resistance to 2-deoxyglucose). Expression of visual marker genes results in the formation of colour (for example p-glucuronidase, GUS or p- galactosidase with its coloured substrates, for example X-Gal), luminescence (such as the luciferin/luciferase system) or fluorescence (Green Fluorescent Protein, GFP, and derivatives thereof). This list represents only a small number of possible markers. The skilled worker is familiar with such markers. Different markers are preferred, depending on the organism and the selection method.
It is known that upon stable or transient integration of nucleic acids into plant cells, only a minority of the cells takes up the foreign DNA and, if desired, integrates it into its genome, depending on the expression vector used and the transfection technique used. To identify and select these integrants, a gene coding for a selectable marker (such as the ones described above) is usually introduced into the host cells together with the gene of interest. These markers can for example be used in mutants in which
these genes are not functional by, for example, deletion by conventional methods. Furthermore, nucleic acid molecules encoding a selectable marker can be introduced into a host cell on the same vector that comprises the sequence encoding the polypeptides of the invention or used in the methods of the invention, or else in a separate vector. Cells which have been stably transfected with the introduced nucleic acid can be identified for example by selection (for example, cells which have integrated the selectable marker survive whereas the other cells die).
Since the marker genes, particularly genes for resistance to antibiotics and herbicides, are no longer required or are undesired in the transgenic host cell once the nucleic acids have been introduced successfully, the process according to the invention for introducing the nucleic acids advantageously employs techniques which enable the removal or excision of these marker genes. One such a method is what is known as co-transformation. The co- transformation method employs two vectors simultaneously for the transformation, one vector bearing the nucleic acid according to the invention and a second bearing the marker gene(s). A large proportion of transformants receives or, in the case of plants, comprises (up to 40% or more of the transformants), both vectors. In case of transformation with Agrobacteria, the transformants usually receive only a part of the vector, i.e. the sequence flanked by the T-DNA, which usually represents the expression cassette. The marker genes can subsequently be removed from the transformed plant by performing crosses. In another method, marker genes integrated into a transposon are used for the transformation together with desired nucleic acid (known as the Ac/Ds technology). The transformants can be crossed with a transposase source or the transformants are transformed with a nucleic acid construct conferring expression of a transposase, transiently or stable. In some cases (approx. 10%), the transposon jumps out of the genome of the host cell once transformation has taken place successfully and is lost. In a further number of cases, the transposon jumps to a different location. In these cases the marker gene must be eliminated by performing crosses. In microbiology, techniques were developed which make possible, or facilitate, the detection of such events. A further advantageous method relies on what is known as recombination systems; whose advantage is that elimination by crossing can be dispensed with. The best-known system of this type is what is known as the Cre/lox system. Crel is a recombinase that removes the sequences located between the loxP sequences. If the marker gene is integrated between the loxP sequences, it is removed once transformation has taken place successfully, by expression of the recombinase. Further recombination systems are the HIN/HIX, FLP/FRT and REP/STB system (Tribble et al., J. Biol. Chem., 275, 2000: 22255-22267; Velmurugan et al., J. Cell Biol., 149, 2000: 553-566). A site-specific integration into the plant genome of the nucleic acid sequences according to the invention is possible.
For the purposes of the invention, "transgenic", "transgene" or "recombinant" means with regard to, for example, a nucleic acid sequence, an expression cassette, gene construct or a vector comprising the nucleic acid sequence or an organism transformed with the nucleic acid sequences, expression cassettes or vectors according to the invention.
A transgenic plant for the purposes of the invention is thus understood as meaning, as above, that the nucleic acids used in the method of the invention are not present in, or originating from, the genome of said plant, or are present in the genome of said plant but not at their natural locus in the genome of said plant, it being possible for the nucleic acids to be expressed homologously or heterologously. However, as mentioned, transgenic also means that, while the nucleic acids according to the invention or used in the inventive method are at their natural position in the genome of a plant, the sequence has been modified with regard to the natural sequence, and/or that the regulatory sequences of the natural sequences have been modified. Transgenic is preferably understood as meaning the expression of the nucleic acids according to the invention at an unnatural locus in the genome, i.e. homologous or, heterologous expression of the nucleic acids takes place. Preferred transgenic plants are mentioned herein.
For the purpose of this invention related or orthologous genes of the genes as described herein before can be isolated from the (publicly) available sequence databases. The "sequence identity" of two related nucleotide or amino acid sequences, expressed as a percentage, refers to the number of positions in the two optimally aligned sequences which have identical residues (xlOO) divided by the number of positions compared. A gap, i.e., a position in an alignment where a residue is present in one sequence but not in the other is regarded as a position with non-identical residues. The alignment of the two sequences is performed by the Needleman and Wunsch algorithm (Needleman and Wunsch (1970) J Mol Biol. 48: 443-453) The computer-assisted sequence alignment above, can be conveniently performed using standard software program such as GAP which is part of the Wisconsin Package Version 10.1 (Genetics Computer Group, Madison, Wisconsin, USA) using the default scoring matrix with a gap creation penalty of 50 and a gap extension penalty of 3. Sequences are indicated as "essentially similar" when such sequence have a sequence identity of at least about 75%, particularly at least about 80 %, more particularly at least about 85%, quite particularly about 90%, especially about 95%, more especially about 100%, quite especially are identical.
The term "expression" or "gene expression" means the transcription of a specific gene or specific genes or specific genetic construct. The term "expression" or "gene expression" in particular means the transcription of a gene or genes or genetic construct into structural RNA (rRNA, tRNA) or mRNA with or
without subsequent translation of the latter into a protein. The process includes transcription of DNA and processing of the resulting mRNA product.
The term "introduction" or "transformation" as referred to herein encompass the transfer of an exogenous polynucleotide into a host cell, irrespective of the method used for transfer. Plant tissue capable of subsequent clonal propagation, whether by organogenesis or embryogenesis, may be transformed with a genetic construct of the present invention and a whole plant regenerated there from. The particular tissue chosen will vary depending on the clonal propagation systems available for, and best suited to, the particular species being transformed. Exemplary tissue targets include leaf disks, pollen, embryos, cotyledons, hypocotyls, megagametophytes, callus tissue, existing meristematic tissue (e.g., apical meristem, axillary buds, and root meristems), hairy roots and induced meristem tissue (e.g., cotyledon meristem and hypocotyl meristem). The polynucleotide may be transiently or stably introduced into a host cell and may be maintained non-integrated, for example, as a plasmid. Alternatively, it may be integrated into the host genome. The resulting transformed plant cell may then be used to regenerate a transformed plant in a manner known to persons skilled in the art.
The transfer of foreign genes into the genome of a plant is called transformation. Transformation of plant species is now a fairly routine technique. Advantageously, any of several transformation methods may be used to introduce the gene of interest into a suitable ancestor cell. The methods described for the transformation and regeneration of plants from plant tissues or plant cells may be utilized for transient or for stable transformation. Transformation methods include the use of liposomes, electroporation, chemicals that increase free DNA uptake, injection of the DNA directly into the plant, particle gun bombardment, transformation using viruses or pollen and microprojection. Methods may be selected from the calcium/polyethylene glycol method for protoplasts (Krens, F.A. et al., (1982) Nature 296, 72-74; Negrutiu I et al. (1987) Plant Mol Biol 8: 363- 373); electroporation of protoplasts (Shillito R.D. et al. (1985) Bio/Technol 3, 1099-1 102); microinjection into plant material (Crossway A et al., (1986) Mol. Gen Genet 202: 179-185); DNA or RNA-coated particle bombardment (Klein TM et al., (1987) Nature 327: 70) infection with (non-integrative) viruses and the like. Transgenic plants, including transgenic crop plants, are preferably produced via Agrobacterium-mediated transformation. An advantageous transformation method is the transformation in planta. To this end, it is possible, for example, to allow the agrobacteria to act on plant seeds or to inoculate the plant meristem with agrobacteria. It has proved particularly expedient in accordance with the invention to allow a suspension of transformed agrobacteria to act on the intact plant or at least on the flower primordia. The plant is subsequently grown on until the seeds of the treated plant are obtained (Clough and Bent, Plant J. (1998) 16, 735-743). Methods for Agrobacterium-mediated transformation of rice include well known methods
for rice transformation, such as those described in any of the following: European patent application EP1198985, Aldemita and Hodges (Planta 199: 612-617, 1996); Chan et al. (Plant Mol Biol 22 (3): 491 - 506, 1993), Hiei et al. (Plant J 6 (2): 271 -282, 1994), which disclosures are incorporated by reference herein as if fully set forth. In the case of corn transformation, the preferred method is as described in either Ishida et al. (Nat. Biotechnol 14(6): 745-50, 1996) or Frame et al. (Plant Physiol 129(1): 13-22, 2002), which disclosures are incorporated by reference herein as if fully set forth. Said methods are further described by way of example in B. Jenes et al., Techniques for Gene Transfer, in: Transgenic Plants, Vol. 1 , Engineering and Utilization, eds. S.D. Kung and R. Wu, Academic Press (1993) 128-143 and in Potrykus Annu. Rev. Plant Physiol. Plant Molec. Biol. 42 (1991) 205-225). The nucleic acids or the construct to be expressed is preferably cloned into a vector, which is suitable for transforming Agrobacterium tumefaciens, for example pBinl9 (Bevan et al (1984) Nucl. Acids Res. 12-8711). Agrobacteria transformed by such a vector can then be used in known manner for the transformation of plants, such as plants used as a model, like Arabidopsis (Arabidopsis thaliana is within the scope of the present invention not considered as a crop plant), or crop plants such as, by way of example, tobacco plants, for example by immersing bruised leaves or chopped leaves in an agrobacterial solution and then culturing them in suitable media. The transformation of plants by means of Agro bacterium tumefaciens is described, for example, by Hofgen and Willmitzer in Nucl. Acid Res. (1988) 16, 9877 or is known inter alia from F.F. White, Vectors for Gene Transfer in Higher Plants; in Transgenic Plants, Vol. 1 , Engineering and Utilization, eds. S.D. Kung and R. Wu, Academic Press, 1993, pp. 15-38.
In addition to the transformation of somatic cells, which then have to be regenerated into intact plants, it is also possible to transform the cells of plant meristems and in particular those cells which develop into gametes. In this case, the transformed gametes follow the natural plant development, giving rise to transgenic plants. Thus, for example, seeds of Arabidopsis are treated with agrobacteria and seeds are obtained from the developing plants of which a certain proportion is transformed and thus transgenic [Feldman, KA and Marks MD (1987). Mol Gen Genet 208:1 -9; Feldmann K (1992). In: C Koncz, N-H Chua and J Shell, eds, Methods in Arabidopsis Research. Word Scientific, Singapore, pp. 274-289], Alternative methods are based on the repeated removal of the inflorescences and incubation of the excision site in the center of the rosette with transformed agrobacteria, whereby transformed seeds can likewise be obtained at a later point in time (Chang (1994). Plant J. 5: 551 -558; Katavic (1994). Mol Gen Genet, 245: 363-370). However, an especially effective method is the vacuum infiltration method with its modifications such as the "floral dip" method. In the case of vacuum infiltration of Arabidopsis, intact plants under reduced pressure are treated with an agrobacterial suspension [Bechthold, N (1993). CR Acad Sci Paris Life Sci, 316: 1 194-1 199], while in the case of the "floral dip" method the developing floral tissue is incubated briefly with a surfactant-treated agrobacterial suspension [Clough, SJ and Bent AF
(1998) The Plant J. 16, 735-743], A certain proportion of transgenic seeds are harvested in both cases, and these seeds can be distinguished from non-transgenic seeds by growing under the above-described selective conditions. In addition the stable transformation of plastids is of advantages because plastids are inherited maternally is most crops reducing or eliminating the risk of transgene flow through pollen. The transformation of the chloroplast genome is generally achieved by a process which has been schematically displayed in Klaus et al., 2004 [Nature Biotechnology 22 (2), 225-229], Briefly the sequences to be transformed are cloned together with a selectable marker gene between flanking sequences homologous to the chloroplast genome. These homologous flanking sequences direct site specific integration into the plastome. Plastidal transformation has been described for many different plant species and an overview is given in Bock (2001) Transgenic plastids in basic research and plant biotechnology. J Mol Biol. 2001 Sep 21; 312 (3):425-38 or Maliga, P (2003) Progress towards commercialization of plastid transformation technology. Trends Biotechnol. 21 , 20-28. Further biotechnological progress has recently been reported in form of marker free plastid transformants, which can be produced by a transient co-integrated maker gene (Klaus et al., 2004, Nature Biotechnology 22(2), 225-229).
The genetically modified plant cells can be regenerated via all methods with which the skilled worker is familiar. Suitable methods can be found in the abovementioned publications by S.D. Kung and R. Wu, Potrykus or Hofgen and Willmitzer.
Generally after transformation, plant cells or cell groupings are selected for the presence of one or more markers which are encoded by plant-expressible genes co-transferred with the gene of interest, following which the transformed material is regenerated into a whole plant. To select transformed plants, the plant material obtained in the transformation is, as a rule, subjected to selective conditions so that transformed plants can be distinguished from untransformed plants. For example, the seeds obtained in the above-described manner can be planted and, after an initial growing period, subjected to a suitable selection by spraying. A further possibility consists in growing the seeds, if appropriate after sterilization, on agar plates using a suitable selection agent so that only the transformed seeds can grow into plants. Alternatively, the transformed plants are screened for the presence of a selectable marker such as the ones described above.
Following DNA transfer and regeneration, putatively transformed plants may also be evaluated, for instance using Southern analysis, for the presence of the gene of interest, copy number and/or genomic organisation. Alternatively or additionally, expression levels of the newly introduced DNA may be
monitored using Northern and/or Western analysis, both techniques being well known to persons having ordinary skill in the art.
The generated transformed plants may be propagated by a variety of means, such as by clonal propagation or classical breeding techniques. For example, a first generation (or Tl) transformed plant may be selfed and homozygous second-generation (or T2) transformants selected, and the T2 plants may then further be propagated through classical breeding techniques. The generated transformed organisms may take a variety of forms. For example, they may be chimeras of transformed cells and nontransformed cells; clonal transformants (e.g., all cells transformed to contain the expression cassette); grafts of transformed and untransformed tissues (e.g., in plants, a transformed rootstock grafted to an untransformed scion).
The term "plant" as used herein encompasses whole monocotyledonous and dicotyledonous plants, ancestors and progeny of the plants and plant parts, including seeds, shoots, stems, leaves, roots (including tubers), flowers, and tissues and organs, wherein each of the aforementioned comprise the gene/nucleic acid of interest. The term "plant" also encompasses monocotyledonous and dicotyledonous plant cells, suspension cultures, callus tissue, hairy roots, embryos, meristematic regions, gametophytes, pollen and microspores, again wherein each of the aforementioned comprises the gene/nucleic acid of interest.
The term "expression cassette" refers to any recombinant expression system for the purpose of expressing a nucleic acid sequence of the invention in vitro or in vivo, constitutively or inducibly, in any cell, including, in addition to plant cells, prokaryotic, yeast, fungal, insect or mammalian cells. The term includes linear and circular expression systems. The term includes all vectors. The cassettes can remain episomal or integrate into the host cell genome. The expression cassettes can have the ability to selfreplicate or not (i.e., drive only transient expression in a cell). The term includes recombinant expression cassettes that contain only the minimum elements needed for transcription of the recombinant nucleic acid.
The following non-limiting Examples describe methods and means according to the invention. Unless stated otherwise in the Examples, all techniques are carried out according to protocols standard in the art. The following examples are included to illustrate embodiments of the invention. Those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the concept, spirit and scope of the invention. More specifically, it will be apparent that certain agents which are both chemically and physiologically related may be substituted for the agents described herein while
the same or similar results would be achieved. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the invention as defined by the appended claims.
Examples and materials and methods
Example 1: Identification of novel enzymes involved in saponin biosynthesis in Saponaria officinalis
To identify the genes encoding the enzymes involved in the biosynthesis of triterpene saponins in Saponaria officinalis (soapwort), co-expression analysis was conducted on S. officinalis transcriptome datasets produced in-house and comprising different organs, sampled in response to hormonal treatments at multiple time points. More specifically, leaves, stems and roots from hydroponically grown S. officinalis plants were sampled in triplicate at six and twenty-four hours after mock or 50 pM Methyl Jasmonate treatment. S. officinalis seeds were sourced from the seed company Jelitto (https://www.ielitto.com/, Schwarmstedt, Germany).
RNA sequencing datasets were used to draft an S. officinalis reference transcriptome as well as to conduct gene expression analysis. To this end, RNA was extracted from the sampled S. officinalis tissues using the ReliaPrep RNA miniprep Systems (Promega™), following manufacturer's instructions for fibrous tissues. RNA was single-end sequenced via Illumina HiSeq 6000 with single-end read lengths of 100 bp for gene expression profiling. RNA was also used as template for cDNA synthesis using qScript cDNA SuperMix (QuantaBio™) and for the cloning of candidate genes. The S. officinalis draft transcriptome was produced using public datasets available through the 1KP project [1] and derived from HiSeq paired-end sequencing of flowers, fruits, stems and leaves that were therefore assembled by CLC genomics workbench using default parameters. Transcripts levels were estimated by mapping single-end reads on the S. officinalis transcriptome using Salmon [2] implemented on a Galaxy pipeline. Next, transcripts that were lowly expressed or with zero expression variance across samples were removed and the remaining genes were clustered using Self Organizing Map (SOM) implemented and visualized in R using the Kohonen-package [3] to assemble clusters of transcripts characterized by similar expression trends. The expression trends and transcript abundances for the genes considered in this study that will be discussed below are reported in Figure 1.
1.1 P-amyrin synthase
Gene functional annotation on the S. officinalis transcriptome allowed identification, by sequence homology with previously characterized enzymes from other plants, of a candidate p-amyrin synthase (BAS, hereafter referred to as SoBAS_1421, see SEQ ID NO: 1 for the coding sequence) that was therefore selected as the main bait for co-expression analysis. BAS is the key enzyme that commits linear triterpene precursors towards biosynthesis of oleanane-type pentacyclic triterpenes via cyclization of 2,3-
oxidosqualene. The SoBAS_1421 full coding sequence and predicted protein are reported as SEQ ID NOs: 1 and 2.
1.2 P-amyrin oxidases
Four full length-length transcripts corresponding to cytochrome P450s (P450s) were identified among the genes co-expressed with SoBAS_1421. P450s proteins belong to a broad superfamily of enzymes catalyzing oxidation of various substrates. Phylogenetic analysis was performed to infer the function of the four candidate P450s from S. officinalis based on the comparison of protein sequences with P450s from other plant species reported to be involved in the oxidation of triterpenes.
The first P450 (hereafter referred to as SoP450_1944, see SEQ ID NO: 3 for the coding sequence) showed phylogenetic relations with plant P450s belonging to the CYP716 family that had been characterized as P-amyrin C-28 oxidases, suggesting that SoP450_1944 may catalyze the same reaction in S. officinalis cells. The full coding sequence and predicted protein sequences for SoP450_1944 are reported as SEQ ID NOs: 3 and 4.
The second P450 (hereafter referred to as SoP450_1049, see SEQ ID NO: 7 for the coding sequence) showed sequence similarities with CYP716A141 from Panax ginseng, the latter being the only example reported so far of an enzyme capable of oxidizing the C-28 of position of -amyrin and the C-16 p position of oleanolic acid. The full coding sequence and predicted sequences for SoP450_1049 are reported as SEQ ID NOs: 5 and 6.
The third P450 identified (hereafter referred to as SoP450_2010, see SEQ ID NO: 5 for the coding sequence) was found to be neighboring P450s involved in oxidation of position C23 of pentacyclic triterpenes. The full nucleotide and predicted protein sequences of SoP450_2010 are reported as SEQ ID NOs: 7 and 8.
Also the fourth P450 identified (hereafter referred to as SoP450_6085, see SEQ ID NO: 9 for the coding sequence) was found to be neighboring P450s involved in oxidation of position C23 of pentacyclic triterpenes. The full nucleotide and predicted protein sequences of SoP450_6085 are reported as SEQ ID NOs: 9 and 10.
1.3 Accessory enzymes
The UDP-glycosyltransferase (UGT) encoding gene SoUGT_2488 (SEQ ID NOs: 17 and 18) was identified in S. officinalis by sequence homology with previously reported UGTs from other plant species [4] as a gene encoding for an enzyme involved in transferring the first UDP-sugar (UDP-Glucuronic Acid) on to the C-3 position of quillaic acid. Therefore it may play a role in committing flux of aglycones towards the
biosynthesis of highly-glycosylated oleanane-type saponins. SotHMGR (SEQ ID NOs: 19 and 20) was instead identified and cloned as truncated feedback-insensitive version of 3-hydroxy-3-methylglutaryl coenzyme A reductase (tHMGR), an enzyme known to catalyze the rate-limiting step in the biosynthesis of triterpene precursors [5], Lastly the gene SoSQE2 (SEQ ID Nos: 21 and 22) was identified as squalene epoxidase, an enzyme known to catalyze the stereospecific conversion of squalene to 2,3(S)- oxidosqualene, a key precursor in triterpenoid saponins. Therefore, co-expression of SoSQE2 together with other triterpene biosynthetic enzymes can boost production of precursors and ultimately triterpenoids yield as previously reported (Dong L. et al (2018) Metab Eng. 49:1-12).
Example 2: Cloning candidate genes from S. officinalis and generation of expression constructs for functional analysis
Each of the seven target genes described above was amplified by PCR using specific primer sets (see Table 1). Each of the primers included attB adapter sequences at the 5' end to allow directional cloning in appropriate Gateway® vectors (indicated in red in Table 1).
Two PCR reactions were performed on each gene using either leaf or root cDNA as a template. PCR reactions were set up in a total volume of 25 pl using Q5 polymerase (New England Biolabs) according to the manufacturer's instructions. For amplification of SoBAS_1421 and SoP450_1049, PCR thermal cycling involved initial denaturation at 98°C (30 sec) followed by 3 cycles of denaturation (98°C, 10 sec) annealing (61°C, 20 sec), extension (72°C, 1 min 10 sec), followed by 30 cycles using the same conditions but increasing the annealing temperature to 70° C, with a final extension at 72° C (5 min). To amplify
SoP450_2010 and SoP450_1944, the amplification protocol was slightly modified involving initial denaturation at 98°C (30 sec) followed by 10 cycles of denaturation (98°C, 10 sec), annealing (52°C, 20 sec), and extension (72°C, 1 min 30 sec), then followed by 25 cycles using the same conditions but increasing the annealing temperature to 60° C, with a final extension at 72° C (5 min). The same conditions were used to amplify SoUGT_2488 and SotHMGR. ForSoP450_6085 thermal cycling was set up to include 98°C (30 sec) followed by 35 cycles of denaturation (98°C, 10 sec), annealing (60°C, 20 sec), extension (72°C, 40 sec), and a final extension at 72° C (5 min).
All genes were successfully amplified from both mock-treated leaf and root cDNA except for SoP450_6085 and SotHMGR, which were cloned from mock-treated root or whole-plantlet cDNA and cDNA derived from JA-treated roots, respectively (Fig. 2). For each gene, the PCR product obtained from roots was purified and recombined into Gateway entry vector pDONR221 according to the manufacturer's protocols. The resulting plasmids were transformed into E.coli DH5a, extracted and sequenced to verify presence of correct inserts. Afterwards, each gene in entry clones was further recombined into the appropriate destination vectors via LR cloning.
Example 3: Functional analysis of candidate enzymes by expression in yeast
For heterologous expression in yeast, five different expression vectors were selected that (i) allow inducible transgene expression upon switching from glucose to galactose as the carbon source in the culture medium and (ii) carry different auxotrophic markers. Therefore, SoBAS_1421 was cloned into pESC (uracil selection). This vector was modified to be used in gateway recombination systems, as described previously [6], The same plasmid also contained a feedback-insensitive truncated version of 3-hydroxy-3-methylglutaryl coenzyme A reductase 1 (tHMGRl) from Medicago truncatula. tH MGR genes have been extensively reported to increase triterpene production upon transient or stable expression in heterologous as well as homologous systems [5], SoP450_1944, SoP450_2010 and SoP450_1049 were cloned into the Gateway-compatible yeast vectors pAG424GAL (tryptophan selection), pAG425GAL (leucine selection) and pAG427GAL (methionine selection), respectively. SoP450_6085 was cloned in all three different vectors (pAG424GAL, pAG425GAL and pAG427GAL) to allow combination with each of the others P450s. Finally, in order to increase the catalytic efficiency of the P450s, a partner reductase enzyme known as Cytochrome P450 reductase (MtMTRl - MTR_3gl00160, SEQ. ID NOs: 15 and 16) from M. truncatula was cloned into pAG423GAL (histidine selection). All vectors contain galactose-inducible promoters driving the expression of inserted genes.
All these plasmids were transformed in S. cerevisiae to assess the function of each gene individually or in different combinations, as described in table 2. The yeast strain used in this study was derived from BY4272 (genotype: MATa; his3Al; leu2A0; ura3A0; lys2A0; trplAO; metl5A0; PAHl-Ob; Perg7::PMET3-
ERG7), which contains five auxotrophic selection markers (-URA/-HIS/-LEU/-MET/-TRP) and therefore allows expression of genes from up to five plasmids.
The transformed yeast strains were selected on solid synthetic yeast media with appropriate supplements. Selected yeast strains were cultured in synthetic liquid medium with Galactose as the only carbon source and incubated for 7 days at 30°C. Methyl-p-cyclodextrin were added to the liquid medium at day two and day four during cultivation to reach a final concentration of 10 mM. Methyl-P- cyclodextrins are cyclic oligosaccharides that are able to sequester apolar triterpenes from yeast cells into the liquid medium thus avoiding possible toxicity and feedback inhibition by pathway intermediates, and thereby increase their production yield [7], Strains were pelleted by centrifugation and metabolites were extracted, separately for cell pellets and liquid culture medium, by liquid-liquid separation using n- hexane and ethyl acetate. Organic phases were collected, lyophilized by vacuum and analyzed by GC- MS.
SoBAS 1421 is a monofunctional P-amyrin synthase from S. officinalis
GC-MS analysis revealed that all strains, except the negative control, accumulated a peak at 15.48 corresponding to p-amyrin (Fig. 3 - results are shown for strain 1, 2, and 3). It is worth to mention that except strain 3 (carrying SoBAS_1421) and strain 11 (the negative control carrying empty vectors), all other strains carried GgBAS, a reported p-amyrin synthase from Glycyrrhiza glabra (SEQ ID NOs: 11 and 12). Besides p-amyrin, no side products were detected in strain 3 (expressing SoBAS_1421) thus confirming that SoBAS_1421 is a cyclase encoding for p-amyrin synthase, p-amyrin was predominantly present in the yeast pellet and at a lesser extent in the growth medium.
Discovery of C-28, C-28/C-16 and C-23 oxidases
Strain 4, 5 and 6 expressed a single P450 (either SoP450_1944, SoP450_2010 or SoP450_1049 respectively) together with tHMGRl, GgBAS and MtMTRl. Strain 4 (SoP450_1944) was found to accumulate erythrodiol (17.27 min), the C-28 alcohol of p-amyrin in the pellet (Fig. 4), whereas the medium (Fig. 5) contained oleanolic acid (18.37 min) and all its intermediates derived from C-28 p-amyrin oxidation such as erythrodiol (17.25 min) and oleanolic aldehyde (18.83 min). This confirmed that SoP450_1944 corresponds to a p-amyrin C-28 oxidase. In both pellet and medium (Fig. 4 and 5) of strain 5 (SoP450_2010) , only a peak corresponding to 23-hydroxy-p-amyrin (16.66 min) was detected, confirming this enzyme as a C-23 oxidase. Differently, strain 13 (expressing SoP450_6085) was found to produce only p-amyrin showing that this C-23 oxidase (SoP450_6085) is not able to function on such compound suggesting that perhaps it accepts only substrates with a higher degree of oxidation (Fig. 5). Strain 6 (expressing SoP450_1049) didn't produce any additional peak in the pellet (Fig. 4), whereas in the medium (Fig. 6) erythrodiol (17.25 min), oleanolic acid (18.37 min) and echinocystic acid (19.00 min) were detected. The latter compound corresponds to 16-hydroxy-oleanolic acid and therefore leads to the identification of SoP450_1049 as an enzyme with a dual function as a C-28/C-16 oxidase.
Concomitant expression of C28-oxidase and C-23 oxidase
When the two enzymes SoP450_1944 and SoP450_2010 were combined in strain 7 , no additional products were detected in the cell pellet (Fig. 7), though the levels of erythrodiol (17.27 min) and 23- hydroxy-p-amyrin (16.66 min) were notably reduced. Analysis of liquid medium of strain 7 (Fig. 8) revealed that, besides the products already detected in yeast strains 4 (SoP450_1944) and 5 (SoP450_2010) expressing each of these two P450s individual ly, gypsogenin (22.53 min) and gypsogenic acid (23.44 min) were present as additional products. Both of these compounds result from a C-23 oxidation of oleanolic acid: gypsogenin is characterized by aldehyde function at position 23 while the same carbon is further oxidized to carboxy group in gypsogenic acid.
These results show that SoP450_2010 is able to accept oleanolic acid as a substrate and confirm its activity to be specifically oxidizing the carbon at position 23. Unexpectedly, strains 14 (Fig. 13) expressing the second candidate C-23 oxidase (SoP450_6085) together with the C-28 oxidase (SoP450_1944) did not produce any additional compound from those produced by strain 4 expressing SoP450_1944 solely.
Concomitant expression of C28-oxidase and C-28/C-16 oxidase
Combination of SoP450_1944 and SoP450_1049 in strain 8 did not lead to accumulation of any additional products, either in the pellet or liquid medium (Fig. 9 and 10), as compared to the yeast strains expressing the same enzymes individually. This is due to the fact that SoP450_1049 already appeared to have both C-28 and C-16 oxidase activity.
Concomitant expression of C23-oxidase and C-28/C-16 oxidase
No additional products were detected, either in the pellet (Fig. 11) or the medium (Fig. 12) of yeast strain 9 co-expressing SoP450_2010 and SoP450_1049, as compared to strains 5 (SoP450_2010) and 6 (SoP450_1049) expressing each enzyme individually. However, levels of hydroxy 23-hydroxy- p-amyrin (16.66 min), erythrodiol (17.25 min), oleanolic acid (18.37 min) and echinocystic acid (19.00 min) were found to be notably reduced. Similarly no additional peaks were identified in strain 16 (Fig. 14) by concomitant expression of SoP450_6085 and SoP450_1049, as compared to strain 6 expressing solely the latter enzyme.
Concomitant expression of three P450s enzymes
Next, all possible combinations involving three of the four P450s discussed so far were expressed in yeast thus generating strains 10, 17, 18 and 19 (see Table 1).
When the three P450 enzymes SoP450_1944, SoP450_1049 and SoP450_2010 were expressed together, in yeast strain 10, no new peaks appeared in the cell pellet (Fig. 12) although -amyrin (15.38 min) was
reduced, thus suggesting further conversion. This was confirmed by analysis of the yeast medium (Fig. 13) for the same strain, which showed a new peak (22.67 min) detected only in strain 10. Based on the observed enzyme activities of all three P450s, this peak would correspond to oxidation of 3-OH, 16-OH, 23-COOH and 28-COOH on the p-amyrin backbone, leading to a compound also called 16-hydroxy- gypsogenic acid. This was further confirmed after purification of the compound and structure elucidation by Nuclear Resonance Mass Spectrometry (data not shown).
Yeast strain 18 differed from yeast strain 10 by substitution of SoP450_2010 with another C-23 oxidase, SoP450_6085. Interestingly, and unexpectedly, this combination did not yield any new compound as compared to combination 8 (expressing SoP450_1944 and SoP450_1049) or combination 14 (expressing SoP450_1944 and SoP450_6085). Likewise, strain 17, expressing C-28 oxidase SoP450_1944 together with both C-23 oxidases (SoP450_2010 and SoP450_6085) yielded the same metabolites as strain 7 carrying SoP450_1944 and SoP450_2010 only. However, differently from strain 7 that yielded gypsogenic acid, in strain 17 unexpectedly only gypsogenin was detected thus demonstrating that concomitant expression of both C-23 oxidases would promote that oxidation of the C-23 position stops at the aldehyde moiety (gypsogenin) reducing the production of the more oxidized carboxy group (gypsogenic acid).
Production of Quillaic Acid in yeast requires an unusual combination of C-28/16 and two C-23 oxidases
The most unexpected result was obtained by metabolite profiling of strain 19 where the promiscuous C- 28/C-16 oxidase (SoP450_1049) was co-expressed with both C-23 oxidases (SoP450_2010 and SoP450_6085). This strain, beside all the products derived from C-28 and C-16 oxidation of p-amyrin (such as erythrodiol, oleanolic and echinocystic acid), was also found to synthesize 16-hydroxy- gypsogenic acid (similarly to strain 10) but differently it also accumulated quillaic acid and traces of gypsogenic acid (Fig. 15). This is to our knowledge not only the first time that quillaic acid has been heterologously produced in yeast, but it additionally also involves an unusual combination of enzymes with partially overlapping functions. Surprisingly, the quillaic acid pathway seems to require two C-23 oxidases that feed on each other products. For instance with our experiments we demonstrated that the first enzyme SoP450_1049 is sufficient for production of echinocystic acid (the C-28/C-16 oxidized form of -amyrin). Afterwards, the first C-23 oxidase (SoP450_2010) is capable of oxidizing the C-23 position into an alcohol moiety producing caulophyllogenin that is readily converted by the second C-23 enzyme (SoP450_6085) into quillaic acid (Fig. 15). As a further confirmation, the metabolite extract from yeast strain 19 was additionally analyzed by UPLC-MS analysis using a high resolution protocol that allows to distinguish more clearly between quillaic acid and gypsogenic acid (Fig. 15, second panel), confirming that the peak produced by yeast strain 19 unambiguously matches the quillaic acid standard.
Example 4: Overexpression of newly discovered enzymes in transgenic S. officinalis hairy root lines
For this example, we generated different constructs for ectopic overexpression of the candidate genes in planta, more specifically in transformed hairy roots of S. officinalis, and assess the effect thereof on the production levels of a specific oleanane type saponin, named sapofectosid (i.e. also designated as SO1861) [11], Transgenic S. officinalis hairy root lines were generated via Agrobacterium rhizogenes infection of S. officinalis adult leaves adapting a previously described protocol [12], S. officinalis hairy roots were grown for two months on liquid Murashige and Skoog medium including vitamins supplied with 1.5 % sucrose. The hairy roots were harvested and ground in liquid nitrogen into a fine powder. Total metabolite extraction and subsequent LC-MS analysis was conducted as previously reported [13],
The transgenes selected for ectopic overexpression in S. officinalis included three of the newly discovered S. officinalis genes involved in triterpene production, namely SoBAS_1421, SoP450_1049 and SoUGT_2488. The coding sequence of each gene was cloned into the Gateway-compatible binary vector pK7WG2D [8], For each transgene four independent hairy root lines were cultured, analyzed and compared with four independent control lines (ectopically expressing the metabolically non-active green fluorescent protein GFP).
LC-MS profiling data of metabolite extracts from control hairy roots, expressing only GFP, was compared to those of hairy roots expressing the transgenes of interest. This analysis revealed that overexpression of either SoBAS_1421 or SoP450_1049 caused a substantial increase in SO1861 levels (Fig. 16). Conversely, overexpression of SoUGT_2488, encoding the putative UGT that catalyzes the transfer of glucuronic acid to the C-3 position of quillaic acid did not affect the SO1861 levels. Together these data illustrate that (i) overexpression of a single pathway enzyme can promote SO1861 production, thus demonstrating the utility of single chimeric genes to engineer S. officinalis saponin biosynthesis their and (ii) that not all of the committed pathway steps are rate-limiting, thus demonstrating that the utility of a particular single chimeric gene as an engineering tool is unpredictable. A schematic representation of the biosynthetic pathway of quillaic acid in S.officinalis is reported in figure 17.
Example 5: Overexpression of newly discovered enzymes in transgenic Gypsophila elegans hairy root lines
For this example, diverse constructs were generated to achieve the ectopic overexpression of the candidate genes in planta, specifically within the transformed hairy roots of Gypsophila elegans, and to evaluate their impact on the production levels of particular oleanane type saponins sharing structural and biochemical features with the designated SO1861.
Transgenic G.elegans hairy root lines were established using Agrobacterium rhizogenes infection of G.elegans seedlings, following a previously established protocol (Pollier J. et al (2019) Plant J. 99:637- 654). These G. elegans hairy roots were cultivated for a span of two months in liquid Murashige and Skoog medium supplemented with vitamins and 1.5% sucrose. Afterward, the hairy roots were collected, frozen in liquid nitrogen, and pulverized into fine powder. The comprehensive extraction of metabolites and subsequent LC-MS analysis were carried out in accordance with previously documented procedures (Mertens J. et al (2016) Plant Physiol. 170:194-210).
The chosen transgenes for ectopic overexpression in G. elegans included each of the newly identified S. officinalis P450 genes linked to triterpene production. This involved both individual (namely SoP450_1944, SoP450_6085, SoP450_2010, SoP450_1049) and collective overexpression in a binary vector equipped with seven distinct transcriptional cassettes. These cassettes encompassed the aforementioned P450s as well as SoBAS_1421, SoSQE2, and eGFP, serving as a visual transformation marker. For this purpose, the Golden Gibson assembly method was employed for binary vector construction, enabling the incorporation of multiple transcriptional cassettes, as previously described (Aesaert S et al. 2022) Front. Plant Sci 13:883847). For the expression of individual transgenes, the Gateway-compatible binary vector pK7WG2D was utilized, as outlined in Example 4 for S. officinalis hairy roots . For each vector, three separate hairy root lines were cultivated, analyzed, and compared against three distinct control lines (expressing solely the metabolically inactive green fluorescent protein, GFP).
LC-MS profiling data of metabolite extracts from control hairy roots, expressing only GFP, was compared to those of hairy roots expressing the transgenes of interest. The effect of transgene expression was evaluated on the content of seven different saponins that were reported as representative of Gypsophila triterpenoid profile (Voutquenne-Nazabadioko L et al (2013) Phytochemistry 90:114-127) sharing substantial structural similarities with Sapofectosid. The outcome of this analysis revealed that the sole overexpression of SoP450_1049 (as illustrated in Fig. 18 - Gl_1049) or the concurrent expression of SoSQE2, SoBAS_1421, and all four Saponaria P450 genes as disclosed in this application (Fig. 18 - Gl_ALL_P450s) yielded a substantial increase in the production of a particular saponin, specifically C4_1567.681 (see Fig. 18). The latter saponin is a closely related structure to sapofectosid. While the simultaneous expression of multiple saponin pathway genes in the G. elegans hairy root line Gl_ALL_P450s appeared to enhance the production of C4_1567.681, the variations observed in comparison to hairy root lines expressing solely SoP450_1049 did not demonstrate statistical significance. Thus, the overexpression of a single gene in Gypsophila elegans is sufficient to increase the production of the sapofectosid homologue.
Sequence listing
SEQ ID NO: 1 - S. officinalis P-amyrin synthase, SoBAS_1421 coding sequence (2283 bps)
SEQ ID NO: 2 - S. officinalis SoBAS_1421 translated nucleotide sequence (760 AA)
SEQ ID NO: 3 - S. officinalis SoP450_1944, C-28 oxidase coding sequence(1458 bps)
SEQ ID NO: 4 - S. officinalis SoP450_1944 translated nucleotide sequence(485 AA)
SEQ ID NO: 5 - S. officinalis SoP450_2010, C-23 oxidase coding sequence (1568 bps)
SEQ ID NO: 6 - S. officinalis SoP450_2010, C-23 oxidase translated nucleotide sequence(522 AA)
SEQ ID NO: 7 - S. officinalis SoP450_1049, C-28/C-16 oxidase coding sequence (1509 bps)
SEQ ID NO: 8 - S. officinalis SoP450_1049, C-28/C-16 oxidase translated nucleotide sequence (502 AA)
SEQ ID NO: 9 - S. officinalis SoP450_6085, C-23 oxidase coding sequence (1374 bps)
SEQ ID NO: 10 - S. officinalis SoP450_6085, C-23 translated nucleotide sequence (457 AA)
SEQ ID NO: 11 - Glycyrrhiza glabra GgBAS, -amyrin synthase coding sequence (2298 bps)
SEQ ID NO: 12 - Glycyrrhiza glabra GgBAS, p-amyrin synthase translate nucleotide sequence (765 AA)
SEQ ID NO: 13 - Medicago truncatula MTtHMGRl, (truncated HMG-CoA reductase MTR_5g026500) (1374 bps)
SEQ ID NO: 14 - Medicago truncatula MtHMGRl (truncated HMG-CoA reductase MTR_5g026500) (457 AA)
SEQ ID NO: 15 - Medicago truncatula MtMTRl, NADPH-cytochrome P450 reductase (Medtr3gl00160) (2079 bps)
SEQ ID NO: 16 - Medicago truncatula MtMTRl, NADPH-cytochrome P450 reductase (Medtr3gl00160) (692 AA)
SEQ ID NO: 17 Saponaria officinalis SoUGT_2488, Nucleotide-diphospho-sugar transferases - Cellulose synthase (2097 bps)
SEQ ID NO: 18 Saponaria officinalis SoUGT_2488, Nucleotide-diphospho-sugar transferases - Cellulose synthase (698 AA)
SEQ ID NO: 19 Saponaria officinalis SotHMGR, hydroxymethylglutaryl-CoA reductase (NADPH) (1443 bps)
SEQ ID NO: 20 Saponaria officinalis SotHMGR, hydroxymethylglutaryl-CoA reductase (480 AA)
SEQ ID 21: SoSQE2 Saponaria officinalis, Squalene Epoxidase 2 coding sequence (1557 bps)
SEQ ID 22: SoSQE2 Saponaria officinalis, Squalene Epoxidase 2 translated nucleotide sequence (518 AA)
References
1. One Thousand Plant Transcriptomes I: One thousand plant transcriptomes and the phylogenomics of green plants. Nature 2019, 574:679-685.
2. Patro R, Duggal G, Love Ml, Irizarry RA, Kingsford C: Salmon provides fast and bias-aware quantification of transcript expression. Nat Methods 2017, 14:417-419.
3. Wehrens R, Buydens LMC: Self- and Super-organizing Maps in R: The kohonen Package. Journal of
Statistical Software 2007, 21.
4. Jozwiak A, Sonawane PD, Panda S, Garagounis C, Papadopoulou KK, Abebie B, Massalha H, Almekias-
Siegl E, Scherf T, Aharoni A: Plant terpenoid metabolism co-opts a component of the cell wall biosynthesis machinery. Nat Chem Biol 2020, 16:740-748.
5. Holmberg N, Harker M, Wallace AD, Clayton JC, Gibbard CL, Safford R: Co-expression of N-terminal truncated 3-hydroxy-3-methylglutaryl CoA reductase and C24-sterol methyltransferase type 1 in transgenic tobacco enhances carbon flux towards end-product sterols. Plant J 2003, 36:12-20.
6. Miettinen K, Poll ier J, Buyst D, Arendt P, Csuk R, Sommerwerk S, Moses T, Mertens J, Sonawane PD,
Pauwels L, et al.: The ancient CYP716 family is a major contributor to the diversification of eudicot triterpenoid biosynthesis. Nat Commun 2017, 8:14153.
7. Moses T, Pollier J, Almagro L, Buyst D, Van Montagu M, Pedreno MA, Martins JC, Thevelein JM,
Goossens A: Combinatorial biosynthesis of sapogenins and saponins in Saccharomyces cerevisiae using a C-16alpha hydroxylase from Bupleurum falcatum. Proc Natl Acad Sci U S A 2014, 111:1634-1639.
8. Karimi M, Inze D, Depicker A: GATEWAY vectors for Agrobacterium-mediated plant transformation.
Trends Plant Sci 2002, 7:193-195.
9. Lakatos L, Szittya G, Silhavy D, Burgyan J: Molecular mechanism of RNA silencing suppression mediated by pl9 protein of tombusviruses. EMBOJ 2004, 23:876-884.
10. Moses T, Pollier J, Shen Q, Soetaert S, Reed J, Erffelinck ML, Van Nieuwerburgh FC, Vanden Bossche
R, Osbourn A, Thevelein JM, et al.: OSC2 and CYP716A14v2 catalyze the biosynthesis of triterpenoids for the cuticle of aerial organs of Artemisia annua. Plant Cell 2015, 27:286-301.
11. Sama S, Jerz G, Schmieder P, Woith E, Melzig MF, Weng A: Sapofectosid - Ensuring non-toxic and effective DNA and RNA delivery. IntJ Pharm 2017, 534:195-205.
12. Ron M, Kajala K, Pauluzzi G, Wang D, Reynoso MA, Zumstein K, Garcha J, Winte S, Masson H, Inagaki
S, et al.: Hairy root transformation using Agrobacterium rhizogenes as a tool for exploring cell type-specific gene expression and function using tomato as a model. Plant Physiol 2014,
166:455-469.
tens J, Pollier J, Vanden Bossche R, Lopez-Vidriero I, Franco-Zorrilla JM, Goossens A: The bHLH Transcription Factors TSAR1 and TSAR2 Regulate Triterpene Saponin Biosynthesis in Medicago truncatula. Plant Physiol 2016, 170:194-210.
Claims
1. A eukaryotic cell comprising at least one of the following chimeric genes: a. a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 2 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 2 and transcription termination and polyadenylation signals, b. a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 6 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 6 and transcription termination and polyadenylation signals, c. a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 8 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 8 and transcription termination and polyadenylation signals, d. a promoter operably linked to a nucleotide sequence encoding SEQ ID NO: 10 or a nucleotide sequence encoding a sequence with 95% identity over the total length of SEQ ID NO: 10 and transcription termination and polyadenylation signals.
2. A eukaryotic cell according to claim 1 further comprising a chimeric gene encoding a feedbackinsensitive 3-hydroxy-3-methylglutaryl coenzyme A reductase.
3. A eukaryotic cell according to claims 1 or 2 further comprising a chimeric gene encoding a NADPH-cytochrome P450 reductase.
4. A eukaryotic cell according to claim 1 which is a yeast cell and wherein said yeast cell comprises the chimeric genes a), b), c) and d) as cited in claim 1.
5. A eukaryotic cell according to claim 2 which is a yeast cell and wherein said yeast cell comprises the chimeric genes a), b), c) and d) as cited in claim 1.
6. A eukaryotic cell according to claim 3 which is a yeast cell and wherein said yeast cell comprises the chimeric genes a), b), c) and d) as cited in claim 1.
7. A eukaryotic cell according to claims 1 or 2 which is a plant cell.
8. A eukaryotic cell according to claim 7 which is a plant cell from the genus Saponaria or Gypsophila.
9. A plant cell from the genus Saponaria or Gypsophila which comprises a chimeric gene selected from a) or b) as cited in claim 1.
10. A method to produce quillaic acid in yeast comprising introducing the chimeric genes a), b), c) and d) as cited in claim 1 in said yeast, fermenting said transformed yeast and isolating quillaic acid from said fermented yeast.
11. A method according to claim 10 wherein the yeast cell further comprises a chimeric gene encoding a feedback-insensitive 3-hydroxy-3-methylglutaryl coenzyme A reductase.
A method according to claim 11 wherein the yeast cell further comprises a chimeric gene encoding a NADPH-cytochrome P450 reductase. A method to produce sapofectosid in a Saponaria or Gypsophila species comprising introducing the chimeric gene a) or b) as cited in claim 1 into hairy roots of a Saponaria or Gypsophila species, growing said transformed hairy roots and isolating sapofectosid from said grown transformed hairy roots.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP22193481 | 2022-09-01 | ||
EP22193481.3 | 2022-09-01 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2024047057A1 true WO2024047057A1 (en) | 2024-03-07 |
Family
ID=83692828
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2023/073700 WO2024047057A1 (en) | 2022-09-01 | 2023-08-29 | Means and methods to produce triterpene saponins in eukaryotic cells |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2024047057A1 (en) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1198985A1 (en) | 1999-07-22 | 2002-04-24 | Japan as represented by Dir. Gen. of National Inst. of Agrobiological Resources,Ministry of Agriculture, Forestry and Fisherie | Method for superrapid transformation of monocotyledon |
WO2013167751A1 (en) * | 2012-05-11 | 2013-11-14 | Vib Vzw | Triterpenoid sapogenin production in plant and microbial cultures |
WO2019122259A1 (en) | 2017-12-21 | 2019-06-27 | Plant Bioscience Limited | Metabolic engineering |
WO2020049572A1 (en) * | 2018-09-06 | 2020-03-12 | Yeda Research And Development Co. Ltd. | Cellulose-synthase-like enzymes and uses thereof |
WO2020263524A1 (en) * | 2019-06-25 | 2020-12-30 | The Regents Of The University Of California | Triterpene production |
WO2022136563A2 (en) * | 2020-12-24 | 2022-06-30 | Plant Bioscience Limited | Methods and compositions |
-
2023
- 2023-08-29 WO PCT/EP2023/073700 patent/WO2024047057A1/en unknown
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1198985A1 (en) | 1999-07-22 | 2002-04-24 | Japan as represented by Dir. Gen. of National Inst. of Agrobiological Resources,Ministry of Agriculture, Forestry and Fisherie | Method for superrapid transformation of monocotyledon |
WO2013167751A1 (en) * | 2012-05-11 | 2013-11-14 | Vib Vzw | Triterpenoid sapogenin production in plant and microbial cultures |
WO2019122259A1 (en) | 2017-12-21 | 2019-06-27 | Plant Bioscience Limited | Metabolic engineering |
WO2020049572A1 (en) * | 2018-09-06 | 2020-03-12 | Yeda Research And Development Co. Ltd. | Cellulose-synthase-like enzymes and uses thereof |
WO2020263524A1 (en) * | 2019-06-25 | 2020-12-30 | The Regents Of The University Of California | Triterpene production |
WO2022136563A2 (en) * | 2020-12-24 | 2022-06-30 | Plant Bioscience Limited | Methods and compositions |
Non-Patent Citations (50)
Title |
---|
"One Thousand Plant Transcriptomes I: One thousand plant transcriptomes and the phylogenomics of green plants", NATURE, vol. 574, 2019, pages 679 - 685 |
AESAERT 5 ET AL., FRONT. PLANT SCI, vol. 13, 2022, pages 883847 |
ALDEMITAHODGES, PLANTA, vol. 199, 1996, pages 612 - 617 |
B. JENES ET AL.: "Engineering and Utilization", vol. 1, 1993, ACADEMIC PRESS, article "Vectors for Gene Transfer in Higher Plants; in Transgenic Plants", pages: 128 - 143 |
BECHTHOLD, N, CR ACAD SCI PARIS LIFE SCI, vol. 316, 1993, pages 1194 - 1199 |
BEVAN ET AL., NUCL. ACIDS RES, 1984, pages 12 - 8711 |
BOCK: "Transgenic plastids in basic research and plant biotechnology", J MOL BIOL., vol. 312, no. 3, 21 September 2001 (2001-09-21), pages 425 - 38, XP002206098, DOI: 10.1006/jmbi.2001.4960 |
CADDICK ET AL., NATURE BIOTECHNOLOGY, vol. 16, 1998, pages 177 - 180 |
CHAN ET AL., PLANT MOL BIOL, vol. 22, no. 3, 1993, pages 491 - 506 |
CLOUGH, SJBENT AF, THE PLANT J., vol. 16, 1998, pages 735 - 743 |
CLOUGHBENT, PLANT J., vol. 16, 1998, pages 735 - 743 |
CROSSWAY A ET AL., MOL. GEN GENET, vol. 202, 1986, pages 179 - 185 |
D. MEESAPYODSUK ET AL: "Saponin Biosynthesis in Saponaria vaccaria. cDNAs Encoding beta-Amyrin Synthase and a Triterpene Carboxylic Acid Glucosyltransferase", PLANT PHYSIOLOGY, vol. 143, no. 2, 22 December 2006 (2006-12-22), Rockville, Md, USA, pages 959 - 969, XP055559076, ISSN: 0032-0889, DOI: 10.1104/pp.106.088484 * |
DA SILVA MAGEDANS YVE VERÔNICA ET AL: "Production of plant bioactive triterpenoid saponins: from metabolites to genes and back", PHYTOCHEMISTRY REVIEWS, vol. 20, no. 2, 21 September 2020 (2020-09-21), pages 461 - 482, XP037418359, ISSN: 1568-7767, DOI: 10.1007/S11101-020-09722-4 * |
DONG L ET AL., METAB ENG., vol. 49, 2018, pages 1 - 12 |
FELDMAN, KAMARKS MD, MOL GEN GENET, vol. 208, 1987, pages 1 - 9 |
FELDMANN K: "Methods in Arabidopsis Research", 1992, WORD SCIENTIFIC, pages: 274 - 289 |
FRAME ET AL., PLANT PHYSIOL, vol. 129, no. 1, 2002, pages 13 - 22 |
HIEI ET AL., PLANT J, vol. 5, no. 2, 1994, pages 551 - 558 |
HOFGENWILLMITZER, NUCL. ACID RES., vol. 16, 1988, pages 9877 |
HOLMBERG NHARKER MWALLACE ADCLAYTON JCGIBBARD CLSAFFORD R: "Co-expression of N-terminal truncated 3-hydroxy-3-methylglutaryl CoA reductase and C24-sterol methyltransferase type 1 in transgenic tobacco enhances carbon flux towards end-product sterols", PLANT J, vol. 36, 2003, pages 12 - 20 |
ISHIDA ET AL., NAT. BIOTECHNOL, vol. 14, no. 6, 1996, pages 745 - 50 |
JOZWIAK ASONAWANE PDPANDA SGARAGOUNIS CPAPADOPOULOU KKABEBIE BMASSALHA HALMEKIAS-SIEGL ESCHERF TAHARONI A: "Plant terpenoid metabolism co-opts a component of the cell wall biosynthesis machinery", NOT CHEM BIOL, vol. 16, 2020, pages 740 - 748, XP037175980, DOI: 10.1038/s41589-020-0541-x |
KARIMI M, INZE D, DEPICKER A: "GATEWAY vectors for Agrobacterium-mediated plant transformation. ", TRENDS PLANT SCI, vol. 7, 2002, pages 193 - 195, XP002714178, DOI: 10.1016/S1360-1385(02)02251-3 |
KATAVIC, MOL GEN GENET, vol. 245, 1994, pages 363 - 370 |
KENSIL CR ET AL.: "Vaccine Design", 1995, pages: 525 - 541 |
KLAUS ET AL., NATURE BIOTECHNOLOGY, vol. 22, no. 2, 2004, pages 225 - 229 |
KLEIN TM ET AL., NATURE, vol. 327, 1987, pages 70 |
KRENS, F.A ET AL., NATURE, vol. 296, 1982, pages 72 - 74 |
LAKATOS LSZITTYA GSILHAVY DBURGYAN J: "Molecular mechanism of RNA silencing suppression mediated by p19 protein of tombusviruses", EMBOJ, vol. 23, 2004, pages 876 - 884 |
MALIGA, P: "Progress towards commercialization of plastid transformation technology", TRENDS BIOTECHNOL, vol. 21, 2003, pages 20 - 28, XP004397633, DOI: 10.1016/S0167-7799(02)00007-0 |
MERTENS J. ET AL., PLANT PHYSIOL., vol. 170, 2016, pages 194 - 210 |
MERTENS JPOLLIER JVANDEN BOSSCHE RLOPEZ-VIDRIERO IFRANCO-ZORRILLA JMGOOSSENS A: "The bHLH Transcription Factors TSAR1 and TSAR2 Regulate Triterpene Saponin Biosynthesis in Medicago truncatula", PLANT PHYSIOL, vol. 170, 2016, pages 194 - 210 |
MIETTINEN KPOLLIER JBUYST DARENDT PCSUK RSOMMERWERK SMOSES TMERTENS JSONAWANE PDPAUWELS L ET AL.: "The ancient CYP716 family is a major contributor to the diversification of eudicot triterpenoid biosynthesis", NOT COMMUN, vol. 8, 2017, pages 14153 |
MOSES TPOLLIER JALMAGRO LBUYST DVAN MONTAGU MPEDRENO MAMARTINS JCTHEVELEIN JMGOOSSENS A: "Combinatorial biosynthesis of sapogenins and saponins in Saccharomyces cerevisiae using a C-16alpha hydroxylase from Bupleurum falcatum", PROC NATL ACAD SCI U 5 A, vol. 111, 2014, pages 1634 - 1639, XP055560681, DOI: 10.1073/pnas.1323369111 |
MOSES TPOLLIER JSHEN QSOETAERT SREED JERFFELINCK MLVAN NIEUWERBURGH FCVANDEN BOSSCHE ROSBOURN ATHEVELEIN JM ET AL.: "OSC2 and CYP716A14v2 catalyze the biosynthesis of triterpenoids for the cuticle of aerial organs of Artemisia annua", PLANT CELL, vol. 27, 2015, pages 286 - 301 |
NEEDLEMANWUNSCH, J MOL BIOL., vol. 48, 1970, pages 443 - 453 |
NEGRUTIU I ET AL., PLANT MOL BIOL, vol. 8, 1987, pages 363 - 373 |
PATRO RDUGGAL GLOVE MLIRIZARRY RAKINGSFORD C: "Salmon provides fast and bias-aware quantification of transcript expression", NOT METHODS, vol. 14, 2017, pages 417 - 419 |
POLLIER J. ET AL., PLANT J, vol. 99, 2019, pages 637 - 654 |
POTRYKUS, ANNU. REV. PLANT PHYSIOL. PLANT MOLEC. BIOL, vol. 42, 1991, pages 205 - 225 |
RON MKAJALA KPAULUZZI GWANG DREYNOSO MAZUMSTEIN KGARCHA JWINTE SMASSON HINAGAKI S ET AL.: "Hairy root transformation using Agrobacterium rhizogenes as a tool for exploring cell type-specific gene expression and function using tomato as a model", PLANT PHYSIOL, vol. 166, 2014, pages 455 - 469, XP055384283, DOI: 10.1104/pp.114.239392 |
SAMA S ET AL., INT. J. OF PHARMACEUTICS, vol. 534, 2017, pages 195 - 205 |
SAMA SJERZ GSCHMIEDER PWOITH EMELZIG MFWENG A: "Sapofectosid - Ensuring non-toxic and effective DNA and RNA delivery", INTJ PHARM, vol. 534, 2017, pages 195 - 205, XP085280601, DOI: 10.1016/j.ijpharm.2017.10.016 |
SHILLITO R.D ET AL., BIO/TECHNOL, vol. 3, 1985, pages 1099 - 1102 |
TRIBBLE ET AL., J. BIOL. CHEM., vol. 275, 2000, pages 22255 - 22267 |
VELMURUGAN ET AL., J. CELL BIOL., vol. 149, 2000, pages 553 - 566 |
VOUTQUENNE-NAZABADIOKO L ET AL., PHYTOCHEMISTRY, vol. 90, 2013, pages 114 - 127 |
WEHRENS RBUYDENS LMC: "Self- and Super-organizing Maps in R: The kohonen Package", JOURNAL OF STATISTICAL SOFTWARE, 2007, pages 21 |
YU HANWEN ET AL: "Transcriptome analysis identifies putative genes involved in triterpenoid biosynthesis in Platycodon grandiflorus", PLANTA, SPRINGER BERLIN HEIDELBERG, BERLIN/HEIDELBERG, vol. 254, no. 2, 21 July 2021 (2021-07-21), XP037514866, ISSN: 0032-0935, [retrieved on 20210721], DOI: 10.1007/S00425-021-03677-2 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20140148622A1 (en) | Engineering Plants to Produce Farnesene and Other Terpenoids | |
KR20150085846A (en) | Tal-Mediated Transfer DNA Insertion | |
EP0977866B1 (en) | Fatty acid modifying enzymes from developing seeds of vernonia galamenensis | |
JP2021510495A (en) | Metabolic engineering | |
US20240124899A1 (en) | Methods for production of novel diterpene scaffolds | |
US20020076775A1 (en) | WRKY transcription factors and methods of use | |
US7943753B2 (en) | Auxin transport proteins | |
KR101465692B1 (en) | Method for producing transgenic plant with regulated anthocyanin biosynthesis using AtbHLH113 gene and the plant thereof | |
CN104829699B (en) | A kind of and plant adversity resistance related protein Gshdz4 and its encoding gene and application | |
EP1849868A2 (en) | Plant defensins | |
WO2004067723A2 (en) | Floral development genes | |
WO2024047057A1 (en) | Means and methods to produce triterpene saponins in eukaryotic cells | |
CN114539373B (en) | IbPIF1 related to sweet potato stem nematode resistance as well as encoding gene and application thereof | |
US6864077B1 (en) | Membrane-bound desaturases | |
DE10047286B4 (en) | Isomalt producing transgenic plant | |
CN110317826B (en) | Application of substance for regulating content or activity of PvGRF9 in regulation of plant stem growth and development | |
CN112538107B (en) | Triterpene related protein, encoding gene thereof and application of triterpene related protein in improving content of triterpene compounds in plants | |
Mishra et al. | The multifaceted roles of R2R3 transcription factor HlMYB7 in the regulation of flavonoid and bitter acids biosynthesis, development and biotic stress tolerance in hop (Humulus lupulus L.) | |
EP1177307B1 (en) | Disease resistance factors | |
CN111285927B (en) | Plant stress tolerance related protein SiWRKY78 and coding gene and application thereof | |
US20220042030A1 (en) | A method to improve the agronomic characteristics of plants | |
CN111205355B (en) | Plant stress tolerance related protein SiWRKY76 and coding gene and application thereof | |
US20040010822A1 (en) | Hydroperoxyde lyases | |
WO2016120657A1 (en) | Polynucleotide associated with ergosterol biosynthesis and uses thereof | |
Do | Genetic Engineering of Sorghum and Switchgrass for Improved Biofuel Production |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 23762454 Country of ref document: EP Kind code of ref document: A1 |