CN114410604B - 环氧化物水解酶及其编码基因和应用 - Google Patents
环氧化物水解酶及其编码基因和应用 Download PDFInfo
- Publication number
- CN114410604B CN114410604B CN202011172801.8A CN202011172801A CN114410604B CN 114410604 B CN114410604 B CN 114410604B CN 202011172801 A CN202011172801 A CN 202011172801A CN 114410604 B CN114410604 B CN 114410604B
- Authority
- CN
- China
- Prior art keywords
- astd
- cyclooxygenase
- leu
- ala
- compound
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 24
- 102000005486 Epoxide hydrolase Human genes 0.000 title claims description 11
- 108020002908 Epoxide hydrolase Proteins 0.000 title claims description 11
- 108090000459 Prostaglandin-endoperoxide synthases Proteins 0.000 claims abstract description 45
- 102000004005 Prostaglandin-endoperoxide synthases Human genes 0.000 claims abstract description 43
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 36
- 238000006462 rearrangement reaction Methods 0.000 claims abstract description 18
- 238000010475 Pinacol rearrangement reaction Methods 0.000 claims abstract description 10
- 210000005056 cell body Anatomy 0.000 claims abstract description 6
- GPXPJKFETRLRAS-AHUKKWBBSA-N Asteltoxin Chemical compound C(/[C@H]1O[C@H]2O[C@@H]([C@]([C@@]2(C)[C@H]1O)(C)O)CC)=C\C=C\C=C\C=1OC(=O)C=C(OC)C=1C GPXPJKFETRLRAS-AHUKKWBBSA-N 0.000 claims description 39
- GPXPJKFETRLRAS-UHFFFAOYSA-N Asteltoxin Natural products OC1C2(C)C(O)(C)C(CC)OC2OC1C=CC=CC=CC=1OC(=O)C=C(OC)C=1C GPXPJKFETRLRAS-UHFFFAOYSA-N 0.000 claims description 39
- 241000894006 Bacteria Species 0.000 claims description 14
- 230000007062 hydrolysis Effects 0.000 claims description 14
- 238000006460 hydrolysis reaction Methods 0.000 claims description 14
- 150000002373 hemiacetals Chemical class 0.000 claims description 9
- 230000003197 catalytic effect Effects 0.000 claims description 8
- 238000004321 preservation Methods 0.000 claims description 8
- 241001443610 Aschersonia Species 0.000 claims description 5
- 239000013604 expression vector Substances 0.000 claims description 5
- 239000002773 nucleotide Substances 0.000 claims description 5
- 125000003729 nucleotide group Chemical group 0.000 claims description 5
- 238000003259 recombinant expression Methods 0.000 claims description 3
- 150000002118 epoxides Chemical class 0.000 claims 2
- 125000003275 alpha amino acid group Chemical group 0.000 claims 1
- 230000008707 rearrangement Effects 0.000 abstract description 20
- 102000004190 Enzymes Human genes 0.000 abstract description 19
- 108090000790 Enzymes Proteins 0.000 abstract description 19
- 210000004027 cell Anatomy 0.000 abstract description 11
- 241000233866 Fungi Species 0.000 abstract description 10
- 229930014626 natural product Natural products 0.000 abstract description 10
- 239000011942 biocatalyst Substances 0.000 abstract description 5
- 239000003814 drug Substances 0.000 abstract description 5
- 229940079593 drug Drugs 0.000 abstract description 4
- 238000002360 preparation method Methods 0.000 abstract description 4
- 102000004169 proteins and genes Human genes 0.000 abstract description 4
- 238000002864 sequence alignment Methods 0.000 abstract description 3
- 230000002349 favourable effect Effects 0.000 abstract description 2
- 239000000047 product Substances 0.000 description 44
- 150000001875 compounds Chemical class 0.000 description 42
- 238000000855 fermentation Methods 0.000 description 37
- 230000004151 fermentation Effects 0.000 description 37
- 229940125904 compound 1 Drugs 0.000 description 26
- 239000004593 Epoxy Substances 0.000 description 23
- 241000223651 Aureobasidium Species 0.000 description 19
- 239000000543 intermediate Substances 0.000 description 15
- 101150081385 astD gene Proteins 0.000 description 14
- 229940088598 enzyme Drugs 0.000 description 14
- 108020004414 DNA Proteins 0.000 description 12
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 12
- 231100000678 Mycotoxin Toxicity 0.000 description 12
- 239000002636 mycotoxin Substances 0.000 description 12
- 239000012071 phase Substances 0.000 description 12
- 239000001965 potato dextrose agar Substances 0.000 description 12
- 108090000604 Hydrolases Proteins 0.000 description 11
- 241000972090 Calcarisporium arbuscula Species 0.000 description 10
- 102000004157 Hydrolases Human genes 0.000 description 10
- 150000001413 amino acids Chemical group 0.000 description 10
- 239000002609 medium Substances 0.000 description 10
- 239000013612 plasmid Substances 0.000 description 10
- 230000006870 function Effects 0.000 description 9
- 108091008053 gene clusters Proteins 0.000 description 9
- 238000004128 high performance liquid chromatography Methods 0.000 description 9
- 238000000034 method Methods 0.000 description 9
- 150000002924 oxiranes Chemical class 0.000 description 9
- 150000004291 polyenes Chemical class 0.000 description 9
- 238000003786 synthesis reaction Methods 0.000 description 9
- ZPSJGADGUYYRKE-UHFFFAOYSA-N 2H-pyran-2-one Chemical compound O=C1C=CC=CO1 ZPSJGADGUYYRKE-UHFFFAOYSA-N 0.000 description 8
- YMWUJEATGCHHMB-UHFFFAOYSA-N Dichloromethane Chemical compound ClCCl YMWUJEATGCHHMB-UHFFFAOYSA-N 0.000 description 8
- 239000000126 substance Substances 0.000 description 8
- 239000000758 substrate Substances 0.000 description 8
- 241001607706 Aspergillus stellatus Species 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 7
- 239000001963 growth medium Substances 0.000 description 7
- 239000007788 liquid Substances 0.000 description 7
- 230000007246 mechanism Effects 0.000 description 7
- 238000009629 microbiological culture Methods 0.000 description 7
- 241000223678 Aureobasidium pullulans Species 0.000 description 6
- 101100111059 Calcarisporium arbuscula aurD gene Proteins 0.000 description 6
- 241000223250 Metarhizium anisopliae Species 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- 244000005700 microbiome Species 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 241000589158 Agrobacterium Species 0.000 description 5
- 241001132374 Asta Species 0.000 description 5
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 5
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 5
- 229910052799 carbon Inorganic materials 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 238000009472 formulation Methods 0.000 description 5
- 230000006698 induction Effects 0.000 description 5
- 230000002906 microbiologic effect Effects 0.000 description 5
- 238000001228 spectrum Methods 0.000 description 5
- 238000012795 verification Methods 0.000 description 5
- 229920001817 Agar Polymers 0.000 description 4
- 241000196324 Embryophyta Species 0.000 description 4
- 239000008272 agar Substances 0.000 description 4
- 101150026435 astA gene Proteins 0.000 description 4
- 238000010276 construction Methods 0.000 description 4
- 238000012217 deletion Methods 0.000 description 4
- 230000037430 deletion Effects 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- BRZYSWJRSDMWLG-CAXSIQPQSA-N geneticin Chemical compound O1C[C@@](O)(C)[C@H](NC)[C@@H](O)[C@H]1O[C@@H]1[C@@H](O)[C@H](O[C@@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](C(C)O)O2)N)[C@@H](N)C[C@H]1N BRZYSWJRSDMWLG-CAXSIQPQSA-N 0.000 description 4
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 4
- 238000013508 migration Methods 0.000 description 4
- 239000007787 solid Substances 0.000 description 4
- IHKOXYNAMXNNDK-UHFFFAOYSA-N 2,3,3a,4,5,6a-hexahydrofuro[2,3-b]furan Chemical compound C1COC2OCCC21 IHKOXYNAMXNNDK-UHFFFAOYSA-N 0.000 description 3
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 3
- 206010006187 Breast cancer Diseases 0.000 description 3
- 208000026310 Breast neoplasm Diseases 0.000 description 3
- XEKOWRVHYACXOJ-UHFFFAOYSA-N Ethyl acetate Chemical compound CCOC(C)=O XEKOWRVHYACXOJ-UHFFFAOYSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 241000922174 Metarhizium robertsii Species 0.000 description 3
- 241001661277 Moelleriella libera Species 0.000 description 3
- 241000235395 Mucor Species 0.000 description 3
- 101100217185 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) aruC gene Proteins 0.000 description 3
- 101150083159 astB gene Proteins 0.000 description 3
- 101150024707 astC gene Proteins 0.000 description 3
- 230000001588 bifunctional effect Effects 0.000 description 3
- 230000001851 biosynthetic effect Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 229940126214 compound 3 Drugs 0.000 description 3
- 230000002538 fungal effect Effects 0.000 description 3
- 238000010353 genetic engineering Methods 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 238000000655 nuclear magnetic resonance spectrum Methods 0.000 description 3
- 239000000843 powder Substances 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 230000001954 sterilising effect Effects 0.000 description 3
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 2
- -1 4-bromo-2, 6-di-tert-butylbenzene oxide Chemical compound 0.000 description 2
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 2
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 2
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 2
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 2
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 2
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 2
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 2
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 2
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 2
- 241000228245 Aspergillus niger Species 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 229920000298 Cellophane Polymers 0.000 description 2
- 241000654838 Exosporium Species 0.000 description 2
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 2
- HOMFINRJHIIZNJ-HOCLYGCPSA-N Leu-Trp-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O HOMFINRJHIIZNJ-HOCLYGCPSA-N 0.000 description 2
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 2
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 2
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 2
- 102000016397 Methyltransferase Human genes 0.000 description 2
- 108060004795 Methyltransferase Proteins 0.000 description 2
- 102000008109 Mixed Function Oxygenases Human genes 0.000 description 2
- 108010074633 Mixed Function Oxygenases Proteins 0.000 description 2
- 238000005481 NMR spectroscopy Methods 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 2
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 2
- 241000187693 Rhodococcus rhodochrous Species 0.000 description 2
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 2
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 2
- WYURNTSHIVDZCO-UHFFFAOYSA-N Tetrahydrofuran Chemical compound C1CCOC1 WYURNTSHIVDZCO-UHFFFAOYSA-N 0.000 description 2
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 2
- GSCPHMSPGQSZJT-JYBASQMISA-N Trp-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GSCPHMSPGQSZJT-JYBASQMISA-N 0.000 description 2
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 2
- LYPKCSYAKLTBHJ-ILWGZMRPSA-N Tyr-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CC=C(C=C4)O)N)C(=O)O LYPKCSYAKLTBHJ-ILWGZMRPSA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 229930184953 aurovertin Natural products 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 229940041514 candida albicans extract Drugs 0.000 description 2
- 239000003054 catalyst Substances 0.000 description 2
- 238000006555 catalytic reaction Methods 0.000 description 2
- 229940125782 compound 2 Drugs 0.000 description 2
- 238000005100 correlation spectroscopy Methods 0.000 description 2
- 239000000287 crude extract Substances 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 238000009795 derivation Methods 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 230000007247 enzymatic mechanism Effects 0.000 description 2
- 239000013613 expression plasmid Substances 0.000 description 2
- 235000019253 formic acid Nutrition 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 238000003919 heteronuclear multiple bond coherence Methods 0.000 description 2
- 238000005570 heteronuclear single quantum coherence Methods 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 239000012074 organic phase Substances 0.000 description 2
- AICOOMRHRUFYCM-ZRRPKQBOSA-N oxazine, 1 Chemical compound C([C@@H]1[C@H](C(C[C@]2(C)[C@@H]([C@H](C)N(C)C)[C@H](O)C[C@]21C)=O)CC1=CC2)C[C@H]1[C@@]1(C)[C@H]2N=C(C(C)C)OC1 AICOOMRHRUFYCM-ZRRPKQBOSA-N 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000012019 product validation Methods 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 238000002390 rotary evaporation Methods 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 239000000741 silica gel Substances 0.000 description 2
- 229910002027 silica gel Inorganic materials 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 239000012137 tryptone Substances 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 239000012138 yeast extract Substances 0.000 description 2
- 238000004791 1D NOESY Methods 0.000 description 1
- PAWQVTBBRAZDMG-UHFFFAOYSA-N 2-(3-bromo-2-fluorophenyl)acetic acid Chemical compound OC(=O)CC1=CC=CC(Br)=C1F PAWQVTBBRAZDMG-UHFFFAOYSA-N 0.000 description 1
- SXGZJKUKBWWHRA-UHFFFAOYSA-N 2-(N-morpholiniumyl)ethanesulfonate Chemical compound [O-]S(=O)(=O)CC[NH+]1CCOCC1 SXGZJKUKBWWHRA-UHFFFAOYSA-N 0.000 description 1
- 238000005084 2D-nuclear magnetic resonance Methods 0.000 description 1
- LPMTVTDWWATVQO-UHFFFAOYSA-N 3-cyclooctyldioxocane Chemical group C1CCCCCCC1C1OOCCCCC1 LPMTVTDWWATVQO-UHFFFAOYSA-N 0.000 description 1
- 108010044087 AS-I toxin Proteins 0.000 description 1
- 244000020998 Acacia farnesiana Species 0.000 description 1
- 241001019659 Acremonium <Plectosphaerellaceae> Species 0.000 description 1
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 1
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 1
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 1
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- YXXPVUOMPSZURS-ZLIFDBKOSA-N Ala-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 YXXPVUOMPSZURS-ZLIFDBKOSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- 244000099147 Ananas comosus Species 0.000 description 1
- 235000007119 Ananas comosus Nutrition 0.000 description 1
- 241000219194 Arabidopsis Species 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 1
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 1
- 241001465318 Aspergillus terreus Species 0.000 description 1
- JLSVDPQAIKFBTO-OMCRQDLASA-N Citreoviridin Chemical compound COC1=CC(=O)OC(\C=C\C=C\C=C\C(\C)=C\[C@@]2(C)[C@@H]([C@@](C)(O)[C@@H](C)O2)O)=C1C JLSVDPQAIKFBTO-OMCRQDLASA-N 0.000 description 1
- 241000223208 Curvularia Species 0.000 description 1
- 229920000832 Cutin Polymers 0.000 description 1
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 1
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- OEIDWQHTRYEYGG-QEJZJMRPSA-N Gln-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N OEIDWQHTRYEYGG-QEJZJMRPSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 1
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- UFHFLCQGNIYNRP-UHFFFAOYSA-N Hydrogen Chemical compound [H][H] UFHFLCQGNIYNRP-UHFFFAOYSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- AKQFLPNANHNTLP-VKOGCVSHSA-N Ile-Pro-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N AKQFLPNANHNTLP-VKOGCVSHSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 1
- MJTOYIHCKVQICL-ULQDDVLXSA-N Leu-Met-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MJTOYIHCKVQICL-ULQDDVLXSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- SNOUHRPNNCAOPI-SZMVWBNQSA-N Leu-Trp-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SNOUHRPNNCAOPI-SZMVWBNQSA-N 0.000 description 1
- WGAZVKFCPHXZLO-SZMVWBNQSA-N Leu-Trp-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N WGAZVKFCPHXZLO-SZMVWBNQSA-N 0.000 description 1
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- 102100022118 Leukotriene A-4 hydrolase Human genes 0.000 description 1
- 239000002841 Lewis acid Substances 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- PGBPWPTUOSCNLE-JYJNAYRXSA-N Lys-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N PGBPWPTUOSCNLE-JYJNAYRXSA-N 0.000 description 1
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 1
- MNNKPHGAPRUKMW-BPUTZDHNSA-N Met-Asp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 MNNKPHGAPRUKMW-BPUTZDHNSA-N 0.000 description 1
- GETCJHFFECHWHI-QXEWZRGKSA-N Met-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCSC)N GETCJHFFECHWHI-QXEWZRGKSA-N 0.000 description 1
- QEDGNYFHLXXIDC-DCAQKATOSA-N Met-Pro-Gln Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O QEDGNYFHLXXIDC-DCAQKATOSA-N 0.000 description 1
- KVNOBVKRBOYSIV-SZMVWBNQSA-N Met-Pro-Trp Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N KVNOBVKRBOYSIV-SZMVWBNQSA-N 0.000 description 1
- RUTZUJXAVNWLQP-BVSLBCMMSA-N Met-Tyr-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 RUTZUJXAVNWLQP-BVSLBCMMSA-N 0.000 description 1
- CKAVKDJBSNTJDB-SRVKXCTJSA-N Met-Val-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCSC CKAVKDJBSNTJDB-SRVKXCTJSA-N 0.000 description 1
- 241000223201 Metarhizium Species 0.000 description 1
- 241000235575 Mortierella Species 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- LFZAGIJXANFPFN-UHFFFAOYSA-N N-[3-[4-(3-methyl-5-propan-2-yl-1,2,4-triazol-4-yl)piperidin-1-yl]-1-thiophen-2-ylpropyl]acetamide Chemical compound C(C)(C)C1=NN=C(N1C1CCN(CC1)CCC(C=1SC=CC=1)NC(C)=O)C LFZAGIJXANFPFN-UHFFFAOYSA-N 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 241000220289 Pedunculata Species 0.000 description 1
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- DSXPMZMSJHOKKK-HJOGWXRNSA-N Phe-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DSXPMZMSJHOKKK-HJOGWXRNSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 1
- 108010030975 Polyketide Synthases Proteins 0.000 description 1
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 1
- BODDREDDDRZUCF-QTKMDUPCSA-N Pro-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2)O BODDREDDDRZUCF-QTKMDUPCSA-N 0.000 description 1
- FJLODLCIOJUDRG-PYJNHQTQSA-N Pro-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FJLODLCIOJUDRG-PYJNHQTQSA-N 0.000 description 1
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- VPBQDHMASPJHGY-JYJNAYRXSA-N Pro-Trp-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CO)C(=O)O VPBQDHMASPJHGY-JYJNAYRXSA-N 0.000 description 1
- GOOHAUXETOMSMM-UHFFFAOYSA-N Propylene oxide Chemical compound CC1CO1 GOOHAUXETOMSMM-UHFFFAOYSA-N 0.000 description 1
- 238000010802 RNA extraction kit Methods 0.000 description 1
- 101100029566 Rattus norvegicus Rabggta gene Proteins 0.000 description 1
- 241001533598 Septoria Species 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- MLSQXWSRHURDMF-GARJFASQSA-N Ser-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N)C(=O)O MLSQXWSRHURDMF-GARJFASQSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 1
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- VBPDMBAFBRDZSK-HOUAVDHOSA-N Thr-Asn-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VBPDMBAFBRDZSK-HOUAVDHOSA-N 0.000 description 1
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 1
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 1
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 1
- MJBBMTOGSOSAKJ-HJXMPXNTSA-N Trp-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MJBBMTOGSOSAKJ-HJXMPXNTSA-N 0.000 description 1
- VKMOGXREKGVZAF-QEJZJMRPSA-N Trp-Asp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VKMOGXREKGVZAF-QEJZJMRPSA-N 0.000 description 1
- OBAMASZCXDIXSS-SZMVWBNQSA-N Trp-Glu-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N OBAMASZCXDIXSS-SZMVWBNQSA-N 0.000 description 1
- WKCFCVBOFKEVKY-HSCHXYMDSA-N Trp-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WKCFCVBOFKEVKY-HSCHXYMDSA-N 0.000 description 1
- YTZYHKOSHOXTHA-TUSQITKMSA-N Trp-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=3C4=CC=CC=C4NC=3)CC(C)C)C(O)=O)=CNC2=C1 YTZYHKOSHOXTHA-TUSQITKMSA-N 0.000 description 1
- JGLXHHQUSIULAK-OYDLWJJNSA-N Trp-Pro-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]3CCCN3C(=O)[C@H](CC=3C4=CC=CC=C4NC=3)N)C(O)=O)=CNC2=C1 JGLXHHQUSIULAK-OYDLWJJNSA-N 0.000 description 1
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 1
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 1
- RSGHLMMKXJGCMK-JYJNAYRXSA-N Val-Met-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N RSGHLMMKXJGCMK-JYJNAYRXSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- NGXQOQNXSGOYOI-BQFCYCMXSA-N Val-Trp-Gln Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 NGXQOQNXSGOYOI-BQFCYCMXSA-N 0.000 description 1
- 240000004922 Vigna radiata Species 0.000 description 1
- 235000010721 Vigna radiata var radiata Nutrition 0.000 description 1
- 235000011469 Vigna radiata var sublobata Nutrition 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010047495 alanylglycine Proteins 0.000 description 1
- 238000005882 aldol condensation reaction Methods 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 238000011914 asymmetric synthesis Methods 0.000 description 1
- 101150037794 aurD gene Proteins 0.000 description 1
- 239000005441 aurora Substances 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000003766 bioinformatics method Methods 0.000 description 1
- KGBXLFKZBHKPEV-UHFFFAOYSA-N boric acid Chemical compound OB(O)O KGBXLFKZBHKPEV-UHFFFAOYSA-N 0.000 description 1
- 239000004327 boric acid Substances 0.000 description 1
- 238000010805 cDNA synthesis kit Methods 0.000 description 1
- LLSDKQJKOVVTOJ-UHFFFAOYSA-L calcium chloride dihydrate Chemical compound O.O.[Cl-].[Cl-].[Ca+2] LLSDKQJKOVVTOJ-UHFFFAOYSA-L 0.000 description 1
- 229940052299 calcium chloride dihydrate Drugs 0.000 description 1
- 238000001460 carbon-13 nuclear magnetic resonance spectrum Methods 0.000 description 1
- 150000001728 carbonyl compounds Chemical class 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- BXUGQFINZUXAKN-UHFFFAOYSA-N citreoviridin Natural products COC1=CC(=O)OC(=C1C)C=CC=CC=CC=CC2(C)OC(C)C(C)(O)C2O BXUGQFINZUXAKN-UHFFFAOYSA-N 0.000 description 1
- JLSVDPQAIKFBTO-UHFFFAOYSA-N citreoviridin A Natural products COC1=CC(=O)OC(C=CC=CC=CC(C)=CC2(C)C(C(C)(O)C(C)O2)O)=C1C JLSVDPQAIKFBTO-UHFFFAOYSA-N 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 229940125898 compound 5 Drugs 0.000 description 1
- JZCCFEFSEZPSOG-UHFFFAOYSA-L copper(II) sulfate pentahydrate Chemical compound O.O.O.O.O.[Cu+2].[O-]S([O-])(=O)=O JZCCFEFSEZPSOG-UHFFFAOYSA-L 0.000 description 1
- 239000012043 crude product Substances 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 229940079919 digestives enzyme preparation Drugs 0.000 description 1
- 150000002009 diols Chemical class 0.000 description 1
- 239000012153 distilled water Substances 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 230000008641 drought stress Effects 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 239000003344 environmental pollutant Substances 0.000 description 1
- 238000006735 epoxidation reaction Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 239000011790 ferrous sulphate Substances 0.000 description 1
- 235000003891 ferrous sulphate Nutrition 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 238000012215 gene cloning Methods 0.000 description 1
- 238000012268 genome sequencing Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 238000000227 grinding Methods 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- BAUYGSIQEAFULO-UHFFFAOYSA-L iron(2+) sulfate (anhydrous) Chemical compound [Fe+2].[O-]S([O-])(=O)=O BAUYGSIQEAFULO-UHFFFAOYSA-L 0.000 description 1
- 229910000359 iron(II) sulfate Inorganic materials 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 108010072713 leukotriene A4 hydrolase Proteins 0.000 description 1
- 150000007517 lewis acids Chemical class 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- WRUGWIBCXHJTDG-UHFFFAOYSA-L magnesium sulfate heptahydrate Chemical compound O.O.O.O.O.O.O.[Mg+2].[O-]S([O-])(=O)=O WRUGWIBCXHJTDG-UHFFFAOYSA-L 0.000 description 1
- 229940061634 magnesium sulfate heptahydrate Drugs 0.000 description 1
- ISPYRSDWRDQNSW-UHFFFAOYSA-L manganese(II) sulfate monohydrate Chemical compound O.[Mn+2].[O-]S([O-])(=O)=O ISPYRSDWRDQNSW-UHFFFAOYSA-L 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 230000003228 microsomal effect Effects 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 238000001823 molecular biology technique Methods 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 238000007344 nucleophilic reaction Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 231100000614 poison Toxicity 0.000 description 1
- 229930001119 polyketide Natural products 0.000 description 1
- 125000000830 polyketide group Chemical group 0.000 description 1
- 229920001470 polyketone Polymers 0.000 description 1
- 239000008057 potassium phosphate buffer Substances 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 238000000425 proton nuclear magnetic resonance spectrum Methods 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 238000007363 ring formation reaction Methods 0.000 description 1
- 238000007142 ring opening reaction Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 229930000044 secondary metabolite Natural products 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- RWVGQQGBQSJDQV-UHFFFAOYSA-M sodium;3-[[4-[(e)-[4-(4-ethoxyanilino)phenyl]-[4-[ethyl-[(3-sulfonatophenyl)methyl]azaniumylidene]-2-methylcyclohexa-2,5-dien-1-ylidene]methyl]-n-ethyl-3-methylanilino]methyl]benzenesulfonate Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C(=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C)C=2C(=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C)C=C1 RWVGQQGBQSJDQV-UHFFFAOYSA-M 0.000 description 1
- 230000035882 stress Effects 0.000 description 1
- YLQBMQCUIZJEEH-UHFFFAOYSA-N tetrahydrofuran Natural products C=1C=COC=1 YLQBMQCUIZJEEH-UHFFFAOYSA-N 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 239000003440 toxic substance Substances 0.000 description 1
- 239000011573 trace mineral Substances 0.000 description 1
- 235000013619 trace mineral Nutrition 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- RZLVQBNCHSJZPX-UHFFFAOYSA-L zinc sulfate heptahydrate Chemical compound O.O.O.O.O.O.O.[Zn+2].[O-]S([O-])(=O)=O RZLVQBNCHSJZPX-UHFFFAOYSA-L 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/80—Vectors or expression systems specially adapted for eukaryotic hosts for fungi
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/02—Oxygen as only ring hetero atoms
- C12P17/04—Oxygen as only ring hetero atoms containing a five-membered hetero ring, e.g. griseofulvin, vitamin C
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y303/00—Hydrolases acting on ether bonds (3.3)
- C12Y303/02—Ether hydrolases (3.3.2)
- C12Y303/02003—Epoxide hydrolase (3.3.2.3)
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Mycology (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
本发明公开了环氧水解酶及其编码基因和应用。本发明从丝状真菌异冠裸胞壳中鉴定出具有催化半频哪醇重排功能的环氧水解酶AstD,并在NCBI数据库中基于同源序列比对,发现具有催化半频哪醇重排功能的同源蛋白MrvD;进一步,构建了高表达AstD或MrvD的基因工程丝状真菌,重构了asteltoxin生物合成途径,培养的基因工程菌可在细胞体内实现高效的半频哪醇重排反应,并应用于天然产物asteltoxin T1的大量制备。本发明有助于定向进化实现高效广谱的半频哪醇重排,为新型生物催化剂的开发并实现手性特异性药物的高效绿色生物合成奠定基础,具有重要的应用前景。
Description
技术领域
本发明属于生物技术领域,具体涉及一种新的环氧水解酶及其编码基因和应用。
背景技术
半频哪醇重排(semi-pinocol rearrangement)在有机合成中能高效完成结构复杂、立体选择性高化合物的不对称合成,因此是一类非常重要的有机合成反应。在半频哪醇重排反应中,邻位迁移基团对环氧丙烷中碳的亲核反应具有很高的区域选择性和立体选择性,重排过程中往往形成手性季碳中心;重排后,通常形成含α-/β-羟基的羰基化合物,可进一步发生串联成环反应,生成结构多样、手性特异的化合物。基于亲电子碳和底物的结构不同,主要有四类半频哪醇重排用于合成结构复杂的天然产物。其中,III型半频哪醇重排以2,3-环氧邻二醇多烯α-吡喃酮衍生物为底物,介导底物从C1到C3的2,3-迁移,生成具有立体特异性的2-季醛醇,可作为天然产物合成的优良前体。然而,化学合成领域仅开发了很少的路易斯酸型催化剂用于III型半频哪醇重排反应,目前用于天然产物合成的仅有一种化学催化剂4-溴-2,6-二叔丁基苯氧化物,这极大限制了半频哪醇重排反应的应用。酶作为生物催化剂具有催化效率高、立体和区域选择性强、条件温和、环境友好等优点,因此,生物酶催化III型半频哪醇重排反应在天然产物的生物合成中非常具有潜力。然而,到目前为止,催化III型半频哪醇重排反应的酶在天然产物的生物合成中尚未见诸报道。
环氧水解酶(Epoxide hydrolase,EH,EC3.3.2.3)是一类高效的生物催化剂,在手性药物中间体合成中具有广泛的应用。环氧水解酶能不对称催化合成高光学纯的手性化合物,具有催化效率高,且区域、立体选择性强的特点。该类酶的底物谱涵盖了多种环氧化合物,是手性药物合成中非常重要的生物催化剂,也是一种强有力的生物合成元件。
环氧水解酶广泛存在于植物、昆虫、哺乳动物以及微生物体内,在不同的生物体内发挥不同的生理功能。其中以哺乳动物来源的EH发现得最早,研究得最成熟,主要参与内源或外源毒性物质的分解代谢。植物EH来源比较丰富,陆续发现大豆、拟南芥、菠萝、土豆、绿豆和烟叶等常见植物中都有EH存在,它们主要参与角质的合成,并与某些胁迫反应(如干旱胁迫)相关。微生物的基因组能够编码多种类型的EH(可溶性EH、白三烯A4水解酶和微粒体EH等),它们在分解代谢自然环境中特定的碳源以及环境污染物的过程中发挥着重要的作用。由于微生物丰富的多样性、生长速度快、易于培养,酶的产量高等优点,目前EH最主要的来源还是微生物。在微生物中寻找新的EHs酶源是近年来国际上研究的热点。国内外已有大量关于产EH微生物的筛选、纯化及其基因克隆表达的文献报道(Tetrahedron:Asymmetry1998,9:459-466)。目前在市场上已有来源于黑曲霉(Aspergillus niger),放射土壤杆菌(Agrobacterium radiobacter)和紫红红球菌(Rhodococcus rhodochrous)的环氧水解酶制剂产品出售。尽管如此,可供选择的酶制剂种类仍然太少,底物作用范围有限,具有潜在应用价值的新型环氧水解酶仍然有待进一步开发。
发明内容
有鉴于此,本发明提供了一种环氧水解酶及其编码基因和应用。该环氧水解酶为同时具有环氧水解和半频哪醇重排催化活性的双功能酶,包括:AstD或MrvD。在具有抗乳腺癌活性的真菌毒素asteltoxin的生物合成中,该环氧水解酶高效催化环氧化物中间体水解并伴随催化半频哪醇重排反应,得到半缩醛产物asteltoxin T1。
为达到上述目的,本发明通过以下技术方案来实现:
本发明的第一个方面在于提供环氧水解酶AstD或MrvD,其中,AstD的氨基酸序列如SEQ ID NO.5所示,MrvD的氨基酸序列SEQ ID NO.7所示。
本发明的第二个方面在于提供编码上述环氧水解酶的基因astD或mrvD,其中,astD的核苷酸序列如SEQ ID NO.4所示,mrvD的核苷酸序列如SEQ ID NO.6所示。
本发明的第三个方面在于提供含有所述环氧水解酶基因astD或mrvD的重组表达载体。
就本发明的上述目的,所述重组表达载体为pFGL-aurAp-astD或pFGL-aurAp-mrvD质粒。
本发明的第四个方面在于提供含有高表达所述环氧水解酶AstD或MrvD的基因工程菌。
就本发明的上述目的,所述的基因工程菌为齿梗孢霉(Calcarisporiumarbuscula)。
就本发明的上述目的,所述的基因工程菌为齿梗孢霉AstD高表达菌株,菌株编号为M100,分类命名为齿梗孢霉菌(Calcarisporium arbuscula),保藏于中国微生物菌种保藏管理委员会普通微生物中心(简称CGMCC),保藏编号为CGMCC NO.20273,保藏日期为2020年9月21日。
就本发明的上述目的,所述的基因工程菌为齿梗孢霉MrvD高表达菌株,菌株编号为M101,分类命名为齿梗孢霉(Calcarisporium arbuscula),保藏于中国微生物菌种保藏管理委员会普通微生物中心(简称CGMCC),保藏编号为CGMCC NO.20272,保藏日期为2020年9月21日。
本发明的第五个方面在于提供上述环氧水解酶AstD或MrvD的应用,包括:上述环氧水解酶AstD或MrvD在催化半频哪醇重排反应中的应用,或者,上述环氧水解酶AstD或MrvD在asteltoxin的生物合成中的应用。
就本发明的上述应用,所述环氧水解酶AstD或MrvD在细胞体内高效催化环氧化物中间体水解,并伴随催化半频哪醇重排反应,得到半缩醛产物asteltoxin T1(化合物8)。
本发明的第六个方面在于提供上述基因工程菌的应用,包括:上述基因工程菌在催化半频哪醇重排反应中的应用,或者,上述基因工程菌在asteltoxin的生物合成中的应用。
就本发明的上述应用,所述基因工程菌中高表达的环氧水解酶AstD或MrvD在细胞体内高效催化环氧化物中间体水解,并伴随催化半频哪醇重排反应,得到半缩醛产物asteltoxin T1(化合物8)。
本发明的环氧水解酶AstD来自丝状真菌异冠裸胞壳。本发明通过基因组测序得到异冠裸胞壳的基因组序列草图,利用生物信息学分析鉴定到了包含asteltoxin(化合物1)生物合成基因簇,其中含有环氧水解酶AstD。利用分子生物学技术确定了环氧水解酶AstD的氨基酸序列及编码其的基因序列。还利用NCBI数据库通过同源氨基酸序列比对找到了另一个环氧水解酶MrvD的基因序列及氨基酸序列。将环氧水解酶基因astD、mrvD分别与丝状真菌表达载体pFGL-aurAp连接后转化齿梗孢霉ΔaurD缺失株,得到齿梗孢霉环氧水解酶AstD高表达菌株或齿梗孢霉环氧水解酶MrvD高表达菌株,进而发现在上述基因工程菌的发酵产物中存在大量半缩醛产物asteltoxin T1(化合物8),进一步的研究分析认为:该酶在异源宿主中高效催化环氧化物中间体水解生成环氧邻二醇多烯α-吡喃酮底物,并伴随催化环氧邻二醇多烯α-吡喃酮的半频哪醇重排反应生成大量半缩醛产物asteltoxin T1(化合物8)。即,上述环氧水解酶AstD或MrvD兼具环氧水解和半频哪醇重排催化活性的双功能。本发明涉及的酶可用于生物医药、生物化工等领域的半频哪醇重排反应并得到结构复杂的化合物,具有很大的应用价值。
相对于现有技术,本发明具有以下有益的技术效果:
本发明的环氧水解酶AstD或MrvD兼具环氧水解和半频哪醇重排催化活性的双功能,能够用在具有抗乳腺癌活性的真菌毒素asteltoxin的生物合成中高效催化环氧化物中间体水解,并伴随催化半频哪醇重排反应,得到半缩醛产物asteltoxin T1(化合物8)。
基于本发明的环氧水解酶AstD或MrvD可构建高表达AstD或MrvD的基因工程丝状真菌,在细胞体内高效实现半频哪醇重排反应,并应用于制备大量天然产物asteltoxin T1(化合物8)。
本发明有助于定向进化实现高效广谱的半频哪醇重排,为新型生物催化剂的开发并实现手性特异性药物的高效绿色生物合成奠定基础,具有重要的应用前景。
生物保藏信息:
齿梗孢霉AstD高表达菌株,菌株编号为M100,分类命名为齿梗孢霉(Calcarisporium arbuscula),保藏于中国微生物菌种保藏管理委员会普通微生物中心(简称CGMCC),地址:北京市朝阳区北辰西路1号院3号中国科学院微生物研究所,保藏编号为CGMCC NO.20273,保藏日期为2020年9月21日。
齿梗孢霉MrvD高表达菌株,菌株编号为M101,分类命名为齿梗孢霉(Calcarisporium arbuscula),保藏于中国微生物菌种保藏管理委员会普通微生物中心(简称CGMCC),地址:北京市朝阳区北辰西路1号院3号中国科学院微生物研究所,保藏编号为CGMCC NO.20272,保藏日期为2020年9月21日。
附图说明
图1为异冠裸胞壳中通过同源分析鉴定到的asteltoxin(化合物1)的生物合成基因簇。
图2为不同高表达的齿梗孢霉菌株发酵产物高效液相色谱检测结果对比。
图3为本发明的asteltoxin T1(化合物8)的1H NMR图谱
图4为本发明的asteltoxin T1(化合物8)的13C NMR图谱
图5为本发明的asteltoxin T1(化合物8)的DEPT135 NMR图谱
图6为本发明的asteltoxin T1(化合物8)的HSQC NMR图谱
图7为本发明的asteltoxin T1(化合物8)的COSY NMR图谱
图8为本发明的asteltoxin T1(化合物8)的HMBC NMR图谱
图9为本发明的asteltoxin T1(化合物8)的NOESY NMR图谱
具体实施方式
本申请发明人经过广泛而深入的研究,以丝状真菌异冠裸胞壳(Emericellavariecolor NHL 2881)来源的真菌毒素asteltoxin(化合物1,其具有抗乳腺癌活性)为目标分子,采用微生物学、分子生物学、生物分析信息学、生物化学及有机化学相结合的方法研究其生物合成,首次鉴定了真菌毒素asteltoxin(化合物1)的生物合成基因簇,并且,发明人还首次发现了真菌毒素asteltoxin(化合物1)的生物合成途径。同时,通过对真菌毒素asteltoxin(化合物1)的生物合成机制的研究,揭示化合物1中2,8-二氧杂双环-[3.3.0]-辛烷(或双四氢呋喃)的独特化学结构形成的酶学机理,分离和鉴定了兼具环氧水解和半频哪醇重排催化活性的双功能酶AstD,并通过同源氨基酸序列比对找到了另一个双功能酶MrvD。在此基础上,运用基因工程的原理,得到齿梗孢霉AstD高表达菌株或齿梗孢霉MrvD高表达菌株,这些基因工程菌株的发酵产物中存在大量的半缩醛产物asteltoxin T1(化合物8),因此,可用于天然产物真菌毒素asteltoxin(化合物1)的大量合成。
本发明中未注明具体技术或条件者,按照本领域内文献所描述的技术或条件或者按照产品说明书进行。所用试剂或仪器未注明生产厂商者,均为可以通过市场购得的常规产品。
质粒、菌株以及培养基:
本发明所涉及的BamHI消化的真菌高表达质粒pFGL-aurAp,其构建方法具体请见申请号为201810129024.5的中国发明专利的公开文本(公开号为CN108265074A)中质粒pFLG-aurAp的制备(文献中pFLG-aurAp实际就是pFGL-aurAp,其是基于农杆菌表达载体pFGL815N插入遗传霉素抗性基因neoR以及来源于齿梗孢霉的启动子aurAp得到的)。
本发明所涉及的野生型异冠裸胞壳孢子(拉丁名为:Emericella variecolor NHL2881)购自荷兰CBS菌种保藏中心(编号CBS 668.82)。
本发明所涉及的丝状真菌齿梗孢霉环氧水解酶缺失株(C.arbusculaΔaurD)的构建方法具体请见:Xu-Ming Mao等,Efficient Biosynthesis of Fungal PolyketidesContaining the Dioxabicyclo-octane Ring System,J.Am.Chem.Soc.2015,137,11904-11907。
马铃薯葡萄糖琼脂(PDA)培养基配方(1L):PDB(Potato Dextrose Broth,马铃薯葡萄糖肉汤)24g,琼脂粉20g;121℃灭菌20分钟。
固体LB培养基配方(1L):胰蛋白胨10g,酵母提取物5g,氯化钠10g,琼脂粉20g;121℃灭菌20分钟。
液体LB培养基配方(1L):胰蛋白胨10g、酵母提取物5g、氯化钠10g,蒸馏水1L;121℃灭菌20分钟。
液体诱导培养基配方(1L):葡萄糖1.8g、甘油5mL、2-(N-吗啉代)乙烷磺酸8.53g(购自上海翊圣生物)、七水硫酸镁0.6g、氯化钠0.3g、二水氯化钙0.01g、硫酸亚铁0.001g、硝酸铵0.5g、pH4.8磷酸钾缓冲液0.8mL、微量元素5mL(含有:七水硫酸锌0.1g/L、五水硫酸铜0.1g/L、硼酸0.1g/L、一水硫酸锰0.1g/L、二水钼酸钠0.1g/L),pH调整到5.5。
固体诱导培养基配方(1L):1L液体诱导培养基、2g琼脂粉。
抗性PDA培养基(1L):1L的PDA培养基,0.3mg孢噻肟钠和0.1g遗传霉素,购自sigma。
以下将结合附图和具体实施例对本发明的实施方案进行详细描述。应理解,这些实施例仅用于说明本发明,而不应视为限定本发明的范围。
1、asteltoxin生物合成基因簇的鉴定
将野生型异冠裸胞壳孢子均匀涂布在马铃薯葡萄糖琼脂(PDA)培养基上30℃培养5天,收集新鲜孢子后交由浙江天科高新技术发展有限公司(浙江省微生物研究所)提取基因组进行测序并完成基因组草图拼装。
利用生物信息学技术(https://fungismash.secondarymetabolites.org/)对基因组进行注释,分析其中的次级代谢产物基因簇,并通过本地BLAST以结构类似天然产物aurovertin(来源于齿梗孢霉)、citreoviridin(来源于土曲霉)的生物合成基因簇进行同源性分析,寻找可能的asteltoxin基因簇,最终初步确定了真菌毒素asteltoxin(化合物1)的生物合成基因簇及簇内相关同源基因(如图1所示)。
由图1可见,真菌毒素asteltoxin(化合物1)的生物合成基因簇包括:astA、astB、astC和astD,其中,astA用于编码聚酮合酶AstA,astB用于编码甲基转移酶AstB,astC用于编码单加氧酶AstC,astD用于编码α/β-环氧水解酶AstD。astA、astB、astC和astD的核苷酸序列分别如SEQ ID NO:1、SEQ ID NO:2、SEQ ID NO:3、SEQ ID NO:4所示。
2、环氧水解酶AstD的功能预测
分析真菌毒素asteltoxin(化合物1)的分子结构式,推测其分子中2,8-二氧杂双环-[3.3.0]-辛烷(或双四氢呋喃)部分是通过末端多烯多次环氧化反应、随后的环氧化物水解并伴随C1到C3的2,3-迁移生成的,即,推测真菌毒素asteltoxin(化合物1)的生物合成途径中包含半频哪醇重排反应。
进一步推测半频哪醇重排反应是在环氧化物被环氧水解酶水解生成环氧邻二醇多烯α-吡喃酮之后发生。结合酶的空间分布特异性和对底物选择性等多方面的全面考虑和分析,推测真菌毒素asteltoxin(化合物1)的生物合成中的半频哪醇重排反应很可能是由环氧水解酶AstD介导的。α/β-环氧水解酶AstD的氨基酸序列如SEQ ID NO:5所示,含有398个氨基酸。
3、astD的同源基因的鉴定
利用NCBI-BLAST(https://blast.ncbi.nlm.nih.gov/Blast.cgi)在已公布的基因数据库中寻找asteltoxin生物合成中环氧水解酶AstD的同源蛋白,发现罗伯茨绿僵菌(Metarhizium robertsii)和金龟子绿僵菌(Metarhizium anisopliae)中各有一个蛋白序列同源性达到67%的基因,进一步分析发现这两个基因编码的蛋白序列一致。
其中,罗伯茨绿僵菌中astD的同源基因为mrvD,mrvD的核苷酸序列如SEQ ID NO:6所示,其编码的环氧水解酶MrvD的NCBI accession number:XP_007824469.1,MrvD的氨基酸序列如SEQ ID NO:7所示,含有384个氨基酸。
由上海捷瑞生物工程有限公司通过化学法全合成了罗伯茨绿僵菌中astD的同源基因mrvD。
4、asteltoxin生物合成机制推导
进一步,推测了asteltoxin(化合物1)的生物合成途径。为了更清楚地说明化合物1中2,8-二氧杂双环-[3.3.0]-辛烷(或双四氢呋喃)的独特化学结构形成的酶学机理(即,由环氧水解酶AstD介导的半频哪醇重排反应机制),在此将本发明推测的asteltoxin(化合物1)的生物合成途径与文献公开的aurovertin的合成途径进行了对比,并将对比结果在下式中进行了展示。
如上式所示,本发明推测的asteltoxin(化合物1)的生物合成途径如下:
一系列前体物质经聚酮合酶AstA催化生成多烯α-吡喃酮(化合物5),通过甲基转移酶AstB催化生成甲基化的多烯α-吡喃酮(化合物6),再通过单加氧酶AstC和α/β-环氧水解酶AstD水解生成环氧邻二醇多烯α-吡喃酮中间体;
与化合物2(aurovertin E)、化合物3(aurovertin B)、化合物4(aurovertin D)合成过程中由环氧水解酶AurD介导3位的羟基进攻6位的碳正离子生成四氢呋喃中间体(化合物7)有所不同的是,在asteltoxin(化合物1)的合成过程中,环氧邻二醇多烯α-吡喃酮底物发生了从C1到C3的2,3-迁移(III型半频哪醇重排),并生成半缩醛中间体asteltoxin T1(化合物8);
随后,与aurovertin类似,asteltoxin T1(化合物8)被进一步加氧、水解生成终产物asteltoxin(化合物1)。
其中,aurovertin的生物合成基因簇中,编码环氧水解酶AurD的aurD基因,其核苷酸序列如SEQ ID NO:8所示。
5、asteltoxin生物合成机理的验证
本发明通过将环氧水解酶AstD和MrvD在齿梗孢霉中异源表达,得到了大量经半频哪醇重排反应后的天然产物asteltoxin T1,验证了环氧水解酶AstD和MrvD催化半频哪醇重排反应的功能,说明催化半频哪醇重排酶在丝状真菌中可以合成asteltoxin T1(化合物8),进而验证了上述推测的asteltoxin(化合物1)的生物合成机理。具体说明如下:
5.1齿梗孢霉体内AstD或MrvD高表达的质粒构建
在含5%大米和2%琼脂(购自上海生工生物工程)的固体LB培养基上涂布野生型异冠裸胞壳孢子,在30℃生长5天左右提取菌丝,用液氮研磨法配合TRIzol Universal RNA提取试剂盒(购自北京全式金生物)提取异冠裸胞壳总RNA。然后用cDNA合成试剂盒(购自大连宝生物)将RNA反转录成cDNA。
以野生型异冠裸胞壳cDNA为模版,通过第一引物对(SEQ ID NO:9和SEQ ID NO:10)PCR扩增astD;以合成的同源基因mrvD为模版,通过第二引物对(SEQ ID NO:11和SEQ IDNO:12)PCR扩增mrvD。同时,以齿梗孢霉cDNA为模版,通过第三引物对(SEQ ID NO:13和SEQID NO:14)PCR扩增aurD。
表1PCR扩增所用引物
SEQ ID NO | 序列 |
9 | CGACCACCTAACAACATGCCTCAATCCGCAAAATAC |
10 | GTCATCCTTGTAATCGAGCTGTCTTTCCTTCTCC |
11 | CGACCACCTAACAACATGGCTCAATTAACAAAATAC |
12 | GTCATCCTTGTAATCCTTTGTCTTTTGTTTAGGTCGC |
13 | CGACCACCTAACAACATGCCTCAATCCACGAAATAC |
14 | GTCATCCTTGTAATCTTTTGTCTTTTCTTTAGCACG |
通过无缝克隆将上述扩增片段连入BamHI消化的真菌高表达质粒pFGL-aurAp,分别得到pFGL-aurAp-astD、pFGL-aurAp-mrvD和pFGL-aurAp-aurD三个质粒。
5.2齿梗孢霉AstD或MrvD高表达菌株的构建
通过农杆菌EHA105(购自上海唯地生物)转化丝状真菌齿梗孢霉环氧水解酶缺失株(C.arbusculaΔaurD):
首先,将已构建好的pFGL-aurAp-astD质粒电转化入农杆菌EHA105,将得到的农杆菌转化子接入含50μg/mL卡那霉素的液体LB培养基中过夜培养;然后,将培养好的农杆菌转接入液体诱导培养基中培养至OD600≈0.6-0.8,再将诱导好的农杆菌与齿梗孢霉环氧水解酶缺失株孢子(C.arbusculaΔaurD)以菌量数量比100:1的比例混合,涂布在铺有玻璃纸的固体诱导培养基上,25℃培养2天;最后,将培养好的混合菌连同玻璃纸直接转移到抗性PDA培养基上,25℃培养7天,初步转化子长出。挑取初步转化子转接于抗性PDA培养基,25℃培养,5天左右挑取长出的1号阳性转化子(即1号菌株)进行发酵产物验证。
将pFGL-aurAp-mrvD质粒替代pFGL-aurAp-astD质粒,相同处理后,挑取长出的2号阳性转化子(即2号菌株)进行发酵产物验证。
同样,将pFGL-aurAp-aurD质粒替代pFGL-aurAp-astD质粒,相同处理后,取长出的3号阳性转化子(即3号菌株)进行发酵产物验证。
5.3齿梗孢霉AstD或MrvD高表达菌株的发酵产物验证
利用HPLC对步骤5.2筛选出的1号菌株(齿梗孢霉AstD高表达菌株,记为iiiΔaurD-astD)的发酵产物进行验证(或检测)。
利用HPLC对步骤5.2筛选出的2号菌株(齿梗孢霉MrvD高表达菌株,记为ivΔaurD-mrvD)的发酵产物进行验证(或检测)。
利用HPLC对步骤5.2筛选出的3号菌株(齿梗孢霉环氧水解酶aurD缺失株中高表达aurD菌株,记为vΔaurD-aurD)的发酵产物进行验证(或检测)。
为了进行对比,还利用高效液相色谱仪检测了野生型齿梗孢霉(记为i野生型)的发酵产物(发酵条件:在PDA培养基中25℃培养5天)、齿梗孢霉环氧水解酶缺失株(记为iiΔaurD)的发酵产物(发酵条件:在PDA培养基中25℃培养5天)。
利用HPLC进行发酵产物验证(或利用HPLC检测发酵产物)的过程如下:
将待测菌株或发酵产物用含有10%甲醇的乙酸乙酯萃取,离心后吸取上层有机相,将有机相离心浓缩干燥成粗提物,然后加入100μL色谱级甲醇溶解粗提物,利用有机滤膜过滤后用高效液相色谱仪检测。
检测条件为:色谱柱:XDB-C18,规格:5μm,4.6×150mm;流动相:A相:水+1‰甲酸,B相:乙腈+1‰甲酸;流动相体积比:0-30min:B相:A相=70:30-0:100;流速:1mL/min;检测波长:220nm-500nm全波长检测。
HPLC检测结果如图2所示。为了方便对照,图2还标出了化合物1、化合物2、化合物3、化合物4和化合物8所对应的特征峰,并给出了asteltoxin(化合物1)的标品(记为viasteltoxin标品)HPLC曲线。
图2中,野生型齿梗孢霉(记为i野生型)的发酵产物包括有大量化合物4、化合物3和少量化合物2,齿梗孢霉环氧水解酶缺失株(记为iiΔaurD)的发酵产物包括有化合物4和大量非特异性水解产物,1号菌株的发酵产物包括大量中间产物化合物8和少量的化合物1,同样,2号菌株的发酵产物也包括大量中间产物化合物8和少量的化合物1,而3号菌株的发酵产物包括大量化合物4。
由图2可见,在齿梗孢霉环氧水解酶缺失株中,高表达AurD可使菌株恢复合成化合物4,而高表达AstD或MrvD可生成大量中间产物化合物8和少量的化合物1。
同时,将发酵产物验证正确的1号菌株(齿梗孢霉AstD高表达菌株)扩大培养并进行生物保藏,命名为齿梗孢霉(Calcarisporium arbuscula)M100(菌株编号为M100,分类命名为齿梗孢霉(Calcarisporium arbuscula)),已于2020年9月21日保藏于中国微生物菌种保藏管理委员会普通微生物中心,地址为:北京市朝阳区北辰西路1号院3号中国科学院微生物研究所,保藏编号为CGMCC NO.20273。
将发酵产物验证正确的2号菌株(齿梗孢霉MrvD高表达菌株)扩大培养并进行生物保藏,命名为齿梗孢霉(Calcarisporium arbuscula)M101(菌株编号为M101,分类命名为齿梗孢霉(Calcarisporium arbuscula)),已于2020年9月21日保藏于中国微生物菌种保藏管理委员会普通微生物中心,地址为:北京市朝阳区北辰西路1号院3号中国科学院微生物研究所,保藏编号为CGMCC NO.20272。
5.4齿梗孢霉AstD或MrvD高表达菌株发酵产物的纯化和结构鉴定
将已经成功通过了发酵产物验证的上述齿梗孢霉环氧水解酶高表达菌株,包括齿梗孢霉AstD高表达菌株、齿梗孢霉MrvD高表达菌株,分别接种于1L马铃薯葡萄糖肉汤培养基(PDB)中,并在20cm培养皿中静置室温发酵2周左右。发酵完成后,分别收集4号菌株发酵产物和5号菌株发酵产物。
将4号菌株发酵产物用200mL二氯甲烷反复浸润提取三次,旋蒸后得到约550mg浸膏。随后,用中低压制备色谱仪(购自美国CombiFlash公司)进行粗分,分离条件为:分离柱:18g硅胶预装柱;流动相:A相:二氯甲烷,B相:甲醇;流动相体积比:0-20min:B相:A相=95:5-0:100;流速:20mL/min;收集条件:收集紫外385nm信号峰。将收集的组分旋蒸后得到约80mg粗品。最后,在1mm厚制备型硅胶板上样40mg,展开剂为二氯甲烷:甲醇=20:1,在展缸中避光展开1.5小时左右,最后收集黄色条带,旋蒸后得到约8mg纯化的发酵产物化合物8。
利用一维NMR核磁共振的氢谱(见附图3)、碳谱(见附图4),以及二维核磁共振DEPT135(见附图5)、HSQC(见附图6)、COSY(见附图7)、HMBC(见附图8)、NOESY(见附图9),并参照化合物1的化学结构与手性,确定了发酵产物化合物8的化学结构,包括手性中心的绝对构型。发酵产物化合物8的化学结构正如前文所述的“4、asteltoxin生物合成机制推导”中所推测的生物合成线路图中的化合物8所示。即,验证了化合物8是:由双环氧中间体经过环氧开环生成环氧邻二醇多烯α吡喃酮中间体后,再经半频哪醇重排、羟醛缩合所生成的一个半缩醛产物。这说明asteltoxin(化合物1)的生物合成途径与前文所述的“4、asteltoxin生物合成机制推导”中推测的一样。
在上述实施例中,AurD在齿梗孢霉环氧水解酶缺失株中高表达可使菌株恢复合成化合物4,说明AurD仅有催化环氧水解功能而无催化半频哪醇重排功能;AstD在齿梗孢霉环氧水解酶缺失株中高表达,可高效催化半频哪醇重排得到大量的化合物8,说明AstD具有催化环氧水解以及半频哪醇重排双功能。
对5号菌株发酵产物采取与上述4号菌株发酵产物相同的纯化和结构鉴定的方法,得出与4号菌株发酵产物相同的结论,说明MrvD与AstD具有相同的功能,都具有催化环氧水解以及半频哪醇重排双功能,MrvD在齿梗孢霉环氧水解酶缺失株中高表达,也可高效催化半频哪醇重排得到大量的化合物8,从而用于asteltoxin(化合物1)的生物合成。
此外,AstD与MrvD在齿梗孢霉中成功表达,也说明了其可应用在异源宿主中,说明该酶可实现异源表达并具备高效催化功能,有一定工业应用潜力。
序列表
<110> 浙江大学
<120> 环氧化物水解酶及其编码基因和应用
<160> 14
<170> SIPOSequenceListing 1.0
<210> 1
<211> 8192
<212> DNA
<213> 异冠裸胞壳(Emericella variecolor )
<400> 1
atggctcccg aacctatcgc tatcattgga actggctgta gactccctgg ctcatcctca 60
tcgccgtcac gcctctggga gctgctcagc aagccaaagg atgtcgcgtc caagcctcca 120
gcggatcgat tcaacattga cggattctac catccgaatc caaccaaccc gctgacgctc 180
aatgtcaaag agtcgtactt cctcaacgat aacgtacggc agttcgacgc atccttcttc 240
aacattgctg ccaacgaggc caccagcctg gacccgcagc agcggatgct cctcgaaact 300
gtgtacgagt ccctcgaggc cgcaggactt cgcatggagg cccttcgcgg gtcctctaca 360
ggtgtctttt gcggagccat gtgtgccgac tgggaagctc tgttagccct ggacaaggct 420
gttcctgaat acgtgagcac atgcccgtac tgccctcacg tatctatttt cttttccctt 480
tcatgagtgc tgatttgaag aaatcaatga aaaaaggcca tctctggcgt agcacgcaac 540
aacctagcaa accggatctc gtactttttc gactggaacg gcccatcaat gaccattgac 600
accgcctgtt cctcaagcct ggtggctctt caccagggaa tctctgcgct ccaacgcggc 660
gagtgctcac ttgttgcctc cgttggcgtc aacctgatcc tggcaccaac tttgtatttt 720
gcggcgtcga atcttcaaat gctctcaccc gaagcccgtg gtcggatgtg ggatcagaac 780
gccaacgggt acgtccgtgg ggaaggagtg gcttctgtca tcttgaagcg actcagtgat 840
gctgttgcag atggtgatcc gatagagtgt gtgatacggg cttctggcgt gaatcaggta 900
cgttagcaag ctggaccagc ccgaacaacg agtctgatat ctgtaggatg cgcgcactct 960
ggggttgact atgccttcgg gcaaggctca ggagagcttg attcgctcga cctatgctct 1020
tgctggactg gacgtcaatc ggccagagga tcgaccgcaa tatttcgaag ctcatggcag 1080
taagtgcgag aagactggcc cgagaagatt atactgacgg atcatgtgac agcgggcacc 1140
caggccggcg attatcagga agcttcgggc atcttcaaca ccttcttctc gacaccatct 1200
cttgatgaca atgttctaca tgtcggatca atcaagacag tcctcggtcg tcagttgtcc 1260
ctttattgct ggcaatacat atctgataat ctgacacgcg aatctcagac agcgaaggat 1320
gcgctgggtt ggccggacta ctcaaggcat ccttgtgcat ccgaaatgga aaaatcccgc 1380
ctaatctgca ctttgagaaa ctgaatccca agctggagcc ctattccttg aagctcaagg 1440
tccccaccga gctgagggat tggccaacac tacctccggg tgttccgcgg cgcgtttccg 1500
tcaactcatt gtaaggagcc ctgctctgat agccccatag gaacaaccaa ataattgaca 1560
catttgcagt gggttcggag gtacaaactc tcatgcagtt cttgagagct acgaaccgca 1620
agcgaatggg ctcgtgaaac ccctcaacaa tagggttacc accgctggcc cagttacgat 1680
gccgtttgct ttttcggccg cttctgaaag aacacttggg gcagtactcg gaagctacga 1740
gcactacata acagagaatt ctcacatcga tccactagac ctatcttggt ccttaatgca 1800
gaagcgctca gcgctaaagt atcgcgtggc cttgtgcgca gccacagcag atgagctcaa 1860
gataaagatc aacgatgagc ttgcccttcg aaaagctaat tcctcttcaa cagtggtatc 1920
acgaagtgac tccgagaaaa aacttgttct gggtatattc accggccagg gtgcgcaatg 1980
gccagagatg ggtgttgatt tgatcaacac cttcccacaa gctcgagggt ggtttgagga 2040
aatgcaaaaa tcgctagatg agttgcctgg tggccaacgc gcagatttct ctcttcttga 2100
tgaattggct gctccaaagg cgtcctcaag gatccaggag gctgccgtcg ctcagcccct 2160
atgcacggct gtgcagattg tccttgtaaa tgtcctctat acattgggga tatcatttga 2220
cgcagtcatc ggtcattcct ccggagaggt tgctgccgcg tatgctgcgg gagttctcaa 2280
cgcacatgat gccattcgta ttgcatatct gcgcggaaag gtaggctcct ggttcctctg 2340
cgattcagaa ttaaaaagac taaccaacat acaggtggcc cgtatggctg cgggctcaaa 2400
tggcgaacgc ggagggatgt tagcagcagg cctgtctttt gacggcgcga tagccttttg 2460
cgagcaaccc cagtacctgg gtcgaatcag cgtagccgcg tgcaattctc catctagcgt 2520
gaccttgtct ggagatgcag atgcaatccg cgaagcagag caggatctga agggccagga 2580
caaatttgct cgcattgttc tcgtggacac cgcataccac tcccatcaca tggaaccttg 2640
tgcagagcca tatctccgtg ccatggaggg atgtaacatc caagttgggg agccgacttc 2700
cactcgatgg tactccaccg tttatggcgg gaaggaagtc aaccgctcac cctatgtagc 2760
aaaggatctg gtcagcgggt actggaagga caacatgcgc cagccagtcc tgtttcacca 2820
agcgctgatg gcagcagttg ggaatttcgc tcctggactg atcgttgaag tgggccccca 2880
tcccgcactg aaaggtcccg ttctccaggg catttctgaa ggattgaagg cggcatcatc 2940
tacagcgatt ccatatattg gcaccctgcg tcgtgggtta accgggactg tggcagttgc 3000
tgagacggcc ggctcgcttt ggacatatct cggctccgac gagattgaca tttcacgcta 3060
tatatcgctg agcggcgcac acaggaagct caaattcatc gaaaatttgc cacactatcc 3120
tttcgatcac agccagtcct actggactga aacacgacgc tcgaaagcct atctgcaccg 3180
gacaccgcgc aatgaactcc ttggggatct cagcgaggag aatgctgagg gcgagtggcg 3240
ctggaggaat ttcttaatcc caagcaatat ggagtatctc gaggggcatc aaatccaagc 3300
tcagacaatt tttcccgcga cgggatacat tgcgatggca ttcgaagctg ctgcaaagat 3360
tgcagatgga aaatccattc gcttcatgca ggtcaacgac ttaatcatta accaagctat 3420
cgccttctcg gaagatagca aaggcgttga gattctctat cgggtctatc aaataagtac 3480
cgaaggtggt tttactcgcg caacgttcag ctgccatgca gatattggag gcaatttgaa 3540
gtcctgcgcg tcgggtcact tgctcatgag ctggggtgag atggaagcac atattttgcc 3600
ccccaagcct gcccctatgt ctggcatgtc aagcgtggac gttgatgagt tctattcctc 3660
tctgggcaag ttaggatacg gctatagcgg ccttttccgg ggtattacct ctctgaagcg 3720
caagctcaac atgtccacgg gccacctcaa taatgtgaag gatgtgtctg ttcttcttca 3780
cccttctacc atggattgcg gattgcagtg cttacttggt gcggttgcgg cgccggggga 3840
tggggagtta tcacgcctcc agatcccgac acgaattcga acggccacta tgaatccaag 3900
attctgcgaa aatatcggag gttgtttccc aggagagtcg ctcacctttg acgccaccgt 3960
cacgagtgca aaccctgatg gtgtgtcggg agacatcagc ttgttcactc aacaggggca 4020
gggggtcatt cagttcgaag gcgtccaggt atcaccgctc atgaagccca gcgccaaaga 4080
agatagaccg atgttctcag aaattgcctg gggaaatctt gagcctgatg cggagccccg 4140
ggagactctg cctgtgaagt tctggcctgg caaactagat gacccacaac atatatgctt 4200
cctcattatg aaggacatcc ttccccagct gactacagag gatagagaaa gacttgaggg 4260
gcatcgaaag gacattgtgg cttggtttga ccatgttgta ggtcttactt cctcggggca 4320
gtatgacctc tgcaagaaag agtgggatgc tgaggatcca catgaggtgc tgcccattct 4380
gttgaaggat gcacagccta tcattgtgga gatgacagcg aatataagga gatactttgc 4440
ttcgttcctg cgaaatgaga tctctatgat tgagatttat cgggagaaca acctgctgac 4500
ccggttctac aaagtcgagc aagagctcga gtatatgagc ctccgagtgg gcgacatggc 4560
tgaacagctc gcattcagat acccacgcat gaagattctc gagataggag ctggaactgc 4620
ctcagccact cgtgctgttc ttgaccgtat tgggcaatac ttccattctt atacgtttac 4680
cgatatttcc gccggctttt tcgaagacgc ggaagcgaca tttgccgccg aacacggtga 4740
caaaatggta tataaagtcc ttgatatcga gcaggacccc atagagcaag ggtttgaggg 4800
cggcgcgtat gacctagtca tagctgcaaa tgtccttcat gcaacgaaac atctcgagat 4860
aacaatgacc aatgtacgcc gtctcctcaa accaggcggc catctcatcg ccctcgaaat 4920
cacaaacgaa cagatcctac aagatgccct gctattctgc gcgtttgaag gctggtggct 4980
aggccgtcat catgataaca gaccttgggg tccgaagatc tccgtggcca agtgggagca 5040
tctcctcagg aacacgggct tcagcggcgt cgataccata ctccctcaac cggggaaacc 5100
gcagtacctg ttctggggat actcaacttt cgttactcag gcggttgacg accggatcgc 5160
acgaattcgg cagccgttga ttccttccaa ttcttctact gcaagtgcaa ggcgcgacaa 5220
tcttgtgatc attggtggcg caacagagac aacatcacgg ctcatgcctg ctctgcagaa 5280
gctgctcgcc ccattcttcc acaccgtcat tgaggaatct agtttcgact cgctgtttga 5340
ccatgctgag gagtcctcgg ctgcaacagt ggttctatgt ctcgcagaca tggataaccc 5400
atgtttccaa gacctcaatg aagttaaaat gcgtgctctg aaacggatac ttgaaagtgc 5460
tcgacgactc ttatgggtta caactgggtc cgagtccgaa aatctgtact tgagtatgag 5520
caaagggttc ctcagctgca tcgggtacga gtatagagac tcgctgcacc agtacctcaa 5580
catcctggag ccaaaggacg tcaacgcgga gatcatatcc acgaccctga tgcgtatggt 5640
tcattccgag tgcaccaacg actacagcca cccgaatact gtggacagcg tcgagctgga 5700
actgcgcttt gcgaacaaca aaatgcagat tccgcgcatc atggaggcca cgtctctgaa 5760
ccgacgttat gcagccacga ggcgggccgt gcataagctg gtcggcctgt cgcattcttc 5820
cgtcaatcta tgttgggctg gagatcatgc agagctggta cagcagagtg ataagagtgc 5880
cgatggtggt ttttacggca agggctcgac tagcggcggt ggaaacttca atgccaaaaa 5940
gcagcaacag cagcagcata accctatgac tcaaatccga gttcagtaca gcacctcgac 6000
ggctttgaaa attggcaggg ccgggttttt ggctctcgtg ctgggtacgg atgaggcgac 6060
gagaacacga gtcgtagcct ttgcggatac gatcgcgtcg agggtgtcag tgccctcggc 6120
ctggtgctcg gagctcccgc gcggaatcac aacagaaatg gaagaggcgg aattcctgcg 6180
agccctggca tgtgcactac tggcgaagag ccttgttcag caggcaaccc caaatacagg 6240
tctgttggta cacggtgcca gcgagcctct caaacattct atctggtgcc aggctgttgc 6300
caaaggtaag aaggaaagaa gaaaaagaga tacggctttt cccttcatcc cagatttctt 6360
ctcaaataac acactgtcct gcccctgtgc aaatcgcgcg ttactaacag cttgcaggcg 6420
tccaaccgca ttttagcaca agcgatgtct caaaaacaag tcaagattcg agctcgcaaa 6480
tcctgcacct gcaagagaga agcacagcac aagcgcttgc tcgagttcta cctagcaatc 6540
tctccgttgt ggcagacttg agcagtacaa gcatgcacaa cggcatccta acccagctca 6600
gcaatgtcat ccaccccagc gttgcgcgag aaaccaaaga aacgctctcg gcacgttcgc 6660
cgttcctcgt ggcgggattc cgccacgaca ccgtgacgaa ggtgttcaat gcagcgtgcg 6720
tggttgcggc acaggtcatg ctgttactac aaagtgcgca ctgcgtgggc ccaacgagct 6780
tcaaagccgc cccctcgcca ggcgtaagcg tgacgaaaat taacgaggtt gtcgggcgca 6840
gtgatggtga tgtgaatgtt acgggaggac cggatggcac tgagatcctg gactgggcgc 6900
aggccgccgc actgcccgtt cgtgtttcgc ctgcgagttc gcaagttacg cttcgtgggg 6960
accggactta tctccttgtt ggtatggcgg gggatctggg ccagtctgtc tgtcactgga 7020
tgattacgcg aggagctcgc aatatcgtgc tggctagtag gacgccgaac gttgatgaga 7080
agtggattga tgagatggtt gcattgcggg ctcgggtgag ggtcgagtct atgtaagttt 7140
ttgtccctat caaggaagca cgactacgta aatacgcatg cagtacgtga ctgacaaaac 7200
aaaccccgga caacagggac gtcacaaacc gagaatcaat cctccgtgtc gacagtgcta 7260
ttcggcaatg tctacccccc gtcggaggcg tcgtaaacgg ggcaatggtt ctccgcgacc 7320
aaatgttcag cgaaatttcg ctagaaaaca tcctcgatac ctacaagccc aaggtcgacg 7380
gcagccggct cctcgaagag atctacgggc agggagcaga ccaagacctc gacttcttca 7440
tctgctttgg ctccgcaacg gccatcctcg gcaacgttgg gcagtcctcc tacggggccg 7500
caacaaacta catgcgcagc ctgatccagc gccgccgcga gcgcggcctc gtcggcagca 7560
tcatccaccc cgccgaggtg cgcggcgtcg ggtacatctc gcggaggggc agcgaactca 7620
tgcgccgtat ggccgatctc gtaggcacgc acattgtctc ggagaaagac ctgcacgaga 7680
ccttcgcgga ggccatccta gccgggaacc cggcgctggg tcgcaacccc gaagtcatca 7740
gcgggctcac gcagcatgac ccagagtccg agcccgacat catctggtac accaaccccc 7800
aaacctggcc gctagccaac tatcacttgc gctcgcactc gacgcacggg ggcgaggccg 7860
gtggcgcgca agtgcccgtt cagaaacagc tagaggccac cgagaacatg gacgaagccg 7920
ccgaaatcgt gctatccgcg tttatcgcga agatcattca gaagctgcat ctctcggaga 7980
acgtctctgt tactgcggac agtcggttga cggagctggg cgcggatagt ctggtggctg 8040
ttgacttgcg gacttggttc cttaaggagc tgcaggttga gattccggtg ctgcagatcc 8100
agagcggtgc gtctattggg gagttggcag gtagtgtcac tgcaaagtta ccagggagct 8160
tgattccgaa tgtcaaggct tcatctgatt ga 8192
<210> 2
<211> 707
<212> DNA
<213> 异冠裸胞壳(Emericella variecolor )
<400> 2
atgaccatcg aatccagaga caactactac aaccccctcg tgctatgggt ctacgatttc 60
ttcgtgcaag tcctcaccaa cacattctgg tggcgctgct ccacaagatc tatcctagtc 120
cccttcttca tagccaacac cagcgcacga cacctagaag tcggcgccgg aaccggctac 180
ttcctgcgcc agaaagtcga cgacgaacag cgccgcacct catctaccgt atccacaacc 240
aactggcctg aaaagcttac cctcgtcgac ttccacagcc agtgcatgcg caaggccgcc 300
agccggatcg cacacgatat ctccagggag cccgagtgcg tggttgctaa tatcctggag 360
ccgctgccgc tgaagcgcgg tgagcaattt gatagcattg cgctcatgta tgtcttgcat 420
tgtatccctg tgccgccggc tgtgaagggg cgtgttgttg agaatttaaa ggagtttctg 480
gcggatgatg gggtgttctt tgggtcgacg gttttgggga agggggtgcg gcataatttg 540
attgggatgt ttcttatgtg gctatataat tatattggga tgtttgggaa ttgggatgat 600
gggcgggagg agattctgaa gcccattagg gagaactttg aggaggtcga gagcgaggtt 660
gttgggacgg tgttgatgtg gagggcgatg aagccgagac ggtggta 707
<210> 3
<211> 1574
<212> DNA
<213> 异冠裸胞壳(Emericella variecolor)
<400> 3
atgtcagacc acacacaggc agcgcagaga gccccattca gggttatcat tgtcggaggc 60
tctataacgg gcctgactct agcacactgc cttgaacgag cgggaatcga ctatgttgtg 120
cttgaaaaac acgtcgatat atttgccgaa ccggggattt ctatcggcct tatgcccaac 180
ggttctcgaa tcctagaaca attggggatc tatgccgatg tggacaagct ctatcaaccg 240
atcaagaaga tctatcagtg cttcccagat ggattctgcc ttgaaactga cagccctgtc 300
aacatagtca aacggtgagc cctcaagcct tgatcgcact ataggttggc tcaagctgat 360
ggtctgatcc atgtagattc ggcttgccgt tctgtgtcat cgagcgacag ctatttctcg 420
atgtcctctt tgctaagttc aaggacaagt cgcgcattca cctgggcatt aaagtcacgg 480
aagtctgcca taccaactca ggagtctctg tccagactgg gaacggcacc acctacaccg 540
gtgatcttgt cgtcggtgca gacggtgtcc acagcatcgt gcgctcagag atgtggcgga 600
ttgggaacgc cgagcaacgt ggattcatca gcgacaagga caagtccgga atggcagcag 660
agtttgggtg cgtgtttggg gtttgcaagg ctcccccagg gcaagatcgc tgggagcaca 720
tcgtccgata caatcacgat ttctgcttcc tgttcttccc tgccacgggg acggacatct 780
tctttaacgt gatctatcga ttgaagcgaa agtcgcacta ccccaatatt ccgcgcttta 840
gtgaggcaga ggcaatccag gtatgcgagt ctgttgctga tttccctgtt tggaacgggg 900
tcaaatttgg ggatctttgg gcgcagcgaa cgagggttgt cctggtccct ttggaggagt 960
atatgtataa gaattggcat tatcgccgca tagtctgtat tggggacagc gttagcaagg 1020
tgggaatccc catactcaca tttggagaaa caagaagctc actccatctc acagatgacc 1080
cccaatctcg gacaaggcgc taatacagca atggaatgcg cagcggtgct cacaaaccgt 1140
cttcgatcat tgctggctga tacttatccc gacaagccgc ttgaacaaag tctgaacggc 1200
atgttggaag acttcaacca gaagcagttc aagcgccttc ttagtgtcca cggcgacgct 1260
caattcgtca ccaggctaga ggcactggac gggtggtcac tgcgagtctt tgcacgccat 1320
gtcatgaaat atctcgggga tttgctcgtt ggaaatctct cccggattgt ggcggcgggc 1380
tctgtgttgg atttcgtgcc cttgactgtt cgttccggta aggactggcc gccttgtccc 1440
ttgcagaatt cctggggaat tgcagaatcg cttaacttct tttgggcttg tacctgggct 1500
ttgtgttttg ctttggttgt cgtttatgtg aggcagaagg aaggaggact agggctagga 1560
acttctttct ggta 1574
<210> 4
<211> 1266
<212> DNA
<213> 异冠裸胞壳(Emericella variecolor)
<400> 4
atgcctcaat ccgcaaaata cgtcctaccg gcatacggcg cacttgcact ctacgcgctg 60
gtcgatttct cttacagaaa cggatatatc cctatggtca tggagaaagg ccagatatgg 120
ctgaatcaac cctcgagcga tccgaggaaa cgggcccaga caacgggaat cgcatcgatt 180
gacgagacgc tggctaccat gtttacgttt tactggcctg tgcttgatgg aacgtttcct 240
gctctcagtg tgatgttcac gaactatttt gggactatta ctctgtctct ggttctgtat 300
tcgcttgagt ccttgaggaa gggcaatagg acgtcttctg cgtgagtttc tacatgcagg 360
atcccaaagt tgatccttct ctctctttct ctctctgaca ctgtccatag ctctttcttt 420
tacagtccaa cactctgggg agtgattggg gtcatggtga cacttgctgt ttccatgccc 480
tggtatctca cggcccatct gttcatctct gctacagcaa ccaagcctac agcagagaat 540
ctctctatcc caatccacca gctcaaggcg cttatcgtca atgtcgtctt cgggctcatg 600
atgccttgtc ttctagtggc gttaccggag agaattacct ggtcactatt cacaagacag 660
tccgcaatcg cagtatggca gctctggcca ttctggagca cggccgtaca ttaccttgcc 720
aacctgttca tcccagctga acgcagcaga atcagcatta aaggccgaga acaaggccaa 780
caaccaggcg caacgcttac aaattggcag cgcacgcgat ccgccttccg agccgtctac 840
ggcctgacat ttgcagtcgc cgccataacg cacattgcga catggacgat ctcgctgacg 900
gttgcatcgg gtctgcgcga cgctctgaat cctcagactg ctgctgctct acatccgtct 960
atcatatttc taaacacagc gccctggtcg tctgtgcaga ctgactctgt cggcgagggg 1020
acattgtggt taatccagtg ggatcaggcg attgcggcgg gggcgatgtg gttgtggagt 1080
ttgcaactgt atcggactgc gcatgtaacg catggaaggg caattgacct gatgcatttc 1140
gtgctcaaaa cagttgcttt ttgcatggtt gctgggttca cgggcgctgc ggtggagttg 1200
ctctgggaaa gggaggaaat ggtcctcgag gcagcgttgg agaaggagaa ggaaagacag 1260
ctctag 1266
<210> 5
<211> 398
<212> PRT
<213> 异冠裸胞壳(Emericella variecolor)
<400> 5
Met Pro Gln Ser Ala Lys Tyr Val Leu Pro Ala Tyr Gly Ala Leu Ala
1 5 10 15
Leu Tyr Ala Leu Val Asp Phe Ser Tyr Arg Asn Gly Tyr Ile Pro Met
20 25 30
Val Met Glu Lys Gly Gln Ile Trp Leu Asn Gln Pro Ser Ser Asp Pro
35 40 45
Arg Lys Arg Ala Gln Thr Thr Gly Ile Ala Ser Ile Asp Glu Thr Leu
50 55 60
Ala Thr Met Phe Thr Phe Tyr Trp Pro Val Leu Asp Gly Thr Phe Pro
65 70 75 80
Ala Leu Ser Val Met Phe Thr Asn Tyr Phe Gly Thr Ile Thr Leu Ser
85 90 95
Leu Val Leu Tyr Ser Leu Glu Ser Leu Arg Lys Gly Asn Arg Thr Ser
100 105 110
Ser Ala Ser Phe Phe Tyr Ser Pro Thr Leu Trp Gly Val Ile Gly Val
115 120 125
Met Val Thr Leu Ala Val Ser Met Pro Trp Tyr Leu Thr Ala His Leu
130 135 140
Phe Ile Ser Ala Thr Ala Thr Lys Pro Thr Ala Glu Asn Leu Ser Ile
145 150 155 160
Pro Ile His Gln Leu Lys Ala Leu Ile Val Asn Val Val Phe Gly Leu
165 170 175
Met Met Pro Cys Leu Leu Val Ala Leu Pro Glu Arg Ile Thr Trp Ser
180 185 190
Leu Phe Thr Arg Gln Ser Ala Ile Ala Val Trp Gln Leu Trp Pro Phe
195 200 205
Trp Ser Thr Ala Val His Tyr Leu Ala Asn Leu Phe Ile Pro Ala Glu
210 215 220
Arg Ser Arg Ile Ser Ile Lys Gly Arg Glu Gln Gly Gln Gln Pro Gly
225 230 235 240
Ala Thr Leu Thr Asn Trp Gln Arg Thr Arg Ser Ala Phe Arg Ala Val
245 250 255
Tyr Gly Leu Thr Phe Ala Val Ala Ala Ile Thr His Ile Ala Thr Trp
260 265 270
Thr Ile Ser Leu Thr Val Ala Ser Gly Leu Arg Asp Ala Leu Asn Pro
275 280 285
Gln Thr Ala Ala Ala Leu His Pro Ser Ile Ile Phe Leu Asn Thr Ala
290 295 300
Pro Trp Ser Ser Val Gln Thr Asp Ser Val Gly Glu Gly Thr Leu Trp
305 310 315 320
Leu Ile Gln Trp Asp Gln Ala Ile Ala Ala Gly Ala Met Trp Leu Trp
325 330 335
Ser Leu Gln Leu Tyr Arg Thr Ala His Val Thr His Gly Arg Ala Ile
340 345 350
Asp Leu Met His Phe Val Leu Lys Thr Val Ala Phe Cys Met Val Ala
355 360 365
Gly Phe Thr Gly Ala Ala Val Glu Leu Leu Trp Glu Arg Glu Glu Met
370 375 380
Val Leu Glu Ala Ala Leu Glu Lys Glu Lys Glu Arg Gln Leu
385 390 395
<210> 6
<211> 1155
<212> DNA
<213> 罗伯茨绿僵菌(Metarhizium robertsii)
<400> 6
atggctcaat taacaaaata catcttgcca atctatggcc ttttagggct ttacagcctt 60
gggtacttct cttaccgcaa tggctacatc gacatggtct tggaaaagcg cgaagcgtgg 120
ctttcctcgc cctcgacgga cccgaggaaa cggccacaca cgactggcat tcaatcgctc 180
gacgaaactt tggccaccat gtttgtcttt tactggcccg tccttgacgg cagcttcccc 240
gggctaagcc tcatgttttc caattatttc ggaaccatca cattgtccct tgtcttggtt 300
tctttggagt ccctaaggaa aggcaacagg acgtcgtcat cttcaacctt cttcagtccc 360
acactatggg gaatgattgg catcatggtg acactggcga tttcgatacc gtggtacctc 420
acagtacacc tcctcatttc cagcaccgcg tctcacccta ccattgaaaa cttgtccatc 480
ccgatacatg aattgcgagc tctgattttc aatgttgtgt ttggactcgt tttgccttgt 540
ctggcagtgg ctctgccaga gagcataacc gggctgctct ttaccagaca gtcagcaatc 600
gcgctgtggc agttgtggcc gttttggagc actgcagtgc attttattgc aaagcagttt 660
atccccgtcg cagagcgtga cgcggatccg agggcccaat gggaaaagac taggacagca 720
tttcgtcttg tctacgggtt aacatttgcg gtagcagcca tcacacatgt ttcaacgtgg 780
gccatctcac tgactgctgc gtatgctctg ccaaacctct tgaccgtcga gacagtctct 840
gctctccacc cgaccaacgt cttcttgaac acttggccgt ggttgcccat caagacggac 900
tcgattggtc aagggacgct ctggctaata caatgggatc aggtttttgc ggcaggcgcc 960
atgtactggt ggagtcttga tctatatcga gctgcacacg caacccaggg caagaagatg 1020
gattggggtt gcttggcact caaatcgatg gcattttgtg ttgtatccgg ctttacgggt 1080
gctgcggtgg agttgctttg ggaaagagag gagatggtta tggaggctgg gcgacctaaa 1140
caaaagacaa agtag 1155
<210> 7
<211> 384
<212> PRT
<213> 罗伯茨绿僵菌(Metarhizium robertsii)
<400> 7
Met Ala Gln Leu Thr Lys Tyr Ile Leu Pro Ile Tyr Gly Leu Leu Gly
1 5 10 15
Leu Tyr Ser Leu Gly Tyr Phe Ser Tyr Arg Asn Gly Tyr Ile Asp Met
20 25 30
Val Leu Glu Lys Arg Glu Ala Trp Leu Ser Ser Pro Ser Thr Asp Pro
35 40 45
Arg Lys Arg Pro His Thr Thr Gly Ile Gln Ser Leu Asp Glu Thr Leu
50 55 60
Ala Thr Met Phe Val Phe Tyr Trp Pro Val Leu Asp Gly Ser Phe Pro
65 70 75 80
Gly Leu Ser Leu Met Phe Ser Asn Tyr Phe Gly Thr Ile Thr Leu Ser
85 90 95
Leu Val Leu Val Ser Leu Glu Ser Leu Arg Lys Gly Asn Arg Thr Ser
100 105 110
Ser Ser Ser Thr Phe Phe Ser Pro Thr Leu Trp Gly Met Ile Gly Ile
115 120 125
Met Val Thr Leu Ala Ile Ser Ile Pro Trp Tyr Leu Thr Val His Leu
130 135 140
Leu Ile Ser Ser Thr Ala Ser His Pro Thr Ile Glu Asn Leu Ser Ile
145 150 155 160
Pro Ile His Glu Leu Arg Ala Leu Ile Phe Asn Val Val Phe Gly Leu
165 170 175
Val Leu Pro Cys Leu Ala Val Ala Leu Pro Glu Ser Ile Thr Gly Leu
180 185 190
Leu Phe Thr Arg Gln Ser Ala Ile Ala Leu Trp Gln Leu Trp Pro Phe
195 200 205
Trp Ser Thr Ala Val His Phe Ile Ala Lys Gln Phe Ile Pro Val Ala
210 215 220
Glu Arg Asp Ala Asp Pro Arg Ala Gln Trp Glu Lys Thr Arg Thr Ala
225 230 235 240
Phe Arg Leu Val Tyr Gly Leu Thr Phe Ala Val Ala Ala Ile Thr His
245 250 255
Val Ser Thr Trp Ala Ile Ser Leu Thr Ala Ala Tyr Ala Leu Pro Asn
260 265 270
Leu Leu Thr Val Glu Thr Val Ser Ala Leu His Pro Thr Asn Val Phe
275 280 285
Leu Asn Thr Trp Pro Trp Leu Pro Ile Lys Thr Asp Ser Ile Gly Gln
290 295 300
Gly Thr Leu Trp Leu Ile Gln Trp Asp Gln Val Phe Ala Ala Gly Ala
305 310 315 320
Met Tyr Trp Trp Ser Leu Asp Leu Tyr Arg Ala Ala His Ala Thr Gln
325 330 335
Gly Lys Lys Met Asp Trp Gly Cys Leu Ala Leu Lys Ser Met Ala Phe
340 345 350
Cys Val Val Ser Gly Phe Thr Gly Ala Ala Val Glu Leu Leu Trp Glu
355 360 365
Arg Glu Glu Met Val Met Glu Ala Gly Arg Pro Lys Gln Lys Thr Lys
370 375 380
<210> 8
<211> 1152
<212> DNA
<213> 齿梗孢霉(Calcarisporium arbuscula)
<400> 8
atgcctcaat ccacgaaata catcttgcca gtctatggca ttttggcgct ctatagcctt 60
gggtattttt cctatcgaaa cggctatgtc aacatcgtct tggaagagcg ccaggcatgg 120
ctcgacatgc ccccgggaga tccaacaaag gttgcgcaac cgactggtat tgcatctctc 180
gacgaaacct tggctgccat gttcgtcttt tattggccag tcctcgacgg gagctttccc 240
ggcttgagcc tcatgttttg caattatctc ggagcgttgc ccttgtgctt ggtgttgatg 300
accttggagt ccctaaggaa gggaaacaga agttcatttt cattctttta cagcccaacg 360
ttttggggaa tgattgcagt catgatgaca ttggccgttt cgataccctg gtacctcacc 420
atacatctgt tgatttctac caccgcgtct caccctacca ttgagaacat gtcgattcca 480
ctggccgaat tgaaagctct gattgtcaat atcgtcgttg gactcgtatt gcctagtcta 540
ttagtggccc tgccagagac aataactcag acgctgttca cgagacaaac agcgattacg 600
ctgtggcagc tgtggccatt ctggagcact gcagtgcatt ttattgcaag gaagtttata 660
tcggctactg agcgcggtgc cgactcaaga gctcaatgga caagggtcag gagtgcattc 720
cgttccgtct atggtctgac atttgcagct gcagccatcg cacacattgc aacatggtca 780
atctccctaa ccgccgccta tgctctgccg gacgctatga gtgccgaaac cgtctcttca 840
ctccatccgc aaaccgtctt tgtcaatact tggccctggc tgcctgtcac gactgactct 900
gtgggtgaag ggactctctg gttgctacaa tgggataagt ttgttggggt tggtgccatt 960
tactggtgga gcctcgatct atacagagcc gcacatacgg ctcaacgcaa gaaaatcaac 1020
tggtattatt ttgcgctcaa aacagtggcg ttttgcttag tatctgggtt caccggtgct 1080
acgatagagt tgctttggga gagggaagaa atgattatgg aggccgggcg tgctaaagaa 1140
aagacaaaat ga 1152
<210> 9
<211> 36
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 9
cgaccaccta acaacatgcc tcaatccgca aaatac 36
<210> 10
<211> 34
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 10
gtcatccttg taatcgagct gtctttcctt ctcc 34
<210> 11
<211> 36
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 11
cgaccaccta acaacatggc tcaattaaca aaatac 36
<210> 12
<211> 37
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 12
gtcatccttg taatcctttg tcttttgttt aggtcgc 37
<210> 13
<211> 36
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 13
cgaccaccta acaacatgcc tcaatccacg aaatac 36
<210> 14
<211> 36
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 14
gtcatccttg taatcttttg tcttttcttt agcacg 36
Claims (9)
1.一种环氧水解酶AstD,其氨基酸序列如SEQ ID NO. 5所示。
2.一种编码如权利要求1所述的环氧水解酶的基因astD,其核苷酸序列如SEQ ID NO.4所示。
3.一种含有如权利要求2所述的环氧水解酶基因astD的重组表达载体。
4.一种含有高表达如权利要求1所述的环氧水解酶AstD的基因工程菌。
5.如权利要求4所述的基因工程菌,其特征在于,其为齿梗孢霉AstD高表达菌株,保藏编号为CGMCC NO.20273。
6.如权利要求1所述的环氧水解酶AstD的应用,其特征在于,包括:如权利要求1所述的环氧水解酶AstD在催化半频哪醇重排反应中的应用,或者,如权利要求1所述的环氧水解酶AstD在asteltoxin的生物合成中的应用。
7.如权利要求6所述的应用,其特征在于,为:所述环氧水解酶AstD在细胞体内催化环氧化物中间体水解,并伴随催化半频哪醇重排反应,得到半缩醛产物asteltoxin T1,如下式所示:
。
8.如权利要求4或5所述的基因工程菌的应用,其特征在于,包括:如权利要求4或5所述的基因工程菌在催化半频哪醇重排反应中的应用,或者,如权利要求4或5所述的基因工程菌在asteltoxin的生物合成中的应用。
9.如权利要求8所述的应用,其特征在于,为:所述基因工程菌中高表达的环氧水解酶AstD在细胞体内催化环氧化物中间体水解,并伴随催化半频哪醇重排反应,得到半缩醛产物asteltoxin T1,如下式所示:
。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011172801.8A CN114410604B (zh) | 2020-10-28 | 2020-10-28 | 环氧化物水解酶及其编码基因和应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011172801.8A CN114410604B (zh) | 2020-10-28 | 2020-10-28 | 环氧化物水解酶及其编码基因和应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114410604A CN114410604A (zh) | 2022-04-29 |
CN114410604B true CN114410604B (zh) | 2023-11-14 |
Family
ID=81260193
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011172801.8A Active CN114410604B (zh) | 2020-10-28 | 2020-10-28 | 环氧化物水解酶及其编码基因和应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114410604B (zh) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104884629A (zh) * | 2012-10-15 | 2015-09-02 | 基因组股份公司 | 用于生产特定长度脂肪醇及相关化合物的微生物和方法 |
CN108265074A (zh) * | 2018-02-08 | 2018-07-10 | 浙江大学 | 一种内生真菌高效遗传体系的构建方法及其应用 |
-
2020
- 2020-10-28 CN CN202011172801.8A patent/CN114410604B/zh active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104884629A (zh) * | 2012-10-15 | 2015-09-02 | 基因组股份公司 | 用于生产特定长度脂肪醇及相关化合物的微生物和方法 |
CN108265074A (zh) * | 2018-02-08 | 2018-07-10 | 浙江大学 | 一种内生真菌高效遗传体系的构建方法及其应用 |
Non-Patent Citations (1)
Title |
---|
Efficient Biosynthesis of Fungal Polyketides Containing the Dioxabicyclo-octane Ring System;Xu-Ming Mao;J Am Chem Soc;第137卷(第37期);第11904-11907页 * |
Also Published As
Publication number | Publication date |
---|---|
CN114410604A (zh) | 2022-04-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN115197172B (zh) | 二倍半萜化合物、其合成基因簇与合成方法 | |
CN112143764B (zh) | 一种生物酶催化制备布瓦西坦中间体化合物的方法 | |
CN113308443B (zh) | 一种红曲霉单加氧酶突变体及其应用 | |
WO2008046328A1 (fr) | Souche produisant une lactonohydrolase lévorotatoire et son utilisation pour la production d'oxyacide chiral | |
CN108929884B (zh) | 通过合成生物学手段异源生物合成灵芝酸的方法 | |
CN114591938B (zh) | 羧化酶突变体及其制备方法和应用 | |
CN114410604B (zh) | 环氧化物水解酶及其编码基因和应用 | |
CN114381441B (zh) | 酶催化合成手性氨基醇化合物 | |
CN108374017B (zh) | 一种新型苯乙烯环氧化酶及其功能 | |
CN115433721B (zh) | 一种羰基还原酶突变体及其应用 | |
CN114891707B (zh) | 重组菌株及其全细胞催化生产胆红素的方法 | |
CN112795494B (zh) | 一种基因工程菌及其构建方法和用途 | |
CN114854714A (zh) | 一种菜豆源环氧化物酶突变体、基因、载体、工程菌及制备方法和应用 | |
CN116716270A (zh) | 一种糖基转移酶及其在生物催化合成红景天苷中的应用 | |
CN113444737A (zh) | 细胞色素p450酶及其在合成灵芝三萜类化合物中的应用 | |
CN118063531B (zh) | 大环内酯类化合物PA-46101s C-E的制备及其应用 | |
CN116144637B (zh) | 一种橄榄色毛壳菌组蛋白去乙酰化酶及其编码基因和应用 | |
CN116064495B (zh) | 一种thdp依赖型脱羧酶及应用 | |
CN107723308A (zh) | 一种化合物balanol的生物合成方法及基因簇 | |
CN118272331B (zh) | 一种烯还原酶突变体及其在(r)-香茅醛合成中的应用 | |
CN115851684B (zh) | 一种腈水解酶及其在蛋氨酸合成中的应用 | |
CN111378675A (zh) | 长春花中艾里莫芬烷倍半萜的生物合成基因及应用 | |
CN113025546B (zh) | 一种多酶级联转化l-酪氨酸生产酪醇的方法 | |
CN102174531A (zh) | 谷田霉素的生物合成基因簇 | |
CN109486789B (zh) | 一种立体选择性提高的菜豆环氧化物水解酶突变体 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |