WO2020081958A2 - Compositions and methods for identifying mutations of genes of multi-gene systems having improved function - Google Patents
Compositions and methods for identifying mutations of genes of multi-gene systems having improved function Download PDFInfo
- Publication number
- WO2020081958A2 WO2020081958A2 PCT/US2019/056977 US2019056977W WO2020081958A2 WO 2020081958 A2 WO2020081958 A2 WO 2020081958A2 US 2019056977 W US2019056977 W US 2019056977W WO 2020081958 A2 WO2020081958 A2 WO 2020081958A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- coli
- lysine
- engineered
- genes
- mutations
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/04—Alpha- or beta- amino acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/04—Alpha- or beta- amino acids
- C12P13/08—Lysine; Diaminopimelic acid; Threonine; Valine
Definitions
- Embodiments of the present disclosure relate to engineering microorganisms for increasing production of and/or increasing tolerance to a target molecule having an assayable endpoint.
- compositions and methods disclosed herein concern genetically modifying microorganisms through manipulating pathway flux of an amino acid to increase amino acid production and/or tolerance compared to microorganisms not genetically modified.
- genetic modifications of these microorganisms can be engineered through methods of deep scanning mutagenesis strategies applied to one or more pathways related to molecular flux of a target molecule.
- Some embodiments concern genetically modifying a microorganism such as bacteria or yeast.
- modified bacteria can be of the Enterohacteriaceae family.
- compositions and methods concern modifying Escherichia coli (“E. coli”).
- E. coli are genetically modified to positively modify amino acid flux relative to wild type.
- Yet other embodiments disclosed herein relate to use of engineered E. coli for increased production and/or tolerance of an amino acid (e.g. lysine, arginine, leucine etc.).
- Evolution has selected for efficient and robust metabolic and regulatory networks that prevent unnecessary metabolite biosynthesis and optimally distribute resources to maximize overall cellular fitness. Complexity of these networks in combination with limited approaches to understand their structure and function has limited the ability to re-program cellular networks in effort to modify these systems for a range of applications. Current approaches to re-program cellular networks are directed to modifying single genes of complex pathways but as a consequence of modifying the single genes unwanted modifications to the genes or other genes can be created, limiting the ability to identify changes necessary to achieve a particular endpoint.
- Amino acids have many useful applications. Amino acid metabolism is fundamental to all domains of life that includes highly involved pathways with extensive kinetic and regulatory features. Amino acid metabolism is an ideal model for assessing modifications to pathways affecting amino acid flux by having a measureable endpoint, increased amino acid production and/or tolerance. Some uses for amino acids for example, the amino acid lysine, is useful for supplementing animal feedstock as a nutritional supplement, used in pharmaceuticals, and cosmetics, among others. Lysine can be industrially produced by microbial fermentation, but, there are limits to its efficiency, scalability, tolerance and production.
- Microbial overproducers of lysine have traditionally been identified via“adaptive evolution”, namely, adaptation of the microbes in the presence of antimetabolites (such as the analog S-(2-aminoethyl)-L-cysteine (AEC)) but the underlying genetic basis for the overproduction phenotype is relatively unknown.
- AEC analog S-(2-aminoethyl)-L-cysteine
- Embodiments of the present disclosure relate to applying, for example, deep scanning technologies in order to introduce and assay for mutations directed to altering one or more pathways related to molecular flux of a target molecule in an organism instead of targeting or selecting for single gene changes.
- microorganisms can be engineered using these deep scanning technologies for increasing production of and/or increasing tolerance to a target molecule having a measurable endpoint such as an amino acid.
- methods disclosed herein can be used to screen tens of thousands of mutations introduced to one or more genes affecting one or more biosynthetic pathways of a target molecule to exploit mechanism(s) responsible for producing the target molecule.
- compositions and methods disclosed herein concern genetically modifying microorganisms to increase amino acid production and/or tolerance compared to microorganisms that are not genetically modified.
- genetic modifications to a microorganism are engineered through applications of deep scanning mutagenesis strategies applied to one or more pathways related to molecular flux of a target amino acid.
- Some embodiments concern genetically modifying bacteria of the
- compositions and methods concern modi tying Escherichia coli. (“E. coli”).
- E. coli are genetically modified to positively affect amino acid flux relative to wild type (e.g . lysine) to increase tolerance and/or increase production of the amino acid by the genetically modified E. coli.
- wild type e.g . lysine
- Yet other embodiments disclosed herein relate to use of engineered E. coli for production of lysine.
- compositions and methods disclosed herein concern genetically modifying bacteria to increase amino acid production and/or tolerance compared to bacteria that are not genetically modified. Some embodiments concern genetically modifying bacteria of the Enterobacteriaceae family. In yet other embodiments, compositions and methods concern modifying Escherichia coli. (“E. coli”). In certain embodiments, E. coli are genetically modified to increase lysine production, increase lysine tolerance, and/or modify lysine homeostasis relative to their wild type. Yet other embodiments relate to use of these engineered organisms for over production or increased tolerance to produced lysine.
- E. coli Escherichia coli
- Certain embodiments relate to introducing genetic mutations in genes of pathways related to amino acid production, amino acid tolerance, amino acid metabolism, and/or amino acid homeostasis in E. coli.
- one or more genes of these pathways are modified to increase tolerance of the engineered E. coli to lysine and/or to induce over-production of lysine by the engineered E. coli.
- one or more genes of the engineered E. coli are modified in order to enhance lysine homeostasis.
- one or more genes of the engineered E. coli are modified in order to enhance amino acid metabolism (e.g. lysine).
- genetic modifications to certain genes can lead to modifications of genes contributing to all around amino acid metabolism and tolerance.
- production and tolerance of the amino acid lysine can be altered in a microorganism.
- lysine production, lysine tolerance, lysine metabolism, and/or lysine homeostasis in for example, during 1) lysine biosynthesis, 2) lysine degradation, 3) lysine regulation, and/or 4) lysine transport can be altered in an engineered microorganism contemplated herein.
- coli can be effected through deletions or insertions into the E. coli genes.
- these modifications can include genes that encode particular proteins affecting pathways related to lysine production, lysine tolerance, lysine metabolism, and/or lysine homeostasis, for example proteins involved in lysine biosynthesis, lysine degradation, lysine regulation, and lysine transport or export.
- genetic modifications in the engineered E. coli can be mutations to a binding site of one or more polypeptides involved in lysine biosynthesis and/or tolerance.
- binding sites can include a substrate binding site, a co-factor binding site, a DNA binding site, an allosteric factor binding site.
- the one or more genetic and/or pathway modifications to the engineered E. coli lead to an assayable trait.
- an assayable trait can be with respect to an engineered microorganism having altered lysine metabolism, a decrease in uptake of S-(2-aminoethyl)-L-cysteine (AEC) by the engineered microorganism (e.g. E. coli ) demonstrating effective lysine flux manipulation for selection purposes.
- AEC S-(2-aminoethyl)-L-cysteine
- production, metabolism, and/or homeostasis in E. coli can be enhanced by introducing mutations such as site-directed mutations or targeted mutations that affect the binding region of targeted genes.
- some mutations can include introducing a single mutation or multiple mutations up to mutating all regions of a gene to alter a binding region for example, introducing a single nucleotide polymorphism (SNP) into the gene in one to all sites or nucleotides that affect binding affinity of the gene for a particular molecule.
- SNP single nucleotide polymorphism
- mutations can be introduced or selected for; for example, selecting for a SNP in one or more of genes that encode proteins affecting lysine production, metabolism, and/or homeostasis, including lysine biosynthesis, lysine degradation, lysine regulation, and lysine transport or efflux.
- genetic modifications for creating an engineered E. coli for modulating lysine metabolism can include, but are not limited to, mutating one or more dapF , lysP, lysR, lysC , serC, dapD, cadA, argT, dapE, dapA, lysA, lysS, argP, argD, asd, lysU, cadB, dapB, and IdcC.
- genetic modifications for creating an engineered E can include, but are not limited to, mutating one or more dapF , lysP, lysR, lysC , serC, dapD, cadA, argT, dapE, dapA, lysA, lysS, argP, argD, asd, lysU, cadB, dapB, and IdcC.
- coli for modulating lysine metabolism can include, but are not limited to, mutating dapF , lysP, lysR , lysC, and lysS, or combinations thereof.
- the engineered E. coli has increased tolerance and/or production of lysine compared to a wild type.
- targeted genes for modification can include, but are not limited to, one or more of dapF , lysP, and lysR , genes can be modified.
- introducing one or more SNP(s) introduced to a targeted gene of a microorganism can include, but are not limited to; one or more of dapF G210D, dap I ⁇ ' M260Y, lysP T33F, lysP Q219I, and lysR S36R in order to modulate lysine biosynthesis and/or tolerance in the engineered microorganism; for example, bacteria ( e.g . E. coli).
- promoters are targeted to increase expression of one or more genes in E. coli in order to affect lysine production, tolerance, metabolism, and/or homeostasis.
- vectors can be designed for transfection of E. coli in order to increase lysine production, tolerance, metabolism, and/or homeostasis.
- a vector can include at least a regulated promoter, an editing cassette having a selectable marker, and an associated spacer.
- the selectable marker can include tracking a marker that indicates one or more modifications to one or more of dapF , lysP, lysR, lysC , serC, dapD, cadA, argT, dapE, dapA, lysA, lysS, argP, argD, asd, lysU, cadB, dapB, and IdcC in order to allow selection of these modified genes.
- constructs of use for enhanced lysine production, metabolism, and/or homeostasis in E. coli can include swapping promoter regions in order to upregulate or down regulate targeted genes of a bacteria to modify lysine biosynthesis and tolerance in the bacteria.
- methods for targeting bacterial (e.g. E. coli ) pathways associated with one or more amino acid (e.g. lysine) production and/or tolerance using genetic manipulation in order to obtain engineered bacteria can include, but are not limited to, using a Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-based approach. This type of approach provides for reprogramming of gene transcription translation and other effects to elicit particular targeted cellular phenotypes in the bacteria.
- these methods can include subsequently producing an engineered bacteria (e.g. E. coli ) by introducing into the bacteria (e.g.
- E. coli a vector that encodes one or more mutated genes identified by deep scanning mutagenesis of dapF , lysP, lysR , lysC, serC , dapD, cadA, argT, dapE, dapA, lysA, lysS, argP, argD, asd, lysU, cadB, dapB, and IdcC ; producing bacteria expressing the vector.
- the bacteria can be an engineered E. coli having increased lysine tolerance.
- the bacteria can be engineered A. coli having increased lysine production.
- the bacteria can be an engineered E. coli having both increased lysine production and increased lysine tolerance.
- methods of making engineered bacteria or other organisms can concern manipulation of genes involved in the aspartate pathway in a microorganism to make one or more amino acid as a product from the oxaloacetate/aspartate family.
- amino acids contemplated in this family can include, but are not limited to, lysine, asparagine, methionine, threonine, and/or isoleucine. It is understood by those of skill in the art that aspartate can be converted into lysine, asparagine, methionine and threonine. Threonine can be converted to isoleucine.
- Aspartate pathway uses L-aspartic acid as the precursor for the biosynthesis of one fourth of the building block amino acids.
- engineered microorganisms contemplated herein concern microorganisms capable of having increased production and or tolerance to one or more of lysine, arginine, proline, glutamic acid, leucine, methionine, phenylalanine, tryptophan, tyrosine, valine, isoleucine, and/or histidine.
- the following agents can be used for selection and/or detection of a corresponding amino acid contemplated herein S-(2-Aminoethyl)-L-cysteine, canavaninin, Azetidine-2-carboxylic acid, Beta-N-Methylaminoalanine (BMAA), 5 -hydroxyl eucine, ethionine, selenomethionine,, o- tyrosine, 7-azatryptophan, 3,4-Dihydroxyphenylalanine (DOPA), 4-hydroxyvaline, O- methylthreonine and/or 2-thiazolealanine or other chemical of use to assay for the production of one or more amino acids contemplated herein.
- S-(2-Aminoethyl)-L-cysteine canavaninin
- Azetidine-2-carboxylic acid Beta-N-Methylaminoalanine (BMAA)
- BMAA Beta-N-Methylamino
- methods for making engineered bacteria can include introducing into the bacteria (e.g. E. coli ) a first vector having a polynucleotide encoding a nuclease-deactivated CRISPR-associated (Cas) protein; and a second vector of one of at least one short guide RNA (sgRNA) molecule of a CRISPR-associated (Cas) protein binding site and further including a targeting RNA sequence directed to a target polynucleotide.
- sgRNA short guide RNA
- the targeting RNA sequence is directed to a target polynucleotide including, but not limited to, one or more of dapF , lysP, lysR, lysC , serC, dapD, cadA, argT, dapE, dap A, lysA, lysS, argP, argD, asd, lysU, cadB, dapB, and IdcC or other gene related to an amino acid synthesis pathway.
- methods provide for engineered bacteria expressing the second vector, having an increased lysine tolerance and/or increased lysine production.
- amino acids e.g. lysine
- methods disclosed herein include recovering the amino acid from media of engineered bacteria such as an E. coli.
- methods can include harvesting the engineered bacteria such as an E. coli and recovering intracellularly produced amino acid.
- engineered bacteria (e.g . E. coli ) disclosed herein can be used for technological or commercial applications.
- engineered bacteria (e.g. E. coli ) disclosed herein can be used for increasing production of and tolerance for an amino acid (e.g. lysine) by the engineered bacteria compared to a wild-type bacteria (e.g. E. coli).
- a 5%, or a 10%, or a 20% or, a 30%, or a 40%, or a 50%, or a 60%, or a 70 %, or an 80% or a 90% or more increase in production and/or tolerance of the amino acid (e.g. lysine) can be produced in the engineered bacteria.
- FIG. 1 illustrates an overview of lysine metabolism in exemplary bacteria (e.g. E. coli ) of some embodiments disclosed herein.
- FIG. 2A illustrates library coverage assessed through exemplary deep sequencing before exposure to Cas9 of some embodiments disclosed herein.
- FIG. 2B illustrates library coverage assessed through exemplary deep sequencing, after exposure to Cas9 of some embodiments disclosed herein.
- FIG. 3A illustrates an exemplary enrichment map of variants across E. coli targeted genes related to lysine production, metabolism, and/or homeostasis, including 1) lysine biosynthesis, 2) lysine degradation, 3) lysine regulation, and 4) lysine transport of some embodiments disclosed herein.
- FIG. 3B illustrates an exemplary map of the number of enriched mutations in genes classified in each of the four exemplary categories mentioned above with increasing concentration of a selection agent of some embodiments disclosed herein.
- FIG. 3C illustrates exemplary enrichment scores for each gene represented in FIG. 4B of some embodiments disclosed herein.
- FIG. 4 illustrates the fraction of engineered E. coli lysP mutants across increasing selective pressures compared to other mutants of some embodiments disclosed herein.
- FIG. 5A illustrates growth of an exemplary engineered E. coli lysP T33F mutant compared to wild-type E. coli transformed with a non-target gRNA of some embodiments disclosed herein.
- FIG. 5B illustrates growth of an exemplary engineered E. coli lysP Q219I mutant compared to wild-type E. coli cells transformed with a non-target gRNA of some embodiments disclosed herein.
- FIG. 6 illustrates enrichment of exemplary synonymous mutations observed for LysP, LysR and DapF in engineered E. coli of some embodiments disclosed herein.
- FIG. 7 illustrates an exemplary illustration of mutations conferring selection tolerance in engineered E. coli of some embodiments disclosed herein.
- FIG. 8A illustrates an exemplary quantification of intracellular lysine levels in wild type . coli and an engineered . coli lysR S36R mutant of some embodiments disclosed herein.
- FIG. 8B illustrates differential gene expression for the lysR and lysA genes in a wild type E. coli compared to an engineered E. coli having a genetic mutation of some embodiments disclosed herein.
- FIG. 9 illustrates growth of an exemplary engineered E. coli mutant compared to wild type E. coli cells transformed with a non-target gRNA of some embodiments disclosed herein.
- FIG. 10A illustrates growth of an exemplary engineered E. coli mutant compared to wild type E. coli transformed with a non-target gRNA of some embodiments disclosed herein.
- FIG. 10B illustrates quantification of intracellular lysine concentration in wild type E. coli cells and exemplary engineered E. coli mutant of some embodiments disclosed herein.
- FIG. 11 illustrates an exemplary gel demonstrating expression and purification of an engineered E. coli variant compared to a positive control microorganism of some embodiments disclosed herein.
- FIG. 12A illustrates a negative control for a DapF kinetic experiments, comparing before and after DapF exposure of some embodiments disclosed herein.
- FIG. 12B illustrates a positive control for a DapF kinetic experiment, illustrating comparing before and after wild-type DapF exposure of some embodiments disclosed herein.
- FIG. 13A illustrates a DapF assay of kinetics of wild type and engineered E. coli of some embodiments disclosed herein.
- FIG. 13B illustrates differential gene expression for target genes of an engineered E. coli compared to wild type if coli of some embodiments disclosed herein.
- FIG. 14 illustrates an exemplary vector used for a selected mutant using CREATE of some embodiments disclosed herein.
- FIG. 15 represents an exemplary table illustrating genes related to lysine synthesis in a bacteria and targeted sites in an exemplary library of some embodiments disclosed herein.
- FIG. 16 represents an exemplary table illustrating amino acids and an exemplary analog thereof for selecting or detecting the presence of its respective amino acid genes related to lysine synthesis in a bacteria and targeted sites in an exemplary library of some embodiments disclosed herein.
- FIG. 17 represents a schematic of a workflow strategy to map trajectories of agent resistance in a microorganism using CREATE of some embodiments disclosed herein.
- FIGS. 18A-18E represent 18 A) model structure of LysR, with the HTH DNA- binding domains; 18B) an enlargement of a mutation illustrating its proximity to the DNA phosphate backbone; 18C) a substitution mutation 18D) an example of absolute quantification of intracellular amino acid levels ( e.g . lysine) in wild-type and the reconstructed mutant; and 18E) differential gene expression quantified via QPCR for exemplary genes on wild-type and mutant backgrounds of some embodiments disclosed herein.
- 18B an enlargement of a mutation illustrating its proximity to the DNA phosphate backbone
- 18C a substitution mutation
- 18D an example of absolute quantification of intracellular amino acid levels (e.g . lysine) in wild-type and the reconstructed mutant
- 18E differential gene expression quantified via QPCR for exemplary genes on wild-type and mutant backgrounds of some embodiments disclosed herein.
- FIGS. 19A-19C is a table that represents an exemplary targeted system of amino acid synthesis of targeted sites in a library of various sizes of a target protein of some embodiments disclosed herein.
- FIGS. 20A-20E represents 5 tables in 20A-20E of a list of parameters of various mutants of some embodiments disclosed herein.
- FIGS. 21A-21D represent 21A) a comparison mapping technique of adaptive evolution to deep scanning mutagenesis; 21B) single nucleotide polymorphism (SNP) categories; 21C) a plot of mutants found after adaptation and 21D) mapping of enriched mutations using selective pressure of some embodiments disclosed herein.
- SNP single nucleotide polymorphism
- FIG. 22 represents exemplary growth curves of an amino acid library (e.g. lysine) (black) compared to two different negative controls under increasing selective pressures.
- DSB double-strand break
- n 3 for each curve. Positive results were observed related to the amino acid library under increasing selective pressures.
- “modulation” and“manipulation” of a gene can mean an increase, a decrease, upregulation, downregulation, an induction, a change in encoded activity, a change in binding, a change in stability or the like, of one or more of targeted genes or gene clusters.
- primers used for sequencing and sample preparation per conventional techniques can include sequencing primers and amplification primers.
- plasmids and oligomers used per conventional techniques can include synthesized oligomers, oligomer cassettes. Appendix A, Appendix B, Appendix C, Appendix D, Appendix E, Appendix F and Appendix G and the figures and sequence listing of the provisional application are incorporated herein in their entirety for all purposes.
- amino acid metabolism consists of highly evolved pathways with extensive kinetic and regulatory features. Evolution has selected for efficient and robust metabolic and regulatory networks that prevent unnecessary metabolite biosynthesis and optimally distribute resources to maximize overall cellular fitness.
- Network and pathway engineering strategies have relied primarily upon coarse approaches for modulating function (e.g. promoter swaps or complete gene knockouts) at a limited number of loci.
- ALE adaptive laboratory evolution
- ALE can lead to a larger number of unintended passenger mutations and limited mechanistic understanding of the improved phenotype.
- both strategies massively under sample the combinatorial space of interest.
- network and pathway engineering would benefit from improved approaches capable of generating a broad range of targeted mutations that can be mapped with high resolution to the pathway-network level function, mirroring deep scanning mutagenesis strategies that have revolutionized protein engineering.
- This capability would provide for entirely new paradigms to engineer complex multigenic phenotypes to optimize function through transcription, translation, stability, and kinetics among others that encompass the breadth of what is found in nature.
- sequence to function mapping at a pathway scale has been designed and used.
- compositions and methods disclosed herein relate to amino acid metabolism and manipulations thereof.
- Amino acids include large industrial product markets - lysine, for example, is used in the animal feedstock, pharmaceutical and cosmetics industries, having a multi-billion dollar market.
- Lysine overproducers were traditionally identified via adaptation in the presence of antimetabolites such as the analog S-(2- aminoethyl)-L-cysteine (AEC).
- AEC analog S-(2- aminoethyl)-L-cysteine
- Derepression of lysine biosynthesis has been previously implicated as a mechanism of resistance to AEC, however the complexity of this phenotype has also implicated other mechanisms such as improper discrimination by the lysl-tRNA synthetase machinery.
- recent systems-based approaches are being used to elucidate the biochemical and regulatory mechanisms of lysine overproduction, current strategies rely on individually constructing and testing single sequence-to-activity hypotheses, requiring substantial investment in time and resources.
- one tool as used in certain methods disclosed herein to overcome limited abilities to predict the phenotypic consequences of mutations in single proteins is to introduce every possible mutation and couple that to a genotype-phenotype assay platform; for example, deep scanning mutagenesis.
- a genotype-phenotype assay platform for example, deep scanning mutagenesis.
- tens of thousands of single and multiple mutations can be investigated in the coding sequence of a target protein to report a local fitness landscape for this protein, using for example, fluorescence as a proxy. Expanding this concept to a repertoire of proteins connected to one another through a phenotype of interest permits parallel investigation of pathways and networks on a system scale. This requires, however, the ability to individually measure genotype-phenotype relationships for each of the designed mutants across all targeted proteins.
- CRISPR EnAbled Trackable genome Engineering CREATE
- CREATE leverages array-based oligo technologies to synthesize and clone hundreds of thousands of cassettes containing a genome-targeting gRNA covalently linked to a dsDNA repair cassette encoding a designed mutation.
- these methods have not been applied to amino acid synthesis pathways to optimize production and tolerance to amino acids.
- CRISPR/Cas e.g.
- frequency of each designed mutant can be tracked by high-throughput sequencing using the CREATE plasmid as a barcode, uniquely combining these two technologies.
- proteins associated with a metabolic pathway can be interrogated in parallel at single nucleotide resolution, validating deep scanning mutagenesis at a pathway- focused scale.
- amino acid metabolism pathways were targeted in order to optimize production and tolerance of a target amino acid through analysis of its production pathway.
- the amino acid, lysine was analyzed through identification of critical modifications to lysine stasis in a microorganism.
- lysine metabolism as an amino acid example in bacteria was used as an exemplary example.
- the bacterium used for analysis and validation was Escherichia coli.
- a saturated mutagenesis library was constructed in binding pockets of key proteins involved in four main categories that affect lysine
- compositions and methods for assessing a pathway provides valuable framework for directed engineering of complex multigenic phenotypes for use in commercial purposes.
- directed engineering of targeted amino acids disclosed herein can be used to generate engineered bacteria for the production of target agents (e.g. amino acids).
- compositions and methods disclosed herein concern genetically modifying bacteria to increase amino acid (e.g. lysine) production and/or amino acid (e.g. lysine) tolerance compared to bacteria that are not genetically modified.
- Some embodiments concern genetically modifying bacteria of the Enierobacleriaceae family.
- compositions and methods concern modifying Escherichia coli . (“E. coli”).
- E. coli are genetically modified to increase lysine production, increase lysine tolerance, and/or modify lysine homeostasis relative to their wild type. Yet other embodiments, relate to use of these engineered organisms for over production or increased tolerance to produced lysine.
- Certain embodiments relate to introducing genetic mutations in one or more genes of pathways related to lysine production, lysine tolerance, lysine metabolism, and/or lysine homeostasis in E. coli.
- one or more genes of these pathways are modified to increase tolerance of the engineered E. coli to lysine and/or to increase production of lysine by the engineered E. coli.
- one or more genes of the engineered E. coli are modified in order to enhance lysine homeostasis.
- Microorganisms such as E. coli, produce lysine through highly evolved pathways with extensive kinetic and regulatory features.
- Certain pathways involved in lysine production include: 1) lysine biosynthesis, 2) lysine degradation, 3) lysine regulation, and 4) lysine transport. These pathways include multiple categories of genes and gene regions that can affect lysine production and tolerance in the E. coli. Selective mutations or manipulations to one or more genes within these pathways can modulate lysine production, metabolism, tolerance, and/or homeostasis in the bacteria.
- engineered E. coli contain one or more mutations in a single gene or one or more mutations in multiple genes, which form part of one or more pathways for lysine production, metabolism, tolerance, and/or homeostasis.
- These one or more genes targeted genes code for proteins with a variety of cellular functions including, but not limited to transcription, repression, and/or regulation of lysine biosynthesis.
- engineered E. coli disclosed herein can contain mutations to a single gene identified using deep scanning methodologies disclosed herein. In certain embodiment, these methodologies can identify a single mutation having significant effect on lysine biosynthesis beyond dominant selection winners as well as identifying mechanisms for altering lysine pathway flux that would have been difficult to predict by known methods given the number of mutations being evaluated.
- engineered E. coli can contain one or more mutations to multiple genes related to lysine biosynthesis and tolerance.
- one or more mutations can include genes unrelated to a lysine regulatory, transport, or biosynthesis pathway identified by deep scanning methodologies.
- binding sites can include a substrate binding site, a co- factor binding site, a DNA binding site, and/or an allosteric factor binding site.
- the one or more modifications to the engineered E. coli lead to a decrease in uptake of S-(2-aminoethyl)-L-cysteine (AEC) by the engineered E. coli.
- AEC S-(2-aminoethyl)-L-cysteine
- amino acid (e.g. lysine) production, metabolism, and/or homeostasis in E. coli can be enhanced by introducing mutations such as site-directed mutations or targeted mutations that affect the binding region of the genes identified by methods disclosed herein.
- some mutations can include introducing a single mutation or change in a gene to alter a binding region for example, introducing a single nucleotide polymorphism (SNP) into the gene that affects binding affinity of the gene for a particular molecule.
- SNP single nucleotide polymorphism
- mutations can be introduced or selected for; for example, selecting for a one or more SNP in one or more of genes that encode proteins affecting lysine production, metabolism, and/or homeostasis, including lysine biosynthesis, lysine degradation, lysine regulation, and lysine transport or efflux.
- targeted genes for lysine production or tolerance in a microorganism can include, but are not limited to, one or more mutations of dapF , lysP, lysR , lysC, serC, dapD, cadA, argT, dapE, dap A, lysA, lysS, argP, argD, asd, lysU, cadB, dapB, and IdcC genes or synonymous gene thereof.
- introducing one or more SNP(s) to a target gene can include introducing one or more SNPs of dapF G210D, dapF M260Y, lysP T33F, and lysP Q219I m order to modulate lysine biosynthesis and/or tolerance in the engineered E. coli.
- modifications to E. coli can be to the gene encoding the protein LysP.
- synonymous mutations to the gene encoding lysP can alter expression or stability of LysP. For example in E. coli, lysine uptake is mediated by three different transporter systems: ArgT, CadB, and LysP.
- ArgT codes for a periplasmic binding protein specific to lysine, arginine and ornithine, interacting with the ABC transporter coded by the hisJQMP operon.
- CadB is part of the Cad system, which plays a role in pH homeostasis under acidic conditions. This transporter imports lysine and excretes the decarboxylated product cadaverine in conditions of low external pH and exogenous lysine.
- LysP is a specific transporter for lysine, but also has a regulatory role in activating the Cad system through transmembrane interactions with CadC.
- modifications to E. coli can be performed by introducing mutations to the gene encoding dapF.
- the DapF gene encodes an epimerase catalyzing the penultimate step in the lysine biosynthetic pathway, a conversion of LL-diaminopimelate (LL-DAP) to ffieso-diaminopimelate (meso- DAP).
- modifications to E. coli can be to one or more genes encoding lysR, lysC , serC and dapD.
- modifications to E. coli can be to the gene encoding lysR .
- a mutation in lysR can include modifying amino acid position 36 or synonymous position in lysR to be an Arginine instead of a Serine (e.g. fysR_S36K) (See for example, SEQ ID NO: 1).
- a mutation to the gene encoding lysP can include modifying amino acid position 33 in lysP to be a Threonine instead of a Phenylalanine (e.g. lysP _ T33F) (See for example, SEQ ID NO: 2).
- a mutation to the gene encoding lysP can include modifying amino acid at position 33 in lysP to be an Isoleucine instead of a Glutamine (e.g. lysP Q 2191) (See for example, SEQ ID NO: 3).
- a mutation to the gene encoding dapF can include modifying amino acid at position 210 in dapF to an Aspartic Acid instead of a Glycine (e.g. dapF GHQ ) (See for example, SEQ ID NO: 4).
- a mutation to the gene encoding dapF can include modifying amino acid at position 260 in dapF is a Methionine instead of a Tyrosine (e.g. dapF A260Y) (See for example, SEQ ID NO: 5).
- the engineered A. coli disclosed herein can have a decrease in the uptake of the lysine analog S-(2-aminoethyl)-L-cysteine (“AEC”) when AEC is added to culture media demonstrating efficacy of selective mutations in the E. coli.
- AEC is an antimetabolite, an analog to lysine that competes with canonical lysine for binding to the lysil- tRNA synthetase (LysRS), leading to protein misfolding and reduced cell growth. Additionally, AEC blocks lysine biosynthesis by interacting with riboswitches, inhibiting bacterial growth in the absence of an external lysine source.
- the engineered E. coli disclosed herein can have increased tolerance to AEC as compared to control E. coli when AEC is added to the bacterial culture media.
- AEC can be used to select bacterial mutants of use for increased production or tolerance to lysine for commercial purposes.
- mutations in regulatory genes of E. coli for example, in the lysine-regulated riboswitch controlling expression of the aspartokinase lysC affect uptake of AEC and confirm modification of lysine flux.
- diaminopimelic acid is bound to the regulator and the regulator lysR activates the last enzymatic step in lysine biosynthesis ( lysA ).
- the LysR family of transcription regulators is ubiquitous in bacteria and includes a conserved N-terminal helix-turn-helix (HTH) DNA-binding domain and a less conserved C- terminal co-inducer binding domain.
- a lysR mutant (7ys7?_S36R) substitution lies on the DNA- binding (HTH) domain.
- HTH DNA binding domain can be manipulated to increase lysine production using compositions and methods disclosed herein.
- a lysR mutant can lead to an increase in intracellular lysine production by the engineered A. coli.
- modifications to genes or gene regions, as disclosed herein can lead to increased tolerance of the engineered A. coli to the presence of higher concentrations of intracellular lysine.
- modifications to genes or gene pathways, as disclosed herein can lead to increased lysine production of the engineered E. coli.
- modifications to genes or gene regions, as disclosed herein can lead to increased lysine homeostasis of the engineered E. coli.
- modifications to genes or gene regions, as disclosed herein can lead to increased lysine metabolism of the engineered E. coli.
- modulation of genes or genetic regions contemplated herein in E. coli can allow the engineered E.
- engineered A. coli having a modified tolerance to the presence of high intracellular lysine levels can be used to produce larger quantities of lysine for example, for industrial applicability, reducing production costs and scaling up production, for example.
- modifications to increase lysine production, tolerance, metabolism, and/or homeostasis in bacteria can be the result of increased copy number of one or more gene or genetic regions.
- modifications to increase lysine production and/or tolerance can be the result of upregulation or down regulation; of expression of one or more genes in bacteria, or can result from a decreased copy number of one or more such gene
- modifications of the bacterial genes can include point mutations either selected for or introduced, resulting in one or more SNPs within a gene affecting the production of lysine and/or tolerance of lysine in the engineered bacteria.
- manipulated E. coli phenotypes contemplated herein can be used for technological or commercial applications.
- one or more genes can be manipulated in E. coli in order for an engineered E. coli to tolerate high lysine concentrations for example, lysine levels typically toxic to a wild-type E. coli.
- Certain embodiments disclosed herein concern introducing one or more mutations to one or more genes or gene regions disclosed herein in E. coli in order to modulate lysine production, metabolism, and/or homeostasis in the E. coli.
- engineered bacteria e.g. E. coli
- a 5%, or a 10%, or a 20%, or a 30%, or a 40%, or a 50%, or a 60%, or a 70 %, or an 80% or a 90% or more increase in production and/or tolerance of a target amino acid such as lysine, arginine, methionine or other target amino acid can be obtained.
- a commensurate level of improved amino acid homeostasis is observed in the engineered E. coli compared to unmodified control E. coli.
- Adaptive mutations can be a central driver of evolution; however, the abundance and relative contribution of these mutations to resulting cellular phenotypes are poorly understood, even in well-studied organisms.
- Network and pathway engineering strategies have relied primarily upon coarse approaches for modulating function (e.g. promoter swaps or complete gene knockouts) at a limited number of loci.
- adaptive laboratory evolution or “directed evolution” approaches have been employed, which while producing more refined adjustments for manipulating pathway flux can lead to a larger number of unintended passenger mutations and limited mechanistic understanding of the improved phenotype.
- a fundamental limitation to directed evolution or targeted selection of a particular phenotype is the inability to effectively manipulate complex phenotypes in a laboratory setting, where the relevant combinatorial mutational space is often much larger than can be searched on laboratory time scales or budgets. Further, off-target mutations can decrease overall fitness of an organism and lead to“dead-end” phenotypes, preventing further improvement of an evolved strain.
- methods disclosed herein are able to overcome limited abilities and errors of directed evolution in predicting the phenotypic consequences of mutations in single proteins.
- methods include introduction of every possible mutation to a target gene or genetic pathway and combining these mutations to a genotype-phenotype assay platform. In certain embodiments, this permits expansion of deep scanning concepts to a repertoire of proteins connected to one another through a phenotype of interest, allowing parallel investigation of pathways and/or networks on a system scale when partnered with individual measurements of genotype-phenotype relationships for each mutant across all targeted proteins.
- probable protein targets with no known functional sites can be scanned and mutations of these regions can divulge roles previously unidentified due to lack of ability to pinpoint activity/function relationships.
- One aspect of the present disclosure provides methods for mapping multiple loss-of- function or gain-of-function mutations in one or more transporter protein(s) related to lysine biosynthesis, identifying these one or more transporter protein(s) as an important resistance route to modifying amino acid flux.
- exemplary lysine transporter proteins of the one or more transporters include but are not limited to lysP, argT, and cadB.
- certain synonymous mutations in the one or more transporter protein(s) can disrupt transporter function, for example by, impacting transcription or translation rate or affecting proper folding or function of the one or more transporter(s) in the membrane.
- synonymous mutations in the one or more transporter protein(s) can enhance transporter function, such that the transporter protein(s) functions are enhanced.
- Another aspect of the present disclosure provides methods for mapping multiple loss-of-function or gain-of-function mutations in one or more regulatory protein(s), identifying these one or more regulatory proteins as contributory molecules to lysine regulation and/or tolerance.
- exemplary regulatory proteins can include, but are not limited to, argP and lysR.
- one or more mutations in the one or more regulatory protein(s) can disrupt regulatory function.
- one or more mutations in one or more regulatory protein(s) can enhance regulatory function, for example, when regulatory protein(s) improve efficiency in lysine production, tolerance metabolism, and/or homeostasis.
- proteins contributing to lysine degradation can include, but are not limited to, IdcC, cadA, cadB, cadC and combinations thereof.
- proteins contributing to lysine degradation can include, but are not limited to, IdcC, cadA, cadB, cadC and combinations thereof.
- several synonymous mutations in one or more degradation-related protein(s) can also disrupt amino acid degradation function.
- several synonymous mutations in one or more degradation-related protein(s) can enhance transport function.
- methods are provided for mapping multiple loss-of-function and/or gain-of-function mutations in one or more genes encoding amino acid (e.g. lysine) biosynthesis protein(s), identifying these genes for encoding these proteins as important target for this biosynthesis.
- amino acid e.g. lysine
- exemplary proteins contributing to the amino acid lysine biosynthesis can include, but are not limited to, dapA, dapB, dapE, lysU, lysS, asd, dapF, argD, lysA, lysR , lysC, serC , and dapD or synonymous genes thereof
- mutations in one or more biosynthesis protein(s) can disrupt regulatory function, impacting transcription or translation rate or binding of related signaling proteins responsible for regulation.
- methods using mapping techniques can permit assess to identifying phenotypic consequences of one or more mutations under conditions of stringent selective pressures.
- a library redesign including, for example different sets of mutations targeting each site, can allow high resolution mapping of effects of silent substitutions on wild-type protein function.
- One aspect of the present disclosure provides for methods for deeply mapping, through the use of barcodes, to quantify beyond a typical selection winner.
- this mapping can be used to identify mutations of one or more genes for directing pathway optimization.
- parallel integration of genes can be leveraged to uncover regulatory interactions on a systems scale.
- mutations identified by these deep scanning methods resulting in amino acid (e.g. lysine) overproduction in a microorganism can be used to create industrial strains having these traits in order to increase production to manufacturing scale.
- deep scanning mutagenesis strategies can be used to profile genetic mutations in a microorganism from a single gene to entire metabolic pathway(s).
- deep scanning mutagenesis can be used to elucidate multiple routes having one or more mutated genes that affect amino acid (e.g. lysine) production, metabolism, and/or homeostasis leading to identification of contribution of individual genes within particular pathways to modify amino acid production. This process permits development of engineered microorganisms (e.g. E. coli) capable of increased amino acid (e.g. lysine) production and/or homeostasis.
- engineered microorganisms e.g. E. coli
- one approach for engineering complex phenotypes can be to use genome engineering tools.
- methods can include the use of trackable, precision genome editing referred to as Clustered regularly interspersed short palindromic repeats (CRISPR).
- CRISPR systems exist in many bacterial genomes and have been found to play an important role in adaptive bacteria immunity.
- Genome engineering as detailed herein can use CREATE, (CRISPR enabled trackable genome engineering), a CRISPR-based technology that involves synthesizing constructs which contain an editing cassette and CRISPR-RNA sequentially.
- CREATE methods achieve highly efficient editing/mutating using a single vector that encodes both an editing cassette and a guide RNA (gRNA).
- CREATE allows parallel mapping of mutations in a multiplex scale.
- CREATE leverages array-based oligo technologies to synthesize and clone hundreds of thousands of cassettes containing a genome-targeting gRNA covalently linked to a dsDNA repair cassette encoding a designed mutation.
- a CREATE editing cassette can introduce a silent protospacer adjacent motif (PAM).
- PAM mutation(s) can be an insertion, deletion or substitution of one or more nucleotides that mutates the sequence of the PAM such that the mutated PAM (PAM mutation) is not recognized by the CRISPR system.
- a cell including a PAM mutation can be said to be "immune" to CRISPR- mediated killing.
- methods for mapping genotype-phenotype relationships on a multiplex protein scale can include identifying a target multigenic system; applying full codon mutagenesis to genes of the target multigenic system to create a mutant library; transfecting the mutant library into a host; applying CrispR/Cas genomic engineering to the genes of the multigene system to create a mutant library; and using deep-scanning mutagenisis to analyze the mutant library.
- the target multigenic system comprises a pathway.
- pathway can include a synthesis pathway or a regulatory pathway or a transport pathway or a storage pathway or similar.
- a synthesis pathway can include an amino acid synthesis pathway.
- a pathway contemplated herein can be a pathway that produces mapable and/or assayable product.
- the pathway can produce an assayable end-product.
- combination methods disclosed herein can identify one or more mutant having improved characteristics compared to the wild-type gene(s). wherein the target multigenic system is optimized based on the identified one or more mutants.
- compositions contemplated herein include a host cell, a saturated mutant library of a targeted multi-gene system and CRISPR/Cas plasmid construct system.
- methods for demonstrating construction and mapping of libraries can include about tens of thousands of mutations in four E. coli regulatory pathways that increase lysine production, tolerance, metabolism, and/or homeostasis. These methods can enable identification of specific mutants conferring increased amino acid production, tolerance, metabolism, and/or homeostasis in microorganisms (e.g. E. coli).
- engineered E. coli can be created that produce amino acids (e.g. lysine) at concentrations of about 40%, to about 50% to about 60% or greater than wild-type microorganisms.
- engineered bacteria e.g. E. coli
- amino acid (e.g. lysine) flux as identified by the methods disclosed above can be used to create a mutation library exposed to an amino acid analog.
- a lysine analog S-(2-aminoethyl)-L-cysteine (“AEC”) can be used for analysis of lysine flux in bacteria.
- AEC S-(2-aminoethyl)-L-cysteine
- This antimetabolite is an analog to lysine and competes with canonical lysine for binding to the lysil-tRNA synthetase (LysRS), leading to protein misfolding and reduced cell growth.
- AEC blocks lysine biosynthesis by interacting with riboswitches, inhibiting bacterial growth in the absence of an external lysine source.
- whole genome sequencing can be performed under a selective AEC concentration to identify contribution of individual genes within particular pathways related to lysine flux. Combination methods disclosed herein allow for development of engineered microorganisms that produce target molecules by simple to complex pathways to manipulate production of the target molecules (e.g. amino acids).
- gene summaries can be then mapped to one or more categories that can affect molecular production; for example of amino acids or other bacterial molecules.
- gene summaries can be then mapped to one or more categories that can affect molecular production; for example, lysine production, metabolism, and/or homeostasis including lysine biosynthesis, lysine degradation, lysine regulation, or lysine transport.
- deep mapping analysis leads to generation of a comprehensive map of evolutionary trajectories resulting from selective (e.g. AEC) resistance that can identify combinations of one or more mutations that can affect amino acid (e.g. lysine) production, metabolism, and/or homeostasis.
- the use of trackable barcodes for each mutant can enable unique characterization of each mutant, beyond those having dominant characteristics, enabling the identification and contribution of all mutations, even minor contributory effects of silent mutations.
- methods disclosed herein enable creation of comprehensive map(s) which can highlight which pathway features can be optimized and which specific mutations can lead to phenotypic improvement in amino acid production, metabolism, and/or homeostasis (e.g. lysine, arginine, methionine etc.).
- engineered E. coli of certain embodiments disclosed herein can be used for technological applications.
- one or more genes can be mutated in E. coli in order for the engineered E. coli to tolerate increased lysine concentrations compared to a wild-type E. coli.
- engineered E. coli as disclosed herein can have increased lysine production compared to a control E. coli.
- a 10%, or a 20%, or a 30%, or a 40%, or a 50%, or a 60%, or a 70 %, or an 80% or a 90% or more increase in production of lysine can be engineered.
- a commensurate level of improved lysine tolerance or homeostasis can be observed in the modified E. coli compared to unmodified control E. coli. USE OF ENGINEERED E. COLI
- engineered microorganisms as described herein can be used in the industrial or commercial production of amino acids or similar target molecules.
- the production of intracellular lysine in the engineered E. coli can include production of about 800 mM and to about 1400 mM or more per engineered E. coli cell.
- Other embodiments disclosed herein concern increasing intracellular production of lysine to concentrations of about 800 pM, of about 1000 pM, of about 1200 pM, or of about 1400 pM or more.
- Other embodiments disclosed herein concern increasing lysine expression in engineered E. coli as calculated as expression fold change compared to wild type E. coli as between about 1.5-10 fold as compare to wild type E.
- engineered E. coli can increase export of lysine to surrounding media for harvesting and reuse of the engineered organisms.
- kits are contemplated of use to transport or house engineered microorganisms (e.g . E. coli ) having modified amino acid flux (e.g. lysine) regarding production, tolerance metabolism, and/or homeostasis.
- engineered microorganisms e.g . E. coli
- modified amino acid flux e.g. lysine
- kits can include components for culturing and growing engineered microorganisms to produce the amino acids or similar molecule (e.g. lysine).
- genes encoding target regulatory proteins were selected from four categories that affect lysine flux: lysine transport (3 genes), lysine regulation (2 genes), lysine biosynthesis (12 genes), and lysine degradation (2 decarboxylation genes) (FIG. 1). Based on the observations above, 16,300 mutations targeting four primary routes that affect lysine flux were designed: lysine biosynthesis (12 genes), lysine degradation (2 decarboxylation genes), lysine transport (3 genes), and regulation of genes in such pathways (2 genes) (FIG. 1).
- the exemplary library was exposed to the lysine analog S-(2-aminoethyl)-L-cysteine or AEC.
- This analog competes with canonical lysine for binding to the lysyl-tRNA synthetase (LysRS), leading to protein misfolding and reduced growth.
- LysRS lysyl-tRNA synthetase
- AEC blocks lysine biosynthesis by interacting with riboswitches, inhibiting bacterial growth in the absence of an external lysine source (FIG. 7).
- designer mutations were investigated that influence lysine regulation and overproduction allowing lysine to outcompete AEC and thereby restore cell growth.
- Sequencing of the plasmid cassettes (herein referenced as barcodes) before and after growth in the presence of AEC allows parallel tracking of each designed mutant in the library, permitting highly parallel mapping to be performed and to assess their contribution to tolerance and their inference to lysine flux (FIG. 17, workflow illustration).
- the lysine deep scanning mutagenesis library exhibited enhanced growth when compared to wild-type cells transformed with either a non-targeting gRNA or a gRNA targeting the unrelated loci galK (double-stranded break control or DSB) across a range of AEC concentrations (data not shown).
- a non-targeting gRNA or a gRNA targeting the unrelated loci galK double-stranded break control or DSB
- DSB double-stranded break control
- lysine uptake was analyzed. Lysine uptake is mediated by three different transporter systems in E. coli (FIG. 1). ArgT codes for a periplasmic binding protein specific to lysine, arginine and ornithine, interacting with the ABC transporter coded by the hisJQMP operon. CadB is part of the Cad system, which plays a role in pH homeostasis under acidic conditions. This transporter imports lysine and excretes the decarboxylated product cadaverine in conditions of low external pH and presence of exogenous lysine.
- LysP is a specific transporter for lysine, but also has a regulatory role in activating the Cad system through transmembrane interactions with CadC. Mutations in lysP were identified as the most highly enriched, including the dominant selection winner (data not shown). No enrichment for lysP mutations were observed when cells were grown in the absence of AEC (data not shown). This correlates with previous findings that identified lysP mutations in AEC resistant strains
- DapF mutations ranked as the most enriched non-lysP mutant under 100 mM AEC and the second most under 1,000 mM AEC, although no strong enrichment was observed under 10,000 pM AEC.
- both G210D and M260Y substitutions which lie close to the protein catalytic site (data not shown), suggesting an effect on catalytic activity.
- both mutants grew similarly to wild-type cells in the absence of AEC, but displayed distinct phenotypes when put under selective pressure.
- DapF G210D mutants had high growth rates up to 10,000 pM AEC (data not shown), confirming the barcode enrichment previously observed.
- DapF M260Y grew similarly to wild-type cells in the presence of AEC (data not shown). The DapF G210D mutant was regrown and independently tested and the same phenotype having superior growth in the presence of AEC was demonstrated.
- DapF mutant DapF variants were purified and their kinetics measured in vitro (data not shown). Surprisingly, both DapF mutants are kinetically impaired relative to the wild-type variant (data not shown).
- qPCR profiling of the entire biosynthetic pathway revealed one gene with statistically significant increase in gene expression, the diaminopimelate decarboxylase lysA (data not shown). LysA is responsible for the last enzymatic step in lysine biosynthesis, and it’s known to be repressed by lysine and induced by diaminopimelic acid through the regulator LysR.
- plasmid barcodes are used as a proxy for identifying genomic edits, lack of correlation introduces noise that can lead to false positives in the enrichment scores.
- plasmid-genome correlation should be strong for real hits with strong enrichment, and weaker for non-enriched variants.
- the regulator category was analyzed and investigated regarding a weakly enriched mutation in LysR, as well as a strongly enriched mutation in ArgP.
- Regulatory mutations are well known to confer AEC resistance, mainly in the lysine- regulated riboswitch controlling expression of the aspartokinase lysC (data not shown).
- the regulator LysR which upon binding to diaminopimelic acid activates the last enzymatic step in lysine biosynthesis (lysA, FIG. 1), exhibited few weakly enriched mutations in this exemplary library (data not shown).
- the LysR family of transcription regulators is ubiquitous in bacteria and comprises a conserved N-terminal helix-tum-helix (HTH) DNA-binding domain and a less conserved C- terminal co-inducer binding domain.
- the LysR S36R mutation lies on the DNA-binding (HTH) domain.
- HTH DNA-binding domain
- mutants do not display any alteration in intracellular lysine levels (FIG. 18B).
- strains harboring the S36R mutation grew slower than wild-type cells transformed with a non-targeting gRNA (data not shown).
- the ArgP regulator displayed much stronger enrichment scores for a E246Q substitution (FIG. 18C), with a p-value of 1.6 x 10-6 at 100 mM AEC, 8.1 x 10-8 at 1,000 pM AEC, and 1.59 x 10-5 at 10,000 pM AEC.
- ArgP which also belongs to the LysR family of transcriptional regulators, can bind to lysine in order to inhibit transcription of several genes in the biosynthetic lysine pathway (FIG. 1), acting as one of the main negative feedback mechanisms.
- the E246Q substitution lies on the C-terminal co-inducer binding domain, although the apparent role for this residue is unclear.
- Exemplary methods herein demonstrated expansion of deep scanning mutagenesis strategies from a single gene to an entire metabolic pathway.
- multiple routes of AEC resistance were identified, encompassing mutations in transporters, regulators and biosynthetic genes.
- This technology should accelerate the ability to investigate, understand and control, complex multigenic phenotypes, providing knowledge that will contribute to the forward engineering of these traits.
- Genome editing and individual mutant validation was performed in a wild-type Escherichia coli str. K-12 substr. MG1655 strain.
- a custom pSIM5-Cas9 dual-vector was built by cloning the araC-pBAD-Cas9 fragment from pX2-Cas9 vector (Addgene #85811) into the temperature sensitive pSIM5 plasmid containing the lambda red genes.
- This pSIM5-Cas9 dual vector was transformed into E. coli MG1655 prior to the library introduction.
- the editing cassettes containing the homology arm and genome-targeting gRNA were cloned in the same backbone previously used for CREATE .
- the cassette design included the following features: a library-specific 18 nt priming site for subpooling, a 12 nt variant-specific priming site (not used in this study), a 118 nt homology arm encoding the specific genomic edit and a synonymous PAM mutation in close proximity, the constitutive promoter J23119 (35 nt), a 3 bp spacing sequence (ATC), the 20 nt spacer region required for Cas9 targeting, followed by 24 nt of the 5’ end of the canonical S. pyogenes gRNA.
- the full list of cassette sequences can be provided but is not shown.
- the designed library was synthesized as 230-mers by Agilent Technologies in a custom array and delivered pooled as lyophilized single-stranded DNA.
- the oligo pool was subjected to an Alexa Fluor 488-label ed strand extension reaction and purified in a 6% SDS-PAGE gel to remove indels introduced in the synthesis process.
- the lysine library was amplified as a single subpool using predefined library-specific priming sites included in the cassette design. The amplification was optimized to minimize overamplification in an effort to reduce product crossover.
- the PCR reaction was performed using Phusion High-Fidelity PCR Master Mix (New England BioLabs) and the following reaction conditions: 98°C for 60 seconds, followed by 8 cycles of 98 o C30s/68 ° C30s/72 ° C90s, followed by 10 cycles of 98 ° C30s/72 ° C90s and then a final extension at 72°C for 3 minutes.
- the library product was purified from 1% agarose gels using the QIAquick Gel Extraction Kit (QIAGEN).
- the amplified library was cloned using Gibson Assembly Hi-Fi l-Step Kit (SGI- DNA), with 300 ng of the linearized backbone and 30 ng of the library insert.
- the cloning reaction was dialyzed and then transformed via electroporation into E. cloni 10GF' ELITE Electrocompetent Cells (Lucigen), in a single electroporation using a 0.2 cm gap cuvette (GenePulser, Bio Rad). Cloning efficiency was estimated by counting colonies in LB agar plates. Overall, >60X coverage (total CFETs/number of library variants) were achieved at the cloning stage.
- the library was grown in LB media to saturation and plasmid was extracted using the QIAprep Spin Miniprep Kit (QIAGEN).
- a non-targeting control containing a plasmid with a gRNA that does not target the E. coli genome
- a double-stranded break control containing a plasmid with a CREATE cassette designed to introduce a stop codon at the unrelated gene galK.
- custom Illumina compatible primers were used to barcode each selection using Phusion High-Fidelity PCR Master Mix (New England BioLabs), 300 ng of the plasmid prep, 3% DMSO, and the following cycling conditions: 98°C for 30 seconds, 20 cycles of 98°Cios/68 Ci5s/72°C20s, followed by a final extension of 72°C for 5 minutes.
- PCR products were purified from 1% agarose gels using the QIAquick Gel Extraction Kit (QIAGEN), pooled together in equimolar amounts, and sequenced using an Illumina MiSeq 2x150 paired end reads run.
- the average of enrichment scores for all synonymous mutations included in the library was calculated (average m of wild-type enrichment).
- Bootstrap analysis (resampled with replacement 20,000 times) was performed to obtain a 95% confidence interval for the wild-type enrichment average m.
- Variants were considered as significantly enriched if their weighted enrichment scores were at least m ⁇ 2*s (i.e. p-value ⁇ 0.05 assuming a normal distribution of synonymous mutations enrichment scores), with s being the standard deviation.
- the p-value of their respective enrichment scores was calculated using the probability density function of all mutants under the specific selective pressure.
- Selected genomic pockets were PCR amplified with primers that included the Nextera adapter sequences as overhangs (Forward primer: SEQ ID NO: 20, 5’ - T C GT C GGC AGC GT C AG AT GT GT AT A AG AG AC AG - [locus- specific sequence] - 3’; Reverse primer: SEQ ID NO: 21, 5’ -
- the frozen cell pellets were extracted in ice cold lysis buffer, a 5:3 :2 ratio of MeOH:ACN:H20, containing amino acid standard mix at a final concentration of 1 mM (MSK-A2-1.2 standard amino acid mix, purchased from Cambridge Isotope Laboratories, Inc. - Tewksbury, MA). Samples were vortexed for 30 minutes at 4°C with lmm glass beads. Insoluble proteins and lipids were pelleted by centrifugation at 4°C for 10 minutes at l2,000g. Supernatants were collected and analyzed using a Thermo Vanquish UHPLC coupled online to a Thermo Q Exactive mass spectrometer. UHPLC-MS methods and data analysis approaches were performed as described previously. The intracellular concentration of wild-type control samples was normalized to 1, and the experimental samples are reported as fold-change relative to these wild-type levels.
- the dapF variants were PCR amplified from boiled cells that contained the desired mutation (wild type E. coli MG1655 for the wild type dapF sequence; reconstructed dapF mutants for the G210D and M260Y variants).
- the PCR products were then cloned and sequence verified into a custom made pET-3 backbone, containing the histidine tag (6x) on either the 5’ or 3’ end of the genes to test for optimal expression.
- Corynebacterium glutamicum DAP Dehydrogenase was synthesized from Eurofms Genomics and also cloned in the pET- based vector. Expression was done in a E.
- Proteins were purified using the Ni-NTA Spin Kit (QIAGEN), following the protocol for purification of tagged proteins under native conditions. Purified samples were run on a denaturing PAGE gel (Mini-PROTEAN TGX Stain-Free Precast Gels, Bio-Rad) to confirm purity and quantified using the Thermo Fisher Scientific Pierce 660 nm Protein Assay Reagent. Purified proteins were used fresh for the kinetic assay (never frozen).
- Enzymatic activity of the DapF variants was determined in vitro using a modified DAP epimerase-DAP dehydrogenase coupled spectrophotometric assay (Cox et al , 2002). Briefly, 100 mM Tris (pH 7.8), 0.1 mM diaminopimelic acid (racemic mixture), 0.44 mM NADP+ and 1 mM DTT was added to a cuvette and incubated at 37°C for 10 minutes to equilibrate the temperature. Then, 1.8 mM DAP Dehydrogenase was added and the absorbance was recorded at 340 nm until it reached a plateau (i.e.
- Wild type E. coli MG1655 and the analyzed reconstructed mutants were grown under the same conditions as described for absolute intracellular lysine quantification.
- OD 6 oo 0.5
- 1 mL of the culture was treated with RNAprotect Bacteria Reagent (QIAGEN) to stabilize the RNA and the resulting pellet frozen at -80°C.
- Total RNA was then extracted using the RNeasy Mini Kit (QIAGEN) with an on-column DNAse digestion.
- cDNA was synthesized using the Superscript IV First-Strand Synthesis System (Invitrogen).
- Adaptive evolution and whole genome sequencing [00144] The adaptive evolution experiments were performed with wild-type A. coli MG1655 (without any plasmids) in 30 mL of the same minimal media used for selections, containing 1000 mM AEC. Cells were grown at 37°C under 200 rpm in two different regimes: (1) growth for 48 hours (single-batch) since the inoculation; (2) growth for 5 days, with passages to new media every 24 hours (100 pL was transferred in each passage). Additionally, wild-type A. coli MG1655 cells were also grown for 48 hours in minimal media without any AEC present (parent strain genome). Next, the final cultures were streaked to agar plates of the same selective media and single colonies were processed for whole genome sequencing. To do so, genomic DNA was extracted using the Wizard Genomic DNA Purification Kit (Promega), libraries were prepared using the Nextera XT DNA Library Prep Kit (Illumina) and sequenced on an Illumina MiSeq 2x150 paired end reads run.
Landscapes
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Chemical & Material Sciences (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- General Chemical & Material Sciences (AREA)
- Physics & Mathematics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
Description
Claims
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201862747479P | 2018-10-18 | 2018-10-18 | |
US62/747,479 | 2018-10-18 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2020081958A2 true WO2020081958A2 (en) | 2020-04-23 |
WO2020081958A3 WO2020081958A3 (en) | 2020-07-16 |
Family
ID=70283136
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2019/056977 WO2020081958A2 (en) | 2018-10-18 | 2019-10-18 | Compositions and methods for identifying mutations of genes of multi-gene systems having improved function |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2020081958A2 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113061602A (en) * | 2021-02-26 | 2021-07-02 | 未米生物科技(江苏)有限公司 | High-flux promoter variation creating method |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
HU223706B1 (en) * | 1994-12-09 | 2004-12-28 | Ajinomoto Co., Inc. | Novel lysine decarboxylase gene and process for producing l-lysine |
-
2019
- 2019-10-18 WO PCT/US2019/056977 patent/WO2020081958A2/en active Application Filing
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113061602A (en) * | 2021-02-26 | 2021-07-02 | 未米生物科技(江苏)有限公司 | High-flux promoter variation creating method |
Also Published As
Publication number | Publication date |
---|---|
WO2020081958A3 (en) | 2020-07-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Bassalo et al. | Deep scanning lysine metabolism in Escherichia coli | |
Wang et al. | Integrated whole-genome and transcriptome sequence analysis reveals the genetic characteristics of a riboflavin-overproducing Bacillus subtilis | |
US12031138B2 (en) | Recombinant microorganisms capable of carbon fixation | |
Pechter et al. | Essential genome of the metabolically versatile alphaproteobacterium Rhodopseudomonas palustris | |
Brutinel et al. | Anomalies of the anaerobic tricarboxylic acid cycle in S hewanella oneidensis revealed by T n‐seq | |
Horinouchi et al. | Improvement of isopropanol tolerance of Escherichia coli using adaptive laboratory evolution and omics technologies | |
CN112912496B (en) | Novel mutations that increase the DNA cleavage activity of CPF1 of the genus amino acid coccus | |
CN107429275B (en) | Genetically modified microorganisms with improved tolerance to L-serine | |
Lennen et al. | Combinatorial strategies for improving multiple-stress resistance in industrially relevant Escherichia coli strains | |
CN109055289B (en) | Recombinant escherichia coli for high yield of L-methionine and application thereof | |
US8735132B2 (en) | Mutations and genetic targets for enhanced L-tyrosine production | |
Auger et al. | Global expression profile of Bacillus subtilis grown in the presence of sulfate or methionine | |
Nærdal et al. | L-lysine production by Bacillus methanolicus: genome-based mutational analysis and L-lysine secretion engineering | |
CN114381416B (en) | Recombinant escherichia coli strain for high yield of 5-aminolevulinic acid and application thereof | |
Stella et al. | Biosensor-based growth-coupling and spatial separation as an evolution strategy to improve small molecule production of Corynebacterium glutamicum | |
Zuchowski et al. | Discovery of novel amino acid production traits by evolution of synthetic co-cultures | |
CN105400801B (en) | Release thrA gene mutation bodies and its application of feedback inhibition | |
Csonka et al. | Biosynthesis of proline | |
WO2020081958A2 (en) | Compositions and methods for identifying mutations of genes of multi-gene systems having improved function | |
Cai et al. | Engineering of the DNA replication and repair machinery to develop binary mutators for rapid genome evolution of Corynebacterium glutamicum | |
KR102269939B1 (en) | Thermococcus onnuienus WTF-156T having a mutation in the formic acid transporter and hydrogen production method using the same | |
Masachis et al. | FASTBAC-Seq: functional analysis of toxin–antitoxin systems in bacteria by deep sequencing | |
JP7447810B2 (en) | Method for producing tripeptide γ-GLU-VAL-GLY using Enterobacteriaceae | |
Wang et al. | Pooled CRISPR interference screens enable high-throughput functional genomics study and elucidate new rules for guide RNA library design in Escherichia coli | |
Hücker | RIBOseq-based discovery of non-annotated genes in Escherichia coli O157: H7 Sakai and their functional characterization |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19873456 Country of ref document: EP Kind code of ref document: A2 |
|
ENP | Entry into the national phase |
Ref document number: 2021521258 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 19873456 Country of ref document: EP Kind code of ref document: A2 |
|
NENP | Non-entry into the national phase |
Ref country code: JP |