Plant Biotechnol Rep
DOI 10.1007/s11816-017-0440-0
Online ISSN 1863-5474
Print ISSN 1863-5466
ORIGINAL ARTICLE
Effect of codon optimization on the enhancement of the b-carotene
contents in rice endosperm
Ye Sol Jeong1 • Hyung-Keun Ku1 • Jae Kwang Kim2 • Min Kyoung You1
Sun-Hyung Lim3 • Ju-Kon Kim4 • Sun-Hwa Ha1
•
Received: 21 May 2017 / Accepted: 25 May 2017
Ó Korean Society for Plant Biotechnology and Springer Japan 2017
Abstract A b-carotene is the most well-known dietary
source as provitamin A carotenoids. Among b-caroteneproducing Golden Rice varieties, PAC (Psy:2A:CrtI) rice
has been previously developed using a bicistronic recombinant gene that linked the Capsicum Psy and Pantoea CrtI
genes by a viral 2A sequence. To enhance b-carotene
content by improving this PAC gene, its codon was optimized for rice plants (Oryza sativa L.) by minimizing the
codon bias between the transgene donor and the host rice
and was then artificially synthesized as stPAC
(stPsy:2A:stCrtI) gene. The GC content (58.7 from 50.9%)
and codon adaptation index (0.85 from 0.77) of the stPAC
gene were increased relative to the original PAC gene with
76% DNA identity. Among 67 T1 seeds of stPAC transformants showing positive correlations between transgene
copy numbers (up to three) and carotenoid contents, three
stPAC lines with a single intact copy were chosen to
minimize unintended insertional effects and compared to
the representative line of the PAC transgene with respect to
their codon optimization effects. Translation levels were
stably increased in all three stPAC lines (3.0-, 2.5-, 2.9fold). Moreover, a greater intensity of the yellow color of
stPAC seeds was correlated with enhanced levels of bcarotene (4-fold, 2.37 lg/g) as well as total carotenoid
(2.9-fold, 3.50 lg/g) relative to PAC seeds, suggesting a bbranch preference for the stPAC gene. As a result, the
codon optimization of the transgene might be an effective
tool in genetic engineering for crop improvement as proven
at the enhanced levels of translation and carotenoid
production.
Ye Sol Jeong and Hyung-Keun Ku contributed equally to this study.
Keywords b-carotene Codon optimization
Recombinant gene Rice endosperm Synthetic gene
Electronic supplementary material The online version of this
article (doi:10.1007/s11816-017-0440-0) contains supplementary
material, which is available to authorized users.
Introduction
& Sun-Hwa Ha
sunhwa@khu.ac.kr
1
Department of Genetic Engineering and Graduate School of
Biotechnology, College of Life Sciences, Kyung Hee
University, Yongin 17104, Korea
2
Division of Life Sciences and Bio-Resource and
Environmental Center, Incheon National University,
Incheon 22012, Korea
3
National Academy of Agricultural Science, Rural
Development Administration, Jeonju 54874, Korea
4
Graduate School of International Agricultural Technology
and Crop Biotechnology Institute/GreenBio Science and
Technology, Seoul National University, Pyeongchang 25354,
Korea
The development of genetically modified rice that produces
b-carotene seeks to address malnutrition-associated vitamin A deficiency (VAD) problems, such as immune disorders and blindness, in developing countries (Sommer
et al. 2002; Ye et al. 2000). As b-carotene is a precursor of
vitamin A, consumption of b-carotene-producing rice as a
staple food could be an effective way to alleviate VAD
(Tang et al. 2009). By introducing two genes of phytoene
synthase (Psy) and carotene desaturase (CrtI) as separated
two cassettes into the rice genome, Golden Rice (GR) 1
and 2 successfully produced the carotenoids (1.6 lg/g) de
novo and then largely enhanced the amounts of those
(37 lg/g) up to 23 times, respectively (Paine et al. 2005;
123
Plant Biotechnol Rep
Ye et al. 2000). In 2010, by introducing the single
recombinant gene (PAC, Psy:2A:CrtI) as one cassette, a
new type of GR (PAC rice) produced the carotenoids as a
result of bicistronic expression of two genes (Ha et al.
2010). The effectiveness of the PAC gene was further
proven to produce the 62-fold higher levels of total carotenoids relative to wild-type when expressed in a seedspecific manner in soybeans (Kim et al. 2012). Hence, it
proved the practical use of viral 2A technology to transfer a
smaller transgene into plant genomes to express Psy and
CrtI simultaneously in a coordinated manner. Although this
PAC gene could greatly increase the contents of total
carotenoids in soybean (146 lg/g), the limited yield of
carotenoids in rice endosperm (1.30 lg/g) has been a barrier to practical use as biofortified rice plants (Ha et al.
2010; Kim et al. 2012).
In plants, metabolic engineering for specialized
metabolites and molecular farming for high-value proteins
with industrial importance require a sufficient quantity of
protein production through the efficient expression of
transgenes. In these fields of plant biotechnology, codon
modification has been widely used to optimize the
expression of heterologous transgenes to match the codon
usage of the host plants, including rice and tobacco plants
(Al-Babili et al. 2006; Kim et al. 2016; Li et al. 2016; Liu
et al. 2011; Misztal et al. 2004). Each living organism has
a preferred codon usage and its codon bias plays an
important role as a determinant of the translation efficiency as the subset of optimal codons corresponds to the
most abundant tRNAs (Ikemura 1981; Kanaya et al.
1999). Thus, one of the typical approaches for codon
optimization is to replace rare codons in a transgene into
abundant ones of the host to elevate translational efficiency (Gustafsson et al. 2004; Tuller et al. 2010). In
addition, codon optimization also matches the GC content
of nucleotides suitable for the host and alters unstable RNA sequences derived from different species, thus
increasing both transcriptional efficiency and transcript
stability (Jackson et al. 2014; Li et al. 2015).
In this study, we adopted a codon optimization strategy
by altering the heterologous codons of transgenes to ricepreferred ones to increase gene expression and to improve
b-carotene content in rice endosperm. The DNA sequences
of two original genes, Capsicum Psy (Psy) and Pantoea
CrtI (CrtI), constituting the PAC recombinant gene, were
artificially modified based on a rice codon usage into the
synthetic PAC gene (stPAC, stPsy:2A:stCrtI) including the
synthetic Psy (stPsy) and the synthetic CrtI (stCrtI) gene.
The effectiveness of codon optimization in stPAC rice in
comparison to PAC rice was evaluated by analyzing
transgene expression, color phenotype, and carotenoid
content in transgenic rice endosperm.
123
Materials and methods
Codon optimization and gene synthesis
Based on the native PSY protein sequence (419 aa) from
Korean red pepper (Capsicum annuum), and CRTI (493 aa)
from Pantoea ananatis (formerly Erwinia uredovora), the
nucleotide sequences of Psy and CrtI were modified to
derive the optimized sequences of stPsy and stCrtI with
referred codon usage for Oryza sativa using Leto software
(Entelechon, Regensberg, Germany) and a rice codon
usage table (http://www.kazusa.or.jp/codon/). The 2A
peptide sequence for bicistronic expression of two genes
and a transit peptide sequence (Tp) for chloroplast targeting
of CRTI were derived from a previously used PAC gene
(Ha et al. 2010). The final DNA sequence of stPAC
(2,952 bp), which was a sequential assembly of stPsy, 2A,
Tp, and stCrtI, was artificially synthesized using a customized service (Entelechon). To facilitate the Gateway
cloning procedure, the stPAC gene was inserted into the
pCR4-TOPO vector with attL1 and attL2 (Invitrogen,
Waltham, MA).
Vector construction and rice transformation
For rice transformation, the stPAC gene in the pCR4-TOPO
vector was directly recombined into the pMJ103 destination vector with an endosperm-specific rice globulin promoter (Myungji University, Yongin, Korea) using a
Gateway cloning strategy (Karimi et al. 2002), yielding the
pstPAC construct (Fig. 1). Gateway cloning was performed
using GatewayÒ LR ClonaseÒ II Enzyme Mix in accordance with the manufacturer’s instructions (Invitrogen).
The pstPAC in Escherichia coli strain (DH5a) was introduced into an Agrobacterium tumefaciens strain (LBA4404
harboring the pSB1 vector) by tri-parental mating with a
conjugated helper strain (HB101 harboring the pRK2013
plasmid). Embryogenic calli that proliferated from the
mature seeds of japonica-type Korean rice (Oryza sativa
L., cv. Hwayoung) were co-cultivated with a prepared
Agrobacterium strain possessing the pstPAC vector and
regenerated into transgenic rice plants via shooting and
rooting procedures on selection medium containing phosphinothricin (4 mg/L) and cefotaxime (500 mg/L) (Park
et al. 2015).
Growth conditions and visual inspection of seed
color
Putative transgenic plants (T0) were acclimatized in a
cultivation room and grown in a greenhouse at 28 °C until
maturity. Seeds of transgenic plants were harvested
Plant Biotechnol Rep
Fig. 1 Schematic representation of the PAC and stPAC constructs for
rice transformation. Each pPAC and pstPAC vector contains the
recombinant gene, Psy:2A:CrtI (PAC) and stPsy:2A:stCrtI (stPAC),
respectively. The remaining components in each vector construct are
exactly the same and include the endosperm-specific rice globulin
promoter (P-Glb), and the 30 region of the potato proteinase inhibitor
II gene (T-PinII). The Bar in the big arrow represents a cassette used
to express the Bar gene under a 35S promoter (P-35S) with a 30 region
from the nopaline synthase gene (T-Nos). The primer positions used
for qRT-PCR (used in Figs. 4a, 5 and listed in Table S1) to detect
transcripts of 2A-Tp (2A-F/Tp-R), stPsy (stPsy-F/stPsy-R), and stCrtI
(stCrtI-F/stCrtI-R) are indicated. The 50 -matrix attachment region
(Mar) from the chicken lysozyme gene flanked at both ends of
T-DNA was used as a probe in Southern blot analyses. BR right
border, BL left border, 2A rice codon-optimized DNA sequence
encoding 2A peptide from foot-and-mouth disease virus, Tp transit
peptide of rice rubisco small subunit, RI EcoRI restriction enzyme
sites
40 days after flowering (DAF) in the green house (T1
generation) and in the field (T2 and T3 generation). To
harvest PAC and stPAC seeds at the same generation (T3),
T2 plants from PAC line 4 and stPAC line 21, 25, 102 were
grown in the same field. The seed colors of the nontransgenic and transgenic rice plants were visually
inspected after dehusking (TR-200 Electromotion rice
husker; Kett, Tokyo, Japan) and polishing (Pearlest polisher; Kett).
(Nos-M) labeled with the fluorescent reporter dye 6-carboxy-fluoroscein (6-FAM) to detect the Nos terminator in a
Bar gene expression cassette in accordance with the manufacturer’s protocols on a CFX ConnectTM Real-Time
System (Bio-Rad, Richmond, CA). An a-tubulin assay
(Assay ID: Os03643486_s1; Applied Biosystems) using the
fluorescent reporter dye VIC was performed as an internal
control (Song et al. 2016). The PCR reaction conditions
were set as follows: 10 min enzyme activation at 95 °C,
followed by 40 cycles of 15 s at 95 °C, 1 min at 60 °C.
Data were analyzed to calculate the T-DNA copy number
using Bio-Rad CFX manager software (Bio-Rad) using the
homozygous transgenic rice line, PAC 4, which was
assigned a value of 1 as a single-copy reference (Ha et al.
2010). The sequences of primers for genomic DNA analysis are provided in the supporting information (Table S1).
Genomic DNA analysis
Using a modified cetyltriammonium bromide-based protocol (Allen et al. 2006), young leaf tissues from the T0
generation were used to extract genomic DNAs to estimate
the copy number of the transgene in rice plants. Southern
blot analysis was performed using five micrograms of DNA
digested with EcoRI, fractionated in a 1% agarose gel, and
transferred onto a Hybond-N? nylon membrane (Amersham Biosciences, Piscataway, NJ). The membrane was
hybridized with a Mar gene probe that was generated with
Mar-F/Mar-R using a PCR DIG Probe Synthesis Kit
(Roche Inc., Mannheim, Germany), and signals were
developed using a DIG High Prime DNA Labeling and
Detection Starter Kit II (Roche) according to the manufacturer’s instructions. The membrane was exposed to
Lumi-Film Chemiluminescent Detection film (Roche) and
the bands were imaged with LAS 4000 software (Fujifilm,
Tokyo, Japan).
Taqman PCR analysis was carried out with Taqman
Gene Expression Master Mix (Applied Biosystems, Foster
City, CA) using a primer set (Nos-F/Nos-R) and a probe
RNA analysis
Total RNA was extracted from 100 mg of mature seeds at
40 DAF after imbibition in water and trituration in a mortar
with liquid nitrogen using PureLinkÒ Plant RNA Reagent
(Invitrogen) according to the manufacturer’s instructions.
The RNAs were treated with DNase I (Qiagen, Hilden,
Germany) to remove any remaining genomic DNA and
their quality and concentration were checked via agarose
gel electrophoresis and a NanoDrop Spectrophotometer
ND-2000 (Nano-Drop Technologies, Wilmington, DE).
cDNAs were simultaneously synthesized and amplified
from 1 lg of total RNA using AccuPowerÒ RT Premix
(Bioneer, Daejeon, Korea) according to manufacturer’s
instructions. Quantitative real-time (qRT) PCR was carried
123
Plant Biotechnol Rep
out using a CFX ConnectTM Real-Time System (Bio-Rad).
Each reaction was composed of 100 ng of cDNA, 0.6 lM
of gene-specific primers, 10 lL SYBR Green Real-time
PCR master mix (Bio-Rad) in a total volume of 20 lL. The
PCR conditions were as follows: 3 min at 95 °C, followed
by 40 cycles of 15 s at 95 °C, 30 s at 60 °C. The primers
used to amplify target genes were as follows: F2A-RTp
(2A-F/Tp-R), stPsy (stPsy-F/stPsy-R), and stCrtI (stCrtI-F/
stCrtI-R). The rice ubiquitin 5 gene (OsUbi5, Os01g22490)
was used as an internal control using two OsUbi5 primer
sets (OsUbi-QA/OsUbi-QR, and OsUbi-QF/OsUbi-QR)
(Jain et al. 2006). Each qRT-PCR reaction was carried out
in triplicate on three individual plants. Information on the
primers used for qRT-PCR is listed in the supporting
information (Table S1).
Protein analysis
Protein from 100 mg of rice seed powder, prepared by
soaking rice in water and grinding it under liquid nitrogen,
was extracted using buffer containing 0.025 M Tris–HCl
(pH 6.8), 4% SDS, 4 M Urea, and 5% 2-mercaptoethanol.
The supernatants were collected from extracts after centrifugation at 12,0009g for 45 min at 4 °C, and protein
concentrations were measured using a Quant-iTTM Protein
Assay Kit (Invitrogen) according to the manufacture’s
protocol. A total of 80 lg of protein was separated by
electrophoresis on an 8% SDS/polyacrylamide gel and
transferred onto a PVDF membrane (Whatman, Kent, UK)
using a Trans-Blot SD semi-Dry electrophoretic transfer
cell (Bio-Rad). Western blot analyses were performed
using primary anti-CRTI antibody (1:2500) raised against
the Pantoea CRTI protein (kindly provided by Dr. Peter
Beyer group at University of Freiburg in Germany). Bound
antibody was detected using an anti-rabbit IgG (Fc) alkaline phosphatase-conjugated secondary antibody (1:5000)
and a BCIP/NBT color development system (Promega,
Madison, WI). The bands were imaged using LAS 4000
software (Fujifilm) and densitometric analysis was carried
out using Image QuantTL software (GE Healthcare, Piscataway, NJ).
Carotenoid extraction and quantitation
Mature seeds at 40 DAF were dehusked after harvesting
and drying and were ground into a fine powder using a
pestle for the extraction and quantitation of carotenoids by
high-performance liquid chromatography (HPLC). In brief,
1.0 g of the homogenized sample was prepared for carotenoid extraction with 3 mL of ethanol containing 0.1%
ascorbic acid (w/v) and saponified with potassium
hydroxide (120 lL, 80%, w/v). After adding b-Apo-80 carotenal (0.1 mL of 25 lg/mL) as an internal standard, the
123
carotenoids were extracted twice with 1.5 mL of hexane
followed by centrifugation at 1,2009g for layer separation.
Nitrogen-dried extract aliquots were re-dissolved in 50:50
(v/v) dichloromethane/methanol before HPLC analysis.
Carotenoid separation was performed on an YMC ODS
C-30 column (3 lm, 4.6 mm 9 250 mm; YMC Europe,
Germany) using an HPLC system (Agilent 1100 Series;
Agilent, Santa Clara, CA). Chromatograms were generated
at 472 and 450 nm for lycopene and the other four compounds of a-carotene, b-carotene, lutein, and zeaxanthin,
respectively. All details, solvent information and elution
conditions, were as described previously (Kim et al. 2010).
Carotenoid standards for lycopene (w,w-carotene) a-carotene (b,e-carotene), 13Z-b-carotene, (all-E)-b-carotene,
9Z-b-carotene, lutein (b,e-carotene-diol), and zeaxanthin
(b,b-carotene-diol) were purchased from CaroteNature
(Lupsingen, Switzerland). The amount of b-carotene (b,bcarotene) was calculated as the sum of 13Z-b-carotene,
(all-E)-b-carotene, and 9Z-b-carotene. With respect to peak
area ratios of the analyte to internal standard, calibration
curves were drawn by plotting different concentrations of
each carotenoid standard.
Results
Codon-optimization and artificial synthesis
of the stPAC gene based on the PAC gene for rice
To enhance the levels of carotenoids via gene modification
in rice endosperm, the DNA sequences of two carotenoid
biosynthetic genes originating from heterologous sources,
C. annum for Psy and Pantoea ananatis for CrtI, were
optimized for rice plants using the O. sativa codon usage
database (http://www.kazusa.or.jp/codon/) (Nakamura
et al. 2000). To minimize codon bias, codons less frequently used were replaced with more frequently used ones
based on the same protein sequences of the PAC recombinant gene. The codon adaptation index (CAI) calculated
using a CAI calculator (http://ppuigbo.me/programs/CAI
cal/) (Puigbo et al. 2008) resulted in codon alterations of
64.0% (268/419 aa) and 65.1% (321/493 aa) and DNA
similarities of 74.1 and 75.3% between original and modified genes encoding PSY and CRTI, respectively. Additionally, the CAIs of the two modified genes encoding PSY
and CRTI were increased to 0.84 and 0.85 from 0.73 and
0.7, while the GC contents were also increased to 57.8 and
57.9% from the 45.2 and 53.1% found in the original
genes, respectively. These percentages are similar to the
average GC content of O. sativa genes (about 55%) (Li
et al. 2015). A new version of a recombinant gene, the
stPAC gene, including two codon-optimized genes linked
with the same 2A and Tp sequences derived from the
Plant Biotechnol Rep
original PAC recombinant gene, was artificially synthesized. Consequently, the overall GC content and CAI of the
stPAC gene were increased from 50.9 to 58.7% and from
0.77 to 0.85, respectively, compared to the PAC gene. The
stPAC gene consisting of a single open-reading frame for
the expression of two genes was specifically expressed in
rice endosperm under the control of the rice globulin promoter using the same rice transformation vector previously
used to express the PAC gene (Ha et al. 2010). The regions
between the left and right T-DNA borders in the corresponding pPAC and pstPAC vectors are depicted
schematically in Fig. 1.
Effect of copy number of a stPAC transgene
on carotenoid content in rice endosperm
Using co-cultivation with an Agrobacterium strain harboring the pstPAC gene, a total of 67 stPAC transgenic rice
lines were obtained and the copy number of T-DNA was
analyzed by Southern blotting and Taqman PCR analyses
using leaf tissues from the T0 generation. The copy numbers in these stPAC lines were one (24 lines), two (24
lines), three (8 lines), and more than four (11 lines). Their
T1 seeds were harvested and the yellow phenotype was
observed and compared to the white phenotype of nontransgenic rice seeds. Additionally, HPLC analysis
revealed the levels of carotenoid resulting from de novo
synthesis from the introduced stPAC gene in the transgenic
rice endosperm. An average of 2.43 lg/g of total carotenoids was measured, with a range of a minimum of
0.43 lg/g and a maximum of 4.45 lg/g (Fig. 2). The
average content of carotenoids was enhanced as the copy
number of the transgene increased, resulting in 2.17 (one
Fig. 2 Levels of carotenoid content in T1 stPAC seeds. Total
amounts of carotenoid in 67 stPAC seeds carrying one (24 lines),
two (24 lines), three (8 lines), and more than four copies (11 lines) of
the stPAC gene were determined by HPLC analysis at the T1
generation. Individual and average carotenoid contents of stPAC
endosperm with their corresponding copy numbers are indicated by an
open circle (s) and dash (–), respectively
copy), 2.32 (two copies), and 2.93 lg/g (three copies) of
total carotenoids. Interestingly, insertion of over four genes
(an average 2.91 lg/g of four to nine copies) resulted in no
additional gene dosage effect. This implies a positive
correlation between carotenoid contents resulting from
stPAC gene expression and the copy number of the stPAC
gene up to three in rice endosperm, suggesting that transgenes over three could have more chances to cause negative effects on insertional position. Also, it suggests that the
good performance of carotenoid production could be generated in single-copy transgenic plants if sufficient numbers of transformants were obtained and analyzed for the
target trait.
Effect on protein expression from the stPAC
transgene in rice endosperm
Representative lines with the highest levels of carotenoid
were selected from among the 67 transgenic plants based
on the copy number of T-DNA integration (Fig. 2).
Specifically, three lines with one copy (No. 21, 25, 102),
three lines with two copies (No. 78, 85, 88), four lines with
three copies (No. 6, 8, 49, 73), and five lines with over four
copies (18, 95, 99, 117, 123) were selected. Their T1 seeds
were analyzed by Western blotting with an anti-CRTI
antibody to detect the downstream protein of the bicistronic
construct in stPAC, as well as PAC rice plant (Fig. 3). The
levels of expression of the CRTI protein in 15 stPAC plants
were compared relative to a T5 seed of the PAC line 4 that
was assigned a value of one in a previous study, which
confirmed equivalent levels of CRTI expression among
four PAC lines with a single copy transgene (Ha et al.
2010). All transgenic rice lines for PAC and stPAC displayed two signal bands for CRTI including a CRTI protein
of 55 kDa (cleaved protein) and a fused PSY:2A:CRTI of
110 kDa (uncleaved protein product), in addition to a nonspecific product (NSP) band around 60 kDa was a crossreacting product in all samples including non-transgenic
(NT) rice (Fig. 3). The stPAC lines exhibited variable
levels of CRTI protein expression, but most of them,
except for stPAC line 78, showed higher levels of expression than the PAC line 4. The average fold intensities of
CRTI expression in the stPAC endosperm were increased
to 3.3 (3, 4, and 3 in lines 21, 25, 102 with one copy), 3.7
(1, 5, and 5 in lines 78, 85, 88 with two copies), 6.3 (8, 8, 3,
and 6 in lines 6, 8, 49, 73 with three copies), and 5.8 (3, 7,
6, 8, and 5 in lines 18, 95, 99, 117, 123 with more than four
copies) relative to a value of 1 in PAC 4 endosperm
(Fig. 3). These results confirmed that codon-optimization
of stPAC from PAC for rice reliably enhances expression
of protein in endosperm. Also, the higher expression of
protein may be caused by the integration of more than a
single copy of the stPAC, as demonstrated by the positive
123
Plant Biotechnol Rep
Fig. 3 Levels of CRTI protein in T1 stPAC seeds. Western blot
analysis was performed using anti-CRTI antibody raised against
Pantoea CRTI protein in 15 stPAC seeds chosen from among 67 lines
at the same T1 generation based on high carotenoid contents.
Uncleaved (PSY:2A:CRTI) and cleaved (CRTI) products are indicated
by solid arrows and a non-specific product (NSP) is indicated by dotted
arrows. NSP was used as a loading control to determine band
intensities calculated using Image QuantTL software. Intensity fold
was calculated using CRTI band of PAC 4 line as a reference 1. NT
designates non-transgenic plants used as a negative control
correlation between expression levels and gene copies up
to three copies demonstrated in Fig. 2. Furthermore, to
compare the effectiveness of codon optimization of stPAC
from the PAC gene, three lines with a single copy of the
transgene showing stable levels of protein production were
selected and further developed to a homozygous
generation.
around 60 kDa, was observed. stPAC 21, 25, and 102
displayed 3.0-, 2.5-, and 2.9-fold greater intensities than
that observed in PAC 4 line when normalized using the
NSP signal. This indicates that the translation efficiency of
the modified stPAC gene was significantly improved on
average by a factor of 2.8-fold relative to the original PAC
gene (Fig. 4b).
Despite these similar levels of CRTI protein among the
three stPAC lines, different levels of the 2A-Tp transcripts
were examined in the regions of stPsy and stCrtI by qRTPCR (Fig. 5a, b). Both amounts of stPsy and stCrtI transcripts were resulted in the same pattern of the highest
expression in stPAC line 102 and relatively lower expression in stPAC 21 and 25 lines as detected with that of 2ATp (Fig. 4a). Hence, these results demonstrate that the
codon optimization might enhance translation efficiency as
estimated by quantification of Western bands (Fig. 4b)
irrespective of diverse transcript levels (Figs. 4a, 5a, b). It
still implies that the codon optimization can enhance the
stability of transcripts.
Comparison of transgene expression in the codonoptimized stPAC relative to PAC transgenic rice
endosperm
To define the effect on transgene expression resulting from
codon optimization of stPAC based on the PAC gene, the
stPAC 21, 25 and 102 lines with single intact copies, as
confirmed by Taqman PCR (Fig. S1a) and Southern blot
(Fig. S1b) analyses, were selected since high carotenoid
content and stable protein expression among these lines
had been analyzed (Figs. 2, 3). To ensure the comparability
of stPAC relative to PAC lines, they were grown in the
same field during the same year as PAC 4 with the single
copy of the transgene as a reference line (Ha et al. 2010).
Additionally, their seed ages were also matched to a coeval
generation at T3 by transplanting the same T2 plants. The
transcriptional efficiencies of the PAC and stPAC recombinant genes were first compared by qRT-PCR using
identical nucleotide sequences as primers in the 2A and Tp
region (Fig. 4a). Transcript levels of 2A-Tp were 1.4- and
2.8-fold higher in stPAC lines 25 and 102, but were similar
or decreased by 0.9-fold in stPAC line 21, than transcript
levels in the PAC 4 line. This suggests that transcriptional
efficiency tended to be enhanced by codon optimization,
but we were unable to confirm this here.
Meanwhile, translational efficiency between PAC and
stPAC recombinant genes was defined at the same T3 seed
generation by immunoblotting with anti-CRTI antibody
(Fig. 4b). The same pattern of signals as in Fig. 3,
including two CRTI bands of 55 kDa (CRTI protein alone)
and 110 kDa (fusion protein with PSY) and one NSP band
123
Comparison of carotenoid production in a codonoptimized stPAC relative to PAC transgenic rice
endosperm
Using golden color inspection as the most convenient tool
to visualize the carotenoid contents in rice endosperm,
color intensity among PAC and stPAC lines was compared
at the same T3 seed generation (Fig. 6). Unlike the white
endosperm of non-transgenic (NT) rice, a homogeneous
yellow color was observed in every transgenic line, with
more intense coloring observed among the three stPAC
lines relative to the PAC endosperm, indicating the effect
of codon optimization on enhanced carotenoid content. The
total content and composition of the carotenoids were
measured in the same T3 seeds of PAC and stPAC lines by
HPLC analysis (Fig. 7; Table S2). The total average
amount of carotenoid in the three stPAC lines was 3.5 lg/g
(an average of 2.5, 3.4, and 4.6 lg/g), which was
Plant Biotechnol Rep
Fig. 4 Comparison of transgene expression between PAC and stPAC
transformants in T3 seed generation. a qRT-PCR was performed to
quantify the relative expression levels in the common 2A-Tp region
between PAC and stPAC transcripts. The same amounts of seed RNA
were normalized with the OsUbi 5 gene. Error bars represent the
Fig. 5 Comparison of transgene expression among stPAC transformants in T3 seed generation. qRT-PCR was performed to quantify the
relative expression levels of the stPsy and stCrtI transcripts in three
stPAC lines. The same amounts of seed RNA were normalized with
the OsUbi 5 gene. Error bars represent the standard deviations among
triplicate experimental reactions. NT designates non-transgenic plants
used as a negative control
Fig. 6 Comparison of seed color phenotypes between PAC and
stPAC transformants in T3 seed generation. Photographs showing the
color phenotypes of PAC and stPAC transgenic rice seeds along with
non-transgenic (NT) rice seeds after dehusking followed by polishing
approximately 2.9-time higher than that measured in the
PAC line (1.22 lg/g) in Fig. 7a. Interestingly, the b/a ratio
of carotenoids in the stPAC lines was 2.7 (an average of
2.3, 2.9, and 2.7), which was far higher than the 1.2 in PAC
4 line and 0.6 in the NT plant (Fig. 7b). This was supported
by a big increase in b-carotenoids, including a 4-fold
increase in b-carotene (to 2.4 from 0.6 lg/g) and a 2.3-fold
increase in zeaxanthin (to 0.16 from 0.07 lg/g) in stPAC
lines relative to the PAC 4 line (Fig. 7c) with the altered
proportions of b-carotene (68 and 48%) and similarly
standard deviations among triplicate experimental reactions. NT
designates non-transgenic plants used as a negative control.
b Western blot analysis using anti-CRTI antibody to determine the
abundance of the CRTI protein were performed as described in Fig. 3
maintaining of zeaxanthin (around 5–6%) between stPAC
and PAC 4 lines. As a-carotenoids, a-carotene and lutein
also showed a2.1- and 1.5-fold increases in stPAC lines
(0.39 and 0.56 lg/g) relative to the PAC 4 line (0.19 and
0.37 lg/g), but no degree of increase was higher than those
of the b-carotene (Fig. 7d). The compositional proportions
of lutein and a-carotene would rather be decreased in the
stPAC (16 and 11%) relative to the PAC 4 (30 and 16%),
respectively. Enhanced production of b-carotene component indicates that a codon-optimized stPAC gene had an
effect on the enhanced levels of total carotenoids with bbranch preference relative to the PAC gene in rice
endosperm.
Discussion
Codon modification is one tool used to generate the optimum DNA sequence for introduction of heterologous
transgenes, which match up with the host conditions,
including the codon frequency known as codon bias
(Gustafsson et al. 2004; Ikemura 1981; Kanaya et al. 1999;
Tuller et al. 2010), GC content (Jackson et al. 2014; Li
et al. 2015), and the removal of mRNA destabilizing
sequences (Jackson et al. 2014). However, the effectiveness of the gene modified by codon optimization has rarely
been reported in comparison with an original unmodified
gene, in spite of lots of reports of the development of
genetically modified organisms for the production of useful
proteins and metabolites using foreign genes (Kim et al.
2016; Li et al. 2016; Liu et al. 2011; Misztal et al. 2004).
Among several reports that have adopted the use of synthetic genes in cyanobacteria, Brassica and the rice callus
system for the production of isoprene and carotenoids (Bai
et al. 2014; Fujisawa et al. 2009; Lindberg et al. 2010),
only the DNA sequence of the isoprene synthase gene from
kudzu plants was successfully altered for use in
123
Plant Biotechnol Rep
Fig. 7 Comparison of
carotenoid content and
composition between PAC and
stPAC transformants in T3 seed
generation. HPLC analysis
determined the total amount of
carotenoid, b/a ratio of
carotenoid, and the content of bcarotenoids (b-carotene and
zeaxanthin) and a-carotenoids
(a-carotene and lutein). Data are
expressed as mean (lg/g dry
weight) ±SE from three
independent experiments using
unpolished mature seeds.
Numerical results of these
figures refer to Table S2 in
detail
cyanobacteria and resulted in an improvement in protein
expression by a factor of about 10 (Lindberg et al. 2010).
In this study, we investigated the effect of codon optimization of two genes from Capsicum plants and Pantoea
bacteria for use in rice plants, along with analyzing the
effect of transgene copy number. The characteristics of the
DNA sequence in our final stPAC gene (2,952 bp) were
altered by increases in the GC content (58.7 from 50.9%)
and CAI (0.85 from 0.77), resulting in 76% DNA similarity
based on a PAC gene (2,952 bp). Through the molecular and
metabolite analyses of the enough number of transgenic
plants (Figs. 1, 2), three chosen stPAC lines showed an
average 2.8-fold increase in CRTI translation (Fig. 4b) and
more intense yellow color in seeds (Fig. 6) with an average
2.9-fold increase in total carotenoid contents (Fig. 7a;
Table S2) relative to those of the PAC 4 line were observed.
Interestingly, using a similar approach, the synthetic CrtI
gene (Crt-Synth) has been previously examined if b-carotene
increased or not (Al-Babili et al. 2006). As a result, this
modified gene with 73.8% identity to the original CrtI
sequence did not increase the content of carotenoids
including b-carotene and CRTI protein abundance (Al-Babili et al. 2006), unlike our results supported by the immunoblot experiments using an anti-CRTI antibody exhibiting
enhanced translational efficiency relative to CrtI in PAC
(Figs. 3, 4b). This implies that our stCrtI in the stPAC gene
was more effectively modified than Crt-Synth. Moreover,
the accompanying optimization of codons for the gene
encoding PSY as the stPsy might have had a positive
influence resulting in increased carotenoid production.
123
To minimize the unintended insertional effects in
genetically engineered plants (Schnell et al. 2015), only
transgenic lines with a single intact copy of T-DNA were
selected and grown under the exact same conditions
between PAC and stPAC lines during the coeval T3 generation. In our study, we presented substantive evidence
based on numerical values of efficiency (2.8- and 2.9-fold
in production of protein and metabolite) by codon optimization, suggesting an effective tool for the adoption of
genetic engineering in the broad field of crop
biotechnology.
Acknowledgements This work was supported by grants from the
Next-Generation BioGreen 21 Program (PJ01128601 and
PJ01109401 to S.-H. Ha) funded by the Rural Development Administration. Our work was also supported by the Research Program
through the National Research Foundation of Korea funded by the
Ministry of Education, Science, and Technology (NRF2016R1A2B4013485 to S.-H. Ha).
References
Al-Babili S, Hoa TT, Schaub P (2006) Exploring the potential of the
bacterial carotene desaturase CrtI to increase the beta-carotene
content in Golden Rice. J Exp Bot 57:1007–1014
Allen GC, Flores-Vergara MA, Krasynanski S, Kumar S, Thompson
WF (2006) A modified protocol for rapid DNA isolation from
plant tissues using cetyltrimethylammonium bromide. Nat Protoc 1:2320–2325
Bai C, Rivera SM, Medina V, Alves R, Vilaprinyo E, Sorribas A,
Canela R, Capell T, Sandmann G, Christou P (2014) An in vitro
system for the rapid functional characterization of genes
Plant Biotechnol Rep
involved in carotenoid biosynthesis and accumulation. Plant J
77:464–475
Fujisawa M, Takita E, Harada H, Sakurai N, Suzuki H, Ohyama K,
Shibata D, Misawa N (2009) Pathway engineering of Brassica
napus seeds using multiple key enzyme genes involved in
ketocarotenoid formation. J Exp Bot 60:1319–1332
Gustafsson C, Govindarajan S, Minshull J (2004) Codon bias and
heterologous protein expression. Trends Biotechnol 22:346–353
Ha SH, Liang YS, Jung H, Ahn MJ, Suh SC, Kweon SJ, Kim DH,
Kim YM, Kim JK (2010) Application of two bicistronic systems
involving 2A and IRES sequences to the biosynthesis of
carotenoids in rice endosperm. Plant Biotechnol J 8:928–938
Ikemura T (1981) Correlation between the abundance of Escherichia
coli transfer RNAs and the occurrence of the respective codons
in its protein genes: a proposal for a synonymous codon choice
that is optimal for the E. coli translational system. J Mol Biol
151:389–409
Jackson MA, Sternes PR, Mudge SR, Graham MW, Birch RG (2014)
Design rules for efficient transgene expression in plants. Plant
Biotechnol J 12:925–933
Jain M, Nijhawan A, Tyagi AK, Khurana JP (2006) Validation of
housekeeping genes as internal control for studying gene
expression in rice by quantitative real-time PCR. Biochem
Biophys Res Commun 345:646–651
Kanaya S, Yamada Y, Kudo Y, Ikemura T (1999) Studies of codon
usage and tRNA genes of 18 unicellular organisms and
quantification of Bacillus subtilis tRNAs: gene expression level
and species-specific diversity of codon usage based on multivariate analysis. Gene 238:143–155
Karimi M, Inze D, Depicker A (2002) GATEWAY vectors for
Agrobacterium-mediated plant transformation. Trends Plant Sci
7:193–195
Kim JK, Lee SY, Chu SM, Lim SH, Suh SC, Lee YT, Cho HS, Ha SH
(2010) Variation and correlation analysis of flavonoids and
carotenoids in Korean pigmented rice (Oryza sativa L.) cultivars.
J Agric Food Chem 58:12804–12809
Kim MJ, Kim JK, Kim HJ, Pak JH, Lee JH, Kim DH, Choi HK, Jung
HW, Lee JD, Chung YS, Ha SH (2012) Genetic modification of
the soybean to enhance the beta-carotene content through seedspecific expression. PLoS One 7:e48287
Kim MY, Kim TG, Yang MS (2016) Production and immunogenicity
of Actinobacillus pleuropneumoniae ApxIIA protein in transgenic rice callus. Protein Expr Purif 132:116–123
Li J, Zhou J, Wu Y, Yang S, Tian D (2015) GC-content of
synonymous codons profoundly influences amino acid usage. G3
5:2027–2036
Li Y, Wang R, Hu Z, Li H, Lu S, Zhang J, Lin Y, Zhou F (2016)
Expression of a codon-optimized dsdA Gene in tobacco plastids
and rice nucleus confers D-serine tolerance. Front Plant Sci 7:640
Lindberg P, Park S, Melis A (2010) Engineering a platform for
photosynthetic isoprene production in cyanobacteria, using
Synechocystis as the model organism. Metab Eng 12:70–79
Liu F, Zhao YY, Zheng JG (2011) High-level expression of modified
gene encoding human adiponectin in transgenic rice. Biol Res
44:369–375
Misztal LH, Mostowska A, Skibinska M, Bajsa J, Musial WG,
Jarmolowski A (2004) Expression of modified Cry1Ac gene of
Bacillus thuringiensis in transgenic tobacco plants. Mol Biotechnol 26:17–26
Nakamura Y, Gojobori T, Ikemura T (2000) Codon usage tabulated
from international DNA sequence databases: status for the year
2000. Nucleic Acids Res 28:292
Paine JA, Shipton CA, Chaggar S, Howells RM, Kennedy MJ,
Vernon G, Wright SY, Hinchliffe E, Adams JL, Silverstone AL,
Drake R (2005) Improving the nutritional value of Golden Rice
through increased pro-vitamin A content. Nat Biotechnol
23:482–487
Park SH, Jin SJ, Choi YD, Kim JK (2015) Characterization of the rice
RbcS3 promoter and its transitpeptide for use in chloroplasttargeted expression. Plant Biotechnol Rep 9:395–403
Puigbo P, Bravo IG, Garcia-Vallve S (2008) CAIcal: a combined set
of tools to assess codon usage adaptation. Biol Direct 3:38
Schnell J, Steele M, Bean J, Neuspiel M, Girard C, Dormann N,
Pearson C, Savoie A, Bourbonniere L, Macdonald P (2015) A
comparative analysis of insertional effects in genetically engineered plants: considerations for pre-market assessments. Transgenic Res 24:1–17
Sommer A, Davidson FR, Accords Annecy (2002) Assessment and
control of vitamin A deficiency: the Annecy Accords. J Nutr
132:2845S–2850S
Song M, Lim S, Kim JK, Jung ES, John KM, You M, Ahn S, Lee CH,
Ha S (2016) In planta cleavage of carotenoids by Arabidopsis
carotenoid cleavage dioxygenase 4 in transgenic rice plants.
Plant Biotechnol Rep 10:291–300
Tang G, Qin J, Dolnikowski GG, Russell RM, Grusak MA (2009)
Golden Rice is an effective source of vitamin A. Am J Clin Nutr
89:1776–1783
Tuller T, Waldman YY, Kupiec M, Ruppin E (2010) Translation
efficiency is determined by both codon bias and folding energy.
Proc Natl Acad Sci USA 107:3645–3650
Ye X, Al-Babili S, Kloti A, Zhang J, Lucca P, Beyer P, Potrykus I
(2000) Engineering the provitamin A (beta-carotene) biosynthetic pathway into (carotenoid-free) rice endosperm. Science
287:303–305
123