WO2009124254A1 - Nucleotide analogs - Google Patents
Nucleotide analogs Download PDFInfo
- Publication number
- WO2009124254A1 WO2009124254A1 PCT/US2009/039475 US2009039475W WO2009124254A1 WO 2009124254 A1 WO2009124254 A1 WO 2009124254A1 US 2009039475 W US2009039475 W US 2009039475W WO 2009124254 A1 WO2009124254 A1 WO 2009124254A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- nucleotide analog
- inhibitor
- group
- analog
- nucleotide
- Prior art date
Links
- 0 C*(NC(CSSCCC(NCC#Cc1c[n]([C@@]2OC(COP(O)(OP(O)(OP(O)(O)=O)=O)=O)=CC2)c2ncnc(N)c12)=O)C(NCCCCCC(NC(CC(O)=O)C(NC(CC(O)=O)C(O)=O)=O)=O)=O)=* Chemical compound C*(NC(CSSCCC(NCC#Cc1c[n]([C@@]2OC(COP(O)(OP(O)(OP(O)(O)=O)=O)=O)=CC2)c2ncnc(N)c12)=O)C(NCCCCCC(NC(CC(O)=O)C(NC(CC(O)=O)C(O)=O)=O)=O)=O)=* 0.000 description 9
- QGMVEQSLPCWBLM-UHFFFAOYSA-O CC[N+](c(cc1)c(C2(C)C)cc1S(O)(=O)=O)=C2/C=C/C=C/C=C(\C1(C)C)/N(CCCCCC(NC(CSSCCC(NCC#Cc2c[n](C(C3)OC(COP(O)(OP(O)(OP(O)(O)=O)=O)=O)C3O)c3ncnc(N)c23)=O)C(NCCCCCC(NCCN(C(C=CN2C(C3)OC(COP(O)(O)=O)C3O)=O)C2=O)=O)=O)=O)c(cc2)c1cc2S(O)(=O)=O Chemical compound CC[N+](c(cc1)c(C2(C)C)cc1S(O)(=O)=O)=C2/C=C/C=C/C=C(\C1(C)C)/N(CCCCCC(NC(CSSCCC(NCC#Cc2c[n](C(C3)OC(COP(O)(OP(O)(OP(O)(O)=O)=O)=O)C3O)c3ncnc(N)c23)=O)C(NCCCCCC(NCCN(C(C=CN2C(C3)OC(COP(O)(O)=O)C3O)=O)C2=O)=O)=O)=O)c(cc2)c1cc2S(O)(=O)=O QGMVEQSLPCWBLM-UHFFFAOYSA-O 0.000 description 1
- FTCIRCMHTIVOLZ-UHFFFAOYSA-N CC[O](C(C)(C)C)N Chemical compound CC[O](C(C)(C)C)N FTCIRCMHTIVOLZ-UHFFFAOYSA-N 0.000 description 1
- OSLAYNSYTXNMCR-UHFFFAOYSA-N O=C(CCS)NI Chemical compound O=C(CCS)NI OSLAYNSYTXNMCR-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07H—SUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
- C07H19/00—Compounds containing a hetero ring sharing one ring hetero atom with a saccharide radical; Nucleosides; Mononucleotides; Anhydro-derivatives thereof
- C07H19/02—Compounds containing a hetero ring sharing one ring hetero atom with a saccharide radical; Nucleosides; Mononucleotides; Anhydro-derivatives thereof sharing nitrogen
- C07H19/04—Heterocyclic radicals containing only nitrogen atoms as ring hetero atom
- C07H19/06—Pyrimidine radicals
- C07H19/073—Pyrimidine radicals with 2-deoxyribosyl as the saccharide radical
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07H—SUGARS; DERIVATIVES THEREOF; NUCLEOSIDES; NUCLEOTIDES; NUCLEIC ACIDS
- C07H19/00—Compounds containing a hetero ring sharing one ring hetero atom with a saccharide radical; Nucleosides; Mononucleotides; Anhydro-derivatives thereof
- C07H19/02—Compounds containing a hetero ring sharing one ring hetero atom with a saccharide radical; Nucleosides; Mononucleotides; Anhydro-derivatives thereof sharing nitrogen
- C07H19/04—Heterocyclic radicals containing only nitrogen atoms as ring hetero atom
- C07H19/16—Purine radicals
- C07H19/173—Purine radicals with 2-deoxyribosyl as the saccharide radical
Definitions
- the invention relates to nucleotide analogs and methods for sequencing a nucleic acid using the nucleotide analogs.
- Sequencing-by-synthesis involves the template-dependent addition of nucleotides to a template/primer duplex.
- Traditional sequencing-by-synthesis is performed using dye- labeled terminators and gel electrophoresis (so-called "Sanger sequencing”). See, e.g., Sanger, F. and Coulson, A.R., 1975, J. MoI. Biol. 94: 441-448; Sanger, F. et al, 1977, Nature. 265(5596): 687-695; and Sanger, F. et al, 1977, Proc. Natl. Acad. ScL U.S.A. 75: 5463-5467.
- a challenge that has arisen in single molecule sequencing involves the ability to sequence through homopolymer regions (i.e., portions of the template that contain consecutive identical nucleotides). Often the number of bases present in a homopolymer region is important from the point of view of genetic function. Many polymerase enzymes used in sequencing-by- synthesis reactions are highly-processive and tend to add bases continuously in a homopolymer region. It is often difficult to resolve the number of nucleotides in a homopolymer due to the difficulty in distinguishing between the incorporation of one or two labeled nucleotides and the incorporation of a greater number of nucleotides.
- the invention provides nucleotide analogs and methods of using them to allow sequencing-by-synthesis to occur such that, on average, a single nucleotide is incorporated into the 3' end of a primer portion of a template/primer duplex per sequencing cycle.
- the invention is based, in part, on the discovery that nucleotide analogs having an attached inhibitory region with one or more charged groups provide good incorporation of a single nucleotide into the duplex without allowing a significant, or any, amount of second, third, etc. base incorporation.
- the invention generally provides nucleotide analogs and methods of using nucleotide analogs in sequencing. More particularly, the invention provides compounds, methods and compositions useful in introduction of a single base at a time in a template- dependent sequencing-by-synthesis reaction. The invention allows template-dependent sequencing-by-synthesis through all regions of a target nucleic acid, including homopolymer regions, and provides methods for the determination of the number of nucleotides present in a homopolymer region.
- the invention provides nucleotide analogs that comprise a nucleotide (or nucleotide analog), a detectable label, and an inhibitor group.
- the inhibitor prevents subsequent nucleotide incorporation into the same duplex.
- the nucleotide analog does not substantially hinder subsequent nucleotide (or nucleotide analog) incorporation.
- a method for sequencing a nucleic acid includes the steps of: exposing a nucleic acid duplex comprising a template portion and a primer portion to a nucleotide analog comprising an inhibitor that is charged or capable of becoming charged, and a polymerase, under conditions that permit template-dependent incorporation of the analog into the primer; detecting incorporation of the analog; removing or neutralizing the inhibitor; and repeating the exposing, detecting, and removing steps at least once, thereby to determine the sequence of the template.
- the invention in another aspect, relates to a nucleotide analog that includes: a nucleoside triphosphate; an inhibitor comprising (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more (i.e., a plurality of) singly charged groups or two or more groups capable of becoming singly charged; a detectable label; and a linker connecting the inhibitor and the label to the nucleoside triphosphate. It should be noted that in some embodiments, one or a single charged group may be sufficient to provide the desired inhibitory effect.
- the invention relates to nucleotide analogs of the formula:
- NTP is a nucleoside or nucleotide triphosphate or an analog thereof capable of template- dependent incorporation into the 3' end of a polynucleotide strand hybridized to a template.
- Inhibitor comprises a moiety that is charged or capable of becoming charged and that inhibits subsequent nucleotide incorporation once the first nucleotide is incorporated.
- Tether is a bond or a group linking the NTP to the Inhibitor group.
- the inhibitor is a non-steric inhibitor.
- the invention relates to nucleotide analogs of Formula II:
- NTP is a nucleoside or nucleotide triphosphate or an analog of either capable of template- dependent incorporation into the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP.
- L is a detectable label that facilitates the identification of the nucleotide analog.
- Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged.
- Ri and R 2 are independently a bond or a group, wherein at least one of Ri and R 2 comprises a cleavable bond, which upon cleavage results in de- association of NTP from both Label and Inhibitor.
- R 3 is a bond or group linking R 2 to the Inhibitor.
- R 4 is a bond or group linking R 2 to a Label.
- NTP is a nucleoside or nucleotide triphosphate or an analog of either capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP;
- L is a detectable label that facilitates the identification of the nucleotide analog;
- Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged;
- Ri comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor;
- R 2 is a tri-valent radical having the formula:
- each of R 2 ' and R 2 " is a bi-valent or tri-valent radical selected from:
- the invention relates to a method for sequencing a nucleic acid.
- the method includes: (a) anchoring a nucleic acid duplex, or portion thereof, to a surface, the duplex comprising a template portion and a primer portion hybridized thereto; (b) exposing the duplex to nucleotide analog of Formula I or II (as defined herein) in the presence of a polymerase capable of catalyzing the addition of the nucleotide analog to the primer portion in a template- dependent manner; (c) removing unincorporated nucleotide analog and polymerase; (d) detecting incorporation of the nucleotide analog into the primer portion; and repeating the exposing, removing, and detecting steps at least once.
- the invention relates to a method for sequencing a nucleic acid, the method comprising the steps of: (a) exposing a nucleic acid duplex comprising a template portion and a primer portion to a nucleotide analog of the following Formula II:
- NTP is a nucleoside or nucleotide triphosphate or an analog of either capable of incorporating onto the 3 ' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP
- L is a detectable label that facilitates the identification of the nucleotide analog
- Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged
- Ri comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor
- R 2 is a tri-valent radical having the formula:
- each of R 2 ' and R 2 " is a bi-valent or tri-valent radical selected from:
- the invention provides methods and nucleotide analogs for selectively inhibiting the catalytic function of a polymerase enzyme.
- nucleotide analogs comprise an inhibitory portion, such that the nucleotide analog is capable of being incorporated into a nucleic acid duplex but then inhibits subsequent nucleotide incorporation until the inhibitory portion is removed.
- the inhibitory portion of an analog of the invention preferably is a charged group.
- the charged group can take any appropriate form as long as it carries a charge.
- the charge group is selected from a phosphate, a carboxylic acid (or carboxylate), a sulfate, caproic acid (or a caproic acid derivative), a charged amino acid, -SO3, -SO 2 , and - NR W R V , where R w and R v independently is H, an alkyl or aryl group.
- the charged group can convey a negative or positive charge, but negative charged groups are preferred.
- the charge group contains multiple charged portions.
- the charge group can be a dipeptide, a di-phosphate, disulfate, or other multiples of charged moieties.
- amino acid inhibitors are preferably selected from aspartic acid, glutamic acid, arginine, lysine, and histidine.
- the invention provides charged inhibitors of subsequent base incorporation in a sequencing-by-synthesis reaction.
- subsequent base incorporation it is intended that a first nucleotide (or analog) is incorporated in a template-dependent manner, but second, third, etc. base incorporation is inhibited by the inhibitor group.
- inhibition occurs by positioning a charged group in proximity to the active site of a polymerase enzyme, thus disabling the ability of the polymerase to make subsequent incorporations.
- analogs of the invention interfere with magnesium present in the active site of the polymerase, resulting in a reduced ability of the active site to catalyze subsequent nucleotide incorporation.
- an analog of the invention comprises a nucleoside triphosphate, an inhibitor comprising a plurality of charged groups, a detectable label, and a linker connecting the charged groups and the label to the nucleoside triphosphate.
- Preferred inhibitors comprise a plurality of charged groups and may be selected from any charged group capable of conferring a charge in a local area.
- the inhibitor does not sterically inhibit a polymerase.
- the linker is cleavable. Multiple cleavable groups, such as enzymatically-cleavable group, such as disulfide bonds and the like.
- the invention provides methods and compositions that facilitate the addition of a single nucleotide to a template/primer duplex per reaction cycle (i.e., the addition of nucleotides and polymerase enzyme under conditions that result in template-dependent nucleotide incorporation into the primer).
- Analogs of the invention comprise a charged inhibitory group that, upon incorporation of a nucleotide in a template-dependent manner, prevents subsequent nucleotide incorporation until the inhibitory group is removed.
- an analog of the invention comprises a nucleotide triphosphate, a linker (or tether), a detectable label, and a charged inhibitory group, wherein the label and the inhibitory group are removable.
- the invention generally provides nucleotide analogs of the following Formula I:
- NTP is a nucleoside triphosphate or an analog thereof capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP;
- Inhibitor comprises a group that is charged or capable of becoming charged, e.g., under reaction conditions, and that inhibits a subsequent incorporation of a nucleotide (or analog thereof), and
- Tether is a bond or a group linking the NTP to the Inhibitor moiety.
- a group is considered capable of becoming charged if the group is capable of becoming electrically non-neutral, e.g., under reaction or buffer conditions. Examples of such groups include -COOH and -NR W R V , where R w and R v independently is H, an alkyl or aryl group.
- the inhibitor group can cause inhibition of subsequent nucleotide incorporation without steric hinderance. In other words, the inhibition is caused by chemical or charge interaction with the enzyme and not be a physical blocking of the enzyme.
- the charged inhibitor also provides steric inhibition of enzyme activity. However, in either case, the inhibitor group is charged.
- Natural NTPs include nucleoside triphosphates, adenosine triphosphate (ATP), guanosine triphosphate (GTP), cytidine triphosphate (CTP), thymidine triphosphate (TTP) and uridine triphosphate (UTP); and nucleotide triphosphates, deoxyadenosine triphosphate (dATP), deoxyguanosine triphosphate (dGTP), deoxycytidine triphosphate (dCTP), deoxythimidine triphosphate (dTTP) and deoxyuridine triphosphate (dUTP).
- NTPs useful in this invention include non-nature nucleosides and nucleotides, and analogs and derivatives thereof.
- the inhibitor may include a moiety that is negatively charged or capable of becoming a negatively charged. In other embodiments, the inhibitor group is positively charged or capable of becoming positively charged.
- the inhibitor is an amino acid or an amino acid analog.
- the Inhibitor may be a peptide of 2 to 20 units of amino acids or analogs, a peptide of 2 to 10 units of amino acids or analogs, a peptide of 3 to 7 units of amino acids or analogs, a peptide of 3 to 5 units of amino acids or analogs.
- the Inhibitor includes a group selected from the group consisting of GIu, Asp, Arg, His, and Lys, and a combination thereof (e.g., Arg, Arg- Arg, Asp, Asp- Asp, GIu, Glu-Glu, Asp-Glu-Asp, Asp-Asp-Glu or Asp Asp Asp Asp).
- Peptides or groups may be combinations of the same or different amino acids or analogs.
- the invention relates to an oligonucleotide with at least one nucleotide analog of the invention incorporated therein.
- the Tether comprises
- L is detectable label that facilitates the identification of the nucleotide analog after incorporation onto a template
- Ri and R 2 are independently a bond or a group, wherein at least one of Ri and R 2 comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor;
- R3 is a bond or group linking R 2 to the Inhibitor moiety; and R4 is a bond or group linking R 2 to a L.
- the present invention is directed to nucleotide analogs of Formula II:
- NTP is a nucleoside triphosphate or an analog thereof capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP;
- L is a detectable label to facilitate the identification of the nucleotide analog after incorporation onto the template
- Inhibitor is a moiety that substantially inhibits a subsequent incorporation of a nucleotide (or analog thereof).
- the Inhibitor moiety includes a nucleotide or nucleoside or analogs thereof, in other embodiments, the inhibitor is not a nucleotide or analog thereof;
- Ri and R 2 are independently a bond or a group, wherein at least one of Ri and R 2 comprises a cleavable bond, which upon cleavage results in de-association of NTP from both Label and Inhibitor;
- R 3 is a bond or group linking R 2 to the Inhibitor moiety; and R 4 is a bond or group linking R 2 to L.
- NTP is a compound having the following formula: wherein B is selected from the group consisting of purine or pyrimidine bases, as well as derivatives of purine and pyrimidine bases; R' is independently selected from the group consisting of-OH, -0-P(O)(OH) 2 , -O-C(O)-R X , -NHR y , and an -O-blocking agent, where R x and R y are alkyl groups; R" is independently selected from the group consisting of H and -OH.
- Non-limiting examples of representative purine and pyrimidine bases include adenine, cytosine, guanine, thymine, uracil, or hypoxanthine.
- Non-limiting examples of derivatives of purine and pyrimidine bases include naturally-occurring and synthetic derivatives of a base, including pyrazolo[3,4-d]pyrimidines, 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-
- Base B 1 of the invention permits a nucleotide to be incorporated into a polynucleotide chain by a polymerase and forms base pairs with a base on an antiparallel nucleic acid strand.
- the term base pair encompasses not only the standard AT, AU or GC base pairs, but also base pairs formed between nucleotides and/or nucleotide analogs comprising non-standard or modified bases, wherein the arrangement of hydrogen bond donors and hydrogen bond acceptors permits hydrogen bonding between a nonstandard base and a standard base or between two complementary non-standard base structures.
- non-standard base pairing is the base pairing between the nucleotide analog inosine and adenine, cytosine or uracil, where two hydrogen bonds are formed.
- the Inhibitor may include a charged moiety (e.g., a negatively charged moiety, a positively charged moiety, or both) or a moiety that is capable of becoming charged.
- the Inhibitor can include two or more charged groups.
- the Inhibitor may have a charged group selected from the group consisting of -COOH, -PO 4 , -SO 4 , -SO3, -SO 2 , -NR W R V , where R w and R v independently is H, an alkyl or aryl group.
- the Inhibitor moiety does not comprise a -PO 4 group.
- the Inhibitor moiety does not comprise an aryl group.
- the Inhibitor does not include a nucleotide or nucleoside or analogs thereof.
- Inhibitor may be a compound having the following formula:
- each Ai and each A 2 is independently an amino acid moiety;
- Rs and R 9 independently is a H or an alkyl group;
- each of x and y is an integer from 0 to about 10.
- R 3 of a nucleotide analog of Formula II may include a group having the formula of
- R 5 is a H or an alkyl group
- p is an integer from 0 to about 10. In some embodiments, p is 5 or 6.
- R3 of a nucleotide analog of Formula II may include a group having the formula of
- k is an integer from about 1 to about 5. In some embodiments, k is an integer from about 2 to about 4. In some embodiments, k is 3.
- R3 of a nucleotide analog of Formula II may include a group having the formula of
- R , R are independently H or alkyl groups, and may together form one or more 3, 4, 5, or 6-member rings, and j is an integer from about 1 to about 5.
- R 3 of include a group having the formula of
- R3 of a nucleotide analog of Formula II may include a group having the formula of wherein R 1 , R 2 , R 3 , and R 4 are independently H or alkyl groups, and two or more of which may together form one or more 3, 4, 5, or 6-member rings, and j is an integer from about 1 to about 3.
- R3 of include a group having the formula of
- Ri of a nucleotide analog of Formula II may include a C-C triple bond, a S-S bond, or both a C-C triple bond and a S-S bond.
- Ri in the nucleotide analog of Formula II includes a group having the formula of
- R 6 is a H or an alkyl group; q and r independently is an integer from about 1 to about 10. [0040] In some embodiments, q is 1 or 2 and r is 1, 2 or 3. [0041] In some embodiments, R 2 is a tri-valent radical having the formula: wherein each Of R 2 ' and R 2 " is a bi-valent or tri-valent radical selected from:
- R 2 " is -(CH 2 ) x - or -(CH 2 -O) x -, where x is 2, 3, 4, 5, or 6.
- R 2 " is -(CH 2 -O) Z -(CH 2 ) y - or -(CH 2 ) Z -(CH 2 -O) y -, , where y+z is 2, 3, 4, 5, or 6.
- Advantages of these analogs include increased stability and enhanced level of inhibition, allowing more optimal spacing of the inhibitor moiety within/on the polymerase to increase effective inhibition.
- Exemplary compounds include:
- Atto647N is typically used in the form of a carboxylic acid or ester
- Atto647N (and other Atto dyes) is reacted with an amine group to for the above molecule - hence to amide moiety between the Atto647N and the back bone of the molecule.
- Atto dyes may be couple to the rest of the molecule by other linkages than an amide linkage, although amide is often preferred for convenience of preparation.
- the analogs of the invention may also be represented as follows, for example.
- the location of the charged moiety within the inhibitor group and/or the distance of the charged group to the NTP plays an important role in the effectiveness of inhibiting a subsequent nucleotide incorporation.
- the charged moiety of the inhibitor is from about 5 to about 60 bonds away from the NTP. In some other embodiments, the charged moiety of the inhibitor is from about 10 to about 40 bonds away from the NTP. In some other embodiments, the charged moiety of the inhibitor is from about 10 to about 35 bonds away from the NTP. In some other embodiments, the charged moiety of the inhibitor is from about 10 to about 30 bonds away from the NTP. In some other embodiments, the charged moiety of the inhibitor is from about 10 to about 20 bonds away from the NTP.
- the above compound (about 17X fold inhibition) exhibits an inhibiting effect that is much less than the following compound (about 7OX fold inhibition).
- the label may be any moiety that can be attached to or associated with, e.g., directly or via a linker or spacer, an oligonucleotide and that functions to provide a detectable signal, and/or to interact with a second label to modify the detectable signal provided by the first or second label, e.g. fluorescence resonance energy transfer (FRET).
- FRET fluorescence resonance energy transfer
- the label is an optically-detectable moiety (e.g., a fluorophore).
- types of optically-detectable labels include a fluorescent, chemiluminescence, or electrochemically luminescent label.
- fluorescent labels include, but are not limited to, 4-acetamido-4'-isothiocyanatostilbene-2,2'disulfonic acid; acridine and derivatives thereof such as acridine, acridine isothiocyanate; 5-(2'-aminoethyl)aminonaphthalene-l -sulfonic acid (EDANS); 4-amino-N-[3-vinylsulfonyl)phenyl]naphthalimide-3,5disulfonate; N-(4-anilino-l- naphthyl)maleimide; anthranilamide; BODIPY; Brilliant Yellow; coumarin and derivatives; coumarin, 7-amino-4-methylcoumarin (AMC, Coumarin 120), 7-amino-4- trifluoromethylcouluarin (Coumaran 15 1); cyanine dyes; cyanosine; 4',6-diaminidino
- each R x is independently selected from the group consisting of H, alkyl, and substituted alkyl.
- the above exemplary label moieties include any derivatives containing the chromophore of any of the labeling moieties exemplified or described herein, attached to the nucleotide analog by means of any suitable chemical linking group.
- the chromophore can be attached to the nucleotide analog via an alkyl chain bonded to the nucleotide analog by a functional group such as an amide, ester, ether, amine, thiol, disulfide, urea, urethane, carbonate, etc.
- the label is a fluorescent label such as cyanine-3 and cyanine-5.
- Labels other than fluorescent labels are contemplated as part of the invention, including other optically-detectable labels. Any appropriate detectable label can be used according to the invention, and numerous other labels are known to those skilled in the art.
- the invention also relates to methods for nucleic acid sequence determination using the nucleotide analogs described herein.
- the nucleotide analogs of the invention are particularly suitable for use in single molecule sequencing techniques. Such techniques are described for example in U.S. Patent Application Serial Nos. 10/831,214 filed April 2004; 10/852,028 filed May 24,2004; 10/866,388 filed June 10,2005; 10/099,459 filed March 12,2002; and U.S. Published Application 2003/013880 published July 24, 2003, each of which is herein incorporated in its entirety for all purposes.
- methods for nucleic acid sequence determination include exposing a target nucleic acid (also referred to herein as template nucleic acid or template) to a primer that is complementary to at least a portion of the target nucleic acid, under conditions suitable for hybridizing the primer to the target nucleic acid, forming a template/primer duplex.
- a target nucleic acid also referred to herein as template nucleic acid or template
- primer that is complementary to at least a portion of the target nucleic acid
- the invention also relates to methods for nucleic acid sequence determination using the nucleotide analogs described herein.
- the nucleotide analogs of the invention are particularly suitable for use in single molecule sequencing techniques. Such techniques are described for example in U.S. Patent Application Serial Nos. 10/831,214 filed April 2004; 10/852,028 filed May 24,2004; 10/866,388 filed June 10,2005; 10/099,459 filed March 12,2002; and U.S. Published Application 2003/013880 published July 24, 2003, each of which is herein incorporated in its entirety for all purposes.
- methods for nucleic acid sequence determination include exposing a target nucleic acid (also referred to herein as template nucleic acid or template) to a primer that is complementary to at least a portion of the target nucleic acid, under conditions suitable for hybridizing the primer to the target nucleic acid, forming a template/primer duplex.
- a target nucleic acid also referred to herein as template nucleic acid or template
- primer that is complementary to at least a portion of the target nucleic acid
- the invention in another aspect, relates to a method for sequencing a nucleic acid.
- the method includes: (a) anchoring a nucleic acid duplex to a surface, the duplex comprising a template portion and a primer portion hybridized thereto; (b) exposing the duplex to nucleotide analog of Formula I or Formula II in the presence of a polymerase capable of catalyzing the addition of the nucleotide analog to the primer portion in a template- dependent manner; (c) removing unincorporated nucleotide analog and polymerase; (d) detecting incorporation of the nucleotide analog into the primer portion; and (e) repeating said exposing, removing, and detecting steps at least once.
- the method may further include cleaving L from the nucleotide analog after the detecting step.
- the invention in another aspect, relates to a method for inhibiting the catalytic function of a polymerase enzyme in a sequencing-by-synthesis reaction comprising introducing a nucleotide attached to an inhibitory group.
- the invention comprises attaching one or both members of a template/primer duplex to a surface, introducing a polymerase and a nucleotide analog comprising a charged inhibitor under conditions sufficient for template- dependent incorporation of the nucleotide and inhibition of subsequent incorporation.
- Such methods further comprise removing or neutralizing the inhibitor in order to facilitate further nucleotide incorporation.
- nucleotides of the invention can be detectably labeled to monitor incorporation.
- Target nucleic acids include deoxyribonucleic acid (DNA) and/or ribonucleic acid (RNA).
- Target nucleic acid molecules can be obtained from any cellular material obtained from an animal, plant, bacterium, virus, fungus, or any other cellular organism, or may be synthetic DNA.
- Target nucleic acids may be obtained directly from an organism or from a biological sample obtained from an organism, e.g., from blood, urine, cerebrospinal fluid, seminal fluid, saliva, sputum, stool and tissue. Any tissue or body fluid specimen may be used as a source for nucleic acid for use in the invention.
- Nucleic acid molecules may also be isolated from cultured cells, such as a primary cell culture or a cell line.
- the cells from which target nucleic acids are obtained can be infected with a virus or other intracellular pathogen.
- Nucleic acid molecules may also include those of animal (including human), wild type or engineered prokaryotic or eukaryotic cells, viruses or completely or partially synthetic RNAs or DNAs.
- a sample can also be total RNA extracted from a biological specimen, a cDNA library, or genomic DNA.
- Nucleic acid typically is fragmented to produce suitable fragments for analysis.
- nucleic acid from a biological sample is fragmented by sonication.
- Test samples can be obtained as described in U.S. Patent Application 2002/0190663 Al, published October 9, 2003, herein incorporated by reference in its entirety for all purposes.
- nucleic acid can be extracted from a biological sample by a variety of techniques such as those described by Maniatis, et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, N.Y., pp. 280-281 (1982).
- target nucleic acid molecules can be from about 5 bases to about 20 kb, about 30 kb, or even about 40 kb or more.
- Nucleic acid molecules may be single- stranded, double-stranded, or double-stranded with single-stranded regions (for example, stem- and loop-structures)
- Single molecule sequencing includes a template nucleic acid molecule/primer duplex that is immobilized on a surface such that the duplex and/or the nucleotides (or nucleotide analogs) added to the immobilized primer are individually optically resolvable.
- the primer, template and/or nucleotide analogs are detectably labeled such that the position of an individual duplex molecule is individually optically resolvable.
- Either the primer or the template is immobilized to a solid support.
- the primer and template can be hybridized to each other and optionally covalently cross-linked prior to or after attachment of either the template or the primer to the solid support.
- methods for facilitating the incorporation of a nucleotide analog as an extension of a primer include exposing a target nucleic acid/primer duplex to one or more nucleotide analogs disclosed herein and a polymerase under conditions suitable to extend the primer in a template dependent manner.
- the primer is sufficiently complementary to at least a portion of the target nucleic acid to hybridize to the target nucleic acid and allow template-dependent nucleotide polymerization.
- the primer extension process can be repeated to identify additional nucleotide analogs in the template.
- the sequence of the template is determined by compiling the detected nucleotides, thereby determining the complementary sequence of the target nucleic acid molecule.
- Any polymerase and/or polymerizing enzyme may be employed.
- a preferred polymerase is Klenow with reduced exonuclease activity.
- Nucleic acid polymerases generally useful in the invention include DNA polymerases, RNA polymerases, reverse transcriptases, and mutant or altered forms of any of the foregoing. DNA polymerases and their properties are described in detail in, among other places, DNA Replication 2nd edition, Komberg and Baker, W. H. Freeman, New York, N.Y. (1991).
- Known conventional DNA polymerases useful in the invention include, but are not limited to, Pyrococcus furiosus (Pfu) DNA polymerase (Lundberg et al., 1991, Gene, 108: 1, Stratagene), Pyrococcus woesei (Pwo) DNA polymerase (Hinnisdaels et al., 1996, Biotechniques, 20: 186-8, Boehringer Mannheim), Thermus thermophilus (Tth) DNA polymerase (Myers and Gelfand 1991, Biochemistry 30:7661), Bacillus stearothermophilus DNA polymerase (Stenesh and McGowan, 1977, Biochim Biophys Acta 475:32), Thermococcus litoralis (TIi) DNA polymerase (also referred to as VentTM DNA polymerase, Cariello et al., 1991, Polynucleotides Res, 19: 4 193, New England Biolabs), 9"Nm DNA polymerase (New England Biolabs), Stoff
- thermococcus sp Thermus aquaticus (T aq) DNA polymerase (Chien et al., 1976, J. Bacteoriol, 127: 1550), DNA polymerase, Pyrococcus kodakaraensis KOD DNA polymerase (Takagi et al., 1997, Appl. Environ. Microbiol. 63:4504), JDF-3 DNA polymerase (from thermococcus sp.
- DNA polymerases include, but are not limited to, ThermoSequenase ® , 9°NmTM, TherminatorTM, Taq, Tne, Tma, Pfu, TfI, Tth, TIi, Stoffel fragment, VentTM and Deep VentTM DNA polymerase, KOD DNA polymerase, Tgo, JDF-3, and mutants, variants and derivatives thereof.
- Reverse transcriptases useful in the invention include, but are not limited to, reverse transcriptases from HIV, HTLV-I, HTLV-11, FeLV, FIV, SIV, AMV, MMTV, MoMuLV and other retroviruses (see Levin, Cell 88:5-8 (1997); Verma, Biochim Biophys Acta. 473:1-38 (1977); Wu et al., CRC Crit Rev Biochem. 3:289-347(1975)).
- Unincorporated nucleotide analog molecules may be removed prior to or after detecting. Unincorporated nucleotide analog molecules may be removed by washing.
- a template/primer duplex is treated to remove the label and/or to cleave the molecular chain attaching the label to the nucleotide.
- nucleotide analog after removal of the label and portions of the molecular chain connecting the label to the nucleotide can be represented by:
- R is a N-containing group such as a primary amino group, a secondary amino group, a tertiary amino group, an amide group,
- R is a phosphodiester linkage connecting the nucleotide analog to a sugar of an adjacent nucleotide in the nucleic acid, or a phosphoryl group.
- z is an integer from about 1 to about 5. In some other embodiments, z is an integer from about 1 to about 3.
- the invention also provides for a method of removing a label from a labeled base, comprising(a) exposing a base of Formula I or Formula II:
- B 1 is a part of the NTP of a nucleotide analog in Formula I or Formula II, and n is an integer from about 1 to about 12.
- the reducing agent is tris (2-carboxyl ethyl) phosphine.
- the base is linked to a sugar selected from the group consisting of ribose, deoxyribose, and analogs thereof, where the base and sugar together may be present in a nucleotide in a nucleic acid.
- One embodiment of a method for sequencing a nucleic acid template includes exposing a nucleic acid template to a primer capable of hybridizing to the template , a polymerase capable of catalyzing nucleotide addition to the primer, and a labeled nucleotide analog disclosed herein under conditions to permit the polymerase to add the nucleotide analog to the primer.
- a method for sequencing may further include identifying or detecting the incorporated labeled nucleotide.
- a cleavable bond may then be cleaved, removing at least the label from the nucleotide analog.
- the exposing, detecting, and removing steps are repeated at least once. In certain embodiments, the exposing, detecting, and removing steps are repeated at least three, five, ten or even more times.
- the sequence of the template can be determined based upon the order of incorporation of the labeled nucleotides.
- a method for sequencing a nucleic acid template includes exposing a nucleic acid template to a primer capable of hybridizing to the template and a polymerase capable of catalyzing nucleotide addition to the primer.
- the polymerase is, for example, Klenow with reduced exonuclease activity.
- the polymerase adds a labeled nucleotide analog disclosed herein.
- the method may include identifying the incorporated labeled nucleotide. Once the labeled nucleotide is identified, the label and at least a portion of a molecular chain connecting the label to the nucleotide analog are removed and the remaining portion of the molecular chain includes a free hydroxyl group.
- the exposing, incorporating, identifying, and removing steps are repeated at least once, preferably multiple times depending on the application.
- the sequence of the template is determined based upon the order of incorporation of the labeled nucleotides.
- Removal of a label from a labeled nucleotide analog and/or cleavage of the molecular chain linking a nucleotide analog to a label may include contacting or exposing the labeled nucleotide with a reducing agent.
- Such reducing agents include, for example, dithiothreitol (DTT), tris(2-carboxyethyl)phosphine (TCEP), tris(3-hydroxy -propyl) phosphine, tris(2-chloropropyl) phosphate (TCPP), 2-mercaptoethanol, 2-mercaptoethylarnine, cystein and ethylmaleimide.
- DTT dithiothreitol
- TCEP tris(2-carboxyethyl)phosphine
- TCPP tris(2-chloropropyl) phosphate
- 2-mercaptoethanol 2-mercaptoethylarnine
- cystein cystein
- ethylmaleimide Such contacting or exposing the reducing agent to a labeled nucleotide analog may occur at a range of pH values, for example at a pH of about 5 to about 10, or about 7 to about 9.
- the above-described methods for sequencing a nucleic acid template can further include a step of capping a molecular chain, for example, after the label has been removed.
- any optional 3' phosphate moiety can be removed enzymatically.
- an optional phosphate can be removed using alkaline phosphatase or T4 polynucleotide kinase.
- Suitable enzymes for removing optional phosphate include, any phosphatase, for example, alkaline phosphatase such as shrimp alkaline phosphatase, bacterial alkaline phosphatase, or calf intestinal alkaline phosphatase.
- any suitable detection method may be used to identify an incorporated nucleotide analog.
- exemplary detection methods include radioactive detection, optical absorbance detection, e.g., UV-visible absorbance detection, optical emission detection, e.g., fluorescence or chemiluminescence.
- Single-molecule fluorescence can be carried out using a conventional microscope equipped with total internal reflection (TIR) objective.
- TIR total internal reflection
- the detectable moiety associated with the extended primers can be detected on a substrate by scanning all or portions of each substrate simultaneously or serially, depending on the scanning method used.
- fluorescence labeling selected regions on a substrate may be serially scanned one-by-one or row-by-row using a fluorescence microscope apparatus, such as described in Fodor (U.S. Patent No.
- a phosphorimager device can be used (Johnston et al., Electrophoresis, 13566, 1990; Drmanac et al., Electrophoresis, 13:566, 1992; 1993).
- Other commercial suppliers of imaging instruments include General Scanning Inc., (Watertown, Mass. on the World Wide Web at genscan.com), Genix Technologies (Waterloo, Ontario, Canada; on the World Wide Web at confocal.com), and Applied Precision Inc. Such detection methods are particularly useful to achieve simultaneous scanning of multiple attached target nucleic acids.
- the present invention provides for detection of molecules ranging from a single nucleotide to a single target nucleic acid molecule.
- a number of methods are available for this purpose.
- Methods for visualizing single molecules within nucleic acids labeled with an intercalating dye include, for example, fluorescence microscopy. For example, the fluorescent spectrum and lifetime of a single molecule excited-state can be measured. Standard detectors such as a photomultiplier tube or avalanche photodiode can be used. Full field imaging with a two-stage image intensified CCD camera also can be used. Additionally, low noise cooled CCD can also be used to detect single fluorescent molecules.
- the detection system for the signal may depend upon the labeling moiety used.
- a combination of an optical fiber or charge coupled device (CCD) can be used in the detection step.
- CCD charge coupled device
- the substrate is itself transparent to the radiation used, it is possible to have an incident light beam pass through the substrate with the detector located opposite the substrate from the target nucleic acid.
- various forms of spectroscopy systems can be used.
- Various physical orientations for the detection system are available and discussion of design parameters is provided in the art.
- Optical setups include near-field scanning microscopy, far-field confocal microscopy, wide-field epi-illumination, but are not limited to, light scattering, dark field microscopy, photoconversion, single and/or multiphoton excitation, spectral wavelength discrimination, fluorophore identification, evanescent wave illumination, and total internal reflection fluorescence (TIRF) microscopy.
- TIRF total internal reflection fluorescence
- certain methods involve detection of laser-activated fluorescence using a microscope equipped with a camera.
- Suitable photon detection systems include, but are not limited to, photodiodes and intensified CCD cameras.
- an intensified charge couple device (ICCD) camera can be used.
- ICCD intensified charge couple device
- the use of an ICCD camera to image individual fluorescent dye molecules in a fluid near a surface provides numerous advantages. For example, with an ICCD optical setup, it is possible to acquire a sequence of images (movies) of fluorophores.
- TIRF microscopy uses totally internally reflected excitation light and is well known in the art. See, e g., the World Wide Web at nikoninstrurnents.jp/eng/page/products/tirf.aspx.
- detection is carried out using evanescent wave illumination and total internal reflection fluorescence microscopy.
- An evanescent light field can be set up at the surface, for example, to image fluorescently-labeled nucleic acid molecules.
- the excitation light beam penetrates only a short distance into the liquid.
- the optical field does not end abruptly at the reflective interface, but its intensity falls off exponentially with distance.
- This surface electromagnetic field called the "evanescent wave”
- the thin evanescent optical field at the interface provides low background and facilitates the detection of single molecules with high signal-to-noise ratio at visible wavelengths.
- the evanescent field also can image fluorescently-labeled nucleotides upon their incorporation into the attached target nucleic acid target molecule/primer complex in the presence of a polymerase. Total internal reflectance fluorescence microscopy is then used to visualize the attached target nucleic acid target molecule/primer complex and/or the incorporated nucleotides with single molecule resolution.
- Fluorescence resonance energy transfer can be used as a detection scheme. FRET in the context of sequencing is described generally in Braslavasky, et al., Proc. Nat'l Acad. ScL, 100: 3960-3964 (2003), incorporated by reference herein.
- a donor fluorophore is attached to the primer, polymerase, or template. Nucleotides added for incorporation into the primer comprise an acceptor fluorophore that is activated by the donor when the two are in proximity.
- Measured signals can be analyzed manually or preferably by appropriate computer methods to tabulate results.
- the signals of millions of analogs are read in parallel and then deconvoluted to ascertain a sequence.
- the substrates and reaction conditions can include appropriate controls for verifying the integrity of hybridization and extension conditions, and for providing standard curves for quantification, if desired.
- a control nucleic acid can be added to the sample. The absence of the expected extension product is an indication that there is a defect with the sample or assay components requiring correction.
- the described nucleotide analogs can be used to facilitate "four color" sequencing by synthesis if each base (A, C, G, T) is labeled with a dye emitting and/or absorbing at a different and resolvable wavelength.
- the sequencing procedure can be shortened from four separate addition cycles (i.e., one for each base) to the following: add A, C, G, T (each differently labeled) with polymerase and an appropriate reaction buffer, rinse, image the four resolvable dyes and record which base (if any) was incorporated, cleave and cap the nucleotides, and repeat.
- the described nucleotide analogs facilitate this kind of sequencing because of their ability to incorporate one and only one base at a time. Without that ability, if all four bases are added to the incorporation reaction at once multiple bases would be added to a given strand and the interactions between the proximate dyes would hinder the ability to resolve the sequence information correctly.
- the nucleotide analogs described herein can facilitate sequencing nucleic acids containing homopolymer sequences, using sequencing by synthesis methodology (e.g., using the methods of US 2007/0190546, herein incorporated by reference in its entirety for all purpose.
- sequencing by synthesis methodology e.g., using the methods of US 2007/0190546, herein incorporated by reference in its entirety for all purpose.
- nucleotide analog, and reaction buffer combination that allows for only a single nucleotide analog incorporation allows for each base in the homopolymer to be sequenced sequentially. After one base is incorporated into the homopolymer and detected, the portion of the analog that inhibits subsequent base incorporation and that contains the fluorescent label is removed, making incorporation of the next base in the homopolymer possible during the next addition cycle of the correct base.
- ⁇ - N-Fmoc-S-tert-butylthio-L-cysteine (1 g, 2.32 mmol) was dissolved in anhydrous acetonitrile and solution of dicyclohexylcarbodiimide (DCC) (573 mg, 2.78 mmol in CH 3 CN) was added followed by solution of NHS (345 mg, 3.01 mmol in CH 3 CN). After 1 hr. dicyclohexylurea was spun down and active ester used without purification in coupling with ⁇ - amino-hexanoic acid (304 mg, 2.32 mmol) dissolved in 50% aq. DMF.
- DCC dicyclohexylcarbodiimide
- DIPEA N,N'- Diisopropylethylamine
- dATP-AP3 and dCTP-AP3 were prepared by a modified procedure of Hobbs and Cocuzza: a) Pyrophosphate and tributylamine were added to the reaction mixture rather than vice versa; b) After pyrophosphate addition the reaction was quenched with 50 mM TEAB within 15 min.; c) DEAE-Sephadex chromatography was replaced by preparative HPLC.
- reaction mixture was purified with HPLC (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B CH 3 CN, 10 mL/min flow).
- HPLC Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B CH 3 CN, 10 mL/min flow).
- the NHS ester of the acid 32 was prepared by dissolving the acid 32 (3.0 ⁇ mol) in DMF (500.0 ⁇ L) and N,N,N',N'-Tetramethyl-O-(N-succinimidyl)uronium hexafluorophosphate (SbTMU) (4.3 mg, 12 ⁇ mol) in 100 ⁇ L DMF was added to the acid solution followed by the addition of DIPEA (80 ⁇ L). After stirring at RT for 1 hr., the reaction mixture was used immediately for peptide coupling without any purification.
- the peptide Arg- Arg-Arg-OH (14.5 mg, 30 ⁇ mol) was dissolved in 160 ⁇ L 0.5M phosphate buffer, and added to the freshly prepared NHS ester of the acid 32. The reaction mixture was stirred for 30 minutes and then the crude reaction mixture was purified with HPLC (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B MeCN, 10 mL/min flow).
- HPLC Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B MeCN, 10 mL/min flow).
- the NHS ester of the acid 45 was prepared by dissolving the acid 45 (4.0 ⁇ mol, 1 eqv.) in DMF (700.0 ⁇ L) and the SbTMU 5.93 mg, 16.5 ⁇ mol, in 200 ⁇ L DMF, 4.0 eqv.) was added, to the acid solution followed by the addition of DIPEA (103.0 ⁇ L). After stirring at RT for 1 hour, the reaction mixture was used immediately for peptide coupling without any purification. The peptide (Asp- Asp-Asp- Asp) was dissolved in DMFB 2 O (400.0 ⁇ L, 1 :1), basif ⁇ ed using DIPEA (50.0 ⁇ L).
- HPLC fractions containing the thiol 7 (0.34 ⁇ mol, 1 eqv.) were mixed with HPLC fractions containing dCTP-SPDP (0.41 ⁇ mol, 1.25 eqv.) in an aluminum foil covered flask. After 15 min. LCMS analysis indicated that the completion of the reaction and it was then partially concentrated under reduced pressure to remove CH 3 CN, then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B CH 3 CN, 5 mL/min flow).
- HPLC fractions containing thiol 47 were mixed with HPLC fractions containing dATP-SPDP (0.6 ⁇ mol, 1.2 eqv.) in an aluminum foil covered flask. After 15 min. LCMS analysis indicated that the completion of the reaction and it was then partially concentrated under reduced pressure to remove CH 3 CN, then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.00 mm 10 micron, gradient: 100% A for 5 min., then 1% B/min, buffer A 0.05M TEAB, buffer B CH 3 CN, 5 mL/min flow).
- Fmoc-Cys(S ⁇ Bu)-OH (2.0 g, 4.63 mmol, 1 eqv.) was dissolved in CH 3 CN (10 mL).
- DCC 1.2 g, 5.81 mmol, 1.26 eqv.
- NHS 0.1 g, 6.08 mmol, 1.31 eqv.
- DCU White precipitate
- 6-Aminohexanoic acid (0.60 g, 4.57 mmol, 1 eqv.) was dissolved in 1 :1 H 2 O:DMF (6 mL total). DIPEA (0.016 mL) was added to keep the pH about 8. NHS ester (4.63 mmol in 10 mL CH3CN, 1.01 eqv.) was added to the reaction mixture in 1 mL aliquots over aboutlO min. DIPEA (0.02 mL) was added after each aliquot to keep the reaction basic. After the first aliquot of NHS ester was added, the reaction became cloudy, and addition of extra H 2 O (0.2 mL) was needed to clear up the solution.
- HPLC fractions containing the thiol were mixed with HPLC fractions containing SPDP-dATP (5 ⁇ mol, 1 equiv). After the SPDP-dATP was consumed based on LCMS analysis (about 10 min), the reaction was partially concentrated under reduced pressure to remove CH 3 CN and then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 15.0 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min., buffer A 0.05M TEAB, buffer B CH 3 CN, 10 mL/min. flow). Fractions containing the desired were pooled and lyophilized, then used for subsequent reactions without quantifying.
- Atto647N-NHS ester (0.030 mL, 1.8 ⁇ mol, 0.06M in anhydrous DMF, 3.6 eqv.) was added to a solution of amine (0.5 ⁇ mol, 1 eqv.) in H 2 O (0.25 mL) in 10 ⁇ L aliquots. The reaction was monitored by LCMS to determine how much dye was needed to consume the starting amine.
- HPLC fractions containing the thiol were mixed with HPLC fractions containing SPDP-dGTP (1.5 ⁇ mol, 1 eqv.). After the SPDP-dGTP was consumed based on LCMS analysis (about 10 min), the reaction was partially concentrated under reduced pressure to remove CH 3 CN and then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min., then 1% B/min, buffer A 0.05M TEAB, buffer B CH 3 CN, 5 mL/min. flow). Fractions containing the desired were pooled and lyophilized, then used for subsequent reactions without quantifying.
- Atto647N-NHS ester (0.011 mL, 0.66 ⁇ mol, 0.06 M in anhydrous DMF, 2.5 eqv.) was added to a solution of amine (0.26 ⁇ mol, 1 equiv) in H2O (0.50 mL) in small aliquots. The reaction was monitored by LCMS to determine how much dye was needed to consume the starting amine.
- HPLC fractions containing the thiol were mixed with SPDP-dCTP (1 ⁇ mol, 1 eqv.) in H 2 O (0.20 mL). After the SPDP-dCTP was consumed based on LCMS analysis (about 10 min.), the reaction was partially concentrated under reduced pressure to remove CH 3 CN and then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 3% B/min., buffer A 0.05M TEAB, buffer B CH 3 CN, 5 mL/min. flow). Fractions containing the desired were pooled and lyophilized, then used for subsequent reactions without quantifying.
- Atto647N-NHS ester (0.012 mL, 0.72 ⁇ mol, 0.06M in anhydrous DMF, 3.6 eqv.) was added to a solution of amine (0.15 ⁇ mol, 1 eqv.) in H 2 O (0.20 mL) in 5 ⁇ L aliquots. The reaction was monitored by LCMS to determine how much dye was needed to consume the starting amine.
- Atto647N-NHS ester (0.010 mL, 0.68 ⁇ mol, 0.06M in anhydrous DMF, 3.6 eqv.) was added to a solution of amine (0.19 ⁇ mol, 1 eqv.) in H 2 O (0.40 mL) in small aliquots.
- IM K 2 HPO 4 (0.40 mL) was also added to accelerate the reaction after there was little product formed within an hour. The reaction was monitored by LCMS to determine how much dye was needed to consume the starting amine.
- HPLC fractions containing thiol 57 were mixed with HPLC fractions containing SPDP-dATP (20 ⁇ mol). After -15 minutes the reaction was lyophilized, then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield 58, which was used for the subsequent reaction without quantifying.
- HPLC fractions containing thiol 57 were mixed with HPLC fractions containing SPDP-dCTP (45 ⁇ mol). After -30 minutes the reaction was lyophilized, then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield 61, which was used for the subsequent reaction without quantifying.
- Acid 54 (0.14 g, 0.34 mmol) was dissolved in MeCN (0.6 mL). DCC (0.085 g, 0.41 mmol) was added, followed by NHS (0.051 g, 0.44 mmol) and the reaction was stirred at RT for an hour. White precipitate (DCU) began forming within five minutes. The reaction mixture was transferred to eppendorf tubes and centrifuged to remove the white precipitate. The supernatant containing NHS ester 55 was then added to a solution of 6-aminocaproic acid (0.049 g, 0.37 mmol) in H 2 O (0.4 mL) and DMF (0.4 mL). DIPEA (0.05 mL) was added to keep the pH ⁇ 8.
- Acid 64 (0.12 g, 0.23 mmol) was dissolved in MeCN (0.6 mL). DCC (0.058 g, 0.28 mmol) was added, followed by NHS (0.035 g, 0.3 mmol) and the reaction was stirred at RT for an hour. White precipitate (DCU) began forming within twenty minutes. The reaction mixture was transferred to eppendorf tubes and centrifuged to remove the white precipitate. The supernatant containing NHS ester 65 was then added to a solution of H-Asp-Asp-OH (0.075 g, 0.30 mmol) in 0.1 M K 2 HPO 4 (0.5 mL) and MeCN (0.5 mL).
- HPLC fractions containing thiol 67 were mixed with HPLC fractions containing SPDP-dUTP (50 ⁇ mol). After ⁇ 1 hr the reaction was concentrated to remove MeCN, then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex C18 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 5 min, then 2% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield 68, which was used for the subsequent reaction without quantifying.
- Carbamate 68 (unqualified, -50 ⁇ mol) was treated with 20% piperidine/MeCN (2 mL) for 30 minutes to remove the Fmoc protecting group. The solvent was removed under reduced pressure, and the residue was dissolved in 50 mM TEAB buffer ( ⁇ 3mL), causing formation of copious white precipitate (dibenzylfulvene). The mixture was transferred to eppendorf tubes and centrifuged to remove the precipitate.
- HPLC fractions containing thiol 73 were mixed with HPLC fractions containing SPDP-dGTP (58 ⁇ mol). After -30 minutes the reaction was concentrated to remove MeCN, then HPLC purified (Waters Delta 600 pump and 2487 Dual ⁇ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield 74, which was used for the subsequent reaction without quantifying.
- nucleotide analogs disclosed here include compounds which otherwise correspond thereto, and which have the same general properties thereof, wherein one or more simple variations of substituents or components are made which do not adversely affect the characteristics of the nucleotide analogs of interest.
- the components of the nucleotide analogs disclosed herein may be prepared by the methods illustrated in the general reaction schema as described herein or by modifications thereof, using readily available starting materials, reagents, and conventional synthesis procedures.
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Molecular Biology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The invention provides for novel nucleotide analogs and methods of using the same, e.g., for sequencing nucleic acids.
Description
NUCLEOTIDE ANALOGS
Related Applications
[0001 ] This application claims priority to PCT International Application No. PCT/US08/59446 filed April 4, 2008 and to U.S. Application Serial No. 12/098,196 filed on April 4, 2008, and to U.S. Application Serial No. 12/244,698 filed October 2, 2008, the entire contents of each of which are expressly incorporated herein by reference for all purposes.
Field of the Invention
[0002] The invention relates to nucleotide analogs and methods for sequencing a nucleic acid using the nucleotide analogs.
Background
[0003] Sequencing-by-synthesis involves the template- dependent addition of nucleotides to a template/primer duplex. Traditional sequencing-by-synthesis is performed using dye- labeled terminators and gel electrophoresis (so-called "Sanger sequencing"). See, e.g., Sanger, F. and Coulson, A.R., 1975, J. MoI. Biol. 94: 441-448; Sanger, F. et al, 1977, Nature. 265(5596): 687-695; and Sanger, F. et al, 1977, Proc. Natl. Acad. ScL U.S.A. 75: 5463-5467. Recently, single molecule sequencing methods have been proposed that provide increased resolution, throughput, and speed at reduced cost. For example, a sequencing-by-synthesis method that results in sequence determination without consecutive base incorporation, has been proposed by Braslavsky, et al., Proc. Nat'lAcad. Sd., 100: 3960-3964 (2003). These methods do not rely on the user of terminator nucleotides as in Sanger sequencing. Instead, template/primer duplex is anchored directly, or indirectly (e.g., via a polymerase enzyme) to a surface and labeled nucleotides are added in a template-dependent manner.
[0004] A challenge that has arisen in single molecule sequencing involves the ability to sequence through homopolymer regions (i.e., portions of the template that contain consecutive identical nucleotides). Often the number of bases present in a homopolymer region is important from the point of view of genetic function. Many polymerase enzymes used in sequencing-by- synthesis reactions are highly-processive and tend to add bases continuously in a homopolymer region. It is often difficult to resolve the number of nucleotides in a homopolymer due to the
difficulty in distinguishing between the incorporation of one or two labeled nucleotides and the incorporation of a greater number of nucleotides.
[0005] A need therefore exists for nucleotide analogs that promote accurate base-over- base incorporation in sequencing-by-synthesis reactions.
Summary of the Invention
[0006] The invention provides nucleotide analogs and methods of using them to allow sequencing-by-synthesis to occur such that, on average, a single nucleotide is incorporated into the 3' end of a primer portion of a template/primer duplex per sequencing cycle. The invention is based, in part, on the discovery that nucleotide analogs having an attached inhibitory region with one or more charged groups provide good incorporation of a single nucleotide into the duplex without allowing a significant, or any, amount of second, third, etc. base incorporation.
[0007] The invention generally provides nucleotide analogs and methods of using nucleotide analogs in sequencing. More particularly, the invention provides compounds, methods and compositions useful in introduction of a single base at a time in a template- dependent sequencing-by-synthesis reaction. The invention allows template-dependent sequencing-by-synthesis through all regions of a target nucleic acid, including homopolymer regions, and provides methods for the determination of the number of nucleotides present in a homopolymer region.
[0008] The invention provides nucleotide analogs that comprise a nucleotide (or nucleotide analog), a detectable label, and an inhibitor group. Upon incorporation of the nucleotide, the inhibitor prevents subsequent nucleotide incorporation into the same duplex. However, upon removal of the detectable label and the inhibitor group, the nucleotide analog does not substantially hinder subsequent nucleotide (or nucleotide analog) incorporation.
[0009] In one aspect, A method for sequencing a nucleic acid. The method includes the steps of: exposing a nucleic acid duplex comprising a template portion and a primer portion to a nucleotide analog comprising an inhibitor that is charged or capable of becoming charged, and a polymerase, under conditions that permit template-dependent incorporation of the analog into the primer; detecting incorporation of the analog; removing or neutralizing the inhibitor; and
repeating the exposing, detecting, and removing steps at least once, thereby to determine the sequence of the template.
[0010] In another aspect, the invention relates to a nucleotide analog that includes: a nucleoside triphosphate; an inhibitor comprising (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more (i.e., a plurality of) singly charged groups or two or more groups capable of becoming singly charged; a detectable label; and a linker connecting the inhibitor and the label to the nucleoside triphosphate. It should be noted that in some embodiments, one or a single charged group may be sufficient to provide the desired inhibitory effect.
[0011] In yet another aspect, the invention relates to nucleotide analogs of the formula:
NTP is a nucleoside or nucleotide triphosphate or an analog thereof capable of template- dependent incorporation into the 3' end of a polynucleotide strand hybridized to a template. Inhibitor comprises a moiety that is charged or capable of becoming charged and that inhibits subsequent nucleotide incorporation once the first nucleotide is incorporated. Tether is a bond or a group linking the NTP to the Inhibitor group. In a preferred embodiment, the inhibitor is a non-steric inhibitor.
[0012] In yet another aspect, the invention relates to nucleotide analogs of Formula II:
NTP is a nucleoside or nucleotide triphosphate or an analog of either capable of template- dependent incorporation into the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP. L is a detectable label that facilitates the identification of the nucleotide analog. Inhibitor comprises (a) one or more multiply charged groups or groups
capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged. Ri and R2 are independently a bond or a group, wherein at least one of Ri and R2 comprises a cleavable bond, which upon cleavage results in de- association of NTP from both Label and Inhibitor. R3 is a bond or group linking R2 to the Inhibitor. R4 is a bond or group linking R2 to a Label.
[0013] In yet another aspect, the invention relates to nucleotide analogs of nucleotide analog of the following Formula II:
wherein NTP is a nucleoside or nucleotide triphosphate or an analog of either capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP; L is a detectable label that facilitates the identification of the nucleotide analog; Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged; Ri comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor; R2 is a tri-valent radical having the formula:
-(CH2) X-, -(CH2-O) x-, -(CH2-O) Z-(CH2) y-, -(CH2) Z-(CH2-O) y-, and the same substituted with one or more groups selected from hydroxyl, halogen, amino, thiol, (Ci-C6) alkyl, wherein x, y and z are each integers with x and y+z are each from 2 to 10; R3 is a bond or group linking R2 to the Inhibitor moiety; and R4 is a bond or group linking R2 to a L.
[0014] In yet another aspect, the invention relates to a method for sequencing a nucleic acid. The method includes: (a) anchoring a nucleic acid duplex, or portion thereof, to a surface,
the duplex comprising a template portion and a primer portion hybridized thereto; (b) exposing the duplex to nucleotide analog of Formula I or II (as defined herein) in the presence of a polymerase capable of catalyzing the addition of the nucleotide analog to the primer portion in a template- dependent manner; (c) removing unincorporated nucleotide analog and polymerase; (d) detecting incorporation of the nucleotide analog into the primer portion; and repeating the exposing, removing, and detecting steps at least once.
[0015] In yet another aspect, the invention relates to a method for sequencing a nucleic acid, the method comprising the steps of: (a) exposing a nucleic acid duplex comprising a template portion and a primer portion to a nucleotide analog of the following Formula II:
(b) detecting incorporation of the analog; (c) removing or neutralizing the inhibitor; and (d) repeating the exposing, detecting, and removing steps at least once, thereby to determine the sequence of the template, wherein NTP is a nucleoside or nucleotide triphosphate or an analog of either capable of incorporating onto the 3 ' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP; L is a detectable label that facilitates the identification of the nucleotide analog; Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged; Ri comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor; R2 is a tri-valent radical having the formula:
-(CH2) X-, -(CH2-O) x-, -(CH2-O) Z-(CH2) y-, -(CH2) Z-(CH2-O) y-,
and the same substituted with one or more groups selected from hydroxyl, halogen, amino, thiol, (Ci-C6) alkyl, wherein x, y and z are each integers with x and y+z are each from 2 to 10; R3 is a bond or group linking R2 to the Inhibitor moiety; and R4 is a bond or group linking R2 to a L.
[0016] In yet another aspect, the invention provides methods and nucleotide analogs for selectively inhibiting the catalytic function of a polymerase enzyme. As such, nucleotide analogs comprise an inhibitory portion, such that the nucleotide analog is capable of being incorporated into a nucleic acid duplex but then inhibits subsequent nucleotide incorporation until the inhibitory portion is removed.
[0017] The inhibitory portion of an analog of the invention preferably is a charged group. The charged group can take any appropriate form as long as it carries a charge. Preferably, the charge group is selected from a phosphate, a carboxylic acid (or carboxylate), a sulfate, caproic acid (or a caproic acid derivative), a charged amino acid, -SO3, -SO2, and - NRWRV, where Rw and Rv independently is H, an alkyl or aryl group. The charged group can convey a negative or positive charge, but negative charged groups are preferred. In another preferred embodiment, the charge group contains multiple charged portions. For example, the charge group can be a dipeptide, a di-phosphate, disulfate, or other multiples of charged moieties. For example, amino acid inhibitors are preferably selected from aspartic acid, glutamic acid, arginine, lysine, and histidine.
[0018] The invention provides charged inhibitors of subsequent base incorporation in a sequencing-by-synthesis reaction. By subsequent base incorporation it is intended that a first nucleotide (or analog) is incorporated in a template-dependent manner, but second, third, etc. base incorporation is inhibited by the inhibitor group. In a preferred embodiment, inhibition occurs by positioning a charged group in proximity to the active site of a polymerase enzyme, thus disabling the ability of the polymerase to make subsequent incorporations. Without being limited to theory, analogs of the invention, interfere with magnesium present in the active site of the polymerase, resulting in a reduced ability of the active site to catalyze subsequent nucleotide incorporation.
[0019] In a preferred embodiment, an analog of the invention comprises a nucleoside triphosphate, an inhibitor comprising a plurality of charged groups, a detectable label, and a linker connecting the charged groups and the label to the nucleoside triphosphate. Preferred
inhibitors comprise a plurality of charged groups and may be selected from any charged group capable of conferring a charge in a local area. Preferably, the inhibitor does not sterically inhibit a polymerase. Also in a preferred embodiment, the linker is cleavable. Multiple cleavable groups, such as enzymatically-cleavable group, such as disulfide bonds and the like.
Detailed Description of the Invention
[0020] The invention provides methods and compositions that facilitate the addition of a single nucleotide to a template/primer duplex per reaction cycle (i.e., the addition of nucleotides and polymerase enzyme under conditions that result in template-dependent nucleotide incorporation into the primer). Analogs of the invention comprise a charged inhibitory group that, upon incorporation of a nucleotide in a template-dependent manner, prevents subsequent nucleotide incorporation until the inhibitory group is removed. Thus, an analog of the invention comprises a nucleotide triphosphate, a linker (or tether), a detectable label, and a charged inhibitory group, wherein the label and the inhibitory group are removable.
[0021] In one aspect, the invention generally provides nucleotide analogs of the following Formula I:
NTP is a nucleoside triphosphate or an analog thereof capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP;
Inhibitor comprises a group that is charged or capable of becoming charged, e.g., under reaction conditions, and that inhibits a subsequent incorporation of a nucleotide (or analog thereof), and
Tether is a bond or a group linking the NTP to the Inhibitor moiety. A group is considered capable of becoming charged if the group is capable of becoming electrically non-neutral, e.g., under reaction or buffer conditions. Examples of such groups include -COOH and -NRWRV, where Rw and Rv independently is H, an alkyl or aryl group.
[0022] In one embodiment, the inhibitor group can cause inhibition of subsequent nucleotide incorporation without steric hinderance. In other words, the inhibition is caused by chemical or charge interaction with the enzyme and not be a physical blocking of the enzyme. In another embodiment, the charged inhibitor also provides steric inhibition of enzyme activity. However, in either case, the inhibitor group is charged.
[0023] Natural NTPs include nucleoside triphosphates, adenosine triphosphate (ATP), guanosine triphosphate (GTP), cytidine triphosphate (CTP), thymidine triphosphate (TTP) and uridine triphosphate (UTP); and nucleotide triphosphates, deoxyadenosine triphosphate (dATP), deoxyguanosine triphosphate (dGTP), deoxycytidine triphosphate (dCTP), deoxythimidine triphosphate (dTTP) and deoxyuridine triphosphate (dUTP). NTPs useful in this invention include non-nature nucleosides and nucleotides, and analogs and derivatives thereof.
[0024] In some embodiments, the inhibitor may include a moiety that is negatively charged or capable of becoming a negatively charged. In other embodiments, the inhibitor group is positively charged or capable of becoming positively charged.
[0025] In some other embodiments, the inhibitor is an amino acid or an amino acid analog. The Inhibitor may be a peptide of 2 to 20 units of amino acids or analogs, a peptide of 2 to 10 units of amino acids or analogs, a peptide of 3 to 7 units of amino acids or analogs, a peptide of 3 to 5 units of amino acids or analogs. In some embodiments, the Inhibitor includes a group selected from the group consisting of GIu, Asp, Arg, His, and Lys, and a combination thereof (e.g., Arg, Arg- Arg, Asp, Asp- Asp, GIu, Glu-Glu, Asp-Glu-Asp, Asp-Asp-Glu or Asp Asp Asp Asp). Peptides or groups may be combinations of the same or different amino acids or analogs.
[0026] In one embodiment, the invention relates to an oligonucleotide with at least one nucleotide analog of the invention incorporated therein.
[0027] In some embodiments, the Tether comprises
wherein L is detectable label that facilitates the identification of the nucleotide analog after incorporation onto a template;
Ri and R2 are independently a bond or a group, wherein at least one of Ri and R2 comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor;
R3 is a bond or group linking R2 to the Inhibitor moiety; and R4 is a bond or group linking R2 to a L.
[0028] In another aspect, the present invention is directed to nucleotide analogs of Formula II:
NTP is a nucleoside triphosphate or an analog thereof capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP;
L is a detectable label to facilitate the identification of the nucleotide analog after incorporation onto the template;
Inhibitor is a moiety that substantially inhibits a subsequent incorporation of a nucleotide (or analog thereof). In some embodiments, the Inhibitor moiety includes a nucleotide or nucleoside or analogs thereof, in other embodiments, the inhibitor is not a nucleotide or analog thereof;
Ri and R2 are independently a bond or a group, wherein at least one of Ri and R2 comprises a cleavable bond, which upon cleavage results in de-association of NTP from both Label and Inhibitor;
R3 is a bond or group linking R2 to the Inhibitor moiety; and R4 is a bond or group linking R2 to L.
[0029] In some embodiments, NTP is a compound having the following formula:
wherein B is selected from the group consisting of purine or pyrimidine bases, as well as derivatives of purine and pyrimidine bases; R' is independently selected from the group consisting of-OH, -0-P(O)(OH)2, -O-C(O)-RX, -NHRy, and an -O-blocking agent, where Rx and Ry are alkyl groups; R" is independently selected from the group consisting of H and -OH.
[0030] Non-limiting examples of representative purine and pyrimidine bases include adenine, cytosine, guanine, thymine, uracil, or hypoxanthine. Non-limiting examples of derivatives of purine and pyrimidine bases include naturally-occurring and synthetic derivatives of a base, including pyrazolo[3,4-d]pyrimidines, 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2-thiothymine and 2-thiocytosine, 5-propynyl uracil and cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8-halo (e.g., 8-bromo), 8-amino, 8-thiol, 8- thioalkyl, 8-hydroxyl and other 8-substituted adenines and guanines, 5-halo particularly 5- bromo, 5-trifluoromethyl and other 5-substituted uracils and cytosines, 7-methylguanine and 7- methyladenine, 8-azaguanine and 8-azaadenine, deazaguanine, 7-deazaguanine, 3-deazaguanine, deazaadenine, 7-deazaadenine, 3-deazaadenine, pyrazolo[3,4-d]pyrimidine, imidazo[l,5-a] 1,3,5 triazinones, 9-deazapurines, imidazo[4,5-d]pyrazines, thiazo Io [4,5-d]pyrimi dines, pyrazin-2- ones, 1 ,2,4-triazine, pyridazine; and 1,3,5 triazine.
[0031] Base B1 of the invention permits a nucleotide to be incorporated into a polynucleotide chain by a polymerase and forms base pairs with a base on an antiparallel nucleic acid strand. The term base pair encompasses not only the standard AT, AU or GC base pairs, but also base pairs formed between nucleotides and/or nucleotide analogs comprising non-standard or modified bases, wherein the arrangement of hydrogen bond donors and hydrogen bond acceptors permits hydrogen bonding between a nonstandard base and a standard base or between two complementary non-standard base structures. One example of such non-standard base
pairing is the base pairing between the nucleotide analog inosine and adenine, cytosine or uracil, where two hydrogen bonds are formed.
[0032] The Inhibitor may include a charged moiety (e.g., a negatively charged moiety, a positively charged moiety, or both) or a moiety that is capable of becoming charged. The Inhibitor can include two or more charged groups. The Inhibitor may have a charged group selected from the group consisting of -COOH, -PO4, -SO4, -SO3, -SO2, -NRWRV, where Rw and Rv independently is H, an alkyl or aryl group. In other embodiments, the Inhibitor moiety does not comprise a -PO4 group. In some other embodiments, the Inhibitor moiety does not comprise an aryl group. In certain other embodiments, the Inhibitor does not include a nucleotide or nucleoside or analogs thereof.
[0033] Inhibitor may be a compound having the following formula:
wherein each Ai and each A2 is independently an amino acid moiety; Rs and R9 independently is a H or an alkyl group; each of x and y is an integer from 0 to about 10. In some embodiments, R8 and R9 are H atoms and x = 1 and y = 2.
[0034] R3 of a nucleotide analog of Formula II may include a group having the formula of
wherein R5 is a H or an alkyl group; p is an integer from 0 to about 10. In some embodiments, p is 5 or 6.
[0035] In some embodiments, R3 of a nucleotide analog of Formula II may include a group having the formula of
wherein k is an integer from about 1 to about 5. In some embodiments, k is an integer from about 2 to about 4. In some embodiments, k is 3.
[0036] In some embodiments, R3 of a nucleotide analog of Formula II may include a group having the formula of
wherein R , R are independently H or alkyl groups, and may together form one or more 3, 4, 5, or 6-member rings, and j is an integer from about 1 to about 5. In some embodiments, R3 of include a group having the formula of
[0037] In some embodiments, R3 of a nucleotide analog of Formula II may include a group having the formula of
wherein R1, R2, R3, and R4 are independently H or alkyl groups, and two or more of which may together form one or more 3, 4, 5, or 6-member rings, and j is an integer from about 1 to about 3. In some embodiments, R3 of include a group having the formula of
[0038] Ri of a nucleotide analog of Formula II may include a C-C triple bond, a S-S bond, or both a C-C triple bond and a S-S bond.
[0039] In some embodiments, Ri in the nucleotide analog of Formula II includes a group having the formula of
wherein R6 is a H or an alkyl group; q and r independently is an integer from about 1 to about 10. [0040] In some embodiments, q is 1 or 2 and r is 1, 2 or 3. [0041] In some embodiments, R2 is a tri-valent radical having the formula:
wherein each Of R2' and R2" is a bi-valent or tri-valent radical selected from:
-(CH2) X-, -(CH2-O) x-, -(CH2-O) Z-(CH2) y-, -(CH2) Z-(CH2-O) y-, and the same substituted with one or more groups selected from hydroxyl, halogen, amino, thiol, (Ci-C6) alkyl, wherein x, y and z are each integers with x and y+z are each from 2 to 10.
[0042] In some detailed embodiments, R2" is -(CH2) x- or -(CH2-O) x-, where x is 2, 3, 4, 5, or 6. In some other detailed embodiments, R2" is -(CH2-O) Z-(CH2) y- or -(CH2) Z-(CH2-O)y-, , where y+z is 2, 3, 4, 5, or 6. Advantages of these analogs include increased stability and enhanced level of inhibition, allowing more optimal spacing of the inhibitor moiety within/on the polymerase to increase effective inhibition. Exemplary compounds include:
Table 1
[0043] As Atto647N is typically used in the form of a carboxylic acid or ester, Atto647N (and other Atto dyes) is reacted with an amine group to for the above molecule - hence to amide moiety between the Atto647N and the back bone of the molecule. It is noted here that Atto dyes may be couple to the rest of the molecule by other linkages than an amide linkage, although amide is often preferred for convenience of preparation. In this regard, the analogs of the invention may also be represented as follows, for example.
[0044] In some embodiments of the invention, the location of the charged moiety within the inhibitor group and/or the distance of the charged group to the NTP plays an important role in the effectiveness of inhibiting a subsequent nucleotide incorporation. In some embodiments, the charged moiety of the inhibitor is from about 5 to about 60 bonds away from the NTP. In some other embodiments, the charged moiety of the inhibitor is from about 10 to about 40 bonds away from the NTP. In some other embodiments, the charged moiety of the inhibitor is from
about 10 to about 35 bonds away from the NTP. In some other embodiments, the charged moiety of the inhibitor is from about 10 to about 30 bonds away from the NTP. In some other embodiments, the charged moiety of the inhibitor is from about 10 to about 20 bonds away from the NTP.
[0045] For example, the above compound (about 17X fold inhibition) exhibits an inhibiting effect that is much less than the following compound (about 7OX fold inhibition).
[0046] The label (or "L") may be any moiety that can be attached to or associated with, e.g., directly or via a linker or spacer, an oligonucleotide and that functions to provide a detectable signal, and/or to interact with a second label to modify the detectable signal provided by the first or second label, e.g. fluorescence resonance energy transfer (FRET). In one embodiment, the label is an optically-detectable moiety (e.g., a fluorophore). Non-limiting examples of types of optically-detectable labels include a fluorescent, chemiluminescence, or electrochemically luminescent label. Examples of fluorescent labels include, but are not limited to, 4-acetamido-4'-isothiocyanatostilbene-2,2'disulfonic acid; acridine and derivatives thereof such as acridine, acridine isothiocyanate; 5-(2'-aminoethyl)aminonaphthalene-l -sulfonic acid (EDANS); 4-amino-N-[3-vinylsulfonyl)phenyl]naphthalimide-3,5disulfonate; N-(4-anilino-l- naphthyl)maleimide; anthranilamide; BODIPY; Brilliant Yellow; coumarin and derivatives; coumarin, 7-amino-4-methylcoumarin (AMC, Coumarin 120), 7-amino-4- trifluoromethylcouluarin (Coumaran 15 1); cyanine dyes; cyanosine; 4',6-diaminidino-2- phenylindole (DAPI); 5',5"-dibromopyrogallol-sulfonaphthalein (Bromopyrogallol Red); 7- diethylamino-3-(4'-isothiocyanatophenyl)-4-methylcoumarin; diethylenetriamine pentaacetate; 4,4'-diisothiocyanatodihydro-stilbene-2,2'-disulfonic acid; 4,4'-diisothiocyanatostilbene-2,2'-
disulfonic acid; 5-[dimethylaminolnaphthalene-l-sulfonyl chloride (DNS, dansylchloride); 4- dimethylarninophenylazophenyl-4'-isothiocyanate (DABITC); eosin and derivatives; eosin, eosin isothiocyanate, erythrosin and derivatives; erythrosin B, erythrosin, isothiocyanate; ethidium; fluorescein and derivatives; 5 -carboxy fluorescein (FAM), 5-(4,6-dichlorotriazin-2- yl)aminofluorescein (DTAF), 2',7'-dimethoxy-4'5'-dichloro-6-carboxyfluorescein, fluorescein, fluorescein isothiocyanate, QFITC, (XRITC); fluorescamine; IR144; IR1446; Malachite Green isothiocyanate; 4-methylumbelliferoneortho cresolphthalein; nitrotyrosine; pararosaniline; Phenol Red; B-phycoerythrin; o-phthaldialdehyde; pyrene and derivatives: pyrene, pyrene butyrate, succinimidyl 1 -pyrene; butyrate quantum dots; Reactive Red 4 (Cibacron™ Brilliant Red 3B-A) rhodamine and derivatives: 6-carboxy-X-rhodamine (ROX), 6-carboxyrhodamine (R6G), lissamine rhodamine B sulfonyl chloride rhodarnine (Rhod), rhodamine B, rhodamine 123, rhodamine X isothiocyanate, sulforhodamine B, sulforhodamine 101, sulfonyl chloride derivatives of sulforhodamine 101 (Texas Red); N,N,N',N'-tetramethyl-6-carboxyrhodamine (TAMRA); tetramethyl rhodamine; tetramethyl rhodamine isothiocyanate (TRITC); riboflavin; rosolic acid; terbium chelate derivatives; Cy3; Cy5; Cy5.5; Cy7; IRD 700; IRD 800; La Jolta Blue; phthalocyanine; naphthalocyanine; any of the fluorescent labels available from Atto-Tec, such as Atto 390, Atto 425, Atto 465, Atto 488, Atto 495, Atto 520, Atto 532, Atto 550, Atto 565, Atto 590, Atto 594, Atto 610, Atto 61 IX, Atto 620, Atto 633, Atto 635, Atto 637, Atto 647, Atto 647N, Atto 655, Atto 680, Atto 700, Atto 725, Atto 740, etc.; any of the fluorescent labels available from Dyomics such as DY-630, DY-631, DY-632, DY-633, DY-634, DY-635, DY- 636, Dy-647, Dy-648, DY-649, Dy-650, Dy-651, DY-652, etc.; any of the fluorescent labels available from Pierce such as DyLight 405, DyLight 488, DyLight 549, DyLight 633, DyLight 649, DyLight 680, DyLight 800, etc.; any of the fluorescent labels available from AnaSpec such as HiLyte FluoriM 4. S. S dyes, HiLytc F!uor1M 555 Jycs. HiLyte Fluor™ 647 dyes, HiLytc HuorIM 680 dyes. HiLytc Fluor1* 750 dyes, HiLytePlus™ 555 dyes, HiLytePlus fM (47 dyes, HiLytcPlu-,3 M 750 dyes, etc ; any of the fluorescent labels available from Denovo Biolables such as Oyster 500, Oyster 550 P, Oyster 550 D, Oyster 556, Oyster 645, Oyster 650 P, Oyster 650 D, Oyster 656, etc.; IRDye® 680, IRDye® 700, IRDye® 700DX, IRDye® 800, IRDye® 800 RS, IRDye® 800 CW, etc.; any of the fluorescent labels available from SETA Biomedicals such as Seta Kl -204, Seta K5-3212, Seta K8- 1342, Seta K8- 1352, Seta K8- 1357, Seta K8- 1407, Seta K8-1642, Seta K8- 1644, Seta K8- 1663, Seta K8- 1664, Seta K8- 1669, Seta K8-3002, Seta K4-
1082, SetaK8-1669, SetaK7-545, SetaK7-547, SetaK7-549, SetaK8-1252, SetaK8-1261, Seta K8-1262, Seta K8- 1320, Seta K8- 1344, Seta K8- 1367, Seta K8- 1377, Seta K8- 1382, SetaK8- 1446, SetaK8-1667, SetaK8-1752, SetaK8-1762, SetaK8-1767, SetaK8-1777, SetaK8-1782, etc.; Q Dots; and dyes having the following structures:
wherein each Rx is independently selected from the group consisting of H, alkyl, and substituted alkyl.
[0047] The above exemplary label moieties include any derivatives containing the chromophore of any of the labeling moieties exemplified or described herein, attached to the nucleotide analog by means of any suitable chemical linking group. For example, the chromophore can be attached to the nucleotide analog via an alkyl chain bonded to the nucleotide analog by a functional group such as an amide, ester, ether, amine, thiol, disulfide, urea, urethane, carbonate, etc. In one embodiment, the label is a fluorescent label such as cyanine-3 and cyanine-5.
[0048] Labels other than fluorescent labels are contemplated as part of the invention, including other optically-detectable labels. Any appropriate detectable label can be used according to the invention, and numerous other labels are known to those skilled in the art.
[0049] The invention also relates to methods for nucleic acid sequence determination using the nucleotide analogs described herein. The nucleotide analogs of the invention are particularly suitable for use in single molecule sequencing techniques. Such techniques are described for example in U.S. Patent Application Serial Nos. 10/831,214 filed April 2004; 10/852,028 filed May 24,2004; 10/866,388 filed June 10,2005; 10/099,459 filed March 12,2002;
and U.S. Published Application 2003/013880 published July 24, 2003, each of which is herein incorporated in its entirety for all purposes. In general, methods for nucleic acid sequence determination include exposing a target nucleic acid (also referred to herein as template nucleic acid or template) to a primer that is complementary to at least a portion of the target nucleic acid, under conditions suitable for hybridizing the primer to the target nucleic acid, forming a template/primer duplex.
[0050] The invention also relates to methods for nucleic acid sequence determination using the nucleotide analogs described herein. The nucleotide analogs of the invention are particularly suitable for use in single molecule sequencing techniques. Such techniques are described for example in U.S. Patent Application Serial Nos. 10/831,214 filed April 2004; 10/852,028 filed May 24,2004; 10/866,388 filed June 10,2005; 10/099,459 filed March 12,2002; and U.S. Published Application 2003/013880 published July 24, 2003, each of which is herein incorporated in its entirety for all purposes. In general, methods for nucleic acid sequence determination include exposing a target nucleic acid (also referred to herein as template nucleic acid or template) to a primer that is complementary to at least a portion of the target nucleic acid, under conditions suitable for hybridizing the primer to the target nucleic acid, forming a template/primer duplex.
[0051] In another aspect, the invention relates to a method for sequencing a nucleic acid. The method includes: (a) anchoring a nucleic acid duplex to a surface, the duplex comprising a template portion and a primer portion hybridized thereto; (b) exposing the duplex to nucleotide analog of Formula I or Formula II in the presence of a polymerase capable of catalyzing the addition of the nucleotide analog to the primer portion in a template- dependent manner; (c) removing unincorporated nucleotide analog and polymerase; (d) detecting incorporation of the nucleotide analog into the primer portion; and (e) repeating said exposing, removing, and detecting steps at least once. The method may further include cleaving L from the nucleotide analog after the detecting step.
[0052] In another aspect, the invention relates to a method for inhibiting the catalytic function of a polymerase enzyme in a sequencing-by-synthesis reaction comprising introducing a nucleotide attached to an inhibitory group. In one aspect, the invention comprises attaching one or both members of a template/primer duplex to a surface, introducing a polymerase and a
nucleotide analog comprising a charged inhibitor under conditions sufficient for template- dependent incorporation of the nucleotide and inhibition of subsequent incorporation. Such methods further comprise removing or neutralizing the inhibitor in order to facilitate further nucleotide incorporation. Finally, nucleotides of the invention can be detectably labeled to monitor incorporation.
[0053] Target nucleic acids include deoxyribonucleic acid (DNA) and/or ribonucleic acid (RNA). Target nucleic acid molecules can be obtained from any cellular material obtained from an animal, plant, bacterium, virus, fungus, or any other cellular organism, or may be synthetic DNA. Target nucleic acids may be obtained directly from an organism or from a biological sample obtained from an organism, e.g., from blood, urine, cerebrospinal fluid, seminal fluid, saliva, sputum, stool and tissue. Any tissue or body fluid specimen may be used as a source for nucleic acid for use in the invention. Nucleic acid molecules may also be isolated from cultured cells, such as a primary cell culture or a cell line. The cells from which target nucleic acids are obtained can be infected with a virus or other intracellular pathogen. Nucleic acid molecules may also include those of animal (including human), wild type or engineered prokaryotic or eukaryotic cells, viruses or completely or partially synthetic RNAs or DNAs. A sample can also be total RNA extracted from a biological specimen, a cDNA library, or genomic DNA.
[0054] Nucleic acid typically is fragmented to produce suitable fragments for analysis. In one embodiment, nucleic acid from a biological sample is fragmented by sonication. Test samples can be obtained as described in U.S. Patent Application 2002/0190663 Al, published October 9, 2003, herein incorporated by reference in its entirety for all purposes. Generally, nucleic acid can be extracted from a biological sample by a variety of techniques such as those described by Maniatis, et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, N.Y., pp. 280-281 (1982). Generally, target nucleic acid molecules can be from about 5 bases to about 20 kb, about 30 kb, or even about 40 kb or more. Nucleic acid molecules may be single- stranded, double-stranded, or double-stranded with single-stranded regions (for example, stem- and loop-structures)
[0055] Single molecule sequencing includes a template nucleic acid molecule/primer duplex that is immobilized on a surface such that the duplex and/or the nucleotides (or nucleotide
analogs) added to the immobilized primer are individually optically resolvable. The primer, template and/or nucleotide analogs are detectably labeled such that the position of an individual duplex molecule is individually optically resolvable. Either the primer or the template is immobilized to a solid support. The primer and template can be hybridized to each other and optionally covalently cross-linked prior to or after attachment of either the template or the primer to the solid support.
[0056] In general, methods for facilitating the incorporation of a nucleotide analog as an extension of a primer include exposing a target nucleic acid/primer duplex to one or more nucleotide analogs disclosed herein and a polymerase under conditions suitable to extend the primer in a template dependent manner. Generally, the primer is sufficiently complementary to at least a portion of the target nucleic acid to hybridize to the target nucleic acid and allow template-dependent nucleotide polymerization. The primer extension process can be repeated to identify additional nucleotide analogs in the template. The sequence of the template is determined by compiling the detected nucleotides, thereby determining the complementary sequence of the target nucleic acid molecule.
[0057] Any polymerase and/or polymerizing enzyme may be employed. A preferred polymerase is Klenow with reduced exonuclease activity. Nucleic acid polymerases generally useful in the invention include DNA polymerases, RNA polymerases, reverse transcriptases, and mutant or altered forms of any of the foregoing. DNA polymerases and their properties are described in detail in, among other places, DNA Replication 2nd edition, Komberg and Baker, W. H. Freeman, New York, N.Y. (1991). Known conventional DNA polymerases useful in the invention include, but are not limited to, Pyrococcus furiosus (Pfu) DNA polymerase (Lundberg et al., 1991, Gene, 108: 1, Stratagene), Pyrococcus woesei (Pwo) DNA polymerase (Hinnisdaels et al., 1996, Biotechniques, 20: 186-8, Boehringer Mannheim), Thermus thermophilus (Tth) DNA polymerase (Myers and Gelfand 1991, Biochemistry 30:7661), Bacillus stearothermophilus DNA polymerase (Stenesh and McGowan, 1977, Biochim Biophys Acta 475:32), Thermococcus litoralis (TIi) DNA polymerase (also referred to as Vent™ DNA polymerase, Cariello et al., 1991, Polynucleotides Res, 19: 4 193, New England Biolabs), 9"Nm DNA polymerase (New England Biolabs), Stoffel fragment, Thermosequenase (Amersham Pharmacia Biotech UK), Therminator (New England Biolabs), Thermotoga maritima (Tma) DNA polymerase (Diaz and Sabino, 1998 Braz J Med. Res, 3 1 : 1239),
Thermus aquaticus (T aq) DNA polymerase (Chien et al., 1976, J. Bacteoriol, 127: 1550), DNA polymerase, Pyrococcus kodakaraensis KOD DNA polymerase (Takagi et al., 1997, Appl. Environ. Microbiol. 63:4504), JDF-3 DNA polymerase (from thermococcus sp. JDF-3, Patent application WO 0132887), Pyrococcus GB-D (PGB-D) DNA polymerase (also referred as Deep VentTMD NA polymerase, Juncosa-Ginesta et al., 1994, Biotechniques, 16:820, New England Biolabs), UITma DNA polymerase (from thermophile Thermotoga maritima; Diaz and Sabino, 1998 Braz J. Med. Res, 3 1 : 1239; PE Applied Biosystems), Tgo DNA polymerase (from thermococcus gorgonarius, Roche Molecular Biochemicals), E. coli DNA polymerase I (Lecomte and Doubleday, 1983, Polynucleotides Res. 11 :7505), T7 DNA polymerase (Nordstrom et al., 198 1, J Biol. Chem. 256:3 1 12), and archaeal DP1I/DP2 DNA polymerase II (Cann et al., 1998, Proc Natl Acad. Sci. USA 95: 14250-5).
[0058] Other DNA polymerases include, but are not limited to, ThermoSequenase®, 9°Nm™, Therminator™, Taq, Tne, Tma, Pfu, TfI, Tth, TIi, Stoffel fragment, Vent™ and Deep Vent™ DNA polymerase, KOD DNA polymerase, Tgo, JDF-3, and mutants, variants and derivatives thereof. Reverse transcriptases useful in the invention include, but are not limited to, reverse transcriptases from HIV, HTLV-I, HTLV-11, FeLV, FIV, SIV, AMV, MMTV, MoMuLV and other retroviruses (see Levin, Cell 88:5-8 (1997); Verma, Biochim Biophys Acta. 473:1-38 (1977); Wu et al., CRC Crit Rev Biochem. 3:289-347(1975)).
[0059] Unincorporated nucleotide analog molecules may be removed prior to or after detecting. Unincorporated nucleotide analog molecules may be removed by washing.
[0060] A template/primer duplex is treated to remove the label and/or to cleave the molecular chain attaching the label to the nucleotide. One may repeat the steps of exposing template/primer duplex to one or more nucleotide analogs and polymerase, detecting incorporated nucleotides, and then treating to (1) remove the label, (2) remove the label and at least a portion of the molecular chain associating the label to the nucleotide or (3) cleave the molecular chain thereby identifying additional bases in the template nucleic acid, The identified bases can be compiled to determine the sequence of the target nucleic acid. In some embodiments, at least some portions of the remaining molecular chain and/or label are not removed, for example, in the last round of primer extension.
[0061] In some embodiments, a nucleotide analog, after removal of the label and portions of the molecular chain connecting the label to the nucleotide can be represented by:
wherein B1, R', R", are as described herein, here R is a N-containing group such as a primary amino group, a secondary amino group, a tertiary amino group, an amide group,
η
; and z is an integer from about 1 to about 12. R is a phosphodiester linkage connecting the nucleotide analog to a sugar of an adjacent nucleotide in the nucleic acid, or a phosphoryl group. In some embodiments, z is an integer from about 1 to about 5. In some other embodiments, z is an integer from about 1 to about 3.
[0062] The invention also provides for a method of removing a label from a labeled base, comprising(a) exposing a base of Formula I or Formula II:
as described herein, to a reducing agent for a time sufficient to produce an unlabelled base of Formula III:
where B1 is a part of the NTP of a nucleotide analog in Formula I or Formula II, and n is an integer from about 1 to about 12. In some embodiments, the reducing agent is tris (2-carboxyl ethyl) phosphine. In other embodiments, the base is linked to a sugar selected from the group consisting of ribose, deoxyribose, and analogs thereof, where the base and sugar together may be present in a nucleotide in a nucleic acid.
[0063] One embodiment of a method for sequencing a nucleic acid template includes exposing a nucleic acid template to a primer capable of hybridizing to the template , a polymerase capable of catalyzing nucleotide addition to the primer, and a labeled nucleotide analog disclosed herein under conditions to permit the polymerase to add the nucleotide analog to the primer. A method for sequencing may further include identifying or detecting the incorporated labeled nucleotide. A cleavable bond may then be cleaved, removing at least the label from the nucleotide analog. The exposing, detecting, and removing steps are repeated at least once. In certain embodiments, the exposing, detecting, and removing steps are repeated at least three, five, ten or even more times. The sequence of the template can be determined based upon the order of incorporation of the labeled nucleotides.
[0064] In another embodiment, a method for sequencing a nucleic acid template includes exposing a nucleic acid template to a primer capable of hybridizing to the template and a polymerase capable of catalyzing nucleotide addition to the primer. The polymerase is, for example, Klenow with reduced exonuclease activity. The polymerase adds a labeled nucleotide analog disclosed herein. The method may include identifying the incorporated labeled nucleotide. Once the labeled nucleotide is identified, the label and at least a portion of a molecular chain connecting the label to the nucleotide analog are removed and the remaining portion of the molecular chain includes a free hydroxyl group. The exposing, incorporating, identifying, and removing steps are repeated at least once, preferably multiple times depending on the application. The sequence of the template is determined based upon the order of incorporation of the labeled nucleotides.
[0065] Removal of a label from a labeled nucleotide analog and/or cleavage of the molecular chain linking a nucleotide analog to a label may include contacting or exposing the labeled nucleotide with a reducing agent. Such reducing agents include, for example, dithiothreitol (DTT), tris(2-carboxyethyl)phosphine (TCEP), tris(3-hydroxy -propyl) phosphine, tris(2-chloropropyl) phosphate (TCPP), 2-mercaptoethanol, 2-mercaptoethylarnine, cystein and ethylmaleimide. Such contacting or exposing the reducing agent to a labeled nucleotide analog may occur at a range of pH values, for example at a pH of about 5 to about 10, or about 7 to about 9.
[0066] The above-described methods for sequencing a nucleic acid template can further include a step of capping a molecular chain, for example, after the label has been removed. After addition of the nucleotide analog to the primer, any optional 3' phosphate moiety can be removed enzymatically. In one embodiment, an optional phosphate can be removed using alkaline phosphatase or T4 polynucleotide kinase. Suitable enzymes for removing optional phosphate include, any phosphatase, for example, alkaline phosphatase such as shrimp alkaline phosphatase, bacterial alkaline phosphatase, or calf intestinal alkaline phosphatase.
[0067] Any suitable detection method may be used to identify an incorporated nucleotide analog. Thus, exemplary detection methods include radioactive detection, optical absorbance detection, e.g., UV-visible absorbance detection, optical emission detection, e.g., fluorescence or chemiluminescence. Single-molecule fluorescence can be carried out using a conventional microscope equipped with total internal reflection (TIR) objective. The detectable moiety associated with the extended primers can be detected on a substrate by scanning all or portions of each substrate simultaneously or serially, depending on the scanning method used. For fluorescence labeling, selected regions on a substrate may be serially scanned one-by-one or row-by-row using a fluorescence microscope apparatus, such as described in Fodor (U.S. Patent No. 5,445,934) and Mathies et al. (U.S. Patent No. 5,09 1,652). Devices capable of sensing fluorescence from a single molecule include scanning tunneling microscope (STM) and the atomic force microscope (AFM). Hybridization patterns may also be scanned using a CCD camera (e.g., Model TE/CCD512SF, Princeton Instruments, Trenton, NJ.) with suitable optics (Ploem, CCD (Chase-Completed-Device) in Fluorescent and Luminescent Probes for Biological Activity Mason, T.G. Ed., Academic Press, Landon, pp. 1-11 (1993), such as described in Yershov et al., Proc. Natl. Aca. Sci. 93:4913 (1996), or may be imaged by TV monitoring. For
radioactive signals, a phosphorimager device can be used (Johnston et al., Electrophoresis, 13566, 1990; Drmanac et al., Electrophoresis, 13:566, 1992; 1993). Other commercial suppliers of imaging instruments include General Scanning Inc., (Watertown, Mass. on the World Wide Web at genscan.com), Genix Technologies (Waterloo, Ontario, Canada; on the World Wide Web at confocal.com), and Applied Precision Inc. Such detection methods are particularly useful to achieve simultaneous scanning of multiple attached target nucleic acids.
[0068] The present invention provides for detection of molecules ranging from a single nucleotide to a single target nucleic acid molecule. A number of methods are available for this purpose. Methods for visualizing single molecules within nucleic acids labeled with an intercalating dye include, for example, fluorescence microscopy. For example, the fluorescent spectrum and lifetime of a single molecule excited-state can be measured. Standard detectors such as a photomultiplier tube or avalanche photodiode can be used. Full field imaging with a two-stage image intensified CCD camera also can be used. Additionally, low noise cooled CCD can also be used to detect single fluorescent molecules.
[0069] The detection system for the signal may depend upon the labeling moiety used. For optical signals, a combination of an optical fiber or charge coupled device (CCD) can be used in the detection step. In those circumstances where the substrate is itself transparent to the radiation used, it is possible to have an incident light beam pass through the substrate with the detector located opposite the substrate from the target nucleic acid. For electromagnetic labeling moieties, various forms of spectroscopy systems can be used. Various physical orientations for the detection system are available and discussion of design parameters is provided in the art.
[0070] A number of approaches can be used to detect incorporation of fluorescently labeled nucleotides into a single nucleic acid molecule. Optical setups include near-field scanning microscopy, far-field confocal microscopy, wide-field epi-illumination, but are not limited to, light scattering, dark field microscopy, photoconversion, single and/or multiphoton excitation, spectral wavelength discrimination, fluorophore identification, evanescent wave illumination, and total internal reflection fluorescence (TIRF) microscopy. In general, certain methods involve detection of laser-activated fluorescence using a microscope equipped with a camera. Suitable photon detection systems include, but are not limited to, photodiodes and intensified CCD cameras. For example, an intensified charge couple device (ICCD) camera can
be used. The use of an ICCD camera to image individual fluorescent dye molecules in a fluid near a surface provides numerous advantages. For example, with an ICCD optical setup, it is possible to acquire a sequence of images (movies) of fluorophores.
[0071] Some embodiments of the present invention use TIRF microscopy for two- dimensional imaging. TIRF microscopy uses totally internally reflected excitation light and is well known in the art. See, e g., the World Wide Web at nikoninstrurnents.jp/eng/page/products/tirf.aspx. In certain embodiments, detection is carried out using evanescent wave illumination and total internal reflection fluorescence microscopy. An evanescent light field can be set up at the surface, for example, to image fluorescently-labeled nucleic acid molecules. When a laser beam is totally reflected at the interface between a liquid and a solid substrate (e.g., a glass), the excitation light beam penetrates only a short distance into the liquid. The optical field does not end abruptly at the reflective interface, but its intensity falls off exponentially with distance. This surface electromagnetic field, called the "evanescent wave", can selectively excite fluorescent molecules in the liquid near the interface. The thin evanescent optical field at the interface provides low background and facilitates the detection of single molecules with high signal-to-noise ratio at visible wavelengths.
[0072] The evanescent field also can image fluorescently-labeled nucleotides upon their incorporation into the attached target nucleic acid target molecule/primer complex in the presence of a polymerase. Total internal reflectance fluorescence microscopy is then used to visualize the attached target nucleic acid target molecule/primer complex and/or the incorporated nucleotides with single molecule resolution.
[0073] Fluorescence resonance energy transfer (FRET) can be used as a detection scheme. FRET in the context of sequencing is described generally in Braslavasky, et al., Proc. Nat'l Acad. ScL, 100: 3960-3964 (2003), incorporated by reference herein. In an embodiment, a donor fluorophore is attached to the primer, polymerase, or template. Nucleotides added for incorporation into the primer comprise an acceptor fluorophore that is activated by the donor when the two are in proximity.
[0074] Measured signals can be analyzed manually or preferably by appropriate computer methods to tabulate results. Preferably, the signals of millions of analogs are read in parallel and then deconvoluted to ascertain a sequence. The substrates and reaction conditions
can include appropriate controls for verifying the integrity of hybridization and extension conditions, and for providing standard curves for quantification, if desired. For example, a control nucleic acid can be added to the sample. The absence of the expected extension product is an indication that there is a defect with the sample or assay components requiring correction.
[0075] As another example, the described nucleotide analogs can be used to facilitate "four color" sequencing by synthesis if each base (A, C, G, T) is labeled with a dye emitting and/or absorbing at a different and resolvable wavelength. The sequencing procedure can be shortened from four separate addition cycles (i.e., one for each base) to the following: add A, C, G, T (each differently labeled) with polymerase and an appropriate reaction buffer, rinse, image the four resolvable dyes and record which base (if any) was incorporated, cleave and cap the nucleotides, and repeat. The described nucleotide analogs facilitate this kind of sequencing because of their ability to incorporate one and only one base at a time. Without that ability, if all four bases are added to the incorporation reaction at once multiple bases would be added to a given strand and the interactions between the proximate dyes would hinder the ability to resolve the sequence information correctly.
[0076] For example, the nucleotide analogs described herein can facilitate sequencing nucleic acids containing homopolymer sequences, using sequencing by synthesis methodology (e.g., using the methods of US 2007/0190546, herein incorporated by reference in its entirety for all purpose. When the template sequence contains a homopolymer, using a polymerase, nucleotide analog, and reaction buffer combination that allows for only a single nucleotide analog incorporation allows for each base in the homopolymer to be sequenced sequentially. After one base is incorporated into the homopolymer and detected, the portion of the analog that inhibits subsequent base incorporation and that contains the fluorescent label is removed, making incorporation of the next base in the homopolymer possible during the next addition cycle of the correct base.
[0077] Reference to the following figures or schemes illustrating an exemplary reaction scheme and nucleotide analogs is intended in no way to limit the scope of this invention but is provided to illustrate how to prepare and use the compounds of the present invention.
Examples
Example 1. Caproic-Glu and Caproic-Glu
Synthetic Scheme I
3-tert-Butyldisulfanyl-2-(9H-fluoren-9-ylmethoxycarbonylamino)-propionic acid 2, 5-dioxo- pyrrolidin-1-yl ester (2)
[0078] To a solution of Fmoc-Cys(S StBu)-OH (1, 2.15 g, 5.0 mmole,) dissolved in anhydrous CH2Cl2(SO mL) was added N-Ethyl-N'-(3-dimethylaminopropyl)carbodiimide hydrochloride (EDAC, 1.146g, 6 mmole), the reaction mixture was stirred for 10 min. at room temperature (RT) and then added N-hydroxysuccinimide (NHS) (0.690 g, 6.0 mmole). To this reaction mixture was added catalytic amount of N,N'-dimethlyaminopyridine and stirred at RT until completion of reaction tested with TLC. The solvent was evaporated and the residue obtained was extracted with ethyl acetate (50 mLx2), washed with IM NaHCO3 (10 mL), followed by brine solution (20 mL) and dried over anhydrous Na2SO^ Evaporation of the solvent afforded 2 as a white crystalline solid. Yield. 2.5 g (95%).
6-[S-tert-Butyldisulfanyl-2-(9H-fluoren-9-ylmethoxycarbonylamino)-propionylamino]-hexanoic acid (3)
[0079] To a solution of 6-Aminohexanoic acid (0.158.g, 1.2 mmole) dissolved in 0.1 M NaHCO3 (2.0 mL) was added the NHS ester 2 (0.68g, 1.3 mmole) in 4 mL of anhydrous THF. The reaction mixture was stirred at RT for 2 hr. The solvent was completely evaporated and the dried solid residue obtained was dissolved in CH3OHZCH2Cl2 mixture and purified by silica gel column chromatography using 10% CH3OH/CH2C12 and obtained 3 as a white solid on evaporation the solvent. Yield: 0.5 g (77%).
[0080] To a solution of (3, 500 mg, 0.92 mmole,) dissolved in anhydrous CH2C12/THF (1: 1)(5 mL) was added N-Ethyl-N'-(3-dimethylaminopropyl)carbodiimide hydrochloride (EDAC, 191 mg, 1.0 mmole), followed by NHS (115 mg, 1.0 mmole). To this reaction mixture was added catalytic amount of N,N'-dimethlyaminopyridine and stirred at RT until completion of reaction tested with TLC. The solvent was evaporated and the residue obtained was extracted with ethyl acetate (50 mLx2), washed with IM NaHCO3 (10 mL), followed by brine solution (10 mL) and dried over anhydrous Na2SCv Evaporation of the solvent afforded 4 as a white crystalline solid. Yield. 0.52g (88%).
2-{6-[3-tert-Butyldisulfanyl-2-(9H-fluoren-9-ylmethoxycarbonylamino)-propionylamino]- hexanoylaminoj-pentanedioic acid (5)
[0081] To a stirred solution of Glutamic acid (20 mg, 0.14 mmole) in 0.2M NaHCO3 (0.5 mL) was added 6-[3-tert-Butyldisulfanyl-2-(9H-fluoren-9-ylmethoxycarbonylamino)- propionylamino]-hexanoic acid 2,5-dioxo-pyrrolidin-l-yl ester (4, 96 mg, 0.15 mmole) dissolved in (THF-DMF(1 : 1), 0.5 mL). The reaction mixture was stirred at RT for 10 min. and analyzed with LCMS which showed the product (5) peak with mass m/z: 671.95 [M-H]. The reaction was stirred at RT for overnight and purified by HPLC using Phenomenex Cl 8 preparative column, (250 x 21.00 mm, gradient: 2% CH3CN/50mM TEAB (triethylammonium bicarbonate), pH 8.4, 10 mL/min flow). Fractions containing the compound 5 were collected together and evaporated the solvent using rotary evaporator and dried. Yielded 5 as a white solid: 50 mg.
2-{6-[2-(9H-Fluoren-9-ylmethoxycarbonylamino)-S-mercapto-propionylamino]- hexanoylaminoj-pentanedioic acid (6)
[0082] A solution of (5) (10 mg, 0.015 mmole) in H2O-THF (1:1, 1.0 ml) was treated with tris(2-carboxyethyl)phosphine (TCEP, 0.10 mL, 0.5M in H2O). The reaction was stirred at RT for 4h until complete cleavage of disulphide bond (monitored by LCMS) and purified by HPLC using Phenomenex Cl 8 preparative column, (250 x 21.00 mm, gradient: 2% CH3CN/50mM TEAB, pH 8.4, 10 mL/min flow). Fractions containing the compound 6 were pooled and used immediately for the subsequent displacement reaction with dATP-SPDP (SPDP: N-succinimidyl 3-(2-pyridyl dithio) propionate) and dCTP-SPDP as described below. LCMS: m/s: 583.95[M-H].
Compound 7
[0083] The fractions containing compound 6 in 60% CH3CN/50 mM TEAB buffer (4.0 mg, in 4 mL) collected from HPLC were mixed with dATP-SPDP (3.6 μmole, ref. previous
patent) in 4 ml of 30% CH3N/50 mM TEAB buffer, pH 8.4 in a round bottom flask and stirred for 2h. The reaction solution was concentrated under reduced pressure, diluted with water and purified with HPLC (Phenomenex Cl 8 column, 250 x 21.0 mm, gradient: 1.5% CH3CN/50 mM TEAB buffer, lOmL/min flow rate). Fractions containing the desired were pooled together and evaporated and dried. Yielded 7 (3.0 mg) as a white solid. LCMS: m/z: 2121.80 [M-2H], 606.05 [M/2-2H].
Compound 8
[0084] The compound 7 (2.0 mg) obtained was dissolved in anhydrous DMF (0.6 mL) added 60μl of piperidine. The reaction mixture was then stirred at RT for an hour. The complete cleavage of FMOC group was monitored by LCMS and the reaction mixture was purified by HPLC (Phenomenex Cl 8 column, 250 x 21.0 mm, gradient: 1.5% CH3CN/50 mM TEAB buffer, 10 mL/min flow rate). Fractions containing the desired were pooled together and evaporated and obtained 8 (1.0 μmole) as a colorless solid. LCMS: 990.95 [M/2-2H].
Compound 9
[0085] To a solution of 8 (0.5 μmole) in 0.5 mL of 50 mM K2HPO4 was added Cy5- NHS (1 mg, 1.2 μmole) dissolved in 20μL of anhydrous DMF and stirred at RT until the complete disappearance of starting material 8 which was monitored by LCMS. Then the blue color reaction mixture was purified HPLC (Phenomenex Cl 8 column, 250 x 21.0 mm, gradient: 1.5% CH3CN/5O mM TEAB buffer, 10 mL/min flow rate). Fractions containing the desired were pooled together and lyophilized. Yielded 9a (0.36 μmole) as a blue solid. LCMS: 814.40 [M/2-2H].
[0086] Similarly a solution of 8 (0.5 μmole) in 0.5 mL of 50 mM K2HPO4 was added Atto 647N-NHS (2 mg, 2.5 μmole) dissolved in 40μL of anhydrous DMF and stirred at RT until the complete disappearance of starting material 8 which was monitored by LCMS. Then the blue color reaction mixture was purified HPLC (Phenomenex Cl 8 column, 250 x 21.0 mm, gradient: 2.0% CH3CN/5O mM TEAB buffer, 10 mL/min flow rate). Fractions containing the desired were pooled together and lyophilized. Yielded 9b (0.3 μmole) as a blue solid. LCMS: 1595.2 [M-2H], 797.0 [M/2-2H].
Compound 10
[0087] The fractions containing compound 6 in 60% CH3CN/50 mM TEAB buffer (3.0 mg, in 3 mL) collected from HPLC were mixed with dCTP-SPDP (3.0 μmole, ref. previous patent) in 3 ml of 30% CH3N/50 mM TEAB buffer, pH 8.4 in a round bottom flask and stirred for 2 hr. The reaction solution was concentrated under reduced pressure, diluted with water and purified with HPLC (Phenomenex Cl 8 column, 250 x 21.0 mm, gradient: 1.5% CH3CN/50 mM TEAB buffer, 10 mL/min flow rate). Fractions containing the desired were pooled together and evaporated and dried. Yielded 10 (3.0 mg) as a white solid. LCMS: m/z: 1189.85 [M-2H], 594.8 [M/2-2H].
Compound 11
[0088] The compound 10 (2.0 mg) obtained was dissolved in anhydrous DMF (0.6 mL) added 60μl of piperidine. The reaction mixture was then stirred at RT for an hour. The complete cleavage of FMOC group was monitored by LCMS and the reaction mixture was purified by HPLC (Phenomenex Cl 8 column, 250 x 21.0 mm, gradient: 1.5% CH3CN/50 mM TEAB buffer, 10 mL/min flow rate). Fractions containing the desired were pooled together and evaporated and obtained 11 (1.2 μmole) as a colorless solid. LCMS: 967.90 [M/2-2H].
Compound 12
[0089] To a solution of 11 (0.6 μmole) in 0.5 mL of 50 mM K2HPO4 was added Cy5- NHS (1.5 mg, 1.6 μmole) dissolved in 30 μL of anhydrous DMF and stirred at RT until the complete disappearance of starting material 11 which was monitored by LCMS. Then the blue color reaction mixture was purified HPLC (Phenomenex Cl 8 column, 250 x 21.0 mm, gradient: 1.5% CH3CN/50 mM TEAB buffer, 10 mL/min flow rate). Fractions containing the desired were pooled together and lyophilized. Yielded 12a (0.5 μmole) as a blue solid. LCMS: 814.40 [M/2-2H].
[0090] Similarly a solution of 11 (0.4 μmole) in 0.5 mL of 5OmM K2HPO4 was added Atto 647N-NHS (2 mg, 2.5 μmole) dissolved in 40μL of anhydrous DMF and stirred at RT until the complete disappearance of starting material 11 which was monitored by LCMS. Then the blue color reaction mixture was purified HPLC (Phenomenex Cl 8 column, 250 x 21.0 mm, gradient: 2.0% CH3CN/5O mM TEAB buffer, 10 mL/min flow rate). Fractions containing the desired were pooled together and lyophilized. Yielded 12b (0.35 μmole) as a blue solid. LCMS: 1595.2 [M-2H], 797.0 [M/2-2H].
Example 2 Caproic-Asp-Asp Synthetic Scheme II
C* cap-Asp-Asp: H
[0091] α- N-Fmoc-S-tert-butylthio-L-cysteine (1 g, 2.32 mmol) was dissolved in anhydrous acetonitrile and solution of dicyclohexylcarbodiimide (DCC) (573 mg, 2.78 mmol in CH3CN) was added followed by solution of NHS (345 mg, 3.01 mmol in CH3CN). After 1 hr. dicyclohexylurea was spun down and active ester used without purification in coupling with ε- amino-hexanoic acid (304 mg, 2.32 mmol) dissolved in 50% aq. DMF. N,N'- Diisopropylethylamine (DIPEA) was added to correct pH to 8.0. Upon completion reaction mixture was acidified to pH 3 and partitioned between water and dichloro methane (DCM). Organic layer was dried over anhydrous Na2SC^ and evaporated to give 1.33g of crude material. Purification using flash chromatography in DCM/methanol gave 745 mg of pure material (MW=544.75).
[0092] α- N-Fmoc-S-tert-butylthio-L-cyst-caproic acid (3, 77 mg, 141 μmols, CH3CN) was converted to NHS active ester using DCC (35 mg, 169 μmols, CH3CN) and NHS (21 mg, 183 μmols, ACN). After 1 hr. precipitate of dicyclohexylurea was removed by centrifugation and ester used without further purification in coupling with H-Asp-Asp-OH peptide (12 mg, 48 nmols) dissolved in 0.5M K2HPO4, pH of reaction mixture corrected to 7.5 with DIPEA. Progress of reaction was monitored by TLC (disappearance of ester) and by LC-MS (formation of product). Upon completion product was isolated by direct injection on preparative HPLC (Cl 8
column, 3 % CH3CN gradient in 50 mM TEAB, pH 8.6). Isolated product was lyophilized to give white powder (MW=774.9)
[0093] To free the thiol α- N-Fmoc-S-tert-butylthio-L-cyst-caproic-Asp-Asp-OH (15) was treated with 100 mM DTT in 0.1M K2HPO4 during 1 hr. at RT. Reaction was monitored by LC-MS and upon completion injected directly on preparative HPLC (Cl 8 column). Purification using 2% CH3CN gradient in 50 mM TEAB, pH 8.6 yielded product (MW=686.7) which was used immediately without evaporation in displacement reaction with SPDP modified nucleotide triphosphates.
[0094] dATP-AP3 and dCTP-AP3 were prepared by a modified procedure of Hobbs and Cocuzza: a) Pyrophosphate and tributylamine were added to the reaction mixture rather than vice versa; b) After pyrophosphate addition the reaction was quenched with 50 mM TEAB within 15 min.; c) DEAE-Sephadex chromatography was replaced by preparative HPLC.
[0095] SPDP modification of dATP-AP3 and dCTP-AP3 was accomplished using standard protocol: 2 μmols of dNTP-AP3 were dissolved in 250 μl of 0.1N NaHCO3 and 1.2 equivalent (eqv.) of freshly prepared 50 mM stock of SPDP in anhydrous DMF was added. Progress of modification was monitored using LC-MS. Product was isolated using preparative HPLC (C18 column ) with 1% CH3CN gradient in 50 mM TEAB, pH 8.6 gradient and used in displacement reaction with thiol without evaporation of HPLC solvents (MW=717.01 for dCTP- AP3-SPDP, MW=740.03 for dATP-AP3-SPDP).
[0096] Small aliquots of isolated thiol were added to freshly isolated dNTP-AP3-SPDP to obtain displacement product. Progress of reaction was monitored by LC-MS after every addition of thiol. Reaction was completed when all dNTP-AP3-SPDP was consumed at which point reaction mixture was concentrated and purified on preparative HPLC (C 18 column) using
1% gradient Of CH3CN in 50 mM TEAB, pH 8.6. Isolated product was lyophilized to give white powder (MW=1293.06 for cytidine-analog and MW= 1316.09 for adenosine-analog).
[0097] Removal of Fmoc-protecting group was accomplished using 20% piperidine in CH3CN (20 min., RT). Subsequently solvents were removed and crude reaction mixture purified on preparative HPLC (C 18 column) using 2% CH3CN gradient. Product was dried down and OD measured in water at 290 nm for cytidine analog (800 nmols, MW=1070.8) and 280 nm for adenosine analog (640 nmols, MW=1093.8).
[0098] Dye modified final products were prepared using following standard conditions: peptide modified dNTPs were re-dissolved in 20 mM K2HPO4 and dye -NHS dissolved in anhydrous DMF (5 mg in lOOμl) was added using initially 1.2 eqv. up to 4 eqv. to reach complete consumption of starting material. Progress of modification was monitored using LC- MS. Product was isolated using preparative HPLC (C 18 column) with 1% CH3CN gradient and 50 mM TEAB, pH 8.6. Desired fractions were combined, organic solvent removed under reduced pressure and products subjected to CH3OH repurification on C18 HPLC column (1%
CH3OH gradient). Final fractions were quantitated at 650 nm using εβso = 250000 M" cm" for
Cy5 dye and 150000 M -11Cm -"l1 for Atto 647N dye.
Example 3 Caproic-Arg-Arg-Arg
Synthetic Scheme III
Compound 32
[0099] Compound 31 (100 mg, 0.18 mmol) was dissolved in 0.8 ml DMF and added 0.2 mL piperidine and then kept at RT for 30 min. DMF was removed and the residue was purified with flash column using CH2Cl2: CH3OH (2:1). The purified amine (35 mg) was dissolved in 1 mL DMF and used directly for the next step without characterization. 3.5 mg of the purified amine in 0.1 mL DMF (10.8 μmol) was added 60 μL DMF and 40 μL DIPEA and then Cy5
Mono NHS Ester (6.63 μmol) in 100 μL anhydrous DMF was added into the solution. After 30 minutes, the reaction mixture was purified with HPLC (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B CH3CN, 10 mL/min flow). Fractions containing the desired compound 32 were pooled and quantified; (3.0 μmol, 45 %, ε649 = 250000); ESI-MS (negative ion mode): m/z = 959.20 (M-H).
Compound 33
[00100] The NHS ester of the acid 32 was prepared by dissolving the acid 32 (3.0 μmol) in DMF (500.0 μL) and N,N,N',N'-Tetramethyl-O-(N-succinimidyl)uronium hexafluorophosphate (SbTMU) (4.3 mg, 12 μmol) in 100 μL DMF was added to the acid solution followed by the addition of DIPEA (80 μL). After stirring at RT for 1 hr., the reaction mixture was used immediately for peptide coupling without any purification. The peptide Arg- Arg-Arg-OH (14.5 mg, 30 μmol) was dissolved in 160 μL 0.5M phosphate buffer, and added to the freshly prepared NHS ester of the acid 32. The reaction mixture was stirred for 30 minutes and then the crude reaction mixture was purified with HPLC (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired compound 33 were pooled and quantified; (0.6 μmol, 20 %, ε649 = 250000); ESI-MS (negative ion mode): m/z = 713.45 [(M-2H)/2].
Compound 34
[00101] A solution of compound 33 (0.6 μmol) in 3 ml H2O was treated with TCEP (300 μL, IM solution) in an aluminum foil covered flask. After 30 minutes, the reaction mixture was purified with HPLC (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B CH3CN, 10 mL/min flow). Fractions containing the desired thiol, analyzed with ESI-MS (negative ion mode): m/z = 669.90 [(M- 2H)/2], were pooled and immediately added dATP-SPDP (1 μmol in 1 mL H2O). After 15 minutes, LCMS analysis indicated that the completion of the reaction and the reaction mixture was then partially concentrated under reduced pressure to remove CH3CN, then purified with
HPLC (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min flow). Fractions containing the desired compound were pooled and concentrated and then purified again with HPLC using CH3OH and TEAB buffer. The fractions containing the desired compound 34 were pooled and lyophilized to yield compound 34 as a bright blue solid (0.37 μmol, 62 %, ε649 = 250000). ESI-MS (negative ion mode): m/z = 983.75 [(M-2H)/2].
Example 4 Cap- Asp-Asp- Asp-Asp
Compound 45
[00102] Cy5 Mono NHS Ester (100.0 μL, 6.63 μmol) in anhydrous DMF was added to a solution of amine 44 (13.26 μmol, 2 equiv) in DMF (100 μL) and DIPEA (20.0 μL) in an aluminum foil covered flask. After 30 minutes, the disappearance of the starting amine was determined by LCMS or HPLC. The reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex C18 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B CH3CN, 10 mL/min flow). Fractions containing the desired product were pooled and quantified; (4.0 μmol, 60.3 %, ε649 = 250000); ESI-MS (negative ion mode): m/z = 959.20 (M-H).
Compound 46
[00103] The NHS ester of the acid 45 was prepared by dissolving the acid 45 (4.0 μmol, 1 eqv.) in DMF (700.0 μL) and the SbTMU 5.93 mg, 16.5 μmol, in 200 μL DMF, 4.0 eqv.) was added, to the acid solution followed by the addition of DIPEA (103.0 μL). After stirring at RT for 1 hour, the reaction mixture was used immediately for peptide coupling without any purification. The peptide (Asp- Asp-Asp- Asp) was dissolved in DMFB2O (400.0 μL, 1 :1), basifϊed using DIPEA (50.0 μL). To this peptide solution was added freshly prepared NHS ester of the acid 45. The reaction mixture was stirred for 30 minutes and it was then analyzed by LCMS. The crude reaction mixture was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min., then 1% B/min, buffer A 0.05 M TEAB, buffer B CH3CN, 10 mL/min flow). Fractions containing the desired were pooled and quantified; (3.0 μmol, 75.0 %, ε649 = 250000); ESI-MS (negative ion mode): m/z = 709.20 (1/2M-H).
Compound 47
[00104] A solution of compound 46 (1.0 μmol) in H2O was treated with TCEP (40.0 μL, 19.92 μmol, 0.5 M in H2O, 19.92 equiv) in an aluminum foil covered flask. After 30 minutes, the reaction mixture was analyzed by LCMS and was then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex C18 preparative column, 250 x 21.00 mm 10 micron, gradient: 100% A for 5 min., then 1% B/min, buffer A 0.05 M TEAB, buffer B CH3CN, 10 mL/min flow). Fractions containing the desired were pooled and used immediately for the subsequent displacement reaction without removing the solvent. ESI-MS (negative ion mode): m/z = 665.45 (1/2M-H).
Compound 48a
[00105] HPLC fractions containing the thiol 7 (0.34 μmol, 1 eqv.) were mixed with HPLC fractions containing dCTP-SPDP (0.41 μmol, 1.25 eqv.) in an aluminum foil covered flask. After 15 min. LCMS analysis indicated that the completion of the reaction and it was then partially concentrated under reduced pressure to remove CH3CN, then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.05 M TEAB, buffer B CH3CN, 5 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield compound 1 as a bright blue solid (0.17 μmol, 50 %, ε649 = 250000). The desired product was HPLC purified a second time under the same conditions, using CH3OH instead Of CH3CN for buffer B. Fractions containing the desired were pooled and stored at -80 0C without removing the solvent. ESI-MS (negative ion mode): m/z = 968.35 (1/2M-H).
Compound 49a
[00106] HPLC fractions containing thiol 47 (0.5 μmol, 1 eqv.) were mixed with HPLC fractions containing dATP-SPDP (0.6 μmol, 1.2 eqv.) in an aluminum foil covered flask. After 15 min. LCMS analysis indicated that the completion of the reaction and it was then partially concentrated under reduced pressure to remove CH3CN, then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.00 mm 10 micron, gradient: 100% A for 5 min., then 1% B/min, buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield
compound 49a as a bright blue solid (0.35 μmol, 70 %, ε649 = 250000). The desired was HPLC purified a second time under the same conditions, using CH3OH instead Of CH3CN for buffer B. Fractions containing the desired were pooled and stored at -80 0C without removing the solvent. ESI-MS (negative ion mode): m/z = 980.10 (1/2M-H).
Example 5 Caproic-Asp Synthetic Scheme IV
NHS ester
[00107] Fmoc-Cys(SϊBu)-OH (2.0 g, 4.63 mmol, 1 eqv.) was dissolved in CH3CN (10 mL). DCC (1.2 g, 5.81 mmol, 1.26 eqv.) was added, followed by NHS (0.70 g, 6.08 mmol, 1.31 eqv.) and the reaction was stirred at RT for 1 hr. White precipitate (DCU) began forming within five min. The reaction mixture was transferred to Eppendorf tubes and centrifuged to remove the white precipitate. The supernatant was then used in subsequent reactions without further purification.
Acid
[00108] 6-Aminohexanoic acid (0.60 g, 4.57 mmol, 1 eqv.) was dissolved in 1 :1 H2O:DMF (6 mL total). DIPEA (0.016 mL) was added to keep the pH about 8. NHS ester (4.63 mmol in 10 mL CH3CN, 1.01 eqv.) was added to the reaction mixture in 1 mL aliquots over aboutlO min. DIPEA (0.02 mL) was added after each aliquot to keep the reaction basic. After the first aliquot of NHS ester was added, the reaction became cloudy, and addition of extra H2O (0.2 mL) was needed to clear up the solution. The reaction was stirred at RT for two hours, then quenched with 20 mL 10% HCl (aq.). The aqueous phase was extracted with CH2Cl2 (2 x 5OmL). The organic phase was dried over Na2Sθ4, filtered, and concentrated under reduced pressure to yield a brown oil. Purification by flash column chromatography (100% CH2Cl2 to 5% CH3OH/CH2C12) afforded the desired acid as a white foam (2.14 g, 86%).
NHS ester
[00109] The starting acid (0.99 g, 1.82 mmol, 1 eqv.) was dissolved in CH3CN (10 mL). DCC (0.46 g, 2.23 mmol, 1.23 eqv.) was added, followed by NHS (0.28 g, 2.43 mmol, 1.34 eqv.) and the reaction was stirred at RT for an hour. White precipitate (DCU) began forming within 5 min. The reaction mixture was transferred to Eppendorf tubes and centrifuged to remove the white precipitate. The supernatant was then used in subsequent reactions without further purification.
Dimethyl ester
[00110] L-Aspartic acid dimethyl ester hydrochloride (0.2 g, 1.01 mmol, 2 eqv.) was dissolved in CH3CN (1 mL) and DIPEA (0.32 mL, 1.84 mmol, 4 eqv.). A solution of NHS ester (0.48 mmol, 1 eqv.) in CH3CN (2 mL) was added, and the reaction was stirred at RT for 12 hr. The reaction was diluted with EtOAc (25 mL), then washed with brine (1 x 30 mL) and sat. NH4CI (aq.) (1 x 30 mL). The organic phase was dried over Na2SC^, filtered, and concentrated under reduced pressure. Purification by flash column chromatography (100% CH2Cl2 to 2% CH3OH/CH2CI2) afforded the desired ester as a white foam (0.12 g, 36%).
Diacid
[00111] IM LiOH(aq) (0.18 mL, ~6 equiv) was added to a solution of dimethyl ester (0.02 g, 0.029 mmol, 1 eqv.) in THF (0.30 mL). The reaction was stirred at RT until the starting dimethyl ester was consumed based on LCMS analysis (about 15 min). The crude reaction was then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 90% A for 3 min., then 5% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 10 mL/min. flow). Fractions containing the desired were pooled and concentrated to yield the desired diacid, which was used for subsequent reactions without quantifying.
[00112] Diacid (-29 μmol, 1 eqv.) was treated with TCEP (1.7 mL, 0.85 mmol, 0.5M in H2O, 29 eqv.). The reaction was stirred at RT until the starting material was consumed based on LCMS analysis (about 30 min.). The crude reaction was then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 5% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 10 mL/min. flow). Fractions containing the desired were pooled and used for subsequent reactions without concentrating or quantifying.
Disulfide
[00113] HPLC fractions containing the thiol (about 10 μmol, 2 eqv.) were mixed with HPLC fractions containing SPDP-dATP (5 μmol, 1 equiv). After the SPDP-dATP was consumed based on LCMS analysis (about 10 min), the reaction was partially concentrated under reduced pressure to remove CH3CN and then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 15.0 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 10 mL/min. flow). Fractions containing the desired were pooled and lyophilized, then used for subsequent reactions without quantifying.
Amine
[00114] The starting carbamate (~5 μmol, 1 eqv.) was treated with 20% piperidine in 1 : 1 DMF: CH3CN (2 mL), and stirred at RT until the starting material was consumed based on LCMS analysis (-15 min). After removing the solvent under reduced pressure, the reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min., buffer A 0.05M TEAB, buffer B CH3OH, 10 mL/min. flow). Fractions containing the desired were pooled and lyophilized to yield the product as a white foam (1 μmol, 20%, £280 = 12700).
A * Caproic-Asp
[00115] Atto647N-NHS ester (0.030 mL, 1.8 μmol, 0.06M in anhydrous DMF, 3.6 eqv.) was added to a solution of amine (0.5 μmol, 1 eqv.) in H2O (0.25 mL) in 10 μL aliquots. The reaction was monitored by LCMS to determine how much dye was needed to consume the starting amine. After disappearance of amine, the crude reaction was HPLC purified (Waters
Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min, then 2% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and concentrated, then HPLC purified a second time under the same conditions, using CH3OH instead of CH3CN for buffer B. Fractions containing the desired were pooled and stored at -80 0C without removing the solvent (0.086 μmol, 17%, ε64s = 150000).
Disulfide
[00116] HPLC fractions containing the thiol (-10 μmol, 6 eqv.) were mixed with HPLC fractions containing SPDP-dGTP (1.5 μmol, 1 eqv.). After the SPDP-dGTP was consumed based on LCMS analysis (about 10 min), the reaction was partially concentrated under reduced pressure to remove CH3CN and then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min., then 1% B/min, buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and lyophilized, then used for subsequent reactions without quantifying.
[00117] The starting carbamate (~1.5 μmol, 1 eqv.) was treated with 20% piperidine in DMF (0.5 mL), and stirred at RT until the starting material was consumed based on LCMS analysis (about 15 min). After removing the solvent under reduced pressure, the reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and lyophilized to yield the product as a white foam (0.26 μmol, 17%, £272 = 11900).
G* Caproic-Asp
[00118] Atto647N-NHS ester (0.011 mL, 0.66 μmol, 0.06 M in anhydrous DMF, 2.5 eqv.) was added to a solution of amine (0.26 μmol, 1 equiv) in H2O (0.50 mL) in small aliquots. The reaction was monitored by LCMS to determine how much dye was needed to consume the starting amine. After disappearance of amine, the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min, then 2% B/min., buffer A 0.05M TEAB, buffer B CH3 CN, 5 mL/min. flow). Fractions containing the desired were pooled and concentrated, then HPLC purified a second time under the same conditions, using CH3OH instead Of CH3CN for buffer B. Fractions containing the desired were pooled and stored at -80 0C without removing the solvent (0.076 μmol, 29%, ε64s = 150000).
Disulfide
[00119] HPLC fractions containing the thiol (about 5 μmol, 5 eqv.) were mixed with SPDP-dCTP (1 μmol, 1 eqv.) in H2O (0.20 mL). After the SPDP-dCTP was consumed based on LCMS analysis (about 10 min.), the reaction was partially concentrated under reduced pressure to remove CH3CN and then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 3% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and lyophilized, then used for subsequent reactions without quantifying.
Amine
[00120] The starting carbamate (about 1 μmol, 1 eqv.) was treated with 20% piperidine in CH3CN (0.5 mL), and stirred at RT until the starting material was consumed based on LCMS analysis (~15 min). After removing the solvent under reduced pressure, the reaction was HPLC
purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min., then 1% B/min, buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and lyophilized to yield the product as a white foam (0.15 μmol, 15%, £294 = 9300).
C* Caproic-Asp
[00121] Atto647N-NHS ester (0.012 mL, 0.72 μmol, 0.06M in anhydrous DMF, 3.6 eqv.) was added to a solution of amine (0.15 μmol, 1 eqv.) in H2O (0.20 mL) in 5 μL aliquots. The reaction was monitored by LCMS to determine how much dye was needed to consume the starting amine. After disappearance of amine, the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min., then 2% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and concentrated, then HPLC purified a second time under the same conditions, using CH3OH instead Of CH3CN for buffer B. Fractions containing the desired were pooled and stored at -80 0C without removing the solvent (0.030 μmol, 20%, ε645 = 150000).
Disulfide
[00122] HPLC fractions containing the thiol (~5 μmol, 2.5 equiv) were mixed with SPDP-dUTP (2 μmol, 1 eqv.) in H2O (0.13 mL). After the SPDP-dUTP was consumed based on LCMS analysis (-10 min), the reaction was partially concentrated under reduced pressure to remove CH3CN and then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min., then 1% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and lyophilized, then used for subsequent reactions without quantifying.
Amine
[00123] The starting carbamate (~1 μmol, 1 equiv) was treated with 20% piperidine in DMF (2 mL), and stirred at RT until the starting material was consumed based on LCMS analysis (about 15 min). After removing the solvent under reduced pressure, the reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex
Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and lyophilized to yield the product as a white foam (0.19 μmol, 19%, ε289 13000).
T* Caproic-Asp
[00124] Atto647N-NHS ester (0.010 mL, 0.68 μmol, 0.06M in anhydrous DMF, 3.6 eqv.) was added to a solution of amine (0.19 μmol, 1 eqv.) in H2O (0.40 mL) in small aliquots. IM K2HPO4 (0.40 mL) was also added to accelerate the reaction after there was little product formed within an hour. The reaction was monitored by LCMS to determine how much dye was needed to consume the starting amine. After disappearance of amine, the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 10.0 mm 10 micron, gradient: 100% A for 3 min., then 2% B/min., buffer A 0.05M TEAB, buffer B CH3CN, 5 mL/min. flow). Fractions containing the desired were pooled and concentrated, then HPLC purified a second time under the same conditions, using CH3OH instead Of CH3CN for buffer B. Fractions containing the desired were pooled and stored at -80 0C without removing the solvent (0.059 μmol, 31%, ε645 = 150000).
Example 6 Caproic-Asp- Asp- Asp- Asp (Alternative Routes)
Synthetic Scheme VI
Example 8 Synthesis of compounds in Table 1 Synthesis ofdCTP and dATP analogs:
[00125] A solution of N-α-Fmoc-L-glutamic acid α-t-butyl ester (1.02 g, 2.35 mmol) in anhydrous THF (12 mL) was cooled to 00C. Anhydrous NEt3 (0.4 mL, 2.87 mmol) was added, followed by ethyl chloroformate (0.3 mL, 3.1 mmol). After stirring under Ar for ~10 min, sodium borohydride (0.27 g, 7.1 mmol) was added in one portion. Methanol (23 mL) was then added slowly over ~10 min, causing vigorous gas evolution. The reaction was warmed to RT, then acidified with 10% HCl (10 mL). The organics were removed in vacuo. The residue was diluted with EtOAc (40 mL), then washed with brine (2 x 50 mL). The organic phase was dried over Na2SO4, filtered, and concentrated under reduced pressure to yield alcohol 52 as a white foam (0.97 g, 99%).
[00126] A solution of alcohol 52 (0.97 g, 2.35 mmol) in anhydrous CH2Cl2 (9 mL) and anhydrous NEt3 (0.7 mL, 5.0 mmol) was cooled to 00C. Methanesulfonyl chloride (0.28 mL, 3.6 mmol) was added dropwise over -15 min. After disappearance of starting material by TLC (-10 min), the reaction was washed with ice cold H2O (2 x 25 mL), dried over Na2SO4, filtered, and concentrated under reduced pressure. Potassium thioacetate (0.54 g, 4.7 mmol) was added to a solution of the resultant white foam in acetone (6 mL), and the dark brown reaction was stirred at RT for 12 hrs. The crude reaction was then purified by flash column chromatography (5% to 20% EtOAc/hexanes) to afford thioacetate 53 as a brown oil (0.57 g, 52%).
[00127] Trifluoroacetic acid (2 mL) was added to a solution of thioacetate 53 (0.29 g, 0.61 mmol) in CH2Cl2 (2 mL) and the reaction was stirred at RT for -30 min. The reaction was diluted with CH2Cl2 (20 mL), then washed with brine (2 x 20 mL). The organic phase was dried
over Na2SO4, filtered, and concentrated under reduced pressure to yield acid 54 as a brown oil (0.22 g, 88%).
[00128] Acid 54 (0.22 g, 0.53 mmol) was dissolved in MeCN (3 mL). DCC (0.12 g, 0.58 mmol) was added, followed by NHS (0.07 g, 0.63 mmol) and the reaction was stirred at RT for an hour. White precipitate (DCU) began forming within five minutes. The reaction mixture was transferred to eppendorf tubes and centrifuged to remove the white precipitate. The dark brown supernatant containing NHS ester 55 was then added to a solution of H-Asp-Asp-OH (0.13 g, 0.52 mmol) in 0.25 M K2HPO4 (2.4 mL) and MeCN (1 mL). DIPEA (0.15 mL) was added to keep the pH ~ 8. The reaction was stirred at RT for two hours, then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 3% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired thioacetate 56 were pooled and lyophilized to yield the product as a white foam (0.25 g, 75%).
[00129] A solution of thioacetate 56 (0.023 g, mmol) in 50% MeCN/H2O (0.5 mL) was treated with 1 M NH2OH (0.5 mL, pH ~7). The reaction was stirred at RT for -10 min, then immediately HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min,
then 2% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired thiol 57 were used immediately for subsequent reactions without quantifying.
Synthesis ofdATP analog:
[00130] HPLC fractions containing thiol 57 (unqualified, -25 μmol) were mixed with HPLC fractions containing SPDP-dATP (20 μmol). After -15 minutes the reaction was lyophilized, then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield 58, which was used for the subsequent reaction without quantifying.
[00131] Carbamate 58 (unqualified, -16 μmol) was treated with 20% piperidine/MeCN (2 mL) and 20% piperidine/DMF (1 mL) for 15 minutes to remove the Fmoc protecting group. The solvent was removed under reduced pressure, and the residue was then HPLC purified
(Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield the product as a white foam (7.5 μmol, 47%, ε289 = 12700).
[00132] Atto647N NHS ester (0.36 mL, 36 μmol, 0.1 M in anhydrous DMF) was added to a solution of amine 59 (17.6 μmol) in H2O (3 mL) and DMF (0.9 mL). After disappearance of amine by LCMS the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 15.00 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield amide 60 as a bright blue solid (13 μmol, 74%, E649 = 150000).
Synthesis ofdCTP analog:
[00133] HPLC fractions containing thiol 57 (unqualified, -55 μmol) were mixed with HPLC fractions containing SPDP-dCTP (45 μmol). After -30 minutes the reaction was lyophilized, then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield 61, which was used for the subsequent reaction without quantifying.
[00134] Carbamate 61 (unqualified, -36 μmol) was treated with 20% piperidine/MeCN (3 mL) for 15 minutes to remove the Fmoc protecting group. The solvent was removed under reduced pressure, and the residue was then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield amine 62 as a white foam (10.6 μmol, 29%, ε289 = 9300).
[00135] Atto647N NHS ester (0.475 mL, 47.5 μmol, 0.1 M in anhydrous DMF) was added to a solution of amine 62 (21 μmol) in H2O (3 mL) and DMF (0.7 mL). After
disappearance of amine by LCMS the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 15.00 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield amide 63 as a bright blue solid (17.7 μmol, 84%, ε649 = 150000).
Synthesis ofdUTP analog:
[00136] Acid 54 (0.14 g, 0.34 mmol) was dissolved in MeCN (0.6 mL). DCC (0.085 g, 0.41 mmol) was added, followed by NHS (0.051 g, 0.44 mmol) and the reaction was stirred at RT for an hour. White precipitate (DCU) began forming within five minutes. The reaction mixture was transferred to eppendorf tubes and centrifuged to remove the white precipitate. The supernatant containing NHS ester 55 was then added to a solution of 6-aminocaproic acid (0.049 g, 0.37 mmol) in H2O (0.4 mL) and DMF (0.4 mL). DIPEA (0.05 mL) was added to keep the pH ~ 8. The reaction was stirred at RT for 12 hours, then adjusted to pH 4 with 0.1 M HCl and extracted with CH2Cl2 (2 x 25 mL). The organic phase was dried over Na2SO^ filtered, and concentrated under reduced pressure. Purification by flash column chromatography (100% CH2Cl2 to 5% MeOH/CH2Cl2) afforded acid 64 (0.12 g, 69%).
[00137] Acid 64 (0.12 g, 0.23 mmol) was dissolved in MeCN (0.6 mL). DCC (0.058 g, 0.28 mmol) was added, followed by NHS (0.035 g, 0.3 mmol) and the reaction was stirred at RT for an hour. White precipitate (DCU) began forming within twenty minutes. The reaction mixture was transferred to eppendorf tubes and centrifuged to remove the white precipitate. The supernatant containing NHS ester 65 was then added to a solution of H-Asp-Asp-OH (0.075 g, 0.30 mmol) in 0.1 M K2HPO4 (0.5 mL) and MeCN (0.5 mL). DIPEA (0.095 mL) was added to keep the pH ~ 8. The reaction was stirred at RT for 12 hours, then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 5 min, then 2% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired thioacetate 66 were pooled and lyophilized to yield the product as a white foam (0.1 g, 55%).
[00138] A solution of thioacetate 66 (0.1 g, 0.13 mmol) in 50% MeCN/H2O (2 mL) was treated with 1 M NH2OH (2 mL, pH ~7). The reaction was stirred at RT for -10 min, then immediately HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 5 min,
then 2% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired thiol 67 were used immediately for subsequent reactions without quantifying.
Synthesis ofdUTP analog:
[00139] HPLC fractions containing thiol 67 (unqualified, -55 μmol) were mixed with HPLC fractions containing SPDP-dUTP (50 μmol). After ~1 hr the reaction was concentrated to remove MeCN, then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex C18 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 5 min, then 2% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield 68, which was used for the subsequent reaction without quantifying.
[00140] Carbamate 68 (unqualified, -50 μmol) was treated with 20% piperidine/MeCN (2 mL) for 30 minutes to remove the Fmoc protecting group. The solvent was removed under reduced pressure, and the residue was dissolved in 50 mM TEAB buffer (~3mL), causing formation of copious white precipitate (dibenzylfulvene). The mixture was transferred to eppendorf tubes and centrifuged to remove the precipitate. The supernatant was then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 5 min, then 2% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield amine 69 as a white foam (39 μmol, 78%, ε288 = 13000).
[00141] Atto647N NHS ester (0.24 mL, 24 μmol, 0.1 M in anhydrous DMF) was added to a solution of amine 69 (20 μmol) in H2O (0.5 mL) and DMF (0.1 mL). DIPEA (4 μL, 20 μmol) was added to basify the reaction. After disappearance of amine by LCMS the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 15.00 mm 10 micron, gradient: 100% A for 5 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield amide 70 as a bright blue solid (20 μmol, 99%, ε649 = 150000).
Synthesis of dGTP analog:
[00142] SPDP (2.1 mL, 0.10 mmol, 0.05 M in DMF) was added to a solution of H-Pro- Lys(Fmoc)-Pro-Asp-Asp-OH (0.054 g, 0.068 mmol) in 0.1 M K2HPO4 (3 mL) and the reaction was stirred at RT until disappearance of the peptide by LCMS. The crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 2% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield peptide 72, which was used without quantifying.
[00143] A solution of disulfide 72 (-60 μmol) in H2O (5 mL) was treated with TCEP (1.44 mL, 0.72 mmol, 0.5 M in H2O). After -15 minutes the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative
column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 2% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired thiol 73 were used immediately for the subsequent reaction without concentrating or quantifying.
Synthesis ofdGTP analog:
[00144] HPLC fractions containing thiol 73 (unqualified, -50 μmol) were mixed with HPLC fractions containing SPDP-dGTP (58 μmol). After -30 minutes the reaction was concentrated to remove MeCN, then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield 74, which was used for the subsequent reaction without quantifying.
[00145] Carbamate 74 (unqualified, -40 μmol) was treated with 20% piperidine/MeCN (4 mL) for 15 minutes to remove the Fmoc protecting group. The solvent was removed under reduced pressure, and the residue was then HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 21.2 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield the product as a white foam (19.1 μmol, 48%, ε289 = 11900).
[00146] Atto647N NHS ester (0.30 mL, 30 μmol, 0.1 M in anhydrous DMF) was added to a solution of amine 75 (19.1 μmol) in H2O (3 mL) and DMF (0.6 mL). After disappearance of amine by LCMS the crude reaction was HPLC purified (Waters Delta 600 pump and 2487 Dual λ Absorbance Detector, Phenomenex Cl 8 preparative column, 250 x 15.00 mm 10 micron, gradient: 100% A for 3 min, then 1% B/min, buffer A 0.1 M TEAB, buffer B MeCN, 10 mL/min flow). Fractions containing the desired were pooled and lyophilized to yield amide 76 as a bright blue solid (19 μmol, 99%, ε649 = 150000).
[00147] The schemes above and variations thereof may be utilized for syntheses of derivatives and analogs of the exemplary nucleotide analogs shown above, for example, those having additional amino groups at the Inhibitor end and/or compounds of different linking groups.
[00148] While specific embodiments of the subject invention have been discussed, the above specification is illustrative and not restrictive. Many variations of the invention will
become apparent to those skilled in the art upon review of this specification. Contemplated equivalents of the nucleotide analogs disclosed here include compounds which otherwise correspond thereto, and which have the same general properties thereof, wherein one or more simple variations of substituents or components are made which do not adversely affect the characteristics of the nucleotide analogs of interest. In general, the components of the nucleotide analogs disclosed herein may be prepared by the methods illustrated in the general reaction schema as described herein or by modifications thereof, using readily available starting materials, reagents, and conventional synthesis procedures.
Equivalents
[00149] The invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The foregoing embodiments are therefore to be considered in all respects illustrative rather than limiting on the invention described herein. Scope of the invention is thus indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.
Incorporation by Reference
[00150] The entire disclosure of each of the publications and patent documents referred to herein is incorporated by reference in its entirety for all purposes to the same extent as if each individual publication or patent document were so individually denoted.
Claims
1. A nucleotide analog of the following Formula II:
NTP is a nucleoside or nucleotide triphosphate, or an analog thereof, capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP;
L is a detectable label that facilitates the identification of the nucleotide analog;
Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged;
Ri comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor;
R2 is a tri-valent radical having the formula:
-(CH2) X-, -(CH2-O) x-, -(CH2-O) Z-(CH2) y-, -(CH2) Z-(CH2-O) y-, and the same substituted with one or more groups selected from hydroxyl, halogen, amino, thiol, (C1-C6) alkyl, wherein x, y and z are each integers with x and y+z are each from 2 to 10; R3 is a bond or group linking R2 to the Inhibitor moiety; and R4 is a bond or group linking R2 to a L.
2. The nucleotide analog of Claim 1, wherein the Inhibitor comprises a charged group selected from the group consisting of -COOH and -PO4.
3. The nucleotide analog of Claim 1, wherein the Inhibitor comprises at least two -COOH groups.
4. The nucleotide analog of Claim 1 , wherein the Inhibitor does not comprise a -PO4 group.
5. The nucleotide analog of Claim 1, wherein the Inhibitor does not comprise an aryl group.
6. The nucleotide analog of Claim 1, wherein the Inhibitor comprises a group selected from the group consisting of GIu, Asp, Arg, His, Thr, Trp, GIn, Tyr, Pro and Lys.
7. The nucleotide analog of Claim 1 , wherein Ri comprises a C-C triple bond or a trans C-C double bond.
8. The nucleotide analog of Claim 1, wherein Ri comprises a S-S bond.
9. The nucleotide analog of Claim 1, wherein Ri comprises a C-C triple bond and a S-S bond.
10. The nucleotide analog of Claim 1, wherein Ri comprises
11. The nucleotide analog of Claim 10, wherein q is 1 or 2 and r is 1 , 2 or 3.
12. The nucleotide analog of Claim 1, wherein NTP is selected from dATP, dGTP, dCTP, dTTP, dUTP, ATP, GTP, CTP, TTP, UTP or an analog thereof.
13. The nucleotide analog of Claim 1, wherein the L is an optically-detectable moiety.
14. The nucleotide analog of Claim 13, wherein the optically-detectable moiety comprises a fluorophore.
15. The nucleotide analog of Claim 14, wherein the fluorophore is Cy5 or ATTO 647N.
16. The nucleotide analog of Claim 1, wherein R3 comprises
wherein R is H or alkyl groups, and may together form 3, 4, 5, or 6-member rings.
17. The nucleotide analog of Claim 16, wherein R3 comprises
wherein R is GIu, Asp, Arg, His, Thr, Trp, GIn, Tyr, Pro or Lys, or a peptide of two or more amino acids comprising an amino acid selected from the group consisting of GIu, Asp, Arg, His, Thr, Trp, GIn, Tyr, Pro and Lys.
19. A method for sequencing a nucleic acid, the method comprising the steps of: exposing a nucleic acid duplex comprising a template portion and a primer portion to a nucleotide analog of the following Formula II:
detecting incorporation of the analog; removing or neutralizing the inhibitor; and repeating the exposing, detecting, and removing steps at least once, thereby to determine the sequence of the template,
wherein
NTP is a nucleoside or nucleotide triphosphate or an analog of either capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP;
L is a detectable label that facilitates the identification of the nucleotide analog;
Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged;
Ri comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor;
R2 is a tri-valent radical having the formula:
-(CH2) X-, -(CH2-O) x-, -(CH2-O) Z-(CH2) y-, -(CH2) Z-(CH2-O) y-, and the same substituted with one or more groups selected from hydroxyl, halogen, amino, thiol,
(Ci-C6) alkyl, wherein x, y and z are each integers with x and y+z are each from 2 to 10;
R3 is a bond or group linking R2 to the Inhibitor moiety; and
R4 is a bond or group linking R2 to a L.
20. The method of Claim 19, wherein the inhibitor is selected from the group consisting of one or more carboxylic acid, one or more phosphate, one or more amino acid, one or more peptide, one or more sulfate, one or more caproic acid, and any combination thereof.
21. A method for sequencing a nucleic acid, the method comprising the steps of: exposing a nucleic acid duplex comprising a template portion and a primer portion to a nucleotide analog comprising an inhibitor that is charged or capable of becoming charged, and a polymerase, under conditions that permit template-dependent incorporation of the analog into the primer; detecting incorporation of the analog; removing or neutralizing the inhibitor; and repeating the exposing, detecting, and removing steps at least once, thereby to determine the sequence of the template.
22. The method of Claim 21 , wherein the template portion and/or the primer portion is directly or indirectly anchored to a support.
23. The method of Claim 21 , wherein the detecting step comprises detecting individual analogs.
24. The method of Claim 21 further comprising the step of removing unincorporated analog.
25. The method of Claim 21 , wherein the inhibitor is selected from the group consisting of one or more carboxylic acid, one or more phosphate, one or more amino acid, one or more peptide, one or more sulfate, one or more caproic acid, and any combination thereof.
26. The method of Claim 25, wherein the amino acid is a negatively charged amino acid.
27. The method of Claim 26, wherein the amino acid is selected from the group consisting of aspartic acid, glutamic acid, histidine, lysine, and arginine.
28. The method of Claim 25, wherein the peptide is from about 2 to about 10 amino acids in length.
29. The method of Claim 21, wherein the inhibitor comprises multiple charged groups.
30. The method of Claim 21 , wherein the inhibitor is negatively charged.
31. The method of Claim 21 , wherein the inhibitor is positively charged.
32. The method of Claim 21, wherein the inhibitor does not cause steric inhibition of the polymerase.
33. A nucleotide analog, comprising a nucleoside triphosphate; an inhibitor comprising (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged; a detectable label; and a linker connecting the inhibitor and the label to the nucleoside triphosphate.
34. The analog of Claim 33, wherein the linker is cleavable.
35. The analog of Claim 34, wherein after the linker is cleaved, the residual analog has the structure of:
B1 is selected from the group consisting of purine bases, pyrimidine bases, and derivatives of purine and pyrimidine bases;
R is a N-containing group;
R' is independently selected from the group consisting of -OH, -0-P(O)(OH)2, -0-C(O)- Rx, -NHRy, and an -O-blocking agent, wherein Rx and Ry are alkyl groups;
R" is independently selected from the group consisting of H and -OH; R7 is a phosphodiester or a phosphoryl group; and z is an integer from about 1 to about 5.
36. The analog of Claim 33, wherein the charged groups consist of between about 2 to about 10 charged groups.
37. The analog of Claim 33, wherein the charged groups are selected from any combination of one or more carboxylic acid, one or more phosphate, one or more amino acid, one or more peptide, one or more sulfate, and one or more caproic acid.
38. The analog of Claim 33, wherein the label is optically detectable.
39. The analog of Claim 38, wherein the label is a fluorescent label.
40. The analog of Claim 33, wherein the inhibitor is not a steric inhibitor of a polymerase enzyme.
41. The analog of Claim 33, wherein the nucleoside triphosphate is selected from ATP, GTP, CTP, TTP, UTP, dATP, dGTP, dCTP, dTTP, dUTP, or an analog of any of the foregoing.
NTP is a nucleoside or nucleotide triphosphate or an analog of either capable of incorporating onto the 3' end of a polynucleotide strand hybridized to a template presenting the complement of the NTP;
L is a detectable label that facilitates the identification of the nucleotide analog;
Inhibitor comprises (a) one or more multiply charged groups or groups capable of becoming multiply charged, or (b) two or more singly charged groups or two or more groups capable of becoming singly charged;
Ri and R2 are independently a bond or a group, wherein at least one of Ri and R2 comprises a cleavable bond, which upon cleavage results in de-association of NTP from both L and Inhibitor;
R3 is a bond or group linking R2 to the Inhibitor moiety; and
R4 is a bond or group linking R2 to a L.
43. The nucleotide analog of Claim 42, wherein the Inhibitor does not comprise a nucleotide or nucleoside or analogs thereof.
44. The nucleotide analog of Claim 42, wherein the Inhibitor comprises a negatively charged group or a group capable of becoming negatively charged.
45. The nucleotide analog of Claim 42, wherein the Inhibitor comprises a positively charged group or a group capable of becoming positively charged.
46. The nucleotide analog of Claim 42, wherein the Inhibitor comprises two or more charged groups.
47. The nucleotide analog of Claim 42, wherein the Inhibitor comprises a charged group selected from the group consisting of -COOH, -PO4, -SO4, -SO3, -SO2, -NRWRV, where Rw and Rv independently is H, an alkyl or aryl group.
48. The nucleotide analog of Claim 42, wherein the Inhibitor comprises
49. The nucleotide analog of Claim 48, wherein Rs and R9 are H atoms and x = 1 and y = 2.
50. The nucleotide analog of Claim 42, wherein the Inhibitor does not comprise a -PO4 group.
51. The nucleotide analog of Claim 42, wherein the Inhibitor does not comprise an aryl group.
52. The nucleotide analog of Claim 42, wherein the Inhibitor comprises an amino acid group or an amino acid analog group.
53. The nucleotide analog of Claim 52, wherein the Inhibitor comprises a peptide of 2 to 20 units of amino acids or analogs.
54. The nucleotide analog of Claim 52, wherein the Inhibitor comprises a group selected from the group consisting of GIu, Asp, Arg, His, Thr, Trp, GIn, Tyr and Lys.
55. The nucleotide analog of Claim 42, wherein R3 comprises
56. The nucleotide analog of Claim 55, wherein p is 5 or 6.
57. The nucleotide analog of Claim 42, wherein Ri comprises a C-C triple bond.
58. The nucleotide analog of Claim 42, wherein Ri comprises a S-S bond.
59. The nucleotide analog of Claim 42, wherein Ri comprises a C-C triple bond and a S-S bond.
60. The nucleotide analog of Claim 42, wherein Ri comprises
61. The nucleotide analog of Claim 60, wherein q is 1 or 2 and r is 1, 2 or 3.
62. The nucleotide analog of Claim 42, wherein NTP is selected from dATP, dGTP, dCTP, dTTP, dUTP, ATP, GTP, CTP, TTP, UTP or an analog thereof.
63. The nucleotide analog of Claim 42, wherein the L is an optically-detectable moiety.
64. The nucleotide analog of Claim 63, wherein the optically-detectable moiety comprises a fluorophore.
65. The nucleotide analog of Claim 64, wherein the fluorophore is Cy5 or ATTO 647N.
66. The nucleotide analog of Claim 42, wherein the group of the Inhibitor that is charged or capable of becoming charged is from about 5 to about 60 bonds away from the NTP.
67. The nucleotide analog of Claim 56, wherein p is 5 and the detectable label comprises ATTO 647N.
68. The nucleotide analog of Claim 42, wherein R3 comprises
69. The nucleotide analog of Claim 68, wherein the Inhibitor comprises a -COOH group.
70. The nucleotide analog of Claim 69, wherein the Inhibitor comprises two or more -COOH groups.
71. The nucleotide analog of Claim 42, wherein R3 comprises
wherein R . 1 , n R2 are independently H or alkyl groups, and may together form 3, 4, 5, or 6-member rings, and j is an integer from about 1 to about 5.
72. The nucleotide analog of Claim 42, wherein R3 comprises
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/098,196 US8071755B2 (en) | 2004-05-25 | 2008-04-04 | Nucleotide analogs |
PCT/US2008/059446 WO2009123642A1 (en) | 2008-04-04 | 2008-04-04 | Nucleotide analogs |
USPCT/US2008/059446 | 2008-04-04 | ||
US12/098,196 | 2008-04-04 | ||
US12/244,698 US8114973B2 (en) | 2004-05-25 | 2008-10-02 | Nucleotide analogs |
US12/244,698 | 2008-10-02 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2009124254A1 true WO2009124254A1 (en) | 2009-10-08 |
Family
ID=40886121
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2009/039475 WO2009124254A1 (en) | 2008-04-04 | 2009-04-03 | Nucleotide analogs |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2009124254A1 (en) |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102268054A (en) * | 2011-04-12 | 2011-12-07 | 宁辉 | Beta-cytidine-5 '-triphosphoarginine derivative ester and its preparation method and use |
EP2796552A3 (en) * | 2013-04-02 | 2015-03-04 | Molecular Assembly, LLC | Methods and apparatus for synthesizing nucleic acids |
US9279149B2 (en) | 2013-04-02 | 2016-03-08 | Molecular Assemblies, Inc. | Methods and apparatus for synthesizing nucleic acids |
WO2017015538A1 (en) | 2015-07-22 | 2017-01-26 | Purdue Research Foundation | Modified glucagon molecules |
US9771613B2 (en) | 2013-04-02 | 2017-09-26 | Molecular Assemblies, Inc. | Methods and apparatus for synthesizing nucleic acid |
WO2019105421A1 (en) * | 2017-11-30 | 2019-06-06 | 深圳市瀚海基因生物科技有限公司 | Nucleoside analogue, preparation method and application |
WO2019231617A1 (en) * | 2018-05-29 | 2019-12-05 | Elitechgroup, Inc. | Carborhodamine compounds and methods of preparation thereof |
US10683536B2 (en) | 2013-04-02 | 2020-06-16 | Molecular Assemblies, Inc. | Reusable initiators for synthesizing nucleic acids |
US11331643B2 (en) | 2013-04-02 | 2022-05-17 | Molecular Assemblies, Inc. | Reusable initiators for synthesizing nucleic acids |
US11384377B2 (en) | 2013-04-02 | 2022-07-12 | Molecular Assemblies, Inc. | Reusable initiators for synthesizing nucleic acids |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005044836A2 (en) * | 2003-11-05 | 2005-05-19 | Genovoxx Gmbh | Macromolecular nucleotide compounds and methods for using the same |
WO2007062160A2 (en) * | 2005-11-22 | 2007-05-31 | Helicos Biosciences Corporation | Methods and compositions for sequencing a nucleic acid |
US20070128614A1 (en) * | 2005-12-06 | 2007-06-07 | Liu David R | Nucleotide analogs |
WO2008137661A1 (en) * | 2007-05-03 | 2008-11-13 | Helicos Biosciences Corporation | Methods and compositions for sequencing a nucleic acid |
WO2008144315A1 (en) * | 2007-05-14 | 2008-11-27 | Helicos Biosciences Corporation | Methods and compositions for sequencing a nucleic acid |
US20090061437A1 (en) * | 2004-05-25 | 2009-03-05 | Helicos Biosciences Corporation | Nucleotide Analogs |
-
2009
- 2009-04-03 WO PCT/US2009/039475 patent/WO2009124254A1/en active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005044836A2 (en) * | 2003-11-05 | 2005-05-19 | Genovoxx Gmbh | Macromolecular nucleotide compounds and methods for using the same |
US20090061437A1 (en) * | 2004-05-25 | 2009-03-05 | Helicos Biosciences Corporation | Nucleotide Analogs |
WO2007062160A2 (en) * | 2005-11-22 | 2007-05-31 | Helicos Biosciences Corporation | Methods and compositions for sequencing a nucleic acid |
US20070128614A1 (en) * | 2005-12-06 | 2007-06-07 | Liu David R | Nucleotide analogs |
WO2008137661A1 (en) * | 2007-05-03 | 2008-11-13 | Helicos Biosciences Corporation | Methods and compositions for sequencing a nucleic acid |
WO2008144315A1 (en) * | 2007-05-14 | 2008-11-27 | Helicos Biosciences Corporation | Methods and compositions for sequencing a nucleic acid |
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102268054A (en) * | 2011-04-12 | 2011-12-07 | 宁辉 | Beta-cytidine-5 '-triphosphoarginine derivative ester and its preparation method and use |
CN102268054B (en) * | 2011-04-12 | 2012-11-21 | 宁辉 | Beta-cytidine-5 '-triphosphoarginine derivative ester and its preparation method and use |
US9695470B2 (en) | 2013-04-02 | 2017-07-04 | Molecular Assemblies, Inc. | Methods and apparatus for synthesizing nucleic acids |
US9279149B2 (en) | 2013-04-02 | 2016-03-08 | Molecular Assemblies, Inc. | Methods and apparatus for synthesizing nucleic acids |
EP3115462A1 (en) * | 2013-04-02 | 2017-01-11 | Molecular Assemblies, Inc. | Methods and apparatus for synthesizing nucleic acids |
US11331643B2 (en) | 2013-04-02 | 2022-05-17 | Molecular Assemblies, Inc. | Reusable initiators for synthesizing nucleic acids |
US9771613B2 (en) | 2013-04-02 | 2017-09-26 | Molecular Assemblies, Inc. | Methods and apparatus for synthesizing nucleic acid |
US10041110B2 (en) | 2013-04-02 | 2018-08-07 | Molecular Assemblies, Inc. | Methods and apparatus for synthesizing nucleic acids |
EP3425053A1 (en) * | 2013-04-02 | 2019-01-09 | Molecular Assemblies, Inc. | Methods and apparatus for synthesizing nucleic acid |
EP2796552A3 (en) * | 2013-04-02 | 2015-03-04 | Molecular Assembly, LLC | Methods and apparatus for synthesizing nucleic acids |
US10683536B2 (en) | 2013-04-02 | 2020-06-16 | Molecular Assemblies, Inc. | Reusable initiators for synthesizing nucleic acids |
US11384377B2 (en) | 2013-04-02 | 2022-07-12 | Molecular Assemblies, Inc. | Reusable initiators for synthesizing nucleic acids |
WO2017015538A1 (en) | 2015-07-22 | 2017-01-26 | Purdue Research Foundation | Modified glucagon molecules |
US11472857B2 (en) | 2015-07-22 | 2022-10-18 | Purdue Research Foundation | Modified glucagon molecules |
US10954283B2 (en) | 2015-07-22 | 2021-03-23 | Purdue Research Foundation | Modified glucagon molecules |
WO2019105421A1 (en) * | 2017-11-30 | 2019-06-06 | 深圳市瀚海基因生物科技有限公司 | Nucleoside analogue, preparation method and application |
CN111741967A (en) * | 2017-11-30 | 2020-10-02 | 深圳市真迈生物科技有限公司 | Nucleoside analogue, preparation method and application |
US11512106B2 (en) | 2017-11-30 | 2022-11-29 | Genemind Biosciences Company Limited | Nucleoside analogue, preparation method and application |
US11155713B2 (en) | 2018-05-29 | 2021-10-26 | Elitechgroup, Inc. | Carborhodamine compounds and methods of preparation thereof |
WO2019231617A1 (en) * | 2018-05-29 | 2019-12-05 | Elitechgroup, Inc. | Carborhodamine compounds and methods of preparation thereof |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8071755B2 (en) | Nucleotide analogs | |
US8114973B2 (en) | Nucleotide analogs | |
WO2009124254A1 (en) | Nucleotide analogs | |
EP1497304B1 (en) | Dual-labeled nucleotides | |
EP2057175A2 (en) | Nucleotide analogs | |
US7994304B2 (en) | Methods and compositions for sequencing a nucleic acid | |
EP1141409B2 (en) | A kit and methods for nucleic acid sequencing of single molecules by polymerase synthesis | |
EP1973926A2 (en) | Methods and compositions for sequencing a nucleic acid | |
EP3091026B1 (en) | Disulfide-linked reversible terminators | |
US7476734B2 (en) | Nucleotide analogs | |
WO2008137661A1 (en) | Methods and compositions for sequencing a nucleic acid | |
WO2008016906A2 (en) | 3'-phosphate-labeled nucleotide analogs and their use for sequencing a nucleic acid | |
AU2006297104A1 (en) | Labeled nucleotide analogs and uses therefor | |
US20230348968A1 (en) | Short pendant arm linkers for nucleotides in sequencing applications | |
WO2009123642A1 (en) | Nucleotide analogs | |
WO2008016907A1 (en) | Nucleotide analogs | |
NZ759350B2 (en) | Short pendant arm linkers for nucleotides in sequencing applications |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 09727687 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 09727687 Country of ref document: EP Kind code of ref document: A1 |