US20230132387A9 - Method of characterizing a target ribonucleic acid (rna) comprising forming a complementary polynucleotide which moves through a transmembrane pore - Google Patents
Method of characterizing a target ribonucleic acid (rna) comprising forming a complementary polynucleotide which moves through a transmembrane pore Download PDFInfo
- Publication number
- US20230132387A9 US20230132387A9 US17/397,320 US202117397320A US2023132387A9 US 20230132387 A9 US20230132387 A9 US 20230132387A9 US 202117397320 A US202117397320 A US 202117397320A US 2023132387 A9 US2023132387 A9 US 2023132387A9
- Authority
- US
- United States
- Prior art keywords
- polynucleotide
- pore
- target rna
- seq
- rna
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 102000040430 polynucleotide Human genes 0.000 title claims abstract description 328
- 108091033319 polynucleotide Proteins 0.000 title claims abstract description 328
- 239000002157 polynucleotide Substances 0.000 title claims abstract description 328
- 239000011148 porous material Substances 0.000 title claims abstract description 212
- 238000000034 method Methods 0.000 title claims abstract description 159
- 229920002477 rna polymer Polymers 0.000 title claims abstract description 157
- 230000000295 complement effect Effects 0.000 title claims abstract description 130
- 108060004795 Methyltransferase Proteins 0.000 claims description 130
- 239000012528 membrane Substances 0.000 claims description 75
- 108020004999 messenger RNA Proteins 0.000 claims description 71
- 239000002299 complementary DNA Substances 0.000 claims description 55
- 201000010099 disease Diseases 0.000 claims description 52
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 52
- 150000001413 amino acids Chemical class 0.000 claims description 41
- 238000005259 measurement Methods 0.000 claims description 37
- 239000002679 microRNA Substances 0.000 claims description 33
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 30
- 102000014914 Carrier Proteins Human genes 0.000 claims description 28
- 108091008324 binding proteins Proteins 0.000 claims description 26
- 102000035160 transmembrane proteins Human genes 0.000 claims description 21
- 108091005703 transmembrane proteins Proteins 0.000 claims description 21
- GYOZYWVXFNDGLU-XLPZGREQSA-N dTMP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)C1 GYOZYWVXFNDGLU-XLPZGREQSA-N 0.000 claims description 20
- NCMVOABPESMRCP-SHYZEUOFSA-N 2'-deoxycytosine 5'-monophosphate Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)C1 NCMVOABPESMRCP-SHYZEUOFSA-N 0.000 claims description 14
- LTFMZDNNPPEQNG-KVQBGUIXSA-N 2'-deoxyguanosine 5'-monophosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@H]1C[C@H](O)[C@@H](COP(O)(O)=O)O1 LTFMZDNNPPEQNG-KVQBGUIXSA-N 0.000 claims description 14
- 238000003752 polymerase chain reaction Methods 0.000 claims description 11
- 108700011259 MicroRNAs Proteins 0.000 claims description 10
- 230000002441 reversible effect Effects 0.000 claims description 8
- 238000003757 reverse transcription PCR Methods 0.000 claims description 6
- 239000007787 solid Substances 0.000 claims description 6
- 101710203389 Outer membrane porin F Proteins 0.000 claims description 4
- 101710203388 Outer membrane porin G Proteins 0.000 claims description 4
- 230000000694 effects Effects 0.000 claims description 4
- 230000003287 optical effect Effects 0.000 claims description 4
- 108010006464 Hemolysin Proteins Proteins 0.000 claims description 3
- 108010013381 Porins Proteins 0.000 claims description 3
- 239000003228 hemolysin Substances 0.000 claims description 3
- 108060002716 Exonuclease Proteins 0.000 claims description 2
- 108010014603 Leukocidins Proteins 0.000 claims description 2
- 102000004895 Lipoproteins Human genes 0.000 claims description 2
- 108090001030 Lipoproteins Proteins 0.000 claims description 2
- 241000187480 Mycobacterium smegmatis Species 0.000 claims description 2
- 241000588653 Neisseria Species 0.000 claims description 2
- 101710183280 Topoisomerase Proteins 0.000 claims description 2
- 108010073429 Type V Secretion Systems Proteins 0.000 claims description 2
- 102000013165 exonuclease Human genes 0.000 claims description 2
- 238000002847 impedance measurement Methods 0.000 claims description 2
- 108010014203 outer membrane phospholipase A Proteins 0.000 claims description 2
- KHWCHTKSEGGWEX-UHFFFAOYSA-N deoxyadenylic acid Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(O)=O)O1 KHWCHTKSEGGWEX-UHFFFAOYSA-N 0.000 claims 2
- LTFMZDNNPPEQNG-UHFFFAOYSA-N deoxyguanylic acid Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1CC(O)C(COP(O)(O)=O)O1 LTFMZDNNPPEQNG-UHFFFAOYSA-N 0.000 claims 2
- 230000005669 field effect Effects 0.000 claims 1
- 102000007739 porin activity proteins Human genes 0.000 claims 1
- 125000003729 nucleotide group Chemical group 0.000 description 91
- 239000002773 nucleotide Substances 0.000 description 90
- 125000006850 spacer group Chemical group 0.000 description 90
- 239000000523 sample Substances 0.000 description 51
- 102000053602 DNA Human genes 0.000 description 43
- 108020004414 DNA Proteins 0.000 description 43
- 235000001014 amino acid Nutrition 0.000 description 43
- 108090000623 proteins and genes Proteins 0.000 description 39
- 102000004190 Enzymes Human genes 0.000 description 33
- 108090000790 Enzymes Proteins 0.000 description 33
- 239000000178 monomer Substances 0.000 description 33
- 235000018102 proteins Nutrition 0.000 description 33
- 102000004169 proteins and genes Human genes 0.000 description 33
- 239000000232 Lipid Bilayer Substances 0.000 description 29
- 210000004027 cell Anatomy 0.000 description 29
- 230000008878 coupling Effects 0.000 description 29
- 238000010168 coupling process Methods 0.000 description 29
- 238000005859 coupling reaction Methods 0.000 description 29
- 239000010410 layer Substances 0.000 description 26
- 108091070501 miRNA Proteins 0.000 description 25
- 230000035772 mutation Effects 0.000 description 24
- 102000004196 processed proteins & peptides Human genes 0.000 description 24
- 108090000765 processed proteins & peptides Proteins 0.000 description 24
- 125000003275 alpha amino acid group Chemical group 0.000 description 23
- 229920001184 polypeptide Polymers 0.000 description 22
- 230000004048 modification Effects 0.000 description 21
- 238000012986 modification Methods 0.000 description 21
- 108091028664 Ribonucleotide Proteins 0.000 description 20
- 239000002336 ribonucleotide Substances 0.000 description 20
- 125000002652 ribonucleotide group Chemical group 0.000 description 20
- 102100034343 Integrase Human genes 0.000 description 16
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 16
- 102000039446 nucleic acids Human genes 0.000 description 16
- 108020004707 nucleic acids Proteins 0.000 description 16
- 150000007523 nucleic acids Chemical class 0.000 description 16
- 150000003839 salts Chemical class 0.000 description 16
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 15
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 15
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical group N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 14
- -1 Nucleotide sugars Chemical class 0.000 description 14
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 14
- 239000012634 fragment Substances 0.000 description 14
- 230000002209 hydrophobic effect Effects 0.000 description 14
- 230000007935 neutral effect Effects 0.000 description 14
- WCUXLLCKKVVCTQ-UHFFFAOYSA-M Potassium chloride Chemical compound [Cl-].[K+] WCUXLLCKKVVCTQ-UHFFFAOYSA-M 0.000 description 13
- 239000000872 buffer Substances 0.000 description 13
- 238000006467 substitution reaction Methods 0.000 description 13
- 235000000346 sugar Nutrition 0.000 description 13
- KHWCHTKSEGGWEX-RRKCRQDMSA-N 2'-deoxyadenosine 5'-monophosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(O)=O)O1 KHWCHTKSEGGWEX-RRKCRQDMSA-N 0.000 description 12
- 125000003636 chemical group Chemical group 0.000 description 12
- 239000013598 vector Substances 0.000 description 12
- 208000035657 Abasia Diseases 0.000 description 11
- 238000012512 characterization method Methods 0.000 description 11
- 239000000463 material Substances 0.000 description 11
- 229920000642 polymer Polymers 0.000 description 11
- 102200026914 rs730882246 Human genes 0.000 description 11
- 102000012410 DNA Ligases Human genes 0.000 description 10
- 108010061982 DNA Ligases Proteins 0.000 description 10
- 206010028980 Neoplasm Diseases 0.000 description 10
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 10
- 101710092462 Alpha-hemolysin Proteins 0.000 description 9
- 108091006146 Channels Proteins 0.000 description 9
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical class O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 9
- DJJCXFVJDGTHFX-UHFFFAOYSA-N Uridinemonophosphate Natural products OC1C(O)C(COP(O)(O)=O)OC1N1C(=O)NC(=O)C=C1 DJJCXFVJDGTHFX-UHFFFAOYSA-N 0.000 description 9
- 238000007792 addition Methods 0.000 description 9
- 229920001400 block copolymer Polymers 0.000 description 9
- IERHLVCPSMICTF-XVFCMESISA-N cytidine 5'-monophosphate Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(O)=O)O1 IERHLVCPSMICTF-XVFCMESISA-N 0.000 description 9
- IERHLVCPSMICTF-UHFFFAOYSA-N cytidine monophosphate Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(COP(O)(O)=O)O1 IERHLVCPSMICTF-UHFFFAOYSA-N 0.000 description 9
- 239000005547 deoxyribonucleotide Substances 0.000 description 9
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 9
- 239000013604 expression vector Substances 0.000 description 9
- RQFCJASXJCIDSX-UUOKFMHZSA-N guanosine 5'-monophosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O RQFCJASXJCIDSX-UUOKFMHZSA-N 0.000 description 9
- 235000013928 guanylic acid Nutrition 0.000 description 9
- 229920001223 polyethylene glycol Polymers 0.000 description 9
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical group CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 9
- DJJCXFVJDGTHFX-XVFCMESISA-N uridine 5'-monophosphate Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 DJJCXFVJDGTHFX-XVFCMESISA-N 0.000 description 9
- ZKHQWZAMYRWXGA-KQYNXXCUSA-J ATP(4-) Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-J 0.000 description 8
- ZKHQWZAMYRWXGA-UHFFFAOYSA-N Adenosine triphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O ZKHQWZAMYRWXGA-UHFFFAOYSA-N 0.000 description 8
- 108091093037 Peptide nucleic acid Proteins 0.000 description 8
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 8
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 8
- LWIHDJKSTIGBAC-UHFFFAOYSA-K tripotassium phosphate Chemical compound [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 8
- 239000002202 Polyethylene glycol Substances 0.000 description 7
- 239000002585 base Substances 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 7
- 235000020958 biotin Nutrition 0.000 description 7
- 239000011616 biotin Chemical group 0.000 description 7
- 229960002685 biotin Drugs 0.000 description 7
- 235000012000 cholesterol Nutrition 0.000 description 7
- 229940107161 cholesterol Drugs 0.000 description 7
- 235000018417 cysteine Nutrition 0.000 description 7
- JSRLJPSBLDHEIO-SHYZEUOFSA-N dUMP Chemical compound O1[C@H](COP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C=C1 JSRLJPSBLDHEIO-SHYZEUOFSA-N 0.000 description 7
- 238000000338 in vitro Methods 0.000 description 7
- 230000003993 interaction Effects 0.000 description 7
- 239000000203 mixture Substances 0.000 description 7
- 239000000126 substance Substances 0.000 description 7
- WFDIJRYMOXRFFG-UHFFFAOYSA-N Acetic anhydride Chemical compound CC(=O)OC(C)=O WFDIJRYMOXRFFG-UHFFFAOYSA-N 0.000 description 6
- UDMBCSSLTHHNCD-KQYNXXCUSA-N adenosine 5'-monophosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O UDMBCSSLTHHNCD-KQYNXXCUSA-N 0.000 description 6
- 125000001931 aliphatic group Chemical group 0.000 description 6
- 125000003118 aryl group Chemical group 0.000 description 6
- 230000000903 blocking effect Effects 0.000 description 6
- 239000003153 chemical reaction reagent Substances 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 150000002500 ions Chemical class 0.000 description 6
- 150000002632 lipids Chemical group 0.000 description 6
- 239000001103 potassium chloride Substances 0.000 description 6
- 235000011164 potassium chloride Nutrition 0.000 description 6
- 102220289580 rs33916541 Human genes 0.000 description 6
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 5
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 5
- 108091093094 Glycol nucleic acid Proteins 0.000 description 5
- 206010033128 Ovarian cancer Diseases 0.000 description 5
- 108091046915 Threose nucleic acid Proteins 0.000 description 5
- 239000007864 aqueous solution Substances 0.000 description 5
- 210000004899 c-terminal region Anatomy 0.000 description 5
- 230000008859 change Effects 0.000 description 5
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 5
- 230000003247 decreasing effect Effects 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 238000009396 hybridization Methods 0.000 description 5
- 239000000276 potassium ferrocyanide Substances 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- 239000011780 sodium chloride Substances 0.000 description 5
- XOGGUFAVLNCTRS-UHFFFAOYSA-N tetrapotassium;iron(2+);hexacyanide Chemical compound [K+].[K+].[K+].[K+].[Fe+2].N#[C-].N#[C-].N#[C-].N#[C-].N#[C-].N#[C-] XOGGUFAVLNCTRS-UHFFFAOYSA-N 0.000 description 5
- 150000003573 thiols Chemical class 0.000 description 5
- 230000001052 transient effect Effects 0.000 description 5
- 230000005945 translocation Effects 0.000 description 5
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 5
- 229930024421 Adenine Natural products 0.000 description 4
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 4
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 4
- IVOMOUWHDPKRLL-KQYNXXCUSA-N Cyclic adenosine monophosphate Chemical compound C([C@H]1O2)OP(O)(=O)O[C@H]1[C@@H](O)[C@@H]2N1C(N=CN=C2N)=C2N=C1 IVOMOUWHDPKRLL-KQYNXXCUSA-N 0.000 description 4
- 150000008574 D-amino acids Chemical class 0.000 description 4
- 206010013801 Duchenne Muscular Dystrophy Diseases 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- 201000001087 Miyoshi muscular dystrophy Diseases 0.000 description 4
- 208000009376 Miyoshi myopathy Diseases 0.000 description 4
- 206010061535 Ovarian neoplasm Diseases 0.000 description 4
- 108010076504 Protein Sorting Signals Proteins 0.000 description 4
- 108091093078 Pyrimidine dimer Proteins 0.000 description 4
- 108010090804 Streptavidin Proteins 0.000 description 4
- 229960000643 adenine Drugs 0.000 description 4
- 230000003321 amplification Effects 0.000 description 4
- 239000012491 analyte Substances 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 230000004888 barrier function Effects 0.000 description 4
- 239000012472 biological sample Substances 0.000 description 4
- AIYUHDOJVYHVIT-UHFFFAOYSA-M caesium chloride Chemical compound [Cl-].[Cs+] AIYUHDOJVYHVIT-UHFFFAOYSA-M 0.000 description 4
- 201000011510 cancer Diseases 0.000 description 4
- ZOOGRGPOEVQQDX-KHLHZJAASA-N cyclic guanosine monophosphate Chemical compound C([C@H]1O2)O[P@](O)(=O)O[C@@H]1[C@H](O)[C@H]2N1C(N=C(NC2=O)N)=C2N=C1 ZOOGRGPOEVQQDX-KHLHZJAASA-N 0.000 description 4
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 4
- 229940104302 cytosine Drugs 0.000 description 4
- 239000012530 fluid Substances 0.000 description 4
- 201000001441 melanoma Diseases 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 4
- 229910000160 potassium phosphate Inorganic materials 0.000 description 4
- 235000011009 potassium phosphates Nutrition 0.000 description 4
- 239000013635 pyrimidine dimer Substances 0.000 description 4
- 238000010839 reverse transcription Methods 0.000 description 4
- 238000010561 standard procedure Methods 0.000 description 4
- 229940113082 thymine Drugs 0.000 description 4
- 230000014616 translation Effects 0.000 description 4
- 229940035893 uracil Drugs 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- UDMBCSSLTHHNCD-UHFFFAOYSA-N Coenzym Q(11) Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(O)=O)C(O)C1O UDMBCSSLTHHNCD-UHFFFAOYSA-N 0.000 description 3
- 229920000858 Cyclodextrin Polymers 0.000 description 3
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 3
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 3
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 3
- LNQVTSROQXJCDD-UHFFFAOYSA-N adenosine monophosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(CO)C(OP(O)(O)=O)C1O LNQVTSROQXJCDD-UHFFFAOYSA-N 0.000 description 3
- 210000001124 body fluid Anatomy 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 208000006990 cholangiocarcinoma Diseases 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 208000005017 glioblastoma Diseases 0.000 description 3
- IPCSVZSSVZVIGE-UHFFFAOYSA-M hexadecanoate Chemical compound CCCCCCCCCCCCCCCC([O-])=O IPCSVZSSVZVIGE-UHFFFAOYSA-M 0.000 description 3
- 238000002169 hydrotherapy Methods 0.000 description 3
- 229910052751 metal Inorganic materials 0.000 description 3
- 239000002184 metal Substances 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 3
- 102200160490 rs1800299 Human genes 0.000 description 3
- 102200037599 rs749038326 Human genes 0.000 description 3
- 102220143003 rs753997345 Human genes 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 239000003053 toxin Substances 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- BYGOPQKDHGXNCD-UHFFFAOYSA-N tripotassium;iron(3+);hexacyanide Chemical compound [K+].[K+].[K+].[Fe+3].N#[C-].N#[C-].N#[C-].N#[C-].N#[C-].N#[C-] BYGOPQKDHGXNCD-UHFFFAOYSA-N 0.000 description 3
- YKBGVTZYEHREMT-KVQBGUIXSA-N 2'-deoxyguanosine Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](CO)O1 YKBGVTZYEHREMT-KVQBGUIXSA-N 0.000 description 2
- MXHRCPNRJAMMIM-SHYZEUOFSA-N 2'-deoxyuridine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 MXHRCPNRJAMMIM-SHYZEUOFSA-N 0.000 description 2
- FZWGECJQACGGTI-UHFFFAOYSA-N 2-amino-7-methyl-1,7-dihydro-6H-purin-6-one Chemical compound NC1=NC(O)=C2N(C)C=NC2=N1 FZWGECJQACGGTI-UHFFFAOYSA-N 0.000 description 2
- 150000005019 2-aminopurines Chemical class 0.000 description 2
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 2
- FSASIHFSFGAIJM-UHFFFAOYSA-N 3-methyladenine Chemical compound CN1C=NC(N)=C2N=CN=C12 FSASIHFSFGAIJM-UHFFFAOYSA-N 0.000 description 2
- XTWYTFMLZFPYCI-KQYNXXCUSA-N 5'-adenylphosphoric acid Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O XTWYTFMLZFPYCI-KQYNXXCUSA-N 0.000 description 2
- NFEXJLMYXXIWPI-JXOAFFINSA-N 5-Hydroxymethylcytidine Chemical class C1=C(CO)C(N)=NC(=O)N1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 NFEXJLMYXXIWPI-JXOAFFINSA-N 0.000 description 2
- WOVKYSAHUYNSMH-RRKCRQDMSA-N 5-bromodeoxyuridine Chemical class C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(Br)=C1 WOVKYSAHUYNSMH-RRKCRQDMSA-N 0.000 description 2
- ZAYHVCMSTBRABG-JXOAFFINSA-N 5-methylcytidine Chemical class O=C1N=C(N)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 ZAYHVCMSTBRABG-JXOAFFINSA-N 0.000 description 2
- NJQONZSFUKNYOY-JXOAFFINSA-N 5-methylcytidine 5'-monophosphate Chemical compound O=C1N=C(N)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(O)=O)O1 NJQONZSFUKNYOY-JXOAFFINSA-N 0.000 description 2
- 208000036762 Acute promyelocytic leukaemia Diseases 0.000 description 2
- 102000002735 Acyl-CoA Dehydrogenase Human genes 0.000 description 2
- 108010001058 Acyl-CoA Dehydrogenase Proteins 0.000 description 2
- XTWYTFMLZFPYCI-UHFFFAOYSA-N Adenosine diphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(O)=O)C(O)C1O XTWYTFMLZFPYCI-UHFFFAOYSA-N 0.000 description 2
- 208000001446 Anaplastic Thyroid Carcinoma Diseases 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- 208000010839 B-cell chronic lymphocytic leukemia Diseases 0.000 description 2
- 201000006935 Becker muscular dystrophy Diseases 0.000 description 2
- 208000008439 Biliary Liver Cirrhosis Diseases 0.000 description 2
- 208000033222 Biliary cirrhosis primary Diseases 0.000 description 2
- ZWIADYZPOWUWEW-XVFCMESISA-N CDP Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(O)=O)O1 ZWIADYZPOWUWEW-XVFCMESISA-N 0.000 description 2
- 208000024172 Cardiovascular disease Diseases 0.000 description 2
- 108010078791 Carrier Proteins Proteins 0.000 description 2
- 206010009900 Colitis ulcerative Diseases 0.000 description 2
- 206010009944 Colon cancer Diseases 0.000 description 2
- PCDQPRRSZKQHHS-CCXZUQQUSA-N Cytarabine Triphosphate Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 PCDQPRRSZKQHHS-CCXZUQQUSA-N 0.000 description 2
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical compound OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 2
- 101150077975 DDT gene Proteins 0.000 description 2
- 102100033215 DNA nucleotidylexotransferase Human genes 0.000 description 2
- 108010008286 DNA nucleotidylexotransferase Proteins 0.000 description 2
- AHCYMLUZIRLXAA-SHYZEUOFSA-N Deoxyuridine 5'-triphosphate Chemical compound O1[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C=C1 AHCYMLUZIRLXAA-SHYZEUOFSA-N 0.000 description 2
- 201000010374 Down Syndrome Diseases 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 208000032027 Essential Thrombocythemia Diseases 0.000 description 2
- 208000037149 Facioscapulohumeral dystrophy Diseases 0.000 description 2
- 208000004463 Follicular Adenocarcinoma Diseases 0.000 description 2
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 2
- QGWNDRXFNXRZMB-UUOKFMHZSA-N GDP Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O QGWNDRXFNXRZMB-UUOKFMHZSA-N 0.000 description 2
- 201000010915 Glioblastoma multiforme Diseases 0.000 description 2
- XKMLYUALXHKNFT-UUOKFMHZSA-N Guanosine-5'-triphosphate Chemical compound C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O XKMLYUALXHKNFT-UUOKFMHZSA-N 0.000 description 2
- 208000023105 Huntington disease Diseases 0.000 description 2
- 150000008575 L-amino acids Chemical class 0.000 description 2
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 2
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- 208000031422 Lymphocytic Chronic B-Cell Leukemia Diseases 0.000 description 2
- 206010025323 Lymphomas Diseases 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 241000124008 Mammalia Species 0.000 description 2
- 208000025205 Mantle-Cell Lymphoma Diseases 0.000 description 2
- 108010052285 Membrane Proteins Proteins 0.000 description 2
- 208000034965 Nemaline Myopathies Diseases 0.000 description 2
- 206010029260 Neuroblastoma Diseases 0.000 description 2
- 101710163270 Nuclease Proteins 0.000 description 2
- 101710116435 Outer membrane protein Proteins 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- 206010033701 Papillary thyroid cancer Diseases 0.000 description 2
- 102000017033 Porins Human genes 0.000 description 2
- 208000012654 Primary biliary cholangitis Diseases 0.000 description 2
- 208000033826 Promyelocytic Acute Leukemia Diseases 0.000 description 2
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 2
- 208000007014 Retinitis pigmentosa Diseases 0.000 description 2
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 2
- 206010040047 Sepsis Diseases 0.000 description 2
- 208000000102 Squamous Cell Carcinoma of Head and Neck Diseases 0.000 description 2
- 208000005718 Stomach Neoplasms Diseases 0.000 description 2
- RZCIEJXAILMSQK-JXOAFFINSA-N TTP Chemical compound O=C1NC(=O)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 RZCIEJXAILMSQK-JXOAFFINSA-N 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- 201000006704 Ulcerative Colitis Diseases 0.000 description 2
- 108010072685 Uracil-DNA Glycosidase Proteins 0.000 description 2
- 102000006943 Uracil-DNA Glycosidase Human genes 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- XCCTYIAWTASOJW-XVFCMESISA-N Uridine-5'-Diphosphate Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(=O)OP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 XCCTYIAWTASOJW-XVFCMESISA-N 0.000 description 2
- 206010046798 Uterine leiomyoma Diseases 0.000 description 2
- 208000016025 Waldenstroem macroglobulinemia Diseases 0.000 description 2
- 208000033559 Waldenström macroglobulinemia Diseases 0.000 description 2
- KPUOHXMVCZBWQC-JXOAFFINSA-N [(2r,3s,4r,5r)-5-[4-amino-5-(hydroxymethyl)-2-oxopyrimidin-1-yl]-3,4-dihydroxyoxolan-2-yl]methyl dihydrogen phosphate Chemical compound C1=C(CO)C(N)=NC(=O)N1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(O)=O)O1 KPUOHXMVCZBWQC-JXOAFFINSA-N 0.000 description 2
- BZDVTEPMYMHZCR-JGVFFNPUSA-N [(2s,5r)-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methyl phosphono hydrogen phosphate Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)CC1 BZDVTEPMYMHZCR-JGVFFNPUSA-N 0.000 description 2
- ASJWEHCPLGMOJE-LJMGSBPFSA-N ac1l3rvh Chemical class N1C(=O)NC(=O)[C@@]2(C)[C@@]3(C)C(=O)NC(=O)N[C@H]3[C@H]21 ASJWEHCPLGMOJE-LJMGSBPFSA-N 0.000 description 2
- 208000017733 acquired polycythemia vera Diseases 0.000 description 2
- 230000010933 acylation Effects 0.000 description 2
- 238000005917 acylation reaction Methods 0.000 description 2
- 150000001299 aldehydes Chemical class 0.000 description 2
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- CKLJMWTZIZZHCS-REOHCLBHSA-N aspartic acid group Chemical group N[C@@H](CC(=O)O)C(=O)O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 2
- 208000029560 autism spectrum disease Diseases 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 239000010839 body fluid Substances 0.000 description 2
- 150000001768 cations Chemical class 0.000 description 2
- 239000002800 charge carrier Substances 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 150000003841 chloride salts Chemical class 0.000 description 2
- UHZZMRAGKVHANO-UHFFFAOYSA-M chlormequat chloride Chemical compound [Cl-].C[N+](C)(C)CCCl UHZZMRAGKVHANO-UHFFFAOYSA-M 0.000 description 2
- 208000032852 chronic lymphocytic leukemia Diseases 0.000 description 2
- 229920001577 copolymer Polymers 0.000 description 2
- 208000029078 coronary artery disease Diseases 0.000 description 2
- 208000030381 cutaneous melanoma Diseases 0.000 description 2
- DAEAPNUQQAICNR-RRKCRQDMSA-K dADP(3-) Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP([O-])(=O)OP([O-])([O-])=O)O1 DAEAPNUQQAICNR-RRKCRQDMSA-K 0.000 description 2
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 2
- FTDHDKPUHBLBTL-SHYZEUOFSA-K dCDP(3-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 FTDHDKPUHBLBTL-SHYZEUOFSA-K 0.000 description 2
- RGWHQCVHVJXOKC-SHYZEUOFSA-N dCTP Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO[P@](O)(=O)O[P@](O)(=O)OP(O)(O)=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-N 0.000 description 2
- CIKGWCTVFSRMJU-KVQBGUIXSA-N dGDP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(O)=O)O1 CIKGWCTVFSRMJU-KVQBGUIXSA-N 0.000 description 2
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 2
- UJLXYODCHAELLY-XLPZGREQSA-N dTDP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 UJLXYODCHAELLY-XLPZGREQSA-N 0.000 description 2
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 2
- QHWZTVCCBMIIKE-SHYZEUOFSA-N dUDP Chemical compound O1[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(=O)NC(=O)C=C1 QHWZTVCCBMIIKE-SHYZEUOFSA-N 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 101150102279 ddc gene Proteins 0.000 description 2
- 201000001981 dermatomyositis Diseases 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 239000000539 dimer Substances 0.000 description 2
- 239000001177 diphosphate Substances 0.000 description 2
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 2
- 235000011180 diphosphates Nutrition 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 208000008570 facioscapulohumeral muscular dystrophy Diseases 0.000 description 2
- 150000002190 fatty acyls Chemical group 0.000 description 2
- 201000003444 follicular lymphoma Diseases 0.000 description 2
- 206010017758 gastric cancer Diseases 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 229910021389 graphene Inorganic materials 0.000 description 2
- QGWNDRXFNXRZMB-UHFFFAOYSA-N guanidine diphosphate Natural products C1=2NC(N)=NC(=O)C=2N=CN1C1OC(COP(O)(=O)OP(O)(O)=O)C(O)C1O QGWNDRXFNXRZMB-UHFFFAOYSA-N 0.000 description 2
- 201000000459 head and neck squamous cell carcinoma Diseases 0.000 description 2
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 2
- 231100000844 hepatocellular carcinoma Toxicity 0.000 description 2
- 125000000623 heterocyclic group Chemical group 0.000 description 2
- IPCSVZSSVZVIGE-UHFFFAOYSA-N hexadecanoic acid Chemical compound CCCCCCCCCCCCCCCC(O)=O IPCSVZSSVZVIGE-UHFFFAOYSA-N 0.000 description 2
- ACCCMOQWYVYDOT-UHFFFAOYSA-N hexane-1,1-diol Chemical group CCCCCC(O)O ACCCMOQWYVYDOT-UHFFFAOYSA-N 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 2
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 2
- 201000008319 inclusion body myositis Diseases 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 201000007450 intrahepatic cholangiocarcinoma Diseases 0.000 description 2
- 201000010260 leiomyoma Diseases 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 238000004811 liquid chromatography Methods 0.000 description 2
- 235000018977 lysine Nutrition 0.000 description 2
- 125000003588 lysine group Chemical class [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 2
- 150000002678 macrocyclic compounds Chemical class 0.000 description 2
- 229920002521 macromolecule Polymers 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- SJFKGZZCMREBQH-UHFFFAOYSA-N methyl ethanimidate Chemical compound COC(C)=N SJFKGZZCMREBQH-UHFFFAOYSA-N 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 150000004712 monophosphates Chemical class 0.000 description 2
- 208000008338 non-alcoholic fatty liver disease Diseases 0.000 description 2
- 208000002154 non-small cell lung carcinoma Diseases 0.000 description 2
- 239000002777 nucleoside Substances 0.000 description 2
- 201000008129 pancreatic ductal adenocarcinoma Diseases 0.000 description 2
- 235000021317 phosphate Nutrition 0.000 description 2
- 150000003904 phospholipids Chemical class 0.000 description 2
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 2
- 210000002381 plasma Anatomy 0.000 description 2
- 208000037244 polycythemia vera Diseases 0.000 description 2
- 208000005987 polymyositis Diseases 0.000 description 2
- 150000003212 purines Chemical class 0.000 description 2
- 150000003230 pyrimidines Chemical class 0.000 description 2
- 230000009257 reactivity Effects 0.000 description 2
- 238000003259 recombinant expression Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 238000006722 reduction reaction Methods 0.000 description 2
- 238000005932 reductive alkylation reaction Methods 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 210000003296 saliva Anatomy 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 239000003579 shift reagent Substances 0.000 description 2
- 239000002356 single layer Substances 0.000 description 2
- 201000003708 skin melanoma Diseases 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 201000011549 stomach cancer Diseases 0.000 description 2
- 229920001059 synthetic polymer Polymers 0.000 description 2
- 201000008440 thyroid gland anaplastic carcinoma Diseases 0.000 description 2
- 208000030901 thyroid gland follicular carcinoma Diseases 0.000 description 2
- 208000030045 thyroid gland papillary carcinoma Diseases 0.000 description 2
- 208000019179 thyroid gland undifferentiated (anaplastic) carcinoma Diseases 0.000 description 2
- 231100000765 toxin Toxicity 0.000 description 2
- 108700012359 toxins Proteins 0.000 description 2
- 238000011282 treatment Methods 0.000 description 2
- MQAYPFVXSPHGJM-UHFFFAOYSA-M trimethyl(phenyl)azanium;chloride Chemical compound [Cl-].C[N+](C)(C)C1=CC=CC=C1 MQAYPFVXSPHGJM-UHFFFAOYSA-M 0.000 description 2
- 239000001226 triphosphate Substances 0.000 description 2
- 235000011178 triphosphate Nutrition 0.000 description 2
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 2
- 208000029729 tumor suppressor gene on chromosome 11 Diseases 0.000 description 2
- 210000002700 urine Anatomy 0.000 description 2
- 201000007954 uterine fibroid Diseases 0.000 description 2
- AUTOLBMXDDTRRT-JGVFFNPUSA-N (4R,5S)-dethiobiotin Chemical compound C[C@@H]1NC(=O)N[C@@H]1CCCCCC(O)=O AUTOLBMXDDTRRT-JGVFFNPUSA-N 0.000 description 1
- UKGJZDSUJSPAJL-YPUOHESYSA-N (e)-n-[(1r)-1-[3,5-difluoro-4-(methanesulfonamido)phenyl]ethyl]-3-[2-propyl-6-(trifluoromethyl)pyridin-3-yl]prop-2-enamide Chemical compound CCCC1=NC(C(F)(F)F)=CC=C1\C=C\C(=O)N[C@H](C)C1=CC(F)=C(NS(C)(=O)=O)C(F)=C1 UKGJZDSUJSPAJL-YPUOHESYSA-N 0.000 description 1
- KILNVBDSWZSGLL-KXQOOQHDSA-N 1,2-dihexadecanoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCCCCCCCCC KILNVBDSWZSGLL-KXQOOQHDSA-N 0.000 description 1
- WWJWZQKUDYKLTK-UHFFFAOYSA-N 1,n6-ethenoadenine Chemical compound C1=NC2=NC=N[C]2C2=NC=CN21 WWJWZQKUDYKLTK-UHFFFAOYSA-N 0.000 description 1
- BMQZYMYBQZGEEY-UHFFFAOYSA-M 1-ethyl-3-methylimidazolium chloride Chemical compound [Cl-].CCN1C=C[N+](C)=C1 BMQZYMYBQZGEEY-UHFFFAOYSA-M 0.000 description 1
- PHNGFPPXDJJADG-RRKCRQDMSA-N 2'-deoxyinosine-5'-monophosphate Chemical compound O1[C@H](COP(O)(O)=O)[C@@H](O)C[C@@H]1N1C(N=CNC2=O)=C2N=C1 PHNGFPPXDJJADG-RRKCRQDMSA-N 0.000 description 1
- CKTSBUTUHBMZGZ-SHYZEUOFSA-N 2'‐deoxycytidine Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 CKTSBUTUHBMZGZ-SHYZEUOFSA-N 0.000 description 1
- MHKBMNACOMRIAW-UHFFFAOYSA-N 2,3-dinitrophenol Chemical class OC1=CC=CC([N+]([O-])=O)=C1[N+]([O-])=O MHKBMNACOMRIAW-UHFFFAOYSA-N 0.000 description 1
- PIINGYXNCHTJTF-UHFFFAOYSA-N 2-(2-azaniumylethylamino)acetate Chemical group NCCNCC(O)=O PIINGYXNCHTJTF-UHFFFAOYSA-N 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical group OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical group Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- HCGYMSSYSAKGPK-UHFFFAOYSA-N 2-nitro-1h-indole Chemical class C1=CC=C2NC([N+](=O)[O-])=CC2=C1 HCGYMSSYSAKGPK-UHFFFAOYSA-N 0.000 description 1
- 108020005345 3' Untranslated Regions Proteins 0.000 description 1
- KIUMMUBSPKGMOY-UHFFFAOYSA-N 3,3'-Dithiobis(6-nitrobenzoic acid) Chemical group C1=C([N+]([O-])=O)C(C(=O)O)=CC(SSC=2C=C(C(=CC=2)[N+]([O-])=O)C(O)=O)=C1 KIUMMUBSPKGMOY-UHFFFAOYSA-N 0.000 description 1
- OUCMTIKCFRCBHK-UHFFFAOYSA-N 3,3-dibenzylcyclooctyne Chemical group C1CCCCC#CC1(CC=1C=CC=CC=1)CC1=CC=CC=C1 OUCMTIKCFRCBHK-UHFFFAOYSA-N 0.000 description 1
- 108010034927 3-methyladenine-DNA glycosylase Proteins 0.000 description 1
- OZFPSOBLQZPIAV-UHFFFAOYSA-N 5-nitro-1h-indole Chemical class [O-][N+](=O)C1=CC=C2NC=CC2=C1 OZFPSOBLQZPIAV-UHFFFAOYSA-N 0.000 description 1
- 208000024893 Acute lymphoblastic leukemia Diseases 0.000 description 1
- 208000031261 Acute myeloid leukaemia Diseases 0.000 description 1
- 208000010507 Adenocarcinoma of Lung Diseases 0.000 description 1
- 208000003200 Adenoma Diseases 0.000 description 1
- 206010001233 Adenoma benign Diseases 0.000 description 1
- 208000022309 Alcoholic Liver disease Diseases 0.000 description 1
- 208000024827 Alzheimer disease Diseases 0.000 description 1
- 208000019901 Anxiety disease Diseases 0.000 description 1
- 206010003571 Astrocytoma Diseases 0.000 description 1
- 102000014461 Ataxins Human genes 0.000 description 1
- 108010078286 Ataxins Proteins 0.000 description 1
- 206010003658 Atrial Fibrillation Diseases 0.000 description 1
- 235000007319 Avena orientalis Nutrition 0.000 description 1
- 244000075850 Avena orientalis Species 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 208000003950 B-cell lymphoma Diseases 0.000 description 1
- 208000032791 BCR-ABL1 positive chronic myelogenous leukemia Diseases 0.000 description 1
- 241000193738 Bacillus anthracis Species 0.000 description 1
- 101100031699 Bacillus subtilis (strain 168) bglP gene Proteins 0.000 description 1
- 206010061692 Benign muscle neoplasm Diseases 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 206010005003 Bladder cancer Diseases 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 208000003174 Brain Neoplasms Diseases 0.000 description 1
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 description 1
- 235000006008 Brassica napus var napus Nutrition 0.000 description 1
- 240000000385 Brassica napus var. napus Species 0.000 description 1
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 description 1
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- 208000011691 Burkitt lymphomas Diseases 0.000 description 1
- 102220484866 C-type lectin domain family 4 member A_W21A_mutation Human genes 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 102220548139 Calpain-2 catalytic subunit_D22E_mutation Human genes 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 206010007572 Cardiac hypertrophy Diseases 0.000 description 1
- 208000006029 Cardiomegaly Diseases 0.000 description 1
- 208000031229 Cardiomyopathies Diseases 0.000 description 1
- 206010008025 Cerebellar ataxia Diseases 0.000 description 1
- 206010008342 Cervix carcinoma Diseases 0.000 description 1
- 206010008642 Cholesteatoma Diseases 0.000 description 1
- 208000006332 Choriocarcinoma Diseases 0.000 description 1
- 208000000668 Chronic Pancreatitis Diseases 0.000 description 1
- 208000010833 Chronic myeloid leukaemia Diseases 0.000 description 1
- 208000030808 Clear cell renal carcinoma Diseases 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 1
- 108020004635 Complementary DNA Proteins 0.000 description 1
- 208000002330 Congenital Heart Defects Diseases 0.000 description 1
- 206010056370 Congestive cardiomyopathy Diseases 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- 208000012609 Cowden disease Diseases 0.000 description 1
- 201000002847 Cowden syndrome Diseases 0.000 description 1
- 108010069514 Cyclic Peptides Proteins 0.000 description 1
- 102000001189 Cyclic Peptides Human genes 0.000 description 1
- 201000003883 Cystic fibrosis Diseases 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 102100039128 DNA-3-methyladenine glycosylase Human genes 0.000 description 1
- 206010011878 Deafness Diseases 0.000 description 1
- 206010012289 Dementia Diseases 0.000 description 1
- CKTSBUTUHBMZGZ-UHFFFAOYSA-N Deoxycytidine Natural products O=C1N=C(N)C=CN1C1OC(CO)C(O)C1 CKTSBUTUHBMZGZ-UHFFFAOYSA-N 0.000 description 1
- 206010012438 Dermatitis atopic Diseases 0.000 description 1
- 208000007342 Diabetic Nephropathies Diseases 0.000 description 1
- SHIBSTMRCDJXLN-UHFFFAOYSA-N Digoxigenin Natural products C1CC(C2C(C3(C)CCC(O)CC3CC2)CC2O)(O)C2(C)C1C1=CC(=O)OC1 SHIBSTMRCDJXLN-UHFFFAOYSA-N 0.000 description 1
- 201000010046 Dilated cardiomyopathy Diseases 0.000 description 1
- 101100310856 Drosophila melanogaster spri gene Proteins 0.000 description 1
- 206010014733 Endometrial cancer Diseases 0.000 description 1
- 206010014759 Endometrial neoplasm Diseases 0.000 description 1
- 201000009273 Endometriosis Diseases 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 208000000461 Esophageal Neoplasms Diseases 0.000 description 1
- 208000001730 Familial dysautonomia Diseases 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 206010016654 Fibrosis Diseases 0.000 description 1
- 201000011240 Frontotemporal dementia Diseases 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 208000034826 Genetic Predisposition to Disease Diseases 0.000 description 1
- 208000032320 Germ cell tumor of testis Diseases 0.000 description 1
- 208000032612 Glial tumor Diseases 0.000 description 1
- 206010018338 Glioma Diseases 0.000 description 1
- 208000022461 Glomerular disease Diseases 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- 241000219146 Gossypium Species 0.000 description 1
- 239000007995 HEPES buffer Substances 0.000 description 1
- 208000002927 Hamartoma Diseases 0.000 description 1
- 206010019280 Heart failures Diseases 0.000 description 1
- 102100022536 Helicase POLQ-like Human genes 0.000 description 1
- 208000005176 Hepatitis C Diseases 0.000 description 1
- 208000017604 Hodgkin disease Diseases 0.000 description 1
- 208000021519 Hodgkin lymphoma Diseases 0.000 description 1
- 208000010747 Hodgkins lymphoma Diseases 0.000 description 1
- 101000899334 Homo sapiens Helicase POLQ-like Proteins 0.000 description 1
- 240000005979 Hordeum vulgare Species 0.000 description 1
- 235000007340 Hordeum vulgare Nutrition 0.000 description 1
- 208000025500 Hutchinson-Gilford progeria syndrome Diseases 0.000 description 1
- AVXURJPOCDRRFD-UHFFFAOYSA-N Hydroxylamine Chemical class ON AVXURJPOCDRRFD-UHFFFAOYSA-N 0.000 description 1
- 208000035150 Hypercholesterolemia Diseases 0.000 description 1
- 206010020772 Hypertension Diseases 0.000 description 1
- 206010021042 Hypopharyngeal cancer Diseases 0.000 description 1
- 206010056305 Hypopharyngeal neoplasm Diseases 0.000 description 1
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- 208000008839 Kidney Neoplasms Diseases 0.000 description 1
- 208000007666 Klatskin Tumor Diseases 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- 208000031671 Large B-Cell Diffuse Lymphoma Diseases 0.000 description 1
- 240000004322 Lens culinaris Species 0.000 description 1
- 235000014647 Lens culinaris subsp culinaris Nutrition 0.000 description 1
- 201000009342 Limb-girdle muscular dystrophy Diseases 0.000 description 1
- 206010024612 Lipoma Diseases 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 1
- 208000030289 Lymphoproliferative disease Diseases 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 244000070406 Malus silvestris Species 0.000 description 1
- 208000000172 Medulloblastoma Diseases 0.000 description 1
- 208000008770 Multiple Hamartoma Syndrome Diseases 0.000 description 1
- 208000034578 Multiple myelomas Diseases 0.000 description 1
- 240000005561 Musa balbisiana Species 0.000 description 1
- 201000003793 Myelodysplastic syndrome Diseases 0.000 description 1
- 208000033761 Myelogenous Chronic BCR-ABL Positive Leukemia Diseases 0.000 description 1
- 208000014767 Myeloproliferative disease Diseases 0.000 description 1
- 201000007224 Myeloproliferative neoplasm Diseases 0.000 description 1
- 201000004458 Myoma Diseases 0.000 description 1
- 206010068871 Myotonic dystrophy Diseases 0.000 description 1
- OKIZCWYLBDKLSU-UHFFFAOYSA-M N,N,N-Trimethylmethanaminium chloride Chemical compound [Cl-].C[N+](C)(C)C OKIZCWYLBDKLSU-UHFFFAOYSA-M 0.000 description 1
- 208000002454 Nasopharyngeal Carcinoma Diseases 0.000 description 1
- 206010061306 Nasopharyngeal cancer Diseases 0.000 description 1
- 206010029379 Neutrophilia Diseases 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 1
- OZBFLQITCMCIOY-FOUAGVGXSA-N OC[C@H]([C@H]([C@@H]([C@H]1O)O)O[C@H]2O[C@@H]([C@@H](O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O3)[C@H](O)[C@H]2O)CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@@H]3O[C@@H]1CO Chemical compound OC[C@H]([C@H]([C@@H]([C@H]1O)O)O[C@H]2O[C@@H]([C@@H](O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](CO)[C@H]([C@@H]([C@H]3O)O)O3)[C@H](O)[C@H]2O)CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@@H]3O[C@@H]1CO OZBFLQITCMCIOY-FOUAGVGXSA-N 0.000 description 1
- 208000008589 Obesity Diseases 0.000 description 1
- 206010030155 Oesophageal carcinoma Diseases 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 208000007571 Ovarian Epithelial Carcinoma Diseases 0.000 description 1
- 235000021314 Palmitic acid Nutrition 0.000 description 1
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 1
- 206010033649 Pancreatitis chronic Diseases 0.000 description 1
- 208000018737 Parkinson disease Diseases 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 208000023668 Pharyngeal disease Diseases 0.000 description 1
- 244000046052 Phaseolus vulgaris Species 0.000 description 1
- 235000010627 Phaseolus vulgaris Nutrition 0.000 description 1
- 208000007913 Pituitary Neoplasms Diseases 0.000 description 1
- 201000005746 Pituitary adenoma Diseases 0.000 description 1
- 206010061538 Pituitary tumour benign Diseases 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- 239000004952 Polyamide Substances 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- 201000010769 Prader-Willi syndrome Diseases 0.000 description 1
- 208000024777 Prion disease Diseases 0.000 description 1
- 208000007932 Progeria Diseases 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 201000004681 Psoriasis Diseases 0.000 description 1
- 241000910071 Pyrobaculum filamentous virus 1 Species 0.000 description 1
- 238000010802 RNA extraction kit Methods 0.000 description 1
- 101710086015 RNA ligase Proteins 0.000 description 1
- 102220637688 Ras-related protein Rab-33A_D91G_mutation Human genes 0.000 description 1
- 206010038389 Renal cancer Diseases 0.000 description 1
- 208000006265 Renal cell carcinoma Diseases 0.000 description 1
- 201000000582 Retinoblastoma Diseases 0.000 description 1
- 201000001638 Riley-Day syndrome Diseases 0.000 description 1
- 240000000111 Saccharum officinarum Species 0.000 description 1
- 235000007201 Saccharum officinarum Nutrition 0.000 description 1
- 206010039491 Sarcoma Diseases 0.000 description 1
- 208000009359 Sezary Syndrome Diseases 0.000 description 1
- 208000021388 Sezary disease Diseases 0.000 description 1
- 229910052581 Si3N4 Inorganic materials 0.000 description 1
- 101710126859 Single-stranded DNA-binding protein Proteins 0.000 description 1
- 208000013738 Sleep Initiation and Maintenance disease Diseases 0.000 description 1
- 206010041067 Small cell lung cancer Diseases 0.000 description 1
- 240000003768 Solanum lycopersicum Species 0.000 description 1
- 244000061456 Solanum tuberosum Species 0.000 description 1
- 235000002595 Solanum tuberosum Nutrition 0.000 description 1
- 208000009415 Spinocerebellar Ataxias Diseases 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 229930182558 Sterol Natural products 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 208000000389 T-cell leukemia Diseases 0.000 description 1
- 208000028530 T-cell lymphoblastic leukemia/lymphoma Diseases 0.000 description 1
- 208000034799 Tauopathies Diseases 0.000 description 1
- 229920006362 Teflon® Polymers 0.000 description 1
- 208000002903 Thalassemia Diseases 0.000 description 1
- 235000009470 Theobroma cacao Nutrition 0.000 description 1
- 244000299461 Theobroma cacao Species 0.000 description 1
- 208000024770 Thyroid neoplasm Diseases 0.000 description 1
- 208000000323 Tourette Syndrome Diseases 0.000 description 1
- 208000016620 Tourette disease Diseases 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 208000007097 Urinary Bladder Neoplasms Diseases 0.000 description 1
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 1
- 201000005969 Uveal melanoma Diseases 0.000 description 1
- 241000219094 Vitaceae Species 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- KHIXTHDBCAQXRC-JXOAFFINSA-N [(2r,3s,4r,5r)-5-(4-amino-5-methyl-2-oxopyrimidin-1-yl)-3,4-dihydroxyoxolan-2-yl]methyl phosphono hydrogen phosphate Chemical compound O=C1N=C(N)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(O)=O)O1 KHIXTHDBCAQXRC-JXOAFFINSA-N 0.000 description 1
- JVCCCMWZAFGSPD-JXOAFFINSA-N [(2r,3s,4r,5r)-5-[4-amino-5-(hydroxymethyl)-2-oxopyrimidin-1-yl]-3,4-dihydroxyoxolan-2-yl]methyl phosphono hydrogen phosphate Chemical compound C1=C(CO)C(N)=NC(=O)N1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(O)=O)O1 JVCCCMWZAFGSPD-JXOAFFINSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Chemical class Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- YIJVOACVHQZMKI-JXOAFFINSA-N [[(2r,3s,4r,5r)-5-(4-amino-5-methyl-2-oxopyrimidin-1-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound O=C1N=C(N)C(C)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 YIJVOACVHQZMKI-JXOAFFINSA-N 0.000 description 1
- NAOOSVVUTFPWHI-JXOAFFINSA-N [[(2r,3s,4r,5r)-5-[4-amino-5-(hydroxymethyl)-2-oxopyrimidin-1-yl]-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound C1=C(CO)C(N)=NC(=O)N1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 NAOOSVVUTFPWHI-JXOAFFINSA-N 0.000 description 1
- PGAVKCOVUIYSFO-UHFFFAOYSA-N [[5-(2,4-dioxopyrimidin-1-yl)-3,4-dihydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate Chemical compound OC1C(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)OC1N1C(=O)NC(=O)C=C1 PGAVKCOVUIYSFO-UHFFFAOYSA-N 0.000 description 1
- 208000002223 abdominal aortic aneurysm Diseases 0.000 description 1
- 125000000641 acridinyl group Chemical class C1(=CC=CC2=NC3=CC=CC=C3C=C12)* 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 206010000891 acute myocardial infarction Diseases 0.000 description 1
- 238000013006 addition curing Methods 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- NLTUCYMLOPLUHL-KQYNXXCUSA-N adenosine 5'-[gamma-thio]triphosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=S)[C@@H](O)[C@H]1O NLTUCYMLOPLUHL-KQYNXXCUSA-N 0.000 description 1
- 208000020990 adrenal cortex carcinoma Diseases 0.000 description 1
- 208000007128 adrenocortical carcinoma Diseases 0.000 description 1
- 229910052783 alkali metal Inorganic materials 0.000 description 1
- 229910001514 alkali metal chloride Inorganic materials 0.000 description 1
- PNEYBMLMFCGWSK-UHFFFAOYSA-N aluminium oxide Inorganic materials [O-2].[O-2].[O-2].[Al+3].[Al+3] PNEYBMLMFCGWSK-UHFFFAOYSA-N 0.000 description 1
- 210000004381 amniotic fluid Anatomy 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 206010002026 amyotrophic lateral sclerosis Diseases 0.000 description 1
- 208000007474 aortic aneurysm Diseases 0.000 description 1
- 235000021016 apples Nutrition 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 206010003246 arthritis Diseases 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 238000000429 assembly Methods 0.000 description 1
- 230000000712 assembly Effects 0.000 description 1
- 208000006673 asthma Diseases 0.000 description 1
- 125000004429 atom Chemical group 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 201000008937 atopic dermatitis Diseases 0.000 description 1
- 201000004562 autosomal dominant cerebellar ataxia Diseases 0.000 description 1
- 208000021033 autosomal dominant polycystic liver disease Diseases 0.000 description 1
- 235000021015 bananas Nutrition 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- VTJUKNSKBAOEHE-UHFFFAOYSA-N calixarene Chemical class COC(=O)COC1=C(CC=2C(=C(CC=3C(=C(C4)C=C(C=3)C(C)(C)C)OCC(=O)OC)C=C(C=2)C(C)(C)C)OCC(=O)OC)C=C(C(C)(C)C)C=C1CC1=C(OCC(=O)OC)C4=CC(C(C)(C)C)=C1 VTJUKNSKBAOEHE-UHFFFAOYSA-N 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 125000004432 carbon atom Chemical group C* 0.000 description 1
- 239000002041 carbon nanotube Substances 0.000 description 1
- 229910021393 carbon nanotube Inorganic materials 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 230000002490 cerebral effect Effects 0.000 description 1
- 201000010881 cervical cancer Diseases 0.000 description 1
- 210000003467 cheek Anatomy 0.000 description 1
- 230000007882 cirrhosis Effects 0.000 description 1
- 208000019425 cirrhosis of liver Diseases 0.000 description 1
- 206010073251 clear cell renal cell carcinoma Diseases 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 230000001010 compromised effect Effects 0.000 description 1
- 208000028831 congenital heart disease Diseases 0.000 description 1
- 229910052593 corundum Inorganic materials 0.000 description 1
- 150000003983 crown ethers Chemical class 0.000 description 1
- 208000035250 cutaneous malignant susceptibility to 1 melanoma Diseases 0.000 description 1
- 229940097362 cyclodextrins Drugs 0.000 description 1
- 150000001945 cysteines Chemical class 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000003831 deregulation Effects 0.000 description 1
- MXHRCPNRJAMMIM-UHFFFAOYSA-N desoxyuridine Natural products C1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 MXHRCPNRJAMMIM-UHFFFAOYSA-N 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 208000033679 diabetic kidney disease Diseases 0.000 description 1
- 238000003745 diagnosis Methods 0.000 description 1
- 235000014113 dietary fatty acids Nutrition 0.000 description 1
- 206010012818 diffuse large B-cell lymphoma Diseases 0.000 description 1
- QONQRTHLHBTMGP-UHFFFAOYSA-N digitoxigenin Natural products CC12CCC(C3(CCC(O)CC3CC3)C)C3C11OC1CC2C1=CC(=O)OC1 QONQRTHLHBTMGP-UHFFFAOYSA-N 0.000 description 1
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 description 1
- 238000007598 dipping method Methods 0.000 description 1
- 208000016097 disease of metabolism Diseases 0.000 description 1
- 208000033420 disorder of pharynx Diseases 0.000 description 1
- 239000003651 drinking water Substances 0.000 description 1
- 235000020188 drinking water Nutrition 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 229920001971 elastomer Polymers 0.000 description 1
- 239000000806 elastomer Substances 0.000 description 1
- 201000003908 endometrial adenocarcinoma Diseases 0.000 description 1
- 208000016052 endometrial endometrioid adenocarcinoma Diseases 0.000 description 1
- 230000009088 enzymatic function Effects 0.000 description 1
- 230000009144 enzymatic modification Effects 0.000 description 1
- 210000003743 erythrocyte Anatomy 0.000 description 1
- 201000004101 esophageal cancer Diseases 0.000 description 1
- 208000007276 esophageal squamous cell carcinoma Diseases 0.000 description 1
- 201000006608 esophagus squamous cell carcinoma Diseases 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 229930195729 fatty acid Natural products 0.000 description 1
- 239000000194 fatty acid Substances 0.000 description 1
- 150000004665 fatty acids Chemical class 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 231100000852 glomerular disease Toxicity 0.000 description 1
- 206010061989 glomerulosclerosis Diseases 0.000 description 1
- 125000003827 glycol group Chemical group 0.000 description 1
- 235000021021 grapes Nutrition 0.000 description 1
- 201000010536 head and neck cancer Diseases 0.000 description 1
- 208000014829 head and neck neoplasm Diseases 0.000 description 1
- 230000010370 hearing loss Effects 0.000 description 1
- 231100000888 hearing loss Toxicity 0.000 description 1
- 208000016354 hearing loss disease Diseases 0.000 description 1
- 208000019622 heart disease Diseases 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 208000002672 hepatitis B Diseases 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 208000018060 hilar cholangiocarcinoma Diseases 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 201000006866 hypopharynx cancer Diseases 0.000 description 1
- 238000011065 in-situ storage Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 229910010272 inorganic material Inorganic materials 0.000 description 1
- 239000011147 inorganic material Substances 0.000 description 1
- 229920000592 inorganic polymer Polymers 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 206010022437 insomnia Diseases 0.000 description 1
- 239000011810 insulating material Substances 0.000 description 1
- 206010022498 insulinoma Diseases 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 239000002608 ionic liquid Substances 0.000 description 1
- 208000002551 irritable bowel syndrome Diseases 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 201000010982 kidney cancer Diseases 0.000 description 1
- 208000017169 kidney disease Diseases 0.000 description 1
- 238000009533 lab test Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 201000005264 laryngeal carcinoma Diseases 0.000 description 1
- 235000021374 legumes Nutrition 0.000 description 1
- 208000032839 leukemia Diseases 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 239000013554 lipid monolayer Substances 0.000 description 1
- 201000005249 lung adenocarcinoma Diseases 0.000 description 1
- 201000005202 lung cancer Diseases 0.000 description 1
- 201000003349 lung leiomyoma Diseases 0.000 description 1
- 208000020816 lung neoplasm Diseases 0.000 description 1
- 210000002751 lymph Anatomy 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 235000009973 maize Nutrition 0.000 description 1
- 208000006178 malignant mesothelioma Diseases 0.000 description 1
- 208000015486 malignant pancreatic neoplasm Diseases 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 206010027191 meningioma Diseases 0.000 description 1
- 208000030159 metabolic disease Diseases 0.000 description 1
- CXKWCBBOMKCUKX-UHFFFAOYSA-M methylene blue Chemical compound [Cl-].C1=CC(N(C)C)=CC2=[S+]C3=CC(N(C)C)=CC=C3N=C21 CXKWCBBOMKCUKX-UHFFFAOYSA-M 0.000 description 1
- 229960000907 methylthioninium chloride Drugs 0.000 description 1
- 238000004377 microelectronic Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 210000003097 mucus Anatomy 0.000 description 1
- 201000006417 multiple sclerosis Diseases 0.000 description 1
- 201000005962 mycosis fungoides Diseases 0.000 description 1
- 206010028537 myelofibrosis Diseases 0.000 description 1
- 208000010125 myocardial infarction Diseases 0.000 description 1
- 208000037891 myocardial injury Diseases 0.000 description 1
- 201000011216 nasopharynx carcinoma Diseases 0.000 description 1
- 230000009826 neoplastic cell growth Effects 0.000 description 1
- 201000008383 nephritis Diseases 0.000 description 1
- 230000004770 neurodegeneration Effects 0.000 description 1
- 230000001293 nucleolytic effect Effects 0.000 description 1
- 150000003833 nucleoside derivatives Chemical class 0.000 description 1
- 125000003835 nucleoside group Chemical group 0.000 description 1
- 235000020824 obesity Nutrition 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 239000011368 organic material Substances 0.000 description 1
- 229920000620 organic polymer Polymers 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 238000010422 painting Methods 0.000 description 1
- 201000002528 pancreatic cancer Diseases 0.000 description 1
- 208000008443 pancreatic carcinoma Diseases 0.000 description 1
- 208000021255 pancreatic insulinoma Diseases 0.000 description 1
- 208000019906 panic disease Diseases 0.000 description 1
- 150000002972 pentoses Chemical class 0.000 description 1
- 150000008300 phosphoramidites Chemical class 0.000 description 1
- 208000021310 pituitary gland adenoma Diseases 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 229920002647 polyamide Polymers 0.000 description 1
- 208000030761 polycystic kidney disease Diseases 0.000 description 1
- 208000028589 polycystic liver disease Diseases 0.000 description 1
- 229920002704 polyhistidine Polymers 0.000 description 1
- 235000012015 potatoes Nutrition 0.000 description 1
- 208000003476 primary myelofibrosis Diseases 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 208000002815 pulmonary hypertension Diseases 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 201000009410 rhabdomyosarcoma Diseases 0.000 description 1
- 208000004124 rheumatic heart disease Diseases 0.000 description 1
- 206010039073 rheumatoid arthritis Diseases 0.000 description 1
- 125000000548 ribosyl group Chemical group C1([C@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 235000009566 rice Nutrition 0.000 description 1
- 102220198221 rs1057519921 Human genes 0.000 description 1
- 102220278254 rs1554096653 Human genes 0.000 description 1
- 102220036433 rs35389822 Human genes 0.000 description 1
- 102200076325 rs5658 Human genes 0.000 description 1
- 102220188881 rs747642461 Human genes 0.000 description 1
- 102220158369 rs886047453 Human genes 0.000 description 1
- HFHDHCJBZVLPGP-UHFFFAOYSA-N schardinger α-dextrin Chemical compound O1C(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(O)C2O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC2C(O)C(O)C1OC2CO HFHDHCJBZVLPGP-UHFFFAOYSA-N 0.000 description 1
- 201000000980 schizophrenia Diseases 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 239000013535 sea water Substances 0.000 description 1
- 208000007056 sickle cell anemia Diseases 0.000 description 1
- LIVNPJMFVYWSIS-UHFFFAOYSA-N silicon monoxide Inorganic materials [Si-]#[O+] LIVNPJMFVYWSIS-UHFFFAOYSA-N 0.000 description 1
- 229920002379 silicone rubber Polymers 0.000 description 1
- 239000004945 silicone rubber Substances 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 208000017520 skin disease Diseases 0.000 description 1
- 208000019116 sleep disease Diseases 0.000 description 1
- 238000010583 slow cooling Methods 0.000 description 1
- 208000000587 small cell lung carcinoma Diseases 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000012279 sodium borohydride Substances 0.000 description 1
- 229910000033 sodium borohydride Inorganic materials 0.000 description 1
- 208000002320 spinal muscular atrophy Diseases 0.000 description 1
- 206010041823 squamous cell carcinoma Diseases 0.000 description 1
- 150000003432 sterols Chemical class 0.000 description 1
- 235000003702 sterols Nutrition 0.000 description 1
- 208000003265 stomatitis Diseases 0.000 description 1
- 238000001847 surface plasmon resonance imaging Methods 0.000 description 1
- 235000019527 sweetened beverage Nutrition 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 208000001608 teratocarcinoma Diseases 0.000 description 1
- 208000002918 testicular germ cell tumor Diseases 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 201000002510 thyroid cancer Diseases 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 201000002743 tongue squamous cell carcinoma Diseases 0.000 description 1
- 229920000428 triblock copolymer Polymers 0.000 description 1
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 1
- 239000000107 tumor biomarker Substances 0.000 description 1
- 208000001072 type 2 diabetes mellitus Diseases 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 201000005112 urinary bladder cancer Diseases 0.000 description 1
- 208000019553 vascular disease Diseases 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- 208000005925 vesicular stomatitis Diseases 0.000 description 1
- 229910001845 yogo sapphire Inorganic materials 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N27/00—Investigating or analysing materials by the use of electric, electrochemical, or magnetic means
- G01N27/26—Investigating or analysing materials by the use of electric, electrochemical, or magnetic means by investigating electrochemical variables; by using electrolysis or electrophoresis
- G01N27/416—Systems
- G01N27/4166—Systems measuring a particular property of an electrolyte
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/118—Prognosis of disease development
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/158—Expression markers
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/178—Oligonucleotides characterized by their use miRNA, siRNA or ncRNA
Definitions
- the invention relates to a new method of characterising a target ribonucleic acid (RNA) involving forming a complementary polynucleotide.
- the method uses a transmembrane pore.
- Nanopores Transmembrane pores have great potential as direct, electrical biosensors for polymers and a variety of small molecules.
- recent focus has been given to nanopores as a potential DNA sequencing technology.
- Nanopore detection of the nucleotide gives a current change of known signature and duration.
- Strand sequencing method a single polynucleotide strand is passed through the pore and the identity of the nucleotides are derived. Strand sequencing can involve the use of a nucleotide handling protein, such as a helicase, to control the movement of the polynucleotide through the pore.
- miRNAs are highly stable RNA oligomers, which can regulate protein production post-transcriptionally. They act by one of two mechanisms. In plants, miRNAs have been shown to act chiefly by directing the cleavage of messenger RNA, whereas in animals, gene regulation by miRNAs typically involves hybridisation of miRNAs to the 3′ UTRs of messenger RNAs, which hinders translation (Lee et al., Cell 75, 843-54 (1993); Wightman et al., Cell 75, 855-62 (1993); and Esquela-Kerscher et al., Cancer 6, 259-69 (2006)). miRNAs frequently bind to their targets with imperfect complementarity. They have been predicted to bind to as many as 200 gene targets each and to regulate more than a third of all human genes (Lewis et al., Cell 120, 15-20 (2005)).
- microRNAs The expression level of certain microRNAs is known to change in tumours, giving different tumour types characteristic patterns of microRNA expression (Rosenfeld, N. et al., Nature Biotechnology 26, 462-9 (2008)).
- miRNA profiles have been shown to be able to reveal the stage of tumour development with greater accuracy than messenger RNA profiles (Lu et al., Nature 435, 834-8 (2005) and Barshack et al., The International Journal of Biochemistry & Cell Biology 42, 1355-62 (2010)).
- the inventors have surprisingly demonstrated that it is possible to characterise a target RNA by forming a complementary polynucleotide from the target RNA and then characterising the complementary polynucleotide using a transmembrane pore.
- the invention therefore provides a method of characterising a target RNA, comprising:
- the invention also provides:
- FIG. 1 shows the sample preparation procedure outlined in Example 1 and 2.
- a sample of mRNA (shown in step A, labelled X) is annealed to a capture strand (shown in step B, labelled Y).
- the capture strand anneals to the mRNA at the polyA region (labelled 1, this region can vary in length depending on the mRNA).
- a reverse transcriptase enzyme forms the complementary cDNA strand (shown in step C as a dotted line) to the mRNA.
- the tether shown in step D, labelled Z) then anneals to the cDNA.
- FIG. 2 shows the nanopore system used in Example 2 and 3 to characterise cDNA.
- the cDNA/mRNA (cDNA labelled 1, mRNA labelled 2) are tethered to the bilayer (labelled 3) by a short strand of DNA with a 3′ cholesterol tether (labelled 4).
- the leader sequence of the cDNA allows the enzyme (labelled 5) to bind to the cDNA but the iSpC3 spacers (shown as a square and labelled 6) stall the enzyme on the DNA until the DNA enters the nanopore (labelled 7).
- the enzyme moves along the cDNA, controlling the movement through the nanopore.
- the mRNA dehybridises from the complementary cDNA as the enzyme moves along the cDNA.
- the direction of movement of the enzyme is indicated by the arrow labelled 8 and the direction of movement of the cDNA is indicated by the arrow labelled 9.
- a helicase T4 Dda E94C/A360C (SEQ
- SEQ ID NO: 1 shows the codon optimised polynucleotide sequence encoding the MS-B1 mutant MspA monomer. This mutant lacks the signal sequence and includes the following mutations: D90N, D91N, D93N, D118R, D134R and E139K.
- SEQ ID NO: 2 shows the amino acid sequence of the mature form of the MS-B1 mutant of the MspA monomer. This mutant lacks the signal sequence and includes the following mutations: D90N, D91N, D93N, D118R, D134R and E139K.
- SEQ ID NO: 3 shows the polynucleotide sequence encoding one monomer of ⁇ -hemolysin-E111N/K147N ( ⁇ -HL-NN; Stoddart et al., PNAS, 2009; 106(19): 7702-7707).
- SEQ ID NO: 4 shows the amino acid sequence of one monomer of ⁇ -HL-NN.
- SEQ ID Nos: 5 to 7 show the amino acid sequences of MspB, C and D.
- SEQ ID NO: 8 shows the polynucleotide sequence of the messenger RNA used in Examples 1 and 2.
- SEQ ID NO: 9 shows part of the polynucleotide sequence which makes up the primer used in Example 1.
- the 3′ end of SEQ ID NO: 9 is attached by four iSpC3 spacers to the 5′ end of SEQ ID NO: 10.
- SEQ ID NO: 10 shows part of the polynucleotide sequence which makes up the primer used in Example 1.
- the 5′ end of SEQ ID NO: 10 is attached by four iSpC3 spacers to the 3′ end of SEQ ID NO: 9.
- SEQ ID NO: 11 shows the polynucleotide sequence of the cDNA transcribed from SEQ ID NO: 8 which is attached at its 5′ end to the 3′ end of the primer sequence (SEQ ID NO: 10 which is attached by four iSpC3 spacers to the 3′ end of SEQ ID NO: 9).
- SEQ ID NO: 12 shows the polynucleotide sequence of the strand used to tether the cDNA/mRNA in Examples 2 and 3. Attached to the 3′ end of SEQ ID NO: 11 is six iSp18 spacers which are attached to two thymine residues and a 3′ cholesterol TEG.
- SEQ ID NO: 13 shows the amino acid sequence of T4 Dda helicase.
- a target RNA includes two or more target RNAs
- a complementary polynucleotide includes two or more such complementary polynucleotides
- reference to “a transmembrane protein pore” includes two or more such pores, and the like.
- the invention provides a method of characterising a target ribonucleic acid (RNA).
- RNA ribonucleic acid
- a complementary polynucleotide is formed from the target RNA and the complementary polynucleotide is characterised using a transmembrane pore. This allows characterisation of the target RNA.
- the target RNA is preferably not ligated to a non-RNA leader, such as a DNA leader.
- the method of the invention is straightforward and simple. Since the transmembrane pore is capable of detecting single molecule of the complementary polynucleotide, there is no need for amplification of the target RNA or complementary polynucleotide.
- the method typically does not comprise polymerase chain reaction (PCR) or reverse transcription PCR (RT-PCR). This considerably reduces the amount of workflow needed to characterise a target RNA. It also avoids any biases and artifacts introduced by PCR.
- PCR polymerase chain reaction
- RT-PCR reverse transcription PCR
- RNA is a macromolecule comprising two or more ribonucleotides.
- the target RNA may comprise any combination of any ribonucleotides.
- the ribonucleotides can be naturally occurring or artificial.
- One or more ribonucleotides in the target RNA can be oxidized or methylated.
- One or more ribonucleotides in the target RNA may be damaged.
- the target RNA may comprise a pyrimidine dimer, such as a uracil dimer. Such dimers are typically associated with damage by ultraviolet light and are the primary cause of skin melanomas.
- One or more ribonucleotides in the target RNA may be modified, for instance with a label or a tag. Suitable labels are described below.
- the target RNA may comprise one or more spacers.
- a ribonucleotide typically contains a nucleobase, a ribose sugar and at least one phosphate group.
- the nucleobase is typically heterocyclic. Nucleobases include, but are not limited to, purines and pyrimidines and more specifically adenine, guanine, thymine, uracil and cytosine.
- the nucleotide typically contains a monophosphate, diphosphate or triphosphate. Phosphates may be attached on the 5′ or 3′ side of a nucleotide.
- Ribonucleotides include, but are not limited to, adenosine monophosphate (AMP), guanosine monophosphate (GMP), thymidine monophosphate (TMP), uridine monophosphate (UMP), cytidine monophosphate (CMP), 5-methylcytidine monophosphate, 5-methylcytidine diphosphate, 5-methylcytidine triphosphate, 5-hydroxymethylcytidine monophosphate, 5-hydroxymethylcytidine diphosphate and 5-hydroxymethylcytidine triphosphate.
- the nucleotides are preferably selected from AMP, IMP, GMP, CMP and UMP.
- a ribonucleotide may be abasic (i.e. lack a nucleobase).
- a ribonucleotide may also lack a nucleobase and a sugar (i.e. is a C3 spacer).
- the ribonucleotides in the target RNA may be attached to each other in any manner.
- the ribonucleotides are typically attached by their sugar and phosphate groups as in nucleic acids.
- the ribonucleotides may be connected via their nucleobases as in pyrimidine dimers.
- the target RNA may be single stranded or double stranded.
- the target RNA is preferably messenger RNA (mRNA).
- mRNA messenger RNA
- the target mRNA may be an alternate splice variant. Altered amounts (or levels) of mRNA and/or alternate mRNA splice variants may be associated with diseases or conditions.
- the target RNA is preferably a microRNA (or miRNA).
- Suitable miRNAs for use in the invention are well known in the art.
- suitable miRNAs are stored on publically available databases (Jiang Q., Wang Y., Hao Y., Juan L., Teng M., Zhang X., Li M., Wang G., Liu Y., (2009) miR2Disease: a manually curated database for microRNA deregulation in human disease. Nucleic Acids Res.).
- the use of mRNAs and miRNAs to diagnose or prognose diseases or conditions are discussed in more detail below.
- the whole or only part of the target RNA may be characterised using this method.
- the target RNA can be any length.
- the RNA can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400 or at least 500 ribonucleotides in length.
- the target RNA can be 1000 or more ribonucleotides, 5000 or more ribonucleotides in length or 100000 or more ribonucleotides in length.
- the target RNA is typically present in or derived from any suitable sample.
- the invention is typically carried out on a sample that is known to contain or suspected to contain the target RNA.
- the invention may be carried out on a sample to confirm the identity of one or more target RNAs whose presence in the sample is known or expected.
- the sample may be a biological sample.
- the invention may be carried out in vitro on a sample obtained from or extracted from any organism or microorganism.
- the organism or microorganism is typically archaeal, prokaryotic or eukaryotic and typically belongs to one of the five kingdoms: plantae, animalia, fungi, monera and protista.
- the target RNA is preferably eukaryotic.
- the target RNA may be derived from a eukaryotic cell or may be derived from a virus using a eukaryotic cell's transcription machinery.
- the invention may be carried out in vitro on a sample obtained from or extracted from any virus.
- the sample is preferably a fluid sample.
- the sample typically comprises a body fluid of the patient.
- the sample may be urine, lymph, saliva, mucus or amniotic fluid but is preferably blood, plasma or serum.
- the sample is human in origin, but alternatively it may be from another mammal animal such as from commercially farmed animals such as horses, cattle, sheep or pigs or may alternatively be pets such as cats or dogs.
- a sample of plant origin is typically obtained from a commercial crop, such as a cereal, legume, fruit or vegetable, for example wheat, barley, oats, canola, maize, soya, rice, bananas, apples, tomatoes, potatoes, grapes, tobacco, beans, lentils, sugar cane, cocoa or cotton.
- the sample may be a non-biological sample.
- the non-biological sample is preferably a fluid sample.
- Examples of a non-biological sample include surgical fluids, water such as drinking water, sea water or river water, and reagents for laboratory tests.
- the sample is typically processed prior to being assayed, for example by centrifugation or by passage through a membrane that filters out unwanted molecules or cells, such as red blood cells.
- the sample may be measured immediately upon being taken.
- the sample may also be typically stored prior to assay, preferably below ⁇ 70° C.
- the target RNA is typically extracted from the sample before it is used in the method of the invention. RNA extraction kits are commercially available from, for instance, New England Riolabs® and Invitrogen®.
- the target RNA is typically not amplified in the method of the invention.
- the method typically does not comprise making multiple copies of the target RNA.
- the complementary polynucleotide is typically not amplified in the method of the invention.
- the method typically does not comprise making multiple copies of the complementary polynucleotide.
- the method preferably does not comprise polymerase chain reaction (PCR) or reverse transcription PCR (RT-PCR).
- PCR polymerase chain reaction
- RT-PCR reverse transcription PCR
- the method of the invention comprises forming a complementary polynucleotide from the target RNA.
- the polynucleotide may be complementary to part of or all of the target RNA. If the polynucleotide is complementary to part of the target RNA, it is typically complementary to a sufficient amount of the target RNA that it may be characterised in accordance with the invention.
- the polynucleotide is typically complementary based on the pairing of its nucleobases, typically adenine (A), guanine (G), thymine (T) and cytosine (C), with their RNA base counterparts, typically uracil (U), cytosine (C), adenine (A) and guanine (G) respectively.
- nucleobases typically adenine (A), guanine (G), thymine (T) and cytosine (C)
- U uracil
- C adenine
- A adenine
- G guanine
- T thymine
- C cytosine
- a polynucleotide such as a nucleic acid
- the polynucleotide or nucleic acid may comprise any combination of any nucleotides. At least a portion of the polynucleotide is complementary to all of or part of the target RNA.
- the nucleotides can be naturally occurring or artificial.
- One or more nucleotides in the polynucleotide can be oxidized or methylated.
- One or more nucleotides in the polynucleotide may be damaged.
- the polynucleotide may comprise a pyrimidine dimer. Such dimers are typically associated with damage by ultraviolet light and are the primary cause of skin melanomas.
- One or more nucleotides in the polynucleotide may be modified, for instance with a label or a tag. Suitable labels are described below.
- the polynucleotide may comprise one or more spacers.
- a nucleotide typically contains a nucleobase, a sugar and at least one phosphate group.
- the nucleobase and sugar form a nucleoside.
- the nucleobase is typically heterocyclic. Nucleobases include, but are not limited to, purines and pyrimidines and more specifically adenine (A), guanine (G), thymine (T), uracil (U) and cytosine (C).
- A adenine
- G guanine
- T thymine
- U uracil
- C cytosine
- the sugar is typically a pentose sugar.
- Nucleotide sugars include, but are not limited to, ribose and deoxyribose.
- the sugar is preferably a deoxyribose.
- the polynucleotide preferably comprises the following nucleosides: deoxyadenosine (dA), deoxyuridine (dU) and/or thymidine (dT), deoxyguanosine (dG) and deoxycytidine (dC).
- the nucleotide in the polynucleotide is typically a ribonucleotide or deoxyribonucleotide.
- the nucleotide is preferably a deoxyribonucleotide.
- the nucleotide typically contains a monophosphate, diphosphate or triphosphate. Phosphates may be attached on the 5′ or 3′ side of a nucleotide.
- Nucleotides for use in the polynucleotides of the invention include, but are not limited to, adenosine monophosphate (AMP), guanosine monophosphate (GMP), thymidine monophosphate (TMP), uridine monophosphate (UMP), 5-methylcytidine monophosphate, 5-hydroxymethylcytidine monophosphate, cytidine monophosphate (CMP), cyclic adenosine monophosphate (cAMP), cyclic guanosine monophosphate (cGMP), deoxyadenosine monophosphate (dAMP), deoxyguanosine monophosphate (dGMP), deoxythymidine monophosphate (dTMP), deoxyuridine monophosphate (dUMP) and deoxycytidine monophosphate (dCMP).
- AMP adenosine monophosphate
- GFP guanosine monophosphate
- TMP thymidine monophosphate
- UMP uridine monophosphate
- the nucleotides are preferably selected from AMP, TMP, GMP, CMP, UMP, dAMP, dTMP, dGMP, dCMP and dUMP.
- the nucleotides are most preferably selected from dAMP, dTMP, dGMP, dCMP and dUMP.
- the polynucleotide preferably comprises the following nucleotides: dAMP, dUMP and/or dTMP, dGMP and dCMP.
- a nucleotide may be abasic (i.e. lack a nucleobase).
- a nucleotide may also lack a nucleobase and a sugar (i.e. is a C3 spacer).
- the nucleotides in the polynucleotide may be attached to each other in any manner.
- the nucleotides are typically attached by their sugar and phosphate groups as in nucleic acids.
- the nucleotides may be connected via their nucleobases as in pyrimidine dimers.
- the polynucleotide is typically single stranded.
- the polynucleotide can be a nucleic acid, such as deoxyribonucleic acid (DNA) or ribonucleic acid (RNA).
- the polynucleotide may be any synthetic nucleic acid known in the art, such as peptide nucleic acid (PNA), glycerol nucleic acid (GNA), threose nucleic acid (TNA), locked nucleic acid (LNA) or other synthetic polymers with nucleotide side chains.
- PNA peptide nucleic acid
- GMA glycerol nucleic acid
- TAA threose nucleic acid
- LNA locked nucleic acid
- the PNA backbone is composed of repeating N-(2-aminoethyl)-glycine units linked by peptide bonds.
- the GNA backbone is composed of repeating glycol units linked by phosphodiester bonds.
- the TNA backbone is composed of repeating threose sugars linked together by phosphodiester bonds.
- LNA is formed from ribonucleotides as discussed above having an extra bridge connecting the 2′ oxygen and 4′ carbon in the ribose moiety.
- the complementary polynucleotide is most preferably complementary deoxyribonucleic acid (cDNA).
- cDNA complementary deoxyribonucleic acid
- the complementary polynucleotide may be any length.
- the complementary polynucleotide is typically the same length as the target RNA.
- the complementary polynucleotide can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400 or at least 500 deoxyribonucleotides in length.
- the complementary polynucleotide can be 1000 or more deoxyribonucleotides, 5000 or more deoxyribonucleotides in length or 100000 or more deoxyribonucleotides in length.
- the complementary polynucleotide may be formed from the target RNA using any known method. Enzymes which convert RNA to complementary nucleic acids such as those described above are known in the art.
- the method comprises reverse transcribing the target RNA to form a cDNA.
- Step (a) preferably comprising reverse transcribing the target RNA using a reverse transcriptase to form the cDNA.
- the reverse transcriptase may reverse transcribe all or part of the available target RNA.
- Reverse transcriptases are enzymes which are capable of catalysing the formation of cDNA from a RNA template. They are commercially available from, for instance, New England Biolabs® and Invitrogen®.
- the target RNA is typically contacted with the reverse transcriptase in the presence of a population of deoxyribonucleotides as defined above.
- the population typically comprises all of the deoxyribonucleotides needed to base pair with each of the ribonucleotides in the target RNA.
- the population of deoxyribonucleotides typically comprises dAMP, dTMP, dGMP and dCMP.
- Step (a) preferably comprises hybridising a primer to the target RNA and using the primer to form the complementary polynucleotide.
- the primer typically assists with conversion of the target RNA to the complementary polynucleotide.
- the double stranded region formed by hybridisation of the primer to the target RNA may provide a binding site for a reverse transcriptase.
- the reverse transcriptase may then reverse transcribe the remainder of the target RNA to form cDNA.
- the complementary polynucleotide, such as cDNA, produced in step (a) is typically attached to the primer.
- the primer may comprise a bridging moiety, such as a hairpin loop, as discussed below.
- Using a primer has various advantages. It avoids the need to amplify the target RNA using PCR. This reduces the amount of workflow that needs to be carried out and avoids any biases and artifacts introduced by PCR. Since the primer can be designed to bind at a specific end of the target RNA (see below), the complementary polynucleotide can be formed in a specific direction and the complementary polynucleotide can be moved through the pore is a known direction. This facilitates the chracterisation of the target RNA.
- the primer is typically a polynucleotide.
- the polynucleotide may be any of those discussed above.
- the primer preferably comprises a leader sequence and/or a region to which a polynucleotide binding protein is capable of binding.
- the leader sequence facilitates the method of the invention.
- the leader sequence is designed to preferentially thread into the transmembrane pore and thereby facilitate the movement of the complementary polynucleotide through the pore.
- the leader sequence is typically a polynucleotide, such as DNA or RNA, a modified polynucleotide (such as abasic DNA), PNA, LNA, PEG or a polypeptide.
- the leader is preferably a polynucleotide and is more preferably a single stranded polynucleotide.
- the leader sequence can be any of the polynucleotides discussed above.
- the single stranded leader sequence is most preferably a single strand of DNA.
- the leader sequence can be any length, but is typically 27 to 150 nucleotides in length, such as from 50 to 150
- the region to which a polynucleotide binding protein is capable of binding is typically a polynucleotide. It can be any of the polynucleotides discussed above.
- the region may correspond to the leader sequence. Alternatively, the region may be distinct from the leader sequence.
- the polynucleotide binding protein may help to control the movement of the complementary polynucleotide through the pore as discussed in more detail below.
- the target RNA is preferably eukaryotic.
- Eukaryotic RNA typically comprises polyA tail, i.e. a stretch of consecutive adenosine monophosphates.
- the polyA tail is typically at the 3′ end of the RNA.
- step (a) preferably comprises hybridising a primer to the polyA tail of the target RNA and using the primer to reverse transcribe the target RNA to form the complementary polynucleotide.
- the primer preferably comprises a polyT region, i.e. region containing only nucleotides based on thymine.
- the polyT region may contain IMP or dTMP.
- the polyT region may be any length, such as at least 10, at least 15, at least 20, at least 25 or more.
- the primer is preferably a polyT-VN primer, which comprises a polyT region and a VN anchor where V is dAMP, dCMP or dGMP and N is dAMP, dCMP, dGMP or dTMP.
- V is dAMP, dCMP or dGMP
- N is dAMP, dCMP, dGMP or dTMP.
- primers are commercially available, such as from New England Biolabs®.
- step (a) further comprises adding a polyA tail to the target RNA, for instance using a polyA polymerase and ATP.
- Step (a) may further comprise hybridising a primer to the added polyA tail as described above.
- the method of the invention also comprises (b) contacting the complementary polynucleotide with a transmembrane pore.
- the method also comprises (c) taking one or more measurements as the complementary polynucleotide moves with respect to the pore wherein the measurements are indicative of one or more characteristics of the complementary polynucleotide and thereby characterising the target RNA.
- Steps (b) and (c) are preferably carried out with a potential applied across the pore.
- the applied potential may be a voltage potential.
- the applied potential may be a chemical potential.
- An example of this is using a salt gradient across an amphiphilic layer. A salt gradient is disclosed in Holden et al., J Am Chem Soc. 2007 Jul. 11; 129(27):8650-5.
- the current passing through the pore as the polynucleotide moves with respect to the pore is used to determine the sequence of the complementary polynucleotide and hence the sequence of the target RNA. This is Strand Sequencing.
- step (a) preferably further comprises removing the target RNA, for instance by digesting the target RNA.
- Step (a) may further comprise contacting the target RNA with RNAse H. This enzyme specifically digests the RNA strand of RNA:DNA duplexes.
- a transmembrane pore is a structure that crosses the membrane to some degree. It permits hydrated ions driven by an applied potential to flow across or within the membrane.
- the transmembrane pore typically crosses the entire membrane so that hydrated ions may flow from one side of the membrane to the other side of the membrane.
- the transmembrane pore does not have to cross the membrane. It may be closed at one end.
- the pore may be a well in the membrane along which or into which hydrated ions may flow.
- transmembrane pore may be used in the invention.
- the pore may be biological or artificial.
- Suitable pores include, but are not limited to, protein pores, polynucleotide pores and solid state pores.
- Any membrane may be used in accordance with the invention. Suitable membranes are well-known in the art.
- the membrane is preferably an amphiphilic layer.
- An amphiphilic layer is a layer formed from amphiphilic molecules, such as phospholipids, which have both at least one hydrophilic portion and at least one lipophilic or hydrophobic portion.
- the amphiphilic molecules may be synthetic or naturally occurring. Non-naturally occurring amphiphiles and amphiphiles which form a monolayer are known in the art and include, for example, block copolymers (Gonzalez-Perez et al., Langmuir, 2009, 25, 10447-10450).
- Block copolymers are polymeric materials in which two or more monomer sub-units that are polymerized together to create a single polymer chain. Block copolymers typically have properties that are contributed by each monomer sub-unit. However, a block copolymer may have unique properties that polymers formed from the individual sub-units do not possess. Block copolymers can be engineered such that one of the monomer sub-units is hydrophobic (i.e. lipophilic), whilst the other sub-unit(s) are hydrophilic whilst in aqueous media. In this case, the block copolymer may possess amphiphilic properties and may form a structure that mimics a biological membrane.
- the block copolymer may be a diblock (consisting of two monomer sub-units), but may also be constructed from more than two monomer sub-units to form more complex arrangements that behave as amphipiles.
- the copolymer may be a triblock, tetrablock or pentablock copolymer.
- the amphiphilic layer may be a monolayer or a bilayer.
- the amphiphilic layer is typically a planar lipid bilayer or a supported bilayer.
- the amphiphilic layer is typically a lipid bilayer.
- Lipid bilayers are models of cell membranes and serve as excellent platforms for a range of experimental studies.
- lipid bilayers can be used for in vitro investigation of membrane proteins by single-channel recording.
- lipid bilayers can be used as biosensors to detect the presence of a range of substances.
- the lipid bilayer may be any lipid bilayer. Suitable lipid bilayers include, but are not limited to, a planar lipid bilayer, a supported bilayer or a liposome.
- the lipid bilayer is preferably a planar lipid bilayer. Suitable lipid bilayers are disclosed in International Application No.
- PCT/GB08/000563 (published as WO 2008/102121), International Application No. PCT/GB08/004127 (published as WO 2009/077734) and International Application No. PCT/GB2006/001057 (published as WO 2006/100484).
- Lipid bilayers are known in the art. Suitable methods are disclosed in the Examples. Lipid bilayers are commonly formed by the method of Montal and Mueller (Proc. Natl. Acad. Sci. USA., 1972; 69: 3561-3566), in which a lipid monolayer is carried on aqueous solution/air interface past either side of an aperture which is perpendicular to that interface.
- Montal & Mueller The method of Montal & Mueller is popular because it is a cost-effective and relatively straightforward method of forming good quality lipid bilayers that are suitable for protein pore insertion.
- Other common methods of bilayer formation include tip-dipping, painting bilayers and patch-clamping of liposome bilayers.
- the lipid bilayer is formed as described in International Application No. PCT/GB08/004127 (published as WO 2009/077734).
- the membrane is a solid state layer.
- a solid-state layer is not of biological origin.
- a solid state layer is not derived from or isolated from a biological environment such as an organism or cell, or a synthetically manufactured version of a biologically available structure.
- Solid state layers can be formed from both organic and inorganic materials including, but not limited to, microelectronic materials, insulating materials such as Si 3 N 4 , Al 2 O 3 , and SiO, organic and inorganic polymers such as polyamide, plastics such as Teflon® or elastomers such as two-component addition-cure silicone rubber, and glasses.
- the solid state layer may be formed from monatomic layers, such as graphene, or layers that are only a few atoms thick. Suitable graphene layers are disclosed in International Application No. PCT/US2008/010637 (published as WO 2009/035647).
- the method is typically carried out using (i) an artificial amphiphilic layer comprising a pore, (ii) an isolated, naturally-occurring lipid bilayer comprising a pore, or (iii) a cell having a pore inserted therein.
- the method is typically carried out using an artificial amphiphilic layer, such as an artificial lipid bilayer.
- the layer may comprise other transmembrane and/or intramembrane proteins as well as other molecules in addition to the pore. Suitable apparatus and conditions are discussed below.
- the method of the invention is typically carried out in vitro.
- the complementary polynucleotide is preferably coupled to the membrane. This may be done using any known method.
- the complementary polynucleotide is preferably coupled to the membrane comprising the transmembrane pore.
- the method may comprise coupling the complementary polynucleotide to the membrane comprising the transmembrane pore.
- the polynucleotide is preferably coupled to the membrane using one or more anchors.
- the polynucleotide may be coupled to the membrane using any known method.
- Each anchor comprises a group which couples (or binds) to the polynucleotide and a group which couples (or binds) to the membrane.
- Each anchor may covalently couple (or bind) to the polynucleotide and/or the membrane. If a Y adaptor and/or a hairpin loop adaptors are used, the polynucleotide is preferably coupled to the membrane using the adaptor(s).
- the polynucleotide may be coupled to the membrane using any number of anchors, such as 2, 3, 4 or more anchors.
- a polynucleotide may be coupled to the membrane using two anchors each of which separately couples (or binds) to both the polynucleotide and membrane.
- the one or more anchors may comprise the one or more helicases and/or the one or more molecular brakes discussed below.
- the complementary polynucleotide is preferably coupled to the membrane via a polypeptide present in the membrane or a hydrophobic anchor present in the membrane.
- the hydrophobic anchor is preferably a lipid, fatty acid, sterol, carbon nanotube or amino acid.
- the complementary polynucleotide may be coupled directly to the membrane. It may be coupled to the membrane using any of the ways disclosed in International Application Number No. PCT/GB2012/051191 (published as WO 2012/164270).
- the complementary polynucleotide is preferably coupled to the membrane via a linker.
- Preferred linkers include, but are not limited to, polymers, such as polynucleotides, polyethylene glycols (PEGs) and polypeptides. If a complementary polynucleotide is coupled directly to the membrane, then some data will be lost as the characterising run cannot continue to the end of the complementary polynucleotide due to the distance between the membrane and the pore and/or polynucleotide binding protein.
- the complementary polynucleotide can be processed to completion. If a linker is used, the linker may be attached to the complementary polynucleotide at any position. The linker is typically attached to the complementary polynucleotide at the tail polymer.
- the coupling may be stable or transient.
- the transient nature of the coupling is preferred. If a stable coupling molecule were attached directly to either the 5′ or 3′ end of a complementary polynucleotide, then some data will be lost as the characterising run cannot continue to the end of the complementary polynucleotide due to the distance between the membrane and the pore and/or polynucleotide binding protein. If the coupling is transient, then when the coupled end randomly becomes free of the membrane, then the complementary polynucleotide can be processed to completion. Chemical groups that form stable or transient links with the membrane are discussed in more detail below.
- the complementary polynucleotide may be transiently coupled to an amphiphilic layer, such as a lipid bilayer using cholesterol or a fatty acyl chain.
- an amphiphilic layer such as a lipid bilayer using cholesterol or a fatty acyl chain.
- Any fatty acyl chain having a length of from 6 to 30 carbon atoms, such as hexadecanoic acid, may be used.
- the complementary polynucleotide is coupled to an amphiphilic layer. Coupling of polynucleotides to synthetic lipid bilayers has been carried out previously with various different tethering strategies. These are summarised in Table 1 below.
- Complementary polynucleotides may be functionalized using a modified phosphoramidite in the synthesis reaction, which is easily compatible for the addition of reactive groups, such as thiol, cholesterol, lipid and biotin groups.
- reactive groups such as thiol, cholesterol, lipid and biotin groups.
- These different attachment chemistries give a suite of attachment options for complementary polynucleotides.
- Each different modification group tethers the complementary polynucleotide in a slightly different way and coupling is not always permanent so giving different dwell times for the complementary polynucleotide to the membrane. The advantages of transient coupling are discussed above.
- Coupling of complementary polynucleotides can also be achieved by a number of other means provided that a reactive group can be added to the complementary polynucleotide.
- a reactive group can be added to the complementary polynucleotide.
- a thiol group can be added to the 5′ of ssDNA using polynucleotide kinase and ATP ⁇ S (Grant, G. P. and P. Z. Qin (2007). “A facile method for attaching nitroxide spin labels at the 5′ terminus of nucleic acids.” Nucleic Acids Res 35(10): e77).
- the reactive group could be considered to be a short region in the polynucleotide complementary to one already coupled to the membrane, so that attachment can be achieved via hybridisation.
- the region could be part of the complementary polynucleotide or ligated to it. Ligation of short pieces of ssDNA have been reported using T4 RNA ligase I (Troutt, A. B., M. G. McHeyzer-Williams, et al. (1992). “Ligation-anchored PCR: a simple amplification technique with single-sided specificity.” Proc Natl Acad Sci USA 89(20): 9823-5).
- the coupling chemistry can be incorporated during the formation of the complementary polynucleotide from the target RNA.
- the complementary polynucleotide can be synthesized using a primer with a reactive group attached to it.
- the complementary polynucleotide is coupled to the membrane using a cholesterol-tagged polynucleotide which hybridises to the complementary polynucleotide or primer attached thereto.
- the transmembrane pore is preferably a transmembrane protein pore.
- a transmembrane protein pore is a polypeptide or a collection of polypeptides that permits hydrated ions, such as analyte, to flow from one side of a membrane to the other side of the membrane.
- the transmembrane protein pore is capable of forming a pore that permits hydrated ions driven by an applied potential to flow from one side of the membrane to the other.
- the transmembrane protein pore preferably permits analyte such as nucleotides to flow from one side of the membrane, such as a lipid bilayer, to the other.
- the transmembrane protein pore allows a polynucleotide or nucleic acid, such as DNA or RNA, to be moved through the pore.
- the transmembrane protein pore may be a monomer or an oligomer.
- the pore is preferably made up of several repeating subunits, such as 6, 7, 8 or 9 subunits.
- the pore is preferably a hexameric, heptameric, octameric or nonameric pore.
- the transmembrane protein pore typically comprises a barrel or channel through which the ions may flow.
- the subunits of the pore typically surround a central axis and contribute strands to a transmembrane ⁇ barrel or channel or a transmembrane ⁇ -helix bundle or channel.
- the barrel or channel of the transmembrane protein pore typically comprises amino acids that facilitate interaction with analyte, such as nucleotides, polynucleotides or nucleic acids. These amino acids are preferably located near a constriction of the barrel or channel.
- the transmembrane protein pore typically comprises one or more positively charged amino acids, such as arginine, lysine or histidine, or aromatic amino acids, such as tyrosine or tryptophan. These amino acids typically facilitate the interaction between the pore and nucleotides, polynucleotides or nucleic acids.
- Transmembrane protein pores for use in accordance with the invention can be derived from ⁇ -barrel pores or ⁇ -helix bundle pores.
- ⁇ -barrel pores comprise a barrel or channel that is formed from ⁇ -strands.
- Suitable ⁇ -barrel pores include, but are not limited to, ⁇ -toxins, such as ⁇ -hemolysin, anthrax toxin and leukocidins, and outer membrane proteins/porins of bacteria, such as Mycobacterium smegmatis porin (Msp), for example MspA, MspB, MspC or MspD, outer membrane porin F (OmpF), outer membrane porin G (OmpG), outer membrane phospholipase A and Neisseria autotransporter lipoprotein (NalP).
- Msp Mycobacterium smegmatis porin
- Msp Mycobacterium smegmatis porin
- Msp Mycobacterium sme
- ⁇ -helix bundle pores comprise a barrel or channel that is formed from ⁇ -helices.
- Suitable ⁇ -helix bundle pores include, but are not limited to, inner membrane proteins and a outer membrane proteins, such as WZA and ClyA toxin.
- the transmembrane pore may be derived from Msp or from ⁇ -hemolysin ( ⁇ -HL).
- the transmembrane protein pore is preferably derived from Msp, preferably from MspA. Such a pore will be oligomeric and typically comprises 7, 8, 9 or 10 monomers derived from Msp.
- the pore may be a homo-oligomeric pore derived from Msp comprising identical monomers. Alternatively, the pore may be a hetero-oligomeric pore derived from Msp comprising at least one monomer that differs from the others.
- the pore is derived from MspA or a homolog or paralog thereof.
- a monomer derived from Msp typically comprises the sequence shown in SEQ ID NO: 2 or a variant thereof.
- SEQ ID NO: 2 is the MS-(B1)8 mutant of the MspA monomer. It includes the following mutations: D90N, D91N, D93N, D118R, D134R and E139K.
- a variant of SEQ ID NO: 2 is a polypeptide that has an amino acid sequence which varies from that of SEQ ID NO: 2 and which retains its ability to form a pore. The ability of a variant to form a pore can be assayed using any method known in the art.
- the variant may be inserted into an amphiphilic layer along with other appropriate subunits and its ability to oligomerise to form a pore may be determined.
- Methods are known in the art for inserting subunits into membranes, such as amphiphilic layers.
- subunits may be suspended in a purified form in a solution containing a lipid bilayer such that it diffuses to the lipid bilayer and is inserted by binding to the lipid bilayer and assembling into a functional state.
- subunits may be directly inserted into the membrane using the “pick and place” method described in M. A. Holden, H. Bayley. J. Am. Chem. Soc. 2005, 127, 6502-6503 and International Application No. PCT/GB2006/001057 (published as WO 2006/100484).
- a variant will preferably be at least 50% homologous to that sequence based on amino acid identity. More preferably, the variant may be at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90% and more preferably at least 95%, 97% or 99% homologous based on amino acid identity to the amino acid sequence of SEQ ID NO: 2 over the entire sequence. There may be at least 80%, for example at least 85%, 90% or 95%, amino acid identity over a stretch of 100 or more, for example 125, 150, 175 or 200 or more, contiguous amino acids (“hard homology”).
- Standard methods in the art may be used to determine homology.
- the UWGCG Package provides the BESTFIT program which can be used to calculate homology, for example used on its default settings (Devereux et al (1984) Nucleic Acids Research 12, p 387-395).
- the PILEUP and BLAST algorithms can be used to calculate homology or line up sequences (such as identifying equivalent residues or corresponding sequences (typically on their default settings)), for example as described in Altschul S. F. (1993) J Mol Evol 36:290-300; Altschul, S. F et al (1990) J Mol Biol 215:403-10.
- Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/).
- SEQ ID NO: 2 is the MS-(B1)8 mutant of the MspA monomer.
- the variant may comprise any of the mutations in the MspB, C or D monomers compared with MspA.
- the mature forms of MspB, C and D are shown in SEQ ID NOs: 5 to 7.
- the variant may comprise the following substitution present in MspB: A138P.
- the variant may comprise one or more of the following substitutions present in MspC: A96G, N102E and A138P.
- the variant may comprise one or more of the following mutations present in MspD: Deletion of G1, L2V, E5Q, L8V, D13G, W21A, D22E, K47T, I49H, I68V, D91G, A96Q, N102D, S103T, V104I, S136K and G141A.
- the variant may comprise combinations of one or more of the mutations and substitutions from Msp B, C and D.
- the variant preferably comprises the mutation L88N.
- a variant of SEQ ID NO: 2 has the mutation L88N in addition to all the mutations of MS-(B1)8 and is called MS-(B2)8.
- the pore used in the invention is preferably MS-(B2)8.
- the further preferred variant comprises the mutations G75S/G77S/L88N/Q126R.
- the variant of SEQ ID NO: 2 has the mutations G75S/G77S/L88N/Q126R in addition to all the mutations of MS-(B1)8 and is called MS-(B2C)8.
- the pore used in the invention is preferably MS-(B2)8 or MS-(B2C)8.
- Amino acid substitutions may be made to the amino acid sequence of SEQ ID NO: 2 in addition to those discussed above, for example up to 1, 2, 3, 4, 5, 10, 20 or 30 substitutions.
- Conservative substitutions replace amino acids with other amino acids of similar chemical structure, similar chemical properties or similar side-chain volume.
- the amino acids introduced may have similar polarity, hydrophilicity, hydrophobicity, basicity, acidity, neutrality or charge to the amino acids they replace.
- the conservative substitution may introduce another amino acid that is aromatic or aliphatic in the place of a pre-existing aromatic or aliphatic amino acid.
- Conservative amino acid changes are well-known in the art and may be selected in accordance with the properties of the 20 main amino acids as defined in Table 2 below. Where amino acids have similar polarity, this can also be determined by reference to the hydropathy scale for amino acid side chains in Table 3.
- One or more amino acid residues of the amino acid sequence of SEQ ID NO: 2 may additionally be deleted from the polypeptides described above. Up to 1, 2, 3, 4, 5, 10, 20 or 30 residues may be deleted, or more.
- Variants may include fragments of SEQ ID NO: 2. Such fragments retain pore forming activity. Fragments may be at least 50, 100, 150 or 200 amino acids in length. Such fragments may be used to produce the pores. A fragment preferably comprises the pore forming domain of SEQ ID NO: 2. Fragments must include one of residues 88, 90, 91, 105, 118 and 134 of SEQ ID NO: 2. Typically, fragments include all of residues 88, 90, 91, 105, 118 and 134 of SEQ ID NO: 2.
- One or more amino acids may be alternatively or additionally added to the polypeptides described above.
- An extension may be provided at the amino terminal or carboxy terminal of the amino acid sequence of SEQ ID NO: 2 or polypeptide variant or fragment thereof.
- the extension may be quite short, for example from 1 to 10 amino acids in length. Alternatively, the extension may be longer, for example up to 50 or 100 amino acids.
- a carrier protein may be fused to an amino acid sequence according to the invention. Other fusion proteins are discussed in more detail below.
- a variant is a polypeptide that has an amino acid sequence which varies from that of SEQ ID NO: 2 and which retains its ability to form a pore.
- a variant typically contains the regions of SEQ ID NO: 2 that are responsible for pore formation. The pore forming ability of Msp, which contains a ⁇ -barrel, is provided by ⁇ -sheets in each subunit.
- a variant of SEQ ID NO: 2 typically comprises the regions in SEQ ID NO: 2 that form ⁇ -sheets.
- One or more modifications can be made to the regions of SEQ ID NO: 2 that form ⁇ -sheets as long as the resulting variant retains its ability to form a pore.
- a variant of SEQ ID NO: 2 preferably includes one or more modifications, such as substitutions, additions or deletions, within its ⁇ -helices and/or loop regions.
- the monomers derived from Msp may be modified to assist their identification or purification, for example by the addition of histidine residues (a hist tag), aspartic acid residues (an asp tag), a streptavidin tag or a flag tag, or by the addition of a signal sequence to promote their secretion from a cell where the polypeptide does not naturally contain such a sequence.
- An alternative to introducing a genetic tag is to chemically react a tag onto a native or engineered position on the pore. An example of this would be to react a gel-shift reagent to a cysteine engineered on the outside of the pore. This has been demonstrated as a method for separating hemolysin hetero-oligomers (Chem Biol. 1997 Jul.; 4(7):497-505).
- the monomer derived from Msp may be labelled with a revealing label.
- the revealing label may be any suitable label which allows the pore to be detected. Suitable labels are described below.
- the monomer derived from Msp may also be produced using D-amino acids.
- the monomer derived from Msp may comprise a mixture of L-amino acids and D-amino acids. This is conventional in the art for producing such proteins or peptides.
- the monomer derived from Msp contains one or more specific modifications to facilitate nucleotide discrimination.
- the monomer derived from Msp may also contain other non-specific modifications as long as they do not interfere with pore formation.
- a number of non-specific side chain modifications are known in the art and may be made to the side chains of the monomer derived from Msp. Such modifications include, for example, reductive alkylation of amino acids by reaction with an aldehyde followed by reduction with NaBIE, amidination with methylacetimidate or acylation with acetic anhydride.
- the monomer derived from Msp can be produced using standard methods known in the art.
- the monomer derived from Msp may be made synthetically or by recombinant means.
- the pore may be synthesized by in vitro translation and transcription (IVTT). Suitable methods for producing pores are discussed in International Application Nos. PCT/GB09/001690 (published as WO 2010/004273), PCT/GB09/001679 (published as WO 2010/004265) or PCT/GB10/000133 (published as WO 2010/086603). Methods for inserting pores into membranes are discussed.
- the transmembrane protein pore is also preferably derived from ⁇ -hemolysin ( ⁇ -HL).
- ⁇ -HL ⁇ -hemolysin
- the wild type ⁇ -HL pore is formed of seven identical monomers or subunits (i.e. it is heptameric).
- the sequence of one monomer or subunit of ⁇ -hemolysin-NN is shown in SEQ ID NO: 4.
- the transmembrane protein pore preferably comprises seven monomers each comprising the sequence shown in SEQ ID NO: 4 or a variant thereof.
- Residues 113 and 147 of SEQ ID NO: 4 form part of a constriction of the barrel or channel of ⁇ -HL.
- a pore comprising seven proteins or monomers each comprising the sequence shown in SEQ ID NO: 4 or a variant thereof are preferably used in the method of the invention.
- the seven proteins may be the same (homo-heptamer) or different (hetero-heptamer).
- a variant of SEQ ID NO: 4 is a protein that has an amino acid sequence which varies from that of SEQ ID NO: 4 and which retains its pore forming ability.
- the ability of a variant to form a pore can be assayed using any method known in the art.
- the variant may be inserted into an amphiphilic layer, such as a lipid bilayer, along with other appropriate subunits and its ability to oligomerise to form a pore may be determined. Methods are known in the art for inserting subunits into amphiphilic layers, such as lipid bilayers. Suitable methods are discussed above.
- the variant may include modifications that facilitate covalent attachment to or interaction with the helicase or construct.
- the variant preferably comprises one or more reactive cysteine residues that facilitate attachment to the helicase or construct.
- the variant may include a cysteine at one or more of positions 8, 9, 17, 18, 19, 44, 45, 50, 51, 237, 239 and 287 and/or on the amino or carboxy terminus of SEQ ID NO: 4.
- Preferred variants comprise a substitution of the residue at position 8, 9, 17, 237, 239 and 287 of SEQ ID NO: 4 with cysteine (ABC, T9C, N17C, K237C, S239C or E287C).
- the variant is preferably any one of the variants described in International Application No. PCT/GB09/001690 (published as WO 2010/004273), PCT/GB09/001679 (published as WO 2010/004265) or PCT/GB10/000133 (published as WO 2010/086603).
- the variant may also include modifications that facilitate any interaction with nucleotides.
- the variant may be a naturally occurring variant which is expressed naturally by an organism, for instance by a Staphylococcus bacterium.
- the variant may be expressed in vitro or recombinantly by a bacterium such as Escherichia coli .
- Variants also include non-naturally occurring variants produced by recombinant technology. Over the entire length of the amino acid sequence of SEQ ID NO: 4, a variant will preferably be at least 50% homologous to that sequence based on amino acid identity.
- the variant polypeptide may be at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90% and more preferably at least 95%, 97% or 99% homologous based on amino acid identity to the amino acid sequence of SEQ ID NO: 4 over the entire sequence.
- Amino acid substitutions may be made to the amino acid sequence of SEQ ID NO: 4 in addition to those discussed above, for example up to 1, 2, 3, 4, 5, 10, 20 or 30 substitutions. Conservative substitutions may be made as discussed above.
- One or more amino acid residues of the amino acid sequence of SEQ ID NO: 4 may additionally be deleted from the polypeptides described above. Up to 1, 2, 3, 4, 5, 10, 20 or 30 residues may be deleted, or more.
- Variants may be fragments of SEQ ID NO: 4. Such fragments retain pore-forming activity. Fragments may be at least 50, 100, 200 or 250 amino acids in length. A fragment preferably comprises the pore-forming domain of SEQ ID NO: 4. Fragments typically include residues 119, 121, 135. 113 and 139 of SEQ ID NO: 4.
- One or more amino acids may be alternatively or additionally added to the polypeptides described above.
- An extension may be provided at the amino terminus or carboxy terminus of the amino acid sequence of SEQ ID NO: 4 or a variant or fragment thereof.
- the extension may be quite short, for example from 1 to 10 amino acids in length. Alternatively, the extension may be longer, for example up to 50 or 100 amino acids.
- a carrier protein may be fused to a pore or variant.
- a variant of SEQ ID NO: 4 is a subunit that has an amino acid sequence which varies from that of SEQ ID NO: 4 and which retains its ability to form a pore.
- a variant typically contains the regions of SEQ ID NO: 4 that are responsible for pore formation.
- the pore forming ability of ⁇ -HL, which contains a ⁇ -barrel, is provided by ⁇ -strands in each subunit.
- a variant of SEQ ID NO: 4 typically comprises the regions in SEQ ID NO: 4 that form ⁇ -strands.
- the amino acids of SEQ ID NO: 4 that form ⁇ -strands are discussed above.
- One or more modifications can be made to the regions of SEQ ID NO: 4 that form ⁇ -strands as long as the resulting variant retains its ability to form a pore. Specific modifications that can be made to the ⁇ -strand regions of SEQ ID NO: 4 are discussed above.
- a variant of SEQ ID NO: 4 preferably includes one or more modifications, such as substitutions, additions or deletions, within its ⁇ -helices and/or loop regions. Amino acids that form ⁇ -helices and loops are discussed above.
- the variant may be modified to assist its identification or purification as discussed above.
- Pores derived from ⁇ -HL can be made as discussed above with reference to pores derived from Msp.
- the transmembrane protein pore is chemically modified.
- the pore can be chemically modified in any way and at any site.
- the transmembrane protein pore is preferably chemically modified by attachment of a molecule to one or more cysteines (cysteine linkage), attachment of a molecule to one or more lysines, attachment of a molecule to one or more non-natural amino acids, enzyme modification of an epitope or modification of a terminus. Suitable methods for carrying out such modifications are well-known in the art.
- the transmembrane protein pore may be chemically modified by the attachment of any molecule. For instance, the pore may be chemically modified by attachment of a dye or a fluorophore.
- any number of the monomers in the pore may be chemically modified.
- One or more, such as 2, 3, 4, 5, 6, 7, 8, 9 or 10, of the monomers is preferably chemically modified as discussed above.
- cysteine residues may be enhanced by modification of the adjacent residues. For instance, the basic groups of flanking arginine, histidine or lysine residues will change the pKa of the cysteines thiol group to that of the more reactive S ⁇ group.
- the reactivity of cysteine residues may be protected by thiol protective groups such as dTNB. These may be reacted with one or more cysteine residues of the pore before a linker is attached.
- the molecule (with which the pore is chemically modified) may be attached directly to the pore or attached via a linker as disclosed in International Application Nos. PCT/GB09/001690 (published as WO 2010/004273), PCT/GB09/001679 (published as WO 2010/004265) or PCT/GB10/000133 (published as WO 2010/086603).
- Step (b) preferably comprises contacting the complementary polynucleotide with a polynucleotide binding protein such that the protein controls the movement of the complementary polynucleotide through the pore.
- a polynucleotide binding protein may be used.
- the polynucleotide binding protein is preferably a polynucleotide handling enzyme.
- a polynucleotide handling enzyme is a polypeptide that is capable of interacting with and modifying at least one property of a polynucleotide. The enzyme may modify the polynucleotide by cleaving it to form individual nucleotides or shorter chains of nucleotides, such as di- or trinucleotides.
- the enzyme may modify the polynucleotide by orienting it or moving it to a specific position.
- the polynucleotide handling enzyme does not need to display enzymatic activity as long as it is capable of binding the target sequence and controlling its movement through the pore.
- the enzyme may be modified to remove its enzymatic activity or may be used under conditions which prevent it from acting as an enzyme. Such conditions are discussed in more detail below.
- the polynucleotide handling enzyme is preferably derived from a nucleolytic enzyme.
- the polynucleotide handling enzyme used in the construct of the enzyme is more preferably derived from a member of any of the Enzyme Classification (EC) groups 3.1.11, 3.1.13, 3.1.14, 3.1.15, 3.1.16, 3.1.21, 3.1.22, 3.1.25, 3.1.26, 3.1.27, 3.1.30 and 3.1.31.
- Preferred enzymes are polymerases, exonucleases, helicases and topoisomerases, such as gyrases.
- the enzyme may be any of those disclosed in International Application No. PCT/GB10/000133 (published as WO 2010/086603).
- the helicase may a Hel308 helicase, a RecD helicase, such as TraI helicase or a TrwC helicase, a XPD helicase or a Dda helicase.
- the helicase may be any of the helicases, modified helicases or helicase constructs disclosed in International Application Nos.
- PCT/GB2012/052579 (published as WO 2013/057495); PCT/GB2012/053274 (published as WO 2013/098562); PCT/GB2012/053273 (published as WO2013098561); PCT/GB2013/051925; PCT/GB2013/051924; PCT/GB2013/051928; and the UK Application being filed concurrently with this application (ONT IP 049).
- the method involves contacting the complementary polynucleotide with a helicase such that the helicase controls the movement of the complementary polynucleotide through the pore.
- a helicase Any helicase may be used in the method. Helicases may work in two modes with respect to the pore. First, the method is preferably carried out using a helicase such that it controls movement of the polynucleotide through the pore with the field resulting from the applied voltage.
- the 5′ end of the polynucleotide is first captured in the pore, and the enzyme controls movement of the polynucleotide into the pore such that the polynucleotide is passed through the pore with the field until it finally translocates through to the trans side of the bilayer.
- the method is preferably carried out such that a helicase enzyme controls movement of the polynucleotide through the pore against the field resulting from the applied voltage.
- the 3′ end of the polynucleotide is first captured in the pore, and the enzyme controls movement of the polynucleotide through the pore such that the polynucleotide is pulled out of the pore against the applied field until finally ejected back to the cis side of the bilayer.
- the polynucleotide binding protein may be covalently attached to the pore.
- the polynucleotide binding protein is preferably not covalently attached to the pore.
- the application of a voltage to the pore and helicase or construct may result in the formation of a sensor that is capable of characterising the complementary polynucleotide. This is discussed in more detail below.
- any of the proteins described herein may be modified to assist their identification or purification, for example by the addition of histidine residues (a his tag), aspartic acid residues (an asp tag), a streptavidin tag, a flag tag, a SUMO tag, a GST tag or a MBP tag, or by the addition of a signal sequence to promote their secretion from a cell where the polypeptide does not naturally contain such a sequence.
- An alternative to introducing a genetic tag is to chemically react a tag onto a native or engineered position on the helicase, pore or construct. An example of this would be to react a gel-shift reagent to a cysteine engineered on the outside of the pore. This has been demonstrated as a method for separating hemolysin hetero-oligomers (Chem Biol. 1997 Jul.; 4 (7):497-505).
- the target RNA, complementary polynucleotide, polynucleotide binding protein or pore may be labelled with a revealing label.
- the revealing label may be any suitable label which can be detected. Suitable labels include, but are not limited to, fluorescent molecules, radioisotopes, e.g. 125 I, 35 S, enzymes, antibodies, antigens, polynucleotides and ligands such as biotin.
- Proteins may be made synthetically or by recombinant means.
- proteins may be synthesized by in vitro translation and transcription (IVTT).
- IVTT in vitro translation and transcription
- the amino acid sequence of the protein may be modified to include non-naturally occurring amino acids or to increase the stability of the protein.
- When a protein is produced by synthetic means, such amino acids may be introduced during production. Proteins may also be altered following either synthetic or recombinant production.
- Proteins may also be produced using D-amino acids.
- the pore or polynucleotide binding protein may comprise a mixture of L-amino acids and D-amino acids. This is conventional in the art for producing such proteins or peptides.
- the proteins used in the invention may also contain other non-specific modifications as long as they do not interfere with the proteins' function.
- a number of non-specific side chain modifications are known in the art and may be made to the side chains of the protein(s). Such modifications include, for example, reductive alkylation of amino acids by reaction with an aldehyde followed by reduction with NaBH 4 , amidination with methylacetimidate or acylation with acetic anhydride.
- Polynucleotide sequences encoding a protein may be derived and replicated using standard methods in the art. Polynucleotide sequences encoding a protein may be expressed in a bacterial host cell using standard techniques in the art. The protein may be produced in a cell by in situ expression of the polypeptide from a recombinant expression vector. The expression vector optionally carries an inducible promoter to control the expression of the polypeptide. These methods are described in Sambrook, J. and Russell, D. (2001). Molecular Cloning: A Laboratory Manual, 3rd Edition. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.
- the gene encoding the sequence of interest may be amplified using PCR involving specific primers.
- the amplified sequences may then be incorporated into a recombinant replicable vector such as a cloning vector.
- the vector may be used to replicate the polynucleotide in a compatible host cell.
- polynucleotide sequences may be made by introducing a polynucleotide encoding the sequence of interest into a replicable vector, introducing the vector into a compatible host cell, and growing the host cell under conditions which bring about replication of the vector.
- the vector may be recovered from the host cell. Suitable host cells for cloning of polynucleotides are known in the art and described in more detail below.
- the polynucleotide sequence may be cloned into a suitable expression vector.
- the polynucleotide sequence is typically operably linked to a control sequence which is capable of providing for the expression of the coding sequence by the host cell.
- Such expression vectors can be used to express a construct.
- operably linked refers to a juxtaposition wherein the components described are in a relationship permitting them to function in their intended manner.
- a control sequence “operably linked” to a coding sequence is ligated in such a way that expression of the coding sequence is achieved under conditions compatible with the control sequences. Multiple copies of the same or different polynucleotide may be introduced into the vector.
- the expression vector may then be introduced into a suitable host cell.
- a construct can be produced by inserting a polynucleotide sequence encoding a construct into an expression vector, introducing the vector into a compatible bacterial host cell, and growing the host cell under conditions which bring about expression of the polynucleotide sequence.
- the vectors may be for example, plasmid, virus or phage vectors provided with an origin of replication, optionally a promoter for the expression of the said polynucleotide sequence and optionally a regulator of the promoter.
- the vectors may contain one or more selectable marker genes, for example an ampicillin resistance gene. Promoters and other expression regulation signals may be selected to be compatible with the host cell for which the expression vector is designed. A T7, trc, lac, ara or ⁇ L promoter is typically used.
- the host cell typically expresses the construct at a high level. Host cells transformed with a polynucleotide sequence will be chosen to be compatible with the expression vector used to transform the cell.
- the host cell is typically bacterial and preferably E. coli . Any cell with a ⁇ DE3 lysogen, for example Rosetta2(DE3)pLys, C41 (DE3), BL21 (DE3), JM109 (DE3), B834 (DE3), TUNER, Origami and Origami B, can express a vector comprising the T7 promoter.
- Proteins may be produced in large scale following purification by any protein liquid chromatography system from protein producing organisms or after recombinant expression.
- Typical protein liquid chromatography systems include FPLC, AKTA systems, the Bio-Cad system, the Bio-Rad BioLogic system and the Gilson HPLC system.
- the method of the invention involves measuring one or more characteristics of the target RNA.
- the method may involve measuring two, three, four or five or more characteristics of the target RNA.
- the one or more characteristics are preferably selected from (i) the length of the target RNA, (ii) the identity of the target RNA, (iii) the sequence of the target RNA, and (iv) the amount of the target RNA. Any combination of (i) to (iv) may be measured in accordance with the invention.
- the length of the RNA may be measured for example by forming a complementary polynucleotide of the same length and determining the number of interactions between the complementary polynucleotide and the pore or the duration of interaction between the complementary polynucleotide and the pore.
- the identity of the RNA may be measured in a number of ways.
- the identity of the RNA may be measured in conjunction with measurement of the sequence of the complementary polynucleotide or without measurement of the sequence of the complementary polynucleotide.
- the former is straightforward; the complementary polynucleotide is sequenced and the sequence of the target RNA is thereby identified (since it is complementary).
- the latter may be done in several ways. For instance, the presence of a particular motif in the complementary polynucleotide may be measured (without measuring the remaining sequence of the polynucleotide).
- the measurement of a particular electrical and/or optical signal in the method may identify the complementary polynucleotide and thereby identify the target RNA.
- the sequence of the complementary polynucleotide and hence the sequence of the target RNA can be determined as described previously. Suitable sequencing methods, particularly those using electrical measurements, are described in Stoddart D et al., Proc Natl Acad Sci, 12; 106(19):7702-7, Lieberman K R et al, J Am Chem Soc. 2010; 132(50):17961-72, and International Application WO 2000/28312.
- the amount of the target RNA may be measured in a variety of ways. For instance, since the target RNA is typically not amplified in the method of the invention, the amount of the target RNA may be measured by counting the number of complementary polynucleotides which interact with the transmembrane pore. The number of complementary polynucleotides (i.e. the number of instances of the complementary polynucleotide) typically corresponds to the number of the target RNA molecules (i.e. the number of instances of the target RNA).
- a variety of different types of measurements may be made. This includes without limitation: electrical measurements and optical measurements. Possible electrical measurements include: current measurements, impedance measurements, tunnelling measurements (Ivanov A P et al., Nano Lett. 2011 Jan. 12; 11(1):279-85), and FET measurements (International Application WO 2005/124888). Optical measurements may be combined with electrical measurements (Soni G V et al., Rev Sci Instrum. 2010 January; 81(1):014301). The measurement may be a transmembrane current measurement such as measurement of ionic current flowing through the pore.
- Electrical measurements may be made using standard single channel recording equipment as describe in Stoddart D et al., Proc Natl Acad Sci, 12; 106(19):7702-7, Lieberman K R et al, J Am Chem Soc. 2010; 132(50):17961-72, and International Application WO-2000/28312.
- electrical measurements may be made using a multi-channel system, for example as described in International Application WO-2009/077734 and International Application WO-2011/067559.
- the method comprises:
- the methods may be carried out using any apparatus that is suitable for investigating a membrane/pore system in which a pore is present in a membrane.
- the method may be carried out using any apparatus that is suitable for transmembrane pore sensing.
- the apparatus comprises a chamber comprising an aqueous solution and a barrier that separates the chamber into two sections.
- the barrier typically has an aperture in which the membrane containing the pore is formed.
- the barrier forms the membrane in which the pore is present.
- the methods may be carried out using the apparatus described in International Application No. PCT/GB08/000562 (WO 2008/102120).
- the methods may involve measuring the current passing through the pore as the polynucleotide moves with respect to the pore. Therefore the apparatus may also comprise an electrical circuit capable of applying a potential and measuring an electrical signal across the membrane and pore.
- the methods may be carried out using a patch clamp or a voltage clamp.
- the methods preferably involve the use of a voltage clamp.
- the methods of the invention may involve the measuring of a current passing through the pore as the polynucleotide moves with respect to the pore. Suitable conditions for measuring ionic currents through transmembrane protein pores are known in the art and disclosed in the Examples.
- the method is typically carried out with a voltage applied across the membrane and pore.
- the voltage used is typically from +2 V to ⁇ 2 V, typically ⁇ 400 mV to +400 mV.
- the voltage used is preferably in a range having a lower limit selected from ⁇ 400 mV, ⁇ 300 mV, ⁇ 200 mV, ⁇ 150 mV, ⁇ 100 mV, ⁇ 50 mV, ⁇ 20 mV and 0 mV and an upper limit independently selected from +10 mV, +20 mV, +50 mV, +100 mV, +150 mV, +200 mV, +300 mV and +400 mV.
- the voltage used is more preferably in the range 100 mV to 240 mV and most preferably in the range of 120 mV to 220 mV. It is possible to increase discrimination between different nucleotides by a pore by using an increased applied potential.
- the methods are typically carried out in the presence of any charge carriers, such as metal salts, for example alkali metal salt, halide salts, for example chloride salts, such as alkali metal chloride salt.
- Charge carriers may include ionic liquids or organic salts, for example tetramethyl ammonium chloride, trimethylphenyl ammonium chloride, phenyltrimethyl ammonium chloride, or 1-ethyl-3-methyl imidazolium chloride.
- the salt is present in the aqueous solution in the chamber. Potassium chloride (KCl), sodium chloride (NaCl), caesium chloride (CsCl) or a mixture of potassium ferrocyanide and potassium ferricyanide is typically used.
- the salt concentration may be at saturation.
- the salt concentration may be 3 M or lower and is typically from 0.1 to 2.5 M, from 0.3 to 1.9 M, from 0.5 to 1.8 M, from 0.7 to 1.7 M, from 0.9 to 1.6 M or from 1 M to 1.4 M.
- the salt concentration is preferably from 150 mM to 1 M. IIe1308, XPD, RecD and TraI helicases surprisingly work under high salt concentrations.
- the method is preferably carried out using a salt concentration of at least 0.3 M, such as at least 0.4 M, at least 0.5 M, at least 0.6 M, at least 0.8 M, at least 1.0 M, at least 1.5 M, at least 2.0 M, at least 2.5 M or at least 3.0 M.
- High salt concentrations provide a high signal to noise ratio and allow for currents indicative of the presence of a nucleotide to be identified against the background of normal current fluctuations.
- the methods are typically carried out in the presence of a buffer.
- the buffer is present in the aqueous solution in the chamber. Any buffer may be used in the method of the invention.
- the buffer is HEPES.
- Another suitable buffer is Tris-HCl buffer.
- the methods are typically carried out at a pH of from 4.0 to 12.0, from 4.5 to 10.0, from 5.0 to 9.0, from 5.5 to 8.8, from 6.0 to 8.7 or from 7.0 to 8.8 or 7.5 to 8.5.
- the pH used is preferably about 7.5.
- the methods may be carried out at from 0° C. to 100° C., from 15° C. to 95° C., from 16° C. to 90° C., from 17° C. to 85° C., from 18° C. to 80° C., 19° C. to 70° C., or from 20° C. to 60° C.
- the methods are typically carried out at room temperature.
- the methods are optionally carried out at a temperature that supports enzyme function, such as about 37° C.
- the method may be carried out in the presence of free nucleotides or free nucleotide analogues and/or an enzyme cofactor that facilitates the action of the helicase or construct.
- the method may also be carried out in the absence of free nucleotides or free nucleotide analogues and in the absence of an enzyme cofactor.
- the free nucleotides may be one or more of any of the individual nucleotides discussed above.
- the free nucleotides include, but are not limited to, adenosine monophosphate (AMP), adenosine diphosphate (ADP), adenosine triphosphate (ATP), guanosine monophosphate (GMP), guanosine diphosphate (GDP), guanosine triphosphate (GTP), thymidine monophosphate (TMP), thymidine diphosphate (TDP), thymidine triphosphate (TTP), uridine monophosphate (UMP), uridine diphosphate (UDP), uridine triphosphate (UTP), cytidine monophosphate (CMP), cytidine diphosphate (CDP), cytidine triphosphate (CTP), cyclic adenosine monophosphate (cAMP), cyclic guanosine monophosphate (cGMP), deoxyadenosine monophosphate (dAMP), deoxyadenosine diphosphate (dADP), deoxyaden
- the free nucleotides are preferably selected from AMP, TMP, GMP, CMP, UMP, dAMP, dIMP, dGMP or dCMP.
- the free nucleotides are preferably adenosine triphosphate (ATP).
- the enzyme cofactor is a factor that allows the helicase or construct to function.
- the enzyme cofactor is preferably a divalent metal cation.
- the divalent metal cation is preferably Mg 2+ , Mn 2+ , Ca 2+ or Co 2+ .
- the enzyme cofactor is most preferably Mg 2+ .
- the method comprises:
- the one or more helicases may be any of those discussed above.
- the one or more molecular brakes may be any compound or molecule which binds to the polynucleotide and slows the movement of the polynucleotide through the pore.
- the one or more molecular brakes preferably comprise one or more compounds which bind to the polynucleotide.
- the one or more compounds are preferably one or more macrocycles. Suitable macrocycles include, but are not limited to, cyclodextrins, calixarenes, cyclic peptides, crown ethers, cucurbiturils, pillararenes, derivatives thereof or a combination thereof.
- the cyclodextrin or derivative thereof may be any of those disclosed in Eliseev, A.
- the agent is more preferably heptakis-6-amino- ⁇ -cyclodextrin (am 7 - ⁇ CD), 6-monodeoxy-6-monoamino- ⁇ -cyclodextrin (am 1 - ⁇ CD) or heptakis-(6-deoxy-6-guanidino)-cyclodextrin (gu 7 - ⁇ CD).
- the one or more molecular brakes are preferably not one or more single stranded binding proteins (SSB).
- the one or more molecular brakes are more preferably not a single-stranded binding protein (SSB) comprising a carboxy-terminal (C-terminal) region which does not have a net negative charge or (ii) a modified SSB comprising one or more modifications in its C-terminal region which decreases the net negative charge of the C-terminal region.
- the one or more molecular brakes are most preferably not any of the SSBs disclosed in International Application No. PCT/GB2013/051924 (published as WO 2014/013259).
- the one or more molecular brakes are preferably one or more polynucleotide binding proteins.
- the polynucleotide binding protein may be any protein that is capable of binding to the polynucleotide and controlling its movement through the pore. It is straightforward in the art to determine whether or not a protein binds to a polynucleotide.
- the protein typically interacts with and modifies at least one property of the polynucleotide.
- the protein may modify the polynucleotide by cleaving it to form individual nucleotides or shorter chains of nucleotides, such as di- or trinucleotides.
- the moiety may modify the polynucleotide by orienting it or moving it to a specific position, i.e. controlling its movement.
- the polynucleotide binding protein is preferably derived from a polynucleotide handling enzyme.
- the one or more molecular brakes may be derived from any of the polynucleotide handling enzymes discussed above. Modified versions of Phi29 polymerase (SEQ ID NO: 8) which act as molecular brakes are disclosed in U.S. Pat. No. 5,576,204.
- the one or more molecular brakes are preferably derived from a helicase.
- any number of molecular brakes derived from a helicase may be used. For instance, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more helicases may be used as molecular brakes. If two or more helicases are be used as molecular brakes, the two or more helicases are typically the same helicase. The two or more helicases may be different helicases.
- the two or more helicases may be any combination of the helicases mentioned above.
- the two or more helicases may be two or more Dda helicases.
- the two or more helicases may be one or more Dda helicases and one or more TrwC helicases.
- the two or more helicases may be different variants of the same helicase.
- the two or more helicases are preferably attached to one another.
- the two or more helicases are more preferably covalently attached to one another.
- the helicases may be attached in any order and using any method.
- the one or more molecular brakes derived from helicases are preferably modified to reduce the size of an opening in the polynucleotide binding domain through which in at least one conformational state the polynucleotide can unbind from the helicase. This is disclosed in WO 2014/013260.
- the one or more molecular brakes are preferably (a) used in an inactive mode (i.e. are used in the absence of the necessary components to facilitate movement or are incapable of active movement), (b) used in an active mode where the one or more molecular brakes move in the opposite direction to the one or more helicases or (c) used in an active mode where the one or more molecular brakes move in the same direction as the one or more helicases and more slowly than the one or more helicases.
- the one or more molecular brakes are preferably (a) used in an inactive mode (i.e. are used in the absence of the necessary components to facilitate movement or are incapable of active movement) or (b) used in an active mode where the one or more molecular brakes move along the polynucleotide in the same direction as the polynucleotide through the pore.
- the one or more helicases and one or more molecular brakes may be attached to the polynucleotide at any positions so that they are brought together and both control the movement of the polynucleotide through the pore.
- the one or more helicases and one or more molecular brakes are at least one nucleotide apart, such as at least 5, at least 10, at least 50, at least 100, at least 500, at least 1000, at least 5000, at least 10,000, at least 50,000 nucleotides or more apart.
- the one or more helicases are preferably attached to the Y adaptor and the one or more molecular brakes are preferably attached to the hairpin loop adaptor.
- the one or more molecular brakes are preferably one or more helicases that are modified such that they bind the polynucleotide but do not function as a helicase.
- the one or more helicases attached to the Y adaptor are preferably stalled at a spacer as discussed in more detail below.
- the one or more molecular brakes attach to the hairpin loop adaptor are preferably not stalled at a spacer.
- the one or more helicases and the one or more molecular brakes are preferably brought together when the one or more helicases reach the hairpin loop.
- the one or more helicases may be attached to the Y adaptor before the Y adaptor is attached to the polynucleotide or after the Y adaptor is attached to the polynucleotide.
- the one or more molecular brakes may be attached to the hairpin loop adaptor before the hairpin loop adaptor is attached to the polynucleotide or after the hairpin loop adaptor is attached to the polynucleotide.
- the one or more helicases and the one or more molecular brakes are preferably not attached to one another.
- the one or more helicases and the one or more molecular brakes are more preferably not covalently attached to one another.
- the one or more helicases and the one or more molecular brakes are preferably not attached as described in International Application Nos. PCT/GB2013/051925 (published as WO 2014/013260); PCT/GB2013/051924 (published as WO 2014/013259) PCT/GB2013/051928 (published as WO 2014/013262) and PCT/GB2014/052736.
- the one or more helicases may be stalled at one or more spacers as discussed in International Application No. PCT/GB2014/050175. Any configuration of one or more helicases and one or more spacers disclosed in the International Application may be used in this invention.
- the one or more helicases are moved past the spacer by the pore as the polynucleotide moves through the pore. This is because the complementary polynucleotide (including the one or more spacers) moves through the pore and the one or more helicases remain on top of the pore.
- the one or more spacers are preferably part of the complementary polynucleotide, for instance they interrupt(s) the polynucleotide sequence.
- the one or more spacers are preferably not part of one or more blocking molecules, such as speed bumps, hybridised to the polynucleotide.
- spacers in the complementary polynucleotide there may be any number of spacers in the complementary polynucleotide, such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more spacers. There are preferably two, four or six spacers in the polynucleotide. There may be one or more spacers in different regions of the polynucleotide, such as one or more spacers in the Y adaptor and/or hairpin loop adaptor.
- the one or more spacers each provides an energy barrier which the one or more helicases cannot overcome even in the active mode.
- the one or more spacers may stall the one or more helicases by reducing the traction of the helicase (for instance by removing the bases from the nucleotides in the polynucleotide) or physically blocking movement of the one or more helicases (for instance using a bulky chemical group).
- the one or more spacers may comprise any molecule or combination of molecules that stalls the one or more helicases.
- the one or more spacers may comprise any molecule or combination of molecules that prevents the one or more helicases from moving along the polynucleotide. It is straightforward to determine whether or not the one or more helicases are stalled at one or more spacers in the absence of a transmembrane pore and an applied potential. For instance, the ability of a helicase to move past a spacer and displace a complementary strand of DNA can be measured by PAGE.
- the one or more spacers typically comprise a linear molecule, such as a polymer.
- the one or more spacers typically have a different structure from the polynucleotide.
- the polynucleotide is DNA
- the one or more spacers are typically not DNA.
- the one or more spacers preferably comprise peptide nucleic acid (PNA), glycerol nucleic acid (GNA), threose nucleic acid (TNA), locked nucleic acid (LNA) or a synthetic polymer with nucleotide side chains.
- the one or more spacers may comprise one or more nucleotides in the opposite direction from the polynucleotide.
- the one or more spacers may comprise one or more nucleotides in the 3′ to 5′ direction when the polynucleotide is in the 5′ to 3′ direction.
- the nucleotides may be any of those discussed above.
- the one or more spacers preferably comprises one or more nitroindoles, such as one or more 5-nitroindoles, one or more inosines, one or more acridines, one or more 2-aminopurines, one or more 2-6-diaminopurines, one or more 5-bromo-deoxyuridines, one or more inverted thymidines (inverted dTs), one or more inverted dideoxy-thymidines (ddTs), one or more dideoxy-cytidines (ddCs), one or more 5-methylcytidines, one or more 5-hydroxymethylcytidines, one or more 2′-O-Methyl RNA bases, one or more Iso-deoxycytidines (Iso-dCs), one or more Iso-deoxyguanosines (Iso-dGs), one or more iSpC3 groups (i.e.
- nucleotides which lack sugar and a base one or more photo-cleavable (PC) groups, one or more hexandiol groups, one or more spacer 9 (iSp9) groups, one or more spacer 18 (iSp18) groups, a polymer or one or more thiol connections.
- the one or more spacers may comprise any combination of these groups. Many of these groups are commercially available from IDT® (Integrated DNA Technologies®).
- the one or more spacers may contain any number of these groups. For instance, for 2-aminopurines, 2-6-diaminopurines, 5-bromo-deoxyuridines, inverted dTs, ddTs, ddCs, 5-methylcytidines, 5-hydroxymethylcytidines, 2′-O-Methyl RNA bases, Iso-dCs, Iso-dGs, iSpC3 groups, PC groups, hexandiol groups and thiol connections, the one or more spacers preferably comprise 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more. The one or more spacers preferably comprise 2, 3, 4, 5, 6, 7, 8 or more iSp9 groups. The one or more spacers preferably comprise 2, 3, 4, 5 or 6 or more iSp18 groups. The most preferred spacer is four iSp18 groups.
- the polymer is preferably a polypeptide or a polyethylene glycol (PEG).
- the polypeptide preferably comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more amino acids.
- the PEG preferably comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more monomer units.
- the one or more spacers preferably comprise one or more abasic nucleotides (i.e. nucleotides lacking a nucleobase), such as 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more abasic nucleotides.
- the nucleobase can be replaced by —H (idSp) or —OH in the abasic nucleotide.
- Abasic spacers can be inserted into polynucleotides by removing the nucleobases from one or more adjacent nucleotides.
- polynucleotides may be modified to include 3-methyladenine, 7-methylguanine, 1,N6-ethenoadenine inosine or hypoxanthine and the nucleobases may be removed from these nucleotides using Human Alkyladenine DNA Glycosylase (hAAG).
- polynucleotides may be modified to include uracil and the nucleobases removed with Uracil-DNA Glycosylase (UDG).
- the one or more spacers do not comprise any abasic nucleotides.
- the one or more helicases may be stalled by (i.e. before) or on each linear molecule spacers. If linear molecule spacers are used, the polynucleotide is preferably provided with a double stranded region of polynucleotide adjacent to the end of each spacer past which the one or more helicases are to be moved.
- the double stranded region typically helps to stall the one or more helicases on the adjacent spacer.
- the presence of the double stranded region(s) is particularly preferred if the method is carried out at a salt concentration of about 100 mM or lower.
- Each double stranded region is typically at least 10, such as at least 12, nucleotides in length.
- a double stranded region may be formed by hybridising a shorter polynucleotide to a region adjacent to a spacer.
- the shorter polynucleotide is typically formed from the same nucleotides as the polynucleotide, but may be formed from different nucleotides. For instance, the shorter polynucleotide may be formed from LNA.
- the polynucleotide is preferably provided with a blocking molecule at the end of each spacer opposite to end past which the one or more helicases are to be moved.
- the helicase is stalled between a blocking molecule and a spacer. This can help to ensure that the one or more helicases remain stalled on each spacer. It may also help retain the one or more helicases on the polynucleotide in the case that it/they diffuse(s) off in solution.
- the blocking molecule may be any of the chemical groups discussed below which physically cause the one or more helicases to stall.
- the blocking molecule may be a double stranded region of polynucleotide.
- the one or more spacers preferably comprise one or more chemical groups which physically cause the one or more helicases to stall.
- the one or more chemical groups are preferably one or more pendant chemical groups.
- the one or more chemical groups may be attached to one or more nucleobases in the polynucleotide.
- the one or more chemical groups may be attached to the polynucleotide backbone. Any number of these chemical groups may be present, such as 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more.
- Suitable groups include, but are not limited to, fluorophores, streptavidin and/or biotin, cholesterol, methylene blue, dinitrophenols (DNPs), digoxigenin and/or anti-digoxigenin and dibenzylcyclooctyne groups.
- Different spacers in the polynucleotide may comprise different stalling molecules.
- one spacer may comprise one of the linear molecules discussed above and another spacer may comprise one or more chemical groups which physically cause the one or more helicases to stall.
- a spacer may comprise any of the linear molecules discussed above and one or more chemical groups which physically cause the one or more helicases to stall, such as one or more abasics and a fluorophore.
- Suitable spacers can be designed depending on the type of polynucleotide and the conditions under which the method of the invention is carried out. Most helicases bind and move along DNA and so may be stalled using anything that is not DNA. Suitable molecules are discussed above.
- the method of the invention is preferably carried out in the presence of free nucleotides and/or the presence of a helicase cofactor. This is discussed in more detail below.
- the one or more spacers are preferably capable of stalling the one or more helicases in the presence of free nucleotides and/or the presence of a helicase cofactor.
- one or more longer spacers are typically used to ensure that the one or more helicases are stalled on the polynucleotide before they are contacted with the transmembrane pore and a potential is applied.
- One or more shorter spacers may be used in the absence of free nucleotides and a helicase cofactor (such that the one or more helicases are in the inactive mode).
- the salt concentration also affects the ability of the one or more spacers to stall the one or more helicases.
- the one or more spacers are preferably capable of stalling the one or more helicases at a salt concentration of about 100 mM or lower. The higher the salt concentration used in the method of the invention, the shorter the one or more spacers that are typically used and vice versa.
- the method may concern moving two or more helicases past a spacer.
- the length of the spacer is typically increased to prevent the trailing helicase from pushing the leading helicase past the spacer in the absence of the pore and applied potential.
- the spacer lengths discussed above may be increased at least 1.5 fold, such 2 fold, 2.5 fold or 3 fold.
- the spacer lengths in the third column of Table 4 above may be increased 1.5 fold, 2 fold, 2.5 fold or 3 fold.
- the complementary polynucleotide is linked to the target RNA using a bridging moiety.
- step (a) preferably comprises hybridising a primer to the target RNA and using the primer to form the complementary polynucleotide.
- the primer preferably comprises a bridging moiety and the bridging moiety is preferably attached to the target RNA such that the complementary polynucleotide linked to the target RNA.
- Step (b) preferably comprises contacting the linked construct comprising the complementary polynucleotide and the target RNA with a transmembrane pore such that such that both the complementary polynucleotide and the target RNA move through the pore.
- the complementary polynucleotide is preferably contacted with the pore before the target RNA.
- Step c) preferably comprises taking one or more measurements as both the complementary polynucleotide and the target RNA move with respect to the pore wherein the measurements are indicative of one or more characteristics of the complementary polynucleotide and the target RNA and thereby characterising the target double stranded polynucleotide.
- Linking and interrogating both the complementary polynucleotide and the target RNA in this way increases the efficiency and accuracy of characterization.
- the bridging moiety is capable of linking the two strands of the target polynucleotide.
- the bridging moiety typically covalently links the two strands of the target polynucleotide.
- the bridging moiety can be anything that is capable of linking the two strands of the target polynucleotide, provided that the bridging moiety does not interfere with movement of the single stranded polynucleotide through the transmembrane pore.
- the bridging moiety may be linked to the target polynucleotide by any suitable means known in the art.
- the bridging moiety may be synthesized separately and chemically attached or enzymatically ligated to the target polynucleotide. Alternatively, the bridging moiety may be generated in the processing of the target polynucleotide.
- the bridging moiety is linked to the target polynucleotide at or near one end of the target polynucleotide.
- the bridging moiety is preferably linked to the target polynucleotide within 10 nucleotides of the end of the target polynucleotide
- Suitable bridging moieties include, but are not limited to a polymeric linker, a chemical linker, a polynucleotide or a polypeptide.
- the bridging moiety comprises DNA, RNA, modified DNA (such as abasic DNA), RNA, PNA, LNA or PEG.
- the bridging moiety is more preferably DNA or RNA.
- the bridging moiety is most preferably a hairpin loop or a hairpin loop adaptor. Suitable hairpin adaptors can be designed using methods known in the art.
- the hairpin loop may be any length.
- the hairpin loop is typically 110 or fewer nucleotides, such as 100 or fewer nucleotides, 90 or fewer nucleotides, 80 or fewer nucleotides, 70 or fewer nucleotides, 60 or fewer nucleotides, 50 or fewer nucleotides, 40 or fewer nucleotides, 30 or fewer nucleotides, 20 or fewer nucleotides or 10 or fewer nucleotides, in length.
- the hairpin loop is preferably from about 1 to 110, from 2 to 100, from 5 to 80 or from 6 to 50 nucleotides in length. Longer lengths of the hairpin loop, such as from 50 to 110 nucleotides, are preferred if the loop is involved in the differential selectability of the adaptor. Similarly, shorter lengths of the hairpin loop, such as from 1 to 5 nucleotides, are preferred if the loop is not involved in the selectable binding as discussed below.
- the hairpin adaptor may be ligated to either end of the complementary polynucleotide and/or target RNA, i.e. the 5′ or the 3′ end.
- the hairpin adaptor may be ligated using any method known in the art.
- the hairpin adaptor may be ligated using a ligase, such as T4 DNA ligase, E. coli DNA ligase, Taq DNA ligase, Tma DNA ligase and 9° N DNA ligase.
- the complementary polynucleotide and the target RNA may be separated as or before the linked construct is contacted with the pore in accordance with the invention. They may be separated as the polynucleotide movement through the pore is controlled by a polynucleotide binding protein, such as a helicase, or molecular brake.
- a polynucleotide binding protein such as a helicase, or molecular brake.
- the complementary polynucleotide and the target RNA may be separated using any method known in the art. For instance, they may be separated by a polynucleotide binding protein or using conditions which favour dehybridsation (examples of conditions which favour dehybridisation include, but are not limited to, high temperature, high pH and the addition of agents that can disrupt hydrogen bonding or base pairing, such as formamide and urea).
- the hairpin adaptor preferably comprises a selectable binding moiety.
- a selectable binding moiety is a moiety that can be selected on the basis of its binding properties.
- a selectable binding moiety is preferably a moiety that specifically binds to a surface.
- a selectable binding moiety specifically binds to a surface if it binds to the surface to a much greater degree than any other moiety used in the invention.
- the moiety binds to a surface to which no other moiety used in the invention binds.
- Suitable selective binding moieties are known in the art.
- Preferred selective binding moieties include, but are not limited to, biotin, a polynucleotide sequence, antibodies, antibody fragments, such as Fab and ScSv, antigens, polynucleotide binding proteins, poly histidine tails and GST tags.
- the most preferred selective binding moieties are biotin and a selectable polynucleotide sequence. Biotin specifically binds to a surface coated with avidins.
- Selectable polynucleotide sequences specifically bind (i.e. hybridise) to a surface coated with homologus sequences.
- selectable polynucleotide sequences specifically bind to a surface coated with polynucleotide binding proteins.
- the hairpin adaptor and/or the selectable binding moiety may comprise a region that can be cut, nicked, cleaved or hydrolysed. Such a region can be designed to allow the complementary polynucleotide and/or target RNA to be removed from the surface to which it is bound following purification or isolation. Suitable regions are known in the art. Suitable regions include, but are not limited to, an RNA region, a region comprising desthiobiotin and streptavidin, a disulphide bond and a photocleavable region.
- the linked construct preferably comprises a leader sequence at the opposite end from the bridging moiety, such as a hairpin loop or hairpin loop adaptor. Leader sequences are discussed in more detail below.
- the method preferably comprises attaching to the linked construct a leader sequence which preferentially threads into the pore.
- the leader sequence facilitates the method of the invention.
- the leader sequence is designed to preferentially thread into the pore of the invention and thereby facilitate the movement of polynucleotide through the pore.
- the leader sequence can also be used to link the construct to the one or more anchors as discussed above.
- the leader sequence may be linked to the complementary polynucleotide or the target RNA.
- the leader sequence typically comprises a polymer.
- the polymer is preferably negatively charged.
- the polymer is preferably a polynucleotide, such as DNA or RNA, a modified polynucleotide (such as abasic DNA), PNA, LNA, polyethylene glycol (PEG) or a polypeptide.
- the leader preferably comprises a polynucleotide and more preferably comprises a single stranded polynucleotide.
- the leader sequence can comprise any of the polynucleotides discussed above.
- the single stranded leader sequence most preferably comprises a single strand of DNA, such as a poly dT section.
- the leader sequence preferably comprises the one or more spacers.
- the leader sequence can be any length, but is typically 10 to 150 nucleotides in length, such as from 20 to 150 nucleotides in length.
- the length of the leader typically depends on the transmembrane pore used in the method.
- the leader sequence is preferably part of a Y adaptor as defined below.
- the method of the invention may involve double coupling of the complementary polynucleotide and target RNA.
- the method of the invention comprises:
- the double stranded polynucleotide is provided with a Y adaptor at one end and a hairpin loop adaptor at the other end.
- the Y adaptor and/or the hairpin adaptor are typically polynucleotide adaptors. They may be formed from any of the polynucleotides discussed above.
- the Y adaptor typically comprises (a) a double stranded region and (b) a single stranded region or a region that is not complementary at the other end.
- the Y adaptor may be described as having an overhang if it comprises a single stranded region.
- the presence of a non-complementary region in the Y adaptor gives the adaptor its Y shape since the two strands typically do not hybridise to each other unlike the double stranded portion.
- the Y adaptor comprises the one or more first anchors. Anchors are discussed in more detail above.
- the Y adaptor preferably comprises a leader sequence which preferentially threads into the pore. This is discussed above.
- the hairpin adaptor preferably comprises a selectable binding moiety as discussed above.
- the hairpin adaptor and/or the selectable binding moiety may comprise a region that can be cut, nicked, cleaved or hydrolysed as discussed above.
- the Y adaptor preferably comprises the one or more helicases and the hairpin loop adaptor preferably comprises the one or more molecular brakes.
- the Y adaptor and/or the hairpin adaptor may be ligated to the polynucleotide using any method known in the art.
- One or both of the adaptors may be ligated using a ligase, such as T4 DNA ligase, E. coli DNA ligase, Taq DNA ligase, Tma DNA ligase and 9° N DNA ligase.
- the adaptors may be added to the polynucleotide using the methods of the invention discussed below.
- step a) of the method comprises modifying the double stranded polynucleotide so that it comprises the Y adaptor at one end and the hairpin loop adaptor at the other end. Any manner of modification can be used. The methods of modification and characterisation may be combined in any way.
- the strength of coupling (or binding) of the bridging moiety (or hairpin) adaptor to the membrane is greater than the strength of coupling (or binding) of the Y adaptor to the membrane. This can be measured in any way.
- a suitable method for measuring the strength of coupling (or binding) is disclosed in the Examples of the UK Application No. 1406147.7.
- the strength of coupling (or binding) of the hairpin loop adaptor is preferably at least 1.5 times the strength of coupling (or binding) of the hairpin loop adaptor, such as at least twice, at least three times, at least four times, at least five or at least ten times the strength of coupling (or binding) of the anchor adaptor.
- the affinity constant (Kd) of the hairpin loop adaptor for the membrane is preferably at least 1.5 times the affinity constant of the Y adaptor, such as at least twice, at least three times, at least four times, at least five or at least ten times the strength of coupling of the Y adaptor.
- the hairpin loop adaptor couples (or binds) more strongly to the membrane than the Y adaptor.
- the hairpin loop adaptor may comprise more anchors than the Y adaptor.
- the hairpin loop adaptor may comprise 2, 3 or more second anchors whereas the Y adaptor may comprise one first anchor.
- the strength of coupling (or binding) of the one or more second anchors to the membrane may be greater than the strength of coupling (or binding) of the one or more first anchors to the membrane.
- the strength of coupling (or binding) of the one or more second anchors to the hairpin loop adaptor may be greater than the strength of coupling (or binding) of the one or more first anchors to the Y adaptor.
- the one or more first anchors and the one or more second anchors may be attached to their respective adaptors via hybridisation and the strength of hybridisation is greater in the one or more second anchors than in the one or more first anchors. Any combination of these embodiments may also be used in the invention. Strength of coupling (or binding) may be measure using known techniques in the art.
- the one or more second anchors preferably comprise one or more groups which couples(s) (or bind(s)) to the membrane with a greater strength than the one or more groups in the one or more first anchors which couple(s) (or bind(s)) to the membrane.
- the hairpin loop adaptor/one or more second anchors couple (or bind) to the membrane using cholesterol and the Y adaptor/one or more first anchors couple (or bind) to the membrane using palmitate.
- Cholesterol binds to triblock copolymer membranes and lipid membranes more strongly than palmitate.
- the hairpin loop adaptor/one or more second anchors couple (or bind) to the membrane using a mono-acyl species, such as palmitate, and the Y adaptor/one or more first anchors couple (or bind) to the membrane using a diacyl species, such as dipalmitoylphosphatidylcholine.
- a mono-acyl species such as palmitate
- the Y adaptor/one or more first anchors couple (or bind) to the membrane using a diacyl species, such as dipalmitoylphosphatidylcholine.
- mRNA is preferably used in the invention to diagnose or prognose a disease or condition.
- Some diseases or conditions are associated with an altered amount (or level) of mRNA.
- the mRNA may be normal or wild-type mRNA, i.e. not alternately spliced.
- the amount (or level) of the mRNA may be increased or decreased in the disease or condition compared with the amount (or level) in a patient without the disease or condition.
- diseases or conditions may be diagnosed or prognosed by determining the amount of the mRNA in a sample from the patient using a method of the invention.
- mRNA splicing defects Many genetic diseases or conditions are caused by mutations that cause alternate mRNA splicing, such as mRNA splicing defects. A number of diseases or conditions are associated with alternate mRNA splicing which are not attributed to overt mutations. The presence or absence of alternate splicing can be identified by determining the presence or absence of an alternately spliced mRNA in a sample from the patient using the method of the invention. In some instances, alternate mRNA splicing may be the normal function of a cell. In such instances, an increased or decreased amount (or level) of the alternately spliced mRNA compared with the normal amount (i.e. the amount in a patient without the disease or condition) may be used to diagnose or prognose the disease or condition.
- the invention provides a method of diagnosing or prognosing a disease or condition associated with an altered amount and/or alternate splicing of messenger RNA (mRNA) in a patient.
- the invention provides a method of determining whether or not a patient has or is at risk of developing a disease or condition associated with an altered amount and/or alternate splicing of messenger RNA (mRNA).
- the method comprises determining the amount and/or identity of the mRNA in a sample from the patient using a method of the invention.
- the disease or condition may be any of those discussed below.
- the disease or condition is preferably cystic fibrosis, familial dysautonomia, frontotemporal lobar dementia, amyotrophic lateral sclerosis, Hutchinson-Gilford progeria syndrome, medium-chain acyl-CoA dehydrogenase (MCAD) deficiency, myotonic dystrophy, Prader-Willi syndrome, spinal muscular atrophy, tauopathy, hypercholesterolemia or cancer.
- MCAD medium-chain acyl-CoA dehydrogenase
- the presence of an altered (i.e. increased or decreased) amount (or level) of the mRNA in the sample from the patient typically diagnoses or prognoses the disease or condition, i.e. indicates that the patient has or is at risk of developing the disease or condition.
- the absence of an altered (i.e. increased or decreased) amount (or level) of the mRNA in the sample from the patient typically indicates that the patient does not have or is not at risk of developing the disease or condition.
- the amount of mRNA can be determined as discussed above.
- the presence of the alternately spliced mRNA in the sample from the patient typically diagnoses or prognoses the disease or condition, i.e. indicates that the patient has or is at risk of developing the disease or condition.
- the absence of the alternately spliced mRNA in the sample from the patient typically indicates that the patient does not have or is not at risk of developing the disease or condition.
- the presence or absence of the alternately spliced mRNA can be determined by identifying RNA in the sample as discussed above.
- An increased or decreased amount (or level) of the alternately spliced mRNA in the sample from the patient typically diagnoses or prognoses the disease or condition, i.e. indicates that the patient has or is at risk of developing the disease or condition.
- No change in the amount of the alternately spliced mRNA in the sample from the patient typically indicates that the patient does not have or is not at risk of developing the disease or condition.
- the amount of the alternately spliced mRNA can be determined as discussed above.
- miRNA is preferably used in the invention to diagnose or prognose a disease or condition.
- the invention provides a method of diagnosing or prognosing a disease or condition associated with a miRNA.
- the invention provides a method of determining whether or not a patient has or is at risk of developing a disease or condition associated with a miRNA.
- the method comprises determining the presence or absence of the miRNA in a sample from the patient using a method of the invention.
- the disease or condition may be any of those discussed below.
- the presence of the miRNA in the sample from the patient typically indicates that the patient has or is at risk of developing the disease or condition.
- the absence of the miRNA in the sample from the patient typically indicates that the patient does not have or is not at risk of developing the disease or condition.
- the presence or absence of the miRNA can be determined by identifying any miRNAs in the sample as discussed above.
- the disease or condition is preferably cancer, coronary heart disease, cardiovascular disease or sepsis.
- the disease or condition is more preferably abdominal aortic aneurysm, acute lymphoblastic leukemia (ALL), acute myeloid leukemia (AML), acute myocardial infarction, acute promyelocytic leukemia (APL), adenoma, adrenocortical carcinoma, alcoholic liver disease, Alzheimer's disease, anaplastic thyroid carcinoma (ATC), anxiety disorder, asthma, astrocytoma, atopic dermatitis, autism spectrum disorder (ASD), B-cell chronic lymphocytic leukemia, B-cell lymphoma, Becker muscular dystrophy (BMD), bladder cancer, brain neoplasm, breast cancer, Burkitt lymphoma, cardiac hypertrophy, cardiomyopathy, cardiovascular disease, cerebellar neurodegeneration, cervical cancer, cholangiocarcinoma, cholesteatoma, choriocarcinoma, chronic lymphocytic
- the patient may be any of the mammals discussed above.
- the patient is preferably human.
- the patient is an individual.
- the sample may be any of those discussed above.
- the sample is typically from any tissue or bodily fluid.
- the sample typically comprises a body fluid and/or cells of the patient and may, for example, be obtained using a swab, such as a mouth swab.
- the sample may be, or be derived from, blood, urine, saliva, skin, cheek cell or hair root samples.
- the target RNA is typically extracted from the sample before it is used in the method of the invention.
- the method may concern diagnosis of the disease or condition in the patient, i.e. determining whether or not the patient has the disease or condition.
- the patient may be symptomatic.
- the method may concern prognosing the disease or condition in the patient, i.e. determining whether or not the patient is likely to develop the disease or condition.
- the patient can be asymptomatic.
- the patient can have a genetic predisposition to the disease or condition.
- the patient may have one or more family member(s) with the disease or condition.
- the invention also provides a kit for characterising a target RNA.
- the kit comprises (a) a transmembrane pore and (b) a reverse transcriptase enzyme and/or a reverse transcription primer. Any of the embodiments discussed above with reference to the method of the invention equally apply to the kits.
- the kit may further comprise the components of a membrane, such as the phospholipids needed to form an amphiphilic layer, such as a lipid bilayer.
- a membrane such as the phospholipids needed to form an amphiphilic layer, such as a lipid bilayer.
- the kit of the invention may additionally comprise one or more other reagents or instruments which enable any of the embodiments mentioned above to be carried out.
- reagents or instruments include one or more of the following: suitable buffer(s) (aqueous solutions), means to obtain a sample from a subject (such as a vessel or an instrument comprising a needle), a membrane as defined above or voltage or patch clamp apparatus.
- Reagents may be present in the kit in a dry state such that a fluid sample resuspends the reagents.
- the kit may also, optionally, comprise instructions to enable the kit to be used in the method of the invention or details regarding which patients the method may be used for.
- the kit typically comprises nucleotides.
- the kit preferably comprises dAMP, dTMP, dGMP and dCMP.
- the kit preferably does not comprise means to amplify and/or express polynucleotides.
- the invention also provides an apparatus for characterising target RNAs.
- the apparatus comprises (a) a plurality of pores and (b) a plurality of a plurality of reverse transcriptase enzymes and/or a plurality of reverse transcription primers.
- the apparatus preferably further comprises instructions for carrying out the method of the invention.
- the apparatus may be any conventional apparatus for polynucleotide analysis, such as an array or a chip. Any of the embodiments discussed above with reference to the methods of the invention are equally applicable to the apparatus of the invention.
- the apparatus is preferably set up to carry out the method of the invention.
- the apparatus preferably comprises:
- a sensor device that is capable of supporting the plurality of pores and being operable to perform RNA characterisation using the pores and the helicases or constructs;
- At least one reservoir for holding material for performing the characterisation.
- the apparatus preferably comprises:
- a sensor device that is capable of supporting the plurality of pores and being operable to perform RNA characterisation using the pores and the helicases or constructs;
- At least one port for delivery of the material for performing the characterisation.
- the apparatus preferably comprises:
- a sensor device that is capable of supporting the membrane and plurality of pores and being operable to perform RNA characterising using the pores
- At least one reservoir for holding material for performing the characterising
- a fluidics system configured to controllably supply material from the at least one reservoir to the sensor device
- the apparatus may be any of those described in International Application No. PCT/GB08/004127 (published as WO 2009/077734), PCT/GB10/000789 (published as WO 2010/122293), International Application No. PCT/GB10/002206 (published as WO 2011/067559) or International Application No. PCT/US99/25679 (published as WO 00/28312).
- This example describes the sample preparation procedure used to produce the cDNA which can then be characterised using the nanopore system.
- the steps of the procedure outlined in this example are steps A to C shown in FIG. 1 .
- cDNA SEQ ID NO: 11
- mRNA messenger RNA
- SuperScript II Reverse Transcriptase (Life Technologies) using an adaption of the standard SuperScript II protocol (the protocol that was followed is shown below).
- the mRNA (SEQ ID NO: 8), custom pT primer (SEQ ID NO: 9 attached at its 3′ end to four iSpC3 spacers which are attached to the 5′ end of SEQ ID NO: 10), dNTP mix and nuclease free water were mixed together and then heated to 65° C. for 5 minutes before being quick chilled on ice. 5 ⁇ SuperScript II buffer and DTT were then added to the mixture and the sample incubated at 42° C. for 2 minutes. Finally, Superscript II was added to the reaction mixture and then the sample was incubated at 42° C. for 50 mins and then at 70° C. for 15 minutes.
- the cDNA product (SEQ ID NO: 11 attached at its 5′ end to the 3′ end of SEQ ID NO: 10 which is attached by its 5′ end to four iSpC3 spacers which are attached to the 3′ end of SEQ ID NO: 9) hybridised to the mRNA (SEQ ID NO: 8) was then purified using SPRI beads (Agencourt AMPure).
- This example describes the characterisation of cDNA (SEQ ID NO: 11 attached at its 5′ end to the 3′ end of SEQ ID NO: 10 which is attached by its 5′ end to four iSpC3 spacers which are attached to the 3′ end of SEQ ID NO: 9) using a nanopore system. See FIG. 2 for a cartoon representation of the system.
- cDNA/mRNA (0.05 nM, SEQ ID NO: 11 attached at its 5′ end to the 3′ end of SEQ ID NO: 10 which is attached by its 5′ end to four iSpC3 spacers which are attached to the 3′ end of SEQ ID NO: 9, where SEQ ID NO: 11 is hybridised to SEQ ID NO: 8) was hybridised to the tether (0.25 nM, SEQ ID NO: 12) by heating the sample at 40° C. in buffer (10 mM TRIS, 50 mM NaCl, pI17.5) for two minutes and then slow cooling to 30° C. over 15 minutes. This is shown as step D in FIG. 1 .
- the cDNA/mRNA sample was then pre-incubated with T4 Dda-E94C/A360C (125 nM, SEQ ID NO: 13 with mutations E94C/A360C) for an hour at room temperature in buffer (126.5 mM KCl, 25 mM NaCl, 25 mM potassium phosphate pH 7.5-8.0 and 5 mM Tris).
- buffer (1 mL, 600 mM KCl, 25 mM potassium phosphate, 75 mM Potassium Ferrocyanide (II), 25 mM Potassium ferricyanide (III), pH 8) was flowed through the system to remove any excess MspA nanopores (MS(B1-G75S/G77S/L88N/Q126R)8 MspA (SEQ ID NO: 2 with mutations G75S/G77S/L88N/Q126R).
- MgCl 2 (10 mM final concentration) and ATP (5 mM final concentration) were mixed together with buffer (600 mM KCl, 25 mM potassium phosphate, 75 mM Potassium Ferrocyanide (II), 25 mM Potassium ferricyanide (III), pH 8) and then added to the cDNA/mRNA (0.05 nM final concentration), T4 Dda-E94C/A360C (1 nM final concentration, SEQ ID NO: 13 with mutations E94C/A360C) pre-mix. The pre-mix was then added to the nanopore experimental system. Experiments were carried out for eighteen hours at an applied potential of +120 mV and helicase-controlled cDNA movement was monitored.
- the region labelled 1 corresponds to the capture tail
- the region labelled 2 corresponds to the iSpC3 spacers in the primer
- the region labelled 3 corresponds to the polyT primer for the reverse transcriptase
- the region labelled 4 corresponds to of the cDNA region.
- This example shows characterisation of cDNA which was transcribed from mRNA (SEQ ID NO: 8) as described in Example 1.
- This example describes the characterisation of cDNA using a nanopore system, where the cDNA was transcribed from mRNA found in yeast ( Saccharomyces cerevisiae ). See FIG. 2 for a cartoon representation of the system.
- Example 2 The materials and methods procedure described in Example 2 was repeated for the cDNA (0.05 nM) transcribed from yeast ( Saccharomyces cerevisiae ).
- FIGS. 4 and 5 show an example of a complete yeast cDNA trace.
- This example shows characterisation of cDNA which was transcribed from yeast ( Saccharomyces cerevisiae ) polyA+ mRNA as described in Example 1.
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Engineering & Computer Science (AREA)
- Analytical Chemistry (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Immunology (AREA)
- Biochemistry (AREA)
- Genetics & Genomics (AREA)
- Physics & Mathematics (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Pathology (AREA)
- General Physics & Mathematics (AREA)
- Electrochemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Description
- The invention relates to a new method of characterising a target ribonucleic acid (RNA) involving forming a complementary polynucleotide. The method uses a transmembrane pore.
- There is currently a need for rapid and cheap polynucleotide (e.g. DNA or RNA) sequencing and identification technologies across a wide range of applications. Existing technologies are slow and expensive mainly because they rely on amplification techniques to produce large volumes of polynucleotide and require a high quantity of specialist fluorescent chemicals for signal detection.
- Transmembrane pores (nanopores) have great potential as direct, electrical biosensors for polymers and a variety of small molecules. In particular, recent focus has been given to nanopores as a potential DNA sequencing technology.
- When a potential is applied across a nanopore, there is a change in the current flow when an analyte, such as a nucleotide, resides transiently in the barrel for a certain period of time. Nanopore detection of the nucleotide gives a current change of known signature and duration. In the “strand sequencing method, a single polynucleotide strand is passed through the pore and the identity of the nucleotides are derived. Strand sequencing can involve the use of a nucleotide handling protein, such as a helicase, to control the movement of the polynucleotide through the pore.
- One group of RNAs which are difficult to detect in low concentrations are micro-ribonucleic acids (micro-RNA or miRNAs). miRNAs are highly stable RNA oligomers, which can regulate protein production post-transcriptionally. They act by one of two mechanisms. In plants, miRNAs have been shown to act chiefly by directing the cleavage of messenger RNA, whereas in animals, gene regulation by miRNAs typically involves hybridisation of miRNAs to the 3′ UTRs of messenger RNAs, which hinders translation (Lee et al.,
Cell 75, 843-54 (1993); Wightman et al., Cell 75, 855-62 (1993); and Esquela-Kerscher et al., Cancer 6, 259-69 (2006)). miRNAs frequently bind to their targets with imperfect complementarity. They have been predicted to bind to as many as 200 gene targets each and to regulate more than a third of all human genes (Lewis et al., Cell 120, 15-20 (2005)). - The expression level of certain microRNAs is known to change in tumours, giving different tumour types characteristic patterns of microRNA expression (Rosenfeld, N. et al., Nature Biotechnology 26, 462-9 (2008)). In addition, miRNA profiles have been shown to be able to reveal the stage of tumour development with greater accuracy than messenger RNA profiles (Lu et al., Nature 435, 834-8 (2005) and Barshack et al., The International Journal of Biochemistry & Cell Biology 42, 1355-62 (2010)). These findings, together with the high stability of miRNAs, and the ability to detect circulating miRNAs in serum and plasma (Wang et al., Biochemical and Biophysical Research Communications 394, 184-8 (2010); Gilad et al., PloS One 3, e3148 (2008); and Keller et al., Nature Methods 8, 841-3 (2011)), have led to a considerable amount of interest in the potential use of microRNAs as cancer biomarkers. For treatment to be effective, cancers need to be classified accurately and treated differently, but the efficacy of tumour morphology evaluation as a means of classification is compromised by the fact that many different types of cancer share morphological features miRNAs offer a potentially more reliable and less invasive solution.
- The inventors have surprisingly demonstrated that it is possible to characterise a target RNA by forming a complementary polynucleotide from the target RNA and then characterising the complementary polynucleotide using a transmembrane pore. The invention therefore provides a method of characterising a target RNA, comprising:
-
- (a) forming a complementary polynucleotide from the target RNA;
- (b) contacting the complementary polynucleotide with a transmembrane pore such that the complementary polynucleotide moves through the pore; and
- (c) taking one or more measurements as the complementary polynucleotide moves with respect to the pore wherein the measurements are indicative of one or more characteristics of the complementary polynucleotide and thereby characterising the target RNA.
- The invention also provides:
-
- a method of determining whether or not a patient has or is at risk of developing a disease or condition associated with an altered amount and/or alternate splicing of messenger RNA (mRNA), comprising determining the amount and/or identity of the mRNA in a sample from the patient using a method of the invention and thereby determining whether or not the patient has or is at risk of developing the disease or condition;
- a method of determining whether or not a patient has or is at risk of developing a disease or condition associated with a miRNA, comprising determining the presence or absence of the miRNA in a sample from the patient using a method of the invention and thereby determining whether or not a patient has or is at risk of developing the disease or condition;
- a kit for characterising a target RNA comprising (a) a transmembrane pore and (b) a reverse transcriptase enzyme and/or a reverse transcription primer; and
- an apparatus for characterising target RNAs in a sample, comprising (a) a plurality of transmembrane pores and (b) a plurality of reverse transcriptase enzymes and/or a plurality of reverse transcription primers.
-
FIG. 1 shows the sample preparation procedure outlined in Example 1 and 2. A sample of mRNA (shown in step A, labelled X) is annealed to a capture strand (shown in step B, labelled Y). The capture strand anneals to the mRNA at the polyA region (labelled 1, this region can vary in length depending on the mRNA). A reverse transcriptase enzyme forms the complementary cDNA strand (shown in step C as a dotted line) to the mRNA. The tether (shown in step D, labelled Z) then anneals to the cDNA. -
FIG. 2 shows the nanopore system used in Example 2 and 3 to characterise cDNA. The cDNA/mRNA (cDNA labelled 1, mRNA labelled 2) are tethered to the bilayer (labelled 3) by a short strand of DNA with a 3′ cholesterol tether (labelled 4). The leader sequence of the cDNA allows the enzyme (labelled 5) to bind to the cDNA but the iSpC3 spacers (shown as a square and labelled 6) stall the enzyme on the DNA until the DNA enters the nanopore (labelled 7). The enzyme moves along the cDNA, controlling the movement through the nanopore. The mRNA dehybridises from the complementary cDNA as the enzyme moves along the cDNA. The direction of movement of the enzyme is indicated by the arrow labelled 8 and the direction of movement of the cDNA is indicated by the arrow labelled 9. -
FIG. 3 shows an example current trace (y-axis label=Current (pA, 40 to 120), x-axis label=Time (s, 2460 to 2600)) of when a helicase (T4 Dda E94C/A360C (SEQ ID NO: 13 with mutations E94C/A360C)) controls the translocation of cDNA (0.05 nM, SEQ ID NO: 11 attached at its 5′ end to the 3′ end of SEQ ID NO: 10 which is attached by its 5′ end to four iSpC3 spacers which are attached to the 3′ end of SEQ ID NO: 9, where SEQ ID NO: 11 is hybridised to SEQ ID NO: 8) through a nanopore (MS(B1-G75S/G77S/L88N/Q126R)8 MspA (SEQ ID NO: 2 with mutations G75S/G77S/L88N/Q126R)). A number of features in the electrical read out are identified as the helicase controls the cDNA movement through the nanopore (label 1=capture tail, 2=the iSpC3 spacers in the primer, 3=polyT primer for the reverse transcriptase and 4=region of cDNA). -
FIG. 4 shows an example current trace (y-axis label=Current (pA, 25 to 150), x-axis label=Time (s, 2300 to 2400)) of when a helicase (T4 Dda E94C/A360C (SEQ ID NO: 13 with mutations E94C/A360C)) controls the translocation of cDNA (0.05 nM) transcribed from yeast mRNA through a nanopore (MS(B1-G75S/G77S/L88N/Q126R)8 MspA (SEQ ID NO: 2 with mutations G75 S/G77 S/L88N/Q 126R)). -
FIG. 5 shows a zoomed in region of an example current trace (y-axis label=Current (pA, 50 to 250), x-axis label=Time (s, 1214 to 1224)) of when a helicase (T4 Dda E94C/A360C (SEQ ID NO: 13 with mutations E94C/A360C)) controls the translocation of cDNA (0.05 nM) transcribed from yeast mRNA through a nanopore (MS(B1-G75S/G77S/L88N/Q126R)8 MspA (SEQ ID NO: 2 with mutations G75S/G77S/L88N/Q126R)). A number of features in the electrical read out are identified as the helicase controls the cDNA movement through the nanopore (label 1=capture tail, 2=the iSpC3 spacers in the primer, 3=polyT primer for the reverse transcriptase and 4=region of cDNA). - SEQ ID NO: 1 shows the codon optimised polynucleotide sequence encoding the MS-B1 mutant MspA monomer. This mutant lacks the signal sequence and includes the following mutations: D90N, D91N, D93N, D118R, D134R and E139K.
- SEQ ID NO: 2 shows the amino acid sequence of the mature form of the MS-B1 mutant of the MspA monomer. This mutant lacks the signal sequence and includes the following mutations: D90N, D91N, D93N, D118R, D134R and E139K.
- SEQ ID NO: 3 shows the polynucleotide sequence encoding one monomer of α-hemolysin-E111N/K147N (α-HL-NN; Stoddart et al., PNAS, 2009; 106(19): 7702-7707).
- SEQ ID NO: 4 shows the amino acid sequence of one monomer of α-HL-NN.
- SEQ ID NOs: 5 to 7 show the amino acid sequences of MspB, C and D.
- SEQ ID NO: 8 shows the polynucleotide sequence of the messenger RNA used in Examples 1 and 2.
- SEQ ID NO: 9 shows part of the polynucleotide sequence which makes up the primer used in Example 1. The 3′ end of SEQ ID NO: 9 is attached by four iSpC3 spacers to the 5′ end of SEQ ID NO: 10.
- SEQ ID NO: 10 shows part of the polynucleotide sequence which makes up the primer used in Example 1. The 5′ end of SEQ ID NO: 10 is attached by four iSpC3 spacers to the 3′ end of SEQ ID NO: 9.
- SEQ ID NO: 11 shows the polynucleotide sequence of the cDNA transcribed from SEQ ID NO: 8 which is attached at its 5′ end to the 3′ end of the primer sequence (SEQ ID NO: 10 which is attached by four iSpC3 spacers to the 3′ end of SEQ ID NO: 9).
- SEQ ID NO: 12 shows the polynucleotide sequence of the strand used to tether the cDNA/mRNA in Examples 2 and 3. Attached to the 3′ end of SEQ ID NO: 11 is six iSp18 spacers which are attached to two thymine residues and a 3′ cholesterol TEG.
- SEQ ID NO: 13 shows the amino acid sequence of T4 Dda helicase.
- It is to be understood that different applications of the disclosed products and methods may be tailored to the specific needs in the art. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments of the invention only, and is not intended to be limiting.
- In addition as used in this specification and the appended claims, the singular forms “a”, “an”, and “the” include plural referents unless the content clearly dictates otherwise. Thus, for example, reference to “a target RNA” includes two or more target RNAs, reference to “a complementary polynucleotide” includes two or more such complementary polynucleotides, reference to “a transmembrane protein pore” includes two or more such pores, and the like.
- All publications, patents and patent applications cited herein, whether supra or infra, are hereby incorporated by reference in their entirety.
- The invention provides a method of characterising a target ribonucleic acid (RNA). A complementary polynucleotide is formed from the target RNA and the complementary polynucleotide is characterised using a transmembrane pore. This allows characterisation of the target RNA. The target RNA is preferably not ligated to a non-RNA leader, such as a DNA leader.
- The method of the invention, and in particular the sample preparation involved, is straightforward and simple. Since the transmembrane pore is capable of detecting single molecule of the complementary polynucleotide, there is no need for amplification of the target RNA or complementary polynucleotide. The method typically does not comprise polymerase chain reaction (PCR) or reverse transcription PCR (RT-PCR). This considerably reduces the amount of workflow needed to characterise a target RNA. It also avoids any biases and artifacts introduced by PCR.
- RNA is a macromolecule comprising two or more ribonucleotides. The target RNA may comprise any combination of any ribonucleotides. The ribonucleotides can be naturally occurring or artificial. One or more ribonucleotides in the target RNA can be oxidized or methylated. One or more ribonucleotides in the target RNA may be damaged. For instance, the target RNA may comprise a pyrimidine dimer, such as a uracil dimer. Such dimers are typically associated with damage by ultraviolet light and are the primary cause of skin melanomas. One or more ribonucleotides in the target RNA may be modified, for instance with a label or a tag. Suitable labels are described below. The target RNA may comprise one or more spacers.
- A ribonucleotide typically contains a nucleobase, a ribose sugar and at least one phosphate group. The nucleobase is typically heterocyclic. Nucleobases include, but are not limited to, purines and pyrimidines and more specifically adenine, guanine, thymine, uracil and cytosine. The nucleotide typically contains a monophosphate, diphosphate or triphosphate. Phosphates may be attached on the 5′ or 3′ side of a nucleotide.
- Ribonucleotides include, but are not limited to, adenosine monophosphate (AMP), guanosine monophosphate (GMP), thymidine monophosphate (TMP), uridine monophosphate (UMP), cytidine monophosphate (CMP), 5-methylcytidine monophosphate, 5-methylcytidine diphosphate, 5-methylcytidine triphosphate, 5-hydroxymethylcytidine monophosphate, 5-hydroxymethylcytidine diphosphate and 5-hydroxymethylcytidine triphosphate. The nucleotides are preferably selected from AMP, IMP, GMP, CMP and UMP.
- A ribonucleotide may be abasic (i.e. lack a nucleobase). A ribonucleotide may also lack a nucleobase and a sugar (i.e. is a C3 spacer).
- The ribonucleotides in the target RNA may be attached to each other in any manner. The ribonucleotides are typically attached by their sugar and phosphate groups as in nucleic acids. The ribonucleotides may be connected via their nucleobases as in pyrimidine dimers.
- The target RNA may be single stranded or double stranded.
- The target RNA is preferably messenger RNA (mRNA). The target mRNA may be an alternate splice variant. Altered amounts (or levels) of mRNA and/or alternate mRNA splice variants may be associated with diseases or conditions.
- The target RNA is preferably a microRNA (or miRNA). Suitable miRNAs for use in the invention are well known in the art. For instance, suitable miRNAs are stored on publically available databases (Jiang Q., Wang Y., Hao Y., Juan L., Teng M., Zhang X., Li M., Wang G., Liu Y., (2009) miR2Disease: a manually curated database for microRNA deregulation in human disease. Nucleic Acids Res.). The use of mRNAs and miRNAs to diagnose or prognose diseases or conditions are discussed in more detail below.
- The whole or only part of the target RNA may be characterised using this method. The target RNA can be any length. For example, the RNA can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400 or at least 500 ribonucleotides in length. The target RNA can be 1000 or more ribonucleotides, 5000 or more ribonucleotides in length or 100000 or more ribonucleotides in length.
- The target RNA is typically present in or derived from any suitable sample. The invention is typically carried out on a sample that is known to contain or suspected to contain the target RNA. Alternatively, the invention may be carried out on a sample to confirm the identity of one or more target RNAs whose presence in the sample is known or expected.
- The sample may be a biological sample. The invention may be carried out in vitro on a sample obtained from or extracted from any organism or microorganism. The organism or microorganism is typically archaeal, prokaryotic or eukaryotic and typically belongs to one of the five kingdoms: plantae, animalia, fungi, monera and protista. The target RNA is preferably eukaryotic. For instance, the target RNA may be derived from a eukaryotic cell or may be derived from a virus using a eukaryotic cell's transcription machinery. The invention may be carried out in vitro on a sample obtained from or extracted from any virus.
- The sample is preferably a fluid sample. The sample typically comprises a body fluid of the patient. The sample may be urine, lymph, saliva, mucus or amniotic fluid but is preferably blood, plasma or serum. Typically, the sample is human in origin, but alternatively it may be from another mammal animal such as from commercially farmed animals such as horses, cattle, sheep or pigs or may alternatively be pets such as cats or dogs. Alternatively a sample of plant origin is typically obtained from a commercial crop, such as a cereal, legume, fruit or vegetable, for example wheat, barley, oats, canola, maize, soya, rice, bananas, apples, tomatoes, potatoes, grapes, tobacco, beans, lentils, sugar cane, cocoa or cotton.
- The sample may be a non-biological sample. The non-biological sample is preferably a fluid sample. Examples of a non-biological sample include surgical fluids, water such as drinking water, sea water or river water, and reagents for laboratory tests.
- The sample is typically processed prior to being assayed, for example by centrifugation or by passage through a membrane that filters out unwanted molecules or cells, such as red blood cells. The sample may be measured immediately upon being taken. The sample may also be typically stored prior to assay, preferably below −70° C. The target RNA is typically extracted from the sample before it is used in the method of the invention. RNA extraction kits are commercially available from, for instance, New England Riolabs® and Invitrogen®.
- The target RNA is typically not amplified in the method of the invention. The method typically does not comprise making multiple copies of the target RNA.
- The complementary polynucleotide is typically not amplified in the method of the invention. The method typically does not comprise making multiple copies of the complementary polynucleotide.
- The method preferably does not comprise polymerase chain reaction (PCR) or reverse transcription PCR (RT-PCR).
- The method of the invention comprises forming a complementary polynucleotide from the target RNA. The polynucleotide may be complementary to part of or all of the target RNA. If the polynucleotide is complementary to part of the target RNA, it is typically complementary to a sufficient amount of the target RNA that it may be characterised in accordance with the invention.
- The polynucleotide is typically complementary based on the pairing of its nucleobases, typically adenine (A), guanine (G), thymine (T) and cytosine (C), with their RNA base counterparts, typically uracil (U), cytosine (C), adenine (A) and guanine (G) respectively.
- A polynucleotide, such as a nucleic acid, is a macromolecule comprising two or more nucleotides. The polynucleotide or nucleic acid may comprise any combination of any nucleotides. At least a portion of the polynucleotide is complementary to all of or part of the target RNA. The nucleotides can be naturally occurring or artificial. One or more nucleotides in the polynucleotide can be oxidized or methylated. One or more nucleotides in the polynucleotide may be damaged. For instance, the polynucleotide may comprise a pyrimidine dimer. Such dimers are typically associated with damage by ultraviolet light and are the primary cause of skin melanomas. One or more nucleotides in the polynucleotide may be modified, for instance with a label or a tag. Suitable labels are described below. The polynucleotide may comprise one or more spacers.
- A nucleotide typically contains a nucleobase, a sugar and at least one phosphate group. The nucleobase and sugar form a nucleoside.
- The nucleobase is typically heterocyclic. Nucleobases include, but are not limited to, purines and pyrimidines and more specifically adenine (A), guanine (G), thymine (T), uracil (U) and cytosine (C).
- The sugar is typically a pentose sugar. Nucleotide sugars include, but are not limited to, ribose and deoxyribose. The sugar is preferably a deoxyribose.
- The polynucleotide preferably comprises the following nucleosides: deoxyadenosine (dA), deoxyuridine (dU) and/or thymidine (dT), deoxyguanosine (dG) and deoxycytidine (dC).
- The nucleotide in the polynucleotide is typically a ribonucleotide or deoxyribonucleotide. The nucleotide is preferably a deoxyribonucleotide. The nucleotide typically contains a monophosphate, diphosphate or triphosphate. Phosphates may be attached on the 5′ or 3′ side of a nucleotide.
- Nucleotides for use in the polynucleotides of the invention include, but are not limited to, adenosine monophosphate (AMP), guanosine monophosphate (GMP), thymidine monophosphate (TMP), uridine monophosphate (UMP), 5-methylcytidine monophosphate, 5-hydroxymethylcytidine monophosphate, cytidine monophosphate (CMP), cyclic adenosine monophosphate (cAMP), cyclic guanosine monophosphate (cGMP), deoxyadenosine monophosphate (dAMP), deoxyguanosine monophosphate (dGMP), deoxythymidine monophosphate (dTMP), deoxyuridine monophosphate (dUMP) and deoxycytidine monophosphate (dCMP). The nucleotides are preferably selected from AMP, TMP, GMP, CMP, UMP, dAMP, dTMP, dGMP, dCMP and dUMP. The nucleotides are most preferably selected from dAMP, dTMP, dGMP, dCMP and dUMP. The polynucleotide preferably comprises the following nucleotides: dAMP, dUMP and/or dTMP, dGMP and dCMP.
- A nucleotide may be abasic (i.e. lack a nucleobase). A nucleotide may also lack a nucleobase and a sugar (i.e. is a C3 spacer).
- The nucleotides in the polynucleotide may be attached to each other in any manner. The nucleotides are typically attached by their sugar and phosphate groups as in nucleic acids. The nucleotides may be connected via their nucleobases as in pyrimidine dimers.
- The polynucleotide is typically single stranded. The polynucleotide can be a nucleic acid, such as deoxyribonucleic acid (DNA) or ribonucleic acid (RNA). The polynucleotide may be any synthetic nucleic acid known in the art, such as peptide nucleic acid (PNA), glycerol nucleic acid (GNA), threose nucleic acid (TNA), locked nucleic acid (LNA) or other synthetic polymers with nucleotide side chains. The PNA backbone is composed of repeating N-(2-aminoethyl)-glycine units linked by peptide bonds. The GNA backbone is composed of repeating glycol units linked by phosphodiester bonds. The TNA backbone is composed of repeating threose sugars linked together by phosphodiester bonds. LNA is formed from ribonucleotides as discussed above having an extra bridge connecting the 2′ oxygen and 4′ carbon in the ribose moiety.
- The complementary polynucleotide is most preferably complementary deoxyribonucleic acid (cDNA).
- The complementary polynucleotide may be any length. The complementary polynucleotide is typically the same length as the target RNA. For example, the complementary polynucleotide can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400 or at least 500 deoxyribonucleotides in length. The complementary polynucleotide can be 1000 or more deoxyribonucleotides, 5000 or more deoxyribonucleotides in length or 100000 or more deoxyribonucleotides in length.
- The complementary polynucleotide may be formed from the target RNA using any known method. Enzymes which convert RNA to complementary nucleic acids such as those described above are known in the art.
- If the complementary polynucleotide is cDNA, the method comprises reverse transcribing the target RNA to form a cDNA. Step (a) preferably comprising reverse transcribing the target RNA using a reverse transcriptase to form the cDNA. The reverse transcriptase may reverse transcribe all or part of the available target RNA. Reverse transcriptases are enzymes which are capable of catalysing the formation of cDNA from a RNA template. They are commercially available from, for instance, New England Biolabs® and Invitrogen®. The target RNA is typically contacted with the reverse transcriptase in the presence of a population of deoxyribonucleotides as defined above. The population typically comprises all of the deoxyribonucleotides needed to base pair with each of the ribonucleotides in the target RNA. The population of deoxyribonucleotides typically comprises dAMP, dTMP, dGMP and dCMP.
- Primers
- Step (a) preferably comprises hybridising a primer to the target RNA and using the primer to form the complementary polynucleotide. The primer typically assists with conversion of the target RNA to the complementary polynucleotide. For instance, the double stranded region formed by hybridisation of the primer to the target RNA may provide a binding site for a reverse transcriptase. The reverse transcriptase may then reverse transcribe the remainder of the target RNA to form cDNA. The complementary polynucleotide, such as cDNA, produced in step (a) is typically attached to the primer. The primer may comprise a bridging moiety, such as a hairpin loop, as discussed below.
- Using a primer has various advantages. It avoids the need to amplify the target RNA using PCR. This reduces the amount of workflow that needs to be carried out and avoids any biases and artifacts introduced by PCR. Since the primer can be designed to bind at a specific end of the target RNA (see below), the complementary polynucleotide can be formed in a specific direction and the complementary polynucleotide can be moved through the pore is a known direction. This facilitates the chracterisation of the target RNA.
- The primer is typically a polynucleotide. The polynucleotide may be any of those discussed above.
- The primer preferably comprises a leader sequence and/or a region to which a polynucleotide binding protein is capable of binding. The leader sequence facilitates the method of the invention. The leader sequence is designed to preferentially thread into the transmembrane pore and thereby facilitate the movement of the complementary polynucleotide through the pore. The leader sequence is typically a polynucleotide, such as DNA or RNA, a modified polynucleotide (such as abasic DNA), PNA, LNA, PEG or a polypeptide. The leader is preferably a polynucleotide and is more preferably a single stranded polynucleotide. The leader sequence can be any of the polynucleotides discussed above. The single stranded leader sequence is most preferably a single strand of DNA. The leader sequence can be any length, but is typically 27 to 150 nucleotides in length, such as from 50 to 150 nucleotides in length.
- The region to which a polynucleotide binding protein is capable of binding is typically a polynucleotide. It can be any of the polynucleotides discussed above. The region may correspond to the leader sequence. Alternatively, the region may be distinct from the leader sequence. The polynucleotide binding protein may help to control the movement of the complementary polynucleotide through the pore as discussed in more detail below.
- As discussed above, the target RNA is preferably eukaryotic. Eukaryotic RNA typically comprises polyA tail, i.e. a stretch of consecutive adenosine monophosphates. The polyA tail is typically at the 3′ end of the RNA. In such embodiments, step (a) preferably comprises hybridising a primer to the polyA tail of the target RNA and using the primer to reverse transcribe the target RNA to form the complementary polynucleotide. The primer preferably comprises a polyT region, i.e. region containing only nucleotides based on thymine. The polyT region may contain IMP or dTMP. The polyT region may be any length, such as at least 10, at least 15, at least 20, at least 25 or more. The primer is preferably a polyT-VN primer, which comprises a polyT region and a VN anchor where V is dAMP, dCMP or dGMP and N is dAMP, dCMP, dGMP or dTMP. Such primers are commercially available, such as from New England Biolabs®.
- For non-eukaryotic target RNA, such as bacterial target RNA, step (a) further comprises adding a polyA tail to the target RNA, for instance using a polyA polymerase and ATP. Step (a) may further comprise hybridising a primer to the added polyA tail as described above.
- The method of the invention also comprises (b) contacting the complementary polynucleotide with a transmembrane pore. The method also comprises (c) taking one or more measurements as the complementary polynucleotide moves with respect to the pore wherein the measurements are indicative of one or more characteristics of the complementary polynucleotide and thereby characterising the target RNA.
- Steps (b) and (c) are preferably carried out with a potential applied across the pore. The applied potential may be a voltage potential. Alternatively, the applied potential may be a chemical potential. An example of this is using a salt gradient across an amphiphilic layer. A salt gradient is disclosed in Holden et al., J Am Chem Soc. 2007 Jul. 11; 129(27):8650-5. In some instances, the current passing through the pore as the polynucleotide moves with respect to the pore is used to determine the sequence of the complementary polynucleotide and hence the sequence of the target RNA. This is Strand Sequencing.
- The complementary polynucleotide may be contacted with the pore when it is fully or partially hybridized to the target RNA. Alternatively, the complementary polynucleotide may be contacted with the pore in the absence of the target RNA. In such embodiments, step (a) preferably further comprises removing the target RNA, for instance by digesting the target RNA. Step (a) may further comprise contacting the target RNA with RNAse H. This enzyme specifically digests the RNA strand of RNA:DNA duplexes.
- A transmembrane pore is a structure that crosses the membrane to some degree. It permits hydrated ions driven by an applied potential to flow across or within the membrane. The transmembrane pore typically crosses the entire membrane so that hydrated ions may flow from one side of the membrane to the other side of the membrane. However, the transmembrane pore does not have to cross the membrane. It may be closed at one end. For instance, the pore may be a well in the membrane along which or into which hydrated ions may flow.
- Any transmembrane pore may be used in the invention. The pore may be biological or artificial. Suitable pores include, but are not limited to, protein pores, polynucleotide pores and solid state pores.
- Any membrane may be used in accordance with the invention. Suitable membranes are well-known in the art. The membrane is preferably an amphiphilic layer. An amphiphilic layer is a layer formed from amphiphilic molecules, such as phospholipids, which have both at least one hydrophilic portion and at least one lipophilic or hydrophobic portion. The amphiphilic molecules may be synthetic or naturally occurring. Non-naturally occurring amphiphiles and amphiphiles which form a monolayer are known in the art and include, for example, block copolymers (Gonzalez-Perez et al., Langmuir, 2009, 25, 10447-10450). Block copolymers are polymeric materials in which two or more monomer sub-units that are polymerized together to create a single polymer chain. Block copolymers typically have properties that are contributed by each monomer sub-unit. However, a block copolymer may have unique properties that polymers formed from the individual sub-units do not possess. Block copolymers can be engineered such that one of the monomer sub-units is hydrophobic (i.e. lipophilic), whilst the other sub-unit(s) are hydrophilic whilst in aqueous media. In this case, the block copolymer may possess amphiphilic properties and may form a structure that mimics a biological membrane. The block copolymer may be a diblock (consisting of two monomer sub-units), but may also be constructed from more than two monomer sub-units to form more complex arrangements that behave as amphipiles. The copolymer may be a triblock, tetrablock or pentablock copolymer.
- The amphiphilic layer may be a monolayer or a bilayer. The amphiphilic layer is typically a planar lipid bilayer or a supported bilayer.
- The amphiphilic layer is typically a lipid bilayer. Lipid bilayers are models of cell membranes and serve as excellent platforms for a range of experimental studies. For example, lipid bilayers can be used for in vitro investigation of membrane proteins by single-channel recording. Alternatively, lipid bilayers can be used as biosensors to detect the presence of a range of substances. The lipid bilayer may be any lipid bilayer. Suitable lipid bilayers include, but are not limited to, a planar lipid bilayer, a supported bilayer or a liposome. The lipid bilayer is preferably a planar lipid bilayer. Suitable lipid bilayers are disclosed in International Application No. PCT/GB08/000563 (published as WO 2008/102121), International Application No. PCT/GB08/004127 (published as WO 2009/077734) and International Application No. PCT/GB2006/001057 (published as WO 2006/100484).
- Methods for forming lipid bilayers are known in the art. Suitable methods are disclosed in the Examples. Lipid bilayers are commonly formed by the method of Montal and Mueller (Proc. Natl. Acad. Sci. USA., 1972; 69: 3561-3566), in which a lipid monolayer is carried on aqueous solution/air interface past either side of an aperture which is perpendicular to that interface.
- The method of Montal & Mueller is popular because it is a cost-effective and relatively straightforward method of forming good quality lipid bilayers that are suitable for protein pore insertion. Other common methods of bilayer formation include tip-dipping, painting bilayers and patch-clamping of liposome bilayers.
- In a preferred embodiment, the lipid bilayer is formed as described in International Application No. PCT/GB08/004127 (published as WO 2009/077734).
- In another preferred embodiment, the membrane is a solid state layer. A solid-state layer is not of biological origin. In other words, a solid state layer is not derived from or isolated from a biological environment such as an organism or cell, or a synthetically manufactured version of a biologically available structure. Solid state layers can be formed from both organic and inorganic materials including, but not limited to, microelectronic materials, insulating materials such as Si3N4, Al2O3, and SiO, organic and inorganic polymers such as polyamide, plastics such as Teflon® or elastomers such as two-component addition-cure silicone rubber, and glasses. The solid state layer may be formed from monatomic layers, such as graphene, or layers that are only a few atoms thick. Suitable graphene layers are disclosed in International Application No. PCT/US2008/010637 (published as WO 2009/035647).
- The method is typically carried out using (i) an artificial amphiphilic layer comprising a pore, (ii) an isolated, naturally-occurring lipid bilayer comprising a pore, or (iii) a cell having a pore inserted therein. The method is typically carried out using an artificial amphiphilic layer, such as an artificial lipid bilayer. The layer may comprise other transmembrane and/or intramembrane proteins as well as other molecules in addition to the pore. Suitable apparatus and conditions are discussed below. The method of the invention is typically carried out in vitro. The complementary polynucleotide is preferably coupled to the membrane. This may be done using any known method. The complementary polynucleotide is preferably coupled to the membrane comprising the transmembrane pore. The method may comprise coupling the complementary polynucleotide to the membrane comprising the transmembrane pore. The polynucleotide is preferably coupled to the membrane using one or more anchors. The polynucleotide may be coupled to the membrane using any known method.
- Each anchor comprises a group which couples (or binds) to the polynucleotide and a group which couples (or binds) to the membrane. Each anchor may covalently couple (or bind) to the polynucleotide and/or the membrane. If a Y adaptor and/or a hairpin loop adaptors are used, the polynucleotide is preferably coupled to the membrane using the adaptor(s).
- The polynucleotide may be coupled to the membrane using any number of anchors, such as 2, 3, 4 or more anchors. For instance, a polynucleotide may be coupled to the membrane using two anchors each of which separately couples (or binds) to both the polynucleotide and membrane.
- The one or more anchors may comprise the one or more helicases and/or the one or more molecular brakes discussed below.
- If the membrane is an amphiphilic layer, such as a lipid bilayer (as discussed in detail above), the complementary polynucleotide is preferably coupled to the membrane via a polypeptide present in the membrane or a hydrophobic anchor present in the membrane. The hydrophobic anchor is preferably a lipid, fatty acid, sterol, carbon nanotube or amino acid.
- The complementary polynucleotide may be coupled directly to the membrane. It may be coupled to the membrane using any of the ways disclosed in International Application Number No. PCT/GB2012/051191 (published as WO 2012/164270). The complementary polynucleotide is preferably coupled to the membrane via a linker. Preferred linkers include, but are not limited to, polymers, such as polynucleotides, polyethylene glycols (PEGs) and polypeptides. If a complementary polynucleotide is coupled directly to the membrane, then some data will be lost as the characterising run cannot continue to the end of the complementary polynucleotide due to the distance between the membrane and the pore and/or polynucleotide binding protein. If a linker is used, then the complementary polynucleotide can be processed to completion. If a linker is used, the linker may be attached to the complementary polynucleotide at any position. The linker is typically attached to the complementary polynucleotide at the tail polymer.
- The coupling may be stable or transient. For certain applications, the transient nature of the coupling is preferred. If a stable coupling molecule were attached directly to either the 5′ or 3′ end of a complementary polynucleotide, then some data will be lost as the characterising run cannot continue to the end of the complementary polynucleotide due to the distance between the membrane and the pore and/or polynucleotide binding protein. If the coupling is transient, then when the coupled end randomly becomes free of the membrane, then the complementary polynucleotide can be processed to completion. Chemical groups that form stable or transient links with the membrane are discussed in more detail below. The complementary polynucleotide may be transiently coupled to an amphiphilic layer, such as a lipid bilayer using cholesterol or a fatty acyl chain. Any fatty acyl chain having a length of from 6 to 30 carbon atoms, such as hexadecanoic acid, may be used.
- In preferred embodiments, the complementary polynucleotide is coupled to an amphiphilic layer. Coupling of polynucleotides to synthetic lipid bilayers has been carried out previously with various different tethering strategies. These are summarised in Table 1 below.
-
TABLE 1 Attach- Type of ment cou- group pling Reference Thiol Stable Yoshina-Ishii, C. and S. G. Boxer (2003). “Arrays of mobile tethered vesicles on supported lipid bilayers.” J Am Chem Soc 125(13): 3696-7. Biotin Stable Nikolov, V., R. Lipowsky, et al. (2007). “Behavior of giant vesicles with anchored DNA molecules.” Biophys J 92(12): 4356-68 Choles- Tran- Pfeiffer, I. and F. Hook (2004). “Bivalent cholesterol- terol sient based coupling of oligonucletides to lipid membrane assemblies.” J Am Chem Soc 126(33): 10224-5 Lipid Stable van Lengerich, B., R. J. Rawle, et al. “Covalent attachment of lipid vesicles to a fluid-supported bilayer allows observation of DNA-mediated vesicle interactions.” Langmuir 26(11): 8666-72 - Complementary polynucleotides may be functionalized using a modified phosphoramidite in the synthesis reaction, which is easily compatible for the addition of reactive groups, such as thiol, cholesterol, lipid and biotin groups. These different attachment chemistries give a suite of attachment options for complementary polynucleotides. Each different modification group tethers the complementary polynucleotide in a slightly different way and coupling is not always permanent so giving different dwell times for the complementary polynucleotide to the membrane. The advantages of transient coupling are discussed above.
- Coupling of complementary polynucleotides can also be achieved by a number of other means provided that a reactive group can be added to the complementary polynucleotide. The addition of reactive groups to either end of DNA has been reported previously. A thiol group can be added to the 5′ of ssDNA using polynucleotide kinase and ATPγS (Grant, G. P. and P. Z. Qin (2007). “A facile method for attaching nitroxide spin labels at the 5′ terminus of nucleic acids.” Nucleic Acids Res 35(10): e77). A more diverse selection of chemical groups, such as biotin, thiols and fluorophores, can be added using terminal transferase to incorporate modified oligonucleotides to the 3′ of ssDNA (Kumar, A., P. Tchen, et al. (1988). “Nonradioactive labeling of synthetic oligonucleotide probes with terminal deoxynucleotidyl transferase.” Anal Biochem 169(2): 376-82).
- Alternatively, the reactive group could be considered to be a short region in the polynucleotide complementary to one already coupled to the membrane, so that attachment can be achieved via hybridisation. The region could be part of the complementary polynucleotide or ligated to it. Ligation of short pieces of ssDNA have been reported using T4 RNA ligase I (Troutt, A. B., M. G. McHeyzer-Williams, et al. (1992). “Ligation-anchored PCR: a simple amplification technique with single-sided specificity.” Proc Natl Acad Sci USA 89(20): 9823-5). The coupling chemistry can be incorporated during the formation of the complementary polynucleotide from the target RNA. For instance, the complementary polynucleotide can be synthesized using a primer with a reactive group attached to it.
- Most preferably, the complementary polynucleotide is coupled to the membrane using a cholesterol-tagged polynucleotide which hybridises to the complementary polynucleotide or primer attached thereto.
- The transmembrane pore is preferably a transmembrane protein pore. A transmembrane protein pore is a polypeptide or a collection of polypeptides that permits hydrated ions, such as analyte, to flow from one side of a membrane to the other side of the membrane. In the present invention, the transmembrane protein pore is capable of forming a pore that permits hydrated ions driven by an applied potential to flow from one side of the membrane to the other. The transmembrane protein pore preferably permits analyte such as nucleotides to flow from one side of the membrane, such as a lipid bilayer, to the other. The transmembrane protein pore allows a polynucleotide or nucleic acid, such as DNA or RNA, to be moved through the pore.
- The transmembrane protein pore may be a monomer or an oligomer. The pore is preferably made up of several repeating subunits, such as 6, 7, 8 or 9 subunits. The pore is preferably a hexameric, heptameric, octameric or nonameric pore.
- The transmembrane protein pore typically comprises a barrel or channel through which the ions may flow. The subunits of the pore typically surround a central axis and contribute strands to a transmembrane β barrel or channel or a transmembrane α-helix bundle or channel.
- The barrel or channel of the transmembrane protein pore typically comprises amino acids that facilitate interaction with analyte, such as nucleotides, polynucleotides or nucleic acids. These amino acids are preferably located near a constriction of the barrel or channel. The transmembrane protein pore typically comprises one or more positively charged amino acids, such as arginine, lysine or histidine, or aromatic amino acids, such as tyrosine or tryptophan. These amino acids typically facilitate the interaction between the pore and nucleotides, polynucleotides or nucleic acids.
- Transmembrane protein pores for use in accordance with the invention can be derived from β-barrel pores or α-helix bundle pores. β-barrel pores comprise a barrel or channel that is formed from β-strands. Suitable β-barrel pores include, but are not limited to, β-toxins, such as α-hemolysin, anthrax toxin and leukocidins, and outer membrane proteins/porins of bacteria, such as Mycobacterium smegmatis porin (Msp), for example MspA, MspB, MspC or MspD, outer membrane porin F (OmpF), outer membrane porin G (OmpG), outer membrane phospholipase A and Neisseria autotransporter lipoprotein (NalP). α-helix bundle pores comprise a barrel or channel that is formed from α-helices. Suitable α-helix bundle pores include, but are not limited to, inner membrane proteins and a outer membrane proteins, such as WZA and ClyA toxin. The transmembrane pore may be derived from Msp or from α-hemolysin (α-HL).
- The transmembrane protein pore is preferably derived from Msp, preferably from MspA. Such a pore will be oligomeric and typically comprises 7, 8, 9 or 10 monomers derived from Msp. The pore may be a homo-oligomeric pore derived from Msp comprising identical monomers. Alternatively, the pore may be a hetero-oligomeric pore derived from Msp comprising at least one monomer that differs from the others. Preferably the pore is derived from MspA or a homolog or paralog thereof.
- A monomer derived from Msp typically comprises the sequence shown in SEQ ID NO: 2 or a variant thereof. SEQ ID NO: 2 is the MS-(B1)8 mutant of the MspA monomer. It includes the following mutations: D90N, D91N, D93N, D118R, D134R and E139K. A variant of SEQ ID NO: 2 is a polypeptide that has an amino acid sequence which varies from that of SEQ ID NO: 2 and which retains its ability to form a pore. The ability of a variant to form a pore can be assayed using any method known in the art. For instance, the variant may be inserted into an amphiphilic layer along with other appropriate subunits and its ability to oligomerise to form a pore may be determined. Methods are known in the art for inserting subunits into membranes, such as amphiphilic layers. For example, subunits may be suspended in a purified form in a solution containing a lipid bilayer such that it diffuses to the lipid bilayer and is inserted by binding to the lipid bilayer and assembling into a functional state. Alternatively, subunits may be directly inserted into the membrane using the “pick and place” method described in M. A. Holden, H. Bayley. J. Am. Chem. Soc. 2005, 127, 6502-6503 and International Application No. PCT/GB2006/001057 (published as WO 2006/100484).
- Over the entire length of the amino acid sequence of SEQ ID NO: 2, a variant will preferably be at least 50% homologous to that sequence based on amino acid identity. More preferably, the variant may be at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90% and more preferably at least 95%, 97% or 99% homologous based on amino acid identity to the amino acid sequence of SEQ ID NO: 2 over the entire sequence. There may be at least 80%, for example at least 85%, 90% or 95%, amino acid identity over a stretch of 100 or more, for example 125, 150, 175 or 200 or more, contiguous amino acids (“hard homology”).
- Standard methods in the art may be used to determine homology. For example the UWGCG Package provides the BESTFIT program which can be used to calculate homology, for example used on its default settings (Devereux et al (1984)
Nucleic Acids Research 12, p 387-395). The PILEUP and BLAST algorithms can be used to calculate homology or line up sequences (such as identifying equivalent residues or corresponding sequences (typically on their default settings)), for example as described in Altschul S. F. (1993) J Mol Evol 36:290-300; Altschul, S. F et al (1990) J Mol Biol 215:403-10. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). - SEQ ID NO: 2 is the MS-(B1)8 mutant of the MspA monomer. The variant may comprise any of the mutations in the MspB, C or D monomers compared with MspA. The mature forms of MspB, C and D are shown in SEQ ID NOs: 5 to 7. In particular, the variant may comprise the following substitution present in MspB: A138P. The variant may comprise one or more of the following substitutions present in MspC: A96G, N102E and A138P. The variant may comprise one or more of the following mutations present in MspD: Deletion of G1, L2V, E5Q, L8V, D13G, W21A, D22E, K47T, I49H, I68V, D91G, A96Q, N102D, S103T, V104I, S136K and G141A. The variant may comprise combinations of one or more of the mutations and substitutions from Msp B, C and D. The variant preferably comprises the mutation L88N. A variant of SEQ ID NO: 2 has the mutation L88N in addition to all the mutations of MS-(B1)8 and is called MS-(B2)8. The pore used in the invention is preferably MS-(B2)8. The further preferred variant comprises the mutations G75S/G77S/L88N/Q126R. The variant of SEQ ID NO: 2 has the mutations G75S/G77S/L88N/Q126R in addition to all the mutations of MS-(B1)8 and is called MS-(B2C)8. The pore used in the invention is preferably MS-(B2)8 or MS-(B2C)8.
- Amino acid substitutions may be made to the amino acid sequence of SEQ ID NO: 2 in addition to those discussed above, for example up to 1, 2, 3, 4, 5, 10, 20 or 30 substitutions. Conservative substitutions replace amino acids with other amino acids of similar chemical structure, similar chemical properties or similar side-chain volume. The amino acids introduced may have similar polarity, hydrophilicity, hydrophobicity, basicity, acidity, neutrality or charge to the amino acids they replace. Alternatively, the conservative substitution may introduce another amino acid that is aromatic or aliphatic in the place of a pre-existing aromatic or aliphatic amino acid. Conservative amino acid changes are well-known in the art and may be selected in accordance with the properties of the 20 main amino acids as defined in Table 2 below. Where amino acids have similar polarity, this can also be determined by reference to the hydropathy scale for amino acid side chains in Table 3.
-
TABLE 2 Chemical properties of amino acids Ala aliphatic, hydrophobic, neutral Met hydrophobic, neutral Cys polar, hydrophobic, neutral Asn polar, hydrophilic, neutral Asp polar, hydrophilic, charged (−) Pro hydrophobic, neutral Glu polar, hydrophilic, charged (−) Gln polar, hydrophilic, neutral Phe aromatic, hydrophobic, neutral Arg polar, hydrophilic, charged (+) Gly aliphatic, neutral Ser polar, hydrophilic, neutral His aromatic, polar, hydrophilic, charged (+) Thr polar, hydrophilic, neutral Ile aliphatic, hydrophobic, neutral Val aliphatic, hydrophobic, neutral Lys polar, hydrophilic, charged(+) Trp aromatic, hydrophobic, neutral Leu aliphatic, hydrophobic, neutral Tyr aromatic, polar, hydrophobic -
TABLE 3 Hydropathy scale Side Chain Hydropathy Ile 4.5 Val 4.2 Leu 3.8 Phe 2.8 Cys 2.5 Met 1.9 Ala 1.8 Gly −0.4 Thr −0.7 Ser −0.8 Trp −0.9 Tyr −1.3 Pro −1.6 His −3.2 Glu −3.5 Gln −3.5 Asp −3.5 Asn −3.5 Lys −3.9 Arg −4.5 - One or more amino acid residues of the amino acid sequence of SEQ ID NO: 2 may additionally be deleted from the polypeptides described above. Up to 1, 2, 3, 4, 5, 10, 20 or 30 residues may be deleted, or more.
- Variants may include fragments of SEQ ID NO: 2. Such fragments retain pore forming activity. Fragments may be at least 50, 100, 150 or 200 amino acids in length. Such fragments may be used to produce the pores. A fragment preferably comprises the pore forming domain of SEQ ID NO: 2. Fragments must include one of residues 88, 90, 91, 105, 118 and 134 of SEQ ID NO: 2. Typically, fragments include all of residues 88, 90, 91, 105, 118 and 134 of SEQ ID NO: 2.
- One or more amino acids may be alternatively or additionally added to the polypeptides described above. An extension may be provided at the amino terminal or carboxy terminal of the amino acid sequence of SEQ ID NO: 2 or polypeptide variant or fragment thereof. The extension may be quite short, for example from 1 to 10 amino acids in length. Alternatively, the extension may be longer, for example up to 50 or 100 amino acids. A carrier protein may be fused to an amino acid sequence according to the invention. Other fusion proteins are discussed in more detail below.
- As discussed above, a variant is a polypeptide that has an amino acid sequence which varies from that of SEQ ID NO: 2 and which retains its ability to form a pore. A variant typically contains the regions of SEQ ID NO: 2 that are responsible for pore formation. The pore forming ability of Msp, which contains a β-barrel, is provided by β-sheets in each subunit. A variant of SEQ ID NO: 2 typically comprises the regions in SEQ ID NO: 2 that form β-sheets. One or more modifications can be made to the regions of SEQ ID NO: 2 that form β-sheets as long as the resulting variant retains its ability to form a pore. A variant of SEQ ID NO: 2 preferably includes one or more modifications, such as substitutions, additions or deletions, within its α-helices and/or loop regions.
- The monomers derived from Msp may be modified to assist their identification or purification, for example by the addition of histidine residues (a hist tag), aspartic acid residues (an asp tag), a streptavidin tag or a flag tag, or by the addition of a signal sequence to promote their secretion from a cell where the polypeptide does not naturally contain such a sequence. An alternative to introducing a genetic tag is to chemically react a tag onto a native or engineered position on the pore. An example of this would be to react a gel-shift reagent to a cysteine engineered on the outside of the pore. This has been demonstrated as a method for separating hemolysin hetero-oligomers (Chem Biol. 1997 Jul.; 4(7):497-505).
- The monomer derived from Msp may be labelled with a revealing label. The revealing label may be any suitable label which allows the pore to be detected. Suitable labels are described below.
- The monomer derived from Msp may also be produced using D-amino acids. For instance, the monomer derived from Msp may comprise a mixture of L-amino acids and D-amino acids. This is conventional in the art for producing such proteins or peptides.
- The monomer derived from Msp contains one or more specific modifications to facilitate nucleotide discrimination. The monomer derived from Msp may also contain other non-specific modifications as long as they do not interfere with pore formation. A number of non-specific side chain modifications are known in the art and may be made to the side chains of the monomer derived from Msp. Such modifications include, for example, reductive alkylation of amino acids by reaction with an aldehyde followed by reduction with NaBIE, amidination with methylacetimidate or acylation with acetic anhydride.
- The monomer derived from Msp can be produced using standard methods known in the art. The monomer derived from Msp may be made synthetically or by recombinant means. For example, the pore may be synthesized by in vitro translation and transcription (IVTT). Suitable methods for producing pores are discussed in International Application Nos. PCT/GB09/001690 (published as WO 2010/004273), PCT/GB09/001679 (published as WO 2010/004265) or PCT/GB10/000133 (published as WO 2010/086603). Methods for inserting pores into membranes are discussed.
- The transmembrane protein pore is also preferably derived from α-hemolysin (α-HL). The wild type α-HL pore is formed of seven identical monomers or subunits (i.e. it is heptameric). The sequence of one monomer or subunit of α-hemolysin-NN is shown in SEQ ID NO: 4. The transmembrane protein pore preferably comprises seven monomers each comprising the sequence shown in SEQ ID NO: 4 or a variant thereof.
Amino acids 1, 7 to 21, 31 to 34, 45 to 51, 63 to 66, 72, 92 to 97, 104 to 111, 124 to 136, 149 to 153, 160 to 164, 173 to 206, 210 to 213, 217, 218, 223 to 228, 236 to 242, 262 to 265, 272 to 274, 287 to 290 and 294 of SEQ ID NO: 4 form loop regions. Residues 113 and 147 of SEQ ID NO: 4 form part of a constriction of the barrel or channel of α-HL. - In such embodiments, a pore comprising seven proteins or monomers each comprising the sequence shown in SEQ ID NO: 4 or a variant thereof are preferably used in the method of the invention. The seven proteins may be the same (homo-heptamer) or different (hetero-heptamer).
- A variant of SEQ ID NO: 4 is a protein that has an amino acid sequence which varies from that of SEQ ID NO: 4 and which retains its pore forming ability. The ability of a variant to form a pore can be assayed using any method known in the art. For instance, the variant may be inserted into an amphiphilic layer, such as a lipid bilayer, along with other appropriate subunits and its ability to oligomerise to form a pore may be determined. Methods are known in the art for inserting subunits into amphiphilic layers, such as lipid bilayers. Suitable methods are discussed above.
- The variant may include modifications that facilitate covalent attachment to or interaction with the helicase or construct. The variant preferably comprises one or more reactive cysteine residues that facilitate attachment to the helicase or construct. For instance, the variant may include a cysteine at one or more of
positions 8, 9, 17, 18, 19, 44, 45, 50, 51, 237, 239 and 287 and/or on the amino or carboxy terminus of SEQ ID NO: 4. Preferred variants comprise a substitution of the residue at position 8, 9, 17, 237, 239 and 287 of SEQ ID NO: 4 with cysteine (ABC, T9C, N17C, K237C, S239C or E287C). The variant is preferably any one of the variants described in International Application No. PCT/GB09/001690 (published as WO 2010/004273), PCT/GB09/001679 (published as WO 2010/004265) or PCT/GB10/000133 (published as WO 2010/086603). - The variant may also include modifications that facilitate any interaction with nucleotides.
- The variant may be a naturally occurring variant which is expressed naturally by an organism, for instance by a Staphylococcus bacterium. Alternatively, the variant may be expressed in vitro or recombinantly by a bacterium such as Escherichia coli. Variants also include non-naturally occurring variants produced by recombinant technology. Over the entire length of the amino acid sequence of SEQ ID NO: 4, a variant will preferably be at least 50% homologous to that sequence based on amino acid identity. More preferably, the variant polypeptide may be at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90% and more preferably at least 95%, 97% or 99% homologous based on amino acid identity to the amino acid sequence of SEQ ID NO: 4 over the entire sequence. There may be at least 80%, for example at least 85%, 90% or 95%, amino acid identity over a stretch of 200 or more, for example 230, 250, 270 or 280 or more, contiguous amino acids (“hard homology”). Homology can be determined as discussed above.
- Amino acid substitutions may be made to the amino acid sequence of SEQ ID NO: 4 in addition to those discussed above, for example up to 1, 2, 3, 4, 5, 10, 20 or 30 substitutions. Conservative substitutions may be made as discussed above.
- One or more amino acid residues of the amino acid sequence of SEQ ID NO: 4 may additionally be deleted from the polypeptides described above. Up to 1, 2, 3, 4, 5, 10, 20 or 30 residues may be deleted, or more.
- Variants may be fragments of SEQ ID NO: 4. Such fragments retain pore-forming activity. Fragments may be at least 50, 100, 200 or 250 amino acids in length. A fragment preferably comprises the pore-forming domain of SEQ ID NO: 4. Fragments typically include residues 119, 121, 135. 113 and 139 of SEQ ID NO: 4.
- One or more amino acids may be alternatively or additionally added to the polypeptides described above. An extension may be provided at the amino terminus or carboxy terminus of the amino acid sequence of SEQ ID NO: 4 or a variant or fragment thereof. The extension may be quite short, for example from 1 to 10 amino acids in length. Alternatively, the extension may be longer, for example up to 50 or 100 amino acids. A carrier protein may be fused to a pore or variant.
- As discussed above, a variant of SEQ ID NO: 4 is a subunit that has an amino acid sequence which varies from that of SEQ ID NO: 4 and which retains its ability to form a pore. A variant typically contains the regions of SEQ ID NO: 4 that are responsible for pore formation. The pore forming ability of α-HL, which contains a β-barrel, is provided by β-strands in each subunit. A variant of SEQ ID NO: 4 typically comprises the regions in SEQ ID NO: 4 that form β-strands. The amino acids of SEQ ID NO: 4 that form β-strands are discussed above. One or more modifications can be made to the regions of SEQ ID NO: 4 that form β-strands as long as the resulting variant retains its ability to form a pore. Specific modifications that can be made to the β-strand regions of SEQ ID NO: 4 are discussed above.
- A variant of SEQ ID NO: 4 preferably includes one or more modifications, such as substitutions, additions or deletions, within its α-helices and/or loop regions. Amino acids that form α-helices and loops are discussed above.
- The variant may be modified to assist its identification or purification as discussed above.
- Pores derived from α-HL can be made as discussed above with reference to pores derived from Msp.
- In some embodiments, the transmembrane protein pore is chemically modified. The pore can be chemically modified in any way and at any site. The transmembrane protein pore is preferably chemically modified by attachment of a molecule to one or more cysteines (cysteine linkage), attachment of a molecule to one or more lysines, attachment of a molecule to one or more non-natural amino acids, enzyme modification of an epitope or modification of a terminus. Suitable methods for carrying out such modifications are well-known in the art. The transmembrane protein pore may be chemically modified by the attachment of any molecule. For instance, the pore may be chemically modified by attachment of a dye or a fluorophore.
- Any number of the monomers in the pore may be chemically modified. One or more, such as 2, 3, 4, 5, 6, 7, 8, 9 or 10, of the monomers is preferably chemically modified as discussed above.
- The reactivity of cysteine residues may be enhanced by modification of the adjacent residues. For instance, the basic groups of flanking arginine, histidine or lysine residues will change the pKa of the cysteines thiol group to that of the more reactive S− group. The reactivity of cysteine residues may be protected by thiol protective groups such as dTNB. These may be reacted with one or more cysteine residues of the pore before a linker is attached.
- The molecule (with which the pore is chemically modified) may be attached directly to the pore or attached via a linker as disclosed in International Application Nos. PCT/GB09/001690 (published as WO 2010/004273), PCT/GB09/001679 (published as WO 2010/004265) or PCT/GB10/000133 (published as WO 2010/086603).
- Step (b) preferably comprises contacting the complementary polynucleotide with a polynucleotide binding protein such that the protein controls the movement of the complementary polynucleotide through the pore. Any polynucleotide binding protein may be used. The polynucleotide binding protein is preferably a polynucleotide handling enzyme. A polynucleotide handling enzyme is a polypeptide that is capable of interacting with and modifying at least one property of a polynucleotide. The enzyme may modify the polynucleotide by cleaving it to form individual nucleotides or shorter chains of nucleotides, such as di- or trinucleotides. The enzyme may modify the polynucleotide by orienting it or moving it to a specific position. The polynucleotide handling enzyme does not need to display enzymatic activity as long as it is capable of binding the target sequence and controlling its movement through the pore. For instance, the enzyme may be modified to remove its enzymatic activity or may be used under conditions which prevent it from acting as an enzyme. Such conditions are discussed in more detail below.
- The polynucleotide handling enzyme is preferably derived from a nucleolytic enzyme. The polynucleotide handling enzyme used in the construct of the enzyme is more preferably derived from a member of any of the Enzyme Classification (EC) groups 3.1.11, 3.1.13, 3.1.14, 3.1.15, 3.1.16, 3.1.21, 3.1.22, 3.1.25, 3.1.26, 3.1.27, 3.1.30 and 3.1.31.
- Preferred enzymes are polymerases, exonucleases, helicases and topoisomerases, such as gyrases. The enzyme may be any of those disclosed in International Application No. PCT/GB10/000133 (published as WO 2010/086603). The helicase may a Hel308 helicase, a RecD helicase, such as TraI helicase or a TrwC helicase, a XPD helicase or a Dda helicase. The helicase may be any of the helicases, modified helicases or helicase constructs disclosed in International Application Nos. PCT/GB2012/052579 (published as WO 2013/057495); PCT/GB2012/053274 (published as WO 2013/098562); PCT/GB2012/053273 (published as WO2013098561); PCT/GB2013/051925; PCT/GB2013/051924; PCT/GB2013/051928; and the UK Application being filed concurrently with this application (ONT IP 049).
- In one embodiment, the method involves contacting the complementary polynucleotide with a helicase such that the helicase controls the movement of the complementary polynucleotide through the pore. Any helicase may be used in the method. Helicases may work in two modes with respect to the pore. First, the method is preferably carried out using a helicase such that it controls movement of the polynucleotide through the pore with the field resulting from the applied voltage. In this mode the 5′ end of the polynucleotide is first captured in the pore, and the enzyme controls movement of the polynucleotide into the pore such that the polynucleotide is passed through the pore with the field until it finally translocates through to the trans side of the bilayer. Alternatively, the method is preferably carried out such that a helicase enzyme controls movement of the polynucleotide through the pore against the field resulting from the applied voltage. In this mode the 3′ end of the polynucleotide is first captured in the pore, and the enzyme controls movement of the polynucleotide through the pore such that the polynucleotide is pulled out of the pore against the applied field until finally ejected back to the cis side of the bilayer.
- The polynucleotide binding protein may be covalently attached to the pore. The polynucleotide binding protein is preferably not covalently attached to the pore. The application of a voltage to the pore and helicase or construct may result in the formation of a sensor that is capable of characterising the complementary polynucleotide. This is discussed in more detail below.
- Any of the proteins described herein may be modified to assist their identification or purification, for example by the addition of histidine residues (a his tag), aspartic acid residues (an asp tag), a streptavidin tag, a flag tag, a SUMO tag, a GST tag or a MBP tag, or by the addition of a signal sequence to promote their secretion from a cell where the polypeptide does not naturally contain such a sequence. An alternative to introducing a genetic tag is to chemically react a tag onto a native or engineered position on the helicase, pore or construct. An example of this would be to react a gel-shift reagent to a cysteine engineered on the outside of the pore. This has been demonstrated as a method for separating hemolysin hetero-oligomers (Chem Biol. 1997 Jul.; 4 (7):497-505).
- The target RNA, complementary polynucleotide, polynucleotide binding protein or pore may be labelled with a revealing label. The revealing label may be any suitable label which can be detected. Suitable labels include, but are not limited to, fluorescent molecules, radioisotopes, e.g. 125I, 35S, enzymes, antibodies, antigens, polynucleotides and ligands such as biotin.
- Proteins may be made synthetically or by recombinant means. For example, proteins may be synthesized by in vitro translation and transcription (IVTT). The amino acid sequence of the protein may be modified to include non-naturally occurring amino acids or to increase the stability of the protein. When a protein is produced by synthetic means, such amino acids may be introduced during production. Proteins may also be altered following either synthetic or recombinant production.
- Proteins may also be produced using D-amino acids. For instance, the pore or polynucleotide binding protein may comprise a mixture of L-amino acids and D-amino acids. This is conventional in the art for producing such proteins or peptides.
- The proteins used in the invention may also contain other non-specific modifications as long as they do not interfere with the proteins' function. A number of non-specific side chain modifications are known in the art and may be made to the side chains of the protein(s). Such modifications include, for example, reductive alkylation of amino acids by reaction with an aldehyde followed by reduction with NaBH4, amidination with methylacetimidate or acylation with acetic anhydride.
- Polynucleotide sequences encoding a protein may be derived and replicated using standard methods in the art. Polynucleotide sequences encoding a protein may be expressed in a bacterial host cell using standard techniques in the art. The protein may be produced in a cell by in situ expression of the polypeptide from a recombinant expression vector. The expression vector optionally carries an inducible promoter to control the expression of the polypeptide. These methods are described in Sambrook, J. and Russell, D. (2001). Molecular Cloning: A Laboratory Manual, 3rd Edition. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.
- The gene encoding the sequence of interest may be amplified using PCR involving specific primers. The amplified sequences may then be incorporated into a recombinant replicable vector such as a cloning vector. The vector may be used to replicate the polynucleotide in a compatible host cell. Thus polynucleotide sequences may be made by introducing a polynucleotide encoding the sequence of interest into a replicable vector, introducing the vector into a compatible host cell, and growing the host cell under conditions which bring about replication of the vector. The vector may be recovered from the host cell. Suitable host cells for cloning of polynucleotides are known in the art and described in more detail below.
- The polynucleotide sequence may be cloned into a suitable expression vector. In an expression vector, the polynucleotide sequence is typically operably linked to a control sequence which is capable of providing for the expression of the coding sequence by the host cell. Such expression vectors can be used to express a construct.
- The term “operably linked” refers to a juxtaposition wherein the components described are in a relationship permitting them to function in their intended manner. A control sequence “operably linked” to a coding sequence is ligated in such a way that expression of the coding sequence is achieved under conditions compatible with the control sequences. Multiple copies of the same or different polynucleotide may be introduced into the vector.
- The expression vector may then be introduced into a suitable host cell. Thus, a construct can be produced by inserting a polynucleotide sequence encoding a construct into an expression vector, introducing the vector into a compatible bacterial host cell, and growing the host cell under conditions which bring about expression of the polynucleotide sequence.
- The vectors may be for example, plasmid, virus or phage vectors provided with an origin of replication, optionally a promoter for the expression of the said polynucleotide sequence and optionally a regulator of the promoter. The vectors may contain one or more selectable marker genes, for example an ampicillin resistance gene. Promoters and other expression regulation signals may be selected to be compatible with the host cell for which the expression vector is designed. A T7, trc, lac, ara or λL promoter is typically used.
- The host cell typically expresses the construct at a high level. Host cells transformed with a polynucleotide sequence will be chosen to be compatible with the expression vector used to transform the cell. The host cell is typically bacterial and preferably E. coli. Any cell with a λ DE3 lysogen, for example Rosetta2(DE3)pLys, C41 (DE3), BL21 (DE3), JM109 (DE3), B834 (DE3), TUNER, Origami and Origami B, can express a vector comprising the T7 promoter.
- Proteins may be produced in large scale following purification by any protein liquid chromatography system from protein producing organisms or after recombinant expression. Typical protein liquid chromatography systems include FPLC, AKTA systems, the Bio-Cad system, the Bio-Rad BioLogic system and the Gilson HPLC system.
- The method of the invention involves measuring one or more characteristics of the target RNA. The method may involve measuring two, three, four or five or more characteristics of the target RNA. The one or more characteristics are preferably selected from (i) the length of the target RNA, (ii) the identity of the target RNA, (iii) the sequence of the target RNA, and (iv) the amount of the target RNA. Any combination of (i) to (iv) may be measured in accordance with the invention.
- For (i), the length of the RNA may be measured for example by forming a complementary polynucleotide of the same length and determining the number of interactions between the complementary polynucleotide and the pore or the duration of interaction between the complementary polynucleotide and the pore.
- For (ii), the identity of the RNA may be measured in a number of ways. The identity of the RNA may be measured in conjunction with measurement of the sequence of the complementary polynucleotide or without measurement of the sequence of the complementary polynucleotide. The former is straightforward; the complementary polynucleotide is sequenced and the sequence of the target RNA is thereby identified (since it is complementary). The latter may be done in several ways. For instance, the presence of a particular motif in the complementary polynucleotide may be measured (without measuring the remaining sequence of the polynucleotide). Alternatively, the measurement of a particular electrical and/or optical signal in the method may identify the complementary polynucleotide and thereby identify the target RNA.
- For (iii), the sequence of the complementary polynucleotide and hence the sequence of the target RNA can be determined as described previously. Suitable sequencing methods, particularly those using electrical measurements, are described in Stoddart D et al., Proc Natl Acad Sci, 12; 106(19):7702-7, Lieberman K R et al, J Am Chem Soc. 2010; 132(50):17961-72, and International Application WO 2000/28312.
- For (iv), the amount of the target RNA may be measured in a variety of ways. For instance, since the target RNA is typically not amplified in the method of the invention, the amount of the target RNA may be measured by counting the number of complementary polynucleotides which interact with the transmembrane pore. The number of complementary polynucleotides (i.e. the number of instances of the complementary polynucleotide) typically corresponds to the number of the target RNA molecules (i.e. the number of instances of the target RNA).
- A variety of different types of measurements may be made. This includes without limitation: electrical measurements and optical measurements. Possible electrical measurements include: current measurements, impedance measurements, tunnelling measurements (Ivanov A P et al., Nano Lett. 2011 Jan. 12; 11(1):279-85), and FET measurements (International Application WO 2005/124888). Optical measurements may be combined with electrical measurements (Soni G V et al., Rev Sci Instrum. 2010 January; 81(1):014301). The measurement may be a transmembrane current measurement such as measurement of ionic current flowing through the pore.
- Electrical measurements may be made using standard single channel recording equipment as describe in Stoddart D et al., Proc Natl Acad Sci, 12; 106(19):7702-7, Lieberman K R et al, J Am Chem Soc. 2010; 132(50):17961-72, and International Application WO-2000/28312. Alternatively, electrical measurements may be made using a multi-channel system, for example as described in International Application WO-2009/077734 and International Application WO-2011/067559.
- In a preferred embodiment, the method comprises:
-
- (a) contacting the complementary polynucleotide with a transmembrane pore and a polynucleotide binding protein such that the protein controls the movement of the polynucleotide through the pore; and
- (b) measuring the current passing through the pore as the polynucleotide moves with respect to the pore wherein the current is indicative of one or more characteristics of the complementary polynucleotide and thereby characterising the target RNA.
- The methods may be carried out using any apparatus that is suitable for investigating a membrane/pore system in which a pore is present in a membrane. The method may be carried out using any apparatus that is suitable for transmembrane pore sensing. For example, the apparatus comprises a chamber comprising an aqueous solution and a barrier that separates the chamber into two sections. The barrier typically has an aperture in which the membrane containing the pore is formed. Alternatively the barrier forms the membrane in which the pore is present.
- The methods may be carried out using the apparatus described in International Application No. PCT/GB08/000562 (WO 2008/102120).
- The methods may involve measuring the current passing through the pore as the polynucleotide moves with respect to the pore. Therefore the apparatus may also comprise an electrical circuit capable of applying a potential and measuring an electrical signal across the membrane and pore. The methods may be carried out using a patch clamp or a voltage clamp. The methods preferably involve the use of a voltage clamp.
- The methods of the invention may involve the measuring of a current passing through the pore as the polynucleotide moves with respect to the pore. Suitable conditions for measuring ionic currents through transmembrane protein pores are known in the art and disclosed in the Examples. The method is typically carried out with a voltage applied across the membrane and pore. The voltage used is typically from +2 V to −2 V, typically −400 mV to +400 mV. The voltage used is preferably in a range having a lower limit selected from −400 mV, −300 mV, −200 mV, −150 mV, −100 mV, −50 mV, −20 mV and 0 mV and an upper limit independently selected from +10 mV, +20 mV, +50 mV, +100 mV, +150 mV, +200 mV, +300 mV and +400 mV. The voltage used is more preferably in the
range 100 mV to 240 mV and most preferably in the range of 120 mV to 220 mV. It is possible to increase discrimination between different nucleotides by a pore by using an increased applied potential. - The methods are typically carried out in the presence of any charge carriers, such as metal salts, for example alkali metal salt, halide salts, for example chloride salts, such as alkali metal chloride salt. Charge carriers may include ionic liquids or organic salts, for example tetramethyl ammonium chloride, trimethylphenyl ammonium chloride, phenyltrimethyl ammonium chloride, or 1-ethyl-3-methyl imidazolium chloride. In the exemplary apparatus discussed above, the salt is present in the aqueous solution in the chamber. Potassium chloride (KCl), sodium chloride (NaCl), caesium chloride (CsCl) or a mixture of potassium ferrocyanide and potassium ferricyanide is typically used. KCl, NaCl and a mixture of potassium ferrocyanide and potassium ferricyanide are preferred. The salt concentration may be at saturation. The salt concentration may be 3 M or lower and is typically from 0.1 to 2.5 M, from 0.3 to 1.9 M, from 0.5 to 1.8 M, from 0.7 to 1.7 M, from 0.9 to 1.6 M or from 1 M to 1.4 M. The salt concentration is preferably from 150 mM to 1 M. IIe1308, XPD, RecD and TraI helicases surprisingly work under high salt concentrations. The method is preferably carried out using a salt concentration of at least 0.3 M, such as at least 0.4 M, at least 0.5 M, at least 0.6 M, at least 0.8 M, at least 1.0 M, at least 1.5 M, at least 2.0 M, at least 2.5 M or at least 3.0 M. High salt concentrations provide a high signal to noise ratio and allow for currents indicative of the presence of a nucleotide to be identified against the background of normal current fluctuations.
- The methods are typically carried out in the presence of a buffer. In the exemplary apparatus discussed above, the buffer is present in the aqueous solution in the chamber. Any buffer may be used in the method of the invention. Typically, the buffer is HEPES. Another suitable buffer is Tris-HCl buffer. The methods are typically carried out at a pH of from 4.0 to 12.0, from 4.5 to 10.0, from 5.0 to 9.0, from 5.5 to 8.8, from 6.0 to 8.7 or from 7.0 to 8.8 or 7.5 to 8.5. The pH used is preferably about 7.5.
- The methods may be carried out at from 0° C. to 100° C., from 15° C. to 95° C., from 16° C. to 90° C., from 17° C. to 85° C., from 18° C. to 80° C., 19° C. to 70° C., or from 20° C. to 60° C. The methods are typically carried out at room temperature. The methods are optionally carried out at a temperature that supports enzyme function, such as about 37° C.
- The method may be carried out in the presence of free nucleotides or free nucleotide analogues and/or an enzyme cofactor that facilitates the action of the helicase or construct. The method may also be carried out in the absence of free nucleotides or free nucleotide analogues and in the absence of an enzyme cofactor. The free nucleotides may be one or more of any of the individual nucleotides discussed above. The free nucleotides include, but are not limited to, adenosine monophosphate (AMP), adenosine diphosphate (ADP), adenosine triphosphate (ATP), guanosine monophosphate (GMP), guanosine diphosphate (GDP), guanosine triphosphate (GTP), thymidine monophosphate (TMP), thymidine diphosphate (TDP), thymidine triphosphate (TTP), uridine monophosphate (UMP), uridine diphosphate (UDP), uridine triphosphate (UTP), cytidine monophosphate (CMP), cytidine diphosphate (CDP), cytidine triphosphate (CTP), cyclic adenosine monophosphate (cAMP), cyclic guanosine monophosphate (cGMP), deoxyadenosine monophosphate (dAMP), deoxyadenosine diphosphate (dADP), deoxyadenosine triphosphate (dATP), deoxyguanosine monophosphate (dGMP), deoxyguanosine diphosphate (dGDP), deoxyguanosine triphosphate (dGTP), deoxythymidine monophosphate (dTMP), deoxythymidine diphosphate (dTDP), deoxythymidine triphosphate (dTTP), deoxyuridine monophosphate (dUMP), deoxyuridine diphosphate (dUDP), deoxyuridine triphosphate (dUTP), deoxycytidine monophosphate (dCMP), deoxycytidine diphosphate (dCDP) and deoxycytidine triphosphate (dCTP). The free nucleotides are preferably selected from AMP, TMP, GMP, CMP, UMP, dAMP, dIMP, dGMP or dCMP. The free nucleotides are preferably adenosine triphosphate (ATP). The enzyme cofactor is a factor that allows the helicase or construct to function. The enzyme cofactor is preferably a divalent metal cation. The divalent metal cation is preferably Mg2+, Mn2+, Ca2+ or Co2+. The enzyme cofactor is most preferably Mg2+.
- In a preferred embodiment, the method comprises:
-
- (a) providing the complementary polynucleotide with one or more helicases and one or more molecular brakes attached to the polynucleotide;
- (b) contacting the complementary polynucleotide with a transmembrane pore and applying a potential across the pore such that the one or more helicases and the one or more molecular brakes are brought together and both control the movement of the polynucleotide through the pore;
- (c) taking one or more measurements as the complementary polynucleotide moves with respect to the pore wherein the measurements are indicative of one or more characteristics of the polynucleotide and thereby characterising the polynucleotide.
- This type of method is discussed in detail in the International application PCT/GB2014/052737.
- The one or more helicases may be any of those discussed above. The one or more molecular brakes may be any compound or molecule which binds to the polynucleotide and slows the movement of the polynucleotide through the pore. The one or more molecular brakes preferably comprise one or more compounds which bind to the polynucleotide. The one or more compounds are preferably one or more macrocycles. Suitable macrocycles include, but are not limited to, cyclodextrins, calixarenes, cyclic peptides, crown ethers, cucurbiturils, pillararenes, derivatives thereof or a combination thereof. The cyclodextrin or derivative thereof may be any of those disclosed in Eliseev, A. V., and Schneider, H-J. (1994) 1 Am. Chem. Soc. 116, 6081-6088. The agent is more preferably heptakis-6-amino-β-cyclodextrin (am7-βCD), 6-monodeoxy-6-monoamino-β-cyclodextrin (am1-βCD) or heptakis-(6-deoxy-6-guanidino)-cyclodextrin (gu7-βCD).
- The one or more molecular brakes are preferably not one or more single stranded binding proteins (SSB). The one or more molecular brakes are more preferably not a single-stranded binding protein (SSB) comprising a carboxy-terminal (C-terminal) region which does not have a net negative charge or (ii) a modified SSB comprising one or more modifications in its C-terminal region which decreases the net negative charge of the C-terminal region. The one or more molecular brakes are most preferably not any of the SSBs disclosed in International Application No. PCT/GB2013/051924 (published as WO 2014/013259).
- The one or more molecular brakes are preferably one or more polynucleotide binding proteins. The polynucleotide binding protein may be any protein that is capable of binding to the polynucleotide and controlling its movement through the pore. It is straightforward in the art to determine whether or not a protein binds to a polynucleotide. The protein typically interacts with and modifies at least one property of the polynucleotide. The protein may modify the polynucleotide by cleaving it to form individual nucleotides or shorter chains of nucleotides, such as di- or trinucleotides. The moiety may modify the polynucleotide by orienting it or moving it to a specific position, i.e. controlling its movement.
- The polynucleotide binding protein is preferably derived from a polynucleotide handling enzyme. The one or more molecular brakes may be derived from any of the polynucleotide handling enzymes discussed above. Modified versions of Phi29 polymerase (SEQ ID NO: 8) which act as molecular brakes are disclosed in U.S. Pat. No. 5,576,204. The one or more molecular brakes are preferably derived from a helicase.
- Any number of molecular brakes derived from a helicase may be used. For instance, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more helicases may be used as molecular brakes. If two or more helicases are be used as molecular brakes, the two or more helicases are typically the same helicase. The two or more helicases may be different helicases.
- The two or more helicases may be any combination of the helicases mentioned above. The two or more helicases may be two or more Dda helicases. The two or more helicases may be one or more Dda helicases and one or more TrwC helicases. The two or more helicases may be different variants of the same helicase.
- The two or more helicases are preferably attached to one another. The two or more helicases are more preferably covalently attached to one another. The helicases may be attached in any order and using any method. The one or more molecular brakes derived from helicases are preferably modified to reduce the size of an opening in the polynucleotide binding domain through which in at least one conformational state the polynucleotide can unbind from the helicase. This is disclosed in WO 2014/013260.
- Preferred helicase constructs for use in the invention are described in International Application Nos. PCT/GB2013/051925 (published as WO 2014/013260); PCT/GB2013/051924 (published as WO 2014/013259) PCT/GB2013/051928 (published as WO 2014/013262) and PCT/GB2014/052736.
- If the one or more helicases are used in the active mode (i.e. when the one or more helicases are provided with all the necessary components to facilitate movement, e.g. ATP and Mg2+), the one or more molecular brakes are preferably (a) used in an inactive mode (i.e. are used in the absence of the necessary components to facilitate movement or are incapable of active movement), (b) used in an active mode where the one or more molecular brakes move in the opposite direction to the one or more helicases or (c) used in an active mode where the one or more molecular brakes move in the same direction as the one or more helicases and more slowly than the one or more helicases.
- If the one or more helicases are used in the inactive mode (i.e when the one or more helicases are not provided with all the necessary components to facilitate movement, e.g. ATP and Mg2+ or are incapable of active movement), the one or more molecular brakes are preferably (a) used in an inactive mode (i.e. are used in the absence of the necessary components to facilitate movement or are incapable of active movement) or (b) used in an active mode where the one or more molecular brakes move along the polynucleotide in the same direction as the polynucleotide through the pore.
- The one or more helicases and one or more molecular brakes may be attached to the polynucleotide at any positions so that they are brought together and both control the movement of the polynucleotide through the pore. The one or more helicases and one or more molecular brakes are at least one nucleotide apart, such as at least 5, at least 10, at least 50, at least 100, at least 500, at least 1000, at least 5000, at least 10,000, at least 50,000 nucleotides or more apart. If the method concerns characterising a double stranded polynucleotide provided with a Y adaptor at one end and a hairpin loop adaptor at the other end, the one or more helicases are preferably attached to the Y adaptor and the one or more molecular brakes are preferably attached to the hairpin loop adaptor. In this embodiment, the one or more molecular brakes are preferably one or more helicases that are modified such that they bind the polynucleotide but do not function as a helicase. The one or more helicases attached to the Y adaptor are preferably stalled at a spacer as discussed in more detail below. The one or more molecular brakes attach to the hairpin loop adaptor are preferably not stalled at a spacer. The one or more helicases and the one or more molecular brakes are preferably brought together when the one or more helicases reach the hairpin loop. The one or more helicases may be attached to the Y adaptor before the Y adaptor is attached to the polynucleotide or after the Y adaptor is attached to the polynucleotide. The one or more molecular brakes may be attached to the hairpin loop adaptor before the hairpin loop adaptor is attached to the polynucleotide or after the hairpin loop adaptor is attached to the polynucleotide.
- The one or more helicases and the one or more molecular brakes are preferably not attached to one another. The one or more helicases and the one or more molecular brakes are more preferably not covalently attached to one another. The one or more helicases and the one or more molecular brakes are preferably not attached as described in International Application Nos. PCT/GB2013/051925 (published as WO 2014/013260); PCT/GB2013/051924 (published as WO 2014/013259) PCT/GB2013/051928 (published as WO 2014/013262) and PCT/GB2014/052736.
- The one or more helicases may be stalled at one or more spacers as discussed in International Application No. PCT/GB2014/050175. Any configuration of one or more helicases and one or more spacers disclosed in the International Application may be used in this invention.
- When a part of the complementary polynucleotide enters the pore and moves through the pore along the field resulting from the applied potential, the one or more helicases are moved past the spacer by the pore as the polynucleotide moves through the pore. This is because the complementary polynucleotide (including the one or more spacers) moves through the pore and the one or more helicases remain on top of the pore.
- The one or more spacers are preferably part of the complementary polynucleotide, for instance they interrupt(s) the polynucleotide sequence. The one or more spacers are preferably not part of one or more blocking molecules, such as speed bumps, hybridised to the polynucleotide.
- There may be any number of spacers in the complementary polynucleotide, such as 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more spacers. There are preferably two, four or six spacers in the polynucleotide. There may be one or more spacers in different regions of the polynucleotide, such as one or more spacers in the Y adaptor and/or hairpin loop adaptor.
- The one or more spacers each provides an energy barrier which the one or more helicases cannot overcome even in the active mode. The one or more spacers may stall the one or more helicases by reducing the traction of the helicase (for instance by removing the bases from the nucleotides in the polynucleotide) or physically blocking movement of the one or more helicases (for instance using a bulky chemical group).
- The one or more spacers may comprise any molecule or combination of molecules that stalls the one or more helicases. The one or more spacers may comprise any molecule or combination of molecules that prevents the one or more helicases from moving along the polynucleotide. It is straightforward to determine whether or not the one or more helicases are stalled at one or more spacers in the absence of a transmembrane pore and an applied potential. For instance, the ability of a helicase to move past a spacer and displace a complementary strand of DNA can be measured by PAGE.
- The one or more spacers typically comprise a linear molecule, such as a polymer. The one or more spacers typically have a different structure from the polynucleotide. For instance, if the polynucleotide is DNA, the one or more spacers are typically not DNA. In particular, if the polynucleotide is deoxyribonucleic acid (DNA) or ribonucleic acid (RNA), the one or more spacers preferably comprise peptide nucleic acid (PNA), glycerol nucleic acid (GNA), threose nucleic acid (TNA), locked nucleic acid (LNA) or a synthetic polymer with nucleotide side chains. The one or more spacers may comprise one or more nucleotides in the opposite direction from the polynucleotide. For instance, the one or more spacers may comprise one or more nucleotides in the 3′ to 5′ direction when the polynucleotide is in the 5′ to 3′ direction. The nucleotides may be any of those discussed above.
- The one or more spacers preferably comprises one or more nitroindoles, such as one or more 5-nitroindoles, one or more inosines, one or more acridines, one or more 2-aminopurines, one or more 2-6-diaminopurines, one or more 5-bromo-deoxyuridines, one or more inverted thymidines (inverted dTs), one or more inverted dideoxy-thymidines (ddTs), one or more dideoxy-cytidines (ddCs), one or more 5-methylcytidines, one or more 5-hydroxymethylcytidines, one or more 2′-O-Methyl RNA bases, one or more Iso-deoxycytidines (Iso-dCs), one or more Iso-deoxyguanosines (Iso-dGs), one or more iSpC3 groups (i.e. nucleotides which lack sugar and a base), one or more photo-cleavable (PC) groups, one or more hexandiol groups, one or more spacer 9 (iSp9) groups, one or more spacer 18 (iSp18) groups, a polymer or one or more thiol connections. The one or more spacers may comprise any combination of these groups. Many of these groups are commercially available from IDT® (Integrated DNA Technologies®).
- The one or more spacers may contain any number of these groups. For instance, for 2-aminopurines, 2-6-diaminopurines, 5-bromo-deoxyuridines, inverted dTs, ddTs, ddCs, 5-methylcytidines, 5-hydroxymethylcytidines, 2′-O-Methyl RNA bases, Iso-dCs, Iso-dGs, iSpC3 groups, PC groups, hexandiol groups and thiol connections, the one or more spacers preferably comprise 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more. The one or more spacers preferably comprise 2, 3, 4, 5, 6, 7, 8 or more iSp9 groups. The one or more spacers preferably comprise 2, 3, 4, 5 or 6 or more iSp18 groups. The most preferred spacer is four iSp18 groups.
- The polymer is preferably a polypeptide or a polyethylene glycol (PEG). The polypeptide preferably comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more amino acids. The PEG preferably comprises 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more monomer units.
- The one or more spacers preferably comprise one or more abasic nucleotides (i.e. nucleotides lacking a nucleobase), such as 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more abasic nucleotides. The nucleobase can be replaced by —H (idSp) or —OH in the abasic nucleotide. Abasic spacers can be inserted into polynucleotides by removing the nucleobases from one or more adjacent nucleotides. For instance, polynucleotides may be modified to include 3-methyladenine, 7-methylguanine, 1,N6-ethenoadenine inosine or hypoxanthine and the nucleobases may be removed from these nucleotides using Human Alkyladenine DNA Glycosylase (hAAG). Alternatively, polynucleotides may be modified to include uracil and the nucleobases removed with Uracil-DNA Glycosylase (UDG). In one embodiment, the one or more spacers do not comprise any abasic nucleotides.
- The one or more helicases may be stalled by (i.e. before) or on each linear molecule spacers. If linear molecule spacers are used, the polynucleotide is preferably provided with a double stranded region of polynucleotide adjacent to the end of each spacer past which the one or more helicases are to be moved. The double stranded region typically helps to stall the one or more helicases on the adjacent spacer. The presence of the double stranded region(s) is particularly preferred if the method is carried out at a salt concentration of about 100 mM or lower. Each double stranded region is typically at least 10, such as at least 12, nucleotides in length. If the polynucleotide used in the invention is single stranded, a double stranded region may formed by hybridising a shorter polynucleotide to a region adjacent to a spacer. The shorter polynucleotide is typically formed from the same nucleotides as the polynucleotide, but may be formed from different nucleotides. For instance, the shorter polynucleotide may be formed from LNA.
- If linear molecule spacers are used, the polynucleotide is preferably provided with a blocking molecule at the end of each spacer opposite to end past which the one or more helicases are to be moved. In other words, the helicase is stalled between a blocking molecule and a spacer. This can help to ensure that the one or more helicases remain stalled on each spacer. It may also help retain the one or more helicases on the polynucleotide in the case that it/they diffuse(s) off in solution. The blocking molecule may be any of the chemical groups discussed below which physically cause the one or more helicases to stall. The blocking molecule may be a double stranded region of polynucleotide.
- The one or more spacers preferably comprise one or more chemical groups which physically cause the one or more helicases to stall. The one or more chemical groups are preferably one or more pendant chemical groups. The one or more chemical groups may be attached to one or more nucleobases in the polynucleotide. The one or more chemical groups may be attached to the polynucleotide backbone. Any number of these chemical groups may be present, such as 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more. Suitable groups include, but are not limited to, fluorophores, streptavidin and/or biotin, cholesterol, methylene blue, dinitrophenols (DNPs), digoxigenin and/or anti-digoxigenin and dibenzylcyclooctyne groups.
- Different spacers in the polynucleotide may comprise different stalling molecules. For instance, one spacer may comprise one of the linear molecules discussed above and another spacer may comprise one or more chemical groups which physically cause the one or more helicases to stall. A spacer may comprise any of the linear molecules discussed above and one or more chemical groups which physically cause the one or more helicases to stall, such as one or more abasics and a fluorophore.
- Suitable spacers can be designed depending on the type of polynucleotide and the conditions under which the method of the invention is carried out. Most helicases bind and move along DNA and so may be stalled using anything that is not DNA. Suitable molecules are discussed above.
- The method of the invention is preferably carried out in the presence of free nucleotides and/or the presence of a helicase cofactor. This is discussed in more detail below. In the absence of the transmembrane pore and an applied potential, the one or more spacers are preferably capable of stalling the one or more helicases in the presence of free nucleotides and/or the presence of a helicase cofactor.
- If the method of the invention is carried out in the presence of free nucleotides and a helicase cofactor as discussed below (such that the one of more helicases are in the active mode), one or more longer spacers are typically used to ensure that the one or more helicases are stalled on the polynucleotide before they are contacted with the transmembrane pore and a potential is applied. One or more shorter spacers may be used in the absence of free nucleotides and a helicase cofactor (such that the one or more helicases are in the inactive mode).
- The salt concentration also affects the ability of the one or more spacers to stall the one or more helicases. In the absence of the transmembrane pore and an applied potential, the one or more spacers are preferably capable of stalling the one or more helicases at a salt concentration of about 100 mM or lower. The higher the salt concentration used in the method of the invention, the shorter the one or more spacers that are typically used and vice versa.
- Preferred combinations of features are shown in the Table 4 below.
-
Spacer length Spacer (i.e. Free Polynucle- compo- number nucle- Helicase otide sition* of*) Salt [ ] otides? cofactor? DNA iSpC3 4 1M Yes Yes DNA iSp18 4 100-1000 mM Yes Yes DNA iSp18 6 <100-1000 mM Yes Yes DNA iSp18 2 1M Yes Yes DNA iSpC3 12 <100-1000 mM Yes Yes DNA iSpC3 20 <100-1000 mM Yes Yes DNA iSp9 6 100-1000 mM Yes Yes DNA idSp 4 1M Yes Yes - The method may concern moving two or more helicases past a spacer. In such instances, the length of the spacer is typically increased to prevent the trailing helicase from pushing the leading helicase past the spacer in the absence of the pore and applied potential. If the method concerns moving two or more helicases past one or more spacers, the spacer lengths discussed above may be increased at least 1.5 fold, such 2 fold, 2.5 fold or 3 fold. For instance, if the method concerns moving two or more helicases past one or more spacers, the spacer lengths in the third column of Table 4 above may be increased 1.5 fold, 2 fold, 2.5 fold or 3 fold.
- In a preferred embodiment, the complementary polynucleotide is linked to the target RNA using a bridging moiety. As discussed above, step (a) preferably comprises hybridising a primer to the target RNA and using the primer to form the complementary polynucleotide. The primer preferably comprises a bridging moiety and the bridging moiety is preferably attached to the target RNA such that the complementary polynucleotide linked to the target RNA. Step (b) preferably comprises contacting the linked construct comprising the complementary polynucleotide and the target RNA with a transmembrane pore such that such that both the complementary polynucleotide and the target RNA move through the pore. The complementary polynucleotide is preferably contacted with the pore before the target RNA. Step c) preferably comprises taking one or more measurements as both the complementary polynucleotide and the target RNA move with respect to the pore wherein the measurements are indicative of one or more characteristics of the complementary polynucleotide and the target RNA and thereby characterising the target double stranded polynucleotide.
- Linking and interrogating both the complementary polynucleotide and the target RNA in this way increases the efficiency and accuracy of characterization.
- The bridging moiety is capable of linking the two strands of the target polynucleotide. The bridging moiety typically covalently links the two strands of the target polynucleotide. The bridging moiety can be anything that is capable of linking the two strands of the target polynucleotide, provided that the bridging moiety does not interfere with movement of the single stranded polynucleotide through the transmembrane pore.
- The bridging moiety may be linked to the target polynucleotide by any suitable means known in the art. The bridging moiety may be synthesized separately and chemically attached or enzymatically ligated to the target polynucleotide. Alternatively, the bridging moiety may be generated in the processing of the target polynucleotide.
- The bridging moiety is linked to the target polynucleotide at or near one end of the target polynucleotide. The bridging moiety is preferably linked to the target polynucleotide within 10 nucleotides of the end of the target polynucleotide
- Suitable bridging moieties include, but are not limited to a polymeric linker, a chemical linker, a polynucleotide or a polypeptide. Preferably, the bridging moiety comprises DNA, RNA, modified DNA (such as abasic DNA), RNA, PNA, LNA or PEG. The bridging moiety is more preferably DNA or RNA.
- The bridging moiety is most preferably a hairpin loop or a hairpin loop adaptor. Suitable hairpin adaptors can be designed using methods known in the art. The hairpin loop may be any length. The hairpin loop is typically 110 or fewer nucleotides, such as 100 or fewer nucleotides, 90 or fewer nucleotides, 80 or fewer nucleotides, 70 or fewer nucleotides, 60 or fewer nucleotides, 50 or fewer nucleotides, 40 or fewer nucleotides, 30 or fewer nucleotides, 20 or fewer nucleotides or 10 or fewer nucleotides, in length. The hairpin loop is preferably from about 1 to 110, from 2 to 100, from 5 to 80 or from 6 to 50 nucleotides in length. Longer lengths of the hairpin loop, such as from 50 to 110 nucleotides, are preferred if the loop is involved in the differential selectability of the adaptor. Similarly, shorter lengths of the hairpin loop, such as from 1 to 5 nucleotides, are preferred if the loop is not involved in the selectable binding as discussed below.
- The hairpin adaptor may be ligated to either end of the complementary polynucleotide and/or target RNA, i.e. the 5′ or the 3′ end. The hairpin adaptor may be ligated using any method known in the art. The hairpin adaptor may be ligated using a ligase, such as T4 DNA ligase, E. coli DNA ligase, Taq DNA ligase, Tma DNA ligase and 9° N DNA ligase.
- The complementary polynucleotide and the target RNA may be separated as or before the linked construct is contacted with the pore in accordance with the invention. They may be separated as the polynucleotide movement through the pore is controlled by a polynucleotide binding protein, such as a helicase, or molecular brake.
- The complementary polynucleotide and the target RNA may be separated using any method known in the art. For instance, they may be separated by a polynucleotide binding protein or using conditions which favour dehybridsation (examples of conditions which favour dehybridisation include, but are not limited to, high temperature, high pH and the addition of agents that can disrupt hydrogen bonding or base pairing, such as formamide and urea).
- The hairpin adaptor preferably comprises a selectable binding moiety. This allows the linked construct to be purified or isolated. A selectable binding moiety is a moiety that can be selected on the basis of its binding properties. Hence, a selectable binding moiety is preferably a moiety that specifically binds to a surface. A selectable binding moiety specifically binds to a surface if it binds to the surface to a much greater degree than any other moiety used in the invention. In preferred embodiments, the moiety binds to a surface to which no other moiety used in the invention binds.
- Suitable selective binding moieties are known in the art. Preferred selective binding moieties include, but are not limited to, biotin, a polynucleotide sequence, antibodies, antibody fragments, such as Fab and ScSv, antigens, polynucleotide binding proteins, poly histidine tails and GST tags. The most preferred selective binding moieties are biotin and a selectable polynucleotide sequence. Biotin specifically binds to a surface coated with avidins. Selectable polynucleotide sequences specifically bind (i.e. hybridise) to a surface coated with homologus sequences. Alternatively, selectable polynucleotide sequences specifically bind to a surface coated with polynucleotide binding proteins.
- The hairpin adaptor and/or the selectable binding moiety may comprise a region that can be cut, nicked, cleaved or hydrolysed. Such a region can be designed to allow the complementary polynucleotide and/or target RNA to be removed from the surface to which it is bound following purification or isolation. Suitable regions are known in the art. Suitable regions include, but are not limited to, an RNA region, a region comprising desthiobiotin and streptavidin, a disulphide bond and a photocleavable region.
- The linked construct preferably comprises a leader sequence at the opposite end from the bridging moiety, such as a hairpin loop or hairpin loop adaptor. Leader sequences are discussed in more detail below.
- Before the contacting step, the method preferably comprises attaching to the linked construct a leader sequence which preferentially threads into the pore. The leader sequence facilitates the method of the invention. The leader sequence is designed to preferentially thread into the pore of the invention and thereby facilitate the movement of polynucleotide through the pore. The leader sequence can also be used to link the construct to the one or more anchors as discussed above. The leader sequence may be linked to the complementary polynucleotide or the target RNA.
- The leader sequence typically comprises a polymer. The polymer is preferably negatively charged. The polymer is preferably a polynucleotide, such as DNA or RNA, a modified polynucleotide (such as abasic DNA), PNA, LNA, polyethylene glycol (PEG) or a polypeptide. The leader preferably comprises a polynucleotide and more preferably comprises a single stranded polynucleotide. The leader sequence can comprise any of the polynucleotides discussed above. The single stranded leader sequence most preferably comprises a single strand of DNA, such as a poly dT section. The leader sequence preferably comprises the one or more spacers.
- The leader sequence can be any length, but is typically 10 to 150 nucleotides in length, such as from 20 to 150 nucleotides in length. The length of the leader typically depends on the transmembrane pore used in the method.
- The leader sequence is preferably part of a Y adaptor as defined below.
- The method of the invention may involve double coupling of the complementary polynucleotide and target RNA. In a preferred embodiment, the method of the invention comprises:
-
- (a) providing the complementary polynucleotide and target RNA with a Y adaptor at one end and a bridging moiety adaptor, such as a hairpin loop adaptor, at the other end, wherein the Y adaptor comprises one or more first anchors for coupling the polynucleotide to the membrane, wherein the hairpin loop adaptor comprises one or more second anchors for coupling the polynucleotide to the membrane and wherein the strength of coupling of the hairpin loop adaptor to the membrane is greater than the strength of coupling of the Y adaptor to the membrane;
- (b) contacting the polynucleotide provided in step (a) with the pore the invention such that the polynucleotide moves through the pore; and
- (c) taking one or more measurements as the polynucleotide moves with respect to the pore, wherein the measurements are indicative of one or more characteristics of the polynucleotide, and thereby characterising the target polynucleotide.
- This type of method is discussed in detail in the UK Application No. 1406147.7.
- The double stranded polynucleotide is provided with a Y adaptor at one end and a hairpin loop adaptor at the other end. The Y adaptor and/or the hairpin adaptor are typically polynucleotide adaptors. They may be formed from any of the polynucleotides discussed above.
- The Y adaptor typically comprises (a) a double stranded region and (b) a single stranded region or a region that is not complementary at the other end. The Y adaptor may be described as having an overhang if it comprises a single stranded region. The presence of a non-complementary region in the Y adaptor gives the adaptor its Y shape since the two strands typically do not hybridise to each other unlike the double stranded portion. The Y adaptor comprises the one or more first anchors. Anchors are discussed in more detail above.
- The Y adaptor preferably comprises a leader sequence which preferentially threads into the pore. This is discussed above.
- The hairpin adaptor preferably comprises a selectable binding moiety as discussed above. The hairpin adaptor and/or the selectable binding moiety may comprise a region that can be cut, nicked, cleaved or hydrolysed as discussed above.
- If one or more helicases and one or more molecular brakes are used as discussed above, the Y adaptor preferably comprises the one or more helicases and the hairpin loop adaptor preferably comprises the one or more molecular brakes.
- The Y adaptor and/or the hairpin adaptor may be ligated to the polynucleotide using any method known in the art. One or both of the adaptors may be ligated using a ligase, such as T4 DNA ligase, E. coli DNA ligase, Taq DNA ligase, Tma DNA ligase and 9° N DNA ligase. Alternatively, the adaptors may be added to the polynucleotide using the methods of the invention discussed below.
- In a preferred embodiment, step a) of the method comprises modifying the double stranded polynucleotide so that it comprises the Y adaptor at one end and the hairpin loop adaptor at the other end. Any manner of modification can be used. The methods of modification and characterisation may be combined in any way.
- The strength of coupling (or binding) of the bridging moiety (or hairpin) adaptor to the membrane is greater than the strength of coupling (or binding) of the Y adaptor to the membrane. This can be measured in any way. A suitable method for measuring the strength of coupling (or binding) is disclosed in the Examples of the UK Application No. 1406147.7.
- The strength of coupling (or binding) of the hairpin loop adaptor is preferably at least 1.5 times the strength of coupling (or binding) of the hairpin loop adaptor, such as at least twice, at least three times, at least four times, at least five or at least ten times the strength of coupling (or binding) of the anchor adaptor. The affinity constant (Kd) of the hairpin loop adaptor for the membrane is preferably at least 1.5 times the affinity constant of the Y adaptor, such as at least twice, at least three times, at least four times, at least five or at least ten times the strength of coupling of the Y adaptor.
- There are several ways in which the hairpin loop adaptor couples (or binds) more strongly to the membrane than the Y adaptor. For instance, the hairpin loop adaptor may comprise more anchors than the Y adaptor. For instance, the hairpin loop adaptor may comprise 2, 3 or more second anchors whereas the Y adaptor may comprise one first anchor.
- The strength of coupling (or binding) of the one or more second anchors to the membrane may be greater than the strength of coupling (or binding) of the one or more first anchors to the membrane. The strength of coupling (or binding) of the one or more second anchors to the hairpin loop adaptor may be greater than the strength of coupling (or binding) of the one or more first anchors to the Y adaptor. The one or more first anchors and the one or more second anchors may be attached to their respective adaptors via hybridisation and the strength of hybridisation is greater in the one or more second anchors than in the one or more first anchors. Any combination of these embodiments may also be used in the invention. Strength of coupling (or binding) may be measure using known techniques in the art.
- The one or more second anchors preferably comprise one or more groups which couples(s) (or bind(s)) to the membrane with a greater strength than the one or more groups in the one or more first anchors which couple(s) (or bind(s)) to the membrane. In preferred embodiments, the hairpin loop adaptor/one or more second anchors couple (or bind) to the membrane using cholesterol and the Y adaptor/one or more first anchors couple (or bind) to the membrane using palmitate. Cholesterol binds to triblock copolymer membranes and lipid membranes more strongly than palmitate. In an alternative embodiment, the hairpin loop adaptor/one or more second anchors couple (or bind) to the membrane using a mono-acyl species, such as palmitate, and the Y adaptor/one or more first anchors couple (or bind) to the membrane using a diacyl species, such as dipalmitoylphosphatidylcholine.
- mRNA is preferably used in the invention to diagnose or prognose a disease or condition. Some diseases or conditions are associated with an altered amount (or level) of mRNA. The mRNA may be normal or wild-type mRNA, i.e. not alternately spliced. The amount (or level) of the mRNA may be increased or decreased in the disease or condition compared with the amount (or level) in a patient without the disease or condition. Such diseases or conditions may be diagnosed or prognosed by determining the amount of the mRNA in a sample from the patient using a method of the invention.
- Many genetic diseases or conditions are caused by mutations that cause alternate mRNA splicing, such as mRNA splicing defects. A number of diseases or conditions are associated with alternate mRNA splicing which are not attributed to overt mutations. The presence or absence of alternate splicing can be identified by determining the presence or absence of an alternately spliced mRNA in a sample from the patient using the method of the invention. In some instances, alternate mRNA splicing may be the normal function of a cell. In such instances, an increased or decreased amount (or level) of the alternately spliced mRNA compared with the normal amount (i.e. the amount in a patient without the disease or condition) may be used to diagnose or prognose the disease or condition.
- The invention provides a method of diagnosing or prognosing a disease or condition associated with an altered amount and/or alternate splicing of messenger RNA (mRNA) in a patient. The invention provides a method of determining whether or not a patient has or is at risk of developing a disease or condition associated with an altered amount and/or alternate splicing of messenger RNA (mRNA). In each instance, the method comprises determining the amount and/or identity of the mRNA in a sample from the patient using a method of the invention. The disease or condition may be any of those discussed below. The disease or condition is preferably cystic fibrosis, familial dysautonomia, frontotemporal lobar dementia, amyotrophic lateral sclerosis, Hutchinson-Gilford progeria syndrome, medium-chain acyl-CoA dehydrogenase (MCAD) deficiency, myotonic dystrophy, Prader-Willi syndrome, spinal muscular atrophy, tauopathy, hypercholesterolemia or cancer. These diseases, their causes and possible treatments are discussed in Tazi et al. (Biochimica et Biophysica Acta (BBA)—Molecular Basis of Disease, Volume 1792,
Issue 1, January 2009, Pages 14-26). - The presence of an altered (i.e. increased or decreased) amount (or level) of the mRNA in the sample from the patient typically diagnoses or prognoses the disease or condition, i.e. indicates that the patient has or is at risk of developing the disease or condition. The absence of an altered (i.e. increased or decreased) amount (or level) of the mRNA in the sample from the patient typically indicates that the patient does not have or is not at risk of developing the disease or condition. The amount of mRNA can be determined as discussed above.
- The presence of the alternately spliced mRNA in the sample from the patient typically diagnoses or prognoses the disease or condition, i.e. indicates that the patient has or is at risk of developing the disease or condition. The absence of the alternately spliced mRNA in the sample from the patient typically indicates that the patient does not have or is not at risk of developing the disease or condition. The presence or absence of the alternately spliced mRNA can be determined by identifying RNA in the sample as discussed above.
- An increased or decreased amount (or level) of the alternately spliced mRNA in the sample from the patient typically diagnoses or prognoses the disease or condition, i.e. indicates that the patient has or is at risk of developing the disease or condition. No change in the amount of the alternately spliced mRNA in the sample from the patient (compared with the amount or level in a patient without the disease or condition) typically indicates that the patient does not have or is not at risk of developing the disease or condition. The amount of the alternately spliced mRNA can be determined as discussed above.
- miRNA is preferably used in the invention to diagnose or prognose a disease or condition. The invention provides a method of diagnosing or prognosing a disease or condition associated with a miRNA. The invention provides a method of determining whether or not a patient has or is at risk of developing a disease or condition associated with a miRNA. The method comprises determining the presence or absence of the miRNA in a sample from the patient using a method of the invention. The disease or condition may be any of those discussed below.
- The presence of the miRNA in the sample from the patient typically indicates that the patient has or is at risk of developing the disease or condition. The absence of the miRNA in the sample from the patient typically indicates that the patient does not have or is not at risk of developing the disease or condition. The presence or absence of the miRNA can be determined by identifying any miRNAs in the sample as discussed above.
- The disease or condition is preferably cancer, coronary heart disease, cardiovascular disease or sepsis. The disease or condition is more preferably abdominal aortic aneurysm, acute lymphoblastic leukemia (ALL), acute myeloid leukemia (AML), acute myocardial infarction, acute promyelocytic leukemia (APL), adenoma, adrenocortical carcinoma, alcoholic liver disease, Alzheimer's disease, anaplastic thyroid carcinoma (ATC), anxiety disorder, asthma, astrocytoma, atopic dermatitis, autism spectrum disorder (ASD), B-cell chronic lymphocytic leukemia, B-cell lymphoma, Becker muscular dystrophy (BMD), bladder cancer, brain neoplasm, breast cancer, Burkitt lymphoma, cardiac hypertrophy, cardiomyopathy, cardiovascular disease, cerebellar neurodegeneration, cervical cancer, cholangiocarcinoma, cholesteatoma, choriocarcinoma, chronic lymphocytic leukemia, chronic myeloid leukemia, chronic pancreatitis, colon carcinoma, colorectal cancer, congenital heart disease, coronary artery disease, cowden syndrome, dermatomyositis (DM), diabetic nephropathy, diarrhea predominant irritable bowel syndrome, diffuse large B-cell lymphoma, dilated cardiomyopathy, down syndrome (DS), duchenne muscular dystrophy (DMD), endometrial cancer, endometrial endometrioid adenocarcinoma, endometriosis, epithelial ovarian cancer, esophageal cancer, esophagus squamous cell carcinoma, essential thrombocythemia (ET), facioscapulohumeral muscular dystrophy (FSHD), follicular lymphoma (FL), follicular thyroid carcinoma (FTC), frontotemporal dementia, gastric cancer (stomach cancer), glioblastoma, glioblastoma multiforme (GBM), glioma, glomerular disease, glomerulosclerosis, hamartoma, HBV-related cirrhosis, HCV infection, head and neck cancer, head and neck squamous cell carcinoma (HNSCC), hearing loss, heart disease, heart failure, hepatitis B, hepatitis C, hepatocellular carcinoma (HCC), hilar cholangiocarcinoma, Hodgkin's lymphoma, homozygous sickle cell disease (HbSS), Huntington's disease (HD), hypertension, hypopharyngeal cancer, inclusion body myositis (IBM), insulinoma, intrahepatic cholangiocarcinoma (ICC), kidney cancer, kidney disease, laryngeal carcinoma, late insomnia (sleep disease), leiomyoma of lung, leukemia, limb-girdle muscular dystrophies types 2A (LGMD2A), lipoma, lung adenocarcinoma, lung cancer, lymphoproliferative disease, malignant lymphoma, malignant melanoma, malignant mesothelioma (MM), mantle cell lymphoma (MCL), medulloblastoma, melanoma, meningioma, metabolic disease, miyoshi myopathy (MM), multiple myeloma (MM), multiple sclerosis, MYC-rearranged lymphoma, myelodysplastic syndrome, myeloproliferative disorder, myocardial infarction, myocardial injury, myoma, nasopharyngeal carcinoma (NPC), nemaline myopathy (NM), nephritis, neuroblastoma (NB), neutrophilia, Niemann-Pick type C (NPC) disease, non-alcoholic fatty liver disease (NAFLD), non-small cell lung cancer (NSCLC), obesity, oral carcinomaosteosarcoma ovarian cancer (OC), pancreatic cancer, pancreatic ductal adenocarcinoma (PDAC), pancreatic neoplasia, panic disease, papillary thyroid carcinoma (PTC), Parkinson's disease, PFV-1 infection, pharyngeal disease, pituitary adenoma, polycystic kidney disease, polycystic liver disease, polycythemia vera (PV), polymyositis (PM), primary biliary cirrhosis (PBC), primary myelofibrosis, prion disease, prostate cancer, psoriasic arthritis, psoriasis, pulmonary hypertension, recurrent ovarian cancer, renal cell carcinoma, renal clear cell carcinoma, retinitis pigmentosa (RP), retinoblastoma, rhabdomyosarcoma, rheumatic heart disease and atrial fibrillation, rheumatoid arthritis, sarcoma, schizophrenia, sepsis, serous ovarian cancer, Sezary syndrome, skin disease, small cell lung cancer, spinocerebellar ataxia, squamous carcinoma, T-cell leukemia, teratocarcinoma, testicular germ cell tumor, thalassemia, thyroid cancer, tongue squamous cell carcinoma, tourette's syndrome, type 2 diabetes, ulcerative colitis (UC), uterine leiomyoma (ULM), uveal melanoma, vascular disease, vesicular stomatitis or Waldenstrom macroglobulinemia (WM).
- The patient may be any of the mammals discussed above. The patient is preferably human. The patient is an individual.
- The sample may be any of those discussed above. The sample is typically from any tissue or bodily fluid. The sample typically comprises a body fluid and/or cells of the patient and may, for example, be obtained using a swab, such as a mouth swab. The sample may be, or be derived from, blood, urine, saliva, skin, cheek cell or hair root samples. The target RNA is typically extracted from the sample before it is used in the method of the invention.
- The method may concern diagnosis of the disease or condition in the patient, i.e. determining whether or not the patient has the disease or condition. The patient may be symptomatic.
- The method may concern prognosing the disease or condition in the patient, i.e. determining whether or not the patient is likely to develop the disease or condition. The patient can be asymptomatic. The patient can have a genetic predisposition to the disease or condition. The patient may have one or more family member(s) with the disease or condition.
- The invention also provides a kit for characterising a target RNA. The kit comprises (a) a transmembrane pore and (b) a reverse transcriptase enzyme and/or a reverse transcription primer. Any of the embodiments discussed above with reference to the method of the invention equally apply to the kits.
- The kit may further comprise the components of a membrane, such as the phospholipids needed to form an amphiphilic layer, such as a lipid bilayer.
- The kit of the invention may additionally comprise one or more other reagents or instruments which enable any of the embodiments mentioned above to be carried out. Such reagents or instruments include one or more of the following: suitable buffer(s) (aqueous solutions), means to obtain a sample from a subject (such as a vessel or an instrument comprising a needle), a membrane as defined above or voltage or patch clamp apparatus. Reagents may be present in the kit in a dry state such that a fluid sample resuspends the reagents. The kit may also, optionally, comprise instructions to enable the kit to be used in the method of the invention or details regarding which patients the method may be used for. The kit typically comprises nucleotides. The kit preferably comprises dAMP, dTMP, dGMP and dCMP. The kit preferably does not comprise means to amplify and/or express polynucleotides.
- The invention also provides an apparatus for characterising target RNAs. The apparatus comprises (a) a plurality of pores and (b) a plurality of a plurality of reverse transcriptase enzymes and/or a plurality of reverse transcription primers. The apparatus preferably further comprises instructions for carrying out the method of the invention. The apparatus may be any conventional apparatus for polynucleotide analysis, such as an array or a chip. Any of the embodiments discussed above with reference to the methods of the invention are equally applicable to the apparatus of the invention.
- The apparatus is preferably set up to carry out the method of the invention.
- The apparatus preferably comprises:
- a sensor device that is capable of supporting the plurality of pores and being operable to perform RNA characterisation using the pores and the helicases or constructs; and
- at least one reservoir for holding material for performing the characterisation.
- The apparatus preferably comprises:
- a sensor device that is capable of supporting the plurality of pores and being operable to perform RNA characterisation using the pores and the helicases or constructs; and
- at least one port for delivery of the material for performing the characterisation.
- The apparatus preferably comprises:
- a sensor device that is capable of supporting the membrane and plurality of pores and being operable to perform RNA characterising using the pores;
- at least one reservoir for holding material for performing the characterising;
- a fluidics system configured to controllably supply material from the at least one reservoir to the sensor device; and
- one or more containers for receiving respective samples, the fluidics system being configured to supply the samples selectively from one or more containers to the sensor device. The apparatus may be any of those described in International Application No. PCT/GB08/004127 (published as WO 2009/077734), PCT/GB10/000789 (published as WO 2010/122293), International Application No. PCT/GB10/002206 (published as WO 2011/067559) or International Application No. PCT/US99/25679 (published as WO 00/28312).
- The following Examples illustrates the invention.
- This example describes the sample preparation procedure used to produce the cDNA which can then be characterised using the nanopore system. The steps of the procedure outlined in this example are steps A to C shown in
FIG. 1 . - Complementary DNA (cDNA, SEQ ID NO: 11) was reverse transcribed from messenger RNA (SEQ ID NO: 8, mRNA) by SuperScript II Reverse Transcriptase (Life Technologies) using an adaption of the standard SuperScript II protocol (the protocol that was followed is shown below).
-
-
Stock Final Reagent Add Concentration Concentration mRNA (SEQ ID NO: 8) 3.5 ul 1 mg/ml Custom pT Primer (SEQ ID 1.1 ul 10 uM 550 nM NO: 9 attached at its 3′ end to four iSpC3 spacers which are attached to the 5′ end of SEQ ID NO: 10) dNTP Mix 1 ul 10 mM Ea 0.5 mM Ea 5x Superscript II Buffer 4 ul 5x 1x DTT 2 ul 0.1M 10 mM Superscript II 1 ul Nuclease Free H2O 7.4 ul Total 20 ul - The mRNA (SEQ ID NO: 8), custom pT primer (SEQ ID NO: 9 attached at its 3′ end to four iSpC3 spacers which are attached to the 5′ end of SEQ ID NO: 10), dNTP mix and nuclease free water were mixed together and then heated to 65° C. for 5 minutes before being quick chilled on ice. 5× SuperScript II buffer and DTT were then added to the mixture and the sample incubated at 42° C. for 2 minutes. Finally, Superscript II was added to the reaction mixture and then the sample was incubated at 42° C. for 50 mins and then at 70° C. for 15 minutes. The cDNA product (SEQ ID NO: 11 attached at its 5′ end to the 3′ end of SEQ ID NO: 10 which is attached by its 5′ end to four iSpC3 spacers which are attached to the 3′ end of SEQ ID NO: 9) hybridised to the mRNA (SEQ ID NO: 8) was then purified using SPRI beads (Agencourt AMPure).
- Both the starting mRNA material (SEQ ID NO: 8) and the cDNA product (SEQ ID NO: 11 attached at its 5′ end to the 3′ end of SEQ ID NO: 10 which is attached by its 5′ end to four iSpC3 spacers which are attached to the 3′ end of SEQ ID NO: 9, where SEQ ID NO: 11 is hybridised to SEQ ID NO: 8) were analysed for mass/vol and DNA-RNA hybrid length using a Nanodrop and Agilent BioAnalyzer (12 k Agilent Chip).
- This procedure was also used to produce cDNA by reverse transcribing mRNA from yeast (used in Example 3). In the above protocol mRNA (SEQ ID NO: 8) was replaced with Saccharomyces cerevisiae PolyA+ messenger RNA (1 mg/ml), which was purchased from Clontech and used as received.
- This example describes the characterisation of cDNA (SEQ ID NO: 11 attached at its 5′ end to the 3′ end of SEQ ID NO: 10 which is attached by its 5′ end to four iSpC3 spacers which are attached to the 3′ end of SEQ ID NO: 9) using a nanopore system. See
FIG. 2 for a cartoon representation of the system. - Prior to setting up the experiment, cDNA/mRNA (0.05 nM, SEQ ID NO: 11 attached at its 5′ end to the 3′ end of SEQ ID NO: 10 which is attached by its 5′ end to four iSpC3 spacers which are attached to the 3′ end of SEQ ID NO: 9, where SEQ ID NO: 11 is hybridised to SEQ ID NO: 8) was hybridised to the tether (0.25 nM, SEQ ID NO: 12) by heating the sample at 40° C. in buffer (10 mM TRIS, 50 mM NaCl, pI17.5) for two minutes and then slow cooling to 30° C. over 15 minutes. This is shown as step D in
FIG. 1 . The cDNA/mRNA sample was then pre-incubated with T4 Dda-E94C/A360C (125 nM, SEQ ID NO: 13 with mutations E94C/A360C) for an hour at room temperature in buffer (126.5 mM KCl, 25 mM NaCl, 25 mM potassium phosphate pH 7.5-8.0 and 5 mM Tris). - Electrical measurements were acquired at 25-30° C. from single MspA nanopores (MS(B1-G75S/G77S/L88N/Q126R)8 MspA (SEQ ID NO: 2 with mutations G75S/G77S/L88N/Q126R) inserted in block co-polymer in buffer (600 mM KCl, 25 mM potassium phosphate, 75 mM Potassium Ferrocyanide (II), 25 mM Potassium ferricyanide (III), pH 8). After achieving a single pore inserted in the block co-polymer, then buffer (1 mL, 600 mM KCl, 25 mM potassium phosphate, 75 mM Potassium Ferrocyanide (II), 25 mM Potassium ferricyanide (III), pH 8) was flowed through the system to remove any excess MspA nanopores (MS(B1-G75S/G77S/L88N/Q126R)8 MspA (SEQ ID NO: 2 with mutations G75S/G77S/L88N/Q126R). MgCl2 (10 mM final concentration) and ATP (5 mM final concentration) were mixed together with buffer (600 mM KCl, 25 mM potassium phosphate, 75 mM Potassium Ferrocyanide (II), 25 mM Potassium ferricyanide (III), pH 8) and then added to the cDNA/mRNA (0.05 nM final concentration), T4 Dda-E94C/A360C (1 nM final concentration, SEQ ID NO: 13 with mutations E94C/A360C) pre-mix. The pre-mix was then added to the nanopore experimental system. Experiments were carried out for eighteen hours at an applied potential of +120 mV and helicase-controlled cDNA movement was monitored.
- Helicase controlled DNA movement was observed for the cDNA (SEQ ID NO: 11 attached at its 5′ end to the 3′ end of SEQ ID NO: 10 which is attached by its 5′ end to four iSpC3 spacers which are attached to the 3′ end of SEQ ID NO: 9, where SEQ ID NO: 11 is hybridised to SEQ ID NO: 8). An example of a helicase-controlled DNA movement is shown in
FIG. 3 . A number of features in the electrical read out are identified as the helicase controls the cDNA movement through the nanopore. The region labelled 1 corresponds to the capture tail, the region labelled 2 corresponds to the iSpC3 spacers in the primer, the region labelled 3 corresponds to the polyT primer for the reverse transcriptase and the region labelled 4 corresponds to of the cDNA region. This example shows characterisation of cDNA which was transcribed from mRNA (SEQ ID NO: 8) as described in Example 1. - This example describes the characterisation of cDNA using a nanopore system, where the cDNA was transcribed from mRNA found in yeast (Saccharomyces cerevisiae). See
FIG. 2 for a cartoon representation of the system. - The materials and methods procedure described in Example 2 was repeated for the cDNA (0.05 nM) transcribed from yeast (Saccharomyces cerevisiae).
- Enzyme-controlled translocation of cDNA through the nanopore was observed for cDNA transcribed from yeast (Saccharomyces cerevisiae) polyA+mRNA. Examples of helicase-controlled translocation of cDNA from yeast are shown in
FIGS. 4 and 5 .FIG. 4 shows an example of a complete yeast cDNA trace.FIG. 5 shows the beginning of a yeast cDNA trace and identifies features in the electrical signal that reflect key sequences in the custom pT primer (labelled inFIGS. 5 as—1=capture tail, 2=the iSpC3 spacers in the primer, 3=polyT primer for the reverse transcriptase and 4=region of cDNA). This example shows characterisation of cDNA which was transcribed from yeast (Saccharomyces cerevisiae) polyA+ mRNA as described in Example 1.
Claims (22)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/397,320 US20230132387A9 (en) | 2013-10-18 | 2021-08-09 | Method of characterizing a target ribonucleic acid (rna) comprising forming a complementary polynucleotide which moves through a transmembrane pore |
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GBGB1318465.0A GB201318465D0 (en) | 2013-10-18 | 2013-10-18 | Method |
GB1318465.0 | 2013-10-18 | ||
PCT/GB2014/053121 WO2015056028A1 (en) | 2013-10-18 | 2014-10-17 | Method of characterizing a target ribonucleic acid (rna) comprising forming a complementary polynucleotide which moves through a transmembrane pore |
US201615028637A | 2016-04-11 | 2016-04-11 | |
US17/397,320 US20230132387A9 (en) | 2013-10-18 | 2021-08-09 | Method of characterizing a target ribonucleic acid (rna) comprising forming a complementary polynucleotide which moves through a transmembrane pore |
Related Parent Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/028,637 Continuation US11111532B2 (en) | 2013-10-18 | 2014-10-17 | Method of characterizing a target ribonucleic acid (RNA) comprising forming a complementary polynucleotide which moves through a transmembrane pore |
PCT/GB2014/053121 Continuation WO2015056028A1 (en) | 2013-10-18 | 2014-10-17 | Method of characterizing a target ribonucleic acid (rna) comprising forming a complementary polynucleotide which moves through a transmembrane pore |
Publications (2)
Publication Number | Publication Date |
---|---|
US20220098657A1 US20220098657A1 (en) | 2022-03-31 |
US20230132387A9 true US20230132387A9 (en) | 2023-04-27 |
Family
ID=49727005
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/028,637 Active 2036-01-05 US11111532B2 (en) | 2013-10-18 | 2014-10-17 | Method of characterizing a target ribonucleic acid (RNA) comprising forming a complementary polynucleotide which moves through a transmembrane pore |
US17/397,320 Pending US20230132387A9 (en) | 2013-10-18 | 2021-08-09 | Method of characterizing a target ribonucleic acid (rna) comprising forming a complementary polynucleotide which moves through a transmembrane pore |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/028,637 Active 2036-01-05 US11111532B2 (en) | 2013-10-18 | 2014-10-17 | Method of characterizing a target ribonucleic acid (RNA) comprising forming a complementary polynucleotide which moves through a transmembrane pore |
Country Status (6)
Country | Link |
---|---|
US (2) | US11111532B2 (en) |
EP (2) | EP4006168A1 (en) |
AU (1) | AU2014335893B2 (en) |
CA (1) | CA2927728C (en) |
GB (1) | GB201318465D0 (en) |
WO (1) | WO2015056028A1 (en) |
Families Citing this family (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
BRPI1007214A8 (en) | 2009-01-30 | 2022-12-20 | Oxford Nanopore Tech Ltd | NUCLEIC ACID SEQUENCING ADAPTER, ADAPTER PAIR, KIT, NUCLEIC ACID CONSTRUCT, AND, METHODS FOR PREPARING AN ADAPTER, NUCLEIC ACID CONSTRUCT, AND DEQUENCY CONSTRUCT, AND FOR SEQUENCING DOUBLE-STRANDED NUCLEIC ACID. |
US9957560B2 (en) | 2011-07-25 | 2018-05-01 | Oxford Nanopore Technologies Ltd. | Hairpin loop method for double strand polynucleotide sequencing using transmembrane pores |
US9758823B2 (en) | 2011-10-21 | 2017-09-12 | Oxford Nanopore Technologies Limited | Enzyme method |
CA2864035C (en) | 2012-02-15 | 2021-05-18 | Oxford Nanopore Technologies Limited | A method for determining the presence of an analyte using an aptamer |
AU2013291763B2 (en) | 2012-07-19 | 2019-06-20 | Oxford Nanopore Technologies Limited | Modified helicases |
US11155860B2 (en) | 2012-07-19 | 2021-10-26 | Oxford Nanopore Technologies Ltd. | SSB method |
EP2895618B1 (en) | 2012-09-14 | 2017-07-26 | Oxford Nanopore Technologies Limited | Sample preparation method |
EP2917366B1 (en) | 2012-11-06 | 2017-08-02 | Oxford Nanopore Technologies Limited | Quadruplex method |
GB201314695D0 (en) | 2013-08-16 | 2013-10-02 | Oxford Nanopore Tech Ltd | Method |
GB201318465D0 (en) | 2013-10-18 | 2013-12-04 | Oxford Nanopore Tech Ltd | Method |
EP2964779B1 (en) | 2013-03-08 | 2018-08-29 | Oxford Nanopore Technologies Limited | Use of spacer elements in a nucleic acid to control movement of a helicase |
EP3575410A3 (en) | 2013-10-18 | 2020-03-04 | Oxford Nanopore Technologies Limited | Modified enzymes |
GB201403096D0 (en) | 2014-02-21 | 2014-04-09 | Oxford Nanopore Tech Ltd | Sample preparation method |
GB201417712D0 (en) | 2014-10-07 | 2014-11-19 | Oxford Nanopore Tech Ltd | Method |
GB201418159D0 (en) | 2014-10-14 | 2014-11-26 | Oxford Nanopore Tech Ltd | Method |
US10480026B2 (en) | 2014-10-17 | 2019-11-19 | Oxford Nanopore Technologies Ltd. | Method for nanopore RNA characterisation |
GB201609220D0 (en) | 2016-05-25 | 2016-07-06 | Oxford Nanopore Tech Ltd | Method |
GB201707140D0 (en) * | 2017-05-04 | 2017-06-21 | Oxford Nanopore Tech Ltd | Method |
WO2019133756A1 (en) | 2017-12-29 | 2019-07-04 | Clear Labs, Inc. | Automated priming and library loading device |
GB201807793D0 (en) | 2018-05-14 | 2018-06-27 | Oxford Nanopore Tech Ltd | Method |
JP2020031557A (en) * | 2018-08-28 | 2020-03-05 | 株式会社日立ハイテクノロジーズ | Biomolecule analyzer |
CN113994192B (en) | 2019-05-17 | 2023-12-05 | 上海芯像生物科技有限公司 | Analytical system for molecular detection and sensing |
US11926819B2 (en) | 2019-05-28 | 2024-03-12 | The Regents Of The University Of California | Methods of adding polymers to ribonucleic acids |
CN113711368A (en) | 2020-03-20 | 2021-11-26 | 上海芯像生物科技有限公司 | High throughput analysis system for molecular detection and sensing |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100159526A1 (en) * | 2007-08-17 | 2010-06-24 | Epicentre Technologies Corporation | Selective 5' ligation tagging of rna |
US20120196293A1 (en) * | 2011-01-28 | 2012-08-02 | Kaohsiung Medical University | Method and kit for in vitro diagnosis of atherosclerosis |
US20120267729A1 (en) * | 2011-04-22 | 2012-10-25 | International Business Machines Corporation | Self-sealed fluidic channels for nanopore array |
Family Cites Families (73)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
IT995739B (en) | 1972-10-13 | 1975-11-20 | Schuler Gmbh L | MOBILE CHUCK FOR CUTTING SHEET TAPE ON THE MOVE |
US5198543A (en) | 1989-03-24 | 1993-03-30 | Consejo Superior Investigaciones Cientificas | PHI29 DNA polymerase |
KR100289793B1 (en) * | 1992-01-29 | 2001-12-01 | 테츠오 토미나가 | Polynucleotide Fixed Support |
US6267872B1 (en) | 1998-11-06 | 2001-07-31 | The Regents Of The University Of California | Miniature support for thin films containing single channels or nanopores and methods for using same |
US6426231B1 (en) | 1998-11-18 | 2002-07-30 | The Texas A&M University System | Analyte sensing mediated by adapter/carrier molecules |
DE60025739T2 (en) * | 1999-09-07 | 2006-08-31 | The Regents Of The University Of California, Oakland | METHOD FOR PROVIDING THE PRESENCE OF DOUBLE-STRONG DNA IN A SAMPLE |
EP1097993A3 (en) | 1999-11-05 | 2004-01-07 | National Institute of Advanced Industrial Science and Technology | Functional ribozyme chimeric molecules capable of sliding |
US20020197618A1 (en) | 2001-01-20 | 2002-12-26 | Sampson Jeffrey R. | Synthesis and amplification of unstructured nucleic acids for rapid sequencing |
US6706203B2 (en) | 2001-10-30 | 2004-03-16 | Agilent Technologies, Inc. | Adjustable nanopore, nanotome, and nanotweezer |
WO2004041843A2 (en) * | 2002-05-17 | 2004-05-21 | Board Of Regents, The University Of Texas System | Hop - a novel cardiac-restricted transcriptional factor potentially useful for cardiac regeneration and specification |
US7282328B2 (en) | 2002-09-20 | 2007-10-16 | New England Biolabs, Inc. | Helicase dependent amplification of nucleic acids |
CN101260429B (en) * | 2003-01-17 | 2011-03-30 | 香港中文大学 | Circulating mRNA as diagnostic markers of pregnancy related disorders |
US20050250094A1 (en) | 2003-05-30 | 2005-11-10 | Nanosphere, Inc. | Method for detecting analytes based on evanescent illumination and scatter-based detection of nanoparticle probe complexes |
US7347921B2 (en) | 2003-07-17 | 2008-03-25 | Agilent Technologies, Inc. | Apparatus and method for threading a biopolymer through a nanopore |
WO2006028508A2 (en) * | 2004-03-23 | 2006-03-16 | President And Fellows Of Harvard College | Methods and apparatus for characterizing polynucleotides |
WO2005124888A1 (en) | 2004-06-08 | 2005-12-29 | President And Fellows Of Harvard College | Suspended carbon nanotube field effect transistor |
GB0505971D0 (en) | 2005-03-23 | 2005-04-27 | Isis Innovation | Delivery of molecules to a lipid bilayer |
US9233846B2 (en) | 2005-10-14 | 2016-01-12 | The Regents Of The University Of California | Formation and encapsulation of molecular bilayer and monolayer membranes |
US20070190543A1 (en) | 2005-11-14 | 2007-08-16 | Applera Corporation | Coded Molecules for Detecting Target Analytes |
GB0523282D0 (en) | 2005-11-15 | 2005-12-21 | Isis Innovation | Methods using pores |
CA2642854A1 (en) | 2006-02-18 | 2007-08-30 | Michael Strathmann | Massively multiplexed sequencing |
WO2007117832A2 (en) | 2006-03-12 | 2007-10-18 | Applera Corporation | Methods of detecting target nucleic acids |
US20110118187A1 (en) | 2006-10-19 | 2011-05-19 | Duke University | Reversible platelet inhibition |
EP2126588A1 (en) | 2007-02-20 | 2009-12-02 | Oxford Nanopore Technologies Limited | Formation of lipid bilayers |
CA2684801C (en) | 2007-04-04 | 2017-10-10 | The Regents Of The University Of California | Compositions, devices, systems, and methods for using a nanopore |
EP2195648B1 (en) | 2007-09-12 | 2019-05-08 | President and Fellows of Harvard College | High-resolution molecular graphene sensor comprising an aperture in the graphene layer |
US20090191553A1 (en) | 2007-10-01 | 2009-07-30 | Applied Biosystems Inc. | Chase Ligation Sequencing |
US9315804B2 (en) | 2007-10-22 | 2016-04-19 | Caris Life Sciences Switzerland Holdings, GmbH | Method of selecting aptamers |
GB0724736D0 (en) | 2007-12-19 | 2008-01-30 | Oxford Nanolabs Ltd | Formation of layers of amphiphilic molecules |
AU2009258033A1 (en) | 2008-03-12 | 2009-12-17 | Syracuse University | Direct selection of structurally defined aptamers |
US8628940B2 (en) | 2008-09-24 | 2014-01-14 | Pacific Biosciences Of California, Inc. | Intermittent detection during analytical reactions |
CA2730068A1 (en) | 2008-07-07 | 2010-01-14 | Oxford Nanopore Technologies Limited | Base-detecting pore |
WO2010004265A1 (en) | 2008-07-07 | 2010-01-14 | Oxford Nanopore Technologies Limited | Enzyme-pore constructs |
US20100092960A1 (en) * | 2008-07-25 | 2010-04-15 | Pacific Biosciences Of California, Inc. | Helicase-assisted sequencing with molecular beacons |
WO2010047797A2 (en) | 2008-10-21 | 2010-04-29 | Research Foundation Of The City University Of New York | Single quantum-dot-based aptameric sensors |
US20110008311A2 (en) * | 2008-11-03 | 2011-01-13 | The Board Of Trustees Of The University Of Illinois | Animal Model and Use of 17Beta-Hydroxysteroid Dehydrogenase Type 7 in the Diagnosis of Anencephaly |
GB0820927D0 (en) | 2008-11-14 | 2008-12-24 | Isis Innovation | Method |
US9222082B2 (en) | 2009-01-30 | 2015-12-29 | Oxford Nanopore Technologies Limited | Hybridization linkers |
BRPI1007214A8 (en) | 2009-01-30 | 2022-12-20 | Oxford Nanopore Tech Ltd | NUCLEIC ACID SEQUENCING ADAPTER, ADAPTER PAIR, KIT, NUCLEIC ACID CONSTRUCT, AND, METHODS FOR PREPARING AN ADAPTER, NUCLEIC ACID CONSTRUCT, AND DEQUENCY CONSTRUCT, AND FOR SEQUENCING DOUBLE-STRANDED NUCLEIC ACID. |
GB0905140D0 (en) | 2009-03-25 | 2009-05-06 | Isis Innovation | Method |
US8986928B2 (en) | 2009-04-10 | 2015-03-24 | Pacific Biosciences Of California, Inc. | Nanopore sequencing devices and methods |
KR20110138286A (en) | 2009-04-20 | 2011-12-26 | 옥스포드 나노포어 테크놀로지즈 리미티드 | Lipid bilayer sensor array |
CA2781581C (en) | 2009-12-01 | 2018-03-06 | Oxford Nanopore Technologies Limited | Biochemical analysis instrument |
US9315857B2 (en) * | 2009-12-15 | 2016-04-19 | Cellular Research, Inc. | Digital counting of individual molecules by stochastic attachment of diverse label-tags |
WO2011103424A2 (en) | 2010-02-19 | 2011-08-25 | The Trustees Of The University Of Pennsylvania | High-resolution analysis devices and related methods |
US8557529B2 (en) | 2010-04-09 | 2013-10-15 | International Business Machines Corporation | Nanopore capture system |
JP6016792B2 (en) * | 2010-07-14 | 2016-10-26 | ザ キュレイターズ オブ ザ ユニバーシティ オブ ミズーリ | Single molecule detection of nanopore-promoting nucleic acid |
CN103392008B (en) | 2010-09-07 | 2017-10-20 | 加利福尼亚大学董事会 | Movement by continuation enzyme with the precision controlling DNA of a nucleotides in nano-pore |
GB2500360B (en) | 2010-12-22 | 2019-10-23 | Genia Tech Inc | Nanopore-based single DNA molecule characterization, identification and isolation using speed bumps |
US9751915B2 (en) | 2011-02-11 | 2017-09-05 | Oxford Nanopore Technologies Ltd. | Mutant pores |
WO2012113780A1 (en) | 2011-02-21 | 2012-08-30 | Qiagen Gmbh | Method for quantifying human dna |
WO2012129242A2 (en) | 2011-03-23 | 2012-09-27 | Pacific Biosciences Of California, Inc. | Isolation of polymerase-nucleic acid complexes and loading onto substrates |
EP3848706B1 (en) | 2011-05-27 | 2023-07-19 | Oxford Nanopore Technologies PLC | Coupling method |
US9957560B2 (en) | 2011-07-25 | 2018-05-01 | Oxford Nanopore Technologies Ltd. | Hairpin loop method for double strand polynucleotide sequencing using transmembrane pores |
WO2013016486A1 (en) * | 2011-07-27 | 2013-01-31 | The Board Of Trustees Of The University Of Illinois | Nanopore sensors for biomolecular characterization |
US9758823B2 (en) | 2011-10-21 | 2017-09-12 | Oxford Nanopore Technologies Limited | Enzyme method |
CN102430153A (en) | 2011-12-24 | 2012-05-02 | 中国人民解放军第三军医大学第二附属医院 | Crosslinking-technology-based bionic bone interface, and construction method and application thereof |
CN104136631B (en) | 2011-12-29 | 2017-03-01 | 牛津纳米孔技术公司 | Method using XPD unwindase characterising polynucleotides |
AU2012360244B2 (en) | 2011-12-29 | 2018-08-23 | Oxford Nanopore Technologies Limited | Enzyme method |
CA2864035C (en) | 2012-02-15 | 2021-05-18 | Oxford Nanopore Technologies Limited | A method for determining the presence of an analyte using an aptamer |
BR102012007246A2 (en) | 2012-03-30 | 2016-02-10 | Fundação Antonio Prudente | methods for predicting and classifying event outcomes |
AU2013246702B2 (en) | 2012-04-10 | 2017-06-15 | Oxford Nanopore Technologies Limited | Mutant lysenin pores |
US11155860B2 (en) | 2012-07-19 | 2021-10-26 | Oxford Nanopore Technologies Ltd. | SSB method |
AU2013291765C1 (en) | 2012-07-19 | 2019-08-08 | Oxford Nanopore Technologies Limited | Enzyme construct |
AU2013291763B2 (en) | 2012-07-19 | 2019-06-20 | Oxford Nanopore Technologies Limited | Modified helicases |
EP2895618B1 (en) | 2012-09-14 | 2017-07-26 | Oxford Nanopore Technologies Limited | Sample preparation method |
EP2917366B1 (en) | 2012-11-06 | 2017-08-02 | Oxford Nanopore Technologies Limited | Quadruplex method |
GB201318465D0 (en) | 2013-10-18 | 2013-12-04 | Oxford Nanopore Tech Ltd | Method |
EP2964779B1 (en) | 2013-03-08 | 2018-08-29 | Oxford Nanopore Technologies Limited | Use of spacer elements in a nucleic acid to control movement of a helicase |
EP3575410A3 (en) | 2013-10-18 | 2020-03-04 | Oxford Nanopore Technologies Limited | Modified enzymes |
GB201406151D0 (en) | 2014-04-04 | 2014-05-21 | Oxford Nanopore Tech Ltd | Method |
US9925679B2 (en) | 2014-05-19 | 2018-03-27 | I+D+M Creative, Llc | Devices and methods for assisting with slicing items |
US10480026B2 (en) | 2014-10-17 | 2019-11-19 | Oxford Nanopore Technologies Ltd. | Method for nanopore RNA characterisation |
-
2013
- 2013-10-18 GB GBGB1318465.0A patent/GB201318465D0/en not_active Ceased
-
2014
- 2014-10-17 EP EP21208188.9A patent/EP4006168A1/en active Pending
- 2014-10-17 CA CA2927728A patent/CA2927728C/en active Active
- 2014-10-17 EP EP14792535.8A patent/EP3058090B1/en active Active
- 2014-10-17 US US15/028,637 patent/US11111532B2/en active Active
- 2014-10-17 WO PCT/GB2014/053121 patent/WO2015056028A1/en active Application Filing
- 2014-10-17 AU AU2014335893A patent/AU2014335893B2/en active Active
-
2021
- 2021-08-09 US US17/397,320 patent/US20230132387A9/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100159526A1 (en) * | 2007-08-17 | 2010-06-24 | Epicentre Technologies Corporation | Selective 5' ligation tagging of rna |
US20120196293A1 (en) * | 2011-01-28 | 2012-08-02 | Kaohsiung Medical University | Method and kit for in vitro diagnosis of atherosclerosis |
US20120267729A1 (en) * | 2011-04-22 | 2012-10-25 | International Business Machines Corporation | Self-sealed fluidic channels for nanopore array |
Also Published As
Publication number | Publication date |
---|---|
WO2015056028A1 (en) | 2015-04-23 |
EP4006168A1 (en) | 2022-06-01 |
US11111532B2 (en) | 2021-09-07 |
EP3058090B1 (en) | 2021-12-22 |
US20160251710A1 (en) | 2016-09-01 |
GB201318465D0 (en) | 2013-12-04 |
AU2014335893A1 (en) | 2016-05-05 |
EP3058090A1 (en) | 2016-08-24 |
AU2014335893B2 (en) | 2020-12-10 |
CA2927728C (en) | 2023-02-21 |
CA2927728A1 (en) | 2015-04-23 |
US20220098657A1 (en) | 2022-03-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20230132387A9 (en) | Method of characterizing a target ribonucleic acid (rna) comprising forming a complementary polynucleotide which moves through a transmembrane pore | |
US11021747B2 (en) | Method for nanopore RNA characterisation | |
US20230212665A1 (en) | Enzyme stalling method | |
US20200131549A1 (en) | Polynucleotide modification methods | |
EP2895618B1 (en) | Sample preparation method | |
WO2018060740A1 (en) | Method for nucleic acid detection by guiding through a nanopore | |
WO2014072703A1 (en) | Quadruplex method | |
EP3259281A1 (en) | Hetero-pores |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
AS | Assignment |
Owner name: OXFORD NANOPORE TECHNOLOGIES PLC, UNITED KINGDOM Free format text: CHANGE OF NAME;ASSIGNOR:OXFORD NANOPORE TECHNOLOGIES LIMITED;REEL/FRAME:058737/0664 Effective date: 20210924 Owner name: OXFORD NANOPORE TECHNOLOGIES LTD., UNITED KINGDOM Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BROWN, CLIVE GAVIN;TURNER, DANIEL JOHN;WHITE, JAMES;REEL/FRAME:058657/0259 Effective date: 20160518 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |