CN103233072B - High-flux mythelation detection technology for DNA (deoxyribonucleic acid) of complete genome - Google Patents
High-flux mythelation detection technology for DNA (deoxyribonucleic acid) of complete genome Download PDFInfo
- Publication number
- CN103233072B CN103233072B CN201310163085.0A CN201310163085A CN103233072B CN 103233072 B CN103233072 B CN 103233072B CN 201310163085 A CN201310163085 A CN 201310163085A CN 103233072 B CN103233072 B CN 103233072B
- Authority
- CN
- China
- Prior art keywords
- methylation
- genome
- dna
- sequencing
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 21
- 108020004414 DNA Proteins 0.000 title claims description 31
- 238000005516 engineering process Methods 0.000 title abstract description 12
- 102000053602 DNA Human genes 0.000 title description 4
- 230000011987 methylation Effects 0.000 claims abstract description 61
- 238000007069 methylation reaction Methods 0.000 claims abstract description 61
- 238000012163 sequencing technique Methods 0.000 claims abstract description 31
- 230000007067 DNA methylation Effects 0.000 claims abstract description 15
- 239000012634 fragment Substances 0.000 claims description 20
- 238000012408 PCR amplification Methods 0.000 claims description 13
- 108010042407 Endonucleases Proteins 0.000 claims description 12
- 102000004533 Endonucleases Human genes 0.000 claims description 12
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 4
- 101150008223 SLX1 gene Proteins 0.000 claims description 4
- 101100333298 Schizosaccharomyces pombe (strain 972 / ATCC 24843) eme1 gene Proteins 0.000 claims description 4
- 239000002773 nucleotide Substances 0.000 claims description 4
- 125000003729 nucleotide group Chemical group 0.000 claims description 4
- 230000003321 amplification Effects 0.000 claims description 3
- 230000027455 binding Effects 0.000 claims description 3
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 3
- 238000000034 method Methods 0.000 abstract description 14
- 230000004048 modification Effects 0.000 abstract description 9
- 238000012986 modification Methods 0.000 abstract description 9
- 230000001419 dependent effect Effects 0.000 abstract description 7
- 238000012165 high-throughput sequencing Methods 0.000 abstract description 7
- 230000008569 process Effects 0.000 abstract description 6
- 230000029087 digestion Effects 0.000 abstract description 4
- 108091008146 restriction endonucleases Proteins 0.000 abstract description 4
- 230000006862 enzymatic digestion Effects 0.000 abstract description 2
- 239000000047 product Substances 0.000 description 19
- 238000006243 chemical reaction Methods 0.000 description 16
- 235000020637 scallop Nutrition 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 7
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical group NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 7
- 241000237509 Patinopecten sp. Species 0.000 description 6
- 108090000623 proteins and genes Proteins 0.000 description 6
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 5
- 238000011160 research Methods 0.000 description 5
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 4
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 4
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 238000001369 bisulfite sequencing Methods 0.000 description 4
- 239000000872 buffer Substances 0.000 description 4
- 230000036425 denaturation Effects 0.000 description 4
- 238000004925 denaturation Methods 0.000 description 4
- PHTQWCKDNZKARW-UHFFFAOYSA-N isoamylol Chemical compound CC(C)CCO PHTQWCKDNZKARW-UHFFFAOYSA-N 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- ZKHQWZAMYRWXGA-UHFFFAOYSA-N Adenosine triphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)C(O)C1O ZKHQWZAMYRWXGA-UHFFFAOYSA-N 0.000 description 3
- 241000237503 Pectinidae Species 0.000 description 3
- 238000000137 annealing Methods 0.000 description 3
- 229940104302 cytosine Drugs 0.000 description 3
- 238000001976 enzyme digestion Methods 0.000 description 3
- 239000002244 precipitate Substances 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 2
- 102000012410 DNA Ligases Human genes 0.000 description 2
- 108010061982 DNA Ligases Proteins 0.000 description 2
- 238000001712 DNA sequencing Methods 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 150000001413 amino acids Chemical class 0.000 description 2
- 235000021466 carotenoid Nutrition 0.000 description 2
- 150000001747 carotenoids Chemical class 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 239000003248 enzyme activator Substances 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 238000001114 immunoprecipitation Methods 0.000 description 2
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 2
- 210000003205 muscle Anatomy 0.000 description 2
- 238000007481 next generation sequencing Methods 0.000 description 2
- 229920002401 polyacrylamide Polymers 0.000 description 2
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 235000015170 shellfish Nutrition 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- LRSASMSXMSNRBT-UHFFFAOYSA-N 5-methylcytosine Chemical compound CC1=CNC(=O)N=C1N LRSASMSXMSNRBT-UHFFFAOYSA-N 0.000 description 1
- ZKHQWZAMYRWXGA-KQYNXXCUSA-J ATP(4-) Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)[C@H]1O ZKHQWZAMYRWXGA-KQYNXXCUSA-J 0.000 description 1
- LSNNMFCWUKXFEE-UHFFFAOYSA-M Bisulfite Chemical compound OS([O-])=O LSNNMFCWUKXFEE-UHFFFAOYSA-M 0.000 description 1
- 108091029430 CpG site Proteins 0.000 description 1
- 102100036279 DNA (cytosine-5)-methyltransferase 1 Human genes 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 101000931098 Homo sapiens DNA (cytosine-5)-methyltransferase 1 Proteins 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 108091036060 Linker DNA Proteins 0.000 description 1
- UPYKUZBSLRQECL-UKMVMLAPSA-N Lycopene Natural products CC(=C/C=C/C=C(C)/C=C/C=C(C)/C=C/C1C(=C)CCCC1(C)C)C=CC=C(/C)C=CC2C(=C)CCCC2(C)C UPYKUZBSLRQECL-UKMVMLAPSA-N 0.000 description 1
- 108060004795 Methyltransferase Proteins 0.000 description 1
- 102000016397 Methyltransferase Human genes 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 108020002230 Pancreatic Ribonuclease Proteins 0.000 description 1
- 102000005891 Pancreatic ribonuclease Human genes 0.000 description 1
- 102000055027 Protein Methyltransferases Human genes 0.000 description 1
- 108700040121 Protein Methyltransferases Proteins 0.000 description 1
- -1 S-adenosylmethylthio Chemical group 0.000 description 1
- LSNNMFCWUKXFEE-UHFFFAOYSA-N Sulfurous acid Chemical compound OS(O)=O LSNNMFCWUKXFEE-UHFFFAOYSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000032683 aging Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 150000001746 carotenes Chemical class 0.000 description 1
- 235000005473 carotenes Nutrition 0.000 description 1
- 230000003915 cell function Effects 0.000 description 1
- 238000007621 cluster analysis Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000013020 embryo development Effects 0.000 description 1
- 230000001973 epigenetic effect Effects 0.000 description 1
- 230000007608 epigenetic mechanism Effects 0.000 description 1
- 230000004049 epigenetic modification Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 230000011365 genetic imprinting Effects 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- VMGAPWLDMVPYIA-HIDZBRGKSA-N n'-amino-n-iminomethanimidamide Chemical compound N\N=C\N=N VMGAPWLDMVPYIA-HIDZBRGKSA-N 0.000 description 1
- 108020004707 nucleic acids Proteins 0.000 description 1
- 102000039446 nucleic acids Human genes 0.000 description 1
- 150000007523 nucleic acids Chemical class 0.000 description 1
- 230000001293 nucleolytic effect Effects 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 150000002989 phenols Chemical class 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000008844 regulatory mechanism Effects 0.000 description 1
- BOLDJAUMGUJJKM-LSDHHAIUSA-N renifolin D Natural products CC(=C)[C@@H]1Cc2c(O)c(O)ccc2[C@H]1CC(=O)c3ccc(O)cc3O BOLDJAUMGUJJKM-LSDHHAIUSA-N 0.000 description 1
- 230000033458 reproduction Effects 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 239000008223 sterile water Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- NCYCYZXNIZJOKI-UHFFFAOYSA-N vitamin A aldehyde Natural products O=CC=C(C)C=CC=C(C)C=CC1=C(C)CCCC1(C)C NCYCYZXNIZJOKI-UHFFFAOYSA-N 0.000 description 1
Images
Landscapes
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
本发明的目的在于提供一种高通量全基因组DNA甲基化检测方法,用甲基化修饰依赖性限制性酶FspEI对基因组DNA进行酶切,基因组中CG 或 CHG 位点上的甲基化修饰均可以被 FspEI 识别,酶切后产生具有核心甲基化位点的等长标签,对标签文库进行高通量测序技术能够获得全基因组范围内的甲基化位点序列信息。本方法可以实现甲基化修饰位点的准确定位,通过酶切富集基因组甲基化位点,直接对甲基化标签序列进行测序和定量,可以有效降低测序费用。实验流程仅需两天,具有通量高、操作简便、成本低廉、可靠性好等优点,是一种优良的适用于非模式生物的高通量全基因组甲基化检测方法。
The purpose of the present invention is to provide a high-throughput genome-wide DNA methylation detection method, which uses the methylation modification-dependent restriction enzyme FspEI to digest the genomic DNA, and the methylation at the CG or CHG site in the genome All modifications can be recognized by FspEI, and after digestion, an equal-length tag with a core methylation site is generated. High-throughput sequencing technology for the tag library can obtain genome-wide methylation site sequence information. This method can realize accurate positioning of methylation modification sites, enrich genome methylation sites by enzymatic digestion, and directly sequence and quantify methylation tag sequences, which can effectively reduce sequencing costs. The experimental process only takes two days, and has the advantages of high throughput, simple operation, low cost, and good reliability. It is an excellent high-throughput genome-wide methylation detection method suitable for non-model organisms.
Description
技术领域 technical field
本发明属于DNA甲基化检测技术领域,具体涉及一种高通量全基因组DNA甲基化检测技术。The invention belongs to the technical field of DNA methylation detection, and in particular relates to a high-throughput genome-wide DNA methylation detection technology.
背景技术: Background technique:
DNA甲基化(DNA methylation)是真核生物基因组DNA的一种重要的表观遗传学修饰,即在DNA甲基化转移酶(DNA methyltransferase,DNMT)的作用下,将S-腺苷甲硫氨酸(SAM)的甲基集团共价结合到DNA分子的胞嘧啶上形成5-甲基胞嘧啶(5-mC)的过程。DNA甲基化在维持高等生物正常细胞功能、遗传印记、胚胎发育、衰老以及人类肿瘤的发生等生物学过程中起着重要作用。在无脊椎动物中,基因组DNA甲基化通过调控基因的表达模式,参与调节机体对环境的适应过程。因此,获得全基因组范围内所有胞嘧啶位点的甲基化数据,对于表观遗传学的时空特异性研究具有重要意义。DNA methylation is an important epigenetic modification of eukaryotic genomic DNA, that is, under the action of DNA methyltransferase (DNA methyltransferase, DNMT), S-adenosylmethylthio The process in which the methyl group of amino acid (SAM) is covalently bound to cytosine of DNA molecule to form 5-methylcytosine (5- mC ). DNA methylation plays an important role in the maintenance of normal cell function in higher organisms, genetic imprinting, embryonic development, aging and the occurrence of human tumors and other biological processes. In invertebrates, genomic DNA methylation is involved in regulating the adaptation process of the organism to the environment by regulating the expression pattern of genes. Therefore, obtaining the methylation data of all cytosine sites in the whole genome is of great significance for the study of spatiotemporal specificity of epigenetics.
随着甲基化研究的不断深入,已经有多种甲基化分析方法以满足不同甲基化研究的需要。在针对全基因范围DNA甲基化的研究中,检测的手段主要是基于芯片平台的全基因组甲基化位点的筛选和基于高通量测序平台的甲基化图谱分析。其中芯片技术在模式生物的甲基化研究中是相对完善成熟的检测工具,具有高覆盖度、方便快捷,性价比高等特点,如 Illumina推出的人类全基因组甲基化芯片Human Methylation HD 450包含45万个CpG位点,能够覆盖所有NCBI注释的基因。操作流程简单,能够进行高通量甲基化位点的准确检测。但针对遗传信息相对匮乏的非模式生物,其芯片造价昂贵,甲基化位点选择的灵活性不高,因而难以利用现有芯片平台对非模式生物进行高通量全基因组甲基化的研究。With the deepening of methylation research, there have been a variety of methylation analysis methods to meet the needs of different methylation research. In the research on DNA methylation in the whole gene range, the means of detection are mainly the screening of genome-wide methylation sites based on chip platform and the analysis of methylation map based on high-throughput sequencing platform. Among them, chip technology is a relatively complete and mature detection tool in the methylation research of model organisms. It has the characteristics of high coverage, convenience, and high cost performance. For example, the Human Methylation HD 450 chip launched by Illumina contains 450,000 CpG sites, able to cover all NCBI annotated genes. The operation procedure is simple, and it can accurately detect high-throughput methylation sites. However, for non-model organisms with relatively scarce genetic information, the cost of the chip is expensive, and the flexibility of methylation site selection is not high, so it is difficult to use the existing chip platform to conduct high-throughput genome-wide methylation research on non-model organisms .
随着新一代测序技术通量不断提高和成本的降低,目前有很多基于二代测序平台的全基因组甲基化检测方法以得到应用,包括有全基因组重亚硫酸盐测序(whole genome bisulfite sequencing,BS-seq)和简化的表达重亚硫酸盐测序 (reduced representation bisulfite sequencing, RRBS)、甲基化DNA 免疫共沉淀测序(methylated DNA immunoprecipitation sequencing, MeDIP-seq)、甲基化DNA 富集结合高通量测序(methylated DNA binding domainsequencing, MBD-Seq) 和甲基化敏感性限制酶测序(methylation-sensitive Restriction Enzyme sequencing,MRE-seq)等。这些技术都有其优缺点和适用范围,如基于亚硫氢酸盐处理的DNA甲基化检测方法, 尽管作为DNA甲基化检测的金标准,但操作复杂(重亚硫氢酸盐处理),测序所需的成本较高,不适用于贝类等基因组较大的物种。 MeDIP-Seq是对特异性抗体富集的基因组上的甲基化区域进行高通量测序从而获得全基因组范围的甲基化位点。但该技术需要大量DNA,并且抗体的价格昂贵。With the continuous improvement of next-generation sequencing technology throughput and cost reduction, there are currently many genome-wide methylation detection methods based on next-generation sequencing platforms that have been applied, including whole genome bisulfite sequencing (whole genome bisulfite sequencing, BS-seq) and simplified expression bisulfite sequencing (reduced representation bisulfite sequencing, RRBS), methylated DNA immunoprecipitation sequencing (methylated DNA immunoprecipitation sequencing, MeDIP-seq), methylated DNA enrichment combined with Qualcomm DNA sequencing (methylated DNA binding domain sequencing, MBD-Seq) and methylation-sensitive restriction enzyme sequencing (methylation-sensitive Restriction Enzyme sequencing, MRE-seq), etc. These technologies have their advantages, disadvantages and scope of application, such as the DNA methylation detection method based on sulfite treatment, although it is the gold standard for DNA methylation detection, the operation is complicated (bisulfite treatment) , the cost required for sequencing is high, and it is not suitable for species with large genomes such as shellfish. MeDIP-Seq is a high-throughput sequencing of methylated regions on the genome enriched by specific antibodies to obtain genome-wide methylated sites. But the technique requires large amounts of DNA, and antibodies are expensive.
发明内容 Contents of the invention
本发明的目的在于提供一种高通量全基因组DNA甲基化检测方法,即一种适用于非模式生物的,低成本、简单快速的高通量全基因组甲基化检测方法,以弥补现有技术的不足。The purpose of the present invention is to provide a high-throughput whole-genome DNA methylation detection method, that is, a low-cost, simple and fast high-throughput whole-genome DNA methylation detection method suitable for non-model organisms, to make up for the existing There are technical deficiencies.
本发明的全基因组DNA甲基化检测方法,包括如下的步骤:The whole genome DNA methylation detection method of the present invention comprises the following steps:
1)将基因组DNA用内切酶FspEI酶进行酶切,获得酶切片段;1) Genomic DNA is digested with endonuclease FspEI to obtain digested fragments;
2)将酶切片段的两端分别连接上接头,作为扩增引物的结合点;2) Connect the two ends of the enzyme-digested fragments to adapters as the binding points of the amplification primers;
3)将连接上接头的酶切片段用引物进行第一轮PCR扩增,从而富集接头连接正确的酶切片段;3) Perform the first round of PCR amplification of the enzyme-cut fragments connected with the adapters with primers, so as to enrich the enzyme-cut fragments with the correct adapter ligation;
4)将第一轮PCR扩增产物经凝胶纯化后用引物进行第二轮PCR扩增,引入Barcode来构建测序文库;4) The first round of PCR amplification products were gel-purified, and primers were used for the second round of PCR amplification, and Barcode was introduced to construct a sequencing library;
5)测序文库进行测序;将测序数据进行分析得到全基因组甲基化信息。5) The sequencing library is sequenced; the sequencing data is analyzed to obtain the genome-wide methylation information.
其中,步骤2)中的接头,为接头slx1和slx2,其中构成slx1的两个核苷酸片段,其序列分别为5′-ACACTCTTTCCCTACACGACGCTCTTCCGATCT-3′(SEQ ID NO:1)和3′-CGAGAAGGCTAGANNNNN-5′(SEQ ID NO:2),其中N为碱基A、T、G、C中的任一个;Wherein, the linkers in step 2) are linkers slx1 and slx2, wherein the two nucleotide fragments constituting slx1 have the sequences respectively 5′-ACACTCTTTCCCTACACGACGCTCTTCCGATCT-3′ (SEQ ID NO: 1) and 3′-CGAGAAGGCTAGANNNNN- 5' (SEQ ID NO: 2), wherein N is any one of the bases A, T, G, and C;
构成slx2的两个核苷酸片段,其序列分别为5′-GTGACTGGAGTTCAGACTwo nucleotide fragments constituting slx2, the sequences of which are 5′-GTGACTGGAGTTCAGAC
GTGTGCTCTTCCGATCT-3′(SEQ ID NO:3)和3′-CGAGAAGGCTAGANNNNN-5′ (SEQ ID NO:2)。GTGTGCTCTTCCGATCT-3' (SEQ ID NO: 3) and 3'-CGAGAAGGCTAGANNNNN-5' (SEQ ID NO: 2).
所述的步骤3)中的引物,为Slx-Primer 1和Slx-Primer 2,其核苷酸序列分别为:The primers in the step 3) are Slx-
Slx-Primer1:5'-AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTASlx-Primer1: 5′-AATGATACGGCGACCACCGAGATCTACACTCTTTTCCCTA
CACGAC-3’ (SEQ ID NO:4);CACGAC-3' (SEQ ID NO: 4);
Slx-Primer 2:5'-GTGACTGGAGTTCAGACGTGT-3’ (SEQ ID NO:5);Slx-Primer 2: 5'-GTGACTGGAGTTCAGACGTGT-3' (SEQ ID NO: 5);
所述的步骤4)中的引物为Slx-Primer 1和Slx-Index Primer,其核苷酸序列分别为:The primers in the step 4) are Slx-
Slx-Primer1:5'-AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTASlx-Primer1: 5′-AATGATACGGCGACCACCGAGATCTACACTCTTTTCCCTA
CACGAC-3’ (SEQ ID NO:4);CACGAC-3' (SEQ ID NO: 4);
Slx-Index Primer:Slx-Index Primer:
5'-CAAGCAGAAGACGGCATACGAGATNNNNNNGTGACTGGAGTTCAGACGTGT-3'(SEQ ID NO:6);其中N为碱基A、T、G、C中的任一个。5'-CAAGCAGAAGACGGCATACGAGATNNNNNNGTGACTGGAGTTCAGACGTGT-3' (SEQ ID NO: 6); wherein N is any one of bases A, T, G, and C.
本方法可以实现甲基化修饰位点的准确定位,通过酶切富集基因组甲基化位点,直接对甲基化标签序列进行测序和定量,可以有效降低测序费用。实验流程仅需两天,具有通量高、操作简便、成本低廉、可靠性好等优点,是一种优良的适用于非模式生物的高通量全基因组甲基化检测方法。This method can realize accurate positioning of methylation modification sites, enrich genome methylation sites by enzymatic digestion, and directly sequence and quantify methylation tag sequences, which can effectively reduce sequencing costs. The experimental process only takes two days, and has the advantages of high throughput, simple operation, low cost, and good reliability. It is an excellent high-throughput genome-wide methylation detection method suitable for non-model organisms.
附图说明 Description of drawings
图1:本发明的全基因组甲基化检测方法的流程及原理示意图。Figure 1: Schematic diagram of the process and principle of the genome-wide methylation detection method of the present invention.
具体实施方式: Detailed ways:
本发明开发了基于高通量测序平台的新型全基因组甲基化检测分析技术MethylRAD-Seq(Methylation-dependent restriction-site associated DNA sequencing),结合甲基修饰依赖性内切酶和高通量测序的特点,在不需要基因组背景信息的前提下可以大规模发掘全基因组范围内的甲基化位点,直接精确检测发生甲基化的胞嘧啶位点。该技术的原理主要是利用甲基化修饰依赖性限制性酶FspEI对基因组DNA进行酶切,基因组中CG 或 CHG 位点上的甲基化修饰均可以被FspEI 识别,酶切后产生具有核心甲基化位点的等长标签,对标签文库进行高通量测序技术能够获得全基因组范围内的甲基化位点序列信息。建库流程简便快速,可同时对多个样本进行全基因组DNA甲基化分析。而且,本发明所用的接头和引物可以高效的对基因组DNA进行操作,提高了效率。本发明的方法对于表观遗传学背景相对较少的非模式生物是一种成本较低、操作简便的高通量全基因组甲基化分析方法。The present invention has developed a new genome-wide methylation detection and analysis technology MethylRAD-Seq (Methylation-dependent restriction-site associated DNA sequencing) based on a high-throughput sequencing platform, which combines methyl modification-dependent endonucleases and high-throughput sequencing It is characterized by large-scale mining of methylation sites across the genome without the need for genomic background information, and direct and accurate detection of methylated cytosine sites. The principle of this technology is to use the methylation modification-dependent restriction enzyme FspEI to digest genomic DNA. The methylation modification at the CG or CHG site in the genome can be recognized by FspEI, and the DNA with core formazan is produced after digestion. The high-throughput sequencing technology of the tag library can obtain the sequence information of the methylation site in the whole genome. The library construction process is simple and fast, and the whole genome DNA methylation analysis can be performed on multiple samples at the same time. Moreover, the adapters and primers used in the present invention can efficiently operate on genomic DNA, improving the efficiency. The method of the present invention is a low-cost, easy-to-operate high-throughput genome-wide methylation analysis method for non-model organisms with relatively few epigenetic backgrounds.
对于本发明中所涉及的名词,定义如下:For the terms involved in the present invention, the definitions are as follows:
1、内切酶,又称为核酸内切酶(endonuclease)在核酸水解酶中,为可水解分子链内部磷酸二酯键生成寡核苷酸的酶;本发明所用到的FspEI内切酶是一种甲基化修饰依赖性内切酶,依赖于胞嘧啶位点的甲基化修饰从而对DNA分子产生切割作用。该酶购自NEB(New EnglandBiolabs)有限公司。1. Endonuclease, also known as endonuclease (endonuclease) in nucleolytic enzymes, is an enzyme that can hydrolyze the internal phosphodiester bond of the molecular chain to generate oligonucleotides; the FspEI endonuclease used in the present invention is A methylation-dependent endonuclease that cleaves DNA molecules depending on the methylation of cytosine sites. The enzyme was purchased from NEB (New England Biolabs) Co., Ltd.
2、接头:adaptor DNA ,是一段短的含酶切位点并能与钝性末端或粘性末端匹配的人工合成DNA片段,接头DNA常用于一钝性末端DNA与一粘性末端DNA的连接。有时连接到粘性末端的接头DNA是为了给未知DNA片段提供一段已知的序列,根据其设计引物,扩增未知的DNA片段。2. Adapter: adapter DNA is a short artificially synthesized DNA fragment that contains a restriction site and can be matched with a blunt end or a sticky end. Adapter DNA is often used to connect a blunt end DNA to a sticky end DNA. Sometimes the linker DNA ligated to the cohesive end is to provide a known sequence to the unknown DNA fragment, based on which primers are designed to amplify the unknown DNA fragment.
3、其中N为碱基A、T、G、C中的任一个;其中A、T、G、C代表组成DNA分子的四种脱氧核苷。3. N is any one of the bases A, T, G, and C; wherein A, T, G, and C represent the four deoxynucleosides that make up the DNA molecule.
4、Barcode即一段短的特征序列,对多个样本同时进行高通量测序时,对每条reads上带有的一段特定短序列(即barcode)测序能够准确识别样本来源。4. Barcode is a short characteristic sequence. When high-throughput sequencing is performed on multiple samples at the same time, sequencing a specific short sequence (ie barcode) on each reads can accurately identify the source of the sample.
本发明的方法,包括有如下的步骤:The method of the present invention comprises the following steps:
1)制备生物基因组DNA:提取生物的基因组DNA,4℃冰箱保存备用。1) Preparation of biological genome DNA: extract the genome DNA of the organism and store it in a refrigerator at 4°C for later use.
将提取的基因组DNA利用甲基化修饰依赖性内切酶FspEI酶切基因组,获得全基因组范围内的甲基化标签,其中酶切体系为20μl,包含300ng基因组DNA,4U的FspEI内切酶(NEB),1×NE Buffer4,1×EnzymeActivator Solution,1×BSA,37℃保温4小时。The extracted genomic DNA was cleaved with the methylation modification-dependent endonuclease FspEI to obtain methylation tags in the whole genome. The enzyme digestion system was 20 μl, containing 300ng of genomic DNA, and 4U of FspEI endonuclease ( NEB), 1×NE Buffer4, 1×Enzyme Activator Solution, 1×BSA, kept at 37°C for 4 hours.
2)设计有粘性末端的接头,连接标签2) Adapters designed with sticky ends, ligated tags
酶切反应产生的标签 5′末端都带有一个4碱基突出,设计3’端带4个兼并碱基的接头Slx-Adaptor 1, Slx-Adaptor 2,连接反应体系为20μl,包含10μl上步的酶切产物,800U T4 DNA连接酶(NEB),1×T4 Ligase Buffer,4μM Slex-Ad1,4μM Slx-Ad2,20mM三磷酸腺苷ATP,4℃连接16h。The tags produced by the enzyme digestion reaction have a 4-base overhang at the 5' end, and design adapters Slx-
其中接头的序列信息见表1。The sequence information of the linker is shown in Table 1.
3)进行第一轮PCR扩增,富集标签,其中PCR反应体系为20μL,包含7μl连接了接头的酶切片段作为反应模板,4μM Slx-Primer1引物3) Carry out the first round of PCR amplification to enrich the tags. The PCR reaction system is 20 μL, including 7 μl of enzyme-cut fragments connected with adapters as reaction templates, and 4 μM Slx-Primer1 primer
(5'- AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGAC -3'),4μM Slx-Primer2引物(5'- GTGACTGGAGTTCAGACGTGT -3'),0.3mM dNTPs, 0.4U Phusion 超保真 DNA 聚合酶(NEB),1×HF buffer;反应条件为98℃变性5 s,60℃退火20 s,72℃延伸10 s,进行14-18个循环,最后72℃延伸 5min。PCR产物用8% 非变性聚丙烯酰胺琼凝胶电泳检测,扩增产物大小约为120bp, 切胶回收PCR产物。(5'- AATGATACGGCGACCACCGAGATTCACACTCTTTCCCTACACGAC -3'), 4μM Slx-Primer2 primer (5'- GTGACTGGAGTTCAGACGTGT -3'), 0.3mM dNTPs, 0.4U Phusion ultra-fidelity DNA polymerase (NEB), 1×HF buffer; the reaction conditions are Denaturation at 98°C for 5 s, annealing at 60°C for 20 s, extension at 72°C for 10 s, 14-18 cycles, and finally extension at 72°C for 5 min. The PCR product was detected by 8% non-denaturing polyacrylamide agar gel electrophoresis, the size of the amplified product was about 120bp, and the PCR product was recovered by cutting the gel.
4)Barcode特异性引物二轮PCR扩增4) Second-round PCR amplification with Barcode-specific primers
为了实现多个个体混合测序进行甲基化检测,可以通过对每个个体添加不同的Barcode来区分,利用PCR反应的不同引物引入不同的Barcode。PCR反应体系为20μL,包含25ng一轮PCR扩增纯化产物,4μM Slx-Primer1引物 5’-AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGAC -3' ,In order to realize the mixed sequencing of multiple individuals for methylation detection, it can be distinguished by adding different Barcodes to each individual, and using different primers in the PCR reaction to introduce different Barcodes. The PCR reaction system is 20 μL, including 25ng round PCR amplification purification product, 4μM Slx-Primer1 primer 5'-AATGATACGGCGACCACCGAGATTCACTCTTTCCCTACACGAC-3',
4μM Slx-Index Primer引物:4μM Slx-Index Primer primer:
5'- CAAGCAGAAGACGGCATACGAGATNNNNNNGTGACTGGAGTTCAGACGTGT-3',其中NNNNNN可根据不同的Barcode序列改变),0.3mM dNTPs, 0.4UPhusion 超保真 DNA 聚合酶(NEB),1× HF buffer;反应条件为98℃变性5 s,60℃退火20 s,72℃延伸10 s,进行5-7个循环,最后72℃延伸5min。平行扩增3管,PCR产物用8%非变性聚丙烯酰胺凝胶电泳检测,扩增产物大小约为150bp,利用QIAGEN PCR产物纯化试剂盒回收纯化PCR产物。利用Solexa Hiseq2000测序平台测序,此部分由测序公司完成。5'- CAAGCAGAAGACGGCATACGAGATNNNNNNGTGACTGGAGTTCAGACGTGT-3', where NNNNNN can be changed according to different Barcode sequences), 0.3mM dNTPs, 0.4UPhusion ultra-fidelity DNA polymerase (NEB), 1× HF buffer; the reaction conditions are denaturation at 98°C for 5 s, 60 Anneal at ℃ for 20 s, extend at 72°C for 10 s, perform 5-7 cycles, and finally extend at 72°C for 5 min. Three tubes were amplified in parallel, and the PCR product was detected by 8% non-denaturing polyacrylamide gel electrophoresis. The size of the amplified product was about 150 bp, and the PCR product was recovered and purified using the QIAGEN PCR Product Purification Kit. The Solexa Hiseq2000 sequencing platform was used for sequencing, and this part was completed by the sequencing company.
5)数据分析:5) Data analysis:
本实施例采用的方法如下:The method adopted in this embodiment is as follows:
1、Illunima/Solexa测序产生的结果文件为fastq格式,首先利用SolexaQA软件包对原始序列进行质量过滤,去除含有N的序列以及大于5个碱基的质量值小于10的reads;1. The result file generated by Illunima/Solexa sequencing is in fastq format. First, use the SolexaQA software package to perform quality filtering on the original sequence, and remove sequences containing N and reads with a quality value greater than 5 bases and less than 10;
2、利用CD-HIT软件对短序列进行聚类分析,获得测序文库中的甲基化标签种类以及该代表标签的丰度信息,甲基化位点的覆盖reads数目可以衡量该位点的甲基化水平;2. Use CD-HIT software to cluster and analyze short sequences to obtain the type of methylation tag in the sequencing library and the abundance information of the representative tag. The number of covered reads at the methylation site can measure the methylation of the site. Basic level;
3、利用SOAP软件将甲基化位点的序列比对基因组参考序列,可以获得该位点的基因组来源信息。3. Using SOAP software to compare the sequence of the methylation site with the genome reference sequence, the genome source information of the site can be obtained.
表1本发明中涉及的接头及引物序列表Linker and primer sequence list involved in the present invention in table 1
下面以虾夷扇贝为例通过实施例详细叙述本发明,对于本发明所用的试剂,本领域的技术人员可以根据本发明的技术方案,在现有试剂中进行选择,而不仅限于本发明具体实施例的限制。The present invention is described in detail below by taking the scallop as an example. For the reagents used in the present invention, those skilled in the art can select from the existing reagents according to the technical scheme of the present invention, and are not limited to the specific implementation of the present invention. Example limitations.
1)提取扇贝基因组DNA1) Extraction of scallop genome DNA
取II龄野生群体的虾夷扇贝和海大金贝各12只,每个个体闭壳肌约0.1克,加入500ulSTE裂解缓冲液(NaCl:100mM;EDTA:1mM,pH=8.0;Tris-HCl,10nM,pH=8.0),剪碎,再加入50μl 10%的SDS,以及5μl蛋白酶K(20mg/ml),56℃水浴消化,至组织碎块完全裂解,裂解液澄清。加入等体积饱和酚(250μl)以及氯仿/异戊醇(24:1)(250μl),抽提3次,取上清液,加入等体积氯仿/异戊醇(24:1)(500ul)抽提1次,取上清液,加入1/10体积NaAc(3M,pH 5.2)(50μl)和2倍体积-20℃保存无水乙醇(1000μl),缓慢摇匀;-20℃沉淀30min.12000rpm离心10min,核酸将沉淀于管底。70%乙醇(1000μl)洗涤沉淀并干燥至乙醇全部挥发,加入100μl无菌水以及少量(1-2μl)RNase A,4℃冰箱保存备用。Take 12 scallops and golden scallops from the wild population at age II, add about 0.1 g of adductor muscle to each individual, add 500 ul of STE lysis buffer (NaCl: 100 mM; EDTA: 1 mM, pH=8.0; Tris-HCl, 10nM, pH=8.0), cut into pieces, then added 50μl 10% SDS, and 5μl proteinase K (20mg/ml), digested in a water bath at 56°C until the tissue fragments were completely lysed and the lysate was clarified. Add an equal volume of saturated phenol (250 μl) and chloroform/isoamyl alcohol (24:1) (250 μl),
扇贝基因组DNA的消化Digestion of scallop genomic DNA
利用甲基化修饰依赖性内切酶FspEI酶切基因组,获得全基因组范围内的甲基化标签:酶切体系为20μl,包含300ng基因组DNA,4U的FspEI内切酶(NEB),1×NEBuffer4,1×Enzyme Activator Solution,1×BSA,37℃保温4小时。Use the methylation-dependent endonuclease FspEI to digest the genome to obtain methylation tags across the genome: the digestion system is 20 μl, containing 300ng genomic DNA, 4U of FspEI endonuclease (NEB), 1×NEBuffer4 , 1×Enzyme Activator Solution, 1×BSA, keep warm at 37°C for 4 hours.
2) 将酶切片段的两端分别连接上接头,作为扩增引物的结合点2) Connect the two ends of the enzyme-digested fragments to adapters as the binding points of the amplification primers
酶切反应产生的标签 5′末端都带有一个4碱基突出,设计3’端带4个兼并碱基的接头Slx-Ad1, Slx-Ad2,连接反应体系为20ul,包含10ul上步的酶切产物,800U T4 DNA连接酶(NEB),1×T4 LigaseBuffer,4uM Adaptor 1,4uM Adaptor 2,20mM三磷酸腺苷ATP,4℃连接16h。The tags produced by the enzyme digestion reaction have a 4-base overhang at the 5' end, and design adapters Slx-Ad1 and Slx-Ad2 with 4 merged bases at the 3' end, and the ligation reaction system is 20ul, including 10ul of the enzyme in the previous step Digested product, 800U T4 DNA ligase (NEB), 1×T4 LigaseBuffer,
3) 将连接上接头的酶切片段用引物进行第一轮PCR扩增,从而富集接头连接正确的酶切片段;3) Perform the first round of PCR amplification of the enzyme-cut fragments connected with the adapters with primers, so as to enrich the enzyme-cut fragments connected with the adapters correctly;
PCR反应体系为20μL,包含7ul反应模板,4μM Slx-Primer1引物(5'- AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGAC -3'),4μM Slx-Primer2引物(5'- GTGACTGGAGTTCAGACGTGT -3'),0.3mMdNTPs, 0.4U Phusion 超保真 DNA 聚合酶(NEB),1× HFbuffer;反应条件为98℃变性5 s,60℃退火20 s,72℃延伸10 s,进行14-18个循环,最后72℃延伸 5min。PCR产物用8% 非变性聚丙烯酰胺琼凝胶电泳检测,扩增产物大小约为120bp, 切胶回收PCR产物。The PCR reaction system is 20μL, including 7ul reaction template, 4μM Slx-Primer1 primer (5'- AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGAC-3'), 4μM Slx-Primer2 primer (5'- GTGACTGGAGTTCAGACGTGT -3'), 0.3mMdNTPs, 0.4U Phusion ultra-fidelity DNA polymerase (NEB), 1× HFbuffer; the reaction conditions are denaturation at 98°C for 5 s, annealing at 60°C for 20 s, extension at 72°C for 10 s, 14-18 cycles, and finally extension at 72°C for 5 min. The PCR product was detected by 8% non-denaturing polyacrylamide agar gel electrophoresis, the size of the amplified product was about 120bp, and the PCR product was recovered by cutting the gel.
4) 将第一轮PCR扩增产物用引物进行第二轮PCR扩增,引入Barcode来构建测序文库;4) Perform the second round of PCR amplification of the first round of PCR amplification products with primers, and introduce Barcode to construct a sequencing library;
为了实现多个个体混合测序进行甲基化检测,可以通过对每个个体添加不同的Barcode来区分,利用PCR反应的不同引物引入不同的Barcode。PCR反应体系为20uL,包含25ng一轮PCR扩增纯化产物,4μM Slx-Primer1引物 5’-AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGAC -3' ,4μM Slx-Index Primer引物In order to realize the mixed sequencing of multiple individuals for methylation detection, it can be distinguished by adding different Barcodes to each individual, and using different primers in the PCR reaction to introduce different Barcodes. The PCR reaction system is 20uL, including 25ng round PCR amplification purification product, 4μM Slx-Primer1 primer 5'-AATGATACGGCGACCACCGAGATTCTACACTCTTTCCCTACACGAC-3', 4μM Slx-Index Primer primer
5'- CAAGCAGAAGACGGCATACGAGATNNNNNNGTGACTGGAGTTCAGACGTGT-3',其中NNNNNN可根据不同的Barcode序列改变,24只虾夷扇贝甲基化文库使用的Slx-Index Primer引物序列如表2所示,0.3mM dNTPs, 0.4UPhusion 超保真 DNA聚合酶(NEB),1× HF buffer;反应条件为98℃变性5 s,60℃退火20 s,72℃延伸10 s,进行5-7个循环,最后72℃延伸5min。平行扩增3管,PCR产物用8%非变性聚丙烯酰胺凝胶电泳检测,扩增产物大小约为150bp,利用QIAGEN PCR产物纯化试剂盒回收纯化PCR产物。利用Solexa Hiseq2000测序平台测序,此部分由测序公司完成。5'- CAAGCAGAAGACGGCATACGAGATNNNNNNGTGACTGGAGTTCAGACGTGT-3', where NNNNNN can be changed according to different Barcode sequences, the Slx-Index Primer primer sequence used in the 24 scallop methylation libraries is shown in Table 2, 0.3mM dNTPs, 0.4UPhusion ultra-fidelity DNA polymerase (NEB), 1× HF buffer; the reaction conditions are denaturation at 98°C for 5 s, annealing at 60°C for 20 s, extension at 72°C for 10 s, 5-7 cycles, and finally extension at 72°C for 5 min. Three tubes were amplified in parallel, and the PCR product was detected by 8% non-denaturing polyacrylamide gel electrophoresis. The size of the amplified product was about 150 bp, and the PCR product was recovered and purified using the QIAGEN PCR Product Purification Kit. The Solexa Hiseq2000 sequencing platform was used for sequencing, and this part was completed by the sequencing company.
表2 本发明中涉及的Slx-Index Primer引物序列表Table 2 Slx-Index Primer primer sequence list involved in the present invention
注:粗体表示barcode序列Note: Bold indicates the barcode sequence
5)测序文库利用Solexa Hiseq2000测序平台进行测序;将测序数据进行分析得到全基因组甲基化信息。5) The sequencing library was sequenced using the Solexa Hiseq2000 sequencing platform; the sequencing data was analyzed to obtain genome-wide methylation information.
1、Illumina/Solexa测序产生的结果文件为fastq格式,首先利用编译的perl脚本对24个文库的原始序列进行质量过滤,去除含有N的序列以及大于5个碱基的质量值小于10的reads,24个甲基化文库中得到的高质量reads数目范围在6,749,208—14,661,303,每个库中的高质量reads数目占原始reads的百分比均在99%以上。1. The result file generated by Illumina/Solexa sequencing is in the fastq format. First, use the compiled perl script to perform quality filtering on the original sequences of the 24 libraries, and remove sequences containing N and reads with a quality value greater than 5 bases and less than 10. The number of high-quality reads obtained in the 24 methylated libraries ranged from 6,749,208 to 14,661,303, and the number of high-quality reads in each library accounted for more than 99% of the original reads.
2、利用CD-HIT软件对高质量标签序列进行聚类分析,获得测序文库中的甲基化标签种类以及该代表标签的丰度,即该位点的甲基化水平信息。在野生型虾夷扇贝和一种突变型虾夷扇贝新品系“海大金贝”基因组范围内获得98,754个甲基化位点,将甲基化位点比对到基因组参考序列上并进行基因的注释分析表明,具有甲基化位点的基因涉及到多种生物学功能和代谢途径中,说明贝类的生长、繁殖和免疫等相关通路可能有表观遗传学机制的参与调控。2. Use CD-HIT software to perform cluster analysis on high-quality tag sequences, and obtain the type of methylation tag in the sequencing library and the abundance of the representative tag, that is, the methylation level information of the site. 98,754 methylation sites were obtained within the genome range of the wild-type scallop and a new mutant line of scallop "Haida Jinbei", and the methylation sites were compared to the genome reference sequence and genetically Annotation analysis of . showed that genes with methylation sites were involved in a variety of biological functions and metabolic pathways, indicating that the growth, reproduction and immunity of shellfish and other related pathways may be regulated by epigenetic mechanisms.
3、利用edgeR软件对野生型虾夷扇贝和“海大金贝”全基因组范围的位点进行甲基化水平差异分析,获得差异甲基化位点2452个,在“海大金贝”中有8个位点的甲基化水平与闭壳肌类胡萝卜素含量表现出较强的相关性,相关系数在0.90以上,将此类位点比对到基因组参考序列中,获得了若干与类胡萝卜素积累相关的候选基因。为了测试本发明技术的可靠性,设计了两组技术重复实验,两组平行对照的泊松系数均能达到0.99以上。验证了本发明技术的稳定和可靠性。3. Using the edgeR software to analyze the difference in methylation level between wild-type scallops and "Haida Jinbei" genome-wide loci, 2452 differentially methylated sites were obtained, among the "Haida Jinbei" The methylation level of 8 sites showed a strong correlation with the carotenoid content of adductor muscle, and the correlation coefficient was above 0.90. By comparing these sites to the genome reference sequence, several results related to carotenoids were obtained. Candidate genes associated with carotene accumulation. In order to test the reliability of the technology of the present invention, two groups of technical repeated experiments were designed, and the Poisson coefficients of the two groups of parallel controls could reach more than 0.99. The stability and reliability of the technology of the present invention are verified.
实验证明本发明技术既可以用于全基因组范围内甲基化位点的筛选,也可以用于不同细胞、组织或者样本间的甲基化位点修饰的差异分析,探讨DNA甲基化影响基因表达的调控机制。Experiments have proved that the technology of the present invention can be used not only for the screening of methylation sites in the whole genome, but also for the differential analysis of methylation site modifications among different cells, tissues or samples, and to explore the influence of DNA methylation on genes. Expression regulation mechanism.
本发明具有通量高、效率高、成本低的特点,适用于在全基因组范围分析检测甲基化位点,在非模式生物的全基因组甲基化检测中具有良好的应用潜能。The invention has the characteristics of high throughput, high efficiency and low cost, is suitable for analyzing and detecting methylation sites in the whole genome, and has good application potential in the whole genome methylation detection of non-model organisms.
Claims (1)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310163085.0A CN103233072B (en) | 2013-05-06 | 2013-05-06 | High-flux mythelation detection technology for DNA (deoxyribonucleic acid) of complete genome |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310163085.0A CN103233072B (en) | 2013-05-06 | 2013-05-06 | High-flux mythelation detection technology for DNA (deoxyribonucleic acid) of complete genome |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103233072A CN103233072A (en) | 2013-08-07 |
CN103233072B true CN103233072B (en) | 2014-07-02 |
Family
ID=48881136
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310163085.0A Expired - Fee Related CN103233072B (en) | 2013-05-06 | 2013-05-06 | High-flux mythelation detection technology for DNA (deoxyribonucleic acid) of complete genome |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103233072B (en) |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103555846B (en) * | 2013-11-07 | 2015-12-30 | 中国海洋大学 | A kind of high-throughput low cost S NP classifying method based on liquid phase molecule Hybridization principle |
US10889852B2 (en) | 2014-10-20 | 2021-01-12 | Commonwealth Scientific And Industrial Research Organisation | Genome methylation analysis |
CN104357573B (en) * | 2014-11-19 | 2016-09-28 | 苏州圣苏新药开发有限公司 | The method of precise verification unknown gene methylation |
CN104561294B (en) * | 2014-12-26 | 2018-03-30 | 北京诺禾致源科技股份有限公司 | The construction method and sequence measurement of Genotyping sequencing library |
CN105986030A (en) * | 2016-02-03 | 2016-10-05 | 广州市基准医疗有限责任公司 | Methylated DNA detection method |
CN106192021B (en) * | 2016-08-02 | 2017-04-26 | 中国海洋大学 | Method for constructing series connection RAD [restriction-site-associated DNA (deoxyribonucleic acid)] tag sequencing libraries |
CN108319984B (en) * | 2018-02-06 | 2019-07-02 | 北京林业大学 | Construction method and prediction method of prediction model of leaf phenotypic characteristics and photosynthetic characteristics of woody plants based on DNA methylation level |
CN109136363A (en) * | 2018-08-31 | 2019-01-04 | 领航基因科技(杭州)有限公司 | A method of the non-integral multiple kit of detection fetal chromosomal and detection fetal methylation DNA copy number |
CN113166809B (en) * | 2018-12-29 | 2023-12-26 | 深圳华大生命科学研究院 | DNA methylation detection method, kit, device and application |
CN111627499B (en) * | 2020-05-27 | 2020-12-08 | 广州市基准医疗有限责任公司 | Methylation level vectorization representation and specific sequencing interval detection method and device |
CN113584600A (en) * | 2021-08-11 | 2021-11-02 | 翌圣生物科技(上海)股份有限公司 | Whole genome methylation single-stranded DNA library building method |
CN115831367B (en) * | 2022-12-21 | 2025-01-07 | 北京博奥医学检验所有限公司 | A risk prediction model for pregnancy complications and its application |
CN116515955B (en) * | 2023-06-20 | 2023-11-17 | 中国科学院海洋研究所 | Multi-gene targeting typing method |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
ATE191510T1 (en) * | 1991-09-24 | 2000-04-15 | Keygene Nv | SELECTIVE RESTRICTION FRAGMENT AMPLIFICATION: GENERAL METHOD FOR DNA FINGERPRINTING |
WO2009015863A2 (en) * | 2007-07-30 | 2009-02-05 | Roche Diagnostics Gmbh | Methods of detecting methylated dna at a specific locus |
CN101503733A (en) * | 2008-12-17 | 2009-08-12 | 上海人类基因组研究中心 | DNA cohesive end joint facilitating high throughput gene sequence label sequencing and use |
-
2013
- 2013-05-06 CN CN201310163085.0A patent/CN103233072B/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN103233072A (en) | 2013-08-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103233072B (en) | High-flux mythelation detection technology for DNA (deoxyribonucleic acid) of complete genome | |
EP3737774B1 (en) | Method for analyzing nucleic acid | |
JP7300989B2 (en) | Methods and systems for analyzing nucleic acid molecules | |
AU2018266377B2 (en) | Universal short adapters for indexing of polynucleotide samples | |
CN106192021B (en) | Method for constructing series connection RAD [restriction-site-associated DNA (deoxyribonucleic acid)] tag sequencing libraries | |
Fan et al. | Highly parallel genomic assays | |
EP3329010B1 (en) | Nucleic acids and methods for detecting chromosomal abnormalities | |
US20140357497A1 (en) | Designing padlock probes for targeted genomic sequencing | |
CN105358714B (en) | Enrichment of DNA sequencing libraries from samples containing small amounts of target DNA | |
EP3207134B1 (en) | Contiguity preserving transposition | |
US10718015B2 (en) | Sequencing library, preparation method and use thereof | |
Hook et al. | Beyond assembly: the increasing flexibility of single-molecule sequencing technology | |
US20240076720A1 (en) | Methods for analyzing nucleic acids | |
WO2022199242A1 (en) | Set of barcode linkers and medium-flux multi-single-cell representative dna methylation library construction and sequencing method | |
JP2020536525A (en) | A method for concentrating the probe and the target region to which it is applied for high-throughput sequencing | |
US20200255824A1 (en) | Methods and Compositions for Preparing Nucleic Acid Sequencing Libraries | |
CN108166068A (en) | A kind of Novel DNA builds library kit and its application | |
CN104313172A (en) | Method for simultaneous genotyping of large number of samples | |
CN108166069A (en) | A kind of novel methylate banking process and its application | |
Good | Reduced representation methods for subgenomic enrichment and next-generation sequencing | |
WO2011063210A2 (en) | Methods of mapping genomic methylation patterns | |
CN108166067A (en) | A kind of Novel DNA banking process and its application | |
CN113166809A (en) | Method, kit, device and application for detecting DNA methylation | |
JP2023534882A (en) | Chimeric amplicon array sequencing | |
Tanić | Epigenome-wide association study (EWAS): Methods and applications |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20140702 Termination date: 20150506 |
|
EXPY | Termination of patent right or utility model |