US20230347311A1 - A versatile method for the detection of marker-free precision genome editing and genetic variation - Google Patents
A versatile method for the detection of marker-free precision genome editing and genetic variation Download PDFInfo
- Publication number
- US20230347311A1 US20230347311A1 US17/850,186 US202217850186A US2023347311A1 US 20230347311 A1 US20230347311 A1 US 20230347311A1 US 202217850186 A US202217850186 A US 202217850186A US 2023347311 A1 US2023347311 A1 US 2023347311A1
- Authority
- US
- United States
- Prior art keywords
- acul
- seq
- interest
- dna
- genomic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000000034 method Methods 0.000 title claims abstract description 62
- 238000010362 genome editing Methods 0.000 title claims abstract description 43
- 230000007614 genetic variation Effects 0.000 title abstract description 6
- 238000001514 detection method Methods 0.000 title description 74
- 108020004414 DNA Proteins 0.000 claims description 179
- 230000035772 mutation Effects 0.000 claims description 73
- 230000000295 complement effect Effects 0.000 claims description 63
- 108091034117 Oligonucleotide Proteins 0.000 claims description 56
- 108091093088 Amplicon Proteins 0.000 claims description 51
- 230000034431 double-strand break repair via homologous recombination Effects 0.000 claims description 51
- 230000002441 reversible effect Effects 0.000 claims description 47
- 239000012634 fragment Substances 0.000 claims description 44
- 238000003753 real-time PCR Methods 0.000 claims description 37
- 238000011002 quantification Methods 0.000 claims description 30
- 231100000590 oncogenic Toxicity 0.000 claims description 28
- 230000002246 oncogenic effect Effects 0.000 claims description 28
- 108091008146 restriction endonucleases Proteins 0.000 claims description 28
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 27
- 206010028980 Neoplasm Diseases 0.000 claims description 23
- 238000012239 gene modification Methods 0.000 claims description 22
- 230000005017 genetic modification Effects 0.000 claims description 22
- 235000013617 genetically modified food Nutrition 0.000 claims description 22
- 201000011510 cancer Diseases 0.000 claims description 21
- 239000013610 patient sample Substances 0.000 claims description 17
- 230000001419 dependent effect Effects 0.000 claims description 16
- 230000002194 synthesizing effect Effects 0.000 claims description 15
- 239000000523 sample Substances 0.000 claims description 14
- 239000012472 biological sample Substances 0.000 claims description 13
- 238000003776 cleavage reaction Methods 0.000 claims description 12
- 230000007017 scission Effects 0.000 claims description 12
- 238000010171 animal model Methods 0.000 claims description 11
- 241001493065 dsRNA viruses Species 0.000 claims description 8
- 238000003757 reverse transcription PCR Methods 0.000 claims description 8
- 241001678559 COVID-19 virus Species 0.000 claims description 3
- 101150101095 Mmp12 gene Proteins 0.000 claims description 3
- 238000011144 upstream manufacturing Methods 0.000 claims description 3
- 238000000605 extraction Methods 0.000 claims 1
- 239000013615 primer Substances 0.000 description 93
- 210000004027 cell Anatomy 0.000 description 87
- 238000007481 next generation sequencing Methods 0.000 description 66
- 108700028369 Alleles Proteins 0.000 description 62
- 239000000047 product Substances 0.000 description 60
- 101150072950 BRCA1 gene Proteins 0.000 description 57
- 238000012360 testing method Methods 0.000 description 52
- 238000006243 chemical reaction Methods 0.000 description 47
- 238000002474 experimental method Methods 0.000 description 46
- 108091033409 CRISPR Proteins 0.000 description 44
- 102000036365 BRCA1 Human genes 0.000 description 43
- 108700020463 BRCA1 Proteins 0.000 description 42
- 102000052609 BRCA2 Human genes 0.000 description 40
- 108700020462 BRCA2 Proteins 0.000 description 40
- 101150008921 Brca2 gene Proteins 0.000 description 40
- 108091027544 Subgenomic mRNA Proteins 0.000 description 35
- 238000010354 CRISPR gene editing Methods 0.000 description 33
- 241000699666 Mus <mouse, genus> Species 0.000 description 32
- 230000001404 mediated effect Effects 0.000 description 31
- 241000699670 Mus sp. Species 0.000 description 28
- 238000007480 sanger sequencing Methods 0.000 description 23
- 238000003556 assay Methods 0.000 description 22
- 101001048956 Homo sapiens Homeobox protein EMX1 Proteins 0.000 description 21
- 102100023823 Homeobox protein EMX1 Human genes 0.000 description 19
- 101001120056 Homo sapiens Phosphatidylinositol 3-kinase regulatory subunit alpha Proteins 0.000 description 19
- 101150063416 add gene Proteins 0.000 description 19
- 230000029087 digestion Effects 0.000 description 19
- 101150009057 JAK2 gene Proteins 0.000 description 18
- 102100026169 Phosphatidylinositol 3-kinase regulatory subunit alpha Human genes 0.000 description 18
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 18
- 239000000203 mixture Substances 0.000 description 18
- 102100025064 Cellular tumor antigen p53 Human genes 0.000 description 17
- 108010078814 Tumor Suppressor Protein p53 Proteins 0.000 description 17
- 238000004458 analytical method Methods 0.000 description 17
- 230000002068 genetic effect Effects 0.000 description 17
- 238000003205 genotyping method Methods 0.000 description 17
- 208000024893 Acute lymphoblastic leukemia Diseases 0.000 description 14
- 101150059668 Bard1 gene Proteins 0.000 description 14
- 102100026816 DNA-dependent metalloprotease SPRTN Human genes 0.000 description 14
- 101000629403 Homo sapiens DNA-dependent metalloprotease SPRTN Proteins 0.000 description 14
- 101000615373 Homo sapiens SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily A-like protein 1 Proteins 0.000 description 14
- 102000003960 Ligases Human genes 0.000 description 14
- 108090000364 Ligases Proteins 0.000 description 14
- 238000012408 PCR amplification Methods 0.000 description 14
- 239000006227 byproduct Substances 0.000 description 14
- 238000003780 insertion Methods 0.000 description 14
- 230000037431 insertion Effects 0.000 description 14
- 108090000623 proteins and genes Proteins 0.000 description 14
- 208000014697 Acute lymphocytic leukaemia Diseases 0.000 description 13
- 208000006664 Precursor Cell Lymphoblastic Leukemia-Lymphoma Diseases 0.000 description 13
- 102100021248 SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily A-like protein 1 Human genes 0.000 description 13
- 239000013612 plasmid Substances 0.000 description 13
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 description 13
- 102200087780 rs77375493 Human genes 0.000 description 13
- 241000700605 Viruses Species 0.000 description 12
- 239000003795 chemical substances by application Substances 0.000 description 12
- 239000013642 negative control Substances 0.000 description 12
- 230000001717 pathogenic effect Effects 0.000 description 12
- 102220009485 rs80357005 Human genes 0.000 description 12
- 238000012163 sequencing technique Methods 0.000 description 12
- 101000997832 Homo sapiens Tyrosine-protein kinase JAK2 Proteins 0.000 description 11
- 102100033444 Tyrosine-protein kinase JAK2 Human genes 0.000 description 11
- 239000000872 buffer Substances 0.000 description 11
- 239000000463 material Substances 0.000 description 11
- 238000010172 mouse model Methods 0.000 description 11
- 210000002220 organoid Anatomy 0.000 description 11
- 102000004190 Enzymes Human genes 0.000 description 10
- 108090000790 Enzymes Proteins 0.000 description 10
- 108010026653 Fanconi Anemia Complementation Group D2 protein Proteins 0.000 description 10
- 238000013461 design Methods 0.000 description 10
- 238000011161 development Methods 0.000 description 10
- 230000018109 developmental process Effects 0.000 description 10
- 238000003745 diagnosis Methods 0.000 description 10
- 230000000694 effects Effects 0.000 description 10
- 239000000499 gel Substances 0.000 description 10
- 150000007523 nucleic acids Chemical group 0.000 description 10
- UHDGCWIWMRVCDJ-UHFFFAOYSA-N 1-beta-D-Xylofuranosyl-NH-Cytosine Natural products O=C1N=C(N)C=CN1C1C(O)C(O)C(CO)O1 UHDGCWIWMRVCDJ-UHFFFAOYSA-N 0.000 description 9
- UHDGCWIWMRVCDJ-PSQAKQOGSA-N Cytidine Natural products O=C1N=C(N)C=CN1[C@@H]1[C@@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-PSQAKQOGSA-N 0.000 description 9
- 102100028712 Cytosolic purine 5'-nucleotidase Human genes 0.000 description 9
- 102000013601 Fanconi Anemia Complementation Group D2 protein Human genes 0.000 description 9
- 101000915162 Homo sapiens Cytosolic purine 5'-nucleotidase Proteins 0.000 description 9
- 101000891326 Homo sapiens Treacle protein Proteins 0.000 description 9
- UHDGCWIWMRVCDJ-ZAKLUEHWSA-N cytidine Chemical compound O=C1N=C(N)C=CN1[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O1 UHDGCWIWMRVCDJ-ZAKLUEHWSA-N 0.000 description 9
- 201000007224 Myeloproliferative neoplasm Diseases 0.000 description 8
- 101150063858 Pik3ca gene Proteins 0.000 description 8
- 102100040421 Treacle protein Human genes 0.000 description 8
- 239000011324 bead Substances 0.000 description 8
- 230000001413 cellular effect Effects 0.000 description 8
- 239000013641 positive control Substances 0.000 description 8
- 101000831286 Homo sapiens Protein timeless homolog Proteins 0.000 description 7
- 101000702606 Homo sapiens Structure-specific endonuclease subunit SLX4 Proteins 0.000 description 7
- 102100024287 Protein timeless homolog Human genes 0.000 description 7
- 102100031003 Structure-specific endonuclease subunit SLX4 Human genes 0.000 description 7
- 238000000137 annealing Methods 0.000 description 7
- 238000013459 approach Methods 0.000 description 7
- 210000001185 bone marrow Anatomy 0.000 description 7
- 201000010099 disease Diseases 0.000 description 7
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 7
- 210000004185 liver Anatomy 0.000 description 7
- 102000039446 nucleic acids Human genes 0.000 description 7
- 108020004707 nucleic acids Proteins 0.000 description 7
- 229920002401 polyacrylamide Polymers 0.000 description 7
- 102220045540 rs397507975 Human genes 0.000 description 7
- 108020001019 DNA Primers Proteins 0.000 description 6
- 239000003155 DNA primer Substances 0.000 description 6
- 102100021601 Ephrin type-A receptor 8 Human genes 0.000 description 6
- 102100034552 Fanconi anemia group M protein Human genes 0.000 description 6
- 101000898676 Homo sapiens Ephrin type-A receptor 8 Proteins 0.000 description 6
- 101000848187 Homo sapiens Fanconi anemia group M protein Proteins 0.000 description 6
- 102100039087 Peptidyl-alpha-hydroxyglycine alpha-amidating lyase Human genes 0.000 description 6
- 230000003321 amplification Effects 0.000 description 6
- 230000008859 change Effects 0.000 description 6
- 238000012217 deletion Methods 0.000 description 6
- 230000037430 deletion Effects 0.000 description 6
- 210000004962 mammalian cell Anatomy 0.000 description 6
- 238000003199 nucleic acid amplification method Methods 0.000 description 6
- 210000005259 peripheral blood Anatomy 0.000 description 6
- 239000011886 peripheral blood Substances 0.000 description 6
- 102220058171 rs273899698 Human genes 0.000 description 6
- 102220021067 rs397508902 Human genes 0.000 description 6
- 102220028199 rs398122680 Human genes 0.000 description 6
- 102200067151 rs55851803 Human genes 0.000 description 6
- 102220041646 rs587780664 Human genes 0.000 description 6
- 102220060981 rs747837583 Human genes 0.000 description 6
- 102220028155 rs80357102 Human genes 0.000 description 6
- 102220061878 rs80357460 Human genes 0.000 description 6
- 102220093522 rs80358869 Human genes 0.000 description 6
- 102200071461 rs80359104 Human genes 0.000 description 6
- 102220010133 rs80359143 Human genes 0.000 description 6
- 210000001519 tissue Anatomy 0.000 description 6
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 6
- 206010006187 Breast cancer Diseases 0.000 description 5
- 208000026310 Breast neoplasm Diseases 0.000 description 5
- 102000012410 DNA Ligases Human genes 0.000 description 5
- 108010061982 DNA Ligases Proteins 0.000 description 5
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 5
- 241001465754 Metazoa Species 0.000 description 5
- 108020004485 Nonsense Codon Proteins 0.000 description 5
- 230000008901 benefit Effects 0.000 description 5
- 210000002798 bone marrow cell Anatomy 0.000 description 5
- 238000002512 chemotherapy Methods 0.000 description 5
- 230000005782 double-strand break Effects 0.000 description 5
- 238000011534 incubation Methods 0.000 description 5
- 230000037434 nonsense mutation Effects 0.000 description 5
- 239000002773 nucleotide Substances 0.000 description 5
- 125000003729 nucleotide group Chemical group 0.000 description 5
- 102220021442 rs397509029 Human genes 0.000 description 5
- 102220026152 rs587779360 Human genes 0.000 description 5
- 102220046186 rs587782713 Human genes 0.000 description 5
- 230000035945 sensitivity Effects 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- 239000006228 supernatant Substances 0.000 description 5
- 230000008685 targeting Effects 0.000 description 5
- 102000053602 DNA Human genes 0.000 description 4
- 238000007400 DNA extraction Methods 0.000 description 4
- 101100175482 Glycine max CG-3 gene Proteins 0.000 description 4
- 239000011543 agarose gel Substances 0.000 description 4
- 210000004369 blood Anatomy 0.000 description 4
- 239000008280 blood Substances 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 101150042537 dld1 gene Proteins 0.000 description 4
- 210000003743 erythrocyte Anatomy 0.000 description 4
- 238000013401 experimental design Methods 0.000 description 4
- 238000001727 in vivo Methods 0.000 description 4
- 230000002779 inactivation Effects 0.000 description 4
- 230000000968 intestinal effect Effects 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 238000006366 phosphorylation reaction Methods 0.000 description 4
- 102220198092 rs1057519866 Human genes 0.000 description 4
- 102220096390 rs373203204 Human genes 0.000 description 4
- 102220060761 rs786202354 Human genes 0.000 description 4
- 238000001890 transfection Methods 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- 108010042407 Endonucleases Proteins 0.000 description 3
- 102000004533 Endonucleases Human genes 0.000 description 3
- 108091092584 GDNA Proteins 0.000 description 3
- 241000581650 Ivesia Species 0.000 description 3
- 101100352296 Mus musculus Pik3ca gene Proteins 0.000 description 3
- 206010061535 Ovarian neoplasm Diseases 0.000 description 3
- 229920003356 PDX® Polymers 0.000 description 3
- 229960000723 ampicillin Drugs 0.000 description 3
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 3
- 229930189065 blasticidin Natural products 0.000 description 3
- 238000009795 derivation Methods 0.000 description 3
- 230000011559 double-strand break repair via nonhomologous end joining Effects 0.000 description 3
- 230000014509 gene expression Effects 0.000 description 3
- 238000002347 injection Methods 0.000 description 3
- 239000007924 injection Substances 0.000 description 3
- 239000002609 medium Substances 0.000 description 3
- 239000012264 purified product Substances 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 102220198110 rs1057519867 Human genes 0.000 description 3
- 102220012939 rs397516386 Human genes 0.000 description 3
- 102220010159 rs55933907 Human genes 0.000 description 3
- 102220018743 rs80358448 Human genes 0.000 description 3
- 238000013515 script Methods 0.000 description 3
- 210000002966 serum Anatomy 0.000 description 3
- 239000007790 solid phase Substances 0.000 description 3
- 230000004936 stimulating effect Effects 0.000 description 3
- 230000007704 transition Effects 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- 108700010154 BRCA2 Genes Proteins 0.000 description 2
- 238000011814 C57BL/6N mouse Methods 0.000 description 2
- 206010068051 Chimerism Diseases 0.000 description 2
- 108020004635 Complementary DNA Proteins 0.000 description 2
- 206010053138 Congenital aplastic anaemia Diseases 0.000 description 2
- 230000004543 DNA replication Effects 0.000 description 2
- 201000004939 Fanconi anemia Diseases 0.000 description 2
- 241000282326 Felis catus Species 0.000 description 2
- 208000031448 Genomic Instability Diseases 0.000 description 2
- 241000713666 Lentivirus Species 0.000 description 2
- 101710163270 Nuclease Proteins 0.000 description 2
- 206010033128 Ovarian cancer Diseases 0.000 description 2
- 101150046396 PIK3R1 gene Proteins 0.000 description 2
- MEFKEPWMEQBLKI-AIRLBKTGSA-N S-adenosyl-L-methioninate Chemical compound O[C@@H]1[C@H](O)[C@@H](C[S+](CC[C@H](N)C([O-])=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 MEFKEPWMEQBLKI-AIRLBKTGSA-N 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 239000008049 TAE buffer Substances 0.000 description 2
- HGEVZDLYZYVYHD-UHFFFAOYSA-N acetic acid;2-amino-2-(hydroxymethyl)propane-1,3-diol;2-[2-[bis(carboxymethyl)amino]ethyl-(carboxymethyl)amino]acetic acid Chemical compound CC(O)=O.OCC(N)(CO)CO.OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O HGEVZDLYZYVYHD-UHFFFAOYSA-N 0.000 description 2
- 229960001570 ademetionine Drugs 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 229940088710 antibiotic agent Drugs 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000004820 blood count Methods 0.000 description 2
- 210000000481 breast Anatomy 0.000 description 2
- 238000010804 cDNA synthesis Methods 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 239000013068 control sample Substances 0.000 description 2
- 238000012937 correction Methods 0.000 description 2
- 238000000684 flow cytometry Methods 0.000 description 2
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 2
- 239000007850 fluorescent dye Substances 0.000 description 2
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 2
- 239000010931 gold Substances 0.000 description 2
- 229910052737 gold Inorganic materials 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 239000011259 mixed solution Substances 0.000 description 2
- 238000002156 mixing Methods 0.000 description 2
- 238000001823 molecular biology technique Methods 0.000 description 2
- 238000011275 oncology therapy Methods 0.000 description 2
- 230000007918 pathogenicity Effects 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000003752 polymerase chain reaction Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 2
- 230000008439 repair process Effects 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 208000007056 sickle cell anemia Diseases 0.000 description 2
- 210000000952 spleen Anatomy 0.000 description 2
- 238000007619 statistical method Methods 0.000 description 2
- 238000011282 treatment Methods 0.000 description 2
- 210000003462 vein Anatomy 0.000 description 2
- OSBLTNPMIGYQGY-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;2-[2-[bis(carboxymethyl)amino]ethyl-(carboxymethyl)amino]acetic acid;boric acid Chemical compound OB(O)O.OCC(N)(CO)CO.OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O OSBLTNPMIGYQGY-UHFFFAOYSA-N 0.000 description 1
- FWBHETKCLVMNFS-UHFFFAOYSA-N 4',6-Diamino-2-phenylindol Chemical compound C1=CC(C(=N)N)=CC=C1C1=CC2=CC=C(C(N)=N)C=C2N1 FWBHETKCLVMNFS-UHFFFAOYSA-N 0.000 description 1
- 102000055025 Adenosine deaminases Human genes 0.000 description 1
- 108700040115 Adenosine deaminases Proteins 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 208000031873 Animal Disease Models Diseases 0.000 description 1
- 101700002522 BARD1 Proteins 0.000 description 1
- 108700040618 BRCA1 Genes Proteins 0.000 description 1
- 102100028048 BRCA1-associated RING domain protein 1 Human genes 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 238000011740 C57BL/6 mouse Methods 0.000 description 1
- 101150038243 CLOCK gene Proteins 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 102000005381 Cytidine Deaminase Human genes 0.000 description 1
- 108010031325 Cytidine deaminase Proteins 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 230000007018 DNA scission Effects 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 241000238557 Decapoda Species 0.000 description 1
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 1
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 1
- 102100040870 Glycine amidinotransferase, mitochondrial Human genes 0.000 description 1
- 108020005004 Guide RNA Proteins 0.000 description 1
- 208000028782 Hereditary disease Diseases 0.000 description 1
- 101000934870 Homo sapiens Breast cancer type 1 susceptibility protein Proteins 0.000 description 1
- 101000934858 Homo sapiens Breast cancer type 2 susceptibility protein Proteins 0.000 description 1
- 101000893303 Homo sapiens Glycine amidinotransferase, mitochondrial Proteins 0.000 description 1
- 101100480807 Homo sapiens TCOF1 gene Proteins 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 201000007493 Kallmann syndrome Diseases 0.000 description 1
- 208000000916 Mandibulofacial dysostosis Diseases 0.000 description 1
- 241000204031 Mycoplasma Species 0.000 description 1
- 208000014767 Myeloproliferative disease Diseases 0.000 description 1
- 206010053142 Olfacto genital dysplasia Diseases 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- 102220615662 Ras-related protein Rab-11A_I44A_mutation Human genes 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 101150002956 SPRTN gene Proteins 0.000 description 1
- 239000008051 TBE buffer Substances 0.000 description 1
- 101150103534 TCOF1 gene Proteins 0.000 description 1
- 201000003199 Treacher Collins syndrome Diseases 0.000 description 1
- 101000909800 Xenopus laevis Probable N-acetyltransferase camello Proteins 0.000 description 1
- 150000001413 amino acids Chemical class 0.000 description 1
- 238000011558 animal model by disease Methods 0.000 description 1
- NZLIJGHPMIEDBX-UHFFFAOYSA-M azanium potassium hydrogen carbonate chloride Chemical compound [NH4+].[Cl-].[K+].OC([O-])=O NZLIJGHPMIEDBX-UHFFFAOYSA-M 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 238000007622 bioinformatic analysis Methods 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 238000010322 bone marrow transplantation Methods 0.000 description 1
- 244000309466 calf Species 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 208000011654 childhood malignant neoplasm Diseases 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 230000008632 circadian clock Effects 0.000 description 1
- 238000012761 co-transfection Methods 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 238000009096 combination chemotherapy Methods 0.000 description 1
- 238000010835 comparative analysis Methods 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 230000030609 dephosphorylation Effects 0.000 description 1
- 238000006209 dephosphorylation reaction Methods 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000013399 early diagnosis Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000000925 erythroid effect Effects 0.000 description 1
- 230000001605 fetal effect Effects 0.000 description 1
- 102000034287 fluorescent proteins Human genes 0.000 description 1
- 108091006047 fluorescent proteins Proteins 0.000 description 1
- 231100000221 frame shift mutation induction Toxicity 0.000 description 1
- 230000037433 frameshift Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 102000054767 gene variant Human genes 0.000 description 1
- 230000004077 genetic alteration Effects 0.000 description 1
- 231100000118 genetic alteration Toxicity 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 210000003783 haploid cell Anatomy 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000005534 hematocrit Methods 0.000 description 1
- 102000048580 human BRCA1 Human genes 0.000 description 1
- 102000047599 human BRCA2 Human genes 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000012933 kinetic analysis Methods 0.000 description 1
- 230000003902 lesion Effects 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000003278 mimic effect Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 231100000219 mutagenic Toxicity 0.000 description 1
- 230000003505 mutagenic effect Effects 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 101150101144 nt5c2 gene Proteins 0.000 description 1
- 229940046166 oligodeoxynucleotide Drugs 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000002611 ovarian Effects 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 238000011338 personalized therapy Methods 0.000 description 1
- 108090000765 processed proteins & peptides Proteins 0.000 description 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 229950010131 puromycin Drugs 0.000 description 1
- 102000005912 ran GTP Binding Protein Human genes 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 102220022127 rs397509267 Human genes 0.000 description 1
- 102220020066 rs80359140 Human genes 0.000 description 1
- 238000013207 serial dilution Methods 0.000 description 1
- 230000037432 silent mutation Effects 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000003393 splenic effect Effects 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 210000002303 tibia Anatomy 0.000 description 1
- 239000012096 transfection reagent Substances 0.000 description 1
- 238000002054 transplantation Methods 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 210000000689 upper leg Anatomy 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 238000007482 whole exome sequencing Methods 0.000 description 1
Images
Classifications
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J19/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J19/0046—Sequential or parallel reactions, e.g. for the synthesis of polypeptides or polynucleotides; Apparatus and devices for combinatorial chemistry or for making molecular arrays
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/6858—Allele-specific amplification
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/686—Polymerase chain reaction [PCR]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/70—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving virus or bacteriophage
- C12Q1/701—Specific hybridization probes
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B40/00—Libraries per se, e.g. arrays, mixtures
- C40B40/04—Libraries containing only organic compounds
- C40B40/06—Libraries containing nucleotides or polynucleotides, or derivatives thereof
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B60/00—Apparatus specially adapted for use in combinatorial chemistry or with libraries
- C40B60/14—Apparatus specially adapted for use in combinatorial chemistry or with libraries for creating libraries
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00277—Apparatus
- B01J2219/00279—Features relating to reactor vessels
- B01J2219/00306—Reactor vessels in a multiple arrangement
- B01J2219/00313—Reactor vessels in a multiple arrangement the reactor vessels being formed by arrays of wells in blocks
- B01J2219/00315—Microtiter plates
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00277—Apparatus
- B01J2219/00351—Means for dispensing and evacuation of reagents
- B01J2219/00364—Pipettes
- B01J2219/00367—Pipettes capillary
- B01J2219/00369—Pipettes capillary in multiple or parallel arrangements
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00277—Apparatus
- B01J2219/00351—Means for dispensing and evacuation of reagents
- B01J2219/00378—Piezoelectric or ink jet dispensers
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00277—Apparatus
- B01J2219/00351—Means for dispensing and evacuation of reagents
- B01J2219/00387—Applications using probes
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00277—Apparatus
- B01J2219/00497—Features relating to the solid phase supports
- B01J2219/00504—Pins
Definitions
- the present disclosure provides, inter alia, specially designed DNA adaptors and various methods and kits for carrying out and detecting marker-free precision genome editing and genetic variation using such adaptors.
- sequence listing is hereby incorporated by reference in its entirety pursuant to 37 C.F.R. ⁇ 1.52(e)(5).
- Precision genome editing allows the modeling and correction of desired genomic variants containing insertions or deletions of specific nucleotide sequences or changes in single DNA bases (Anzalone et al., 2019; Barbieri et al., 2017; Cong et al., 2013; Dow, 2015; Guo et al., 2018; Liu et al., 2018; Mali et al., 2013; Roy et al., 2018).
- Precision genome editing can be obtained by CRISPR-dependent homology-directed repair (HDR) of Cas9-induced DNA double-strand breaks (DSBs) (Jasin and Haber, 2016) or result from the use of alternative DSB-free methods, such as CRISPR-dependent base editing, which utilizes cytidine or adenosine deaminases fused to a nickase Cas9 (nCas9) mutant to generate base transitions (Gaudelli et al., 2017; Komor et al., 2016), and prime editing, which employs a reverse transcriptase-nCas9 fusion and a template prime editing guide RNA (pegRNA) to install into the genome a large variety of genomic changes, including transversions, transitions, small insertions and deletions (Anzalone et al., 2019).
- HDR CRISPR-dependent homology-directed repair
- DSBs Cas9-induced DNA double-strand breaks
- Genome editing has been facilitated by the development of accessible and cost-effective methods for the detection of small insertions and deletions (indels) resulting from the repair of Cas9-induced DSBs, such as the T7E1 and Surveyor nuclease assays (Mashal et al., 1995; Qiu et al., 2004; Ran et al., 2013).
- these methods do not determine the identity of DNA bases, they are ill-suited for the detection of genomic changes introduced by precision genome editing (Germini et al., 2018).
- Precision genome editing events can be detected by the addition of genomic markers by CRISPR-dependent HDR or prime editing, such as silent mutations that create or disrupt restriction sites, or selectable reporters encoding for antibiotic resistance or fluorescent proteins.
- genomic markers entails an elaborate experimental design that is unique for each targeted site, thus complicating the insertion of the desired genetic modifications.
- genomic markers can cause unintended perturbations of coding or non-coding genomic elements.
- marker-based detection methods are not compatible with CRISPR-dependent base editing strategies, which induce single DNA base changes (Rees and Liu, 2018).
- NGS-based detection strategies are highly sensitive (Clement et al., 2019; Lindsay et al., 2016; Pinello et al., 2016), they remain expensive and time-consuming, which limits their value for the development of mutant cell lines and animal models and for applications that require a rapid turnaround time, such as the identification of pathogenic variants in certain clinical settings. Therefore, a simple, efficient, inexpensive and rapid method that enables quantitative detection of genetic variants in complex biological systems is needed. This disclosure is directed to meeting these and other needs.
- DTECT Deoxyribonucleotide signaTurE CapTure
- DTECT a rapid and versatile detection method that relies on the capture of targeted dinucleotide signatures resulting from the digestion of genomic DNA amplicons by the type IIS restriction enzyme Acul.
- DTECT enables the accurate quantification of marker-free precision genome editing events introduced by CRISPR-dependent homology-directed repair, base editing or prime editing in various biological systems, such as mammalian cell lines, organoids and tissues.
- DTECT allows the identification of oncogenic mutations in cancer mouse models, patient-derived xenografts and human cancer patient samples; it also allows the identification of genetic modifications incurred in various infectious diseases.
- DTECT enables the capture of signatures in nucleic acids from any organism including, e.g., viruses such as SARS-CoV-2.
- viruses such as SARS-CoV-2.
- the ease, speed and cost efficiency by which DTECT identifies genomic signatures should facilitate the generation of marker-free cellular and animal models of human disease and expedite the detection of human pathogenic variants.
- one embodiment of the present disclosure is a DNA adaptor comprising: (a) one strand with sequence of 5′-CTGGGGCACGGGTAAGAAGCATTCTGTCTCTCTTCTAAGAATTCGAGCTCGGTACC CG-3′ (SEQ ID NO: 230); and (b) one complementary strand with sequence of 5′-CGGGTACCGAGCTCGAATTCTTAGAAGAGAGACAGAATGCTTCTTACCCGTGCCC CAGNN-3′ with “N” corresponding to A, T, G or C (SEQ ID NOs: 231-246).
- Another embodiment of the present disclosure is a method of preparing a DNA adaptor disclosed herein, comprising: (a) synthesizing one constant oligonucleotide with sequence of 5′-CTGGGGCACGGGTAAGAAGCATTCTGTCTCTCTTCTAAGAATTCGAGCTCGGTACC CG-3′ (SEQ ID NO: 230); (b) synthesizing one complementary oligonucleotide with sequence of 5′-CGGGTACCGAGCTCGAATTCTTAGAAGAGAGACAGAATGCTTCTTACCCGTGCCC CAGNN-3′ with “N” corresponding to A, T, G or C (SEQ ID NOs: 231-246); (c) mixing the constant and complementary oligonucleotides; and (d) annealing the mixture to obtain the DNA adaptor.
- Another embodiment of the present disclosure is a library of DNA adaptors prepared by methods disclosed herein, the library comprises 16 DNA adaptors, wherein each DNA adaptor has a different “NN”.
- Another embodiment of the present disclosure is a method for detecting a genetic modification, comprising the steps of: (a) amplifying a genomic locus of interest using a specially designed Type IIS restriction enzyme-tagging primer, comprising: (i) extracting genomic DNA from a biological sample of interest; (ii) synthesizing the Type IIS restriction enzyme-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Type IIS restriction enzyme-tagging primer and a reverse primer; and (iv) purifying a Type IIS restriction enzyme-tagged genomic amplicon; (b) digesting the Type IIS restriction enzyme-tagged genomic amplicon with the Type IIS restriction enzyme; (c) isolating the smaller DNA fragment containing a genomic signature of interest exposed in a 3′ single-stranded overhang; (d) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment containing the 3
- a further embodiment of the present disclosure is a kit for detecting a genetic modification of interest, comprising a specially designed Type IIS restriction enzyme-tagging primer disclosed herein, and a library of DNA adaptors disclosed herein, packaged together with instructions for its use.
- Another embodiment of the present disclosure is a method for detecting a genetic modification, comprising the steps of: (a) amplifying a genomic locus of interest using a specially designed Acul-tagging primer, comprising: (i) extracting DNA of interest; (ii) synthesizing the Acul-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Acul-tagging primer and a reverse primer; and (iv) purifying an Acul-tagged genomic amplicon; (b) digesting the Acul-tagged genomic amplicon with restriction enzyme Acul; (c) isolating the smaller DNA fragment containing a genomic signature of interest produced by Acul-digestion; (d) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; and (
- An additional embodiment of the present disclosure is a kit for detecting a genetic modification, comprising a specially designed Acul-tagging primer and a library of DNA adaptors disclosed herein, packaged together with instructions for its use.
- Another embodiment of the present disclosure is a method for quantifying a genomic variant in a biological system, comprising the steps of: (a) obtaining a sample from the biological system; (b) amplifying a genomic locus of interest using a specially designed Acul-tagging primer, comprising: (i) extracting DNA of interest; (ii)synthesizing the Acul-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Acul-tagging primer and a reverse primer; and (iv) purifying an Acul-tagged genomic amplicon; (c) digesting the Acul-tagged genomic amplicon with restriction enzyme Acul; (d) isolating the smaller DNA fragment containing a genomic signature of interest produced by the Acul-digestion; (e) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a
- Still another embodiment of the present disclosure is a method for identifying and quantifying an oncogenic mutation of interest in a biological sample, comprising the steps of: (a) obtaining a biological sample; (b) amplifying a genomic locus of interest using a specially designed Acul-tagging primer, comprising: (i) extracting DNA of interest; (ii) synthesizing the Acul-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Acul-tagging primer and a reverse primer; and (iv) purifying an Acul-tagged genomic amplicon; (c) digesting the Acul-tagged genomic amplicon with restriction enzyme Acul; (d) isolating the smaller DNA fragment containing a genomic signature of interest produced by the Acul-digestion; (e) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing
- a further embodiment of the present disclosure is a process for marker-free detection of a precision genome editing event comprising carrying out Dinucleotide signaTurE CapTure (DTECT) on a nucleic acid sequence of interest.
- DTECT Dinucleotide signaTurE CapTure
- Still another embodiment of the present disclosure is a method for detecting a virus variant of interest, comprising the steps of: (a) obtaining a nucleic acid of the virus variant of interest from a biological sample; and (b) if the nucleic acid is DNA, carrying out Dinucleotide signaTurE CapTure (DTECT) to detect the variant of interest; or (c) if the nucleic acid is RNA, coverting it to DNA by reverse transcription PCR (RT-PCR) and then carrying out DTECT to detect the variant of interest.
- DTECT Dinucleotide signaTurE CapTure
- FIGS. 1 A- 1 C show the identification of targeted dinucleotide signatures using DTECT.
- FIG. 1 A is a schematic representation of DTECT.
- the Acul-tagging primer (60 nt) is constituted of DNA sequences complementary to the genomic locus (purple) interrupted by a hairpin containing an Acul recognition site (green), and a non-complementary DNA sequence (blue).
- the locus-specific reverse primer (red) is located at a distance >100 bp from the targeted dinucleotide.
- the obtained PCR product is subsequently cleaved by the Acul restriction enzyme in a position adjacent to the targeted dinucleotide, resulting in the generation of two DNA fragments of 60 bp and >100 bp (Acul digestion, step III).
- the 60 bp fragment containing the exposed signature of the targeted dinucleotide is then isolated using SPRI beads with higher affinity towards >100 bp DNA products (Small fragment isolation, step IV).
- the 60 bp fragment is then ligated to DNA adaptors containing 3′-overhangs of two bases complementary (specific) or not (non-specific) to the dinucleotide signature (Adaptor ligation, step V).
- Adaptor ligation, step V Adaptor ligation
- the ligated product is then subjected to PCR amplification for analytical or quantitative detection (Detection PCR, step VI). The approximate time required for each step is indicated.
- FIG. 1 B shows the schematics of the DTECT adaptor library.
- Control (green) and mutant (purple) dinucleotide signatures are detected using a library of 16 unique adaptors (middle panel).
- the library contains adaptors with dinucleotides complementary to the control (green) or mutant (purple) signature, as well as non-specific adaptors (blue) (right panel).
- FIG. 1 C shows the schematics of the positive and negative controls used in DTECT experiments to identify signatures of interest (e.g., mutant allele) in allele populations.
- the adaptor complementary to the WT dinucleotide signature green
- the adaptor complementary to the mutant signature of interest purple
- a non-specific adaptor blue
- the adaptor complementary to the WT dinucleotide signature green
- a non-specific adaptor blue
- the adaptor complementary to the mutant dinucleotide signature (purple) is used to detect the presence of the variant of interest and quantify its frequency.
- FIGS. 2 A- 2 K show the detection and quantification of dinucleotide signatures using DTECT.
- FIG. 2 A shows the design of Acul-tagging primers that allow the capture of two dinucleotide signatures (CC and TT; blue) on opposite DNA strands.
- FIG. 2 B shows the PCR amplification (22 cycles) of the Acul-digested DNA products containing the CC and TT signatures shown in FIG. 2 A , which have been captured using GG or AA adaptors.
- FIG. 2 C shows the PCR amplification (22 cycles) of DNA fragments captured as in FIG. 2 B with or without dephosphorylation of the Acul-digested products by the shrimp alkaline phosphatase (rSAP).
- FIG. 2 D shows the PCR amplification (22 cycles) of DNA fragments captured as in FIG. 2 B in the absence or presence of Acul, DNA adaptors (GG adaptor for signature CC; AA adaptor for signature TT) or T4 DNA ligase.
- FIG. 2 E shows the schematic representation of the Acul-tagging primer design for detecting four possible dinucleotide signatures (#1-4) containing the same targeted base (C:G, red) in the PIK3R1 gene.
- FIG. 2 F shows the detection of the four dinucleotide signatures shown in FIG. 2 E by DTECT (18 PCR cycles) using specific (green) and non-specific (blue) adaptors.
- FIG. 2 G shows the quantification by DTECT of the relative abundance of SMARCAL1, SPRTN and PIK3R1 WT (green) and STOP (purple) dinucleotide signatures in mixtures of WT and STOP alleles at predefined ratios.
- FIG. 2 H shows the representation of the Acul-tagging primers used to detect the WT and STOP alleles of the PIK3R1 gene.
- the targeted dinucleotides are shown in blue, the edited base is indicated with an asterisk and part of the Acul-tagging primer sequence is shown in purple.
- FIG. 2 I shows the PCR amplification (25 cycles) of WT and STOP PIK3R1 alleles (arrow) captured using DTECT from WT:STOP allele mixtures (i.e., 100:0 and 99:1).
- An adaptor (CG) specific for the WT allele is used as a positive control and a non-specific adaptor (TT) is used as a negative control.
- An adaptor that captures the STOP PIK3R1 allele (CA) serves as an additional negative control in the reaction containing only the WT allele. Background non-specific PCR products are indicated with an asterisk.
- FIG. 2 J shows the fold change variation in the frequency of capture of each of the 16 dinucleotide signatures relative to the mean dinucleotide capture frequency.
- Oligonucleotides containing distinct dinucleotide signatures are captured using specific adaptors.
- the fraction of captured material is then quantified by qPCR and normalized to the mean value obtained from the capture of all 16 dinucleotide signatures. Error bars indicate the s.d. of 4 independent experiments. Dots represent individual data point.
- FIG. 2 K shows the fold change variation in the frequency of capture of dinucleotide signatures with 1 A/T + 1 C/G, 2 A/T or 2 C/G bases relative to the mean dinucleotide capture frequency, determined as described in FIG. 2 J .
- Error bars represent the s.d. of 8 mean values for dinucleotides with 1 A/T + 1 C/G and 4 mean values for dinucleotides with 2 A/T and 2 C/G, as determined in FIG. 2 J .
- FIGS. 3 A- 3 E show the detection and quantification of precision genome editing by CRISPR-mediated HDR, base editing and prime editing using DTECT.
- FIG. 3 A shows the schematics of the protocol used to identify genomic changes introduced by CRISPR-dependent HDR, base editing or prime editing.
- HDR experiments blue
- HEK293T cells were transfected with Cas9 and sgRNA targeting a gene of interest with or without donor DNA molecules.
- base editing experiments red
- HEK293T cells were transfected with BE3 base editors with either control or base editing sgRNAs.
- Base editing experiments were also conducted in cells stably expressing FNLS-BE3.
- prime editing experiments (grey), HEK293T cells were transfected with PE2 with or without pegRNA. Genomic DNA was then extracted from cell populations and subjected to DTECT using adaptors specific for WT (green) or edited (purple) variants.
- FIG. 3 B shows the identification by DTECT of WT and HDR-edited (R209fs*6) TP53 alleles (top), WT and base-edited (Q223*) FANCD2 alleles (middle), and WT and prime-edited (CTT_ins) HEK3 alleles (bottom).
- Adaptors specific for the WT (CT, CA, CG; green) or edited (TT, TA; purple) signatures were utilized in DTECT experiments. Captured samples were subjected to analytical (left; 21 cycles) or quantitative PCR (right).
- cells were transfected with Cas9, sgRNA and an ssODN specific for the TP53 locus with or without the HDR stimulatory factor i53.
- the ssODN was omitted in control reactions.
- cells were transfected with BE3 and sgRNA to induce Q223* in FANCD2.
- cells were transfected with PE2 and pegRNA to introduce a CTT insertion in the HEK3 locus.
- FIG. 3 D shows the schematic representation of the experiments conducted to measure the efficiency of precision genome editing in vivo using DTECT. Editing of the mouse liver was performed by hydrodynamic injection of the cytidine base editor (CBE) FNLS-BE3 and an sgRNA to introduce the Pik3ca E545K variant. DTECT (red) and NGS (green) were used to determine the efficiency of editing in the mouse liver sample.
- CBE cytidine base editor
- NGS green
- FIG. 3 E shows the quantification by DTECT (red) and NGS (green) of the Pik3ca E545K variant introduced by CRISPR-mediated base editing in the mouse liver, as shown in FIG. 3 D .
- Error bars indicate the s.d. of 2 independent experiments. Dots represent individual data point.
- FIGS. 4 A- 4 C show the identification of multiple genome editing events in a single locus or distinct loci by DTECT.
- FIG. 4 A shows the detection by PCR (21 cycles) of allelic mixtures induced by CRISPR-mediated base editing events occurring at a CC sequence (green) in the EMX1 gene.
- the sequences of the EMX1 alleles resulting from four possible C->T base transitions (CC, CT, TC, TT) induced by CRISPR-mediated base editing and the adaptors to capture them (GG, AG, GA, AA) are shown.
- HEK293T cells constitutively expressing the cytidine base editor (CBE) FNLS-BE3 were transfected with sgRNA targeting the EMX1 locus.
- FIG. 4 B shows the schematics of the experiments conducted to detect multiple simultaneously induced variants using DTECT.
- HEK293T cells constitutively expressing the base editor FNLS-BE3 were transfected with two sgRNAs to introduce simultaneously the BRCA1 E638K and the BRCA2 E2772K mutations by CRISPR-mediated base editing.
- FIG. 4 C shows the detection of multiple precision genome editing events introduced by CRISPR-mediated base editing in HEK293T cell populations, as illustrated in FIG. 4 B .
- WT and edited BRCA1 and BRCA2 alleles captured using adaptors specific for the WT (TG, AG; green) or edited (TA, AA; purple) alleles were subjected to analytical (left; 21 cycles) or quantitative PCR (right).
- FIGS. 5 A- 5 J show the DTECT-mediated identification of clinically relevant BRCA1 ⁇ 2 mutations generated by precision genome editing and genotyping of cell lines and animal models carrying BRCA1 or BARD1 mutations.
- FIG. 5 A shows the schematic representation of the human BRCA1 protein. BRCA1 domains and ClinVar BRCA1 mutations generated in this study are indicated.
- FIG. 5 B shows the quantification using DTECT (red) and NGS (green) of the editing efficiency by which 10 BRCA1 mutations are introduced into HEK293T cells by CRISPR-mediated base editing.
- DTECT red
- NGS green
- Histograms show the mean frequency of the indicated variants estimated by DTECT and error bars represent the s.d. from 2 independent DTECT assays for the same Acul-tagged amplicon. n.d.: not determined, due to sequencing failure.
- FIG. 5 C shows the analytical detection of the indicated BRCA1 mutations in HEK293T cell populations by DTECT (21 PCR cycles) using adaptors specific for WT (green) or mutant (purple) alleles.
- FIG. 5 D shows the schematic representation of the human BRCA2 protein. BRCA2 domains and ClinVar BRCA2 mutations generated in this study are indicated.
- FIG. 5 E shows the quantification using DTECT (red) and NGS (green) of the editing efficiency by which 13 BRCA2 mutations are introduced into HEK293T cells by CRISPR-mediated base editing, as described in FIG. 5 B .
- FIG. 5 F shows the analytical detection of the indicated BRCA2 mutations in HEK293T cell populations by DTECT (21 PCR cycles) using adaptors specific for WT (green) or mutant (purple) alleles. Experiments were conducted as in FIG. 5 C .
- FIG. 5 G shows the genotyping by DTECT-based analytical PCR (18 cycles) of single clones carrying WT and/or BRCA1 E638K mutant alleles derived from the BRCA1 E638K mutant cell population shown in FIG. 5 C .
- WT #4, not edited
- heterozygous #1
- homozygous #2
- FIG. 5 H shows the Sanger sequencing of WT, heterozygous and homozygous mutant amplicons shown in FIG. 5 G .
- the targeted dinucleotide is indicated in green and part of the sequence of the Acul-tagging primer is indicated in purple.
- FIG. 5 I shows the genotyping by DTECT-based analytical PCR of Bard1 S563F (left) and Brca1 S1598F (right) knock-in mutant mice (Bard1, 18 PCR cycles; Brca1, 20 PCR cycles).
- gDNA for DTECT analysis was obtained from mouse tail samples. WT (Bard1 #8 and Brca1 #5), heterozygous (Bard1 #2 and Brca1 #2) and homozygous (Bard1 #3) mutant mice identified by DTECT are indicated. No homozygous Brca1 S1598F mutant mice were identified in the analyzed mouse litters due to sub-Mendelian birth ratios (Billing et al., 2018).
- FIG. 5 J shows the Sanger sequencing of WT, heterozygous and homozygous mutant amplicons shown in FIG. 51 .
- FIGS. 6 A- 6 D show the detection of oncogenic signatures in human clinical samples using DTECT.
- FIG. 6 A shows the schematic representation of the experiments conducted on ALL patient-derived samples. Bone marrow samples from ALL patients were collected at diagnosis and after chemotherapy. PDXs were generated from the patient samples. The genomic DNA was recovered from the patient samples and PDX mouse models and subjected to analytical and quantitative detection of NT5C2 oncogenic mutations using DTECT.
- FIG. 6 B provides the heat map showing the detection of NT5C2 oncogenic mutations in patient samples and a control sample using DTECT.
- Bone marrow samples from 5 patients were collected; genomic DNA was prepared and tested for the presence of 3 frequent NT5C2 mutations responsible for relapse to chemotherapy.
- a non-patient-derived gDNA sample was utilized as a control to estimate the levels of non-specific background in the DTECT assay. Data are shown as fold change in the frequency of mutant signatures in the patient samples relative to the control sample.
- FIG. 6 C shows the graphical representation of the frequency of NT5C2 mutations determined by DTECT (red) and NGS (green) in the 5 human patient samples analyzed in FIG. 6 B .
- Error bars indicate the s.d. of 2 independent DTECT replicates.
- FIG. 6 D shows the analytical and quantitative detection of the NT5C2 R367Q mutation in PDX models generated from ALL tumors of patients #2, #4 and #5 at diagnosis and after chemotherapy relapse.
- WT and mutant variants were captured using adaptors specific for the WT (GA, green) or mutant (AA, purple) allele and subjected to analytical (left; 18 PCR cycles) and quantitative PCR (right).
- FIG. 7 shows the DTECT applications for the detection of precision genome editing and genetic variation. It shows the schematic representation of examples of targeted dinucleotide signatures generated by single base edits, small insertions and deletions that can be detected using DTECT. Examples of adaptors that can be used to detect the indicated genome editing events are shown on the right.
- FIGS. 8 A- 8 D show the features of type IIS restriction enzymes compatible with DTECT and schematic representation of the Acul digestion pattern.
- FIG. 8 A shows the representation of two key features of type IIS restriction enzymes compatible with DTECT: 1) Binding of a single recognition motif (green); 2) Cleavage of a targeted DNA sequence (blue) far from the recognition motif.
- FIG. 8 B shows the representation of the pattern of digestion of a type IIS enzyme, including the main digestion product and a cleavage byproduct due to slippage activity.
- FIG. 8 C shows the graphical representation of the number of type IIS enzymes in function of the distance between their recognition motif and cleavage site.
- FIG. 8 D shows the pattern of cleavage of the type IIS enzyme Acul.
- Acul cleaves DNA products 14/16 bp away from its recognition site (green), leaving a 3′-overhang of 2 DNA bases (blue).
- FIGS. 9 A- 9 C show the Sanger sequencing reads of captured Acul-digested DNA fragments and validation of the adaptor library.
- FIGS. 9 A and 9 B show the Sanger sequencing reads of PCR amplicons of Acul-digested DNA products containing the TT ( FIG. 9 A ) and CC ( FIG. 9 B ) signatures shown in FIG. 2 B , which have been captured using AA or GG adaptors.
- the DNA sequences of PCR primers red), genomic locus (purple), targeted dinucleotides (blue), Acul motif (green) and adaptors (brown) are shown.
- FIG. 9 C shows the PCR amplification (18 cycles) of captured Acul-digested DNA products by DTECT using specific (green) and non-specific (blue) DNA adaptors. Each of the 16 adaptors was tested for its ability to capture two independent dinucleotide signatures (#1 and #2).
- FIGS. 10 A- 10 F show the identification of WT and STOP alleles in mixed solutions and quantification of non-specific dinucleotide capture and ligation efficiency in DTECT assays.
- FIG. 10 A shows the schematics of the protocol used to identify and quantify WT and STOP alleles in mixed solutions, as shown in FIGS. 2 G- 2 I .
- Cells were transfected with the cytidine base editor (CBE) BE3 and an sgRNA to induce a STOP codon (sgSTOP) using iSTOP.
- sgSTOP cytidine base editor
- WT and STOP alleles were then cloned and mixed at different WT:STOP ratios, as indicated in FIG. 2 G .
- DTECT was then used to capture WT and STOP signatures using adaptors specific for the WT (green) or STOP (purple) allele, as well as non-specific adaptors (blue). Captured material was then subjected to analytical or quantitative PCR.
- FIG. 10 B shows the Sanger sequencing reads of WT and STOP alleles of SPRTN, SMARCAL1 and PIK3R1.
- the targeted dinucleotide signature is shown in green and the edited cytidine base (C-> T) is indicated by the blue arrow.
- FIG. 10 C shows the representation of the Acul-tagging primers used to detect the WT and STOP alleles of the SPRTN gene.
- the targeted dinucleotides are shown in blue, the edited base is indicated with an asterisk, the PAM sequence is show in red and part of the Acul-tagging primer sequence is shown in purple.
- FIG. 10 D shows the PCR amplification (25 cycles) of WT and STOP SPRTN alleles (arrow) captured using DTECT from WT:STOP allele mixtures (i.e., 100:0 and 99:1).
- An adaptor (AG) specific for the STOP SPRTN allele is utilized in the capture reaction, along with an adaptor specific for the WT allele (GG; positive control) and a non-specific adaptor (TT; negative control). Background non-specific PCR products are indicated with an asterisk.
- FIG. 10 E shows the frequency of non-specific dinucleotide capture for each of the 16 adaptors used for DTECT.
- Adaptors containing the indicated dinucleotide sequences were utilized to capture Acul-digested DNA fragments with non-complementary dinucleotides and the frequency of non-specific dinucleotide capture was quantified by qPCR.
- Mean frequency of non-specific dinucleotide capture is shown for 2-6 independent DNA ligation reactions using DNA fragments with distinct non-complementary dinucleotides.
- Adaptors complementary to +1 and -1 Acul-dependent slippage events were excluded from the analysis.
- FIG. 10 F shows the time course experiment to measure the efficiency of the ligation of Acul-digested products to DNA adaptors.
- Acul-digested products from 3 independent targets SMARCAL1, SPRTN and PIK3R1
- DNA adaptors and T4 ligase were incubated for 5 min, 1 hour or 16 hours, and the captured material was quantified by qPCR.
- a sample without T4 ligase was used as a negative control.
- the percentage of captured material at the different time points was obtained by normalization to the amount of captured material upon a 16-hour ligation reaction. Error-bars represent the s.d. of 2 independent experiments.
- FIGS. 11 A- 11 J show the detection of CRISPR-mediated HDR and base editing events by DTECT, NGS and RFLP assays.
- FIGS. 11 A- 11 D show the detection by analytical PCR (20 or 21 cycles) of WT and HDR-edited EMX1 ( FIG. 11 A ), JAK2 ( FIG. 11 B ), HBB ( FIG. 11 C ) and BRCA2 ( FIG. 11 D ) alleles captured using adaptors specific for the WT (green) or edited (purple) alleles.
- HEK293T cells were transfected with Cas9, sgRNA and an HDR donor (ssODN) with or without the HDR stimulatory factor i53. The ssODN was omitted in control reactions.
- ssODNs introduce a Pmel site in EMX1 and JAK2, a sickle cell anemia mutation in HBB (i.e., G6V), and a breast cancer-associated small tandem duplication in BRCA2 (dupAGAAGAT).
- FIG. 11 E shows the quantification of the efficiency of the insertion of the short tandem duplication dupAGAAGAT in the BRCA2 locus, as determined by NGS.
- the pie chart shows the distribution of NGS reads corresponding to HDR- and/or NHEJ-mediated repair events (HDR, red; NHEJ, blue; mixed HDR/NHEJ, green; unedited, brown) occurring at the BRCA2 locus in HEK293T cells transfected with Cas9/sgRNA and ssODN donor, with or without i53.
- the BRCA2 locus was amplified by PCR and subjected to NGS.
- the NGS reads were analyzed by CRISPResso.
- FIG. 11 F shows the RFLP assay to monitor the gain of a Pmel restriction site introduced by ssODN-meditated HDR in the EMX1 and JAK2 loci under the same experimental conditions shown in FIG. 11 A and FIG. 11 B .
- Digested (edited) and undigested (WT) DNA products are indicated by arrows.
- FIGS. 11 G- 11 H show the RFLP assays to monitor the loss of Ncol ( FIG. 11 G ) or Taqal ( FIG. 11 H ) restriction sites in the HBB and TP53 loci, respectively, resulting from the insertion of the G6V and R209fs*6 mutations under the same experimental conditions shown in FIG. 11 C and FIG. 3 B .
- Digested (WT) and undigested (edited) DNA products are indicated by arrows.
- FIG. 111 shows the detection of WT and nonsense mutant TIMELESS, SLX4 and FANCM alleles by DTECT using adaptors specific for the WT (green) or edited (purple) signatures.
- WT WT
- nonsense mutant TIMELESS SLX4 and FANCM alleles by DTECT using adaptors specific for the WT (green) or edited (purple) signatures.
- WT green
- edited (purple) signatures Experiments were performed in cells transfected with the cytidine base editor BE3 and sgRNA to induce the indicated nonsense mutations, which were detected by analytical (left; 21 cycles) or quantitative PCR (right).
- FIG. 11 J shows the detection of WT and nonsense mutant TCOF1 alleles by DTECT (21 PCR cycles) using adaptors specific for the WT (GG, green) or edited (AG, purple) allele.
- FIGS. 12 A- 12 B show the comparative analysis of DTECT-, Sanger- and NGS-based estimations of the frequency of genetic variants generated by precision genome editing.
- FIG. 12 A shows the graphical representation of the frequency of mutations introduced by CRISPR-dependent HDR and base editing in human and mouse cells, and intestinal organoids.
- the FANCF, Pik3ca and Apc loci were edited in biological duplicate or triplicate using multiple base editors, and the resulting edited samples were previously described (Zafra et al., 2018).
- the BRCA1 ⁇ 2 loci were edited using BE3.
- the frequency values were determined by both DTECT (red) and NGS (green). NGS was conducted on standard PCR amplicons (FANCF, Pik3ca and Apc) or Acul-tagged amplicons (BRCA1 ⁇ 2) of the edited loci. Error bars represent the s.e.m. of 2-5 independent DTECT assays per edited sample. The same frequency values are plotted in the graphs shown in FIG. 3 C .
- FIG. 12 B shows the graphical representation of the correlation between technical duplicates obtained by DTECT (red), EditR (green) or ICE (blue). Each dot represents a distinct BRCA1 ⁇ 2 variant introduced in cells by precision genome editing.
- Technical duplicates of DTECT assays correspond to two independent ligation reactions for the same Acul-digested amplicon and Sanger-based technical duplicates correspond to two independent sequencing reactions for the same PCR amplicon.
- FIGS. 13 A- 13 C show the detection of base editing byproducts and clinically relevant BRCA1 ⁇ 2 mutations introduced by precision genome editing.
- FIG. 13 A shows the detection by analytical PCR (21 cycles) of allelic mixtures induced by CRISPR-mediated base editing events occurring at a CC sequence in the EMX1 gene, as shown in FIG. 4 A .
- HEK293T cells constitutively expressing the base editor FNLS-BE3 were transfected with a control sgRNA (top) or an sgRNA targeting the EMX1 locus (bottom). All possible 16 adaptors were used to capture EMX1 variants.
- Adaptors that capture the WT allele (GG) and +1 Acul slippage event (CG) are shown in green and orange.
- Adaptors that capture C->T base editing events (AA, AG, GA) and C->A and C->G base editing byproducts AC, AT, CA, CG, GC) are also shown.
- FIGS. 13 B- 13 C show the analytical detection of the indicated BRCA1 ( FIG. 13 A ) and BRCA2 ( FIG. 13 B ) mutations in HEK293T cell populations by DTECT (21 PCR cycles) using adaptors specific for WT (green) or mutant (purple) alleles. Experiments were conducted as in FIGS. 5 C and 5 F .
- FIGS. 14 A- 14 B show the genotyping of mutant cellular clones and knock-in mice using DTECT.
- FIG. 14 A shows the genotyping by DTECT-based analytical PCR (20 cycles) of HEK293T clones (17) carrying WT and/or BRCA1 E638K mutant alleles or base editing byproducts derived by single cell dilution from the BRCA1 E638K cell population shown in FIG. 5 C .
- Heterozygous and homozygous mutant clones are indicated in blue and purple, respectively.
- WT clones are indicated in green and a clone with a base editing byproduct is indicated in orange.
- Clones #1, #2, #4 and control (CTL) are also shown in FIG. 5 G . Quantification of each BRCA1 variant by qPCR is also shown (bottom).
- HEK293T cells have 4 BRCA1 alleles. Error bars correspond to two independent experiments.
- FIG. 14 B shows the genotyping by DTECT-based analytical PCR of Bard1 S563F (top) and Brca1 S1598F (bottom) knock-in mutant mice (Bard1, 18 PCR cycles; Brca1, 20 PCR cycles).
- DTECT assays were conducted on gDNA isolated from mouse tail samples. Heterozygous and homozygous mutant mice are indicated in blue and purple, respectively, and WT mice are indicated in green. No homozygous Brca1 S1598F mutant mice were identified in the analyzed mouse litters due to sub-Mendelian birth ratios (Billing et al., 2018). Mice #1, #2, #3 and #8 (Bard1), and #1, #2, #5 (Brca1) are also shown in FIG. 5 I .
- FIGS. 15 A- 15 D show the detection of oncogenic mutations in a mouse model of myeloproliferative neoplasm and in ALL patients using DTECT.
- FIG. 15 A shows the schematics of the experiments conducted to detect the Jak2 V617F mutation in a mouse model of myeloproliferative neoplasm.
- Peripheral blood was collected from mice transplanted with a mixture of bone marrow cells either wild-type (WT) or carrying an inducible Jak2 V617F mutant allele (Mx1-Cre+;Jak2 V617F/+ ).
- WT wild-type
- Mx1-Cre+ inducible Jak2 V617F mutant allele
- FIG. 15 B shows the schematic representation of 4 Acul-induced dinucleotide signatures that enable the identification of Jak2 WT and V617F alleles.
- the G in red is replaced by a T in the Jak2 V617F mutant allele.
- FIG. 15 C shows the identification by DTECT-based analytical PCR (20 cycles) of the Jak2 V617F mutation in the blood of a mouse model of myeloproliferative neoplasm generated as described in FIG. 15 A .
- the Jak2 V617F mutation was identified using the 4 independent dinucleotide signatures shown in FIG. 15 B .
- gDNA samples from peripheral blood of WT mice were used as controls (#1 and #2) in this experiment. Sanger sequencing (bottom) was conducted to confirm the results obtained using DTECT.
- FIG. 15 D shows the analytical detection of the indicated NT5C2 mutations in ALL patient samples by PCR (20 cycles). The frequency of the indicated mutations in the same patient samples is shown in FIG. 6 B .
- FIGS. 16 A- 16 C show the analysis of ClinVar variants with proximal genomic Acul motifs compatible with DTECT.
- FIG. 16 A shows the Bioinformatic analysis of ClinVar database variants (425,580) with (80,326; blue) or without (345,254; green) genomic Acul sites in close proximity (+/- 100 bp).
- Variants green, right pie chart
- Variants with a single Acul motif located 35 bp to 100 bp away on the 3′- (29,848) or 5′- (29,291) side can be detected using DTECT, as illustrated in FIG. 16 C .
- Variants (red, right pie chart) with an Acul motif located ⁇ 35 bp away (18,739) or with proximal Acul motifs on both sides (2,448) cannot be detected using DTECT.
- FIG. 16 B shows the percentage and number of ClinVar variants that can (95.02%, 404,393) or cannot (4.98%, 21,187) be detected using DTECT.
- FIG. 16 C shows the schematic representation of genomic loci with or without an Acul site in close proximity to the edited site.
- detection of the edited site can be obtained by designing 2 Acul-tagging primers that anneal to the targeted locus between the genomic Acul site and the edited base(s). This approach allows the capture of two independent dinucleotide signatures for each targeted site with one proximal Acul site. Four independent dinucleotide signatures can be captured for targeted sites with no proximal Acul sites.
- FIGS. 17 A- 17 B show the detection of Acul slippage events by DTECT.
- FIG. 17 A shows the schematics of targeted dinucleotides (blue) and +1 (red) and -1 (orange) Acul slippage events (left). Detection of Acul slippage byproducts by DTECT (22 PCR cycles) using adaptors complementary to the targeted dinucleotide signatures (green) and to signatures generated by Acul +1 (red) or -1 (orange) slippage (right). A non-specific adaptor (blue) is used as a control.
- FIG. 17 B shows the schematic representation of DNA digestion products generated by precise Acul cleavage (green) or +1 slippage (red) occurring at wild-type and mutant alleles.
- the dinucleotide signatures generated as a result of Acul slippage byproducts and the complementary adaptors to capture them are indicated.
- FIGS. 18 A- 18 D show the design of DTECT assays to avoid indel interference in CRISPR-mediated HDR experiments.
- FIG. 18 A shows the InDelphi prediction (https://indelphi.giffordlab.mit.edu) of indel-containing alleles in the TP53 locus.
- the dinucleotides targeted to simultaneously introduce the TP53 R209fs*6 mutation and a G > T mutation in the PAM by CRISPR-dependent HDR are indicated in green and red, respectively.
- the Cas9 cleavage site is indicated in black.
- the dinucleotide signatures captured to detect the TP53 R209fs*6 and PAM mutations are shown in purple.
- the presence of indel interference in the distinct predicted alleles is indicated.
- MH microhomology.
- FIG. 18 B shows the DTECT-based quantification of the TP53 R209fs*6 and PAM mutations introduced by HDR using a single ssODN donor template, as shown in FIG. 18 A .
- Adaptors specific for the WT (CT and TG; green and red) or edited (TT; purple) signatures were used for quantification.
- HDR efficiency determined by NGS is also shown.
- FIG. 18 C shows the schematic representation of the design of DTECT experiments to avoid interference of indels formed at DSBs during CRISPR-mediated HDR.
- Cas9-mediated DSBs are induced at a distance from a targeted dinucleotide (green) sufficient to avoid mutation of the targeted dinucleotide by indels (blue).
- the pattern of indel mutations is predicted using the InDelphi website.
- FIG. 18 D shows the schematics of alleles generated by CRISPR-mediated HDR, including the unedited allele (green), indel-containing alleles (blue) and the HDR-edited allele (purple).
- DTECT captures both the unedited and the indel-containing alleles using an adaptor specific for the WT dinucleotide signature, while the HDR-edited allele is captured using an adaptor specific for the edited dinucleotide signature.
- the capture of indel-containing alleles with a WT adaptor ensures the accurate quantification of the frequency of the HDR-edited allele in the allele population.
- the present disclosure provides a versatile method that uses standard molecular biology techniques to detect variants introduced by precision genome editing or resulting from genetic variation.
- This detection method designated Dinucleotide signaTurE CapTure (DTECT)
- DTECT Dinucleotide signaTurE CapTure
- DTECT can readily identify oncogenic mutations in cancer mouse models, patient-derived xenograft models and cancer patient samples.
- one embodiment of the present disclosure is a DNA adaptor comprising: (a) one strand with sequence of 5′-CTGGGGCACGGGTAAGAAGCATTCTGTCTCTCTTCTAAGAATTCGAGCTCGGTACC CG-3′ (SEQ ID NO: 230); and (b) one complementary strand with sequence of 5′-CGGGTACCGAGCTCGAATTCTTAGAAGAGAGACAGAATGCTTCTTACCCGTGCCC CAGNN-3′ with “N” corresponding to A, T, G or C (SEQ ID NOs: 231-246).
- the DNA adaptor is labeled with a detection molecule.
- the detection molecule include a radiolabel, a fluorescent label, a biotinylated label, a non-fluorescent label, an enzyme, a hapten, a phosphorescent molecule, a chemiluminescent molecule, a chromophore, a luminescent molecule, a photoaffinity molecule, a color particle or a ligand.
- Another embodiment of the present disclosure is a method of preparing a DNA adaptor disclosed herein, comprising: (a) synthesizing one constant oligonucleotide with sequence of 5′-CTGGGGCACGGGTAAGAAGCATTCTGTCTCTCTTCTAAGAATTCGAGCTCGGTACC CG-3′ (SEQ ID NO: 230); (b) synthesizing one complementary oligonucleotide with sequence of 5′-CGGGTACCGAGCTCGAATTCTTAGAAGAGAGACAGAATGCTTCTTACCCGTGCCC CAGNN-3′ with “N” corresponding to A, T, G or C (SEQ ID NOs: 231-246); (c) mixing the constant and complementary oligonucleotides; and (d) annealing the mixture to obtain the DNA adaptor.
- Another embodiment of the present disclosure is a library of DNA adaptors prepared by methods disclosed herein, the library comprises 16 DNA adaptors, wherein each DNA adaptor has a different “NN”.
- Another embodiment of the present disclosure is a method for detecting a genetic modification, comprising the steps of: (a) amplifying a genomic locus of interest using a specially designed Type IIS restriction enzyme-tagging primer, comprising: (i) extracting genomic DNA from a biological sample of interest; (ii) synthesizing the Type IIS restriction enzyme-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Type IIS restriction enzyme-tagging primer and a reverse primer; and (iv) purifying a Type IIS restriction enzyme-tagged genomic amplicon; (b) digesting the Type IIS restriction enzyme-tagged genomic amplicon with the Type IIS restriction enzyme; (c) isolating the smaller DNA fragment containing a genomic signature of interest exposed in a 3′ single-stranded overhang; (d) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment containing the 3
- the genetic modification is selected from a base change, a deletion, or an insertion. In some embodiments, the genetic modification is selected from a single genomic change or multiple genomic changes. In some embodiments, the multiple genomic changes can occur within a single locus or distinct loci.
- the Type IIS restriction enzyme is selected from Acul, Bpml, BpuEI, BsgI, Mmel and NmeAIII. In some embodiments, the Type IIS restriction enzyme is selected from Acul and BpuEI. In some embodiments, the Type IIS restriction enzyme is Acul.
- the Type IIS restriction enzyme-tagging primer is an oligonucleotide comprising: (a) a non-complementary handle sequence positioned on the 5′ side; (b) a complementary sequence of the genomic locus of interest on the 5′ side; (c) a recognition motif of the Type IIS restriction enzyme that is positioned at a predicted distance from its cleavage site to generate the genomic signature of interest; and (d) a complementary sequence of the genomic locus of interest on the 3′ side.
- the reverse primer is positioned at more than 100 bp downstream of the genomic locus of interest.
- the non-complementary handle sequence can have any suitable length. In some embodiments, the non-complementary handle sequence is 25 bp. In some embodiments, the non-complementary handle sequence can have any suitable sequence. In some embodiments, the non-complementary handle sequence is 5′-GCAATTCCTCACGAGACCCGTCCTG-3′ (SEQ ID NO: 3).
- step (d)(ii) of the methods disclosed above is carried out by T4 DNA ligase.
- a further embodiment of the present disclosure is a kit for detecting a genetic modification of interest, comprising a specially designed Type IIS restriction enzyme-tagging primer disclosed herein, and a library of DNA adaptors disclosed herein, packaged together with instructions for its use.
- the Type IIS restriction enzyme is Acul.
- Another embodiment of the present disclosure is a method for detecting a genetic modification, comprising the steps of: (a) amplifying a genomic locus of interest using a specially designed Acul-tagging primer, comprising: (i) extracting DNA of interest; (ii) synthesizing the Acul-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Acul-tagging primer and a reverse primer; and (iv) purifying an Acul-tagged genomic amplicon; (b) digesting the Acul-tagged genomic amplicon with restriction enzyme Acul; (c) isolating the smaller DNA fragment containing a genomic signature of interest produced by Acul-digestion; (d) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; and (
- the Acul-tagging primer is an oligonucleotide comprising: (a) a non-complementary handle sequence positioned on the 5′ side; and (b) a complementary sequence of the genomic locus of interest containing an Acul motif (5′-CTGAAG-3′) positioned 14 bp upstream from the genomic locus of interest.
- the Acul-tagging primer can have any suitable length. In some embodiments, the Acul-tagging primer is 60 bp.
- the reverse primer is positioned at more than 100 bp downstream of the genomic locus of interest.
- the non-complementary handle sequence can have any suitable length. In some embodiments, the non-complementary handle sequence is 25 bp.
- the complementary sequence has the structure of: 5′-N(20)CTGAAGN(14)-3′ or 5′-N(15)CTGAAGN(14)-3′, with “N” corresponding to A, T, G or C, depending on the DNA sequence of the genomic locus of interest.
- the non-complementary handle sequence is 5′-GCAATTCCTCACGAGACCCGTCCTG-3′ (SEQ ID NO: 3) and the complementary sequence is 5′-N(15)CTGAAGN(14)-3′, with “N” corresponding to A, T, G or C.
- step (d)(ii) of the methods disclosed above is carried out by T4 DNA ligase.
- An additional embodiment of the present disclosure is a kit for detecting a genetic modification, comprising a specially designed Acul-tagging primer and a library of DNA adaptors disclosed herein, packaged together with instructions for its use.
- Another embodiment of the present disclosure is a method for quantifying a genomic variant in a biological system, comprising the steps of: (a) obtaining a sample from the biological system; (b) amplifying a genomic locus of interest using a specially designed Acul-tagging primer, comprising: (i) extracting DNA of interest; (ii)synthesizing the Acul-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Acul-tagging primer and a reverse primer; and (iv) purifying an Acul-tagged genomic amplicon; (c) digesting the Acul-tagged genomic amplicon with restriction enzyme Acul; (d) isolating the smaller DNA fragment containing a genomic signature of interest produced by the Acul-digestion; (e) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a
- the genomic variant is generated by precision genome editing.
- the precision genome editing is CRISPER-dependent homology-directed repair, base editing or prime editing.
- the biological system is a mammalian cell line, an organoid, or a tissue.
- the quantification in step (f) of the methods disclosed above is carried out by quantitative PCR (qPCR).
- Still another embodiment of the present disclosure is a method for identifying and quantifying an oncogenic mutation of interest in a biological sample, comprising the steps of: (a) obtaining a biological sample; (b) amplifying a genomic locus of interest using a specially designed Acul-tagging primer, comprising: (i) extracting DNA of interest; (ii) synthesizing the Acul-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Acul-tagging primer and a reverse primer; and (iv) purifying an Acul-tagged genomic amplicon; (c) digesting the Acul-tagged genomic amplicon with restriction enzyme Acul; (d) isolating the smaller DNA fragment containing a genomic signature of interest produced by the Acul-digestion; (e) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing
- the biological sample is obtained from a cancer animal model, a patient-derived xenograft (PDX), or a human cancer patient sample.
- PDX patient-derived xenograft
- the quantification in step (g) of the methods disclosed above is carried out by quantitative PCR (qPCR).
- a further embodiment of the present disclosure is a process for marker-free detection of a precision genome editing event comprising carrying out Dinucleotide signaTurE CapTure (DTECT) on a nucleic acid sequence of interest.
- DTECT Dinucleotide signaTurE CapTure
- DTECT can also be used to detect genetic signatures in any organism, for example, a virus.
- a method for detecting a virus variant of interest comprising the steps of: (a) obtaining a nucleic acid of the virus variant of interest from a biological sample; and (b) if the nucleic acid is DNA, carrying out Dinucleotide signaTurE CapTure (DTECT) to detect the variant of interest; or (c) if the nucleic acid is RNA, coverting it to DNA by reverse transcription PCR (RT-PCR) and then carrying out DTECT to detect the variant of interest.
- RT-PCR reverse transcription PCR
- This detection method is applicable to any type of virus including but not limited to a DNA virus, an RNA virus, a retrovirus, etc.
- the virus is an RNA virus.
- the virus is SARS-CoV-2.
- Plasmids for DTECT quantification and expression of base editing sgRNAs targeting BRCA1, BRCA2 and FANCD2 have been deposited to Addgene (#139321-139333, and 139511).
- HEK293T and DLD1 cell lines were obtained from ATCC.
- Cells were cultured in DMEM (ThermoFisher Scientific) supplemented with 10% Fetalgro bovine growth serum (BGS, RMBIO) and 1% penicillin-streptomycin (ThermoFisher Scientific). Cells were grown at 37° C. with 5% CO 2 and tested regularly for mycoplasma.
- NIH/3T3 were maintained in DMEM supplemented with 10% bovine calf serum. Organoids were isolated and cultured as previously described (Zafra et al., 2018).
- HEK293T cells were infected with a lentivirus expressing the above construct.
- Viruses were produced in HEK293T in 6-well plates by transfecting 2 ⁇ g of FNLS-BE3-P2A-BlastR, 0.2 ⁇ g of Tat, 0.2 ⁇ g of Gag/Pol, 0.2 ⁇ g of Rev, 0.4 ⁇ g of VSV-G expressing plasmids in 250 ⁇ l of DMEM without serum.
- 9 ⁇ l of TransIT-293 (Mirus) were added to the DNA, mixed and incubated for 15 min at room temperature.
- the DNA transfection reagent mix was added dropwise to the cells and incubated at 37° C. with 5% CO 2 . The next day the cell medium was replaced and cells were incubated for 48 hours. The medium containing lentiviruses was then collected and utilized to infect new HEK293T cells. 48 hours after infection, blasticidin was added to the medium until the uninfected control cells were killed. FNLS-BE3 expression was determined by western blot and the base editing activity of the construct was tested using previously validated sgRNAs. Single HEK293T clones were selected for high base editing efficiency. Clones were isolated by trypsinization of the initial cell population into individual cells.
- Cell density was evaluated by counting the cells with a hemocytometer and cells were diluted to approximately 0.13 cells/ ⁇ l, equivalent to 20 cells per 150 ⁇ l. Serial dilutions were prepared and 150 ⁇ l of the diluted cell mixture were seeded into 96-well plates. Single clones were expanded and further examined for FNLS-BE3 expression and activity.
- HEK293T cells were seeded at 50%-70% confluency into 24-well plates and reverse transfected with 0.25 ⁇ g of sgRNA and 0.25 ⁇ g of Cas9 expressing plasmid (Addgene #42230) with or without 0.5 ⁇ l of ssODN (40 ⁇ M) into 100 ⁇ l of DMEM without Fetalgro BGS and antibiotics. 3 ⁇ l of TransIT-293 (Mirus) were added to the DNA, mixed and incubated for 15 min at room temperature. Experiments involving i53 were done by adding 0.25 ⁇ g of i53 (Addgene #77939) to the transfection mixture.
- the gDNAs of cell populations and individual clones were recovered by resuspending the cell pellets in the Quick Extract DNA Extraction Solution (Epicentre), followed by incubation at 65° C. for 10 min and 95° C. for 5 min.
- the isolated gDNAs were diluted in H 2 O, quantified using Nanodrop and stored at -20° C. or directly used in PCR reactions.
- base editing experiments we used cells constitutively expressing FNLS-BE3 or transfected with pCMV-BE3 (Addgene #73021) and sgRNAs, as described above. Empty plasmids (Addgene #100708) with no sgRNAs were used as controls. To determine the accuracy of the quantification of variant frequency by DTECT ( FIG.
- STOP codons were introduced into SPRTN, SMARCAL1 and PIK3R1 genes using iSTOP, as previously described (Billon et al., 2017).
- the locus was amplified by PCR and cloned into the pCR-Blunt II-TOPO vector (ThermoFisher Scientific).
- the STOP alleles were isolated by PCR amplification using gDNA that was partially edited as template.
- the PCR product was subsequently digested using restriction enzymes that specifically cleave the WT PCR alleles (i.e., Pvull for SPRTN, SfaNI for SMARCAL1 and Taqal for PIK3R1).
- the digestion reaction was loaded on a 2% agarose gel and the undigested PCR products were column purified (Zymoclean #D4008). The purified products were subsequently cloned into the pCR-Blunt II-TOPO vector (ThermoFisher Scientific). Cloned WT and STOP PCR fragments were confirmed by Sanger sequencing and are shown in FIG. 10 B .
- RFLP assays were conducted by digesting PCR amplicons of the edited genomic loci with enzymes that recognize restriction sites created or disrupted by editing of the targeted loci. Restriction digest products were run on 6% TBE polyacrylamide gels.
- HEK293T cells expressing FNLS-BE3 were seeded at 50%-70% confluency into 24-well plates and reverse transfected with 1 ⁇ g of sgRNA into 100 ⁇ l of DMEM without Fetalgro BGS and antibiotics. 3 ⁇ l of TransIT-293 (Mirus) were added to the DNA, mixed and incubated for 15 min at room temperature. The DNA transfection mix was added dropwise to the cells and incubated at 37° C. with 5% CO 2 for 4 days. Single clones were generated and the gDNAs of cell populations and individual clones were recovered as describe above. Genomic loci were Sanger sequenced by Eton Bioscience or Genewiz. Sanger sequencing data were analyzed using Serial cloner and viewed by Snapgene Viewer. The sequencing profiles shown in this manuscript were generated by SnapGene Viewer. Quantitative detection of the editing level using the Acul-tagged amplicon was done blindly.
- mice were injected with 0.9% sterile sodium chloride solution containing 20 ⁇ g of pLenti-FNLS-P2A-Puro and 10 ⁇ g of sgRNA vector.
- the total injection volume corresponded to 20% of the individual mouse body weight and was injected into the lateral tail vein in 5-7 seconds. All animal experiments were authorized by the regional board of Düsseldorf, Germany.
- mice harboring the Brca1 S1598F and Bard1 S563F alleles were previously described (Billing et al., 2018; Shakya et al., 2011).
- Mouse genotyping was performed using DTECT on genomic DNA extracted from mouse tails. Acul-tagging of the targeted loci was performed using 50 ng of gDNA (see DTECT protocol above). All primer sequences are listed in Table S1. Genotyping experiments were conducted blindly.
- mice 1.5 ⁇ 10 6 filtered whole donor Mx1-Cre + ;Jak2 V617F/+ bone marrow cells (CD45.2) were then mixed with wild-type 1.5 ⁇ 10 6 competitor bone marrow cells (CD45.1) and transplanted via tail vein injection into lethally irradiated (2 ⁇ 550 Rad) CD45.1 host mice. Mice were then monitored serially for the development of MPN based on blood counts and donor chimerism by retroorbital bleed draws using heparinized microhematocrit capillary tubes (ThermoFisher Scientific). After 3 consecutive hematocrits of >65%, mice were then sacrificed for peripheral blood fluorescence-activated cell sorting (FACS) analysis and DNA extraction.
- FACS peripheral blood fluorescence-activated cell sorting
- mice All animal procedures were conducted in accordance with the Guidelines for the Care and Use of Laboratory Animals and were approved by the Institutional Animal Care and Use Committees at Memorial Sloan Kettering Cancer Center.
- the conditional Mx1-Cre + ;Jak2 V617F/+ mice are all C57BL/6 background and have been previously described (Mullally et al., 2010).
- Automated peripheral blood counts were obtained using a ProCyte Dx (IDEXX Laboratories) according to the manufacturer’s protocol.
- IDEXX Laboratories ProCyte Dx (IDEXX Laboratories) according to the manufacturer’s protocol.
- RBCs were lysed and stained with monoclonal antibodies in PBS plus 1% BSA for 1 hour on ice.
- DAPI was used for live/dead cell analysis.
- Cell populations were analyzed using an LSR Fortessa (Becton Dickinson), and data were analyzed with FlowJo software (Tree Star). DNA extraction was performed using the QIAamp DNA Micro Kit (Qiagen) per manufacturer’s protocol.
- DNA samples from leukemic ALL blasts obtained at diagnosis and after relapse were provided by multiple institutions, as previously described (Oshima et al., 2016). Informed consent was obtained at study entry and samples were collected under the supervision of local Institutional Review Boards for participating institutions and analyzed under the supervision of the Columbia University Irving Medical Center Institutional Review Board. Research was conducted in compliance with ethical regulations. ALL patients received standard combination chemotherapy at diagnosis. Diagnosis and relapse samples were harvested from bone marrow. High molecular weight genomic DNA from matched diagnosis and relapse samples of ALL patients was extracted from patient leukemic blasts or from xenografts using the DNeasy Blood & Tissue Kit (Qiagen) or the AllPrep DNA/RNA Mini Kit (Qiagen).
- sgRNAs were synthesized as complementary oligonucleotides (IDT) compatible with BbsI restriction sites located into the B52 plasmid (Addgene #100708). Oligonucleotides were designed as previously described (Billon et al., 2017). Cloned sgRNAs were verified by Sanger sequencing. Sequences of the sgRNAs are available in Table S1. ssODNs used in HDR experiments were synthesized as ultramer oligos (IDT) and their sequences are available in Table S1.
- the pLenti-FNLS-P2A-Puro plasmid (Addgene #110841) (Zafra et al., 2018) was modified by replacing the puromycin resistance gene with the blasticidin resistance gene. Briefly, the blasticidin resistance gene coding sequence was amplified by PCR and recombined using Gibson assembly into FNLS-BE3-P2A. The FNLS-BE3-P2A-BlastR sequence was verified by Sanger sequencing.
- the Acul-tagging oligonucleotide enables the insertion of an Acul motif (5′-CTGAAG-3′) 14 bp away from a targeted dinucleotide. This motif is inserted as a hairpin in the middle of a sequence complementary to the targeted genomic locus.
- the Acul-tagging oligonucleotide is 60 bp-long and contains a non-complementary handle sequence of 20-25 bp. Common handle sequences used are PB547 (5′-GATCCTCTAGAGTCGACCTG-3′) (SEQ ID NO: 1) or PB1072 (5′-GCAATTCCTCACGAGACCCGTCCTG-3′) (SEQ ID NO: 3) (Table S1).
- the oligonucleotide sequence complementary to the targeted genomic locus plus the Acul motif has the following sequence: 5′-N(20)CTGAAGN(14)-3′ or 5′-N(15)CTGAAGN(14)-3′, with “N” corresponding to A, T, G or C bases complementary to the targeted locus.
- a set of 17 individual oligonucleotides constitutes the full adaptor library.
- This library contains: a) One constant oligonucleotide with the following sequence: 5′-CTGGGGCACGGGTAAGAAGCATTCTGTCTCTcttctaagaattcgagctcggtacccg-3′ (SEQ ID NO: 230).
- the lowercase nucleotide sequence located at the 3′-end of the constant oligonucleotide corresponds to the handle sequence used to detect the ligated products with either PB548 (5′-cgggtaccgagctcgaattc-3′) (SEQ ID NO: 2) or PB1073 (5′-cgggtaccgagctcgaattcttagaag-3′) (SEQ ID NO: 4); b) 16 variable oligonucleotides that contain a sequence complementary to the constant oligonucleotide plus one of 16 different dinucleotides at their 3′-end.
- variable oligonucleotides have the following sequence: 5′-cgggtaccgagctcgaattcttagaagAGAGACAGAATGCTTCTTACCCGTGCCCCAGNN-3′.
- the adaptor sequences are available in Table S1.
- the constant oligonucleotide and each variable oligonucleotide were resuspended at a concentration of 100 ⁇ M in H 2 O.
- the adaptor library was tested at two independent loci, as shown in FIG. 9 C .
- Acul-tagging oligonucleotides targeting the ampicillin resistance gene were designed following the rules detailed above (Table S1).
- the digested plasmid was subsequently purified on column (Zymoclean #D4008) and used as a template in PCR reactions with each Acul-tagging primer and a constant reverse primer (5′-CCAATGCTTAATCAGTGAGG-3′) (SEQ ID NO: 320) located at the 3′-side of the ampicillin resistance gene.
- the PCRs were performed in a 25 ⁇ l reaction containing: 1 ⁇ M forward and reverse primers, 0.1 mM dNTP (NEB #N0447L), 1X Q5 buffer (NEB), 20 ng of digested pUC19, 1 unit of Q5 polymerase (NEB) and water.
- the PCR program used was the following: 95° C.
- PCR reactions were loaded on a 2% agarose gel, extracted from gel and purified on column (Zymoclean #D4008). Finally, the DTECT protocol was applied as described below. Briefly, 0.5 pmol of Acul-tagging PCR products were digested by Acul for 30 min at 37° C. 10 ⁇ l of the digested products were purified with 18 ⁇ l of solid phase reversible immobilization magnetic beads (Beckman Coulter #A63881).
- AA #1 Specific adaptor: TT, Non-specific adaptor: CC
- AA #2 TT, CC
- AC #1 GT, AC
- AC #2 GT, AA
- AG #1 CT, GA
- AG #2 CT, GA
- AT #1 AT, GG
- AT #2 AT, GG
- CA #1 TG, CA
- CA #2 TG
- CA CA
- CC #1 GG, CC
- CC #2 GG, CC
- CG #2 CG, AA
- CT #1 AG, TT
- CT #2 GA #1 (TC, GA)
- GA #2 TC, GA
- GC #1 GC, TT
- GC #2 TT
- GG #1 CC, TT
- GT #1 AC, TG
- GT #2 AC, TG
- TA #1 TA, GG
- TA #2 TA, GG
- TC #1 CA, TG
- CA CA
- the ligated products were subsequently detected by PCR amplification using the primers PB547 (5′-gatcctctagagtcgacctg-3′) (SEQ ID NO: 1) and PB1073 (5′-cgggtaccgagctcgaattcttagaag-3′) (SEQ ID NO: 4). All primer sequences are listed in Table S1.
- the measurement of the dinucleotide capture efficiency of each adaptor was determined by ligating the 16 different adaptors to annealed oligonucleotides containing complementary dinucleotides.
- the reverse oligonucleotide (PB1449: 5′-gtagttcgccagttCTTCAGaatagtttgcgca CAGGACGGGTCTCGTGAGGAATTGC-3′) (SEQ ID NO: 91) was phosphorylated with PNK (NEB).
- the phosphorylation reaction was conducted as follows: 5 ⁇ l of PB1449 (100 ⁇ M), 4 ⁇ l of 5X ligase buffer, 0.5 ⁇ l of PNK in a 20 ⁇ l reaction. Phosphorylation was obtained upon incubation for 1 hour at 37° C., followed by heat inactivation of PNK for 20 min at 65° C. After incubation, the phosphorylated oligonucleotide PB1449 was annealed to 16 complementary oligonucleotides with the following sequence: 5′-GCAATTCCTCACGAGACCCGTCCTGTGCGCAAACTAT TCTGAAGAACTGGCGAACTACNN-3′ (SEQ ID NOs: 231-246).
- annealing reaction 40 ⁇ l of 5X ligase buffer and 130 ⁇ l of H 2 O were added to the phosphorylation reaction. 9.5 ⁇ l of this mix were used for annealing with 0.5 ⁇ l of each of the above 16 oligos (50 ⁇ M). Annealing, which was performed as described above for the library of adaptors, resulted in a 5′-phosphorylated double-stranded DNA with an overhang of 2 nucleotides, mimicking the product of Acul digestion.
- the ligation between the adaptors and the phosphorylated products was performed as follows: 1 ⁇ l of annealed oligonucleotides, 2 ⁇ l of T4 ligase buffer, 0.5 ⁇ l of T4 ligase and 0.5 ⁇ l of adaptors in a 10 ⁇ l reaction.
- the ligation reaction was incubated for 1 hour at 25° C. and 10 min at 65° C. Detection was performed using qPCR as described below in the DTECT protocol.
- the assay performed to measure the efficiency of DNA ligation was conducted in a master mix reaction equivalent to 5 ⁇ l per time point as follows: 0.5 ⁇ l of Acul digested products, 1 ⁇ l of T4 ligase buffer and 0.5 ⁇ l of adaptors with or without 0.5 ⁇ l of T4 ligase.
- the reactions were incubated at 25° C. After 5 min, 5 ⁇ l were taken from the reaction and the T4 ligase was added for 10 min at 65° C. 1 hour after the start of the ligation reaction, 5 ⁇ l were additionally taken from the reaction and heat inactivated. The rest of the reaction was incubated overnight for 16 hours and heat inactivated. The amount of products captured was determined by qPCR as described below.
- the DTECT protocol consists of 6 steps (I-VI, FIG. 1 A ).
- the genomic DNA (gDNA) is prepared using the Quick Extract Solution (Epicentre) by incubating the cells at 65° C. for 10 min and 95° C. for 5 min.
- the genomic DNA is quantified by Nanodrop, diluted to 200 ng/ ⁇ l in H 2 O and stored at -20° C. or immediately used in PCR reactions.
- PCRs were performed in a 25 ⁇ l or 50 ⁇ l solution containing: 1 ⁇ M forward and reverse primers, 0.1 mM dNTP (NEB #N0447L), 1X Q5 buffer (NEB), 10-200 ng of gDNA, 1 unit of Q5 polymerase (NEB) and water. PCR reactions were conducted as follows: 95° C. for 30 s; 40 cycles of 95° C. for 10 s, 58° C. for 10 s, 72° C. for 45 s; and final amplification at 72° C. for 1 min.
- the purified PCR products were digested by 0.25 ⁇ l Acul (NEB #0641L) in a 20 ⁇ l reaction containing 1X CutSmart Buffer (NEB) supplemented with 40 ⁇ M S-adenosylmethionine (SAM) and 100 ng of purified PCR product.
- the reaction was incubated for 1 hour at 37° C. with heat inactivation at 65° C. for 20 min. IV) Isolation of the Acul-digested genomic amplicon by solid phase reversible immobilization (SPRI).
- SPRI solid phase reversible immobilization
- the purified 60 bp-long DNA fragments were ligated to DNA adaptors generated as described above.
- the adaptors and the purified products were ligated in the following reaction: 6.5 ⁇ l of water, 2 ⁇ l of 5X ligase buffer (ThermoFisher Scientific), 0.5 ⁇ l of T4 ligase (ThermoFisher Scientific), 0.5 ⁇ l of adaptors and 0.5 ⁇ l of purified DNA product.
- the ligation reaction was performed for 1 hour at 25° C. in a thermocycler, followed by inactivation of the T4 ligase for 10 min at 65° C.
- PCR amplification was performed by PCR in a 12.5 or 25 ⁇ l reaction volume containing 0.5 ⁇ M forward and reverse primers, 0.05 mM dNTP (NEB #N0447L), 1X Q5 buffer (NEB), 0.5-1 ⁇ l of ligated product, 0.1-0.2 ⁇ l of Q5 polymerase (NEB), 0.5-1 ⁇ l ligation reaction and water.
- PCR primers (PB1072 and PB1073) contained sequences complementary to the adaptor and handle (see above).
- the PCR program used was the following: 95° C. for 1 min, and different number of cycles (indicated in each figure legend) of 95° C. for 10 s, 65° C. for 5 s, 72° C. for 7 s. Detection of low abundant genomic variants ( ⁇ 1% frequency) was generally obtained with 23-25 PCR cycles, while detection of greater amounts of edited products was achieved with 17-22 PCR cycles. 5 ⁇ l of the PCR reactions were incubated with SYBR Gold (Thermofisher Scientific #S-11494), loaded on a 2% agarose gel and run in 1X TAE buffer until the DNA was separated. Gels were developed using LI-COR Odyssey. qPCR was performed using QuantStudio 3 (Applied Biosystems).
- qPCR reactions were performed as follows: 5 ⁇ l of 2X SYBR Gold master mix (ThermoFisher Scientific #4367659), 0.1 ⁇ l of forward and reverse primers (PB1072 and PB1073, 100 ⁇ M) and 1 ⁇ l of ligated products (diluted 1:100 in H 2 O) in a 10 ⁇ l reaction.
- the PCR program used in the qPCR reaction was the following: 95° C. for 10 s and 40 cycles of 60° C. 30 s, 95° C. 15 s. Quantification of the frequency of genomic variants was conducted as described below (Quantification and Statistical Analysis section).
- Samples for NGS were prepared by amplifying the edited regions of interest by PCR. Samples were sequenced by the Genome Sciences Facility at The Pennsylvania State College of Medicine or by Genewiz and the results were analyzed by Genewiz, or by using an R-based script of the Ciccia laboratory or CRISPResso2 (Clement et al., 2019). To ensure that no biases were introduced during DTECT assays, the Acul-tagging amplicons for the BRCA1 and BRCA2 mutant samples were sequenced by NGS and analyzed using an R-based script. In this analysis, 7 sequences with >6000 reads were filtered out from the analysis due to incorrect sequence.
- the editing frequency from the NGS results were determined using the formula: ((Number of reads for the edited dinucleotide) / (total number of reads)) x 100.
- Oligonucleotides used for PCR amplifications, Illumina sequencing adaptors and indexes are listed in Table S1.
- ThermoFisher Scientific 1 8265-017 Chemicals, Peptides, and Recombinant Proteins Q5 High-Fidelity DNA polymerase NEB M0491L T4 DNA ligase ThermoFisher Scientific 15224017 Acul NEB R0641L rSAP NEB M0371L SybrGold (for gel staining) ThermoFisher Scientific S-11494 SybrGold (for qPCR) ThermoFisher Scientific 4367659 Ba m HI—HF NEB R3136S dNTPs NEB N0447L T4 Polynucleotide Kinase NEB M0201S Critical Commercial Assays Agencourt AMPure XP magnetic beads Beckman Coulter A63881 Zymoclean gel DNA recovery kit Zymo Research D4008
- Restriction enzymes optimal for our method exhibit the following characteristics: a) they cleave far from their recognition motif, thus enabling the incorporation of non-complementary type IIS recognition motifs into PCR primers without disrupting genomic DNA amplification ( FIGS. 1 A and 8 A ); b) they bind a single recognition motif (Bath et al., 2002) ( FIG. 8 A ); and c) they possess highly specific endonuclease activity, therefore generating a limited number of cleavage byproducts due to slippage activity (Lundin et al., 2015) ( FIG. 8 B ).
- the genomic locus of interest is PCR-amplified using a locus-specific DNA primer (red) and a DNA oligonucleotide (Acul-tagging primer) containing two regions of complementarity to the genomic locus (purple) interrupted by an Acul recognition site (Acul hairpin, green) positioned 14 bp upstream of a dinucleotide of interest ( FIG. 1 A , steps I and II).
- Acul-tagging primer containing two regions of complementarity to the genomic locus (purple) interrupted by an Acul recognition site (Acul hairpin, green) positioned 14 bp upstream of a dinucleotide of interest.
- Acul-tagging primer containing two regions of complementarity to the genomic locus (purple) interrupted by an Acul recognition site (Acul hairpin, green) positioned 14 bp upstream of a dinucleotide of interest
- the larger DNA fragment (>100 bp) resulting from Acul-mediated digestion is removed using solid phase reversible immobilization (SPRI) beads ( FIG. 1 A , step IV) and the smaller DNA fragment (60 bp) containing the targeted dinucleotide is ligated to an adaptor with a 3′-overhang complementary to the exposed signature ( FIG. 1 A , step V).
- the ligated DNA products are subsequently detected by analytical or quantitative PCR (qPCR) ( FIG. 1 A , step VI).
- This method which we named DTECT (Dinucleotide signaTurE CapTure), can be completed within 4-5 hours ( FIG. 1 A ).
- a common set of DNA primers that anneal to constant regions in the Acul-digested fragments (blue) and the ligated adaptors (brown) is utilized in all DTECT experiments ( FIG. 1 A , step VI), avoiding locus-specific amplification bias and variability in qPCR efficiency among distinct sets of samples.
- a library of 16 distinct adaptors is sufficient to capture all dinucleotide signatures that can be generated by Acul ( FIG. 1 B ).
- DTECT provides a highly controlled assessment of successful and specific capture of dinucleotide signatures
- FIG. 2 A To demonstrate the feasibility of DTECT, we designed two Acul-tagging DNA primers flanking four adjacent bases (5′-TTGG-3′) on opposite DNA strands (TT and CC signatures, blue) ( FIG. 2 A ). Upon PCR amplification using Acul-tagging primers and locus-specific DNA primers, the PCR amplicons were digested and ligated to adaptors with either complementary or non-specific 3′-overhangs (GG or AA). Detection of the ligated products by PCR, as described above, revealed that the GG and AA adaptors specifically captured the DNA fragments containing the CC and TT dinucleotides, respectively ( FIG. 2 B ).
- FIGS. 9 A- 9 B Sanger sequencing confirmed that the amplicons of the ligated DNA products had the expected genomic sequence (purple) adjacent to the Acul motif (green) and the GG or AA adaptors (brown) ( FIGS. 9 A- 9 B ). Importantly, robust amplification of captured DNA products was observed only upon 1) capture of the Acul-digested products with complementary adaptors ( FIG. 2 B ), 2) Acul-mediated cutting and generation of 5′-phosphorylated DNA fragments ( FIGS. 2 C- 2 D ), and 3) DNA ligation by the T4 DNA ligase ( FIG. 2 D ).
- each individual DNA base can be identified by designing 4 independent Acul-tagging primers (2 on each DNA strand), thus enabling the capture of 4 distinct signatures per genomic DNA base ( FIGS. 2 E- 2 F ).
- This DTECT feature allows flexible Acul-mediated cleavage of genomic DNA amplicons containing targeted DNA sequences.
- each of the 16 possible dinucleotide signatures generated by Acul at two independent target sites can be efficiently captured using DNA adaptors containing complementary DNA overhangs ( FIG. 9 C ). Together, these studies establish DTECT as a rapid and efficient method to identify DNA bases through the capture of Acul-induced dinucleotide signatures using a common and unique set of adaptors.
- the resulting DNA fragments were then captured using adaptors complementary to WT (green) and STOP (purple) dinucleotide signatures ( FIG. 10 A ).
- qPCR analysis of the captured DNA fragments accurately determined the relative abundance of the WT and STOP alleles at the three loci indicated above ( FIG.
- FIG. 2 G demonstrates that DTECT can estimate the frequency of dinucleotide signatures in a mixed population with high precision, including variants with low abundance (1%) ( FIG. 2 G ).
- Low abundance STOP variants in SPRTN and PIK3R1 were also detectable by analytical PCR ( FIGS. 2 H- 2 I and 10 C- 10 D ), confirming the high sensitivity and accuracy of DTECT.
- direct comparison of the 16 DTECT adaptors revealed comparable efficiency in the capture of oligonucleotides containing complementary dinucleotide signatures ( FIGS. 2 J- 2 K ).
- CRISPR-mediated HDR for generating various types of disease-related mutations using single-stranded oligodeoxynucleotides (ssODNs), including a cancer-associated frameshift mutation in TP53 (i.e., R209fs*6), a missense mutation in HBB (i.e., G6V) that causes sickle cell anemia, a small tandem duplication in BRCA2 (dupAGAAGAT) identified in breast cancer, and small insertions into JAK2 and EMX1 (Paulsen et al., 2017), two genes associated with myeloproliferative disorders and Kallmann syndrome, respectively.
- ssODNs single-stranded oligodeoxynucleotides
- FIG. 3 A Three days after co-transfection of Cas9 with site-specific sgRNAs and ssODNs into HEK293T cells, we harvested the cellular genomic DNA and utilized DTECT to determine by analytical and quantitative PCR whether the desired changes were incorporated into the targeted chromosomal loci ( FIG. 3 A ).
- a restriction fragment length polymorphism (RFLP) assay that monitors restriction sites disrupted or created by the above mutations in the targeted genomic loci was conducted in parallel.
- DTECT readily captured the specific signature of the mutant variants ( FIGS. 3 B and 11 A- 11 C ), while the RFLP assay either failed to detect or weakly detected the same mutant variants ( FIGS. 11 F- 11 H ).
- DTECT was able to discern the HDR stimulatory effect induced by i53 ( FIGS. 3 B and 11 A- 11 B ), a genetically-encoded 53BP1 inhibitor that was previously shown to increase the frequency of HDR events (Canny et al., 2018), indicating that DTECT can be employed to compare the editing levels between distinct experimental conditions.
- DTECT also clearly determined which mutations failed to be incorporated by the HDR machinery (e.g., BRCA2 dupAGAAGAT), as confirmed by NGS analysis ( FIGS. 11 D- 11 E ).
- Example 7 DTECT is Capable of Identifying Multiple Genome Editing Events Occurring within A Single Locus or Distinct Loci
- Example 8 DTECT Expedites the Derivation of Marker-Free Cell Lines Carrying Clinically Relevant Mutations and Facilitates the Genotyping of Cellular and Animal Disease Models
- ALL acute lymphoblastic leukemia
- NT5C2 NT5C2 gene
- DTECT also identified the presence of the above NT5C2 variants in the patient-derived xenograft (PDX) models generated from these relapsed ALL patients ( FIGS. 6 A and 6 D ). These studies demonstrate that DTECT can identify oncogenic mutations of interest in PDX models and cancer patient samples.
- DTECT a sensitive method for the identification of genomic DNA signatures.
- DTECT readily identifies precision genome editing events induced by CRISPR-dependent HDR, base editing and prime editing, including low abundance and complex genomic changes.
- DTECT can be employed to identify pathogenic lesions of interest, such as oncogenic mutations, in cancer mouse models, PDXs, and cancer patient specimens.
- DTECT is a rapid ( ⁇ 4-5 hours) and easy-to-perform detection method that relies on standard molecular biology techniques (PCR, DNA digestion and ligation) and common laboratory reagents.
- DTECT assays utilize a unique and common set of adaptors that includes positive and negative controls to ensure specificity and accuracy.
- the ease, speed and cost efficiency by which DTECT identifies genetic variants in a wide variety of cellular and animal systems should facilitate the generation and study of biological models of human diseases and expedite the detection of pathogenic variants for both pre-clinical and clinical applications.
- DTECT has three potential limitations.
- Acul-induced dinucleotide byproducts can be generated if a genomic Acul restriction site located in close proximity to the targeted dinucleotide is incorporated into the amplicon of the targeted locus.
- genomic Acul sites occur relatively infrequently and 95% of clinically relevant variants (404,393 variants) are compatible with DTECT ( FIGS. 16 A- 16 B ).
- dinucleotide byproducts may also occur due to Acul slippage activity, resulting in the cleavage of DNA molecules 13 (-1) or 15 (+1), instead of 14, bases away from the Acul recognition site.
- indel mutations formed at DSB sites generated by Cas nucleases in CRISPR-mediated HDR experiments can result in defective PCR amplification of indel-containing loci that have not undergone HDR and therefore cause an overestimation of the frequency of HDR events by DTECT ( FIGS. 18 A and 18 B ).
- This limitation does not affect the detection of CRISPR-mediated base editing and prime editing events, and naturally occurring genetic variants, which are accompanied by either very low frequency (Anzalone et al., 2019; Gaudelli et al., 2017; Komor et al., 2017; Yeh et al., 2018) or complete absence of DSB-induced indel formation, respectively.
- DTECT has several advantages compared to other detection methods.
- a major benefit of DTECT is its versatility, which allows the detection and quantification of nucleotide substitutions, precise base insertions and deletions using the same small set of 16 predefined adaptors ( FIGS. 1 B and 7 ).
- Each editing event can be identified using 4 distinct signatures resulting from Acul-mediated digestion of genomic DNA amplicons, indicating that the design of DTECT studies is flexible ( FIGS. 2 E- 2 F and 15 B- 15 C ).
- DTECT DTECT-based detection methods, such as ICE and EditR, in which efficiency depends on the quality of the sequencing reads, which can vary greatly between sequencing platforms, samples and reactions ( FIG. 12 B ).
- DTECT displays greater sensitivity and flexibility compared to RFLP-based assays ( FIGS. 11 A- 11 J ) and exhibits similar precision to NGS ( FIG. 3 C ) at a lower cost and with a faster turnaround time (hours vs. days/weeks).
- DTECT directly identifies genetic variants independently of genomic markers, therefore enabling the analysis of scarless and marker-free cellular and animal models generated by precision genome editing. Given its ability to identify multiple independent genetic variants simultaneously ( FIGS. 4 A- 4 C ), DTECT could expedite the generation of complex genomic changes, especially for genetic interaction studies, synthetic biology applications and molecular recording (Fahim Farzadfard, 2018).
- BRCA1 ⁇ 2 heterozygous mutations have been recently shown to cause genome instability induced by DNA replication stress (Billing et al., 2018; Pathania et al., 2014; Tan et al., 2017).
- DTECT could help elucidate the underlying mechanisms by which genome instability causes breast and ovarian cancer development in BRCA1 ⁇ 2 mutation carriers.
- DTECT can also be used to detect pathogenic variants in pre-clinical and clinical settings.
- DTECT can rapidly identify the presence of oncogenic variants in cancer mouse models ( FIGS. 15 A- 15 D ), thus facilitating the study of cancer pathogenesis and the development of novel cancer therapies.
- DTECT can also identify oncogenic mutations in samples from cancer patients and PDX mouse models ( FIGS. 6 A- 6 D ). The speed by which DTECT accurately and unambiguously identifies pathogenic variants could accelerate cancer diagnosis and expedite the testing of cancer therapies in PDX models, thus leading to more effective cancer treatments.
- DTECT protocol may further simplify the detection of desired genomic signatures and increase the sensitivity of DTECT, thus expanding the number of possible DTECT applications and enabling early diagnosis of cancer and hereditary disorders through the detection of pathogenic variants in circulating cell-free tumor and fetal DNA (Zhang et al., 2019).
- ssODNs Sequence (5′- -> 3′) Targeted gene TTCCTTAGTCTTTCTTTGAAGCAGCAAGTATGATGAGCAAGCTTTCTCA CAAGCATTTGGTTTTAAATTATGGAGTATGTGTgtttaaacCTGTGGAGACG AGAGTAAGTAAAACTACAGGCTTTCTAATGCCTTTCTCAGAGCATCTGT TTTTGTTTATATAGAAAATTCAGTTTCAGGATCA (SEQ ID NO: 225) JAK2 AAGAAGGGCTCCCATCACATCAACCGGTGGCGCATTGCCACGAAGCA GGCCAATGGGGAGGACATCGATGTCACCTCCAATGACTAgtttaaacGGG TGGGCAACCACAAACCCACGAGGGCAGAGTGCTGCTTGCTGCTGGCC AGGCCCCTGCGTGGGCCCAAGCTGGACTCTGGCCACTCCC(SEQID NO: 226) EMX1 TACATTTGCTTCTGACACAACTGTGTTCACTAGCAACCTC
- Oligo corresponds to the variable strand of the adaptor. It contains a 3′ AG, expected to ligate to CT PB987 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGAA (SEQ ID NO: 233) Oligo corresponds to the variable strand of the adaptor.
- Oligo corresponds to the variable strand of the adaptor. It contains a 3′ TG, expected to ligate to CA PB989 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGTA (SEQ ID NO: 235) Oligo corresponds to the variable strand of the adaptor.
- Oligo corresponds to the variable strand of the adaptor. It contains a 3′ CG, expected to ligate to CG PB991 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGCA (SEQ ID NO: 237) Oligo corresponds to the variable strand of the adaptor.
- Oligo corresponds to the variable strand of the adaptor. It contains a 3′ AC, expected to ligate to GT PB1001 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGAT (SEQ ID NO: 241) Oligo corresponds to the variable strand of the adaptor.
- Oligo corresponds to the variable strand of the adaptor. It contains a 3′ CC, expected to ligate to GG PB1003 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGGC (SEQ ID NO: 243) Oligo corresponds to the variable strand of the adaptor.
- Oligo corresponds to the variable strand of the adaptor. It contains a 3′ GT, expected to ligate to AC PB1005 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGTC (SEQ ID NO: 245) Oligo corresponds to the variable strand of the adaptor.
- Oligo corresponds to the variable strand of the adaptor. It contains a 3′ TT, expected to ligate to AA
- Oligos (sgRNAs cloning): Oligo Sequence (5′- -> 3′) Target/Notes oligo plate CAC CGT ACA TAA AGG ACA CTG TGA (SEQ ID NO: 247) BRCA1 C64Y for oligo plate CAC CGC AAT TCA GTA CAA TTA GGT (SEQ ID NO: 248) BRCA1 E638K for oligo plate CAC CGA TTT TCT CTA ATG TTA TTA (SEQ ID NO: 249) BRCA1 E1033K for oligo plate CAC CGT TTT TCG AGT GAT TCT ATT (SEQ ID NO: 250) BRCA1 E575K for oligo plate CAC CGT TTT AAC AAA TGA CTT GAT (SEQ ID NO: 251) BRCA1 V9901 for oligo plate CAC CGA GAC AGT TAA TAT CAC TGC (SEQ ID NO: 252) BRCA1 T922I for oligo plate CAC CGT TAT
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Engineering & Computer Science (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Immunology (AREA)
- Analytical Chemistry (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Virology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The present disclosure provides, inter alia, specially designed DNA adaptors and methods of preparing the same. Methods and kits for carrying out and detecting marker-free precision genome editing and genetic variation using such adaptors are also provided.
Description
- The present application claims the benefit to and is a continuation of U.S. Non-provisional Pat. Application No. 17/192,836, filed Mar. 4, 2021, which claims benefit of U.S. Provisional Pat. Application Serial No. 62/985,746, filed on Mar. 5, 2020, which applications are incorporated by reference herein in their entireties.
- The present disclosure provides, inter alia, specially designed DNA adaptors and various methods and kits for carrying out and detecting marker-free precision genome editing and genetic variation using such adaptors.
- This application contains references to amino acids and/or nucleic acid sequences that have been filed as sequence listing text file “1035795-000704-seq.txt”, file size of 63 KB, created on Apr. 8, 2021. The aforementioned sequence listing is hereby incorporated by reference in its entirety pursuant to 37 C.F.R. § 1.52(e)(5).
- This invention was made with government support under grant no. GM117064, awarded by the National Institutes of Health. The government has certain rights in the invention.
- Precision genome editing allows the modeling and correction of desired genomic variants containing insertions or deletions of specific nucleotide sequences or changes in single DNA bases (Anzalone et al., 2019; Barbieri et al., 2017; Cong et al., 2013; Dow, 2015; Guo et al., 2018; Liu et al., 2018; Mali et al., 2013; Roy et al., 2018). Precision genome editing can be obtained by CRISPR-dependent homology-directed repair (HDR) of Cas9-induced DNA double-strand breaks (DSBs) (Jasin and Haber, 2016) or result from the use of alternative DSB-free methods, such as CRISPR-dependent base editing, which utilizes cytidine or adenosine deaminases fused to a nickase Cas9 (nCas9) mutant to generate base transitions (Gaudelli et al., 2017; Komor et al., 2016), and prime editing, which employs a reverse transcriptase-nCas9 fusion and a template prime editing guide RNA (pegRNA) to install into the genome a large variety of genomic changes, including transversions, transitions, small insertions and deletions (Anzalone et al., 2019).
- Genome editing has been facilitated by the development of accessible and cost-effective methods for the detection of small insertions and deletions (indels) resulting from the repair of Cas9-induced DSBs, such as the T7E1 and Surveyor nuclease assays (Mashal et al., 1995; Qiu et al., 2004; Ran et al., 2013). However, since these methods do not determine the identity of DNA bases, they are ill-suited for the detection of genomic changes introduced by precision genome editing (Germini et al., 2018). Precision genome editing events can be detected by the addition of genomic markers by CRISPR-dependent HDR or prime editing, such as silent mutations that create or disrupt restriction sites, or selectable reporters encoding for antibiotic resistance or fluorescent proteins. However, the use of genomic markers entails an elaborate experimental design that is unique for each targeted site, thus complicating the insertion of the desired genetic modifications. In addition, genomic markers can cause unintended perturbations of coding or non-coding genomic elements. Moreover, marker-based detection methods are not compatible with CRISPR-dependent base editing strategies, which induce single DNA base changes (Rees and Liu, 2018). Alternatively, methods that employ Sanger sequencing or next-generation sequencing (NGS) enable the detection of precise genomic changes without the use of genomic markers (Brinkman et al., 2014; Pinello et al., 2016). However, Sanger sequencing-based approaches suffer from low sensitivity and precision due to variable quality of the sequencing reactions and background signals that often affect the sequencing reads (Brinkman et al., 2014; Brinkman et al., 2018). While NGS-based detection strategies are highly sensitive (Clement et al., 2019; Lindsay et al., 2016; Pinello et al., 2016), they remain expensive and time-consuming, which limits their value for the development of mutant cell lines and animal models and for applications that require a rapid turnaround time, such as the identification of pathogenic variants in certain clinical settings. Therefore, a simple, efficient, inexpensive and rapid method that enables quantitative detection of genetic variants in complex biological systems is needed. This disclosure is directed to meeting these and other needs.
- Genome editing technologies have transformed our ability to engineer desired genomic changes within living systems. However, detecting precise genomic modifications often requires sophisticated, expensive and time-consuming experimental approaches. The present disclosure provides DTECT (Dinucleotide signaTurE CapTure), a rapid and versatile detection method that relies on the capture of targeted dinucleotide signatures resulting from the digestion of genomic DNA amplicons by the type IIS restriction enzyme Acul. DTECT enables the accurate quantification of marker-free precision genome editing events introduced by CRISPR-dependent homology-directed repair, base editing or prime editing in various biological systems, such as mammalian cell lines, organoids and tissues. Furthermore, DTECT allows the identification of oncogenic mutations in cancer mouse models, patient-derived xenografts and human cancer patient samples; it also allows the identification of genetic modifications incurred in various infectious diseases. Ultimately, DTECT enables the capture of signatures in nucleic acids from any organism including, e.g., viruses such as SARS-CoV-2. The ease, speed and cost efficiency by which DTECT identifies genomic signatures should facilitate the generation of marker-free cellular and animal models of human disease and expedite the detection of human pathogenic variants.
- Accordingly, one embodiment of the present disclosure is a DNA adaptor comprising: (a) one strand with sequence of 5′-CTGGGGCACGGGTAAGAAGCATTCTGTCTCTCTTCTAAGAATTCGAGCTCGGTACC CG-3′ (SEQ ID NO: 230); and (b) one complementary strand with sequence of 5′-CGGGTACCGAGCTCGAATTCTTAGAAGAGAGACAGAATGCTTCTTACCCGTGCCC CAGNN-3′ with “N” corresponding to A, T, G or C (SEQ ID NOs: 231-246).
- Another embodiment of the present disclosure is a method of preparing a DNA adaptor disclosed herein, comprising: (a) synthesizing one constant oligonucleotide with sequence of 5′-CTGGGGCACGGGTAAGAAGCATTCTGTCTCTCTTCTAAGAATTCGAGCTCGGTACC CG-3′ (SEQ ID NO: 230); (b) synthesizing one complementary oligonucleotide with sequence of 5′-CGGGTACCGAGCTCGAATTCTTAGAAGAGAGACAGAATGCTTCTTACCCGTGCCC CAGNN-3′ with “N” corresponding to A, T, G or C (SEQ ID NOs: 231-246); (c) mixing the constant and complementary oligonucleotides; and (d) annealing the mixture to obtain the DNA adaptor.
- Another embodiment of the present disclosure is a library of DNA adaptors prepared by methods disclosed herein, the library comprises 16 DNA adaptors, wherein each DNA adaptor has a different “NN”.
- Another embodiment of the present disclosure is a method for detecting a genetic modification, comprising the steps of: (a) amplifying a genomic locus of interest using a specially designed Type IIS restriction enzyme-tagging primer, comprising: (i) extracting genomic DNA from a biological sample of interest; (ii) synthesizing the Type IIS restriction enzyme-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Type IIS restriction enzyme-tagging primer and a reverse primer; and (iv) purifying a Type IIS restriction enzyme-tagged genomic amplicon; (b) digesting the Type IIS restriction enzyme-tagged genomic amplicon with the Type IIS restriction enzyme; (c) isolating the smaller DNA fragment containing a genomic signature of interest exposed in a 3′ single-stranded overhang; (d) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment containing the 3′ overhang signature with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; and (e) amplifying the ligated product to detect the presence of the genetic modification.
- A further embodiment of the present disclosure is a kit for detecting a genetic modification of interest, comprising a specially designed Type IIS restriction enzyme-tagging primer disclosed herein, and a library of DNA adaptors disclosed herein, packaged together with instructions for its use.
- Another embodiment of the present disclosure is a method for detecting a genetic modification, comprising the steps of: (a) amplifying a genomic locus of interest using a specially designed Acul-tagging primer, comprising: (i) extracting DNA of interest; (ii) synthesizing the Acul-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Acul-tagging primer and a reverse primer; and (iv) purifying an Acul-tagged genomic amplicon; (b) digesting the Acul-tagged genomic amplicon with restriction enzyme Acul; (c) isolating the smaller DNA fragment containing a genomic signature of interest produced by Acul-digestion; (d) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; and (e) amplifying the ligated product to detect the presence of the genetic modification.
- An additional embodiment of the present disclosure is a kit for detecting a genetic modification, comprising a specially designed Acul-tagging primer and a library of DNA adaptors disclosed herein, packaged together with instructions for its use.
- Another embodiment of the present disclosure is a method for quantifying a genomic variant in a biological system, comprising the steps of: (a) obtaining a sample from the biological system; (b) amplifying a genomic locus of interest using a specially designed Acul-tagging primer, comprising: (i) extracting DNA of interest; (ii)synthesizing the Acul-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Acul-tagging primer and a reverse primer; and (iv) purifying an Acul-tagged genomic amplicon; (c) digesting the Acul-tagged genomic amplicon with restriction enzyme Acul; (d) isolating the smaller DNA fragment containing a genomic signature of interest produced by the Acul-digestion; (e) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; and (f) quantifying the genomic variant and determining its relative abundance.
- Still another embodiment of the present disclosure is a method for identifying and quantifying an oncogenic mutation of interest in a biological sample, comprising the steps of: (a) obtaining a biological sample; (b) amplifying a genomic locus of interest using a specially designed Acul-tagging primer, comprising: (i) extracting DNA of interest; (ii) synthesizing the Acul-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Acul-tagging primer and a reverse primer; and (iv) purifying an Acul-tagged genomic amplicon; (c) digesting the Acul-tagged genomic amplicon with restriction enzyme Acul; (d) isolating the smaller DNA fragment containing a genomic signature of interest produced by the Acul-digestion; (e) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; (f) amplifying the ligated product to identify the presence of the oncogenic mutation of interest; and (g) quantifying the oncogenic mutation of interest, if present, and determining its frequency.
- A further embodiment of the present disclosure is a process for marker-free detection of a precision genome editing event comprising carrying out Dinucleotide signaTurE CapTure (DTECT) on a nucleic acid sequence of interest.
- Still another embodiment of the present disclosure is a method for detecting a virus variant of interest, comprising the steps of: (a) obtaining a nucleic acid of the virus variant of interest from a biological sample; and (b) if the nucleic acid is DNA, carrying out Dinucleotide signaTurE CapTure (DTECT) to detect the variant of interest; or (c) if the nucleic acid is RNA, coverting it to DNA by reverse transcription PCR (RT-PCR) and then carrying out DTECT to detect the variant of interest.
-
FIGS. 1A-1C show the identification of targeted dinucleotide signatures using DTECT. -
FIG. 1A is a schematic representation of DTECT. The targeted genomic locus containing a hypothetical targeted dinucleotide (N= A, C, G or T; green) is PCR-amplified using a forward Acul-tagging primer juxtaposed to the targeted dinucleotide and a locus-specific DNA primer (Acul-tagging primer design and PCR, steps I and II). The Acul-tagging primer (60 nt) is constituted of DNA sequences complementary to the genomic locus (purple) interrupted by a hairpin containing an Acul recognition site (green), and a non-complementary DNA sequence (blue). The locus-specific reverse primer (red) is located at a distance >100 bp from the targeted dinucleotide. The obtained PCR product is subsequently cleaved by the Acul restriction enzyme in a position adjacent to the targeted dinucleotide, resulting in the generation of two DNA fragments of 60 bp and >100 bp (Acul digestion, step III). The 60 bp fragment containing the exposed signature of the targeted dinucleotide is then isolated using SPRI beads with higher affinity towards >100 bp DNA products (Small fragment isolation, step IV). The 60 bp fragment is then ligated to DNA adaptors containing 3′-overhangs of two bases complementary (specific) or not (non-specific) to the dinucleotide signature (Adaptor ligation, step V). The ligated product is then subjected to PCR amplification for analytical or quantitative detection (Detection PCR, step VI). The approximate time required for each step is indicated. -
FIG. 1B shows the schematics of the DTECT adaptor library. Control (green) and mutant (purple) dinucleotide signatures (left panel) are detected using a library of 16 unique adaptors (middle panel). The library contains adaptors with dinucleotides complementary to the control (green) or mutant (purple) signature, as well as non-specific adaptors (blue) (right panel). -
FIG. 1C shows the schematics of the positive and negative controls used in DTECT experiments to identify signatures of interest (e.g., mutant allele) in allele populations. In genomic DNA samples containing only the WT dinucleotide signature, the adaptor complementary to the WT dinucleotide signature (green) serves as a positive control, while the adaptor complementary to the mutant signature of interest (purple) and a non-specific adaptor (blue) are used as negative controls. In genomic DNA samples containing a mixture of the WT and the mutant dinucleotide signature, the adaptor complementary to the WT dinucleotide signature (green) is used as a positive control and a non-specific adaptor (blue) serves as a negative control. The adaptor complementary to the mutant dinucleotide signature (purple) is used to detect the presence of the variant of interest and quantify its frequency. -
FIGS. 2A-2K show the detection and quantification of dinucleotide signatures using DTECT. -
FIG. 2A shows the design of Acul-tagging primers that allow the capture of two dinucleotide signatures (CC and TT; blue) on opposite DNA strands. -
FIG. 2B shows the PCR amplification (22 cycles) of the Acul-digested DNA products containing the CC and TT signatures shown inFIG. 2A , which have been captured using GG or AA adaptors. -
FIG. 2C shows the PCR amplification (22 cycles) of DNA fragments captured as inFIG. 2B with or without dephosphorylation of the Acul-digested products by the shrimp alkaline phosphatase (rSAP). -
FIG. 2D shows the PCR amplification (22 cycles) of DNA fragments captured as inFIG. 2B in the absence or presence of Acul, DNA adaptors (GG adaptor for signature CC; AA adaptor for signature TT) or T4 DNA ligase. -
FIG. 2E shows the schematic representation of the Acul-tagging primer design for detecting four possible dinucleotide signatures (#1-4) containing the same targeted base (C:G, red) in the PIK3R1 gene. -
FIG. 2F shows the detection of the four dinucleotide signatures shown inFIG. 2E by DTECT (18 PCR cycles) using specific (green) and non-specific (blue) adaptors. -
FIG. 2G shows the quantification by DTECT of the relative abundance of SMARCAL1, SPRTN and PIK3R1 WT (green) and STOP (purple) dinucleotide signatures in mixtures of WT and STOP alleles at predefined ratios. Graphs (left) represent the correlation between the frequency of WT and STOP variants determined by DTECT and the expected frequency of the same variants in the mixed populations for each of the above 3 genes. Error bars represent the s.d. of independent experiments (n = 2). Pearson correlation (r) was determined by comparing expected and DTECT-based frequency. Comparison of the mean frequency of STOP and WT signatures determined by DTECT and their expected frequency is shown in the right panel (n = 3 independent genes, SMARCAL1, SPRTN and PIK3R1). -
FIG. 2H shows the representation of the Acul-tagging primers used to detect the WT and STOP alleles of the PIK3R1 gene. The targeted dinucleotides are shown in blue, the edited base is indicated with an asterisk and part of the Acul-tagging primer sequence is shown in purple. -
FIG. 2I shows the PCR amplification (25 cycles) of WT and STOP PIK3R1 alleles (arrow) captured using DTECT from WT:STOP allele mixtures (i.e., 100:0 and 99:1). An adaptor (CG) specific for the WT allele is used as a positive control and a non-specific adaptor (TT) is used as a negative control. An adaptor that captures the STOP PIK3R1 allele (CA) serves as an additional negative control in the reaction containing only the WT allele. Background non-specific PCR products are indicated with an asterisk. -
FIG. 2J shows the fold change variation in the frequency of capture of each of the 16 dinucleotide signatures relative to the mean dinucleotide capture frequency. Oligonucleotides containing distinct dinucleotide signatures are captured using specific adaptors. The fraction of captured material is then quantified by qPCR and normalized to the mean value obtained from the capture of all 16 dinucleotide signatures. Error bars indicate the s.d. of 4 independent experiments. Dots represent individual data point. -
FIG. 2K shows the fold change variation in the frequency of capture of dinucleotide signatures with 1 A/T + 1 C/G, 2 A/T or 2 C/G bases relative to the mean dinucleotide capture frequency, determined as described inFIG. 2J . Error bars represent the s.d. of 8 mean values for dinucleotides with 1 A/T + 1 C/G and 4 mean values for dinucleotides with 2 A/T and 2 C/G, as determined inFIG. 2J . -
FIGS. 3A-3E show the detection and quantification of precision genome editing by CRISPR-mediated HDR, base editing and prime editing using DTECT. -
FIG. 3A shows the schematics of the protocol used to identify genomic changes introduced by CRISPR-dependent HDR, base editing or prime editing. In HDR experiments (blue), HEK293T cells were transfected with Cas9 and sgRNA targeting a gene of interest with or without donor DNA molecules. In base editing experiments (red), HEK293T cells were transfected with BE3 base editors with either control or base editing sgRNAs. Base editing experiments were also conducted in cells stably expressing FNLS-BE3. In prime editing experiments (grey), HEK293T cells were transfected with PE2 with or without pegRNA. Genomic DNA was then extracted from cell populations and subjected to DTECT using adaptors specific for WT (green) or edited (purple) variants. -
FIG. 3B shows the identification by DTECT of WT and HDR-edited (R209fs*6) TP53 alleles (top), WT and base-edited (Q223*) FANCD2 alleles (middle), and WT and prime-edited (CTT_ins) HEK3 alleles (bottom). Adaptors specific for the WT (CT, CA, CG; green) or edited (TT, TA; purple) signatures were utilized in DTECT experiments. Captured samples were subjected to analytical (left; 21 cycles) or quantitative PCR (right). In the HDR experiment, cells were transfected with Cas9, sgRNA and an ssODN specific for the TP53 locus with or without the HDR stimulatory factor i53. The ssODN was omitted in control reactions. In the base editing experiment, cells were transfected with BE3 and sgRNA to induce Q223* in FANCD2. In prime editing experiments, cells were transfected with PE2 and pegRNA to introduce a CTT insertion in the HEK3 locus. -
FIG. 3C provides the graphical representation of the correlation of DTECT- and NGS-based estimations of the frequency of genetic variants introduced by precision genome editing in human and mouse cells, and mouse intestinal organoids (n = 62). Data points in the dashed box (frequency <20%) of the left panel are shown enlarged on the right panel (n = 33). Error bars indicate the s.e.m. of 2-5 independent replicates. The source of the edited sample is indicated by distinct colors. -
FIG. 3D shows the schematic representation of the experiments conducted to measure the efficiency of precision genome editing in vivo using DTECT. Editing of the mouse liver was performed by hydrodynamic injection of the cytidine base editor (CBE) FNLS-BE3 and an sgRNA to introduce the Pik3ca E545K variant. DTECT (red) and NGS (green) were used to determine the efficiency of editing in the mouse liver sample. -
FIG. 3E shows the quantification by DTECT (red) and NGS (green) of the Pik3ca E545K variant introduced by CRISPR-mediated base editing in the mouse liver, as shown inFIG. 3D . Error bars indicate the s.d. of 2 independent experiments. Dots represent individual data point. -
FIGS. 4A-4C show the identification of multiple genome editing events in a single locus or distinct loci by DTECT. -
FIG. 4A shows the detection by PCR (21 cycles) of allelic mixtures induced by CRISPR-mediated base editing events occurring at a CC sequence (green) in the EMX1 gene. The sequences of the EMX1 alleles resulting from four possible C->T base transitions (CC, CT, TC, TT) induced by CRISPR-mediated base editing and the adaptors to capture them (GG, AG, GA, AA) are shown. In these experiments HEK293T cells constitutively expressing the cytidine base editor (CBE) FNLS-BE3 were transfected with sgRNA targeting the EMX1 locus. -
FIG. 4B shows the schematics of the experiments conducted to detect multiple simultaneously induced variants using DTECT. HEK293T cells constitutively expressing the base editor FNLS-BE3 were transfected with two sgRNAs to introduce simultaneously the BRCA1 E638K and the BRCA2 E2772K mutations by CRISPR-mediated base editing. -
FIG. 4C shows the detection of multiple precision genome editing events introduced by CRISPR-mediated base editing in HEK293T cell populations, as illustrated inFIG. 4B . WT and edited BRCA1 and BRCA2 alleles captured using adaptors specific for the WT (TG, AG; green) or edited (TA, AA; purple) alleles were subjected to analytical (left; 21 cycles) or quantitative PCR (right). -
FIGS. 5A-5J show the DTECT-mediated identification of clinically relevant BRCA½ mutations generated by precision genome editing and genotyping of cell lines and animal models carrying BRCA1 or BARD1 mutations. -
FIG. 5A shows the schematic representation of the human BRCA1 protein. BRCA1 domains and ClinVar BRCA1 mutations generated in this study are indicated. -
FIG. 5B shows the quantification using DTECT (red) and NGS (green) of the editing efficiency by which 10 BRCA1 mutations are introduced into HEK293T cells by CRISPR-mediated base editing. Experiments were conducted in cells expressing the base editor FNLS-BE3 upon transfection of sgRNAs to introduce the indicated mutations. Histograms show the mean frequency of the indicated variants estimated by DTECT and error bars represent the s.d. from 2 independent DTECT assays for the same Acul-tagged amplicon. n.d.: not determined, due to sequencing failure. -
FIG. 5C shows the analytical detection of the indicated BRCA1 mutations in HEK293T cell populations by DTECT (21 PCR cycles) using adaptors specific for WT (green) or mutant (purple) alleles. -
FIG. 5D shows the schematic representation of the human BRCA2 protein. BRCA2 domains and ClinVar BRCA2 mutations generated in this study are indicated. -
FIG. 5E shows the quantification using DTECT (red) and NGS (green) of the editing efficiency by which 13 BRCA2 mutations are introduced into HEK293T cells by CRISPR-mediated base editing, as described inFIG. 5B . -
FIG. 5F shows the analytical detection of the indicated BRCA2 mutations in HEK293T cell populations by DTECT (21 PCR cycles) using adaptors specific for WT (green) or mutant (purple) alleles. Experiments were conducted as inFIG. 5C . -
FIG. 5G shows the genotyping by DTECT-based analytical PCR (18 cycles) of single clones carrying WT and/or BRCA1 E638K mutant alleles derived from the BRCA1 E638K mutant cell population shown inFIG. 5C . WT (#4, not edited), heterozygous (#1) and homozygous (#2) BRCA1 mutant clones identified by DTECT are indicated. -
FIG. 5H shows the Sanger sequencing of WT, heterozygous and homozygous mutant amplicons shown inFIG. 5G . The targeted dinucleotide is indicated in green and part of the sequence of the Acul-tagging primer is indicated in purple. -
FIG. 5I shows the genotyping by DTECT-based analytical PCR of Bard1 S563F (left) and Brca1 S1598F (right) knock-in mutant mice (Bard1, 18 PCR cycles; Brca1, 20 PCR cycles). gDNA for DTECT analysis was obtained from mouse tail samples. WT (Bard1 # 8 and Brca1 #5), heterozygous (Bard1 # 2 and Brca1 #2) and homozygous (Bard1 #3) mutant mice identified by DTECT are indicated. No homozygous Brca1 S1598F mutant mice were identified in the analyzed mouse litters due to sub-Mendelian birth ratios (Billing et al., 2018). -
FIG. 5J shows the Sanger sequencing of WT, heterozygous and homozygous mutant amplicons shown inFIG. 51 . -
FIGS. 6A-6D show the detection of oncogenic signatures in human clinical samples using DTECT. -
FIG. 6A shows the schematic representation of the experiments conducted on ALL patient-derived samples. Bone marrow samples from ALL patients were collected at diagnosis and after chemotherapy. PDXs were generated from the patient samples. The genomic DNA was recovered from the patient samples and PDX mouse models and subjected to analytical and quantitative detection of NT5C2 oncogenic mutations using DTECT. -
FIG. 6B provides the heat map showing the detection of NT5C2 oncogenic mutations in patient samples and a control sample using DTECT. Bone marrow samples from 5 patients were collected; genomic DNA was prepared and tested for the presence of 3 frequent NT5C2 mutations responsible for relapse to chemotherapy. A non-patient-derived gDNA sample was utilized as a control to estimate the levels of non-specific background in the DTECT assay. Data are shown as fold change in the frequency of mutant signatures in the patient samples relative to the control sample. -
FIG. 6C shows the graphical representation of the frequency of NT5C2 mutations determined by DTECT (red) and NGS (green) in the 5 human patient samples analyzed inFIG. 6B . Error bars indicate the s.d. of 2 independent DTECT replicates. -
FIG. 6D shows the analytical and quantitative detection of the NT5C2 R367Q mutation in PDX models generated from ALL tumors ofpatients # 2, #4 and #5 at diagnosis and after chemotherapy relapse. WT and mutant variants were captured using adaptors specific for the WT (GA, green) or mutant (AA, purple) allele and subjected to analytical (left; 18 PCR cycles) and quantitative PCR (right). -
FIG. 7 shows the DTECT applications for the detection of precision genome editing and genetic variation. It shows the schematic representation of examples of targeted dinucleotide signatures generated by single base edits, small insertions and deletions that can be detected using DTECT. Examples of adaptors that can be used to detect the indicated genome editing events are shown on the right. -
FIGS. 8A-8D show the features of type IIS restriction enzymes compatible with DTECT and schematic representation of the Acul digestion pattern. -
FIG. 8A shows the representation of two key features of type IIS restriction enzymes compatible with DTECT: 1) Binding of a single recognition motif (green); 2) Cleavage of a targeted DNA sequence (blue) far from the recognition motif. -
FIG. 8B shows the representation of the pattern of digestion of a type IIS enzyme, including the main digestion product and a cleavage byproduct due to slippage activity. -
FIG. 8C shows the graphical representation of the number of type IIS enzymes in function of the distance between their recognition motif and cleavage site. -
FIG. 8D shows the pattern of cleavage of the type IIS enzyme Acul. Acul cleavesDNA products 14/16 bp away from its recognition site (green), leaving a 3′-overhang of 2 DNA bases (blue). -
FIGS. 9A-9C show the Sanger sequencing reads of captured Acul-digested DNA fragments and validation of the adaptor library. -
FIGS. 9A and 9B show the Sanger sequencing reads of PCR amplicons of Acul-digested DNA products containing the TT (FIG. 9A ) and CC (FIG. 9B ) signatures shown inFIG. 2B , which have been captured using AA or GG adaptors. The DNA sequences of PCR primers (red), genomic locus (purple), targeted dinucleotides (blue), Acul motif (green) and adaptors (brown) are shown. -
FIG. 9C shows the PCR amplification (18 cycles) of captured Acul-digested DNA products by DTECT using specific (green) and non-specific (blue) DNA adaptors. Each of the 16 adaptors was tested for its ability to capture two independent dinucleotide signatures (#1 and #2). -
FIGS. 10A-10F show the identification of WT and STOP alleles in mixed solutions and quantification of non-specific dinucleotide capture and ligation efficiency in DTECT assays. -
FIG. 10A shows the schematics of the protocol used to identify and quantify WT and STOP alleles in mixed solutions, as shown inFIGS. 2G-2I . Cells were transfected with the cytidine base editor (CBE) BE3 and an sgRNA to induce a STOP codon (sgSTOP) using iSTOP. WT and STOP alleles were then cloned and mixed at different WT:STOP ratios, as indicated inFIG. 2G . DTECT was then used to capture WT and STOP signatures using adaptors specific for the WT (green) or STOP (purple) allele, as well as non-specific adaptors (blue). Captured material was then subjected to analytical or quantitative PCR. -
FIG. 10B shows the Sanger sequencing reads of WT and STOP alleles of SPRTN, SMARCAL1 and PIK3R1. The targeted dinucleotide signature is shown in green and the edited cytidine base (C-> T) is indicated by the blue arrow. -
FIG. 10C shows the representation of the Acul-tagging primers used to detect the WT and STOP alleles of the SPRTN gene. The targeted dinucleotides are shown in blue, the edited base is indicated with an asterisk, the PAM sequence is show in red and part of the Acul-tagging primer sequence is shown in purple. -
FIG. 10D shows the PCR amplification (25 cycles) of WT and STOP SPRTN alleles (arrow) captured using DTECT from WT:STOP allele mixtures (i.e., 100:0 and 99:1). An adaptor (AG) specific for the STOP SPRTN allele is utilized in the capture reaction, along with an adaptor specific for the WT allele (GG; positive control) and a non-specific adaptor (TT; negative control). Background non-specific PCR products are indicated with an asterisk. -
FIG. 10E shows the frequency of non-specific dinucleotide capture for each of the 16 adaptors used for DTECT. Adaptors containing the indicated dinucleotide sequences were utilized to capture Acul-digested DNA fragments with non-complementary dinucleotides and the frequency of non-specific dinucleotide capture was quantified by qPCR. Mean frequency of non-specific dinucleotide capture is shown for 2-6 independent DNA ligation reactions using DNA fragments with distinct non-complementary dinucleotides. Adaptors complementary to +1 and -1 Acul-dependent slippage events were excluded from the analysis. -
FIG. 10F shows the time course experiment to measure the efficiency of the ligation of Acul-digested products to DNA adaptors. Acul-digested products from 3 independent targets (SMARCAL1, SPRTN and PIK3R1), DNA adaptors and T4 ligase were incubated for 5 min, 1 hour or 16 hours, and the captured material was quantified by qPCR. A sample without T4 ligase was used as a negative control. The percentage of captured material at the different time points was obtained by normalization to the amount of captured material upon a 16-hour ligation reaction. Error-bars represent the s.d. of 2 independent experiments. -
FIGS. 11A-11J show the detection of CRISPR-mediated HDR and base editing events by DTECT, NGS and RFLP assays. -
FIGS. 11A-11D show the detection by analytical PCR (20 or 21 cycles) of WT and HDR-edited EMX1 (FIG. 11A ), JAK2 (FIG. 11B ), HBB (FIG. 11C ) and BRCA2 (FIG. 11D ) alleles captured using adaptors specific for the WT (green) or edited (purple) alleles. In these experiments HEK293T cells were transfected with Cas9, sgRNA and an HDR donor (ssODN) with or without the HDR stimulatory factor i53. The ssODN was omitted in control reactions. ssODNs introduce a Pmel site in EMX1 and JAK2, a sickle cell anemia mutation in HBB (i.e., G6V), and a breast cancer-associated small tandem duplication in BRCA2 (dupAGAAGAT). -
FIG. 11E shows the quantification of the efficiency of the insertion of the short tandem duplication dupAGAAGAT in the BRCA2 locus, as determined by NGS. The pie chart shows the distribution of NGS reads corresponding to HDR- and/or NHEJ-mediated repair events (HDR, red; NHEJ, blue; mixed HDR/NHEJ, green; unedited, brown) occurring at the BRCA2 locus in HEK293T cells transfected with Cas9/sgRNA and ssODN donor, with or without i53. In these experiments, the BRCA2 locus was amplified by PCR and subjected to NGS. The NGS reads were analyzed by CRISPResso. -
FIG. 11F shows the RFLP assay to monitor the gain of a Pmel restriction site introduced by ssODN-meditated HDR in the EMX1 and JAK2 loci under the same experimental conditions shown inFIG. 11A andFIG. 11B . Digested (edited) and undigested (WT) DNA products are indicated by arrows. -
FIGS. 11G-11H show the RFLP assays to monitor the loss of Ncol (FIG. 11G ) or Taqal (FIG. 11H ) restriction sites in the HBB and TP53 loci, respectively, resulting from the insertion of the G6V and R209fs*6 mutations under the same experimental conditions shown inFIG. 11C andFIG. 3B . Digested (WT) and undigested (edited) DNA products are indicated by arrows. -
FIG. 111 shows the detection of WT and nonsense mutant TIMELESS, SLX4 and FANCM alleles by DTECT using adaptors specific for the WT (green) or edited (purple) signatures. Experiments were performed in cells transfected with the cytidine base editor BE3 and sgRNA to induce the indicated nonsense mutations, which were detected by analytical (left; 21 cycles) or quantitative PCR (right). -
FIG. 11J shows the detection of WT and nonsense mutant TCOF1 alleles by DTECT (21 PCR cycles) using adaptors specific for the WT (GG, green) or edited (AG, purple) allele. Experiments were performed in cells transfected with BE3 and sgRNA to induce the indicated nonsense mutation in the TCOF1 gene. The introduction of the nonsense mutation was confirmed by Sanger sequencing (bottom) and by an RFLP assay that monitors the loss of an Xcml restriction site at the edited locus (right). -
FIGS. 12A-12B show the comparative analysis of DTECT-, Sanger- and NGS-based estimations of the frequency of genetic variants generated by precision genome editing. -
FIG. 12A shows the graphical representation of the frequency of mutations introduced by CRISPR-dependent HDR and base editing in human and mouse cells, and intestinal organoids. The FANCF, Pik3ca and Apc loci were edited in biological duplicate or triplicate using multiple base editors, and the resulting edited samples were previously described (Zafra et al., 2018). The BRCA½ loci were edited using BE3. The frequency values were determined by both DTECT (red) and NGS (green). NGS was conducted on standard PCR amplicons (FANCF, Pik3ca and Apc) or Acul-tagged amplicons (BRCA½) of the edited loci. Error bars represent the s.e.m. of 2-5 independent DTECT assays per edited sample. The same frequency values are plotted in the graphs shown inFIG. 3C . -
FIG. 12B shows the graphical representation of the correlation between technical duplicates obtained by DTECT (red), EditR (green) or ICE (blue). Each dot represents a distinct BRCA½ variant introduced in cells by precision genome editing. Technical duplicates of DTECT assays correspond to two independent ligation reactions for the same Acul-digested amplicon and Sanger-based technical duplicates correspond to two independent sequencing reactions for the same PCR amplicon. -
FIGS. 13A-13C show the detection of base editing byproducts and clinically relevant BRCA½ mutations introduced by precision genome editing. -
FIG. 13A shows the detection by analytical PCR (21 cycles) of allelic mixtures induced by CRISPR-mediated base editing events occurring at a CC sequence in the EMX1 gene, as shown inFIG. 4A . In these experiments HEK293T cells constitutively expressing the base editor FNLS-BE3 were transfected with a control sgRNA (top) or an sgRNA targeting the EMX1 locus (bottom). All possible 16 adaptors were used to capture EMX1 variants. Adaptors that capture the WT allele (GG) and +1 Acul slippage event (CG) are shown in green and orange. Adaptors that capture C->T base editing events (AA, AG, GA) and C->A and C->G base editing byproducts (AC, AT, CA, CG, GC) are also shown. -
FIGS. 13B-13C show the analytical detection of the indicated BRCA1 (FIG. 13A ) and BRCA2 (FIG. 13B ) mutations in HEK293T cell populations by DTECT (21 PCR cycles) using adaptors specific for WT (green) or mutant (purple) alleles. Experiments were conducted as inFIGS. 5C and 5F . -
FIGS. 14A-14B show the genotyping of mutant cellular clones and knock-in mice using DTECT. -
FIG. 14A shows the genotyping by DTECT-based analytical PCR (20 cycles) of HEK293T clones (17) carrying WT and/or BRCA1 E638K mutant alleles or base editing byproducts derived by single cell dilution from the BRCA1 E638K cell population shown inFIG. 5C . Heterozygous and homozygous mutant clones are indicated in blue and purple, respectively. WT clones are indicated in green and a clone with a base editing byproduct is indicated in orange.Clones # 1, #2, #4 and control (CTL) are also shown inFIG. 5G . Quantification of each BRCA1 variant by qPCR is also shown (bottom). HEK293T cells have 4 BRCA1 alleles. Error bars correspond to two independent experiments. -
FIG. 14B shows the genotyping by DTECT-based analytical PCR of Bard1 S563F (top) and Brca1 S1598F (bottom) knock-in mutant mice (Bard1, 18 PCR cycles; Brca1, 20 PCR cycles). DTECT assays were conducted on gDNA isolated from mouse tail samples. Heterozygous and homozygous mutant mice are indicated in blue and purple, respectively, and WT mice are indicated in green. No homozygous Brca1 S1598F mutant mice were identified in the analyzed mouse litters due to sub-Mendelian birth ratios (Billing et al., 2018).Mice # 1, #2, #3 and #8 (Bard1), and #1, #2, #5 (Brca1) are also shown inFIG. 5I . -
FIGS. 15A-15D show the detection of oncogenic mutations in a mouse model of myeloproliferative neoplasm and in ALL patients using DTECT. -
FIG. 15A shows the schematics of the experiments conducted to detect the Jak2 V617F mutation in a mouse model of myeloproliferative neoplasm. Peripheral blood was collected from mice transplanted with a mixture of bone marrow cells either wild-type (WT) or carrying an inducible Jak2 V617F mutant allele (Mx1-Cre+;Jak2V617F/+). DTECT was then utilized to determine the presence of the Jak2 V617F mutation in gDNA extracted from the collected blood samples. -
FIG. 15B shows the schematic representation of 4 Acul-induced dinucleotide signatures that enable the identification of Jak2 WT and V617F alleles. The G in red is replaced by a T in the Jak2 V617F mutant allele. -
FIG. 15C shows the identification by DTECT-based analytical PCR (20 cycles) of the Jak2 V617F mutation in the blood of a mouse model of myeloproliferative neoplasm generated as described inFIG. 15A . The Jak2 V617F mutation was identified using the 4 independent dinucleotide signatures shown inFIG. 15B . gDNA samples from peripheral blood of WT mice were used as controls (#1 and #2) in this experiment. Sanger sequencing (bottom) was conducted to confirm the results obtained using DTECT. -
FIG. 15D shows the analytical detection of the indicated NT5C2 mutations in ALL patient samples by PCR (20 cycles). The frequency of the indicated mutations in the same patient samples is shown inFIG. 6B . -
FIGS. 16A-16C show the analysis of ClinVar variants with proximal genomic Acul motifs compatible with DTECT. -
FIG. 16A shows the Bioinformatic analysis of ClinVar database variants (425,580) with (80,326; blue) or without (345,254; green) genomic Acul sites in close proximity (+/- 100 bp). Variants (green, right pie chart) with a single Acul motif located 35 bp to 100 bp away on the 3′- (29,848) or 5′- (29,291) side can be detected using DTECT, as illustrated inFIG. 16C . Variants (red, right pie chart) with an Acul motif located <35 bp away (18,739) or with proximal Acul motifs on both sides (2,448) cannot be detected using DTECT. -
FIG. 16B shows the percentage and number of ClinVar variants that can (95.02%, 404,393) or cannot (4.98%, 21,187) be detected using DTECT. -
FIG. 16C shows the schematic representation of genomic loci with or without an Acul site in close proximity to the edited site. When a genomic Acul site is located 35 bp to 100 bp away from the edited site, detection of the edited site can be obtained by designing 2 Acul-tagging primers that anneal to the targeted locus between the genomic Acul site and the edited base(s). This approach allows the capture of two independent dinucleotide signatures for each targeted site with one proximal Acul site. Four independent dinucleotide signatures can be captured for targeted sites with no proximal Acul sites. -
FIGS. 17A-17B show the detection of Acul slippage events by DTECT. -
FIG. 17A shows the schematics of targeted dinucleotides (blue) and +1 (red) and -1 (orange) Acul slippage events (left). Detection of Acul slippage byproducts by DTECT (22 PCR cycles) using adaptors complementary to the targeted dinucleotide signatures (green) and to signatures generated by Acul +1 (red) or -1 (orange) slippage (right). A non-specific adaptor (blue) is used as a control. -
FIG. 17B shows the schematic representation of DNA digestion products generated by precise Acul cleavage (green) or +1 slippage (red) occurring at wild-type and mutant alleles. The dinucleotide signatures generated as a result of Acul slippage byproducts and the complementary adaptors to capture them are indicated. -
FIGS. 18A-18D show the design of DTECT assays to avoid indel interference in CRISPR-mediated HDR experiments. -
FIG. 18A shows the InDelphi prediction (https://indelphi.giffordlab.mit.edu) of indel-containing alleles in the TP53 locus. The dinucleotides targeted to simultaneously introduce the TP53 R209fs*6 mutation and a G > T mutation in the PAM by CRISPR-dependent HDR are indicated in green and red, respectively. The Cas9 cleavage site is indicated in black. The dinucleotide signatures captured to detect the TP53 R209fs*6 and PAM mutations are shown in purple. The presence of indel interference in the distinct predicted alleles is indicated. MH, microhomology. -
FIG. 18B shows the DTECT-based quantification of the TP53 R209fs*6 and PAM mutations introduced by HDR using a single ssODN donor template, as shown inFIG. 18A . Adaptors specific for the WT (CT and TG; green and red) or edited (TT; purple) signatures were used for quantification. HDR efficiency determined by NGS is also shown. -
FIG. 18C shows the schematic representation of the design of DTECT experiments to avoid interference of indels formed at DSBs during CRISPR-mediated HDR. Cas9-mediated DSBs are induced at a distance from a targeted dinucleotide (green) sufficient to avoid mutation of the targeted dinucleotide by indels (blue). The pattern of indel mutations is predicted using the InDelphi website. -
FIG. 18D shows the schematics of alleles generated by CRISPR-mediated HDR, including the unedited allele (green), indel-containing alleles (blue) and the HDR-edited allele (purple). Using the experimental design shown inFIG. 18C , DTECT captures both the unedited and the indel-containing alleles using an adaptor specific for the WT dinucleotide signature, while the HDR-edited allele is captured using an adaptor specific for the edited dinucleotide signature. The capture of indel-containing alleles with a WT adaptor ensures the accurate quantification of the frequency of the HDR-edited allele in the allele population. - The present disclosure provides a versatile method that uses standard molecular biology techniques to detect variants introduced by precision genome editing or resulting from genetic variation. This detection method, designated Dinucleotide signaTurE CapTure (DTECT), enables accurate and sensitive quantification of marker-free precision genome editing events induced by CRISPR-dependent HDR, base editing and prime editing. In addition, we show that DTECT can readily identify oncogenic mutations in cancer mouse models, patient-derived xenograft models and cancer patient samples. These studies establish a cost-effective method for the rapid detection of genetic variants, which will aid the generation of marker-free cellular and animal models of human disease and expedite the detection of pathogenic variants for clinical applications.
- Accordingly, one embodiment of the present disclosure is a DNA adaptor comprising: (a) one strand with sequence of 5′-CTGGGGCACGGGTAAGAAGCATTCTGTCTCTCTTCTAAGAATTCGAGCTCGGTACC CG-3′ (SEQ ID NO: 230); and (b) one complementary strand with sequence of 5′-CGGGTACCGAGCTCGAATTCTTAGAAGAGAGACAGAATGCTTCTTACCCGTGCCC CAGNN-3′ with “N” corresponding to A, T, G or C (SEQ ID NOs: 231-246).
- In some embodiments, the DNA adaptor is labeled with a detection molecule. Non-limiting examples of the detection molecule include a radiolabel, a fluorescent label, a biotinylated label, a non-fluorescent label, an enzyme, a hapten, a phosphorescent molecule, a chemiluminescent molecule, a chromophore, a luminescent molecule, a photoaffinity molecule, a color particle or a ligand.
- Another embodiment of the present disclosure is a method of preparing a DNA adaptor disclosed herein, comprising: (a) synthesizing one constant oligonucleotide with sequence of 5′-CTGGGGCACGGGTAAGAAGCATTCTGTCTCTCTTCTAAGAATTCGAGCTCGGTACC CG-3′ (SEQ ID NO: 230); (b) synthesizing one complementary oligonucleotide with sequence of 5′-CGGGTACCGAGCTCGAATTCTTAGAAGAGAGACAGAATGCTTCTTACCCGTGCCC CAGNN-3′ with “N” corresponding to A, T, G or C (SEQ ID NOs: 231-246); (c) mixing the constant and complementary oligonucleotides; and (d) annealing the mixture to obtain the DNA adaptor.
- Another embodiment of the present disclosure is a library of DNA adaptors prepared by methods disclosed herein, the library comprises 16 DNA adaptors, wherein each DNA adaptor has a different “NN”.
- Another embodiment of the present disclosure is a method for detecting a genetic modification, comprising the steps of: (a) amplifying a genomic locus of interest using a specially designed Type IIS restriction enzyme-tagging primer, comprising: (i) extracting genomic DNA from a biological sample of interest; (ii) synthesizing the Type IIS restriction enzyme-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Type IIS restriction enzyme-tagging primer and a reverse primer; and (iv) purifying a Type IIS restriction enzyme-tagged genomic amplicon; (b) digesting the Type IIS restriction enzyme-tagged genomic amplicon with the Type IIS restriction enzyme; (c) isolating the smaller DNA fragment containing a genomic signature of interest exposed in a 3′ single-stranded overhang; (d) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment containing the 3′ overhang signature with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; and (e) amplifying the ligated product to detect the presence of the genetic modification.
- In some embodiments, the genetic modification is selected from a base change, a deletion, or an insertion. In some embodiments, the genetic modification is selected from a single genomic change or multiple genomic changes. In some embodiments, the multiple genomic changes can occur within a single locus or distinct loci.
- In some embodiments, the Type IIS restriction enzyme is selected from Acul, Bpml, BpuEI, BsgI, Mmel and NmeAIII. In some embodiments, the Type IIS restriction enzyme is selected from Acul and BpuEI. In some embodiments, the Type IIS restriction enzyme is Acul.
- In some embodiments, the Type IIS restriction enzyme-tagging primer is an oligonucleotide comprising: (a) a non-complementary handle sequence positioned on the 5′ side; (b) a complementary sequence of the genomic locus of interest on the 5′ side; (c) a recognition motif of the Type IIS restriction enzyme that is positioned at a predicted distance from its cleavage site to generate the genomic signature of interest; and (d) a complementary sequence of the genomic locus of interest on the 3′ side.
- In some embodiments, the reverse primer is positioned at more than 100 bp downstream of the genomic locus of interest.
- In some embodiments, the non-complementary handle sequence can have any suitable length. In some embodiments, the non-complementary handle sequence is 25 bp. In some embodiments, the non-complementary handle sequence can have any suitable sequence. In some embodiments, the non-complementary handle sequence is 5′-GCAATTCCTCACGAGACCCGTCCTG-3′ (SEQ ID NO: 3).
- In some embodiments, the ligation in step (d)(ii) of the methods disclosed above is carried out by T4 DNA ligase.
- A further embodiment of the present disclosure is a kit for detecting a genetic modification of interest, comprising a specially designed Type IIS restriction enzyme-tagging primer disclosed herein, and a library of DNA adaptors disclosed herein, packaged together with instructions for its use. In some embodiments, the Type IIS restriction enzyme is Acul.
- Another embodiment of the present disclosure is a method for detecting a genetic modification, comprising the steps of: (a) amplifying a genomic locus of interest using a specially designed Acul-tagging primer, comprising: (i) extracting DNA of interest; (ii) synthesizing the Acul-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Acul-tagging primer and a reverse primer; and (iv) purifying an Acul-tagged genomic amplicon; (b) digesting the Acul-tagged genomic amplicon with restriction enzyme Acul; (c) isolating the smaller DNA fragment containing a genomic signature of interest produced by Acul-digestion; (d) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; and (e) amplifying the ligated product to detect the presence of the genetic modification.
- In some embodiments, the Acul-tagging primer is an oligonucleotide comprising: (a) a non-complementary handle sequence positioned on the 5′ side; and (b) a complementary sequence of the genomic locus of interest containing an Acul motif (5′-CTGAAG-3′) positioned 14 bp upstream from the genomic locus of interest.
- In some embodiments, the Acul-tagging primer can have any suitable length. In some embodiments, the Acul-tagging primer is 60 bp.
- In some embodiments, the reverse primer is positioned at more than 100 bp downstream of the genomic locus of interest.
- In some embodiments, the non-complementary handle sequence can have any suitable length. In some embodiments, the non-complementary handle sequence is 25 bp.
- In some embodiments, the complementary sequence has the structure of: 5′-N(20)CTGAAGN(14)-3′ or 5′-N(15)CTGAAGN(14)-3′, with “N” corresponding to A, T, G or C, depending on the DNA sequence of the genomic locus of interest.
- In some embodiments, the non-complementary handle sequence is 5′-GCAATTCCTCACGAGACCCGTCCTG-3′ (SEQ ID NO: 3) and the complementary sequence is 5′-N(15)CTGAAGN(14)-3′, with “N” corresponding to A, T, G or C.
- In some embodiments, the ligation in step (d)(ii) of the methods disclosed above is carried out by T4 DNA ligase.
- An additional embodiment of the present disclosure is a kit for detecting a genetic modification, comprising a specially designed Acul-tagging primer and a library of DNA adaptors disclosed herein, packaged together with instructions for its use.
- Another embodiment of the present disclosure is a method for quantifying a genomic variant in a biological system, comprising the steps of: (a) obtaining a sample from the biological system; (b) amplifying a genomic locus of interest using a specially designed Acul-tagging primer, comprising: (i) extracting DNA of interest; (ii)synthesizing the Acul-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Acul-tagging primer and a reverse primer; and (iv) purifying an Acul-tagged genomic amplicon; (c) digesting the Acul-tagged genomic amplicon with restriction enzyme Acul; (d) isolating the smaller DNA fragment containing a genomic signature of interest produced by the Acul-digestion; (e) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; and (f) quantifying the genomic variant and determining its relative abundance.
- In some embodiments, the genomic variant is generated by precision genome editing. In some embodiments, the precision genome editing is CRISPER-dependent homology-directed repair, base editing or prime editing.
- In some embodiments, the biological system is a mammalian cell line, an organoid, or a tissue.
- In some embodiments, the quantification in step (f) of the methods disclosed above is carried out by quantitative PCR (qPCR).
- Still another embodiment of the present disclosure is a method for identifying and quantifying an oncogenic mutation of interest in a biological sample, comprising the steps of: (a) obtaining a biological sample; (b) amplifying a genomic locus of interest using a specially designed Acul-tagging primer, comprising: (i) extracting DNA of interest; (ii) synthesizing the Acul-tagging primer based on the genomic locus of interest; (iii) amplifying the genomic locus of interest using the Acul-tagging primer and a reverse primer; and (iv) purifying an Acul-tagged genomic amplicon; (c) digesting the Acul-tagged genomic amplicon with restriction enzyme Acul; (d) isolating the smaller DNA fragment containing a genomic signature of interest produced by the Acul-digestion; (e) capturing the genomic signature of interest, comprising: (i) preparing the library of DNA adaptors disclosed herein; (ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a ligation; and (iii) obtaining a ligated product; (f) amplifying the ligated product to identify the presence of the oncogenic mutation of interest; and (g) quantifying the oncogenic mutation of interest, if present, and determining its frequency.
- In some embodiments, the biological sample is obtained from a cancer animal model, a patient-derived xenograft (PDX), or a human cancer patient sample.
- In some embodiments, the quantification in step (g) of the methods disclosed above is carried out by quantitative PCR (qPCR).
- A further embodiment of the present disclosure is a process for marker-free detection of a precision genome editing event comprising carrying out Dinucleotide signaTurE CapTure (DTECT) on a nucleic acid sequence of interest.
- DTECT can also be used to detect genetic signatures in any organism, for example, a virus. Thus, still another embodiment of the present disclosure is a method for detecting a virus variant of interest, comprising the steps of: (a) obtaining a nucleic acid of the virus variant of interest from a biological sample; and (b) if the nucleic acid is DNA, carrying out Dinucleotide signaTurE CapTure (DTECT) to detect the variant of interest; or (c) if the nucleic acid is RNA, coverting it to DNA by reverse transcription PCR (RT-PCR) and then carrying out DTECT to detect the variant of interest. This detection method is applicable to any type of virus including but not limited to a DNA virus, an RNA virus, a retrovirus, etc. In some embodiments, the virus is an RNA virus. In some embodiments, the virus is SARS-CoV-2.
- The following examples are provided to further illustrate the methods of the present disclosure. These examples are illustrative only and are not intended to limit the scope of the disclosure in any way.
- Plasmids for DTECT quantification and expression of base editing sgRNAs targeting BRCA1, BRCA2 and FANCD2 have been deposited to Addgene (#139321-139333, and 139511).
- HEK293T and DLD1 cell lines were obtained from ATCC. Cells were cultured in DMEM (ThermoFisher Scientific) supplemented with 10% Fetalgro bovine growth serum (BGS, RMBIO) and 1% penicillin-streptomycin (ThermoFisher Scientific). Cells were grown at 37° C. with 5% CO2 and tested regularly for mycoplasma. NIH/3T3 were maintained in DMEM supplemented with 10% bovine calf serum. Organoids were isolated and cultured as previously described (Zafra et al., 2018). To generate cells constitutively expressing FNLS-BE3-P2A-BlastR, HEK293T cells were infected with a lentivirus expressing the above construct. Viruses were produced in HEK293T in 6-well plates by transfecting 2 µg of FNLS-BE3-P2A-BlastR, 0.2 µg of Tat, 0.2 µg of Gag/Pol, 0.2 µg of Rev, 0.4 µg of VSV-G expressing plasmids in 250 µl of DMEM without serum. 9 µl of TransIT-293 (Mirus) were added to the DNA, mixed and incubated for 15 min at room temperature. The DNA transfection reagent mix was added dropwise to the cells and incubated at 37° C. with 5% CO2. The next day the cell medium was replaced and cells were incubated for 48 hours. The medium containing lentiviruses was then collected and utilized to infect new HEK293T cells. 48 hours after infection, blasticidin was added to the medium until the uninfected control cells were killed. FNLS-BE3 expression was determined by western blot and the base editing activity of the construct was tested using previously validated sgRNAs. Single HEK293T clones were selected for high base editing efficiency. Clones were isolated by trypsinization of the initial cell population into individual cells. Cell density was evaluated by counting the cells with a hemocytometer and cells were diluted to approximately 0.13 cells/µl, equivalent to 20 cells per 150 µl. Serial dilutions were prepared and 150 µl of the diluted cell mixture were seeded into 96-well plates. Single clones were expanded and further examined for FNLS-BE3 expression and activity.
- To induce CRISPR-mediated HDR editing, HEK293T cells were seeded at 50%-70% confluency into 24-well plates and reverse transfected with 0.25 µg of sgRNA and 0.25 µg of Cas9 expressing plasmid (Addgene #42230) with or without 0.5 µl of ssODN (40 µM) into 100 µl of DMEM without Fetalgro BGS and antibiotics. 3 µl of TransIT-293 (Mirus) were added to the DNA, mixed and incubated for 15 min at room temperature. Experiments involving i53 were done by adding 0.25 µg of i53 (Addgene #77939) to the transfection mixture. The gDNAs of cell populations and individual clones were recovered by resuspending the cell pellets in the Quick Extract DNA Extraction Solution (Epicentre), followed by incubation at 65° C. for 10 min and 95° C. for 5 min. The isolated gDNAs were diluted in H2O, quantified using Nanodrop and stored at -20° C. or directly used in PCR reactions. In base editing experiments, we used cells constitutively expressing FNLS-BE3 or transfected with pCMV-BE3 (Addgene #73021) and sgRNAs, as described above. Empty plasmids (Addgene #100708) with no sgRNAs were used as controls. To determine the accuracy of the quantification of variant frequency by DTECT (
FIG. 2G ), STOP codons were introduced into SPRTN, SMARCAL1 and PIK3R1 genes using iSTOP, as previously described (Billon et al., 2017). To isolate the WT alleles, the locus was amplified by PCR and cloned into the pCR-Blunt II-TOPO vector (ThermoFisher Scientific). The STOP alleles were isolated by PCR amplification using gDNA that was partially edited as template. The PCR product was subsequently digested using restriction enzymes that specifically cleave the WT PCR alleles (i.e., Pvull for SPRTN, SfaNI for SMARCAL1 and Taqal for PIK3R1). The digestion reaction was loaded on a 2% agarose gel and the undigested PCR products were column purified (Zymoclean #D4008). The purified products were subsequently cloned into the pCR-Blunt II-TOPO vector (ThermoFisher Scientific). Cloned WT and STOP PCR fragments were confirmed by Sanger sequencing and are shown inFIG. 10B . RFLP assays were conducted by digesting PCR amplicons of the edited genomic loci with enzymes that recognize restriction sites created or disrupted by editing of the targeted loci. Restriction digest products were run on 6% TBE polyacrylamide gels. Gels were run at 160 V in 1X TBE and stained for 5 min using SybrGold diluted in 1X TBE buffer. In prime editing experiments, 1 µg of pCMV-PE2 (Addgene #132775) was transfected into HEK293T cells along with 500 ng of control pegRNA (Addgene #132777) or pegRNA HEK3 insCTT (Addgene #132778). Three days after transfection, genomic DNA was recovered as above and the edited signature was identified with DTECT. Edited DLD1 (FANCF locus) and NIH/3T3 (Pik3ca and Apc loci) cell populations and mouse intestinal organoids (Pik3ca and Apc loci) were previously described (Zafra et al., 2018). Genomic DNA from the edited cell populations was used to quantify the editing efficiency by DTECT (FIG. 12A ). - In order to introduce multiple variants into the BRCA1 and BRCA2 genes, HEK293T cells expressing FNLS-BE3 were seeded at 50%-70% confluency into 24-well plates and reverse transfected with 1 µg of sgRNA into 100 µl of DMEM without Fetalgro BGS and antibiotics. 3 µl of TransIT-293 (Mirus) were added to the DNA, mixed and incubated for 15 min at room temperature. The DNA transfection mix was added dropwise to the cells and incubated at 37° C. with 5% CO2 for 4 days. Single clones were generated and the gDNAs of cell populations and individual clones were recovered as describe above. Genomic loci were Sanger sequenced by Eton Bioscience or Genewiz. Sanger sequencing data were analyzed using Serial cloner and viewed by Snapgene Viewer. The sequencing profiles shown in this manuscript were generated by SnapGene Viewer. Quantitative detection of the editing level using the Acul-tagged amplicon was done blindly.
- In vivo mouse editing was performed as previously described (Zafra et al., 2018). Briefly, eight week-old C57BL/6N mice (Charles River) were injected with 0.9% sterile sodium chloride solution containing 20 µg of pLenti-FNLS-P2A-Puro and 10 µg of sgRNA vector. The total injection volume corresponded to 20% of the individual mouse body weight and was injected into the lateral tail vein in 5-7 seconds. All animal experiments were authorized by the regional board of Karlsruhe, Germany.
- The generation of genetically engineered mice harboring the Brca1 S1598F and Bard1 S563F alleles was previously described (Billing et al., 2018; Shakya et al., 2011). Mouse genotyping was performed using DTECT on genomic DNA extracted from mouse tails. Acul-tagging of the targeted loci was performed using 50 ng of gDNA (see DTECT protocol above). All primer sequences are listed in Table S1. Genotyping experiments were conducted blindly.
- Competitive transplantation experiments were performed to assess chimerism of Jak2 V617F mutant cells in relation to wild-type support. Specifically, Mx1-Cre*;CD45.2 Jak2V617F/+ and Mx1Cre+;CD45.1 wild-type mice were dosed with polyinosine-polycytosine (PIPC) 8 weeks prior to sacrifice to induce MPN in mutant mice. On day of sacrifice, dissected femurs and tibias were isolated and bone marrow flushed with a syringe into PBS. Red blood cells (RBCs) were lysed in ammonium chloride-potassium bicarbonate lysis buffer for 10 min on ice. 1.5 × 106 filtered whole donor Mx1-Cre+;Jak2V617F/+ bone marrow cells (CD45.2) were then mixed with wild-type 1.5 × 106 competitor bone marrow cells (CD45.1) and transplanted via tail vein injection into lethally irradiated (2 × 550 Rad) CD45.1 host mice. Mice were then monitored serially for the development of MPN based on blood counts and donor chimerism by retroorbital bleed draws using heparinized microhematocrit capillary tubes (ThermoFisher Scientific). After 3 consecutive hematocrits of >65%, mice were then sacrificed for peripheral blood fluorescence-activated cell sorting (FACS) analysis and DNA extraction. All animal procedures were conducted in accordance with the Guidelines for the Care and Use of Laboratory Animals and were approved by the Institutional Animal Care and Use Committees at Memorial Sloan Kettering Cancer Center. The conditional Mx1-Cre+;Jak2V617F/+ mice are all C57BL/6 background and have been previously described (Mullally et al., 2010). Automated peripheral blood counts were obtained using a ProCyte Dx (IDEXX Laboratories) according to the manufacturer’s protocol. For surface flow cytometry of mouse peripheral blood, bone marrow, and spleen, RBCs were lysed and stained with monoclonal antibodies in PBS plus 1% BSA for 1 hour on ice. For flow cytometry of erythroid lineage, bone marrow or splenic cells were stained without RBC lysis. DAPI was used for live/dead cell analysis. Cell populations were analyzed using an LSR Fortessa (Becton Dickinson), and data were analyzed with FlowJo software (Tree Star). DNA extraction was performed using the QIAamp DNA Micro Kit (Qiagen) per manufacturer’s protocol.
- DNA samples from leukemic ALL blasts obtained at diagnosis and after relapse were provided by multiple institutions, as previously described (Oshima et al., 2016). Informed consent was obtained at study entry and samples were collected under the supervision of local Institutional Review Boards for participating institutions and analyzed under the supervision of the Columbia University Irving Medical Center Institutional Review Board. Research was conducted in compliance with ethical regulations. ALL patients received standard combination chemotherapy at diagnosis. Diagnosis and relapse samples were harvested from bone marrow. High molecular weight genomic DNA from matched diagnosis and relapse samples of ALL patients was extracted from patient leukemic blasts or from xenografts using the DNeasy Blood & Tissue Kit (Qiagen) or the AllPrep DNA/RNA Mini Kit (Qiagen). Primary human xenograft ALL cells were passaged and harvested from the spleens of NRG (NOD.Cg-ag1tm1MomIl2rgtm1Wjl/SzJ, The Jackson Laboratory) mice. Whole exome sequencing was performed and analyzed as previously described (Oshima et al., 2016).
- sgRNAs were synthesized as complementary oligonucleotides (IDT) compatible with BbsI restriction sites located into the B52 plasmid (Addgene #100708). Oligonucleotides were designed as previously described (Billon et al., 2017). Cloned sgRNAs were verified by Sanger sequencing. Sequences of the sgRNAs are available in Table S1. ssODNs used in HDR experiments were synthesized as ultramer oligos (IDT) and their sequences are available in Table S1. To generate the FNLS-BE3-P2A-BlastR plasmid, the pLenti-FNLS-P2A-Puro plasmid (Addgene #110841) (Zafra et al., 2018) was modified by replacing the puromycin resistance gene with the blasticidin resistance gene. Briefly, the blasticidin resistance gene coding sequence was amplified by PCR and recombined using Gibson assembly into FNLS-BE3-P2A. The FNLS-BE3-P2A-BlastR sequence was verified by Sanger sequencing.
- The Acul-tagging oligonucleotide enables the insertion of an Acul motif (5′-CTGAAG-3′) 14 bp away from a targeted dinucleotide. This motif is inserted as a hairpin in the middle of a sequence complementary to the targeted genomic locus. The Acul-tagging oligonucleotide is 60 bp-long and contains a non-complementary handle sequence of 20-25 bp. Common handle sequences used are PB547 (5′-GATCCTCTAGAGTCGACCTG-3′) (SEQ ID NO: 1) or PB1072 (5′-GCAATTCCTCACGAGACCCGTCCTG-3′) (SEQ ID NO: 3) (Table S1). The oligonucleotide sequence complementary to the targeted genomic locus plus the Acul motif has the following sequence: 5′-N(20)CTGAAGN(14)-3′ or 5′-N(15)CTGAAGN(14)-3′, with “N” corresponding to A, T, G or C bases complementary to the targeted locus. Reverse primers used in Acul-tagging reactions were designed by Primer 3 (http://bioinfo.ut.ee/primer3-0.4.0/) using the default parameters with the following changes: Mispriming library = “HUMAN” for amplifying from human genomic DNA or Mispriming library = “RODENT” for amplifying from mouse genomic DNA, Primer size “min = 25, Opt = 27, Max = 30”, Primer Tm “Min = 57.0° C., Opt = 60.0° C., Max = 63.0° C.”. Reverse primers are located >100 bp away from the targeted dinucleotides. All sequences of the primers used in this study are available in Table S1.
- A set of 17 individual oligonucleotides constitutes the full adaptor library. This library contains: a) One constant oligonucleotide with the following sequence: 5′-CTGGGGCACGGGTAAGAAGCATTCTGTCTCTcttctaagaattcgagctcggtacccg-3′ (SEQ ID NO: 230). The lowercase nucleotide sequence located at the 3′-end of the constant oligonucleotide (5′-cttctaagaattcgagctcggtacccg-3′) (SEQ ID NO: 319) corresponds to the handle sequence used to detect the ligated products with either PB548 (5′-cgggtaccgagctcgaattc-3′) (SEQ ID NO: 2) or PB1073 (5′-cgggtaccgagctcgaattcttagaag-3′) (SEQ ID NO: 4); b) 16 variable oligonucleotides that contain a sequence complementary to the constant oligonucleotide plus one of 16 different dinucleotides at their 3′-end. The variable oligonucleotides have the following sequence: 5′-cgggtaccgagctcgaattcttagaagAGAGACAGAATGCTTCTTACCCGTGCCCCAGNN-3′. NN, with N = A, C, G or T (SEQ ID NOs: 231-246), corresponds to the dinucleotide that is different for each of the 16 oligos. The adaptor sequences are available in Table S1. The constant oligonucleotide and each variable oligonucleotide were resuspended at a concentration of 100 µM in H2O. 2.5 µl of constant oligonucleotide and 2.5 µl of each variable oligonucleotide were mixed with 1X ligase buffer (ThermoFisher Scientific) and water in a 20 µl reaction. The reactions were placed in a thermocycler and oligonucleotides were annealed by incubating them for 5 min at 95° C., followed by a gradual temperature decrease from 95° C. to 15° C. After annealing was completed, 100 µl of water were added to dilute the adaptors in a 120 µl final volume. Adaptors were frozen and stored at -20° C.
- The adaptor library was tested at two independent loci, as shown in
FIG. 9C . In this assay, Acul-tagging oligonucleotides targeting the ampicillin resistance gene were designed following the rules detailed above (Table S1). First, we linearized the pUC19 plasmid as follows: 1.5 µg of pUC19, 1X CutSmart Buffer (NEB) and 0.75 µl of BamHI-HF were mixed in a 30 µl reaction and incubated for 2 hours at 37° C. The digested plasmid was subsequently purified on column (Zymoclean #D4008) and used as a template in PCR reactions with each Acul-tagging primer and a constant reverse primer (5′-CCAATGCTTAATCAGTGAGG-3′) (SEQ ID NO: 320) located at the 3′-side of the ampicillin resistance gene. The PCRs were performed in a 25 µl reaction containing: 1 µM forward and reverse primers, 0.1 mM dNTP (NEB #N0447L), 1X Q5 buffer (NEB), 20 ng of digested pUC19, 1 unit of Q5 polymerase (NEB) and water. The PCR program used was the following: 95° C. for 1 min, 40 cycles of 95° C. for 10 s, 58° C. for 10 s, 72° C. for 45 s and a final amplification step of 1 min at 72° C. PCR reactions were loaded on a 2% agarose gel, extracted from gel and purified on column (Zymoclean #D4008). Finally, the DTECT protocol was applied as described below. Briefly, 0.5 pmol of Acul-tagging PCR products were digested by Acul for 30 min at 37° C. 10 µl of the digested products were purified with 18 µl of solid phase reversible immobilization magnetic beads (Beckman Coulter #A63881). 20 µl of supernatant (unbound fraction) were recovered and 0.5 µl of this supernatant were ligated using complementary and negative control adaptors for 1 hour at 25° C., followed by T4 ligase inactivation for 10 min at 65° C. The complementary and negative control adaptors used inFIG. 9C are the following: AA #1 (Specific adaptor: TT, Non-specific adaptor: CC), AA #2 (TT, CC), AC #1 (GT, AC), AC #2 (GT, AA), AG #1 (CT, GA), AG #2 (CT, GA), AT #1 (AT, GG), AT #2 (AT, GG), CA #1 (TG, CA), CA #2 (TG, CA), CC #1 (GG, CC), CC #2 (GG, CC), CG #1 (CG, AA), CG #2 (CG, AA), CT #1 (AG, TT), CT #2 (AG, TT), GA #1 (TC, GA), GA #2 (TC, GA), GC #1 (GC, TT), GC #2 (GC, TT), GG #1 (CC, TT), GG #2 (CC, TT), GT #1 (AC, TG), GT #2 (AC, TG), TA #1 (TA, GG), TA #2 (TA, GG), TC #1 (GA, CT), TC #2 (GA, CT), TG #1 (CA, TG), TG #2 (CA, TG), TT #1 (AA, GG) and TT #2 (AA, GG). The ligated products were subsequently detected by PCR amplification using the primers PB547 (5′-gatcctctagagtcgacctg-3′) (SEQ ID NO: 1) and PB1073 (5′-cgggtaccgagctcgaattcttagaag-3′) (SEQ ID NO: 4). All primer sequences are listed in Table S1. - The measurement of the dinucleotide capture efficiency of each adaptor (
FIGS. 2J-2K ) was determined by ligating the 16 different adaptors to annealed oligonucleotides containing complementary dinucleotides. To mimic the 5′ phosphorylation induced by Acul in DTECT experiments, the reverse oligonucleotide (PB1449: 5′-gtagttcgccagttCTTCAGaatagtttgcgca CAGGACGGGTCTCGTGAGGAATTGC-3′) (SEQ ID NO: 91) was phosphorylated with PNK (NEB). The phosphorylation reaction was conducted as follows: 5 µl of PB1449 (100 µM), 4 µl of 5X ligase buffer, 0.5 µl of PNK in a 20 µl reaction. Phosphorylation was obtained upon incubation for 1 hour at 37° C., followed by heat inactivation of PNK for 20 min at 65° C. After incubation, the phosphorylated oligonucleotide PB1449 was annealed to 16 complementary oligonucleotides with the following sequence: 5′-GCAATTCCTCACGAGACCCGTCCTGTGCGCAAACTAT TCTGAAGAACTGGCGAACTACNN-3′ (SEQ ID NOs: 231-246). The two Ns indicate the dinucleotide that is different for each of the 16 oligos, with N = A, C, G or T. In the annealing reaction, 40 µl of 5X ligase buffer and 130 µl of H2O were added to the phosphorylation reaction. 9.5 µl of this mix were used for annealing with 0.5 µl of each of the above 16 oligos (50 µM). Annealing, which was performed as described above for the library of adaptors, resulted in a 5′-phosphorylated double-stranded DNA with an overhang of 2 nucleotides, mimicking the product of Acul digestion. The ligation between the adaptors and the phosphorylated products was performed as follows: 1 µl of annealed oligonucleotides, 2 µl of T4 ligase buffer, 0.5 µl of T4 ligase and 0.5 µl of adaptors in a 10 µl reaction. The ligation reaction was incubated for 1 hour at 25° C. and 10 min at 65° C. Detection was performed using qPCR as described below in the DTECT protocol. - The assay performed to measure the efficiency of DNA ligation (
FIG. 10F ) was conducted in a master mix reaction equivalent to 5 µl per time point as follows: 0.5 µl of Acul digested products, 1 µl of T4 ligase buffer and 0.5 µl of adaptors with or without 0.5 µl of T4 ligase. The reactions were incubated at 25° C. After 5 min, 5 µl were taken from the reaction and the T4 ligase was added for 10 min at 65° C. 1 hour after the start of the ligation reaction, 5 µl were additionally taken from the reaction and heat inactivated. The rest of the reaction was incubated overnight for 16 hours and heat inactivated. The amount of products captured was determined by qPCR as described below. - To calculate the frequency of non-specific dinucleotide capture shown in
FIG. 10E , Acul-generated fragments of WT SMARCAL1, SPRTN and PIK3R1 amplicons (obtained as described below) were ligated to each of the 16 library adaptors under the adaptor ligation conditions described above. The frequency of non-specific dinucleotide capture for all the adaptors non-complementary to the SMARCAL1, SPRTN and PIK3R1 dinucleotide signatures was calculated by qPCR analysis, as described below. Adaptors complementary to +1 and -1 Acul-dependent slippage events were excluded from the analysis. - The DTECT protocol consists of 6 steps (I-VI,
FIG. 1A ). I) Design of the Acul-tagging primer, as described above. II) Amplification of the genomic locus of interest using the Acul-tagging primer. The genomic DNA (gDNA) is prepared using the Quick Extract Solution (Epicentre) by incubating the cells at 65° C. for 10 min and 95° C. for 5 min. The genomic DNA is quantified by Nanodrop, diluted to 200 ng/µl in H2O and stored at -20° C. or immediately used in PCR reactions. PCRs were performed in a 25 µl or 50 µl solution containing: 1 µM forward and reverse primers, 0.1 mM dNTP (NEB #N0447L), 1X Q5 buffer (NEB), 10-200 ng of gDNA, 1 unit of Q5 polymerase (NEB) and water. PCR reactions were conducted as follows: 95° C. for 30 s; 40 cycles of 95° C. for 10 s, 58° C. for 10 s, 72° C. for 45 s; and final amplification at 72° C. for 1 min. When the Acul-tagging PCR did not work on gDNA (<5% of the cases), a PCR using standard locus-specific primers was performed to amplify the targeted locus and the Acul-tagging PCR was conducted using this amplicon as template DNA. PCR products were loaded on a 2% agarose gel and run in TAE buffer. PCR products were extracted from gel and column purified (Zymo Research #D4008) and the purified products were subsequently quantified using Nanodrop. III) Digestion of the Acul-tagged genomic amplicon with Acul. The purified PCR products were digested by 0.25 µl Acul (NEB #0641L) in a 20 µl reaction containing 1X CutSmart Buffer (NEB) supplemented with 40 µM S-adenosylmethionine (SAM) and 100 ng of purified PCR product. The reaction was incubated for 1 hour at 37° C. with heat inactivation at 65° C. for 20 min. IV) Isolation of the Acul-digested genomic amplicon by solid phase reversible immobilization (SPRI). 10 µl of the digestion reaction were subsequently mixed with 18 µl of Agencourt AMPure XP magnetic beads (Beckman Coulter #A63881) by pipetting up and down thebeads 10 times (volume ratio of DNA:beads = 1:1.8) and then incubated at room temperature for 5 min. This procedure resulted in the binding of the larger digestion fragment (>100 bp) to the beads, while the smaller digested fragment (60 bp) remained in the supernatant. After incubation, the supernatant was isolated using a magnetic rack. 20 µl of the supernatant were recovered, diluted in 40 µl of H2O and stored at -20° C. or immediately used for capture with DNA adaptors. V) Capture of the digested 60 bp-long products using DNA adaptors. The purified 60 bp-long DNA fragments were ligated to DNA adaptors generated as described above. The adaptors and the purified products were ligated in the following reaction: 6.5 µl of water, 2 µl of 5X ligase buffer (ThermoFisher Scientific), 0.5 µl of T4 ligase (ThermoFisher Scientific), 0.5 µl of adaptors and 0.5 µl of purified DNA product. The ligation reaction was performed for 1 hour at 25° C. in a thermocycler, followed by inactivation of the T4 ligase for 10 min at 65° C. The ligated products were stored at -20° C. or used directly for detection of the captured material. VI) Analytical or quantitative detection of the captured DNA products by PCR amplification. For analytical detection, the amplification of the captured material was performed by PCR in a 12.5 or 25 µl reaction volume containing 0.5 µM forward and reverse primers, 0.05 mM dNTP (NEB #N0447L), 1X Q5 buffer (NEB), 0.5-1 µl of ligated product, 0.1-0.2 µl of Q5 polymerase (NEB), 0.5-1 µl ligation reaction and water. PCR primers (PB1072 and PB1073) contained sequences complementary to the adaptor and handle (see above). The PCR program used was the following: 95° C. for 1 min, and different number of cycles (indicated in each figure legend) of 95° C. for 10 s, 65° C. for 5 s, 72° C. for 7 s. Detection of low abundant genomic variants (≤1% frequency) was generally obtained with 23-25 PCR cycles, while detection of greater amounts of edited products was achieved with 17-22 PCR cycles. 5 µl of the PCR reactions were incubated with SYBR Gold (Thermofisher Scientific #S-11494), loaded on a 2% agarose gel and run in 1X TAE buffer until the DNA was separated. Gels were developed using LI-COR Odyssey. qPCR was performed using QuantStudio 3 (Applied Biosystems). qPCR reactions were performed as follows: 5 µl of 2X SYBR Gold master mix (ThermoFisher Scientific #4367659), 0.1 µl of forward and reverse primers (PB1072 and PB1073, 100 µM) and 1 µl of ligated products (diluted 1:100 in H2O) in a 10 µl reaction. The PCR program used in the qPCR reaction was the following: 95° C. for 10 s and 40 cycles of 60° C. 30 s, 95° C. 15 s. Quantification of the frequency of genomic variants was conducted as described below (Quantification and Statistical Analysis section). - Samples for NGS were prepared by amplifying the edited regions of interest by PCR. Samples were sequenced by the Genome Sciences Facility at The Pennsylvania State College of Medicine or by Genewiz and the results were analyzed by Genewiz, or by using an R-based script of the Ciccia laboratory or CRISPResso2 (Clement et al., 2019). To ensure that no biases were introduced during DTECT assays, the Acul-tagging amplicons for the BRCA1 and BRCA2 mutant samples were sequenced by NGS and analyzed using an R-based script. In this analysis, 7 sequences with >6000 reads were filtered out from the analysis due to incorrect sequence. The editing frequency from the NGS results were determined using the formula: ((Number of reads for the edited dinucleotide) / (total number of reads)) x 100. Oligonucleotides used for PCR amplifications, Illumina sequencing adaptors and indexes are listed in Table S1.
- Technical duplicates of each sample were performed in each qPCR reaction. A standard curve to determine the concentration of the captured material was generated using predefined concentrations of a DTECT ligation product (
FIG. 1A , step V) cloned into the pCR-Blunt II-TOPO vector (ThermoFisher Scientific; B650 plasmid, Addgene #139333) and oligos PB1072 and PB1073 (Table S1). The calculated standard curve corresponds to a linear curve with the following parameters: y = -3.3245x + 7.5504 and R2 = 0.99819. Quantification of the frequency of genomic variants was determined by calculating the mean Ct score (Mean Ct) of the two technical duplicates for each sample. The concentration of the captured material for each sample was determined using the following formula: Concentration = 10^((Mean Ct -7.5504)/-3.3245). The relative abundance between WT and mutant signatures was determined as follows: FrequencyMutant = (ConcentrationMutant / (ConcentrationMutant + ConcentrationWT)) x 100 and FrequencyWT = (ConcentrationWT / (ConcentrationMutant + ConcentrationWT)) x 100. - R-based scripts of the Ciccia laboratory for analysis of NGS reads and ClinVar datasets are available upon request. Raw NGS reads of edited DLD1 and NIH/3T3 cells, organoids and liver samples are available under accession SRP151111 in the Sequence Read Archive. NGS reads have been deposited into the NCBI database and are and are accessible as BioProject # PRJNA603357. All uncropped gels, raw qPCR data and Sanger sequencing reads are available in Mendeley (https://data.mendeley.com/datasets/gtkk6sthtw/draft?a=ca72630e-56eb-4e29-bcdb-158b2c7d4123).
-
KEY RESOURCES TABLE REAGENT or RESOURCE SOURCE IDENTIFIER Bacterial and Virus Strains Subcloning Efficiency DH5α ThermoFisher Scientific 1 8265-017 Chemicals, Peptides, and Recombinant Proteins Q5 High-Fidelity DNA polymerase NEB M0491L T4 DNA ligase ThermoFisher Scientific 15224017 Acul NEB R0641L rSAP NEB M0371L SybrGold (for gel staining) ThermoFisher Scientific S-11494 SybrGold (for qPCR) ThermoFisher Scientific 4367659 BamHI—HF NEB R3136S dNTPs NEB N0447L T4 Polynucleotide Kinase NEB M0201S Critical Commercial Assays Agencourt AMPure XP magnetic beads Beckman Coulter A63881 Zymoclean gel DNA recovery kit Zymo Research D4008 Quick Extract DNA Extraction Solution Epicentre QE09050 Zero BLUNT II TOPO PCR Cloning kit ThermoFisher Scientific 450245 Deposited Data Unprocessed images of gels This disclosure, Mendeley Data Raw gel images Raw Sanger sequencing files This disclosure, Mendeley Data Sequences of BRCA1-2 edited cells; Repeated sequences Raw NGS sequencing files This disclosure, NCBI BioProject # PRJNA603357 Raw and processed qPCR data This disclosure, Mendeley Data Raw and processed qPCR data Raw and processed DTECT, ICE, EditR and NGS data This disclosure, Mendeley Data Quantification of BRCA1-2 variants by DTECT, ICE, EditR and NGS Human: HEK293T ATCC CRL-11268 Human: DLD1 ATCC CCL-221 Mouse: NIH/3T3 ATCC CRL-1658 Mouse: C57BL/6N Charles River C57BL/6NCrl Mouse: Brca1S1598F/+ Shakya et al, 2011 N/A Mouse: Bard1S563F/+ Billing et al, 2018 N/A Mouse: Mx1Cre+;CD45.1 Mullally et al, 2010 N/A Mouse: Mx1-Cre+;CD45.2 Jak2V617F/+ Mullally et al, 2010 N/A Mouse: NRG The Jackson Laboratory 007799 Primers for PCR This disclosure Table S1 Oligonucleotides for sgRNA cloning This disclosure Table S1 ssODNs (for HDR) This disclosure Table S1 Oligonucleotides for adaptors This disclosure Table S1 Plasmid: B52 (containing 2 empty sgRNAs-expressing cassettes) Addgene 100708 pCMV-PE2 Addgene 132775 pCMV-BE3 Addgene 73021 DTECT — Plasmid for standard curve This disclosure, Addgene 139333 pTOPO-SPRTN WT This disclosure N/A pTOPO-SPRTN STOP This disclosure N/A pTOPO-SMARCAL1 WT This disclosure N/A pTOPO-SMARCAL1 STOP This disclosure N/A pTOPO-PIK3R1 WT This disclosure N/A pTOPO-PIK3R1 STOP This disclosure N/A pX330-U6-Chimeric_BB-CBh-hSpCas9 Addgene 42230 pCDNA3-Flag::UbvG08 I44A, deltaGG Addgene 74939 pU6-Sp-pegRNA-HEK3-CTT_ins Addgene 132778 Plasmids expressing sgRNAs for base editing of FANCD2, BRCA1 and BRCA2 This disclosure, Addgene 139321-139332, and 139511 R Studio Desktop IDE 1.0.143 RStudio https://www.rstud io.com Bioconductor R packages Bioconductor https://www.bioc onductor.org R 3.4.1 The R project for statistical computing https://www.r-project.org ClinVar database NCBI https://www.ncbi. nlm.nih.gov/clinv ar/ Li-COR Odyssey N/A https://www.licor. com/bio/products /imaging_system s/odyssey q- PCR QuantStudio 3Applied Biosystems N/A - In our detection method, we take advantage of the property of type IIS restriction enzymes to generate single-stranded DNA overhangs at a specific distance from their recognition motif. Based on the above property, we hypothesized that single-stranded DNA overhangs generated by digestion of genomic DNA sequences with type IIS restriction enzymes could be captured and identified using DNA adaptors containing overhangs complementary to the exposed DNA signatures (
FIG. 1A ). To identify type IIS enzymes with efficient and accurate endonuclease activity, we analyzed the properties of known type IIS enzymes. Restriction enzymes optimal for our method exhibit the following characteristics: a) they cleave far from their recognition motif, thus enabling the incorporation of non-complementary type IIS recognition motifs into PCR primers without disrupting genomic DNA amplification (FIGS. 1A and 8A ); b) they bind a single recognition motif (Bath et al., 2002) (FIG. 8A ); and c) they possess highly specific endonuclease activity, therefore generating a limited number of cleavage byproducts due to slippage activity (Lundin et al., 2015) (FIG. 8B ). Among the >40 known type IIS endonucleases, only 6 enzymes cleave at a distance ≥14 bp from their recognition motif (Acul, Bpml, BpuEI, BsgI, Mmel and NmeAIII) (FIG. 8C ). Of those enzymes, only Acul and BpuEI have a single recognition motif, and Acul exhibits the lowest slippage activity of the two enzymes (slippage byproducts: Acul, 1.1%; BpuEI, 41.4%) (Lundin et al., 2015). In particular, upon DNA cleavage Acul exposes a dinucleotide signature located 15/16 nucleotides away from its recognition site (FIG. 8D ). Based on the above considerations, Acul is the most suitable restriction enzyme for our detection method. - In our approach, the genomic locus of interest is PCR-amplified using a locus-specific DNA primer (red) and a DNA oligonucleotide (Acul-tagging primer) containing two regions of complementarity to the genomic locus (purple) interrupted by an Acul recognition site (Acul hairpin, green) positioned 14 bp upstream of a dinucleotide of interest (
FIG. 1A , steps I and II). Tagging of the genomic amplicon with an Acul motif allows Acul-mediated digestion of the sequence of interest on the 3′-side of the targeted dinucleotide. Upon Acul-mediated digestion, the signature of the targeted dinucleotide becomes exposed (FIG. 1A , step III). To proceed with a single DNA fragment containing the targeted dinucleotide, the larger DNA fragment (>100 bp) resulting from Acul-mediated digestion is removed using solid phase reversible immobilization (SPRI) beads (FIG. 1A , step IV) and the smaller DNA fragment (60 bp) containing the targeted dinucleotide is ligated to an adaptor with a 3′-overhang complementary to the exposed signature (FIG. 1A , step V). The ligated DNA products are subsequently detected by analytical or quantitative PCR (qPCR) (FIG. 1A , step VI). This method, which we named DTECT (Dinucleotide signaTurE CapTure), can be completed within 4-5 hours (FIG. 1A ). A common set of DNA primers that anneal to constant regions in the Acul-digested fragments (blue) and the ligated adaptors (brown) is utilized in all DTECT experiments (FIG. 1A , step VI), avoiding locus-specific amplification bias and variability in qPCR efficiency among distinct sets of samples. Considering the total number of 16 unique dinucleotides (24), a library of 16 distinct adaptors is sufficient to capture all dinucleotide signatures that can be generated by Acul (FIG. 1B ). Given the possible use of positive and negative controls to determine the efficiency and specificity of dinucleotide capture (FIG. 1C ), DTECT provides a highly controlled assessment of successful and specific capture of dinucleotide signatures - To demonstrate the feasibility of DTECT, we designed two Acul-tagging DNA primers flanking four adjacent bases (5′-TTGG-3′) on opposite DNA strands (TT and CC signatures, blue) (
FIG. 2A ). Upon PCR amplification using Acul-tagging primers and locus-specific DNA primers, the PCR amplicons were digested and ligated to adaptors with either complementary or non-specific 3′-overhangs (GG or AA). Detection of the ligated products by PCR, as described above, revealed that the GG and AA adaptors specifically captured the DNA fragments containing the CC and TT dinucleotides, respectively (FIG. 2B ). Sanger sequencing confirmed that the amplicons of the ligated DNA products had the expected genomic sequence (purple) adjacent to the Acul motif (green) and the GG or AA adaptors (brown) (FIGS. 9A-9B ). Importantly, robust amplification of captured DNA products was observed only upon 1) capture of the Acul-digested products with complementary adaptors (FIG. 2B ), 2) Acul-mediated cutting and generation of 5′-phosphorylated DNA fragments (FIGS. 2C-2D ), and 3) DNA ligation by the T4 DNA ligase (FIG. 2D ). We additionally showed that each individual DNA base can be identified by designing 4 independent Acul-tagging primers (2 on each DNA strand), thus enabling the capture of 4 distinct signatures per genomic DNA base (FIGS. 2E-2F ). This DTECT feature allows flexible Acul-mediated cleavage of genomic DNA amplicons containing targeted DNA sequences. In additional studies, we confirmed that each of the 16 possible dinucleotide signatures generated by Acul at two independent target sites can be efficiently captured using DNA adaptors containing complementary DNA overhangs (FIG. 9C ). Together, these studies establish DTECT as a rapid and efficient method to identify DNA bases through the capture of Acul-induced dinucleotide signatures using a common and unique set of adaptors. - Next, we examined whether DTECT can determine the relative abundance of DNA variants with distinct DNA signatures, including low abundance DNA variants. To this end, we transfected HEK293T cells with sgRNAs that introduce nonsense mutations into the SPRTN, PIK3R1 and SMARCAL1 genes using iSTOP, a CRISPR-mediated base editing approach that creates STOP codons within genes of interest (Billon et al., 2017) (
FIG. 10A ). We then cloned both WT and mutant alleles, which differ by a single base change (C -> T) (FIG. 10B ), and subjected them to PCR amplification using a locus-specific DNA primer and an Acul-tagging primer flanking the iSTOP-targeted DNA base (FIG. 10C ). The WT and edited PCR products were then mixed at different ratios (WT - STOP allele = 100-0, 99-1, 90-10, 75-25, 50-50, 25-75 or 10-90) and digested with Acul. The resulting DNA fragments were then captured using adaptors complementary to WT (green) and STOP (purple) dinucleotide signatures (FIG. 10A ). Remarkably, qPCR analysis of the captured DNA fragments accurately determined the relative abundance of the WT and STOP alleles at the three loci indicated above (FIG. 2G ), demonstrating that DTECT can estimate the frequency of dinucleotide signatures in a mixed population with high precision, including variants with low abundance (1%) (FIG. 2G ). Low abundance STOP variants in SPRTN and PIK3R1 were also detectable by analytical PCR (FIGS. 2H-2I and 10C-10D ), confirming the high sensitivity and accuracy of DTECT. Importantly, direct comparison of the 16 DTECT adaptors revealed comparable efficiency in the capture of oligonucleotides containing complementary dinucleotide signatures (FIGS. 2J-2K ). In addition, all adaptors exhibited low levels of non-specific capture background (mean = 0.325%, ranging from 0.16% to 0.876%) (FIG. 10E ). The above observations indicate that the adaptor ligation is conducted under optimal conditions, as confirmed by kinetic analysis of the adaptor ligation reaction (FIG. 10F ). Together, these findings demonstrate that DTECT captures dinucleotide variants and quantifies their relative abundance with high specificity and sensitivity. - To examine the ability of DTECT to identify precise genomic changes introduced into mammalian cell populations, we utilized CRISPR-mediated HDR for generating various types of disease-related mutations using single-stranded oligodeoxynucleotides (ssODNs), including a cancer-associated frameshift mutation in TP53 (i.e., R209fs*6), a missense mutation in HBB (i.e., G6V) that causes sickle cell anemia, a small tandem duplication in BRCA2 (dupAGAAGAT) identified in breast cancer, and small insertions into JAK2 and EMX1 (Paulsen et al., 2017), two genes associated with myeloproliferative disorders and Kallmann syndrome, respectively. Three days after co-transfection of Cas9 with site-specific sgRNAs and ssODNs into HEK293T cells, we harvested the cellular genomic DNA and utilized DTECT to determine by analytical and quantitative PCR whether the desired changes were incorporated into the targeted chromosomal loci (
FIG. 3A ). For comparison, a restriction fragment length polymorphism (RFLP) assay that monitors restriction sites disrupted or created by the above mutations in the targeted genomic loci was conducted in parallel. In these experiments, DTECT readily captured the specific signature of the mutant variants (FIGS. 3B and 11A-11C ), while the RFLP assay either failed to detect or weakly detected the same mutant variants (FIGS. 11F-11H ). In addition, DTECT was able to discern the HDR stimulatory effect induced by i53 (FIGS. 3B and 11A-11B ), a genetically-encoded 53BP1 inhibitor that was previously shown to increase the frequency of HDR events (Canny et al., 2018), indicating that DTECT can be employed to compare the editing levels between distinct experimental conditions. Importantly, DTECT also clearly determined which mutations failed to be incorporated by the HDR machinery (e.g., BRCA2 dupAGAAGAT), as confirmed by NGS analysis (FIGS. 11D-11E ). Next, to determine whether DTECT can identify precise genomic changes introduced by CRISPR-mediated base editing in mammalian cell populations, we used a cytidine base editor to install nonsense mutations into the Fanconi anemia-associated genes FANCD2, FANCM and SLX4, the DNA replication and circadian clock gene TIMELESS and the Treacher Collins syndrome gene TCOF1. These experiments showed that DTECT was able to capture the signatures of the newly introduced variants in all of the above genes (FIGS. 3B and 11I-11J ). Finally, to test whether DTECT is also able to identify genomic signatures generated by prime editing, we transiently transfected into HEK293T cells a prime editor and a pegRNA to introduce a 3-bp insertion (CTT_ins) in the HEK3 locus (Anzalone et al., 2019). As shown inFIG. 3B , DTECT specifically identified the newly created signature and quantified its frequency in the transfected cell population, indicating that DTECT is also suitable to identify prime editing events. The specificity and accuracy of the above DTECT studies was confirmed by both positive and negative controls (e.g., CG and TT adaptors in the control unedited sample ofFIG. 3B ). - To further confirm the accuracy of DTECT in quantifying precision genome editing, we compared the frequency of editing events determined by either DTECT or NGS across 62 samples derived from human cells, mouse cells and intestinal organoids, which were modified using CRISPR-mediated HDR or base editing (Zafra et al., 2018). As shown in
FIG. 3C (left panel) and 12A, the frequencies of editing events obtained by DTECT and NGS were comparable (mean frequency: DTECT, 35.43%; NGS, 33.47%; r = 0.9857, n = 62), indicating that the quantification of precision genome editing by DTECT is accurate. Similar to NGS, DTECT is also accurate in the detection of less abundant (< 20% frequency) variants (mean frequency: DTECT, 5.41%; NGS, 5.06%, r = 0.843, n = 33) (FIG. 3C , right panel). Together, these experiments demonstrate that DTECT precisely identifies and quantifies genetic variants introduced by precision genome editing in various biological systems. - Recent studies led to the development of Sanger sequencing-based methods, such as ICE (Synthego; https://ice.synthego.com/#/) or EditR (Kluesner et al., 2018), that enable the detection of genomic variants based on the deconvolution of chromatogram peaks. To compare DTECT with the above methods, we subjected to Sanger sequencing the genomic amplicons of 23 samples edited by precision genome editing. In these experiments, we used two primers annealing to opposite DNA strands to obtain independent sequencing duplicates of the same amplicons, and analyzed the Sanger sequencing reads using either ICE or EditR. Notably, ~10% of the sequencing reactions failed to generate high quality reads required for ICE or EditR, despite using high quality amplicons for sequencing (Mendeley dataset, Data availability section). Independent repeats using new genomic amplicons did not improve the sequencing outcome (Mendeley dataset, Data availability section). In addition, we noted that technical duplicates of Sanger sequencing reactions analyzed by ICE or EditR displayed lower levels of consistency relative to technical replicates of DTECT assays (
FIG. 12B ). These studies indicate that DTECT displays greater robustness and reliability compared to Sanger-based detection methods, which heavily rely on the quality of Sanger sequencing reactions. - The modeling and correction of pathogenic mutations in adult mice is critical for the development of novel approaches to therapeutic intervention against cancer and other diseases (Chadwick et al., 2017; Gao et al., 2018; Levy et al., 2020; Ryu et al., 2018; Song et al., 2020; Villiger et al., 2018; Yin et al., 2016; Yin et al., 2014). To determine whether DTECT can determine editing levels in adult mouse tissue, we hydrodynamically delivered into the mouse liver (Tschaharganeh et al., 2014) a cytidine base editor and an sgRNA introducing the oncogenic Pik3ca E545K mutation (Zafra et al., 2018) (
FIG. 3D ). We then used both DTECT and NGS to quantify the oncogenic Pik3ca signature in DNA samples derived from the edited livers of two mice. DTECT analysis identified base editing events in the mouse liver at a ~1-2% frequency, comparable to the editing rates obtained by NGS (FIG. 3E ). This study revealed that DTECT can accurately quantify low abundance genetic variants introduced by precision genome editing in vivo. - The above studies indicate that DTECT can determine the identity of individual genomic changes. To examine whether DTECT can also identify complex sets of mutations, we employed CRISPR-dependent base editing to target two adjacent cytosines in the EMX1 locus that had previously been converted into four distinct dinucleotide combinations (i.e., CC, CT, TC or TT) by base editing (Komor et al., 2016) (
FIG. 4A ). As shown inFIG. 4A , DTECT readily distinguished each of the four combinations in an sgRNA-dependent manner, demonstrating that DTECT can identify a complex mixture of allelic variants. Furthermore, we also detected base editing byproducts (FIG. 13A ), suggesting that DTECT could be used to optimize conditions that reduce the formation of these byproducts (Komor et al., 2017; Wang et al., 2017). Additionally, to determine whether DTECT can be employed to monitor genomic changes at multiple loci, we simultaneously introduced two clinically relevant point mutations into two distinct genes (i.e., BRCA1 and BRCA2) (FIG. 4B ). As shown inFIG. 4C , DTECT correctly identified these genomic changes, indicating that it can readily detect complex genome editing events occurring within single or multiple genomic loci. - Precision genome editing allows the modeling of clinically relevant gene variants. Given that DTECT enables the identification of newly created DNA signatures without requiring the insertion of markers or elaborate experimental design specific for each edited site, we tested whether DTECT could facilitate the generation of multiple cell lines harboring clinically relevant mutations. In particular, we focused our attention on mutations in the
BRCA 1 and BRCA2 genes, which in heterozygosity can predispose women to the development of breast and/or ovarian cancer (Apostolou and Fostira, 2013), whereas in homozygosity can cause Fanconi anemia (Ceccaldi et al., 2016). More than 7,000 clinically associated SNVs have been identified in BRCA½, according to the ClinVar database, but efforts to characterize their functional impact and pathogenic potential have been limited in part due to the challenge of generating cell lines that carry such a large number of individual homozygous and heterozygous variants. To determine whether DTECT can facilitate the production of cell lines harboring clinically relevant BRCA½ SNVs, we expressed a cytidine base editor in HEK293T cells along with individual sgRNAs to generate 23 different BRCA½ mutations identified in patients with ovarian and breast cancers, as reported in ClinVar (FIGS. 5A and 5D ). We then used DTECT to determine by analytical PCR which variants were introduced in the transfected cell populations and quantify the editing efficiency for each variant by qPCR (FIGS. 5B-5C, 5E-5F and 13B-13C ). The accuracy of DTECT in the quantification of the editing events was confirmed by NGS (FIGS. 5B and 5E ). The above approach proved effective for rapidly identifying cell populations with high levels of editing. Upon isolation of single clones from edited cell populations (e.g., BRCA1 E638K mutant cells), we tested whether DTECT could be used for clone genotyping. Importantly, DTECT allowed rapid genotyping of multiple clones (FIG. 14A ) and accurately determined the genotype of each clone, including WT, homozygous and heterozygous mutant clones (FIGS. 5G-5H ), thus expediting the production of marker-free isogenic heterozygous and homozygous mutant cells. - Given the ability of DTECT to correctly determine the genotype of cellular clones, we then tested whether DTECT could also be applied to mouse genotyping. To this end, we obtained tail DNA samples from genetically engineered mice carrying knock-in mutations in Brca1 (S1598F) and its partner protein Bard1 (S563F) (Billing et al., 2018). As shown in
FIGS. 5I-5J and 14B , DTECT accurately determined the genotype of 24 Bard1 S563F mutant mice and 16 Brca1 S1598F mutant mice. These findings indicate that DTECT can be employed to rapidly determine the genotype of genetically engineered mice, thus facilitating the derivation, maintenance and analysis of marker-free animal models. - Precise and rapid detection of pathogenic variants in patients is critical for accurate diagnosis and personalized therapy. Given the ability of DTECT to identify genetic variants rapidly and accurately, we tested whether DTECT could be utilized to expedite the identification of pathogenic variants in pre-clinical and clinical settings. In particular, we examined whether DTECT could identify the presence of oncogenic variants in various biological systems. In our studies we focused our attention on the JAK2 V617F variant, which is present in the majority of patients with myeloproliferative neoplasm (MPN) (Levine et al., 2005). Mice transplanted with Jak2 V617F mutant bone marrow cells develop MPN and recapitulate the human disease (Mullally et al., 2010). Therefore, we analyzed the Jak2 V617F variant in the peripheral blood of mice transplanted with a mixture of bone marrow cells that do or do not carry an inducible Jak2 V617F variant (Bhagwat et al., 2014) (
FIG. 15A ). As shown inFIGS. 15B-15C , DTECT readily distinguished wild-type from V617F mutant Jak2 in the examined mouse blood samples, as detected using any of the four distinct Acul-tagging primers specific for the targeted bases. These experiments show that DTECT can identify oncogenic signatures of interest in mouse tissues in a marker-free manner, thus enabling the tracking of genetic variants in mouse models without requiring complex selection markers. - We next examined whether DTECT can identify the presence of specific oncogenic mutations in human samples from patients diagnosed with acute lymphoblastic leukemia (ALL), the most common form of childhood cancer (Inaba et al., 2013). Although most ALL patients respond to chemotherapy, ~20% suffer a relapse as a result of resistance to chemotherapy (Bhojwani and Pui, 2013). Moreover, secondary genetic alterations that promote chemoresistance, including mutations in the NT5C2 gene (Tzoneva et al., 2018; Tzoneva et al., 2013), are found in a large fraction of ALL relapse cases (Dieck and Ferrando, 2019; Oshima et al., 2016). To test whether DTECT can identify these relapse-specific oncogenic signatures, we obtained matched DNA samples from the bone marrow of ALL patients at diagnosis and relapse and analyzed them for the presence of three common NT5C2 mutations (R238W, K359Q and R367Q) (
FIGS. 6A-6B ). Remarkably, DTECT unambiguously detected the presence of oncogenic NT5C2 variants in all five patient samples (patient # 1, R238W;patients # 2, #4 and #5, R367Q;patient # 3, K359Q) and accurately quantified their frequency in a manner comparable to NGS (FIGS. 6B-6C and 15D ). Moreover, DTECT also identified the presence of the above NT5C2 variants in the patient-derived xenograft (PDX) models generated from these relapsed ALL patients (FIGS. 6A and 6D ). These studies demonstrate that DTECT can identify oncogenic mutations of interest in PDX models and cancer patient samples. - In this study, we established DTECT as a sensitive method for the identification of genomic DNA signatures. In particular, we show that DTECT readily identifies precision genome editing events induced by CRISPR-dependent HDR, base editing and prime editing, including low abundance and complex genomic changes. In addition, we show that DTECT can be employed to identify pathogenic lesions of interest, such as oncogenic mutations, in cancer mouse models, PDXs, and cancer patient specimens. DTECT is a rapid (~4-5 hours) and easy-to-perform detection method that relies on standard molecular biology techniques (PCR, DNA digestion and ligation) and common laboratory reagents. This methodology is also not labor-intensive, given that it entails short periods (5-10 min) of sample processing followed by hands-free incubations. Importantly, DTECT assays utilize a unique and common set of adaptors that includes positive and negative controls to ensure specificity and accuracy. The ease, speed and cost efficiency by which DTECT identifies genetic variants in a wide variety of cellular and animal systems (e.g., cell lines, organoids, animal models, patient samples) should facilitate the generation and study of biological models of human diseases and expedite the detection of pathogenic variants for both pre-clinical and clinical applications.
- Although highly robust, DTECT has three potential limitations. First, Acul-induced dinucleotide byproducts can be generated if a genomic Acul restriction site located in close proximity to the targeted dinucleotide is incorporated into the amplicon of the targeted locus. However, an analysis of the ClinVar database revealed that genomic Acul sites occur relatively infrequently and 95% of clinically relevant variants (404,393 variants) are compatible with DTECT (
FIGS. 16A-16B ). Second, dinucleotide byproducts may also occur due to Acul slippage activity, resulting in the cleavage of DNA molecules 13 (-1) or 15 (+1), instead of 14, bases away from the Acul recognition site. Nonetheless, we found that DTECT is able to identify Acul slippage events, which occur mostly at position +1 relative to the standard Acul cleavage site (Lundin et al., 2015) (FIG. 17A ). It is reasonable to anticipate that future optimization of Acul architecture and improvements in the Acul digestion protocol will limit its slippage activity. It is also important to note that Acul byproducts resulting from either genomic Acul motifs or Acul slippage activity are easily predictable based on the sequence of the nucleotides flanking the targeted dinucleotide and they can be completely avoided by optimal design of the Acul-tagging primer and appropriate adaptor selection, as shown inFIGS. 16C and 17B . Third, indel mutations formed at DSB sites generated by Cas nucleases in CRISPR-mediated HDR experiments can result in defective PCR amplification of indel-containing loci that have not undergone HDR and therefore cause an overestimation of the frequency of HDR events by DTECT (FIGS. 18A and 18B ). However, given that the mutagenic spectrum of indel mutations induced by any sgRNA is predictable (Allen et al., 2018; Leenay et al., 2019; Shen et al., 2018; van Overbeek et al., 2016) (inDelphi web portal; https://indelphi.giffordlab.mit.edu/), the negative impact of indel mutations on DTECT-based quantification of CRISPR-mediated HDR events can be avoided by introducing the desired genomic changes in indel-free regions adjacent to CRISPR-induced cut sites (FIGS. 18C and 18D ). This limitation does not affect the detection of CRISPR-mediated base editing and prime editing events, and naturally occurring genetic variants, which are accompanied by either very low frequency (Anzalone et al., 2019; Gaudelli et al., 2017; Komor et al., 2017; Yeh et al., 2018) or complete absence of DSB-induced indel formation, respectively. - In addition to its ease of use, speed and cost efficiency, DTECT has several advantages compared to other detection methods. A major benefit of DTECT is its versatility, which allows the detection and quantification of nucleotide substitutions, precise base insertions and deletions using the same small set of 16 predefined adaptors (
FIGS. 1B and 7 ). Each editing event can be identified using 4 distinct signatures resulting from Acul-mediated digestion of genomic DNA amplicons, indicating that the design of DTECT studies is flexible (FIGS. 2E-2F and 15B-15C ). These features distinguish DTECT from strategies that employ allele-specific DNA oligonucleotides or probes to identify SNVs, which work with variable efficiency due to the competition between WT and mutant alleles and the number of variant DNA bases, thus requiring unique experimental design for the detection of each individual genetic variant. Given that both wild-type and mutant DNA signatures are captured from the same Acul-digested PCR amplicon and that a common set of PCR primers is utilized for both analytical and quantitative detection of all variants (FIG. 1A , step VI), DTECT exhibits limited technical variability across distinct experimental conditions. This aspect differentiates DTECT from Sanger sequencing-based detection methods, such as ICE and EditR, in which efficiency depends on the quality of the sequencing reads, which can vary greatly between sequencing platforms, samples and reactions (FIG. 12B ). In addition, DTECT displays greater sensitivity and flexibility compared to RFLP-based assays (FIGS. 11A-11J ) and exhibits similar precision to NGS (FIG. 3C ) at a lower cost and with a faster turnaround time (hours vs. days/weeks). Finally, DTECT directly identifies genetic variants independently of genomic markers, therefore enabling the analysis of scarless and marker-free cellular and animal models generated by precision genome editing. Given its ability to identify multiple independent genetic variants simultaneously (FIGS. 4A-4C ), DTECT could expedite the generation of complex genomic changes, especially for genetic interaction studies, synthetic biology applications and molecular recording (Fahim Farzadfard, 2018). - The ability to model clinically relevant mutations in a marker-free manner is critical for assessing their potential pathogenicity, especially in the case of genes, such as BRCA1 and BRCA2, which have thousands of clinically-associated SNVs. Recent studies have led to the development of high-throughput saturation genome editing (SGE) to examine en masse the pathogenicity of BRCA1 variants (Findlay et al., 2018). Although highly useful for classifying BRCA1 SNVs, SGE requires the use of haploid cells and is therefore not compatible with the study of the functional impact of BRCA1 mutations in heterozygosity, as observed in BRCA1 mutation carriers (Apostolou and Fostira, 2013). BRCA½ heterozygous mutations have been recently shown to cause genome instability induced by DNA replication stress (Billing et al., 2018; Pathania et al., 2014; Tan et al., 2017). By facilitating the derivation of both heterozygous and homozygous BRCA½ mutant cells and animal models (
FIGS. 5A-5J ), DTECT could help elucidate the underlying mechanisms by which genome instability causes breast and ovarian cancer development in BRCA½ mutation carriers. Our work demonstrated that DTECT can expedite the generation of a large variety of human genetic variants in various complex biological systems. - In addition to facilitating precision genome editing, we showed that DTECT can also be used to detect pathogenic variants in pre-clinical and clinical settings. In particular, DTECT can rapidly identify the presence of oncogenic variants in cancer mouse models (
FIGS. 15A-15D ), thus facilitating the study of cancer pathogenesis and the development of novel cancer therapies. Furthermore, DTECT can also identify oncogenic mutations in samples from cancer patients and PDX mouse models (FIGS. 6A-6D ). The speed by which DTECT accurately and unambiguously identifies pathogenic variants could accelerate cancer diagnosis and expedite the testing of cancer therapies in PDX models, thus leading to more effective cancer treatments. We envision that future developments and implementations of the DTECT protocol may further simplify the detection of desired genomic signatures and increase the sensitivity of DTECT, thus expanding the number of possible DTECT applications and enabling early diagnosis of cancer and hereditary disorders through the detection of pathogenic variants in circulating cell-free tumor and fetal DNA (Zhang et al., 2019). - Collectively, our work established DTECT as a facile, rapid and cost-effective method for identifying genomic variants in various biological systems, such as mammalian cell lines, organoids, mouse tissues, PDX models and human patient samples. Given the growing number of genetic variants identified in the human population (Lek et al., 2016) and in human genetic disorders (McClellan and King, 2010), this versatile method for the detection of genomic signatures should facilitate the study of human genetic variation and expedite the diagnosis and treatment of human disease.
-
TABLE S1 Primers, ssODNs, adaptors and other oligos used in this disclosure. Detection primers Sequence (5′- -> 3′) Notes PB547 gatcctctagagtcgacctg (SEQ ID NO: 1) Oligos for detection (step VI) PB548 cgggtaccgagctcgaattc (SEQ ID NO: 2) Oligos for detection (step VI) PB1072 gcaattcctcacgagacccgtcctg (SEQ ID NO: 3) Oligos for detection (step VI) - Only these oligos were used for qPCR PB1073 cgggtaccgagctcgaattcttagaag (SEQ ID NO: 4) Oligos for detection (step VI) - Only these oligos were used for qPCR Acultagging primers Sequence (5′- -> 3′): Handle for detection-gDNA-Acul hairpin-gDNA Notes PB1021 gatcctctagagtcgacctgGGAGTCCCTGTCGCTAGTGGCTGAAGACGCGTCGTGGGAG (SEQ ID NO: 5) Acul for signature TT PB1022 gatcctctagagtcgacctgACAAACAGTGCCTGCAAGTCCTGAAGCGGTGTGGGGTCCA (SEQ ID NO: 6) Acul for signature CC PB1071 GCAATTCCTCACGAGACCCGTCCTGATTTCAGGGAAGAAGCTGAAGTGAATGAAAAACTT (SEQ ID NO: 7) Acul for PIK3R1-STOP PB1153 GCAATTCCTCACGAGACCCGTCCTGTGTAGTTTTACTTACCTGAAGTCTCGTCTCCACAG (SEQ ID NO: 8) Acul for JAK2 (HDR) PB1151 GCAATTCCTCACGAGACCCGTCCTGAGGACATCGATGTCACTGAAGCCTCCAATGACTAG (SEQ ID NO: 9) Acul for EMX1 (HDR) PB1019 gatcctctagagtcgacctgAAACGGCAGAAGCTGGAGGACTGAAGGGAAGGGCCTGAGT (SEQ ID NO: 10) Acul for EMX1 (Base editing) PB1080 GCAATTCCTCACGAGACCCGTCCTGGTTCAGTTTAACGACCTGAAGCAATTCTTCTGGGG (SEQ ID NO: 11) Acul for SPRTN-STOP PB1149 GCAATTCCTCACGAGACCCGTCCTGTGTGTTCACTAGCAACTGAAGCCTCAAACAGACAC (SEQ ID NO: 12) Acul for HBB (HDR) PB1211 GCAATTCCTCACGAGACCCGTCCTGGAGGAGGAGGCCCCTCTGAAGGCAGGGACACGAAG (SEQ ID NO: 13) Acul for TCOF1 (Base editing) oligo plate GAT CCT CTA GAG TCG ACC TGC CAA ATT ATA TAC CTT TTG GCT GAA GTT ATA TCA TTC TTA (SEQ ID NO: 14) BRCA1 C64Y Acul oligo plate GAT CCT CTA GAG TCG ACC TGT CTT CAC TGC TAG AAC AAC TCT GAA GAT CAA TTT GCA ATT (SEQ ID NO: 15) BRCA1 E638K Acul oligo plate GAT CCT CTA GAG TCG ACC TGA TAT TGC TTG AGC TGG CTT CCT GAA GTT TAA AAA CAT TTT (SEQ ID NO: 16) BRCA1 E1033K Acul oligo plate GAT CCT CTA GAG TCG ACC TGG GTT CAG CTT TCG TTT TGA ACT GAA GAG CAG ATT CTT TTT (SEQ ID NO: 17) BRCA1 E575K Acul oligo plate GAT CCT CTA GAG TCG ACC TGT CCT CTA GCA GAT TTT TCT TCT GAA GAC ATT TAG TTT TAA (SEQ ID NO: 18) BRCA1 V990I Acul oligo plate GAT CCT CTA GAG TCG ACC TGG GAA AGA ATG AGT CTA ATA TCT GAA GCA AGC CTG TAC AGA (SEQ ID NO: 19) BRCA1 T922I Acul oligo plate GAT CCT CTA GAG TCG ACC TGC ATC ATT ACC AAA TTA TAT ACT GAA GCC TTT TGG TTA TAT (SEQ ID NO: 20) BRCA1 D67N Acul oligo plate GAT CCT CTA GAG TCG ACC TGG AGG GAG GGA GCT TTA CCT TCT GAA GTC TGT CCT GGG ATT (SEQ ID NO: 21) BRCA1 E1754K Acul oligo plate GAT CCT CTA GAG TCG ACC TGG AAG AAA ATA ATC AAG AAG ACT GAA GGC AAA GCA TGG ATT (SEQ ID NO: 22) BRCA1 S1363L Acul oligo plate GAT CCT CTA GAG TCG ACC TGG CAG TGA TTT TAC ATC TAA ACT GAA GTG TCC ATT TTA GAT (SEQ ID NO: 23) BRCA1 Q1779* Acul oligo plate GAT CCT CTA GAG TCG ACC TGG ATG GAG AAG ACA TCA TCT GCT GAA GGA TTA TAC ATA TTT (SEQ ID NO: 24) BRCA2 R2842C Acul oligo plate GAT CCT CTA GAG TCG ACC TGT GAA TCT TTT TCT TTT TTT GCT GAA GAA TAG CTT ACA ATA (SEQ ID NO: 25) BRCA2 R2973H Acul oligo plate GAT CCT CTA GAG TCG ACC TGC TGA GTA TTT GGC GTC CAT CCT GAA GAT CAG ATT TAT ATT (SEQ ID NO: 26) BRCA2 S2998F Acul oligo plate GAT CCT CTA GAG TCG ACC TGC AAA TTT TTA GAT CCA GAC TCT GAA GTC AGC CAT CTT GTT (SEQ ID NO: 27) BRCA2 S3070F Acul oligo plate GAT CCT CTA GAG TCG ACC TGA GTG CAA ATT AAT TTA CCT TCT GAA GTA ACA TAA GAG ATT (SEQ ID NO: 28) BRCA2 E2772K Acul oligo plate GAT CCT CTA GAG TCG ACC TGG GAA TAT TTG ATG GTC AAC CCT GAA GAG AAA GAA TAA ATA (SEQ ID NO: 29) BRCA2 T1707I Acul oligo plate GAT CCT CTA GAG TCG ACC TGA TCT TGT TCT GAG GTG GAC CCT GAA GTA ATA GGA TTT GTC (SEQ ID NO: 30) BRCA2 V3079I Acul oligo plate GAT CCT CTA GAG TCG ACC TGT AGG AAG GCC ATG GAA TCT GCT GAA GCT GAA CAA AAG GAA (SEQ ID NO: 31) BRCA2 Q2960* Acul oligo plate GAT CCT CTA GAG TCG ACC TGA ACT GAA GCC TCT GAA AGT GCT GAA GAC TGG AAA TAC ATA (SEQ ID NO: 32) BRCA2 T544I Acul oligo plate GAT CCT CTA GAG TCG ACC TGT TTA CCA TCA CGT GCA CTA ACT GAA GCA AGA CAG CAA GTT (SEQ ID NO: 33) BRCA2 R2896C Acul oligo plate GAT CCT CTA GAG TCG ACC TGT GGA AGC TGG CCA GCC ACC ACT GAA GCC ACA CAG AAT TCT (SEQ ID NO: 34) BRCA2 V572I Acul oligo plate GAT CCT CTA GAG TCG ACC TGT TGC CTC TAG AAA TCA TGA CCT GAA GTA GGT TTG ACA GAA (SEQ ID NO: 35) BRCA2 V778I Acul oligo plate GAT CCT CTA GAG TCG ACC TGT TTC TCT TAT CAA CAC GAG GCT GAA GAA GTA TTT TTG ATA (SEQ ID NO: 36) BRCA2 V2102I Acul AA1 GAT CCT CTA GAG TCG ACC TGC AAA CGA CGA GCG TGA CAC CCT GAA GAC GAT GCC TGT AGC (SEQ ID NO: 37) For adaptor library testing AA2 GAT CCT CTA GAG TCG ACC TGT CGT TGG GAA CCG GAG CTG ACT GAA GAT GAA GCC ATA CCA (SEQ ID NO: 38) For adaptor library testing AC1 GAT CCT CTA GAG TCG ACC TGG AGC TGA ATG AAG CCA TAC CCT GAA GAA ACG ACG AGC GTG (SEQ ID NO: 39) For adaptor library testing AC2 GAT CCT CTA GAG TCG ACC TGG CTG AAT GAA GCC ATA CCA ACT GAA GAC GAC GAG CGT GAC (SEQ ID NO: 40) For adaptor library testing AG1 GAT CCT CTA GAG TCG ACC TGG AAC CGG AGC TGA ATG AAG CCT GAA GCA TAC CAA ACG ACG (SEQ ID NO: 41) For adaptor library testing AG2 GAT CCT CTA GAG TCG ACC TGT ACC AAA CGA CGA GCG TGA CCT GAA GAC CAC GAT GCC TGT (SEQ ID NO: 42) For adaptor library testing AT1 GAT CCT CTA GAG TCG ACC TGT GAA GCC ATA CCA AAC GAC GCT GAA GAG CGT GAC ACC ACG (SEQ ID NO: 43) For adaptor library testing AT2 GAT CCT CTA GAG TCG ACC TGA AAC GAC GAG CGT GAC ACC ACT GAA GCG ATG CCT GTA GCA (SEQ ID NO: 44) For adaptor library testing CA1 GAT CCT CTA GAG TCG ACC TGG ATC GTT GGG AAC CGG AGC TCT GAA GGA ATG AAG CCA TAC (SEQ ID NO: 45) For adaptor library testing CA2 GAT CCT CTA GAG TCG ACC TGA GCT GAA TGA AGC CAT ACC ACT GAA GAA CGA CGA GCG TGA (SEQ ID NO: 46) For adaptor library testing CC1 GAT CCT CTA GAG TCG ACC TGC TGA ATG AAG CCA TAC CAA ACT GAA GCG ACG AGC GTG ACA (SEQ ID NO: 47) For adaptor library testing CC2 GAT CCT CTA GAG TCG ACC TGA GCC ATA CCA AAC GAC GAG CCT GAA GGT GAC ACC ACG ATG (SEQ ID NO: 48) For adaptor library testing CG1 GAT CCT CTA GAG TCG ACC TGA CCG GAG CTG AAT GAA GCC ACT GAA GTA CCA AAC GAC GAG (SEQ ID NO: 49) For adaptor library testing CG2 GAT CCT CTA GAG TCG ACC TGA ATG AAG CCA TAC CAA ACG ACT GAA GCG AGC GTG ACA CCA (SEQ ID NO: 50) For adaptor library testing CT1 GAT CCT CTA GAG TCG ACC TGG CCA TAC CAA ACG ACG AGC GCT GAA GTG ACA CCA CGA TGC (SEQ ID NO: 51) For adaptor library testing CT2 GAT CCT CTA GAG TCG ACC TGT CAT GTA ACT CGC CTT GAT CCT GAA GGT TGG GAA CCG GAG (SEQ ID NO: 52) For adaptor library testing GA1 GAT CCT CTA GAG TCG ACC TGG GAG CTG AAT GAA GCC ATA CCT GAA GCA AAC GAC GAG CGT (SEQ ID NO: 53) For adaptor library testing GA2 GAT CCT CTA GAG TCG ACC TGG GAA CCG GAG CTG AAT GAA GCT GAA GCC ATA CCA AAC GAC (SEQ ID NO: 54) For adaptor library testing GC1 GAT CCT CTA GAG TCG ACC TGA ACC GGA GCT GAA TGA AGC CCT GAA GAT ACC AAA CGA CGA (SEQ ID NO: 55) For adaptor library testing GC2 GAT CCT CTA GAG TCG ACC TGA AGC CAT ACC AAA CGA CGA GCT GAA GCG TGA CAC CAC GAT (SEQ ID NO: 56) For adaptor library testing GG1 GAT CCT CTA GAG TCG ACC TGA CGA CGA GCG TGA CAC CAC GCT GAA GAT GCC TGT AGC AAT (SEQ ID NO: 57) For adaptor library testing GG2 GAT CCT CTA GAG TCG ACC TGA GCA ATG GCA ACA ACG TTG CCT GAA GGC AAA CTA TTA ACT (SEQ ID NO: 58) For adaptor library testing GT1 GAT CCT CTA GAG TCG ACC TGC CGG AGC TGA ATG AAG CCA TCT GAA GAC CAA ACG ACG AGC (SEQ ID NO: 59) For adaptor library testing GT2 GAT CCT CTA GAG TCG ACC TGC ATA CCA AAC GAC GAG CGT GCT GAA GAC ACC ACG ATG CCT (SEQ ID NO: 60) For adaptor library testing TA1 GAT CCT CTA GAG TCG ACC TGC TTG ATC GTT GGG AAC CGG ACT GAA GGC TGA ATG AAG CCA (SEQ ID NO: 61) For adaptor library testing TA2 GAT CCT CTA GAG TCG ACC TGA TAC CAA ACG ACG AGC GTG ACT GAA GCA CCA CGA TGC CTG (SEQ ID NO: 62) For adaptor library testing TC1 (PB1040) GAT CCT CTA GAG TCG ACC TGc cgc ttt ttt gca caa cat gCT GAA Ggg gga tca tgt aac (SEQ ID NO: 63) For adaptor library testing TC2 GAT CCT CTA GAG TCG ACC TGC GTT GCG CAA ACT ATT AAC TCT GAA GGG CGA ACT ACT TAC (SEQ ID NO: 64) For adaptor library testing TG1 GAT CCT CTA GAG TCG ACC TGC GGA GCT GAA TGA AGC CAT ACT GAA GCC AAA CGA CGA GCG (SEQ ID NO: 65) For adaptor library testing TG2 (PB1070) gat cct cta gag tcg acc tgc cat acc aaa cga cga gcg tCT GAA Gga cac cac gat gcc (SEQ ID NO: 66) For adaptor library testing TT1 GAT CCT CTA GAG TCG ACC TGT GAC ACC ACG ATG CCT GTA GCT GAA GCA ATG GCA ACA ACG (SEQ ID NO: 67) For adaptor library testing TT2 GAT CCT CTA GAG TCG ACC TGG CCT GTA GCA ATG GCA ACA ACT GAA GCG TTG CGC AAA CTA (SEQ ID NO: 68) For adaptor library testing PB1477 GCAATTCCTCACGAGACCCGTCCTGACCTGAGTTCTTTCCCTGAAGCCACATCAGCGTGC (SEQ ID NO: 69) FANCD2 Acul PB1257 GATCCTCTAGAGTCGACCTGCCGCAGAGCTGAGAAGTTATCTGAAGTGGCAGAACAGCAT (SEQ ID NO: 70) SMARCAL1 Acul PB1264 gatcctctagagtcgacctgGTTTTCATTTCAGGGAAGAACTGAAGGTGAATGAAAAACT (SEQ ID NO: 71) PIK3R1 signatures PB1265 gatcctctagagtcgacctgTCTCGTACCAAAAAGGTCCCCTGAAGGTCTGCTGTATCTC (SEQ ID NO: 72) PIK3R1 signatures PB1266 gatcctctagagtcgacctgATCTCGTACCAAAAAGGTCCCTGAAGCGTCTGCTGTATCT (SEQ ID NO: 73) PIK3R1 signatures PB1010 gatcctctagagtcgacctgTTTTCATTTCAGGGAAGAAGCTGAAGTGAATGAAAAACTT (SEQ ID NO: 74) PIK3R1 signatures PB1433 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattCTGAAGaactggcgaactacAA (SEQ ID NO: 75) AA-Oligo to test dinucleotide capture efficiency (DTECT) PB1434 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattCTGAAGaactggcgaactacAC (SEQ ID NO: 76) AC-Oligo to test dinucleotide capture efficiency (DTECT) PB1435 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattCTGAAGaactggcgaactacAG (SEQ ID NO: 77) AG-Oligo to test dinucleotide capture efficiency (DTECT) PB1436 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattCTGAAGaactggcgaactacAT (SEQ ID NO: 78) AT-Oligo to test dinucleotide capture efficiency (DTECT) PB1437 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattCTGAAGaactggcgaactacCA (SEQ ID NO: 79) CA-Oligo to test dinucleotide capture efficiency (DTECT) PB1438 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattCTGAAGaactggcgaactacCC (SEQ ID NO: 80) CC-Oligo to test dinucleotide capture efficiency (DTECT) PB1439 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattCTGAAGaactggcgaactacCG (SEQ ID NO: 81) CG-Oligo to test dinucleotide capture efficiency (DTECT) PB1440 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattCTGAAGaactggcgaactacCT (SEQ ID NO: 82) CT-Oligo to test dinucleotide capture efficiency (DTECT) PB1441 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattCTGAAGaactggcgaactacGA (SEQ ID NO: 83) GA-Oligo to test dinucleotide capture efficiency (DTECT) PB1442 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattCTGAAGaactggcgaactacGC (SEQ ID NO: 84) GC-Oligo to test dinucleotide capture efficiency (DTECT) PB1443 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattCTGAAGaactggcgaactacGG (SEQ ID NO: 85) GG-Oligo to test dinucleotide capture efficiency (DTECT) PB1444 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattCTGAAGaactggcgaactacGT (SEQ ID NO: 86) GT-Oligo to test dinucleotide capture efficiency (DTECT) PB1445 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattCTGAAGaactggcgaactacTA (SEQ ID NO: 87) TA-Oligo to test dinucleotide capture efficiency (DTECT) PB1446 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattCTGAAGaactggcgaactacTC (SEQ ID NO: 88) TC-Oligo to test dinucleotide capture efficiency (DTECT) PB1447 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattCTGAAGaactggcgaactacTG (SEQ ID NO: 89) TG-Oligo to test dinucleotide capture efficiency (DTECT) PB1448 GCAATTCCTCACGAGACCCGTCCTGtgcgcaaactattCTGAAGaactggcgaactacTT (SEQ ID NO: 90) TT-Oligo to test dinucleotide capture efficiency (DTECT) PB1449 gtagttcgccagttCTTCAGaatagtttgcgcaCAGGACGGGTCTCGTGAGGAATTGC (SEQ ID NO: 91) Complementary 5′-phosphorylated oligo PB1321 GCAATTCCTCACGAGACCCGTCCTGGTGGCTCCATAGGAACTGAAGGTCTTTCTCTTGTT (SEQ ID NO: 92) mouse Pik3ca (545) Acul PB1380 GCAATTCCTCACGAGACCCGTCCTGTTATATACCTTTTGGCTGAAGTTATATCATTCTTA (SEQ ID NO: 93) BRCA1 Cys64Tyr Acul PB1381 GCAATTCCTCACGAGACCCGTCCTGACTGCTAGAACAACTCTGAAGATCAATTTGCAATT (SEQ ID NO: 94) BRCA1 Glu638Lys Acul PB1382 GCAATTCCTCACGAGACCCGTCCTGGCTTGAGCTGGCTTCCTGAAGTTTAAAAACATTTT (SEQ ID NO: 95) BRCA1 Glu1033Lys Acul PB1383 GCAATTCCTCACGAGACCCGTCCTGAGCTTTCGTTTTGAACTGAAGAGCAGATTCTTTTT (SEQ ID NO: 96) BRCA1 Glu575Lys Acul PB1386 GCAATTCCTCACGAGACCCGTCCTGTAGCAGATTTTTCTTCTGAAGACATTTAGTTTTAA (SEQ ID NO: 97) BRCA1 Val990Ile Acul PB1388 GCAATTCCTCACGAGACCCGTCCTGGAATGAGTCTAATATCTGAAGCAAGCCTGTACAGA (SEQ ID NO: 98) BRCA1 Thr922Ile Acul PB1389 GCAATTCCTCACGAGACCCGTCCTGTTACCAAATTATATACTGAAGCCTTTTGGTTATAT (SEQ ID NO: 99) BRCA1 Asp67Asn Acul PB1390 GCAATTCCTCACGAGACCCGTCCTGAGGGAGCTTTACCTTCTGAAGTCTGTCCTGGGATT (SEQ ID NO: 100) BRCA1 Glu1754Lys Acul PB1393 GCAATTCCTCACGAGACCCGTCCTGAAATAATCAAGAAGACTGAAGGCAAAGCATGGATT (SEQ ID NO: 101) BRCA1 Ser1363Leu Acul PB1394 GCAATTCCTCACGAGACCCGTCCTGGATTTTACATCTAAACTGAAGTGTCCATTTTAGAT (SEQ ID NO: 102) BRCA1 Gln1779Ter Acul PB1396 GCAATTCCTCACGAGACCCGTCCTGAGAAGACATCATCTGCTGAAGGATTATACATATTT (SEQ ID NO: 103) BRCA2 Arg2842Cys Acul PB1397 GCAATTCCTCACGAGACCCGTCCTGCTTTTTCTTTTTTTGCTGAAGAATAGCTTACAATA (SEQ ID NO: 104) BRCA2 Arg2973His Acul PB1398 GCAATTCCTCACGAGACCCGTCCTGTATTTGGCGTCCATCCTGAAGATCAGATTTATATT (SEQ ID NO: 105) BRCA2 Ser2998Phe Acul PB1399 GCAATTCCTCACGAGACCCGTCCTGTTTTAGATCCAGACTCTGAAGTCAGCCATCTTGTT (SEQ ID NO: 106) BRCA2 Ser3070Phe Acul PB1400 GCAATTCCTCACGAGACCCGTCCTGAAATTAATTTACCTTCTGAAGTAACATAAGAGATT (SEQ ID NO: 107) BRCA2 Glu2772Lys Acul PB1401 GCAATTCCTCACGAGACCCGTCCTGATTTGATGGTCAACCCTGAAGAGAAAGAATAAATA (SEQ ID NO: 108) BRCA2 Thr1707Ile Acul PB1402 GCAATTCCTCACGAGACCCGTCCTGGTTCTGAGGTGGACCCTGAAGTAATAGGATTTGTC (SEQ ID NO: 109) BRCA2 Val3079Ile Acul PB1403 GCAATTCCTCACGAGACCCGTCCTGAGGCCATGGAATCTGCTGAAGCTGAACAAAAGGAA (SEQ ID NO: 110) BRCA2 Gln2960Ter Acul PB1405 GCAATTCCTCACGAGACCCGTCCTGAAGCCTCTGAAAGTGCTGAAGACTGGAAATACATA (SEQ ID NO: 111) BRCA2 Thr544Ile Acul PB1406 GCAATTCCTCACGAGACCCGTCCTGCTTATCAACACGAGGCTGAAGAAGTATTTTTGATA (SEQ ID NO: 112) BRCA2 Val2102Ile Acul PB1407 GCAATTCCTCACGAGACCCGTCCTGCATCACGTGCACTAACTGAAGCAAGACAGCAAGTT (SEQ ID NO: 113) BRCA2 Arg2896Cys Acul PB1408 GCAATTCCTCACGAGACCCGTCCTGGCTGGCCAGCCACCACTGAAGCCACACAGAATTCT (SEQ ID NO: 114) BRCA2 Val572Ile Acul PB1409 GCAATTCCTCACGAGACCCGTCCTGTCTAGAAATCATGACCTGAAGTAGGTTTGACAGAA (SEQ ID NO: 115) BRCA2 Val778Ile Acul PB1509 GCAATTCCTCACGAGACCCGTCCTGGCATTTTCTGCTGCTCTGAAGGTGAAGAAAGCCCA (SEQ ID NO: 116) Bard1 S563F Acul PB1513 GCAATTCCTCACGAGACCCGTCCTGgagcggatagagacaCTGAAGtatccatggtggtg (SEQ ID NO: 117) Brca1 S1598F Acul PB1483 GCAATTCCTCACGAGACCCGTCCTGTGTGCGAGTTCAGGACTGAAGATCACCAAAAAAGT (SEQ ID NO:118) NT5C2 R367Q Acul PB1486 GCAATTCCTCACGAGACCCGTCCTGTTGGAGATCACATTTCTGAAGTTGGGGACATTTTA (SEQ ID NO:119) NT5C2 K359Q Acul PB1493 GCAATTCCTCACGAGACCCGTCCTGTTTCAGGGAAAACTGCTGAAGCCTTTGCTTCTGAG (SEQ ID NO:120) NT5C2 R238W Acul PB1296 GCAATTCCTCACGAGACCCGTCCTGTGATACTGAAATTGACTGAAGTAGAAGCAGAAGAT (SEQ ID NO:121) BRCA2 dupAGAAGAT Acul PB1473 GCAATTCCTCACGAGACCCGTCCTGGCCAGCGAGAGATGGCTGAAGCAGAAAAGAAGACT (SEQ ID NO: 122) TIMELESS Acul PB1476 GCAATTCCTCACGAGACCCGTCCTGGGGCAGCGGGTGCCGCTGAAGGCGAGGACGCTGAC (SEQ ID NO: 123) SLX4 Acul PB1472 GCAATTCCTCACGAGACCCGTCCTGACGTTTACGGCCAGTCTGAAGTCTACCCATTCGTT (SEQ ID NO: 124) FANCM Acul PB1427 GCAATTCCTCACGAGACCCGTCCTGGAAGCTCGGAAAAGCCTGAAGGATCCAGGTGCTGC (SEQ ID NO: 125) FANCF Acul PB1430 GCAATTCCTCACGAGACCCGTCCTGATGTAGAATTAAGAACTGAAGTCATGCCTCCAGTT (SEQ ID NO: 126) Acul Apc.1529 PB1431 GCAATTCCTCACGAGACCCGTCCTGCCCGGGGCATTTCATCTGAAGCCCAGGAGCTAGGT (SEQ ID NO: 127) Acul Apc.492 PB1318 GCAATTCCTCACGAGACCCGTCCTGTTGAGAGTCGCTCCACTGAAGTTGCCAGCTCTGTT (SEQ ID NO: 128) Acul Apc.1405 PB1332 GCAATTCCTCACGAGACCCGTCCTGAGCATTTGGTTTTGACTGAAGATTATGGTGTCTGT (SEQ ID NO: 129) Acul Jak2 #1 PB1333 GCAATTCCTCACGAGACCCGTCCTGCTGGCTTTACTTACTCTGAAGCTCCTCTCCACAGA (SEQ ID NO: 130) Acul Jak2 #2 PB1460 GCAATTCCTCACGAGACCCGTCCTGAAGCATTTGGTTTTGCTGAAGAATTATGGTGTCTG (SEQ ID NO: 131) Acul Jak2 #3 PB1461 GCAATTCCTCACGAGACCCGTCCTGGCTGGCTTTACTTACCTGAAGTCTCCTCTCCACAG (SEQ ID NO: 132) Acul Jak2 #4 PB1545 GCAATTCCTCACGAGACCCGTCCTGGAAGCAGGGCTTCCTCTGAAGTTCCTCTGCCATCA (SEQ ID NO: 133) Acul HEK3 PB1301 GCAATTCCTCACGAGACCCGTCCTGGAAATTTGCGTGTGGCTGAAGAGTATTTGGATGAC (SEQ ID NO: 134) Acul TP53 R209fs delGA PB1535 GCAATTCCTCACGAGACCCGTCCTGAACCAGACCTCAGGCCTGAAGGGCTCATAGGGCAC (SEQ ID NO: 135) Acul TP53 delAG (PAM) -
Standard PCR primers Sequence (5′- -> 3′) Notes Ampicillin reverse CCA ATG CTT AAT CAG TGA GG (SEQ ID NO: 136) For adaptor library testing Acul-tagging oligo reverse AAT CGC TTG ATC ACA GAT GTA TGT A (SEQ ID NO: 137) PCR BRCA1 C64Y and BRCA1 D67N Acul-tagging oligo reverse GAA GAC AAA ATA TTT GGG AAA ACC T (SEQ ID NO: 138) PCR BRCA1 E638K and BRCA1 E575K Acul-tagging oligo reverse TCT CGT TAC TGG AAG TTA GCA CTC T (SEQ ID NO: 139) PCR BRCA1 E1033K and BRCA1 V990I Acul-tagging oligo reverse ATT TCA CCA TCA TCT AAC AGG TCA T (SEQ ID NO: 140) PCR BRCA1 T922I Acul-tagging oligo reverse CAC CTC CTG CAT TCA AAA GAT TC (SEQ ID NO: 141) PCR BRCA1 E1754K Acul-tagging oligo reverse GCT GCT TCA CCT TAA ATA ACA AAA A (SEQ ID NO: 142) PCR BRCA1 S1363L Acul-tagging oligo reverse AGG GAC ATA TGG GAA AAA GAG TTA G (SEQ ID NO: 143) PCR BRCA1 Q1779* Acul-tagging oligo reverse TTA GAC CTG ATA TTT CTG TCC CTT G (SEQ ID NO: 144) PCR BRCA2 R2842C Acul-tagging oligo reverse ACC TCT ACT ACC TAT GTG GCT TGT G (SEQ ID NO: 145) PCR BRCA2 R2973H Acul-tagging oligo reverse GGT TTG TAC CGG TAG TTG TTG ATA C (SEQ ID NO: 146) PCR BRCA2 S2998F and BRCA2 Q2960* Acul-tagging oligo reverse AAA TAG CCC TGT ACA ATG AAA AGT AGA (SEQ ID NO: 147) PCR BRCA2 S3070F and BRCA2 V30791 Acul-tagging oligo reverse TCA TAT ACG GCA GTA TGG TTA AGG T (SEQ ID NO: 148) PCR BRCA2 E2772K Acul-tagging oligo reverse GTG GCC CTA CCT CAA AAT TAT TAC T (SEQ ID NO: 149) PCR BRCA2 T17071 Acul-tagging oligo reverse TAT CTA CCA TGT TTG AGT GAC CTG A (SEQ ID NO: 150) PCR BRCA2 T5441 and BRCA2 V572I Acul-tagging oligo reverse CTT CAT AAG TCA GTC TCA TCT GCA A (SEQ ID NO: 151) PCR BRCA2 V2102I Acul-tagging oligo reverse GTA CAG GAG GGA CAA AAA TAA AAC A (SEQ ID NO: 152) PCR BRCA2 R2896C Acul-tagging oligo reverse CCT TAA CTA GCT CTT TTG GGA CAA T (SEQ ID NO: 153) PCR BRCA2 V778I PB1150 GAAAATAGACCAATAGGCAGAGAGAGTC (SEQ ID NO: 154) HBB PCR rev PB1152 TGTCATTAAGAGAGAGACTTTTATTATTCC (SEQ ID NO: 155) EMX1 PCR rev PB1154 ATCCATCTACCTCAGTTTCCTATATCTATC (SEQ ID NO: 156) JAK2 PCR rev PB783 CCCTTTCCTGTAAAAACAATATAAAAA (SEQ ID NO: 157) PIK3R1 PCR rev PB764 TTCTGGAAAATGGATCTAAAGCTAATA (SEQ ID NO: 158) TCOF1 PCR RFLP for PB765 TCACAATTCGTAGTCCTACTTCTACCT(SEQ ID NO: 159) TCOF1 PCR RFLP rev TP226 ACGTTGATGGCAGTTGCAGGTC (SEQ ID NO: 160) JAK2 (HDR) for TP227 CTGACAGAGTTGCTAGACACTGGGTTG (SEQ ID NO: 161) JAK2 (HDR) rev PB969 AACGATCTTCAATATGCTTACCAAG (SEQ ID NO: 162) HBB PCR RFLP for PB970 CTTAACCATAGAAAAGAAGGGGAAA (SEQ ID NO: 163) HBB PCR RFLP rev PB327 GCCATCCCCTTCTGTGAATGTTAGAC(SEQ ID NO: 164) EMX1 PCR for PB328 GGAGATTGGAGACACGGAGAGCAG (SEQ ID NO: 165) EMX1 PCR rev PB1302 AACTGTGCAATAGTTAAACCCATTTAC (SEQ ID NO: 166) PCR TP53 (HDR) PB862 GTAGGTGTTCGGTAAATGTTAATGG (SEQ ID NO: 167) PCR FANCD2 PB863 AAGTCAAATCCCATACCCTACTCAT(SEQID NO: 168) PCR FANCD2 PB1334 TACTTGCTTTCAGTGTTGTGTTATAGG(SEQ ID NO: 169) PCR Jak2 (mouse) PB1335 ATTTGTTTACTGTAATCCTCATCCATC(SEQ ID NO: 170) PCR Jak2 (mouse) PB1319 GGAAAAGTTTATAGGTGTCCCTTCTAC (SEQ ID NO: 171) PCR Apc.1405 PB1320 AGCAGGTGTACTTCTGTCAGCTC (SEQ ID NO: 172) PCR Apc.1405 PB1432 AATATTCTGCAGACTGATATTCTGGTT(SEQ ID NO: 173) PCR Apc.492 PB1428 CGTTACTTAATTTTGAAAAACCTCAAC (SEQ ID NO: 174) PCR FANCF PB1429 AGATTTGGGTTCTCTCTATAGCCATT (SEQ ID NO: 175) PCR FANCF PB745 GACTCCAGTCAAAAATTCTCCTAGTTA(SEQ ID NO: 176) PCR FANCM PB858 ATGTCTGCAGCTATAGTTAGGAAGC (SEQ ID NO: 177) PCR SLX4 PB859 ATCTCTCCCTGAGTTGATGAGAAG(SEQID NO: 178) PCR SLX4 PB764 TTCTGGAAAATGGATCTAAAGCTAATA (SEQ ID NO: 179) PCR TCOF1 PB765 TCACAATTCGTAGTCCTACTTCTACCT(SEQ ID NO: 180) PCR TCOF1 PB746 CTGTTTGTCCTAAACAAGATGTGAAT (SEQ ID NO: 181) PCR TIMELESS PB747 CATTGGAGCAAGTTAAAACTACAAAAT (SEQ ID NO: 182) PCR TIMELESS PB1297 CCTTAACCTCTTGATGTATGAGAAGAA (SEQ ID NO: 183) PCR BRCA2 dupAGAAGAT PB1298 AGTACATCTAAGAAATTGAGCATCCTT(SEQ ID NO: 184) PCR BRCA2 dupAGAAGAT PB590 GTGTGTGTGCAATTATAAAAGAAACTT (SEQ ID NO: 185) PCR SMARCAL1 PB591 GTCAGCATTAGATGAGCTACTGAGATT(SEQ ID NO: 186) PCR SMARCAL1 PB1322 CTGTTCTACTTGTTGGTGGTGATAATA (SEQ ID NO: 187) PCR mouse Pik3ca (545) PB1323 ATGGTAAGAAATATGGTTAACACCAAG (SEQ ID NO: 188) PCR mouse Pik3ca (545) PB1510 CTATTTTAGGTTACTGGGAACAGAATG (SEQ ID NO: 189) Oligos for Bard1 S563F genotyping PB1511 AAACTACATAACTACAACCCAATGCTT(SEQ ID NO: 190) Oligos for Bard1 S563F genotyping PB1514 GAACCCCATACCTGGGATCT (SEQ ID NO: 191) Oligos for Brca1 S1598F genotyping PB1515 tcatacctcacaaggtgccta (SEQ ID NO: 192) Oligos for Brca1 S1598F genotyping PB1548 TTATCAGTTTTGGAGGATGTACATAAA (SEQ ID NO: 193) PCR HEK3 rev PB780 CTCCTTCCTCTTCCTACAGTACTCC(SEQID NO: 194) TP53 gDNA for (PAM) -
Illumina primers (NGS) Sequence (5′- -> 3′) Notes Primers for amplifying Acul-tagged amplicons SAM175 ACACTCTTTCCCTACACGACGCTCTTCCGATC TTTCCTCACGAGACCCGTCCTG (SEQ ID NO: 195) Adaptor constant forward - Forward primer used with all amplicons - binds Acul-tagging primer sequence SAM176 AGACGTGTGCTCTTCCGATCTCTTGATCACAG ATGTATGTA (SEQ ID NO: 196) NGS BRCA1 C64Y Acul SAM177 AGACGTGTGCTCTTCCGATCTCAAAATATTTG GGAAAACCT (SEQ ID NO: 197) NGS BRCA1 E638K Acul SAM178 AGACGTGTGCTCTTCCGATCTTTACTGGAAGT TAGCACTCT (SEQ ID NO: 198) NGS BRCA1 E1033K Acul SAM179 AGACGTGTGCTCTTCCGATCTCAAAATATTTG GGAAAACCT (SEQ ID NO: 199) NGS BRCA1 E575K Acul SAM182 AGACGTGTGCTCTTCCGATCTTTACTGGAAGT TAGCACTCT (SEQ ID NO: 200) NGS BRCA1 V990I Acul SAM184 AGACGTGTGCTCTTCCGATCTACCATCATCTA ACAGGTCAT (SEQ ID NO: 201) NGS BRCA1 T922I Acul SAM185 AGACGTGTGCTCTTCCGATCTCTTGATCACAG ATGTATGTA (SEQ ID NO: 202) NGS BRCA1 D67N Acul SAM186 AGACGTGTGCTCTTCCGATCTCTCCTGCATTC AAAAGATTC (SEQ ID NO: 203) NGS BRCA1 E1754K Acul SAM189 AGACGTGTGCTCTTCCGATCTTTCACCTTAAA TAACAAAAA (SEQ ID NO: 204) NGS BRCA1 S1363L Acul SAM190 AGACGTGTGCTCTTCCGATCTCATATGGGAAA AAGAGTTAG (SEQ ID NO: 205) NGS BRCA1 Q1779* Acul SAM192 AGACGTGTGCTCTTCCGATCTCCTGATATTTC TGTCCCTTG (SEQ ID NO: 206) NGS BRCA2 R2842C Acul SAM193 AGACGTGTGCTCTTCCGATCTTACTACCTATG TGGCTTGTG (SEQ ID NO: 207) NGS BRCA2 R2973H Acul SAM194 AGACGTGTGCTCTTCCGATCTGTACCGGTAGT TGTTGATAC (SEQ ID NO: 208) NGS BRCA2 S2998F Acul SAM195 AGACGTGTGCTCTTCCGATCTCCTGTACAATG AAAAGTAGA (SEQ ID NO: 209) NGS BRCA2 S3070F Acul SAM196 AGACGTGTGCTCTTCCGATCTTACGGCAGTAT GGTTAAGGT (SEQ ID NO: 210) NGS BRCA2 E2772K Acul SAM197 AGACGTGTGCTCTTCCGATCTCCTACCTCAAA ATTATTACT (SEQ ID NO: 211) NGS BRCA2 T1707I Acul SAM198 AGACGTGTGCTCTTCCGATCTCCTGTACAATG AAAAGTAGA (SEQ ID NO: 212) NGS BRCA2 V3079I Acul SAM199 AGACGTGTGCTCTTCCGATCTGTACCGGTAGT TGTTGATAC (SEQ ID NO: 213) NGS BRCA2 Q2960* Acul SAM201 AGACGTGTGCTCTTCCGATCTACCATGTTTGA GTGACCTGA (SEQ ID NO: 214) NGS BRCA2 T544I Acul SAM202 AGACGTGTGCTCTTCCGATCTTAAGTCAGTCT CATCTGCAA (SEQ ID NO: 215) NGS BRCA2 V2102I Acul SAM203 AGACGTGTGCTCTTCCGATCTGGAGGGACAA AAATAAAACA (SEQ ID NO: 216) NGS BRCA2 R2896C Acul SAM204 AGACGTGTGCTCTTCCGATCTACCATGTTTGA GTGACCTGA (SEQ ID NO: 217) NGS BRCA2 V572I Acul SAM205 AGACGTGTGCTCTTCCGATCTACTAGCTCTTT TGGGACAAT (SEQ ID NO: 218) NGS BRCA2 V778I Acul SAM113 caagcagaagacggcatacgagatTGCCTCTTgtgactgga gttcagacgtgtgctcttccgatct (SEQ ID NO: 219) N711 SAM64 aatgatacggcgaccaccgagatctacacACTGCATAacact ctttccctacacgacg (SEQ ID NO: 220) S506 TP370 acactctttccctacacgacgctcttccgatctGTTTAAACAGT GGAATTCTAGAGTCA (SEQ ID NO: 221) BRCA2_NGS_F TP371 agacgtgtgctcttccgatctTTTTTGCAGCTGTGTCATC C (SEQ ID NO: 222) BRCA2 NGS R TP372 acactctttccctacacgacgctcttccgatctGCCCCTCCTC AGCATCTTAT (SEQ ID NO: 223) TP53 NGS F TP373 agacgtgtgctcttccgatctCTTAACCCCTCCTCCCAG AG (SEQ ID NO: 224) TP53 NGS R -
ssODNs: Sequence (5′- -> 3′) Targeted gene TTCCTTAGTCTTTCTTTGAAGCAGCAAGTATGATGAGCAAGCTTTCTCA CAAGCATTTGGTTTTAAATTATGGAGTATGTGTgtttaaacCTGTGGAGACG AGAGTAAGTAAAACTACAGGCTTTCTAATGCCTTTCTCAGAGCATCTGT TTTTGTTTATATAGAAAATTCAGTTTCAGGATCA (SEQ ID NO: 225) JAK2 AAGAAGGGCTCCCATCACATCAACCGGTGGCGCATTGCCACGAAGCA GGCCAATGGGGAGGACATCGATGTCACCTCCAATGACTAgtttaaacGGG TGGGCAACCACAAACCCACGAGGGCAGAGTGCTGCTTGCTGCTGGCC AGGCCCCTGCGTGGGCCCAAGCTGGACTCTGGCCACTCCC(SEQID NO: 226) EMX1 TACATTTGCTTCTGACACAACTGTGTTCACTAGCAACCTCAAACAGACA CAATGGTGCATCTGACTCCTGTCGAGAAGTCTGCCGTTACTGCCCTGT GGGGCAAGGTGAACGTGGATGAAGTTGGTGGTGAGGCCCTGGG (SEQ ID NO: 227) HBB TCTTAGGTCTGGCCCCTCCTCAGCATCTTATCCGAGTGGAAGGAAATT TGCGTGTGGAGTATTTGGATGACAAACACTTTTCGTCATAGTGTGGTTG TGCCCTATGAGCCGCCTGAGGTCTGGTTTGCAACTGGGGTCTCTGGG AGGAGGGGTTAAGGGTGGTTGT (SEQ ID NO: 228) TP53 R209fs*6 TTGTTTAAACAGTGGAATTCTAGAGTCACACTTCCTAAAATATGCATTTT TGTTTTCACTTTTAGATATGATACTGAAATTGATAGAAGCAGAAGATAG BRCA2 dupAGA AAGATCGGCTATAAAAAAGATAATGGAAAGGGATGACACAGCTGCAAA AACACTTGTTCTCTGTGTTTCTGACATAAT (SEQ ID NO: 229) AGAT -
Libray of adaptors: Oligo Sequence (5′- -> 3′) Notes PB984 CTGGGGCACGGGTAAGAAGCATTCTGTCTCTCT TCTAAgaattcgagctcggtacccg (SEQ ID NO: 230) Oligo corresponds to the constant strand of the adaptor PB985 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGGG (SEQ ID NO: 231) Oligo corresponds to the variable strand of the adaptor. It contains a 3′ GG, expected to ligate to CC PB986 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGAG (SEQ ID NO: 232) Oligo corresponds to the variable strand of the adaptor. It contains a 3′ AG, expected to ligate to CT PB987 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGAA (SEQ ID NO: 233) Oligo corresponds to the variable strand of the adaptor. It contains a 3′ AA, expected to ligate to TT PB988 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGTG (SEQ ID NO: 234) Oligo corresponds to the variable strand of the adaptor. It contains a 3′ TG, expected to ligate to CA PB989 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGTA (SEQ ID NO: 235) Oligo corresponds to the variable strand of the adaptor. It contains a 3′ TA, expected to ligate to TA PB990 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGCG (SEQ ID NO: 236) Oligo corresponds to the variable strand of the adaptor. It contains a 3′ CG, expected to ligate to CG PB991 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGCA (SEQ ID NO: 237) Oligo corresponds to the variable strand of the adaptor. It contains a 3′ CA, expected to ligate to TG PB992 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG Oligo corresponds to CTTCTTACCCGTGCCCCAGCT (SEQ ID NO: 238) the variable strand of the adaptor. It contains a 3′ CT, expected to ligate to AG PB993 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGGA (SEQ ID NO: 239) Oligo corresponds to the variable strand of the adaptor. It contains a 3′ GA, expected to ligate to TC PB1000 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGAC (SEQ ID NO: 240) Oligo corresponds to the variable strand of the adaptor. It contains a 3′ AC, expected to ligate to GT PB1001 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGAT (SEQ ID NO: 241) Oligo corresponds to the variable strand of the adaptor. It contains a 3′ AT, expected to ligate to AT PB1002 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGCC (SEQ ID NO: 242) Oligo corresponds to the variable strand of the adaptor. It contains a 3′ CC, expected to ligate to GG PB1003 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGGC (SEQ ID NO: 243) Oligo corresponds to the variable strand of the adaptor. It contains a 3′ GC, expected to ligate to GC PB1004 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGGT (SEQ ID NO: 244) Oligo corresponds to the variable strand of the adaptor. It contains a 3′ GT, expected to ligate to AC PB1005 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGTC (SEQ ID NO: 245) Oligo corresponds to the variable strand of the adaptor. It contains a 3′ TC, expected to ligate to GA PB1006 cgggtaccgagctcgaattcTTAGAAGAGAGACAGAATG CTTCTTACCCGTGCCCCAGTT (SEQ ID NO: 246) Oligo corresponds to the variable strand of the adaptor. It contains a 3′ TT, expected to ligate to AA -
Oligos (sgRNAs cloning): Oligo Sequence (5′- -> 3′) Target/Notes oligo plate CAC CGT ACA TAA AGG ACA CTG TGA (SEQ ID NO: 247) BRCA1 C64Y for oligo plate CAC CGC AAT TCA GTA CAA TTA GGT (SEQ ID NO: 248) BRCA1 E638K for oligo plate CAC CGA TTT TCT CTA ATG TTA TTA (SEQ ID NO: 249) BRCA1 E1033K for oligo plate CAC CGT TTT TCG AGT GAT TCT ATT (SEQ ID NO: 250) BRCA1 E575K for oligo plate CAC CGT TTT AAC AAA TGA CTT GAT (SEQ ID NO: 251) BRCA1 V9901 for oligo plate CAC CGA GAC AGT TAA TAT CAC TGC (SEQ ID NO: 252) BRCA1 T922I for oligo plate CAC CGT TAT ATC ATT CTT ACA TAA (SEQ ID NO: 253) BRCA1 D67N for oligo plate CAC CGG GGA TTC TCT TGC TCG CTT (SEQ ID NO: 254) BRCA1 E1754K for oligo plate CAC CGT GGA TTC AAA CTT AGG TAT (SEQ ID NO: 255) BRCA1 S1363L for oligo plate CAC CGT TAG ATC AAC TGG AAT GGA (SEQ ID NO: 256) BRCA1 Q1779* for oligo plate CAC CGA TAT TTC GCA ATG AAA GAG (SEQ ID NO: 257) BRCA2 R2842C for oligo plate CAC CGA CAA TAC GCA ACT TCC ACA (SEQ ID NO: 258) BRCA2 R2973H for oligo plate CAC CGT ATA TTC TCT GTT AAC AGA (SEQ ID NO: 259) BRCA2 S2998F for oligo plate CAC CGG TTC TGA GGT GGA CCT AAT (SEQ ID NO: 260) BRCA2 S3070F for oligo plate CAC CGG AGA TTC TGG GGC TTC AAG (SEQ ID NO: 261) BRCA2 E2772K for oligo plate CAC CGT AAA TAC TGC AGA TTA TGT (SEQ ID NO: 262) BRCA2 T1707I for oligo plate CAC CGA GAA ACG ACA AAT CCT ATT (SEQ ID NO: 263) BRCA2 V30791 for oligo plate CAC CGA AGG AAC AAG GTT TAT CAA (SEQ ID NO: 264) BRCA2 Q2960* for oligo plate CAC CGC ATA CTG TTT GCT CAC AGA (SEQ ID NO: 265) BRCA2 T5441 for oligo plate CAC CGG CTA CAG AAT TCT GTG TGG (SEQ ID NO: 266) BRCA2 V5721 for oligo plate CAC CGA CAG AAC ATC CTT GGA AGT (SEQ ID NO: 267) BRCA2 V778I for oligo plate AAA CTC ACA GTG TCC TTT ATG TAC BRCA1 C64Y rev (SEQ ID NO: 268) oligo plate AAA CAC CTA ATT GTA CTG AAT TGC (SEQ ID NO: 269) BRCA1 E638K rev oligo plate AAA CTA ATA ACA TTA GAG AAA ATC (SEQ ID NO: 270) BRCA1 E1033K rev oligo plate AAA CAA TAG AAT CAC TCG AAA AAC (SEQ ID NO: 271) BRCA1 E575K rev oligo plate AAA CAT CAA GTC ATT TGT TAA AAC (SEQ ID NO: 272) BRCA1 V990I rev oligo plate AAA CGC AGT GAT ATT AAC TGT CTC (SEQ ID NO: 273) BRCA1 T922I rev oligo plate AAA CTT ATG TAA GAA TGA TAT AAC (SEQ ID NO: 274) BRCA1 D67N rev oligo plate AAA CAA GCG AGC AAG AGA ATC CCC (SEQ ID NO: 275) BRCA1 E1754K rev oligo plate AAA CAT ACC TAA GTT TGA ATC CAC (SEQ ID NO: 276) BRCA1 S1363L rev oligo plate AAA CTC CAT TCC AGT TGA TCT AAC (SEQ ID NO: 277) BRCA1 Q1779* rev oligo plate AAA CCT CTT TCA TTG CGA AAT ATC (SEQ ID NO: 278) BRCA2 R2842C rev oligo plate AAA CTG TGG AAG TTG CGT ATT GTC (SEQ ID NO: 279) BRCA2 R2973H rev oligo plate AAA CTC TGT TAA CAG AGA ATA TAC (SEQ ID NO: 280) BRCA2 S2998F rev oligo plate AAA CAT TAG GTC CAC CTC AGA ACC (SEQ ID NO: 281) BRCA2 S3070F rev oligo plate AAA CCT TGA AGC CCC AGA ATC TCC (SEQ ID NO: 282) BRCA2 E2772K rev oligo plate AAA CAC ATA ATC TGC AGT ATT TAC (SEQ ID NO: 283) BRCA2 T17071 rev oligo plate AAA CAA TAG GAT TTG TCG TTT CTC (SEQ ID NO: 284) BRCA2 V30791 rev oligo plate AAA CTT GAT AAA CCT TGT TCC TTC (SEQ ID NO: 285) BRCA2 Q2960* rev oligo plate AAA CTC TGT GAG CAA ACA GTA TGC (SEQ ID NO: 286) BRCA2 T5441 rev oligo plate AAA CCC ACA CAG AAT TCT GTA GCC (SEQ ID NO: 287) BRCA2 V572I rev oligo plate AAA CAC TTC CAA GGA TGT TCT GTC (SEQ ID NO: 288) BRCA2 V7781 rev PB776 CACCGAACTTcGAGATACAGCAGAC (SEQ ID NO: 289) PIK3R1 R348* for PB777 AAACGTCTGCTGTATCTCgAAGTTC (SEQ ID NO: 290) PIK3R1 R348* rev PB551 CACCGGGCCAGCTGGAGGCCGTCG SPRTN Q60* for (SEQ ID NO: 291) PB552 AAACCGACGGCCTCCAGCTGGCCC(SEQ ID NO: 292) SPRTN Q60* rev PB756 CACCGAGCcAGGTGAGGCCTGGAGG (SEQ ID NO: 293) TCOF1 Q290* for PB757 AAACCCTCCAGGCCTCACCTgGCTC (SEQ ID NO: 294) TCOF1 Q290* rev TP212 CACCGAATTATGGAGTATGTGTCTG (SEQ ID NO: 295) JAK2 HDR for TP213 AAACCAGACACATACTCCATAATTC (SEQ ID NO: 296) JAK2 HDR rev PB963 CACCGATGGTGCATCTGACTCCTG(SEQ ID NO: 297) HBB E6V HDR for PB964 AAACCAGGAGTCAGATGCACCATC(SEQ ID NO: 298) HBB E6V HDR rev PB1017 CACCGAGTCCGAGCAGAAGAAGAA (SEQ ID NO: 299) EMX1 Base editing for PB1018 AAACTTCTTCTTCTGCTCGGACTC (SEQ ID NO: 300) EMX1 Base editing rev PB325 CACCGGTCACCTCCAATGACTAGGG (SEQ ID NO: 301) EMX1 HDR for PB326 AAACCCCTAGTCATTGGAGGTGACC (SEQ ID NO: 302) EMX1 HDR rev PB1299 CACCGCACTTTTCGACATAGTGTGG (SEQ ID NO: 303) TP53 R209fs*6 PB1300 AAACCCACACTATGTCGAAAAGTGC (SEQ ID NO: 304) TP53 R209fs*6 PB580 CACCGCAGCATCAGAGGACTAGCTC (SEQ ID NO: 305) SMARCAL1 Q34* PB581 AAACGAGCTAGTCCTCTGATGCTGC (SEQ ID NO: 306) SMARCAL1 Q34* PB838 CACCGATTCCcAGCACGCTGATGTG (SEQ ID NO: 307) FANCD2 Q223* for PB839 AAACCACATCAGCGTGCTgGGAATC (SEQ ID NO: 308) FANCD2 Q223* rev E12 CAC CGA TAC ATT TTG TCT AGA CGT (SEQ ID NO: 309) BRCA2 V2102I for H06 AAA CAC GTC TAG ACA AAA TGT ATC (SEQ ID NO: 310) BRCA2 V2102I rev PB1294 CACCGTTTCACTTTTAGATATGATA(SEQ ID NO: 311) BRCA2 dupAGAAGAT for PB1295 AAACTATCATATCTAAAAGTGAAAC (SEQ ID NO: 312) BRCA2 dupAGAAGAT rev PB738 CACCGAAGACTCGAGCCCTCCAGCG (SEQ ID NO: 313) TIMELESS R267* for PB739 AAACCGCTGGAGGGCTCGAGTCTTC TIMELESS R267* rev (SEQ ID NO: 314) PB834 CACCGCAGCcAGTCAGCGTCCTCGC (SEQ ID NO: 315) SLX4 W879* for PB835 AAACGCGAGGACGCTGACTgGCTGC (SEQ ID NO: 316) SLX4 W879* rev PB736 CACCGGTACAACGAATGGGTAGAAC (SEQ ID NO: 317) FANCM Q572* for PB737 AAACGTTCTACCCATTCGTTGTACC (SEQ ID NO: 318) FANCM Q572* rev -
- Allen, F., Crepaldi, L., Alsinet, C., Strong, A.J., Kleshchevnikov, V., De Angeli, P., Palenikova, P., Khodak, A., Kiselev, V., Kosicki, M., et al. (2018). Predicting the mutations generated by repair of Cas9-induced double-strand breaks. Nature biotechnology.
- Anzalone, A.V., Randolph, P.B., Davis, J.R., Sousa, A.A., Koblan, L.W., Levy, J.M., Chen, P.J., Wilson, C., Newby, G.A., Raguram, A., et al. (2019). Search-and-replace genome editing without double-strand breaks or donor DNA. Nature.
- Apostolou, P., and Fostira, F. (2013). Hereditary breast cancer: the era of new susceptibility genes. Biomed Res Int 2013, 747318.
- Barbieri, E.M., Muir, P., Akhuetie-Oni, B.O., Yellman, C.M., and Isaacs, F.J. (2017). Precise Editing at DNA Replication Forks Enables Multiplex Genome Engineering in Eukaryotes. Cell 171, 1453-1467 e1413.
- Bath, A.J., Milsom, S.E., Gormley, N.A., and Halford, S.E. (2002). Many type IIs restriction endonucleases interact with two recognition sites before cleaving DNA. J Biol Chem 277, 4024-4033.
- Bhagwat, N., Koppikar, P., Keller, M., Marubayashi, S., Shank, K., Rampal, R., Qi, J., Kleppe, M., Patel, H.J., Shah, S.K., et al. (2014). Improved targeting of JAK2 leads to increased therapeutic efficacy in myeloproliferative neoplasms. Blood 123, 2075-2083.
- Bhojwani, D., and Pui, C.H. (2013). Relapsed childhood acute lymphoblastic leukaemia.
Lancet Oncol 14, e205-217. - Billing, D., Horiguchi, M., Wu-Baer, F., Taglialatela, A., Leuzzi, G., Nanez, S.A., Jiang, W., Zha, S., Szabolcs, M., Lin, C.S., et al. (2018). The BRCT Domains of the BRCA1 and BARD1 Tumor Suppressors Differentially Regulate Homology-Directed Repair and Stalled Fork Protection. Mol Cell 72, 127-139 e128.
- Billon, P., Bryant, E.E., Joseph, S.A., Nambiar, T.S., Hayward, S.B., Rothstein, R., and Ciccia, A. (2017). CRISPR-Mediated Base Editing Enables Efficient Disruption of Eukaryotic Genes through Induction of STOP Codons. Mol Cell 67, 1068-1079 e1064.
- Brinkman, E.K., Chen, T., Amendola, M., and van Steensel, B. (2014). Easy quantitative assessment of genome editing by sequence trace decomposition. Nucleic Acids Res 42, e168.
- Brinkman, E.K., Kousholt, A.N., Harmsen, T., Leemans, C., Chen, T., Jonkers, J., and van Steensel, B. (2018). Easy quantification of template-directed CRISPR/Cas9 editing. Nucleic Acids Res 46, e58.
- Canny, M.D., Moatti, N., Wan, L.C.K., Fradet-Turcotte, A., Krasner, D., Mateos-Gomez, P.A., Zimmermann, M., Orthwein, A., Juang, Y.C., Zhang, W., et al. (2018). Inhibition of 53BP1 favors homology-dependent DNA repair and increases CRISPR-Cas9 genome-editing efficiency. Nat Biotechnol 36, 95-102.
- Ceccaldi, R., Sarangi, P., and D′Andrea, A.D. (2016). The Fanconi anaemia pathway: new players and new functions. Nat Rev
Mol Cell Biol 17, 337-349. - Chadwick, A.C., Wang, X., and Musunuru, K. (2017). In Vivo Base Editing of PCSK9 (Proprotein Convertase Subtilisin/Kexin Type 9) as a Therapeutic Alternative to Genome Editing. Arterioscler Thromb Vasc Biol 37, 1741-1747.
- Clement, K., Rees, H., Canver, M.C., Gehrke, J.M., Farouni, R., Hsu, J.Y., Cole, M.A., Liu, D.R., Joung, J.K., Bauer, D.E., et al. (2019). CRISPResso2 provides accurate and rapid genome editing sequence analysis. Nature biotechnology 37, 224-226.
- Cong, L., Ran, F.A., Cox, D., Lin, S., Barretto, R., Habib, N., Hsu, P.D., Wu, X., Jiang, W., Marraffini, L.A., et al. (2013). Multiplex genome engineering using CRISPR/Cas systems. Science 339, 819-823.
- Dieck, C.L., and Ferrando, A.A. (2019). Genetics and mechanisms of NT5C2-driven chemotherapy resistance in relapsed ALL. Blood.
- Dow, L.E. (2015). Modeling Disease In Vivo With CRISPR/Cas9.
Trends Mol Med 21, 609-621. - Fahim Farzadfard, T.K.L. (2018). Emerging applications for DNA writers and molecular recorders. Science 361, 870-875.
- Findlay, G.M., Daza, R.M., Martin, B., Zhang, M.D., Leith, A.P., Gasperini, M., Janizek, J.D., Huang, X., Starita, L.M., and Shendure, J. (2018). Accurate classification of BRCA1 variants with saturation genome editing. Nature.
- Gao, X., Tao, Y., Lamas, V., Huang, M., Yeh, W.H., Pan, B., Hu, Y.J., Hu, J.H., Thompson, D.B., Shu, Y., et al. (2018). Treatment of autosomal dominant hearing loss by in vivo delivery of genome editing agents. Nature 553, 217-221.
- Gaudelli, N.M., Komor, A.C., Rees, H.A., Packer, M.S., Badran, A.H., Bryson, D.I., and Liu, D.R. (2017). Programmable base editing of A*T to G*C in genomic DNA without DNA cleavage. Nature.
- Germini, D., Tsfasman, T., Zakharova, V.V., Sjakste, N., Lipinski, M., and Vassetzky, Y. (2018). A Comparison of Techniques to Evaluate the Effectiveness of Genome Editing. Trends Biotechnol 36, 147-159.
- Guo, X., Chavez, A., Tung, A., Chan, Y., Kaas, C., Yin, Y., Cecchi, R., Garnier, S.L., Kelsic, E.D., Schubert, M., et al. (2018). High-throughput creation and functional profiling of DNA sequence variant libraries using CRISPR-Cas9 in yeast. Nat Biotechnol 36, 540-546.
- Inaba, H., Greaves, M., and Mullighan, C.G. (2013). Acute lymphoblastic leukaemia. Lancet 381, 1943-1955.
- Jasin, M., and Haber, J.E. (2016). The democratization of gene editing: Insights from site-specific cleavage and double-strand break repair. DNA Repair (Amst) 44, 6-16.
- Kluesner, M.G., Nedveck, D.A., Lahr, W.S., Garbe, J.R., Abrahante, J.E., Webber, B.R., and Moriarity, B.S. (2018). EditR: A Method to Quantify Base Editing from Sanger Sequencing.
CRISPR J 1, 239-250. - Komor, A.C., Kim, Y.B., Packer, M.S., Zuris, J.A., and Liu, D.R. (2016). Programmable editing of a target base in genomic DNA without double-stranded DNA cleavage. Nature 533, 420-424.
- Komor, A.C., Zhao, K.T., Packer, M.S., Gaudelli, N.M., Waterbury, A.L., Koblan, L.W., Kim, Y.B., Badran, A.H., and Liu, D.R. (2017). Improved base excision repair inhibition and bacteriophage Mu Gam protein yields C:G-to-T:A base editors with higher efficiency and product purity.
Sci Adv 3, eaao4774. - Leenay, R.T., Aghazadeh, A., Hiatt, J., Tse, D., Roth, T.L., Apathy, R., Shifrut, E., Hultquist, J.F., Krogan, N., Wu, Z., et al. (2019). Large dataset enables prediction of repair after CRISPR-Cas9 editing in primary T cells. Nature biotechnology 37, 1034-1037.
- Lek, M., Karczewski, K.J., Minikel, E.V., Samocha, K.E., Banks, E., Fennell, T., O’Donnell-Luria, A.H., Ware, J.S., Hill, A.J., Cummings, B.B., et al. (2016). Analysis of protein-coding genetic variation in 60,706 humans. Nature 536, 285-291.
- Levine, R.L., Wadleigh, M., Cools, J., Ebert, B.L., Wernig, G., Huntly, B.J., Boggon, T.J., Wlodarska, I., Clark, J.J., Moore, S., et al. (2005). Activating mutation in the tyrosine kinase JAK2 in polycythemia vera, essential thrombocythemia, and myeloid metaplasia with myelofibrosis.
Cancer Cell 7, 387-397. - Levy, J.M., Yeh, W.H., Pendse, N., Davis, J.R., Hennessey, E., Butcher, R., Koblan, L.W., Comander, J., Liu, Q., and Liu, D.R. (2020). Cytosine and adenine base editing of the brain, liver, retina, heart and skeletal muscle of mice via adeno-associated viruses.
Nat Biomed Eng 4, 97-110. - Lindsay, H., Burger, A., Biyong, B., Felker, A., Hess, C., Zaugg, J., Chiavacci, E., Anders, C., Jinek, M., Mosimann, C., et al. (2016). CrispRVariants charts the mutation spectrum of genome engineering experiments. Nat Biotechnol 34, 701-702.
- Liu, Z., Lu, Z., Yang, G., Huang, S., Li, G., Feng, S., Liu, Y., Li, J., Yu, W., Zhang, Y., et al. (2018). Efficient generation of mouse models of human diseases via ABE- and BE-mediated base editing.
Nat Commun 9, 2338. - Lundin, S., Jemt, A., Terje-Hegge, F., Foam, N., Pettersson, E., Kaller, M., Wirta, V., Lexow, P., and Lundeberg, J. (2015). Endonuclease specificity and sequence dependence of type IIS restriction enzymes.
PLoS One 10, e0117059. - Mali, P., Yang, L., Esvelt, K.M., Aach, J., Guell, M., DiCarlo, J.E., Norville, J.E., and Church, G.M. (2013). RNA-guided human genome engineering via Cas9. Science 339, 823-826.
- Mashal, R.D., Koontz, J., and Sklar, J. (1995). Detection of mutations by cleavage of DNA heteroduplexes with bacteriophage resolvases.
Nat Genet 9, 177-183. - McClellan, J., and King, M.C. (2010). Genetic heterogeneity in human disease. Cell 141, 210-217.
- Mullally, A., Lane, S.W., Ball, B., Megerdichian, C., Okabe, R., Al-Shahrour, F., Paktinat, M., Haydu, J.E., Housman, E., Lord, A.M., et al. (2010). Physiological Jak2V617F expression causes a lethal myeloproliferative neoplasm with differential effects on hematopoietic stem and progenitor cells.
Cancer Cell 17, 584-596. - Oshima, K., Khiabanian, H., da Silva-Almeida, A.C., Tzoneva, G., Abate, F., Ambesi-Impiombato, A., Sanchez-Martin, M., Carpenter, Z., Penson, A., Perez-Garcia, A., et al. (2016). Mutational landscape, clonal evolution patterns, and role of RAS mutations in relapsed acute lymphoblastic leukemia. Proc Natl Acad Sci USA 113, 11306-11311.
- Pathania, S., Bade, S., Le Guillou, M., Burke, K., Reed, R., Bowman-Colin, C., Su, Y., Ting, D.T., Polyak, K., Richardson, A.L., et al. (2014). BRCA1 haploinsufficiency for replication stress suppression in primary cells.
Nature communications 5, 5496. - Paulsen, B.S., Mandal, P.K., Frock, R.L., Boyraz, B., Yadav, R., Upadhyayula, S., Gutierrez-Martinez, P., Ebina, W., Fasth, A., Kirchhausen, T., et al. (2017). Ectopic expression of RAD52 and dn53BP1 improves homology-directed repair during CRISPR-Cas9 genome editing.
Nat Biomed Eng 1, 878-888. - Pinello, L., Canver, M.C., Hoban, M.D., Orkin, S.H., Kohn, D.B., Bauer, D.E., and Yuan, G.C. (2016). Analyzing CRISPR genome-editing experiments with CRISPResso. Nature biotechnology 34, 695-697.
- Qiu, P., Shandilya, H., D′Alessio, J.M., O’Connor, K., Durocher, J., and Gerard, G.F. (2004). Mutation detection using Surveyor nuclease. Biotechniques 36, 702-707.
- Ran, F.A., Hsu, P.D., Wright, J., Agarwala, V., Scott, D.A., and Zhang, F. (2013). Genome engineering using the CRISPR-Cas9 system.
Nature protocols 8, 2281-2308. - Rees, H.A., and Liu, D.R. (2018). Base editing: precision chemistry on the genome and transcriptome of living cells.
Nat Rev Genet 19, 770-788. - Roy, K.R., Smith, J.D., Vonesch, S.C., Lin, G., Tu, C.S., Lederer, A.R., Chu, A., Suresh, S., Nguyen, M., Horecka, J., et al. (2018). Multiplexed precision genome editing with trackable genomic barcodes in yeast. Nat Biotechnol 36, 512-520.
- Ryu, S.M., Koo, T., Kim, K., Lim, K., Baek, G., Kim, S.T., Kim, H.S., Kim, D.E., Lee, H., Chung, E., et al. (2018). Adenine base editing in mouse embryos and an adult mouse model of Duchenne muscular dystrophy. Nature biotechnology 36, 536-539.
- Shakya, R., Reid, L.J., Reczek, C.R., Cole, F., Egli, D., Lin, C.S., deRooij, D.G., Hirsch, S., Ravi, K., Hicks, J.B., et al. (2011). BRCA1 tumor suppression depends on BRCT phosphoprotein binding, but not its E3 ligase activity. Science 334, 525-528.
- Shen, M.W., Arbab, M., Hsu, J.Y., Worstell, D., Culbertson, S.J., Krabbe, O., Cassa, C.A., Liu, D.R., Gifford, D.K., and Sherwood, R.I. (2018). Predictable and precise template-free CRISPR editing of pathogenic variants. Nature 563, 646-651.
- Song, C.Q., Jiang, T., Richter, M., Rhym, L.H., Koblan, L.W., Zafra, M.P., Schatoff, E.M., Doman, J.L., Cao, Y., Dow, L.E., et al. (2020). Adenine base editing in an adult mouse model of tyrosinaemia.
Nat Biomed Eng 4, 125-130. - Tan, S.L.W., Chadha, S., Liu, Y., Gabasova, E., Perera, D., Ahmed, K., Constantinou, S., Renaudin, X., Lee, M., Aebersold, R., et al. (2017). A Class of Environmental and Endogenous Toxins Induces BRCA2 Haploinsufficiency and Genome Instability. Cell 169, 1105-1118 e1115.
- Tschaharganeh, D.F., Xue, W., Calvisi, D.F., Evert, M., Michurina, T.V., Dow, L.E., Banito, A., Katz, S.F., Kastenhuber, E.R., Weissmueller, S., et al. (2014). p53-dependent Nestin regulation links tumor suppression to cellular plasticity in liver cancer. Cell 158, 579-592.
- Tzoneva, G., Dieck, C.L., Oshima, K., Ambesi-Impiombato, A., Sanchez-Martin, M., Madubata, C.J., Khiabanian, H., Yu, J., Waanders, E., Iacobucci, I., et al. (2018). Clonal evolution mechanisms in NT5C2 mutant-relapsed acute lymphoblastic leukaemia. Nature 553, 511-514.
- Tzoneva, G., Perez-Garcia, A., Carpenter, Z., Khiabanian, H., Tosello, V., Allegretta, M., Paietta, E., Racevskis, J., Rowe, J.M., Tallman, M.S., et al. (2013). Activating mutations in the NT5C2 nucleotidase gene drive chemotherapy resistance in relapsed ALL.
Nat Med 19, 368-371. - van Overbeek, M., Capurso, D., Carter, M.M., Thompson, M.S., Frias, E., Russ, C., Reece-Hoyes, J.S., Nye, C., Gradia, S., Vidal, B., et al. (2016). DNA Repair Profiling Reveals Nonrandom Outcomes at Cas9-Mediated Breaks. Mol Cell 63, 633-646.
- Villiger, L., Grisch-Chan, H.M., Lindsay, H., Ringnalda, F., Pogliano, C.B., Allegri, G., Fingerhut, R., Haberle, J., Matos, J., Robinson, M.D., et al. (2018). Treatment of a metabolic liver disease by in vivo genome base editing in adult mice.
Nat Med 24, 1519-1525. - Wang, L., Xue, W., Yan, L., Li, X., Wei, J., Chen, M., Wu, J., Yang, B., Yang, L., and Chen, J. (2017). Enhanced base editing by co-expression of free uracil DNA glycosylase inhibitor. Cell Res 27, 1289-1292.
- Yeh, W.H., Chiang, H., Rees, H.A., Edge, A.S.B., and Liu, D.R. (2018). In vivo base editing of post-mitotic sensory cells.
Nature communications 9, 2184. - Yin, H., Song, C.Q., Dorkin, J.R., Zhu, L.J., Li, Y., Wu, Q., Park, A., Yang, J., Suresh, S., Bizhanova, A., et al. (2016). Therapeutic genome editing by combined viral and non-viral delivery of CRISPR system components in vivo. Nature biotechnology 34, 328-333.
- Yin, H., Xue, W., Chen, S., Bogorad, R.L., Benedetti, E., Grompe, M., Koteliansky, V., Sharp, P.A., Jacks, T., and Anderson, D.G. (2014). Genome editing with Cas9 in adult mice corrects a disease mutation and phenotype. Nature biotechnology 32, 551-553.
- Zafra, M.P., Schatoff, E.M., Katti, A., Foronda, M., Breinig, M., Schweitzer, A.Y., Simon, A., Han, T., Goswami, S., Montgomery, E., et al. (2018). Optimized base editors enable efficient editing in cells, organoids and mice. Nature biotechnology 36, 888-893.
- Zhang, J., Li, J., Saucier, J.B., Feng, Y., Jiang, Y., Sinson, J., McCombs, A.K., Schmitt, E.S., Peacock, S., Chen, S., et al. (2019). Non-invasive prenatal sequencing for multiple Mendelian monogenic disorders using circulating cell-free fetal DNA.
Nat Med 25, 439-447. - All documents cited in this application are hereby incorporated by reference as if recited in full herein.
- Although illustrative embodiments of the present disclosure have been described herein, it should be understood that the disclosure is not limited to those described, and that various other changes or modifications may be made by one skilled in the art without departing from the scope or spirit of the disclosure.
Claims (20)
1. A method for detecting a genetic modification in a DNA sequence of interest, comprising the steps of:
(a) amplifying the DNA sequence of interest using a specially designed Type IIS restriction enzyme-tagging primer, comprising:
(i) obtaining the DNA sequence of interest from a biological sample;
(ii) synthesizing the Type IIS restriction enzyme-tagging primer based on the DNA sequence of interest;
(iii) amplifying the DNA sequence of interest using the Type IIS restriction enzyme-tagging primer and a reverse primer; and
(iv)purifying a Type IIS restriction enzyme-tagged amplicon;
(b) digesting the Type IIS restriction enzyme-tagged amplicon with the Type IIS restriction enzyme;
(c) isolating the smaller DNA fragment containing the genetic modification exposed in a 3′ single-stranded overhang;
(d) capturing the genetic modification, comprising:
(i) preparing a library of 16 DNA adaptors, wherein each DNA adaptor comprises one strand with sequence of 5′-CTGGGGCACGGGTAAGAAGCATTCTGTCTCTCTTCTAAGAATTCGAG CTCGGTACCCG-3′ (SEQ ID NO: 230); and one complementary strand with sequence of 5′-CGGGTACCGAGCTCGAATTCTTAGAAGAGAGACAGAATGCTTCTTAC CCGTGCCCCAGNN-3′ with “N” corresponding to A, T, G or C (SEQ ID NOs: 231-246), and wherein each DNA adaptor has a different “NN”;
(ii) incubating the isolated smaller DNA fragment containing the 3′ overhang with the library of DNA adaptors and performing a ligation; and
(iii) obtaining a ligated product; and
(e) amplifying the ligated product to detect the presence of the genetic modification, wherein the DNA sequence of interest is a genomic locus or corresponds to a genomic locus of an RNA virus variant.
2. The method of claim 1 , wherein the DNA sequence of interest corresponds to a genomic locus of an RNA virus variant, and wherein obtaining the DNA sequence of interest comprises obtaining the RNA sequence from the RNA virus variant and converting it to the corresponding DNA sequence by reverse transcription PCR (RT-PCR).
3. The method of claim 2 , wherein the RNA virus is SARS-CoV-2.
4. The method of claim 1 , wherein the Type IIS restriction enzyme is selected from Acul, Bpml, BpuEl, Bsgl, Mmel and NmeAIII.
5. The method of claim 4 , wherein the Type IIS restriction enzyme is Acul.
6. The method of claim 1 , wherein the Type IIS restriction enzyme-tagging primer is an oligonucleotide comprising:
(a) a non-complementary handle sequence positioned on the 5′ side;
(b) a complementary sequence of the genomic locus of interest on the 5′ side;
(c) a recognition motif of the Type IIS restriction enzyme that is positioned at a predicted distance from its cleavage site to generate the genomic signature of interest; and
(d) a complementary sequence of the genomic locus of interest on the 3′ side.
7. A kit for detecting a genetic modification of interest, comprising a specially designed Type IIS restriction enzyme-tagging primer according to claim 6 , and a library of DNA adaptors according to claim 1 , packaged together with instructions for its use.
8. The method of claim 5 , wherein the Acul-tagging primer is an oligonucleotide comprising:
(a) a non-complementary handle sequence positioned on the 5′ side; and
(b) a complementary sequence of the genomic locus of interest containing an Acul motif (5′-CTGAAG-3′) positioned 14 bp upstream from the genomic locus of interest.
9. The method of claim 8 , wherein the reverse primer is positioned at more than 100 bp downstream of the genomic locus of interest.
10. The method of claim 8 , wherein the non-complementary handle sequence is 25 bp.
11. The method of claim 8 , wherein the complementary sequence has the structure of: 5′-N(20)CTGAAGN(14)-3′ or 5′-N(15)CTGAAGN(14)-3′, with “N” corresponding to A, T, G or C, depending on the DNA sequence of the genomic locus of interest.
12. The method of claim 8 , wherein the non-complementary handle sequence is 5′-GCAATTCCTCACGAGACCCGTCCTG-3′ (SEQ ID NO: 3) and the complementary sequence is 5′-N(15)CTGAAGN(14)-3′, with “N” corresponding to A, T, G or C.
13. A kit for detecting a genetic modification, comprising a specially designed Acul-tagging primer and a library of DNA adaptors according to claim 1 , packaged together with instructions for its use.
14. A method for quantifying a genomic variant in a biological system, comprising the steps of:
(a) obtaining a sample from the biological system;
(b) amplifying a DNA sequence of interest using a specially designed Acul-tagging primer, wherein the DNA sequence of interest is a genomic locus or corresponds to a genomic locus of an RNA virus variant, comprising:
(i) obtaining the DNA sequence of interest by (1) genomic extraction or (2) obtaining the RNA sequence from the RNA virus variant and converting it to the corresponding DNA sequence by reverse transcription PCR (RT-PCR);
(ii) synthesizing the Acul-tagging primer based on the DNA sequence of interest;
(iii) amplifying the DNA sequence of interest using the Acul-tagging primer and a reverse primer; and
(iv) purifying an Acul-tagged amplicon;
(c) digesting the Acul-tagged amplicon with restriction enzyme Acul;
(d) isolating the smaller DNA fragment containing the genomic variant of interest produced by the Acul-digestion;
(e) capturing the genomic variant of interest, comprising:
(i) preparing the library of DNA adaptors according to claim 1 ;
(ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a ligation; and
(iii) obtaining a ligated product; and
(f) quantifying the genomic variant and determining its relative abundance.
15. The method of claim 14 , wherein the genomic variant is generated by precision genome editing.
16. The method of claim 15 , wherein the precision genome editing is CRISPER-dependent homology-directed repair, base editing or prime editing.
17. The method of claim 14 , wherein the quantification in step (f) is carried out by quantitative PCR (qPCR).
18. A method for identifying and quantifying an oncogenic mutation of interest in a biological sample, comprising the steps of:
(a) obtaining a biological sample;
(b) amplifying a genomic locus of interest using a specially designed Acul-tagging primer, comprising:
(i) extracting DNA of interest;
(ii) synthesizing the Acul-tagging primer based on the genomic locus of interest;
(iii) amplifying the genomic locus of interest using the Acul-tagging primer and a reverse primer; and
(iv) purifying an Acul-tagged genomic amplicon;
(c) digesting the Acul-tagged genomic amplicon with restriction enzyme Acul;
(d) isolating the smaller DNA fragment containing the oncogenic mutation of interest produced by the Acul-digestion;
(e) capturing the genomic signature of interest, comprising:
(i) preparing the library of DNA adaptors according to claim 1 ;
(ii) incubating the isolated smaller DNA fragment with the library of DNA adaptors and performing a ligation; and
(iii) obtaining a ligated product;
(f) amplifying the ligated product to identify the presence of the oncogenic mutation of interest; and
(g) quantifying the oncogenic mutation of interest, if present, and determining its frequency.
19. The method of claim 18 , wherein the biological sample is obtained from a cancer animal model, a patient-derived xenograft (PDX), or a human cancer patient sample.
20. The method of claim 18 , wherein the quantification in step (g) is carried out by quantitative PCR (qPCR).
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17/850,186 US20230347311A1 (en) | 2020-03-05 | 2022-06-27 | A versatile method for the detection of marker-free precision genome editing and genetic variation |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202062985746P | 2020-03-05 | 2020-03-05 | |
US17/192,836 US11369936B2 (en) | 2020-03-05 | 2021-03-04 | Versatile method for the detection of marker-free precision genome editing and genetic variation |
US17/850,186 US20230347311A1 (en) | 2020-03-05 | 2022-06-27 | A versatile method for the detection of marker-free precision genome editing and genetic variation |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/192,836 Continuation US11369936B2 (en) | 2020-03-05 | 2021-03-04 | Versatile method for the detection of marker-free precision genome editing and genetic variation |
Publications (1)
Publication Number | Publication Date |
---|---|
US20230347311A1 true US20230347311A1 (en) | 2023-11-02 |
Family
ID=77663596
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/192,836 Active US11369936B2 (en) | 2020-03-05 | 2021-03-04 | Versatile method for the detection of marker-free precision genome editing and genetic variation |
US17/850,186 Abandoned US20230347311A1 (en) | 2020-03-05 | 2022-06-27 | A versatile method for the detection of marker-free precision genome editing and genetic variation |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/192,836 Active US11369936B2 (en) | 2020-03-05 | 2021-03-04 | Versatile method for the detection of marker-free precision genome editing and genetic variation |
Country Status (1)
Country | Link |
---|---|
US (2) | US11369936B2 (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11369936B2 (en) * | 2020-03-05 | 2022-06-28 | The Trustees Of Columbia University In The City Of New York | Versatile method for the detection of marker-free precision genome editing and genetic variation |
CN116396952A (en) * | 2023-01-06 | 2023-07-07 | 之江实验室 | Pilot editing system and gene editing method |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11369936B2 (en) * | 2020-03-05 | 2022-06-28 | The Trustees Of Columbia University In The City Of New York | Versatile method for the detection of marker-free precision genome editing and genetic variation |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113388670B (en) | 2015-01-09 | 2024-02-02 | 生物辐射实验室股份有限公司 | Detecting genome editing |
CN113846144B (en) | 2015-03-17 | 2023-09-26 | 生物辐射实验室股份有限公司 | Detecting genome editing |
CN104894255B (en) | 2015-05-29 | 2019-09-06 | 石河子大学 | A kind of method and its application detecting inefficient genome editor based on polyacrylamide gel electrophoresis |
WO2017091811A2 (en) | 2015-11-25 | 2017-06-01 | Integrated Dna Technologies, Inc. | Methods for variant detection |
-
2021
- 2021-03-04 US US17/192,836 patent/US11369936B2/en active Active
-
2022
- 2022-06-27 US US17/850,186 patent/US20230347311A1/en not_active Abandoned
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11369936B2 (en) * | 2020-03-05 | 2022-06-28 | The Trustees Of Columbia University In The City Of New York | Versatile method for the detection of marker-free precision genome editing and genetic variation |
Also Published As
Publication number | Publication date |
---|---|
US11369936B2 (en) | 2022-06-28 |
US20210283567A1 (en) | 2021-09-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11795501B2 (en) | Methods for next generation genome walking and related compositions and kits | |
JP7256748B2 (en) | Methods for targeted nucleic acid sequence enrichment with application to error-corrected nucleic acid sequencing | |
JP7229923B2 (en) | Methods for assessing nuclease cleavage | |
JP6998404B2 (en) | Method for enriching and determining the target nucleotide sequence | |
US10640820B2 (en) | Methods relating to the detection of recurrent and non-specific double strand breaks in the genome | |
Atkins et al. | Off-target analysis in gene editing and applications for clinical translation of CRISPR/Cas9 in HIV-1 therapy | |
JP6709778B2 (en) | Method for quantitative gene analysis of cell-free DNA (cfDNA) | |
KR102393608B1 (en) | Systems and methods to detect rare mutations and copy number variation | |
KR102505122B1 (en) | Methods for Detection of Genomic Copy Changes in DNA Samples | |
KR102598819B1 (en) | Genomewide unbiased identification of dsbs evaluated by sequencing (guide-seq) | |
US20230347311A1 (en) | A versatile method for the detection of marker-free precision genome editing and genetic variation | |
KR102580824B1 (en) | Method and Kit for Determining Reactivity to PARP inhibitor | |
CN114616343A (en) | Compositions and methods for analyzing cell-free DNA in methylation partition assays | |
JP2017509324A (en) | Error-free DNA sequencing | |
KR20220041874A (en) | gene mutation analysis | |
Billon et al. | Detection of marker-free precision genome editing and genetic variation through the capture of genomic signatures | |
US20230366009A1 (en) | Simultaneous amplification of dna and rna from single cells | |
US10870879B2 (en) | Method for the preparation of bar-coded primer sets | |
US20200399694A1 (en) | Methods of labelling nucleic acids | |
CN113195709A (en) | Compositions and methods for multiplex quantitative analysis of cell lineages | |
US20230095295A1 (en) | Phi29 mutants and use thereof | |
CN111379032B (en) | Method and kit for constructing sequencing library for simultaneously realizing genome copy number variation detection and gene mutation detection | |
KR20240004397A (en) | Compositions and methods for simultaneous genetic analysis of multiple libraries | |
JP2024515305A (en) | Nucleic Acid Concentration and Detection | |
Dunwell et al. | Adaptor template oligo-mediated sequencing (ATOM-Seq): a versatile and ultra-sensitive UMI-based NGS library preparation technology, for use with cfDNA and cfRNA |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK, NEW YORK Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:CICCIA, ALBERTO;BILLON, PIERRE;REEL/FRAME:060321/0354 Effective date: 20200403 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |