CN114525293B - 新型CRISPR-Cas9抑制蛋白及其改造应用于化学可控的基因编辑的方法 - Google Patents
新型CRISPR-Cas9抑制蛋白及其改造应用于化学可控的基因编辑的方法 Download PDFInfo
- Publication number
- CN114525293B CN114525293B CN202210127749.7A CN202210127749A CN114525293B CN 114525293 B CN114525293 B CN 114525293B CN 202210127749 A CN202210127749 A CN 202210127749A CN 114525293 B CN114525293 B CN 114525293B
- Authority
- CN
- China
- Prior art keywords
- protein
- cas9
- glu
- lys
- acr
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000010362 genome editing Methods 0.000 title claims abstract description 44
- 238000000034 method Methods 0.000 title claims abstract description 26
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 title abstract description 24
- 108091006086 inhibitor proteins Proteins 0.000 title abstract description 15
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 202
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 179
- 108091033409 CRISPR Proteins 0.000 claims abstract description 139
- 230000000694 effects Effects 0.000 claims abstract description 86
- 101000910035 Streptococcus pyogenes serotype M1 CRISPR-associated endonuclease Cas9/Csn1 Proteins 0.000 claims abstract description 31
- 230000005764 inhibitory process Effects 0.000 claims abstract description 19
- 230000001939 inductive effect Effects 0.000 claims abstract description 13
- 108020004414 DNA Proteins 0.000 claims description 56
- 230000002401 inhibitory effect Effects 0.000 claims description 55
- DODQJNMQWMSYGS-QPLCGJKRSA-N 4-[(z)-1-[4-[2-(dimethylamino)ethoxy]phenyl]-1-phenylbut-1-en-2-yl]phenol Chemical compound C=1C=C(O)C=CC=1C(/CC)=C(C=1C=CC(OCCN(C)C)=CC=1)/C1=CC=CC=C1 DODQJNMQWMSYGS-QPLCGJKRSA-N 0.000 claims description 35
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 33
- 101000910045 Streptococcus thermophilus (strain ATCC BAA-491 / LMD-9) CRISPR-associated endonuclease Cas9 2 Proteins 0.000 claims description 27
- 238000010354 CRISPR gene editing Methods 0.000 claims description 20
- 230000027455 binding Effects 0.000 claims description 19
- 238000003776 cleavage reaction Methods 0.000 claims description 16
- 230000007017 scission Effects 0.000 claims description 15
- 230000017730 intein-mediated protein splicing Effects 0.000 claims description 14
- 239000000758 substrate Substances 0.000 claims description 14
- 108020001507 fusion proteins Proteins 0.000 claims description 13
- 102000037865 fusion proteins Human genes 0.000 claims description 13
- 230000001419 dependent effect Effects 0.000 claims description 8
- 230000033228 biological regulation Effects 0.000 claims description 5
- 238000003384 imaging method Methods 0.000 claims description 4
- 230000004952 protein activity Effects 0.000 claims description 4
- 230000001580 bacterial effect Effects 0.000 claims description 3
- 239000003446 ligand Substances 0.000 claims description 3
- 241000206602 Eukaryota Species 0.000 claims description 2
- 238000002360 preparation method Methods 0.000 claims description 2
- 239000000126 substance Substances 0.000 claims description 2
- 230000006698 induction Effects 0.000 claims 2
- 230000007018 DNA scission Effects 0.000 abstract description 29
- 241000194017 Streptococcus Species 0.000 abstract description 15
- 230000007246 mechanism Effects 0.000 abstract description 13
- 230000004568 DNA-binding Effects 0.000 abstract description 10
- 241000894006 Bacteria Species 0.000 abstract description 5
- 210000003527 eukaryotic cell Anatomy 0.000 abstract description 2
- 235000018102 proteins Nutrition 0.000 description 164
- 210000004027 cell Anatomy 0.000 description 63
- 239000013612 plasmid Substances 0.000 description 62
- 238000002474 experimental method Methods 0.000 description 40
- 125000000539 amino acid group Chemical group 0.000 description 27
- 230000006870 function Effects 0.000 description 26
- 239000000499 gel Substances 0.000 description 24
- 210000005260 human cell Anatomy 0.000 description 21
- 108091027544 Subgenomic mRNA Proteins 0.000 description 18
- 238000003556 assay Methods 0.000 description 18
- 230000001404 mediated effect Effects 0.000 description 17
- 108091035539 telomere Proteins 0.000 description 17
- 102000055501 telomere Human genes 0.000 description 17
- 210000003411 telomere Anatomy 0.000 description 17
- 238000006243 chemical reaction Methods 0.000 description 15
- 230000008685 targeting Effects 0.000 description 12
- 241000588724 Escherichia coli Species 0.000 description 10
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 9
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 9
- 210000004899 c-terminal region Anatomy 0.000 description 9
- 102100028089 RING finger protein 112 Human genes 0.000 description 8
- 238000000338 in vitro Methods 0.000 description 8
- 239000002609 medium Substances 0.000 description 8
- 229920002401 polyacrylamide Polymers 0.000 description 7
- 108091005946 superfolder green fluorescent proteins Proteins 0.000 description 7
- 101001000998 Homo sapiens Protein phosphatase 1 regulatory subunit 12C Proteins 0.000 description 6
- 108091028043 Nucleic acid sequence Proteins 0.000 description 6
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 6
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 6
- 108010009298 lysylglutamic acid Proteins 0.000 description 6
- 102100035620 Protein phosphatase 1 regulatory subunit 12C Human genes 0.000 description 5
- 229960000723 ampicillin Drugs 0.000 description 5
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 5
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 5
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 5
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 5
- 101150038500 cas9 gene Proteins 0.000 description 5
- 229960005091 chloramphenicol Drugs 0.000 description 5
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 5
- 238000010586 diagram Methods 0.000 description 5
- 230000010534 mechanism of action Effects 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 238000001890 transfection Methods 0.000 description 5
- 239000013598 vector Substances 0.000 description 5
- 102100028554 Dual specificity tyrosine-phosphorylation-regulated kinase 1A Human genes 0.000 description 4
- 102100035102 E3 ubiquitin-protein ligase MYCBP2 Human genes 0.000 description 4
- 101000838016 Homo sapiens Dual specificity tyrosine-phosphorylation-regulated kinase 1A Proteins 0.000 description 4
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 4
- JCMMNFZUKMMECJ-DCAQKATOSA-N Met-Lys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JCMMNFZUKMMECJ-DCAQKATOSA-N 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- 235000001014 amino acid Nutrition 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 239000013613 expression plasmid Substances 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 4
- 229930027917 kanamycin Natural products 0.000 description 4
- 229960000318 kanamycin Drugs 0.000 description 4
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 4
- 229930182823 kanamycin A Natural products 0.000 description 4
- 108010034529 leucyl-lysine Proteins 0.000 description 4
- 102000039446 nucleic acids Human genes 0.000 description 4
- 108020004707 nucleic acids Proteins 0.000 description 4
- 150000007523 nucleic acids Chemical class 0.000 description 4
- 239000002245 particle Substances 0.000 description 4
- 230000003389 potentiating effect Effects 0.000 description 4
- RXWNCPJZOCPEPQ-NVWDDTSBSA-N puromycin Chemical compound C1=CC(OC)=CC=C1C[C@H](N)C(=O)N[C@H]1[C@@H](O)[C@H](N2C3=NC=NC(=C3N=C2)N(C)C)O[C@@H]1CO RXWNCPJZOCPEPQ-NVWDDTSBSA-N 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 238000006467 substitution reaction Methods 0.000 description 4
- 108010051110 tyrosyl-lysine Proteins 0.000 description 4
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 3
- 101150109930 ACR gene Proteins 0.000 description 3
- 229920000936 Agarose Polymers 0.000 description 3
- 108091079001 CRISPR RNA Proteins 0.000 description 3
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 3
- 241000701533 Escherichia virus T4 Species 0.000 description 3
- 239000006142 Luria-Bertani Agar Substances 0.000 description 3
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 3
- 241000193996 Streptococcus pyogenes Species 0.000 description 3
- 108010005233 alanylglutamic acid Proteins 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 108010092854 aspartyllysine Proteins 0.000 description 3
- 238000003766 bioinformatics method Methods 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 230000008045 co-localization Effects 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 239000012091 fetal bovine serum Substances 0.000 description 3
- 238000000684 flow cytometry Methods 0.000 description 3
- 238000000799 fluorescence microscopy Methods 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- 108010022588 methionyl-lysyl-proline Proteins 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 230000009456 molecular mechanism Effects 0.000 description 3
- 230000009437 off-target effect Effects 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- KDELTXNPUXUBMU-UHFFFAOYSA-N 2-[2-[bis(carboxymethyl)amino]ethyl-(carboxymethyl)amino]acetic acid boric acid Chemical compound OB(O)O.OB(O)O.OB(O)O.OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KDELTXNPUXUBMU-UHFFFAOYSA-N 0.000 description 2
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 2
- OTUQSEPIIVBYEM-IHRRRGAJSA-N Arg-Asn-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OTUQSEPIIVBYEM-IHRRRGAJSA-N 0.000 description 2
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 2
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 2
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 2
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 2
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 2
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 2
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 2
- 108010067770 Endopeptidase K Proteins 0.000 description 2
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 2
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 2
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 2
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- 108020005004 Guide RNA Proteins 0.000 description 2
- HTTJABKRGRZYRN-UHFFFAOYSA-N Heparin Chemical compound OC1C(NC(=O)C)C(O)OC(COS(O)(=O)=O)C1OC1C(OS(O)(=O)=O)C(O)C(OC2C(C(OS(O)(=O)=O)C(OC3C(C(O)C(O)C(O3)C(O)=O)OS(O)(=O)=O)C(CO)O2)NS(O)(=O)=O)C(C(O)=O)O1 HTTJABKRGRZYRN-UHFFFAOYSA-N 0.000 description 2
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 2
- 102100023823 Homeobox protein EMX1 Human genes 0.000 description 2
- 101001048956 Homo sapiens Homeobox protein EMX1 Proteins 0.000 description 2
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 2
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 2
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 2
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 2
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 2
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 2
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 2
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 2
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 2
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 2
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 2
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 2
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 2
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 2
- IRVONVRHHJXWTK-RWMBFGLXSA-N Met-Lys-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N IRVONVRHHJXWTK-RWMBFGLXSA-N 0.000 description 2
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 2
- 229920001213 Polysorbate 20 Polymers 0.000 description 2
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 2
- 241000194024 Streptococcus salivarius Species 0.000 description 2
- 241000194020 Streptococcus thermophilus Species 0.000 description 2
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 2
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 2
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 2
- 241000723792 Tobacco etch virus Species 0.000 description 2
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 150000001413 amino acids Chemical class 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- 229940088710 antibiotic agent Drugs 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 239000012148 binding buffer Substances 0.000 description 2
- 238000002306 biochemical method Methods 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 2
- 108091005948 blue fluorescent proteins Proteins 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 239000012636 effector Substances 0.000 description 2
- 238000002337 electrophoretic mobility shift assay Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 210000003714 granulocyte Anatomy 0.000 description 2
- 229960002897 heparin Drugs 0.000 description 2
- 229920000669 heparin Polymers 0.000 description 2
- 230000007124 immune defense Effects 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000013508 migration Methods 0.000 description 2
- 230000005012 migration Effects 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 2
- 238000013081 phylogenetic analysis Methods 0.000 description 2
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 2
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 2
- 229950010131 puromycin Drugs 0.000 description 2
- 238000011002 quantification Methods 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 102220157758 rs777153568 Human genes 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 238000013207 serial dilution Methods 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- HEMHJVSKTPXQMS-UHFFFAOYSA-M sodium hydroxide Inorganic materials [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 2
- 125000006850 spacer group Chemical group 0.000 description 2
- -1 st1Cas9 Proteins 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 1
- 108010040956 Ala-Asp-Glu-Leu Proteins 0.000 description 1
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- FAJIYNONGXEXAI-CQDKDKBSSA-N Ala-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 FAJIYNONGXEXAI-CQDKDKBSSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- IAUSCRHURCZUJP-CIUDSAMLSA-N Ala-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CS)C(O)=O IAUSCRHURCZUJP-CIUDSAMLSA-N 0.000 description 1
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 1
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 1
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 1
- BIOCIVSVEDFKDJ-GUBZILKMSA-N Arg-Arg-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O BIOCIVSVEDFKDJ-GUBZILKMSA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- YUIGJDNAGKJLDO-JYJNAYRXSA-N Arg-Arg-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YUIGJDNAGKJLDO-JYJNAYRXSA-N 0.000 description 1
- RWWPBOUMKFBHAL-FXQIFTODSA-N Arg-Asn-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O RWWPBOUMKFBHAL-FXQIFTODSA-N 0.000 description 1
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 1
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 1
- QAXCZGMLVICQKS-SRVKXCTJSA-N Arg-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QAXCZGMLVICQKS-SRVKXCTJSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 1
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 1
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 1
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 1
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 1
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- GMRGSBAMMMVDGG-GUBZILKMSA-N Asn-Arg-Arg Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N GMRGSBAMMMVDGG-GUBZILKMSA-N 0.000 description 1
- MEFGKQUUYZOLHM-GMOBBJLQSA-N Asn-Arg-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MEFGKQUUYZOLHM-GMOBBJLQSA-N 0.000 description 1
- DQTIWTULBGLJBL-DCAQKATOSA-N Asn-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N DQTIWTULBGLJBL-DCAQKATOSA-N 0.000 description 1
- ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N Asn-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O ZZXMOQIUIJJOKZ-ZLUOBGJFSA-N 0.000 description 1
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- JQBCANGGAVVERB-CFMVVWHZSA-N Asn-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N JQBCANGGAVVERB-CFMVVWHZSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- ICDDSTLEMLGSTB-GUBZILKMSA-N Asn-Met-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ICDDSTLEMLGSTB-GUBZILKMSA-N 0.000 description 1
- KSGAFDTYQPKUAP-GMOBBJLQSA-N Asn-Met-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KSGAFDTYQPKUAP-GMOBBJLQSA-N 0.000 description 1
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 1
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- ILJQISGMGXRZQQ-IHRRRGAJSA-N Asp-Arg-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ILJQISGMGXRZQQ-IHRRRGAJSA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- QQXOYLWJQUPXJU-WHFBIAKZSA-N Asp-Cys-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O QQXOYLWJQUPXJU-WHFBIAKZSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- KOWYNSKRPUWSFG-IHPCNDPISA-N Asp-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC(=O)O)N KOWYNSKRPUWSFG-IHPCNDPISA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 1
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 108010040467 CRISPR-Associated Proteins Proteins 0.000 description 1
- 101710172824 CRISPR-associated endonuclease Cas9 Proteins 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- WVJHEDOLHPZLRV-CIUDSAMLSA-N Cys-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N WVJHEDOLHPZLRV-CIUDSAMLSA-N 0.000 description 1
- ASHTVGGFIMESRD-LKXGYXEUSA-N Cys-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)O ASHTVGGFIMESRD-LKXGYXEUSA-N 0.000 description 1
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 1
- PEZINYWZBQNTIX-NAKRPEOUSA-N Cys-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N PEZINYWZBQNTIX-NAKRPEOUSA-N 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 101710180995 Endonuclease 1 Proteins 0.000 description 1
- XZWYTXMRWQJBGX-VXBMVYAYSA-N FLAG peptide Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 XZWYTXMRWQJBGX-VXBMVYAYSA-N 0.000 description 1
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 1
- OVQXQLWWJSNYFV-XEGUGMAKSA-N Gln-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(N)=O)C)C(O)=O)=CNC2=C1 OVQXQLWWJSNYFV-XEGUGMAKSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- SSWAFVQFQWOJIJ-XIRDDKMYSA-N Gln-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N SSWAFVQFQWOJIJ-XIRDDKMYSA-N 0.000 description 1
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 1
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 1
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- RGAOLBZBLOJUTP-GRLWGSQLSA-N Gln-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N RGAOLBZBLOJUTP-GRLWGSQLSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- NMYFPKCIGUJMIK-GUBZILKMSA-N Gln-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N NMYFPKCIGUJMIK-GUBZILKMSA-N 0.000 description 1
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 1
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 1
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 1
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- JGHNIWVNCAOVRO-DCAQKATOSA-N Glu-His-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGHNIWVNCAOVRO-DCAQKATOSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 1
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 1
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- XEKAJTCACGEBOK-KKUMJFAQSA-N Glu-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XEKAJTCACGEBOK-KKUMJFAQSA-N 0.000 description 1
- ZIYGTCDTJJCDDP-JYJNAYRXSA-N Glu-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZIYGTCDTJJCDDP-JYJNAYRXSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 1
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 1
- MVORZMQFXBLMHM-QWRGUYRKSA-N Gly-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 MVORZMQFXBLMHM-QWRGUYRKSA-N 0.000 description 1
- UUWOBINZFGTFMS-UWVGGRQHSA-N Gly-His-Met Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O UUWOBINZFGTFMS-UWVGGRQHSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 1
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 1
- VCDNHBNNPCDBKV-DLOVCJGASA-N His-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VCDNHBNNPCDBKV-DLOVCJGASA-N 0.000 description 1
- PDSUIXMZYNURGI-AVGNSLFASA-N His-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 PDSUIXMZYNURGI-AVGNSLFASA-N 0.000 description 1
- CIWILNZNBPIHEU-DCAQKATOSA-N His-Arg-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O CIWILNZNBPIHEU-DCAQKATOSA-N 0.000 description 1
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 1
- NQKRILCJYCASDV-QWRGUYRKSA-N His-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 NQKRILCJYCASDV-QWRGUYRKSA-N 0.000 description 1
- UXSATKFPUVZVDK-KKUMJFAQSA-N His-Lys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N UXSATKFPUVZVDK-KKUMJFAQSA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 1
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 1
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- CEPIAEUVRKGPGP-DSYPUSFNSA-N Ile-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 CEPIAEUVRKGPGP-DSYPUSFNSA-N 0.000 description 1
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 1
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 1
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 1
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 1
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 1
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 1
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 1
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 1
- BEGQVWUZFXLNHZ-IHPCNDPISA-N Lys-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 BEGQVWUZFXLNHZ-IHPCNDPISA-N 0.000 description 1
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 1
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 1
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 1
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 1
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 1
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 1
- LKDXINHHSWFFJC-SRVKXCTJSA-N Lys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N LKDXINHHSWFFJC-SRVKXCTJSA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- BVRNWWHJYNPJDG-XIRDDKMYSA-N Lys-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N BVRNWWHJYNPJDG-XIRDDKMYSA-N 0.000 description 1
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 1
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- MDXAULHWGWETHF-SRVKXCTJSA-N Met-Arg-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCNC(N)=N MDXAULHWGWETHF-SRVKXCTJSA-N 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- OBCRZLRPJFNLAN-DCAQKATOSA-N Met-His-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OBCRZLRPJFNLAN-DCAQKATOSA-N 0.000 description 1
- HWROAFGWPQUPTE-OSUNSFLBSA-N Met-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCSC)N HWROAFGWPQUPTE-OSUNSFLBSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 1
- WTHGNAAQXISJHP-AVGNSLFASA-N Met-Lys-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WTHGNAAQXISJHP-AVGNSLFASA-N 0.000 description 1
- JKXVPNCSAMWUEJ-GUBZILKMSA-N Met-Met-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O JKXVPNCSAMWUEJ-GUBZILKMSA-N 0.000 description 1
- CNAGWYQWQDMUGC-IHRRRGAJSA-N Met-Phe-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CNAGWYQWQDMUGC-IHRRRGAJSA-N 0.000 description 1
- ZDJICAUBMUKVEJ-CIUDSAMLSA-N Met-Ser-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O ZDJICAUBMUKVEJ-CIUDSAMLSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 1
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 1
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 102000007474 Multiprotein Complexes Human genes 0.000 description 1
- 108010085220 Multiprotein Complexes Proteins 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- 108010034522 NNQQ peptide Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 229930040373 Paraformaldehyde Natural products 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 1
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 1
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- KCIKTPHTEYBXMG-BVSLBCMMSA-N Phe-Trp-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCIKTPHTEYBXMG-BVSLBCMMSA-N 0.000 description 1
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- BAKAHWWRCCUDAF-IHRRRGAJSA-N Pro-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CN=CN1 BAKAHWWRCCUDAF-IHRRRGAJSA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- VMVNCJDKFOQOHM-GUBZILKMSA-N Ser-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N VMVNCJDKFOQOHM-GUBZILKMSA-N 0.000 description 1
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- LPSKHZWBQONOQJ-XIRDDKMYSA-N Ser-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N LPSKHZWBQONOQJ-XIRDDKMYSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- SDFUZKIAHWRUCS-QEJZJMRPSA-N Ser-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N SDFUZKIAHWRUCS-QEJZJMRPSA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 1
- VBPDMBAFBRDZSK-HOUAVDHOSA-N Thr-Asn-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VBPDMBAFBRDZSK-HOUAVDHOSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- LKEKWDJCJSPXNI-IRIUXVKKSA-N Thr-Glu-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LKEKWDJCJSPXNI-IRIUXVKKSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- WPSDXXQRIVKBAY-NKIYYHGXSA-N Thr-His-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O WPSDXXQRIVKBAY-NKIYYHGXSA-N 0.000 description 1
- FKIGTIXHSRNKJU-IXOXFDKPSA-N Thr-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CN=CN1 FKIGTIXHSRNKJU-IXOXFDKPSA-N 0.000 description 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 1
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- IQGJAHMZWBTRIF-UBHSHLNASA-N Trp-Asp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IQGJAHMZWBTRIF-UBHSHLNASA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- ILDJYIDXESUBOE-HSCHXYMDSA-N Trp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ILDJYIDXESUBOE-HSCHXYMDSA-N 0.000 description 1
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 1
- 108010028230 Trp-Ser- His-Pro-Gln-Phe-Glu-Lys Proteins 0.000 description 1
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 1
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 1
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 1
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 1
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 1
- DANHCMVVXDXOHN-SRVKXCTJSA-N Tyr-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DANHCMVVXDXOHN-SRVKXCTJSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- CWVHKVVKAQIJKY-ACRUOGEOSA-N Tyr-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=C(C=C2)O)N CWVHKVVKAQIJKY-ACRUOGEOSA-N 0.000 description 1
- OKDNSNWJEXAMSU-IRXDYDNUSA-N Tyr-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 OKDNSNWJEXAMSU-IRXDYDNUSA-N 0.000 description 1
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 1
- TYGHOWWWMTWVKM-HJOGWXRNSA-N Tyr-Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 TYGHOWWWMTWVKM-HJOGWXRNSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 1
- OUUBKKIJQIAPRI-LAEOZQHASA-N Val-Gln-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OUUBKKIJQIAPRI-LAEOZQHASA-N 0.000 description 1
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- GMOLURHJBLOBFW-ONGXEEELSA-N Val-Gly-His Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMOLURHJBLOBFW-ONGXEEELSA-N 0.000 description 1
- RHYOAUJXSRWVJT-GVXVVHGQSA-N Val-His-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RHYOAUJXSRWVJT-GVXVVHGQSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 1
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 101150063416 add gene Proteins 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 238000012742 biochemical analysis Methods 0.000 description 1
- 230000002051 biphasic effect Effects 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 101150059443 cas12a gene Proteins 0.000 description 1
- 101150098304 cas13a gene Proteins 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 230000009918 complex formation Effects 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 1
- 238000001493 electron microscopy Methods 0.000 description 1
- 108010025678 empty spiracles homeobox proteins Proteins 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 238000012632 fluorescent imaging Methods 0.000 description 1
- 108091006047 fluorescent proteins Proteins 0.000 description 1
- 102000034287 fluorescent proteins Human genes 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010050848 glycylleucine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 230000006054 immunological memory Effects 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 239000012160 loading buffer Substances 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 239000012139 lysis buffer Substances 0.000 description 1
- 230000001320 lysogenic effect Effects 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 230000002101 lytic effect Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 229920002866 paraformaldehyde Polymers 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 230000006555 post-translational control Effects 0.000 description 1
- 238000011533 pre-incubation Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 235000019419 proteases Nutrition 0.000 description 1
- 230000007026 protein scission Effects 0.000 description 1
- 230000016434 protein splicing Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 235000004400 serine Nutrition 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- ZNJHFNUEQDVFCJ-UHFFFAOYSA-M sodium;2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid;hydroxide Chemical compound [OH-].[Na+].OCCN1CCN(CCS(O)(=O)=O)CC1 ZNJHFNUEQDVFCJ-UHFFFAOYSA-M 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 239000012536 storage buffer Substances 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 235000008521 threonine Nutrition 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 210000001519 tissue Anatomy 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/005—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from viruses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/635—Externally inducible repressor mediated regulation of gene expression, e.g. tetR inducible by tetracyline
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2795/00—Bacteriophages
- C12N2795/00011—Details
- C12N2795/00022—New viral proteins or individual genes, new structural or functional aspects of known viral proteins or genes
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Microbiology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Virology (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明公开了新型CRISPR‑Cas9抑制蛋白及其改造应用于化学可控的基因编辑的方法。发明人从Streptococcus的移动元件中发现了九种新型的II‑A型Acr(AcrIIA24‑32)蛋白家族,这些Acr蛋白可以在细菌和真核细胞中有效抑制II‑A型Cas9直系同源蛋白。另外AcrIIA25.1和AcrIIA32.1可以同时抑制SpyCas9的DNA结合和DNA切割活性,显示出新颖而独特的抑制机制。基于此,发明人开发了新型的化学诱导型iAcr系统,建立了有效的化学可控的基因编辑的方法。本发明具有重要的应用价值。
Description
技术领域
本发明属于生物技术领域,具体涉及新型CRISPR-Cas9抑制蛋白及其改造应用于化学可控的基因编辑的方法。
背景技术
成簇的规律间隔的短回文重复序列(Clustered regularly interspaced shortpalindromic repeats,CRISPR)及其CRISPR相关蛋白(CRISPR-associated,Cas)构成了原核生物适应性免疫防御系统,以防御包括噬菌体和质粒在内的侵入性遗传元件。CRISPR-Cas介导的防御过程可分为三个阶段:(1)适应阶段:当噬菌体入侵时,噬菌体的一部分序列信息作为新的间隔子插入到细菌的CRISPR基因座,形成免疫记忆;(2)表达阶段:系统表达Cas基因以及加工形成成熟的CRISPRRNA(crRNA);(3)干扰阶段:Cas蛋白在crRNA的引导下识别并破坏靶向核酸,形成免疫防御。目前基于系统发生树以及作用机制,CRISPR-Cas系统分为2大类(Class 1和Class 2),6种类型。Class 1系统(包括I、III和IV型)使用多亚基Cas蛋白复合物靶向外源核酸;而Class 2系统(包括II、V和VI型)则依靠单一效应蛋白(如Cas9)来实现干扰,因而应用广泛。截止目前,Class 2CRISPR-Cas系统的多种亚型被鉴定并开发为多种强有力的基因编辑工具,主要包括CRISPR-Cas9、Cas12a(Cpf1)、Cas13a(C2c2)等。它们在科学研究、农业、畜牧业以及临床检测治疗等领域发挥着日益重要的作用。其中来自化脓链球菌(Streptococcuspyogenes)的Cas9(SpyCas9)是应用最为广泛的Cas效应物,并且已被发展成为一系列强有力的基因编辑工具。
然而,尽管近些年CRISPR基因编辑技术得到了长足发展,它仍面临着尚未解决的脱靶问题,使CRISPR基因编辑技术的应用受到了阻碍。为了解决这一问题,有些研究通过改造Cas蛋白或者向导RNA以提升靶向识别的特异性,或者通过可控的Cas蛋白或者导向RNA丰度与时间的表达来进一步降低脱靶效应。近年来,在原噬菌体或噬菌体中发现了一系列被称为anti-CRISPR(Acr)的蛋白,它们可以抑制多种CRISPR-Cas系统的活性,并且一些Acr蛋白(如AcrIIA4)被证明在细胞中可以通过抑制Cas9活性来降低脱靶效应。这些特性使得Acr可作为潜在的“开关”工具用于调控基于CRISPR的应用,因此Acr的探索及其分子机制研究对于基因编辑领域有着十分重要的意义。
Acr蛋白是噬菌体为了对抗原核生物CRISPR-Cas系统进化出来的一种策略,即通过编码蛋白质(即Acr蛋白)来失活原核生物的CRISPR-Cas系统。研究发现Acr蛋白广泛分布于噬菌体、原噬菌体以及侵入性遗传元件中,它们的分子量一般较小,约50-200氨基酸,并且展现出高度的序列多样性。迄今为止,88种不同类型的Acr蛋白家族已被鉴定出来,能够广泛抑制I、II、III、V和VI型CRISPR-Cas系统。由于Acr蛋白种类繁多,并展现出高度的序列多样性,因此很难通过常规的序列同源性比对方法来进行寻找。科学家已经利用多种生物信息以及生物化学等方法来寻找并确认Acr蛋白,包括基于Guilt-by-association、Self-targeting、Machine-learning的生物信息学方法,以及通过Phage-first和High-throughput的生物化学方法。值得注意的是,已经通过多种方法发现了23种AcrIIA家族。目前,只有一小部分AcrIIA蛋白的分子机制被详细地阐明。研究显示这些AcrIIA蛋白主要抑制CRISPR-Cas9免疫的第三个阶段——干扰阶段,并可粗略地分为三种策略:(1)干扰Cas与crRNA的装载,如AcrIIA15;(2)阻止Cas效应复合物与靶向核酸的结合,如AcrIIA4;(3)阻止Cas对靶向核酸的切割,如AcrIIA14。鉴于Acr蛋白能够通过不同的策略抑制CRISPR-Cas9系统,并以此开发相应的基因编辑调控工具,Acr分子机制的研究已成为基因编辑领域的研究热点。
目前这些CRISPR-Cas系统的部分Acr蛋白被鉴定出来,科学家们认为仍有广泛的Acr蛋白应存在并且未被发现。这不仅阻碍我们理解噬菌体与宿主之间的进化竞争方式,也妨碍发展新型的基于Acr的基因编辑方法。特别是考虑到Cas9的脱靶效应,发展Acr蛋白作为有效的基因编辑工具来调控Cas9的活性以解决Cas9应用的安全性问题将特别吸引人。然而在基因编辑领域,可用的Acr蛋白和利用Acr来调控Cas9活性的策略仍然有限。因此发现新型的CRISPR-Cas9抑制蛋白并改造Acr蛋白应用于可控的基因编辑的方法对于基因编辑领域有着重要的意义。
发明内容
本发明的目的在于发现新的CRISPR-Cas9抑制蛋白,并据此建立可控的基因编辑的方法。
本发明首先保护AcrIIA蛋白在细菌、细胞或真核生物中抑制Cas9基因编辑、调控和/或成像中的应用;
所述AcrIIA蛋白可为AcrIIA24蛋白、AcrIIA25蛋白、AcrIIA25.1蛋白、AcrIIA26蛋白、AcrIIA27蛋白、AcrIIA28蛋白、AcrIIA29蛋白、AcrIIA30蛋白、AcrIIA31蛋白、AcrIIA31.1蛋白、AcrIIA32蛋白或AcrIIA32.1;
所述AcrIIA24蛋白可为如下a1)或a2)或a3)或a4):
a1)氨基酸序列是SEQ ID NO:1所示的蛋白质;
a2)在SEQ ID NO:1所示的蛋白质的N端或/和C端连接标签得到的融合蛋白质;
a3)将a1)或a2)所示的蛋白质经过一个或几个氨基酸残基的取代和/或缺失和/或添加得到的且具有抑制Cas9蛋白活性功能的蛋白质;
a4)与SEQ ID NO:1限定的氨基酸序列具有60%或60%以上同源性,且具有抑制Cas9蛋白活性功能的蛋白质;
所述AcrIIA25蛋白可为如下b1)或b2)或b3)或b4):
b1)氨基酸序列是SEQ ID NO:2所示的蛋白质;
b2)在SEQ ID NO:2所示的蛋白质的N端或/和C端连接标签得到的融合蛋白质;
b3)将b1)或b2)所示的蛋白质经过一个或几个氨基酸残基的取代和/或缺失和/或添加得到的且具有抑制Cas9蛋白活性功能的蛋白质;
b4)与SEQ ID NO:2限定的氨基酸序列具有60%或60%以上同源性,且具有抑制Cas9蛋白活性功能的蛋白质;
所述AcrIIA25.1蛋白可为如下c1)或c2)或c3)或c4):
c1)氨基酸序列是SEQ ID NO:3所示的蛋白质;
c2)在SEQ ID NO:3所示的蛋白质的N端或/和C端连接标签得到的融合蛋白质;
c3)将c1)或c2)所示的蛋白质经过一个或几个氨基酸残基的取代和/或缺失和/或添加得到的且具有抑制Cas9蛋白活性功能的蛋白质;
c4)与SEQ ID NO:3限定的氨基酸序列具有60%或60%以上同源性,且具有抑制Cas9蛋白活性功能的蛋白质;
所述AcrIIA26蛋白可为如下d1)或d2)或d3)或d4):
d1)氨基酸序列是SEQ ID NO:4所示的蛋白质;
d2)在SEQ ID NO:4所示的蛋白质的N端或/和C端连接标签得到的融合蛋白质;
d3)将d1)或d2)所示的蛋白质经过一个或几个氨基酸残基的取代和/或缺失和/或添加得到的且具有抑制Cas9蛋白活性功能的蛋白质;
d4)与SEQ ID NO:4限定的氨基酸序列具有60%或60%以上同源性,且具有抑制Cas9蛋白活性功能的蛋白质;
所述AcrIIA27蛋白可为如下e1)或e2)或e3)或e4):
e1)氨基酸序列是SEQ ID NO:5所示的蛋白质;
e2)在SEQ ID NO:5所示的蛋白质的N端或/和C端连接标签得到的融合蛋白质;
e3)将e1)或e2)所示的蛋白质经过一个或几个氨基酸残基的取代和/或缺失和/或添加得到的且具有抑制Cas9蛋白活性功能的蛋白质;
e4)与SEQ ID NO:5限定的氨基酸序列具有60%或60%以上同源性,且具有抑制Cas9蛋白活性功能的蛋白质;
所述AcrIIA28蛋白可为如下f1)或f2)或f3)或f4):
f1)氨基酸序列是SEQ ID NO:6所示的蛋白质;
f2)在SEQ ID NO:6所示的蛋白质的N端或/和C端连接标签得到的融合蛋白质;
f3)将f1)或f2)所示的蛋白质经过一个或几个氨基酸残基的取代和/或缺失和/或添加得到的且具有抑制Cas9蛋白活性功能的蛋白质;
f4)与SEQ ID NO:6限定的氨基酸序列具有60%或60%以上同源性,且具有抑制Cas9蛋白活性功能的蛋白质;
所述AcrIIA29蛋白可为如下g1)或g2)或g3)或g4):
g1)氨基酸序列是SEQ ID NO:7所示的蛋白质;
g2)在SEQ ID NO:7所示的蛋白质的N端或/和C端连接标签得到的融合蛋白质;
g3)将g1)或g2)所示的蛋白质经过一个或几个氨基酸残基的取代和/或缺失和/或添加得到的且具有抑制Cas9蛋白活性功能的蛋白质;
g4)与SEQ ID NO:7限定的氨基酸序列具有60%或60%以上同源性,且具有抑制Cas9蛋白活性功能的蛋白质;
所述AcrIIA30蛋白可为如下h1)或h2)或h3)或h4):
h1)氨基酸序列是SEQ ID NO:8所示的蛋白质;
h2)在SEQ ID NO:8所示的蛋白质的N端或/和C端连接标签得到的融合蛋白质;
h3)将h1)或h2)所示的蛋白质经过一个或几个氨基酸残基的取代和/或缺失和/或添加得到的且具有抑制Cas9蛋白活性功能的蛋白质;
h4)与SEQ ID NO:8限定的氨基酸序列具有60%或60%以上同源性,且具有抑制Cas9蛋白活性功能的蛋白质;
所述AcrIIA31蛋白可为如下i1)或i2)或i3)或i4):
i1)氨基酸序列是SEQ ID NO:9所示的蛋白质;
i2)在SEQ ID NO:9所示的蛋白质的N端或/和C端连接标签得到的融合蛋白质;
i3)将i1)或i2)所示的蛋白质经过一个或几个氨基酸残基的取代和/或缺失和/或添加得到的且具有抑制Cas9蛋白活性功能的蛋白质;
i4)与SEQ ID NO:9限定的氨基酸序列具有60%或60%以上同源性,且具有抑制Cas9蛋白活性功能的蛋白质;
所述AcrIIA31.1蛋白可为如下j1)或j2)或j3)或j4):
j1)氨基酸序列是SEQ ID NO:10所示的蛋白质;
j2)在SEQ ID NO:10所示的蛋白质的N端或/和C端连接标签得到的融合蛋白质;
j3)将j1)或j2)所示的蛋白质经过一个或几个氨基酸残基的取代和/或缺失和/或添加得到的且具有抑制Cas9蛋白活性功能的蛋白质;
j4)与SEQ ID NO:10限定的氨基酸序列具有60%或60%以上同源性,且具有抑制Cas9蛋白活性功能的蛋白质;
所述AcrIIA32蛋白可为如下k1)或k2)或k3)或k4):
k1)氨基酸序列是SEQ ID NO:11所示的蛋白质;
k2)在SEQ ID NO:11所示的蛋白质的N端或/和C端连接标签得到的融合蛋白质;
k3)将k1)或k2)所示的蛋白质经过一个或几个氨基酸残基的取代和/或缺失和/或添加得到的且具有抑制Cas9蛋白活性功能的蛋白质;
k4)与SEQ ID NO:11限定的氨基酸序列具有60%或60%以上同源性,且具有抑制Cas9蛋白活性功能的蛋白质;
所述AcrIIA32.1蛋白可为如下l1)或l2)或l3)或l4):
l1)氨基酸序列是SEQ ID NO:12所示的蛋白质;
l2)在SEQ ID NO:12所示的蛋白质的N端或/和C端连接标签得到的融合蛋白质;
l3)将l1)或l2)所示的蛋白质经过一个或几个氨基酸残基的取代和/或缺失和/或添加得到的且具有抑制Cas9蛋白活性功能的蛋白质;
l4)与SEQ ID NO:12限定的氨基酸序列具有60%或60%以上同源性,且具有抑制Cas9蛋白活性功能的蛋白质。
其中,SEQ ID NO:1由104个氨基酸残基组成。SEQ ID NO:2由96个氨基酸残基组成。SEQ ID NO:3由101个氨基酸残基组成。SEQ ID NO:4由183个氨基酸残基组成。SEQ IDNO:5由79个氨基酸残基组成。SEQ ID NO:6由88个氨基酸残基组成。SEQ ID NO:7由241个氨基酸残基组成。SEQ ID NO:8由73个氨基酸残基组成。SEQ ID NO:9由69个氨基酸残基组成。SEQ ID NO:10由125个氨基酸残基组成。SEQ ID NO:11由141个氨基酸残基组成。SEQ IDNO:12由95个氨基酸残基组成。
为了使a1)、b1)、c1)、d1)、e1)、f1)、g1)、h1)、i1)、j 1)、k1)或l1)中的蛋白质便于纯化,可分别在SEQ ID NO:1-SEQ ID NO:12所示的蛋白质的氨基末端或羧基末端连接上如表1所示的标签。
表1.标签的序列
标签 | 残基 | 序列 |
Poly-Arg | 5-6(通常为5个) | RRRRR |
FLAG | 8 | DYKDDDDK |
Strep-tagII | 8 | WSHPQFEK |
c-myc | 10 | EQKLISEEDL |
上述a3)、b3)、c3)、d3)、e3)、f3)、g3)、h3)、i3)、j3)、k3)或l3)中的蛋白质,所述一个或几个氨基酸残基的取代和/或缺失和/或添加为不超过10个氨基酸残基的取代和/或缺失和/或添加。
上述a3)、b3)、c3)、d3)、e3)、f3)、g3)、h3)、i3)、j3)、k3)或l3)中的蛋白质可人工合成,也可先合成其编码基因,再进行生物表达得到。
上述a3)、b3)、c3)、d3)、e3)、f3)、g3)、h3)、i3)、j3)、k3)或l3)中的蛋白质的编码基因可通过将编码蛋白质的DNA序列中缺失一个或几个氨基酸残基的密码子,和/或进行一个或几个碱基对的错义突变,和/或在其5′端和/或3′端连上表1所示的标签的编码序列得到。
这里使用的术语“同源性”指与天然氨基酸序列的序列相似性。“同源性”包括与本发明的SEQ ID NO:1—SEQ ID NO:12所示的氨基酸序列具有60%或更高,或65%或更高,或70%或更高,或75%或更高,或80%或更高,或85%或更高,或90%或更高,或95%或更高同源性的氨基酸序列。同源性可以用计算机软件(序列比对软件如BLAST)进行评价。使用计算机软件,两个或多个序列之间的同一性可以用百分比(%)表示,其可以用来评价相关序列之间的同一性。
上述任一所述AcrIIA蛋白在抑制Cas9蛋白活性中的应用也属于本发明的保护范围。
上述应用中,所述抑制Cas9蛋白活性可表现为抑制Cas9蛋白与DNA结合和/或抑制Cas9蛋白对底物DNA的切割活性。
本发明还保护一种化学诱导型iAcr的系统;该系统采用所述AcrIIA25.1和/或所述AcrIIA32.1作为化学诱导型anti-CRISPR的蛋白。
上述系统中,采用所述AcrIIA25.1作为化学诱导型anti-CRISPR的蛋白具体可采用表4中的iA25.1(Intein-AcrIIA25.1(S76))进行。
上述系统中,采用所述AcrIIA32.1作为化学诱导型anti-CRISPR的蛋白具体可采用表4中的iA32.1(Intein-AcrIIA32.1(T40))进行。
上述任一所述系统均依赖4-羟基三苯氧胺调节。
上述任一所述系统在实现化学可控的的基因编辑中的应用也属于本发明的保护范围。
本发明还保护一种化学可控的基因编辑的方法;所述方法采用上述任一所述化学诱导型iAcr的系统进行基因编辑。
所述方法可依赖4-羟基三苯氧胺调节。
上述任一所述AcrIIA蛋白在制备化学诱导型iAcr的系统中的应用也属于本发明的保护范围。
上述任一所述AcrIIA蛋白在进行化学可控的基因编辑中的应用也属于本发明的保护范围。
上述任一所述应用中,所述AcrIIA蛋白具体可为所述AcrIIA25.1、所述AcrIIA32.1、表4中的iA25.1(Intein-AcrIIA25.1(S76))或iA32.1(Intein-AcrIIA32.1(T40))。
本申请的发明人通过guilt-by-association方法,以广泛分布的AcrIIA6作为起始标志物,在Streptococcus(链球菌)移动原件中发现了广泛分布的九种新型的II-A型Acr蛋白(AcrIIA24-32)和三个Aca(Acr-associated)(Aca11-13)蛋白家族。研究发现AcrIIA24-32是链球菌CRISPR-Cas9系统的特异性抑制剂,可在细菌和人类细胞中有效抑制II-A型Cas9蛋白,包括SpyCas9、Streptococcus thermophilus(嗜热链球菌)Cas9(CR1/St1Cas9 and CR3/St3Cas9)。在这些Acrs中,AcrIIA26、AcrIIA27、AcrIIA30和AcrIIA31强烈阻断Cas9与DNA的结合,而AcrIIA24阻止Cas9的DNA切割活性。值得注意的是,AcrIIA25.1和AcrIIA32.1可抑制SpyCas9的DNA结合和DNA裂解活性,显示出新颖而独特的抑制机制。本申请的发明人还开发了基于AcrIIA25.1和AcrIIA32.1的多种化学诱导型iAcr(inducibleanti-CRISPR)系统,是由Acr蛋白和4-羟基三苯氧胺(4-HT,4-hydroxytamoxifen)反应性内含肽(intein)组成的融合体。iAcr系统能够在人类细胞中有效的“翻译后”控制CRISPR-Cas9介导的基因编辑活性,显示出了强大的应用前景。本申请扩展了II-A型Acr蛋白,Acr抑制机制的多样性,以及基于Acr的化学可控基因编辑方法。本发明具有重要的应用价值。
附图说明
图1为九种新型AcrIIA蛋白家族(AcrIIA24-32)的鉴定。(A)链球菌噬菌体和原噬菌体基因组中候选acr、aca和相邻基因示意图。Acr基因以数字显示在深色阅读框中。箭头表示acr基因座之间蛋白序列同源性关系(以百分比所示)。Aca基因显示在白色阅读框内。其他相邻基因以灰色显示,一些已知基因结构信息(如AP2 DNA结合结构域)根据NCBI网站进行注释。星号(***)表示在大肠杆菌质粒干涉实验中检测的基因。(B)用于大肠杆菌中Acr活性分析设计的质粒和大肠杆菌质粒干涉实验示意图。Cas9,Acr以及pT质粒携带可兼容的复制子和抗性基因。AmpR,氨苄青霉素抗性;KanR,卡那霉素抗性;ChlR,氯霉素抗性。(C)柱状图显示Acr抑制Cas9直系同源蛋白(SpyCas9、St1Cas9和St3Cas9)的效率。#表示低于检测标准。n=3。值显示为平均值±SEM。
图2为AcrIIA24-32同源蛋白的分布。AcrIIA24-32同源蛋白(A-I)的最小进化系统发育树,蛋白质序列通过BLASTp搜索确定。本研究中所分析的Acr蛋白标记在物种的后面。
图3为AcrIIA24-32是链球菌CRISPR-Cas9系统特异性的抑制蛋白。A为噬菌斑测定实验示意图。调查AcrIIA24-32是否对多种II-A、II-B、II-C型Cas9同源蛋白具有广谱的抑制活性。大肠杆菌携带质粒表达Cas9、sgRNA和Acr,随后使用T4噬菌体进行侵染。T4噬菌体基因23设计被多种Cas9同源蛋白靶向(由菌株名称缩写)。B为噬菌斑测定实验使用10倍连续稀释的T4噬菌体(黑色圆圈)来评估不同Acr蛋白对不同类型Cas9直系同源蛋白的抑制作用,包括II-A型(SpyCas9,St1Cas9,St3Cas9和SaCas9),II-B型(FnCas9)和II-C型(NmeCas9)CRISPR-Cas9系统。
图4为AcrIIA24-32可在体外抑制II-A型Cas9同源蛋白。A为DNA切割实验示意图。在存在或者不存在Acr的情况下,Cas9 RNP复合物靶向DNA底物。B-D为在存在或者不存在Acr的情况下,使用SpyCas9(B)、St3Cas9(C)和St1Cas9(D)RNP复合物靶向线性化的质粒DNA。空心箭头表示未切割的线性化质粒DNA。实心箭头表示切割产物。凝胶图像代表三次独立的重复。
图5为AcrIIA24-32可以在人类细胞中抑制Cas9介导的基因编辑。A为T7E1实验示意图,用于检验Acr在HEK293T细胞中对Cas9蛋白的抑制活性。将编码Cas9、sgRNA和Acr的质粒共转染进人类细胞,随后通过T7E1实验进行分析。(B-G)为T7E1实验代表性凝胶图像,以显示Acrs对SpyCas9(B)、St3Cas9(D)和St1Cas9(F)的抑制活性。人类基因AAVS1(SpyCas9靶向)和DYRK1A(St1Cas9和St3Cas9靶向)的靶位点显示在每个凝胶图像的顶部,PAM以下划线突出显示。Acr的亚型和序号显示在胶图上方;A,AcrIIA;C,AcrIIC。空心箭头表示T7E1未消化条带(未编辑)。实心箭头表示T7E1消化的条带(已编辑)。编辑效率以“indel(%)”显示在每个泳道的底部。在不同Acrs存在下,定量SpyCas9(C)、St3Cas9(E)和St1Cas9(G)介导的基因编辑效率。n=3,值显示平均值±SEM。
图6为Acrs在人类细胞中采用多种策略失活Cas9。A为设计用于人类端粒定位荧光成像的质粒示意图,以研究Acrs对II-A型Cas9直系同源蛋白的抑制策略。将编码Cas9荧光蛋白的质粒、它们各自靶向端粒的sgRNA质粒和Acr蛋白的质粒(用蓝色荧光蛋白TagBFP标记)共转染进U20S细胞系。S**_(d)Cas9-(mCherry)3代表本实验中使用的三种质粒,包括Spy_dCas9-(mCherry)3、St1_dCas9-(mCherry)3和St3_dCas9-(mCherry)3。B为U2OS细胞中Cas9同源蛋白(Nme、Spy、St1和St3)靶向人类端粒的序列。C为Nme_dCas9-(sfGFP)3、Spy_dCas9-(mCherry)3和Acr质粒共转染的U2OS细胞的代表性图像。荧光通道显示在图的顶部,不同的Acr蛋白显示在每行的右侧。比例尺代表10μm。D为在存在不同Acr蛋白的情况下,对Spy_dCas9-(mCherry)3形成的端粒点的细胞进行定量。方法是通过具有Nme_dCas9-(sfGFP)3和Spy_dCas9-(mCherry)3共定位端粒成像细胞的百分比来计算。n=在每种条件下评分的细胞数。E-H为转染St3_dCas9-(mCherry)3(E)或St1_dCas9-(mCherry)3(G)以及Nme_dCas9-(sfGFP)3和不同的Acr质粒后U2OS细胞的代表性图像。比例尺代表10μm。使用与D中相同的方法在每种条件下定量St3_dCas9-(mCherry)3(F)和St1_dCas9-(mCherry)3(H)形成端粒点的细胞比例。
图7为Acrs对Cas9-sgRNARNP复合物的形成没有影响。A和B为EMSA实验以检测AcrIIA25.1、AcrIIA26、AcrIIA27、AcrIIA28和AcrIIA32.1对SpyCas9-sgRNARNP复合物形成的影响。该测定在非变性聚丙烯酰胺凝胶上进行分析,sgRNA由SYBR gold染色。不同反应成分(Acrs、SpyCas9和sgRNA)添加的顺序显示在虚线框上方。C为在加入sgRNA之前或之后加入AcrIIA24时,进行EMSA测定以分析AcrIIA24蛋白对St3Cas9与sgRNA结合的影响。D为在存在或不存在AcrIIA30或AcrIIA31的情况下,利用EMSA实验检测St1Cas9与sgRNA的结合情况。
图8为在体外Acrs表现出多种作用机制来抑制Cas9。A-F为在添加目标DNA之前或之后添加Acrs至反应体系进行EMSA实验,以分析不同Acr蛋白对Cas9 RNP与DNA结合的影响,包括AcrIIA25.1(A)、AcrIIA26(B)、AcrIIA27(C)、AcrIIA32.1(D)、AcrIIA24(E)、AcrIIA30、AcrIIA31(F)、Cas9 RNP(256nM)和Acr梯度(0.125、0.25、0.5、0.1、0.2、0.4、0.8和1.6μM)。实验在非变性凝胶上进行,并且靶向DNA被Cy5所标记。实验进行了三次独立的重复,并选择代表性的图片展示。G为通过在EMSA中添加额外的Mg2+来恢复Cas9对DNA的切割活性来进行DNA切割实验。RT,室温。H和I为进行DNA切割实验以分析Acrs在G所示的不同条件下对Cas9切割DNA活性的影响。SpyCas9 RNP(500nM)、St1Cas9 RNP(500nM)、Acrs(10μM)和底物DNA(50nM)(非目标链由Cy3标记)。实验重复3次,并显示了代表性凝胶图。J为总结本研究中鉴定的anti-CRISPR蛋白的不同抑制机制。AcrIIA26、AcrIIA27、AcrIIA30和AcrIIA31阻断Cas9与DNA的结合,而AcrIIA24抑制Cas9的DNA切割活性。值得注意的是,AcrIIA25.1和AcrIIA32.1可以同时抑制Cas9的DNA结合和DNA切割活性。
图9为Acrs抑制St3Cas9的多种机制。进行DNA切割测定以分析AcrIIA24、AcrIIA25和AcrIIA32.1在不同条件下对St3Cas9切割DNA活性的影响,如图8中G所示。St3Cas9 RNP(500nM)、Acrs(10μM)和底物DNA(50nM)(目标链由Cy5标记)。实验重复3次,并显示了代表性凝胶图。
图10为化学诱导型iAcr(inducible anti-CRISPR)系统的建立以实现化学可控的基因编辑。A为iAcr系统示意图。将配体依赖性内含肽插入Acr蛋白会使Acr失活。4-HT的结合可触发内含肽蛋白自剪接并恢复Acr活性以抑制Cas9。B为BFP-to-GFP报告系统的示意图,使用引导编辑以检查人类细胞中内含肽-Acr杂交体对Cas9的抑制活性。在存在或不存在4-HT(1μM)的情况下,将具有染色体整合BFP的HEK293T细胞(HEK293T-BFP细胞)用编码引导编辑器、靶向BFP的pegRNA和内含肽-Acr杂交体的质粒转染。在转染72小时后通过流式细胞术计算GFP阳性细胞的百分比。PE可以通过将CC替换为GT导致单个H66Y氨基酸替换,从而将BFP转换为GFP。目标序列和PAM序列分别以阴影和下划线显示。C为不同条件下BFP转换为GFP效率的比较。内含肽-Acr变体通过被intein替换的残基来识别。野生型Acrs包括C1(AcrIIC1)、A4(AcrIIA4)、A5(AcrIIA5)、A25.1(AcrIIA25.1)和A32.1(AcrIIA32.1)用作对照。n=3,值显示平均值±SEM。。D为T7E1测定的代表性凝胶图像,以显示Acr和iAcr蛋白在存在或不存在4-HT的情况下对SpyCas9的抑制活性。HEK293T细胞用Cas9(1μg)、sgRNA(0.5μg)和Acr(0.5或0.25μg)质粒转染。靶向序列显示在每个凝胶图像的顶部,PAM以下划线突出显示。编辑效率以“indel(%)”显示在每个泳道的底部。E和F为T7E1测定的代表性凝胶图像,用于研究Acr和iAcr蛋白在存在或不存在4-HT的情况下对SpyCas9(E)或St3Cas9(F)的抑制活性。HEK293T细胞用Cas9(1μg)、sgRNA(0.5μg)和Acr(0.25μg)质粒转染。人类EMX1(由SpyCas9靶向)和DYRK1A(由St3Cas9靶向)的靶向序列显示在每个凝胶的顶部,PAM以下划线突出显示。编辑效率以“indel(%)”显示在每个泳道的底部。
具体实施方式
下面结合具体实施方式对本发明进行进一步的详细描述,给出的实施例仅为了阐明本发明,而不是为了限制本发明的范围。以下提供的实施例可作为本技术领域普通技术人员进行进一步改进的指南,并不以任何方式构成对本发明的限制。
下述实施例中的实验方法,如无特殊说明,均为常规方法,按照本领域内的文献所描述的技术或条件或者按照产品说明书进行。下述实施例中所用的材料、试剂等,如无特殊说明,均可从商业途径得到。
下述实施例中Acr蛋白和Aca蛋白的名称、NCBI登录号以及氨基酸序列见表2。
表2
实施例
一、材料与方法
1、微生物
大肠杆菌菌株(TOP10或Mach1-T1,Biomed)用于质粒扩增和质粒干涉实验。大肠杆菌菌株(T7 Express,Biomed)用于蛋白质表达和噬菌斑测定分析。大肠杆菌通常(除非另有说明)在溶原性肉汤(LB)培养基中于37℃培养,含有适当的抗生素(需要时):氨苄青霉素(50μg/ml)、卡那霉素(50μg/ml)或氯霉素(25μg/ml)。
2、细胞系
HEK293T和HEK293T-BFP细胞在含10%(vol/vol)胎牛血清(FBS,Gibco)的DMEM培养基(Gibco)中置于37℃、5%CO2培养箱中培养。
U2OS细胞在含10%FBS的McCoy's 5A(改良)培养基(Gibco)中置于37℃、5%CO2培养箱中培养。
3、生物信息学分析
BLASTp程序用于在非冗余蛋白质数据库中搜索AcrIIA6(登录号:AVO22749.1)同源蛋白,以手动检查来自相邻候选基因是否是可能的acr基因和aca基因。MPI生物信息学工具包的HHpred用于从相邻基因中识别DNA结合域,其中筛选了aca候选者。使用aca基因进行BLASTp搜索以筛选acr候选基因,并通过生化分析进一步验证。
对于Acr蛋白质的同源蛋白分布以及系统发育分析,使用非冗余蛋白质数据库通过BLASTp程序获得Acrs的同源蛋白质序列。确定具有高同源性的序列(E值<0.001,查询覆盖率>70%),使用快速最小进化树方法、0.85最大序列差异和Grishin(蛋白质)距离模型生成距离树。
4、大肠杆菌中的质粒干涉实验
编码Acr蛋白的DNA序列由博迈德合成并连接到pBAD24载体中。使用CaCl2热休克程序将质粒转化到大肠杆菌中。简而言之,携带Acr质粒的大肠杆菌TOP10或Mach1-T1菌株在含有0.2%阿拉伯糖的LB培养基中培养过夜,然后作为感受态细胞。随后用25ngpT和25ngCas9质粒(具有匹配的间隔区或错配的间隔区)进行转化。在含有0.2%阿拉伯糖的LB培养基中恢复2小时后,将细胞接种在含50μg/ml氨苄青霉素、50μg/ml卡那霉素、25μg/ml氯霉素、1mM IPTG和0.2%阿拉伯糖的LB琼脂板,37℃孵育24-32小时。使用凝胶扫描仪(Tanon3500)拍摄克隆并通过ImageJ软件计数。每个Acr的抑制活性通过靶向pT和非靶向pT的Cas9质粒之间的转化比率来计算。
5、噬菌斑测定实验
大肠杆菌(T7 Express,Biomed)细胞与表达靶向噬菌体T4的Cas9-sgRNA的质粒和编码Acr蛋白的兼容质粒共转化。用于非靶向噬菌体T4的Cas9质粒和空pBAD24质粒(无Acr)均用作对照。将含有Acr和Cas9质粒的大肠杆菌在含50μg/ml氨苄青霉素和25μg/ml氯霉素的LB培养基中培养,并在37℃下生长过夜。第二天早上,将过夜培养物接种在含有抗生素的新鲜LB培养基中,并在37℃下生长两小时。随后,加入1mM IPTG以诱导Cas9蛋白的表达。两小时后,加入0.2%阿拉伯糖诱导Acr蛋白表达。再过两小时后,将200μl培养物与4ml融化的LB-琼脂(0.7%,补充10mM MgSO4)混合,倒在含有10mM MgSO4和0.2%阿拉伯糖的固体LB-琼脂(1.5%,包含1mM IPTG、50μg/ml氨苄青霉素和25μg/ml氯霉素)平板上。接下来,将噬菌体T4的10倍稀释液点在培养皿的表面。将培养板在37℃孵育过夜并使用凝胶扫描仪(Tanon3500)拍照。
6、蛋白质表达和纯化
将编码St1Cas9、St3Cas9或Acr蛋白的DNA序列整合到pET28a载体中,用于在大肠杆菌中表达蛋白(T7 Express,Biomed)。在含1mM IPTG和50μg/ml卡那霉素的LB培养基中常规诱导大肠杆菌细胞表达蛋白质,18℃下培养16小时。收获细胞并重悬于含1mM PMSF和溶菌酶的裂解缓冲液(50mM Tris-HCl,pH 8.0、10mM咪唑、0.5mM TCEP-NaOH和500mM NaCl)中。超声和离心后,细胞上清液与Ni-NTA琼脂糖(QIAGEN)结合,结合的蛋白质用500mM咪唑洗脱。Amicon Ultra离心过滤器(Millipore)用于浓缩蛋白质并将缓冲液交换到储存缓冲液(20mM HEPES-NaOH,pH 7.5、5%(v/v)甘油、300mM NaCl和1mM DTT)。对于Acr蛋白,在4℃下与烟草蚀刻病毒(TEV)蛋白酶孵育过夜后,进行第二轮Ni-NTA纯化以分离未标记的Acr蛋白。
7、体外DNA切割实验
底物DNA的名称和核苷酸序列见表3。
表3
注:*表示荧光素。
根据制造商的手册,使用体外T7转录试剂盒(Invitrogen)制备测定中的所有sgRNA,并使用线性化sgRNA质粒生成转录模板。SpyCas9蛋白购自Invitrogen。对于图4,构建了质粒pC002,并通过限制性内切核酸酶NotI(NEB)进一步线性化。使用NEBuffer3.1以及Cas9蛋白(500nM)、sgRNA(500nM)、靶DNA底物(30ng/μl)和过量的Acr蛋白(10μM)以10μl的总体积进行切割反应。Cas9蛋白在37℃下与sgRNA复合10分钟。然后加入Acr蛋白并在室温下再孵育20分钟。加入目标DNA并在37℃下孵育10分钟。加入1μl蛋白酶K(PK)终止反应。在1%琼脂糖/1×TAE(Tris-醋酸盐-EDTA)凝胶上分析产物。
对于图8和图9,荧光标记的底物DNA用于切割测定,是通过将用Cy5或Cy3标记的目标链和非目标链的合成寡核苷酸退火制备的。裂解测定的过程如图9中G所示。简而言之,将Cas9蛋白(500nM)、和sgRNA(500nM)于37℃混合10分钟以形成Cas9 RNP复合物,反应体系使用1×结合缓冲液(150mM KCl、5mM EDTA、5mM MgCl2、1mM DTT、5%(v/v)甘油、50μg/ml肝素、0.01%Tween 20和100μg/ml BSA、pH 7.6的20mM Tris-HCl)以消除Cas9的裂解活性。然后,以不同的顺序加入Acr蛋白(10μM)和底物DNA(50nM),并分别在室温下孵育20分钟。随后,加入MgCl2(10mM)以恢复Cas9的DNA切割活性,然后在室温下再孵育20分钟。通过添加Gel Loading Buffer II(Invitrogen)停止反应并在85℃下孵育6分钟。在12%变性PAGE凝胶上分析产物并通过Typhoon 7000(GE)进行观察。
8、内含肽-Acr(Intein-Acr)质粒的构建
将编码Acr蛋白(AcrIIA4、AcrIIA5、AcrIIA25.1或AcrIIA32.1)的DNA序列克隆至pCDNA3.1载体中以在人类细胞中表达。
合成内含肽37R3-2序列并将其插入Acr蛋白的所述位置,构建内含肽-Acr质粒。
内含肽-Acr质粒表达intein-Acr变体。intein-Acr变体的名称和氨基酸序列见表4。
表4
注:单下划线为内含肽37R3-2,双下划线为Acr。
9、T7核酸内切酶I(T7E1)检测
AAVS1、EMX1和DYRK1A基因座中的靶序列和PCR扩增的引物序列见表5。对于图5,根据制造商的方案,使用Lipofectamine LTX试剂(Invitrogen)转染24孔板中培养的HEK293T细胞,每孔用1μg Cas9质粒、0.5μg sgRNA质粒和0.5μgAcr质粒。对于图10,HEK293T细胞用1μg Cas9质粒、0.5μg sgRNA质粒、0.5或0.25μgAcr(iAcr)质粒在24孔板中转染,存在或不存在4-HT(1μM,Selleck S7827)处理。转染72小时后,使用DNeasy Blood and Tissue kit(QIAGEN)提取细胞的基因组DNA,并使用Q5 High-Fidelity 2X Master Mix(NEB)进行PCR扩增。在加入T7核酸内切酶I(NEB)之前,将与NEBuffer 2混合的PCR产物进行变性和退火,并在37℃下孵育。在3%琼脂糖/1×TAE凝胶中分离样品。使用ImageJ软件量化条带。
哺乳动物细胞基因组编辑效率的计算公式如下:
表5.T7E1检测中使用的引物和目标序列
10、人类细胞端粒荧光成像
对于成像,U2OS细胞在24孔板中的15mm玻璃底(Electron Microscopy Sciences)上培养。使用Lipofectamine LTX试剂(Invitrogen)根据制造商手册进行细胞转染,使用60ng每种dCas9质粒、300ng每种sgRNA质粒和300ngAcr质粒。转染24小时后,用4%多聚甲醛(Beyotime)固定细胞,并使用尼康A1R+共聚焦显微镜和60×油性物镜观察并成像。
细胞定量由一名实验者通过标记数字对来自每个条件的细胞进行编码。而另一位不知道这些条件的实验者在显微镜下观察并评分细胞。对于定量,只有同时表达TagBFP和mCherry荧光以及Nme_dCas9-(sfGFP)3形成绿色端粒点的细胞中评估是否存在或不存在共定位S**_(d)Cas9-(mCherry)3红色端粒点的情况。
11、凝胶迁移实验(electrophoretic mobility shift assays,EMSA)
RNAEMSA通过在存在或不存在Acrs(5μM)的情况下以图例中所示的顺序孵育Cas9蛋白(256nM)和sgRNA(256nM)来进行。反应在1×结合缓冲液(150mM KCl、5mM EDTA、5mMMgCl2、1mM DTT、5%(v/v)甘油、50μg/ml肝素、0.01%Tween 20、100μg/ml BSA和pH7.6的20mM Tris-HCl)中进行。sgRNAs和Acr蛋白以不同的顺序加入,并分别在37℃下孵育10分钟。在6%Tris-borate-EDTA(TBE)聚丙烯酰胺凝胶上分析样品,并使用Typhoon7000(GE)SYBRGold(Invitrogen)染色进行可视化。
发明人进行了DNAEMSA,简而言之,将Cas9-sgRNA复合物在1×结合缓冲液中37℃孵育10分钟,浓度如图中图例所示。随后,加入不同浓度的Acr蛋白并在室温下孵育20分钟,然后将20nM荧光标记的底物DNA(Cy5标记的目标链)加入混合物中,然后在37℃下孵育10分钟。在平行实验中,加入荧光标记的底物DNA并在37℃下孵育10分钟,然后加入不同浓度的Acr蛋白并在室温下孵育20分钟。样品通过双相聚丙烯酰胺(凝胶上半部为6%,凝胶下半部为12%)/0.5×TBE凝胶电泳分析。凝胶由Typhoon 7000(GE)可视化。所有测定均一式三份进行。
12、HEK293T-BFP细胞中PE介导的BFP-to-GFP基因编辑
首先,发明人在HEK293T细胞中的AAVS1基因座整合了BFP表达基因,构建了HEK293T-BFP细胞系。简而言之,HEK293T细胞用Cas9、AAVS1靶向的sgRNA质粒和供体载体转染,该载体包含同源序列和带有BFP和嘌呤霉素抗性基因。HEK293T-BFP细胞通过嘌呤霉素(2μg/ml)处理和流式细胞术(FACSAriaIII,BD)选择。
为了检查人类细胞中内含肽-Acr杂交体对Cas9的抑制活性,在HEK293T-BFP细胞中进行了PE介导的BFP-to-GFP基因编辑。编码prime editor-PE2的质粒购自Addgene(#132775)。BFP-targetingpegRNA质粒是通过合成含有靶标、sgRNA支架、PBS和RT模板的DNA序列构建的,并整合到U6-sgRNA载体中。HEK293T-BFP细胞在24孔板中培养,然后使用Lipofectamine LTX试剂(Invitrogen)每孔转染PE2(1μg)、BFP靶向的pegRNA(0.5μg)和Acr(0.25μg)质粒,存在或者不存在4-HT(1μM,Selleck S7827)。
二、实验结果
1、九种新型II-A型Acr(AcrIIA24-32)蛋白家族的发现
通过guilt-by-association原理,即Acr基因附近往往存在一个关联基因(Acr-associated gene,Aca),其编码的蛋白具备helix-turn-helix(HTH)保守结构域用以调节Acr的活性。利用BLAST程序对AcrIIA6(accession:AVO22749.1)进行信息检索(见图1中A)。结合Acr蛋白氨基酸序列大小(一般50-200个氨基酸),是否存在于噬菌体/原噬菌体区域等其他特点,建立了Acr候选基因。另外,利用大肠杆菌质粒干涉实验,设计了针对SpyCas9、St1Cas9和St3Cas9系统的Acr蛋白的筛选体系(见图1中B),快速准确鉴定候选的Acr蛋白。实验方法是将编码Cas9蛋白的基因克隆到具有gRNA的细菌表达质粒中,用作大肠杆菌中的外源CRISPR-Cas9系统。另将编码候选Acr蛋白的基因克隆到兼容的细菌表达质粒中,作为以供检测的外源anti-CRISPR系统。还构建了一个靶向质粒(命名为pT),该质粒可被外源CRISPR-Cas9系统识别并靶向。通过计算pT质粒在拥有外源Cas9以及候选Acr质粒的大肠杆菌的转化效率来鉴定Acr能否抑制Cas9蛋白。
通过大肠杆菌质粒干涉实验进行筛选,从Streptococcus(链球菌)的移动元件中发现了九种新型的II-A型Acr(AcrIIA24-32)蛋白家族和三个aca蛋白(见图1中A和B)。在检验的11个Acr蛋白(增加了两个Acr同源蛋白:AcrIIA25.1和AcrIIA32.1)中,5个Acr蛋白(AcrIIA25、AcrIIA27、AcrIIA28、AcrIIA32和AcrIIA32.1)对SpyCas9和St3Cas9展现了强健的抑制活性。另外,AcrIIA25.1和AcrIIA26能够显著抑制SpyCas9,然而AcrIIA24和AcrIIA29以不同的程度来失活St3Cas9。同时,发现AcrIIA30和AcrIIA31能够强烈抑制St1Cas9的活性(见图1中C)。然而,在实验中,来自相邻基因的其他Acr候选蛋白对SpyCas9、St1Cas9和St3Cas9没有抑制活性(数据未显示)。这些数据表明,九种新型II-A型Acr(AcrIIA24-32)蛋白家族的发现,以及通过质粒干涉实验确证这其中大多数Acr蛋白能够在大肠杆菌中强健地抑制Streptococcus(链球菌)来源的II-A型Cas9蛋白。
为了进一步确定AcrIIA24-32同源蛋白的分布,基于BLAST结果进行了全面的系统发育分析(见图2)。数据显示AcrIIA28、AcrIIA30和AcrIIA32的同源蛋白很少,仅分布在几个链球菌基因组或噬菌体中(见图2中E、G和I)。相比之下,其他Acr蛋白的同源蛋白更广泛地分布在链球菌移动元件中。而AcrIIA26、AcrIIA27、AcrIIA29和AcrIIA31蛋白家族在链球菌的多种菌株中存在,如唾液链球菌(S.salivarius)和化脓链球菌(S.pyogenes)(见图2中C、D、F和H)。此外,AcrIIA24和AcrIIA25同源蛋白不仅存在于链球菌基因组菌株中,而且还存在于各种链球菌噬菌体中(见图2中A和B)。数据表明AcrIIA24-32蛋白家族主要分布在链球菌中,表明AcrIIA24-32在链球菌噬菌体与宿主之间的军备竞赛中具有潜在的作用。
2、AcrIIA24-32是链球菌CRISPR-Cas9系统特异性的抑制蛋白
为了进一步调查AcrIIA24-32是否具备广谱的抑制活性,进行了噬菌斑测定实验。选择了多种研究明确的Cas9同源蛋白进行了验证,包括II-A型(SpyCas9、St1Cas9、St3Cas9和SaCas9)、II-B型(FnCas9)和II-C型(NmeCas9)CRISPR系统(图3中A)。大肠杆菌携带能够靶向T4噬菌体基因23的Cas9表达质粒,而不靶向T4噬菌体的Cas9质粒为对照。在存在或者不存在Acr的情况下,用10倍连续稀释的T4噬菌体对这些化转Cas9表达质粒的大肠杆菌进行侵染(图3中A)。发现AcrIIA30和AcrIIA31可以特异性的抑制St1Cas9,而其他Acr蛋白可以同时强烈抑制SpyCas9和St3Cas9(图3中B)。
此外,AcrIIA24-32对SaCas9、NmeCas9和FnCas9没有显示出可检测的抑制活性。数据表明AcrIIA24-32是链球菌CRISPR-Cas9系统特异性的抑制蛋白。
3、AcrIIA24-32可在体外抑制II-A型Cas9同源蛋白
在质粒干涉实验中,AcrIIA24-32中有些蛋白对链球菌的CRISPR-Cas9系统展现了弱的抑制活性(如AcrIIA29弱抑制St3Cas9)。然而在噬菌斑测定实验中,AcrIIA24-32能够将T4噬菌体恢复至与非靶向对照几乎相同的水平,表明这些Acr蛋白对II-A型Cas9同系物(SpyCas9、St1Cas9和St3Cas9)都具有强健的抑制活性(图3中B)。
为了消除这种差异并确认AcrIIA24-32对链球菌CRISPR-Cas9系统的抑制活性,纯化了Cas9和Acr蛋白并在体外进行了DNA切割实验。AcrIIA24-32抑制SpyCas9、St1Cas9和St3Cas9蛋白的活性,可以通过Cas9 RNP在存在或不存在Acr的情况下靶向线性化质粒DNA来测定(图4中A)。结果显示,DNA切割实验的结果与质粒干涉实验的结果基本一致。SpyCas9能够被AcrIIA25、AcrIIA25.1、AcrIIA26、AcrIIA27、AcrIIA28、AcrIIA32和AcrIIA32.1抑制(图4中B)。除AcrIIA25.1和AcrIIA30之外的其他Acr蛋白可以抑制St3Cas9,而AcrIIA24、AcrIIA25、AcrIIA27、AcrIIA31和AcrIIA32对St3Cas9显示出更强的抑制活性(图4中C)。此外,只有AcrIIA30和AcrIIA31可以在体外有效地失活St1Cas9,这与质粒干涉和噬菌斑测定实验的结果一致(图4中D)。还进行了不同条件的DNA切割实验,即在将sgRNA和目标DNA引入反应之前,将apo-Cas9和Acr蛋白预孵育(数据未显示)。发现在这两种反应条件下实验结果并没有显着差异,表明AcrIIA24-32主要作用于Cas9 RNP,并影响Cas9 RNP的下游功能。
4、AcrIIA24-32可以在人类细胞中抑制Cas9介导的基因编辑
鉴于各种Cas9直系同源蛋白在真核细胞中的广泛应用,检查了相应的Acrs是否可以在人类细胞中抑制Cas9蛋白。将编码Cas9、基因组靶向的sgRNA和Acrs的质粒共转染进HEK293T细胞。转染72小时后使用T7核酸内切酶1(T7E1)分析了基因编辑效率(图5中A)。人类内源性基因座AAVS1和DYRK1A被设计可用于链球菌的II-A型Cas9直系同源蛋白进行基因编辑。
引人注目的是,观察到在人体细胞中多个Acrs对SpyCas9、St1Cas9和St3Cas9表现出强烈的抑制作用(见表6)。AcrIIA25.1、AcrIIA26、AcrIIA27、AcrIIA28、AcrIIA32和AcrIIA32.1蛋白几乎完全抑制了SpyCas9的活性,这些蛋白的抑制水平与已知的强效抑制蛋白AcrIIA5相当,而AcrIIA25仅微弱地抑制SpyCas9(平均抑制活性为44%)(图5中B和C)。此外,St3Cas9介导的基因编辑可被多种Acrs不同程度地抑制,包括AcrIIA24(平均91%)、AcrIIA25(平均56%)、AcrIIA27(平均83%)、AcrIIA28(平均84%)、AcrIIA29(平均33%)、AcrIIA32(平均98%)和AcrIIA32.1(平均93%)(图5中D和E)。还发现AcrIIA30、AcrIIA31和AcrIIA31同源蛋白(AcrIIA31.1)可以有效抑制St1Cas9的活性(图5中F和G)。
因此,结果表明来自AcrIIA24-32的多个Acrs及其直系同源蛋白可以有效地在人类细胞中抑制链球菌Cas9同源蛋白(SpyCas9、St1Cas9和St3Cas9)。
表6
注:20%<抑制活性<50%,标记“+”;50%<抑制活性<80%,标记“++”;抑制活性>80%,标记“+++”;ND,没有确定。
5、Acrs在人类细胞中采用多种策略失活Cas9
为了研究Acr蛋白对Cas9的抑制机制,在U20S细胞系中执行了Cas9蛋白介导的端粒共定位荧光成像实验(图6中A和B)。选择AcrIIA24、AcrIIA25.1、AcrIIA26、AcrIIA27、AcrIIA30、AcrIIA31和AcrIIA32.1,考虑到这些蛋白在人类细胞中具备高的抑制Cas9的活性。Acrs的作用机制可以通过mCherry标记的链球菌Cas9蛋白对端粒DNA的成像进行评估,同时可以使用超折叠(super-folder,sf)GFP标记的NmeCas9作为端粒指示信号。
首先,研究了强效抑制蛋白对SpyCas9与DNA结合的影响,包括AcrIIA25.1、AcrIIA26、AcrIIA27和AcrIIA32.1,以及作为对照的AcrIIA4和AcrIIA5。观察到Spy_dCas9-(mCherry)3和Nme_dCas9-(sfGFP)3在共表达AcrIIA5的细胞中能够共定位到端粒上,而在表达AcrIIA4的细胞中,则消除了由Spy_dCas9-(mCherry)3形成的红色端粒点(图6中C)。然而,在表达AcrIIA25.1、AcrIIA26、AcrIIA27或AcrIIA32.1的细胞中均导致Spy_dCas9-(mCherry)3形成的红色端粒点丢失,而不影响Nme_dCas9-(sfGFP)3形成的绿色端粒点(图6中C)。随后,量化了在不同Acr蛋白存在的情况下Spy_dCas9-(mCherry)3形成端粒点的细胞数量。在表达AcrIIA5的细胞中观察到93.1%细胞存在Spy_dCas9-(mCherry)3形成的红色端粒点,而在表达AcrIIA25.1、AcrIIA26、AcrIIA27、AcrIIA32.1或AcrIIA4的细胞中很少观察到红色端粒点(图6中D)。这些结果证实了AcrIIA25.1、AcrIIA26、AcrIIA27和AcrIIA32.1可以在人类细胞中高效阻断SpyCas9对靶向DNA的结合。
随后,为了探索St1Cas9和St3Cas9的强效抑制蛋白的作用机制,使用St1Cas9和St3Cas9建立了类似的荧光成像系统,用于靶向U2OS细胞中的人类端粒(图6中A和B)。观察到St3_dCas9-(mCherry)3或St1_dCas9-(mCherry)3与Nme_dCas9-(sfGFP)3共定位到细胞中的端粒上(图6中E和G)。AcrIIA24对St3_dCas9-(mCherry)3对端粒点的共定位没有影响,而AcrIIA27、AcrIIA32.1、AcrIIA30和AcrIIA31可以明显消除由St3Cas9或St1Cas9形成的红色端粒点(图6中G和E)。另外,通过定量实验,在96.8%表达AcrIIA5的细胞和96.4%表达AcrIIA24的细胞中观察到St3_dCas9-(mCherry)3形成的红色端粒点,而在表达AcrIIA27或AcrIIA32.1的细胞中则没有观察到St3_dCas9-(mCherry)3形成的红色端粒点(图6中F)。此外,在72.4%表达AcrIIA5的细胞、3.4%表达AcrIIA30的细胞和0%表达AcrIIA31的细胞中观察到了St1_dCas9-(mCherry)3形成的红色端粒点(图6中H)。
结果表明,来自链球菌移动元件的Acrs采用多种抑制策略来抑制Cas9蛋白。AcrIIA25.1、AcrIIA26、AcrIIA27、AcrIIA30、AcrIIA31和AcrIIA32.1可有效阻断Cas9蛋白与DNA的结合。然而,AcrIIA24不阻止Cas9蛋白与DNA靶标的结合,表明AcrIIA24可能特异性地抑制Cas9对底物DNA的切割,其作用机制类似于AcrIIC1。
6、Acrs表现出多种作用机制来抑制Cas9
在体外进行了凝胶迁移实验(electrophoretic mobility shift assays,EMSA),以进一步检查这些Acrs对Cas9的作用机制。首先进行了RNA EMSA以确定这些Acrs是否影响Cas9-sgRNA RNP复合物的形成。发现AcrIIA25.1、AcrIIA26、AcrIIA27、AcrIIA28和AcrIIA32.1对SpyCas9-sgRNARNP的形成没有影响(图7中A和B)。AcrIIA24、AcrIIA30和AcrIIA31也没有阻止St3Cas9或St1Cas9与sgRNA的结合(图7中C和D)。
随后,进行了DNAEMSA实验以研究Acrs如何作用于Cas9 RNP以影响Cas9 RNP的下游功能。具备催化活性的Cas9蛋白与sgRNA混合形成Cas9 RNP复合物,同时在反应体系中加入10mM EDTA以消除Cas9对DNA的切割作用。一个荧光标记的底物DNA(Cy5标记的目标链)设计可被SpyCas9、St1Cas9和St3Cas9靶向。观察到AcrIIA25.1、AcrIIA26、AcrIIA27和AcrIIA32.1只有在添加目标DNA之前添加到反映体系时才有效地消除了SpyCas9RNP与DNA的结合(图8中A-D)。还发现AcrIIA25.1和AcrIIA32.1可以捕获与DNA结合的Cas9复合物,从而导致DNA超迁移(“super-shift”),而AcrIIA26、AcrIIA27则不能(图8中A-D)。
接下来,研究了AcrIIA32.1对St3Cas9的抑制机制,获得了与AcrIIA32.1对SpyCas9的相似的结果。还观察到,无论在添加目标DNA之前或之后添加AcrIIA24蛋白,AcrIIA24都可以捕获Cas9-DNA复合物,导致DNA超迁移(“super-shift”)(图8中E)。该结果表明AcrIIA24应特异性抑制St3Cas9对DNA底物的切割活性。还研究了AcrIIA30和AcrIIA31对St1Cas9的抑制机制,发现AcrIIA30和AcrIIA31只有在添加目标DNA之前添加到反映体系时才有效地阻止了St1Cas9 RNP与DNA的结合(图8中F)。与AcrIIA25.1和AcrIIA32.1一样,AcrIIA30可以与DNA结合的Cas9复合物结合,导致DNA超迁移(“super-shift”),而AcrIIA31则不能(图8中F)。
考虑到AcrIIA25.1、AcrIIA32.1和AcrIIA30的一些非常规行为,推测这些Acr蛋白不仅可以阻断Cas9与DNA结合,还可以抑制Cas9-sgRNA-DNA-Acr四元复合物中Cas9对DNA的切割活性。为了验证假设,通过在EMSA中添加额外的Mg2+来恢复Cas9对DNA的切割活性来进行DNA切割实验(图8中G)。检查了不同反应条件下Acr蛋白是否会影响Cas9的DNA切割活性,即在反应体系中添加目标DNA之前或之后添加Acrs。数据显示,通过在EMSA中添加额外的Mg2+、SpyCas9、St1Cas9和St3Cas9的DNA切割活性都得到了有效恢复(图8中H、I和图9)。与AcrIIC1蛋白相比,AcrIIA25.1、AcrIIA32.1和对照AcrIIA4在添加靶DNA之前添加时到反应体系时可以强烈抑制SpyCas9对DNA的切割活性。然而与AcrIIA4对照相比,AcrIIA25.1和AcrIIA32.1在添加目标DNA后添加到反应体系中,也可以使SpyCas9的DNA切割活性失活(图8中H)。结合EMSA和DNA切割分析,数据显示AcrIIA25.1和AcrIIA32.1可以抑制SpyCas9的DNA结合和DNA切割活性,这是一种不同于其他先前报道的Acrs的新型抑制机制(图8中J)。
进一步研究了AcrIIA24、AcrIIA25和AcrIIA32.1是否抑制了St3Cas9的DNA切割活性。发现AcrIIA24、AcrIIA25和AcrIIA32.1无论在添加靶DNA之前或之后添加,均对St3Cas9的DNA切割表现出强烈的抑制作用(图9)。结果表明AcrIIA24可以有效抑制St3Cas9的DNA切割步骤,而AcrIIA25和AcrIIA32.1抑制St3Cas9的DNA结合和DNA切割活性。还研究了AcrIIA30和AcrIIA31对St1Cas9的DNA切割活性的抑制机制,特别是考虑到AcrIIA30可以在EMSA中结合到Cas9-sgRNA-DNA三元复合物。发现只有在添加目标DNA之前添加AcrIIA30和AcrIIA31,才可以有效抑制St1Cas9介导的DNA切割,表明AcrIIA30和AcrIIA31都可以抑制St1Cas9的DNA结合而不影响St1Cas9的DNA切割活性(图8中I和J)。总之,结果表明,发现的这些Acrs表现出抑制Cas9的多种能力,其机制包括阻断DNA结合、DNA切割或两者兼而有之。
7、化学诱导型iAcr(inducible anti-CRISPR)系统的建立以实现化学可控的的基因编辑
由于AcrIIA25.1和AcrIIA32.1可以抑制Cas9的DNA结合和DNA切割活性,展现出强大的应用潜力可用于调节Cas9介导的基因组编辑。选择这两个蛋白作为开发化学诱导型anti-CRISPR的候选蛋白。还使用AcrIIA4和AcrIIA5作为对照,考虑到AcrIIA4只能阻断Cas9与DNA的结合,而AcrIIA5专门抑制Cas9的DNA裂解,通过Acr蛋白与配体依赖性内含肽(intein)37R3-2融合设计了内含肽-Acr杂合体,将内含肽插入Acr蛋白会导致Acr失活,而4-羟基三苯氧胺(4-hydroxytamoxifen,4-HT)与内含肽的结合可触发内含肽蛋白自剪接并恢复Acr活性以抑制Cas9(图10中A)。
通过替换Acrs的单个残基(半胱氨酸、丙氨酸、丝氨酸或苏氨酸)将4-HT响应性内含肽插入Acrs,因为内含肽蛋白剪接留下了单个半胱氨酸残基,替换这些残基将最大限度地减少产生的半胱氨酸点突变的影响(表4)。为了检查人类细胞中内含肽-Acr杂交体对Cas9的影响,设计了一个BFP-to-GFP报告系统,使用引导编辑(Prime editing,PE)进行(图10中B)。建立了功能性HEK293T细胞即在AAVS1位点整合了蓝色荧光蛋白(HEK293T-BFP细胞),流式细胞术分析显示HEK293T-BFP细胞中BFP的表达比例很高。PE可以通过将CC替换为GT将BFP转换为GFP,从而导致单个H66Y氨基酸替换(图10中B)。在存在或不存在4-HT(1μM)的情况下,用编码PE、BFP靶向的pegRNA和内含肽-Acr杂交体的质粒转染进HEK293T-BFP细胞(图10中B)。内含肽-Acr杂交体对Cas9的影响可以通过比较每种条件下BFP到GFP的编辑效率来计算。
结果表明,4-HT处理对人类细胞中PE和野生型(WT)Acrs的活性没有明显影响(图10中C)。在10个intein-Acr变体中,AcrIIA4(T28和A58)、AcrIIA5(S87)、AcrIIA25.1(S59)的活性不受4-HT的调控。尽管AcrIIA5(A68)、AcrIIA25.1(S30)和AcrIIA32.1(T24和A73)可以响应4-HT切换为激活状态以抑制Cas9,但这四种内含肽-Acr变体是弱4-HT依赖性的(平均1.8倍调节)。只有两种内含肽-Acr变体,AcrIIA25.1(S76)和AcrIIA32.1(T40)可以在4-HT存在下有效抑制PE介导的BFP-to-GFP编辑,表现出4-HT依赖性调节(分别3.3和3.6倍的变化)。然后我们将intein-AcrIIA25.1(S76)和intein-AcrIIA32.1(T40)分别称为iAcrIIA25.1(缩写为iA25.1)和iAcrIIA32.1(缩写为iA32.1)。
为了进一步检查iA25.1和iA32.1是否具备4-HT依赖性的活性以抑制Cas9介导的基因编辑,在HEK293T细胞中进行了T7E1实验。人类内源性基因座AAVS1和EMX1被设计用于SpyCas9的靶向编辑。与PE介导的BFP-to-GFP测定的结果一致,4-HT处理对SpyCas9和WTAcrs的活性没有影响(图10中D和E)。引人注目的是,观察到iA25.1和iA32.1的抑制活性是4-HT依赖性的。这两种蛋白质在没有4-HT的情况下对SpyCas9活性有轻微影响,但在4-HT存在的情况下,它们会切换到高活性状态以抑制Cas9(图10中D和E)。为了进一步检查iAcr的潜在应用,进行了T7E1分析,以研究iA32.1是否可以被4-HT激活以抑制人类细胞中St3Cas9介导的基因编辑。正如预期的那样,数据显示具有4-HT触发活性的iA32.1可以抑制人类细胞中St3Cas9介导的基因编辑(图10中F)。
因此,结果表明,这些iAcrs表现出有效的4-HT依赖性调节,以在人类细胞中对CRISPR-Cas9介导的基因组编辑进行翻译后控制。
以上对本发明进行了详述。对于本领域技术人员来说,在不脱离本发明的宗旨和范围,以及无需进行不必要的实验情况下,可在等同参数、浓度和条件下,在较宽范围内实施本发明。虽然本发明给出了特殊的实施例,应该理解为,可以对本发明作进一步的改进。总之,按本发明的原理,本申请欲包括任何变更、用途或对本发明的改进,包括脱离了本申请中已公开范围,而用本领域已知的常规技术进行的改变。按以下附带的权利要求的范围,可以进行一些基本特征的应用。
<110> 中国科学院生物物理研究所
<120> 新型CRISPR-Cas9抑制蛋白及其改造应用于化学可控的基因编辑的方法
<160>15
<170> PatentIn version 3.5
<210>1
<211>104
<212>PRT
<213>Artificial sequence
<400>1
Met Lys Lys Ala Gln Gln Leu Leu Lys Glu Ile Lys Thr Asn Asn Val
1 5 10 15
Ser Tyr Ala Ile Met Asp Glu Asp Asn Glu Ile Tyr Cys Asn Lys Glu
20 25 30
Thr Asn Asn Ile Met Asp Ile Tyr Gly Tyr Asp Asn Glu Asn Gly His
35 40 45
Phe Tyr Gly Val Tyr Gly Asp Val Val Asp Gly Gln Ile Asp Ser Arg
50 55 60
Tyr Phe Ser Asp Asp Ala Ile Leu Asn Ala Ile Asp Lys Leu Leu Phe
65 70 75 80
Leu Gly Asp Pro Ile Lys Arg Thr Asp Leu Pro Ser Asp Ala Asp Phe
85 90 95
Lys Arg Thr Phe Phe Phe Glu Glu
100
<210>2
<211>96
<212>PRT
<213>Artificial sequence
<400>2
Met Lys Asn Arg Leu Leu Gly Ser Arg Tyr Thr Asp Ala Ile Lys Asn
1 5 10 15
Asp Cys Gly Thr Ala Asn Lys Met Ser Asn Ile Tyr Asn Lys Leu Asn
20 25 30
Lys Asp Ser Leu Arg Glu Ile His Ser Ala Leu Tyr Gly Leu Leu Thr
35 40 45
Ala Gly Tyr Asp Ile Ser Asn Met Arg Asn Ile Glu Glu Leu Glu Lys
50 55 60
Tyr Val Asn Leu Lys Lys Ser Arg Gly Gln Leu Leu Asn Val Ser Ser
65 70 75 80
Asp Asp Ile Lys Leu Tyr His Lys Leu Phe Val Ile Arg Phe Gly Lys
85 90 95
<210>3
<211>101
<212>PRT
<213>Artificial sequence
<400>3
Met Lys Asn Gly His Met Ile Leu Gly Gln Arg Trp Thr Asn Ala Ile
1 5 10 15
Arg Asn Glu Thr Gly Thr Ser Ser Lys Met Phe Asn Leu Ser Lys Arg
20 25 30
Leu Tyr Asp Phe Lys Asp Asn Asn Leu Arg Glu Ile His Glu Ala Leu
35 40 45
Tyr Gly Leu Leu Arg Ala Gly Tyr Asp Ile Ser Asn Met Arg Asp Val
50 55 60
Glu Glu Leu Ala Lys Tyr Val Asp Val Lys Lys Ser His Gly Lys Leu
65 70 75 80
Leu Asp Val Thr Arg Asp Asp Ile Glu Leu Tyr His Arg Leu Phe Val
85 90 95
Ala Arg Phe Gly Lys
100
<210>4
<211>183
<212>PRT
<213>Artificial sequence
<400>4
Met Lys Lys Leu Tyr Ile Gln Thr Asn Gln Phe Ala Asn Gly Glu Leu
1 5 10 15
Gln Val Glu Asn Thr Ser Tyr Glu Leu Cys Asp Thr Phe Lys Glu Leu
20 25 30
Tyr Ser Val Ala Ser Asn Leu Val Asp Glu Asn Thr Leu Asn Phe Val
35 40 45
Glu Asp Asn Phe Ile Glu Gln Asn Tyr Lys Asp Glu Tyr Asn Gly Val
50 55 60
Tyr Glu Asn Asp Gly Asp Thr Gly Glu Phe Val Gly Gln Val Phe Glu
65 70 75 80
Asn Lys Val Thr Glu Glu Gln Phe Lys Glu Leu Leu Glu Gln Leu Glu
85 90 95
Ile Thr Tyr Thr Glu Phe Asp Pro Glu Glu Glu Leu Ala Lys Cys Ile
100 105 110
Ala Asn Lys Asn Arg Lys Ser Glu Phe Tyr Gly Asn Gly Leu Lys Val
115 120 125
Ile Ala Glu Tyr Leu Glu Ser Ile Ser His Glu Asp Ala Leu Ala Val
130 135 140
Val Thr Tyr Tyr Tyr Phe Tyr Phe Gly Phe Gly Tyr Glu Asp Gln Leu
145 150 155 160
Ile Ser Asp Ile Lys Asp Asp Gln Glu Asp Gly Val Lys Phe Glu His
165 170 175
Val Glu Arg Ser Glu Thr Ile
180
<210>5
<211>79
<212>PRT
<213>Artificial sequence
<400>5
Met Lys Thr Phe Asn Ile Ile Val Ser Glu Ser Ala Asn Leu Lys Glu
1 5 10 15
His Ser Ser Glu Leu Val Asp Asn Ile Ile Tyr Lys Val Glu Ala Lys
20 25 30
Asn Arg Arg Glu Ala Phe Lys Lys Ala Arg Glu Glu Tyr Ser Phe Ser
35 40 45
Ser Lys Trp Lys Phe Asn Met Arg Asp Leu Thr Ala Ile Asp Asn Thr
50 55 60
His Arg Arg Ala Trp Gly Arg Arg Tyr Leu Arg Val Glu Glu Ala
65 70 75
<210>6
<211>88
<212>PRT
<213>Artificial sequence
<400>6
Met Lys Thr Ile Phe Thr Lys Lys Gln Thr Glu Glu Leu Leu Asn Asp
1 5 10 15
Ile Ser Ile Glu Lys Gln Lys Glu Leu Phe Asn Ser Met His Asp Phe
20 25 30
Arg Ser Gln His Ala Lys Glu Ala Arg Ile Pro Gly Trp Ser Asp Lys
35 40 45
Tyr Asn Lys Leu Glu Lys Lys Met Leu Ser Asp Phe Glu Glu Val Thr
50 55 60
Gly Ile Lys Tyr Asp Thr Leu Glu Ser Glu Leu Ile Trp Asp Asn Leu
65 70 75 80
Ser Asn Lys Phe Leu Tyr Asn Ser
85
<210>7
<211>241
<212>PRT
<213>Artificial sequence
<400>7
Met Lys Pro Ser Gln Lys Ile Lys Trp Leu Leu Thr Ala Thr Gly Ile
1 5 10 15
Thr Thr Tyr Lys Ile Gly Lys Asp Ile Glu Glu Ser Thr Gln Phe Leu
20 25 30
Asp Arg Tyr Lys Asn Asp Pro Glu Lys Ile Gly Gly Met Arg Leu Glu
35 40 45
Lys Ala Glu Lys Leu Leu Glu Tyr Ile Ser Asn Leu Arg Gln Glu Asp
50 55 60
Val Ile Lys Thr Asn Trp Asn Asn Gln Gln Ile Leu Val Gln Asn Ser
65 70 75 80
Thr Glu Lys Glu Ile Thr Lys Tyr Phe Asn Ser Tyr Pro Phe Ala Ile
85 90 95
Lys Leu Asn Trp Ile Lys Pro His Lys Glu Met Phe Ile Val Asn Phe
100 105 110
Asp Thr Thr Ser Asn Lys Thr Phe Arg Lys Tyr Pro Tyr Asp Leu Lys
115 120 125
Asn Leu Tyr Phe Leu Val Asp Lys Asn Arg Asp Lys Met Ser Gln Phe
130 135 140
Ala Glu Phe Leu Ile Ile Cys Gly Arg Lys Ser His Phe Gly Gly Ser
145 150 155 160
Arg Val Leu Tyr Glu Val Glu Gly Lys Lys Tyr Gln Ile Ile Phe Ser
165 170 175
Ile Lys Arg Pro Ser Glu Leu Gly Pro Thr Ile Arg Leu Ile Asn Val
180 185 190
Val Glu Thr Asp Thr Tyr Arg Asp Asp Leu Val Pro Lys Ile Ser Glu
195 200 205
Glu Glu Ser Ile Leu Arg Ser Glu Asp Leu Asp Leu Lys Gly Lys Arg
210 215 220
Val Ser Ile Lys Asp Ser Glu Leu Leu Glu Leu Met Ser Ile Ile Asp
225 230 235 240
Asn
<210>8
<211>73
<212>PRT
<213>Artificial sequence
<400>8
Met Ile Thr Ala Asn Glu Ile Val Lys Thr His Lys Gly Ile Arg Leu
1 5 10 15
Val Gln Arg Lys Asn Glu Ser Trp Glu Glu Phe Lys Glu Arg Ile Gln
20 25 30
Glu Val Ile Ala Lys Gln Gly Asp Asn Tyr Leu Thr Gln Thr Lys Pro
35 40 45
Val His Glu Ile Lys Asn Lys Gly Thr Arg Asn Ile Arg Arg Thr Tyr
50 55 60
Val Asn Ile Leu Leu Lys Glu Gly Ala
65 70
<210>9
<211>69
<212>PRT
<213>Artificial sequence
<400>9
Met Val Thr Glu Glu Gln Leu Lys Glu Val Leu Val Gly Ile Tyr Glu
1 5 10 15
Thr Glu Tyr Lys Asp Glu Gln Thr Phe Glu Glu Tyr Ala Asp Gly Trp
20 25 30
Asp Phe Trp Ile Asp Lys Asp Gly Asp Ile Leu Ile Glu Gly Arg Gly
35 40 45
Met Lys Pro Ile Asp Gly Val Gln Lys Val Gly His Val Asp Asn Gly
50 55 60
Val Ile Tyr Ala Tyr
65
<210>10
<211>125
<212>PRT
<213>Artificial sequence
<400>10
Met Met Asp Ala Gln Thr Lys Ala Thr Lys Lys Trp Asn Lys Glu Asn
1 5 10 15
Arg Ala His Arg Asn Tyr Leu Ser Lys Arg Ser Ser Ala Arg Ser Phe
20 25 30
Ile Arg Asn His Ala Thr Gly Ser Asp Leu Asn Glu Leu Glu Glu Leu
35 40 45
Ile Ala Glu Arg Arg Asp Ala Leu Met Thr Asp Thr Glu Arg Lys Ile
50 55 60
Lys Glu Leu Ile Gln Asp Val Tyr Ala Asp Glu Leu Lys Glu Gln Pro
65 70 75 80
Trp Glu Glu Val Ala Asp Met Leu Asp Phe Trp Arg Asp Lys Asp Gly
85 90 95
Asp Leu Leu Ile Glu Gly Arg Gly Met Lys Pro Ile Asp Gly Val Glu
100 105 110
Tyr Val Gly Tyr Ala Asp Asn Gly Val Ile Trp Glu Arg
115 120 125
<210>11
<211>141
<212>PRT
<213>Artificial sequence
<400>11
Met Lys Asn Glu Asp Gly Lys Leu Val Val Ser Lys Ala His Phe Gly
1 5 10 15
Asn Met Ile Arg Asn Cys Gln Ser Val Glu Asp Phe Lys Lys Ser Phe
20 25 30
Glu Arg Leu Thr Tyr Tyr Ser Ser Glu Asn Arg Glu Ser Thr Val Arg
35 40 45
Gln Arg Leu Lys Ile Ala Glu Lys Glu Tyr Asn Phe Lys Ala Gly Val
50 55 60
Lys Glu Asp Leu Glu Ile Lys Asn Thr Thr Asp Lys Glu Ile Leu Asp
65 70 75 80
Tyr Val Arg Asn Glu Leu Ser Lys Ile Asp Ser Lys Lys Gln Ala Asp
85 90 95
Lys Asn Trp Ser Glu Lys Asn Arg Glu His Arg Asn Tyr Leu Ser Lys
100 105 110
Arg Ser Ser Ala Arg Ser Phe Ile Asn Asn Asn Ala Thr His Glu Asp
115 120 125
Leu Leu Glu Leu Lys Lys Ile Ile Glu Glu Lys Leu Lys
130 135 140
<210>12
<211>95
<212>PRT
<213>Artificial sequence
<400>12
Met Lys Asn Glu Ala Gly Lys Val Ile Val Ser Lys Ser Gln Tyr Ala
1 5 10 15
Asn Leu Ile Arg His Ala Arg Thr Val Glu Ala Phe Lys Asp Glu Phe
20 25 30
Asn Arg Ile Thr Tyr Tyr Asp Thr Leu Thr Thr Glu Ala Ser Glu Arg
35 40 45
Lys Arg Leu Arg Ile Ala Glu His Glu Tyr Lys Phe Arg Val Ser Met
50 55 60
Gln Lys Glu Leu His Gly Thr Glu Ala Glu Ile Thr Lys Asp Phe Gln
65 70 75 80
Ser Val Leu Asp Tyr Ile Gln Glu Gln Leu Lys Val Ile Val Lys
85 90 95
<210>13
<211>63
<212>PRT
<213>Artificial sequence
<400>13
Met Lys Val Asp Thr Lys Gln Ile Glu Trp Leu Leu Lys Asn Ala Ser
1 5 10 15
Gly Tyr Gln Ile Ser Lys Met Ser Gly Val Ala Gln Pro Thr Ile Ser
20 25 30
Ala Leu Ile Asn Lys Lys Arg Ser Ile Glu Asn Leu Thr Ile Glu Thr
35 40 45
Gly His Lys Leu Thr Glu Leu Ala Asn Gln Met Gln Lys Thr Pro
50 55 60
<210>14
<211>140
<212>PRT
<213>Artificial sequence
<400>14
Met Leu Ile Asn Thr Ser Arg Val Glu Met Val Leu Met Asn Lys Ala
1 5 10 15
Ile Ser Ala Tyr Arg Leu Ala Lys Glu Ile Gly Ile Gln Glu Ser Ser
20 25 30
Ile Ser Leu Leu Arg Asn Gly Lys Lys Asp Leu Asp Lys Leu Ser Leu
35 40 45
Glu Val Ala Met Arg Val Gln Ala Trp Ile Asp Ala Gly Asn Tyr Ser
50 55 60
Phe Ser Tyr Asp Tyr Ser Glu Leu Ile Glu Lys Leu Glu Ala Asp Ile
65 70 75 80
Glu Lys Gly Leu Ala Asp Glu Tyr Ile Tyr Ile Val Arg Gly Gly Tyr
85 90 95
Asn Glu Val Met Glu Lys Cys Met Ile Ile Asp Tyr Tyr Tyr Asp Pro
100 105 110
Glu Glu Ile Ala Glu Gly Asp Ile Ala Glu Lys Val Leu Thr Ser Ser
115 120 125
Ala Leu Ala Glu Met Glu Lys Asp Asn Glu Ile Phe
130 135 140
<210>15
<211>59
<212>PRT
<213>Artificial sequence
<400>15
Met Thr Asp Glu Leu Thr Ala Arg Gln Arg Ala Asp Lys Lys Trp Asn
1 5 10 15
Glu Lys Asn Arg Glu His Arg Asn Tyr Met Thr Lys Arg Ser Thr Ala
20 25 30
Arg Gly Phe Ile Arg Asn His Ala Thr Lys Glu Asp Leu Leu Glu Leu
35 40 45
Gln Lys Leu Ile Gln Glu Asn Leu Lys Lys Phe
50 55
Claims (9)
1. AcrIIA蛋白在细胞或真核生物中抑制Cas9基因编辑、调控和/或成像中的应用;
所述AcrIIA蛋白为AcrIIA32.1蛋白;
所述AcrIIA32.1蛋白为如下l1)或l2):
l1)氨基酸序列是SEQ ID NO:12所示的蛋白质;
l2)在SEQ ID NO:12所示的蛋白质的N端或/和C端连接标签得到的融合蛋白质;
所述Cas9蛋白为SpyCas9蛋白或/和St3Cas9蛋白。
2.根据权利要求1所述的应用,其特征在于:所述细胞为细菌细胞。
3.权利要求1中所述AcrIIA蛋白在抑制Cas9蛋白活性中的应用;所述Cas9蛋白为SpyCas9蛋白或/和St3Cas9蛋白。
4.根据权利要求3所述的应用,其特征在于:所述抑制Cas9蛋白活性表现为抑制SpyCas9蛋白与DNA结合和/或抑制St3Cas9蛋白对底物DNA的切割活性。
5.一种化学诱导型iAcr的产品,其特征在于:所述产品包括权利要求1中所述AcrIIA32.1、配体依赖性内含肽和4-羟基三苯氧胺;其中,所述AcrIIA32.1作为化学诱导型anti-CRISPR 的蛋白;所述iAcr为inducible anti-CRISPR。
6.权利要求5所述产品在实现化学可控的基因编辑中的应用;所述化学可控为由4-羟基三苯氧胺调控。
7.一种化学可控的基因编辑的方法,其特征在于:所述方法采用权利要求5所述化学诱导型iAcr的产品进行基因编辑;所述方法依赖4-羟基三苯氧胺调节。
8.权利要求1中所述AcrIIA蛋白在制备化学诱导型iAcr的系统中的应用;所述iAcr为inducible anti-CRISPR;所述化学诱导为4-羟基三苯氧胺诱导。
9.权利要求1中所述AcrIIA蛋白在进行化学可控的基因编辑中的应用;所述化学可控为由4-羟基三苯氧胺调控。
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311023039.0A CN117264987A (zh) | 2022-02-11 | 2022-02-11 | 工程化改造AcrIIA25.1蛋白进行化学可控基因编辑的方法 |
CN202311023426.4A CN117264988A (zh) | 2022-02-11 | 2022-02-11 | AcrIIA27蛋白在调控Cas9基因编辑中的应用 |
CN202311023337.XA CN117487831A (zh) | 2022-02-11 | 2022-02-11 | 鉴定AcrIIA31蛋白并应用于调控CRISPR-Cas9基因编辑的方法 |
CN202311022823.XA CN117286165A (zh) | 2022-02-11 | 2022-02-11 | 新型CRISPR-Cas9抑制蛋白AcrIIA24应用于基因编辑的方法 |
CN202210127749.7A CN114525293B (zh) | 2022-02-11 | 2022-02-11 | 新型CRISPR-Cas9抑制蛋白及其改造应用于化学可控的基因编辑的方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210127749.7A CN114525293B (zh) | 2022-02-11 | 2022-02-11 | 新型CRISPR-Cas9抑制蛋白及其改造应用于化学可控的基因编辑的方法 |
Related Child Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311023426.4A Division CN117264988A (zh) | 2022-02-11 | 2022-02-11 | AcrIIA27蛋白在调控Cas9基因编辑中的应用 |
CN202311023337.XA Division CN117487831A (zh) | 2022-02-11 | 2022-02-11 | 鉴定AcrIIA31蛋白并应用于调控CRISPR-Cas9基因编辑的方法 |
CN202311022823.XA Division CN117286165A (zh) | 2022-02-11 | 2022-02-11 | 新型CRISPR-Cas9抑制蛋白AcrIIA24应用于基因编辑的方法 |
CN202311023039.0A Division CN117264987A (zh) | 2022-02-11 | 2022-02-11 | 工程化改造AcrIIA25.1蛋白进行化学可控基因编辑的方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114525293A CN114525293A (zh) | 2022-05-24 |
CN114525293B true CN114525293B (zh) | 2023-09-01 |
Family
ID=81622912
Family Applications (5)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311023337.XA Pending CN117487831A (zh) | 2022-02-11 | 2022-02-11 | 鉴定AcrIIA31蛋白并应用于调控CRISPR-Cas9基因编辑的方法 |
CN202311023426.4A Pending CN117264988A (zh) | 2022-02-11 | 2022-02-11 | AcrIIA27蛋白在调控Cas9基因编辑中的应用 |
CN202311022823.XA Pending CN117286165A (zh) | 2022-02-11 | 2022-02-11 | 新型CRISPR-Cas9抑制蛋白AcrIIA24应用于基因编辑的方法 |
CN202210127749.7A Active CN114525293B (zh) | 2022-02-11 | 2022-02-11 | 新型CRISPR-Cas9抑制蛋白及其改造应用于化学可控的基因编辑的方法 |
CN202311023039.0A Pending CN117264987A (zh) | 2022-02-11 | 2022-02-11 | 工程化改造AcrIIA25.1蛋白进行化学可控基因编辑的方法 |
Family Applications Before (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311023337.XA Pending CN117487831A (zh) | 2022-02-11 | 2022-02-11 | 鉴定AcrIIA31蛋白并应用于调控CRISPR-Cas9基因编辑的方法 |
CN202311023426.4A Pending CN117264988A (zh) | 2022-02-11 | 2022-02-11 | AcrIIA27蛋白在调控Cas9基因编辑中的应用 |
CN202311022823.XA Pending CN117286165A (zh) | 2022-02-11 | 2022-02-11 | 新型CRISPR-Cas9抑制蛋白AcrIIA24应用于基因编辑的方法 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311023039.0A Pending CN117264987A (zh) | 2022-02-11 | 2022-02-11 | 工程化改造AcrIIA25.1蛋白进行化学可控基因编辑的方法 |
Country Status (1)
Country | Link |
---|---|
CN (5) | CN117487831A (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN115851775B (zh) * | 2022-10-18 | 2023-08-04 | 哈尔滨工业大学 | 一种Cas9蛋白抑制剂及其应用 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109152848A (zh) * | 2016-03-15 | 2019-01-04 | 马萨诸塞大学 | 抗-crispr化合物以及使用方法 |
WO2021108442A2 (en) * | 2019-11-27 | 2021-06-03 | The Regents Of The University Of California | Modulators of cas9 polypeptide activity and methods of use thereof |
-
2022
- 2022-02-11 CN CN202311023337.XA patent/CN117487831A/zh active Pending
- 2022-02-11 CN CN202311023426.4A patent/CN117264988A/zh active Pending
- 2022-02-11 CN CN202311022823.XA patent/CN117286165A/zh active Pending
- 2022-02-11 CN CN202210127749.7A patent/CN114525293B/zh active Active
- 2022-02-11 CN CN202311023039.0A patent/CN117264987A/zh active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109152848A (zh) * | 2016-03-15 | 2019-01-04 | 马萨诸塞大学 | 抗-crispr化合物以及使用方法 |
WO2021108442A2 (en) * | 2019-11-27 | 2021-06-03 | The Regents Of The University Of California | Modulators of cas9 polypeptide activity and methods of use thereof |
Non-Patent Citations (1)
Title |
---|
hypothetical protein [Streptococcus equinus];GenBank;《GenBank Database》;Accession No.WP_094140900.1 * |
Also Published As
Publication number | Publication date |
---|---|
CN117264987A (zh) | 2023-12-22 |
CN117286165A (zh) | 2023-12-26 |
CN117264988A (zh) | 2023-12-22 |
CN117487831A (zh) | 2024-02-02 |
CN114525293A (zh) | 2022-05-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11939604B2 (en) | Modified cascade ribonucleoproteins and uses thereof | |
US11946040B2 (en) | Adenine DNA base editor variants with reduced off-target RNA editing | |
US11427818B2 (en) | S. pyogenes CAS9 mutant genes and polypeptides encoded by same | |
CN107922931B (zh) | 热稳定的Cas9核酸酶 | |
US11913014B2 (en) | S. pyogenes Cas9 mutant genes and polypeptides encoded by same | |
AU2017341926B2 (en) | Epigenetically regulated site-specific nucleases | |
WO2014173955A1 (en) | Improved gene targeting and nucleic acid carrier molecule, in particular for use in plants | |
Song et al. | Discovery of potent and versatile CRISPR–Cas9 inhibitors engineered for chemically controllable genome editing | |
CN114525293B (zh) | 新型CRISPR-Cas9抑制蛋白及其改造应用于化学可控的基因编辑的方法 | |
US20240247255A1 (en) | Methods for modulating cas-effector activity | |
WO2022210748A1 (ja) | 新規なガイドrnaとの複合体形成能を有するポリペプチド | |
Kim et al. | H-NS is a Transcriptional Repressor of the CRISPR-Cas System in Acinetobacter baumannii ATCC 19606 | |
BASE | Adenine Dna Base Editor Variants With Reduced Off-target Rna Editing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |