CN114249803B - 一种高产精氨酸的工程菌及其构建方法与应用 - Google Patents
一种高产精氨酸的工程菌及其构建方法与应用 Download PDFInfo
- Publication number
- CN114249803B CN114249803B CN202111676693.2A CN202111676693A CN114249803B CN 114249803 B CN114249803 B CN 114249803B CN 202111676693 A CN202111676693 A CN 202111676693A CN 114249803 B CN114249803 B CN 114249803B
- Authority
- CN
- China
- Prior art keywords
- ala
- leu
- val
- gly
- arginine
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 241000894006 Bacteria Species 0.000 title claims abstract description 44
- 238000010276 construction Methods 0.000 title claims abstract description 20
- 239000004475 Arginine Substances 0.000 title abstract description 44
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 title abstract description 43
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 144
- 235000018102 proteins Nutrition 0.000 claims abstract description 52
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 52
- ODKSFYDXXFIFQN-BYPYZUCNSA-N L-arginine Chemical compound OC(=O)[C@@H](N)CCCN=C(N)N ODKSFYDXXFIFQN-BYPYZUCNSA-N 0.000 claims abstract description 31
- 229930064664 L-arginine Natural products 0.000 claims abstract description 30
- 235000014852 L-arginine Nutrition 0.000 claims abstract description 30
- 241000186226 Corynebacterium glutamicum Species 0.000 claims description 43
- 239000013598 vector Substances 0.000 claims description 35
- 238000000034 method Methods 0.000 claims description 25
- 150000001413 amino acids Chemical group 0.000 claims description 22
- 230000014509 gene expression Effects 0.000 claims description 18
- 150000007523 nucleic acids Chemical class 0.000 claims description 18
- 102000039446 nucleic acids Human genes 0.000 claims description 17
- 108020004707 nucleic acids Proteins 0.000 claims description 17
- 125000003729 nucleotide group Chemical group 0.000 claims description 16
- 239000002773 nucleotide Substances 0.000 claims description 15
- 239000012620 biological material Substances 0.000 claims description 14
- 238000004519 manufacturing process Methods 0.000 claims description 14
- 230000000694 effects Effects 0.000 claims description 9
- 238000002360 preparation method Methods 0.000 claims description 3
- 238000012258 culturing Methods 0.000 claims description 2
- 235000009697 arginine Nutrition 0.000 abstract description 42
- 230000035772 mutation Effects 0.000 abstract description 16
- 230000001580 bacterial effect Effects 0.000 abstract description 14
- 238000000855 fermentation Methods 0.000 abstract description 13
- 230000004151 fermentation Effects 0.000 abstract description 13
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 abstract description 10
- 125000000539 amino acid group Chemical group 0.000 abstract description 8
- 230000015572 biosynthetic process Effects 0.000 abstract description 6
- 239000004471 Glycine Substances 0.000 abstract description 5
- 238000009776 industrial production Methods 0.000 abstract description 4
- 238000003208 gene overexpression Methods 0.000 abstract 1
- 229960003121 arginine Drugs 0.000 description 39
- 108020004414 DNA Proteins 0.000 description 33
- 239000012634 fragment Substances 0.000 description 29
- 238000003752 polymerase chain reaction Methods 0.000 description 23
- 239000013612 plasmid Substances 0.000 description 22
- 239000002609 medium Substances 0.000 description 14
- 229930027917 kanamycin Natural products 0.000 description 13
- 229960000318 kanamycin Drugs 0.000 description 13
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 13
- 229930182823 kanamycin A Natural products 0.000 description 13
- 241000319304 [Brevibacterium] flavum Species 0.000 description 12
- 102000053602 DNA Human genes 0.000 description 11
- 108091026890 Coding region Proteins 0.000 description 10
- 238000006243 chemical reaction Methods 0.000 description 10
- 244000005700 microbiome Species 0.000 description 10
- 238000001962 electrophoresis Methods 0.000 description 9
- 238000012408 PCR amplification Methods 0.000 description 8
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 8
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 8
- QGZKDVFQNNGYKY-UHFFFAOYSA-N Ammonia Chemical compound N QGZKDVFQNNGYKY-UHFFFAOYSA-N 0.000 description 6
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- 108010047495 alanylglycine Proteins 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 238000011084 recovery Methods 0.000 description 5
- 125000006850 spacer group Chemical group 0.000 description 5
- 239000000126 substance Substances 0.000 description 5
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 4
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 4
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 4
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 4
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 4
- 241000880493 Leptailurus serval Species 0.000 description 4
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 4
- 229930006000 Sucrose Natural products 0.000 description 4
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 4
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 4
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 4
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 4
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 4
- 108010041407 alanylaspartic acid Proteins 0.000 description 4
- 108010070944 alanylhistidine Proteins 0.000 description 4
- 108010087924 alanylproline Proteins 0.000 description 4
- 235000001014 amino acid Nutrition 0.000 description 4
- 229940024606 amino acid Drugs 0.000 description 4
- 238000000137 annealing Methods 0.000 description 4
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 108010077245 asparaginyl-proline Proteins 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 108010089804 glycyl-threonine Proteins 0.000 description 4
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 4
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 4
- 108010000761 leucylarginine Proteins 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 239000005720 sucrose Substances 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 108010061238 threonyl-glycine Proteins 0.000 description 4
- 238000013518 transcription Methods 0.000 description 4
- 230000035897 transcription Effects 0.000 description 4
- 238000011144 upstream manufacturing Methods 0.000 description 4
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 241000192041 Micrococcus Species 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 229910021529 ammonia Inorganic materials 0.000 description 3
- 239000004202 carbamide Substances 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 238000009396 hybridization Methods 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 239000012528 membrane Substances 0.000 description 3
- 230000002018 overexpression Effects 0.000 description 3
- 239000000843 powder Substances 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- WOJJIRYPFAZEPF-YFKPBYRVSA-N 2-[[(2s)-2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]propanoyl]amino]acetate Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)CN WOJJIRYPFAZEPF-YFKPBYRVSA-N 0.000 description 2
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 2
- 229930024421 Adenine Natural products 0.000 description 2
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 2
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 2
- CRWFEKLFPVRPBV-CIUDSAMLSA-N Ala-Gln-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CRWFEKLFPVRPBV-CIUDSAMLSA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 2
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 2
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 2
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 2
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 2
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 2
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 2
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 2
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 2
- OCDJOVKIUJVUMO-SRVKXCTJSA-N Arg-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N OCDJOVKIUJVUMO-SRVKXCTJSA-N 0.000 description 2
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 2
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 2
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 2
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 2
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 2
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 2
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 2
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 2
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 2
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 2
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 2
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 2
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 2
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 2
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 2
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 2
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 2
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 2
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 2
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 2
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 2
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 2
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 2
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 2
- SXLCDCZHNCLFGZ-BPUTZDHNSA-N Asp-Pro-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SXLCDCZHNCLFGZ-BPUTZDHNSA-N 0.000 description 2
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 2
- 241000186146 Brevibacterium Species 0.000 description 2
- 241000186216 Corynebacterium Species 0.000 description 2
- GRNOCLDFUNCIDW-ACZMJKKPSA-N Cys-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N GRNOCLDFUNCIDW-ACZMJKKPSA-N 0.000 description 2
- CEZSLNCYQUFOSL-BQBZGAKWSA-N Cys-Arg-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O CEZSLNCYQUFOSL-BQBZGAKWSA-N 0.000 description 2
- LHLSSZYQFUNWRZ-NAKRPEOUSA-N Cys-Arg-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LHLSSZYQFUNWRZ-NAKRPEOUSA-N 0.000 description 2
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 241000589565 Flavobacterium Species 0.000 description 2
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 2
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 2
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 2
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 2
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 2
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 2
- XKPACHRGOWQHFH-IRIUXVKKSA-N Gln-Thr-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XKPACHRGOWQHFH-IRIUXVKKSA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 2
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 2
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 2
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 2
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 2
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 2
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 2
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 2
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- NPMSEUWUMOSEFM-CIUDSAMLSA-N Glu-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N NPMSEUWUMOSEFM-CIUDSAMLSA-N 0.000 description 2
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 2
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 2
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 2
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 2
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 2
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 2
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 2
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 2
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 2
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 2
- DUYYPIRFTLOAJQ-YUMQZZPRSA-N Gly-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN DUYYPIRFTLOAJQ-YUMQZZPRSA-N 0.000 description 2
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 2
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 2
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 2
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 2
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 2
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 2
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 2
- DVHGLDYMGWTYKW-GUBZILKMSA-N His-Gln-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DVHGLDYMGWTYKW-GUBZILKMSA-N 0.000 description 2
- HQKADFMLECZIQJ-HVTMNAMFSA-N His-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N HQKADFMLECZIQJ-HVTMNAMFSA-N 0.000 description 2
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 2
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 2
- RLAOTFTXBFQJDV-KKUMJFAQSA-N His-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CN=CN1 RLAOTFTXBFQJDV-KKUMJFAQSA-N 0.000 description 2
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 2
- CKONPJHGMIDMJP-IHRRRGAJSA-N His-Val-His Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CKONPJHGMIDMJP-IHRRRGAJSA-N 0.000 description 2
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 2
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 2
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 2
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 2
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 2
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 2
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 2
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 2
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 2
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 2
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 2
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 2
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 2
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 2
- 241000588915 Klebsiella aerogenes Species 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 2
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 2
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 2
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 2
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 2
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 2
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 2
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 2
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 2
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 2
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 2
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 2
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 2
- IPSDPDAOSAEWCN-RHYQMDGZSA-N Lys-Met-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IPSDPDAOSAEWCN-RHYQMDGZSA-N 0.000 description 2
- MSSABBQOBUZFKZ-IHRRRGAJSA-N Lys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O MSSABBQOBUZFKZ-IHRRRGAJSA-N 0.000 description 2
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 2
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 2
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 2
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 2
- BLIPQDLSCFGUFA-GUBZILKMSA-N Met-Arg-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O BLIPQDLSCFGUFA-GUBZILKMSA-N 0.000 description 2
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 2
- VBGGTAPDGFQMKF-AVGNSLFASA-N Met-Lys-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O VBGGTAPDGFQMKF-AVGNSLFASA-N 0.000 description 2
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 2
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- MWUXSHHQAYIFBG-UHFFFAOYSA-N Nitric oxide Chemical compound O=[N] MWUXSHHQAYIFBG-UHFFFAOYSA-N 0.000 description 2
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 2
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 2
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 2
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 2
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 2
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 2
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 2
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 2
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 2
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 2
- CMOIIANLNNYUTP-SRVKXCTJSA-N Pro-Gln-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CMOIIANLNNYUTP-SRVKXCTJSA-N 0.000 description 2
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 2
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 2
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 2
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 2
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 2
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 2
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 2
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 2
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 2
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 2
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 2
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 2
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 2
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 2
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 2
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 2
- WOJYIMBIKTWKJO-KKUMJFAQSA-N Ser-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N WOJYIMBIKTWKJO-KKUMJFAQSA-N 0.000 description 2
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 2
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 2
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 2
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 2
- WLDUCKSCDRIVLJ-NUMRIWBASA-N Thr-Gln-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O WLDUCKSCDRIVLJ-NUMRIWBASA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- IGGFFPOIFHZYKC-PBCZWWQYSA-N Thr-His-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O IGGFFPOIFHZYKC-PBCZWWQYSA-N 0.000 description 2
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 2
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 2
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 2
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 2
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 2
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 2
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 2
- NOOMDULIORCDNF-IRXDYDNUSA-N Tyr-Gly-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NOOMDULIORCDNF-IRXDYDNUSA-N 0.000 description 2
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 2
- AUZADXNWQMBZOO-JYJNAYRXSA-N Tyr-Pro-Arg Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 AUZADXNWQMBZOO-JYJNAYRXSA-N 0.000 description 2
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 2
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 2
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 2
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 2
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 2
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 2
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 2
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 2
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 2
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 2
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- 101150063416 add gene Proteins 0.000 description 2
- 229960000643 adenine Drugs 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 210000004027 cell Anatomy 0.000 description 2
- 229960005091 chloramphenicol Drugs 0.000 description 2
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 2
- CVSVTCORWBXHQV-UHFFFAOYSA-N creatine Chemical compound NC(=[NH2+])N(C)CC([O-])=O CVSVTCORWBXHQV-UHFFFAOYSA-N 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 2
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 2
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 239000013642 negative control Substances 0.000 description 2
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 108010025488 pinealon Proteins 0.000 description 2
- 239000013641 positive control Substances 0.000 description 2
- LWIHDJKSTIGBAC-UHFFFAOYSA-K potassium phosphate Substances [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 2
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 230000001737 promoting effect Effects 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 238000010998 test method Methods 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 2
- 108010020532 tyrosyl-proline Proteins 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 1
- 108700028369 Alleles Proteins 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 208000024172 Cardiovascular disease Diseases 0.000 description 1
- 241000186145 Corynebacterium ammoniagenes Species 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 101000978703 Escherichia virus Qbeta Maturation protein A2 Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 1
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical compound NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 1
- RHGKLRLOHDJJDR-BYPYZUCNSA-N L-citrulline Chemical compound NC(=O)NCCC[C@H]([NH3+])C([O-])=O RHGKLRLOHDJJDR-BYPYZUCNSA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- 102000004364 Myogenin Human genes 0.000 description 1
- 108010056785 Myogenin Proteins 0.000 description 1
- KWYHDKDOAIKMQN-UHFFFAOYSA-N N,N,N',N'-tetramethylethylenediamine Chemical compound CN(C)CCN(C)C KWYHDKDOAIKMQN-UHFFFAOYSA-N 0.000 description 1
- RHGKLRLOHDJJDR-UHFFFAOYSA-N Ndelta-carbamoyl-DL-ornithine Natural products OC(=O)C(N)CCCNC(N)=O RHGKLRLOHDJJDR-UHFFFAOYSA-N 0.000 description 1
- 102000007999 Nuclear Proteins Human genes 0.000 description 1
- 108010089610 Nuclear Proteins Proteins 0.000 description 1
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Natural products NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 1
- UTJLXEIPEHZYQJ-UHFFFAOYSA-N Ornithine Natural products OC(=O)C(C)CCCN UTJLXEIPEHZYQJ-UHFFFAOYSA-N 0.000 description 1
- 229930003270 Vitamin B Natural products 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 239000002518 antifoaming agent Substances 0.000 description 1
- 235000015278 beef Nutrition 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 229960002173 citrulline Drugs 0.000 description 1
- 235000013477 citrulline Nutrition 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 229960003624 creatine Drugs 0.000 description 1
- 239000006046 creatine Substances 0.000 description 1
- 238000012136 culture method Methods 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 238000001784 detoxification Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- ZPWVASYFFYYZEW-UHFFFAOYSA-L dipotassium hydrogen phosphate Chemical compound [K+].[K+].OP([O-])([O-])=O ZPWVASYFFYYZEW-UHFFFAOYSA-L 0.000 description 1
- 229910000396 dipotassium phosphate Inorganic materials 0.000 description 1
- 235000019797 dipotassium phosphate Nutrition 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 239000003797 essential amino acid Substances 0.000 description 1
- 235000020776 essential amino acid Nutrition 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000011790 ferrous sulphate Substances 0.000 description 1
- 235000003891 ferrous sulphate Nutrition 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- ZRALSGWEFCBTJO-UHFFFAOYSA-N guanidine group Chemical group NC(=N)N ZRALSGWEFCBTJO-UHFFFAOYSA-N 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- BAUYGSIQEAFULO-UHFFFAOYSA-L iron(2+) sulfate (anhydrous) Chemical compound [Fe+2].[O-]S([O-])(=O)=O BAUYGSIQEAFULO-UHFFFAOYSA-L 0.000 description 1
- 229910000359 iron(II) sulfate Inorganic materials 0.000 description 1
- 229940124280 l-arginine Drugs 0.000 description 1
- 238000011031 large-scale manufacturing process Methods 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 229940099596 manganese sulfate Drugs 0.000 description 1
- 239000011702 manganese sulphate Substances 0.000 description 1
- 235000007079 manganese sulphate Nutrition 0.000 description 1
- SQQMAOCOWKFBNP-UHFFFAOYSA-L manganese(II) sulfate Chemical compound [Mn+2].[O-]S([O-])(=O)=O SQQMAOCOWKFBNP-UHFFFAOYSA-L 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 238000009629 microbiological culture Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 230000035764 nutrition Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 229960003104 ornithine Drugs 0.000 description 1
- 230000000144 pharmacologic effect Effects 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 231100000572 poisoning Toxicity 0.000 description 1
- 230000000607 poisoning effect Effects 0.000 description 1
- 229920000768 polyamine Polymers 0.000 description 1
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 1
- 238000012257 pre-denaturation Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000007065 protein hydrolysis Effects 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000004614 tumor growth Effects 0.000 description 1
- 230000003827 upregulation Effects 0.000 description 1
- 150000003672 ureas Chemical class 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 235000019156 vitamin B Nutrition 0.000 description 1
- 239000011720 vitamin B Substances 0.000 description 1
- 230000029663 wound healing Effects 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/34—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Corynebacterium (G)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/74—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora
- C12N15/77—Vectors or expression systems specially adapted for prokaryotic hosts other than E. coli, e.g. Lactobacillus, Micromonospora for Corynebacterium; for Brevibacterium
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P13/00—Preparation of nitrogen-containing organic compounds
- C12P13/04—Alpha- or beta- amino acids
- C12P13/10—Citrulline; Arginine; Ornithine
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/10—Plasmid DNA
- C12N2800/101—Plasmid DNA for bacteria
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biophysics (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明公开了一种高产精氨酸的工程菌及其构建方法与应用。本发明公开的YH66_03760突变体是将YH66_03760蛋白质第282位氨基酸残基由甘氨酸突变为精氨酸得到的蛋白质。本发明首先通过对YH66_03760基因进行单点突变得到YH66_03760G844A,然后通过对构建的YH66_03760或突变基因的过表达重组菌以及YH66_03760敲除重组菌进行发酵培养发现,YH66_03760基因或其突变基因可以调控细菌L‑精氨酸产量。本发明首次发现YH66_03760基因参与精氨酸的生物合成,对于培育符合工业化生产的高产、高质量菌种,以及精氨酸工业化生产具有重大的应用价值。
Description
技术领域
本发明属于生物技术领域,具体涉及一种高产精氨酸的工程菌及其构建方法与应用。
背景技术
L-精氨酸(L-argnine,简称L-Arg)是人体内所需半必需碱性氨基酸之一,作为一种含有胍基的碱性氨基酸,是生物体尿素循环的一种重要中间代谢物,具有多种独特的生理和药理作用,对于治疗生理功能、心血管疾病、激励免疫系统、维持婴儿的营养平衡、促进人体解毒等都具有良好的疗效,被专家称为是机体内运输和贮存氨基酸的重要载体,在肌内代谢中极为重要。它是合成胞浆蛋白和核蛋白的必需氨基酸;作为唯一的氨来源参与肌酸的合成;作为尿素循环的重要中间体,在肝脏中扮演着排除多余氨的角色,防止氨过量积累而引起中毒;它还具有调节人体免疫力的功能,可抑制肿瘤生长、促进受伤组织愈合等。并且,精氨酸是一氧化氮、尿素、鸟氨酸及肌丁胺的直接前体,是合成肌肉素的重要原素,且被用作聚胺、瓜氨酸及谷氨酰胺的合成。因此,L-精氨酸在医药、食品及化工领域方面具有重要而广泛的应用。据统计,目前全世界L-精氨酸需求量在15000吨以上,并且需求量以每年12%-15 %的速度增长。
L-精氨酸的生产方法主要有两种:一是蛋白质水解提取法,二是微生物发酵法。水解法存在操作费时,收率和产量低,成本高等问题,并且存在严重的污染而不适合大规模的生产。发酵法生产L-精氨酸工艺相对简单,且对环境友好,因此具有很大的发展潜力,成为国内外氨基酸工业的一个重要趋势。但目前国内的微生物发酵生产L-精氨酸的产量普遍较低,远远不能满足国内需求。
基因工程技术对于精氨酸高产菌选育具有重要的推动作用,如何利用基因工程技术构建适合于工业规模生产的高产重组菌对于提高L-精氨酸产量具有重要意义。
发明内容
本发明所要解决的技术问题是如何利用YH66_03760基因构建重组载体、重组菌和/或提高L-精氨酸的产量。所要解决的技术问题不限于如所描述的技术主题,本领域技术人员通过以下描述可以清楚地理解本文未提及的其它技术主题。
为了解决上述技术问题,本发明首先提供了一种YH66_03760突变体。
本发明提供的YH66_03760突变体是将YH66_03760蛋白质第282位氨基酸残基由甘氨酸突变为其他氨基酸残基得到的蛋白质;
所述YH66_03760蛋白质为如下A1)-A3)中的任一种:
A1)SEQ ID No.2所示的氨基酸序列组成的蛋白质;
A2)将A1)所示的氨基酸序列经过除第282位氨基酸残基以外的一个或几个氨基酸残基的取代和/或缺失和/或添加得到的与细菌产精氨酸相关的蛋白质;
A3)来源于细菌且与A1)或A2)具有95%以上同一性且与细菌产精氨酸相关的蛋白质。
上述A2)所述的蛋白质中,所述一个或几个氨基酸残基的取代和/或缺失和/或添加为不超过10个氨基酸残基的取代和/或缺失和/或添加。
上述A3)所述的蛋白质中,这里使用的术语“同一性”指与天然氨基酸序列的序列相似性。“同一性”包括与本发明的SEQ ID No.2所示的氨基酸序列具有95%或更高,或96%或更高,或 97%或更高,或98%或更高,或99%或更高同一性的氨基酸序列。同一性可以用肉眼或计算机软件进行评价。使用计算机软件,两个或多个序列之间的同一性可以用百分比(%)表示,其可以用来评价相关序列之间的同一性。
上述A1)或A2)或A3)所述的蛋白质可人工合成,也可先合成其编码基因,再进行生物表达得到。
进一步的,所述YH66_03760突变体是将YH66_03760蛋白质第144位氨基酸残基由甘氨酸突变为精氨酸得到的蛋白质(对应本发明实施例中的YH66_03760G884A蛋白质)。
更进一步的,所述YH66_03760突变体(YH66_03760G884A蛋白质)是SEQ ID No.4所示的氨基酸序列组成的蛋白质。
为了解决上述技术问题,本发明又提供了与YH66_03760突变体相关的生物材料。
本发明提供的与YH66_03760突变体相关的生物材料为如下B1)至B4)中的任一种:
B1)编码上述YH66_03760突变体的核酸分子;
B2)含有B1)所述核酸分子的表达盒;
B3)含有B1)所述核酸分子的重组载体、或含有B2)所述表达盒的重组载体;
B4)含有B1)所述核酸分子的重组微生物、或含有B2)所述表达盒的重组微生物、或含有B3)所述重组载体的重组微生物。
为了解决上述技术问题,本发明还提供了上述YH66_03760蛋白质或与上述YH66_03760 蛋白质相关的生物材料或上述YH66_03760突变体或与上述YH66_03760突变体相关的生物材料的新用途。
本发明提供了上述YH66_03760蛋白质或与上述YH66_03760蛋白质相关的生物材料或上述YH66_03760突变体或与上述YH66_03760突变体相关的生物材料在如下X1)至X4)中任一种中的应用:
X1)调控细菌精氨酸产量;
X2)构建产精氨酸工程菌;
X3)制备精氨酸;
与YH66_03760蛋白质相关的生物材料为如下D1)至D4)中的任一种:
D1)编码所述YH66_03760蛋白质的核酸分子;
D2)含有D1)所述核酸分子的表达盒;
D3)含有D1)所述核酸分子的重组载体、或含有D2)所述表达盒的重组载体;
D4)含有D1)所述核酸分子的重组微生物、或含有D2)所述表达盒的重组微生物、或含有D3)所述重组载体的重组微生物。
上述生物材料或应用中,B1)所述编码YH66_03760突变体的核酸分子为如下C1)或C2)中的任一种:
C1)核苷酸序列为SEQ ID No.3的DNA分子;
C2)将SEQ ID No.3所示的核苷酸序列经过修饰和/或一个或几个核苷酸的取代和/或缺失和/或添加得到的与C1)所示的DNA分子具有90%以上的同一性,且具有相同功能的DNA 分子。
D1)所述编码YH66_03760蛋白质的核酸分子为如下E1)或E2)中的任一种:
E1)核苷酸序列为SEQ ID No.1的DNA分子;
E2)将SEQ ID No.1所示的核苷酸序列经过修饰和/或一个或几个核苷酸的取代和/或缺失和/或添加得到的与E1)所示的DNA分子具有90%以上的同一性,且具有相同功能的DNA分子。
其中,SEQ ID No.1所示的DNA分子为谷氨酸棒杆菌(Corynebacteriumglutamicum) CGMCC No.20516中的YH66_03760基因,其编码的YH66_03760蛋白质的氨基酸序列如SEQ ID No.2所示。在本发明中通过引入点突变,得到SEQ ID No.3所示的YH66_03760G844A基因,其编码的YH66_03760G884A蛋白质的氨基酸序列如SEQ ID No.4所示。
本领域普通技术人员可以很容易地采用已知的方法,例如定向进化和点突变的方法,对本发明的编码YH66_03760蛋白质或YH66_03760突变体的核苷酸序列进行突变。那些经过人工修饰的,具有编码YH66_03760蛋白质或YH66_03760突变体的核苷酸序列90%或者更高同一性的核苷酸,只要编码YH66_03760蛋白质或YH66_03760突变体且具有相同功能,均是衍生于本发明的核苷酸序列并且等同于本发明的序列。
这里使用的术语“同一性”指与天然核酸序列的序列相似性。“同一性”包括与本发明的编码SEQ ID No.2或SEQ ID No.4所示的氨基酸序列组成的蛋白质的核苷酸序列具有90%或更高,或91%或更高,或92%或更高,或93%或更高,或94%或更高,或95%或更高,或96%或更高,或97%或更高,或98%或更高,或99%或更高同一性的核苷酸序列。同一性可以用肉眼或计算机软件进行评价。使用计算机软件,两个或多个序列之间的同一性可以用百分比 (%)表示,其可以用来评价相关序列之间的同一性。
所述严格条件是在2×SSC,0.1%SDS的溶液中,在68℃下杂交并洗膜2次,每次5min;或,0.5×SSC,0.1%SDS的溶液中,在68℃下杂交并洗膜2次,每次15min;或,0.1×SSPE (或0.1×SSC)、0.1%SDS的溶液中,65℃条件下杂交并洗膜。
上述生物材料或应用中,B2)所述含有编码YH66_03760突变体的核酸分子的表达盒是指能够在宿主细胞中表达YH66_03760突变体的DNA,该DNA不但可包括启动YH66_03760突变体基因转录的启动子,还可包括终止YH66_03760突变体基因转录的终止子。进一步,所述表达盒还可包括增强子序列。D2)所述含有编码YH66_03760蛋白质的核酸分子的表达盒是指能够在宿主细胞中表达YH66_03760蛋白质的DNA,该DNA不但可包括启动 YH66_03760基因转录的启动子,还可包括终止YH66_03760基因转录的终止子。进一步,所述表达盒还可包括增强子序列。
上述生物材料或应用中,B3)或D3)所述载体可为质粒、黏粒、噬菌体或病毒载体。所述质粒具体可为pK18mobsacB质粒或pXMJ19质粒。
在本发明的一个具体实施例中,所述重组载体为重组载体pK18-YH66_03760G844A。
在本发明的另一个具体实施例中,所述重组载体为重组载体pK18-YH66_03760OE或重组载体pK18-YH66_03760G844AOE。
在本发明的又一个具体实施例中,所述重组载体为重组载体pXMJ19-YH66_03760或重组载体pXMJ19-YH66_03760G844A。
上述生物材料中,B4)或D4)所述微生物可为酵母、细菌、藻或真菌。
进一步的,细菌可为任一具有产精氨酸能力的细菌,如来自短杆菌属(Brevibacterium)、棒杆菌属(Corynebacterium)、埃希氏菌属(Escherichia)、气杆菌属(Aerobacter)、微球菌属(Micrococcus)、黄杆菌属(Flavobacterium)或芽胞杆菌属(Bacillus)的细菌等。
更进一步的,所述细菌包括但不限于谷氨酸棒杆菌(Corynebacteriumglutamicum)、黄色短杆菌(Brevibacterium flavum)、乳酸发酵短杆菌(Brevibacteriumlactofermentum)、产谷氨酸微球菌(Micrococcus glutamicus)、产氨短杆菌(Brevibacterum ammoniagenes)、大肠杆菌(Escherichia coli)、产气气杆菌(Aerobacteraerogenes)。
在本发明的一个具体实施例中,所述微生物为谷氨酸棒杆菌(Corynebacteriumglutamicum)CGMCC No.20516,该菌株名称为YPARG01,已于2020年8月10日保藏于中国微生物菌种保藏管理委员会普通微生物中心(简称CGMCC,地址为:北京市朝阳区北辰西路1号院3号,中国科学院微生物研究所),保藏登记号为CGMCC No.20516。
上述应用中,调控为正调控。具体体现为当细菌中YH66_03760蛋白质或YH66_03760 突变体含量或活性提高时,所述细菌精氨酸产量提高;当细菌中YH66_03760蛋白质含量或活性降低时,所述细菌精氨酸产量降低。
为了解决上述技术问题,本发明还提供了提高YH66_03760蛋白质或YH66_03760突变体含量和/或活性的物质或提高YH66_03760基因或YH66_03760突变体基因表达量的物质的新用途。
本发明提供了提高YH66_03760蛋白质或YH66_03760突变体含量和/或活性的物质或提高YH66_03760基因或YH66_03760突变体基因表达量的物质在如下Y1)至Y4)中任一种中的应用:
Y1)提高细菌精氨酸产量;
Y2)构建产精氨酸工程菌;
Y3)制备精氨酸。
进一步的,所述提高YH66_03760基因表达量的物质可为YH66_03760基因或含有所述YH66_03760基因的重组载体。
所述提高YH66_03760突变体基因表达量的物质可为YH66_03760突变体基因或含有所述YH66_03760突变体基因的重组载体。
更进一步的,含有所述YH66_03760基因的重组载体具体可为重组载体pK18-YH66_03760OE或重组载体pXMJ19-YH66_03760。
含有所述YH66_03760突变体基因的重组载体具体可为重组载体pK18-YH66_03760G844AOE或重组载体pXMJ19-YH66_03760G844A。
为了解决上述技术问题,本发明还提供了一种提高细菌精氨酸产量的方法。
本发明提供的提高细菌精氨酸产量的方法为如下M1)或M2):
所述M1)包括如下步骤:将细菌基因组中的YH66_03760基因替换为YH66_03760突变体基因,实现细菌精氨酸产量的提高;
所述M2)包括如下步骤:提高细菌中YH66_03760蛋白质或YH66_03760突变体含量和/ 或活性,或提高细菌中YH66_03760基因或YH66_03760突变体基因表达量,实现细菌精氨酸产量的提高。
为了解决上述技术问题,本发明还提供了一种产精氨酸工程菌的构建方法。
本发明提供的产精氨酸工程菌的构建方法为如下N1)或N2):
所述N1)包括如下步骤:将细菌基因组中的YH66_03760基因替换为YH66_03760突变体基因,得到所述产精氨酸工程菌;
所述N2)包括如下步骤:提高细菌中YH66_03760蛋白质或YH66_03760突变体含量和/ 或活性,或提高细菌中YH66_03760基因或YH66_03760突变体基因表达量,得到所述产精氨酸工程菌;
上述任一所述应用或方法中,所述YH66_03760突变体具体为YH66_03760G884A蛋白质,具体为SEQ ID No.4所示的氨基酸序列组成的蛋白质。
所述YH66_03760突变体基因具体为YH66_03760G844A基因,具体为SEQ ID No.3所示的 DNA分子。
按照上述产精氨酸工程菌的构建方法构建得到的产精氨酸工程菌在制备精氨酸中的应用也属于本发明的保护范围。
为了解决上述技术问题,本发明最后提供了一种制备精氨酸的方法。
本发明提供的制备精氨酸的方法包括如下步骤:发酵培养按照上述产精氨酸工程菌的构建方法构建得到的产精氨酸工程菌,得到所述精氨酸。
所述发酵培养方法可按照现有技术中的常规试验方法进行。也可采用优化和改进后的常规试验方法进行。所述发酵培养条件如实施例中的表1所示。
上述任一所述应用或方法中,所述精氨酸具体为L-精氨酸。
本发明首先通过对YH66_03760基因进行单点突变得到YH66_03760G844A基因,然后通过对构建的YH66_03760或突变基因的过表达重组菌以及YH66_03760敲除重组菌进行发酵培养发现,YH66_03760基因或其突变基因可以调控细菌L-精氨酸产量。本发明首次发现YH66_03760基因参与精氨酸的生物合成,对于培育符合工业化生产的高产、高质量菌种,以及精氨酸工业化生产具有重大的应用价值。
具体实施方式
下面结合具体实施方式对本发明进行进一步的详细描述,给出的实施例仅为了阐明本发明,而不是为了限制本发明的范围。以下提供的实施例可作为本技术领域普通技术人员进行进一步改进的指南,并不以任何方式构成对本发明的限制。
下述实施例中的实验方法,如无特殊说明,均为常规方法,按照本领域内的文献所描述的技术或条件或者按照产品说明书进行。下述实施例中所用的材料、试剂等,如无特殊说明,均可从商业途径得到。
实施例1、构建包含点突变的YH66_03760基因编码区的重组载体
依据NCBI公布的黄色短杆菌(Brevibacterium flavum)ATCC15168基因组序列,设计并合成两对扩增YH66_03760基因编码区的引物,以等位基因置换的方式在谷氨酸棒杆菌(Corynebacterium glutamicum)CGMCC 20516(经测序确认该菌株染色体上保留有野生型的 YH66_03760基因)的YH66_03760基因编码区(SEQ ID No.1)中引入点突变,所述点突变为将YH66_03760基因的核苷酸序列(SEQ ID No.1)中的第844位鸟嘌呤(G)突变为腺嘌呤(A),得到SEQ ID No.3所示的DNA分子(突变的YH66_03760基因,名称为 YH66_03760G844A)。
其中,SEQ ID No.1所示的DNA分子编码氨基酸序列为SEQ ID No.2的蛋白质(所述蛋白质名称为蛋白质YH66_03760)。
SEQ ID No.3所示的DNA分子编码氨基酸序列为SEQ ID No.4的突变蛋白质(所述突变蛋白质名称为YH66_03760G884A)。所述突变蛋白质YH66_03760G884A氨基酸序列(SEQ IDNo.4)中的第282位精氨酸(R)由甘氨酸(G)突变而来。
采用NEBuilder重组技术进行基因定点突变,引物设计如下(上海invitrogen公司合成),加粗字体的碱基为突变位置:
P1:5'-CAGTGCCAAGCTTGCATGCCTGCAGGTCGACTCTAGATTGGTACTGAAGGCT CACC-3',
P2:5'-AAGAATTCCACGGTTCTCGCGCCCTGGTAACCAATGG-3',
P3:5'-CCATTGGTTACCAGGGCGCGAGAACCGTGGAATTCTT-3',
P4:5'-CAGCTATGACCATGATTACGAATTCGAGCTCGGTACCCGGCAGATCCTTGAT GTTGGG-3'。
构建方法:以黄色短杆菌ATCC15168为模板,分别采用引物P1/P2和P3/P4进行PCR扩增,获得两条分别带有突变碱基,大小分别为716bp和735bp的YH66_03760基因编码区的DNA片段(YH66_03760Up和YH66_03760Down)。
PCR扩增体系为:10×Ex Taq Buffer 5μL,dNTP Mixture(各2.5mM)4μL,Mg2+(25mM) 4μL,引物(10pM)各2μL,Ex Taq(5U/μL)0.25μL,总体积50μL。
PCR扩增反应程序为:94℃预变性5min,(94℃变性30s;52℃退火30s;72℃延伸40s;30个循环),72℃过度延伸10min。
将上述两条DNA片段(YH66_03760Up和YH66_03760Down)经琼脂糖凝胶电泳分离纯化后,与经过酶切(Xbal I/BamH I)后纯化的pK18mobsacB质粒(购自Addgene公司,用 XbalI/BamH I酶切)用NEBuilder酶(购自NEB公司)50℃连接30min,连接产物转化后长出的单克隆经PCR鉴定获得阳性重组载体pK18-YH66_03760G844A,该重组载体上含有卡那霉素抗性标记。将酶切正确的重组载体pK18-YH66_03760G844A送测序公司测序鉴定,并将含有正确点突变(G-A)的重组载体pK18-YH66_03760G844A保存备用。
重组载体pK18-YH66_03760G844A中YH66_03760G844A Up-Down DNA片段(YH66_03760Up-Down,SEQ ID No.5)大小为1414bp,由于含有突变位点,导致菌株谷氨酸棒杆菌CGMCC20516中YH66_03760基因编码区的第844位鸟嘌呤(G)变为腺嘌呤(A),最终导致编码蛋白的第282位甘氨酸(G)变为精氨酸(R)。
所述重组载体pK18-YH66_03760G844A是将pK18mobsacB载体的Xbal I和/BamH I识别位点间的片段(小片段)替换为序列表中SEQ ID No.5的第37-1376位所示的DNA片段,且保持pK18mobsacB载体的其他序列不变,得到的重组载体。
所述重组载体pK18-YH66_03760G844A含有SEQ ID No.3所示的突变基因 YH66_03760G844A的第181-1520位所示的DNA分子。
实施例2、构建包含基因YH66_03760G844A的工程菌株YPR-007
构建方法:将实施例1中的等位替换质粒(pK18-YH66_03760G844A)通过电击转化入谷氨酸棒杆菌(Corynebacterium glutamicum)CGMCC 20516中后,在培养基中进行培养,培养基配方如下:溶剂为水,溶质及其浓度分别如下:蔗糖10g/L、多聚蛋白胨10g/L、牛肉膏10g/L、酵母粉5g/L、尿素2g/L、氯化钠2.5g/L、琼脂粉20g/L,pH7.0;培养条件如下: 32℃。对培养产生的单菌落分别通过实施例1中的引物P1和通用引物M13R进行鉴定,能扩增出1421bp大小条带的菌株为阳性菌株。将阳性菌株在含15%蔗糖的培养基上培养,对培养产生的单菌落分别在含有卡那霉素和不含卡那霉素的培养基上培养,选择在不含卡那霉素的培养基上生长,而在含卡那霉素的培养基上不生长的菌株进一步采用如下引物(上海invitrogen公司合成)进行PCR鉴定:
P5:5'-GTCACCAAAAAGTTGTCGAA-3';
P6:5'-GGTTGCACCAGCAGCCAAGC-3'。
将得到的PCR扩增产物(260bp)通过95℃高温变性10min、冰浴5min后进行SSCP(Single-Strand Conformation Polymorphis)电泳(以质粒pK18-YH66_03760G844A扩增片段为阳性对照,黄色短杆菌ATCC15168扩增片段为阴性对照,水作为空白对照),SSCP电泳的PAGE的制备成分及用量分别如下:40%丙烯酰胺(丙烯酰胺终浓度为8%)8mL、ddH2O 26mL、甘油4mL、10×TBE 2mL、TEMED 40μL、10%APS 600μL,电泳条件如下:将电泳槽置入冰中,使用1×TBE缓冲液,电压120V,电泳时间10h。由于片段结构不同,电泳位置不同,因此片段电泳位置与阴性对照片段位置不一致且与阳性对照片段位置一致的菌株为等位替换成功的菌株。再次通过引物P5/P6 PCR扩增阳性菌株YH66_03760基因片段,并连接到PMD19-T载体进行测序,通过序列比对,碱基序列发生突变(G-A)的菌株为等位替换成功的阳性菌株,并被命名为YPR-007。
重组菌YPR-007是将谷氨酸棒杆菌CGMCC 20516基因组中的YH66_03760基因进行单点突变(对应于SEQ ID No.1所示YH66_03760基因的第844位的碱基G突变为碱基A),且保持谷氨酸棒杆菌CGMCC 20516的基因组中的其它序列不变得到的重组菌。
实施例3、构建基因组上过表达YH66_03760基因或YH66_03760G844A基因的工程菌株YPR-008和YPR-009
依据NCBI公布的黄色短杆菌ATCC15168基因组序列,设计并合成三对扩增上下游同源臂片段及YH66_03760或YH66_03760G844A基因编码区及启动子区的引物,以同源重组的方式在谷氨酸棒杆菌CGMCC 20516中引入YH66_03760或YH66_03760G844A基因。
引物设计如下(上海invitrogen公司合成):
P7:5'-CAGTGCCAAGCTTGCATGCCTGCAGGTCGACTCTAGCATGACGGCTGACTGG ACTC-3';
P8:5'-AAAATCTAAGACTCGGAAAAAATCGGACTCCTTAAATGGG-3';
P9:5'-CCCATTTAAGGAGTCCGATTTTTTCCGAGTCTTAGATTTT-3';
P10:5'-CTATGTGAGTAGTCGATTTATTAGGAAACGACGACGATCA-3';
P11:5'-TGATCGTCGTCGTTTCCTAATAAATCGACTACTCACATAG-3';
P12:5'-CAGCTATGACCATGATTACGAATTCGAGCTCGGTACCCTGCATAAGAAACAA CCACTT-3'。
构建方法:分别以黄色短杆菌ATCC15168或YPR-007为模板,分别采用引物P7/P8、P9/P10和P11/P12进行PCR扩增,获得上游同源臂片段806bp(对应于谷氨酸棒杆菌CGMCC20516YH66_03350基因及其启动子区或者是与上一个基因的间隔区,序列如SEQ ID No.6所示),YH66_03760基因及其启动子片段3858bp(序列如SEQ ID No.7所示)或YH66_03760G844A基因及其启动子片段3858bp(序列如SEQ ID No.8所示)及下游同源臂片段783bp(对应于谷氨酸棒杆菌CGMCC 20516YH66_03355基因及其与YH66_03350基因的部分间隔区,序列如SEQID No.9所示)。PCR反应结束后,对每个模板扩增得到的3个片段采用柱式DNA凝胶回收试剂盒分别进行电泳回收。回收后的3个片段与经过Xbal I/BamH I酶切后纯化的 pK18mobsacB质粒(购自Addgene公司)用NEBuilder酶(购自NEB公司)50℃连接30min,连接产物转化后长出的单克隆用M13引物经PCR鉴定获得阳性整合质粒(重组载体),分别为pK18-YH66_03760OE、pK18-YH66_03760G844AOE,该阳性整合质粒上含有卡那霉素抗性标记,可以通过卡那霉素筛选获得质粒整合到基因组上的重组子。
PCR反应体系为:10×Ex Taq Buffer 5μL,dNTP Mixture(各2.5mM)4μL,Mg2+(25mM) 4μL,引物(10pM)各2μL,Ex Taq(5U/μL)0.25μL,总体积50μL。
PCR反应程序为:94℃预变性5min,94℃变性30s;52℃退火30s;72℃延伸60s (30个循环),72℃过度延伸10min。
将测序正确的整合质粒(pK18-YH66_03760OE、pK18-YH66_03760G844AOE)分别电转化入谷氨酸棒杆菌CGMCC 20516,在培养基中进行培养,对培养产生的单菌落通过P13/P14引物进行PCR鉴定,PCR扩增出大小2245bp的片段的菌株为阳性菌株,扩增不到片段的菌株为原菌。将阳性菌株在含15%蔗糖的培养基上培养,对培养产生的单菌落进一步采用P15/P16 引物进行PCR鉴定,PCR扩增出大小为3453bp的片段的菌株为YH66_03760或 YH66_03760G844A基因整合到谷氨酸棒杆菌CGMCC 20516基因组上同源臂YH66_03350和下同源臂YH66_03355的间隔区上的阳性菌株,分别命名为YPR-008(不含突变点)和YPR-009 (含突变点)。
PCR鉴定引物如下所示:
P13:5'-GTCCGCTCTGTTGGTGTTCA-3'(对应上同源臂YH66_03350的外侧);
P14:5'-CTTATCTTGGGTCAGACCCA-3'(对应YH66_03760基因内部);
P15:5'-CACAGCATTTGGATCCAGAA-3'(对应YH66_03760基因内部);
P16:5'-TGGAGGAATATTCGGCCCAG-3'(对应下同源臂YH66_03355的外侧)。
重组菌YPR-008含有双拷贝的SEQ ID No.1所示的YH66_03760基因;具体地,重组菌 YPR-008是将谷氨酸棒杆菌CGMCC 20516的基因组中上同源臂YH66_03350和下同源臂YH66_03355的间隔区替换为YH66_03760基因,保持谷氨酸棒杆菌CGMCC 20516的基因组中的其它核苷酸不变得到的重组菌。含有双拷贝YH66_03760基因的重组菌可以显著和稳定地提高YH66_03760基因的表达量。重组菌YPR-009含有SEQ ID No.3所示的突变的 YH66_03760G844A基因;具体地,重组菌YPR-009是将谷氨酸棒杆菌CGMCC 20516的基因组中上同源臂YH66_03350和下同源臂YH66_03355的间隔区替换为YH66_03760G844A基因,保持谷氨酸棒杆菌CGMCC 20516的基因组中的其它核苷酸不变得到的重组菌。
实施例4、构建质粒上过表达YH66_03760基因或YH66_03760G844A基因的工程菌株YPR-010和YPR-011
依据NCBI公布的黄色短杆菌ATCC15168基因组序列,设计并合成一对扩增YH66_03760 或YH66_03760G844A基因编码区及启动子区的引物,引物设计如下(上海invitrogen公司合成):
P17:5'-GCTTGCATGCCTGCAGGTCGACTCTAGAGGATCCCCTTTTCCGAGTCTTAGAT TTT-3'(带下划线的核苷酸序列为pXMJ19上的序列),
P18:5'-ATCAGGCTGAAAATCTTCTCTCATCCGCCAAAACTTAGGAAACGACGACGAT CA-3'(带下划线的核苷酸序列为pXMJ19上的序列)。
构建方法:分别以黄色短杆菌ATCC15168或YPR-007为模板,以引物P17/P18进行PCR 扩增,获得YH66_03760基因及其启动子片段3888bp和YH66_03760G844A基因及其启动子片段3888bp,对扩增产物进行电泳并采用柱式DNA凝胶回收试剂盒进行纯化回收,回收的DNA 片段与经EcoR I酶切回收的穿梭质粒pXMJ19用NEBuilder酶(购自NEB公司)50℃连接30min,连接产物转化后长出的单克隆用M13引物经PCR鉴定获得阳性过表达质粒 pXMJ19-YH66_03760(含有YH66_03760基因)和pXMJ19-YH66_03760G844A(含有 YH66_03760G844A基因),将该质粒送测序。因质粒上含有氯霉素抗性标记,可以通过氯霉素来筛选质粒是否转化到菌株中。
PCR反应体系为:10×Ex Taq Buffer 5μL,dNTP Mixture(各2.5mM)4μL,Mg2+(25mM) 4μL,引物(10pM)各2μL,Ex Taq(5U/μL)0.25μL,总体积50μL。
PCR反应程序为:94℃预变性5min,94℃变性30s;52℃退火30s;72℃延伸60s (30个循环),72℃过度延伸10min。
将测序正确的pXMJ19-YH66_03760和pXMJ19-YH66_03760G844A质粒分别电转化入谷氨酸棒杆菌CGMCC 20516中,在培养基中进行培养,对培养产生的单菌落通过引物M13R(-48) /P18进行PCR鉴定,PCR扩增出大小为3927bp的片段的菌株为阳性菌株,其被命名为YPR-010(不含突变点)和YPR-011(含突变点)。
重组菌YPR-010含有SEQ ID No.1所示的YH66_03760基因,是通过质粒表达 YH66_03760基因的重组菌。重组菌YPR-011含有SEQ ID No.3所示的突变的 YH66_03760G844A基因,是通过质粒表达YH66_03760G844A基因的重组菌。
实施例5、构建基因组上缺失YH66_03760基因的工程菌株YPR-012
根据NCBI公布的黄色短杆菌ATCC15168的基因组序列,合成两对扩增YH66_03760基因编码区两端片段的引物,作为上下游同源臂片段。引物设计如下(上海invitrogen公司合成):
P19:5'-CAGTGCCAAGCTTGCATGCCTGCAGGTCGACTCTAGTCCTTTGGCTACTAAC CCAC-3',
P20:5'-GGGGCTTTTTACAGAAAGGTTAGAGTAATTGTTCCTTTCA-3',
P21:5'-TGAAAGGAACAATTACTCTAACCTTTCTGTAAAAAGCCCC-3',
P22:5'-CAGCTATGACCATGATTACGAATTCGAGCTCGGTACCCGTCTATTTGCTTATCG ACGT-3'。
构建方法:以黄色短杆菌ATCC15168为模板,分别采用引物P19/P20和P21/P22进行PCR 扩增,获得YH66_03760的上游同源臂片段716bp及YH66_03760的下游同源臂片段755bp。对扩增的产物进行电泳并采用柱式DNA凝胶回收试剂盒进行纯化,回收的DNA片段与经过Xbal I/BamH I酶切后纯化的pK18mobsacB质粒(购自Addgene公司)用NEBuilder酶(购自 NEB公司)50℃连接30min,连接产物转化后长出的单克隆用M13引物经PCR鉴定获得阳性敲除载体pK18-ΔYH66_03760,该质粒含有整个敲除YH66_03760同源臂片段1431bp(序列如SEQ ID No.10所示)和卡那霉素抗性作为筛选标记,将该质粒送测序。
PCR扩增反应体系为:10×Ex Taq Buffer 5μL,dNTP Mixture(各2.5mM)4μL,Mg2+(25mM)4μL,引物(10pM)各2μL,Ex Taq(5U/μL)0.25μL,总体积50μL。
PCR扩增反应程序为:94℃预变性5min,94℃变性30s;52℃退火30s;72℃延伸 90s(30个循环),72℃过度延伸10min。
将测序正确的敲除质粒pK18-ΔYH66_03760电转化入谷氨酸棒杆菌CGMCC 20516,在培养基中进行培养,对培养产生的单菌落通过如下引物(上海invitrogen公司合成)进行PCR 鉴定:
P23:5'-TCCTTTGGCTACTAACCCAC-3'(对应于谷氨酸棒杆菌CGMCC 20516 YH66_03755基因内部);
P24:5'-GTCTATTTGCTTATCGACGT-3'(对应于谷氨酸棒杆菌CGMCC 20516 YH66_03765 基因内部)。
上述PCR同时扩增出大小1357bp及4780bp的条带的菌株为阳性菌株,只扩增出4780bp 条带的菌株为原菌。阳性菌株在15%蔗糖培养基上筛选后分别在含有卡那霉素和不含卡那霉素的培养基上培养,选择在不含卡那霉素的培养基上生长,而在含卡那霉素的培养基上不生长的菌株进一步采用P23/P24引物进行PCR鉴定,扩增出大小为1357bp条带的菌株为YH66_03760基因编码区被敲除的阳性菌株。再次通过P23/P24引物PCR扩增阳性菌株YH66_03760片段,并连接到pMD19-T载体进行测序,将测序正确的菌株命名为YPR-012(谷氨酸棒杆菌CGMCC 20516上的基因组上的YH66_03760基因被敲除)。
实施例6、L-精氨酸发酵实验
将上述实施例构建的菌株(YPR-007、YPR-008、YPR-009、YPR-010、YPR-011和YPR-012) 和原始菌株谷氨酸棒杆菌CGMCC 20516在BLBIO-5GC-4-H型号的发酵罐(购自上海百仑生物科技有限公司)中以表1所示的控制工艺进行发酵实验。每个菌株重复三次。完成发酵后,收集上清,采用HPLC检测上清中的L-精氨酸产量。发酵培养基配方如下:溶剂为水,溶质及其浓度分别如下:硫酸铵14g/L,磷酸二氢钾1g/L,磷酸氢二钾1g/L,硫酸镁0.5g/L,酵母粉2g/L,硫酸亚铁18mg/L,硫酸锰4.2mg/L,生物素0.02mg/L,维生素B1 2mg/L, antifoam(CB-442)消泡0.5mL/L,70%葡萄糖(底糖)40g/L。
结果如表2所示,在谷氨酸棒杆菌中对YH66_03760基因编码区进行点突变 YH66_03760G844A及过表达,有助于L-精氨酸产量及转化率的提高,而对基因进行敲除或弱化,不利于L-精氨酸的积累。
表1、发酵控制工艺
表2、L-精氨酸发酵实验结果
菌株 | OD562nm | L-精氨酸产量(g/L) |
谷氨酸棒杆菌CGMCC 20516 | 74.6 | 86.5 |
YPR-007 | 77.9 | 89.9 |
YPR-008 | 78.0 | 89.0 |
YPR-009 | 80.1 | 90.2 |
YPR-010 | 79.2 | 89.7 |
YPR-011 | 80.8 | 90.6 |
YPR-012 | 73.8 | 87.1 |
序列表
<110> 宁夏伊品生物科技股份有限公司
<120> 一种高产精氨酸的工程菌及其构建方法与应用
<160> 10
<170> PatentIn version 3.5
<210> 1
<211> 3423
<212> DNA
<213> Artificial Sequence
<400> 1
gtgtcgactc acacatcttc aacgcttcca gcattcaaaa agatcttggt agcaaaccgc 60
ggcgaaatcg cggtccgtgc tttccgtgca gcactcgaaa ccggtgcagc cacggtagct 120
atttaccccc gtgaagatcg gggatcattc caccgctctt ttgcttctga agctgtccgc 180
attggtactg aaggctcacc agtcaaggcg tacctggaca tcgatgaaat tatcggtgca 240
gctaaaaaag ttaaagcaga tgctatttac ccgggatatg gcttcctgtc tgaaaatgcc 300
cagcttgccc gcgagtgcgc ggaaaacggc attactttta ttggcccaac cccagaggtt 360
cttgatctca ccggtgataa gtctcgtgcg gtaaccgccg cgaagaaggc tggtctgcca 420
gttttggcgg aatccacccc gagcaaaaac atcgatgaca tcgttaaaag cgctgaaggc 480
cagacttacc ccatctttgt aaaggcagtt gccggtggtg gcggacgcgg tatgcgcttt 540
gtttcttcac ctgatgagct ccgcaaattg gcaacagaag catctcgtga agctgaagcg 600
gcattcggcg acggttcggt atatgtcgaa cgtgctgtga ttaaccccca gcacattgaa 660
gtgcagatcc ttggcgatcg cactggagaa gttgtacacc tttatgaacg tgactgctca 720
ctgcagcgtc gtcaccaaaa agttgtcgaa attgcgccag cacagcattt ggatccagaa 780
ctgcgtgatc gcatttgcgc ggatgcagta aagttctgcc gctccattgg ttaccagggc 840
gcgggaaccg tggaattctt ggtcgatgaa aagggcaacc acgttttcat cgaaatgaac 900
ccacgtatcc aggttgagca caccgtgact gaagaagtca ccgaggtgga cctggtgaag 960
gcgcagatgc gcttggctgc tggtgcaacc ttgaaggaat tgggtctgac ccaagataag 1020
atcaagaccc acggtgcagc actgcagtgc cgcatcacca cggaagatcc aaacaacggc 1080
ttccgcccag ataccggaac tatcaccgcg taccgctcac caggcggagc tggcgttcgt 1140
cttgacggtg cagctcagct cggtggcgaa atcaccgcac actttgactc catgctggtg 1200
aaaatgacct gccgtggttc cgactttgaa actgctgttg ctcgtgcaca gcgcgcgttg 1260
gctgagttca ccgtgtctgg tgttgcaacc aacattggct tcttgcgcgc gctgctgcgg 1320
gaagaggact tcacttccaa gcgcatcgcc accggattta tcggcgatca cccacacctc 1380
cttcaggctc cacctgcgga tgatgagcag ggacgcatcc tggattactt ggcagatgtc 1440
accgtgaaca agcctcatgg tgtgcgtcca aaggatgttg cagcaccaat cgataagctg 1500
cccaacatca aggatctgcc actgccacgc ggttcccgtg accgcctgaa gcagcttggc 1560
ccagccgcgt ttgctcgtga tctccgtgag caggacgcac tggcagttac tgataccacc 1620
ttccgcgatg cacaccagtc tttgcttgcg acccgagtcc gctcattcgc actgaagcct 1680
gcggcagagg ccgtcgcaaa gctgactcct gagcttttgt ccgtggaggc ctggggcggc 1740
gcgacctacg atgtggcgat gcgtttcctc tttgaggatc cgtgggacag gctcgacgag 1800
ctgcgcgagg cgatgccgaa tgtaaacatt cagatgctgc ttcgcggccg caacaccgtg 1860
ggatacaccc cgtacccaga ctccgtctgc cgcgcgtttg ttaaggaagc tgccacctcc 1920
ggcgtggaca tcttccgcat cttcgacgcg cttaacgacg tctcccagat gcgtccagca 1980
atcgacgcag tcctggagac caacaccgcg gtagccgagg tggctatggc ttattctggt 2040
gatctctctg atccaaatga aaagctctac accctggatt actacctgaa gatggcagag 2100
gagatcgtca agtctggcgc tcacatcttg gccattaagg atatggctgg tctgcttcgc 2160
ccagctgcgg taaccaagct ggtcaccgca ctgcgccgtg aattcgatct gccagtgcac 2220
gtgcacaccc acgacaccgc gggtggccag ctggctacct actttgctgc agctcaagct 2280
ggtgcagatg ctgttgacgg tgcttccgca ccactgtctg gcaccacctc ccagccatcc 2340
ctgtctgcca ttgttgctgc attcgcgcac acccgtcgcg ataccggttt gagcctcgag 2400
gctgtttctg acctcgagcc gtactgggaa gcagtgcgcg gactgtacct gccatttgag 2460
tctggaaccc caggcccaac cggtcgcgtc taccgccacg aaatcccagg cggacagctg 2520
tccaacctgc gtgcacaggc caccgcactg ggccttgctg atcgcttcga gctcatcgaa 2580
gacaactacg cagccgttaa tgagatgctg ggacgcccaa ccaaggtcac cccatcctcc 2640
aaggttgttg gcgacctcgc actccacctg gttggtgcgg gtgtagatcc agcagacttt 2700
gctgcagacc cacaaaagta cgacatccca gactctgtca tcgcgttcct gcgcggcgag 2760
cttggtaacc ctccaggtgg ctggccagaa ccactgcgca cccgcgcact ggaaggccgc 2820
tccgaaggca aggcacctct gacggaagtt cctgaggaag agcaggcgca cctcgacgct 2880
gatgattcca aggaacgtcg caacagcctc aaccgcctgc tgttcccgaa gccaaccgaa 2940
gagttcctcg agcaccgtcg ccgcttcggc aacacctctg cgctggatga tcgtgaattc 3000
ttctacggcc tggtcgaagg ccgcgagact ttgatccgcc tgccagatgt gcgcacccca 3060
ctgcttgttc gcctggatgc gatctctgag ccagacgata agggtatgcg caatgttgtg 3120
gccaacgtca acggccagat ccgcccaatg cgtgtgcgtg accgctccgt tgagtctgtc 3180
accgcaaccg cagaaaaggc agattcctcc aacaagggcc atgttgctgc accattcgct 3240
ggtgttgtca ctgtgactgt tgctgaaggt gatgaggtca aggctggaga tgcagtcgca 3300
atcatcgagg ctatgaagat ggaagcaaca atcactgctt ctgttgacgg caaaatcgat 3360
cgcgttgtgg ttcctgctgc aacgaaggtg gaaggtggcg acttgatcgt cgtcgtttcc 3420
taa 3423
<210> 2
<211> 1140
<212> PRT
<213> Artificial Sequence
<400> 2
Met Ser Thr His Thr Ser Ser Thr Leu Pro Ala Phe Lys Lys Ile Leu
1 5 10 15
Val Ala Asn Arg Gly Glu Ile Ala Val Arg Ala Phe Arg Ala Ala Leu
20 25 30
Glu Thr Gly Ala Ala Thr Val Ala Ile Tyr Pro Arg Glu Asp Arg Gly
35 40 45
Ser Phe His Arg Ser Phe Ala Ser Glu Ala Val Arg Ile Gly Thr Glu
50 55 60
Gly Ser Pro Val Lys Ala Tyr Leu Asp Ile Asp Glu Ile Ile Gly Ala
65 70 75 80
Ala Lys Lys Val Lys Ala Asp Ala Ile Tyr Pro Gly Tyr Gly Phe Leu
85 90 95
Ser Glu Asn Ala Gln Leu Ala Arg Glu Cys Ala Glu Asn Gly Ile Thr
100 105 110
Phe Ile Gly Pro Thr Pro Glu Val Leu Asp Leu Thr Gly Asp Lys Ser
115 120 125
Arg Ala Val Thr Ala Ala Lys Lys Ala Gly Leu Pro Val Leu Ala Glu
130 135 140
Ser Thr Pro Ser Lys Asn Ile Asp Asp Ile Val Lys Ser Ala Glu Gly
145 150 155 160
Gln Thr Tyr Pro Ile Phe Val Lys Ala Val Ala Gly Gly Gly Gly Arg
165 170 175
Gly Met Arg Phe Val Ser Ser Pro Asp Glu Leu Arg Lys Leu Ala Thr
180 185 190
Glu Ala Ser Arg Glu Ala Glu Ala Ala Phe Gly Asp Gly Ser Val Tyr
195 200 205
Val Glu Arg Ala Val Ile Asn Pro Gln His Ile Glu Val Gln Ile Leu
210 215 220
Gly Asp Arg Thr Gly Glu Val Val His Leu Tyr Glu Arg Asp Cys Ser
225 230 235 240
Leu Gln Arg Arg His Gln Lys Val Val Glu Ile Ala Pro Ala Gln His
245 250 255
Leu Asp Pro Glu Leu Arg Asp Arg Ile Cys Ala Asp Ala Val Lys Phe
260 265 270
Cys Arg Ser Ile Gly Tyr Gln Gly Ala Gly Thr Val Glu Phe Leu Val
275 280 285
Asp Glu Lys Gly Asn His Val Phe Ile Glu Met Asn Pro Arg Ile Gln
290 295 300
Val Glu His Thr Val Thr Glu Glu Val Thr Glu Val Asp Leu Val Lys
305 310 315 320
Ala Gln Met Arg Leu Ala Ala Gly Ala Thr Leu Lys Glu Leu Gly Leu
325 330 335
Thr Gln Asp Lys Ile Lys Thr His Gly Ala Ala Leu Gln Cys Arg Ile
340 345 350
Thr Thr Glu Asp Pro Asn Asn Gly Phe Arg Pro Asp Thr Gly Thr Ile
355 360 365
Thr Ala Tyr Arg Ser Pro Gly Gly Ala Gly Val Arg Leu Asp Gly Ala
370 375 380
Ala Gln Leu Gly Gly Glu Ile Thr Ala His Phe Asp Ser Met Leu Val
385 390 395 400
Lys Met Thr Cys Arg Gly Ser Asp Phe Glu Thr Ala Val Ala Arg Ala
405 410 415
Gln Arg Ala Leu Ala Glu Phe Thr Val Ser Gly Val Ala Thr Asn Ile
420 425 430
Gly Phe Leu Arg Ala Leu Leu Arg Glu Glu Asp Phe Thr Ser Lys Arg
435 440 445
Ile Ala Thr Gly Phe Ile Gly Asp His Pro His Leu Leu Gln Ala Pro
450 455 460
Pro Ala Asp Asp Glu Gln Gly Arg Ile Leu Asp Tyr Leu Ala Asp Val
465 470 475 480
Thr Val Asn Lys Pro His Gly Val Arg Pro Lys Asp Val Ala Ala Pro
485 490 495
Ile Asp Lys Leu Pro Asn Ile Lys Asp Leu Pro Leu Pro Arg Gly Ser
500 505 510
Arg Asp Arg Leu Lys Gln Leu Gly Pro Ala Ala Phe Ala Arg Asp Leu
515 520 525
Arg Glu Gln Asp Ala Leu Ala Val Thr Asp Thr Thr Phe Arg Asp Ala
530 535 540
His Gln Ser Leu Leu Ala Thr Arg Val Arg Ser Phe Ala Leu Lys Pro
545 550 555 560
Ala Ala Glu Ala Val Ala Lys Leu Thr Pro Glu Leu Leu Ser Val Glu
565 570 575
Ala Trp Gly Gly Ala Thr Tyr Asp Val Ala Met Arg Phe Leu Phe Glu
580 585 590
Asp Pro Trp Asp Arg Leu Asp Glu Leu Arg Glu Ala Met Pro Asn Val
595 600 605
Asn Ile Gln Met Leu Leu Arg Gly Arg Asn Thr Val Gly Tyr Thr Pro
610 615 620
Tyr Pro Asp Ser Val Cys Arg Ala Phe Val Lys Glu Ala Ala Thr Ser
625 630 635 640
Gly Val Asp Ile Phe Arg Ile Phe Asp Ala Leu Asn Asp Val Ser Gln
645 650 655
Met Arg Pro Ala Ile Asp Ala Val Leu Glu Thr Asn Thr Ala Val Ala
660 665 670
Glu Val Ala Met Ala Tyr Ser Gly Asp Leu Ser Asp Pro Asn Glu Lys
675 680 685
Leu Tyr Thr Leu Asp Tyr Tyr Leu Lys Met Ala Glu Glu Ile Val Lys
690 695 700
Ser Gly Ala His Ile Leu Ala Ile Lys Asp Met Ala Gly Leu Leu Arg
705 710 715 720
Pro Ala Ala Val Thr Lys Leu Val Thr Ala Leu Arg Arg Glu Phe Asp
725 730 735
Leu Pro Val His Val His Thr His Asp Thr Ala Gly Gly Gln Leu Ala
740 745 750
Thr Tyr Phe Ala Ala Ala Gln Ala Gly Ala Asp Ala Val Asp Gly Ala
755 760 765
Ser Ala Pro Leu Ser Gly Thr Thr Ser Gln Pro Ser Leu Ser Ala Ile
770 775 780
Val Ala Ala Phe Ala His Thr Arg Arg Asp Thr Gly Leu Ser Leu Glu
785 790 795 800
Ala Val Ser Asp Leu Glu Pro Tyr Trp Glu Ala Val Arg Gly Leu Tyr
805 810 815
Leu Pro Phe Glu Ser Gly Thr Pro Gly Pro Thr Gly Arg Val Tyr Arg
820 825 830
His Glu Ile Pro Gly Gly Gln Leu Ser Asn Leu Arg Ala Gln Ala Thr
835 840 845
Ala Leu Gly Leu Ala Asp Arg Phe Glu Leu Ile Glu Asp Asn Tyr Ala
850 855 860
Ala Val Asn Glu Met Leu Gly Arg Pro Thr Lys Val Thr Pro Ser Ser
865 870 875 880
Lys Val Val Gly Asp Leu Ala Leu His Leu Val Gly Ala Gly Val Asp
885 890 895
Pro Ala Asp Phe Ala Ala Asp Pro Gln Lys Tyr Asp Ile Pro Asp Ser
900 905 910
Val Ile Ala Phe Leu Arg Gly Glu Leu Gly Asn Pro Pro Gly Gly Trp
915 920 925
Pro Glu Pro Leu Arg Thr Arg Ala Leu Glu Gly Arg Ser Glu Gly Lys
930 935 940
Ala Pro Leu Thr Glu Val Pro Glu Glu Glu Gln Ala His Leu Asp Ala
945 950 955 960
Asp Asp Ser Lys Glu Arg Arg Asn Ser Leu Asn Arg Leu Leu Phe Pro
965 970 975
Lys Pro Thr Glu Glu Phe Leu Glu His Arg Arg Arg Phe Gly Asn Thr
980 985 990
Ser Ala Leu Asp Asp Arg Glu Phe Phe Tyr Gly Leu Val Glu Gly Arg
995 1000 1005
Glu Thr Leu Ile Arg Leu Pro Asp Val Arg Thr Pro Leu Leu Val
1010 1015 1020
Arg Leu Asp Ala Ile Ser Glu Pro Asp Asp Lys Gly Met Arg Asn
1025 1030 1035
Val Val Ala Asn Val Asn Gly Gln Ile Arg Pro Met Arg Val Arg
1040 1045 1050
Asp Arg Ser Val Glu Ser Val Thr Ala Thr Ala Glu Lys Ala Asp
1055 1060 1065
Ser Ser Asn Lys Gly His Val Ala Ala Pro Phe Ala Gly Val Val
1070 1075 1080
Thr Val Thr Val Ala Glu Gly Asp Glu Val Lys Ala Gly Asp Ala
1085 1090 1095
Val Ala Ile Ile Glu Ala Met Lys Met Glu Ala Thr Ile Thr Ala
1100 1105 1110
Ser Val Asp Gly Lys Ile Asp Arg Val Val Val Pro Ala Ala Thr
1115 1120 1125
Lys Val Glu Gly Gly Asp Leu Ile Val Val Val Ser
1130 1135 1140
<210> 3
<211> 3423
<212> DNA
<213> Artificial Sequence
<400> 3
gtgtcgactc acacatcttc aacgcttcca gcattcaaaa agatcttggt agcaaaccgc 60
ggcgaaatcg cggtccgtgc tttccgtgca gcactcgaaa ccggtgcagc cacggtagct 120
atttaccccc gtgaagatcg gggatcattc caccgctctt ttgcttctga agctgtccgc 180
attggtactg aaggctcacc agtcaaggcg tacctggaca tcgatgaaat tatcggtgca 240
gctaaaaaag ttaaagcaga tgctatttac ccgggatatg gcttcctgtc tgaaaatgcc 300
cagcttgccc gcgagtgcgc ggaaaacggc attactttta ttggcccaac cccagaggtt 360
cttgatctca ccggtgataa gtctcgtgcg gtaaccgccg cgaagaaggc tggtctgcca 420
gttttggcgg aatccacccc gagcaaaaac atcgatgaca tcgttaaaag cgctgaaggc 480
cagacttacc ccatctttgt aaaggcagtt gccggtggtg gcggacgcgg tatgcgcttt 540
gtttcttcac ctgatgagct ccgcaaattg gcaacagaag catctcgtga agctgaagcg 600
gcattcggcg acggttcggt atatgtcgaa cgtgctgtga ttaaccccca gcacattgaa 660
gtgcagatcc ttggcgatcg cactggagaa gttgtacacc tttatgaacg tgactgctca 720
ctgcagcgtc gtcaccaaaa agttgtcgaa attgcgccag cacagcattt ggatccagaa 780
ctgcgtgatc gcatttgcgc ggatgcagta aagttctgcc gctccattgg ttaccagggc 840
gcgagaaccg tggaattctt ggtcgatgaa aagggcaacc acgttttcat cgaaatgaac 900
ccacgtatcc aggttgagca caccgtgact gaagaagtca ccgaggtgga cctggtgaag 960
gcgcagatgc gcttggctgc tggtgcaacc ttgaaggaat tgggtctgac ccaagataag 1020
atcaagaccc acggtgcagc actgcagtgc cgcatcacca cggaagatcc aaacaacggc 1080
ttccgcccag ataccggaac tatcaccgcg taccgctcac caggcggagc tggcgttcgt 1140
cttgacggtg cagctcagct cggtggcgaa atcaccgcac actttgactc catgctggtg 1200
aaaatgacct gccgtggttc cgactttgaa actgctgttg ctcgtgcaca gcgcgcgttg 1260
gctgagttca ccgtgtctgg tgttgcaacc aacattggct tcttgcgcgc gctgctgcgg 1320
gaagaggact tcacttccaa gcgcatcgcc accggattta tcggcgatca cccacacctc 1380
cttcaggctc cacctgcgga tgatgagcag ggacgcatcc tggattactt ggcagatgtc 1440
accgtgaaca agcctcatgg tgtgcgtcca aaggatgttg cagcaccaat cgataagctg 1500
cccaacatca aggatctgcc actgccacgc ggttcccgtg accgcctgaa gcagcttggc 1560
ccagccgcgt ttgctcgtga tctccgtgag caggacgcac tggcagttac tgataccacc 1620
ttccgcgatg cacaccagtc tttgcttgcg acccgagtcc gctcattcgc actgaagcct 1680
gcggcagagg ccgtcgcaaa gctgactcct gagcttttgt ccgtggaggc ctggggcggc 1740
gcgacctacg atgtggcgat gcgtttcctc tttgaggatc cgtgggacag gctcgacgag 1800
ctgcgcgagg cgatgccgaa tgtaaacatt cagatgctgc ttcgcggccg caacaccgtg 1860
ggatacaccc cgtacccaga ctccgtctgc cgcgcgtttg ttaaggaagc tgccacctcc 1920
ggcgtggaca tcttccgcat cttcgacgcg cttaacgacg tctcccagat gcgtccagca 1980
atcgacgcag tcctggagac caacaccgcg gtagccgagg tggctatggc ttattctggt 2040
gatctctctg atccaaatga aaagctctac accctggatt actacctgaa gatggcagag 2100
gagatcgtca agtctggcgc tcacatcttg gccattaagg atatggctgg tctgcttcgc 2160
ccagctgcgg taaccaagct ggtcaccgca ctgcgccgtg aattcgatct gccagtgcac 2220
gtgcacaccc acgacaccgc gggtggccag ctggctacct actttgctgc agctcaagct 2280
ggtgcagatg ctgttgacgg tgcttccgca ccactgtctg gcaccacctc ccagccatcc 2340
ctgtctgcca ttgttgctgc attcgcgcac acccgtcgcg ataccggttt gagcctcgag 2400
gctgtttctg acctcgagcc gtactgggaa gcagtgcgcg gactgtacct gccatttgag 2460
tctggaaccc caggcccaac cggtcgcgtc taccgccacg aaatcccagg cggacagctg 2520
tccaacctgc gtgcacaggc caccgcactg ggccttgctg atcgcttcga gctcatcgaa 2580
gacaactacg cagccgttaa tgagatgctg ggacgcccaa ccaaggtcac cccatcctcc 2640
aaggttgttg gcgacctcgc actccacctg gttggtgcgg gtgtagatcc agcagacttt 2700
gctgcagacc cacaaaagta cgacatccca gactctgtca tcgcgttcct gcgcggcgag 2760
cttggtaacc ctccaggtgg ctggccagaa ccactgcgca cccgcgcact ggaaggccgc 2820
tccgaaggca aggcacctct gacggaagtt cctgaggaag agcaggcgca cctcgacgct 2880
gatgattcca aggaacgtcg caacagcctc aaccgcctgc tgttcccgaa gccaaccgaa 2940
gagttcctcg agcaccgtcg ccgcttcggc aacacctctg cgctggatga tcgtgaattc 3000
ttctacggcc tggtcgaagg ccgcgagact ttgatccgcc tgccagatgt gcgcacccca 3060
ctgcttgttc gcctggatgc gatctctgag ccagacgata agggtatgcg caatgttgtg 3120
gccaacgtca acggccagat ccgcccaatg cgtgtgcgtg accgctccgt tgagtctgtc 3180
accgcaaccg cagaaaaggc agattcctcc aacaagggcc atgttgctgc accattcgct 3240
ggtgttgtca ctgtgactgt tgctgaaggt gatgaggtca aggctggaga tgcagtcgca 3300
atcatcgagg ctatgaagat ggaagcaaca atcactgctt ctgttgacgg caaaatcgat 3360
cgcgttgtgg ttcctgctgc aacgaaggtg gaaggtggcg acttgatcgt cgtcgtttcc 3420
taa 3423
<210> 4
<211> 1140
<212> PRT
<213> Artificial Sequence
<400> 4
Met Ser Thr His Thr Ser Ser Thr Leu Pro Ala Phe Lys Lys Ile Leu
1 5 10 15
Val Ala Asn Arg Gly Glu Ile Ala Val Arg Ala Phe Arg Ala Ala Leu
20 25 30
Glu Thr Gly Ala Ala Thr Val Ala Ile Tyr Pro Arg Glu Asp Arg Gly
35 40 45
Ser Phe His Arg Ser Phe Ala Ser Glu Ala Val Arg Ile Gly Thr Glu
50 55 60
Gly Ser Pro Val Lys Ala Tyr Leu Asp Ile Asp Glu Ile Ile Gly Ala
65 70 75 80
Ala Lys Lys Val Lys Ala Asp Ala Ile Tyr Pro Gly Tyr Gly Phe Leu
85 90 95
Ser Glu Asn Ala Gln Leu Ala Arg Glu Cys Ala Glu Asn Gly Ile Thr
100 105 110
Phe Ile Gly Pro Thr Pro Glu Val Leu Asp Leu Thr Gly Asp Lys Ser
115 120 125
Arg Ala Val Thr Ala Ala Lys Lys Ala Gly Leu Pro Val Leu Ala Glu
130 135 140
Ser Thr Pro Ser Lys Asn Ile Asp Asp Ile Val Lys Ser Ala Glu Gly
145 150 155 160
Gln Thr Tyr Pro Ile Phe Val Lys Ala Val Ala Gly Gly Gly Gly Arg
165 170 175
Gly Met Arg Phe Val Ser Ser Pro Asp Glu Leu Arg Lys Leu Ala Thr
180 185 190
Glu Ala Ser Arg Glu Ala Glu Ala Ala Phe Gly Asp Gly Ser Val Tyr
195 200 205
Val Glu Arg Ala Val Ile Asn Pro Gln His Ile Glu Val Gln Ile Leu
210 215 220
Gly Asp Arg Thr Gly Glu Val Val His Leu Tyr Glu Arg Asp Cys Ser
225 230 235 240
Leu Gln Arg Arg His Gln Lys Val Val Glu Ile Ala Pro Ala Gln His
245 250 255
Leu Asp Pro Glu Leu Arg Asp Arg Ile Cys Ala Asp Ala Val Lys Phe
260 265 270
Cys Arg Ser Ile Gly Tyr Gln Gly Ala Arg Thr Val Glu Phe Leu Val
275 280 285
Asp Glu Lys Gly Asn His Val Phe Ile Glu Met Asn Pro Arg Ile Gln
290 295 300
Val Glu His Thr Val Thr Glu Glu Val Thr Glu Val Asp Leu Val Lys
305 310 315 320
Ala Gln Met Arg Leu Ala Ala Gly Ala Thr Leu Lys Glu Leu Gly Leu
325 330 335
Thr Gln Asp Lys Ile Lys Thr His Gly Ala Ala Leu Gln Cys Arg Ile
340 345 350
Thr Thr Glu Asp Pro Asn Asn Gly Phe Arg Pro Asp Thr Gly Thr Ile
355 360 365
Thr Ala Tyr Arg Ser Pro Gly Gly Ala Gly Val Arg Leu Asp Gly Ala
370 375 380
Ala Gln Leu Gly Gly Glu Ile Thr Ala His Phe Asp Ser Met Leu Val
385 390 395 400
Lys Met Thr Cys Arg Gly Ser Asp Phe Glu Thr Ala Val Ala Arg Ala
405 410 415
Gln Arg Ala Leu Ala Glu Phe Thr Val Ser Gly Val Ala Thr Asn Ile
420 425 430
Gly Phe Leu Arg Ala Leu Leu Arg Glu Glu Asp Phe Thr Ser Lys Arg
435 440 445
Ile Ala Thr Gly Phe Ile Gly Asp His Pro His Leu Leu Gln Ala Pro
450 455 460
Pro Ala Asp Asp Glu Gln Gly Arg Ile Leu Asp Tyr Leu Ala Asp Val
465 470 475 480
Thr Val Asn Lys Pro His Gly Val Arg Pro Lys Asp Val Ala Ala Pro
485 490 495
Ile Asp Lys Leu Pro Asn Ile Lys Asp Leu Pro Leu Pro Arg Gly Ser
500 505 510
Arg Asp Arg Leu Lys Gln Leu Gly Pro Ala Ala Phe Ala Arg Asp Leu
515 520 525
Arg Glu Gln Asp Ala Leu Ala Val Thr Asp Thr Thr Phe Arg Asp Ala
530 535 540
His Gln Ser Leu Leu Ala Thr Arg Val Arg Ser Phe Ala Leu Lys Pro
545 550 555 560
Ala Ala Glu Ala Val Ala Lys Leu Thr Pro Glu Leu Leu Ser Val Glu
565 570 575
Ala Trp Gly Gly Ala Thr Tyr Asp Val Ala Met Arg Phe Leu Phe Glu
580 585 590
Asp Pro Trp Asp Arg Leu Asp Glu Leu Arg Glu Ala Met Pro Asn Val
595 600 605
Asn Ile Gln Met Leu Leu Arg Gly Arg Asn Thr Val Gly Tyr Thr Pro
610 615 620
Tyr Pro Asp Ser Val Cys Arg Ala Phe Val Lys Glu Ala Ala Thr Ser
625 630 635 640
Gly Val Asp Ile Phe Arg Ile Phe Asp Ala Leu Asn Asp Val Ser Gln
645 650 655
Met Arg Pro Ala Ile Asp Ala Val Leu Glu Thr Asn Thr Ala Val Ala
660 665 670
Glu Val Ala Met Ala Tyr Ser Gly Asp Leu Ser Asp Pro Asn Glu Lys
675 680 685
Leu Tyr Thr Leu Asp Tyr Tyr Leu Lys Met Ala Glu Glu Ile Val Lys
690 695 700
Ser Gly Ala His Ile Leu Ala Ile Lys Asp Met Ala Gly Leu Leu Arg
705 710 715 720
Pro Ala Ala Val Thr Lys Leu Val Thr Ala Leu Arg Arg Glu Phe Asp
725 730 735
Leu Pro Val His Val His Thr His Asp Thr Ala Gly Gly Gln Leu Ala
740 745 750
Thr Tyr Phe Ala Ala Ala Gln Ala Gly Ala Asp Ala Val Asp Gly Ala
755 760 765
Ser Ala Pro Leu Ser Gly Thr Thr Ser Gln Pro Ser Leu Ser Ala Ile
770 775 780
Val Ala Ala Phe Ala His Thr Arg Arg Asp Thr Gly Leu Ser Leu Glu
785 790 795 800
Ala Val Ser Asp Leu Glu Pro Tyr Trp Glu Ala Val Arg Gly Leu Tyr
805 810 815
Leu Pro Phe Glu Ser Gly Thr Pro Gly Pro Thr Gly Arg Val Tyr Arg
820 825 830
His Glu Ile Pro Gly Gly Gln Leu Ser Asn Leu Arg Ala Gln Ala Thr
835 840 845
Ala Leu Gly Leu Ala Asp Arg Phe Glu Leu Ile Glu Asp Asn Tyr Ala
850 855 860
Ala Val Asn Glu Met Leu Gly Arg Pro Thr Lys Val Thr Pro Ser Ser
865 870 875 880
Lys Val Val Gly Asp Leu Ala Leu His Leu Val Gly Ala Gly Val Asp
885 890 895
Pro Ala Asp Phe Ala Ala Asp Pro Gln Lys Tyr Asp Ile Pro Asp Ser
900 905 910
Val Ile Ala Phe Leu Arg Gly Glu Leu Gly Asn Pro Pro Gly Gly Trp
915 920 925
Pro Glu Pro Leu Arg Thr Arg Ala Leu Glu Gly Arg Ser Glu Gly Lys
930 935 940
Ala Pro Leu Thr Glu Val Pro Glu Glu Glu Gln Ala His Leu Asp Ala
945 950 955 960
Asp Asp Ser Lys Glu Arg Arg Asn Ser Leu Asn Arg Leu Leu Phe Pro
965 970 975
Lys Pro Thr Glu Glu Phe Leu Glu His Arg Arg Arg Phe Gly Asn Thr
980 985 990
Ser Ala Leu Asp Asp Arg Glu Phe Phe Tyr Gly Leu Val Glu Gly Arg
995 1000 1005
Glu Thr Leu Ile Arg Leu Pro Asp Val Arg Thr Pro Leu Leu Val
1010 1015 1020
Arg Leu Asp Ala Ile Ser Glu Pro Asp Asp Lys Gly Met Arg Asn
1025 1030 1035
Val Val Ala Asn Val Asn Gly Gln Ile Arg Pro Met Arg Val Arg
1040 1045 1050
Asp Arg Ser Val Glu Ser Val Thr Ala Thr Ala Glu Lys Ala Asp
1055 1060 1065
Ser Ser Asn Lys Gly His Val Ala Ala Pro Phe Ala Gly Val Val
1070 1075 1080
Thr Val Thr Val Ala Glu Gly Asp Glu Val Lys Ala Gly Asp Ala
1085 1090 1095
Val Ala Ile Ile Glu Ala Met Lys Met Glu Ala Thr Ile Thr Ala
1100 1105 1110
Ser Val Asp Gly Lys Ile Asp Arg Val Val Val Pro Ala Ala Thr
1115 1120 1125
Lys Val Glu Gly Gly Asp Leu Ile Val Val Val Ser
1130 1135 1140
<210> 5
<211> 1414
<212> DNA
<213> Artificial Sequence
<400> 5
cagtgccaag cttgcatgcc tgcaggtcga ctctagattg gtactgaagg ctcaccagtc 60
aaggcgtacc tggacatcga tgaaattatc ggtgcagcta aaaaagttaa agcagatgct 120
atttacccgg gatatggctt cctgtctgaa aatgcccagc ttgcccgcga gtgcgcggaa 180
aacggcatta cttttattgg cccaacccca gaggttcttg atctcaccgg tgataagtct 240
cgtgcggtaa ccgccgcgaa gaaggctggt ctgccagttt tggcggaatc caccccgagc 300
aaaaacatcg atgacatcgt taaaagcgct gaaggccaga cttaccccat ctttgtaaag 360
gcagttgccg gtggtggcgg acgcggtatg cgctttgttt cttcacctga tgagctccgc 420
aaattggcaa cagaagcatc tcgtgaagct gaagcggcat tcggcgacgg ttcggtatat 480
gtcgaacgtg ctgtgattaa cccccagcac attgaagtgc agatccttgg cgatcgcact 540
ggagaagttg tacaccttta tgaacgtgac tgctcactgc agcgtcgtca ccaaaaagtt 600
gtcgaaattg cgccagcaca gcatttggat ccagaactgc gtgatcgcat ttgcgcggat 660
gcagtaaagt tctgccgctc cattggttac cagggcgcga gaaccgtgga attcttggtc 720
gatgaaaagg gcaaccacgt tttcatcgaa atgaacccac gtatccaggt tgagcacacc 780
gtgactgaag aagtcaccga ggtggacctg gtgaaggcgc agatgcgctt ggctgctggt 840
gcaaccttga aggaattggg tctgacccaa gataagatca agacccacgg tgcagcactg 900
cagtgccgca tcaccacgga agatccaaac aacggcttcc gcccagatac cggaactatc 960
accgcgtacc gctcaccagg cggagctggc gttcgtcttg acggtgcagc tcagctcggt 1020
ggcgaaatca ccgcacactt tgactccatg ctggtgaaaa tgacctgccg tggttccgac 1080
tttgaaactg ctgttgctcg tgcacagcgc gcgttggctg agttcaccgt gtctggtgtt 1140
gcaaccaaca ttggcttctt gcgcgcgctg ctgcgggaag aggacttcac ttccaagcgc 1200
atcgccaccg gatttatcgg cgatcaccca cacctccttc aggctccacc tgcggatgat 1260
gagcagggac gcatcctgga ttacttggca gatgtcaccg tgaacaagcc tcatggtgtg 1320
cgtccaaagg atgttgcagc accaatcgat aagctgccca acatcaagga tctgccgggt 1380
accgagctcg aattcgtaat catggtcata gctg 1414
<210> 6
<211> 806
<212> DNA
<213> Artificial Sequence
<400> 6
cagtgccaag cttgcatgcc tgcaggtcga ctctagcatg acggctgact ggactcgact 60
tccatacgag gttctggaga agatctccac ccgcatcacc aacgaagttc cagatgtgaa 120
ccgcgtggtt ttggacgtaa cctccaagcc accaggaacc atcgaatggg agtaggcctt 180
aaatgagcct tcgttaagcg gcaatcacct tattggagat tgtcgctttt cccatttctc 240
cgggttttct ggaacttttt gggcgtatgc tgggaatgat tctattattg ccaaatcaga 300
aagcaggaga gacccgatga gcgaaatcct agaaacctat tgggcacccc actttggaaa 360
aaccgaagaa gccacagcac tcgtttcata cctggcacaa gcttccggcg atcccattga 420
ggttcacacc ctgttcgggg atttaggttt agacggactc tcgggaaact acaccgacac 480
tgagattgac ggctacggcg acgcattcct gctggttgca gcgctatccg tgttgatggc 540
tgaaaacaaa gcaacaggtg gcgtgaatct gggtgagctt gggggagctg ataaatcgat 600
ccggctgcat gttgaatcca aggagaacac ccaaatcaac accgcattga agtattttgc 660
gctctcccca gaagaccacg cagcagcaga tcgcttcgat gaggatgacc tgtctgagct 720
tgccaacttg agtgaagagc tgcgcggaca gctggactaa ttgtctccca tttaaggagt 780
ccgatttttt ccgagtctta gatttt 806
<210> 7
<211> 3858
<212> DNA
<213> Artificial Sequence
<400> 7
cccatttaag gagtccgatt ttttccgagt cttagatttt gagaaaaccc aggattgctt 60
tgtgcactcc tgggttttca ctttgttaag cagttttggg gaaaagtgca aagtttgcaa 120
agtttagaaa tattttaaga ggtaagatgt ctgcaggtgg aagcgtttaa atgcgttaaa 180
cttggccaaa tgtggcaacc tttgcaaggt gaaaaactgg ggcggggtta gatcctgggg 240
ggtttatttc attcactttg gcttgaagtc gtgcaggtca gaggagtgtt gcccgaaaac 300
attgagagga aaacaaaaac cgatgtttga ttgggggaat cgggggttac gatactagga 360
cgaagtgact gctatcaccc ttggcggtct cttgttgaaa ggaacaatta ctctagtgtc 420
gactcacaca tcttcaacgc ttccagcatt caaaaagatc ttggtagcaa accgcggcga 480
aatcgcggtc cgtgctttcc gtgcagcact cgaaaccggt gcagccacgg tagctattta 540
cccccgtgaa gatcggggat cattccaccg ctcttttgct tctgaagctg tccgcattgg 600
tactgaaggc tcaccagtca aggcgtacct ggacatcgat gaaattatcg gtgcagctaa 660
aaaagttaaa gcagatgcta tttacccggg atatggcttc ctgtctgaaa atgcccagct 720
tgcccgcgag tgcgcggaaa acggcattac ttttattggc ccaaccccag aggttcttga 780
tctcaccggt gataagtctc gtgcggtaac cgccgcgaag aaggctggtc tgccagtttt 840
ggcggaatcc accccgagca aaaacatcga tgacatcgtt aaaagcgctg aaggccagac 900
ttaccccatc tttgtaaagg cagttgccgg tggtggcgga cgcggtatgc gctttgtttc 960
ttcacctgat gagctccgca aattggcaac agaagcatct cgtgaagctg aagcggcatt 1020
cggcgacggt tcggtatatg tcgaacgtgc tgtgattaac ccccagcaca ttgaagtgca 1080
gatccttggc gatcgcactg gagaagttgt acacctttat gaacgtgact gctcactgca 1140
gcgtcgtcac caaaaagttg tcgaaattgc gccagcacag catttggatc cagaactgcg 1200
tgatcgcatt tgcgcggatg cagtaaagtt ctgccgctcc attggttacc agggcgcggg 1260
aaccgtggaa ttcttggtcg atgaaaaggg caaccacgtt ttcatcgaaa tgaacccacg 1320
tatccaggtt gagcacaccg tgactgaaga agtcaccgag gtggacctgg tgaaggcgca 1380
gatgcgcttg gctgctggtg caaccttgaa ggaattgggt ctgacccaag ataagatcaa 1440
gacccacggt gcagcactgc agtgccgcat caccacggaa gatccaaaca acggcttccg 1500
cccagatacc ggaactatca ccgcgtaccg ctcaccaggc ggagctggcg ttcgtcttga 1560
cggtgcagct cagctcggtg gcgaaatcac cgcacacttt gactccatgc tggtgaaaat 1620
gacctgccgt ggttccgact ttgaaactgc tgttgctcgt gcacagcgcg cgttggctga 1680
gttcaccgtg tctggtgttg caaccaacat tggcttcttg cgcgcgctgc tgcgggaaga 1740
ggacttcact tccaagcgca tcgccaccgg atttatcggc gatcacccac acctccttca 1800
ggctccacct gcggatgatg agcagggacg catcctggat tacttggcag atgtcaccgt 1860
gaacaagcct catggtgtgc gtccaaagga tgttgcagca ccaatcgata agctgcccaa 1920
catcaaggat ctgccactgc cacgcggttc ccgtgaccgc ctgaagcagc ttggcccagc 1980
cgcgtttgct cgtgatctcc gtgagcagga cgcactggca gttactgata ccaccttccg 2040
cgatgcacac cagtctttgc ttgcgacccg agtccgctca ttcgcactga agcctgcggc 2100
agaggccgtc gcaaagctga ctcctgagct tttgtccgtg gaggcctggg gcggcgcgac 2160
ctacgatgtg gcgatgcgtt tcctctttga ggatccgtgg gacaggctcg acgagctgcg 2220
cgaggcgatg ccgaatgtaa acattcagat gctgcttcgc ggccgcaaca ccgtgggata 2280
caccccgtac ccagactccg tctgccgcgc gtttgttaag gaagctgcca cctccggcgt 2340
ggacatcttc cgcatcttcg acgcgcttaa cgacgtctcc cagatgcgtc cagcaatcga 2400
cgcagtcctg gagaccaaca ccgcggtagc cgaggtggct atggcttatt ctggtgatct 2460
ctctgatcca aatgaaaagc tctacaccct ggattactac ctgaagatgg cagaggagat 2520
cgtcaagtct ggcgctcaca tcttggccat taaggatatg gctggtctgc ttcgcccagc 2580
tgcggtaacc aagctggtca ccgcactgcg ccgtgaattc gatctgccag tgcacgtgca 2640
cacccacgac accgcgggtg gccagctggc tacctacttt gctgcagctc aagctggtgc 2700
agatgctgtt gacggtgctt ccgcaccact gtctggcacc acctcccagc catccctgtc 2760
tgccattgtt gctgcattcg cgcacacccg tcgcgatacc ggtttgagcc tcgaggctgt 2820
ttctgacctc gagccgtact gggaagcagt gcgcggactg tacctgccat ttgagtctgg 2880
aaccccaggc ccaaccggtc gcgtctaccg ccacgaaatc ccaggcggac agctgtccaa 2940
cctgcgtgca caggccaccg cactgggcct tgctgatcgc ttcgagctca tcgaagacaa 3000
ctacgcagcc gttaatgaga tgctgggacg cccaaccaag gtcaccccat cctccaaggt 3060
tgttggcgac ctcgcactcc acctggttgg tgcgggtgta gatccagcag actttgctgc 3120
agacccacaa aagtacgaca tcccagactc tgtcatcgcg ttcctgcgcg gcgagcttgg 3180
taaccctcca ggtggctggc cagaaccact gcgcacccgc gcactggaag gccgctccga 3240
aggcaaggca cctctgacgg aagttcctga ggaagagcag gcgcacctcg acgctgatga 3300
ttccaaggaa cgtcgcaaca gcctcaaccg cctgctgttc ccgaagccaa ccgaagagtt 3360
cctcgagcac cgtcgccgct tcggcaacac ctctgcgctg gatgatcgtg aattcttcta 3420
cggcctggtc gaaggccgcg agactttgat ccgcctgcca gatgtgcgca ccccactgct 3480
tgttcgcctg gatgcgatct ctgagccaga cgataagggt atgcgcaatg ttgtggccaa 3540
cgtcaacggc cagatccgcc caatgcgtgt gcgtgaccgc tccgttgagt ctgtcaccgc 3600
aaccgcagaa aaggcagatt cctccaacaa gggccatgtt gctgcaccat tcgctggtgt 3660
tgtcactgtg actgttgctg aaggtgatga ggtcaaggct ggagatgcag tcgcaatcat 3720
cgaggctatg aagatggaag caacaatcac tgcttctgtt gacggcaaaa tcgatcgcgt 3780
tgtggttcct gctgcaacga aggtggaagg tggcgacttg atcgtcgtcg tttcctaata 3840
aatcgactac tcacatag 3858
<210> 8
<211> 3858
<212> DNA
<213> Artificial Sequence
<400> 8
cccatttaag gagtccgatt ttttccgagt cttagatttt gagaaaaccc aggattgctt 60
tgtgcactcc tgggttttca ctttgttaag cagttttggg gaaaagtgca aagtttgcaa 120
agtttagaaa tattttaaga ggtaagatgt ctgcaggtgg aagcgtttaa atgcgttaaa 180
cttggccaaa tgtggcaacc tttgcaaggt gaaaaactgg ggcggggtta gatcctgggg 240
ggtttatttc attcactttg gcttgaagtc gtgcaggtca gaggagtgtt gcccgaaaac 300
attgagagga aaacaaaaac cgatgtttga ttgggggaat cgggggttac gatactagga 360
cgaagtgact gctatcaccc ttggcggtct cttgttgaaa ggaacaatta ctctagtgtc 420
gactcacaca tcttcaacgc ttccagcatt caaaaagatc ttggtagcaa accgcggcga 480
aatcgcggtc cgtgctttcc gtgcagcact cgaaaccggt gcagccacgg tagctattta 540
cccccgtgaa gatcggggat cattccaccg ctcttttgct tctgaagctg tccgcattgg 600
tactgaaggc tcaccagtca aggcgtacct ggacatcgat gaaattatcg gtgcagctaa 660
aaaagttaaa gcagatgcta tttacccggg atatggcttc ctgtctgaaa atgcccagct 720
tgcccgcgag tgcgcggaaa acggcattac ttttattggc ccaaccccag aggttcttga 780
tctcaccggt gataagtctc gtgcggtaac cgccgcgaag aaggctggtc tgccagtttt 840
ggcggaatcc accccgagca aaaacatcga tgacatcgtt aaaagcgctg aaggccagac 900
ttaccccatc tttgtaaagg cagttgccgg tggtggcgga cgcggtatgc gctttgtttc 960
ttcacctgat gagctccgca aattggcaac agaagcatct cgtgaagctg aagcggcatt 1020
cggcgacggt tcggtatatg tcgaacgtgc tgtgattaac ccccagcaca ttgaagtgca 1080
gatccttggc gatcgcactg gagaagttgt acacctttat gaacgtgact gctcactgca 1140
gcgtcgtcac caaaaagttg tcgaaattgc gccagcacag catttggatc cagaactgcg 1200
tgatcgcatt tgcgcggatg cagtaaagtt ctgccgctcc attggttacc agggcgcgag 1260
aaccgtggaa ttcttggtcg atgaaaaggg caaccacgtt ttcatcgaaa tgaacccacg 1320
tatccaggtt gagcacaccg tgactgaaga agtcaccgag gtggacctgg tgaaggcgca 1380
gatgcgcttg gctgctggtg caaccttgaa ggaattgggt ctgacccaag ataagatcaa 1440
gacccacggt gcagcactgc agtgccgcat caccacggaa gatccaaaca acggcttccg 1500
cccagatacc ggaactatca ccgcgtaccg ctcaccaggc ggagctggcg ttcgtcttga 1560
cggtgcagct cagctcggtg gcgaaatcac cgcacacttt gactccatgc tggtgaaaat 1620
gacctgccgt ggttccgact ttgaaactgc tgttgctcgt gcacagcgcg cgttggctga 1680
gttcaccgtg tctggtgttg caaccaacat tggcttcttg cgcgcgctgc tgcgggaaga 1740
ggacttcact tccaagcgca tcgccaccgg atttatcggc gatcacccac acctccttca 1800
ggctccacct gcggatgatg agcagggacg catcctggat tacttggcag atgtcaccgt 1860
gaacaagcct catggtgtgc gtccaaagga tgttgcagca ccaatcgata agctgcccaa 1920
catcaaggat ctgccactgc cacgcggttc ccgtgaccgc ctgaagcagc ttggcccagc 1980
cgcgtttgct cgtgatctcc gtgagcagga cgcactggca gttactgata ccaccttccg 2040
cgatgcacac cagtctttgc ttgcgacccg agtccgctca ttcgcactga agcctgcggc 2100
agaggccgtc gcaaagctga ctcctgagct tttgtccgtg gaggcctggg gcggcgcgac 2160
ctacgatgtg gcgatgcgtt tcctctttga ggatccgtgg gacaggctcg acgagctgcg 2220
cgaggcgatg ccgaatgtaa acattcagat gctgcttcgc ggccgcaaca ccgtgggata 2280
caccccgtac ccagactccg tctgccgcgc gtttgttaag gaagctgcca cctccggcgt 2340
ggacatcttc cgcatcttcg acgcgcttaa cgacgtctcc cagatgcgtc cagcaatcga 2400
cgcagtcctg gagaccaaca ccgcggtagc cgaggtggct atggcttatt ctggtgatct 2460
ctctgatcca aatgaaaagc tctacaccct ggattactac ctgaagatgg cagaggagat 2520
cgtcaagtct ggcgctcaca tcttggccat taaggatatg gctggtctgc ttcgcccagc 2580
tgcggtaacc aagctggtca ccgcactgcg ccgtgaattc gatctgccag tgcacgtgca 2640
cacccacgac accgcgggtg gccagctggc tacctacttt gctgcagctc aagctggtgc 2700
agatgctgtt gacggtgctt ccgcaccact gtctggcacc acctcccagc catccctgtc 2760
tgccattgtt gctgcattcg cgcacacccg tcgcgatacc ggtttgagcc tcgaggctgt 2820
ttctgacctc gagccgtact gggaagcagt gcgcggactg tacctgccat ttgagtctgg 2880
aaccccaggc ccaaccggtc gcgtctaccg ccacgaaatc ccaggcggac agctgtccaa 2940
cctgcgtgca caggccaccg cactgggcct tgctgatcgc ttcgagctca tcgaagacaa 3000
ctacgcagcc gttaatgaga tgctgggacg cccaaccaag gtcaccccat cctccaaggt 3060
tgttggcgac ctcgcactcc acctggttgg tgcgggtgta gatccagcag actttgctgc 3120
agacccacaa aagtacgaca tcccagactc tgtcatcgcg ttcctgcgcg gcgagcttgg 3180
taaccctcca ggtggctggc cagaaccact gcgcacccgc gcactggaag gccgctccga 3240
aggcaaggca cctctgacgg aagttcctga ggaagagcag gcgcacctcg acgctgatga 3300
ttccaaggaa cgtcgcaaca gcctcaaccg cctgctgttc ccgaagccaa ccgaagagtt 3360
cctcgagcac cgtcgccgct tcggcaacac ctctgcgctg gatgatcgtg aattcttcta 3420
cggcctggtc gaaggccgcg agactttgat ccgcctgcca gatgtgcgca ccccactgct 3480
tgttcgcctg gatgcgatct ctgagccaga cgataagggt atgcgcaatg ttgtggccaa 3540
cgtcaacggc cagatccgcc caatgcgtgt gcgtgaccgc tccgttgagt ctgtcaccgc 3600
aaccgcagaa aaggcagatt cctccaacaa gggccatgtt gctgcaccat tcgctggtgt 3660
tgtcactgtg actgttgctg aaggtgatga ggtcaaggct ggagatgcag tcgcaatcat 3720
cgaggctatg aagatggaag caacaatcac tgcttctgtt gacggcaaaa tcgatcgcgt 3780
tgtggttcct gctgcaacga aggtggaagg tggcgacttg atcgtcgtcg tttcctaata 3840
aatcgactac tcacatag 3858
<210> 9
<211> 783
<212> DNA
<213> Artificial Sequence
<400> 9
tgatcgtcgt cgtttcctaa taaatcgact actcacatag ggtcgggcta gtcattctga 60
tcagcgaatt ccacgttcac atcgccaatt ccagagttca caaccagatt cagcattgga 120
ccttctagat cagcattgtg ggcggtgaga tctccaacat cacagcgcgc tgtgcccaca 180
ccggcggtac aacttaggct cacgggcaca tcatcgggca gggtgaccat gacttcgccg 240
atccctgagg tgatttggat gttttgttcc tgatccaatt gggtgaggtg gctgaaatcg 300
aggttcattt cacccacgcc agaggtgtag ctgctgagga gttcatcgtt ggtggggatg 360
agattgacat cgccgattcc agggtcgtct tcaaagtaga tgggatcgat atttgaaata 420
aacaggcctg cgagggcgct catgacaact ccggtaccaa ctacaccgcc gacaatccat 480
ggccacacat ggcgcttttt ctgaggcttt tgtggaggga cttgtacatc ccaggtgttg 540
tattggtttt gggcaagtgg atcccaatga ggcgcttcgg gggtttgttg cgcgaagggt 600
gcatagtagc cctcaacggg ggtgatagtg cttagatctg gttggggttg tgggtagaga 660
tcttcgtttt tcatggtggc atcctcagaa acagtgaatt cagtggtgag tagtccgcgg 720
ggtggaagtg gttgtttctt atgcagggta ccgagctcga attcgtaatc atggtcatag 780
ctg 783
<210> 10
<211> 1431
<212> DNA
<213> Artificial Sequence
<400> 10
cagtgccaag cttgcatgcc tgcaggtcga ctctagtcct ttggctacta acccacgcgc 60
caagatgcgt tccctgcgcc acggttttgt gaagctgttc tgccgccgta actctggcct 120
gatcatcggt ggtgtcgtgg tggcaccgac cgcgtctgag ctgatcctac cgatcgctgt 180
ggcagtgacc aaccgtctga cagttgctga tctggctgat accttcgcgg tgtacccatc 240
attgtcaggt tcgattactg aagcagcacg tcagctggtt caacatgatg atctaggcta 300
attttccgag tcttagattt tgagaaaacc caggattgct ttgtgcactc ctgggttttc 360
actttgttaa gcagttttgg ggaaaagtgc aaagtttgca aagtttagaa atattttaag 420
aggtaagatg tctgcaggtg gaagcgttta aatgcgttaa acttggccaa atgtggcaac 480
ctttgcaagg tgaaaaactg gggcggggtt agatcctggg gggtttattt cattcacttt 540
ggcttgaagt cgtgcaggtc agaggagtgt tgcccgaaaa cattgagagg aaaacaaaaa 600
ccgatgtttg attgggggaa tcgggggtta cgatactagg acgaagtgac tgctatcacc 660
cttggcggtc tcttgttgaa aggaacaatt actctaacct ttctgtaaaa agccccgctt 720
cttcctcatg gaggaggcgg ggctttttgg gtcaagatgg gagatgggtg agttggattt 780
ggtctgattc gacactttta aggactgaga tttgaagatc gagaccaagg ctcaaaggga 840
atccatgccg tcttgattta atactgcacc ccgctaatga aaatcattac tattaggtgt 900
catgatggac tatgcacacg attcctgctc accaactctg cgccgtgact tggaggtcac 960
cggacaagtc caacctgaga aagctgtgga tttagcagcg tcgtacgagg ggaaggttgc 1020
cgcaataacg aaggtgacct cctcaaatat ggagcatgcc atcacgcagg cctcaaaagc 1080
taaggaggtg gtggtgctca ttggtcactc cctgctgccc acatttcagg atttggaaaa 1140
agacattctg cactttcagg caggtaataa agggcgattt tctgtagcga ttgttgatcc 1200
tgatcgcagt gcagatgtgg ttgccagatt taggccaaaa cagattccgg tggcatacgt 1260
ggtgaaagat ggcgccagca ttgcggagtt caactcgctc aacaaggagc cggttgcaca 1320
atggcttgat cattttgtgt cgcgggaaac gatctccaat gaaaaagagg gggacgtcga 1380
taagcaaata gacgggtacc gagctcgaat tcgtaatcat ggtcatagct g 1431
Claims (8)
1.一种YH66_03760蛋白突变体,所述YH66_03760蛋白突变体的氨基酸序列如SEQ IDNo.4所示。
2.与权利要求1所述YH66_03760蛋白突变体相关的生物材料,所述生物材料为如下B1)至B4)中的任一种:
B1)编码权利要求1所述YH66_03760蛋白突变体的核酸分子;
B2)含有B1)所述核酸分子的表达盒;
B3)含有B1)所述核酸分子的重组载体;
B4)含有B1)所述核酸分子的重组谷氨酸棒杆菌。
3.根据权利要求2所述的生物材料,其特征在于:所述核酸分子的核苷酸序列如SEQ IDNo.3所示。
4.权利要求1所述的YH66_03760蛋白突变体或权利要求2或3中B1)至B3)所述的生物材料在如下X1)至X3)中任一种中的应用:
X1)提高谷氨酸棒杆菌中L-精氨酸产量;
X2)构建产L-精氨酸的谷氨酸棒杆菌;
X3)制备L-精氨酸。
5.一种提高谷氨酸棒杆菌中L-精氨酸产量的方法,所述方法为如下M1)或M2):
所述M1)包括如下步骤:将谷氨酸棒杆菌基因组中编码YH66_03760蛋白的基因替换为编码YH66_03760蛋白突变体的基因,实现谷氨酸棒杆菌中L-精氨酸产量的提高;
所述M2)包括如下步骤:提高谷氨酸棒杆菌中YH66_03760蛋白突变体的含量和/或活性,实现谷氨酸棒杆菌中L-精氨酸产量的提高;
所述YH66_03760蛋白的氨基酸序列如SEQ ID No.2所示;
所述YH66_03760蛋白突变体的氨基酸序列如SEQ ID No.4所示。
6.一种产L-精氨酸工程菌的构建方法,所述方法为如下N1)或N2):
所述N1)包括如下步骤:将谷氨酸棒杆菌基因组中编码YH66_03760蛋白的基因替换为编码YH66_03760蛋白突变体的基因,得到所述产L-精氨酸工程菌;
所述N2)包括如下步骤:提高谷氨酸棒杆菌中YH66_03760蛋白突变体的含量和/或活性,得到所述产L-精氨酸工程菌;
所述YH66_03760蛋白的氨基酸序列如SEQ ID No.2所示;
所述YH66_03760蛋白突变体的氨基酸序列如SEQ ID No.4所示。
7.根据权利要求6所述方法构建得到的产L-精氨酸工程菌在制备L-精氨酸中的应用。
8.一种制备L-精氨酸的方法,包括如下步骤:发酵培养根据权利要求6所述方法构建得到的产L-精氨酸工程菌,得到L-精氨酸。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111676693.2A CN114249803B (zh) | 2021-12-31 | 2021-12-31 | 一种高产精氨酸的工程菌及其构建方法与应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111676693.2A CN114249803B (zh) | 2021-12-31 | 2021-12-31 | 一种高产精氨酸的工程菌及其构建方法与应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114249803A CN114249803A (zh) | 2022-03-29 |
CN114249803B true CN114249803B (zh) | 2024-05-28 |
Family
ID=80799165
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111676693.2A Active CN114249803B (zh) | 2021-12-31 | 2021-12-31 | 一种高产精氨酸的工程菌及其构建方法与应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114249803B (zh) |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1370236A (zh) * | 1999-06-25 | 2002-09-18 | Basf公司 | 编码参与膜合成和膜转运的蛋白质的谷氨酸棒杆菌基因 |
-
2021
- 2021-12-31 CN CN202111676693.2A patent/CN114249803B/zh active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1370236A (zh) * | 1999-06-25 | 2002-09-18 | Basf公司 | 编码参与膜合成和膜转运的蛋白质的谷氨酸棒杆菌基因 |
Non-Patent Citations (1)
Title |
---|
李小曼等.γ-谷氨酰激酶基因敲除对产L-精氨酸钝齿棒杆菌 8-193生理代谢的影响.第51卷(第11期),第1476 - 1484页,尤其是图1,第1476页第2段. * |
Also Published As
Publication number | Publication date |
---|---|
CN114249803A (zh) | 2022-03-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113667682B (zh) | Yh66-rs11190基因突变体及其在制备l-缬氨酸中的应用 | |
CN114835783B (zh) | NCgl2747基因突变体及其在制备L-赖氨酸中的应用 | |
CN113683667B (zh) | Yh66-rs10865基因改造得到的工程菌及其在制备缬氨酸中的应用 | |
CN113683666B (zh) | Yh66-rs07020基因改造得到的工程菌及其在制备缬氨酸中的应用 | |
CN111471693B (zh) | 一种产赖氨酸的谷氨酸棒杆菌及其构建方法与应用 | |
CN110592109B (zh) | 一种spoT基因改造的重组菌株及其构建方法与应用 | |
CN114409751B (zh) | 一种yh66_04470基因突变的重组菌及其在制备精氨酸中的应用 | |
CN114181288B (zh) | 制备l-缬氨酸的方法及其所用的基因与该基因编码的蛋白质 | |
CN114605509B (zh) | Yh66_01475蛋白及其编码基因在调控细菌精氨酸产量中的应用 | |
CN114317583B (zh) | 构建产l-缬氨酸的重组微生物的方法及其所用核酸分子 | |
CN114349831B (zh) | aspA基因突变体、重组菌及制备L-缬氨酸的方法 | |
CN114249803B (zh) | 一种高产精氨酸的工程菌及其构建方法与应用 | |
CN115820706A (zh) | 半乳糖-1-磷酸尿苷酰基转移酶突变体及其在制备l-赖氨酸中的应用 | |
CN116063417A (zh) | Bbd29_14255基因突变体及其在制备l-谷氨酸中的应用 | |
CN114507273B (zh) | Yh66_07020蛋白及其相关生物材料在提高精氨酸产量中的应用 | |
CN114410615B (zh) | Yh66_00525蛋白及其编码基因在调控细菌精氨酸产量中的应用 | |
CN114907459B (zh) | 一种高产精氨酸的工程菌及其构建方法与应用 | |
CN114751966B (zh) | Yh66_04585蛋白及其相关生物材料在提高精氨酸产量中的应用 | |
CN114540399A (zh) | 一种制备l-缬氨酸的方法及其所用基因突变体和生物材料 | |
CN114560918B (zh) | Yh66_14275蛋白或其突变体在制备l-精氨酸中的应用 | |
CN114292315B (zh) | ppc突变体及其在制备L-缬氨酸中的应用 | |
CN115073569B (zh) | 一种yh66_09125基因突变的重组菌及其在制备精氨酸中的应用 | |
CN114277069B (zh) | 制备l-缬氨酸的方法及其所用生物材料 | |
CN114410614B (zh) | Yh66_05415蛋白或其突变体在制备l-精氨酸中的应用 | |
CN114315998B (zh) | Cey17_rs00300基因突变体及其在制备l-缬氨酸中的应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |