CN116790569B - 丙酮酸脱羧酶突变体及其在制备α-羟基酮类化合物中的应用 - Google Patents
丙酮酸脱羧酶突变体及其在制备α-羟基酮类化合物中的应用 Download PDFInfo
- Publication number
- CN116790569B CN116790569B CN202210361627.4A CN202210361627A CN116790569B CN 116790569 B CN116790569 B CN 116790569B CN 202210361627 A CN202210361627 A CN 202210361627A CN 116790569 B CN116790569 B CN 116790569B
- Authority
- CN
- China
- Prior art keywords
- ala
- leu
- gly
- val
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000002360 preparation method Methods 0.000 title claims description 9
- LCTONWCANYUPML-UHFFFAOYSA-N Pyruvic acid Chemical compound CC(=O)C(O)=O LCTONWCANYUPML-UHFFFAOYSA-N 0.000 title description 24
- 229940107700 pyruvic acid Drugs 0.000 title description 12
- 108010011939 Pyruvate Decarboxylase Proteins 0.000 claims abstract description 162
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 6
- HUMNYLRZRPPJDN-UHFFFAOYSA-N benzaldehyde Chemical compound O=CC1=CC=CC=C1 HUMNYLRZRPPJDN-UHFFFAOYSA-N 0.000 claims description 64
- QNGNSVIICDLXHT-UHFFFAOYSA-N para-ethylbenzaldehyde Natural products CCC1=CC=C(C=O)C=C1 QNGNSVIICDLXHT-UHFFFAOYSA-N 0.000 claims description 32
- 230000035772 mutation Effects 0.000 claims description 30
- 150000001875 compounds Chemical class 0.000 claims description 27
- 238000003259 recombinant expression Methods 0.000 claims description 24
- 238000000034 method Methods 0.000 claims description 22
- 102000039446 nucleic acids Human genes 0.000 claims description 18
- 108020004707 nucleic acids Proteins 0.000 claims description 18
- 150000007523 nucleic acids Chemical class 0.000 claims description 18
- DAEPDZWVDSPTHF-UHFFFAOYSA-M sodium pyruvate Chemical compound [Na+].CC(=O)C([O-])=O DAEPDZWVDSPTHF-UHFFFAOYSA-M 0.000 claims description 18
- 241000894006 Bacteria Species 0.000 claims description 15
- 239000013604 expression vector Substances 0.000 claims description 15
- 239000002773 nucleotide Substances 0.000 claims description 14
- 125000003729 nucleotide group Chemical group 0.000 claims description 14
- 239000013598 vector Substances 0.000 claims description 11
- 125000003118 aryl group Chemical group 0.000 claims description 10
- 230000000694 effects Effects 0.000 claims description 9
- 229940054269 sodium pyruvate Drugs 0.000 claims description 9
- 125000004435 hydrogen atom Chemical group [H]* 0.000 claims description 8
- 125000001072 heteroaryl group Chemical group 0.000 claims description 7
- 102220627580 Alsin_L38M_mutation Human genes 0.000 claims description 5
- 241000206602 Eukaryota Species 0.000 claims description 4
- 229920001184 polypeptide Polymers 0.000 claims description 4
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 4
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 4
- 239000003054 catalyst Substances 0.000 claims description 3
- 238000012258 culturing Methods 0.000 claims description 3
- 125000002887 hydroxy group Chemical group [H]O* 0.000 claims description 3
- 238000004519 manufacturing process Methods 0.000 claims description 3
- 229910021645 metal ion Inorganic materials 0.000 claims description 3
- 239000013603 viral vector Substances 0.000 claims description 2
- 239000013601 cosmid vector Substances 0.000 claims 1
- 239000013600 plasmid vector Substances 0.000 claims 1
- 238000006243 chemical reaction Methods 0.000 abstract description 112
- 239000000758 substrate Substances 0.000 abstract description 48
- 230000003287 optical effect Effects 0.000 abstract description 11
- 230000003197 catalytic effect Effects 0.000 abstract description 9
- 108010079364 N-glycylalanine Proteins 0.000 description 80
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 80
- 150000001413 amino acids Chemical group 0.000 description 63
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 48
- 235000001014 amino acid Nutrition 0.000 description 41
- 239000000243 solution Substances 0.000 description 41
- 229940024606 amino acid Drugs 0.000 description 40
- 230000000052 comparative effect Effects 0.000 description 35
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 32
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 32
- 108010005233 alanylglutamic acid Proteins 0.000 description 32
- 108010049041 glutamylalanine Proteins 0.000 description 32
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 32
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 31
- 108010089804 glycyl-threonine Proteins 0.000 description 29
- 238000004128 high performance liquid chromatography Methods 0.000 description 29
- 108020004414 DNA Proteins 0.000 description 24
- 102000053602 DNA Human genes 0.000 description 24
- 102000004190 Enzymes Human genes 0.000 description 23
- 108090000790 Enzymes Proteins 0.000 description 23
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 20
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 18
- 108010047495 alanylglycine Proteins 0.000 description 17
- 108090000623 proteins and genes Proteins 0.000 description 17
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 16
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 16
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 16
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 16
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 16
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 16
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 16
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 16
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 16
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 16
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 16
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 16
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 16
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 16
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 16
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 16
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 16
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 16
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 16
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 16
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 16
- NYDIVDKTULRINZ-AVGNSLFASA-N Arg-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NYDIVDKTULRINZ-AVGNSLFASA-N 0.000 description 16
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 16
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 16
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 16
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 16
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 16
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 16
- RRVBEKYEFMCDIF-WHFBIAKZSA-N Asn-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)C(=O)N RRVBEKYEFMCDIF-WHFBIAKZSA-N 0.000 description 16
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 16
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 16
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 16
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 16
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 16
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 16
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 16
- NJLLRXWFPQQPHV-SRVKXCTJSA-N Asp-Tyr-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O NJLLRXWFPQQPHV-SRVKXCTJSA-N 0.000 description 16
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 16
- RGNMNWULPAYDAH-JSGCOSHPSA-N Gln-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N RGNMNWULPAYDAH-JSGCOSHPSA-N 0.000 description 16
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 16
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 16
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 16
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 16
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 16
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 16
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 16
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 16
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 16
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 16
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 16
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 16
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 16
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 16
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 16
- LSQHWKPPOFDHHZ-YUMQZZPRSA-N His-Asp-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LSQHWKPPOFDHHZ-YUMQZZPRSA-N 0.000 description 16
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 16
- OZBDSFBWIDPVDA-BZSNNMDCSA-N His-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N OZBDSFBWIDPVDA-BZSNNMDCSA-N 0.000 description 16
- NDKSHNQINMRKHT-PEXQALLHSA-N His-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N NDKSHNQINMRKHT-PEXQALLHSA-N 0.000 description 16
- WSWAUVHXQREQQG-JYJNAYRXSA-N His-Tyr-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O WSWAUVHXQREQQG-JYJNAYRXSA-N 0.000 description 16
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 16
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 16
- JQLFYZMEXFNRFS-DJFWLOJKSA-N Ile-Asp-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N JQLFYZMEXFNRFS-DJFWLOJKSA-N 0.000 description 16
- QRTVJGKXFSYJGW-KBIXCLLPSA-N Ile-Glu-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N QRTVJGKXFSYJGW-KBIXCLLPSA-N 0.000 description 16
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 16
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 16
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 16
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 16
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 16
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 16
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 16
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 16
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 16
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 16
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 16
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 16
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 16
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 16
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 16
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 16
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 16
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 16
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 16
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 16
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 16
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 16
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 16
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 16
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 16
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 16
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 16
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 16
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 16
- ZGVYWHODYWRPLK-GUBZILKMSA-N Met-Pro-Cys Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O ZGVYWHODYWRPLK-GUBZILKMSA-N 0.000 description 16
- SOAYQFDWEIWPPR-IHRRRGAJSA-N Met-Ser-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SOAYQFDWEIWPPR-IHRRRGAJSA-N 0.000 description 16
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 16
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 16
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 16
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 16
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 16
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 16
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 16
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 16
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 16
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 16
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 16
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 16
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 16
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 16
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 16
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 16
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 16
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 16
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 16
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 16
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 16
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 16
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 16
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 16
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 16
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 16
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 16
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 16
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 16
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 16
- GQPQJNMVELPZNQ-GBALPHGKSA-N Thr-Ser-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O GQPQJNMVELPZNQ-GBALPHGKSA-N 0.000 description 16
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 16
- OFCKFBGRYHOKFP-IHPCNDPISA-N Trp-Asp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N OFCKFBGRYHOKFP-IHPCNDPISA-N 0.000 description 16
- HIZDHWHVOLUGOX-BPUTZDHNSA-N Trp-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O HIZDHWHVOLUGOX-BPUTZDHNSA-N 0.000 description 16
- DDHFMBDACJYSKW-AQZXSJQPSA-N Trp-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DDHFMBDACJYSKW-AQZXSJQPSA-N 0.000 description 16
- PZXUIGWOEWWFQM-SRVKXCTJSA-N Tyr-Asn-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O PZXUIGWOEWWFQM-SRVKXCTJSA-N 0.000 description 16
- CGDZGRLRXPNCOC-SRVKXCTJSA-N Tyr-Cys-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CGDZGRLRXPNCOC-SRVKXCTJSA-N 0.000 description 16
- FMOSEWZYZPMJAL-KKUMJFAQSA-N Tyr-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N FMOSEWZYZPMJAL-KKUMJFAQSA-N 0.000 description 16
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 16
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 16
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 16
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 16
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 16
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 16
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 16
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 16
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 16
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 16
- 108010093581 aspartyl-proline Proteins 0.000 description 16
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 16
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 16
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 16
- 108010025306 histidylleucine Proteins 0.000 description 16
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 16
- 108010000761 leucylarginine Proteins 0.000 description 16
- 108010057821 leucylproline Proteins 0.000 description 16
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 16
- 108010064235 lysylglycine Proteins 0.000 description 16
- 108010054155 lysyllysine Proteins 0.000 description 16
- 108010056582 methionylglutamic acid Proteins 0.000 description 16
- 108010024607 phenylalanylalanine Proteins 0.000 description 16
- 108010053725 prolylvaline Proteins 0.000 description 16
- 108010071207 serylmethionine Proteins 0.000 description 16
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 16
- 108010061238 threonyl-glycine Proteins 0.000 description 16
- 108010038745 tryptophylglycine Proteins 0.000 description 16
- HFBFSOAKPUZCCO-ZLUOBGJFSA-N Ala-Cys-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N HFBFSOAKPUZCCO-ZLUOBGJFSA-N 0.000 description 15
- RWCBJYUPAUTWJD-NHCYSSNCSA-N Gln-Met-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O RWCBJYUPAUTWJD-NHCYSSNCSA-N 0.000 description 15
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 15
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 15
- HEMHJVSKTPXQMS-UHFFFAOYSA-M Sodium hydroxide Chemical compound [OH-].[Na+] HEMHJVSKTPXQMS-UHFFFAOYSA-M 0.000 description 15
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 15
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 15
- 108010044940 alanylglutamine Proteins 0.000 description 15
- 239000000047 product Substances 0.000 description 15
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 15
- VAIWUNAAPZZGRI-IHPCNDPISA-N Ser-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N VAIWUNAAPZZGRI-IHPCNDPISA-N 0.000 description 14
- 229960002363 thiamine pyrophosphate Drugs 0.000 description 14
- 235000008170 thiamine pyrophosphate Nutrition 0.000 description 14
- 239000011678 thiamine pyrophosphate Substances 0.000 description 14
- YXVCLPJQTZXJLH-UHFFFAOYSA-N thiamine(1+) diphosphate chloride Chemical compound [Cl-].CC1=C(CCOP(O)(=O)OP(O)(O)=O)SC=[N+]1CC1=CN=C(C)N=C1N YXVCLPJQTZXJLH-UHFFFAOYSA-N 0.000 description 14
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 13
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 12
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 12
- 239000012634 fragment Substances 0.000 description 12
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 12
- 239000007788 liquid Substances 0.000 description 11
- 108090000489 Carboxy-Lyases Proteins 0.000 description 10
- 102000004031 Carboxy-Lyases Human genes 0.000 description 10
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 10
- 210000004027 cell Anatomy 0.000 description 8
- 229930027917 kanamycin Natural products 0.000 description 8
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 8
- 229960000318 kanamycin Drugs 0.000 description 8
- 229930182823 kanamycin A Natural products 0.000 description 8
- 238000003760 magnetic stirring Methods 0.000 description 8
- 230000001580 bacterial effect Effects 0.000 description 7
- 239000000872 buffer Substances 0.000 description 7
- 238000006555 catalytic reaction Methods 0.000 description 7
- 239000011777 magnesium Substances 0.000 description 7
- 239000013612 plasmid Substances 0.000 description 7
- 102000003960 Ligases Human genes 0.000 description 6
- 108090000364 Ligases Proteins 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 239000001963 growth medium Substances 0.000 description 6
- 238000010438 heat treatment Methods 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 238000012163 sequencing technique Methods 0.000 description 6
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 5
- 125000004432 carbon atom Chemical group C* 0.000 description 5
- 238000002156 mixing Methods 0.000 description 5
- 238000003756 stirring Methods 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- 239000006228 supernatant Substances 0.000 description 5
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 4
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 4
- 241000588902 Zymomonas mobilis Species 0.000 description 4
- 239000007853 buffer solution Substances 0.000 description 4
- 238000005520 cutting process Methods 0.000 description 4
- 238000001976 enzyme digestion Methods 0.000 description 4
- 239000000499 gel Substances 0.000 description 4
- 230000001965 increasing effect Effects 0.000 description 4
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- 239000012880 LB liquid culture medium Substances 0.000 description 3
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 3
- 239000004472 Lysine Substances 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 230000009471 action Effects 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 238000003752 polymerase chain reaction Methods 0.000 description 3
- 102000004169 proteins and genes Human genes 0.000 description 3
- 238000011084 recovery Methods 0.000 description 3
- 210000001082 somatic cell Anatomy 0.000 description 3
- 230000002194 synthesizing effect Effects 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 2
- URLKBWYHVLBVBO-UHFFFAOYSA-N Para-Xylene Chemical group CC1=CC=C(C)C=C1 URLKBWYHVLBVBO-UHFFFAOYSA-N 0.000 description 2
- 239000001888 Peptone Substances 0.000 description 2
- 108010080698 Peptones Proteins 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 238000000246 agarose gel electrophoresis Methods 0.000 description 2
- 125000000217 alkyl group Chemical group 0.000 description 2
- -1 and e.g. Chemical group 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000007036 catalytic synthesis reaction Methods 0.000 description 2
- 239000012295 chemical reaction liquid Substances 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- ZUOUZKKEUPVFJK-UHFFFAOYSA-N diphenyl Chemical compound C1=CC=CC=C1C1=CC=CC=C1 ZUOUZKKEUPVFJK-UHFFFAOYSA-N 0.000 description 2
- 239000012154 double-distilled water Substances 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000007852 inverse PCR Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 239000012452 mother liquor Substances 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 238000013386 optimize process Methods 0.000 description 2
- 235000019319 peptone Nutrition 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 239000002244 precipitate Substances 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 230000009257 reactivity Effects 0.000 description 2
- 229920002477 rna polymer Polymers 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000013112 stability test Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 125000001637 1-naphthyl group Chemical group [H]C1=C([H])C([H])=C2C(*)=C([H])C([H])=C([H])C2=C1[H] 0.000 description 1
- 125000001622 2-naphthyl group Chemical group [H]C1=C([H])C([H])=C2C([H])=C(*)C([H])=C([H])C2=C1[H] 0.000 description 1
- DAQHMCWYXJEOCG-UHFFFAOYSA-N 2-oxopropanoic acid;sodium Chemical compound [Na].CC(=O)C(O)=O DAQHMCWYXJEOCG-UHFFFAOYSA-N 0.000 description 1
- HHDDCCUIIUWNGJ-UHFFFAOYSA-N 3-hydroxypyruvic acid Chemical compound OCC(=O)C(O)=O HHDDCCUIIUWNGJ-UHFFFAOYSA-N 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000193403 Clostridium Species 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 241000179039 Paenibacillus Species 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 102000003929 Transaminases Human genes 0.000 description 1
- 108090000340 Transaminases Proteins 0.000 description 1
- WLBZWXXGSOLJBA-HOCLYGCPSA-N Trp-Gly-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 WLBZWXXGSOLJBA-HOCLYGCPSA-N 0.000 description 1
- RQOMPQGUGBILAG-AVGNSLFASA-N Val-Met-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQOMPQGUGBILAG-AVGNSLFASA-N 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 239000008476 aike Substances 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 239000004305 biphenyl Substances 0.000 description 1
- 235000010290 biphenyl Nutrition 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 239000005515 coenzyme Substances 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 238000006114 decarboxylation reaction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 239000008367 deionised water Substances 0.000 description 1
- 229910021641 deionized water Inorganic materials 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 229910052736 halogen Inorganic materials 0.000 description 1
- 125000005843 halogen group Chemical group 0.000 description 1
- 150000002367 halogens Chemical class 0.000 description 1
- 125000005842 heteroatom Chemical group 0.000 description 1
- 150000002391 heterocyclic compounds Chemical class 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- 229940039696 lactobacillus Drugs 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 1
- 235000019341 magnesium sulphate Nutrition 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 229930182817 methionine Natural products 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 229910000402 monopotassium phosphate Inorganic materials 0.000 description 1
- 235000019796 monopotassium phosphate Nutrition 0.000 description 1
- 229930014626 natural product Natural products 0.000 description 1
- 230000002572 peristaltic effect Effects 0.000 description 1
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 102000040430 polynucleotide Human genes 0.000 description 1
- 108091033319 polynucleotide Proteins 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- GNSKLFRGEWLPPA-UHFFFAOYSA-M potassium dihydrogen phosphate Chemical compound [K+].OP(O)([O-])=O GNSKLFRGEWLPPA-UHFFFAOYSA-M 0.000 description 1
- LWIHDJKSTIGBAC-UHFFFAOYSA-K potassium phosphate Substances [K+].[K+].[K+].[O-]P([O-])([O-])=O LWIHDJKSTIGBAC-UHFFFAOYSA-K 0.000 description 1
- 238000012257 pre-denaturation Methods 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 235000018102 proteins Nutrition 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 230000035484 reaction time Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- AYEKOFBPNLCAJY-UHFFFAOYSA-N thiamine(1+) diphosphate(1-) Chemical compound CC1=C(CCO[P@](O)(=O)OP(O)([O-])=O)SC=[N+]1CC1=CN=C(C)N=C1N AYEKOFBPNLCAJY-UHFFFAOYSA-N 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 229910021642 ultra pure water Inorganic materials 0.000 description 1
- 239000012498 ultrapure water Substances 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 229930195727 α-lactose Natural products 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/88—Lyases (4.)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/24—Preparation of oxygen-containing organic compounds containing a carbonyl group
- C12P7/26—Ketones
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y401/00—Carbon-carbon lyases (4.1)
- C12Y401/01—Carboxy-lyases (4.1.1)
- C12Y401/01001—Pyruvate decarboxylase (4.1.1.1)
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
本申请公开了一种丙酮酸脱羧酶突变体及其在制备α‑羟基酮类化合物中的应用,所述丙酮酸脱羧酶突变体是在如SEQ ID NO:1所示氨基酸序列的野生型PDC的基础上发生部分位点突变而形成的,所述丙酮酸脱羧酶突变体能够应用于制备α‑羟基酮类化合物,底物转化率可达99.7%,目标产物的e.e值可达99.8%,相较于具有如SEQ ID NO:1所示氨基酸序列的野生型PDC,所述丙酮酸脱羧酶突变体具有催化活性高、光学选择性更佳、底物耐受性更强以及热稳定性理想的优点。
Description
技术领域
本申请涉及酶工程及生物制药领域,具体涉及一种丙酮酸脱羧酶突变体及其在制备α-羟基酮类化合物中的应用。
背景技术
丙酮酸脱羧酶(pyruvate decarboxylase,PDC)属于焦磷酸硫胺素(thiamindiphosphate,TPP)依依赖性的非氧化酶,PDC是一种胞内酶,广泛存在于动植物和微生物体内。PDC能够应用于α-羟基酮类化合物的合成。
α-羟基酮类化合物是一类具有重要应用价值的化合物,α-羟基酮类化合物能够作为具有生物活性的天然产物和医药中间体,也可作为紫外光固化涂料中的光引发剂,还可以衍生转化为杂环化合物。采用PDC催化合成α-羟基酮类化合物具有反应条件温和、原料低廉、产物光学纯度高的优点,具有巨大的经济价值和环保意义。但是,采用野生型PDC催化合成α-羟基酮类化合物具有催化活性低、酶稳定性差和光学选择性不足的缺点,从而限制了PDC在α-羟基酮类化合物合成技术领域的工业应用。
因此,如何改造野生型PDC以提高其催化活性和稳定性,对PDC酶法制备α-羟基酮类化合物的发展具有重要意义。
发明内容
本申请提供了一种丙酮酸脱羧酶突变体及其在制备α-羟基酮类化合物中的应用,以改善现有技术中野生型丙酮酸脱羧酶催化合成α-羟基酮类化合物存在的催化活性低、酶稳定性差和光学选择性不足的问题。
第一方面,本申请提供了一种丙酮酸脱羧酶突变体,所述丙酮酸脱羧酶突变体的氨基酸序列为与SEQ ID NO:1所示的氨基酸序列至少具有80%、至少具有85%、至少具有90%、至少具有95%、至少具有96%、至少具有97%、至少具有98%或至少具有99%相似性的氨基酸序列,且所述丙酮酸脱羧酶突变体的氨基酸序列是由SEQ ID NO:1所示的氨基酸序列发生一个或多个点突变而获得的氨基酸序列;所述丙酮酸脱羧酶突变体具有丙酮酸脱羧酶活性。
进一步地,发生所述点突变的氨基酸包括SEQ ID NO:1中第7位氨基酸T、第8位氨基酸T、第38位氨基酸L、第169位氨基酸N、第246位氨基酸A、第294位氨基酸G、第392位氨基酸W、第450位氨基酸V、第452位氨基酸Q、第472位氨基酸I、第475位氨基酸M、第476位氨基酸I、第549位氨基酸V、第551位氨基酸W、第553位氨基酸K或第555位氨基酸V中的至少一种。
进一步地,发生所述点突变的方式包括T7R、T8W、L38M、N169Y、A246M、G294K、W392A、V450E、Q452G、I472M、M475K、I476L、V549Y、W551D、K553Y或V555P中的至少一种。
进一步地,发生所述点突变的方式为以下任意一种:
(1)W392I;
(2)I472M;
(3)W551D;
(4)G294K;
(5)I476L;
(6)V555P;
(7)M475K;
(8)W551D和M475K;
(9)V549Y和W392A;
(10)T8W和A246M;
(11)N169Y和V450E;
(12)M475K、L38M和W551D;
(13)T7R、L38M和Q452G;
(14)T7R、K553Y和M475K;或
(15)T7R、L38M、W551D和M475K。
进一步地,所述丙酮酸脱羧酶突变体的氨基酸序列选自如SEQ ID NO:3、SEQ IDNO:5、SEQ ID NO:7、SEQ ID NO:9、SEQ ID NO:11、SEQ ID NO:13、SEQ ID NO:15、SEQ IDNO:17、SEQ ID NO:19、SEQ ID NO:21、SEQ ID NO:23、SEQ ID NO:25、SEQ ID NO:27、SEQ IDNO:29或SEQ ID NO:31任一所示的氨基酸序列。
第二方面,本申请提供了一种核酸分子,所述核酸分子包括用于编码如第一方面中任意一种所述的丙酮酸脱羧酶突变体的核苷酸序列。
进一步地,所述核苷酸序列选自如SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQID NO:22、SEQ ID NO:24、SEQ ID NO:26、SEQ ID NO:28、SEQ ID NO:30或SEQ ID NO:32任一所示的核苷酸序列。
第三方面,本申请提供了一种重组表达载体,所述重组表达载体包括载体,以及如第二方面中任意一种所述的核酸分子;所述载体选自质粒、粘粒、噬菌体或病毒载体。
第四方面,本申请提供了一种重组表达转化体,所述重组表达转化体包括宿主,以及引入至所述宿主体内的如第二方面中任意一种所述的核酸分子、或如第三方面中所述的重组表达载体;所述宿主选自真核生物或原核生物。
第五方面,本申请提供了一种丙酮酸脱羧酶突变体的制备方法,通过培养第四方面中所述的重组表达转化体,以及从培养物中获得所述的丙酮酸脱羧酶突变体。
第六方面,本申请提供了一种手性α-羟基酮类化合物的制备方法,以如第一方面中任意一种所述的丙酮酸脱羧酶突变体或如第五方面中所述的制备方法制得的丙酮酸脱羧酶突变体作为催化剂,第一化合物和第二化合物接触反应生成α-羟基酮类化合物;其中,第一化合物具有下面通式(Ⅰ)所示的结构:
在通式(Ⅰ),R1选自氢原子或羟基,X选自氢原子或一价金属离子。
第二化合物具有下面通式(Ⅱ)所示的结构:
在通式(Ⅱ)中,R2选自芳基或杂芳基。
进一步地,所述第一化合物选自丙酮酸、羟基丙酮酸或丙酮酸钠中的至少一种,所述第二化合物选自苯甲醛。
本申请提供了一种丙酮酸脱羧酶突变体及其在制备α-羟基酮类化合物中的应用,具有如下技术效果:
本申请中丙酮酸脱羧酶突变体是在如SEQ ID NO:1所示氨基酸序列的野生型PDC的基础上发生部分位点突变而形成的,所述丙酮酸脱羧酶突变体具有丙酮酸脱羧酶活性,所述丙酮酸脱羧酶突变体能够应用于制备α-羟基酮类化合物,以催化合成式(Ⅳ)所示结构的化合物为例,底物苯甲醛的转化率可达99.7%,目标产物的e.e值可达99.8%,相较于具有如SEQ ID NO:1所示氨基酸序列的野生型PDC,所述丙酮酸脱羧酶突变体具有催化活性高和光学选择性更佳的优点。此外,通过底物耐受性实验和热稳定性实验可知,所述丙酮酸脱羧酶突变体对底物的耐受性能和热稳定性能明显优于所述野生型PDC。
附图说明
下面结合附图,通过对本申请的具体实施方式详细描述,将使本申请的技术方案及其有益效果显而易见。
图1为实验例1中包含丙酮酸脱羧酶突变体M3的反应体系反应2h获得的反应液的HPLC图谱;
图2为实验例1中包含丙酮酸脱羧酶突变体M15的反应体系反应2h获得的反应液的HPLC图谱;
图3为实验例2中包含丙酮酸脱羧酶突变体M3的反应体系反应30min获得的反应液的HPLC图谱;
图4为实验例2中包含丙酮酸脱羧酶突变体M3的反应体系反应24h获得的反应液的HPLC图谱;
图5为实验例2中包含丙酮酸脱羧酶突变体M15的反应体系反应30min获得的反应液的HPLC图谱;
图6为实验例2中包含丙酮酸脱羧酶突变体M15的反应体系反应24h获得的反应液的HPLC图谱;
图7为实验例3中向包含丙酮酸脱羧酶突变体M3的预反应体系流加苯甲醛1h获得的反应液的HPLC图谱;
图8为实验例3中向包含丙酮酸脱羧酶突变体M3的预反应体系流加苯甲醛24h获得的反应液的HPLC图谱;
图9为实验例3中向包含丙酮酸脱羧酶突变体M15的预反应体系流加苯甲醛1h获得的反应液的HPLC图谱;
图10为实验例3中向包含丙酮酸脱羧酶突变体M15的预反应体系流加苯甲醛24h获得的反应液的HPLC图谱。
具体实施方式
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。
本申请实施例提供了一种丙酮酸脱羧酶突变体及其在制备α-羟基酮类化合物中的应用,丙酮酸脱羧酶突变体的氨基酸序列与SEQ ID NO:1所示的氨基酸序列至少具有80%、至少具有85%、至少具有90%、至少具有95%、至少具有96%、至少具有97%、至少具有98%或至少具有99%相似性的氨基酸序列,且丙酮酸脱羧酶突变体的氨基酸序列是由SEQ ID NO:1所示的氨基酸序列发生一个或多个点突变而获得的氨基酸序列,丙酮酸脱羧酶突变体具有丙酮酸脱羧酶活性。
如本申请所用,“相似性”是指两个氨基酸序列或两个核苷酸序列之间的相关性,例如:丙酮酸脱羧酶突变体的氨基酸序列与SEQ ID NO:1所示的氨基酸序列之间的相关性。在本申请实施例中,至少具有80%以上相似性可以理解为80%、81%、82%、83%、84%、85%、86%、87%、88%、89%、90%、91%、92%、93%、94%、95%、96%、97%、98%或99%的序列相似性,对应相似性的数值为整数;也可以进一步理解为80.1%、81.2%、82.3%、83.4%、84.5%、85.6%、86.7%、87.8%、88.9%、89.8%、90.3%、91.7%、92.2%、93.5%、94.8%、95.9%、96.6%、97.5%、98.4%或99.9%,但小于100%的序列相似性,对应相似性的数值为小数。
如本申请所用,“点突变”是指在SEQ ID NO:1所示的氨基酸序列中特定位点氨基酸的取代、缺失或插入。
进一步地,发生点突变的氨基酸包括SEQ ID NO:1中第7位氨基酸T、第8位氨基酸T、第38位氨基酸L、第169位氨基酸N、第246位氨基酸A、第294位氨基酸G、第392位氨基酸W、第450位氨基酸V、第452位氨基酸Q、第472位氨基酸I、第475位氨基酸M、第476位氨基酸I、第549位氨基酸V、第551位氨基酸W、第553位氨基酸K或第555位氨基酸V中的至少一种。
在本申请的一些实施例中,发生点突变的方式包括T7R、T8W、L38M、N169Y、A246M、G294K、W392A、V450E、Q452G、I472M、M475K、I476L、V549Y、W551D、K553Y或V555P中的至少一种。
如本申请所用,“氨基酸”由单字母或三字母代码表示,具有如下含义:A:Ala(丙氨酸);R:Arg(精氨酸);N:Asn(天冬酰胺);D:Asp(天冬氨酸);C:Cys(半胱氨酸);Q:Gln(谷氨酰胺);E:Glu(谷氨酸);G:Gly(甘氨酸);H:His(组氨酸);I:Ile(异亮氨酸);L:Leu(亮氨酸);K:Lys(赖氨酸);M:Met(甲硫氨酸);F:Phe(苯丙氨酸);P:Pro(脯氨酸);S:Ser(丝氨酸);T:Thr(苏氨酸);W:Trp(色氨酸);Y:Tyr(酪氨酸);V:Val(缬氨酸)。
对于氨基酸取代的点突变方式,命名方法为:原始氨基酸,原始氨基酸的位点,取代氨基酸,例如:W551D表示在SEQ ID NO:1所示氨基酸序列中第551位点处采用天冬氨酸取代原始的色氨酸;M475K表示在SEQ ID NO:1所示氨基酸序列中第475位点处采用赖氨酸取代原始的赖氨酸。
在本申请的一些实施例中,所述丙酮酸脱羧酶突变体的氨基酸序列选自如SEQ IDNO:3、SEQ ID NO:5、SEQ ID NO:7、SEQ ID NO:9、SEQ ID NO:11、SEQ ID NO:13、SEQ ID NO:15、SEQ ID NO:17、SEQ ID NO:19、SEQ ID NO:21、SEQ ID NO:23、SEQ ID NO:25、SEQ IDNO:27、SEQ ID NO:29或SEQ ID NO:31任一所示的氨基酸序列。
本申请实施例还提供了一种核酸分子,所述核酸分子包括用于编码本申请实施例中任意一种所述的丙酮酸脱羧酶突变体的核苷酸序列。
如本申请所用,“核酸分子”是指由多个核苷酸聚合而成的生物大分子化合物,可以是通过聚合酶链式反应(PCR)或通过体外翻译产生的脱氧核糖核酸(DNA)片段、核糖核酸(RNA)片段或寡核苷酸片段中的任意一种,以及通过连接、切割、内切核酸酶作用或外切核酸酶作用中的任意一种或多种产生的片段,可以是单链的或双链的。在本申请实施例中,核酸分子包括但不限于用于编码丙酮酸脱羧酶突变体的多核苷酸。
在本申请的一些实施例中,用于编码丙酮酸脱羧酶突变体的核苷酸序列选自如SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22、SEQ ID NO:24、SEQ ID NO:26、SEQ ID NO:28、SEQ ID NO:30或SEQ ID NO:32任一所示的核苷酸序列。
为了便于理解,下表1示出了本申请实施例中涉及的丙酮酸脱羧酶突变体的具体信息,
所述丙酮酸脱羧酶突变体相对于氨基酸序列为SEQ ID NO:1的野生型丙酮酸脱羧酶存在一个或多个突变位点,表1中提供的丙酮酸脱羧酶突变体仅作为示例:
表1本申请实施例中涉及的丙酮酸脱羧酶突变体的具体信息一览表
本申请实施例还提供了一种重组表达载体,所述重组表达载体包括载体,以及如本申请实施例中任意一种所述的核酸分子。
如本申请所用,“载体”是指能够运输另一种核酸的核酸分子,例如可以是质粒、病毒、粘粒、噬菌体等。在本申请的一个实施例中,未插入外源基因的pET24a可以作为载体。
如本申请所用,“重组表达载体”是指一种DNA构建体,其含有与合适的控制序列可操作地连接的核酸分子,所述控制序列能够实现核酸分子在合适的宿主中表达。在本申请实施例中,重组表达载体是指采用分子生物学技术将用于编码丙酮酸脱羧酶突变体的核酸分子插入至载体上,由此形成的DNA构建体。
本申请实施例还提供了一种重组表达转化体,所述重组表达转化体包括宿主,以及引入至宿主体内的如本申请实施例中任意一种所述的核酸分子或重组表达载体。
如本申请所用,“重组表达转化体”是指接受了外源遗传物质(如:质粒DNA)而使遗传特性发生变化的宿主。在本申请实施例中,接受了外源遗传物质(用于编码丙酮酸脱羧酶突变体的核酸分子或重组表达载体)的工程菌株属于重组表达转化体。在本申请的一个实施例中,T1重组工程菌至T15重组工程菌均属于重组表达转化体。
如本申请所用,“宿主”是指用于表达外源基因而产生蛋白的一类生物体,宿主例如可以是真核生物、原核生物、病毒等,其中,作为宿主的真核生物包括但不限于哺乳动物细胞、酵母、真菌、昆虫细胞和植物细胞,作为宿主的原核生物包括但不限于芽孢杆菌属、梭菌属、乳酸菌属、链霉菌属、葡萄球菌属、大肠杆菌、假单胞菌属和类芽孢杆菌属。在本申请的一个实施例中,用于表达丙酮酸脱羧酶突变体的宿主为大肠杆菌BL21(DE3)。
本申请实施例还提供了一种丙酮酸脱羧酶突变体的制备方法,具体是:通过培养本申请实施例中任意一种所述的重组表达转化体,以及从培养物中获得丙酮酸脱羧酶突变体。
本申请实施例还提供了一种α-羟基酮类化合物的制备方法,具体是:以本申请实施例中任意一种所述的丙酮酸脱羧酶突变体或任意一种所述的制备方法制得的丙酮酸脱羧酶突变体作为催化剂,第一化合物和第二化合物接触反应生成α-羟基酮类化合物,其中,第一化合物具有下面通式(Ⅰ)所示的结构:
在通式(Ⅰ),R1选自氢原子或羟基,X选自氢原子或一价金属离子。
第二化合物具有下面通式(Ⅱ)所示的结构:
在通式(Ⅱ)中,R2选自芳基或杂芳基。
如本申请所用,“芳基”既包括未发生取代的芳基,又包括一个或多个氢原子任选地被其他基团取代的芳基,其他基团例如可以是卤素原子或烷基,允许存在多重取代度;“未发生取代的芳基”是指芳香环上仅包含碳原子的芳香基团,包括但不限于是苯基、1-萘基、2-萘基或联苯基。
如本申请所用,“杂芳基”是指芳基中一个或多个碳原子独立地被一个或多个杂原子(例如N、O、P和/或S)替代,例如杂芳基具有3至20个碳原子,又如杂芳基具有5至15个碳原子,又如杂芳基具有5至9个碳原子,杂芳基可以是未取代的,也可以是其上的一个氢原子或多个氢原子任选地被其他基团取代,其他基团例如可以是烷基、卤素等,允许存在多重取代度。
可以理解的是,用于制备α-羟基酮类化合物的丙酮酸脱羧酶突变体可以是携带有丙酮酸脱羧酶突变体的编码基因的重组工程菌的培养物(包括培养基),也可以是通过将所述培养物分离纯化后获得的菌体细胞、菌体细胞提取物、菌体细胞破碎物或提纯的丙酮酸脱羧酶突变体。
在本申请的一些实施例中,在所述丙酮酸脱羧酶突变体催化第一化合物和第二化合物接触反应生成α-羟基酮类化合物的反应体系中,丙酮酸脱羧酶突变体:第一化合物:第二化合物的质量比值为1:(0.7~10):(4.5~12)。若丙酮酸脱羧酶突变体的添加量过少,则对第一化合物和第二化合物的催化反应效果有限,从而出现底物过剩的现象;若丙酮酸脱羧酶突变体的添加量过多,则会造成丙酮酸脱羧酶突变体的浪费,提高了生产成本,并且为后续产物的分离纯化增加了难度。
进一步地,所述反应体系还包括焦磷酸硫胺素和Mg2+,其中,焦磷酸硫胺素用作第一化合物脱羧基反应的辅酶,Mg2+用于提升酶的反应活性,其中,催化反应如下式(Ⅲ)所示:
在本申请的一些实施例中,所述反应体系的pH为5.6至7.0,反应温度为26℃至35℃。需要说明的是,催化反应可以在振荡或搅拌的条件下进行,反应时间例如以底物残留量低于5%为准;催化反应结束后,可以依照本领域常见的分离纯化方法提取α-羟基酮类化合物,常见的分离纯化方法包括但不限于是过滤、离心、沉淀或干燥中的至少一种。
下面将对本申请实施例中的技术方案进行清楚、完整地描述。显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件如Sambrook等人,分子克隆:实验室手册(New York:Cold Spring Harbor Laboratory Press,1989)中所述的条件,或按照制造厂商所建议的条件。
除非另行定义,文中所使用的所有专业与科学用语与本领域技术人员所熟悉的意义相同。此外,任何与所记载内容相似或均等的方法及材料皆可应用于本申请中。文中所述的较佳实施方法与材料仅作示范之用,但不能限制本申请的内容。
除非另有说明,以下实施例中使用的原料和试剂均为市售商品,或者可以通过本领域已知方法制备。
一、本申请实施例中涉及培养基的说明
(1)LB培养基
每100mL的LB液体培养基中,包括:1.0g的蛋白胨,0.5g的酵母粉,以及1.0g的NaCl;
对于LB固体培养基,是在LB液体培养基配方的基础上添加20g/L的琼脂;
对于含有卡那霉素抗性的LB培养基,卡那霉素的总浓度为50μg/mL。
(2)自诱导培养基
分别称取120g的酵母粉、32.25g的蛋白胨、0.75g的硫酸镁(MgSO4)、16.5g的硫酸铵((NH4)2SO4)、32.5g的磷酸二氢钾(KH2PO4)、35.5g的磷酸氢二钠(Na2HPO4)、2.5g的葡萄糖以及10g的α-乳糖,然后将各个称取好的组分全部加入至磨粉机内,充分研磨至粉状,获得粉末状的自诱导培养基。将50g粉末状的自诱导培养基溶于1L去离子水,充分混匀后调节pH至7.0,然后121℃灭菌30min。
二、本申请实施例中涉及质粒和感受态细胞的说明详见下表2:
表2本申请实施例中涉及质粒和感受态细胞的说明
三、本申请实施例中涉及的基因片段及试剂说明:
本申请实施例中涉及的基因片段,包括引物、如SEQ ID NO:2所示的核苷酸序列等由生工生物工程(上海)股份有限公司合成。
本申请实施例中涉及的限制性内切酶(如:BamH I、Nde I和Dpn I)、T4连接酶、KOD高保真酶试剂盒、胶回收试剂盒、10×T4连接酶缓冲液(Buffer)、双蒸水(ddH2O)等分子试剂均购自宝生物工程(大连)有限公司。
下面结合实施例进一步说明本申请的技术方案和有益效果。
实施例1:构建重组表达载体pET24a-ZM
运用基因挖掘技术,从NCBI数据库挖掘出源自运动发酵单胞菌(Zymomonasmobilis)的丙酮酸脱羧酶,所述丙酮酸脱羧酶的氨基酸序列如SEQ ID NO:1所示,NCBI登录号为WP_014849477.1。依据E.coli密码子偏好性进行密码子优化,以全基因合成的方法合成编码SEQ ID NO:1所示的氨基酸序列的核苷酸序列,该核苷酸序列如SEQ ID NO:2所示。
选择未插入外源基因的pET24a质粒作为载体,pET24a质粒具有酶切位点BamH I和Nde I,重组表达载体pET24a-ZM的构建方法包括如下步骤:
S1.1、在SEQ ID NO:2所示的核苷酸序列的两端分别加入酶切位点BamH I和NdeI,人工合成基因片段后,采用BamH I和Nde I限制性内切酶对合成的基因片段进行双酶切,1%琼脂糖凝胶电泳检测酶切完全后,胶回收目的基因片段,其中,双酶切后回收目的基因片段的操作根据胶回收试剂盒操作说明实施;
S1.2、采用BamH I和Nde I限制性内切酶对pET24a质粒进行双酶切,1%琼脂糖凝胶电泳检测酶切完全后,胶回收载体骨架,其中,双酶切后回收载体骨架的操作根据胶回收试剂盒操作说明实施;
S1.3、将步骤S1.1获得的目的基因片段与步骤S1.2获得的载体骨架相混合,在T4连接酶的作用下16℃连接过夜,然后将连接产物转化至DH5a感受态细胞内,挑取单克隆子测序验证,提取测序正确的重组质粒,获得包含转氨酶编码基因的重组表达载体,命名pET24a-AT,其中,重组体系为20μL,具体是:2μL的10×T4连接酶缓冲液(Buffer)、5μL的目的基因片段、5μL的载体骨架、2μL的T4连接酶以及6μL的双蒸水(ddH2O)。
实施例2构建重组工程菌的定向突变库
采用定点突变策略,以实施例1构建的pET24a-AT作为DNA模板,根据待突变的氨基酸位点利用Oligo7软件来设计点突变引物,通过在上下游突变引物的5’端以插入、替换或缺失碱基的方式引入突变,突变位点如表1所示。需要说明的是,本领域技术人员根据突变位点引入的方式,结合引物设计的基本原则可以获得突变引物的核苷酸序列。以构建W392I,上游突变引物的核苷酸序列如SEQ ID NO:33所示,下游突变引物的核苷酸序列如SEQ ID NO:34所示,共设计获得十五组突变引物对,以分别引入表1所示的十五种突变形式。
选择实施例1中构建的重组表达载体pET24a-AT为模板,分别以十五组突变引物对作为PCR引物,采用KOD高保真酶试剂盒进行反向PCR,从而获得十五种突变序列。其中,反向PCR的反应程序为:95℃预变性3min;98℃变性30s,55℃退火30s,68℃延伸3min,28个循环;72℃延伸5min。
分别使用Dpn I限制性内切酶处理十五种突变序列,酶切产物经T4连接酶连接后转化大肠杆菌BL21(DE3)感受态,随后涂布含卡那霉素的LB抗性平板,置于37℃倒置培养18h,挑选单菌落转接含卡那霉素的LB液体培养基中,挑选培养液送样测序,将测序正确的克隆子保存备用,从而获得以大肠杆菌为宿主的重组工程菌,即获得分别用于表达丙酮酸脱羧酶突变体M1至M15的重组工程菌T1至T15。
实施例3重组工程菌的诱导表达及后处理
将实施例2获得的重组工程菌接种至含50μg/mL卡那霉素的LB液体培养基中,在37℃、180r/min的条件下培养至OD600为0.6~0.8,获得种子菌液。将种子菌液以1%的体积浓度接种至新鲜的含终浓度为50μg/mL卡那霉素的自诱导培养基,置于30℃培养18h,获得培养液。将培养液在25℃、8000r/min的条件下离心10min,弃上清液以收集沉淀物,将沉淀物用pH为7.0的PB缓冲液清洗数遍,收集湿菌体备用。
通过超纯水重悬制得的湿菌体,获得菌体浓度(mg/L)为20%的菌液。采用超声波破碎法或高压均质破碎法处理菌液,破碎条件可依据实际需要自行选择。示例超声波破碎法的工作参数为:破碎1s;暂停2s;在180W的功率下,破碎10min。示例高压均质破碎法的工作参数为:在50HZ和800bar的条件下,破碎两次。
菌液破碎处理后,在4℃、12000r/min的条件下离心10min至15min,以除去细胞碎片和大分子杂质,收集上清液保存于-20℃和4℃以备用,上清液即为含有丙酮酸脱羧酶突变体的酶液。
对比例1
本对比例提供了一种丙酮酸脱羧酶,所述丙酮酸脱羧酶的氨基酸序列如SEQ IDNO:1所示,编码如SEQ ID NO:1所示的氨基酸序列的核苷酸序列如SEQ ID NO:2所示。
将重组表达载体pET24a-ZM转化入大肠杆菌BL21(DE3)感受态,随后涂布含卡那霉素的LB抗性平板,置于37℃倒置培养18h,挑选单菌落转接含卡那霉素的LB液体培养基中,挑选培养液送样测序,将测序正确的克隆子保存备用,从而获得重组工程菌T0。
将重组工程菌T0按照实施例3的方法进行诱导表达及后处理,将获得的酶液保存于-20℃和4℃以备用。
实验例1初步比较丙酮酸脱羧酶突变体M1至M15的催化反应活性和光学选择性
取100mL的烧瓶,向其中依次加入40mL的浓度为0.1mol/L的PB缓冲液(pH为6.5)、10mL的二甲基亚砜(DMSO)、50μL的浓度为50mg/mL焦磷酸硫胺素(TPP)和50μL的浓度为2mol/L的Mg2+缓冲液,温和搅拌均匀,然后加入底物:500μL的苯甲醛和0.6g的丙酮酸钠,待底物充分溶解后使用10%NaOH调整体系pH为6.5,获得母液。
将1mL的母液与5μL的含有单种丙酮酸脱羧酶突变体的酶液(实施例3制得)混合制得测活体系,一共制得十六组测活体系(分别对应为包含丙酮酸脱羧酶突变体M1的测活体系至包含丙酮酸脱羧酶突变体M15的测活体系,以及包含对比例1的丙酮酸脱羧酶的测活体系)。将各组测活体系分别置于30℃、180r/min的摇床反应2h,待反应结束后,利用高效液相色谱(High Performance Liquid Chromatography,HPLC)法对反应液进行检测分析,反应液中目标产物的结构式如下式(Ⅳ)所示:
其中,HPLC的仪器型号为岛津LC-16检测器,HPLC的工作条件如下:
(1)进样液的制备:取500μL的反应液与500μL的对二甲苯混合,充分混匀离心以收集上清液;取10μL的上清液,向其中加入990μL的流动相,震荡混匀后进样,每次进样量10μL;
(2)色谱柱:大赛璐OD-H柱,250*4.6mm,5μm。
(3)流动相的制备:将无水正己烷和无水异丙醇按照正己烷:异丙醇的体积比为9:1混合配制而成。
(4)流速:1mL/min。
(5)分析时间:20min。
(6)柱温:30℃。
根据底物化合物的减少量计算底物转化率(%),底物转化率(%)的计算公式如下式(Ⅲ):
在式(Ⅲ)中,A0为底物峰面积,A1为产物峰面积。
作为示例,图1示出了包含丙酮酸脱羧酶突变体M3的反应体系反应2h获得的反应液的HPLC图谱,以及图2示出了包含丙酮酸脱羧酶突变体M15的反应体系反应2h获得的反应产物的HPLC图谱。
各个丙酮酸脱羧酶突变体(M1至M15)以及对比例1的丙酮酸脱羧酶对底物的转化率,以及对应生成的目标产物的光学纯度详见下表3:
表3丙酮酸脱羧酶突变体M1至丙酮酸脱羧酶突变体M15以及对比例1的丙酮酸脱羧酶的初步测活数据
由表3可知,丙酮酸脱羧酶突变体M1至M15是在对比例1的丙酮酸脱羧酶的基础上作出一个或多个位点突变而获得的突变体,相较于对比例1的丙酮酸脱羧酶,M1至M15对底物的转化率显著提升,并且对应的反应体系生成的目标产物的光学纯度也明显提高。以丙酮酸脱羧酶突变体M15为例,M15对底物的转化率可达24.32%,其是对比例1的丙酮酸脱羧酶对底物的转化率的12倍;此外,包含M15的反应体系生成的目标产物的光学纯度可达99.8%,其是包含对比例1中丙酮酸脱羧酶的反应体系生成的目标产物的光学纯度的1.1倍。
此外,对于采用单一位点突变方式获得的丙酮酸脱羧酶突变体来说,采用W551D点突变方式获得的丙酮酸脱羧酶突变体的底物转化率最高,其次是M475K;对于多位点组合突变方式获得的丙酮酸脱羧酶突变体来说,采用T7R、L38M、W551D和M475K组合突变方式获得的丙酮酸脱羧酶突变体的综合催化性能最佳。
实验例2复筛比较丙酮酸脱羧酶突变体M1至M15对底物的转化率
提供装有磁力搅拌转子的洁净三口烧瓶,向三口烧瓶中依次加入55g的纯水和6.4g的丙酮酸钠,将盛装有纯水和丙酮酸钠的三口烧瓶放置于30℃的恒温磁力搅拌水浴锅中,开启搅拌,然后向三口烧瓶内依次加入6g的苯甲醛、100μL的浓度为2mol/L的Mg2+缓冲液和100μL的浓度为50mg/mL焦磷酸硫胺素(TPP)以获得混合体系,采用10%(质量/体积,w/v)NaOH调节混合体系的pH至5.95±0.2,接着继续向三口烧瓶内加入7.0mL的含有单种丙酮酸脱羧酶突变体的酶液(实施例3制得)或对比例1的丙酮酸脱羧酶酶液以获得反应体系,采用50%(质量/体积,w/v)乙酸调节反应体系的pH至6.2±0.2。将反应体系置于30℃下恒温反应24h。
待反应结束后,利用HPLC法对反应液中的目标产物(同实验例1)进行检测分析,HPLC的检测方法参照实验例1进行。作为示例,图3和图4分别示出了包含丙酮酸脱羧酶突变体M3的反应体系反应30min和24h获得的反应液的HPLC图谱,图5和图6分别示出了包含丙酮酸脱羧酶突变体M15的反应体系反应30min和24h获得的反应液的HPLC图谱。
各个丙酮酸脱羧酶突变体(M1至M15)以及对比例1的丙酮酸脱羧酶对底物(苯甲醛)的转化率数据详见下表4:
表4丙酮酸脱羧酶突变体M1至丙酮酸脱羧酶突变体M15以及对比例1的丙酮酸脱羧酶的复筛实验数据
由表4可知,在复筛实验中,丙酮酸脱羧酶突变体M1至丙酮酸脱羧酶突变体M15对底物的转化率明显优于对比例1的丙酮酸脱羧酶。以丙酮酸脱羧酶突变体M15为例,丙酮酸脱羧酶突变体M15对底物的转化率可达95.2%,是对比例1的丙酮酸脱羧酶对底物转化率的38倍。此外,对于采用单一位点突变方式获得的丙酮酸脱羧酶突变体来说,采用W551D点突变方式获得的丙酮酸脱羧酶突变体的底物转化率最高,其次是M475K;对于多位点组合突变方式获得的丙酮酸脱羧酶突变体来说,采用T7R、L38M、W551D和M475K组合突变方式获得的丙酮酸脱羧酶突变体的底物转化率最高。
实验例3优化工艺条件下比较重组工程菌T1至T15对底物的转化率
提供装有磁力搅拌转子的洁净三口烧瓶,向三口烧瓶中加入55g的纯水和6.4g的丙酮酸钠,将盛装有纯水和丙酮酸钠的三口烧瓶放置于30℃的恒温磁力搅拌水浴锅中,开启搅拌,然后向三口烧瓶内依次加入100μL的浓度为2mol/L的Mg2+缓冲液和100μL的浓度为50mg/mL焦磷酸硫胺素(TPP)以获得混合体系,采用10%(质量/体积,w/v)NaOH调节混合体系的pH至5.95±0.2,接着继续向三口烧瓶内加入7.5mL的含有单种丙酮酸脱羧酶突变体的酶液(实施例3制得)或对比例1的丙酮酸脱羧酶酶液以获得预反应体系,然后使用蠕动泵向预反应体系中缓慢滴加6g的苯甲醛(滴加速度为0.4mL/h)以进行催化反应,在催化反应过程中,采用50%(质量/体积,w/v)乙酸控制整个反应体系的pH为6.2±0.2,30℃恒温反应24h。
待反应结束后,利用HPLC法对反应液中的目标产物(同实验例1)进行检测分析,HPLC的检测方法参照实验例1进行。作为示例,图7和图8分别示出了向包含丙酮酸脱羧酶突变体M3的预反应体系流加苯甲醛1h和流加苯甲醛24h获得的反应液的HPLC图谱,图9和图10分别示出了包含丙酮酸脱羧酶突变体M15的预反应体系流加苯甲醛1h和流加苯甲醛24h获得的反应液的HPLC图谱。
各个丙酮酸脱羧酶突变体(M1至M15)以及对比例1的丙酮酸脱羧酶对底物(苯甲醛)的转化率数据详见下表5:
表5丙酮酸脱羧酶突变体M1至丙酮酸脱羧酶突变体M15以及对比例1的丙酮酸脱羧酶的优化工艺条件实验数据
由表5可知,在优化工艺条件实验中,丙酮酸脱羧酶突变体M1至丙酮酸脱羧酶突变体M15对底物的转化率明显优于对比例1的丙酮酸脱羧酶。以丙酮酸脱羧酶突变体M15为例,丙酮酸脱羧酶突变体M15对底物的转化率可达99.7%,是对比例1的丙酮酸脱羧酶对底物转化率的11倍。
实验例4比较丙酮酸脱羧酶突变体M1至M15对底物浓度的耐受性
提供装有磁力搅拌转子的洁净三口烧瓶,向三口烧瓶中依次加入60g的纯水、100μL的浓度为2mol/L的Mg2+缓冲液和100μL的浓度为50mg/mL焦磷酸硫胺素(TPP)以获得混合体系,采用10%(质量/体积,w/v)NaOH调节混合体系的pH至5.95±0.2,接着继续向三口烧瓶内加入7.0mL的含有单种丙酮酸脱羧酶突变体的酶液(实施例3制得)或对比例1的丙酮酸脱羧酶酶液,再加入预设量的苯甲醛以获得预反应体系,每种酶液分别设置四组预反应体系,四组预反应体系的区别之处仅在于:苯甲醛的添加量不同,第一组预反应体系中苯甲醛的添加量为1g(对应反应体系中苯甲醛浓度为1%),第二组预反应体系中苯甲醛的添加量为2g(对应反应体系中苯甲醛浓度为2%),第三组预反应体系中苯甲醛的添加量为4g(对应反应体系中苯甲醛浓度为3%),第四组预反应体系中苯甲醛的添加量为6g(对应反应体系中苯甲醛浓度为6%)。将盛装有预反应体系的三口烧瓶放置于30℃的恒温磁力搅拌水浴锅中,开启搅拌1h,然后向各个预反应体系中分别加入6.4g的丙酮酸钠以对应获得反应体系,采用50%(质量/体积,w/v)乙酸调节反应体系的pH至6.2±0.2。将反应体系置于30℃下恒温反应2h。
待反应结束后,利用HPLC法对反应液中的目标产物(同实验例1)进行检测分析,HPLC的检测方法参照实验例1进行。各个丙酮酸脱羧酶突变体(M1至M15)以及对比例1的丙酮酸脱羧酶在不同的底物(苯甲醛)浓度下对底物(苯甲醛)的转化率数据详见下表6:
表6丙酮酸脱羧酶突变体M1至丙酮酸脱羧酶突变体M15以及对比例1的丙酮酸脱羧酶的耐受性实验数据
由表6可知,随着反应体系中底物浓度的逐渐升高,丙酮酸脱羧酶突变体M1至丙酮酸脱羧酶突变体M15以及对比例1的丙酮酸脱羧酶对底物(苯甲醛)的转化率均呈下降趋势,但在相同的底物浓度下,丙酮酸脱羧酶突变体M1至丙酮酸脱羧酶突变体M15对底物的转化率明显高于对比例1的丙酮酸脱羧酶对底物的转化率,说明:丙酮酸脱羧酶突变体M1至丙酮酸脱羧酶突变体M15对底物的耐受性能均优于对比例1的丙酮酸脱羧酶对底物的耐受性能。
当反应体系中底物浓度由1%升高至6%时,对应丙酮酸脱羧酶突变体M15对底物的转化率下降69%,而对比例1的丙酮酸脱羧酶对底物的转化率下降80%,说明丙酮酸脱羧酶突变体M15对底物的耐受性显著优于对比例1的丙酮酸脱羧酶,并且当反应体系中底物浓度为6%时,丙酮酸脱羧酶突变体M15对底物的转化率是对比例1的丙酮酸脱羧酶对底物的转化率的60倍。
实验例5比较丙酮酸脱羧酶突变体M1至M15的热稳定性
提供装有磁力搅拌转子的洁净三口烧瓶,向三口烧瓶中依次加入55g的纯水和6.4g的丙酮酸钠,将盛装有纯水和丙酮酸钠的三口烧瓶放置于30℃的恒温磁力搅拌水浴锅中,开启搅拌,然后向三口烧瓶内依次加入6g的苯甲醛、100μL的浓度为2mol/L的Mg2+缓冲液和100μL的浓度为50mg/mL焦磷酸硫胺素(TPP)以获得混合体系,采用10%(质量/体积,w/v)NaOH调节混合体系的pH至5.95±0.2,接着继续向三口烧瓶内加入7.0mL的含有单种丙酮酸脱羧酶突变体的酶液(实施例3制得)或对比例1的丙酮酸脱羧酶酶液以获得反应体系,每种酶液设置分别设置两组反应体系,两组反应体系的区别之处仅在于:一组加入的酶液未经热处理,另一组加入的酶液是经过50℃恒温热处理60min之后的酶液。采用50%(质量/体积,w/v)乙酸调节各个反应体系的pH至6.2±0.2,将反应体系置于30℃下恒温反应2h。
待反应结束后,利用HPLC法对反应液中的目标产物(同实验例1)进行检测分析,HPLC的检测方法参照实验例1进行。各个丙酮酸脱羧酶突变体(M1至M15)以及对比例1的丙酮酸脱羧酶未经热处理对底物(苯甲醛)的转化率和热处理后对底物(苯甲醛)的转化率详见下表7:
表7丙酮酸脱羧酶突变体M1至丙酮酸脱羧酶突变体M15以及对比例1的丙酮酸脱羧酶的热稳定性实验数据
由表7可知,丙酮酸脱羧酶突变体M1至丙酮酸脱羧酶突变体M15的热稳定性能明显优于对比例1的丙酮酸脱羧酶的热稳定性能。对于丙酮酸脱羧酶突变体M1至丙酮酸脱羧酶突变体M15,虽然相较于未经热处理的底物转化率,热处理后对底物的转化率有所下降,但是下降幅度较小,以丙酮酸脱羧酶突变体M15为例,热处理后对底物的转化率仅下降6.3%,而对比例1的丙酮酸脱羧酶经热处理后基本丧失对底物的催化活性。
由实验例1至实施例5可知,丙酮酸脱羧酶突变体M1至丙酮酸脱羧酶突变体M15的综合性能明显优于对比例1的丙酮酸脱羧酶,相较于对比例1的丙酮酸脱羧酶,丙酮酸脱羧酶突变体M1至丙酮酸脱羧酶突变体M15具有催化活性更高、光学选择性更好、稳定性更佳的优点。
以上对本申请所提供的一种丙酮酸脱羧酶突变体及其在制备α-羟基酮类化合物中的应用,进行了详细介绍。本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的技术方案及其核心思想;本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请实施例的技术方案的范围。
序列表
<110> 杭州酶易生物技术有限公司
赤峰艾克制药科技股份有限公司
<120> 丙酮酸脱羧酶突变体及其在制备α-羟基酮类化合物中的应用
<130> SUP220088CN
<141> 2022-04-07
<160> 34
<170> SIPOSequenceListing 1.0
<210> 1
<211> 568
<212> PRT
<213> Zymomonas mobilis
<400> 1
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 2
<211> 1707
<212> DNA
<213> Zymomonas mobilis
<400> 2
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 3
<211> 568
<212> PRT
<213> 人工序列
<400> 3
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Ile Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 4
<211> 1707
<212> DNA
<213> 人工序列
<400> 4
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tctatcttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 5
<211> 568
<212> PRT
<213> 人工序列
<400> 5
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Met Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 6
<211> 1707
<212> DNA
<213> 人工序列
<400> 6
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatggaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 7
<211> 568
<212> PRT
<213> 人工序列
<400> 7
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Asp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 8
<211> 1707
<212> DNA
<213> 人工序列
<400> 8
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa gatggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 9
<211> 568
<212> PRT
<213> 人工序列
<400> 9
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Lys Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 10
<211> 1707
<212> DNA
<213> 人工序列
<400> 10
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccacta agtggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 11
<211> 568
<212> PRT
<213> 人工序列
<400> 11
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Leu His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 12
<211> 1707
<212> DNA
<213> 人工序列
<400> 12
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgctgca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 13
<211> 568
<212> PRT
<213> 人工序列
<400> 13
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Pro Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 14
<211> 1707
<212> DNA
<213> 人工序列
<400> 14
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gccctgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 15
<211> 568
<212> PRT
<213> 人工序列
<400> 15
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Lys Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 16
<211> 1707
<212> DNA
<213> 人工序列
<400> 16
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttaagatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 17
<211> 568
<212> PRT
<213> 人工序列
<400> 17
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Lys Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Asp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 18
<211> 1707
<212> DNA
<213> 人工序列
<400> 18
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttaagatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa gatggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 19
<211> 568
<212> PRT
<213> 人工序列
<400> 19
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Ala Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Tyr Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 20
<211> 1707
<212> DNA
<213> 人工序列
<400> 20
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tctgcattca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attgtataaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 21
<211> 568
<212> PRT
<213> 人工序列
<400> 21
Met Ser Tyr Thr Val Gly Thr Trp Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Met Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 22
<211> 1707
<212> DNA
<213> 人工序列
<400> 22
atgagttata ctgtcggtac ctggttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctatgaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 23
<211> 568
<212> PRT
<213> 人工序列
<400> 23
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Tyr Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Glu Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 24
<211> 1707
<212> DNA
<213> 人工序列
<400> 24
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgctatact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagaa gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 25
<211> 568
<212> PRT
<213> 人工序列
<400> 25
Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Met Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Lys Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Asp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 26
<211> 1707
<212> DNA
<213> 人工序列
<400> 26
atgagttata ctgtcggtac ctatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tatgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttaagatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa gatggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 27
<211> 568
<212> PRT
<213> 人工序列
<400> 27
Met Ser Tyr Thr Val Gly Arg Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Met Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gly Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 28
<211> 1707
<212> DNA
<213> 人工序列
<400> 28
atgagttata ctgtcggtcg atatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tatgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctggtatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttatgatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 29
<211> 568
<212> PRT
<213> 人工序列
<400> 29
Met Ser Tyr Thr Val Gly Arg Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Lys Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Trp Gly Tyr Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 30
<211> 1707
<212> DNA
<213> 人工序列
<400> 30
atgagttata ctgtcggtcg atatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tttgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttaagatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa tggggttatc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 31
<211> 568
<212> PRT
<213> 人工序列
<400> 31
Met Ser Tyr Thr Val Gly Arg Tyr Leu Ala Glu Arg Leu Val Gln Ile
1 5 10 15
Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30
Leu Asp Asn Leu Leu Met Asn Lys Asn Met Glu Gln Val Tyr Cys Cys
35 40 45
Asn Glu Leu Asn Cys Gly Phe Ser Ala Glu Gly Tyr Ala Arg Ala Lys
50 55 60
Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly Ala Leu Ser Ala
65 70 75 80
Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95
Ile Ser Gly Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu
100 105 110
His His Ala Leu Gly Lys Thr Asp Tyr His Tyr Gln Leu Glu Met Ala
115 120 125
Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro Glu Glu Ala
130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys
145 150 155 160
Pro Val Tyr Leu Glu Ile Ala Cys Asn Thr Ala Ser Met Pro Cys Ala
165 170 175
Ala Pro Gly Pro Ala Ser Ala Leu Phe Asn Asp Glu Ala Ser Asp Glu
180 185 190
Ala Ser Leu Asn Ala Ala Val Asp Glu Thr Leu Lys Phe Ile Ala Asn
195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly
210 215 220
Ala Glu Glu Ala Ala Val Lys Phe Thr Asp Ala Leu Gly Gly Ala Val
225 230 235 240
Ala Thr Met Ala Ala Ala Lys Ser Phe Phe Pro Glu Glu Asn Ala Asn
245 250 255
Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu Lys
260 265 270
Thr Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn
275 280 285
Asp Tyr Ser Thr Thr Gly Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu
290 295 300
Val Leu Ala Glu Pro Arg Ser Val Val Val Asn Gly Ile Arg Phe Pro
305 310 315 320
Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335
Lys Lys Thr Gly Ser Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu
340 345 350
Leu Lys Lys Ala Ala Pro Ala Asp Pro Ser Ala Pro Leu Val Asn Ala
355 360 365
Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro Asn Thr Thr Val
370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu
385 390 395 400
Pro Asn Gly Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly
405 410 415
Trp Ser Val Pro Ala Ala Phe Gly Tyr Ala Val Gly Ala Pro Glu Arg
420 425 430
Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu Thr Ala Gln
435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu
450 455 460
Ile Asn Asn Tyr Gly Tyr Thr Ile Glu Val Lys Ile His Asp Gly Pro
465 470 475 480
Tyr Asn Asn Ile Lys Asn Trp Asp Tyr Ala Gly Leu Met Glu Val Phe
485 490 495
Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Ala Lys Gly Leu Lys Ala
500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525
Thr Asp Gly Pro Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys
530 535 540
Thr Glu Glu Leu Val Lys Asp Gly Lys Arg Val Ala Ala Ala Asn Ser
545 550 555 560
Arg Lys Pro Val Asn Lys Leu Leu
565
<210> 32
<211> 1707
<212> DNA
<213> 人工序列
<400> 32
atgagttata ctgtcggtcg atatttagcg gagcggcttg tccagattgg tctcaagcat 60
cacttcgcag tcgcgggcga ctacaacctc gtccttcttg acaacctgct tatgaacaaa 120
aacatggagc aggtttattg ctgtaacgaa ctgaactgcg gtttcagtgc agaaggttat 180
gctcgtgcca aaggcgcagc agcagccgtc gttacctaca gcgttggtgc gctttccgca 240
tttgatgcta tcggtggcgc ctatgcagaa aaccttccgg ttatcctgat ctccggtgct 300
ccgaacaaca acgaccacgc tgctggtcat gtgttgcatc atgctcttgg caaaaccgac 360
tatcactatc agttggaaat ggccaagaac atcacggccg ccgctgaagc gatttacacc 420
ccggaagaag ctccggctaa aatcgatcac gtgatcaaaa ctgctcttcg cgagaagaag 480
ccggtttatc tcgaaatcgc ttgcaacact gcttccatgc cctgcgccgc tcctggaccg 540
gcaagtgcat tgttcaatga cgaagccagc gacgaagcat ccttgaatgc agcggttgac 600
gaaaccctga aattcatcgc caaccgcgac aaagttgccg tcctcgtcgg cagcaagctg 660
cgcgctgctg gtgctgaaga agctgctgtt aaattcaccg acgctttggg cggtgcagtg 720
gctactatgg ctgctgccaa gagcttcttc ccagaagaaa atgccaatta cattggtacc 780
tcatggggcg aagtcagcta tccgggcgtt gaaaagacga tgaaagaagc cgatgcggtt 840
atcgctctgg ctcctgtctt caacgactac tccaccactg gttggacgga tatccctgat 900
cctaagaaac tggttctcgc tgaaccgcgt tctgtcgttg tcaacggcat tcgcttcccc 960
agcgttcatc tgaaagacta tctgacccgt ttggctcaga aagtttccaa gaaaaccggt 1020
tctttggact tcttcaaatc cctcaatgca ggtgaactga agaaagccgc tccggctgat 1080
ccgagtgctc cgttggtcaa cgcagaaatc gcccgtcagg tcgaagctct tctgaccccg 1140
aacacgacgg ttattgctga aaccggtgac tcttggttca atgctcagcg catgaagctc 1200
ccgaacggtg ctcgcgttga atatgaaatg cagtggggtc acattggttg gtccgttcct 1260
gccgccttcg gttatgccgt cggtgctccg gaacgtcgca acatcctcat ggttggtgat 1320
ggttccttcc agctgacggc tcaggaagtt gctcagatgg ttcgcctgaa actgccggtt 1380
atcatcttct tgatcaataa ctatggttac accatcgaag ttaagatcca tgatggtccg 1440
tacaacaaca tcaagaactg ggattatgcc ggtctgatgg aagtgttcaa cggtaacggt 1500
ggttatgaca gcggtgctgc taaaggcctg aaggctaaaa ccggtggcga actggcagaa 1560
gctatcaagg ttgctctggc aaacaccgac ggcccaaccc tgatcgaatg cttcatcggt 1620
cgtgaagact gcactgaaga attggtcaaa gatggtaagc gcgttgctgc cgccaacagc 1680
cgtaagcctg ttaacaagct cctctag 1707
<210> 33
<211> 30
<212> DNA
<213> 人工序列
<400> 33
attttcaatg ctcagcgcat gaagctcccg 30
<210> 34
<211> 30
<212> DNA
<213> 人工序列
<400> 34
agagtcaccg gtttcagcaa taaccgtcgt 30
Claims (10)
1.一种丙酮酸脱羧酶突变体,其特征在于,所述丙酮酸脱羧酶突变体的氨基酸序列是由SEQ ID NO:1所示的氨基酸序列发生一个或多个点突变而获得的氨基酸序列;所述丙酮酸脱羧酶突变体具有丙酮酸脱羧酶活性;
其中,发生所述点突变的方式为以下任意一种:
(1) W392I;
(2) I472M;
(3) W551D;
(4) G294K;
(5) I476L;
(6)V555P;
(7)M475K;
(8)W551D和M475K;
(9)V549Y和W392A;
(10) T8W和A246M;
(11) N169Y和V450E;
(12) M475K、L38M和W551D;
(13)T7R、L38M和Q452G;
(14) T7R、K553Y和M475K;或
(15) T7R、L38M、W551D和M475K。
2.根据权利要求1所述的丙酮酸脱羧酶突变体,其特征在于,所述丙酮酸脱羧酶突变体的氨基酸序列选自如SEQ ID NO:3、SEQ ID NO:5、SEQ ID NO:7、SEQ ID NO:9、SEQ ID NO:11、SEQ ID NO:13、SEQ ID NO:15、SEQ ID NO:17、SEQ ID NO:19、SEQ ID NO:21、SEQ IDNO:23、SEQ ID NO:25、SEQ ID NO:27、SEQ ID NO:29或SEQ ID NO:31任一所示的氨基酸序列。
3.一种核酸分子,其特征在于,所述核酸分子编码如权利要求1或2中所述的丙酮酸脱羧酶突变体。
4.根据权利要求3所述的核酸分子,其特征在于,所述核苷酸序列选自如SEQ ID NO:4、SEQ ID NO:6、SEQ ID NO:8、SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18、SEQ ID NO:20、SEQ ID NO:22、SEQ ID NO:24、SEQ ID NO:26、SEQ ID NO:28、SEQ ID NO:30或SEQ ID NO:32任一所示的核苷酸序列。
5.一种重组表达载体,其特征在于,所述重组表达载体包括载体,以及如权利要求3或4所述的核酸分子;所述载体选自质粒、粘粒或病毒载体。
6.根据权利要求5所述的重组表达载体,其特征在于,所述载体选自噬菌体。
7.一种重组表达转化体,其特征在于,所述重组表达转化体包括宿主,以及引入至所述宿主体内的如权利要求3或4所述的核酸分子、或如权利要求5或6所述的重组表达载体;所述宿主选自真核生物或原核生物。
8.一种丙酮酸脱羧酶突变体的制备方法,其特征在于,通过培养如权利要求7所述的重组表达转化体,以及从培养物中获得所述的丙酮酸脱羧酶突变体。
9.一种手性α-羟基酮类化合物的制备方法,其特征在于,以如权利要求1或2中所述的丙酮酸脱羧酶突变体或如权利要求8所述的制备方法制得的丙酮酸脱羧酶突变体作为催化剂,第一化合物和第二化合物接触反应生成α-羟基酮类化合物;
其中,第一化合物具有下面通式(Ⅰ)所示的结构:
(Ⅰ)
在通式(Ⅰ),R1选自氢原子或羟基,X选自氢原子或一价金属离子;
第二化合物具有下面通式(Ⅱ)所示的结构:
(Ⅱ)
在通式(Ⅱ)中,R2选自芳基或杂芳基。
10.根据权利要求9所述的制备方法,其特征在于,所述第一化合物选自丙酮酸钠,所述第二化合物选自苯甲醛。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210361627.4A CN116790569B (zh) | 2022-04-07 | 2022-04-07 | 丙酮酸脱羧酶突变体及其在制备α-羟基酮类化合物中的应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210361627.4A CN116790569B (zh) | 2022-04-07 | 2022-04-07 | 丙酮酸脱羧酶突变体及其在制备α-羟基酮类化合物中的应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN116790569A CN116790569A (zh) | 2023-09-22 |
CN116790569B true CN116790569B (zh) | 2024-04-26 |
Family
ID=88038428
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210361627.4A Active CN116790569B (zh) | 2022-04-07 | 2022-04-07 | 丙酮酸脱羧酶突变体及其在制备α-羟基酮类化合物中的应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116790569B (zh) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102994390A (zh) * | 2011-09-19 | 2013-03-27 | 浙江齐成碳能科技有限公司 | 一种利用点突变进行构建的基因工程能源微生物 |
CN106520716A (zh) * | 2016-10-28 | 2017-03-22 | 杭州酶易生物技术有限公司 | 一种嗜热酮还原酶突变体及其应用 |
CN107312766A (zh) * | 2017-08-07 | 2017-11-03 | 上海凌凯医药科技有限公司 | 一种酶活提高的丙酮酸脱羧酶突变体 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
BR112012025948A2 (pt) * | 2010-04-13 | 2015-10-06 | Embio Ltd | um processo recombinante para a produção de r-aromático e hidroxicetonas. |
WO2015137418A1 (ja) * | 2014-03-11 | 2015-09-17 | 味の素株式会社 | 熱安定性向上リジン脱炭酸酵素変異体を用いる1,5-ペンタジアミンの製造方法 |
-
2022
- 2022-04-07 CN CN202210361627.4A patent/CN116790569B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102994390A (zh) * | 2011-09-19 | 2013-03-27 | 浙江齐成碳能科技有限公司 | 一种利用点突变进行构建的基因工程能源微生物 |
CN106520716A (zh) * | 2016-10-28 | 2017-03-22 | 杭州酶易生物技术有限公司 | 一种嗜热酮还原酶突变体及其应用 |
CN107312766A (zh) * | 2017-08-07 | 2017-11-03 | 上海凌凯医药科技有限公司 | 一种酶活提高的丙酮酸脱羧酶突变体 |
Non-Patent Citations (3)
Title |
---|
The replacement of Trp392 by alanine influences the decarboxylase/carboligase activity and stability of pyruvate decarboxylase from Zymomonas mobilis;Heike BRUHN et al.;《Eur. J. Biochern.》;第234卷;650-655 * |
Translocation of Zymomonas mobilis pyruvate decarboxylase to periplasmic compartment for production of acetaldehyde outside the cytosol;Elina Balodite et al.;《MicrobiologyOpen.》;第8卷(第e809期);1-6 * |
丙酮酸脱羧酶及其应用研究;朱碧云等;《生命科学》;第22卷(第11期);1184-1191 * |
Also Published As
Publication number | Publication date |
---|---|
CN116790569A (zh) | 2023-09-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111269900B (zh) | 一种l-氨基酸脱氨酶突变体的制备及其应用 | |
CN109777763B (zh) | 一株用于l-茶氨酸生产的基因工程菌及其构建与应用 | |
CN111321129B (zh) | 工程化酮还原酶多肽及其应用 | |
CN112143764B (zh) | 一种生物酶催化制备布瓦西坦中间体化合物的方法 | |
CN109468291B (zh) | 一种羰基还原酶EbSDR8突变体及其构建方法和应用 | |
CN111808829B (zh) | 一种γ-谷氨酰甲胺合成酶突变体及其应用 | |
CN113088501A (zh) | 一种用于生产l-草铵膦的谷氨酸脱氢酶突变体及l-草铵膦生产方法 | |
CN113832120B (zh) | 甲醛转化突变蛋白及其应用 | |
CN111454918B (zh) | 一种烯醇还原酶突变体及其在制备(r)-香茅醛中的应用 | |
CN116790569B (zh) | 丙酮酸脱羧酶突变体及其在制备α-羟基酮类化合物中的应用 | |
CN113652408A (zh) | 羰基还原酶突变体及其在(r)-4-氯-3-羟基丁酸乙酯合成中的应用 | |
CN118291419A (zh) | 一种热稳定转氨酶的突变体及其应用 | |
CN109182286B (zh) | 一种改进的氰基还原酶及其在合成3-氯吡嗪-2甲胺中的应用 | |
CN114277022B (zh) | 一种高活性和高热稳定性的腈水合酶突变体 | |
CN105593368B (zh) | 2,3-丁二醇的生成能力得到增加的重组微生物及利用其的2,3-丁二醇的生产方法 | |
CN115975964A (zh) | 一种高活性酮基泛解酸内酯还原酶突变体及其编码基因和应用 | |
CN110938608A (zh) | 醛酮还原酶突变体、编码基因及其在合成(s)-tcpe中的应用 | |
JP2009089649A (ja) | クロストリジウム・クルベリのジアホラーゼ遺伝子およびその利用 | |
CN115011569B (zh) | 一种老黄酶NemR-PS突变体及其在制备(S)-香茅醇中的应用 | |
CN114752574B (zh) | 一种酶催化体系、加氢酶及制备方法和应用 | |
CN112625993B (zh) | 微生物转化法制备α-酮戊二酸 | |
CN110004119B (zh) | ε-酮酯还原酶突变体及其催化合成(R)-α-硫辛酸前体的应用 | |
CN112481229B (zh) | 一种ω转氨酶及其突变体、重组质粒、基因工程菌及其应用 | |
CN109370997B (zh) | 一种苯丙氨酸氨基变位酶突变体 | |
CN109337891B (zh) | 一种热稳定性提高的苯丙氨酸氨基变位酶突变体 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |