CN113583983A - 一种融合蛋白或其变体及其在制备骨化二醇中的应用 - Google Patents
一种融合蛋白或其变体及其在制备骨化二醇中的应用 Download PDFInfo
- Publication number
- CN113583983A CN113583983A CN202010369514.XA CN202010369514A CN113583983A CN 113583983 A CN113583983 A CN 113583983A CN 202010369514 A CN202010369514 A CN 202010369514A CN 113583983 A CN113583983 A CN 113583983A
- Authority
- CN
- China
- Prior art keywords
- leu
- ala
- asp
- arg
- gly
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 102000037865 fusion proteins Human genes 0.000 title claims abstract description 55
- 108020001507 fusion proteins Proteins 0.000 title claims abstract description 55
- JWUBBDSIWDLEOM-UHFFFAOYSA-N 25-Hydroxycholecalciferol Natural products C1CCC2(C)C(C(CCCC(C)(C)O)C)CCC2C1=CC=C1CC(O)CCC1=C JWUBBDSIWDLEOM-UHFFFAOYSA-N 0.000 title claims abstract description 32
- 235000021318 Calcifediol Nutrition 0.000 title claims abstract description 26
- JWUBBDSIWDLEOM-DTOXIADCSA-N calcidiol Chemical compound C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@@H](CCCC(C)(C)O)C)=C\C=C1\C[C@@H](O)CCC1=C JWUBBDSIWDLEOM-DTOXIADCSA-N 0.000 title claims abstract description 26
- 229960004361 calcifediol Drugs 0.000 title claims abstract description 26
- 238000002360 preparation method Methods 0.000 title claims abstract description 11
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 34
- 150000001413 amino acids Chemical class 0.000 claims abstract description 32
- 238000000034 method Methods 0.000 claims abstract description 19
- 238000004519 manufacturing process Methods 0.000 claims abstract description 6
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 6
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 claims description 18
- 238000006243 chemical reaction Methods 0.000 claims description 18
- 230000004927 fusion Effects 0.000 claims description 15
- 108010050375 Glucose 1-Dehydrogenase Proteins 0.000 claims description 12
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 claims description 12
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 claims description 12
- QYSXJUFSXHHAJI-YRZJJWOYSA-N vitamin D3 Chemical compound C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@H](C)CCCC(C)C)=C\C=C1\C[C@@H](O)CCC1=C QYSXJUFSXHHAJI-YRZJJWOYSA-N 0.000 claims description 12
- QYSXJUFSXHHAJI-XFEUOLMDSA-N Vitamin D3 Natural products C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@H](C)CCCC(C)C)=C/C=C1\C[C@@H](O)CCC1=C QYSXJUFSXHHAJI-XFEUOLMDSA-N 0.000 claims description 11
- 235000005282 vitamin D3 Nutrition 0.000 claims description 11
- 239000011647 vitamin D3 Substances 0.000 claims description 11
- 229940021056 vitamin d3 Drugs 0.000 claims description 11
- 101710088194 Dehydrogenase Proteins 0.000 claims description 10
- 239000013604 expression vector Substances 0.000 claims description 10
- 239000000852 hydrogen donor Substances 0.000 claims description 10
- 238000003259 recombinant expression Methods 0.000 claims description 10
- XJLXINKUBYWONI-DQQFMEOOSA-N [[(2r,3r,4r,5r)-5-(6-aminopurin-9-yl)-3-hydroxy-4-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl] [(2s,3r,4s,5s)-5-(3-carbamoylpyridin-1-ium-1-yl)-3,4-dihydroxyoxolan-2-yl]methyl phosphate Chemical compound NC(=O)C1=CC=C[N+]([C@@H]2[C@H]([C@@H](O)[C@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-DQQFMEOOSA-N 0.000 claims description 8
- 239000002773 nucleotide Substances 0.000 claims description 8
- 125000003729 nucleotide group Chemical group 0.000 claims description 8
- 241000588724 Escherichia coli Species 0.000 claims description 7
- 108010006519 Molecular Chaperones Proteins 0.000 claims description 7
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 claims description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 claims description 6
- 238000003780 insertion Methods 0.000 claims description 6
- 230000037431 insertion Effects 0.000 claims description 6
- BOPGDPNILDQYTO-NNYOXOHSSA-N nicotinamide-adenine dinucleotide Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 BOPGDPNILDQYTO-NNYOXOHSSA-N 0.000 claims description 6
- 239000013598 vector Substances 0.000 claims description 6
- 239000005515 coenzyme Substances 0.000 claims description 5
- 238000012217 deletion Methods 0.000 claims description 5
- 230000037430 deletion Effects 0.000 claims description 5
- 102000007698 Alcohol dehydrogenase Human genes 0.000 claims description 4
- 108010021809 Alcohol dehydrogenase Proteins 0.000 claims description 4
- BDAGIHXWWSANSR-UHFFFAOYSA-M Formate Chemical compound [O-]C=O BDAGIHXWWSANSR-UHFFFAOYSA-M 0.000 claims description 4
- 108090000698 Formate Dehydrogenases Proteins 0.000 claims description 4
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 4
- ZMXDDKWLCZADIW-UHFFFAOYSA-N N,N-Dimethylformamide Chemical compound CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 claims description 4
- 239000007810 chemical reaction solvent Substances 0.000 claims description 4
- 239000006184 cosolvent Substances 0.000 claims description 4
- 239000008103 glucose Substances 0.000 claims description 4
- 125000002791 glucosyl group Chemical group C1([C@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 claims description 4
- 238000005805 hydroxylation reaction Methods 0.000 claims description 4
- ODLHGICHYURWBS-LKONHMLTSA-N trappsol cyclo Chemical compound CC(O)COC[C@H]([C@H]([C@@H]([C@H]1O)O)O[C@H]2O[C@@H]([C@@H](O[C@H]3O[C@H](COCC(C)O)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](COCC(C)O)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](COCC(C)O)[C@H]([C@@H]([C@H]3O)O)O[C@H]3O[C@H](COCC(C)O)[C@H]([C@@H]([C@H]3O)O)O3)[C@H](O)[C@H]2O)COCC(O)C)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O)[C@@H]3O[C@@H]1COCC(C)O ODLHGICHYURWBS-LKONHMLTSA-N 0.000 claims description 4
- 229920000858 Cyclodextrin Polymers 0.000 claims description 3
- 238000006722 reduction reaction Methods 0.000 claims description 3
- GSNUFIFRDBKVIE-UHFFFAOYSA-N DMF Natural products CC1=CC=C(C)O1 GSNUFIFRDBKVIE-UHFFFAOYSA-N 0.000 claims description 2
- 229920004890 Triton X-100 Polymers 0.000 claims description 2
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 claims description 2
- 229920000053 polysorbate 80 Polymers 0.000 claims description 2
- HFHDHCJBZVLPGP-UHFFFAOYSA-N schardinger α-dextrin Chemical compound O1C(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC(C(O)C2O)C(CO)OC2OC(C(C2O)O)C(CO)OC2OC2C(O)C(O)C1OC2CO HFHDHCJBZVLPGP-UHFFFAOYSA-N 0.000 claims description 2
- 238000012216 screening Methods 0.000 claims description 2
- 230000027756 respiratory electron transport chain Effects 0.000 abstract description 17
- 102000004169 proteins and genes Human genes 0.000 abstract description 16
- 238000006555 catalytic reaction Methods 0.000 abstract description 7
- JWUBBDSIWDLEOM-DCHLRESJSA-N 25-Hydroxyvitamin D3 Natural products C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@@H](CCCC(C)(C)O)C)=C/C=C1\C[C@@H](O)CCC1=C JWUBBDSIWDLEOM-DCHLRESJSA-N 0.000 abstract description 6
- JWUBBDSIWDLEOM-NQZHSCJISA-N 25-hydroxy-3 epi cholecalciferol Chemical compound C1([C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@@H](CCCC(C)(C)O)C)=CC=C1C[C@H](O)CCC1=C JWUBBDSIWDLEOM-NQZHSCJISA-N 0.000 abstract description 6
- 238000009776 industrial production Methods 0.000 abstract description 3
- 108010050848 glycylleucine Proteins 0.000 description 24
- 239000013612 plasmid Substances 0.000 description 19
- 108010053725 prolylvaline Proteins 0.000 description 19
- 101150053185 P450 gene Proteins 0.000 description 18
- 108010036413 histidylglycine Proteins 0.000 description 17
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 16
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 16
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 16
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 16
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 15
- 102000004190 Enzymes Human genes 0.000 description 15
- 108090000790 Enzymes Proteins 0.000 description 15
- 108010005233 alanylglutamic acid Proteins 0.000 description 15
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 14
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 14
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 13
- 241000894006 Bacteria Species 0.000 description 13
- 108010085203 methionylmethionine Proteins 0.000 description 13
- 239000000047 product Substances 0.000 description 13
- 102000002004 Cytochrome P-450 Enzyme System Human genes 0.000 description 12
- 108010015742 Cytochrome P-450 Enzyme System Proteins 0.000 description 12
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 12
- 108010057821 leucylproline Proteins 0.000 description 12
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 11
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 11
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 11
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 10
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 10
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 10
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 10
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 10
- 108010070944 alanylhistidine Proteins 0.000 description 10
- 108010062796 arginyllysine Proteins 0.000 description 10
- 108010016616 cysteinylglycine Proteins 0.000 description 10
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 10
- 108010092114 histidylphenylalanine Proteins 0.000 description 10
- 108010027338 isoleucylcysteine Proteins 0.000 description 10
- 108010070643 prolylglutamic acid Proteins 0.000 description 10
- FOWHQTWRLFTELJ-FXQIFTODSA-N Ala-Asp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N FOWHQTWRLFTELJ-FXQIFTODSA-N 0.000 description 9
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 9
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 9
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 9
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 9
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 9
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 9
- BOESUSAIMQGVJD-RYQLBKOJSA-N Trp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N BOESUSAIMQGVJD-RYQLBKOJSA-N 0.000 description 9
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 9
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 9
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 9
- 108010092854 aspartyllysine Proteins 0.000 description 9
- 108010061238 threonyl-glycine Proteins 0.000 description 9
- ZEIYPKQQLSUPOT-QORCZRPOSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-phenylpropanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 ZEIYPKQQLSUPOT-QORCZRPOSA-N 0.000 description 8
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 8
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 8
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 8
- ZKEHTYWGPMMGBC-XUXIUFHCSA-N Ala-Leu-Leu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O ZKEHTYWGPMMGBC-XUXIUFHCSA-N 0.000 description 8
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 8
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 8
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 8
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 8
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 8
- YBIAYFFIVAZXPK-AVGNSLFASA-N Arg-His-Arg Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YBIAYFFIVAZXPK-AVGNSLFASA-N 0.000 description 8
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 8
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 8
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 8
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 8
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 8
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 8
- WWOYXVBGHAHQBG-FXQIFTODSA-N Asp-Met-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O WWOYXVBGHAHQBG-FXQIFTODSA-N 0.000 description 8
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 8
- KABHAOSDMIYXTR-GUBZILKMSA-N Cys-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N KABHAOSDMIYXTR-GUBZILKMSA-N 0.000 description 8
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 description 8
- LTXLIIZACMCQTO-GUBZILKMSA-N Gln-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N LTXLIIZACMCQTO-GUBZILKMSA-N 0.000 description 8
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 8
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 8
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 8
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 8
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 8
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 8
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 8
- BNMRSWQOHIQTFL-JSGCOSHPSA-N Gly-Val-Phe Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 BNMRSWQOHIQTFL-JSGCOSHPSA-N 0.000 description 8
- ZVKDCQVQTGYBQT-LSJOCFKGSA-N His-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O ZVKDCQVQTGYBQT-LSJOCFKGSA-N 0.000 description 8
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 8
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 8
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 8
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 8
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 8
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 8
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 8
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 8
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 8
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 8
- PHHYNOUOUWYQRO-XIRDDKMYSA-N Lys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N PHHYNOUOUWYQRO-XIRDDKMYSA-N 0.000 description 8
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 8
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 8
- CTVJSFRHUOSCQQ-DCAQKATOSA-N Met-Arg-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTVJSFRHUOSCQQ-DCAQKATOSA-N 0.000 description 8
- DNDVVILEHVMWIS-LPEHRKFASA-N Met-Asp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DNDVVILEHVMWIS-LPEHRKFASA-N 0.000 description 8
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 8
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 8
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 8
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 8
- XEXSSIBQYNKFBX-KBPBESRZSA-N Phe-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=CC=C1 XEXSSIBQYNKFBX-KBPBESRZSA-N 0.000 description 8
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 8
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 8
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 8
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 8
- KIGGUSRFHJCIEJ-DCAQKATOSA-N Pro-Asp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O KIGGUSRFHJCIEJ-DCAQKATOSA-N 0.000 description 8
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 8
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 8
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 8
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 8
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 8
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 8
- XIHGJKFSIDTDKV-LYARXQMPSA-N Thr-Phe-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XIHGJKFSIDTDKV-LYARXQMPSA-N 0.000 description 8
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 description 8
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 8
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 8
- XGEUYEOEZYFHRL-KKXDTOCCSA-N Tyr-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XGEUYEOEZYFHRL-KKXDTOCCSA-N 0.000 description 8
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 8
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 8
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 8
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 8
- ZLMFVXMJFIWIRE-FHWLQOOXSA-N Val-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N ZLMFVXMJFIWIRE-FHWLQOOXSA-N 0.000 description 8
- 108010066829 alanyl-glutamyl-aspartylprolyine Proteins 0.000 description 8
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 8
- 108010087924 alanylproline Proteins 0.000 description 8
- 210000004027 cell Anatomy 0.000 description 8
- 108010025306 histidylleucine Proteins 0.000 description 8
- 229930027917 kanamycin Natural products 0.000 description 8
- 229960000318 kanamycin Drugs 0.000 description 8
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 8
- 229930182823 kanamycin A Natural products 0.000 description 8
- 108010091871 leucylmethionine Proteins 0.000 description 8
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 8
- 108010047079 phenylalanyl-leucyl-arginyl-phenylalanine Proteins 0.000 description 8
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 8
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 7
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 7
- 239000007788 liquid Substances 0.000 description 7
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 6
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 6
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 6
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 6
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 6
- 108010093581 aspartyl-proline Proteins 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 6
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 6
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 6
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 5
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 5
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 5
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 5
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 5
- BJNUAWGXPSHQMJ-DCAQKATOSA-N Arg-Gln-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O BJNUAWGXPSHQMJ-DCAQKATOSA-N 0.000 description 5
- MTANSHNQTWPZKP-KKUMJFAQSA-N Arg-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O MTANSHNQTWPZKP-KKUMJFAQSA-N 0.000 description 5
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 5
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 5
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 5
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 5
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 5
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 5
- SAPLASXFNUYUFE-CQDKDKBSSA-N His-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N SAPLASXFNUYUFE-CQDKDKBSSA-N 0.000 description 5
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 5
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 5
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 5
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 5
- 108091005804 Peptidases Proteins 0.000 description 5
- OCSACVPBMIYNJE-GUBZILKMSA-N Pro-Arg-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O OCSACVPBMIYNJE-GUBZILKMSA-N 0.000 description 5
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 5
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 5
- 239000004365 Protease Substances 0.000 description 5
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 5
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 5
- 108010047495 alanylglycine Proteins 0.000 description 5
- 230000003197 catalytic effect Effects 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 108010031719 prolyl-serine Proteins 0.000 description 5
- 239000006228 supernatant Substances 0.000 description 5
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 4
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 4
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 4
- OCDJOVKIUJVUMO-SRVKXCTJSA-N Arg-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N OCDJOVKIUJVUMO-SRVKXCTJSA-N 0.000 description 4
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 4
- OMKZPCPZEFMBIT-SRVKXCTJSA-N Arg-Met-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OMKZPCPZEFMBIT-SRVKXCTJSA-N 0.000 description 4
- ANRZCQXIXGDXLR-CWRNSKLLSA-N Asn-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)N)N)C(=O)O ANRZCQXIXGDXLR-CWRNSKLLSA-N 0.000 description 4
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 4
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 4
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 4
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 4
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 4
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 4
- VTZYMXGGXOFBMX-DJFWLOJKSA-N His-Ile-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O VTZYMXGGXOFBMX-DJFWLOJKSA-N 0.000 description 4
- YKUAGFAXQRYUQW-KKUMJFAQSA-N His-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O YKUAGFAXQRYUQW-KKUMJFAQSA-N 0.000 description 4
- BCSGDNGNHKBRRJ-ULQDDVLXSA-N His-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N BCSGDNGNHKBRRJ-ULQDDVLXSA-N 0.000 description 4
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 4
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 4
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 4
- WSPQHZOMTFFWGH-XGEHTFHBSA-N Met-Thr-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(O)=O WSPQHZOMTFFWGH-XGEHTFHBSA-N 0.000 description 4
- CTRHXXXHUJTTRZ-ZLUOBGJFSA-N Ser-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O CTRHXXXHUJTTRZ-ZLUOBGJFSA-N 0.000 description 4
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 4
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 4
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 4
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 4
- ZEJBJDHSQPOVJV-UAXMHLISSA-N Thr-Trp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZEJBJDHSQPOVJV-UAXMHLISSA-N 0.000 description 4
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 4
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 4
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 4
- DZKFGCNKEVMXFA-JUKXBJQTSA-N Tyr-Ile-His Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O DZKFGCNKEVMXFA-JUKXBJQTSA-N 0.000 description 4
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 4
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 4
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 4
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 4
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 4
- 108010047857 aspartylglycine Proteins 0.000 description 4
- 108010004073 cysteinylcysteine Proteins 0.000 description 4
- 239000012634 fragment Substances 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 108010034529 leucyl-lysine Proteins 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 239000000758 substrate Substances 0.000 description 4
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 3
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 3
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 3
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 3
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 3
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 3
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 3
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 3
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 3
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 3
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 3
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 3
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 3
- 241000194107 Bacillus megaterium Species 0.000 description 3
- SKSJPIBFNFPTJB-NKWVEPMBSA-N Cys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CS)N)C(=O)O SKSJPIBFNFPTJB-NKWVEPMBSA-N 0.000 description 3
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 3
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 3
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 3
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 3
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 3
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 3
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 3
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 3
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 description 3
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 3
- JSQIXEHORHLQEE-MEYUZBJRSA-N His-Phe-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JSQIXEHORHLQEE-MEYUZBJRSA-N 0.000 description 3
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 3
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 3
- 239000012880 LB liquid culture medium Substances 0.000 description 3
- 241000880493 Leptailurus serval Species 0.000 description 3
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 3
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 3
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 3
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 3
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 3
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 3
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 3
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 3
- BVXXDMUMHMXFER-BPNCWPANSA-N Met-Ala-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVXXDMUMHMXFER-BPNCWPANSA-N 0.000 description 3
- DSWOTZCVCBEPOU-IUCAKERBSA-N Met-Arg-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCNC(N)=N DSWOTZCVCBEPOU-IUCAKERBSA-N 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 3
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 3
- 241000316848 Rhodococcus <scale insect> Species 0.000 description 3
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 3
- 241001052560 Thallis Species 0.000 description 3
- DGOJNGCGEYOBKN-BWBBJGPYSA-N Thr-Cys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(=O)O)N)O DGOJNGCGEYOBKN-BWBBJGPYSA-N 0.000 description 3
- GQNCRIFNDVFRNF-BPUTZDHNSA-N Trp-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O GQNCRIFNDVFRNF-BPUTZDHNSA-N 0.000 description 3
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 3
- 108010041407 alanylaspartic acid Proteins 0.000 description 3
- 108010008355 arginyl-glutamine Proteins 0.000 description 3
- 108010038633 aspartylglutamate Proteins 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 3
- 108010040030 histidinoalanine Proteins 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 108010000761 leucylarginine Proteins 0.000 description 3
- 230000014759 maintenance of location Effects 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 108010005942 methionylglycine Proteins 0.000 description 3
- 108010084572 phenylalanyl-valine Proteins 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 108010026333 seryl-proline Proteins 0.000 description 3
- 239000010802 sludge Substances 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- 241000384157 Acinetobacter sp. OC4 Species 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- UGLPMYSCWHTZQU-AUTRQRHGSA-N Ala-Ala-Tyr Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UGLPMYSCWHTZQU-AUTRQRHGSA-N 0.000 description 2
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 2
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 2
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 2
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 2
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 2
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 2
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 2
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 2
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 2
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 2
- JGIAYNNXZKKKOW-KKUMJFAQSA-N Asn-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N JGIAYNNXZKKKOW-KKUMJFAQSA-N 0.000 description 2
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 2
- UWOPETAWXDZUJR-ACZMJKKPSA-N Asp-Cys-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O UWOPETAWXDZUJR-ACZMJKKPSA-N 0.000 description 2
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 2
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 2
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- GYNUXDMCDILYIQ-QRTARXTBSA-N Asp-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N GYNUXDMCDILYIQ-QRTARXTBSA-N 0.000 description 2
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 2
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 2
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 2
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 2
- JFSNBQJNDMXMQF-XHNCKOQMSA-N Gln-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O JFSNBQJNDMXMQF-XHNCKOQMSA-N 0.000 description 2
- JKGHMESJHRTHIC-SIUGBPQLSA-N Gln-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JKGHMESJHRTHIC-SIUGBPQLSA-N 0.000 description 2
- YGNPTRVNRUKVLA-DCAQKATOSA-N Gln-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N YGNPTRVNRUKVLA-DCAQKATOSA-N 0.000 description 2
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 2
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 2
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 2
- VXQOONWNIWFOCS-HGNGGELXSA-N Glu-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N VXQOONWNIWFOCS-HGNGGELXSA-N 0.000 description 2
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 2
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 2
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 2
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- BQFGKVYHKCNEMF-DCAQKATOSA-N His-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 BQFGKVYHKCNEMF-DCAQKATOSA-N 0.000 description 2
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 2
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 2
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 2
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 2
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 2
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 2
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 2
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- HMDDEJADNKQTBR-BZSNNMDCSA-N Leu-His-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMDDEJADNKQTBR-BZSNNMDCSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 2
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 2
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 2
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 2
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- 108010065395 Neuropep-1 Proteins 0.000 description 2
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 2
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 2
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- ALJGSKMBIUEJOB-FXQIFTODSA-N Pro-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 ALJGSKMBIUEJOB-FXQIFTODSA-N 0.000 description 2
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 2
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 2
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 2
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 2
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 2
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 2
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 2
- 241000589516 Pseudomonas Species 0.000 description 2
- 241000204087 Pseudonocardia autotrophica Species 0.000 description 2
- 102000018120 Recombinases Human genes 0.000 description 2
- 108010091086 Recombinases Proteins 0.000 description 2
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 2
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 2
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 2
- COAHUSQNSVFYBW-FXQIFTODSA-N Ser-Asn-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O COAHUSQNSVFYBW-FXQIFTODSA-N 0.000 description 2
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 2
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 2
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- MQBTXMPQNCGSSZ-OSUNSFLBSA-N Thr-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N MQBTXMPQNCGSSZ-OSUNSFLBSA-N 0.000 description 2
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 2
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 2
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 2
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 2
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 2
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 2
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 2
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 2
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010027371 asparaginyl-leucyl-prolyl-arginine Proteins 0.000 description 2
- 229960005091 chloramphenicol Drugs 0.000 description 2
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 108010078144 glutaminyl-glycine Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 2
- 239000002054 inoculum Substances 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 108010003700 lysyl aspartic acid Proteins 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- JVBXVOWTABLYPX-UHFFFAOYSA-L sodium dithionite Chemical compound [Na+].[Na+].[O-]S(=O)S([O-])=O JVBXVOWTABLYPX-UHFFFAOYSA-L 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- FSBCNCKIQZZASN-GUBZILKMSA-N Ala-Arg-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O FSBCNCKIQZZASN-GUBZILKMSA-N 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 1
- 108010040956 Ala-Asp-Glu-Leu Proteins 0.000 description 1
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- IVKWMMGFLAMMKJ-XVYDVKMFSA-N Ala-His-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IVKWMMGFLAMMKJ-XVYDVKMFSA-N 0.000 description 1
- 108010076441 Ala-His-His Proteins 0.000 description 1
- JEPNLGMEZMCFEX-QSFUFRPTSA-N Ala-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N JEPNLGMEZMCFEX-QSFUFRPTSA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- FUKFQILQFQKHLE-DCAQKATOSA-N Ala-Lys-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O FUKFQILQFQKHLE-DCAQKATOSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- ISCYZXFOCXWUJU-KZVJFYERSA-N Ala-Thr-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O ISCYZXFOCXWUJU-KZVJFYERSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- SSQHYGLFYWZWDV-UVBJJODRSA-N Ala-Val-Trp Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O SSQHYGLFYWZWDV-UVBJJODRSA-N 0.000 description 1
- IJPNNYWHXGADJG-GUBZILKMSA-N Arg-Ala-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O IJPNNYWHXGADJG-GUBZILKMSA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 1
- OTUQSEPIIVBYEM-IHRRRGAJSA-N Arg-Asn-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OTUQSEPIIVBYEM-IHRRRGAJSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- QAXCZGMLVICQKS-SRVKXCTJSA-N Arg-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QAXCZGMLVICQKS-SRVKXCTJSA-N 0.000 description 1
- DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 1
- VRZDJJWOFXMFRO-ZFWWWQNUSA-N Arg-Gly-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VRZDJJWOFXMFRO-ZFWWWQNUSA-N 0.000 description 1
- RIIVUOJDDQXHRV-SRVKXCTJSA-N Arg-Lys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O RIIVUOJDDQXHRV-SRVKXCTJSA-N 0.000 description 1
- LCBSSOCDWUTQQV-SDDRHHMPSA-N Arg-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LCBSSOCDWUTQQV-SDDRHHMPSA-N 0.000 description 1
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 1
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 1
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 1
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 1
- NXVGBGZQQFDUTM-XVYDVKMFSA-N Asn-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N NXVGBGZQQFDUTM-XVYDVKMFSA-N 0.000 description 1
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 description 1
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 1
- HCAUEJAQCXVQQM-ACZMJKKPSA-N Asn-Glu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HCAUEJAQCXVQQM-ACZMJKKPSA-N 0.000 description 1
- XLHLPYFMXGOASD-CIUDSAMLSA-N Asn-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLHLPYFMXGOASD-CIUDSAMLSA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 1
- KPNUCOPMVSGRCR-DCAQKATOSA-N Asp-His-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KPNUCOPMVSGRCR-DCAQKATOSA-N 0.000 description 1
- RKNIUWSZIAUEPK-PBCZWWQYSA-N Asp-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N)O RKNIUWSZIAUEPK-PBCZWWQYSA-N 0.000 description 1
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 1
- LKVKODXGSAFOFY-VEVYYDQMSA-N Asp-Met-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LKVKODXGSAFOFY-VEVYYDQMSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 1
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 101100380241 Caenorhabditis elegans arx-2 gene Proteins 0.000 description 1
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 1
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 1
- UDPSLLFHOLGXBY-FXQIFTODSA-N Cys-Glu-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDPSLLFHOLGXBY-FXQIFTODSA-N 0.000 description 1
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 1
- PQHYZJPCYRDYNE-QWRGUYRKSA-N Cys-Gly-Phe Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PQHYZJPCYRDYNE-QWRGUYRKSA-N 0.000 description 1
- ZMWOJVAXTOUHAP-ZKWXMUAHSA-N Cys-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N ZMWOJVAXTOUHAP-ZKWXMUAHSA-N 0.000 description 1
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108010074122 Ferredoxins Proteins 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- KZKBJEUWNMQTLV-XDTLVQLUSA-N Gln-Ala-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZKBJEUWNMQTLV-XDTLVQLUSA-N 0.000 description 1
- APWLZZSLCXLDCF-CIUDSAMLSA-N Gln-Cys-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCSC)C(O)=O APWLZZSLCXLDCF-CIUDSAMLSA-N 0.000 description 1
- LVNILKSSFHCSJZ-IHRRRGAJSA-N Gln-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LVNILKSSFHCSJZ-IHRRRGAJSA-N 0.000 description 1
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 1
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 1
- BVELAHPZLYLZDJ-HGNGGELXSA-N Gln-His-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O BVELAHPZLYLZDJ-HGNGGELXSA-N 0.000 description 1
- XWIBVSAEUCAAKF-GVXVVHGQSA-N Gln-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N XWIBVSAEUCAAKF-GVXVVHGQSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 1
- AMHIFFIUJOJEKJ-SZMVWBNQSA-N Gln-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N AMHIFFIUJOJEKJ-SZMVWBNQSA-N 0.000 description 1
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 1
- KLKYKPXITJBSNI-CIUDSAMLSA-N Gln-Met-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O KLKYKPXITJBSNI-CIUDSAMLSA-N 0.000 description 1
- HHRAEXBUNGTOGZ-IHRRRGAJSA-N Gln-Phe-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O HHRAEXBUNGTOGZ-IHRRRGAJSA-N 0.000 description 1
- NPMFDZGLKBNFOO-SRVKXCTJSA-N Gln-Pro-His Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NPMFDZGLKBNFOO-SRVKXCTJSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- SRZLHYPAOXBBSB-HJGDQZAQSA-N Glu-Arg-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SRZLHYPAOXBBSB-HJGDQZAQSA-N 0.000 description 1
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 1
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 1
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- QYPKJXSMLMREKF-BPUTZDHNSA-N Glu-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N QYPKJXSMLMREKF-BPUTZDHNSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 1
- BIHMNDPWRUROFZ-JYJNAYRXSA-N Glu-His-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BIHMNDPWRUROFZ-JYJNAYRXSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 1
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- XNOWYPDMSLSRKP-GUBZILKMSA-N Glu-Met-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O XNOWYPDMSLSRKP-GUBZILKMSA-N 0.000 description 1
- SOEPMWQCTJITPZ-SRVKXCTJSA-N Glu-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N SOEPMWQCTJITPZ-SRVKXCTJSA-N 0.000 description 1
- JZJGEKDPWVJOLD-QEWYBTABSA-N Glu-Phe-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JZJGEKDPWVJOLD-QEWYBTABSA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 1
- YFGONBOFGGWKKY-VHSXEESVSA-N Gly-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)CN)C(=O)O YFGONBOFGGWKKY-VHSXEESVSA-N 0.000 description 1
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- IDNNYVGVSZMQTK-IHRRRGAJSA-N His-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N IDNNYVGVSZMQTK-IHRRRGAJSA-N 0.000 description 1
- DFHVLUKTTVTCKY-PBCZWWQYSA-N His-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O DFHVLUKTTVTCKY-PBCZWWQYSA-N 0.000 description 1
- NELVFWFDOKRTOR-SDDRHHMPSA-N His-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O NELVFWFDOKRTOR-SDDRHHMPSA-N 0.000 description 1
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 1
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 1
- PQKCQZHAGILVIM-NKIYYHGXSA-N His-Glu-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O PQKCQZHAGILVIM-NKIYYHGXSA-N 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 1
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 1
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 1
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 1
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 1
- KHUFDBQXGLEIHC-BZSNNMDCSA-N His-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 KHUFDBQXGLEIHC-BZSNNMDCSA-N 0.000 description 1
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 1
- RNAYRCNHRYEBTH-IHRRRGAJSA-N His-Met-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RNAYRCNHRYEBTH-IHRRRGAJSA-N 0.000 description 1
- YIGCZZKZFMNSIU-RWMBFGLXSA-N His-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N YIGCZZKZFMNSIU-RWMBFGLXSA-N 0.000 description 1
- CTEMYIWDSVICKS-WDSOQIARSA-N His-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N CTEMYIWDSVICKS-WDSOQIARSA-N 0.000 description 1
- WHKLDLQHSYAVGU-ACRUOGEOSA-N His-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WHKLDLQHSYAVGU-ACRUOGEOSA-N 0.000 description 1
- LNDVNHOSZQPJGI-AVGNSLFASA-N His-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNDVNHOSZQPJGI-AVGNSLFASA-N 0.000 description 1
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 1
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 1
- MCGOGXFMKHPMSQ-AVGNSLFASA-N His-Val-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MCGOGXFMKHPMSQ-AVGNSLFASA-N 0.000 description 1
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 1
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- PFTFEWHJSAXGED-ZKWXMUAHSA-N Ile-Cys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N PFTFEWHJSAXGED-ZKWXMUAHSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 1
- SNHYFFQZRFIRHO-CYDGBPFRSA-N Ile-Met-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N SNHYFFQZRFIRHO-CYDGBPFRSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 1
- IBSGMIPRBMPMHE-IHRRRGAJSA-N Leu-Met-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O IBSGMIPRBMPMHE-IHRRRGAJSA-N 0.000 description 1
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 1
- RDFIVFHPOSOXMW-ACRUOGEOSA-N Leu-Tyr-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RDFIVFHPOSOXMW-ACRUOGEOSA-N 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- PGBPWPTUOSCNLE-JYJNAYRXSA-N Lys-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N PGBPWPTUOSCNLE-JYJNAYRXSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 1
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- URBJRJKWSUFCKS-AVGNSLFASA-N Lys-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N URBJRJKWSUFCKS-AVGNSLFASA-N 0.000 description 1
- SPNKGZFASINBMR-IHRRRGAJSA-N Lys-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N SPNKGZFASINBMR-IHRRRGAJSA-N 0.000 description 1
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 1
- WLXGMVVHTIUPHE-ULQDDVLXSA-N Lys-Phe-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O WLXGMVVHTIUPHE-ULQDDVLXSA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 1
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 1
- OCRSGGIJBDUXHU-WDSOQIARSA-N Met-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OCRSGGIJBDUXHU-WDSOQIARSA-N 0.000 description 1
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 1
- ZRACLHJYVRBJFC-ULQDDVLXSA-N Met-Lys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZRACLHJYVRBJFC-ULQDDVLXSA-N 0.000 description 1
- WUYLWZRHRLLEGB-AVGNSLFASA-N Met-Met-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O WUYLWZRHRLLEGB-AVGNSLFASA-N 0.000 description 1
- XGIQKEAKUSPCBU-SRVKXCTJSA-N Met-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCSC)N XGIQKEAKUSPCBU-SRVKXCTJSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- WXJLBSXNUHIGSS-OSUNSFLBSA-N Met-Thr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WXJLBSXNUHIGSS-OSUNSFLBSA-N 0.000 description 1
- SQPZCTBSLIIMBL-BPUTZDHNSA-N Met-Trp-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SQPZCTBSLIIMBL-BPUTZDHNSA-N 0.000 description 1
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 1
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 1
- 102000005431 Molecular Chaperones Human genes 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 101710192343 NADPH:adrenodoxin oxidoreductase, mitochondrial Proteins 0.000 description 1
- 102100036777 NADPH:adrenodoxin oxidoreductase, mitochondrial Human genes 0.000 description 1
- NVNLLIYOARQCIX-MSHCCFNRSA-N Nisin Chemical compound N1C(=O)[C@@H](CC(C)C)NC(=O)C(=C)NC(=O)[C@@H]([C@H](C)CC)NC(=O)[C@@H](NC(=O)C(=C/C)/NC(=O)[C@H](N)[C@H](C)CC)CSC[C@@H]1C(=O)N[C@@H]1C(=O)N2CCC[C@@H]2C(=O)NCC(=O)N[C@@H](C(=O)N[C@H](CCCCN)C(=O)N[C@@H]2C(NCC(=O)N[C@H](C)C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCSC)C(=O)NCC(=O)N[C@H](CS[C@@H]2C)C(=O)N[C@H](CC(N)=O)C(=O)N[C@H](CCSC)C(=O)N[C@H](CCCCN)C(=O)N[C@@H]2C(N[C@H](C)C(=O)N[C@@H]3C(=O)N[C@@H](C(N[C@H](CC=4NC=NC=4)C(=O)N[C@H](CS[C@@H]3C)C(=O)N[C@H](CO)C(=O)N[C@H]([C@H](C)CC)C(=O)N[C@H](CC=3NC=NC=3)C(=O)N[C@H](C(C)C)C(=O)NC(=C)C(=O)N[C@H](CCCCN)C(O)=O)=O)CS[C@@H]2C)=O)=O)CS[C@@H]1C NVNLLIYOARQCIX-MSHCCFNRSA-N 0.000 description 1
- 108010053775 Nisin Proteins 0.000 description 1
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 1
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 1
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 1
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 1
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 1
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 1
- MMYUOSCXBJFUNV-QWRGUYRKSA-N Phe-Gly-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N MMYUOSCXBJFUNV-QWRGUYRKSA-N 0.000 description 1
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 1
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 1
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 1
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- INXAPZFIOVGHSV-CIUDSAMLSA-N Pro-Asn-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 INXAPZFIOVGHSV-CIUDSAMLSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- CJZTUKSFZUSNCC-FXQIFTODSA-N Pro-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 CJZTUKSFZUSNCC-FXQIFTODSA-N 0.000 description 1
- WPQKSRHDTMRSJM-CIUDSAMLSA-N Pro-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 WPQKSRHDTMRSJM-CIUDSAMLSA-N 0.000 description 1
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- AJCRQOHDLCBHFA-SRVKXCTJSA-N Pro-His-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AJCRQOHDLCBHFA-SRVKXCTJSA-N 0.000 description 1
- JRQCDSNPRNGWRG-AVGNSLFASA-N Pro-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2 JRQCDSNPRNGWRG-AVGNSLFASA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 1
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 1
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- CNUIHOAISPKQPY-HSHDSVGOSA-N Pro-Thr-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CNUIHOAISPKQPY-HSHDSVGOSA-N 0.000 description 1
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 1
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 1
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- 101710104207 Probable NADPH:adrenodoxin oxidoreductase, mitochondrial Proteins 0.000 description 1
- 102220617537 Probable serine carboxypeptidase CPVL_T70R_mutation Human genes 0.000 description 1
- 241000187693 Rhodococcus rhodochrous Species 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- QSHKTZVJGDVFEW-GUBZILKMSA-N Ser-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CO)N QSHKTZVJGDVFEW-GUBZILKMSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- PKXHGEXFMIZSER-QTKMDUPCSA-N Thr-Arg-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PKXHGEXFMIZSER-QTKMDUPCSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 1
- LOHBIDZYHQQTDM-IXOXFDKPSA-N Thr-Cys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LOHBIDZYHQQTDM-IXOXFDKPSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- XUGYQLFEJYZOKQ-NGTWOADLSA-N Thr-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XUGYQLFEJYZOKQ-NGTWOADLSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 1
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- XEVHXNLPUBVQEX-DVJZZOLTSA-N Thr-Trp-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N)O XEVHXNLPUBVQEX-DVJZZOLTSA-N 0.000 description 1
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- AXEJRUGTOJPZKG-XGEHTFHBSA-N Thr-Val-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O AXEJRUGTOJPZKG-XGEHTFHBSA-N 0.000 description 1
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 1
- PEYSVKMXSLPQRU-FJHTZYQYSA-N Trp-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O PEYSVKMXSLPQRU-FJHTZYQYSA-N 0.000 description 1
- HYNAKPYFEYJMAS-XIRDDKMYSA-N Trp-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HYNAKPYFEYJMAS-XIRDDKMYSA-N 0.000 description 1
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 1
- UUIYFDAWNBSWPG-IHPCNDPISA-N Trp-Lys-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N UUIYFDAWNBSWPG-IHPCNDPISA-N 0.000 description 1
- ADMHZNPMMVKGJW-BPUTZDHNSA-N Trp-Ser-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ADMHZNPMMVKGJW-BPUTZDHNSA-N 0.000 description 1
- UJGDFQRPYGJBEH-AAEUAGOBSA-N Trp-Ser-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N UJGDFQRPYGJBEH-AAEUAGOBSA-N 0.000 description 1
- ABRICLFKFRFDKS-IHPCNDPISA-N Trp-Ser-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 ABRICLFKFRFDKS-IHPCNDPISA-N 0.000 description 1
- MPYZGXUYLNPSNF-NAZCDGGXSA-N Trp-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O MPYZGXUYLNPSNF-NAZCDGGXSA-N 0.000 description 1
- ZZDFLJFVSNQINX-HWHUXHBOSA-N Trp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O ZZDFLJFVSNQINX-HWHUXHBOSA-N 0.000 description 1
- UUZYQOUJTORBQO-ZVZYQTTQSA-N Trp-Val-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 UUZYQOUJTORBQO-ZVZYQTTQSA-N 0.000 description 1
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 1
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 1
- YGKVNUAKYPGORG-AVGNSLFASA-N Tyr-Asp-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YGKVNUAKYPGORG-AVGNSLFASA-N 0.000 description 1
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 1
- FQNUWOHNGJWNLM-QWRGUYRKSA-N Tyr-Cys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FQNUWOHNGJWNLM-QWRGUYRKSA-N 0.000 description 1
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 1
- MPKPIWFFDWVJGC-IRIUXVKKSA-N Tyr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O MPKPIWFFDWVJGC-IRIUXVKKSA-N 0.000 description 1
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 1
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 1
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 1
- KEANSLVUGJADPN-LKTVYLICSA-N Tyr-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N KEANSLVUGJADPN-LKTVYLICSA-N 0.000 description 1
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 1
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 1
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 1
- DLYOEFGPYTZVSP-AEJSXWLSSA-N Val-Cys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N DLYOEFGPYTZVSP-AEJSXWLSSA-N 0.000 description 1
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 101150092805 actc1 gene Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000010411 cooking Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 238000010511 deprotection reaction Methods 0.000 description 1
- 150000002009 diols Chemical class 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000003311 flocculating effect Effects 0.000 description 1
- 239000003517 fume Substances 0.000 description 1
- 239000007789 gas Substances 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 1
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- -1 hydroxypropyl- Chemical group 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 238000006317 isomerization reaction Methods 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 239000012452 mother liquor Substances 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 239000004309 nisin Substances 0.000 description 1
- 235000010297 nisin Nutrition 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000011164 ossification Effects 0.000 description 1
- 238000006213 oxygenation reaction Methods 0.000 description 1
- 230000035699 permeability Effects 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 238000007142 ring opening reaction Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000004611 spectroscopical analysis Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 108010036387 trimethionine Proteins 0.000 description 1
- 108700004896 tripeptide FEG Proteins 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010003137 tyrosyltyrosine Proteins 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
- C12N9/0077—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14) with a reduced iron-sulfur protein as one donor (1.14.15)
- C12N9/0081—Cholesterol monooxygenase (cytochrome P 450scc)(1.14.15.6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/02—Preparation of oxygen-containing organic compounds containing a hydroxy group
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Organic Chemistry (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
本发明提供了一种融合蛋白或其变体,其包括K1和RhFR,所述K1的氨基酸序列如SEQ ID NO:1所示,所述RhFR的氨基酸序列如SEQ ID NO:5的第466‑773位的氨基酸所示。本发明还提供了一种所述融合蛋白或其变体的制备方法和应用。将本发明的融合蛋白或其变体应用于催化VD3合成25‑羟基维生素D3(骨化二醇)时无需额外添加电子传递相关蛋白,操作简便,且所述融合蛋白中的电子传递效率高,催化效率高,所得骨化二醇的产量显著提高,且降低了生产的成本,适于工业化生产。
Description
技术领域
本发明涉及生物技术领域,具体涉及一种融合蛋白或其变体及其基因、其制备方法,以及融合蛋白或其变体在催化VD3制备骨化二醇中的用途。
背景技术
25-羟基维生素D3,又名骨化二醇,是维生素D3(VD3)的活性代谢物,具有较强的生理活性。传统合成25-羟基维生素D3的方法是化学合成法,需要进行多步的基团保护与脱保护,然后经光照反应、开环和异构化的到25-羟基维生素D3。近年来对VD3的微生物转化方面的研究得到了快速的发展,但是转化率低,而提高活性VD3产量的关键之一在于筛选获得转化效率高的关键代谢酶细胞色素P450酶。
细胞色素P450酶广泛存在于原核生物与真核生物中,其介导的单加氧反应通常需要电子传递链铁氧还蛋白与铁氧还蛋白还原酶的参与,且高效的电子传递体系能有效地提高P450的反应效率。也有自洽型P450的报道,反应过程中无需再添加电子传递相关的蛋白。目前,已报道2种自洽型P450,即来源于巨大芽孢杆菌Bacillus megaterium的P450BM3和来源红球菌Rhodococcus的P450 RhFRed,其蛋白结构上包含P450结构域与电子传递结构域,反应过程中无需再添加电子传递相关的蛋白。但目前未见自洽型P450用于催化VD3产生骨化二醇的报道。
Tamura等(BioChemical and Biophysical Research Communications 2009,385,170–175)对Pseudonocardia autotrophica来源的P450酶进行了改造得到了4个位点突变的突变体Vdh-K1(T70R、V156L、E216M和E384R),并测得用于VD3的催化时Vdh-K1是野生型Vdhwt的活性的21.6倍。但是用于VD3的催化需要电子传递体系的参与,操作繁琐、成本较高,且催化效率仍旧不是很高。Tamura等(ChemBioChem 2013,14,2284-2291)对Pseudonocardiaautotrophica来源的P450酶进行了改造(进行了T107A等突变),并在红球菌中进行异源表达,通过添加乳酸链球菌素以提高细胞膜的通透性,减少底物VD3传质阻力,最终催化VD3两小时得到的骨化二醇的产量达到573μg/mL,方法中采用了电子传递系统AciFdx和AciFdxR,但产率还有待提高。
因此现有技术中急需一种催化VD3制备骨化二醇时催化效率高、所得产物的产量很高且操作简便、成本较低的方法。
发明内容
本发明所要解决的技术问题是针对现有技术中用细胞色素P450酶催化VD3制备骨化二醇时产率低、成本高、操作繁琐等缺陷,提供了一种融合蛋白或其变体及其基因、其制备方法与用其催化VD3制备骨化二醇的应用。将本发明的融合蛋白或其变体应用于催化VD3合成25-羟基维生素D3(骨化二醇)时无需额外添加电子传递相关蛋白,操作简便,且所述融合蛋白中的电子传递效率高,催化效率高,所得骨化二醇的产量显著提高,且降低了生产的成本,适于工业化生产。
提高P450酶催化VD3产率的方法众多,包括寻求高效的P450酶、优化反应条件等,本发明人经过大量摸索,意外发现将特定种类的P450通过特定的方式改造为自洽型P450酶时,能够显著提高催化VD3制备骨化二醇时的催化效率,并显著提高骨化二醇的产量。
为了解决上述技术问题,本发明第一方面提供了一种融合蛋白或其变体,其包括K1和RhFR,所述K1的氨基酸序列如SEQ ID NO:1所示,所述RhFR的氨基酸序列如SEQ ID NO:5的第466-773位的氨基酸所示。
较佳地,所述融合蛋白从N端至C端依次为K1和RhFR。
较佳地,所述K1和RhFR之间通过连接子(linker)进行连接,所述连接子的氨基酸序列优选如SEQ ID NO:5的第445-465位的氨基酸所示。
较佳地,所述融合蛋白或其变体与分子伴侣共表达,所述分子伴侣优选为Gro7。在本发明某一较佳实施例中,表达所述融合蛋白时,可以加入分子伴侣例如Gro7伴侣蛋白,使得所述融合蛋白与所述分子伴侣共表达。本发明中,所述Gro7伴侣蛋白可以是商购来源的,例如可以是购自Biovector Science Lab,Inc的Gro7。
较佳地,所述变体为在所述RhFR的N端发生氨基酸的插入或缺失;更佳地,所述变体为在所述RhFR的N端发生1-14个氨基酸的插入或缺失;进一步更佳地,所述变体为在所述RhFR的N端发生3-6个氨基酸的插入或缺失。
较佳地,所述融合蛋白或其变体的氨基酸序列如SEQ ID NO:9、SEQ ID NO:11、SEQID NO:13、SEQ ID NO:15、SEQ ID NO:17或SEQ ID NO:19所示。
更佳地,编码所述融合蛋白或其变体的核苷酸序列如SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18或SEQ ID NO:20所示。
为了解决上述技术问题,本发明第二方面提供了一种融合基因,其编码如本发明第一方面所述的融合蛋白或其变体。
为了解决上述技术问题,本发明第三方面提供了一种重组表达载体,所述重组表达载体含有如本发明第二方面所述的融合基因。
较佳地,所述重组表达载体的骨架载体为pET28a。
为了解决上述技术问题,本发明第四方面提供了一种转化体,其包括如本发明第二方面所述的融合基因或者如本发明第三方面所述的重组表达载体。
较佳地,所转化体通过在宿主中导入如本发明第二方面所述的融合基因或者如本发明第三方面所述的重组表达载体获得。
更佳地,所述宿主为大肠杆菌,优选为大肠杆菌E.coli BL21(DE3)细胞。
为了解决上述技术问题,本发明第五方面提供了一种融合蛋白或其变体的制备方法,其包括以下步骤:
(1)获得如本发明第四方面所述的转化体;
(2)筛选所述转化体,表达并纯化所述融合蛋白或其变体。
为了解决上述技术问题,本发明第六方面提供了一种骨化二醇的制备方法,所述制备方法包括以下步骤:在反应溶剂、还原型辅酶NADH/NADPH的存在下,将如本发明第一方面所述的融合蛋白或其变体催化维生素D3进行羟化反应即可。
较佳地,所述维生素D3为助溶剂预溶的维生素D3;所述助溶剂优选包括DMSO、吐温80、Triton X100、甲醇、乙醇、异丙醇和DMF中的一种或多种,例如为乙醇。
较佳地,所述方法还包括在进行所述羟化反应前,在所述反应溶剂中加入环糊精的步骤,所述环糊精例如可以为羟丙基-β-环糊精的步骤;所述羟丙基-β-环糊精占反应体系的质量体积百分比优选为0.05%-0.4%,例如0.25%。
较佳地,所述反应的温度为20~33℃,例如为22℃、25℃、28℃或30℃。本发明人实验过程中发现,不在此温度范围内时,所得产物的产量会有所降低。
较佳地,所述反应的pH为6.0~8.0,例如为7.4。本发明人在实验过程中发现,pH不在本发明规定的范围内时,所得产物的产量会有所降低。
较佳地,所述维生素D3的浓度为1g/L-10g/L,例如为1g/L、2g/L、3g/L、4g/L、5g/L、6g/L、7g/L、8g/L、9g/L或10g/L。本发明人实验过程中发现,VD3浓度过高,无法溶解或者可能底物抑制酶活,反应不完全,浓度过低,产量也很低。
较佳地,所述NADH/NADPH与所述维生素D3的摩尔比为0.001:1~2:1,例如0.2:1。
较佳地,所述的制备方法还包括以下步骤:在脱氢酶以及供氢体的存在下,将氧化型辅酶NAD+/NADP+进行还原反应,得到所述的还原型辅酶NADH/NADPH即可;
更佳地,所述的脱氢酶为葡萄糖脱氢酶、醇脱氢酶或甲酸脱氢酶;和/或,所述的供氢体为葡萄糖、异丙醇或甲酸盐;
进一步更佳地,当所述的脱氢酶为醇脱氢酶时,所述的供氢体为异丙醇;当所述的脱氢酶为葡萄糖脱氢酶时,所述的供氢体为葡萄糖;当所述的脱氢酶为甲酸脱氢酶时,所述的供氢体为甲酸盐。
为了解决上述技术问题,本发明第七方面提供了一种如本发明第一方面所述的融合蛋白或其变体、如本发明第二方面的融合基因、如本发明第三方面所述的重组表达载体、或如本发明第四方面所述的转化体在制备骨化二醇中的应用。
在符合本领域常识的基础上,上述各优选条件,可任意组合,即得本发明各较佳实例。
本发明所用试剂和原料均市售可得。
本发明的积极进步效果在于:将本发明的融合蛋白或其变体应用于催化VD3合成25-羟基维生素D3(骨化二醇)时无需额外添加电子传递相关蛋白,操作简便,且所述融合蛋白中的电子传递效率高,催化效率高,所得骨化二醇的产量显著提高,且降低了生产的成本,适于工业化生产。在本发明某一较佳实施例中,本发明的融合蛋白或其变体催化VD3时,所得骨化二醇的产量高达4.427g/L。
附图说明
图1为实施例6中K1-RhFR-I6-Gro7的检测图谱。
图2为VD3底物对照品的图谱结果。
图3为骨化二醇对照品的图谱结果。
具体实施方式
下面通过实施例的方式进一步说明本发明,但并不因此将本发明限制在所述的实施例范围之中。下列实施例中未注明具体条件的实验方法,按照常规方法和条件,或按照商品说明书选择。
产物的HPLC分析方法
色谱条件:色谱柱:Poroshell EC-C18(4.0μm,4.6×150mm);检测波长:265nm;流速:1mL/min;柱温:35℃;进样体积:10μL。梯度洗脱程序如下:0-8min,H2O:乙腈=85%:15%;8-20min,H2O:乙腈=0:100%;20-21min,H2O:乙腈=85%:15%;21-27min,H2O:乙腈=85%:15%。
pET28a购买自Novagen公司;DpnI酶、NdeI酶、HindIII酶购买自Thermo Fisher公司;ExnaseⅡ酶购买至南京诺唯赞生物科技有限公司;E.coli BL21(DE3)感受态细胞购买自北京鼎国昌盛生物技术有限责任公司;NAD+购买自深圳邦泰生物工程有限公司;维生素D3、质粒提取试剂盒购买自生工生物工程(上海)股份有限公司。
实施例1融合蛋白酶菌株的构建
1.1蛋白序列分析
本实施例将能催化VD3的P450酶Vdh-K1与P450 BM3和P450 RhFRed的还原区域进行融合,构建自洽型P450融合蛋白,具体步骤如下:
根据编码NCBI上已报道的细胞色素P450酶SEQ ID NO:1、3、5的基因序列SEQ IDNO:2、4、6全基因合成各基因。合成基因的公司为苏州金唯智生物科技有限公司(苏州工业园区星湖街218号生物纳米科技园C3楼)。各基因信息如表1所示。
表1
利用Discovery studio软件分析BM3蛋白的氨基酸序列,发现N端的第1-459位氨基酸为蛋白的P450结构域;第460-479位氨基酸为P450结构域与电子传递结构域的Linker;第480-1048位氨基酸为蛋白的电子传递结构域。
利用Discovery studio软件分析RhFRed蛋白的氨基酸序列,发现N端的第24-444位氨基酸为蛋白的P450结构域;第445-465位氨基酸为P450结构域与电子传递结构域的Linker(连接子);第466-773位氨基酸为蛋白的电子传递结构域。
1.2K1、FdR、Fdx蛋白工程菌株以及融合蛋白工程菌株构建
将实施例1.1合成的K1、BM3、RhFRed以及合成的电子传递系统AciFdR、AciFdx(如下表2所示)基因分别连pET28a,酶切位点NdeI&HindIII,将酶连好的载体转化至宿主E.coli BL21(DE3)感受态细胞,分别得到含有K1、BM3、RhFRed、AciFdR、AciFdx的工程菌株。利用上海生工的质粒提取试剂盒提取质粒,分别得到pET28a-K1、pET28a-BM3、pET28a-RhFRed、pET28a-FdR、pET28a-Fdx质粒。
将所得pET28a-K1、pET28a-FdR质粒分别与Gro7伴侣质粒共转E.coli BL21,得到BL21-pET28a-K1-Gro7和BL21-pET28a-FdR-Gro7工程菌。将pET28a-Fdx质粒转化BL21,得到BL21-pET28a-Fdx工程菌。
表2
酶编号 | 基因来源 | NCBI登录号 |
AciFdx | Acinetobacter sp.OC4 | BAE78451.1 |
AciFdR | Acinetobacter sp.OC4 | BAE78453.1 |
以原质粒pET28a-K1为模板,K1-Rh-F1、K1-Rh-R1为引物,扩增目的片段ΔK1-1。以pET28a-RhFRed质粒为模板,RhFR-F1、RhFR-R1为引物,扩增5’端与ΔK1-1的3’端具有15bP同源臂,且3’端与ΔK1-1的5’端具有15bP同源臂的载体片段RhFR。将PCR产物用Dpn1消化,37℃,2小时。反应完成后ΔK1-1和RhFR用重组酶ExnaseⅡ进行重组,37℃,0.5小时,重组产物转化至BL21感受态细胞,涂布在含有50μg/mL卡那霉素的LB培养基,37℃培养过夜,得到BL21-pET28a-K1-RhFR转化子,即BL21-pET28a-K1-RhFR工程菌。挑取BL21-pET28a-K1-RhFR转化子接种至含50μg/mL卡那霉素的5ml LB液体培养基中,37℃震荡培养6h,提取pET28a-K1-RhFR质粒,将pET28a-K1-RhFR质粒与Gro7伴侣质粒共转BL21,得到BL21-pET28a-K1-RhFR-Gro7工程菌。引物及其序列如表2所示。
以原质粒pET28a-K1为模板,K1-BM3-F1、K1-BM3-R1为引物,扩增目的片段ΔK1-2。以pET28a-BM3质粒为模板,BM3R-F1、BM3R-R1为引物,扩增5’端与ΔK1-2的3’端具有15bP同源臂,且3’端与K1-2的5’端具有15bP同源臂的载体片段BM3R。将PCR产物用Dpn1消化,37℃,2小时。反应完成后ΔK1-2和BM3R用重组酶ExnaseⅡ进行重组,37℃,0.5小时,重组产物转化至BL21感受态细胞,涂布在含有50μg/mL卡那霉素的LB培养基,37℃培养过夜,得到BL21-pET28a-K1-BM3R转化子,即BL21-pET28a-K1-BM3R工程菌。挑取BL21-pET28a-K1-BM3R转化子接种至含50μg/mL卡那霉素的5ml LB液体培养基中,37℃震荡培养6h,提取pET28a-K1-BM3R质粒,将pET28a-K1-BM3R质粒与Gro7伴侣质粒(购自Biovector Science Lab,Inc)共转BL21,得到BL21-pET28a-K1-BM3R-Gro7工程菌。引物及其序列如表3所示。
表3
实施例2融合蛋白酶的制备
分别将实施例1中构建好的BL21-pET28a-K1-Gro7、BL21-pET28a-FdR-Gro7、BL21-pET28a-Fdx工程菌以及融合蛋白工程菌BL21-pET28a-K1-RhFR、BL21-pET28a-K1-RhFR-Gro7、BL21-pET28a-K1-BM3R、BL21-pET28a-K1-BM3R-Gro7的单菌落接种至含50μg/ml卡那霉素与50μg/ml氯霉素的5ml LB液体培养基中,37℃震荡培养12h。按2v/v%接种量转接至50ml同样含50μg/ml卡那霉素与50μg/ml氯霉素的新鲜LB液体培养基中,37℃震荡至OD600达到0.8左右时,加入IPTG至其终浓度为0.5mM,22℃诱导培养22h。培养结束后,将培养液10000rpm离心10min,弃上清液,收集菌体(即菌泥),置于-20℃冰箱中保存,待用。
分别将各融合蛋白的菌泥用100mM PBS7.4以1:4(W/V)的比例进行均质,均质液用4000rpm离心20min,弃沉淀。上清用2‰的PEI进行絮凝,4000rpm离心20min,上清即为K1-Gro7、AciFdR-Gro7、AciFdx蛋白以及融合蛋白K1-BM3R、K1-BM3R-Gro7、K1-RhFR、K1-RhFR-Gro7的粗酶液。
实施例3融合蛋白P450活性测定
采用CO差光谱法测定K1融合蛋白P450浓度。
测定方法:分别取待测样品(即实施例2中的融合蛋白粗酶液)1mL于2根10mL离心管,标为对照管、样品管。将样品拿至通风橱,先取一根离心管装适量水,将CO管道插入水中,调节三通阀至CO的出气速度约为1秒一个气泡。对照管与样品管分别加入1mg连二亚硫酸钠粉末,反复颠倒使连二亚硫酸钠溶解完全并混合均匀。分别将对照管与样品管液体转移至比色皿中,于紫外分光光度计上扫描400-500nm的吸光值。
酶浓度计算:
CP450=(ΔA450-ΔA490)/(ε450·L)
其中:
CP450,所测样品中P450酶的浓度,单位nmol/mL;
ΔA450,A450样品-A450对照的差值;
ΔA490,A490样品-A490对照的差值;
ε450,P450摩尔吸光系数,为0.091mL/nmol-1·cm-1。
L,光程,1cm。
测定结果如下表4所示:
表4
上述所得融合蛋白均具有P450活性,说明融合蛋白构建成功。
实施例4葡萄糖脱氢酶(GDH)基因的获取和表达
根据来源于枯草芽胞杆菌(Bacillμs sμbtilis)168(NCBI登录号为NP_388275.1)的葡萄糖脱氢酶基因序列,全合成葡萄糖脱氢酶基因。
葡萄糖脱氢酶基因连pET21a,酶切位点NdeI&HindIII,将酶连好的载体转化至宿主E.coli BL21(DE3)感受态细胞,得到含有葡萄糖脱氢酶基因的工程菌株。将含有葡萄糖脱氢酶基因的工程菌在经平皿划线活化后,挑单菌落接种至含100μg/ml氨苄青霉素的5mlLB液体培养基中,37℃震荡培养12h。按2%(v/v)接种量转接至50ml同样含100μg/ml氨苄青霉素的新鲜LB液体培养基中,37℃震荡至OD600达到0.8左右时,加入IPTG至其终浓度为0.5mM,18℃诱导培养16h。培养结束后,将培养液10000rpm离心10min,弃上清液,收集菌体(即得到葡萄糖脱氢酶菌泥),置于-20℃冰箱中保存,待用。
实施例5融合蛋白酶体外催化VD3
底物VD3(购自上海德默医药科技有限公司)用乙醇配制成浓度为50g/L的母液,并加入25%的羟丙基-β-环糊精助溶。使用实施例2中的粗酶液进行体外酶催化反应,反应体系如表5所示。
表5
①加入蛋白粗酶液为K1-Gro7粗酶液的反应体系
②加入蛋白粗酶液为各融合蛋白的反应体系
28℃下,反应14h,随后取样100μL,加入500μL乙醇与400μL乙腈,12000rpm离心3min,上清液过膜除杂后进行HPLC检测。检测结果如下表6所示:
表6
其中,K1-RhFR和K1-RhFR-Gro7的催化能力高于K1-BM3R和K1-BM3R-Gro7,故后续针对融合蛋白K1-RhFR进行Linker的优化。
实施例6融合蛋白Linker优化
为了进一步提高融合蛋白K1-RhFR的活性,设计引物将融合蛋白上K1-RhFR的天然Linker延长或缩短。I3、I6表示在天然Linker的N端分别插入3个、6个氨基酸;D3、D6表示在天然Linker的N端分别删除3个、6个氨基酸;I14表示在N端插入14个氨基酸,具体序列见表7。
表7
以pET28a-K1-RhFR为模板,以I3F、I3R为引物进行PCR,将PCR产物用Dpn1消化,37℃,2小时。反应完成后PCR产物转化至BL21感受态细胞,涂布在含有50μg/mL卡那霉素的LB培养基,37℃培养过夜,得到BL21-pET28a-K1-RhFR-I3转化子。挑取BL21-pET28a-K1-RhFR-I3转化子接种至含50μg/mL卡那霉素的5ml LB液体培养基中,37℃震荡培养6h,提取pET28a-K1-RhFR-I3质粒,将pET28a-K1-RhFR-I3质粒与Gro7伴侣质粒共转BL21,得到BL21-pET28a-K1-RhFR-I3-Gro7工程菌。引物及其序列如表3所示。
采用上述同样的方法,分别获得BL21-pET28a-K1-RhFR-I6-Gro7、BL21-pET28a-K1-RhFR-D3-Gro7、BL21-pET28a-K1-RhFR-D6-Gro7、BL21-pET28a-K1-RhFR-I14-Gro7工程菌,即带有不同Linker长度的工程菌。
按照实施例2同样的方法,得到下表8中融合蛋白酶粗酶液。
按照实施例5同样的方法,将所得融合蛋白酶粗酶液用于体外催化VD3,结果如下表7所示。其中以K1-RhFR-I6-Gro7为例,催化后所得产物的检测图谱如图1所示,保留时间19.050min为VD3,10.884min为骨化二醇。图2为VD3对照品(购自上海德默医药科技有限公司)的图谱,保留时间为19.020min,图3为骨化二醇对照品(购自国家标准物质网)的图谱,保留时间为10.920min。可见,该实施例中VD3与产物骨化二醇的出峰时间与各自对照品的出峰时间基本一致,该实施例制备得到骨化二醇,其他粗酶液的结果也与K1-RhFR-I6-Gro7一致,所得产物中VD3和骨化二醇的出峰时间与各自对照品的出峰时间均基本一致。
表8
SEQUENCE LISTING
<110> 上海弈柯莱生物医药科技有限公司
<120> 一种融合蛋白或其变体及其在制备骨化二醇中的应用
<130> P19014220C
<160> 38
<170> PatentIn version 3.5
<210> 1
<211> 403
<212> PRT
<213> Pseudonocardia autotrophica
<400> 1
Met Ala Leu Thr Thr Thr Gly Thr Glu Gln His Asp Leu Phe Ser Gly
1 5 10 15
Thr Phe Trp Gln Asn Pro His Pro Ala Tyr Ala Ala Leu Arg Ala Glu
20 25 30
Asp Pro Val Arg Lys Leu Ala Leu Pro Asp Gly Pro Val Trp Leu Leu
35 40 45
Thr Arg Tyr Ala Asp Val Arg Glu Ala Phe Val Asp Pro Arg Leu Ser
50 55 60
Lys Asp Trp Arg His Arg Leu Pro Glu Asp Gln Arg Ala Asp Met Pro
65 70 75 80
Ala Thr Pro Thr Pro Met Met Ile Leu Met Asp Pro Pro Asp His Thr
85 90 95
Arg Leu Arg Lys Leu Val Gly Arg Ser Phe Thr Val Arg Arg Met Asn
100 105 110
Glu Leu Glu Pro Arg Ile Thr Glu Ile Ala Asp Gly Leu Leu Ala Gly
115 120 125
Leu Pro Thr Asp Gly Pro Val Asp Leu Met Arg Glu Tyr Ala Phe Gln
130 135 140
Ile Pro Val Gln Val Ile Cys Glu Leu Leu Gly Leu Pro Ala Glu Asp
145 150 155 160
Arg Asp Asp Phe Ser Ala Trp Ser Ser Val Leu Val Asp Asp Ser Pro
165 170 175
Ala Asp Asp Lys Asn Ala Ala Met Gly Lys Leu His Gly Tyr Leu Ser
180 185 190
Asp Leu Leu Glu Arg Lys Arg Thr Glu Pro Asp Asp Ala Leu Leu Ser
195 200 205
Ser Leu Leu Ala Val Ser Asp Met Asp Gly Asp Arg Leu Ser Gln Glu
210 215 220
Glu Leu Val Ala Met Ala Met Leu Leu Leu Ile Ala Gly His Glu Thr
225 230 235 240
Thr Val Asn Leu Ile Gly Asn Gly Val Leu Ala Leu Leu Thr His Pro
245 250 255
Asp Gln Arg Lys Leu Leu Ala Glu Asp Pro Ser Leu Ile Ser Ser Ala
260 265 270
Val Glu Glu Phe Leu Arg Phe Asp Ser Pro Val Ser Gln Ala Pro Ile
275 280 285
Arg Phe Thr Ala Glu Asp Val Thr Tyr Ser Gly Val Thr Ile Pro Ala
290 295 300
Gly Glu Met Val Met Leu Gly Leu Ala Ala Ala Asn Arg Asp Ala Asp
305 310 315 320
Trp Met Pro Glu Pro Asp Arg Leu Asp Ile Thr Arg Asp Ala Ser Gly
325 330 335
Gly Val Phe Phe Gly His Gly Ile His Phe Cys Leu Gly Ala Gln Leu
340 345 350
Ala Arg Leu Glu Gly Arg Val Ala Ile Gly Arg Leu Phe Ala Asp Arg
355 360 365
Pro Glu Leu Ala Leu Ala Val Gly Leu Asp Glu Leu Val Tyr Arg Arg
370 375 380
Ser Thr Leu Val Arg Gly Leu Ser Arg Met Pro Val Thr Met Gly Pro
385 390 395 400
Arg Ser Ala
<210> 2
<211> 1209
<212> DNA
<213> Pseudonocardia autotrophica
<400> 2
atggcactga ccaccaccgg taccgaacag catgacctgt ttagcggtac cttttggcag 60
aatccgcatc cggcgtatgc agcactgcgt gcagaagatc cggttcgtaa actggcactg 120
ccggatggtc cggtgtggct gctgacccgt tatgcagatg ttcgtgaagc atttgttgat 180
ccgcgtctga gtaaagattg gcgtcatcgt ctgccggaag atcagcgtgc cgatatgccg 240
gcaaccccga ccccgatgat gattctgatg gacccgccgg atcatacacg tttacgtaaa 300
ctggttggtc gtagttttac cgttcgtcgt atgaatgaac tggaaccgcg tattaccgaa 360
attgcagatg gtctgctggc aggtctgccg accgatggtc cggttgatct gatgcgtgaa 420
tatgcatttc agattccggt tcaggttata tgtgaactgc tgggtctgcc ggcagaagat 480
cgtgatgatt tttcagcatg gtcaagtgtg ctggttgatg attctccggc agatgataaa 540
aatgccgcaa tgggtaaact gcatggttat ctgtcagatc tgctggaacg taaacgtacc 600
gaaccggatg atgcactgct gagtagcctg ctggcggttt ctgatatgga tggtgatcgt 660
ctgtctcagg aagaactggt tgcaatggca atgctgctgc tgattgcagg tcatgaaacc 720
accgttaatc tgattggtaa tggtgtgctg gcactgctga cccatccgga tcagcgtaaa 780
ctgttagctg aagatccgag tctgattagc tcagcagttg aagaatttct gcgttttgat 840
tctccggtta gccaggcacc gatccgtttt accgctgaag atgttaccta tagtggtgtt 900
accattccgg caggtgaaat ggttatgctg ggtctggcag cagcaaatcg cgatgcagat 960
tggatgccgg aaccggatcg tctggatatt acccgtgatg caagtggtgg tgttttcttt 1020
ggtcatggta ttcatttttg tctgggtgcg cagctggcac gtctggaagg tcgtgtggca 1080
attggtcgtc tgtttgcaga tcgtccggaa ctggcactgg cagttggtct ggatgaactg 1140
gtgtatcgtc gtagcaccct ggttcgtggt ctgagtagga tgccggtgac aatgggtccg 1200
cgttcagca 1209
<210> 3
<211> 1049
<212> PRT
<213> Bacillus megaterium
<400> 3
Met Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys
1 5 10 15
Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys
20 25 30
Ile Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg
35 40 45
Val Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp
50 55 60
Glu Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg
65 70 75 80
Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn
85 90 95
Trp Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala
100 105 110
Met Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val
115 120 125
Gln Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu
130 135 140
Asp Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn
145 150 155 160
Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr
165 170 175
Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala
180 185 190
Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu
195 200 205
Asp Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg
210 215 220
Lys Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn
225 230 235 240
Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg
245 250 255
Tyr Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly
260 265 270
Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu
275 280 285
Gln Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro
290 295 300
Ser Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn
305 310 315 320
Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala
325 330 335
Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp
340 345 350
Glu Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp
355 360 365
Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser
370 375 380
Ala Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala
385 390 395 400
Cys Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly
405 410 415
Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu
420 425 430
Asp Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys
435 440 445
Ala Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr
450 455 460
Glu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn
465 470 475 480
Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly
485 490 495
Thr Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro
500 505 510
Gln Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly
515 520 525
Ala Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn
530 535 540
Ala Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val
545 550 555 560
Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala
565 570 575
Thr Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala
580 585 590
Lys Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp
595 600 605
Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp
610 615 620
Val Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys
625 630 635 640
Ser Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu
645 650 655
Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu
660 665 670
Leu Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu
675 680 685
Leu Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile
690 695 700
Pro Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly
705 710 715 720
Leu Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu
725 730 735
Ala His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln
740 745 750
Tyr Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met
755 760 765
Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu
770 775 780
Leu Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr
785 790 795 800
Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser
805 810 815
Glu Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile
820 825 830
Ser Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser
835 840 845
Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile
850 855 860
Ala Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys
865 870 875 880
Phe Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu
885 890 895
Thr Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg
900 905 910
Gly Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu
915 920 925
Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr
930 935 940
Leu Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr
945 950 955 960
Leu His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val
965 970 975
Gln His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp
980 985 990
Gln Gly Ala His Phe Tyr Ile Cys Gly Asp Gly Ser Gln Met Ala Pro
995 1000 1005
Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gln
1010 1015 1020
Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gln Gln Leu Glu Glu
1025 1030 1035
Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly
1040 1045
<210> 4
<211> 3147
<212> DNA
<213> Bacillus megaterium
<400> 4
atgacaatta aagaaatgcc tcagccaaaa acgtttggag agcttaaaaa tttaccgtta 60
ttaaacacag ataaaccggt tcaagctttg atgaaaattg cggatgaatt aggagaaatc 120
tttaaattcg aggcgcctgg tcgtgtaacg cgctacttat caagtcagcg tctaattaaa 180
gaagcatgcg atgaatcacg ctttgataaa aacttaagtc aagcgcttaa atttgtacgt 240
gattttgcag gagacgggtt atttacaagc tggacgcacg aaaaaaattg gaaaaaagcg 300
cataatatct tacttccaag cttcagtcag caggcaatga aaggctatca tgcgatgatg 360
gtcgatatcg ccgtgcagct tgttcaaaag tgggagcgtc taaatgcaga tgagcatatt 420
gaagtacccg aagatatgac acgtttaacg cttgatacaa ttggtctttg cggctttaac 480
tatcgcttta acagctttta ccgagatcag cctcatccat ttattacaag tatggtccgt 540
gcactggatg aagcaatgaa caagctgcag cgagcaaatc cagacgaccc agcttatgat 600
gaaaacaagc gccagtttca agaagatatc aaggtgatga acgacctagt agataaaatt 660
attgcagatc gcaaagcaag cggtgaacaa agcgatgatt tattaacgca tatgctaaac 720
ggaaaagatc cagaaacagg tgagccgctt gatgacgaga acattcgcta tcaaattatt 780
acattcttaa ttgcgggaca cgaaacaaca agcggtcttt tatcatttgc gctgtatttc 840
ttagtgaaaa atccacatgt attacaaaaa gcagcagaag aagcagcacg agttctagta 900
gatcctgttc caagctacaa acaagtcaaa cagcttaaat atgtcggcat ggtcttaaac 960
gaagcgctgc gcttatggcc aactgctcct gcgttttccc tatatgcaaa agaagatacg 1020
gtgcttggag gagaatatcc tttagaaaaa ggcgacgaac taatggttct gattcctcag 1080
cttcaccgtg ataaaacaat ttggggagac gatgtggaag agttccgtcc agagcgtttt 1140
gaaaatccaa gtgcgattcc gcagcatgcg tttaaaccgt ttggaaacgg tcagcgtgcg 1200
tgtatcggtc agcagttcgc tcttcatgaa gcaacgctgg tacttggtat gatgctaaaa 1260
cactttgact ttgaagatca tacaaactac gagctggata ttaaagaaac tttaacgtta 1320
aaacctgaag gctttgtggt aaaagcaaaa tcgaaaaaaa ttccgcttgg cggtattcct 1380
tcacctagca ctgaacagtc tgctaaaaaa gtacgcaaaa aggcagaaaa cgctcataat 1440
acgccgctgc ttgtgctata cggttcaaat atgggaacag ctgaaggaac ggcgcgtgat 1500
ttagcagata ttgcaatgag caaaggattt gcaccgcagg tcgcaacgct tgattcacac 1560
gccggaaatc ttccgcgcga aggagctgta ttaattgtaa cggcgtctta taacggtcat 1620
ccgcctgata acgcaaagca atttgtcgac tggttagacc aagcgtctgc tgatgaagta 1680
aaaggcgttc gctactccgt atttggatgc ggcgataaaa actgggctac tacgtatcaa 1740
aaagtgcctg cttttatcga tgaaacgctt gccgctaaag gggcagaaaa catcgctgac 1800
cgcggtgaag cagatgcaag cgacgacttt gaaggcacat atgaagaatg gcgtgaacat 1860
atgtggagtg acgtagcagc ctactttaac ctcgacattg aaaacagtga agataataaa 1920
tctactcttt cacttcaatt tgtcgacagc gccgcggata tgccgcttgc gaaaatgcac 1980
ggtgcgtttt caacgaacgt cgtagcaagc aaagaacttc aacagccagg cagtgcacga 2040
agcacgcgac atcttgaaat tgaacttcca aaagaagctt cttatcaaga aggagatcat 2100
ttaggtgtta ttcctcgcaa ctatgaagga atagtaaacc gtgtaacagc aaggttcggc 2160
ctagatgcat cacagcaaat ccgtctggaa gcagaagaag aaaaattagc tcatttgcca 2220
ctcgctaaaa cagtatccgt agaagagctt ctgcaatacg tggagcttca agatcctgtt 2280
acgcgcacgc agcttcgcgc aatggctgct aaaacggtct gcccgccgca taaagtagag 2340
cttgaagcct tgcttgaaaa gcaagcctac aaagaacaag tgctggcaaa acgtttaaca 2400
atgcttgaac tgcttgaaaa atacccggcg tgtgaaatga aattcagcga atttatcgcc 2460
cttctgccaa gcatacgccc gcgctattac tcgatttctt catcacctcg tgtcgatgaa 2520
aaacaagcaa gcatcacggt cagcgttgtc tcaggagaag cgtggagcgg atatggagaa 2580
tataaaggaa ttgcgtcgaa ctatcttgcc gagctgcaag aaggagatac gattacgtgc 2640
tttatttcca caccgcagtc agaatttacg ctgccaaaag accctgaaac gccgcttatc 2700
atggtcggac cgggaacagg cgtcgcgccg tttagaggct ttgtgcaggc gcgcaaacag 2760
ctaaaagaac aaggacagtc acttggagaa gcacatttat acttcggctg ccgttcacct 2820
catgaagact atctgtatca agaagagctt gaaaacgccc aaagcgaagg catcattacg 2880
cttcataccg ctttttctcg catgccaaat cagccgaaaa catacgttca gcacgtaatg 2940
gaacaagacg gcaagaaatt gattgaactt cttgatcaag gagcgcactt ctatatttgc 3000
ggagacggaa gccaaatggc acctgccgtt gaagcaacgc ttatgaaaag ctatgctgac 3060
gttcaccaag tgagtgaagc agacgctcgc ttatggctgc agcagctaga agaaaaaggc 3120
cgatacgcaa aagacgtgtg ggctggg 3147
<210> 5
<211> 773
<212> PRT
<213> Rhodococcus
<400> 5
Met Ser Ala Ser Val Pro Ala Ser Ala Pro Ala Cys Pro Val Asp His
1 5 10 15
Ala Ala Leu Ala Gly Gly Cys Pro Val Ser Ala Asn Ala Ala Ala Phe
20 25 30
Asp Pro Phe Gly Ser Ala Tyr Gln Thr Asp Pro Ala Glu Ser Leu Arg
35 40 45
Trp Ser Arg Asp Glu Glu Pro Val Phe Tyr Ser Pro Glu Leu Gly Tyr
50 55 60
Trp Val Val Thr Arg Tyr Glu Asp Val Lys Ala Val Phe Arg Asp Asn
65 70 75 80
Ile Leu Phe Ser Pro Ala Ile Ala Leu Glu Lys Ile Thr Pro Val Ser
85 90 95
Ala Glu Ala Thr Ala Thr Leu Ala Arg Tyr Asp Tyr Ala Met Ala Arg
100 105 110
Thr Leu Val Asn Glu Asp Glu Pro Ala His Met Pro Arg Arg Arg Ala
115 120 125
Leu Met Asp Pro Phe Thr Pro Lys Glu Leu Ala His His Glu Ala Met
130 135 140
Val Arg Arg Leu Thr Arg Glu Tyr Val Asp Arg Phe Val Glu Ser Gly
145 150 155 160
Lys Ala Asp Leu Val Asp Glu Met Leu Trp Glu Val Pro Leu Thr Val
165 170 175
Ala Leu His Phe Leu Gly Val Pro Glu Glu Asp Met Ala Thr Met Arg
180 185 190
Lys Tyr Ser Ile Ala His Thr Val Asn Thr Trp Gly Arg Pro Ala Pro
195 200 205
Glu Glu Gln Val Ala Val Ala Glu Ala Val Gly Arg Phe Trp Gln Tyr
210 215 220
Ala Gly Thr Val Leu Glu Lys Met Arg Gln Asp Pro Ser Gly His Gly
225 230 235 240
Trp Met Pro Tyr Gly Ile Arg Lys Gln Arg Glu Met Pro Asp Val Val
245 250 255
Thr Asp Ser Tyr Leu His Ser Met Met Met Ala Gly Ile Val Ala Ala
260 265 270
His Glu Thr Thr Ala Asn Ala Ser Ala Asn Ala Phe Lys Leu Leu Leu
275 280 285
Glu Asn Arg Ala Val Trp Glu Glu Ile Cys Ala Asp Pro Ser Leu Ile
290 295 300
Pro Asn Ala Val Glu Glu Cys Leu Arg His Ser Gly Ser Val Ala Ala
305 310 315 320
Trp Arg Arg Val Ala Thr Ala Asp Thr Arg Ile Gly Asp Val Asp Ile
325 330 335
Pro Ala Gly Ala Lys Leu Leu Val Val Asn Ala Ser Ala Asn His Asp
340 345 350
Glu Arg His Phe Glu Arg Pro Asp Glu Phe Asp Ile Arg Arg Pro Asn
355 360 365
Ser Ser Asp His Leu Thr Phe Gly Tyr Gly Ser His Gln Cys Met Gly
370 375 380
Lys Asn Leu Ala Arg Met Glu Met Gln Ile Phe Leu Glu Glu Leu Thr
385 390 395 400
Thr Arg Leu Pro His Met Glu Leu Val Pro Asp Gln Glu Phe Thr Tyr
405 410 415
Leu Pro Asn Thr Ser Phe Arg Gly Pro Asp His Val Trp Val Gln Trp
420 425 430
Asp Pro Gln Ala Asn Pro Glu Arg Thr Asp Pro Ala Val Leu His Arg
435 440 445
His Gln Pro Val Thr Ile Gly Glu Pro Ala Ala Arg Ala Val Ser Arg
450 455 460
Thr Val Thr Val Glu Arg Leu Asp Arg Ile Ala Asp Asp Val Leu Arg
465 470 475 480
Leu Val Leu Arg Asp Ala Gly Gly Lys Thr Leu Pro Thr Trp Thr Pro
485 490 495
Gly Ala His Ile Asp Leu Asp Leu Gly Ala Leu Ser Arg Gln Tyr Ser
500 505 510
Leu Cys Gly Ala Pro Asp Ala Pro Ser Tyr Glu Ile Ala Val His Leu
515 520 525
Asp Pro Glu Ser Arg Gly Gly Ser Arg Tyr Ile His Glu Gln Leu Glu
530 535 540
Val Gly Ser Pro Leu Arg Met Arg Gly Pro Arg Asn His Phe Ala Leu
545 550 555 560
Asp Pro Gly Ala Glu His Tyr Val Phe Val Ala Gly Gly Ile Gly Ile
565 570 575
Thr Pro Val Leu Ala Met Ala Asp His Ala Arg Ala Arg Gly Trp Ser
580 585 590
Tyr Glu Leu His Tyr Cys Gly Arg Asn Arg Ser Gly Met Ala Tyr Leu
595 600 605
Glu Arg Val Ala Gly His Gly Asp Arg Ala Ala Leu His Val Ser Glu
610 615 620
Glu Gly Thr Arg Ile Asp Leu Ala Ala Leu Leu Ala Glu Pro Ala Pro
625 630 635 640
Gly Val Gln Ile Tyr Ala Cys Gly Pro Gly Arg Leu Leu Ala Gly Leu
645 650 655
Glu Asp Ala Ser Arg Asn Trp Pro Asp Gly Ala Leu His Val Glu His
660 665 670
Phe Thr Ser Ser Leu Ala Ala Leu Asp Pro Asp Val Glu His Ala Phe
675 680 685
Asp Leu Glu Leu Arg Asp Ser Gly Leu Thr Val Arg Val Glu Pro Thr
690 695 700
Gln Thr Val Leu Asp Ala Leu Arg Ala Asn Asn Ile Asp Val Pro Ser
705 710 715 720
Asp Cys Glu Glu Gly Leu Cys Gly Ser Cys Glu Val Ala Val Leu Asp
725 730 735
Gly Glu Val Asp His Arg Asp Thr Val Leu Thr Lys Ala Glu Arg Ala
740 745 750
Ala Asn Arg Gln Met Met Thr Cys Cys Ser Arg Ala Cys Gly Asp Arg
755 760 765
Leu Ala Leu Arg Leu
770
<210> 6
<211> 2319
<212> DNA
<213> Rhodococcus
<400> 6
atgagtgcat cagttccggc gtcggcgccg gcgtgtcccg tcgaccacgc ggccctggcg 60
ggcggctgcc cggtgtcggc gaacgccgcg gcgttcgatc cgttcggttc cgcgtaccag 120
accgatccgg ccgagtcgct gcgctggtcc cgcgacgagg agccggtgtt ctacagcccc 180
gaactcggct actgggtcgt cacccggtac gaggatgtga aggcggtgtt ccgcgacaac 240
atcctgttct cgccggcgat cgcgctggag aagatcactc ccgtctcggc ggaggccacc 300
gccaccctcg cccggtacga ctacgccatg gcccggaccc tcgtgaacga ggacgagccc 360
gcccacatgc cgcgccgccg cgcgctcatg gatccgttca ccccgaagga actggcgcac 420
cacgaggcga tggtgcgacg gctcacgcgc gaatacgtcg accgcttcgt cgaatccggc 480
aaggccgacc tggtggacga gatgctgtgg gaggttccgc tcaccgtcgc cctgcacttc 540
ctcggcgtgc cggaggagga catggcgacg atgcgcaagt actcgatcgc gcacaccgtg 600
aacacctggg gccgccccgc gcccgaggag caggtggccg tcgccgaggc ggtcggcagg 660
ttctggcagt acgcgggcac ggtgctcgag aagatgcggc aggacccgtc gggacacggc 720
tggatgccct acgggatccg caagcagcgg gagatgccgg acgtcgtcac cgactcctac 780
ctgcactcga tgatgatggc cggcatcgtc gccgcgcacg agaccacggc caacgcgtcc 840
gcgaacgcgt tcaagctgct gctcgagaac cgcgcggtgt gggaggagat ctgcgcggat 900
ccgtcgctga tccccaacgc cgtcgaggag tgcctgcgcc actccgggtc cgtggcggcg 960
tggcgacggg tggccaccgc cgacacccgc atcggcgacg tcgacatccc cgccggcgcc 1020
aagctgctcg tcgtcaacgc gtccgccaac cacgacgagc gccacttcga gcgccccgac 1080
gagttcgaca tccggcgccc gaactcgagc gaccatctca ccttcgggta cggcagccac 1140
cagtgcatgg gcaagaacct ggcccgcatg gagatgcaga tcttcctcga ggaactcacc 1200
acgcggcttc cccacatgga actcgtaccc gatcaggagt tcacctacct gccgaatacg 1260
tccttccgcg gacccgacca cgtgtgggtg cagtgggatc cgcaggcgaa tcccgagcgc 1320
accgatcctg ctgtgctgca ccggcatcaa ccggtcacca tcggagaacc cgccgcccgg 1380
gcggtgtccc gcaccgtcac cgtcgagcgc ctggaccgga tcgccgacga cgtgctgcgc 1440
ctcgtcctgc gcgacgccgg cggaaagaca ttacccacgt ggactcccgg cgcccatatc 1500
gacctcgacc tcggcgcgct gtcgcgccag tactccctgt gcggcgcgcc cgatgcgccg 1560
agctacgaga ttgccgtgca cctggatccc gagagccgcg gcggttcgcg ctacatccac 1620
gaacagctcg aggtgggaag cccgctccgg atgcgcggcc ctcggaacca tttcgcgctc 1680
gaccccggcg ccgagcacta cgtgttcgtc gccggcggca tcggcatcac cccagtcctg 1740
gccatggccg accacgcccg cgcccggggg tggagctacg aactgcacta ctgcggccga 1800
aaccgttccg gcatggccta tctcgagcgt gtcgccgggc acggtgaccg ggccgccctg 1860
cacgtgtccg aggaaggcac ccggatcgac ctcgccgccc tcctcgccga gcccgccccc 1920
ggcgtccaga tctacgcgtg cgggcccggg cggctgctcg ccggactcga ggacgcgagc 1980
cggaactggc ccgacggggc gctgcacgtc gagcacttca cctcgtccct cgcggcgctc 2040
gatccggacg tcgagcacgc cttcgacctc gaactgcgtg actcggggct gaccgtgcgg 2100
gtcgaaccca cccagaccgt cctcgacgcg ttgcgcgcca acaacatcga cgtgcccagc 2160
gactgcgagg aaggcctctg cggctcgtgc gaggtcgccg tcctcgacgg cgaggtcgac 2220
catcgcgaca cggtgctgac caaggccgag cgggcggcga accggcagat gatgacctgc 2280
tgctcgcgtg cctgtggcga ccggctggcc ctgcgactc 2319
<210> 7
<211> 993
<212> PRT
<213> Artificial Sequence
<220>
<223> K1-BM3R氨基酸序列
<400> 7
Met Ala Leu Thr Thr Thr Gly Thr Glu Gln His Asp Leu Phe Ser Gly
1 5 10 15
Thr Phe Trp Gln Asn Pro His Pro Ala Tyr Ala Ala Leu Arg Ala Glu
20 25 30
Asp Pro Val Arg Lys Leu Ala Leu Pro Asp Gly Pro Val Trp Leu Leu
35 40 45
Thr Arg Tyr Ala Asp Val Arg Glu Ala Phe Val Asp Pro Arg Leu Ser
50 55 60
Lys Asp Trp Arg His Arg Leu Pro Glu Asp Gln Arg Ala Asp Met Pro
65 70 75 80
Ala Thr Pro Thr Pro Met Met Ile Leu Met Asp Pro Pro Asp His Thr
85 90 95
Arg Leu Arg Lys Leu Val Gly Arg Ser Phe Thr Val Arg Arg Met Asn
100 105 110
Glu Leu Glu Pro Arg Ile Thr Glu Ile Ala Asp Gly Leu Leu Ala Gly
115 120 125
Leu Pro Thr Asp Gly Pro Val Asp Leu Met Arg Glu Tyr Ala Phe Gln
130 135 140
Ile Pro Val Gln Val Ile Cys Glu Leu Leu Gly Leu Pro Ala Glu Asp
145 150 155 160
Arg Asp Asp Phe Ser Ala Trp Ser Ser Val Leu Val Asp Asp Ser Pro
165 170 175
Ala Asp Asp Lys Asn Ala Ala Met Gly Lys Leu His Gly Tyr Leu Ser
180 185 190
Asp Leu Leu Glu Arg Lys Arg Thr Glu Pro Asp Asp Ala Leu Leu Ser
195 200 205
Ser Leu Leu Ala Val Ser Asp Met Asp Gly Asp Arg Leu Ser Gln Glu
210 215 220
Glu Leu Val Ala Met Ala Met Leu Leu Leu Ile Ala Gly His Glu Thr
225 230 235 240
Thr Val Asn Leu Ile Gly Asn Gly Val Leu Ala Leu Leu Thr His Pro
245 250 255
Asp Gln Arg Lys Leu Leu Ala Glu Asp Pro Ser Leu Ile Ser Ser Ala
260 265 270
Val Glu Glu Phe Leu Arg Phe Asp Ser Pro Val Ser Gln Ala Pro Ile
275 280 285
Arg Phe Thr Ala Glu Asp Val Thr Tyr Ser Gly Val Thr Ile Pro Ala
290 295 300
Gly Glu Met Val Met Leu Gly Leu Ala Ala Ala Asn Arg Asp Ala Asp
305 310 315 320
Trp Met Pro Glu Pro Asp Arg Leu Asp Ile Thr Arg Asp Ala Ser Gly
325 330 335
Gly Val Phe Phe Gly His Gly Ile His Phe Cys Leu Gly Ala Gln Leu
340 345 350
Ala Arg Leu Glu Gly Arg Val Ala Ile Gly Arg Leu Phe Ala Asp Arg
355 360 365
Pro Glu Leu Ala Leu Ala Val Gly Leu Asp Glu Leu Val Tyr Arg Arg
370 375 380
Ser Thr Leu Val Arg Gly Leu Ser Arg Met Pro Val Thr Met Gly Pro
385 390 395 400
Arg Ser Ala Pro Ser Pro Ser Thr Glu Gln Ser Ala Lys Lys Val Arg
405 410 415
Lys Lys Ala Glu Asn Ala His Asn Thr Pro Leu Leu Val Leu Tyr Gly
420 425 430
Ser Asn Met Gly Thr Ala Glu Gly Thr Ala Arg Asp Leu Ala Asp Ile
435 440 445
Ala Met Ser Lys Gly Phe Ala Pro Gln Val Ala Thr Leu Asp Ser His
450 455 460
Ala Gly Asn Leu Pro Arg Glu Gly Ala Val Leu Ile Val Thr Ala Ser
465 470 475 480
Tyr Asn Gly His Pro Pro Asp Asn Ala Lys Gln Phe Val Asp Trp Leu
485 490 495
Asp Gln Ala Ser Ala Asp Glu Val Lys Gly Val Arg Tyr Ser Val Phe
500 505 510
Gly Cys Gly Asp Lys Asn Trp Ala Thr Thr Tyr Gln Lys Val Pro Ala
515 520 525
Phe Ile Asp Glu Thr Leu Ala Ala Lys Gly Ala Glu Asn Ile Ala Asp
530 535 540
Arg Gly Glu Ala Asp Ala Ser Asp Asp Phe Glu Gly Thr Tyr Glu Glu
545 550 555 560
Trp Arg Glu His Met Trp Ser Asp Val Ala Ala Tyr Phe Asn Leu Asp
565 570 575
Ile Glu Asn Ser Glu Asp Asn Lys Ser Thr Leu Ser Leu Gln Phe Val
580 585 590
Asp Ser Ala Ala Asp Met Pro Leu Ala Lys Met His Gly Ala Phe Ser
595 600 605
Thr Asn Val Val Ala Ser Lys Glu Leu Gln Gln Pro Gly Ser Ala Arg
610 615 620
Ser Thr Arg His Leu Glu Ile Glu Leu Pro Lys Glu Ala Ser Tyr Gln
625 630 635 640
Glu Gly Asp His Leu Gly Val Ile Pro Arg Asn Tyr Glu Gly Ile Val
645 650 655
Asn Arg Val Thr Ala Arg Phe Gly Leu Asp Ala Ser Gln Gln Ile Arg
660 665 670
Leu Glu Ala Glu Glu Glu Lys Leu Ala His Leu Pro Leu Ala Lys Thr
675 680 685
Val Ser Val Glu Glu Leu Leu Gln Tyr Val Glu Leu Gln Asp Pro Val
690 695 700
Thr Arg Thr Gln Leu Arg Ala Met Ala Ala Lys Thr Val Cys Pro Pro
705 710 715 720
His Lys Val Glu Leu Glu Ala Leu Leu Glu Lys Gln Ala Tyr Lys Glu
725 730 735
Gln Val Leu Ala Lys Arg Leu Thr Met Leu Glu Leu Leu Glu Lys Tyr
740 745 750
Pro Ala Cys Glu Met Lys Phe Ser Glu Phe Ile Ala Leu Leu Pro Ser
755 760 765
Ile Arg Pro Arg Tyr Tyr Ser Ile Ser Ser Ser Pro Arg Val Asp Glu
770 775 780
Lys Gln Ala Ser Ile Thr Val Ser Val Val Ser Gly Glu Ala Trp Ser
785 790 795 800
Gly Tyr Gly Glu Tyr Lys Gly Ile Ala Ser Asn Tyr Leu Ala Glu Leu
805 810 815
Gln Glu Gly Asp Thr Ile Thr Cys Phe Ile Ser Thr Pro Gln Ser Glu
820 825 830
Phe Thr Leu Pro Lys Asp Pro Glu Thr Pro Leu Ile Met Val Gly Pro
835 840 845
Gly Thr Gly Val Ala Pro Phe Arg Gly Phe Val Gln Ala Arg Lys Gln
850 855 860
Leu Lys Glu Gln Gly Gln Ser Leu Gly Glu Ala His Leu Tyr Phe Gly
865 870 875 880
Cys Arg Ser Pro His Glu Asp Tyr Leu Tyr Gln Glu Glu Leu Glu Asn
885 890 895
Ala Gln Ser Glu Gly Ile Ile Thr Leu His Thr Ala Phe Ser Arg Met
900 905 910
Pro Asn Gln Pro Lys Thr Tyr Val Gln His Val Met Glu Gln Asp Gly
915 920 925
Lys Lys Leu Ile Glu Leu Leu Asp Gln Gly Ala His Phe Tyr Ile Cys
930 935 940
Gly Asp Gly Ser Gln Met Ala Pro Ala Val Glu Ala Thr Leu Met Lys
945 950 955 960
Ser Tyr Ala Asp Val His Gln Val Ser Glu Ala Asp Ala Arg Leu Trp
965 970 975
Leu Gln Gln Leu Glu Glu Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala
980 985 990
Gly
<210> 8
<211> 2979
<212> DNA
<213> Artificial Sequence
<220>
<223> K1-BM3R核苷酸序列
<400> 8
atggctctga ccaccaccgg taccgaacag cacgacctgt tctctggtac cttctggcag 60
aacccgcacc cggcttacgc tgctctgcgt gctgaagacc cggttcgtaa actggctctg 120
ccggacggtc cggtttggct gctgacccgt tacgctgacg ttcgtgaagc tttcgttgac 180
ccgcgtctgt ctaaagactg gcgtcaccgt ctgccggaag accagcgtgc tgacatgccg 240
gctaccccga ccccgatgat gatcctgatg gacccgccgg accacacccg tctgcgtaaa 300
ctggttggtc gttctttcac cgttcgtcgt atgaacgaac tggaaccgcg tatcaccgaa 360
atcgctgacg gtctgctggc tggtctgccg accgacggtc cggttgacct gatgcgtgaa 420
tacgctttcc agatcccggt tcaggttatc tgcgaactgc tgggtctgcc ggctgaagac 480
cgtgacgact tctctgcttg gtcttctgtt ctggttgacg actctccggc tgacgacaaa 540
aacgctgcta tgggtaaact gcacggttac ctgtctgacc tgctggaacg taaacgtacc 600
gaaccggacg acgctctgct gtcttctctg ctggctgttt ctgacatgga cggtgaccgt 660
ctgtctcagg aagaactggt tgctatggct atgctgctgc tgatcgctgg tcacgaaacc 720
accgttaacc tgatcggtaa cggtgttctg gctctgctga cccacccgga ccagcgtaaa 780
ctgctggctg aagacccgtc tctgatctct tctgctgttg aagaattcct gcgtttcgac 840
tctccggttt ctcaggctcc gatccgtttc accgctgaag acgttaccta ctctggtgtt 900
accatcccgg ctggtgaaat ggttatgctg ggtctggctg ctgctaaccg tgacgctgac 960
tggatgccgg aaccggaccg tctggacatc acccgtgacg cttctggtgg tgttttcttc 1020
ggtcacggta tccacttctg cctgggtgct cagctggctc gtctggaagg tcgtgttgct 1080
atcggtcgtc tgttcgctga ccgtccggaa ctggctctgg ctgttggtct ggacgaactg 1140
gtttaccgtc gttctaccct ggttcgtggt ctgtctcgta tgccggttac catgggtccg 1200
cgttctgctc cgtctccgtc taccgaacag tctgctaaaa aagttcgtaa aaaagctgaa 1260
aacgctcaca acaccccgct gctggttctg tacggttcta acatgggtac cgctgaaggt 1320
accgctcgtg acctggctga catcgctatg tctaaaggtt tcgctccgca ggttgctacc 1380
ctggactctc acgctggtaa cctgccgcgt gaaggtgctg ttctgatcgt taccgcttct 1440
tacaacggtc acccgccgga caacgctaaa cagttcgttg actggctgga ccaggcttct 1500
gctgacgaag ttaaaggtgt tcgttactct gttttcggtt gcggtgacaa aaactgggct 1560
accacctacc agaaagttcc ggctttcatc gacgaaaccc tggctgctaa aggtgctgaa 1620
aacatcgctg accgtggtga agctgacgct tctgacgact tcgaaggtac ctacgaagaa 1680
tggcgtgaac acatgtggtc tgacgttgct gcttacttca acctggacat cgaaaactct 1740
gaagacaaca aatctaccct gtctctgcag ttcgttgact ctgctgctga catgccgctg 1800
gctaaaatgc acggtgcttt ctctaccaac gttgttgctt ctaaagaact gcagcagccg 1860
ggttctgctc gttctacccg tcacctggaa atcgaactgc cgaaagaagc ttcttaccag 1920
gaaggtgacc acctgggtgt tatcccgcgt aactacgaag gtatcgttaa ccgtgttacc 1980
gctcgtttcg gtctggacgc ttctcagcag atccgtctgg aagctgaaga agaaaaactg 2040
gctcacctgc cgctggctaa aaccgtttct gttgaagaac tgctgcagta cgttgaactg 2100
caggacccgg ttacccgtac ccagctgcgt gctatggctg ctaaaaccgt ttgcccgccg 2160
cacaaagttg aactggaagc tctgctggaa aaacaggctt acaaagaaca ggttctggct 2220
aaacgtctga ccatgctgga actgctggaa aaatacccgg cttgcgaaat gaaattctct 2280
gaattcatcg ctctgctgcc gtctatccgt ccgcgttact actctatctc ttcttctccg 2340
cgtgttgacg aaaaacaggc ttctatcacc gtttctgttg tttctggtga agcttggtct 2400
ggttacggtg aatacaaagg tatcgcttct aactacctgg ctgaactgca ggaaggtgac 2460
accatcacct gcttcatctc taccccgcag tctgaattca ccctgccgaa agacccggaa 2520
accccgctga tcatggttgg tccgggtacc ggtgttgctc cgttccgtgg tttcgttcag 2580
gctcgtaaac agctgaaaga acagggtcag tctctgggtg aagctcacct gtacttcggt 2640
tgccgttctc cgcacgaaga ctacctgtac caggaagaac tggaaaacgc tcagtctgaa 2700
ggtatcatca ccctgcacac cgctttctct cgtatgccga accagccgaa aacctacgtt 2760
cagcacgtta tggaacagga cggtaaaaaa ctgatcgaac tgctggacca gggtgctcac 2820
ttctacatct gcggtgacgg ttctcagatg gctccggctg ttgaagctac cctgatgaaa 2880
tcttacgctg acgttcacca ggtttctgaa gctgacgctc gtctgtggct gcagcagctg 2940
gaagaaaaag gtcgttacgc taaagacgtt tgggctggt 2979
<210> 9
<211> 732
<212> PRT
<213> Artificial Sequence
<220>
<223> K1-RhFR氨基酸序列
<400> 9
Met Ala Leu Thr Thr Thr Gly Thr Glu Gln His Asp Leu Phe Ser Gly
1 5 10 15
Thr Phe Trp Gln Asn Pro His Pro Ala Tyr Ala Ala Leu Arg Ala Glu
20 25 30
Asp Pro Val Arg Lys Leu Ala Leu Pro Asp Gly Pro Val Trp Leu Leu
35 40 45
Thr Arg Tyr Ala Asp Val Arg Glu Ala Phe Val Asp Pro Arg Leu Ser
50 55 60
Lys Asp Trp Arg His Arg Leu Pro Glu Asp Gln Arg Ala Asp Met Pro
65 70 75 80
Ala Thr Pro Thr Pro Met Met Ile Leu Met Asp Pro Pro Asp His Thr
85 90 95
Arg Leu Arg Lys Leu Val Gly Arg Ser Phe Thr Val Arg Arg Met Asn
100 105 110
Glu Leu Glu Pro Arg Ile Thr Glu Ile Ala Asp Gly Leu Leu Ala Gly
115 120 125
Leu Pro Thr Asp Gly Pro Val Asp Leu Met Arg Glu Tyr Ala Phe Gln
130 135 140
Ile Pro Val Gln Val Ile Cys Glu Leu Leu Gly Leu Pro Ala Glu Asp
145 150 155 160
Arg Asp Asp Phe Ser Ala Trp Ser Ser Val Leu Val Asp Asp Ser Pro
165 170 175
Ala Asp Asp Lys Asn Ala Ala Met Gly Lys Leu His Gly Tyr Leu Ser
180 185 190
Asp Leu Leu Glu Arg Lys Arg Thr Glu Pro Asp Asp Ala Leu Leu Ser
195 200 205
Ser Leu Leu Ala Val Ser Asp Met Asp Gly Asp Arg Leu Ser Gln Glu
210 215 220
Glu Leu Val Ala Met Ala Met Leu Leu Leu Ile Ala Gly His Glu Thr
225 230 235 240
Thr Val Asn Leu Ile Gly Asn Gly Val Leu Ala Leu Leu Thr His Pro
245 250 255
Asp Gln Arg Lys Leu Leu Ala Glu Asp Pro Ser Leu Ile Ser Ser Ala
260 265 270
Val Glu Glu Phe Leu Arg Phe Asp Ser Pro Val Ser Gln Ala Pro Ile
275 280 285
Arg Phe Thr Ala Glu Asp Val Thr Tyr Ser Gly Val Thr Ile Pro Ala
290 295 300
Gly Glu Met Val Met Leu Gly Leu Ala Ala Ala Asn Arg Asp Ala Asp
305 310 315 320
Trp Met Pro Glu Pro Asp Arg Leu Asp Ile Thr Arg Asp Ala Ser Gly
325 330 335
Gly Val Phe Phe Gly His Gly Ile His Phe Cys Leu Gly Ala Gln Leu
340 345 350
Ala Arg Leu Glu Gly Arg Val Ala Ile Gly Arg Leu Phe Ala Asp Arg
355 360 365
Pro Glu Leu Ala Leu Ala Val Gly Leu Asp Glu Leu Val Tyr Arg Arg
370 375 380
Ser Thr Leu Val Arg Gly Leu Ser Arg Met Pro Val Thr Met Gly Pro
385 390 395 400
Arg Ser Ala Val Leu His Arg His Gln Pro Val Thr Ile Gly Glu Pro
405 410 415
Ala Ala Arg Ala Val Ser Arg Thr Val Thr Val Glu Arg Leu Asp Arg
420 425 430
Ile Ala Asp Asp Val Leu Arg Leu Val Leu Arg Asp Ala Gly Gly Lys
435 440 445
Thr Leu Pro Thr Trp Thr Pro Gly Ala His Ile Asp Leu Asp Leu Gly
450 455 460
Ala Leu Ser Arg Gln Tyr Ser Leu Cys Gly Ala Pro Asp Ala Pro Ser
465 470 475 480
Tyr Glu Ile Ala Val His Leu Asp Pro Glu Ser Arg Gly Gly Ser Arg
485 490 495
Tyr Ile His Glu Gln Leu Glu Val Gly Ser Pro Leu Arg Met Arg Gly
500 505 510
Pro Arg Asn His Phe Ala Leu Asp Pro Gly Ala Glu His Tyr Val Phe
515 520 525
Val Ala Gly Gly Ile Gly Ile Thr Pro Val Leu Ala Met Ala Asp His
530 535 540
Ala Arg Ala Arg Gly Trp Ser Tyr Glu Leu His Tyr Cys Gly Arg Asn
545 550 555 560
Arg Ser Gly Met Ala Tyr Leu Glu Arg Val Ala Gly His Gly Asp Arg
565 570 575
Ala Ala Leu His Val Ser Glu Glu Gly Thr Arg Ile Asp Leu Ala Ala
580 585 590
Leu Leu Ala Glu Pro Ala Pro Gly Val Gln Ile Tyr Ala Cys Gly Pro
595 600 605
Gly Arg Leu Leu Ala Gly Leu Glu Asp Ala Ser Arg Asn Trp Pro Asp
610 615 620
Gly Ala Leu His Val Glu His Phe Thr Ser Ser Leu Ala Ala Leu Asp
625 630 635 640
Pro Asp Val Glu His Ala Phe Asp Leu Glu Leu Arg Asp Ser Gly Leu
645 650 655
Thr Val Arg Val Glu Pro Thr Gln Thr Val Leu Asp Ala Leu Arg Ala
660 665 670
Asn Asn Ile Asp Val Pro Ser Asp Cys Glu Glu Gly Leu Cys Gly Ser
675 680 685
Cys Glu Val Ala Val Leu Asp Gly Glu Val Asp His Arg Asp Thr Val
690 695 700
Leu Thr Lys Ala Glu Arg Ala Ala Asn Arg Gln Met Met Thr Cys Cys
705 710 715 720
Ser Arg Ala Cys Gly Asp Arg Leu Ala Leu Arg Leu
725 730
<210> 10
<211> 2196
<212> DNA
<213> Artificial Sequence
<220>
<223> K1-RhFR核苷酸序列
<400> 10
atggctctga ccaccaccgg taccgaacag cacgacctgt tctctggtac cttctggcag 60
aacccgcacc cggcttacgc tgctctgcgt gctgaagacc cggttcgtaa actggctctg 120
ccggacggtc cggtttggct gctgacccgt tacgctgacg ttcgtgaagc tttcgttgac 180
ccgcgtctgt ctaaagactg gcgtcaccgt ctgccggaag accagcgtgc tgacatgccg 240
gctaccccga ccccgatgat gatcctgatg gacccgccgg accacacccg tctgcgtaaa 300
ctggttggtc gttctttcac cgttcgtcgt atgaacgaac tggaaccgcg tatcaccgaa 360
atcgctgacg gtctgctggc tggtctgccg accgacggtc cggttgacct gatgcgtgaa 420
tacgctttcc agatcccggt tcaggttatc tgcgaactgc tgggtctgcc ggctgaagac 480
cgtgacgact tctctgcttg gtcttctgtt ctggttgacg actctccggc tgacgacaaa 540
aacgctgcta tgggtaaact gcacggttac ctgtctgacc tgctggaacg taaacgtacc 600
gaaccggacg acgctctgct gtcttctctg ctggctgttt ctgacatgga cggtgaccgt 660
ctgtctcagg aagaactggt tgctatggct atgctgctgc tgatcgctgg tcacgaaacc 720
accgttaacc tgatcggtaa cggtgttctg gctctgctga cccacccgga ccagcgtaaa 780
ctgctggctg aagacccgtc tctgatctct tctgctgttg aagaattcct gcgtttcgac 840
tctccggttt ctcaggctcc gatccgtttc accgctgaag acgttaccta ctctggtgtt 900
accatcccgg ctggtgaaat ggttatgctg ggtctggctg ctgctaaccg tgacgctgac 960
tggatgccgg aaccggaccg tctggacatc acccgtgacg cttctggtgg tgttttcttc 1020
ggtcacggta tccacttctg cctgggtgct cagctggctc gtctggaagg tcgtgttgct 1080
atcggtcgtc tgttcgctga ccgtccggaa ctggctctgg ctgttggtct ggacgaactg 1140
gtttaccgtc gttctaccct ggttcgtggt ctgtctcgta tgccggttac catgggtccg 1200
cgttctgctg ttctgcaccg tcaccagccg gttaccatcg gtgaaccggc tgctcgtgct 1260
gtttctcgta ccgttaccgt tgaacgtctg gaccgtatcg ctgacgacgt tctgcgtctg 1320
gttctgcgtg acgctggtgg taaaaccctg ccgacctgga ccccgggtgc tcacatcgac 1380
ctggacctgg gtgctctgtc tcgtcagtac tctctgtgcg gtgctccgga cgctccgtct 1440
tacgaaatcg ctgttcacct ggacccggaa tctcgtggtg gttctcgtta catccacgaa 1500
cagctggaag ttggttctcc gctgcgtatg cgtggtccgc gtaaccactt cgctctggac 1560
ccgggtgctg aacactacgt tttcgttgct ggtggtatcg gtatcacccc ggttctggct 1620
atggctgacc acgctcgtgc tcgtggttgg tcttacgaac tgcactactg cggtcgtaac 1680
cgttctggta tggcttacct ggaacgtgtt gctggtcacg gtgaccgtgc tgctctgcac 1740
gtttctgaag aaggtacccg tatcgacctg gctgctctgc tggctgaacc ggctccgggt 1800
gttcagatct acgcttgcgg tccgggtcgt ctgctggctg gtctggaaga cgcttctcgt 1860
aactggccgg acggtgctct gcacgttgaa cacttcacct cttctctggc tgctctggac 1920
ccggacgttg aacacgcttt cgacctggaa ctgcgtgact ctggtctgac cgttcgtgtt 1980
gaaccgaccc agaccgttct ggacgctctg cgtgctaaca acatcgacgt tccgtctgac 2040
tgcgaagaag gtctgtgcgg ttcttgcgaa gttgctgttc tggacggtga agttgaccac 2100
cgtgacaccg ttctgaccaa agctgaacgt gctgctaacc gtcagatgat gacctgctgc 2160
tctcgtgctt gcggtgaccg tctggctctg cgtctg 2196
<210> 11
<211> 735
<212> PRT
<213> Artificial Sequence
<220>
<223> K1-RhFR-I3氨基酸序列
<400> 11
Met Ala Leu Thr Thr Thr Gly Thr Glu Gln His Asp Leu Phe Ser Gly
1 5 10 15
Thr Phe Trp Gln Asn Pro His Pro Ala Tyr Ala Ala Leu Arg Ala Glu
20 25 30
Asp Pro Val Arg Lys Leu Ala Leu Pro Asp Gly Pro Val Trp Leu Leu
35 40 45
Thr Arg Tyr Ala Asp Val Arg Glu Ala Phe Val Asp Pro Arg Leu Ser
50 55 60
Lys Asp Trp Arg His Arg Leu Pro Glu Asp Gln Arg Ala Asp Met Pro
65 70 75 80
Ala Thr Pro Thr Pro Met Met Ile Leu Met Asp Pro Pro Asp His Thr
85 90 95
Arg Leu Arg Lys Leu Val Gly Arg Ser Phe Thr Val Arg Arg Met Asn
100 105 110
Glu Leu Glu Pro Arg Ile Thr Glu Ile Ala Asp Gly Leu Leu Ala Gly
115 120 125
Leu Pro Thr Asp Gly Pro Val Asp Leu Met Arg Glu Tyr Ala Phe Gln
130 135 140
Ile Pro Val Gln Val Ile Cys Glu Leu Leu Gly Leu Pro Ala Glu Asp
145 150 155 160
Arg Asp Asp Phe Ser Ala Trp Ser Ser Val Leu Val Asp Asp Ser Pro
165 170 175
Ala Asp Asp Lys Asn Ala Ala Met Gly Lys Leu His Gly Tyr Leu Ser
180 185 190
Asp Leu Leu Glu Arg Lys Arg Thr Glu Pro Asp Asp Ala Leu Leu Ser
195 200 205
Ser Leu Leu Ala Val Ser Asp Met Asp Gly Asp Arg Leu Ser Gln Glu
210 215 220
Glu Leu Val Ala Met Ala Met Leu Leu Leu Ile Ala Gly His Glu Thr
225 230 235 240
Thr Val Asn Leu Ile Gly Asn Gly Val Leu Ala Leu Leu Thr His Pro
245 250 255
Asp Gln Arg Lys Leu Leu Ala Glu Asp Pro Ser Leu Ile Ser Ser Ala
260 265 270
Val Glu Glu Phe Leu Arg Phe Asp Ser Pro Val Ser Gln Ala Pro Ile
275 280 285
Arg Phe Thr Ala Glu Asp Val Thr Tyr Ser Gly Val Thr Ile Pro Ala
290 295 300
Gly Glu Met Val Met Leu Gly Leu Ala Ala Ala Asn Arg Asp Ala Asp
305 310 315 320
Trp Met Pro Glu Pro Asp Arg Leu Asp Ile Thr Arg Asp Ala Ser Gly
325 330 335
Gly Val Phe Phe Gly His Gly Ile His Phe Cys Leu Gly Ala Gln Leu
340 345 350
Ala Arg Leu Glu Gly Arg Val Ala Ile Gly Arg Leu Phe Ala Asp Arg
355 360 365
Pro Glu Leu Ala Leu Ala Val Gly Leu Asp Glu Leu Val Tyr Arg Arg
370 375 380
Ser Thr Leu Val Arg Gly Leu Ser Arg Met Pro Val Thr Met Gly Pro
385 390 395 400
Arg Ser Ala Gly Gly Ser Val Leu His Arg His Gln Pro Val Thr Ile
405 410 415
Gly Glu Pro Ala Ala Arg Ala Val Ser Arg Thr Val Thr Val Glu Arg
420 425 430
Leu Asp Arg Ile Ala Asp Asp Val Leu Arg Leu Val Leu Arg Asp Ala
435 440 445
Gly Gly Lys Thr Leu Pro Thr Trp Thr Pro Gly Ala His Ile Asp Leu
450 455 460
Asp Leu Gly Ala Leu Ser Arg Gln Tyr Ser Leu Cys Gly Ala Pro Asp
465 470 475 480
Ala Pro Ser Tyr Glu Ile Ala Val His Leu Asp Pro Glu Ser Arg Gly
485 490 495
Gly Ser Arg Tyr Ile His Glu Gln Leu Glu Val Gly Ser Pro Leu Arg
500 505 510
Met Arg Gly Pro Arg Asn His Phe Ala Leu Asp Pro Gly Ala Glu His
515 520 525
Tyr Val Phe Val Ala Gly Gly Ile Gly Ile Thr Pro Val Leu Ala Met
530 535 540
Ala Asp His Ala Arg Ala Arg Gly Trp Ser Tyr Glu Leu His Tyr Cys
545 550 555 560
Gly Arg Asn Arg Ser Gly Met Ala Tyr Leu Glu Arg Val Ala Gly His
565 570 575
Gly Asp Arg Ala Ala Leu His Val Ser Glu Glu Gly Thr Arg Ile Asp
580 585 590
Leu Ala Ala Leu Leu Ala Glu Pro Ala Pro Gly Val Gln Ile Tyr Ala
595 600 605
Cys Gly Pro Gly Arg Leu Leu Ala Gly Leu Glu Asp Ala Ser Arg Asn
610 615 620
Trp Pro Asp Gly Ala Leu His Val Glu His Phe Thr Ser Ser Leu Ala
625 630 635 640
Ala Leu Asp Pro Asp Val Glu His Ala Phe Asp Leu Glu Leu Arg Asp
645 650 655
Ser Gly Leu Thr Val Arg Val Glu Pro Thr Gln Thr Val Leu Asp Ala
660 665 670
Leu Arg Ala Asn Asn Ile Asp Val Pro Ser Asp Cys Glu Glu Gly Leu
675 680 685
Cys Gly Ser Cys Glu Val Ala Val Leu Asp Gly Glu Val Asp His Arg
690 695 700
Asp Thr Val Leu Thr Lys Ala Glu Arg Ala Ala Asn Arg Gln Met Met
705 710 715 720
Thr Cys Cys Ser Arg Ala Cys Gly Asp Arg Leu Ala Leu Arg Leu
725 730 735
<210> 12
<211> 2205
<212> DNA
<213> Artificial Sequence
<220>
<223> K1-RhFR-I3核苷酸序列
<400> 12
atggcactga ccaccaccgg taccgaacag catgacctgt ttagcggtac cttttggcag 60
aatccgcatc cggcgtatgc agcactgcgt gcagaagatc cggttcgtaa actggcactg 120
ccggatggtc cggtgtggct gctgacccgt tatgcagatg ttcgtgaagc atttgttgat 180
ccgcgtctga gtaaagattg gcgtcatcgt ctgccggaag atcagcgtgc cgatatgccg 240
gcaaccccga ccccgatgat gattctgatg gacccgccgg atcatacacg tttacgtaaa 300
ctggttggtc gtagttttac cgttcgtcgt atgaatgaac tggaaccgcg tattaccgaa 360
attgcagatg gtctgctggc aggtctgccg accgatggtc cggttgatct gatgcgtgaa 420
tatgcatttc agattccggt tcaggttata tgtgaactgc tgggtctgcc ggcagaagat 480
cgtgatgatt tttcagcatg gtcaagtgtg ctggttgatg attctccggc agatgataaa 540
aatgccgcaa tgggtaaact gcatggttat ctgtcagatc tgctggaacg taaacgtacc 600
gaaccggatg atgcactgct gagtagcctg ctggcggttt ctgatatgga tggtgatcgt 660
ctgtctcagg aagaactggt tgcaatggca atgctgctgc tgattgcagg tcatgaaacc 720
accgttaatc tgattggtaa tggtgtgctg gcactgctga cccatccgga tcagcgtaaa 780
ctgttagctg aagatccgag tctgattagc tcagcagttg aagaatttct gcgttttgat 840
tctccggtta gccaggcacc gatccgtttt accgctgaag atgttaccta tagtggtgtt 900
accattccgg caggtgaaat ggttatgctg ggtctggcag cagcaaatcg cgatgcagat 960
tggatgccgg aaccggatcg tctggatatt acccgtgatg caagtggtgg tgttttcttt 1020
ggtcatggta ttcatttttg tctgggtgcg cagctggcac gtctggaagg tcgtgtggca 1080
attggtcgtc tgtttgcaga tcgtccggaa ctggcactgg cagttggtct ggatgaactg 1140
gtgtatcgtc gtagcaccct ggttcgtggt ctgagtagga tgccggtgac aatgggtccg 1200
cgttcagcag gcggaagtgt gctgcaccgg catcaaccgg tcaccatcgg agaacccgcc 1260
gcccgggcgg tgtcccgcac cgtcaccgtc gagcgcctgg accggatcgc cgacgacgtg 1320
ctgcgcctcg tcctgcgcga cgccggcgga aagacattac ccacgtggac tcccggcgcc 1380
catatcgacc tcgacctcgg cgcgctgtcg cgccagtact ccctgtgcgg cgcgcccgat 1440
gcgccgagct acgagattgc cgtgcacctg gatcccgaga gccgcggcgg ttcgcgctac 1500
atccacgaac agctcgaggt gggaagcccg ctccggatgc gcggccctcg gaaccatttc 1560
gcgctcgacc ccggcgccga gcactacgtg ttcgtcgccg gcggcatcgg catcacccca 1620
gtcctggcca tggccgacca cgcccgcgcc cgggggtgga gctacgaact gcactactgc 1680
ggccgaaacc gttccggcat ggcctatctc gagcgtgtcg ccgggcacgg tgaccgggcc 1740
gccctgcacg tgtccgagga aggcacccgg atcgacctcg ccgccctcct cgccgagccc 1800
gcccccggcg tccagatcta cgcgtgcggg cccgggcggc tgctcgccgg actcgaggac 1860
gcgagccgga actggcccga cggggcgctg cacgtcgagc acttcacctc gtccctcgcg 1920
gcgctcgatc cggacgtcga gcacgccttc gacctcgaac tgcgtgactc ggggctgacc 1980
gtgcgggtcg aacccaccca gaccgtcctc gacgcgttgc gcgccaacaa catcgacgtg 2040
cccagcgact gcgaggaagg cctctgcggc tcgtgcgagg tcgccgtcct cgacggcgag 2100
gtcgaccatc gcgacacggt gctgaccaag gccgagcggg cggcgaaccg gcagatgatg 2160
acctgctgct cgcgtgcctg tggcgaccgg ctggccctgc gactc 2205
<210> 13
<211> 738
<212> PRT
<213> Artificial Sequence
<220>
<223> K1-RhFR-I6氨基酸序列
<400> 13
Met Ala Leu Thr Thr Thr Gly Thr Glu Gln His Asp Leu Phe Ser Gly
1 5 10 15
Thr Phe Trp Gln Asn Pro His Pro Ala Tyr Ala Ala Leu Arg Ala Glu
20 25 30
Asp Pro Val Arg Lys Leu Ala Leu Pro Asp Gly Pro Val Trp Leu Leu
35 40 45
Thr Arg Tyr Ala Asp Val Arg Glu Ala Phe Val Asp Pro Arg Leu Ser
50 55 60
Lys Asp Trp Arg His Arg Leu Pro Glu Asp Gln Arg Ala Asp Met Pro
65 70 75 80
Ala Thr Pro Thr Pro Met Met Ile Leu Met Asp Pro Pro Asp His Thr
85 90 95
Arg Leu Arg Lys Leu Val Gly Arg Ser Phe Thr Val Arg Arg Met Asn
100 105 110
Glu Leu Glu Pro Arg Ile Thr Glu Ile Ala Asp Gly Leu Leu Ala Gly
115 120 125
Leu Pro Thr Asp Gly Pro Val Asp Leu Met Arg Glu Tyr Ala Phe Gln
130 135 140
Ile Pro Val Gln Val Ile Cys Glu Leu Leu Gly Leu Pro Ala Glu Asp
145 150 155 160
Arg Asp Asp Phe Ser Ala Trp Ser Ser Val Leu Val Asp Asp Ser Pro
165 170 175
Ala Asp Asp Lys Asn Ala Ala Met Gly Lys Leu His Gly Tyr Leu Ser
180 185 190
Asp Leu Leu Glu Arg Lys Arg Thr Glu Pro Asp Asp Ala Leu Leu Ser
195 200 205
Ser Leu Leu Ala Val Ser Asp Met Asp Gly Asp Arg Leu Ser Gln Glu
210 215 220
Glu Leu Val Ala Met Ala Met Leu Leu Leu Ile Ala Gly His Glu Thr
225 230 235 240
Thr Val Asn Leu Ile Gly Asn Gly Val Leu Ala Leu Leu Thr His Pro
245 250 255
Asp Gln Arg Lys Leu Leu Ala Glu Asp Pro Ser Leu Ile Ser Ser Ala
260 265 270
Val Glu Glu Phe Leu Arg Phe Asp Ser Pro Val Ser Gln Ala Pro Ile
275 280 285
Arg Phe Thr Ala Glu Asp Val Thr Tyr Ser Gly Val Thr Ile Pro Ala
290 295 300
Gly Glu Met Val Met Leu Gly Leu Ala Ala Ala Asn Arg Asp Ala Asp
305 310 315 320
Trp Met Pro Glu Pro Asp Arg Leu Asp Ile Thr Arg Asp Ala Ser Gly
325 330 335
Gly Val Phe Phe Gly His Gly Ile His Phe Cys Leu Gly Ala Gln Leu
340 345 350
Ala Arg Leu Glu Gly Arg Val Ala Ile Gly Arg Leu Phe Ala Asp Arg
355 360 365
Pro Glu Leu Ala Leu Ala Val Gly Leu Asp Glu Leu Val Tyr Arg Arg
370 375 380
Ser Thr Leu Val Arg Gly Leu Ser Arg Met Pro Val Thr Met Gly Pro
385 390 395 400
Arg Ser Ala Gly Gly Ser Gly Gly Ser Val Leu His Arg His Gln Pro
405 410 415
Val Thr Ile Gly Glu Pro Ala Ala Arg Ala Val Ser Arg Thr Val Thr
420 425 430
Val Glu Arg Leu Asp Arg Ile Ala Asp Asp Val Leu Arg Leu Val Leu
435 440 445
Arg Asp Ala Gly Gly Lys Thr Leu Pro Thr Trp Thr Pro Gly Ala His
450 455 460
Ile Asp Leu Asp Leu Gly Ala Leu Ser Arg Gln Tyr Ser Leu Cys Gly
465 470 475 480
Ala Pro Asp Ala Pro Ser Tyr Glu Ile Ala Val His Leu Asp Pro Glu
485 490 495
Ser Arg Gly Gly Ser Arg Tyr Ile His Glu Gln Leu Glu Val Gly Ser
500 505 510
Pro Leu Arg Met Arg Gly Pro Arg Asn His Phe Ala Leu Asp Pro Gly
515 520 525
Ala Glu His Tyr Val Phe Val Ala Gly Gly Ile Gly Ile Thr Pro Val
530 535 540
Leu Ala Met Ala Asp His Ala Arg Ala Arg Gly Trp Ser Tyr Glu Leu
545 550 555 560
His Tyr Cys Gly Arg Asn Arg Ser Gly Met Ala Tyr Leu Glu Arg Val
565 570 575
Ala Gly His Gly Asp Arg Ala Ala Leu His Val Ser Glu Glu Gly Thr
580 585 590
Arg Ile Asp Leu Ala Ala Leu Leu Ala Glu Pro Ala Pro Gly Val Gln
595 600 605
Ile Tyr Ala Cys Gly Pro Gly Arg Leu Leu Ala Gly Leu Glu Asp Ala
610 615 620
Ser Arg Asn Trp Pro Asp Gly Ala Leu His Val Glu His Phe Thr Ser
625 630 635 640
Ser Leu Ala Ala Leu Asp Pro Asp Val Glu His Ala Phe Asp Leu Glu
645 650 655
Leu Arg Asp Ser Gly Leu Thr Val Arg Val Glu Pro Thr Gln Thr Val
660 665 670
Leu Asp Ala Leu Arg Ala Asn Asn Ile Asp Val Pro Ser Asp Cys Glu
675 680 685
Glu Gly Leu Cys Gly Ser Cys Glu Val Ala Val Leu Asp Gly Glu Val
690 695 700
Asp His Arg Asp Thr Val Leu Thr Lys Ala Glu Arg Ala Ala Asn Arg
705 710 715 720
Gln Met Met Thr Cys Cys Ser Arg Ala Cys Gly Asp Arg Leu Ala Leu
725 730 735
Arg Leu
<210> 14
<211> 2214
<212> DNA
<213> Artificial Sequence
<220>
<223> K1-RhFR-I6
<400> 14
atggcactga ccaccaccgg taccgaacag catgacctgt ttagcggtac cttttggcag 60
aatccgcatc cggcgtatgc agcactgcgt gcagaagatc cggttcgtaa actggcactg 120
ccggatggtc cggtgtggct gctgacccgt tatgcagatg ttcgtgaagc atttgttgat 180
ccgcgtctga gtaaagattg gcgtcatcgt ctgccggaag atcagcgtgc cgatatgccg 240
gcaaccccga ccccgatgat gattctgatg gacccgccgg atcatacacg tttacgtaaa 300
ctggttggtc gtagttttac cgttcgtcgt atgaatgaac tggaaccgcg tattaccgaa 360
attgcagatg gtctgctggc aggtctgccg accgatggtc cggttgatct gatgcgtgaa 420
tatgcatttc agattccggt tcaggttata tgtgaactgc tgggtctgcc ggcagaagat 480
cgtgatgatt tttcagcatg gtcaagtgtg ctggttgatg attctccggc agatgataaa 540
aatgccgcaa tgggtaaact gcatggttat ctgtcagatc tgctggaacg taaacgtacc 600
gaaccggatg atgcactgct gagtagcctg ctggcggttt ctgatatgga tggtgatcgt 660
ctgtctcagg aagaactggt tgcaatggca atgctgctgc tgattgcagg tcatgaaacc 720
accgttaatc tgattggtaa tggtgtgctg gcactgctga cccatccgga tcagcgtaaa 780
ctgttagctg aagatccgag tctgattagc tcagcagttg aagaatttct gcgttttgat 840
tctccggtta gccaggcacc gatccgtttt accgctgaag atgttaccta tagtggtgtt 900
accattccgg caggtgaaat ggttatgctg ggtctggcag cagcaaatcg cgatgcagat 960
tggatgccgg aaccggatcg tctggatatt acccgtgatg caagtggtgg tgttttcttt 1020
ggtcatggta ttcatttttg tctgggtgcg cagctggcac gtctggaagg tcgtgtggca 1080
attggtcgtc tgtttgcaga tcgtccggaa ctggcactgg cagttggtct ggatgaactg 1140
gtgtatcgtc gtagcaccct ggttcgtggt ctgagtagga tgccggtgac aatgggtccg 1200
cgttcagcag gcggaagtgg cggaagtgtg ctgcaccggc atcaaccggt caccatcgga 1260
gaacccgccg cccgggcggt gtcccgcacc gtcaccgtcg agcgcctgga ccggatcgcc 1320
gacgacgtgc tgcgcctcgt cctgcgcgac gccggcggaa agacattacc cacgtggact 1380
cccggcgccc atatcgacct cgacctcggc gcgctgtcgc gccagtactc cctgtgcggc 1440
gcgcccgatg cgccgagcta cgagattgcc gtgcacctgg atcccgagag ccgcggcggt 1500
tcgcgctaca tccacgaaca gctcgaggtg ggaagcccgc tccggatgcg cggccctcgg 1560
aaccatttcg cgctcgaccc cggcgccgag cactacgtgt tcgtcgccgg cggcatcggc 1620
atcaccccag tcctggccat ggccgaccac gcccgcgccc gggggtggag ctacgaactg 1680
cactactgcg gccgaaaccg ttccggcatg gcctatctcg agcgtgtcgc cgggcacggt 1740
gaccgggccg ccctgcacgt gtccgaggaa ggcacccgga tcgacctcgc cgccctcctc 1800
gccgagcccg cccccggcgt ccagatctac gcgtgcgggc ccgggcggct gctcgccgga 1860
ctcgaggacg cgagccggaa ctggcccgac ggggcgctgc acgtcgagca cttcacctcg 1920
tccctcgcgg cgctcgatcc ggacgtcgag cacgccttcg acctcgaact gcgtgactcg 1980
gggctgaccg tgcgggtcga acccacccag accgtcctcg acgcgttgcg cgccaacaac 2040
atcgacgtgc ccagcgactg cgaggaaggc ctctgcggct cgtgcgaggt cgccgtcctc 2100
gacggcgagg tcgaccatcg cgacacggtg ctgaccaagg ccgagcgggc ggcgaaccgg 2160
cagatgatga cctgctgctc gcgtgcctgt ggcgaccggc tggccctgcg actc 2214
<210> 15
<211> 729
<212> PRT
<213> Artificial Sequence
<220>
<223> K1-RhFR-D3氨基酸序列
<400> 15
Met Ala Leu Thr Thr Thr Gly Thr Glu Gln His Asp Leu Phe Ser Gly
1 5 10 15
Thr Phe Trp Gln Asn Pro His Pro Ala Tyr Ala Ala Leu Arg Ala Glu
20 25 30
Asp Pro Val Arg Lys Leu Ala Leu Pro Asp Gly Pro Val Trp Leu Leu
35 40 45
Thr Arg Tyr Ala Asp Val Arg Glu Ala Phe Val Asp Pro Arg Leu Ser
50 55 60
Lys Asp Trp Arg His Arg Leu Pro Glu Asp Gln Arg Ala Asp Met Pro
65 70 75 80
Ala Thr Pro Thr Pro Met Met Ile Leu Met Asp Pro Pro Asp His Thr
85 90 95
Arg Leu Arg Lys Leu Val Gly Arg Ser Phe Thr Val Arg Arg Met Asn
100 105 110
Glu Leu Glu Pro Arg Ile Thr Glu Ile Ala Asp Gly Leu Leu Ala Gly
115 120 125
Leu Pro Thr Asp Gly Pro Val Asp Leu Met Arg Glu Tyr Ala Phe Gln
130 135 140
Ile Pro Val Gln Val Ile Cys Glu Leu Leu Gly Leu Pro Ala Glu Asp
145 150 155 160
Arg Asp Asp Phe Ser Ala Trp Ser Ser Val Leu Val Asp Asp Ser Pro
165 170 175
Ala Asp Asp Lys Asn Ala Ala Met Gly Lys Leu His Gly Tyr Leu Ser
180 185 190
Asp Leu Leu Glu Arg Lys Arg Thr Glu Pro Asp Asp Ala Leu Leu Ser
195 200 205
Ser Leu Leu Ala Val Ser Asp Met Asp Gly Asp Arg Leu Ser Gln Glu
210 215 220
Glu Leu Val Ala Met Ala Met Leu Leu Leu Ile Ala Gly His Glu Thr
225 230 235 240
Thr Val Asn Leu Ile Gly Asn Gly Val Leu Ala Leu Leu Thr His Pro
245 250 255
Asp Gln Arg Lys Leu Leu Ala Glu Asp Pro Ser Leu Ile Ser Ser Ala
260 265 270
Val Glu Glu Phe Leu Arg Phe Asp Ser Pro Val Ser Gln Ala Pro Ile
275 280 285
Arg Phe Thr Ala Glu Asp Val Thr Tyr Ser Gly Val Thr Ile Pro Ala
290 295 300
Gly Glu Met Val Met Leu Gly Leu Ala Ala Ala Asn Arg Asp Ala Asp
305 310 315 320
Trp Met Pro Glu Pro Asp Arg Leu Asp Ile Thr Arg Asp Ala Ser Gly
325 330 335
Gly Val Phe Phe Gly His Gly Ile His Phe Cys Leu Gly Ala Gln Leu
340 345 350
Ala Arg Leu Glu Gly Arg Val Ala Ile Gly Arg Leu Phe Ala Asp Arg
355 360 365
Pro Glu Leu Ala Leu Ala Val Gly Leu Asp Glu Leu Val Tyr Arg Arg
370 375 380
Ser Thr Leu Val Arg Gly Leu Ser Arg Met Pro Val Thr Met Gly Pro
385 390 395 400
Arg Ser Ala Arg His Gln Pro Val Thr Ile Gly Glu Pro Ala Ala Arg
405 410 415
Ala Val Ser Arg Thr Val Thr Val Glu Arg Leu Asp Arg Ile Ala Asp
420 425 430
Asp Val Leu Arg Leu Val Leu Arg Asp Ala Gly Gly Lys Thr Leu Pro
435 440 445
Thr Trp Thr Pro Gly Ala His Ile Asp Leu Asp Leu Gly Ala Leu Ser
450 455 460
Arg Gln Tyr Ser Leu Cys Gly Ala Pro Asp Ala Pro Ser Tyr Glu Ile
465 470 475 480
Ala Val His Leu Asp Pro Glu Ser Arg Gly Gly Ser Arg Tyr Ile His
485 490 495
Glu Gln Leu Glu Val Gly Ser Pro Leu Arg Met Arg Gly Pro Arg Asn
500 505 510
His Phe Ala Leu Asp Pro Gly Ala Glu His Tyr Val Phe Val Ala Gly
515 520 525
Gly Ile Gly Ile Thr Pro Val Leu Ala Met Ala Asp His Ala Arg Ala
530 535 540
Arg Gly Trp Ser Tyr Glu Leu His Tyr Cys Gly Arg Asn Arg Ser Gly
545 550 555 560
Met Ala Tyr Leu Glu Arg Val Ala Gly His Gly Asp Arg Ala Ala Leu
565 570 575
His Val Ser Glu Glu Gly Thr Arg Ile Asp Leu Ala Ala Leu Leu Ala
580 585 590
Glu Pro Ala Pro Gly Val Gln Ile Tyr Ala Cys Gly Pro Gly Arg Leu
595 600 605
Leu Ala Gly Leu Glu Asp Ala Ser Arg Asn Trp Pro Asp Gly Ala Leu
610 615 620
His Val Glu His Phe Thr Ser Ser Leu Ala Ala Leu Asp Pro Asp Val
625 630 635 640
Glu His Ala Phe Asp Leu Glu Leu Arg Asp Ser Gly Leu Thr Val Arg
645 650 655
Val Glu Pro Thr Gln Thr Val Leu Asp Ala Leu Arg Ala Asn Asn Ile
660 665 670
Asp Val Pro Ser Asp Cys Glu Glu Gly Leu Cys Gly Ser Cys Glu Val
675 680 685
Ala Val Leu Asp Gly Glu Val Asp His Arg Asp Thr Val Leu Thr Lys
690 695 700
Ala Glu Arg Ala Ala Asn Arg Gln Met Met Thr Cys Cys Ser Arg Ala
705 710 715 720
Cys Gly Asp Arg Leu Ala Leu Arg Leu
725
<210> 16
<211> 2187
<212> DNA
<213> Artificial Sequence
<220>
<223> K1-RhFR-D3核苷酸序列
<400> 16
atggcactga ccaccaccgg taccgaacag catgacctgt ttagcggtac cttttggcag 60
aatccgcatc cggcgtatgc agcactgcgt gcagaagatc cggttcgtaa actggcactg 120
ccggatggtc cggtgtggct gctgacccgt tatgcagatg ttcgtgaagc atttgttgat 180
ccgcgtctga gtaaagattg gcgtcatcgt ctgccggaag atcagcgtgc cgatatgccg 240
gcaaccccga ccccgatgat gattctgatg gacccgccgg atcatacacg tttacgtaaa 300
ctggttggtc gtagttttac cgttcgtcgt atgaatgaac tggaaccgcg tattaccgaa 360
attgcagatg gtctgctggc aggtctgccg accgatggtc cggttgatct gatgcgtgaa 420
tatgcatttc agattccggt tcaggttata tgtgaactgc tgggtctgcc ggcagaagat 480
cgtgatgatt tttcagcatg gtcaagtgtg ctggttgatg attctccggc agatgataaa 540
aatgccgcaa tgggtaaact gcatggttat ctgtcagatc tgctggaacg taaacgtacc 600
gaaccggatg atgcactgct gagtagcctg ctggcggttt ctgatatgga tggtgatcgt 660
ctgtctcagg aagaactggt tgcaatggca atgctgctgc tgattgcagg tcatgaaacc 720
accgttaatc tgattggtaa tggtgtgctg gcactgctga cccatccgga tcagcgtaaa 780
ctgttagctg aagatccgag tctgattagc tcagcagttg aagaatttct gcgttttgat 840
tctccggtta gccaggcacc gatccgtttt accgctgaag atgttaccta tagtggtgtt 900
accattccgg caggtgaaat ggttatgctg ggtctggcag cagcaaatcg cgatgcagat 960
tggatgccgg aaccggatcg tctggatatt acccgtgatg caagtggtgg tgttttcttt 1020
ggtcatggta ttcatttttg tctgggtgcg cagctggcac gtctggaagg tcgtgtggca 1080
attggtcgtc tgtttgcaga tcgtccggaa ctggcactgg cagttggtct ggatgaactg 1140
gtgtatcgtc gtagcaccct ggttcgtggt ctgagtagga tgccggtgac aatgggtccg 1200
cgttcagcac ggcatcaacc ggtcaccatc ggagaacccg ccgcccgggc ggtgtcccgc 1260
accgtcaccg tcgagcgcct ggaccggatc gccgacgacg tgctgcgcct cgtcctgcgc 1320
gacgccggcg gaaagacatt acccacgtgg actcccggcg cccatatcga cctcgacctc 1380
ggcgcgctgt cgcgccagta ctccctgtgc ggcgcgcccg atgcgccgag ctacgagatt 1440
gccgtgcacc tggatcccga gagccgcggc ggttcgcgct acatccacga acagctcgag 1500
gtgggaagcc cgctccggat gcgcggccct cggaaccatt tcgcgctcga ccccggcgcc 1560
gagcactacg tgttcgtcgc cggcggcatc ggcatcaccc cagtcctggc catggccgac 1620
cacgcccgcg cccgggggtg gagctacgaa ctgcactact gcggccgaaa ccgttccggc 1680
atggcctatc tcgagcgtgt cgccgggcac ggtgaccggg ccgccctgca cgtgtccgag 1740
gaaggcaccc ggatcgacct cgccgccctc ctcgccgagc ccgcccccgg cgtccagatc 1800
tacgcgtgcg ggcccgggcg gctgctcgcc ggactcgagg acgcgagccg gaactggccc 1860
gacggggcgc tgcacgtcga gcacttcacc tcgtccctcg cggcgctcga tccggacgtc 1920
gagcacgcct tcgacctcga actgcgtgac tcggggctga ccgtgcgggt cgaacccacc 1980
cagaccgtcc tcgacgcgtt gcgcgccaac aacatcgacg tgcccagcga ctgcgaggaa 2040
ggcctctgcg gctcgtgcga ggtcgccgtc ctcgacggcg aggtcgacca tcgcgacacg 2100
gtgctgacca aggccgagcg ggcggcgaac cggcagatga tgacctgctg ctcgcgtgcc 2160
tgtggcgacc ggctggccct gcgactc 2187
<210> 17
<211> 726
<212> PRT
<213> Artificial Sequence
<220>
<223> K1-RhFR-D6氨基酸序列
<400> 17
Met Ala Leu Thr Thr Thr Gly Thr Glu Gln His Asp Leu Phe Ser Gly
1 5 10 15
Thr Phe Trp Gln Asn Pro His Pro Ala Tyr Ala Ala Leu Arg Ala Glu
20 25 30
Asp Pro Val Arg Lys Leu Ala Leu Pro Asp Gly Pro Val Trp Leu Leu
35 40 45
Thr Arg Tyr Ala Asp Val Arg Glu Ala Phe Val Asp Pro Arg Leu Ser
50 55 60
Lys Asp Trp Arg His Arg Leu Pro Glu Asp Gln Arg Ala Asp Met Pro
65 70 75 80
Ala Thr Pro Thr Pro Met Met Ile Leu Met Asp Pro Pro Asp His Thr
85 90 95
Arg Leu Arg Lys Leu Val Gly Arg Ser Phe Thr Val Arg Arg Met Asn
100 105 110
Glu Leu Glu Pro Arg Ile Thr Glu Ile Ala Asp Gly Leu Leu Ala Gly
115 120 125
Leu Pro Thr Asp Gly Pro Val Asp Leu Met Arg Glu Tyr Ala Phe Gln
130 135 140
Ile Pro Val Gln Val Ile Cys Glu Leu Leu Gly Leu Pro Ala Glu Asp
145 150 155 160
Arg Asp Asp Phe Ser Ala Trp Ser Ser Val Leu Val Asp Asp Ser Pro
165 170 175
Ala Asp Asp Lys Asn Ala Ala Met Gly Lys Leu His Gly Tyr Leu Ser
180 185 190
Asp Leu Leu Glu Arg Lys Arg Thr Glu Pro Asp Asp Ala Leu Leu Ser
195 200 205
Ser Leu Leu Ala Val Ser Asp Met Asp Gly Asp Arg Leu Ser Gln Glu
210 215 220
Glu Leu Val Ala Met Ala Met Leu Leu Leu Ile Ala Gly His Glu Thr
225 230 235 240
Thr Val Asn Leu Ile Gly Asn Gly Val Leu Ala Leu Leu Thr His Pro
245 250 255
Asp Gln Arg Lys Leu Leu Ala Glu Asp Pro Ser Leu Ile Ser Ser Ala
260 265 270
Val Glu Glu Phe Leu Arg Phe Asp Ser Pro Val Ser Gln Ala Pro Ile
275 280 285
Arg Phe Thr Ala Glu Asp Val Thr Tyr Ser Gly Val Thr Ile Pro Ala
290 295 300
Gly Glu Met Val Met Leu Gly Leu Ala Ala Ala Asn Arg Asp Ala Asp
305 310 315 320
Trp Met Pro Glu Pro Asp Arg Leu Asp Ile Thr Arg Asp Ala Ser Gly
325 330 335
Gly Val Phe Phe Gly His Gly Ile His Phe Cys Leu Gly Ala Gln Leu
340 345 350
Ala Arg Leu Glu Gly Arg Val Ala Ile Gly Arg Leu Phe Ala Asp Arg
355 360 365
Pro Glu Leu Ala Leu Ala Val Gly Leu Asp Glu Leu Val Tyr Arg Arg
370 375 380
Ser Thr Leu Val Arg Gly Leu Ser Arg Met Pro Val Thr Met Gly Pro
385 390 395 400
Arg Ser Ala Pro Val Thr Ile Gly Glu Pro Ala Ala Arg Ala Val Ser
405 410 415
Arg Thr Val Thr Val Glu Arg Leu Asp Arg Ile Ala Asp Asp Val Leu
420 425 430
Arg Leu Val Leu Arg Asp Ala Gly Gly Lys Thr Leu Pro Thr Trp Thr
435 440 445
Pro Gly Ala His Ile Asp Leu Asp Leu Gly Ala Leu Ser Arg Gln Tyr
450 455 460
Ser Leu Cys Gly Ala Pro Asp Ala Pro Ser Tyr Glu Ile Ala Val His
465 470 475 480
Leu Asp Pro Glu Ser Arg Gly Gly Ser Arg Tyr Ile His Glu Gln Leu
485 490 495
Glu Val Gly Ser Pro Leu Arg Met Arg Gly Pro Arg Asn His Phe Ala
500 505 510
Leu Asp Pro Gly Ala Glu His Tyr Val Phe Val Ala Gly Gly Ile Gly
515 520 525
Ile Thr Pro Val Leu Ala Met Ala Asp His Ala Arg Ala Arg Gly Trp
530 535 540
Ser Tyr Glu Leu His Tyr Cys Gly Arg Asn Arg Ser Gly Met Ala Tyr
545 550 555 560
Leu Glu Arg Val Ala Gly His Gly Asp Arg Ala Ala Leu His Val Ser
565 570 575
Glu Glu Gly Thr Arg Ile Asp Leu Ala Ala Leu Leu Ala Glu Pro Ala
580 585 590
Pro Gly Val Gln Ile Tyr Ala Cys Gly Pro Gly Arg Leu Leu Ala Gly
595 600 605
Leu Glu Asp Ala Ser Arg Asn Trp Pro Asp Gly Ala Leu His Val Glu
610 615 620
His Phe Thr Ser Ser Leu Ala Ala Leu Asp Pro Asp Val Glu His Ala
625 630 635 640
Phe Asp Leu Glu Leu Arg Asp Ser Gly Leu Thr Val Arg Val Glu Pro
645 650 655
Thr Gln Thr Val Leu Asp Ala Leu Arg Ala Asn Asn Ile Asp Val Pro
660 665 670
Ser Asp Cys Glu Glu Gly Leu Cys Gly Ser Cys Glu Val Ala Val Leu
675 680 685
Asp Gly Glu Val Asp His Arg Asp Thr Val Leu Thr Lys Ala Glu Arg
690 695 700
Ala Ala Asn Arg Gln Met Met Thr Cys Cys Ser Arg Ala Cys Gly Asp
705 710 715 720
Arg Leu Ala Leu Arg Leu
725
<210> 18
<211> 2178
<212> DNA
<213> Artificial Sequence
<220>
<223> K1-RhFR-D6核苷酸序列
<400> 18
atggcactga ccaccaccgg taccgaacag catgacctgt ttagcggtac cttttggcag 60
aatccgcatc cggcgtatgc agcactgcgt gcagaagatc cggttcgtaa actggcactg 120
ccggatggtc cggtgtggct gctgacccgt tatgcagatg ttcgtgaagc atttgttgat 180
ccgcgtctga gtaaagattg gcgtcatcgt ctgccggaag atcagcgtgc cgatatgccg 240
gcaaccccga ccccgatgat gattctgatg gacccgccgg atcatacacg tttacgtaaa 300
ctggttggtc gtagttttac cgttcgtcgt atgaatgaac tggaaccgcg tattaccgaa 360
attgcagatg gtctgctggc aggtctgccg accgatggtc cggttgatct gatgcgtgaa 420
tatgcatttc agattccggt tcaggttata tgtgaactgc tgggtctgcc ggcagaagat 480
cgtgatgatt tttcagcatg gtcaagtgtg ctggttgatg attctccggc agatgataaa 540
aatgccgcaa tgggtaaact gcatggttat ctgtcagatc tgctggaacg taaacgtacc 600
gaaccggatg atgcactgct gagtagcctg ctggcggttt ctgatatgga tggtgatcgt 660
ctgtctcagg aagaactggt tgcaatggca atgctgctgc tgattgcagg tcatgaaacc 720
accgttaatc tgattggtaa tggtgtgctg gcactgctga cccatccgga tcagcgtaaa 780
ctgttagctg aagatccgag tctgattagc tcagcagttg aagaatttct gcgttttgat 840
tctccggtta gccaggcacc gatccgtttt accgctgaag atgttaccta tagtggtgtt 900
accattccgg caggtgaaat ggttatgctg ggtctggcag cagcaaatcg cgatgcagat 960
tggatgccgg aaccggatcg tctggatatt acccgtgatg caagtggtgg tgttttcttt 1020
ggtcatggta ttcatttttg tctgggtgcg cagctggcac gtctggaagg tcgtgtggca 1080
attggtcgtc tgtttgcaga tcgtccggaa ctggcactgg cagttggtct ggatgaactg 1140
gtgtatcgtc gtagcaccct ggttcgtggt ctgagtagga tgccggtgac aatgggtccg 1200
cgttcagcac cggtcaccat cggagaaccc gccgcccggg cggtgtcccg caccgtcacc 1260
gtcgagcgcc tggaccggat cgccgacgac gtgctgcgcc tcgtcctgcg cgacgccggc 1320
ggaaagacat tacccacgtg gactcccggc gcccatatcg acctcgacct cggcgcgctg 1380
tcgcgccagt actccctgtg cggcgcgccc gatgcgccga gctacgagat tgccgtgcac 1440
ctggatcccg agagccgcgg cggttcgcgc tacatccacg aacagctcga ggtgggaagc 1500
ccgctccgga tgcgcggccc tcggaaccat ttcgcgctcg accccggcgc cgagcactac 1560
gtgttcgtcg ccggcggcat cggcatcacc ccagtcctgg ccatggccga ccacgcccgc 1620
gcccgggggt ggagctacga actgcactac tgcggccgaa accgttccgg catggcctat 1680
ctcgagcgtg tcgccgggca cggtgaccgg gccgccctgc acgtgtccga ggaaggcacc 1740
cggatcgacc tcgccgccct cctcgccgag cccgcccccg gcgtccagat ctacgcgtgc 1800
gggcccgggc ggctgctcgc cggactcgag gacgcgagcc ggaactggcc cgacggggcg 1860
ctgcacgtcg agcacttcac ctcgtccctc gcggcgctcg atccggacgt cgagcacgcc 1920
ttcgacctcg aactgcgtga ctcggggctg accgtgcggg tcgaacccac ccagaccgtc 1980
ctcgacgcgt tgcgcgccaa caacatcgac gtgcccagcg actgcgagga aggcctctgc 2040
ggctcgtgcg aggtcgccgt cctcgacggc gaggtcgacc atcgcgacac ggtgctgacc 2100
aaggccgagc gggcggcgaa ccggcagatg atgacctgct gctcgcgtgc ctgtggcgac 2160
cggctggccc tgcgactc 2178
<210> 19
<211> 746
<212> PRT
<213> Artificial Sequence
<220>
<223> K1-RhFR-I14氨基酸序列
<400> 19
Met Ala Leu Thr Thr Thr Gly Thr Glu Gln His Asp Leu Phe Ser Gly
1 5 10 15
Thr Phe Trp Gln Asn Pro His Pro Ala Tyr Ala Ala Leu Arg Ala Glu
20 25 30
Asp Pro Val Arg Lys Leu Ala Leu Pro Asp Gly Pro Val Trp Leu Leu
35 40 45
Thr Arg Tyr Ala Asp Val Arg Glu Ala Phe Val Asp Pro Arg Leu Ser
50 55 60
Lys Asp Trp Arg His Arg Leu Pro Glu Asp Gln Arg Ala Asp Met Pro
65 70 75 80
Ala Thr Pro Thr Pro Met Met Ile Leu Met Asp Pro Pro Asp His Thr
85 90 95
Arg Leu Arg Lys Leu Val Gly Arg Ser Phe Thr Val Arg Arg Met Asn
100 105 110
Glu Leu Glu Pro Arg Ile Thr Glu Ile Ala Asp Gly Leu Leu Ala Gly
115 120 125
Leu Pro Thr Asp Gly Pro Val Asp Leu Met Arg Glu Tyr Ala Phe Gln
130 135 140
Ile Pro Val Gln Val Ile Cys Glu Leu Leu Gly Leu Pro Ala Glu Asp
145 150 155 160
Arg Asp Asp Phe Ser Ala Trp Ser Ser Val Leu Val Asp Asp Ser Pro
165 170 175
Ala Asp Asp Lys Asn Ala Ala Met Gly Lys Leu His Gly Tyr Leu Ser
180 185 190
Asp Leu Leu Glu Arg Lys Arg Thr Glu Pro Asp Asp Ala Leu Leu Ser
195 200 205
Ser Leu Leu Ala Val Ser Asp Met Asp Gly Asp Arg Leu Ser Gln Glu
210 215 220
Glu Leu Val Ala Met Ala Met Leu Leu Leu Ile Ala Gly His Glu Thr
225 230 235 240
Thr Val Asn Leu Ile Gly Asn Gly Val Leu Ala Leu Leu Thr His Pro
245 250 255
Asp Gln Arg Lys Leu Leu Ala Glu Asp Pro Ser Leu Ile Ser Ser Ala
260 265 270
Val Glu Glu Phe Leu Arg Phe Asp Ser Pro Val Ser Gln Ala Pro Ile
275 280 285
Arg Phe Thr Ala Glu Asp Val Thr Tyr Ser Gly Val Thr Ile Pro Ala
290 295 300
Gly Glu Met Val Met Leu Gly Leu Ala Ala Ala Asn Arg Asp Ala Asp
305 310 315 320
Trp Met Pro Glu Pro Asp Arg Leu Asp Ile Thr Arg Asp Ala Ser Gly
325 330 335
Gly Val Phe Phe Gly His Gly Ile His Phe Cys Leu Gly Ala Gln Leu
340 345 350
Ala Arg Leu Glu Gly Arg Val Ala Ile Gly Arg Leu Phe Ala Asp Arg
355 360 365
Pro Glu Leu Ala Leu Ala Val Gly Leu Asp Glu Leu Val Tyr Arg Arg
370 375 380
Ser Thr Leu Val Arg Gly Leu Ser Arg Met Pro Val Thr Met Gly Pro
385 390 395 400
Arg Ser Ala Glu Leu Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu
405 410 415
Asn Val Leu His Arg His Gln Pro Val Thr Ile Gly Glu Pro Ala Ala
420 425 430
Arg Ala Val Ser Arg Thr Val Thr Val Glu Arg Leu Asp Arg Ile Ala
435 440 445
Asp Asp Val Leu Arg Leu Val Leu Arg Asp Ala Gly Gly Lys Thr Leu
450 455 460
Pro Thr Trp Thr Pro Gly Ala His Ile Asp Leu Asp Leu Gly Ala Leu
465 470 475 480
Ser Arg Gln Tyr Ser Leu Cys Gly Ala Pro Asp Ala Pro Ser Tyr Glu
485 490 495
Ile Ala Val His Leu Asp Pro Glu Ser Arg Gly Gly Ser Arg Tyr Ile
500 505 510
His Glu Gln Leu Glu Val Gly Ser Pro Leu Arg Met Arg Gly Pro Arg
515 520 525
Asn His Phe Ala Leu Asp Pro Gly Ala Glu His Tyr Val Phe Val Ala
530 535 540
Gly Gly Ile Gly Ile Thr Pro Val Leu Ala Met Ala Asp His Ala Arg
545 550 555 560
Ala Arg Gly Trp Ser Tyr Glu Leu His Tyr Cys Gly Arg Asn Arg Ser
565 570 575
Gly Met Ala Tyr Leu Glu Arg Val Ala Gly His Gly Asp Arg Ala Ala
580 585 590
Leu His Val Ser Glu Glu Gly Thr Arg Ile Asp Leu Ala Ala Leu Leu
595 600 605
Ala Glu Pro Ala Pro Gly Val Gln Ile Tyr Ala Cys Gly Pro Gly Arg
610 615 620
Leu Leu Ala Gly Leu Glu Asp Ala Ser Arg Asn Trp Pro Asp Gly Ala
625 630 635 640
Leu His Val Glu His Phe Thr Ser Ser Leu Ala Ala Leu Asp Pro Asp
645 650 655
Val Glu His Ala Phe Asp Leu Glu Leu Arg Asp Ser Gly Leu Thr Val
660 665 670
Arg Val Glu Pro Thr Gln Thr Val Leu Asp Ala Leu Arg Ala Asn Asn
675 680 685
Ile Asp Val Pro Ser Asp Cys Glu Glu Gly Leu Cys Gly Ser Cys Glu
690 695 700
Val Ala Val Leu Asp Gly Glu Val Asp His Arg Asp Thr Val Leu Thr
705 710 715 720
Lys Ala Glu Arg Ala Ala Asn Arg Gln Met Met Thr Cys Cys Ser Arg
725 730 735
Ala Cys Gly Asp Arg Leu Ala Leu Arg Leu
740 745
<210> 20
<211> 2238
<212> DNA
<213> Artificial Sequence
<220>
<223> K1-RhFR-I14核苷酸序列
<400> 20
atggcactga ccaccaccgg taccgaacag catgacctgt ttagcggtac cttttggcag 60
aatccgcatc cggcgtatgc agcactgcgt gcagaagatc cggttcgtaa actggcactg 120
ccggatggtc cggtgtggct gctgacccgt tatgcagatg ttcgtgaagc atttgttgat 180
ccgcgtctga gtaaagattg gcgtcatcgt ctgccggaag atcagcgtgc cgatatgccg 240
gcaaccccga ccccgatgat gattctgatg gacccgccgg atcatacacg tttacgtaaa 300
ctggttggtc gtagttttac cgttcgtcgt atgaatgaac tggaaccgcg tattaccgaa 360
attgcagatg gtctgctggc aggtctgccg accgatggtc cggttgatct gatgcgtgaa 420
tatgcatttc agattccggt tcaggttata tgtgaactgc tgggtctgcc ggcagaagat 480
cgtgatgatt tttcagcatg gtcaagtgtg ctggttgatg attctccggc agatgataaa 540
aatgccgcaa tgggtaaact gcatggttat ctgtcagatc tgctggaacg taaacgtacc 600
gaaccggatg atgcactgct gagtagcctg ctggcggttt ctgatatgga tggtgatcgt 660
ctgtctcagg aagaactggt tgcaatggca atgctgctgc tgattgcagg tcatgaaacc 720
accgttaatc tgattggtaa tggtgtgctg gcactgctga cccatccgga tcagcgtaaa 780
ctgttagctg aagatccgag tctgattagc tcagcagttg aagaatttct gcgttttgat 840
tctccggtta gccaggcacc gatccgtttt accgctgaag atgttaccta tagtggtgtt 900
accattccgg caggtgaaat ggttatgctg ggtctggcag cagcaaatcg cgatgcagat 960
tggatgccgg aaccggatcg tctggatatt acccgtgatg caagtggtgg tgttttcttt 1020
ggtcatggta ttcatttttg tctgggtgcg cagctggcac gtctggaagg tcgtgtggca 1080
attggtcgtc tgtttgcaga tcgtccggaa ctggcactgg cagttggtct ggatgaactg 1140
gtgtatcgtc gtagcaccct ggttcgtggt ctgagtagga tgccggtgac aatgggtccg 1200
cgttcagcag aactgcagag tgcaaaaaaa gttcgtaaaa aagcagaaaa tgtgctgcac 1260
cggcatcaac cggtcaccat cggagaaccc gccgcccggg cggtgtcccg caccgtcacc 1320
gtcgagcgcc tggaccggat cgccgacgac gtgctgcgcc tcgtcctgcg cgacgccggc 1380
ggaaagacat tacccacgtg gactcccggc gcccatatcg acctcgacct cggcgcgctg 1440
tcgcgccagt actccctgtg cggcgcgccc gatgcgccga gctacgagat tgccgtgcac 1500
ctggatcccg agagccgcgg cggttcgcgc tacatccacg aacagctcga ggtgggaagc 1560
ccgctccgga tgcgcggccc tcggaaccat ttcgcgctcg accccggcgc cgagcactac 1620
gtgttcgtcg ccggcggcat cggcatcacc ccagtcctgg ccatggccga ccacgcccgc 1680
gcccgggggt ggagctacga actgcactac tgcggccgaa accgttccgg catggcctat 1740
ctcgagcgtg tcgccgggca cggtgaccgg gccgccctgc acgtgtccga ggaaggcacc 1800
cggatcgacc tcgccgccct cctcgccgag cccgcccccg gcgtccagat ctacgcgtgc 1860
gggcccgggc ggctgctcgc cggactcgag gacgcgagcc ggaactggcc cgacggggcg 1920
ctgcacgtcg agcacttcac ctcgtccctc gcggcgctcg atccggacgt cgagcacgcc 1980
ttcgacctcg aactgcgtga ctcggggctg accgtgcggg tcgaacccac ccagaccgtc 2040
ctcgacgcgt tgcgcgccaa caacatcgac gtgcccagcg actgcgagga aggcctctgc 2100
ggctcgtgcg aggtcgccgt cctcgacggc gaggtcgacc atcgcgacac ggtgctgacc 2160
aaggccgagc gggcggcgaa ccggcagatg atgacctgct gctcgcgtgc ctgtggcgac 2220
cggctggccc tgcgactc 2238
<210> 21
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> K1-Rh-F1
<400> 21
accggctggc cctgcgactc taaaagcttg cggccgcact 40
<210> 22
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> K1-Rh-R1
<400> 22
ggttgatgcc ggtgcagcac tgctgaacgc ggacccattg 40
<210> 23
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> RhFR-F1
<400> 23
caatgggtcc gcgttcagca gtgctgcacc ggcatcaacc 40
<210> 24
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> RhFR-R1
<400> 24
agtgcggccg caagctttta gagtcgcagg gccagccggt 40
<210> 25
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> K1-BM3-F1
<400> 25
caaaagacgt gtgggctggg taaaagcttg cggccgcact 40
<210> 26
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> K1-BM3-R1
<400> 26
tgttcagtgc taggtgaagg tgctgaacgc ggacccattg 40
<210> 27
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> BM3R-F1
<400> 27
caatgggtcc gcgttcagca ccttcaccta gcactgaaca 40
<210> 28
<211> 40
<212> DNA
<213> Artificial Sequence
<220>
<223> BM3R-R1
<400> 28
agtgcggccg caagctttta cccagcccac acgtcttttg 40
<210> 29
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> I3-F
<400> 29
ccgcgttcag caggcggaag tgtgctgcac cgg 33
<210> 30
<211> 33
<212> DNA
<213> Artificial Sequence
<220>
<223> I3-R
<400> 30
ccggtgcagc acacttccgc ctgctgaacg cgg 33
<210> 31
<211> 38
<212> DNA
<213> Artificial Sequence
<220>
<223> I6-F
<400> 31
gcgttcagca ggcggaagtg gcggaagtgt gctgcacc 38
<210> 32
<211> 38
<212> DNA
<213> Artificial Sequence
<220>
<223> I6-R
<400> 32
ggtgcagcac acttccgcca cttccgcctg ctgaacgc 38
<210> 33
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> D3-F
<400> 33
ggtccgcgtt cagcacggca tcaaccggtc 30
<210> 34
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> D3-R
<400> 34
gaccggttga tgccgtgctg aacgcggacc 30
<210> 35
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> D6-F
<400> 35
ggtccgcgtt cagcaccggt caccatcgga 30
<210> 36
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> D6-R
<400> 36
tccgatggtg accggtgctg aacgcggacc 30
<210> 37
<211> 72
<212> DNA
<213> Artificial Sequence
<220>
<223> I14-F
<400> 37
atgccggtgc agcacatttt ctgctttttt acgaactttt tttgcactct gcagttctgc 60
tgaacgcgga cc 72
<210> 38
<211> 72
<212> DNA
<213> Artificial Sequence
<220>
<223> I14-R
<400> 38
ggtccgcgtt cagcagaact gcagagtgca aaaaaagttc gtaaaaaagc agaaaatgtg 60
ctgcaccggc at 72
Claims (10)
1.一种融合蛋白或其变体,其特征在于,其包括K1和RhFR,所述K1的氨基酸序列如SEQID NO:1所示,所述RhFR的氨基酸序列如SEQ ID NO:5的第466-773位的氨基酸所示。
2.如权利要求1所述的融合蛋白或其变体,其特征在于,所述融合蛋白从N端至C端依次为K1和RhFR;
和/或,所述K1和RhFR之间通过连接子进行连接,所述连接子的氨基酸序列优选如SEQID NO:5的第445-465位的氨基酸所示;
和/或,所述融合蛋白或其变体与分子伴侣共表达,所述分子伴侣优选为Gro7。
3.如权利要求1或2所述的融合蛋白或其变体,其特征在于,所述变体为在所述RhFR的N端发生氨基酸的插入或缺失,优选在所述RhFR的N端发生1-14个更优选发生3-6个氨基酸的插入或缺失;
较佳地,所述融合蛋白或其变体的氨基酸序列如SEQ ID NO:9、SEQ ID NO:11、SEQ IDNO:13、SEQ ID NO:15、SEQ ID NO:17或SEQ ID NO:19所示;
更佳地,编码所述融合蛋白或其变体的核苷酸序列如SEQ ID NO:10、SEQ ID NO:12、SEQ ID NO:14、SEQ ID NO:16、SEQ ID NO:18或SEQ ID NO:20所示。
4.一种融合基因,其特征在于,其编码如权利要求1~3任一项所述的融合蛋白或其变体。
5.一种重组表达载体,其特征在于,所述重组表达载体含有如权利要求4所述的融合基因;
较佳地,所述重组表达载体的骨架载体为pET28a。
6.一种转化体,其特征在于,其包括如权利要求4所述的融合基因或者如权利要求5所述的重组表达载体;
较佳地,所转化体通过在宿主中导入所述融合基因或者所述重组表达载体获得,所述宿主优选为大肠杆菌,更优选为大肠杆菌E.coli BL21(DE3)细胞。
7.一种融合蛋白或其变体的制备方法,其包括以下步骤:
(1)获得如权利要求6所述的转化体;
(2)筛选所述转化体,表达并纯化所述融合蛋白或其变体。
8.一种骨化二醇的制备方法,其特征在于,所述制备方法包括以下步骤:在反应溶剂、还原型辅酶NADH/NADPH的存在下,将如权利要求1~3任一项所述的融合蛋白或其变体催化维生素D3进行羟化反应即可;
较佳地:
所述维生素D3为助溶剂预溶的维生素D3;所述助溶剂优选包括DMSO、吐温80、TritonX100、甲醇、乙醇、异丙醇和DMF中的一种或多种,例如为乙醇;
和/或,所述方法还包括在进行所述羟化反应前,在所述反应溶剂中加入环糊精的步骤,所述环糊精例如为羟丙基-β-环糊精;所述羟丙基-β-环糊精占反应体系的质量体积百分比优选为0.05%-0.4%,例如0.25%;
和/或,所述反应的温度为20~33℃,例如为22℃、25℃、28℃或30℃;
和/或,所述反应的pH为6.0~8.0,例如为7.4;
和/或,所述维生素D3的浓度为1g/L-10g/L,例如为2g/L、3g/L、4g/L、5g/L、6g/L、7g/L、8g/L或9g/L;
和/或,所述NADH/NADPH与所述维生素D3的摩尔比为0.001:1~2:1,例如0.2:1。
9.如权利要求8所述的方法,其特征在于,所述的制备方法还包括以下步骤:在脱氢酶以及供氢体的存在下,将氧化型辅酶NAD+/NADP+进行还原反应,得到所述的NADH/NADPH即可;
较佳地,所述的脱氢酶为葡萄糖脱氢酶、醇脱氢酶或甲酸脱氢酶;和/或,所述的供氢体为葡萄糖、异丙醇或甲酸盐;
更佳地,当所述的脱氢酶为醇脱氢酶时,所述的供氢体为异丙醇;当所述的脱氢酶为葡萄糖脱氢酶时,所述的供氢体为葡萄糖;当所述的脱氢酶为甲酸脱氢酶时,所述的供氢体为甲酸盐。
10.一种如权利要求1~3任一项所述的融合蛋白或其变体、如权利要求4所述的融合基因、如权利要求5所述的重组表达载体、或如权利要求6所述的转化体在制备骨化二醇中的应用。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010369514.XA CN113583983A (zh) | 2020-04-30 | 2020-04-30 | 一种融合蛋白或其变体及其在制备骨化二醇中的应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010369514.XA CN113583983A (zh) | 2020-04-30 | 2020-04-30 | 一种融合蛋白或其变体及其在制备骨化二醇中的应用 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113583983A true CN113583983A (zh) | 2021-11-02 |
Family
ID=78237807
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010369514.XA Pending CN113583983A (zh) | 2020-04-30 | 2020-04-30 | 一种融合蛋白或其变体及其在制备骨化二醇中的应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113583983A (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116240246A (zh) * | 2021-12-20 | 2023-06-09 | 浙江金朗博药业有限公司 | 利用过氧化物酶合成骨化二醇的方法 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090081758A1 (en) * | 2007-09-21 | 2009-03-26 | The Regents Of The University Of Michigan | Chimeric Cytochrome P450 Proteins and Methods of Use |
CN101675157A (zh) * | 2007-03-01 | 2010-03-17 | 美露香株式会社 | 来自多药运出蛋白缺损株的转化株以及使用该转化株的微生物转化方法 |
KR20170103405A (ko) * | 2016-03-04 | 2017-09-13 | 인하대학교 산학협력단 | 코돈 최적화된 사이클로스포린 특이적 p450 수산화효소 및 이를 이용한 대장균 기반의 최적화된 사이클로스포린 생전환 방법 |
-
2020
- 2020-04-30 CN CN202010369514.XA patent/CN113583983A/zh active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101675157A (zh) * | 2007-03-01 | 2010-03-17 | 美露香株式会社 | 来自多药运出蛋白缺损株的转化株以及使用该转化株的微生物转化方法 |
US20090081758A1 (en) * | 2007-09-21 | 2009-03-26 | The Regents Of The University Of Michigan | Chimeric Cytochrome P450 Proteins and Methods of Use |
KR20170103405A (ko) * | 2016-03-04 | 2017-09-13 | 인하대학교 산학협력단 | 코돈 최적화된 사이클로스포린 특이적 p450 수산화효소 및 이를 이용한 대장균 기반의 최적화된 사이클로스포린 생전환 방법 |
Non-Patent Citations (2)
Title |
---|
SHENGYING LI等: "Engineering and analysis of a self-sufficient biosynthetic cytochrome P450 PikC fused to the RhFRED reductase domain", 《J AM CHEM SOC.》, vol. 129, no. 43, 31 October 2007 (2007-10-31), pages 2 - 4, XP055142363, DOI: 10.1021/ja075842d * |
YOSHIAKI YASUTAKE等: "Structural evidence for enhancement of sequential vitamin D3 hydroxylation activities by directed evolution of cytochrome P450 vitamin D3 hydroxylase", 《J BIOL CHEM.》, vol. 285, no. 41, 8 October 2010 (2010-10-08), pages 31194 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116240246A (zh) * | 2021-12-20 | 2023-06-09 | 浙江金朗博药业有限公司 | 利用过氧化物酶合成骨化二醇的方法 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10787651B2 (en) | Bradyrhizobium monooxygenase and use thereof for preparation of chiral sulfoxide | |
US11345900B2 (en) | Monooxygenase and use in preparation of optically pure sulfoxide | |
CN106929521B (zh) | 一种醛酮还原酶基因重组共表达载体、工程菌及其应用 | |
CN115011616B (zh) | 一种乙醛脱氢酶基因rkaldh及其应用 | |
CN108728421B (zh) | 一种羰基还原酶突变体及其用途 | |
CN112813013B (zh) | 一种生产羟基酪醇的重组大肠杆菌及其应用 | |
CN113430216B (zh) | 一种苯丙酮单加氧酶及在拉唑类药物制备中的应用 | |
CN109504645B (zh) | 异亮氨酸双加氧酶、突变体及在合成4-羟基异亮氨酸中的应用 | |
CN111996176B (zh) | 羰基还原酶突变体及其应用 | |
CN113151201A (zh) | 高热稳定性高活性异丁香酚单加氧酶突变体及其应用 | |
CN113817693B (zh) | 一种短链羰基还原酶PpYSDR突变体、编码基因、重组表达载体、基因工程菌及应用 | |
CN114672525B (zh) | N-乙酰基-5-甲氧基色胺的生物合成方法及其应用 | |
CN112442490B (zh) | 一种转化酶及其在产s-雌马酚中的应用 | |
CN113583983A (zh) | 一种融合蛋白或其变体及其在制备骨化二醇中的应用 | |
CN106957812B (zh) | 一种产细胞色素p450酶及其电子传递系统工程菌的构建及其应用 | |
CN111394289B (zh) | 一种基因工程菌及其应用,生产前列腺素e2的方法 | |
CN113493756A (zh) | 一种基因工程菌及其应用 | |
CN114395571B (zh) | 三角褐指藻zep1基因、蛋白及应用 | |
CN113322291A (zh) | 一种手性氨基醇类化合物的合成方法 | |
CN114507650B (zh) | 亮氨酸脱氢酶突变体及其在合成(s)-邻氯苯甘氨酸中的应用 | |
CN110904062A (zh) | 一株高产l-丙氨酸的菌株 | |
CN112410353B (zh) | 一种fkbS基因、含其的基因工程菌及其制备方法和用途 | |
CN114854714A (zh) | 一种菜豆源环氧化物酶突变体、基因、载体、工程菌及制备方法和应用 | |
CN111254180B (zh) | 一种酶法拆分制备(s)-1,2,3,4-四氢异喹啉-3-甲酸的方法 | |
CN111254170B (zh) | 一种多酶耦合制备(s)-1,2,3,4-四氢异喹啉-3-甲酸的方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information | ||
CB02 | Change of applicant information |
Address after: Room 3114, Building B, 555 Dongchuan Road, Minhang District, Shanghai, 200241 Applicant after: Yikelai Biotechnology (Group) Co.,Ltd. Address before: Room 3114, building B, 555 Dongchuan Road, Minhang District, Shanghai 200240 Applicant before: Ecolab Biotechnology (Shanghai) Co.,Ltd. |