CA2392109A1 - Mammalian receptor proteins; related reagents and methods - Google Patents
Mammalian receptor proteins; related reagents and methods Download PDFInfo
- Publication number
- CA2392109A1 CA2392109A1 CA002392109A CA2392109A CA2392109A1 CA 2392109 A1 CA2392109 A1 CA 2392109A1 CA 002392109 A CA002392109 A CA 002392109A CA 2392109 A CA2392109 A CA 2392109A CA 2392109 A1 CA2392109 A1 CA 2392109A1
- Authority
- CA
- Canada
- Prior art keywords
- ser
- leu
- pro
- glu
- gly
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000000034 method Methods 0.000 title claims abstract description 108
- 239000003153 chemical reaction reagent Substances 0.000 title claims description 34
- 108020003175 receptors Proteins 0.000 title abstract description 90
- 102000005962 receptors Human genes 0.000 title abstract description 89
- 239000012634 fragment Substances 0.000 claims abstract description 70
- 241000288906 Primates Species 0.000 claims abstract description 67
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 65
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 54
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 54
- 239000000203 mixture Substances 0.000 claims abstract description 37
- 108090000623 proteins and genes Proteins 0.000 claims description 170
- 102000004169 proteins and genes Human genes 0.000 claims description 147
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 139
- 210000004027 cell Anatomy 0.000 claims description 121
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 113
- 229920001184 polypeptide Polymers 0.000 claims description 103
- 230000027455 binding Effects 0.000 claims description 91
- 108010057085 cytokine receptors Proteins 0.000 claims description 66
- 102000003675 cytokine receptors Human genes 0.000 claims description 66
- 239000002773 nucleotide Substances 0.000 claims description 63
- 125000003729 nucleotide group Chemical group 0.000 claims description 63
- 150000001413 amino acids Chemical class 0.000 claims description 60
- 241000282414 Homo sapiens Species 0.000 claims description 54
- 150000001875 compounds Chemical class 0.000 claims description 44
- 230000004927 fusion Effects 0.000 claims description 29
- 239000000427 antigen Substances 0.000 claims description 28
- 108091007433 antigens Proteins 0.000 claims description 27
- 102000036639 antigens Human genes 0.000 claims description 27
- 239000000758 substrate Substances 0.000 claims description 24
- 239000000523 sample Substances 0.000 claims description 23
- 239000002299 complementary DNA Substances 0.000 claims description 21
- 239000005557 antagonist Substances 0.000 claims description 18
- 238000009396 hybridization Methods 0.000 claims description 18
- 238000006467 substitution reaction Methods 0.000 claims description 18
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 16
- 239000000556 agonist Substances 0.000 claims description 16
- 239000013604 expression vector Substances 0.000 claims description 16
- 239000007787 solid Substances 0.000 claims description 16
- 238000001514 detection method Methods 0.000 claims description 13
- 239000000126 substance Substances 0.000 claims description 13
- 150000003839 salts Chemical class 0.000 claims description 12
- 230000000890 antigenic effect Effects 0.000 claims description 11
- 230000013595 glycosylation Effects 0.000 claims description 11
- 238000006206 glycosylation reaction Methods 0.000 claims description 11
- 238000003780 insertion Methods 0.000 claims description 10
- 230000037431 insertion Effects 0.000 claims description 10
- 239000000872 buffer Substances 0.000 claims description 9
- 238000012217 deletion Methods 0.000 claims description 9
- 230000037430 deletion Effects 0.000 claims description 9
- 239000012528 membrane Substances 0.000 claims description 9
- 238000002703 mutagenesis Methods 0.000 claims description 9
- 231100000350 mutagenesis Toxicity 0.000 claims description 9
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 claims description 9
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 claims description 8
- 238000011161 development Methods 0.000 claims description 8
- 239000004033 plastic Substances 0.000 claims description 8
- 229920003023 plastic Polymers 0.000 claims description 8
- 238000000746 purification Methods 0.000 claims description 8
- 230000010076 replication Effects 0.000 claims description 8
- 239000011780 sodium chloride Substances 0.000 claims description 7
- 108091026890 Coding region Proteins 0.000 claims description 6
- 238000007911 parenteral administration Methods 0.000 claims description 6
- 238000011200 topical administration Methods 0.000 claims description 6
- 230000000699 topical effect Effects 0.000 claims description 6
- 241000238631 Hexapoda Species 0.000 claims description 5
- 239000011324 bead Substances 0.000 claims description 5
- 210000004962 mammalian cell Anatomy 0.000 claims description 5
- 239000007850 fluorescent dye Substances 0.000 claims description 4
- 230000002285 radioactive effect Effects 0.000 claims description 4
- 230000001580 bacterial effect Effects 0.000 claims description 3
- 230000035479 physiological effects, processes and functions Effects 0.000 claims description 3
- 210000003527 eukaryotic cell Anatomy 0.000 claims description 2
- 210000005260 human cell Anatomy 0.000 claims description 2
- 210000001236 prokaryotic cell Anatomy 0.000 claims description 2
- 210000005253 yeast cell Anatomy 0.000 claims description 2
- 102000014150 Interferons Human genes 0.000 claims 1
- 108010050904 Interferons Proteins 0.000 claims 1
- 229940079322 interferon Drugs 0.000 claims 1
- 230000001225 therapeutic effect Effects 0.000 abstract description 12
- 235000018102 proteins Nutrition 0.000 description 144
- 239000003446 ligand Substances 0.000 description 82
- 235000001014 amino acid Nutrition 0.000 description 46
- 229940024606 amino acid Drugs 0.000 description 46
- 108020004414 DNA Proteins 0.000 description 42
- 239000013598 vector Substances 0.000 description 38
- 230000014509 gene expression Effects 0.000 description 32
- 210000001744 T-lymphocyte Anatomy 0.000 description 30
- 108020004635 Complementary DNA Proteins 0.000 description 25
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 25
- 241000282326 Felis catus Species 0.000 description 24
- 108010034529 leucyl-lysine Proteins 0.000 description 23
- 102000004127 Cytokines Human genes 0.000 description 22
- 108090000695 Cytokines Proteins 0.000 description 22
- 230000000694 effects Effects 0.000 description 22
- 230000000284 resting effect Effects 0.000 description 22
- 241000894007 species Species 0.000 description 22
- 238000013519 translation Methods 0.000 description 21
- 125000003275 alpha amino acid group Chemical group 0.000 description 20
- 238000003556 assay Methods 0.000 description 20
- 230000014616 translation Effects 0.000 description 20
- 238000010804 cDNA synthesis Methods 0.000 description 19
- 230000008569 process Effects 0.000 description 18
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 17
- 108010026333 seryl-proline Proteins 0.000 description 17
- 241000880493 Leptailurus serval Species 0.000 description 16
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 16
- 230000002068 genetic effect Effects 0.000 description 16
- 238000003752 polymerase chain reaction Methods 0.000 description 16
- 108010061238 threonyl-glycine Proteins 0.000 description 16
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 15
- 239000000047 product Substances 0.000 description 15
- 238000003018 immunoassay Methods 0.000 description 14
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 13
- 210000001616 monocyte Anatomy 0.000 description 13
- 108090000790 Enzymes Proteins 0.000 description 12
- DXMOIVCNJIJQSC-QEJZJMRPSA-N Glu-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N DXMOIVCNJIJQSC-QEJZJMRPSA-N 0.000 description 12
- 241000699666 Mus <mouse, genus> Species 0.000 description 12
- 230000004071 biological effect Effects 0.000 description 12
- 230000001605 fetal effect Effects 0.000 description 12
- 230000002163 immunogen Effects 0.000 description 12
- 238000002372 labelling Methods 0.000 description 12
- 108010057821 leucylproline Proteins 0.000 description 12
- 230000035772 mutation Effects 0.000 description 12
- 230000002441 reversible effect Effects 0.000 description 12
- 238000012216 screening Methods 0.000 description 12
- 210000000952 spleen Anatomy 0.000 description 12
- 238000012360 testing method Methods 0.000 description 12
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 11
- 102000004190 Enzymes Human genes 0.000 description 11
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 11
- 102100039620 Granulocyte-macrophage colony-stimulating factor Human genes 0.000 description 11
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 11
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 11
- 229940088598 enzyme Drugs 0.000 description 11
- 108020001507 fusion proteins Proteins 0.000 description 11
- 102000037865 fusion proteins Human genes 0.000 description 11
- 108010050848 glycylleucine Proteins 0.000 description 11
- PGHMRUGBZOYCAA-ADZNBVRBSA-N ionomycin Chemical compound O1[C@H](C[C@H](O)[C@H](C)[C@H](O)[C@H](C)/C=C/C[C@@H](C)C[C@@H](C)C(/O)=C/C(=O)[C@@H](C)C[C@@H](C)C[C@@H](CCC(O)=O)C)CC[C@@]1(C)[C@@H]1O[C@](C)([C@@H](C)O)CC1 PGHMRUGBZOYCAA-ADZNBVRBSA-N 0.000 description 11
- PGHMRUGBZOYCAA-UHFFFAOYSA-N ionomycin Natural products O1C(CC(O)C(C)C(O)C(C)C=CCC(C)CC(C)C(O)=CC(=O)C(C)CC(C)CC(CCC(O)=O)C)CCC1(C)C1OC(C)(C(C)O)CC1 PGHMRUGBZOYCAA-UHFFFAOYSA-N 0.000 description 11
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 11
- -1 sequence Substances 0.000 description 11
- 210000001519 tissue Anatomy 0.000 description 11
- 108090000978 Interleukin-4 Proteins 0.000 description 10
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 10
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 10
- 108091028043 Nucleic acid sequence Proteins 0.000 description 10
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 10
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 10
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 10
- 230000006870 function Effects 0.000 description 10
- 108010077515 glycylproline Proteins 0.000 description 10
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 10
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 10
- 230000004048 modification Effects 0.000 description 10
- 238000012986 modification Methods 0.000 description 10
- 108010029020 prolylglycine Proteins 0.000 description 10
- 108010038745 tryptophylglycine Proteins 0.000 description 10
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 9
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 9
- KIZIOFNVSOSKJI-CIUDSAMLSA-N Leu-Ser-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N KIZIOFNVSOSKJI-CIUDSAMLSA-N 0.000 description 9
- CAVGLNOOIFHJOF-SRVKXCTJSA-N Lys-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N CAVGLNOOIFHJOF-SRVKXCTJSA-N 0.000 description 9
- OBPCXINRFKHSRY-SDDRHHMPSA-N Met-Met-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N OBPCXINRFKHSRY-SDDRHHMPSA-N 0.000 description 9
- 241001465754 Metazoa Species 0.000 description 9
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 9
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 9
- KUSYCSMTTHSZOA-DZKIICNBSA-N Phe-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N KUSYCSMTTHSZOA-DZKIICNBSA-N 0.000 description 9
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 9
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 9
- 241000283984 Rodentia Species 0.000 description 9
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 9
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 9
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 9
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 9
- 108010047495 alanylglycine Proteins 0.000 description 9
- 108010008355 arginyl-glutamine Proteins 0.000 description 9
- 108010093581 aspartyl-proline Proteins 0.000 description 9
- 210000003719 b-lymphocyte Anatomy 0.000 description 9
- 230000000295 complement effect Effects 0.000 description 9
- 108010078144 glutaminyl-glycine Proteins 0.000 description 9
- 108010092114 histidylphenylalanine Proteins 0.000 description 9
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 9
- 239000000243 solution Substances 0.000 description 9
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 8
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 8
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 8
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 8
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 8
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 8
- BIJDDZBDSJLWJY-PJODQICGSA-N Trp-Ala-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O BIJDDZBDSJLWJY-PJODQICGSA-N 0.000 description 8
- 108010047857 aspartylglycine Proteins 0.000 description 8
- 239000003814 drug Substances 0.000 description 8
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 8
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 8
- 108010009298 lysylglutamic acid Proteins 0.000 description 8
- 108020004999 messenger RNA Proteins 0.000 description 8
- 238000002360 preparation method Methods 0.000 description 8
- 239000007790 solid phase Substances 0.000 description 8
- 238000010561 standard procedure Methods 0.000 description 8
- 239000013589 supplement Substances 0.000 description 8
- 108010003137 tyrosyltyrosine Proteins 0.000 description 8
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 7
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 7
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 7
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 7
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 7
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 7
- 108010065920 Insulin Lispro Proteins 0.000 description 7
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 7
- 108010074338 Lymphokines Proteins 0.000 description 7
- 102000008072 Lymphokines Human genes 0.000 description 7
- 108010076504 Protein Sorting Signals Proteins 0.000 description 7
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 7
- 108700012920 TNF Proteins 0.000 description 7
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 7
- BOBZBMOTRORUPT-XIRDDKMYSA-N Trp-Ser-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 BOBZBMOTRORUPT-XIRDDKMYSA-N 0.000 description 7
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 7
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 7
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 7
- 108010038633 aspartylglutamate Proteins 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 7
- 108010069495 cysteinyltyrosine Proteins 0.000 description 7
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 7
- 230000028993 immune response Effects 0.000 description 7
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 7
- 108010027338 isoleucylcysteine Proteins 0.000 description 7
- 108010054155 lysyllysine Proteins 0.000 description 7
- 108010017391 lysylvaline Proteins 0.000 description 7
- 238000004519 manufacturing process Methods 0.000 description 7
- 108010051242 phenylalanylserine Proteins 0.000 description 7
- 230000026731 phosphorylation Effects 0.000 description 7
- 238000006366 phosphorylation reaction Methods 0.000 description 7
- 108010004914 prolylarginine Proteins 0.000 description 7
- 238000000926 separation method Methods 0.000 description 7
- 238000003786 synthesis reaction Methods 0.000 description 7
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 6
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 6
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 6
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 6
- BQBPFMNVOWDLHO-XIRDDKMYSA-N Arg-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N BQBPFMNVOWDLHO-XIRDDKMYSA-N 0.000 description 6
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 6
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 6
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 6
- NZJDBCYBYCUEDC-UBHSHLNASA-N Asp-Cys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N NZJDBCYBYCUEDC-UBHSHLNASA-N 0.000 description 6
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 6
- ZRUBWRCKIVDCFS-XPCJQDJLSA-N Asp-Leu-Thr-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZRUBWRCKIVDCFS-XPCJQDJLSA-N 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 6
- 241000206602 Eukaryota Species 0.000 description 6
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 6
- BETSEXMYBWCDAE-SZMVWBNQSA-N Gln-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N BETSEXMYBWCDAE-SZMVWBNQSA-N 0.000 description 6
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 6
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 6
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 6
- RCHFYMASWAZQQZ-ZANVPECISA-N Gly-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CN)=CNC2=C1 RCHFYMASWAZQQZ-ZANVPECISA-N 0.000 description 6
- 102100031573 Hematopoietic progenitor cell antigen CD34 Human genes 0.000 description 6
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 6
- KNNSUUOHFVVJOP-GUBZILKMSA-N His-Glu-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N KNNSUUOHFVVJOP-GUBZILKMSA-N 0.000 description 6
- 101000777663 Homo sapiens Hematopoietic progenitor cell antigen CD34 Proteins 0.000 description 6
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 6
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 6
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 6
- YSKSXVKQLLBVEX-SZMVWBNQSA-N Leu-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 YSKSXVKQLLBVEX-SZMVWBNQSA-N 0.000 description 6
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 6
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 6
- PHKBGZKVOJCIMZ-SRVKXCTJSA-N Met-Pro-Arg Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PHKBGZKVOJCIMZ-SRVKXCTJSA-N 0.000 description 6
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 6
- 108010079364 N-glycylalanine Proteins 0.000 description 6
- OPEVYHFJXLCCRT-AVGNSLFASA-N Phe-Gln-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O OPEVYHFJXLCCRT-AVGNSLFASA-N 0.000 description 6
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 6
- ANESFYPBAJPYNJ-SDDRHHMPSA-N Pro-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ANESFYPBAJPYNJ-SDDRHHMPSA-N 0.000 description 6
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 6
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 6
- MCPXQHVVCPTRIM-HJOGWXRNSA-N Pro-Trp-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)[C@@H]1CCCN1 MCPXQHVVCPTRIM-HJOGWXRNSA-N 0.000 description 6
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 6
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 6
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 6
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 6
- WMZVVNLPHFSUPA-BPUTZDHNSA-N Ser-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 WMZVVNLPHFSUPA-BPUTZDHNSA-N 0.000 description 6
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 6
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 6
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 6
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 6
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 6
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 6
- 108010044940 alanylglutamine Proteins 0.000 description 6
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 6
- 108010068380 arginylarginine Proteins 0.000 description 6
- 108010060035 arginylproline Proteins 0.000 description 6
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 6
- 230000009260 cross reactivity Effects 0.000 description 6
- 230000001747 exhibiting effect Effects 0.000 description 6
- 238000009472 formulation Methods 0.000 description 6
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 6
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 6
- 108010018006 histidylserine Proteins 0.000 description 6
- 210000004408 hybridoma Anatomy 0.000 description 6
- 238000002955 isolation Methods 0.000 description 6
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 6
- 210000004072 lung Anatomy 0.000 description 6
- 210000004698 lymphocyte Anatomy 0.000 description 6
- 238000010369 molecular cloning Methods 0.000 description 6
- 230000003472 neutralizing effect Effects 0.000 description 6
- 230000000737 periodic effect Effects 0.000 description 6
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 6
- 108010031719 prolyl-serine Proteins 0.000 description 6
- 108010070643 prolylglutamic acid Proteins 0.000 description 6
- 108010053725 prolylvaline Proteins 0.000 description 6
- 108010071207 serylmethionine Proteins 0.000 description 6
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 6
- 238000013518 transcription Methods 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- 108010073969 valyllysine Proteins 0.000 description 6
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 5
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 5
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 5
- 238000002965 ELISA Methods 0.000 description 5
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 5
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 5
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 5
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 5
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 5
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 5
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 5
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 5
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 5
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 5
- BEGQVWUZFXLNHZ-IHPCNDPISA-N Lys-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 BEGQVWUZFXLNHZ-IHPCNDPISA-N 0.000 description 5
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 5
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 5
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 5
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 5
- 108010003201 RGH 0205 Proteins 0.000 description 5
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 5
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 5
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 5
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 5
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 5
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 5
- VDPRBUOZLIFUIM-GUBZILKMSA-N Val-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N VDPRBUOZLIFUIM-GUBZILKMSA-N 0.000 description 5
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 5
- 238000013459 approach Methods 0.000 description 5
- 108010077245 asparaginyl-proline Proteins 0.000 description 5
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N biotin Natural products N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 5
- 230000001413 cellular effect Effects 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 210000000349 chromosome Anatomy 0.000 description 5
- 108010016616 cysteinylglycine Proteins 0.000 description 5
- 239000002552 dosage form Substances 0.000 description 5
- 108010040030 histidinoalanine Proteins 0.000 description 5
- 210000000987 immune system Anatomy 0.000 description 5
- 208000026278 immune system disease Diseases 0.000 description 5
- 238000000338 in vitro Methods 0.000 description 5
- 230000003993 interaction Effects 0.000 description 5
- 210000002540 macrophage Anatomy 0.000 description 5
- 239000002609 medium Substances 0.000 description 5
- 108010012581 phenylalanylglutamate Proteins 0.000 description 5
- 239000013612 plasmid Substances 0.000 description 5
- 238000001742 protein purification Methods 0.000 description 5
- 239000001397 quillaja saponaria molina bark Substances 0.000 description 5
- 239000011347 resin Substances 0.000 description 5
- 229920005989 resin Polymers 0.000 description 5
- 230000004044 response Effects 0.000 description 5
- 229930182490 saponin Natural products 0.000 description 5
- 150000007949 saponins Chemical class 0.000 description 5
- 210000002966 serum Anatomy 0.000 description 5
- 239000006228 supernatant Substances 0.000 description 5
- DQVAZKGVGKHQDS-UHFFFAOYSA-N 2-[[1-[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(O)=O DQVAZKGVGKHQDS-UHFFFAOYSA-N 0.000 description 4
- CSAHOYQKNHGDHX-ACZMJKKPSA-N Ala-Gln-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CSAHOYQKNHGDHX-ACZMJKKPSA-N 0.000 description 4
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 4
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 4
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 4
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 4
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 4
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 4
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 4
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 4
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 4
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 4
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 4
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 4
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 4
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 4
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 4
- XOASPVGNFAMYBD-WFBYXXMGSA-N Asp-Trp-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O XOASPVGNFAMYBD-WFBYXXMGSA-N 0.000 description 4
- VBPGTULCFGKGTF-ACZMJKKPSA-N Cys-Glu-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VBPGTULCFGKGTF-ACZMJKKPSA-N 0.000 description 4
- WPXPYZPGSGWQSC-DCAQKATOSA-N Cys-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N WPXPYZPGSGWQSC-DCAQKATOSA-N 0.000 description 4
- WTNLLMQAFPOCTJ-GARJFASQSA-N Cys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N)C(=O)O WTNLLMQAFPOCTJ-GARJFASQSA-N 0.000 description 4
- FANFRJOFTYCNRG-JYBASQMISA-N Cys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N)O FANFRJOFTYCNRG-JYBASQMISA-N 0.000 description 4
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 4
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 4
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 4
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 4
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 4
- ARIORLIIMJACKZ-KKUMJFAQSA-N Glu-Pro-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ARIORLIIMJACKZ-KKUMJFAQSA-N 0.000 description 4
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 4
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 4
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 4
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 4
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 4
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 4
- 101000799466 Homo sapiens Thrombopoietin receptor Proteins 0.000 description 4
- 108010002350 Interleukin-2 Proteins 0.000 description 4
- 102000015696 Interleukins Human genes 0.000 description 4
- 108010063738 Interleukins Proteins 0.000 description 4
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 4
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 4
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 4
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 4
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 4
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 4
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 4
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 4
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 4
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 4
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 4
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 4
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 4
- WYNIRYZIFZGWQD-BPUTZDHNSA-N Met-Trp-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WYNIRYZIFZGWQD-BPUTZDHNSA-N 0.000 description 4
- 241000699670 Mus sp. Species 0.000 description 4
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 4
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 4
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 4
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 4
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 4
- HQVPQXMCQKXARZ-FXQIFTODSA-N Pro-Cys-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O HQVPQXMCQKXARZ-FXQIFTODSA-N 0.000 description 4
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 4
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 4
- SRBFGSGDNNQABI-FHWLQOOXSA-N Pro-Leu-Trp Chemical compound N([C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C(=O)[C@@H]1CCCN1 SRBFGSGDNNQABI-FHWLQOOXSA-N 0.000 description 4
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 4
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 4
- ZYJMLBCDFPIGNL-JYJNAYRXSA-N Pro-Tyr-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O ZYJMLBCDFPIGNL-JYJNAYRXSA-N 0.000 description 4
- 108020004511 Recombinant DNA Proteins 0.000 description 4
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 4
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 4
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 4
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 4
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 4
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 4
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 4
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 4
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 4
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 4
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 4
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 4
- WPSDXXQRIVKBAY-NKIYYHGXSA-N Thr-His-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O WPSDXXQRIVKBAY-NKIYYHGXSA-N 0.000 description 4
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 4
- WYLAVUAWOUVUCA-XVSYOHENSA-N Thr-Phe-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WYLAVUAWOUVUCA-XVSYOHENSA-N 0.000 description 4
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 4
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 4
- 102100034196 Thrombopoietin receptor Human genes 0.000 description 4
- QNTBGBCOEYNAPV-CWRNSKLLSA-N Trp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O QNTBGBCOEYNAPV-CWRNSKLLSA-N 0.000 description 4
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 4
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 4
- CKHQKYHIZCRTAP-SOUVJXGZSA-N Tyr-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CKHQKYHIZCRTAP-SOUVJXGZSA-N 0.000 description 4
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 4
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 4
- PYPZMFDMCCWNST-NAKRPEOUSA-N Val-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N PYPZMFDMCCWNST-NAKRPEOUSA-N 0.000 description 4
- UXODSMTVPWXHBT-ULQDDVLXSA-N Val-Phe-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N UXODSMTVPWXHBT-ULQDDVLXSA-N 0.000 description 4
- BGXVHVMJZCSOCA-AVGNSLFASA-N Val-Pro-Lys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N BGXVHVMJZCSOCA-AVGNSLFASA-N 0.000 description 4
- SSYBNWFXCFNRFN-GUBZILKMSA-N Val-Pro-Ser Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O SSYBNWFXCFNRFN-GUBZILKMSA-N 0.000 description 4
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 4
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 4
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 125000003277 amino group Chemical group 0.000 description 4
- 230000003321 amplification Effects 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 230000009137 competitive binding Effects 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 108010015792 glycyllysine Proteins 0.000 description 4
- 108010084389 glycyltryptophan Proteins 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 108010085325 histidylproline Proteins 0.000 description 4
- 230000003053 immunization Effects 0.000 description 4
- 238000002649 immunization Methods 0.000 description 4
- 210000003734 kidney Anatomy 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 238000004091 panning Methods 0.000 description 4
- 238000010647 peptide synthesis reaction Methods 0.000 description 4
- 230000000144 pharmacologic effect Effects 0.000 description 4
- 108010073101 phenylalanylleucine Proteins 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 230000004936 stimulating effect Effects 0.000 description 4
- 230000002194 synthesizing effect Effects 0.000 description 4
- 238000002560 therapeutic procedure Methods 0.000 description 4
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 4
- 108010084932 tryptophyl-proline Proteins 0.000 description 4
- 108010020532 tyrosyl-proline Proteins 0.000 description 4
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 4
- 108010009962 valyltyrosine Proteins 0.000 description 4
- PKOHVHWNGUHYRE-ZFWWWQNUSA-N (2s)-1-[2-[[(2s)-2-amino-3-(1h-indol-3-yl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)NCC(=O)N1CCC[C@H]1C(O)=O PKOHVHWNGUHYRE-ZFWWWQNUSA-N 0.000 description 3
- KKTUQAYCCLMNOA-UHFFFAOYSA-N 2,3-diaminobenzoic acid Chemical compound NC1=CC=CC(C(O)=O)=C1N KKTUQAYCCLMNOA-UHFFFAOYSA-N 0.000 description 3
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 3
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 3
- IVKWMMGFLAMMKJ-XVYDVKMFSA-N Ala-His-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IVKWMMGFLAMMKJ-XVYDVKMFSA-N 0.000 description 3
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 3
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 3
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 3
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 3
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 3
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 3
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 3
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 3
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 3
- BKZFBJYIVSBXCO-KKUMJFAQSA-N Asn-Phe-His Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O BKZFBJYIVSBXCO-KKUMJFAQSA-N 0.000 description 3
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 3
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 3
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 3
- UFAQGGZUXVLONR-AVGNSLFASA-N Asp-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)O UFAQGGZUXVLONR-AVGNSLFASA-N 0.000 description 3
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 3
- WWOYXVBGHAHQBG-FXQIFTODSA-N Asp-Met-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O WWOYXVBGHAHQBG-FXQIFTODSA-N 0.000 description 3
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 3
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 3
- 108090001008 Avidin Proteins 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 3
- OXFOKRAFNYSREH-BJDJZHNGSA-N Cys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N OXFOKRAFNYSREH-BJDJZHNGSA-N 0.000 description 3
- KJJASVYBTKRYSN-FXQIFTODSA-N Cys-Pro-Asp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC(=O)O)C(=O)O KJJASVYBTKRYSN-FXQIFTODSA-N 0.000 description 3
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 3
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 3
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 3
- RKAQZCDMSUQTSS-FXQIFTODSA-N Gln-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKAQZCDMSUQTSS-FXQIFTODSA-N 0.000 description 3
- DDNIZQDYXDENIT-FXQIFTODSA-N Gln-Glu-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N DDNIZQDYXDENIT-FXQIFTODSA-N 0.000 description 3
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 3
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 3
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 3
- CMFBOXUBWMZZMD-BPUTZDHNSA-N Gln-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N CMFBOXUBWMZZMD-BPUTZDHNSA-N 0.000 description 3
- WIMVKDYAKRAUCG-IHRRRGAJSA-N Gln-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WIMVKDYAKRAUCG-IHRRRGAJSA-N 0.000 description 3
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 3
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 3
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 3
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 3
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 3
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 3
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 3
- 102000005720 Glutathione transferase Human genes 0.000 description 3
- 108010070675 Glutathione transferase Proteins 0.000 description 3
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 3
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 3
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 3
- MQVNVZUEPUIAFA-WDSKDSINSA-N Gly-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN MQVNVZUEPUIAFA-WDSKDSINSA-N 0.000 description 3
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 3
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 3
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 3
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 3
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 3
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 3
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 3
- WTOAPTKSZJJWKK-HTFCKZLJSA-N Ile-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WTOAPTKSZJJWKK-HTFCKZLJSA-N 0.000 description 3
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 3
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 3
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 3
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 3
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 3
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 3
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 3
- 108060003951 Immunoglobulin Proteins 0.000 description 3
- 108010002586 Interleukin-7 Proteins 0.000 description 3
- 108010002335 Interleukin-9 Proteins 0.000 description 3
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 3
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 3
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 3
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 3
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 3
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 3
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 3
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 3
- RNYLNYTYMXACRI-VFAJRCTISA-N Leu-Thr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O RNYLNYTYMXACRI-VFAJRCTISA-N 0.000 description 3
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 3
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 3
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 3
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 3
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 3
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 3
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 3
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 3
- QXEVZBXTDTVPCP-GMOBBJLQSA-N Met-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCSC)N QXEVZBXTDTVPCP-GMOBBJLQSA-N 0.000 description 3
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 3
- 206010028980 Neoplasm Diseases 0.000 description 3
- 108091005461 Nucleic proteins Proteins 0.000 description 3
- SZYBZVANEAOIPE-UBHSHLNASA-N Phe-Met-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SZYBZVANEAOIPE-UBHSHLNASA-N 0.000 description 3
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 3
- YDUGVDGFKNXFPL-IXOXFDKPSA-N Phe-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YDUGVDGFKNXFPL-IXOXFDKPSA-N 0.000 description 3
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 3
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 3
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 3
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 3
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 3
- VPBQDHMASPJHGY-JYJNAYRXSA-N Pro-Trp-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CO)C(=O)O VPBQDHMASPJHGY-JYJNAYRXSA-N 0.000 description 3
- 229920002684 Sepharose Polymers 0.000 description 3
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 3
- CTRHXXXHUJTTRZ-ZLUOBGJFSA-N Ser-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O CTRHXXXHUJTTRZ-ZLUOBGJFSA-N 0.000 description 3
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 3
- BRIZMMZEYSAKJX-QEJZJMRPSA-N Ser-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N BRIZMMZEYSAKJX-QEJZJMRPSA-N 0.000 description 3
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 3
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 3
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 3
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 3
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 3
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 3
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 3
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 3
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 3
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 3
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 3
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 description 3
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 3
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 3
- 239000007983 Tris buffer Substances 0.000 description 3
- OGXQLUCMJZSJPW-LYSGOOTNSA-N Trp-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O OGXQLUCMJZSJPW-LYSGOOTNSA-N 0.000 description 3
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 3
- MXKUGFHWYYKVDV-SZMVWBNQSA-N Trp-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(C)C)C(O)=O MXKUGFHWYYKVDV-SZMVWBNQSA-N 0.000 description 3
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 3
- WVRUKYLYMFGKAN-IHRRRGAJSA-N Tyr-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 WVRUKYLYMFGKAN-IHRRRGAJSA-N 0.000 description 3
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 3
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 3
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 3
- JXGWQYWDUOWQHA-DZKIICNBSA-N Val-Gln-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N JXGWQYWDUOWQHA-DZKIICNBSA-N 0.000 description 3
- AHHJARQXFFGOKF-NRPADANISA-N Val-Glu-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N AHHJARQXFFGOKF-NRPADANISA-N 0.000 description 3
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 3
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 3
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 3
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 3
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 3
- 230000002159 abnormal effect Effects 0.000 description 3
- 239000004480 active ingredient Substances 0.000 description 3
- 239000002671 adjuvant Substances 0.000 description 3
- 238000001042 affinity chromatography Methods 0.000 description 3
- 108010041407 alanylaspartic acid Proteins 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 229960002685 biotin Drugs 0.000 description 3
- 235000020958 biotin Nutrition 0.000 description 3
- 239000011616 biotin Substances 0.000 description 3
- 239000011575 calcium Substances 0.000 description 3
- 210000000170 cell membrane Anatomy 0.000 description 3
- 230000036755 cellular response Effects 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 239000003937 drug carrier Substances 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000001400 expression cloning Methods 0.000 description 3
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 3
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 3
- 230000001279 glycosylating effect Effects 0.000 description 3
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 108010010147 glycylglutamine Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- 239000003102 growth factor Substances 0.000 description 3
- 108010025306 histidylleucine Proteins 0.000 description 3
- 102000018358 immunoglobulin Human genes 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 3
- 230000015788 innate immune response Effects 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 3
- 239000006166 lysate Substances 0.000 description 3
- 230000031864 metaphase Effects 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 108091033319 polynucleotide Proteins 0.000 description 3
- 102000040430 polynucleotide Human genes 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 3
- 108010090894 prolylleucine Proteins 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 210000004988 splenocyte Anatomy 0.000 description 3
- 238000010186 staining Methods 0.000 description 3
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 3
- 108010080629 tryptophan-leucine Proteins 0.000 description 3
- 238000010396 two-hybrid screening Methods 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- 238000005406 washing Methods 0.000 description 3
- VUDQSRFCCHQIIU-UHFFFAOYSA-N 1-(3,5-dichloro-2,6-dihydroxy-4-methoxyphenyl)hexan-1-one Chemical compound CCCCCC(=O)C1=C(O)C(Cl)=C(OC)C(Cl)=C1O VUDQSRFCCHQIIU-UHFFFAOYSA-N 0.000 description 2
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 2
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 2
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 2
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 2
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 2
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 2
- 108700028369 Alleles Proteins 0.000 description 2
- OOBVTWHLKYJFJH-FXQIFTODSA-N Arg-Ala-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O OOBVTWHLKYJFJH-FXQIFTODSA-N 0.000 description 2
- RWDVGVPHEWOZMO-GUBZILKMSA-N Arg-Cys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCNC(N)=N)C(O)=O RWDVGVPHEWOZMO-GUBZILKMSA-N 0.000 description 2
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 2
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 2
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 2
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 2
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 2
- PYZPXCZNQSEHDT-GUBZILKMSA-N Arg-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PYZPXCZNQSEHDT-GUBZILKMSA-N 0.000 description 2
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 2
- MNBHKGYCLBUIBC-UFYCRDLUSA-N Arg-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCNC(N)=N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MNBHKGYCLBUIBC-UFYCRDLUSA-N 0.000 description 2
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 2
- OWSMKCJUBAPHED-JYJNAYRXSA-N Arg-Pro-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OWSMKCJUBAPHED-JYJNAYRXSA-N 0.000 description 2
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 2
- CTAPSNCVKPOOSM-KKUMJFAQSA-N Arg-Tyr-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CTAPSNCVKPOOSM-KKUMJFAQSA-N 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 2
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 2
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 2
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 2
- KUYKVGODHGHFDI-ACZMJKKPSA-N Asn-Gln-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O KUYKVGODHGHFDI-ACZMJKKPSA-N 0.000 description 2
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 2
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 2
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 2
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 2
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 2
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 2
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 2
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 2
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 2
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 2
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 2
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 2
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 2
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 2
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 2
- 108010078791 Carrier Proteins Proteins 0.000 description 2
- 102000014914 Carrier Proteins Human genes 0.000 description 2
- UPJGYXRAPJWIHD-CIUDSAMLSA-N Cys-Asn-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UPJGYXRAPJWIHD-CIUDSAMLSA-N 0.000 description 2
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 2
- IIGHQOPGMGKDMT-SRVKXCTJSA-N Cys-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N IIGHQOPGMGKDMT-SRVKXCTJSA-N 0.000 description 2
- URDUGPGPLNXXES-WHFBIAKZSA-N Cys-Gly-Cys Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O URDUGPGPLNXXES-WHFBIAKZSA-N 0.000 description 2
- UXBYDFKFHMYYPL-XIRDDKMYSA-N Cys-His-Trp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UXBYDFKFHMYYPL-XIRDDKMYSA-N 0.000 description 2
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 2
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 2
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 2
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 2
- FTTZLFIEUQHLHH-BWBBJGPYSA-N Cys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O FTTZLFIEUQHLHH-BWBBJGPYSA-N 0.000 description 2
- VIOQRFNAZDMVLO-NRPADANISA-N Cys-Val-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIOQRFNAZDMVLO-NRPADANISA-N 0.000 description 2
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 2
- 230000004543 DNA replication Effects 0.000 description 2
- 238000009007 Diagnostic Kit Methods 0.000 description 2
- 241000224495 Dictyostelium Species 0.000 description 2
- QOSSAOTZNIDXMA-UHFFFAOYSA-N Dicylcohexylcarbodiimide Chemical compound C1CCCCC1N=C=NC1CCCCC1 QOSSAOTZNIDXMA-UHFFFAOYSA-N 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 108010092408 Eosinophil Peroxidase Proteins 0.000 description 2
- 102100037362 Fibronectin Human genes 0.000 description 2
- 108010067306 Fibronectins Proteins 0.000 description 2
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 2
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 2
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 2
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 2
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 2
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 2
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 2
- GTBXHETZPUURJE-KKUMJFAQSA-N Gln-Tyr-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GTBXHETZPUURJE-KKUMJFAQSA-N 0.000 description 2
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 2
- HGBHRZBXOOHRDH-JBACZVJFSA-N Gln-Tyr-Trp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HGBHRZBXOOHRDH-JBACZVJFSA-N 0.000 description 2
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 2
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- IYAUFWMUCGBFMQ-CIUDSAMLSA-N Glu-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N IYAUFWMUCGBFMQ-CIUDSAMLSA-N 0.000 description 2
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 2
- PBFGQTGPSKWHJA-QEJZJMRPSA-N Glu-Asp-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PBFGQTGPSKWHJA-QEJZJMRPSA-N 0.000 description 2
- OBIHEDRRSMRKLU-ACZMJKKPSA-N Glu-Cys-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OBIHEDRRSMRKLU-ACZMJKKPSA-N 0.000 description 2
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 2
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 2
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 2
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 2
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 2
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 2
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 2
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 2
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 2
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 2
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 2
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 2
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 2
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 2
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 2
- GYAUWXXORNTCHU-QWRGUYRKSA-N Gly-Cys-Tyr Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 GYAUWXXORNTCHU-QWRGUYRKSA-N 0.000 description 2
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 2
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 2
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 2
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 2
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 2
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 2
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 2
- SJLKKOZFHSJJAW-YUMQZZPRSA-N Gly-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN SJLKKOZFHSJJAW-YUMQZZPRSA-N 0.000 description 2
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 2
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 2
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 2
- ISSDODCYBOWWIP-GJZGRUSLSA-N Gly-Pro-Trp Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ISSDODCYBOWWIP-GJZGRUSLSA-N 0.000 description 2
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 2
- WSWWTQYHFCBKBT-DVJZZOLTSA-N Gly-Thr-Trp Chemical compound C[C@@H](O)[C@H](NC(=O)CN)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O WSWWTQYHFCBKBT-DVJZZOLTSA-N 0.000 description 2
- JKSMZVCGQWVTBW-STQMWFEESA-N Gly-Trp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O JKSMZVCGQWVTBW-STQMWFEESA-N 0.000 description 2
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 2
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- MAABHGXCIBEYQR-XVYDVKMFSA-N His-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MAABHGXCIBEYQR-XVYDVKMFSA-N 0.000 description 2
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 2
- KHUFDBQXGLEIHC-BZSNNMDCSA-N His-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 KHUFDBQXGLEIHC-BZSNNMDCSA-N 0.000 description 2
- AJTBOTWDSRSUDV-ULQDDVLXSA-N His-Phe-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O AJTBOTWDSRSUDV-ULQDDVLXSA-N 0.000 description 2
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 2
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 2
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 2
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 2
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 2
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 2
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 2
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 2
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 2
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 2
- USXAYNCLFSUSBA-MGHWNKPDSA-N Ile-Phe-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N USXAYNCLFSUSBA-MGHWNKPDSA-N 0.000 description 2
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 2
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 2
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 2
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 2
- GNXGAVNTVNOCLL-SIUGBPQLSA-N Ile-Tyr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GNXGAVNTVNOCLL-SIUGBPQLSA-N 0.000 description 2
- 108010058683 Immobilized Proteins Proteins 0.000 description 2
- 102000004551 Interleukin-10 Receptors Human genes 0.000 description 2
- 108010017550 Interleukin-10 Receptors Proteins 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 2
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 2
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 2
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 2
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- JLYUZRKPDKHUTC-WDSOQIARSA-N Leu-Pro-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JLYUZRKPDKHUTC-WDSOQIARSA-N 0.000 description 2
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 2
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 2
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 2
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 2
- IDGRADDMTTWOQC-WDSOQIARSA-N Leu-Trp-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IDGRADDMTTWOQC-WDSOQIARSA-N 0.000 description 2
- ZGGVHTQAPHVMKM-IHPCNDPISA-N Leu-Trp-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N ZGGVHTQAPHVMKM-IHPCNDPISA-N 0.000 description 2
- HQBOMRTVKVKFMN-WDSOQIARSA-N Leu-Trp-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O HQBOMRTVKVKFMN-WDSOQIARSA-N 0.000 description 2
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 2
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 2
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 2
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 2
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 2
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 2
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 2
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 2
- DKTNGXVSCZULPO-YUMQZZPRSA-N Lys-Gly-Cys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O DKTNGXVSCZULPO-YUMQZZPRSA-N 0.000 description 2
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 2
- GFWLIJDQILOEPP-HSCHXYMDSA-N Lys-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N GFWLIJDQILOEPP-HSCHXYMDSA-N 0.000 description 2
- CNGOEHJCLVCJHN-SRVKXCTJSA-N Lys-Pro-Glu Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O CNGOEHJCLVCJHN-SRVKXCTJSA-N 0.000 description 2
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 2
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 2
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 2
- HMZPYMSEAALNAE-ULQDDVLXSA-N Lys-Val-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMZPYMSEAALNAE-ULQDDVLXSA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- MCNGIXXCMJAURZ-VEVYYDQMSA-N Met-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)N)O MCNGIXXCMJAURZ-VEVYYDQMSA-N 0.000 description 2
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 2
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 2
- 102100039364 Metalloproteinase inhibitor 1 Human genes 0.000 description 2
- 230000004988 N-glycosylation Effects 0.000 description 2
- 108091034117 Oligonucleotide Proteins 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- 108010033276 Peptide Fragments Proteins 0.000 description 2
- 102000007079 Peptide Fragments Human genes 0.000 description 2
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 2
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 2
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 2
- NKLDZIPTGKBDBB-HTUGSXCWSA-N Phe-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O NKLDZIPTGKBDBB-HTUGSXCWSA-N 0.000 description 2
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 2
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 2
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 2
- SRILZRSXIKRGBF-HRCADAONSA-N Phe-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N SRILZRSXIKRGBF-HRCADAONSA-N 0.000 description 2
- DEZCWWXTRAKZKJ-UFYCRDLUSA-N Phe-Phe-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DEZCWWXTRAKZKJ-UFYCRDLUSA-N 0.000 description 2
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 2
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 2
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 2
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 2
- APXXVISUHOLGEE-ILWGZMRPSA-N Phe-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CC=CC=C4)N)C(=O)O APXXVISUHOLGEE-ILWGZMRPSA-N 0.000 description 2
- ZTVSVSFBHUVYIN-UFYCRDLUSA-N Phe-Tyr-Met Chemical compound C([C@@H](C(=O)N[C@@H](CCSC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=C(O)C=C1 ZTVSVSFBHUVYIN-UFYCRDLUSA-N 0.000 description 2
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 2
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 2
- 108091000080 Phosphotransferase Proteins 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 2
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 2
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 2
- XJROSHJRQTXWAE-XGEHTFHBSA-N Pro-Cys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XJROSHJRQTXWAE-XGEHTFHBSA-N 0.000 description 2
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 2
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 2
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 2
- UEHYFUCOGHWASA-HJGDQZAQSA-N Pro-Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 UEHYFUCOGHWASA-HJGDQZAQSA-N 0.000 description 2
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 2
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 2
- FFSLAIOXRMOFIZ-GJZGRUSLSA-N Pro-Gly-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)CNC(=O)[C@@H]1CCCN1 FFSLAIOXRMOFIZ-GJZGRUSLSA-N 0.000 description 2
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 2
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 2
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 2
- RPLMFKUKFZOTER-AVGNSLFASA-N Pro-Met-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 RPLMFKUKFZOTER-AVGNSLFASA-N 0.000 description 2
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 2
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 2
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 2
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 2
- NBDHWLZEMKSVHH-UVBJJODRSA-N Pro-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 NBDHWLZEMKSVHH-UVBJJODRSA-N 0.000 description 2
- DMNANGOFEUVBRV-GJZGRUSLSA-N Pro-Trp-Gly Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)O)C(=O)[C@@H]1CCCN1 DMNANGOFEUVBRV-GJZGRUSLSA-N 0.000 description 2
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 2
- QDDJNKWPTJHROJ-UFYCRDLUSA-N Pro-Tyr-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 QDDJNKWPTJHROJ-UFYCRDLUSA-N 0.000 description 2
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 2
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 2
- 102000001253 Protein Kinase Human genes 0.000 description 2
- 108020005091 Replication Origin Proteins 0.000 description 2
- 101100497639 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) CYR1 gene Proteins 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 2
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 2
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 2
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 2
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 2
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 2
- KJMOINFQVCCSDX-XKBZYTNZSA-N Ser-Gln-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KJMOINFQVCCSDX-XKBZYTNZSA-N 0.000 description 2
- GYXVUTAOICLGKJ-ACZMJKKPSA-N Ser-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N GYXVUTAOICLGKJ-ACZMJKKPSA-N 0.000 description 2
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 2
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 2
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 2
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 2
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 2
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 2
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 2
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 2
- LPSKHZWBQONOQJ-XIRDDKMYSA-N Ser-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N LPSKHZWBQONOQJ-XIRDDKMYSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- STIAINRLUUKYKM-WFBYXXMGSA-N Ser-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 STIAINRLUUKYKM-WFBYXXMGSA-N 0.000 description 2
- NERYDXBVARJIQS-JYBASQMISA-N Ser-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N)O NERYDXBVARJIQS-JYBASQMISA-N 0.000 description 2
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 2
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 2
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 2
- PXIPVTKHYLBLMZ-UHFFFAOYSA-N Sodium azide Chemical compound [Na+].[N-]=[N+]=[N-] PXIPVTKHYLBLMZ-UHFFFAOYSA-N 0.000 description 2
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 2
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 2
- VEWZSFGRQDUAJM-YJRXYDGGSA-N Thr-Cys-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O VEWZSFGRQDUAJM-YJRXYDGGSA-N 0.000 description 2
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 2
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 2
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 2
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 2
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 2
- XGUAUKUYQHBUNY-SWRJLBSHSA-N Thr-Trp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XGUAUKUYQHBUNY-SWRJLBSHSA-N 0.000 description 2
- ZEJBJDHSQPOVJV-UAXMHLISSA-N Thr-Trp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZEJBJDHSQPOVJV-UAXMHLISSA-N 0.000 description 2
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 2
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 2
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 2
- KULBQAVOXHQLIY-HSCHXYMDSA-N Trp-Ile-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 KULBQAVOXHQLIY-HSCHXYMDSA-N 0.000 description 2
- KXIQQAWIPDDVOE-BPUTZDHNSA-N Trp-Pro-Cys Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O KXIQQAWIPDDVOE-BPUTZDHNSA-N 0.000 description 2
- IKUMWSDCGQVGHC-UMPQAUOISA-N Trp-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O IKUMWSDCGQVGHC-UMPQAUOISA-N 0.000 description 2
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 2
- UPUNWAXSLPBMRK-XTWBLICNSA-N Trp-Thr-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UPUNWAXSLPBMRK-XTWBLICNSA-N 0.000 description 2
- XKTWZYNTLXITCY-QRTARXTBSA-N Trp-Val-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 XKTWZYNTLXITCY-QRTARXTBSA-N 0.000 description 2
- XQMGDVVKFRLQKH-BBRMVZONSA-N Trp-Val-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O)=CNC2=C1 XQMGDVVKFRLQKH-BBRMVZONSA-N 0.000 description 2
- JRXKIVGWMMIIOF-YDHLFZDLSA-N Tyr-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JRXKIVGWMMIIOF-YDHLFZDLSA-N 0.000 description 2
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 2
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 2
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 2
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 2
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 2
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 2
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 2
- FASACHWGQBNSRO-ZEWNOJEFSA-N Tyr-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FASACHWGQBNSRO-ZEWNOJEFSA-N 0.000 description 2
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 2
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 2
- KWKJGBHDYJOVCR-SRVKXCTJSA-N Tyr-Ser-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O KWKJGBHDYJOVCR-SRVKXCTJSA-N 0.000 description 2
- JHDZONWZTCKTJR-KJEVXHAQSA-N Tyr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JHDZONWZTCKTJR-KJEVXHAQSA-N 0.000 description 2
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 2
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 2
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 2
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 2
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 2
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 2
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 2
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 2
- LZRWTJSPTJSWDN-FKBYEOEOSA-N Val-Trp-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O)N LZRWTJSPTJSWDN-FKBYEOEOSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 230000005856 abnormality Effects 0.000 description 2
- 239000000654 additive Substances 0.000 description 2
- 239000000443 aerosol Substances 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010036533 arginylvaline Proteins 0.000 description 2
- 238000004166 bioassay Methods 0.000 description 2
- 238000007413 biotinylation Methods 0.000 description 2
- 230000006287 biotinylation Effects 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 210000001185 bone marrow Anatomy 0.000 description 2
- 210000004556 brain Anatomy 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 230000004663 cell proliferation Effects 0.000 description 2
- 230000008614 cellular interaction Effects 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 238000012875 competitive assay Methods 0.000 description 2
- 230000002860 competitive effect Effects 0.000 description 2
- 239000000470 constituent Substances 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000004132 cross linking Methods 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 210000004443 dendritic cell Anatomy 0.000 description 2
- 238000003745 diagnosis Methods 0.000 description 2
- 239000003085 diluting agent Substances 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 125000002228 disulfide group Chemical group 0.000 description 2
- 238000007877 drug screening Methods 0.000 description 2
- 239000012636 effector Substances 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 238000003094 enzyme-multiplied immunoassay technique Methods 0.000 description 2
- 150000002148 esters Chemical class 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 108010054666 glycyl-leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010037850 glycylvaline Proteins 0.000 description 2
- 210000003714 granulocyte Anatomy 0.000 description 2
- 230000003100 immobilizing effect Effects 0.000 description 2
- 230000016784 immunoglobulin production Effects 0.000 description 2
- 229940072221 immunoglobulins Drugs 0.000 description 2
- 238000011065 in-situ storage Methods 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 238000007918 intramuscular administration Methods 0.000 description 2
- 238000001990 intravenous administration Methods 0.000 description 2
- 238000005304 joining Methods 0.000 description 2
- 210000005067 joint tissue Anatomy 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 210000004185 liver Anatomy 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 210000001165 lymph node Anatomy 0.000 description 2
- 238000010841 mRNA extraction Methods 0.000 description 2
- 239000006249 magnetic particle Substances 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 230000002503 metabolic effect Effects 0.000 description 2
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 2
- 230000000877 morphologic effect Effects 0.000 description 2
- 239000003960 organic solvent Substances 0.000 description 2
- 210000001672 ovary Anatomy 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 description 2
- 210000001986 peyer's patch Anatomy 0.000 description 2
- 239000008194 pharmaceutical composition Substances 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 2
- 239000010452 phosphate Substances 0.000 description 2
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 2
- 150000008300 phosphoramidites Chemical class 0.000 description 2
- 102000020233 phosphotransferase Human genes 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 239000013641 positive control Substances 0.000 description 2
- 230000003389 potentiating effect Effects 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108060006633 protein kinase Proteins 0.000 description 2
- 238000003127 radioimmunoassay Methods 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 230000028327 secretion Effects 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 230000019491 signal transduction Effects 0.000 description 2
- 230000011664 signaling Effects 0.000 description 2
- 239000003381 stabilizer Substances 0.000 description 2
- 230000000638 stimulation Effects 0.000 description 2
- 238000005556 structure-activity relationship Methods 0.000 description 2
- 210000001550 testis Anatomy 0.000 description 2
- 229940124597 therapeutic agent Drugs 0.000 description 2
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- 210000001541 thymus gland Anatomy 0.000 description 2
- 230000005030 transcription termination Effects 0.000 description 2
- 102000027257 transmembrane receptors Human genes 0.000 description 2
- 108091008578 transmembrane receptors Proteins 0.000 description 2
- 241000701447 unidentified baculovirus Species 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- SBKVPJHMSUXZTA-MEJXFZFPSA-N (2S)-2-[[(2S)-2-[[(2S)-1-[(2S)-5-amino-2-[[2-[[(2S)-1-[(2S)-6-amino-2-[[(2S)-2-[[(2S)-5-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-amino-3-(1H-indol-3-yl)propanoyl]amino]-3-(1H-imidazol-4-yl)propanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-4-methylpentanoyl]amino]-5-oxopentanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]acetyl]amino]-5-oxopentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylsulfanylbutanoyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 SBKVPJHMSUXZTA-MEJXFZFPSA-N 0.000 description 1
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 1
- WHTVZRBIWZFKQO-AWEZNQCLSA-N (S)-chloroquine Chemical compound ClC1=CC=C2C(N[C@@H](C)CCCN(CC)CC)=CC=NC2=C1 WHTVZRBIWZFKQO-AWEZNQCLSA-N 0.000 description 1
- RNAMYOYQYRYFQY-UHFFFAOYSA-N 2-(4,4-difluoropiperidin-1-yl)-6-methoxy-n-(1-propan-2-ylpiperidin-4-yl)-7-(3-pyrrolidin-1-ylpropoxy)quinazolin-4-amine Chemical compound N1=C(N2CCC(F)(F)CC2)N=C2C=C(OCCCN3CCCC3)C(OC)=CC2=C1NC1CCN(C(C)C)CC1 RNAMYOYQYRYFQY-UHFFFAOYSA-N 0.000 description 1
- NGNBDVOYPDDBFK-UHFFFAOYSA-N 2-[2,4-di(pentan-2-yl)phenoxy]acetyl chloride Chemical compound CCCC(C)C1=CC=C(OCC(Cl)=O)C(C(C)CCC)=C1 NGNBDVOYPDDBFK-UHFFFAOYSA-N 0.000 description 1
- VNDWQCSOSCCWIP-UHFFFAOYSA-N 2-tert-butyl-9-fluoro-1,6-dihydrobenzo[h]imidazo[4,5-f]isoquinolin-7-one Chemical compound C1=2C=CNC(=O)C=2C2=CC(F)=CC=C2C2=C1NC(C(C)(C)C)=N2 VNDWQCSOSCCWIP-UHFFFAOYSA-N 0.000 description 1
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 1
- WOVKYSAHUYNSMH-RRKCRQDMSA-N 5-bromodeoxyuridine Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(Br)=C1 WOVKYSAHUYNSMH-RRKCRQDMSA-N 0.000 description 1
- WRDABNWSWOHGMS-UHFFFAOYSA-N AEBSF hydrochloride Chemical compound Cl.NCCC1=CC=C(S(F)(=O)=O)C=C1 WRDABNWSWOHGMS-UHFFFAOYSA-N 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- VIGKUFXFTPWYER-BIIVOSGPSA-N Ala-Cys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N VIGKUFXFTPWYER-BIIVOSGPSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 1
- CZPAHAKGPDUIPJ-CIUDSAMLSA-N Ala-Gln-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CZPAHAKGPDUIPJ-CIUDSAMLSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- DCUCOIYYUBILPS-GUBZILKMSA-N Ala-Leu-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DCUCOIYYUBILPS-GUBZILKMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- QPBSRMDNJOTFAL-AICCOOGYSA-N Ala-Leu-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QPBSRMDNJOTFAL-AICCOOGYSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- FVNAUOZKIPAYNA-BPNCWPANSA-N Ala-Met-Tyr Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FVNAUOZKIPAYNA-BPNCWPANSA-N 0.000 description 1
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 1
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- CQJHFKKGZXKZBC-BPNCWPANSA-N Ala-Pro-Tyr Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CQJHFKKGZXKZBC-BPNCWPANSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 1
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 1
- 101710187573 Alcohol dehydrogenase 2 Proteins 0.000 description 1
- 101710133776 Alcohol dehydrogenase class-3 Proteins 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 244000099147 Ananas comosus Species 0.000 description 1
- 235000007119 Ananas comosus Nutrition 0.000 description 1
- 108020005544 Antisense RNA Proteins 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 1
- JTKLCCFLSLCCST-SZMVWBNQSA-N Arg-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JTKLCCFLSLCCST-SZMVWBNQSA-N 0.000 description 1
- QPOARHANPULOTM-GMOBBJLQSA-N Arg-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N QPOARHANPULOTM-GMOBBJLQSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- NAARDJBSSPUGCF-FXQIFTODSA-N Arg-Cys-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N NAARDJBSSPUGCF-FXQIFTODSA-N 0.000 description 1
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 1
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 1
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 1
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 1
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 1
- DDBMKOCQWNFDBH-RHYQMDGZSA-N Arg-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O DDBMKOCQWNFDBH-RHYQMDGZSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- WTFIFQWLQXZLIZ-UMPQAUOISA-N Arg-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O WTFIFQWLQXZLIZ-UMPQAUOISA-N 0.000 description 1
- FSPQNLYOFCXUCE-BPUTZDHNSA-N Arg-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FSPQNLYOFCXUCE-BPUTZDHNSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 1
- WTUZDHWWGUQEKN-SRVKXCTJSA-N Arg-Val-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O WTUZDHWWGUQEKN-SRVKXCTJSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 1
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 1
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 1
- XXAOXVBAWLMTDR-ZLUOBGJFSA-N Asn-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N XXAOXVBAWLMTDR-ZLUOBGJFSA-N 0.000 description 1
- QRHYAUYXBVVDSB-LKXGYXEUSA-N Asn-Cys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QRHYAUYXBVVDSB-LKXGYXEUSA-N 0.000 description 1
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 1
- OFQPMRDJVWLMNJ-CIUDSAMLSA-N Asn-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N OFQPMRDJVWLMNJ-CIUDSAMLSA-N 0.000 description 1
- SUEIIIFUBHDCCS-PBCZWWQYSA-N Asn-His-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUEIIIFUBHDCCS-PBCZWWQYSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- VOGCFWDZYYTEOY-DCAQKATOSA-N Asn-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N VOGCFWDZYYTEOY-DCAQKATOSA-N 0.000 description 1
- KNENKKKUYGEZIO-FXQIFTODSA-N Asn-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N KNENKKKUYGEZIO-FXQIFTODSA-N 0.000 description 1
- QGABLMITFKUQDF-DCAQKATOSA-N Asn-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGABLMITFKUQDF-DCAQKATOSA-N 0.000 description 1
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 1
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 1
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 1
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- FMNBYVSGRCXWEK-FOHZUACHSA-N Asn-Thr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O FMNBYVSGRCXWEK-FOHZUACHSA-N 0.000 description 1
- UXHYOWXTJLBEPG-GSSVUCPTSA-N Asn-Thr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UXHYOWXTJLBEPG-GSSVUCPTSA-N 0.000 description 1
- UPAGTDJAORYMEC-VHWLVUOQSA-N Asn-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)N)N UPAGTDJAORYMEC-VHWLVUOQSA-N 0.000 description 1
- YNQMEIJEWSHOEO-SRVKXCTJSA-N Asn-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YNQMEIJEWSHOEO-SRVKXCTJSA-N 0.000 description 1
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- AKPLMZMNJGNUKT-ZLUOBGJFSA-N Asp-Asp-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O AKPLMZMNJGNUKT-ZLUOBGJFSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- NURJSGZGBVJFAD-ZLUOBGJFSA-N Asp-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O NURJSGZGBVJFAD-ZLUOBGJFSA-N 0.000 description 1
- WLKVEEODTPQPLI-ACZMJKKPSA-N Asp-Gln-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WLKVEEODTPQPLI-ACZMJKKPSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 1
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- CMCIMCAQIULNDJ-CIUDSAMLSA-N Asp-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N CMCIMCAQIULNDJ-CIUDSAMLSA-N 0.000 description 1
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 1
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 description 1
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 1
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- YVHGKXAOSVBGJV-CIUDSAMLSA-N Asp-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N YVHGKXAOSVBGJV-CIUDSAMLSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- FIAKNCXQFFKSSI-ZLUOBGJFSA-N Asp-Ser-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O FIAKNCXQFFKSSI-ZLUOBGJFSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- QOCFFCUFZGDHTP-NUMRIWBASA-N Asp-Thr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOCFFCUFZGDHTP-NUMRIWBASA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- DKQCWCQRAMAFLN-UBHSHLNASA-N Asp-Trp-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O DKQCWCQRAMAFLN-UBHSHLNASA-N 0.000 description 1
- KNOGLZBISUBTFW-QRTARXTBSA-N Asp-Trp-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O KNOGLZBISUBTFW-QRTARXTBSA-N 0.000 description 1
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 1
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 101100455762 Aspergillus niger (strain CBS 513.88 / FGSC A1513) lysA gene Proteins 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 208000004736 B-Cell Leukemia Diseases 0.000 description 1
- WOVKYSAHUYNSMH-UHFFFAOYSA-N BROMODEOXYURIDINE Natural products C1C(O)C(CO)OC1N1C(=O)NC(=O)C(Br)=C1 WOVKYSAHUYNSMH-UHFFFAOYSA-N 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 1
- 101100392772 Caenorhabditis elegans gln-2 gene Proteins 0.000 description 1
- 101100533230 Caenorhabditis elegans ser-2 gene Proteins 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 201000009030 Carcinoma Diseases 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 241000557626 Corvus corax Species 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- NLCZGISONIGRQP-DCAQKATOSA-N Cys-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N NLCZGISONIGRQP-DCAQKATOSA-N 0.000 description 1
- KLLFLHBKSJAUMZ-ACZMJKKPSA-N Cys-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N KLLFLHBKSJAUMZ-ACZMJKKPSA-N 0.000 description 1
- SFUUYRSAJPWTGO-SRVKXCTJSA-N Cys-Asn-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SFUUYRSAJPWTGO-SRVKXCTJSA-N 0.000 description 1
- DVKQPQKQDHHFTE-ZLUOBGJFSA-N Cys-Cys-Asn Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N)C(=O)N DVKQPQKQDHHFTE-ZLUOBGJFSA-N 0.000 description 1
- FIADUEYFRSCCIK-CIUDSAMLSA-N Cys-Glu-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIADUEYFRSCCIK-CIUDSAMLSA-N 0.000 description 1
- KABHAOSDMIYXTR-GUBZILKMSA-N Cys-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N KABHAOSDMIYXTR-GUBZILKMSA-N 0.000 description 1
- UYYZZJXUVIZTMH-AVGNSLFASA-N Cys-Glu-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UYYZZJXUVIZTMH-AVGNSLFASA-N 0.000 description 1
- LKUCSUGWHYVYLP-GHCJXIJMSA-N Cys-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N LKUCSUGWHYVYLP-GHCJXIJMSA-N 0.000 description 1
- ZLHPWFSAUJEEAN-KBIXCLLPSA-N Cys-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N ZLHPWFSAUJEEAN-KBIXCLLPSA-N 0.000 description 1
- VFGADOJXRLWTBU-JBDRJPRFSA-N Cys-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N VFGADOJXRLWTBU-JBDRJPRFSA-N 0.000 description 1
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 1
- BBQIWFFTTQTNOC-AVGNSLFASA-N Cys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N BBQIWFFTTQTNOC-AVGNSLFASA-N 0.000 description 1
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 1
- SMEYEQDCCBHTEF-FXQIFTODSA-N Cys-Pro-Ala Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O SMEYEQDCCBHTEF-FXQIFTODSA-N 0.000 description 1
- UEHCDNYDBBCQEL-CIUDSAMLSA-N Cys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N UEHCDNYDBBCQEL-CIUDSAMLSA-N 0.000 description 1
- OEDPLIBVQGRKGZ-AVGNSLFASA-N Cys-Tyr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O OEDPLIBVQGRKGZ-AVGNSLFASA-N 0.000 description 1
- ZFHXNNXMNLWKJH-HJPIBITLSA-N Cys-Tyr-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZFHXNNXMNLWKJH-HJPIBITLSA-N 0.000 description 1
- VXDXZGYXHIADHF-YJRXYDGGSA-N Cys-Tyr-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VXDXZGYXHIADHF-YJRXYDGGSA-N 0.000 description 1
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 1
- FNXOZWPPOJRBRE-XGEHTFHBSA-N Cys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CS)N)O FNXOZWPPOJRBRE-XGEHTFHBSA-N 0.000 description 1
- ATFSDBMHRCDLBV-BPUTZDHNSA-N Cys-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N ATFSDBMHRCDLBV-BPUTZDHNSA-N 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 241000233756 Fabriciana elisa Species 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- LZRMPXRYLLTAJX-GUBZILKMSA-N Gln-Arg-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZRMPXRYLLTAJX-GUBZILKMSA-N 0.000 description 1
- KWLMLNHADZIJIS-CIUDSAMLSA-N Gln-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N KWLMLNHADZIJIS-CIUDSAMLSA-N 0.000 description 1
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 1
- FJAYYNIXQNERSO-ACZMJKKPSA-N Gln-Cys-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FJAYYNIXQNERSO-ACZMJKKPSA-N 0.000 description 1
- CXFUMJQFZVCETK-FXQIFTODSA-N Gln-Cys-Gln Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O CXFUMJQFZVCETK-FXQIFTODSA-N 0.000 description 1
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 1
- AJDMYLOISOCHHC-YVNDNENWSA-N Gln-Gln-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AJDMYLOISOCHHC-YVNDNENWSA-N 0.000 description 1
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- DWDBJWAXPXXYLP-SRVKXCTJSA-N Gln-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DWDBJWAXPXXYLP-SRVKXCTJSA-N 0.000 description 1
- ICDIMQAMJGDHSE-GUBZILKMSA-N Gln-His-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O ICDIMQAMJGDHSE-GUBZILKMSA-N 0.000 description 1
- GXMBDEGTXHQBAO-NKIYYHGXSA-N Gln-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N)O GXMBDEGTXHQBAO-NKIYYHGXSA-N 0.000 description 1
- OOLCSQQPSLIETN-JYJNAYRXSA-N Gln-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)O OOLCSQQPSLIETN-JYJNAYRXSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- FYAULIGIFPPOAA-ZPFDUUQYSA-N Gln-Ile-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O FYAULIGIFPPOAA-ZPFDUUQYSA-N 0.000 description 1
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 1
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 1
- OKQLXOYFUPVEHI-CIUDSAMLSA-N Gln-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N OKQLXOYFUPVEHI-CIUDSAMLSA-N 0.000 description 1
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- CVRUVYDNRPSKBM-QEJZJMRPSA-N Gln-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N CVRUVYDNRPSKBM-QEJZJMRPSA-N 0.000 description 1
- CTJRFALAOYAJBX-NWLDYVSISA-N Gln-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N)O CTJRFALAOYAJBX-NWLDYVSISA-N 0.000 description 1
- WPJDPEOQUIXXOY-AVGNSLFASA-N Gln-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WPJDPEOQUIXXOY-AVGNSLFASA-N 0.000 description 1
- OACQOWPRWGNKTP-AVGNSLFASA-N Gln-Tyr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O OACQOWPRWGNKTP-AVGNSLFASA-N 0.000 description 1
- YLABFXCRQQMMHS-AVGNSLFASA-N Gln-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YLABFXCRQQMMHS-AVGNSLFASA-N 0.000 description 1
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 1
- VCUNGPMMPNJSGS-JYJNAYRXSA-N Gln-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VCUNGPMMPNJSGS-JYJNAYRXSA-N 0.000 description 1
- QXQDADBVIBLBHN-FHWLQOOXSA-N Gln-Tyr-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QXQDADBVIBLBHN-FHWLQOOXSA-N 0.000 description 1
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 1
- BBFCMGBMYIAGRS-AUTRQRHGSA-N Gln-Val-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BBFCMGBMYIAGRS-AUTRQRHGSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- PBEQPAZRHDVJQI-SRVKXCTJSA-N Glu-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N PBEQPAZRHDVJQI-SRVKXCTJSA-N 0.000 description 1
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- LSTFYPOGBGFIPP-FXQIFTODSA-N Glu-Cys-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O LSTFYPOGBGFIPP-FXQIFTODSA-N 0.000 description 1
- XMVLTPMCUJTJQP-FXQIFTODSA-N Glu-Gln-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N XMVLTPMCUJTJQP-FXQIFTODSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- HOIPREWORBVRLD-XIRDDKMYSA-N Glu-Met-Trp Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O HOIPREWORBVRLD-XIRDDKMYSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 1
- JYXKPJVDCAWMDG-ZPFDUUQYSA-N Glu-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N JYXKPJVDCAWMDG-ZPFDUUQYSA-N 0.000 description 1
- LPHGXOWFAXFCPX-KKUMJFAQSA-N Glu-Pro-Phe Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O LPHGXOWFAXFCPX-KKUMJFAQSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- JPUNZXVHHRZMNL-XIRDDKMYSA-N Glu-Pro-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JPUNZXVHHRZMNL-XIRDDKMYSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- LWYUQLZOIORFFJ-XKBZYTNZSA-N Glu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O LWYUQLZOIORFFJ-XKBZYTNZSA-N 0.000 description 1
- DDXZHOHEABQXSE-NKIYYHGXSA-N Glu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O DDXZHOHEABQXSE-NKIYYHGXSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- UQULNJAARAXSPO-ZCWPNWOLSA-N Glu-Thr-Thr-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UQULNJAARAXSPO-ZCWPNWOLSA-N 0.000 description 1
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- KCCNSVHJSMMGFS-NRPADANISA-N Glu-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KCCNSVHJSMMGFS-NRPADANISA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- SXRSQZLOMIGNAQ-UHFFFAOYSA-N Glutaraldehyde Chemical compound O=CCCCC=O SXRSQZLOMIGNAQ-UHFFFAOYSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 1
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 1
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 1
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 1
- IVSWQHKONQIOHA-YUMQZZPRSA-N Gly-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN IVSWQHKONQIOHA-YUMQZZPRSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 1
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 1
- TTYVAUJGNMVTRN-GJZGRUSLSA-N Gly-Met-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)CN TTYVAUJGNMVTRN-GJZGRUSLSA-N 0.000 description 1
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- IMRNSEPSPFQNHF-STQMWFEESA-N Gly-Ser-Trp Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O IMRNSEPSPFQNHF-STQMWFEESA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 1
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- QZAFGJNKLMNDEM-DCAQKATOSA-N His-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 QZAFGJNKLMNDEM-DCAQKATOSA-N 0.000 description 1
- NOQPTNXSGNPJNS-YUMQZZPRSA-N His-Asn-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O NOQPTNXSGNPJNS-YUMQZZPRSA-N 0.000 description 1
- MVADCDSCFTXCBT-CIUDSAMLSA-N His-Asp-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MVADCDSCFTXCBT-CIUDSAMLSA-N 0.000 description 1
- RXVOMIADLXPJGW-GUBZILKMSA-N His-Asp-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RXVOMIADLXPJGW-GUBZILKMSA-N 0.000 description 1
- CMQOGWZUKPHLHL-DCAQKATOSA-N His-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CN=CN1)N CMQOGWZUKPHLHL-DCAQKATOSA-N 0.000 description 1
- FMRKUXFLLPKVPG-JYJNAYRXSA-N His-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O FMRKUXFLLPKVPG-JYJNAYRXSA-N 0.000 description 1
- HIAHVKLTHNOENC-HGNGGELXSA-N His-Glu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HIAHVKLTHNOENC-HGNGGELXSA-N 0.000 description 1
- PQKCQZHAGILVIM-NKIYYHGXSA-N His-Glu-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O PQKCQZHAGILVIM-NKIYYHGXSA-N 0.000 description 1
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 1
- VJJSDSNFXCWCEJ-DJFWLOJKSA-N His-Ile-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O VJJSDSNFXCWCEJ-DJFWLOJKSA-N 0.000 description 1
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 1
- IGBBXBFSLKRHJB-BZSNNMDCSA-N His-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 IGBBXBFSLKRHJB-BZSNNMDCSA-N 0.000 description 1
- TTYKEFZRLKQTHH-MELADBBJSA-N His-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O TTYKEFZRLKQTHH-MELADBBJSA-N 0.000 description 1
- XDIVYNSPYBLSME-DCAQKATOSA-N His-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N XDIVYNSPYBLSME-DCAQKATOSA-N 0.000 description 1
- WCHONUZTYDQMBY-PYJNHQTQSA-N His-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WCHONUZTYDQMBY-PYJNHQTQSA-N 0.000 description 1
- LNDVNHOSZQPJGI-AVGNSLFASA-N His-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CN=CN1 LNDVNHOSZQPJGI-AVGNSLFASA-N 0.000 description 1
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 1
- CUEQQFOGARVNHU-VGDYDELISA-N His-Ser-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUEQQFOGARVNHU-VGDYDELISA-N 0.000 description 1
- RXKFKJVJVHLRIE-XIRDDKMYSA-N His-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC3=CN=CN3)N RXKFKJVJVHLRIE-XIRDDKMYSA-N 0.000 description 1
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 1
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 1
- JVEKQAYXFGIISZ-HOCLYGCPSA-N His-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JVEKQAYXFGIISZ-HOCLYGCPSA-N 0.000 description 1
- PDLQNLSEJXOQNQ-IHPCNDPISA-N His-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CN=CN1 PDLQNLSEJXOQNQ-IHPCNDPISA-N 0.000 description 1
- YBDOQKVAGTWZMI-XIRDDKMYSA-N His-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N YBDOQKVAGTWZMI-XIRDDKMYSA-N 0.000 description 1
- YERBCFWVWITTEJ-NAZCDGGXSA-N His-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N)O YERBCFWVWITTEJ-NAZCDGGXSA-N 0.000 description 1
- YKUAGFAXQRYUQW-KKUMJFAQSA-N His-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O YKUAGFAXQRYUQW-KKUMJFAQSA-N 0.000 description 1
- MRVZCDSYLJXKKX-ACRUOGEOSA-N His-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CN=CN3)N MRVZCDSYLJXKKX-ACRUOGEOSA-N 0.000 description 1
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101001057504 Homo sapiens Interferon-stimulated gene 20 kDa protein Proteins 0.000 description 1
- 101001055144 Homo sapiens Interleukin-2 receptor subunit alpha Proteins 0.000 description 1
- 101000852964 Homo sapiens Interleukin-27 subunit beta Proteins 0.000 description 1
- 101000946889 Homo sapiens Monocyte differentiation antigen CD14 Proteins 0.000 description 1
- 101000845170 Homo sapiens Thymic stromal lymphopoietin Proteins 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- ATXGFMOBVKSOMK-PEDHHIEDSA-N Ile-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N ATXGFMOBVKSOMK-PEDHHIEDSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- FHCNLXMTQJNJNH-KBIXCLLPSA-N Ile-Cys-Gln Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)O FHCNLXMTQJNJNH-KBIXCLLPSA-N 0.000 description 1
- PFTFEWHJSAXGED-ZKWXMUAHSA-N Ile-Cys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N PFTFEWHJSAXGED-ZKWXMUAHSA-N 0.000 description 1
- JHCVYQKVKOLAIU-NAKRPEOUSA-N Ile-Cys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N JHCVYQKVKOLAIU-NAKRPEOUSA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- WNQKUUQIVDDAFA-ZPFDUUQYSA-N Ile-Gln-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N WNQKUUQIVDDAFA-ZPFDUUQYSA-N 0.000 description 1
- JXMSHKFPDIUYGS-SIUGBPQLSA-N Ile-Glu-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N JXMSHKFPDIUYGS-SIUGBPQLSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- KOPIAUWNLKKELG-SIGLWIIPSA-N Ile-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N KOPIAUWNLKKELG-SIGLWIIPSA-N 0.000 description 1
- CCYGNFBYUNHFSC-MGHWNKPDSA-N Ile-His-Phe Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CCYGNFBYUNHFSC-MGHWNKPDSA-N 0.000 description 1
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 1
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- MSASLZGZQAXVFP-PEDHHIEDSA-N Ile-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N MSASLZGZQAXVFP-PEDHHIEDSA-N 0.000 description 1
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 1
- AKQFLPNANHNTLP-VKOGCVSHSA-N Ile-Pro-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N AKQFLPNANHNTLP-VKOGCVSHSA-N 0.000 description 1
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 1
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 1
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- JDCQDJVYUXNCGF-SPOWBLRKSA-N Ile-Ser-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JDCQDJVYUXNCGF-SPOWBLRKSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- WKSHBPRUIRGWRZ-KCTSRDHCSA-N Ile-Trp-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N WKSHBPRUIRGWRZ-KCTSRDHCSA-N 0.000 description 1
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 1
- PRTZQMBYUZFSFA-XEGUGMAKSA-N Ile-Tyr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)NCC(=O)O)N PRTZQMBYUZFSFA-XEGUGMAKSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- JCGMFFQQHJQASB-PYJNHQTQSA-N Ile-Val-His Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O JCGMFFQQHJQASB-PYJNHQTQSA-N 0.000 description 1
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 1
- 108010054267 Interferon Receptors Proteins 0.000 description 1
- 102000001617 Interferon Receptors Human genes 0.000 description 1
- 102000000589 Interleukin-1 Human genes 0.000 description 1
- 108010002352 Interleukin-1 Proteins 0.000 description 1
- 102000004553 Interleukin-11 Receptors Human genes 0.000 description 1
- 108010017521 Interleukin-11 Receptors Proteins 0.000 description 1
- 102100026878 Interleukin-2 receptor subunit alpha Human genes 0.000 description 1
- 102100036712 Interleukin-27 subunit beta Human genes 0.000 description 1
- 108010002386 Interleukin-3 Proteins 0.000 description 1
- 102000004889 Interleukin-6 Human genes 0.000 description 1
- 108090001005 Interleukin-6 Proteins 0.000 description 1
- 102100037792 Interleukin-6 receptor subunit alpha Human genes 0.000 description 1
- 101710185757 Interleukin-6 receptor subunit alpha Proteins 0.000 description 1
- 102100026244 Interleukin-9 receptor Human genes 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 102000042838 JAK family Human genes 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical group NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 101150118523 LYS4 gene Proteins 0.000 description 1
- 208000006404 Large Granular Lymphocytic Leukemia Diseases 0.000 description 1
- 208000018142 Leiomyosarcoma Diseases 0.000 description 1
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 1
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 1
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 1
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 1
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 1
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 1
- MPSBSKHOWJQHBS-IHRRRGAJSA-N Leu-His-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N MPSBSKHOWJQHBS-IHRRRGAJSA-N 0.000 description 1
- HMDDEJADNKQTBR-BZSNNMDCSA-N Leu-His-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMDDEJADNKQTBR-BZSNNMDCSA-N 0.000 description 1
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- TVEOVCYCYGKVPP-HSCHXYMDSA-N Leu-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N TVEOVCYCYGKVPP-HSCHXYMDSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- NHRINZSPIUXYQZ-DCAQKATOSA-N Leu-Met-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N NHRINZSPIUXYQZ-DCAQKATOSA-N 0.000 description 1
- POMXSEDNUXYPGK-IHRRRGAJSA-N Leu-Met-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N POMXSEDNUXYPGK-IHRRRGAJSA-N 0.000 description 1
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 1
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- SQUFDMCWMFOEBA-KKUMJFAQSA-N Leu-Ser-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SQUFDMCWMFOEBA-KKUMJFAQSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- SXOFUVGLPHCPRQ-KKUMJFAQSA-N Leu-Tyr-Cys Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(O)=O SXOFUVGLPHCPRQ-KKUMJFAQSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 1
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- SFQPJNQDUUYCLA-BJDJZHNGSA-N Lys-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N SFQPJNQDUUYCLA-BJDJZHNGSA-N 0.000 description 1
- XFBBBRDEQIPGNR-KATARQTJSA-N Lys-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O XFBBBRDEQIPGNR-KATARQTJSA-N 0.000 description 1
- GUYHHBZCBQZLFW-GUBZILKMSA-N Lys-Gln-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N GUYHHBZCBQZLFW-GUBZILKMSA-N 0.000 description 1
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- KKFVKBWCXXLKIK-AVGNSLFASA-N Lys-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCCN)N KKFVKBWCXXLKIK-AVGNSLFASA-N 0.000 description 1
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 1
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- JPYPRVHMKRFTAT-KKUMJFAQSA-N Lys-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N JPYPRVHMKRFTAT-KKUMJFAQSA-N 0.000 description 1
- PIXVFCBYEGPZPA-JYJNAYRXSA-N Lys-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N PIXVFCBYEGPZPA-JYJNAYRXSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 1
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 1
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 1
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- KXYLFJIQDIMURW-IHPCNDPISA-N Lys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCCN)=CNC2=C1 KXYLFJIQDIMURW-IHPCNDPISA-N 0.000 description 1
- GVKINWYYLOLEFQ-XIRDDKMYSA-N Lys-Trp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O GVKINWYYLOLEFQ-XIRDDKMYSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 1
- 108010046938 Macrophage Colony-Stimulating Factor Proteins 0.000 description 1
- 102100028123 Macrophage colony-stimulating factor 1 Human genes 0.000 description 1
- PEEHTFAAVSWFBL-UHFFFAOYSA-N Maleimide Chemical compound O=C1NC(=O)C=C1 PEEHTFAAVSWFBL-UHFFFAOYSA-N 0.000 description 1
- 108010038049 Mating Factor Proteins 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 1
- HDNOQCZWJGGHSS-VEVYYDQMSA-N Met-Asn-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HDNOQCZWJGGHSS-VEVYYDQMSA-N 0.000 description 1
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 1
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 1
- JUXONJROIXKHEV-GUBZILKMSA-N Met-Cys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCNC(N)=N JUXONJROIXKHEV-GUBZILKMSA-N 0.000 description 1
- AWOMRHGUWFBDNU-ZPFDUUQYSA-N Met-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N AWOMRHGUWFBDNU-ZPFDUUQYSA-N 0.000 description 1
- PHWSCIFNNLLUFJ-NHCYSSNCSA-N Met-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N PHWSCIFNNLLUFJ-NHCYSSNCSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- TZHFJXDKXGZHEN-IHRRRGAJSA-N Met-His-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O TZHFJXDKXGZHEN-IHRRRGAJSA-N 0.000 description 1
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 1
- JCMMNFZUKMMECJ-DCAQKATOSA-N Met-Lys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JCMMNFZUKMMECJ-DCAQKATOSA-N 0.000 description 1
- KRLKICLNEICJGV-STQMWFEESA-N Met-Phe-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 KRLKICLNEICJGV-STQMWFEESA-N 0.000 description 1
- QEDGNYFHLXXIDC-DCAQKATOSA-N Met-Pro-Gln Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O QEDGNYFHLXXIDC-DCAQKATOSA-N 0.000 description 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 1
- PHURAEXVWLDIGT-LPEHRKFASA-N Met-Ser-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N PHURAEXVWLDIGT-LPEHRKFASA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- TUZSWDCTCGTVDJ-PJODQICGSA-N Met-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 TUZSWDCTCGTVDJ-PJODQICGSA-N 0.000 description 1
- BAQCROVBDNBEEB-UBYUBLNFSA-N Metrizamide Chemical compound CC(=O)N(C)C1=C(I)C(NC(C)=O)=C(I)C(C(=O)N[C@@H]2[C@H]([C@H](O)[C@@H](CO)OC2O)O)=C1I BAQCROVBDNBEEB-UBYUBLNFSA-N 0.000 description 1
- 102100035877 Monocyte differentiation antigen CD14 Human genes 0.000 description 1
- 102000013967 Monokines Human genes 0.000 description 1
- 108010050619 Monokines Proteins 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- 101100125854 Mus musculus Il18 gene Proteins 0.000 description 1
- 108010021466 Mutant Proteins Proteins 0.000 description 1
- 102000008300 Mutant Proteins Human genes 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- BZQFBWGGLXLEPQ-UHFFFAOYSA-N O-phosphoryl-L-serine Natural products OC(=O)C(N)COP(O)(O)=O BZQFBWGGLXLEPQ-UHFFFAOYSA-N 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 241000414651 Omosita colon Species 0.000 description 1
- 108010058846 Ovalbumin Proteins 0.000 description 1
- 238000010222 PCR analysis Methods 0.000 description 1
- 229930040373 Paraformaldehyde Natural products 0.000 description 1
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 1
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 1
- CUMXHKAOHNWRFQ-BZSNNMDCSA-N Phe-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CUMXHKAOHNWRFQ-BZSNNMDCSA-N 0.000 description 1
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 1
- OWCLJDXHHZUNEL-IHRRRGAJSA-N Phe-Cys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OWCLJDXHHZUNEL-IHRRRGAJSA-N 0.000 description 1
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 1
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- SFKOEHXABNPLRT-KBPBESRZSA-N Phe-His-Gly Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)NCC(O)=O SFKOEHXABNPLRT-KBPBESRZSA-N 0.000 description 1
- FINLZXKJWTYYLC-ACRUOGEOSA-N Phe-His-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FINLZXKJWTYYLC-ACRUOGEOSA-N 0.000 description 1
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 1
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- ROOQMPCUFLDOSB-FHWLQOOXSA-N Phe-Phe-Gln Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CC=CC=C1 ROOQMPCUFLDOSB-FHWLQOOXSA-N 0.000 description 1
- TXJJXEXCZBHDNA-ACRUOGEOSA-N Phe-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N TXJJXEXCZBHDNA-ACRUOGEOSA-N 0.000 description 1
- ILGCZYGFYQLSDZ-KKUMJFAQSA-N Phe-Ser-His Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ILGCZYGFYQLSDZ-KKUMJFAQSA-N 0.000 description 1
- GKRCCTYAGQPMMP-IHRRRGAJSA-N Phe-Ser-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GKRCCTYAGQPMMP-IHRRRGAJSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- CXMSESHALPOLRE-MEYUZBJRSA-N Phe-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O CXMSESHALPOLRE-MEYUZBJRSA-N 0.000 description 1
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- VGTJSEYTVMAASM-RPTUDFQQSA-N Phe-Thr-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VGTJSEYTVMAASM-RPTUDFQQSA-N 0.000 description 1
- AOKZOUGUMLBPSS-PMVMPFDFSA-N Phe-Trp-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O AOKZOUGUMLBPSS-PMVMPFDFSA-N 0.000 description 1
- WDOCBGZHAQQIBL-IHPCNDPISA-N Phe-Trp-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 WDOCBGZHAQQIBL-IHPCNDPISA-N 0.000 description 1
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- 102000009097 Phosphorylases Human genes 0.000 description 1
- 108010073135 Phosphorylases Proteins 0.000 description 1
- 108010047620 Phytohemagglutinins Proteins 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- JARJPEMLQAWNBR-GUBZILKMSA-N Pro-Asp-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JARJPEMLQAWNBR-GUBZILKMSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 1
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- XQHGISDMVBTGAL-ULQDDVLXSA-N Pro-His-Phe Chemical compound C([C@@H](C(=O)[O-])NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1[NH2+]CCC1)C1=CC=CC=C1 XQHGISDMVBTGAL-ULQDDVLXSA-N 0.000 description 1
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 1
- AQSMZTIEJMZQEC-DCAQKATOSA-N Pro-His-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O AQSMZTIEJMZQEC-DCAQKATOSA-N 0.000 description 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 1
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 1
- AUQGUYPHJSMAKI-CYDGBPFRSA-N Pro-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 AUQGUYPHJSMAKI-CYDGBPFRSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 1
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 1
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 1
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 1
- HOTVCUAVDQHUDB-UFYCRDLUSA-N Pro-Phe-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 HOTVCUAVDQHUDB-UFYCRDLUSA-N 0.000 description 1
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- XRGIDCGRSSWCKE-SRVKXCTJSA-N Pro-Val-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O XRGIDCGRSSWCKE-SRVKXCTJSA-N 0.000 description 1
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 102000004022 Protein-Tyrosine Kinases Human genes 0.000 description 1
- 108090000412 Protein-Tyrosine Kinases Proteins 0.000 description 1
- 241000125945 Protoparvovirus Species 0.000 description 1
- 101100408135 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) phnA gene Proteins 0.000 description 1
- 239000012614 Q-Sepharose Substances 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 108020005067 RNA Splice Sites Proteins 0.000 description 1
- 206010039491 Sarcoma Diseases 0.000 description 1
- 101100289883 Schizosaccharomyces pombe (strain 972 / ATCC 24843) lys2 gene Proteins 0.000 description 1
- 229920005654 Sephadex Polymers 0.000 description 1
- 239000012507 Sephadex™ Substances 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- UCXDHBORXLVBNC-ZLUOBGJFSA-N Ser-Asn-Cys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O UCXDHBORXLVBNC-ZLUOBGJFSA-N 0.000 description 1
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 1
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- TUYBIWUZWJUZDD-ACZMJKKPSA-N Ser-Cys-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O TUYBIWUZWJUZDD-ACZMJKKPSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- MPPHJZYXDVDGOF-BWBBJGPYSA-N Ser-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CO MPPHJZYXDVDGOF-BWBBJGPYSA-N 0.000 description 1
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 1
- ULVMNZOKDBHKKI-ACZMJKKPSA-N Ser-Gln-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ULVMNZOKDBHKKI-ACZMJKKPSA-N 0.000 description 1
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 1
- VMVNCJDKFOQOHM-GUBZILKMSA-N Ser-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N VMVNCJDKFOQOHM-GUBZILKMSA-N 0.000 description 1
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- RJHJPZQOMKCSTP-CIUDSAMLSA-N Ser-His-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O RJHJPZQOMKCSTP-CIUDSAMLSA-N 0.000 description 1
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- MOQDPPUMFSMYOM-KKUMJFAQSA-N Ser-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N MOQDPPUMFSMYOM-KKUMJFAQSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 1
- MHVXPTAMDHLTHB-IHPCNDPISA-N Ser-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MHVXPTAMDHLTHB-IHPCNDPISA-N 0.000 description 1
- XQAPEISNMXNKGE-FXQIFTODSA-N Ser-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CS)C(=O)O XQAPEISNMXNKGE-FXQIFTODSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 1
- BVLGVLWFIZFEAH-BPUTZDHNSA-N Ser-Pro-Trp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BVLGVLWFIZFEAH-BPUTZDHNSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 1
- SDFUZKIAHWRUCS-QEJZJMRPSA-N Ser-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N SDFUZKIAHWRUCS-QEJZJMRPSA-N 0.000 description 1
- XPVIVVLLLOFBRH-XIRDDKMYSA-N Ser-Trp-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)CO)C(O)=O XPVIVVLLLOFBRH-XIRDDKMYSA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- BIWBTRRBHIEVAH-IHPCNDPISA-N Ser-Tyr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BIWBTRRBHIEVAH-IHPCNDPISA-N 0.000 description 1
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 108010071390 Serum Albumin Proteins 0.000 description 1
- 102000007562 Serum Albumin Human genes 0.000 description 1
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- STGXWWBXWXZOER-MBLNEYKQSA-N Thr-Ala-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 STGXWWBXWXZOER-MBLNEYKQSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 1
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 1
- UCCNDUPVIFOOQX-CUJWVEQBSA-N Thr-Cys-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 UCCNDUPVIFOOQX-CUJWVEQBSA-N 0.000 description 1
- LOHBIDZYHQQTDM-IXOXFDKPSA-N Thr-Cys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LOHBIDZYHQQTDM-IXOXFDKPSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 1
- HPQHHRLWSAMMKG-KATARQTJSA-N Thr-Lys-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N)O HPQHHRLWSAMMKG-KATARQTJSA-N 0.000 description 1
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 1
- DCRHJDRLCFMEBI-RHYQMDGZSA-N Thr-Lys-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O DCRHJDRLCFMEBI-RHYQMDGZSA-N 0.000 description 1
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 description 1
- GYUUYCIXELGTJS-MEYUZBJRSA-N Thr-Phe-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O GYUUYCIXELGTJS-MEYUZBJRSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 1
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 1
- FBQHKSPOIAFUEI-OWLDWWDNSA-N Thr-Trp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O FBQHKSPOIAFUEI-OWLDWWDNSA-N 0.000 description 1
- KHTIUAKJRUIEMA-HOUAVDHOSA-N Thr-Trp-Asp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 KHTIUAKJRUIEMA-HOUAVDHOSA-N 0.000 description 1
- GJOBRAHDRIDAPT-NGTWOADLSA-N Thr-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H]([C@@H](C)O)N GJOBRAHDRIDAPT-NGTWOADLSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- OMRWDMWXRWTQIU-YJRXYDGGSA-N Thr-Tyr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N)O OMRWDMWXRWTQIU-YJRXYDGGSA-N 0.000 description 1
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- 102100031294 Thymic stromal lymphopoietin Human genes 0.000 description 1
- 102000002689 Toll-like receptor Human genes 0.000 description 1
- 108020000411 Toll-like receptor Proteins 0.000 description 1
- 240000001085 Trapa natans Species 0.000 description 1
- 239000007984 Tris EDTA buffer Substances 0.000 description 1
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 1
- TWJDQTTXXZDJKV-BPUTZDHNSA-N Trp-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O TWJDQTTXXZDJKV-BPUTZDHNSA-N 0.000 description 1
- YEGMNOHLZNGOCG-UBHSHLNASA-N Trp-Asn-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YEGMNOHLZNGOCG-UBHSHLNASA-N 0.000 description 1
- DXDMNBJJEXYMLA-UBHSHLNASA-N Trp-Asn-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 DXDMNBJJEXYMLA-UBHSHLNASA-N 0.000 description 1
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 1
- FKAPNDWDLDWZNF-QEJZJMRPSA-N Trp-Asp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FKAPNDWDLDWZNF-QEJZJMRPSA-N 0.000 description 1
- OBWQLWYNNZPWGX-QEJZJMRPSA-N Trp-Gln-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OBWQLWYNNZPWGX-QEJZJMRPSA-N 0.000 description 1
- PTAWAMWPRFTACW-SZMVWBNQSA-N Trp-Gln-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PTAWAMWPRFTACW-SZMVWBNQSA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- OBAMASZCXDIXSS-SZMVWBNQSA-N Trp-Glu-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N OBAMASZCXDIXSS-SZMVWBNQSA-N 0.000 description 1
- UDCHKDYNMRJYMI-QEJZJMRPSA-N Trp-Glu-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UDCHKDYNMRJYMI-QEJZJMRPSA-N 0.000 description 1
- HXNVJPQADLRHGR-JBACZVJFSA-N Trp-Glu-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N HXNVJPQADLRHGR-JBACZVJFSA-N 0.000 description 1
- DVWAIHZOPSYMSJ-ZVZYQTTQSA-N Trp-Glu-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 DVWAIHZOPSYMSJ-ZVZYQTTQSA-N 0.000 description 1
- NXQAOORHSYJRGH-AAEUAGOBSA-N Trp-Gly-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 NXQAOORHSYJRGH-AAEUAGOBSA-N 0.000 description 1
- WSGPBCAGEGHKQJ-BBRMVZONSA-N Trp-Gly-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WSGPBCAGEGHKQJ-BBRMVZONSA-N 0.000 description 1
- BONYBFXWMXBAND-GQGQLFGLSA-N Trp-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BONYBFXWMXBAND-GQGQLFGLSA-N 0.000 description 1
- VPRHDRKAPYZMHL-SZMVWBNQSA-N Trp-Leu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 VPRHDRKAPYZMHL-SZMVWBNQSA-N 0.000 description 1
- ZHZLQVLQBDBQCQ-WDSOQIARSA-N Trp-Lys-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ZHZLQVLQBDBQCQ-WDSOQIARSA-N 0.000 description 1
- GQNCRIFNDVFRNF-BPUTZDHNSA-N Trp-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O GQNCRIFNDVFRNF-BPUTZDHNSA-N 0.000 description 1
- KXFYAQUYJKOQMI-QEJZJMRPSA-N Trp-Ser-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 KXFYAQUYJKOQMI-QEJZJMRPSA-N 0.000 description 1
- UMIACFRBELJMGT-GQGQLFGLSA-N Trp-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UMIACFRBELJMGT-GQGQLFGLSA-N 0.000 description 1
- MPYZGXUYLNPSNF-NAZCDGGXSA-N Trp-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)O MPYZGXUYLNPSNF-NAZCDGGXSA-N 0.000 description 1
- STKZKWFOKOCSLW-UMPQAUOISA-N Trp-Thr-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 STKZKWFOKOCSLW-UMPQAUOISA-N 0.000 description 1
- YCQKQFKXBPJXRY-PMVMPFDFSA-N Trp-Tyr-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CCCCN)C(=O)O)N YCQKQFKXBPJXRY-PMVMPFDFSA-N 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 1
- KSVMDJJCYKIXTK-IGNZVWTISA-N Tyr-Ala-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KSVMDJJCYKIXTK-IGNZVWTISA-N 0.000 description 1
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 1
- HKIUVWMZYFBIHG-KKUMJFAQSA-N Tyr-Arg-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O HKIUVWMZYFBIHG-KKUMJFAQSA-N 0.000 description 1
- SGFIXFAHVWJKTD-KJEVXHAQSA-N Tyr-Arg-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SGFIXFAHVWJKTD-KJEVXHAQSA-N 0.000 description 1
- DYEGCOJHFNJBKB-UFYCRDLUSA-N Tyr-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 DYEGCOJHFNJBKB-UFYCRDLUSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- XKDOQXAXKFQWQJ-SRVKXCTJSA-N Tyr-Cys-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O XKDOQXAXKFQWQJ-SRVKXCTJSA-N 0.000 description 1
- WEFIPBYPXZYPHD-HJPIBITLSA-N Tyr-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N WEFIPBYPXZYPHD-HJPIBITLSA-N 0.000 description 1
- YLRLHDFMMWDYTK-KKUMJFAQSA-N Tyr-Cys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 YLRLHDFMMWDYTK-KKUMJFAQSA-N 0.000 description 1
- QOEZFICGUZTRFX-IHRRRGAJSA-N Tyr-Cys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O QOEZFICGUZTRFX-IHRRRGAJSA-N 0.000 description 1
- UXUFNBVCPAWACG-SIUGBPQLSA-N Tyr-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N UXUFNBVCPAWACG-SIUGBPQLSA-N 0.000 description 1
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 1
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 1
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 1
- ADECJAKCRKPSOR-ULQDDVLXSA-N Tyr-His-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ADECJAKCRKPSOR-ULQDDVLXSA-N 0.000 description 1
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 1
- JHORGUYURUBVOM-KKUMJFAQSA-N Tyr-His-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O JHORGUYURUBVOM-KKUMJFAQSA-N 0.000 description 1
- HHFMNAVFGBYSAT-IGISWZIWSA-N Tyr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HHFMNAVFGBYSAT-IGISWZIWSA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 1
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- IGXLNVIYDYONFB-UFYCRDLUSA-N Tyr-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 IGXLNVIYDYONFB-UFYCRDLUSA-N 0.000 description 1
- VBFVQTPETKJCQW-RPTUDFQQSA-N Tyr-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VBFVQTPETKJCQW-RPTUDFQQSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- RCMWNNJFKNDKQR-UFYCRDLUSA-N Tyr-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 RCMWNNJFKNDKQR-UFYCRDLUSA-N 0.000 description 1
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 1
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- QFHRUCJIRVILCK-YJRXYDGGSA-N Tyr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O QFHRUCJIRVILCK-YJRXYDGGSA-N 0.000 description 1
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- XTOCLOATLKOZAU-JBACZVJFSA-N Tyr-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N XTOCLOATLKOZAU-JBACZVJFSA-N 0.000 description 1
- JQOMHZMWQHXALX-FHWLQOOXSA-N Tyr-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JQOMHZMWQHXALX-FHWLQOOXSA-N 0.000 description 1
- MJUTYRIMFIICKL-JYJNAYRXSA-N Tyr-Val-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MJUTYRIMFIICKL-JYJNAYRXSA-N 0.000 description 1
- BQASAMYRHNCKQE-IHRRRGAJSA-N Tyr-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BQASAMYRHNCKQE-IHRRRGAJSA-N 0.000 description 1
- GOPQNCQSXBJAII-ULQDDVLXSA-N Tyr-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GOPQNCQSXBJAII-ULQDDVLXSA-N 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- CWOSXNKDOACNJN-BZSNNMDCSA-N Val-Arg-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N CWOSXNKDOACNJN-BZSNNMDCSA-N 0.000 description 1
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 1
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 1
- PVPAOIGJYHVWBT-KKHAAJSZSA-N Val-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N)O PVPAOIGJYHVWBT-KKHAAJSZSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- FRUYSSRPJXNRRB-GUBZILKMSA-N Val-Cys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FRUYSSRPJXNRRB-GUBZILKMSA-N 0.000 description 1
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 1
- FBVUOEYVGNMRMD-NAKRPEOUSA-N Val-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N FBVUOEYVGNMRMD-NAKRPEOUSA-N 0.000 description 1
- LMSBRIVOCYOKMU-NRPADANISA-N Val-Gln-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N LMSBRIVOCYOKMU-NRPADANISA-N 0.000 description 1
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 1
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 1
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- MANXHLOVEUHVFD-DCAQKATOSA-N Val-His-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N MANXHLOVEUHVFD-DCAQKATOSA-N 0.000 description 1
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 1
- JPPXDMBGXJBTIB-ULQDDVLXSA-N Val-His-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N JPPXDMBGXJBTIB-ULQDDVLXSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 1
- SJLVYVZBFDTRCG-DCAQKATOSA-N Val-Lys-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N SJLVYVZBFDTRCG-DCAQKATOSA-N 0.000 description 1
- WBAJDGWKRIHOAC-GVXVVHGQSA-N Val-Lys-Gln Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O WBAJDGWKRIHOAC-GVXVVHGQSA-N 0.000 description 1
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- PWCJARIQERIIGF-BZSNNMDCSA-N Val-Met-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N PWCJARIQERIIGF-BZSNNMDCSA-N 0.000 description 1
- MJFSRZZJQWZHFQ-SRVKXCTJSA-N Val-Met-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N MJFSRZZJQWZHFQ-SRVKXCTJSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- ZHWZDZFWBXWPDW-GUBZILKMSA-N Val-Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O ZHWZDZFWBXWPDW-GUBZILKMSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 208000023940 X-Linked Combined Immunodeficiency disease Diseases 0.000 description 1
- INAPMGSXUVUWAF-GCVPSNMTSA-N [(2r,3s,5r,6r)-2,3,4,5,6-pentahydroxycyclohexyl] dihydrogen phosphate Chemical compound OC1[C@H](O)[C@@H](O)C(OP(O)(O)=O)[C@H](O)[C@@H]1O INAPMGSXUVUWAF-GCVPSNMTSA-N 0.000 description 1
- 239000002250 absorbent Substances 0.000 description 1
- 230000002745 absorbent Effects 0.000 description 1
- 230000009102 absorption Effects 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000008065 acid anhydrides Chemical class 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 210000000577 adipose tissue Anatomy 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 238000003450 affinity purification method Methods 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 230000001270 agonistic effect Effects 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 125000001931 aliphatic group Chemical group 0.000 description 1
- 150000001336 alkenes Chemical class 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- 102000004139 alpha-Amylases Human genes 0.000 description 1
- 108090000637 alpha-Amylases Proteins 0.000 description 1
- 229940024171 alpha-amylase Drugs 0.000 description 1
- 230000009435 amidation Effects 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 150000001408 amides Chemical class 0.000 description 1
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 1
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- 235000011130 ammonium sulphate Nutrition 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 150000008064 anhydrides Chemical class 0.000 description 1
- 238000003975 animal breeding Methods 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 230000003042 antagnostic effect Effects 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 230000002917 arthritic effect Effects 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 150000001540 azides Chemical class 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 102000012740 beta Adrenergic Receptors Human genes 0.000 description 1
- 108010079452 beta Adrenergic Receptors Proteins 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000008512 biological response Effects 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 229940098773 bovine serum albumin Drugs 0.000 description 1
- 125000005997 bromomethyl group Chemical group 0.000 description 1
- 229950004398 broxuridine Drugs 0.000 description 1
- 239000007975 buffered saline Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 150000001718 carbodiimides Chemical class 0.000 description 1
- 150000001720 carbohydrates Chemical group 0.000 description 1
- PFKFTWBEEFSNDU-UHFFFAOYSA-N carbonyldiimidazole Chemical compound C1=CN=CN1C(=O)N1C=CN=C1 PFKFTWBEEFSNDU-UHFFFAOYSA-N 0.000 description 1
- 239000013592 cell lysate Substances 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000013522 chelant Substances 0.000 description 1
- 238000012412 chemical coupling Methods 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- VDQQXEISLMTGAB-UHFFFAOYSA-N chloramine T Chemical compound [Na+].CC1=CC=C(S(=O)(=O)[N-]Cl)C=C1 VDQQXEISLMTGAB-UHFFFAOYSA-N 0.000 description 1
- 125000002668 chloroacetyl group Chemical group ClCC(=O)* 0.000 description 1
- 125000004218 chloromethyl group Chemical group [H]C([H])(Cl)* 0.000 description 1
- 229960003677 chloroquine Drugs 0.000 description 1
- WHTVZRBIWZFKQO-UHFFFAOYSA-N chloroquine Natural products ClC1=CC=C2C(NC(C)CCCN(CC)CC)=CC=NC2=C1 WHTVZRBIWZFKQO-UHFFFAOYSA-N 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 230000014107 chromosome localization Effects 0.000 description 1
- 238000001142 circular dichroism spectrum Methods 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 230000006957 competitive inhibition Effects 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 238000010205 computational analysis Methods 0.000 description 1
- 238000009833 condensation Methods 0.000 description 1
- 230000005494 condensation Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 239000003431 cross linking reagent Substances 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 238000002425 crystallisation Methods 0.000 description 1
- 230000008025 crystallization Effects 0.000 description 1
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 231100000433 cytotoxic Toxicity 0.000 description 1
- 230000001472 cytotoxic effect Effects 0.000 description 1
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 1
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 230000022811 deglycosylation Effects 0.000 description 1
- 239000003398 denaturant Substances 0.000 description 1
- 238000001212 derivatisation Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 229950006137 dexfosfoserine Drugs 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 238000002050 diffraction method Methods 0.000 description 1
- 238000009792 diffusion process Methods 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 229910000397 disodium phosphate Inorganic materials 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 239000012153 distilled water Substances 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 238000009510 drug design Methods 0.000 description 1
- 238000007878 drug screening assay Methods 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 230000009088 enzymatic function Effects 0.000 description 1
- 208000037828 epithelial carcinoma Diseases 0.000 description 1
- 102000015694 estrogen receptors Human genes 0.000 description 1
- 108010038795 estrogen receptors Proteins 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 230000003328 fibroblastic effect Effects 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 238000002875 fluorescence polarization Methods 0.000 description 1
- 238000004108 freeze drying Methods 0.000 description 1
- 210000000232 gallbladder Anatomy 0.000 description 1
- 108010074605 gamma-Globulins Proteins 0.000 description 1
- 238000001415 gene therapy Methods 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 1
- 230000002414 glycolytic effect Effects 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 1
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 230000003394 haemopoietic effect Effects 0.000 description 1
- 229910052736 halogen Inorganic materials 0.000 description 1
- 150000002367 halogens Chemical class 0.000 description 1
- 125000004970 halomethyl group Chemical group 0.000 description 1
- 210000003958 hematopoietic stem cell Anatomy 0.000 description 1
- 230000002607 hemopoietic effect Effects 0.000 description 1
- 239000000833 heterodimer Substances 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 210000004754 hybrid cell Anatomy 0.000 description 1
- 125000004029 hydroxymethyl group Chemical group [H]OC([H])([H])* 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 238000010166 immunofluorescence Methods 0.000 description 1
- 238000001114 immunoprecipitation Methods 0.000 description 1
- 238000007901 in situ hybridization Methods 0.000 description 1
- 238000000099 in vitro assay Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000002757 inflammatory effect Effects 0.000 description 1
- 230000028709 inflammatory response Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000000266 injurious effect Effects 0.000 description 1
- 108010085650 interferon gamma receptor Proteins 0.000 description 1
- 108040006849 interleukin-2 receptor activity proteins Proteins 0.000 description 1
- 108040006862 interleukin-9 receptor activity proteins Proteins 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 108010045069 keyhole-limpet hemocyanin Proteins 0.000 description 1
- 201000010260 leiomyoma Diseases 0.000 description 1
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 1
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 238000000670 ligand binding assay Methods 0.000 description 1
- 108020001756 ligand binding domains Proteins 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 239000008176 lyophilized powder Substances 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010043322 lysyl-tryptophyl-alpha-lysine Proteins 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 210000003519 mature b lymphocyte Anatomy 0.000 description 1
- 238000002483 medication Methods 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 229960000554 metrizamide Drugs 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000001002 morphogenetic effect Effects 0.000 description 1
- 230000000921 morphogenic effect Effects 0.000 description 1
- 201000000050 myeloid neoplasm Diseases 0.000 description 1
- 210000000754 myometrium Anatomy 0.000 description 1
- 210000000822 natural killer cell Anatomy 0.000 description 1
- 230000010807 negative regulation of binding Effects 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 238000001216 nucleic acid method Methods 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- JRZJOMJEPLMPRA-UHFFFAOYSA-N olefin Natural products CCCCCCCC=C JRZJOMJEPLMPRA-UHFFFAOYSA-N 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 229940092253 ovalbumin Drugs 0.000 description 1
- 210000002741 palatine tonsil Anatomy 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 229920002866 paraformaldehyde Polymers 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 108010011903 peptide receptors Proteins 0.000 description 1
- 210000005259 peripheral blood Anatomy 0.000 description 1
- 239000011886 peripheral blood Substances 0.000 description 1
- 102000013415 peroxidase activity proteins Human genes 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 239000012071 phase Substances 0.000 description 1
- 239000005011 phenolic resin Substances 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- NMHMNPHRMNGLLB-UHFFFAOYSA-N phloretic acid Chemical compound OC(=O)CCC1=CC=C(O)C=C1 NMHMNPHRMNGLLB-UHFFFAOYSA-N 0.000 description 1
- BZQFBWGGLXLEPQ-REOHCLBHSA-N phosphoserine Chemical compound OC(=O)[C@@H](N)COP(O)(O)=O BZQFBWGGLXLEPQ-REOHCLBHSA-N 0.000 description 1
- USRGIUJOYOXOQJ-GBXIJSLDSA-N phosphothreonine Chemical compound OP(=O)(O)O[C@H](C)[C@H](N)C(O)=O USRGIUJOYOXOQJ-GBXIJSLDSA-N 0.000 description 1
- DCWXELXMIBXGTH-UHFFFAOYSA-N phosphotyrosine Chemical group OC(=O)C(N)CC1=CC=C(OP(O)(O)=O)C=C1 DCWXELXMIBXGTH-UHFFFAOYSA-N 0.000 description 1
- 230000035790 physiological processes and functions Effects 0.000 description 1
- 230000006461 physiological response Effects 0.000 description 1
- 230000001885 phytohemagglutinin Effects 0.000 description 1
- 210000002826 placenta Anatomy 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 102000054765 polymorphisms of proteins Human genes 0.000 description 1
- 229920000098 polyolefin Polymers 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 238000004393 prognosis Methods 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 230000009145 protein modification Effects 0.000 description 1
- 238000002708 random mutagenesis Methods 0.000 description 1
- 229910052761 rare earth metal Inorganic materials 0.000 description 1
- 150000002910 rare earth metals Chemical group 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000001525 receptor binding assay Methods 0.000 description 1
- 102000027426 receptor tyrosine kinases Human genes 0.000 description 1
- 108091008598 receptor tyrosine kinases Proteins 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 102000037983 regulatory factors Human genes 0.000 description 1
- 108091008025 regulatory factors Proteins 0.000 description 1
- 239000012508 resin bead Substances 0.000 description 1
- 230000004043 responsiveness Effects 0.000 description 1
- 238000003757 reverse transcription PCR Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000011896 sensitive detection Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 230000000405 serological effect Effects 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 230000007781 signaling event Effects 0.000 description 1
- 230000009131 signaling function Effects 0.000 description 1
- 230000037432 silent mutation Effects 0.000 description 1
- 229910052709 silver Inorganic materials 0.000 description 1
- 239000004332 silver Substances 0.000 description 1
- 210000000813 small intestine Anatomy 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 239000006104 solid solution Substances 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 238000006277 sulfonation reaction Methods 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 229960000814 tetanus toxoid Drugs 0.000 description 1
- 150000003568 thioethers Chemical class 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 108700012359 toxins Proteins 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 238000011830 transgenic mouse model Methods 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 101150044170 trpE gene Proteins 0.000 description 1
- 108010029384 tryptophyl-histidine Proteins 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 230000007306 turnover Effects 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 108010071635 tyrosyl-prolyl-arginine Proteins 0.000 description 1
- 108010077037 tyrosyl-tyrosyl-phenylalanine Proteins 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 210000004291 uterus Anatomy 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 230000003245 working effect Effects 0.000 description 1
- 238000002424 x-ray crystallography Methods 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
- C07K14/715—Receptors; Cell surface antigens; Cell surface determinants for cytokines; for lymphokines; for interferons
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P37/00—Drugs for immunological or allergic disorders
- A61P37/02—Immunomodulators
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; AVICULTURE; APICULTURE; PISCICULTURE; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/05—Animals comprising random inserted nucleic acids (transgenic)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Immunology (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Biophysics (AREA)
- Gastroenterology & Hepatology (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Toxicology (AREA)
- Cell Biology (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
Nucleic acids encoding mammalian, e.g., primate, receptors, purified receptor proteins and fragments thereof. Antibodies, both polyclonal and monoclonal, are also provided. Methods of using the compositions for both diagnostic and therapeutic utilities are described.
Description
MAMMALIAN RECEPTOR PROTEINS;
RELATED REAGENTS AND METHODS
This filing claims priority to U.S. Patent Application 09/443,060, filed November 18, 1999, and U.S. Application 60/170,320, filed December 13, 1999, each of which is incorporated herein by reference.
FIELD OF THE INVENTION
The present invention relates to compositions and methods for affecting mammalian physiology, including immune system function. In particular, it provides methods to regulate development and/or the immune system. Diagnostic and therapeutic uses of these materials are also disclosed.
BACKGROUND OF THE INVENTION
Recombinant DNA technology refers generally to techniques of integrating genetic information from a donor source into vectors for subsequent processing, such as through introduction into a host, whereby the transferred genetic information is copied and/or expressed in the new environment. Commonly, the genetic information exists in the form of complementary DNA
(cDNA) derived from messenger RNA (mRNA) coding for a desired protein product. The carrier is frequently a plasmid having the capacity to incorporate cDNA for later replication in a host and, in some cases, actually to control expression of the cDNA
and thereby direct synthesis of the encoded product in the host.
See, e.g., Sambrook, et al. (1989) Molecular Cloning: A
Laboratory Manual, (2d ed.) vols.~ 1-3, CSH Press, NY.
For some time, it has been known that the mammalian immune response is based on a series of complex cellular interactions, called the "immune network". Recent research has provided new insights into the inner workings of this network. While it remains clear that much of the immune response does, in fact, revolve around the network-like interactions of lymphocytes, macrophages, granulocytes, and other cells, immunologists now generally hold the opinion that soluble proteins, known as lymphokines, cytokines, or monokines, play critical roles in controlling these cellular interactions. Thus, there is considerable interest in the isolation, characterization, and mechanisms of action of cell modulatory factors, an understanding of which will lead to significant advancements in the diagnosis and therapy of numerous medical abnormalities, e.g., immune system disorders.
Lymphokines apparently mediate cellular activities in a 10~ variety of ways. See, e.g., Paul (ed. 1996) Fundamental Immunoloav 3d ed., Raven Press, New York; and Thomson (ed. 1994) The Cytokine Handbook 2d ed., Academic Press, San Diego. They have been shown to support the proliferation, growth, and/or differentiation of pluripotential hematopoietic stem cells into vast numbers of progenitors comprising diverse cellular lineages which make up a complex immune system. Proper and balanced interactions between the cellular components are necessary for a healthy immune response. The different cellular lineages often respond in a different manner when lymphokines are administered in conjunction with other agents.
Cell lineages especially important to the immune response include two classes of lymphocytes: B-cells, which can produce and secrete immunoglobulins (proteins with the capability of recognizing and binding to foreign matter to effect its removal), and T-cells of various subsets that secrete lymphokines and induce or suppress the B-cells and various other cells (including other T-cells) making up the immune network.
These lymphocytes interact with many other cell types.
Research to better understand and treat various immune disorders has been hampered by the general inability to maintain cells of the immune system in vitro. Immunologists have discovered that culturing many of these cells can be accomplished through the use of T-cell and other cell supernatants, which contain various growth factors, including many of the lymphokines.
Various growth and regulatory factors exist which modulate morphogenetic development. Many receptors for cytokines are known. Often, there are at least two critical subunits in the functional receptor. See, e.g., Gonda and D'Andrea (1997) Blood 89:355-369; Presky, et al. (1996) Proc. Nat'1 Acad. Sci..USA
93:14002-14007; Drachman and Kaushansky (1995) Curr. Opin.
Hematol. 2:22-28; Theze (1994) Eur. Cytokine Netw. 5:353-368;
and Lemmon and Schlessinger (1994) Trends Biochem. Sci. 19:459-463.
From the foregoing, it is evident that the discovery and development of new receptors, including ones similar to known 10~ receptors for lymphokines, should contribute to new therapies .
In particular, the discovery and understanding of novel receptors for lymphokine-like molecules which enhance or potentiate the beneficial activities of other lymphokines would be highly advantageous. The present invention provides new receptors for ligands exhibiting similarity to cytokine like compositions and related compounds, and methods for their use.
SUMMARY OF THE INVENTION
The present invention is directed to novel receptors related to cytokine receptors, e.g., primate, cytokine receptor like molecular structures, designated DNAX Cytokine Receptor Subunits (DCRS), and their biological activities. In particular, it provides descriptions of subunits designated DCRS3 (referring to two embodiments designated DCRS3.1 and DCRS3.2) and DCRS4 (referring to three embodiments designated DCRS4.1, DCRS4.2, and DCRS4.3). It includes nucleic acids coding for the polypeptides themselves and methods for their production and use. The nucleic acids of the invention are characterized, in part, by their homology to cloned complementary. DNA (cDNA) sequences enclosed herein.
The present invention provides a composition of matter selected from: a substantially pure or recombinant: DCRS3 polypeptide comprising: at least three distinct nonoverlapping segments of at least four amino acids identical to segments of SEQ ID NO: 2 or 25; a substantially pure or recombinant DCRS3 polypeptide comprising at least two distinct nonoverlapping segments of at least five amino acids identical to segments of SEQ ID NO: 2 or 25; a natural sequence DCRS3 comprising mature SEQ ID NO: 2 or 25; a fusion polypeptide comprising DCRS3 sequence; or DCRS4 polypeptide comprising: at least three distinct nonoverlapping segments of at least four amino acids identical to segments of SEQ ID NO: 5, 28, or 31; a substantially pure or recombinant DCRS4 polypeptide comprising at least two distinct nonoverlapping segments of at least five amino acids identical to segments of SEQ ID NO: 5, 28, or 31; a natural sequence DCRS4 comprising mature SEQ ID NO: 5, 28, or 10~ 31; or a fusion polypeptide comprising DCRS4 sequence. In certain embodiments, the invention embraces such a substantially pure or isolated antigenic DCRS3 or DRS4 polypeptide, wherein the distinct nonoverlapping segments of identity: include one of at least eight amino acids; include one of at least four amino acids and a second of at least five amino acids; include at least three segments of at least four, five, and six amino acids, or include one of at least twelve amino acids. Other embodiments include wherein the: DCRS3 polypeptide: comprises a mature sequence of Table 1; is an unglycosylated form of DCRS3;
is from a primate, such as a human; comprises at least seventeen amino acids of SEQ ID NO: 2 or 25; exhibits at least four nonoverlapping segments of at least seven amino acids of SEQ ID
NO: 2 or 25; comprises a sequence of at least 3 amino acids on each side across an exon boundary; is a natural allelic variant of DCRS3; has a length at least about 30 amino acids; exhibits at least two non-overlapping epitopes which are specific for a primate DCRS3; is glycosylated; has a molecular weight of at least 30 kD with natural glycosylation; is a synthetic polypeptide; is attached to a solid substrate; is conjugated to another chemical moiety; is a 5-fold or less substitution from natural sequence; or is a deletion or insertion variant from a natural sequence; or DCRS4 polypeptide: comprises a mature sequence of Table 3; is an unglycosylated form of DCRS4; is from a primate, such as a human; comprises at least seventeen amino acids of SEQ ID NO: 5; exhibits at least four nonoverlapping segments of at least seven amino acids of SEQ ID NO: 5, 28, or 31; comprises a sequence of at least 3 amino acids on each side across an exon boundary; is a natural allelic variant of DCRSS;
has a length at least about 30 amino acids; exhibits at least two non-overlapping epitopes which are specific for a primate DCRS5; is glycosylated; has a molecular weight of at least 30 kD
RELATED REAGENTS AND METHODS
This filing claims priority to U.S. Patent Application 09/443,060, filed November 18, 1999, and U.S. Application 60/170,320, filed December 13, 1999, each of which is incorporated herein by reference.
FIELD OF THE INVENTION
The present invention relates to compositions and methods for affecting mammalian physiology, including immune system function. In particular, it provides methods to regulate development and/or the immune system. Diagnostic and therapeutic uses of these materials are also disclosed.
BACKGROUND OF THE INVENTION
Recombinant DNA technology refers generally to techniques of integrating genetic information from a donor source into vectors for subsequent processing, such as through introduction into a host, whereby the transferred genetic information is copied and/or expressed in the new environment. Commonly, the genetic information exists in the form of complementary DNA
(cDNA) derived from messenger RNA (mRNA) coding for a desired protein product. The carrier is frequently a plasmid having the capacity to incorporate cDNA for later replication in a host and, in some cases, actually to control expression of the cDNA
and thereby direct synthesis of the encoded product in the host.
See, e.g., Sambrook, et al. (1989) Molecular Cloning: A
Laboratory Manual, (2d ed.) vols.~ 1-3, CSH Press, NY.
For some time, it has been known that the mammalian immune response is based on a series of complex cellular interactions, called the "immune network". Recent research has provided new insights into the inner workings of this network. While it remains clear that much of the immune response does, in fact, revolve around the network-like interactions of lymphocytes, macrophages, granulocytes, and other cells, immunologists now generally hold the opinion that soluble proteins, known as lymphokines, cytokines, or monokines, play critical roles in controlling these cellular interactions. Thus, there is considerable interest in the isolation, characterization, and mechanisms of action of cell modulatory factors, an understanding of which will lead to significant advancements in the diagnosis and therapy of numerous medical abnormalities, e.g., immune system disorders.
Lymphokines apparently mediate cellular activities in a 10~ variety of ways. See, e.g., Paul (ed. 1996) Fundamental Immunoloav 3d ed., Raven Press, New York; and Thomson (ed. 1994) The Cytokine Handbook 2d ed., Academic Press, San Diego. They have been shown to support the proliferation, growth, and/or differentiation of pluripotential hematopoietic stem cells into vast numbers of progenitors comprising diverse cellular lineages which make up a complex immune system. Proper and balanced interactions between the cellular components are necessary for a healthy immune response. The different cellular lineages often respond in a different manner when lymphokines are administered in conjunction with other agents.
Cell lineages especially important to the immune response include two classes of lymphocytes: B-cells, which can produce and secrete immunoglobulins (proteins with the capability of recognizing and binding to foreign matter to effect its removal), and T-cells of various subsets that secrete lymphokines and induce or suppress the B-cells and various other cells (including other T-cells) making up the immune network.
These lymphocytes interact with many other cell types.
Research to better understand and treat various immune disorders has been hampered by the general inability to maintain cells of the immune system in vitro. Immunologists have discovered that culturing many of these cells can be accomplished through the use of T-cell and other cell supernatants, which contain various growth factors, including many of the lymphokines.
Various growth and regulatory factors exist which modulate morphogenetic development. Many receptors for cytokines are known. Often, there are at least two critical subunits in the functional receptor. See, e.g., Gonda and D'Andrea (1997) Blood 89:355-369; Presky, et al. (1996) Proc. Nat'1 Acad. Sci..USA
93:14002-14007; Drachman and Kaushansky (1995) Curr. Opin.
Hematol. 2:22-28; Theze (1994) Eur. Cytokine Netw. 5:353-368;
and Lemmon and Schlessinger (1994) Trends Biochem. Sci. 19:459-463.
From the foregoing, it is evident that the discovery and development of new receptors, including ones similar to known 10~ receptors for lymphokines, should contribute to new therapies .
In particular, the discovery and understanding of novel receptors for lymphokine-like molecules which enhance or potentiate the beneficial activities of other lymphokines would be highly advantageous. The present invention provides new receptors for ligands exhibiting similarity to cytokine like compositions and related compounds, and methods for their use.
SUMMARY OF THE INVENTION
The present invention is directed to novel receptors related to cytokine receptors, e.g., primate, cytokine receptor like molecular structures, designated DNAX Cytokine Receptor Subunits (DCRS), and their biological activities. In particular, it provides descriptions of subunits designated DCRS3 (referring to two embodiments designated DCRS3.1 and DCRS3.2) and DCRS4 (referring to three embodiments designated DCRS4.1, DCRS4.2, and DCRS4.3). It includes nucleic acids coding for the polypeptides themselves and methods for their production and use. The nucleic acids of the invention are characterized, in part, by their homology to cloned complementary. DNA (cDNA) sequences enclosed herein.
The present invention provides a composition of matter selected from: a substantially pure or recombinant: DCRS3 polypeptide comprising: at least three distinct nonoverlapping segments of at least four amino acids identical to segments of SEQ ID NO: 2 or 25; a substantially pure or recombinant DCRS3 polypeptide comprising at least two distinct nonoverlapping segments of at least five amino acids identical to segments of SEQ ID NO: 2 or 25; a natural sequence DCRS3 comprising mature SEQ ID NO: 2 or 25; a fusion polypeptide comprising DCRS3 sequence; or DCRS4 polypeptide comprising: at least three distinct nonoverlapping segments of at least four amino acids identical to segments of SEQ ID NO: 5, 28, or 31; a substantially pure or recombinant DCRS4 polypeptide comprising at least two distinct nonoverlapping segments of at least five amino acids identical to segments of SEQ ID NO: 5, 28, or 31; a natural sequence DCRS4 comprising mature SEQ ID NO: 5, 28, or 10~ 31; or a fusion polypeptide comprising DCRS4 sequence. In certain embodiments, the invention embraces such a substantially pure or isolated antigenic DCRS3 or DRS4 polypeptide, wherein the distinct nonoverlapping segments of identity: include one of at least eight amino acids; include one of at least four amino acids and a second of at least five amino acids; include at least three segments of at least four, five, and six amino acids, or include one of at least twelve amino acids. Other embodiments include wherein the: DCRS3 polypeptide: comprises a mature sequence of Table 1; is an unglycosylated form of DCRS3;
is from a primate, such as a human; comprises at least seventeen amino acids of SEQ ID NO: 2 or 25; exhibits at least four nonoverlapping segments of at least seven amino acids of SEQ ID
NO: 2 or 25; comprises a sequence of at least 3 amino acids on each side across an exon boundary; is a natural allelic variant of DCRS3; has a length at least about 30 amino acids; exhibits at least two non-overlapping epitopes which are specific for a primate DCRS3; is glycosylated; has a molecular weight of at least 30 kD with natural glycosylation; is a synthetic polypeptide; is attached to a solid substrate; is conjugated to another chemical moiety; is a 5-fold or less substitution from natural sequence; or is a deletion or insertion variant from a natural sequence; or DCRS4 polypeptide: comprises a mature sequence of Table 3; is an unglycosylated form of DCRS4; is from a primate, such as a human; comprises at least seventeen amino acids of SEQ ID NO: 5; exhibits at least four nonoverlapping segments of at least seven amino acids of SEQ ID NO: 5, 28, or 31; comprises a sequence of at least 3 amino acids on each side across an exon boundary; is a natural allelic variant of DCRSS;
has a length at least about 30 amino acids; exhibits at least two non-overlapping epitopes which are specific for a primate DCRS5; is glycosylated; has a molecular weight of at least 30 kD
5 with natural glycosylation; is a synthetic polypeptide; is attached to a solid substrate; is conjugated to another chemical moiety; is a 5-fold or less substitution from natural sequence;
or is a deletion or insertion variant from a natural sequence.
Still other embodiments include a composition comprising: a 10~ substantially pure DCRS3 and another cytokine receptor family member; a sterile DCRS3 polypeptide; the DCRS3 polypeptide and a carrier, wherein the carrier is: an aqueous compound, including water, saline, and/or buffer; and/or formulated for oral, rectal, nasal, topical, or parenteral administration; a substantially pure DCRS4 and another cytokine receptor family member; a sterile DCRS4 polypeptide; the DCRS4 polypeptide and a carrier, wherein the carrier is: an aqueous compound, including water, saline, and/or buffer; and/or formulated for oral, rectal, nasal, topical, or parenteral administration. Fusion polypeptide embodiments include those comprising: mature protein sequence of Table 1 or 3; a detection or purification tag, including a FLAG, His6, or Ig sequence; or sequence of another interferon receptor protein. Kit embodiments include those comprising such a polypeptide, and: a compartment comprising the protein or polypeptide; or instructions for use or disposal of reagents in the kit.
Binding compound embodiments include, e.g., a binding compound comprising an antigen binding site from an antibody, which specifically binds to a natural: DCRS3 polypeptide, wherein: the binding compound is in a container; the DCRS3 polypeptide is from a human; the binding compound is an Fv, Fab, or Fab2 fragment; the binding compound is conjugated to another chemical moiety; or the antibody: is raised against a peptide sequence of a mature polypeptide of Table 1; is raised against a mature DCRS3; is raised to a purified human DCRS3; is immunoselected; is a polyclonal antibody; binds to a denatured DCRS3; exhibits a Kd to antigen of at least 30 ~M; is attached to a solid substrate, including a bead or plastic membrane; is in a sterile composition; or is detectably labeled, including a radioactive or fluorescent label; or DCRS4 polypeptide, wherein:
the binding compound is in a container; the DCRS4 polypeptide is from a human; the binding compound is an Fv, Fab, or Fab2 fragment; the binding compound is conjugated to another chemical moiety; or the antibody: is raised against a peptide sequence of a mature polypeptide of Table 3; is raised against a mature DCRS4; is raised to a purified human DCRS4; is immunoselected;
10~ is a polyclonal antibody; binds to a denatured DCRS4; exhibits a Kd to antigen of at least 30 ~M; is attached to a solid substrate, including a bead or plastic membrane; is in a sterile composition; or is detectably labeled, including a radioactive or fluorescent label. Kits include those comprising the binding compound, and: a compartment comprising the binding compound; or instructions for use or disposal of reagents in the kit.
Methods are provided, e.g., of producing an antigen: antibody complex, comprising contacting under appropriate conditions: a primate DCRS3 polypeptide with a described antibody, thereby allowing the complex to form; or a primate DCRS4 polypeptide with a described antibody, thereby allowing the complex to form. This includes wherein: the complex is purified from other cytokine receptors; the complex is purified from other antibody; the contacting is with a sample comprising another cytokine; the contacting allows quantitative detection of the antigen; the contacting is with a sample comprising the antibody; or the contacting allows quantitative detection of the antibody.
Various related compositions are provided, e.g., a composition comprising: a sterile binding compound, as described, or the described binding compound and a carrier, wherein the carrier is: an aqueous compound, including water, saline, and/or buffer; and/or formulated for oral, rectal, nasal, topical, or parenteral administration.
Nucleic acid embodiments include, e.g., an isolated or recombinant nucleic acid encoding the DCRS3 polypeptide, wherein the: DCRS3 is from a human; or the nucleic acid: encodes an antigenic peptide sequence of Table 1; encodes a plurality of antigenic peptide sequences of Table 1; exhibits identity over at least thirteen nucleotides to a natural cDNA encoding the segment; is an expression vector; further comprises an origin of replication; is from a natural source; comprises a detectable label; comprises synthetic nucleotide sequence; is less than 6 kb, preferably less than 3 kb; is from a primate; comprises a natural full length coding sequence; is a hybridization probe for a gene encoding the DCRS3; or is a PCR primer, PCR product, 10~ or mutagenesis primer; or an isolated or recombinant nucleic acid encoding the DCRS4 polypeptide, wherein the: DCRS4 is from a human; or the nucleic acid: encodes an antigenic peptide sequence of Table 3; encodes a plurality of antigenic peptide sequences of Table 3; exhibits identity over at least thirteen nucleotides to a natural cDNA encoding the segment; is an expression vector; further comprises an origin of replication;
is from a natural source; comprises a detectable label;
comprises synthetic nucleotide sequence; is less than 6 kb, preferably less than 3 kb; is from a primate; comprises a natural full length coding sequence; is a hybridization probe for a gene encoding the DCRS4; or is a PCR primer, PCR product, or mutagenesis primer. Other embodiments of the invention include a cell or tissue comprising the described recombinant nucleic acid. Preferably, the cell is: a prokaryotic cell; a eukaryotic cell; a bacterial cell; a yeast cell; an insect cell;
a mammalian cell; a mouse cell; a primate cell; or a human cell.
Kit embodiments include those comprising a described nucleic acid, and: a compartment comprising the nucleic acid; a compartment further comprising a primate DCRS3 or DCRS4 polypeptide; or instructions for use or disposal of reagents in the kit.
Alternative nucleic acid embodiments include a nucleic acid which: hybridizes under wash conditions of 30 minutes at 30° C
and less than 2M salt to the coding portion of: SEQ ID NO: 1, 24, 4, 27, or 30; or exhibits identity over a stretch of at least about 30 nucleotides to a primate DCRS3 or DCRS4.
Preferred embodiments include those wherein: the wash conditions are at 45° C and/or 500 mM salt; the wash conditions are at 55°
C and/or 150 mM salt; the stretch is at least 55 nucleotides; or the stretch is at least 75 nucleotides.
Other methods include those of modulating physiology or development of a cell or tissue culture cells comprising contacting the cell with an agonist or antagonist of a mammalian DCRS3 or DCRS4. Preferably, the cell is transformed with a nucleic acid encoding a DCRS3 or DCRS4 and another cytokine receptor subunit.
10~
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
OUTLINE
I. General II. Activities III. Nucleic acids A. encoding fragments, sequence, probes B. mutations, chimeras, fusions C. making nucleic acids D. vectors, cells comprising IV. Proteins, Peptides A. fragments, sequence, immunogens, antigens B. muteins C. agonists/antagonists, functional equivalents D. making proteins V. Making nucleic acids, proteins A. synthetic B. recombinant C. natural sources VI. Antibodies A. polyclonals B. monoclonal C . fragment s ; Kd D, anti-idiotypic antibodies E. hybridoma cell lines VII. Kits and Methods to quantify DCRS
A. ELISA
B. assay mRNA encoding C. qualitative/quantitative D. kits VIII. Therapeutic compositions, methods A. combination compositions B. unit dose C. administration IX. Screening X. Ligands I. General The present invention provides the amino acid sequences and DNA sequences of mammalian, herein primate, cytokine receptor-like subunit molecules, these designated DNAX Cytokine Receptor Subunit 3 (DCRS3; 50R) and DNAX Cytokine Receptor Subunit 4 (DCRS4; cytor) having particular defined properties, both structural and biological. Various cDNAs encoding these 10~ molecules were obtained from primate, e.g., human, cDNA sequence libraries. Other primate or other mammalian counterparts would also be desired.
Some of the standard methods applicable are described or referenced, e.g., in Maniatis, et al. (1982) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor Press; Sambrook, et al. (1989) Molecular Cloning: A
Laboratory Manual, (2d ed.), vols. 1-3, CSH Press, NY; Ausubel, et al., Bioloay, Greene Publishing Associates, Brooklyn, NY; or Ausubel, et al. (1987 and periodic supplements) Current Protocols in Molecular Bioloav, Greene/Wiley, New York; each of which is incorporated herein by reference.
Nucleotide (SEQ ID NO: 1) and corresponding amino acid sequence (SEQ ID NO: 2) of a human DCRS3 coding segment are shown in Table 1; likewise for the DCRS3.2 as SEQ ID NO: 24 and 25; comparison of DCRS3.1 and DCRS3.2 polypeptide sequences is shown also in Table 1. Reverse translations based upon the universal genetic code are provided in Table 2; comparison of the encoding nucleic acid sequences is also presented in Table 2. The sequences are derived from genomic sequence at chromosome location clones CIT987SK-582J2 HUAC004525 and CIT987-SKA-67085 HUAC002303, at 16p12, and other cDNA sequences. The predicted signal sequence is indicated, but may depend on cell type, or may be a few residues in either direction. The transmembrane segment (SEQ ID NO: 2) is predicted to run from about 1eu248-ser264 (g1u242-his268). Predicted fibronectin domain runs from about asn128-tyr220; cytokine receptor WS box from about trp224-ser228; conserved disulfide motif between cys6-cys26; second conserved disulfide linkage at cys65-cys89;
five N glycosylation sites at Asn residues 61, 97, 121, 128, and 145; seven CAMP PK sites at lys4; 1ys68; 1ys184; arg191; arg201;
1ys202; and 1ys292; fourteen Ca phosphorylation sites at thr7l, 5 ser130, ser187, ser205, ser237, ser182, ser195, ser310, ser317, thr323, ser374, ser385, ser403, and thr499; five myristoly sites at 91y174, 91y303, 91y439, 91y449, and 91y466; four PKC
phosphorylation sites at ser7, ser147, ser180, and ser264; and one tyrosine kinase site at 1ys163.
10~ Exon boundaries are predicted to be about between nucleotides 949-c50, 9230-9231, 8284-9285, a484-8485, 9597-a598, 9775-a776, 8875-9876, and 8957-a958. Because the sequences have been derived from genomic sequence, in which the introns have not been spliced out, particularly important compositions will be those which encode segments across the boundaries, e.9., both nucleic acid sequence and amino acid sequence. The segments will comprise, e.9., segments across the boundary which may comprise 8, 9, 11, 13, 15, 17 20, 25, 30, 35, 50, or more nucleotides on either or both sides adjacent to an exon boundary, or 4, 5, 6, 7, or 8 amino acids on either or both sides adjacent a boundary. The lengths on either side need not be the same for purposes of novelty, e.9., three amino acids on one side and 5 on the other side. Thus, e.9., compositions are provided comprising, e.9, 15 contiguous nucleotides across a boundary, of which at least 6 are from each side. Similarly, compositions are provided, e.9., comprising at least 3 amino acids from each side of the exon boundary, with a matching of at least 8 amino acids across the boundary. Also provided are compositions comprising a plurality of such segments across multiple exon boundaries, which different segments need not have the same length limitations. Thus, the invention provides a nucleic acid comprising, e.9., at least 5 nucleotides in each side across the exon 1/2 boundary, and at least 4 nucleotides on either side of the the exon 3/4, 4/5, 5/6, and/or 6/7 boundaries. Natural sequence compositions would be preferred.
Nucleotide (SEQ ID NO: 4) and corresponding amino acid sequence (SEQ.ID NO: 5) of a human DCRS4 coding segment are shown in Table 3; likewise for the DCRS4.2 as SEQ ID NO: 27 and 28 and the DCRS4.3 as SEQ ID N0: 30 and 31; comparison of DCRS4 polypeptide sequences is shown also in Table 3. Reverse translations based upon the universal genetic code are provided in Table 4; comparison of the encoding nucleic acid sequences is also presented in Table 4. The sequence of DCRS4.1 is derived from genomic sequence at chromosome location 6q24.1-25.2, within some 50 kb of IFNyRl chain. The predicted DCRS4.1 signal sequence is indicated, but may depend on cell type, or may be a 10~ few residues in either direction. This embodiment of the receptor lacks a transmembrane segment, which is unusual, but there is precedent for soluble forms of cytokine receptor subunits. See, e.g, IL-l2Ra (p40 subunit) and the EBI3 receptor subunit homolog. For the DCRS4.1, the predicted cytokine receptor domain from prol0-arg49; conserved disulfide motif between cys57-cys65; five N glycosylation sites at Asn residues 35, 131, 136, 157, and 174; four cAMP PK sites at arg30, 1ys98, 1ys106, and 1ys156; eight Ca phosphorylation sites at thr4, thr60, ser64, thr68, thr7l, ser159, ser176, and ser220; three myristoly sites at g1y89, g1y103, and g1y186; three PKC
phosphorylation sites at ser7, ser97, and ser217; one amidation site at tyr79; one CAMP phosphorylation site at 1ys98; and two CK2 phosphorylation sites at sera and ser159. Exon boundaries are predicted to be about between nucleotides c59-a60; t197-a198, 8206-a207, 8430-c431, and g601-a602. Alignment with the other DCRS4 embodiments is provided. As described above, compositions with sequence across the exon boundaries are provided.
3 0 Table 1: Nucleotide and polypeptide sequences of DNAX Cytokine Receptor Subunit like embodiments (DCRS3.1; 50R). Primate, e.g., human embodiment (see SEQ ID NO: 1 and 2). Predicted signal sequence indicated, but may vary by a few positions and depending upon cell type.
3 5 atg ccg cgt ggc tgg gcc gcc ccc ttg ctc ctg ctg ctg ctc cag gga 48 Met Pro Arg Gly Trp Ala Ala Pro Leu Leu Leu Leu Leu Leu Gln Gly gcc ctc gag ggg atg gag agg aag ctc tgc agt ccc aag cca ccc ccc 96 4 0 Ala Leu Glu Gly Met Glu Arg Lys Leu Cys Ser Pro Lys Pro Pro Pro acc aag gcc tct ctc ccc act gac cct cca ggc tgg ggc tgc ccc gac 144 Thr Lys Ala Ser Leu Pro Thr Asp Pro Pro Gly Trp Gly Cys Pro Asp ctc gtc tgc tac acc gat tac ctc cag acg gtc atc tgc atc ctg gaa 192 Leu Val Cys Tyr Thr Asp Tyr Leu Gln Thr Val Ile Cys Ile Leu Glu atg tgg aac ctc cac ccc agc acg ctc acc ctt acc tgg ata ctt tct 240 Met Trp Asn Leu His Pro Ser Thr Leu Thr Leu Thr Trp Ile Leu Ser 15' aat aat act ggg tgc tat atc aag gac aga aca ctg gac ctc agg caa 288 Asn Asn Thr Gly Cys Tyr Ile Lys Asp Arg Thr Leu Asp Leu Arg Gln gac cag tat gaa gag ctg aag gac gag gcc acc tcc tgc agc ctc cac 336 Asp Gln Tyr Glu Glu Leu Lys Asp Glu Ala Thr Ser Cys Ser Leu His 2 0 agg tcg gcc cac aat gcc acg cat gcc acc tac acc tgc cac atg gat 384 Arg Ser Ala His Asn Ala Thr His Ala Thr Tyr Thr Cys His Met Asp gta ttc cac ttc atg gcc gac gac att ttc agt gtc aac atc aca gac 432 2 5 Val Phe His Phe Met Ala Asp Asp Ile Phe Ser Val Asn Ile Thr Asp cag tct ggc aac tac tcc cag gag tgt ggc agc ttt ctc ctg get gag 480 Gln Ser Gly Asn Tyr Ser Gln Glu Cys Gly Ser Phe Leu Leu Ala Glu agc aga cag tat aat atc tcc tgg cgc tca gat tac gaa gac cct gcc 528 Ser Arg Gln Tyr Asn Ile Ser Trp Arg Ser Asp Tyr Glu Asp Pro Ala ttc tac atg ctg aag ggc aag ctt cag tat gag ctg cag tac agg aac 576 Phe Tyr Met Leu Lys Gly Lys Leu Gln Tyr Glu Leu Gln Tyr Arg Asn 4 0 cgg gga gac ccc tgg get gtg agt ccg agg aga aag ctg atc tca gtg 624 Arg Gly Asp Pro Trp Ala Val Ser Pro Arg Arg Lys Leu Ile Ser Val gac tca aga agt gtc tcc ctc ctc ccc ctg gag ttc cgc aaa gac tcg 672 4 5 Asp Ser Arg Ser Val Ser Leu Leu Pro Leu Glu Phe Arg Lys Asp Ser agc tat gag ctg cag gtg cgg gca ggg ccc atg cct ggc tcc tcc tac 720 Ser Tyr Glu Leu Gln Val Arg Ala Gly Pro Met Pro Gly Ser Ser Tyr cag ggg acc tgg agt gaa tgg agt gac ccg gtc atc ttt cag acc cag 768 Gln Gly Thr Trp Ser Glu Trp Ser Asp Pro Val Ile Phe Gln Thr Gln tca gag gag tta aag gaa ggc tgg aac cct cac ctg ctg ctt ctc ctc 816 Ser Glu Glu Leu Lys Glu Gly Trp Asn Pro His Leu Leu Leu Leu Leu ctg ctt gtc ata gtc ttc att cct gcc ttc tgg agc ctg aag acc cat 864 Leu Leu Val Ile Val Phe Ile Pro Ala Phe Trp Ser Leu Lys Thr His cca ttg tgg agg cta tgg aag aag ata tgg gcc gtc ccc agc cct gag 912 Pro Leu Trp Arg Leu Trp Lys Lys Ile Trp Ala Val Pro Ser Pro Glu cgg ttc ttc atg ccc ctg tac aag ggc tgc agc gga gac ttc aag aaa 960 Arg Phe Phe Met Pro Leu Tyr Lys Gly Cys Ser Gly Asp Phe Lys Lys 15' tgg gtg ggt gca ccc ttc act ggc tcc agc ctg gag ctg gga ccc tgg 1008 Trp Val Gly Ala Pro Phe Thr Gly Ser Ser Leu Glu Leu Gly Pro Trp agc cca gag gtg ccc tcc acc ctg gag gtg tac agc tgc cac cca cca 1056 Ser Pro Glu Val Pro Ser Thr Leu Glu Val Tyr Ser Cys His Pro Pro 2 0 cgg agc ccg gcc aag agg ctg cag ctc acg gag cta caa gaa cca gca 1104 Arg Ser Pro Ala Lys Arg Leu Gln Leu Thr Glu Leu Gln Glu Pro Ala gag ctg gtg gag tct gac ggt gtg ccc aag ccc agc ttc tgg ccg aca 1152 25 Glu Leu Val Glu Ser Asp Gly Val Pro Lys Pro Ser Phe Trp Pro Thr gcc cag aac tcg ggg ggc tca get tac agt gag gag agg gat cgg cca 1200 Ala Gln Asn Ser Gly Gly Ser Ala Tyr Ser Glu Glu Arg Asp Arg Pro tac ggc ctg gtg tcc att gac aca gtg act gtg cta gat gca gag ggg 1248 Tyr Gly Leu Val Ser Ile Asp Thr Val Thr Val Leu Asp Ala Glu Gly cca tgc acc tgg ccc tgc agc tgt gag gat gac ggc tac cca gcc ctg 1296 Pro Cys Thr Trp Pro Cys Ser Cys Glu Asp Asp Gly Tyr Pro Ala Leu 4 0 gac ctg gat get ggc ctg gag ccc agc cca ggc cta gag gac cca ctc 1344 Asp Leu Asp Ala Gly Leu Glu Pro Ser Pro Gly Leu Glu Asp Pro Leu ttg gat gca ggg acc aca gtc ctg tcc tgt ggc tgt gtc tca get ggc 1392 4 5 Leu Asp Ala Gly Thr Thr Val Leu Ser Cys Gly Cys Val Ser Ala Gly agc cct ggg cta gga ggg ccc ctg gga agc ctc ctg gac aga cta aag 1440 Ser Pro Gly Leu Gly Gly Pro Leu Gly Ser Leu Leu Asp Arg Leu Lys cca ccc ctt gca gat ggg gag gac tgg get ggg gga ctg ccc tgg ggt 1488 Pro Pro Leu Ala Asp Gly Glu Asp Trp Ala Gly Gly Leu Pro Trp Gly ggc cgg tca cct gga ggg gtc tca gag agt gag gcg ggc tca ccc ctg 1536 Gly Arg Ser Pro Gly Gly Val Ser Glu Ser Glu Ala Gly Ser Pro Leu gcc ggc ctg gat atg gac acg ttt gac agt ggc ttt gtg ggc tct gac 1584 Ala Gly Leu Asp Met Asp Thr Phe Asp Ser Gly Phe Val Gly Ser Asp tgc agc agc cct gtg gag tgt gac ttc acc agc ccc ggg gac gaa gga 1632 Cys Ser Ser Pro Val Glu Cys Asp Phe Thr Ser Pro Gly Asp Glu Gly ccc ccc cgg agc tac ctc cgc cag tgg gtg gtc att cct ccg cca ctt 1680 Pro Pro Arg Ser Tyr Leu Arg Gln Trp Val Val Ile Pro Pro Pro Leu 15' 525 530 535 540 tcg agc cct gga ccc cag gcc agc taa 1707 Ser Ser Pro Gly Pro Gln Ala Ser MPRGWAAPLLLLLLQGALEGMERKLCSPKPPPTKASLPTDPPGWGCPDLVCYTDYLQTVICILEMWNLHPSTLTLTW
ILSNNTGCYIKDRTLDLRQDQYEELKDEATSCSLHRSAHNATHATYTCHMDVFHFMADDIFSVNITDQSGNYSQECG
SFLLAESRQYNISWRSDYEDPAFYMLKGKLQYELQYRNRGDPWAVSPRRKLISVDSRSVSLLPLEFRKDSSYELQVR
AGPMPGSSYQGTWSEWSDPVIFQTQSEELKEGWNPHLLLLLLLVIVFIPAFWSLKTHPLWRLWKKIWAVPSPERFFM
PLYKGCSGDFKKWVGAPFTGSSLELGPWSPEVPSTLEVYSCHPPRSPAKRLQLTELQEPAELVESDGVPKPSFWPTA
QNSGGSAYSEERDRPYGLVSIDTVTVLDAEGPCTWPCSCEDDGYPALDLDAGLEPSPGLEDPLLDAGTTVLSCGCVS
AGSPGLGGPLGSLLDRLKPPLADGEDWAGGLPWGGRSPGGVSESEAGSPLAGLDMDTFDSGFVGSDCSSPVECDFTS
PGDEGPPRSYLRQWWIPPPLSSPGPQAS
Nucleotide and polypeptide sequences of DNAX Cytokine Receptor Subunit like embodiments (DCRS3.2; SEQ ID N0: 24 and 25):
atg ccg cgt ggc tgg gcc gcc ccc ttg ctc ctg ctg ctg ctc cag gga 48 Met Pro Arg Gly Trp Ala Ala Pro Leu Leu Leu Leu Leu Leu Gln Gly ggc tgg ggc tgc ccc gac ctc gtc tgc tac acc gat tac ctc cag acg 96 Gly Trp Gly Cys Pro Asp Leu Val Cys Tyr Thr Asp Tyr Leu Gln Thr gtc atc tgc atc ctg gaa atg tgg aac ctc cac ccc agc acg ctc acc 144 Val Ile Cys Ile Leu Glu Met Trp Asn Leu His Pro Ser Thr Leu Thr 4 5 ctt acc tgg caa gac cag tat gaa gag ctg aag gac gag gcc acc tcc 192 Leu Thr Trp Gln Asp Gln Tyr Glu Glu Leu Lys Asp Glu Ala Thr Ser tgc agc ctc cac agg tcg gcc cac aat gcc acg cat gcc acc tac acc 240 Cys Ser Leu His Arg Ser Ala His Asn Ala Thr His Ala Thr Tyr Thr tgc cac atg gat gta ttc cac ttc atg gcc gac gac att ttc agt gtc 288 Cys His Met Asp Val Phe His Phe Met Ala Asp Asp Ile Phe Ser Val aac atc aca gac cag tct ggc aac tac tcc cag gan tgt ggc agc ttt 336 Asn Ile Thr Asp Gln Ser Gly Asn Tyr Ser Gln Xaa Cys Gly Ser Phe ctc ctg get gag agc atc aag ccg get ccc cct ttc aac gtg act gtg 384 Leu Leu Ala Glu Ser Ile Lys Pro Ala Pro Pro Phe Asn Val Thr Val 5 acc ttc tca gga cag tat aat atn tcc tgg cgc tca gat tac gaa gac 432 Thr Phe Ser Gly Gln Tyr Asn Xaa Ser Trp Arg Ser Asp Tyr Glu Asp cct gcc ttc tac atg ctg aaa ggc aag ctt caa tat gag ctg cag tac 480 10 Pro Ala Phe Tyr Met Leu Lys Gly Lys Leu Gln Tyr Glu Leu Gln Tyr agg aac cgg gga gac ccc tgg get gtg agt ccg agg aga aag ctg atc 528 Arg Asn Arg Gly Asp Pro Trp Ala Val Ser Pro Arg Arg Lys Leu Ile 15' 145 150 155 tca gtg gac tca aga agt gtc tcc ctc ctc ccc ctg gag ttc cgc aaa 576 Ser Val Asp Ser Arg Ser Val Ser Leu Leu Pro Leu Glu Phe Arg Lys gac tcg agc tat gag ctg can gtg cgg gca ggg ccc atg cct ggc tcc 624 Asp Ser Ser Tyr Glu Leu Xaa Val Arg Ala Gly Pro Met Pro Gly Ser 2 5 tcc tac cag ggg acc tgg agt gaa tgg agt gac ccg gtc atc tgt cag 672 Ser Tyr Gln Gly Thr Trp Ser Glu Trp Ser Asp Pro Val Ile Cys Gln acc cag tca gag gag tta aag gaa ggc tgg aac cct cac ctg ctg ctt 720 3 0 Thr Gln Ser Glu Glu Leu Lys Glu Gly Trp Asn Pro His Leu Leu Leu ctc ctc ctg ctt gtc ata gtc ttc att cct gcc ttc tgg agc ctg aag 768 Leu Leu Leu Leu Val Ile Val Phe Ile Pro Ala Phe Trp Ser Leu Lys acc cat cca ttg tgg agg cta tgg aag aag ata tgg gcc gtc ccc agc 816 Thr His Pro Leu Trp Arg Leu Trp Lys Lys Ile Trp Ala Val Pro Ser cct gag cgg ttc ttc atg ccc ctg tac aag ggc tgc agc gga gac ttc 864 Pro Glu Arg Phe Phe Met Pro Leu Tyr Lys Gly Cys Ser Gly Asp Phe 4 5 aag aaa tgg gtg ggt gca ccc ttc act ggc tcc agc ctg gag ctg gga 912 Lys Lys Trp Val Gly Ala Pro Phe Thr Gly Ser Ser Leu Glu Leu Gly ccc tgg agc cca gag gtg ccc tcc acc ctg gag gtg tac agc tgc cac 960 Pro Trp Ser Pro Glu Val Pro Ser Thr Leu Glu Val Tyr Ser Cys His cca cca cgg agc ccg gcc aag agg ctg cag ctc acg gag cta caa gaa 1008 Pro Pro Arg Ser Pro Ala Lys Arg Leu Gln Leu Thr Glu Leu Gln Glu cca gca gag ctg gtg gag tct gac ggt gtg ccc aag ccc agc ttc tgg 1056 Pro Ala Glu Leu Val Glu Ser Asp Gly Val Pro Lys Pro Ser Phe Trp ccg aca gcc cag aac tcg ggg ggc tca get tac agt gag gag agg gat 1104 Pro Thr Ala Gln Asn Ser Gly Gly Ser Ala Tyr Ser Glu Glu Arg Asp cgg cca tac ggc ctg gtg tcc att gac aca gtg act gtg cta gat gca 1152 Arg Pro Tyr Gly Leu Val Ser Ile Asp Thr Val Thr Val Leu Asp Ala gag ggg cca tgc acc tgg ccc tgc agc tgt gag gat gac ggc tac cca 1200 15' Glu Gly Pro Cys Thr Trp Pro Cys Ser Cys Glu Asp Asp Gly Tyr Pro gcc ctg gac ctg gat get ggc ctg gag ccc agc cca ggc cta gag gac 1248 Ala Leu Asp Leu Asp Ala Gly Leu Glu Pro Ser Pro Gly Leu Glu Asp cca ctc ttg gat gca ggg acc aca gtc ctg tcc tgt ggc tgt gtc tca 1296 Pro Leu Leu Asp Ala Gly Thr Thr Val Leu Ser Cys Gly Cys Val Ser get ggc agc cct ggg cta gga ggg ccc ctg gga agc ctc ctg gac aga 1344 Ala Gly Ser Pro Gly Leu Gly Gly Pro Leu Gly Ser Leu Leu Asp Arg 3 0 cta aag cca ccc ctt gca gat ggg gag gac tgg get ggg gga ctg ccc 1392 Leu Lys Pro Pro Leu Ala Asp Gly Glu Asp Trp Ala Gly Gly Leu Pro tgg ggt ggc cgg tca cct gga ggg gtc tca gag agt gag gcg ggc tca 1440 3 5 Trp Gly Gly Arg Ser Pro Gly Gly Val Ser Glu Ser Glu Ala Gly Ser ccc ctg gcc ggc ctg gat atg gac acg ttt gac agt ggc ttt gtg ggc 1488 Pro Leu Ala Gly Leu Asp Met Asp Thr Phe Asp Ser Gly Phe Val Gly tct gac tgc agc agc cct gtg gag tgt gac ttc acc agc ccc ggg gac 1536 Ser Asp Cys Ser Ser Pro Val Glu Cys Asp Phe Thr Ser Pro Gly Asp gaa gga ccc ccc cgg agc tac ctc cgc cag tgg gtg gtc att cct ccg 1584 Glu Gly Pro Pro Arg Ser Tyr Leu Arg Gln Trp Val Val Ile Pro Pro cca ctt tcg agc cct gga ccc cag gcc agc taa 1617 Pro Leu Ser Ser Pro Gly Pro Gln Ala Ser MPRGWAAPLLLLLLQGGWGCPDLVCYTDYLQTVICILEMWNLHPSTLTLTWQDQYEELKDEATSCSLHRSAHNATHA
TYTCHMDVFHFMADDIFSVNITDQSGNYSQXCGSFLLAESIKPAPPFNVTVTFSGQYNXSWRSDYEDPAFYMLKGKL
QYELQYRNRGDPWAVSPRRKLISVDSRSVSLLPLEFRKDSSYELXVRAGPMPGSSYQGTWSEWSDPVICQTQSEELK
EGWNPHLLLLLLLVIVFIPAFWSLKTHPLWRLWKKIWAVPSPERFFMPLYKGCSGDFKKWVGAPFTGSSLELGPWSP
EVPSTLEVYSCHPPRSPAKRLQLTELQEPAELVESDGVPKPSFWPTAQNSGGSAYSEERDRPYGLVSIDTVTVLDAE
or is a deletion or insertion variant from a natural sequence.
Still other embodiments include a composition comprising: a 10~ substantially pure DCRS3 and another cytokine receptor family member; a sterile DCRS3 polypeptide; the DCRS3 polypeptide and a carrier, wherein the carrier is: an aqueous compound, including water, saline, and/or buffer; and/or formulated for oral, rectal, nasal, topical, or parenteral administration; a substantially pure DCRS4 and another cytokine receptor family member; a sterile DCRS4 polypeptide; the DCRS4 polypeptide and a carrier, wherein the carrier is: an aqueous compound, including water, saline, and/or buffer; and/or formulated for oral, rectal, nasal, topical, or parenteral administration. Fusion polypeptide embodiments include those comprising: mature protein sequence of Table 1 or 3; a detection or purification tag, including a FLAG, His6, or Ig sequence; or sequence of another interferon receptor protein. Kit embodiments include those comprising such a polypeptide, and: a compartment comprising the protein or polypeptide; or instructions for use or disposal of reagents in the kit.
Binding compound embodiments include, e.g., a binding compound comprising an antigen binding site from an antibody, which specifically binds to a natural: DCRS3 polypeptide, wherein: the binding compound is in a container; the DCRS3 polypeptide is from a human; the binding compound is an Fv, Fab, or Fab2 fragment; the binding compound is conjugated to another chemical moiety; or the antibody: is raised against a peptide sequence of a mature polypeptide of Table 1; is raised against a mature DCRS3; is raised to a purified human DCRS3; is immunoselected; is a polyclonal antibody; binds to a denatured DCRS3; exhibits a Kd to antigen of at least 30 ~M; is attached to a solid substrate, including a bead or plastic membrane; is in a sterile composition; or is detectably labeled, including a radioactive or fluorescent label; or DCRS4 polypeptide, wherein:
the binding compound is in a container; the DCRS4 polypeptide is from a human; the binding compound is an Fv, Fab, or Fab2 fragment; the binding compound is conjugated to another chemical moiety; or the antibody: is raised against a peptide sequence of a mature polypeptide of Table 3; is raised against a mature DCRS4; is raised to a purified human DCRS4; is immunoselected;
10~ is a polyclonal antibody; binds to a denatured DCRS4; exhibits a Kd to antigen of at least 30 ~M; is attached to a solid substrate, including a bead or plastic membrane; is in a sterile composition; or is detectably labeled, including a radioactive or fluorescent label. Kits include those comprising the binding compound, and: a compartment comprising the binding compound; or instructions for use or disposal of reagents in the kit.
Methods are provided, e.g., of producing an antigen: antibody complex, comprising contacting under appropriate conditions: a primate DCRS3 polypeptide with a described antibody, thereby allowing the complex to form; or a primate DCRS4 polypeptide with a described antibody, thereby allowing the complex to form. This includes wherein: the complex is purified from other cytokine receptors; the complex is purified from other antibody; the contacting is with a sample comprising another cytokine; the contacting allows quantitative detection of the antigen; the contacting is with a sample comprising the antibody; or the contacting allows quantitative detection of the antibody.
Various related compositions are provided, e.g., a composition comprising: a sterile binding compound, as described, or the described binding compound and a carrier, wherein the carrier is: an aqueous compound, including water, saline, and/or buffer; and/or formulated for oral, rectal, nasal, topical, or parenteral administration.
Nucleic acid embodiments include, e.g., an isolated or recombinant nucleic acid encoding the DCRS3 polypeptide, wherein the: DCRS3 is from a human; or the nucleic acid: encodes an antigenic peptide sequence of Table 1; encodes a plurality of antigenic peptide sequences of Table 1; exhibits identity over at least thirteen nucleotides to a natural cDNA encoding the segment; is an expression vector; further comprises an origin of replication; is from a natural source; comprises a detectable label; comprises synthetic nucleotide sequence; is less than 6 kb, preferably less than 3 kb; is from a primate; comprises a natural full length coding sequence; is a hybridization probe for a gene encoding the DCRS3; or is a PCR primer, PCR product, 10~ or mutagenesis primer; or an isolated or recombinant nucleic acid encoding the DCRS4 polypeptide, wherein the: DCRS4 is from a human; or the nucleic acid: encodes an antigenic peptide sequence of Table 3; encodes a plurality of antigenic peptide sequences of Table 3; exhibits identity over at least thirteen nucleotides to a natural cDNA encoding the segment; is an expression vector; further comprises an origin of replication;
is from a natural source; comprises a detectable label;
comprises synthetic nucleotide sequence; is less than 6 kb, preferably less than 3 kb; is from a primate; comprises a natural full length coding sequence; is a hybridization probe for a gene encoding the DCRS4; or is a PCR primer, PCR product, or mutagenesis primer. Other embodiments of the invention include a cell or tissue comprising the described recombinant nucleic acid. Preferably, the cell is: a prokaryotic cell; a eukaryotic cell; a bacterial cell; a yeast cell; an insect cell;
a mammalian cell; a mouse cell; a primate cell; or a human cell.
Kit embodiments include those comprising a described nucleic acid, and: a compartment comprising the nucleic acid; a compartment further comprising a primate DCRS3 or DCRS4 polypeptide; or instructions for use or disposal of reagents in the kit.
Alternative nucleic acid embodiments include a nucleic acid which: hybridizes under wash conditions of 30 minutes at 30° C
and less than 2M salt to the coding portion of: SEQ ID NO: 1, 24, 4, 27, or 30; or exhibits identity over a stretch of at least about 30 nucleotides to a primate DCRS3 or DCRS4.
Preferred embodiments include those wherein: the wash conditions are at 45° C and/or 500 mM salt; the wash conditions are at 55°
C and/or 150 mM salt; the stretch is at least 55 nucleotides; or the stretch is at least 75 nucleotides.
Other methods include those of modulating physiology or development of a cell or tissue culture cells comprising contacting the cell with an agonist or antagonist of a mammalian DCRS3 or DCRS4. Preferably, the cell is transformed with a nucleic acid encoding a DCRS3 or DCRS4 and another cytokine receptor subunit.
10~
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
OUTLINE
I. General II. Activities III. Nucleic acids A. encoding fragments, sequence, probes B. mutations, chimeras, fusions C. making nucleic acids D. vectors, cells comprising IV. Proteins, Peptides A. fragments, sequence, immunogens, antigens B. muteins C. agonists/antagonists, functional equivalents D. making proteins V. Making nucleic acids, proteins A. synthetic B. recombinant C. natural sources VI. Antibodies A. polyclonals B. monoclonal C . fragment s ; Kd D, anti-idiotypic antibodies E. hybridoma cell lines VII. Kits and Methods to quantify DCRS
A. ELISA
B. assay mRNA encoding C. qualitative/quantitative D. kits VIII. Therapeutic compositions, methods A. combination compositions B. unit dose C. administration IX. Screening X. Ligands I. General The present invention provides the amino acid sequences and DNA sequences of mammalian, herein primate, cytokine receptor-like subunit molecules, these designated DNAX Cytokine Receptor Subunit 3 (DCRS3; 50R) and DNAX Cytokine Receptor Subunit 4 (DCRS4; cytor) having particular defined properties, both structural and biological. Various cDNAs encoding these 10~ molecules were obtained from primate, e.g., human, cDNA sequence libraries. Other primate or other mammalian counterparts would also be desired.
Some of the standard methods applicable are described or referenced, e.g., in Maniatis, et al. (1982) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor Press; Sambrook, et al. (1989) Molecular Cloning: A
Laboratory Manual, (2d ed.), vols. 1-3, CSH Press, NY; Ausubel, et al., Bioloay, Greene Publishing Associates, Brooklyn, NY; or Ausubel, et al. (1987 and periodic supplements) Current Protocols in Molecular Bioloav, Greene/Wiley, New York; each of which is incorporated herein by reference.
Nucleotide (SEQ ID NO: 1) and corresponding amino acid sequence (SEQ ID NO: 2) of a human DCRS3 coding segment are shown in Table 1; likewise for the DCRS3.2 as SEQ ID NO: 24 and 25; comparison of DCRS3.1 and DCRS3.2 polypeptide sequences is shown also in Table 1. Reverse translations based upon the universal genetic code are provided in Table 2; comparison of the encoding nucleic acid sequences is also presented in Table 2. The sequences are derived from genomic sequence at chromosome location clones CIT987SK-582J2 HUAC004525 and CIT987-SKA-67085 HUAC002303, at 16p12, and other cDNA sequences. The predicted signal sequence is indicated, but may depend on cell type, or may be a few residues in either direction. The transmembrane segment (SEQ ID NO: 2) is predicted to run from about 1eu248-ser264 (g1u242-his268). Predicted fibronectin domain runs from about asn128-tyr220; cytokine receptor WS box from about trp224-ser228; conserved disulfide motif between cys6-cys26; second conserved disulfide linkage at cys65-cys89;
five N glycosylation sites at Asn residues 61, 97, 121, 128, and 145; seven CAMP PK sites at lys4; 1ys68; 1ys184; arg191; arg201;
1ys202; and 1ys292; fourteen Ca phosphorylation sites at thr7l, 5 ser130, ser187, ser205, ser237, ser182, ser195, ser310, ser317, thr323, ser374, ser385, ser403, and thr499; five myristoly sites at 91y174, 91y303, 91y439, 91y449, and 91y466; four PKC
phosphorylation sites at ser7, ser147, ser180, and ser264; and one tyrosine kinase site at 1ys163.
10~ Exon boundaries are predicted to be about between nucleotides 949-c50, 9230-9231, 8284-9285, a484-8485, 9597-a598, 9775-a776, 8875-9876, and 8957-a958. Because the sequences have been derived from genomic sequence, in which the introns have not been spliced out, particularly important compositions will be those which encode segments across the boundaries, e.9., both nucleic acid sequence and amino acid sequence. The segments will comprise, e.9., segments across the boundary which may comprise 8, 9, 11, 13, 15, 17 20, 25, 30, 35, 50, or more nucleotides on either or both sides adjacent to an exon boundary, or 4, 5, 6, 7, or 8 amino acids on either or both sides adjacent a boundary. The lengths on either side need not be the same for purposes of novelty, e.9., three amino acids on one side and 5 on the other side. Thus, e.9., compositions are provided comprising, e.9, 15 contiguous nucleotides across a boundary, of which at least 6 are from each side. Similarly, compositions are provided, e.9., comprising at least 3 amino acids from each side of the exon boundary, with a matching of at least 8 amino acids across the boundary. Also provided are compositions comprising a plurality of such segments across multiple exon boundaries, which different segments need not have the same length limitations. Thus, the invention provides a nucleic acid comprising, e.9., at least 5 nucleotides in each side across the exon 1/2 boundary, and at least 4 nucleotides on either side of the the exon 3/4, 4/5, 5/6, and/or 6/7 boundaries. Natural sequence compositions would be preferred.
Nucleotide (SEQ ID NO: 4) and corresponding amino acid sequence (SEQ.ID NO: 5) of a human DCRS4 coding segment are shown in Table 3; likewise for the DCRS4.2 as SEQ ID NO: 27 and 28 and the DCRS4.3 as SEQ ID N0: 30 and 31; comparison of DCRS4 polypeptide sequences is shown also in Table 3. Reverse translations based upon the universal genetic code are provided in Table 4; comparison of the encoding nucleic acid sequences is also presented in Table 4. The sequence of DCRS4.1 is derived from genomic sequence at chromosome location 6q24.1-25.2, within some 50 kb of IFNyRl chain. The predicted DCRS4.1 signal sequence is indicated, but may depend on cell type, or may be a 10~ few residues in either direction. This embodiment of the receptor lacks a transmembrane segment, which is unusual, but there is precedent for soluble forms of cytokine receptor subunits. See, e.g, IL-l2Ra (p40 subunit) and the EBI3 receptor subunit homolog. For the DCRS4.1, the predicted cytokine receptor domain from prol0-arg49; conserved disulfide motif between cys57-cys65; five N glycosylation sites at Asn residues 35, 131, 136, 157, and 174; four cAMP PK sites at arg30, 1ys98, 1ys106, and 1ys156; eight Ca phosphorylation sites at thr4, thr60, ser64, thr68, thr7l, ser159, ser176, and ser220; three myristoly sites at g1y89, g1y103, and g1y186; three PKC
phosphorylation sites at ser7, ser97, and ser217; one amidation site at tyr79; one CAMP phosphorylation site at 1ys98; and two CK2 phosphorylation sites at sera and ser159. Exon boundaries are predicted to be about between nucleotides c59-a60; t197-a198, 8206-a207, 8430-c431, and g601-a602. Alignment with the other DCRS4 embodiments is provided. As described above, compositions with sequence across the exon boundaries are provided.
3 0 Table 1: Nucleotide and polypeptide sequences of DNAX Cytokine Receptor Subunit like embodiments (DCRS3.1; 50R). Primate, e.g., human embodiment (see SEQ ID NO: 1 and 2). Predicted signal sequence indicated, but may vary by a few positions and depending upon cell type.
3 5 atg ccg cgt ggc tgg gcc gcc ccc ttg ctc ctg ctg ctg ctc cag gga 48 Met Pro Arg Gly Trp Ala Ala Pro Leu Leu Leu Leu Leu Leu Gln Gly gcc ctc gag ggg atg gag agg aag ctc tgc agt ccc aag cca ccc ccc 96 4 0 Ala Leu Glu Gly Met Glu Arg Lys Leu Cys Ser Pro Lys Pro Pro Pro acc aag gcc tct ctc ccc act gac cct cca ggc tgg ggc tgc ccc gac 144 Thr Lys Ala Ser Leu Pro Thr Asp Pro Pro Gly Trp Gly Cys Pro Asp ctc gtc tgc tac acc gat tac ctc cag acg gtc atc tgc atc ctg gaa 192 Leu Val Cys Tyr Thr Asp Tyr Leu Gln Thr Val Ile Cys Ile Leu Glu atg tgg aac ctc cac ccc agc acg ctc acc ctt acc tgg ata ctt tct 240 Met Trp Asn Leu His Pro Ser Thr Leu Thr Leu Thr Trp Ile Leu Ser 15' aat aat act ggg tgc tat atc aag gac aga aca ctg gac ctc agg caa 288 Asn Asn Thr Gly Cys Tyr Ile Lys Asp Arg Thr Leu Asp Leu Arg Gln gac cag tat gaa gag ctg aag gac gag gcc acc tcc tgc agc ctc cac 336 Asp Gln Tyr Glu Glu Leu Lys Asp Glu Ala Thr Ser Cys Ser Leu His 2 0 agg tcg gcc cac aat gcc acg cat gcc acc tac acc tgc cac atg gat 384 Arg Ser Ala His Asn Ala Thr His Ala Thr Tyr Thr Cys His Met Asp gta ttc cac ttc atg gcc gac gac att ttc agt gtc aac atc aca gac 432 2 5 Val Phe His Phe Met Ala Asp Asp Ile Phe Ser Val Asn Ile Thr Asp cag tct ggc aac tac tcc cag gag tgt ggc agc ttt ctc ctg get gag 480 Gln Ser Gly Asn Tyr Ser Gln Glu Cys Gly Ser Phe Leu Leu Ala Glu agc aga cag tat aat atc tcc tgg cgc tca gat tac gaa gac cct gcc 528 Ser Arg Gln Tyr Asn Ile Ser Trp Arg Ser Asp Tyr Glu Asp Pro Ala ttc tac atg ctg aag ggc aag ctt cag tat gag ctg cag tac agg aac 576 Phe Tyr Met Leu Lys Gly Lys Leu Gln Tyr Glu Leu Gln Tyr Arg Asn 4 0 cgg gga gac ccc tgg get gtg agt ccg agg aga aag ctg atc tca gtg 624 Arg Gly Asp Pro Trp Ala Val Ser Pro Arg Arg Lys Leu Ile Ser Val gac tca aga agt gtc tcc ctc ctc ccc ctg gag ttc cgc aaa gac tcg 672 4 5 Asp Ser Arg Ser Val Ser Leu Leu Pro Leu Glu Phe Arg Lys Asp Ser agc tat gag ctg cag gtg cgg gca ggg ccc atg cct ggc tcc tcc tac 720 Ser Tyr Glu Leu Gln Val Arg Ala Gly Pro Met Pro Gly Ser Ser Tyr cag ggg acc tgg agt gaa tgg agt gac ccg gtc atc ttt cag acc cag 768 Gln Gly Thr Trp Ser Glu Trp Ser Asp Pro Val Ile Phe Gln Thr Gln tca gag gag tta aag gaa ggc tgg aac cct cac ctg ctg ctt ctc ctc 816 Ser Glu Glu Leu Lys Glu Gly Trp Asn Pro His Leu Leu Leu Leu Leu ctg ctt gtc ata gtc ttc att cct gcc ttc tgg agc ctg aag acc cat 864 Leu Leu Val Ile Val Phe Ile Pro Ala Phe Trp Ser Leu Lys Thr His cca ttg tgg agg cta tgg aag aag ata tgg gcc gtc ccc agc cct gag 912 Pro Leu Trp Arg Leu Trp Lys Lys Ile Trp Ala Val Pro Ser Pro Glu cgg ttc ttc atg ccc ctg tac aag ggc tgc agc gga gac ttc aag aaa 960 Arg Phe Phe Met Pro Leu Tyr Lys Gly Cys Ser Gly Asp Phe Lys Lys 15' tgg gtg ggt gca ccc ttc act ggc tcc agc ctg gag ctg gga ccc tgg 1008 Trp Val Gly Ala Pro Phe Thr Gly Ser Ser Leu Glu Leu Gly Pro Trp agc cca gag gtg ccc tcc acc ctg gag gtg tac agc tgc cac cca cca 1056 Ser Pro Glu Val Pro Ser Thr Leu Glu Val Tyr Ser Cys His Pro Pro 2 0 cgg agc ccg gcc aag agg ctg cag ctc acg gag cta caa gaa cca gca 1104 Arg Ser Pro Ala Lys Arg Leu Gln Leu Thr Glu Leu Gln Glu Pro Ala gag ctg gtg gag tct gac ggt gtg ccc aag ccc agc ttc tgg ccg aca 1152 25 Glu Leu Val Glu Ser Asp Gly Val Pro Lys Pro Ser Phe Trp Pro Thr gcc cag aac tcg ggg ggc tca get tac agt gag gag agg gat cgg cca 1200 Ala Gln Asn Ser Gly Gly Ser Ala Tyr Ser Glu Glu Arg Asp Arg Pro tac ggc ctg gtg tcc att gac aca gtg act gtg cta gat gca gag ggg 1248 Tyr Gly Leu Val Ser Ile Asp Thr Val Thr Val Leu Asp Ala Glu Gly cca tgc acc tgg ccc tgc agc tgt gag gat gac ggc tac cca gcc ctg 1296 Pro Cys Thr Trp Pro Cys Ser Cys Glu Asp Asp Gly Tyr Pro Ala Leu 4 0 gac ctg gat get ggc ctg gag ccc agc cca ggc cta gag gac cca ctc 1344 Asp Leu Asp Ala Gly Leu Glu Pro Ser Pro Gly Leu Glu Asp Pro Leu ttg gat gca ggg acc aca gtc ctg tcc tgt ggc tgt gtc tca get ggc 1392 4 5 Leu Asp Ala Gly Thr Thr Val Leu Ser Cys Gly Cys Val Ser Ala Gly agc cct ggg cta gga ggg ccc ctg gga agc ctc ctg gac aga cta aag 1440 Ser Pro Gly Leu Gly Gly Pro Leu Gly Ser Leu Leu Asp Arg Leu Lys cca ccc ctt gca gat ggg gag gac tgg get ggg gga ctg ccc tgg ggt 1488 Pro Pro Leu Ala Asp Gly Glu Asp Trp Ala Gly Gly Leu Pro Trp Gly ggc cgg tca cct gga ggg gtc tca gag agt gag gcg ggc tca ccc ctg 1536 Gly Arg Ser Pro Gly Gly Val Ser Glu Ser Glu Ala Gly Ser Pro Leu gcc ggc ctg gat atg gac acg ttt gac agt ggc ttt gtg ggc tct gac 1584 Ala Gly Leu Asp Met Asp Thr Phe Asp Ser Gly Phe Val Gly Ser Asp tgc agc agc cct gtg gag tgt gac ttc acc agc ccc ggg gac gaa gga 1632 Cys Ser Ser Pro Val Glu Cys Asp Phe Thr Ser Pro Gly Asp Glu Gly ccc ccc cgg agc tac ctc cgc cag tgg gtg gtc att cct ccg cca ctt 1680 Pro Pro Arg Ser Tyr Leu Arg Gln Trp Val Val Ile Pro Pro Pro Leu 15' 525 530 535 540 tcg agc cct gga ccc cag gcc agc taa 1707 Ser Ser Pro Gly Pro Gln Ala Ser MPRGWAAPLLLLLLQGALEGMERKLCSPKPPPTKASLPTDPPGWGCPDLVCYTDYLQTVICILEMWNLHPSTLTLTW
ILSNNTGCYIKDRTLDLRQDQYEELKDEATSCSLHRSAHNATHATYTCHMDVFHFMADDIFSVNITDQSGNYSQECG
SFLLAESRQYNISWRSDYEDPAFYMLKGKLQYELQYRNRGDPWAVSPRRKLISVDSRSVSLLPLEFRKDSSYELQVR
AGPMPGSSYQGTWSEWSDPVIFQTQSEELKEGWNPHLLLLLLLVIVFIPAFWSLKTHPLWRLWKKIWAVPSPERFFM
PLYKGCSGDFKKWVGAPFTGSSLELGPWSPEVPSTLEVYSCHPPRSPAKRLQLTELQEPAELVESDGVPKPSFWPTA
QNSGGSAYSEERDRPYGLVSIDTVTVLDAEGPCTWPCSCEDDGYPALDLDAGLEPSPGLEDPLLDAGTTVLSCGCVS
AGSPGLGGPLGSLLDRLKPPLADGEDWAGGLPWGGRSPGGVSESEAGSPLAGLDMDTFDSGFVGSDCSSPVECDFTS
PGDEGPPRSYLRQWWIPPPLSSPGPQAS
Nucleotide and polypeptide sequences of DNAX Cytokine Receptor Subunit like embodiments (DCRS3.2; SEQ ID N0: 24 and 25):
atg ccg cgt ggc tgg gcc gcc ccc ttg ctc ctg ctg ctg ctc cag gga 48 Met Pro Arg Gly Trp Ala Ala Pro Leu Leu Leu Leu Leu Leu Gln Gly ggc tgg ggc tgc ccc gac ctc gtc tgc tac acc gat tac ctc cag acg 96 Gly Trp Gly Cys Pro Asp Leu Val Cys Tyr Thr Asp Tyr Leu Gln Thr gtc atc tgc atc ctg gaa atg tgg aac ctc cac ccc agc acg ctc acc 144 Val Ile Cys Ile Leu Glu Met Trp Asn Leu His Pro Ser Thr Leu Thr 4 5 ctt acc tgg caa gac cag tat gaa gag ctg aag gac gag gcc acc tcc 192 Leu Thr Trp Gln Asp Gln Tyr Glu Glu Leu Lys Asp Glu Ala Thr Ser tgc agc ctc cac agg tcg gcc cac aat gcc acg cat gcc acc tac acc 240 Cys Ser Leu His Arg Ser Ala His Asn Ala Thr His Ala Thr Tyr Thr tgc cac atg gat gta ttc cac ttc atg gcc gac gac att ttc agt gtc 288 Cys His Met Asp Val Phe His Phe Met Ala Asp Asp Ile Phe Ser Val aac atc aca gac cag tct ggc aac tac tcc cag gan tgt ggc agc ttt 336 Asn Ile Thr Asp Gln Ser Gly Asn Tyr Ser Gln Xaa Cys Gly Ser Phe ctc ctg get gag agc atc aag ccg get ccc cct ttc aac gtg act gtg 384 Leu Leu Ala Glu Ser Ile Lys Pro Ala Pro Pro Phe Asn Val Thr Val 5 acc ttc tca gga cag tat aat atn tcc tgg cgc tca gat tac gaa gac 432 Thr Phe Ser Gly Gln Tyr Asn Xaa Ser Trp Arg Ser Asp Tyr Glu Asp cct gcc ttc tac atg ctg aaa ggc aag ctt caa tat gag ctg cag tac 480 10 Pro Ala Phe Tyr Met Leu Lys Gly Lys Leu Gln Tyr Glu Leu Gln Tyr agg aac cgg gga gac ccc tgg get gtg agt ccg agg aga aag ctg atc 528 Arg Asn Arg Gly Asp Pro Trp Ala Val Ser Pro Arg Arg Lys Leu Ile 15' 145 150 155 tca gtg gac tca aga agt gtc tcc ctc ctc ccc ctg gag ttc cgc aaa 576 Ser Val Asp Ser Arg Ser Val Ser Leu Leu Pro Leu Glu Phe Arg Lys gac tcg agc tat gag ctg can gtg cgg gca ggg ccc atg cct ggc tcc 624 Asp Ser Ser Tyr Glu Leu Xaa Val Arg Ala Gly Pro Met Pro Gly Ser 2 5 tcc tac cag ggg acc tgg agt gaa tgg agt gac ccg gtc atc tgt cag 672 Ser Tyr Gln Gly Thr Trp Ser Glu Trp Ser Asp Pro Val Ile Cys Gln acc cag tca gag gag tta aag gaa ggc tgg aac cct cac ctg ctg ctt 720 3 0 Thr Gln Ser Glu Glu Leu Lys Glu Gly Trp Asn Pro His Leu Leu Leu ctc ctc ctg ctt gtc ata gtc ttc att cct gcc ttc tgg agc ctg aag 768 Leu Leu Leu Leu Val Ile Val Phe Ile Pro Ala Phe Trp Ser Leu Lys acc cat cca ttg tgg agg cta tgg aag aag ata tgg gcc gtc ccc agc 816 Thr His Pro Leu Trp Arg Leu Trp Lys Lys Ile Trp Ala Val Pro Ser cct gag cgg ttc ttc atg ccc ctg tac aag ggc tgc agc gga gac ttc 864 Pro Glu Arg Phe Phe Met Pro Leu Tyr Lys Gly Cys Ser Gly Asp Phe 4 5 aag aaa tgg gtg ggt gca ccc ttc act ggc tcc agc ctg gag ctg gga 912 Lys Lys Trp Val Gly Ala Pro Phe Thr Gly Ser Ser Leu Glu Leu Gly ccc tgg agc cca gag gtg ccc tcc acc ctg gag gtg tac agc tgc cac 960 Pro Trp Ser Pro Glu Val Pro Ser Thr Leu Glu Val Tyr Ser Cys His cca cca cgg agc ccg gcc aag agg ctg cag ctc acg gag cta caa gaa 1008 Pro Pro Arg Ser Pro Ala Lys Arg Leu Gln Leu Thr Glu Leu Gln Glu cca gca gag ctg gtg gag tct gac ggt gtg ccc aag ccc agc ttc tgg 1056 Pro Ala Glu Leu Val Glu Ser Asp Gly Val Pro Lys Pro Ser Phe Trp ccg aca gcc cag aac tcg ggg ggc tca get tac agt gag gag agg gat 1104 Pro Thr Ala Gln Asn Ser Gly Gly Ser Ala Tyr Ser Glu Glu Arg Asp cgg cca tac ggc ctg gtg tcc att gac aca gtg act gtg cta gat gca 1152 Arg Pro Tyr Gly Leu Val Ser Ile Asp Thr Val Thr Val Leu Asp Ala gag ggg cca tgc acc tgg ccc tgc agc tgt gag gat gac ggc tac cca 1200 15' Glu Gly Pro Cys Thr Trp Pro Cys Ser Cys Glu Asp Asp Gly Tyr Pro gcc ctg gac ctg gat get ggc ctg gag ccc agc cca ggc cta gag gac 1248 Ala Leu Asp Leu Asp Ala Gly Leu Glu Pro Ser Pro Gly Leu Glu Asp cca ctc ttg gat gca ggg acc aca gtc ctg tcc tgt ggc tgt gtc tca 1296 Pro Leu Leu Asp Ala Gly Thr Thr Val Leu Ser Cys Gly Cys Val Ser get ggc agc cct ggg cta gga ggg ccc ctg gga agc ctc ctg gac aga 1344 Ala Gly Ser Pro Gly Leu Gly Gly Pro Leu Gly Ser Leu Leu Asp Arg 3 0 cta aag cca ccc ctt gca gat ggg gag gac tgg get ggg gga ctg ccc 1392 Leu Lys Pro Pro Leu Ala Asp Gly Glu Asp Trp Ala Gly Gly Leu Pro tgg ggt ggc cgg tca cct gga ggg gtc tca gag agt gag gcg ggc tca 1440 3 5 Trp Gly Gly Arg Ser Pro Gly Gly Val Ser Glu Ser Glu Ala Gly Ser ccc ctg gcc ggc ctg gat atg gac acg ttt gac agt ggc ttt gtg ggc 1488 Pro Leu Ala Gly Leu Asp Met Asp Thr Phe Asp Ser Gly Phe Val Gly tct gac tgc agc agc cct gtg gag tgt gac ttc acc agc ccc ggg gac 1536 Ser Asp Cys Ser Ser Pro Val Glu Cys Asp Phe Thr Ser Pro Gly Asp gaa gga ccc ccc cgg agc tac ctc cgc cag tgg gtg gtc att cct ccg 1584 Glu Gly Pro Pro Arg Ser Tyr Leu Arg Gln Trp Val Val Ile Pro Pro cca ctt tcg agc cct gga ccc cag gcc agc taa 1617 Pro Leu Ser Ser Pro Gly Pro Gln Ala Ser MPRGWAAPLLLLLLQGGWGCPDLVCYTDYLQTVICILEMWNLHPSTLTLTWQDQYEELKDEATSCSLHRSAHNATHA
TYTCHMDVFHFMADDIFSVNITDQSGNYSQXCGSFLLAESIKPAPPFNVTVTFSGQYNXSWRSDYEDPAFYMLKGKL
QYELQYRNRGDPWAVSPRRKLISVDSRSVSLLPLEFRKDSSYELXVRAGPMPGSSYQGTWSEWSDPVICQTQSEELK
EGWNPHLLLLLLLVIVFIPAFWSLKTHPLWRLWKKIWAVPSPERFFMPLYKGCSGDFKKWVGAPFTGSSLELGPWSP
EVPSTLEVYSCHPPRSPAKRLQLTELQEPAELVESDGVPKPSFWPTAQNSGGSAYSEERDRPYGLVSIDTVTVLDAE
GPCTWPCSCEDDGYPALDLDAGLEPSPGLEDPLLDAGTTVLSCGCVSAGSPGLGGPLGSLLDRLKPPLADGEDWAGG
LPWGGRSPGGVSESEAGSPLAGLDMDTFDSGFVGSDCSSPVECDFTSPGDEGPPRSYLRQWWIPPPLSSPGPQAS
Polypeptide sequence comparison of DCRS3.2 and DCRS3.1:
DCRS3.2 1 MPRGWAAPLLLLLLQG--------------------------GWGCPDLV 24 DCRS3.1 1 MPRGWAAPLLLLLLQGALEGMERKLCSPKPPPTKASLPTDPPGWGCPDLV 50 **************** ********
DCRS3.2 25 CYTDYLQTVICILEMWNLHPSTLTLTW------------------QDQYE 56 DCRS3.1 51 CYTDYLQTVICILEMWNLHPSTLTLTWILSNNTGCYIKDRTLDLRQDQYE 100 *************************** *****
DCRS3.2 57 ELKDEATSCSLHRSAHNATHATYTCHMDVFHFMADDIFSVNITDQSGNYS 106 15' DCRS3.1 101 ELKDEATSCSLHRSAHNATHATYTCHMDVFHFMADDIFSVNITDQSGNYS 150 **************************************************
DCRS3.2 107 QXCGSFLLAESIKPAPPFNVTVTFSGQYNXSWRSDYEDPAFYMLKGKLQY 156 DCRS3.1 151 QECGSFLLAE--------------SRQYNISWRSDYEDPAFYMLKGKLQY 186 2 0 * ******** * *** ********************
DCRS3.2 157 ELQYRNRGDPWAVSPRRKLISVDSRSVSLLPLEFRKDSSYELXVRAGPMP 206 DCRS3.1 187 ELQYRNRGDPWAVSPRRKLISVDSRSVSLLPLEFRKDSSYELQVRAGPMP 236 ****************************************** *******
DCRS3.2 207 GSSYQGTWSEWSDPVICQTQSEELKEGWNPHLLLLLLLVIVFIPAFWSLK 256 DCRS3.1 237 GSSYQGTWSEWSDPVIFQTQSEELKEGWNPHLLLLLLLVIVFIPAFWSLK 286 **************** *********************************
3 O DCRS3.2 257 THPLWRLWKKIWAVPSPERFFMPLYKGCSGDFKKWVGAPFTGSSLELGPW 306 DCRS3.1 287 THPLWRLWKKIWAVPSPERFFMPLYKGCSGDFKKWVGAPFTGSSLELGPW 336 **************************************************
DCRS3.2 307 SPEVPSTLEVYSCHPPRSPAKRLQLTELQEPAELVESDGVPKPSFWPTAQ 356 3 5 DCRS3.1 337 SPEVPSTLEWSCHPPRSPAKRLQLTELQEPAELVESDGVPKPSFWPTAQ 386 **************************************************
DCRS3.2 357 NSGGSAYSEERDRPYGLVSIDTVTVLDAEGPCTWPCSCEDDGYPALDLDA 406 DCRS3.1 387 NSGGSAYSEERDRPYGLVSIDTVTVLDAEGPCTWPCSCEDDGYPALDLDA 436 4 0 **************************************************
DCRS3.2 407 GLEPSPGLEDPLLDAGTTVLSCGCVSAGSPGLGGPLGSLLDRLKPPLADG 456 DCRS3.1 437 GLEPSPGLEDPLLDAGTTVLSCGCVSAGSPGLGGPLGSLLDRLKPPLADG 486 **************************************************
DCRS3.2 457 EDWAGGLPWGGRSPGGVSESEAGSPLAGLDMDTFDSGFVGSDCSSPVECD 506 DCRS3.1 487 EDWAGGLPWGGRSPGGVSESEAGSPLAGLDMDTFDSGFVGSDCSSPVECD 536 **************************************************
5 O DCRS3.2 507 FTSPGDEGPPRSYLRQWWIPPPLSSPGPQAS 538 DCRS3.1 537 FTSPGDEGPPRSYLRQWWIPPPLSSPGPQAS 568 ********************************
Table 2: Reverse Translation of primate, e.g., human, DCRS3.1 (SEQ ID NO:
3). N may be A, C, G, or T.
ATGCCNMGNGGNTGGGCNGCNCCNYTNYTNYTNYTNYTNYTNCARGGNGCNYTNGARGGNATGGARMGNAARYTNTG
YWSNCCNAARCCNCCNCCNACNAARGCNWSNYTNCCNACNGAYCCNCCNGGNTGGGGNTGYCCNGAYYTNGTNTGYT
AYACNGAYTAYYTNCARACNGTNATHTGYATHYTNGARATGTGGAAYYTNCAYCCNWSNACNYTNACNYTNACNTGG
ATHYTNWSNAAYAAYACNGGNTGYTAYATHAARGAYMGNACNYTNGAYYTNMGNCARGAYCARTAYGARGARYTNAA
LPWGGRSPGGVSESEAGSPLAGLDMDTFDSGFVGSDCSSPVECDFTSPGDEGPPRSYLRQWWIPPPLSSPGPQAS
Polypeptide sequence comparison of DCRS3.2 and DCRS3.1:
DCRS3.2 1 MPRGWAAPLLLLLLQG--------------------------GWGCPDLV 24 DCRS3.1 1 MPRGWAAPLLLLLLQGALEGMERKLCSPKPPPTKASLPTDPPGWGCPDLV 50 **************** ********
DCRS3.2 25 CYTDYLQTVICILEMWNLHPSTLTLTW------------------QDQYE 56 DCRS3.1 51 CYTDYLQTVICILEMWNLHPSTLTLTWILSNNTGCYIKDRTLDLRQDQYE 100 *************************** *****
DCRS3.2 57 ELKDEATSCSLHRSAHNATHATYTCHMDVFHFMADDIFSVNITDQSGNYS 106 15' DCRS3.1 101 ELKDEATSCSLHRSAHNATHATYTCHMDVFHFMADDIFSVNITDQSGNYS 150 **************************************************
DCRS3.2 107 QXCGSFLLAESIKPAPPFNVTVTFSGQYNXSWRSDYEDPAFYMLKGKLQY 156 DCRS3.1 151 QECGSFLLAE--------------SRQYNISWRSDYEDPAFYMLKGKLQY 186 2 0 * ******** * *** ********************
DCRS3.2 157 ELQYRNRGDPWAVSPRRKLISVDSRSVSLLPLEFRKDSSYELXVRAGPMP 206 DCRS3.1 187 ELQYRNRGDPWAVSPRRKLISVDSRSVSLLPLEFRKDSSYELQVRAGPMP 236 ****************************************** *******
DCRS3.2 207 GSSYQGTWSEWSDPVICQTQSEELKEGWNPHLLLLLLLVIVFIPAFWSLK 256 DCRS3.1 237 GSSYQGTWSEWSDPVIFQTQSEELKEGWNPHLLLLLLLVIVFIPAFWSLK 286 **************** *********************************
3 O DCRS3.2 257 THPLWRLWKKIWAVPSPERFFMPLYKGCSGDFKKWVGAPFTGSSLELGPW 306 DCRS3.1 287 THPLWRLWKKIWAVPSPERFFMPLYKGCSGDFKKWVGAPFTGSSLELGPW 336 **************************************************
DCRS3.2 307 SPEVPSTLEVYSCHPPRSPAKRLQLTELQEPAELVESDGVPKPSFWPTAQ 356 3 5 DCRS3.1 337 SPEVPSTLEWSCHPPRSPAKRLQLTELQEPAELVESDGVPKPSFWPTAQ 386 **************************************************
DCRS3.2 357 NSGGSAYSEERDRPYGLVSIDTVTVLDAEGPCTWPCSCEDDGYPALDLDA 406 DCRS3.1 387 NSGGSAYSEERDRPYGLVSIDTVTVLDAEGPCTWPCSCEDDGYPALDLDA 436 4 0 **************************************************
DCRS3.2 407 GLEPSPGLEDPLLDAGTTVLSCGCVSAGSPGLGGPLGSLLDRLKPPLADG 456 DCRS3.1 437 GLEPSPGLEDPLLDAGTTVLSCGCVSAGSPGLGGPLGSLLDRLKPPLADG 486 **************************************************
DCRS3.2 457 EDWAGGLPWGGRSPGGVSESEAGSPLAGLDMDTFDSGFVGSDCSSPVECD 506 DCRS3.1 487 EDWAGGLPWGGRSPGGVSESEAGSPLAGLDMDTFDSGFVGSDCSSPVECD 536 **************************************************
5 O DCRS3.2 507 FTSPGDEGPPRSYLRQWWIPPPLSSPGPQAS 538 DCRS3.1 537 FTSPGDEGPPRSYLRQWWIPPPLSSPGPQAS 568 ********************************
Table 2: Reverse Translation of primate, e.g., human, DCRS3.1 (SEQ ID NO:
3). N may be A, C, G, or T.
ATGCCNMGNGGNTGGGCNGCNCCNYTNYTNYTNYTNYTNYTNCARGGNGCNYTNGARGGNATGGARMGNAARYTNTG
YWSNCCNAARCCNCCNCCNACNAARGCNWSNYTNCCNACNGAYCCNCCNGGNTGGGGNTGYCCNGAYYTNGTNTGYT
AYACNGAYTAYYTNCARACNGTNATHTGYATHYTNGARATGTGGAAYYTNCAYCCNWSNACNYTNACNYTNACNTGG
ATHYTNWSNAAYAAYACNGGNTGYTAYATHAARGAYMGNACNYTNGAYYTNMGNCARGAYCARTAYGARGARYTNAA
RGAYGARGCNACNWSNTGYWSNYTNCAYMGNWSNGCNCAYAAYGCNACNCAYGCNACNTAYACNTGYCAYATGGAYG
TNTTYCAYTTYATGGCNGAYGAYATHTTYWSNGTNAAYATHACNGAYCARWSNGGNAAYTAYWSNCARGARTGYGGN
WSNTTYYTNYTNGCNGARWSNMGNCARTAYAAYATHWSNTGGMGNWSNGAYTAYGARGAYCCNGCNTTYTAYATGYT
NAARGGNAARYTNCARTAYGARYTNCARTAYMGNAAYMGNGGNGAYCCNTGGGCNGTNWSNCCNMGNMGNAARYTNA
THWSNGTNGAYWSNMGNWSNGTNWSNYTNYTNCCNYTNGARTTYMGNAARGAYWSNWSNTAYGARYTNCARGTNMGN
GCNGGNCCNATGCCNGGNWSNWSNTAYCARGGNACNTGGWSNGARTGGWSNGAYCCNGTNATHTTYCARACNCARWS
NGARGARYTNAARGARGGNTGGAAYCCNCAYYTNYTNYTNYTNYTNYTNYTNGTNATHGTNTTYATHCCNGCNTTYT
GGWSNYTNAARACNCAYCCNYTNTGGMGNYTNTGGAARAARATHTGGGCNGTNCCNWSNCCNGARMGNTTYTTYATG
CCNYTNTAYAARGGNTGYWSNGGNGAYTTYAARAARTGGGTNGGNGCNCCNTTYACNGGNWSNWSNYTNGARYTNGG
NCCNTGGWSNCCNGARGTNCCNWSNACNYTNGARGTNTAYWSNTGYCAYCCNCCNMGNWSNCCNGCNAARMGNYTNC
ARYTNACNGARYTNCARGARCCNGCNGARYTNGTNGARWSNGAYGGNGTNCCNAARCCNWSNTTYTGGCCNACNGCN
CARAAYWSNGGNGGNWSNGCNTAYWSNGARGARMGNGAYMGNCCNTAYGGNYTNGTNWSNATHGAYACNGTNACNGT
NYTNGAYGCNGARGGNCCNTGYACNTGGCCNTGYWSNTGYGARGAYGAYGGNTAYCCNGCNYTNGAYYTNGAYGCNG
GNYTNGARCCNWSNCCNGGNYTNGARGAYCCNYTNYTNGAYGCNGGNACNACNGTNYTNWSNTGYGGNTGYGTNWSN
15' GCNGGNWSNCCNGGNYTNGGNGGNCCNYTNGGNWSNYTNYTNGAYMGNYTNAARCCNCCNYTNGCNGAYGGNGARGA
YTGGGCNGGNGGNYTNCCNTGGGGNGGNMGNWSNCCNGGNGGNGTNWSNGARWSNGARGCNGGNWSNCCNYTNGCNG
GNYTNGAYATGGAYACNTTYGAYWSNGGNTTYGTNGGNWSNGAYTGYWSNWSNCCNGTNGARTGYGAYTTYACNWSN
CCNGGNGAYGARGGNCCNCCNMGNWSNTAYYTNMGNCARTGGGTNGTNATHCCNCCNCCNYTNWSNWSNCCNGGNCC
NCARGCNWSN
Reverse Translation of primate, e.g., human, DCRS3.2 (SEQ ID NO: 26). N may be A, C, G, or T.
ATGCCNMGNGGNTGGGCNGCNCCNYTNYTNYTNYTNYTNYTNCARGGNGGNTGGGGNTGYCCNGAYYTNGTNTGYTA
YACNGAYTAYYTNCARACNGTNATHTGYATHYTNGARATGTGGAAYYTNCAYCCNWSNACNYTNACNYTNACNTGGC
ARGAYCARTAYGARGARYTNAARGAYGARGCNACNWSNTGYWSNYTNCAYMGNWSNGCNCAYAAYGCNACNCAYGCN
ACNTAYACNTGYCAYATGGAYGTNTTYCAYTTYATGGCNGAYGAYATHTTYWSNGTNAAYATHACNGAYCARWSNGG
NAAYTAYWSNCARNNNTGYGGNWSNTTYYTNYTNGCNGARWSNATHAARCCNGCNCCNCCNTTYAAYGTNACNGTNA
CNTTYWSNGGNCARTAYAAYNNNWSNTGGMGNWSNGAYTAYGARGAYCCNGCNTTYTAYATGYTNAARGGNAARYTN
CARTAYGARYTNCARTAYMGNAAYMGNGGNGAYCCNTGGGCNGTNWSNCCNMGNMGNAARYTNATHWSNGTNGAYWS
NMGNWSNGTNWSNYTNYTNCCNYTNGARTTYMGNAARGAYWSNWSNTAYGARYTNNNNGTNMGNGCNGGNCCNATGC
CNGGNWSNWSNTAYCARGGNACNTGGWSNGARTGGWSNGAYCCNGTNATHTGYCARACNCARWSNGARGARYTNAAR
GARGGNTGGAAYCCNCAYYTNYTNYTNYTNYTNYTNYTNGTNATHGTNTTYATHCCNGCNTTYTGGWSNYTNAARAC
NCAYCCNYTNTGGMGNYTNTGGAAR.AARATHTGGGCNGTNCCNWSNCCNGARMGNTTYTTYATGCCNYTNTAYAARG
GNTGYWSNGGNGAYTTYAARAARTGGGTNGGNGCNCCNTTYACNGGNWSNWSNYTNGARYTNGGNCCNTGGWSNCCN
GARGTNCCNWSNACNYTNGARGTNTAYWSNTGYCAYCCNCCNMGNWSNCCNGCNAARMGNYTNCARYTNACNGARYT
NCARGARCCNGCNGARYTNGTNGARWSNGAYGGNGTNCCNAARCCNWSNTTYTGGCCNACNGCNCARAAYWSNGGNG
GNWSNGCNTAYWSNGARGARMGNGAYMGNCCNTAYGGNYTNGTNWSNATHGAYACNGTNACNGTNYTNGAYGCNGAR
GGNCCNTGYACNTGGCCNTGYWSNTGYGARGAYGAYGGNTAYCCNGCNYTNGAYYTNGAYGCNGGNYTNGARCCNWS
NCCNGGNYTNGARGAYCCNYTNYTNGAYGCNGGNACNACNGTNYTNWSNTGYGGNTGYGTNWSNGCNGGNWSNCCNG
GNYTNGGNGGNCCNYTNGGNWSNYTNYTNGAYMGNYTNAARCCNCCNYTNGCNGAYGGNGARGAYTGGGCNGGNGGN
YTNCCNTGGGGNGGNMGNWSNCCNGGNGGNGTNWSNGARWSNGARGCNGGNWSNCCNYTNGCNGGNYTNGAYATGGA
YACNTTYGAYWSNGGNTTYGTNGGNWSNGAYTGYWSNWSNCCNGTNGARTGYGAYTTYACNWSNCCNGGNGAYGARG
Nucleic acid sequence comparison of two DCRS3 embodiments:
DCRS3.2 1 ATGCCGCGTGGCTGGGCCGCCCCCTTGCTCCTGCTGCTGCTCCAGGGAGC 50 50 DCRS3.1 1 ATGCCGCGTGGCTGGGCCGCCCCCTTGCTCCTGCTGCTGCTCCAGGGA-- 49 ************************************************
DCRS3.2 51 CCTCGAGGGGATGGAGAGGAAGCTCTGCAGTCCCAAGCCACCCCCCACCA 100 DCRS3.1 49 -_________________________________________________ 49 DCRS3.2 101 AGGCCTCTCTCCCCACTGACCCTCCAGGCTGGGGCTGCCCCGACCTCGTC 150 DCRS3.1 50 ---------------------------GCTGGGGCTGCCCCGACCTCGTC 72 ***********************
DCRS3.2 151 TGCTACACCGATTACCTCCAGACGGTCATCTGCATCCTGGAAATGTGGAA 200 DCRS3.1 73 TGCTACACCGATTACCTCCAGACGGTCATCTGCATCCTGGAAATGTGGAA 122 **************************************************
DCRS3.2 201 CCTCCACCCCAGCACGCTCACCCTTACCTGGATACTTTCTAATAATACTG 250 DCRS3.1 123 CCTCCACCCCAGCACGCTCACCCTTACCTGG------------------- 153 *******************************
DCRS3.2 251 GGTGCTATATCAAGGACAGAACACTGGACCTCAGGCAAGACCAGTATGAA 300 15' DCRS3.1 154 -----------------------------------C~GACCAGTATGAA 168 ***************
DCRS3.2 301 GAGCTGAAGGACGAGGCCACCTCCTGCAGCCTCCACAGGTCGGCCCACAA 350 DCRS3.1 169 GAGCTGAAGGACGAGGCCACCTCCTGCAGCCTCCACAGGTCGGCCCACAA 218 2 0 **************************************************
DCRS3.2 351 TGCCACGCATGCCACCTACACCTGCCACATGGATGTATTCCACTTCATGG 400 DCRS3.1 219 TGCCACGCATGCCACCTACACCTGCCACATGGATGTATTCCACTTCATGG 268 **************************************************
DCRS3.2 401 CCGACGACATTTTCAGTGTCAACATCACAGACCAGTCTGGCAACTACTCC 450 DCRS3.1 269 CCGACGACATTTTCAGTGTCAACATCACAGACCAGTCTGGCAACTACTCC 318 **************************************************
3 O DCRS3.2 451 CAGGAGTGTGGCAGCTTTCTCCTGGCTGAGAGCA---------------- 484 DCRS3.1 319 CAGGANTGTGGCAGCTTTCTCCTGGCTGAGAGCATCAAGCCGGCTCCCCC 368 ***** ****************************
DCRS3.2 485 --------------------------GACAGTATAATATCTCCTGGCGCT 508 DCRS3.1 369 TTTCAACGTGACTGTGACCTTCTCAGGACAGTATAATATNTCCTGGCGCT 418 ************* **********
DCRS3.2 509 CAGATTACGAAGACCCTGCCTTCTACATGCTGAAGGGCAAGCTTCAGTAT 558 DCRS3.1 419 CAGATTACGAAGACCCTGCCTTCTACATGCTGAAAGGCAAGCTTCAATAT 468 4 0 ********************************** *********** ***
DCRS3.2 559 GAGCTGCAGTACAGGAACCGGGGAGACCCCTGGGCTGTGAGTCCGAGGAG 608 DCRS3.1 469 GAGCTGCAGTACAGGAACCGGGGAGACCCCTGGGCTGTGAGTCCGAGGAG 518 **************************************************
DCRS3.2 609 AAAGCTGATCTCAGTGGACTCAAGAAGTGTCTCCCTCCTCCCCCTGGAGT 658 DCRS3.1 519 AAAGCTGATCTCAGTGGACTCAAGAAGTGTCTCCCTCCTCCCCCTGGAGT 568 **************************************************
DCRS3.2 659 TCCGCAAAGACTCGAGCTATGAGCTGCAGGTGCGGGCAGGGCCCATGCCT 708 DCRS3.1 569 TCCGCAAAGACTCGAGCTATGAGCTGCANGTGCGGGCAGGGCCCATGCCT 618 **************************** *********************
DCRS3.2 709 GGCTCCTCCTACCAGGGGACCTGGAGTGAATGGAGTGACCCGGTCATCTT 758 DCRS3.1 619 GGCTCCTCCTACCAGGGGACCTGGAGTGAATGGAGTGACCCGGTCATCTG 668 *************************************************
DCRS3.2 759 TCAGACCCAGTCAGAGGAGTTAAAGGAAGGCTGGAACCCTCACCTGCTGC 808 DCRS3.1 669 TCAGACCCAGTCAGAGGAGTTAAAGGAAGGCTGGAACCCTCACCTGCTGC 718 **************************************************
DCRS3.2 809 TTCTCCTCCTGCTTGTCATAGTCTTCATTCCTGCCTTCTGGAGCCTGAAG 858 DCRS3.1 719 TTCTCCTCCTGCTTGTCATAGTCTTCATTCCTGCCTTCTGGAGCCTGAAG 768 **************************************************
10 DCRS3.2 859 ACCCATCCATTGTGGAGGCTATGGAAGAAGATATGGGCCGTCCCCAGCCC 908 DCRS3.1 769 ACCCATCCATTGTGGAGGCTATGGAAGAAGATATGGGCCGTCCCCAGCCC 818 **************************************************
DCRS3.2 909 TGAGCGGTTCTTCATGCCCCTGTACAAGGGCTGCAGCGGAGACTTCAAGA 958 15' DCRS3.1 819 TGAGCGGTTCTTCATGCCCCTGTACAAGGGCTGCAGCGGAGACTTCAAGA 868 **************************************************
DCRS3.2 959 AATGGGTGGGTGCACCCTTCACTGGCTCCAGCCTGGAGCTGGGACCCTGG 1008 DCRS3.1 869 AATGGGTGGGTGCACCCTTCACTGGCTCCAGCCTGGAGCTGGGACCCTGG 918 2 0 **************************************************
DCRS3.2 1009 AGCCCAGAGGTGCCCTCCACCCTGGAGGTGTACAGCTGCCACCCACCACG 1058 DCRS3.1 919 AGCCCAGAGGTGCCCTCCACCCTGGAGGTGTACAGCTGCCACCCACCACG 968 **************************************************
DCRS3.2 1059 GAGCCCGGCCAAGAGGCTGCAGCTCACGGAGCTACAAGAACCAGCAGAGC 1108 DCRS3.1 969 GAGCCCGGCCAAGAGGCTGCAGCTCACGGAGCTACAAGAACCAGCAGAGC 1018 **************************************************
3 0 DCRS3.2 1109 TGGTGGAGTCTGACGGTGTGCCCAAGCCCAGCTTCTGGCCGACAGCCCAG 1158 DCRS3.1 1019 TGGTGGAGTCTGACGGTGTGCCCAAGCCCAGCTTCTGGCCGACAGCCCAG 1068 **************************************************
DCRS3.2 1159 AACTCGGGGGGCTCAGCTTACAGTGAGGAGAGGGATCGGCCATACGGCCT 1208 3 5 DCRS3.1 1069 AACTCGGGGGGCTCAGCTTACAGTGAGGAGAGGGATCGGCCATACGGCCT 1118 **************************************************
DCRS3.2 1209 GGTGTCCATTGACACAGTGACTGTGCTAGATGCAGAGGGGCCATGCACCT 1258 DCRS3.1 1119 GGTGTCCATTGACACAGTGACTGTGCTAGATGCAGAGGGGCCATGCACCT 1168 4 0 **************************************************
DCRS3.2 1259 GGCCCTGCAGCTGTGAGGATGACGGCTACCCAGCCCTGGACCTGGATGCT 1308 DCRS3.1 1169 GGCCCTGCAGCTGTGAGGATGACGGCTACCCAGCCCTGGACCTGGATGCT 1218 **************************************************
DCRS3.2 1309 GGCCTGGAGCCCAGCCCAGGCCTAGAGGACCCACTCTTGGATGCAGGGAC 1358 DCRS3.1 1219 GGCCTGGAGCCCAGCCCAGGCCTAGAGGACCCACTCTTGGATGCAGGGAC 1268 **************************************************
5 O DCRS3.2 1359 CACAGTCCTGTCCTGTGGCTGTGTCTCAGCTGGCAGCCCTGGGCTAGGAG 1408 DCRS3.1 1269 CACAGTCCTGTCCTGTGGCTGTGTCTCAGCTGGCAGCCCTGGGCTAGGAG 1318 **************************************************
DCRS3.2 1409 GGCCCCTGGGAAGCCTCCTGGACAGACTAAAGCCACCCCTTGCAGATGGG 1458 5 5 DCRS3.1 1319 GGCCCCTGGGAAGCCTCCTGGACAGACTAAAGCCACCCCTTGCAGATGGG 1368 **************************************************
DCRS3.2 1459 GAGGACTGGGCTGGGGGACTGCCCTGGGGTGGCCGGTCACCTGGAGGGGT 1508 DCRS3.1 1369 GAGGACTGGGCTGGGGGACTGCCCTGGGGTGGCCGGTCACCTGGAGGGGT 1418 **************************************************
DCRS3.2 1509 CTCAGAGAGTGAGGCGGGCTCACCCCTGGCCGGCCTGGATATGGACACGT 1558 DCRS3.1 1419 CTCAGAGAGTGAGGCGGGCTCACCCCTGGCCGGCCTGGATATGGACACGT 1468 **************************************************
DCRS3.2 1559 TTGACAGTGGCTTTGTGGGCTCTGACTGCAGCAGCCCTGTGGAGTGTGAC 1608 DCRS3.1 1469 TTGACAGTGGCTTTGTGGGCTCTGACTGCAGCAGCCCTGTGGAGTGTGAC 1518 **************************************************
DCRS3.2 1609 TTCACCAGCCCCGGGGACGAAGGACCCCCCCGGAGCTACCTCCGCCAGTG 1658 15' DCRS3.1 1519 TTCACCAGCCCCGGGGACGAAGGACCCCCCCGGAGCTACCTCCGCCAGTG 1568 **************************************************
DCRS3.2 1659 GGTGGTCATTCCTCCGCCACTTTCGAGCCCTGGACCCCAGGCCAGCTAA 1707 DCRS3.1 1569 GGTGGTCATTCCTCCGCCACTTTCGAGCCCTGGACCCCAGGCCAGCTAA 1617 2 0 *************************************************
Table 3: Nucleotide and polypeptide sequences of DNAX Cytokine Receptor Subunit like embodiment 4 (DCRS4.1; cytor). Primate, e.g., human embodiment (see SEQ ID N0: 4 and 5). Predicted signal sequence indicated, but may vary 2 5 by a few positions and depending upon cell type.
atg atg cct aaa cat tgc ttt cta ggc ttc ctc atc agt ttc ttc ctt 48 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu act ggt gta gca gga act cag tca acg cat gag tct ctg aag cct cag 96 Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln 3 5 agg gta caa ttt cag tcc cga aat ttt cac aac att ttg caa tgg cag 144 Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln cct ggg agg gca ctt act ggc aac agc agt gtc tat ttt gtg cag tac 192 4 0 Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr aaa ata tat gga cag aga caa tgg aaa aat aaa gaa gac tgt tgg ggt 240 Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly act caa gaa ctc tct tgt gac ctt acc agt gaa acc tca gac ata cag 288 Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln gaa cct tat tac ggg agg agg ggc aaa aat aaa aat aaa ggg aat cct 336 Glu Pro Tyr Tyr Gly Arg Arg Gly Lys Asn Lys Asn Lys Gly Asn Pro 55 tgg ggg cca aaa caa agt aaa cgg aaa tca aag ggg aac cag aag acc 384 Trp Gly Pro Lys Gln Ser Lys Arg Lys Ser Lys Gly Asn Gln Lys Thr aac aca gtg act gcc cca get gcc ctg aag gca ttt get gga tgt gca 432 Asn Thr Val Thr Ala Pro Ala Ala Leu Lys Ala Phe Ala Gly Cys Ala aaa ata gat cct cca gtc atg aat ata acc caa gtc aat ggc tct ttg 480 Lys Ile Asp Pro Pro Val Met Asn Ile Thr Gln Val Asn Gly Ser Leu ttg gta att ctc cat get cca aat tta cca tat aga tac caa aag gaa 528 Leu Val Ile Leu His Ala Pro Asn Leu Pro Tyr Arg Tyr Gln Lys Glu aaa aat gta tct ata gaa gat tac tat gaa cta cta tac cga gtt ttt 576 15' Lys Asn Val Ser Ile Glu Asp Tyr Tyr Glu Leu Leu Tyr Arg Val Phe ata att aac aat tca cta gaa aag gag caa aag gtt tat gaa ggg get 624 Ile Ile Asn Asn Ser Leu Glu Lys Glu Gln Lys Val Tyr Glu Gly Ala cac aga gcg gtt gaa att gaa get cta aca cca cac tcc agc tac tgt 672 His Arg Ala Val Glu Ile Glu Ala Leu Thr Pro His Ser Ser Tyr Cys gta gtg get gaa ata tat cag ccc atg tta gac aga aga agt cag aga 720 Val Val Ala Glu Ile Tyr Gln Pro Met Leu Asp Arg Arg Ser Gln Arg 3 0 agt gaa gag aga tgt gtg gaa att cca tga 750 Ser Glu Glu Arg Cys Val Glu Ile Pro MMPKHCFLGFLISFFLTGVAGTQSTHESLKPQRVQFQSRNFHNILQWQPGRALTGNSSVYFVQYKIYGQRQWKNKED
CWGTQELSCDLTSETSDIQEPYYGRRGKNKNKGNPWGPKQSKRKSKGNQKTNTVTAPAALKAFAGCAKIDPPVMNIT
QVNGSLLVILHAPNLPYRYQKEKNVSIEDYYELLYRVFIINNSLEKEQKVYEGAHRAVEIEALTPHSSYCWAEIYQ
PMLDRRSQRSEERCVEIP.
Nucleotide and polypeptide sequences of DNAX Cytokine Receptor Subunit like 4 0 embodiments (DCRS4.2, cytorX700; SEQ ID NO: 27 and 28):
ATGATGCCTAAACATTGCTTTCTAGGCTTCCTCATCAGTTTCTTCCTTACTGGTGTAGCAGGAACTCAGTCAACGCA
TGAGTCTCTGAAGCCTCAGAGGGTACAATTTCAGTCCCGAAATTTTCACAACATTTTGCAATGGCAGCCCGGGAGGG
CACTTACTGGCAACAGCAGTGTCTATTTTGTGCAGTACAAAATATATGGACAGAGACAATGGAAAAATAAAGAAGAC
TGTTGGGGTACTCAAGAACTCTCTTGTGACCTTACCAGTGAAACCTCAGACATACAGGAACCTTATTACGGGAGGGT
GAGGGCGGCCTCGGCTGGGAGCTACTCAGAATGGAGCATGACGCCGCGGTTCACTCCCTGGTGGGAAACAAAAATAG
ATCCTCCAGTCATGAATATAACCCAAGTCAATGGCTCTTTGTTGGTAATTCTCCATGCTCCAAATTTACCATATAGA
TACCAAAAGGAAAAA.AATGTATCTATAGAAGATTACTATGAACTACTATACCGAGTTTTTATAATTAACAATTCACT
AGAAAAGGAGCAAAAGGTTTATGAAGGGGCTCACAGAGCGGTTGAAATTGAAGCTCTAACACCACACTCCAGCTACT
GTGTAGTGGCTGAAATATATCAGCCCATGTTAGACAGAAGAAGTCAGAGAAGTGAAGAGAGATGTGTGGAAATTCCA
TGA
atg atg cct aaa cat tgc ttt cta ggc ttc ctc atc agt ttc ttc ctt 48 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu 55 -20 -15 -to act ggt gta gca gga act cag tca acg cat gag tct ctg aag cct cag 96 Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln agg gta caa ttt cag tcc cga aat ttt cac aac att ttg caa tgg cag 144 Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln ccc ggg agg gca ctt act ggc aac agc agt gtc tat ttt gtg cag tac 192 Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr aaa ata tat gga cag aga caa tgg aaa aat aaa gaa gac tgt tgg ggt 240 Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly act caa gaa ctc tct tgt gac ctt acc agt gaa acc tca gac ata cag 288 Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln 15' 60 65 70 75 gaa cct tat tac ggg agg gtg agg gcg gcc tcg get ggg agc tac tca 336 Glu Pro Tyr Tyr Gly Arg Val Arg Ala Ala Ser Ala Gly Ser Tyr Ser gaa tgg agc atg acg ccg cgg ttc act ccc tgg tgg gaa aca aaa ata 384 Glu Trp Ser Met Thr Pro Arg Phe Thr Pro Trp Trp Glu Thr Lys Ile 2 5 gat cct cca gtc atg aat ata acc caa gtc aat ggc tct ttg ttg gta 432 Asp Pro Pro Val Met Asn Ile Thr Gln Val Asn Gly Ser Leu Leu Val att ctc cat get cca aat tta cca tat aga tac caa aag gaa aaa aat 480 3 0 Ile Leu His Ala Pro Asn Leu Pro Tyr Arg Tyr Gln Lys Glu Lys Asn gta tct ata gaa gat tac tat gaa cta cta tac cga gtt ttt ata att 528 Val Ser Ile Glu Asp Tyr Tyr Glu Leu Leu Tyr Arg Val Phe Ile Ile aac aat tca cta gaa aag gag caa aag gtt tat gaa ggg get cac aga 576 Asn Asn Ser Leu Glu Lys Glu Gln Lys Val Tyr Glu Gly Ala His Arg gcg gtt gaa att gaa get cta aca cca cac tcc agc tac tgt gta gtg 624 Ala Val Glu Ile Glu Ala Leu Thr Pro His Ser Ser Tyr Cys Val Val get gaa ata tat cag ccc atg tta gac aga aga agt cag aga agt gaa 672 Ala Glu Ile Tyr Gln Pro Met Leu Asp Arg Arg Ser Gln Arg Ser Glu gag aga tgt gtg gaa att cca tga 696 Glu Arg Cys Val Glu Ile Pro >cytorX700 MMPKHCFLGFLISFFLTGVAGTQSTHESLKPQRVQFQSRNFHNILQWQPGRALTGNSSVYFVQYKIYGQRQWKNKED
CWGTQELSCDLTSETSDIQEPYYGRVRAASAGSYSEWSMTPRFTPWWETKIDPPVMNITQVNGSLLVILHAPNLPYR
YQKEKNVSIEDYYELLYRVFIINNSLEKEQKVYEGAHRAVEIEALTPHSSYCWAEIYQPMLDRRSQRSEERCVEIP
Nucleotide and polypeptide sequences of DNAX Cytokine Receptor Subunit like embodiments (DCRS4.3 cytorX600; SEQ ID NO: 30 and 31):
ATGATGCCTAAACATTGCTTTCTAGGCTTCCTCATCAGTTTTTTCCTTACTGGTGTAGCAGGAACTCAGTCAACGCA
TGAGTCTCTGAAGCCTCAGAGGGTACAATTTCAGTCCCGAAATTTTCACAACATTTTGCAATGGCAGCCTGGGAGGG
CACTTACTGGCAACAGCAGTGTCTATTTTGTGCAGTACAAAATATATGGACAGAGACAATGGAAAAATAAAGAAGAC
TGTTGGGGTACTCAAGAACTCTCTTGTGACCTTACCAGTGAAACCTCAGACATACAGGAATCTTATTACGGGAGGGT
GAGGGCGGCCTCGGCTGGGAGCTACTCAGAATGGAGCATGACGCCGCGGTTCACTCCCTGGTGGGAAAGAGCAAAAG
GTTTATGAAGGGGCTCACAGAGCGGTTGAAATTGAAGCTCTAACACCACACTCCAGCTACTGTGTAGTGGCTGAAAT
ATATCAGCCCACGTTAGACAGAAGAAGTCAGAGAAGTGAAGAGAGATGTGTGGAAATTCCATGA
atg atg cct aaa cat tgc ttt cta ggc ttc ctc atc agt ttt ttc ctt 48 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu 15' act ggt gta gca gga act cag tca acg cat gag tct ctg aag cct cag 96 Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln 2 0 agg gta caa ttt cag tcc cga aat ttt cac aac att ttg caa tgg cag 144 Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln cct ggg agg gca ctt act ggc aac agc agt gtc tat ttt gtg cag tac 192 2 5 Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr aaa ata tat gga cag aga caa tgg aaa aat aaa gaa gac tgt tgg ggt 240 Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly act caa gaa ctc tct tgt gac ctt acc agt gaa acc tca gac ata cag 288 Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln gaa tct tat tac ggg agg gtg agg gcg gcc tcg get ggg agc tac tca 336 Glu Ser Tyr Tyr Gly Arg Val Arg Ala Ala Ser Ala Gly Ser Tyr Ser 4 0 gaa tgg agc atg acg ccg cgg ttc act ccc tgg tgg gaa aga gca aaa 384 Glu Trp Ser Met Thr Pro Arg Phe Thr Pro Trp Trp Glu Arg Ala Lys ggt tta tgaaggggct cacagagcgg ttgaaattga agctctaaca ccacactcca 440 4 5 Gly Leu gctactgtgt agtggctgaa atatatcagc ccacgttaga cagaagaagt cagagaagtg 500 aagagagatg tgtggaaatt ccatga 526 >cytorX600 MMPKHCFLGFLISFFLTGVAGTQSTHESLKPQRVQFQSRNFHNILQWQPGRALTGNSSVYFVQYKIYGQRQWKNKED
CWGTQELSCDLTSETSDIQESYYGRVRAASAGSYSEWSMTPRFTPWWERAKGL.
5 5 Polypeptide sequence comparison of DCRS4.1, DCRS4.2 and DCRS4.3:
DCRS4.1 1 MMPKHCFLGFLISFFLTGVAGTQSTHESLKPQRVQFQSRNFHNILQWQPG 50 DCRS4.2 1 MMPKHCFLGFLISFFLTGVAGTQSTHESLKPQRVQFQSRNFHNILQWQPG 50 DCRS4.3 1 MMPKIICFLGFLISFFLTGVAGTQSTHESLKPQRVQFQSRNFHNILQWQPG 50 5 **************************************************
DCRS4.1 51 RALTGNSSWFVQYKIYGQRQWKNKEDCWGTQELSCDLTSETSDIQEPYY 100 DCRS4.2 51 RALTGNSSWFVQYKIYGQRQWKNKEDCWGTQELSCDLTSETSDIQEPYY 100 DCRS4.3 51 RALTGNSSWFVQYKIYGQRQWKNKEDCWGTQELSCDLTSETSDIQESYY 100 10 *********************************************** **
DCRS4.1 101 GR-RGKNKNKGNPWGPKQSKRKSKGNQKTNTVTAPAALKAFAGCAKIDPP 149 DCRS4.2 101 GRVRAASAGSYSEWS--MTPRFTP-----------------WWETKIDPP 131 DCRS4.3 101 GRVRAASAGSYSEWS--MTPRFTP-----------------WWE------ 119 15' ** * _ * . * .
DCRS4.1 150 VMNITQVNGSLLVILHAPNLPYRYQKEKNVSIEDYYELLYRVFIINNSLE 199 DCRS4.2 132 VMNITQVNGSLLVILHAPNLPYRYQKEKNVSIEDYYELLYRVFIINNSLE 181 DCRS4.3 123 -------------------------RAKGL 130 20 .. *..
DCRS4.1 200 KEQKWEGAHRAVEIEALTPHSSYCWAEIYQPMLDRRSQRSEERCVEIP 249 DCRS4.2 182 KEQKVYEGAHRAVEIEALTPHSSYCWAEIYQPMLDRRSQRSEERCVEIP 231 DCRS4.3 131 130 Table 4: Reverse Translation of primate, e.g., human, DCRS4.1 (SEQ ID N0: 6).
N may be A, C, G, or T.
ATGATGCCNAARCAYTGYTTYYTNGGNTTYYTNATHWSNTTYTTYYTNACNGGNGTNGCNGGNACNCARWSNACNCA
YGARWSNYTNAARCCNCARMGNGTNCARTTYCARWSNMGNAAYTTYCAYAAYATHYTNCARTGGCARCCNGGNMGNG
CNYTNACNGGNAAYWSNWSNGTNTAYTTYGTNCARTAYAARATHTAYGGNCARMGNCARTGGAARAAYAARGARGAY
TGYTGGGGNACNCARGARYTNWSNTGYGAYYTNACNWSNGARACNWSNGAYATHCARGARCCNTAYTAYGGNMGNMG
NGGNAARAAYAARAAYAARGGNAAYCCNTGGGGNCCNAARCARWSNAARMGNAARWSNAARGGNAAYCARAAR.ACNA
AYACNGTNACNGCNCCNGCNGCNYTNAARGCNTTYGCNGGNTGYGCNAARATHGAYCCNCCNGTNATGAAYATHACN
CARGTNAAYGGNWSNYTNYTNGTNATHYTNCAYGCNCCNAAYYTNCCNTAYMGNTAYCARAARGARAARAAYGTNWS
NATHGARGAYTAYTAYGARYTNYTNTAYMGNGTNTTYATHATHAAYAAYWSNYTNGARAARGARCARAARGTNTAYG
ARGGNGCNCAYMGNGCNGTNGARATHGARGCNYTNACNCCNCAYWSNWSNTAYTGYGTNGTNGCNGARATHTAYCAR
CCNATGYTNGAYMGNMGNWSNCARMGNWSNGARGARMGNTGYGTNGARATHCCN
Reverse Translation of primate, e.g., human, DCRS4.2 (SEQ ID NO: 29).
N may be A, C, G, or T.
ATGATGCCNAARCAYTGYTTYYTNGGNTTYYTNATHWSNTTYTTYYTNACNGGNGTNGCNGGNACNCARWSNACNCA
YGARWSNYTNAARCCNCARMGNGTNCARTTYCARWSNMGNAAYTTYCAYAAYATHYTNCARTGGCARCCNGGNMGNG
CNYTNACNGGNAAYWSNWSNGTNTAYTTYGTNCARTAYAARATHTAYGGNCARMGNCARTGGAARAAYAARGARGAY
TGYTGGGGNACNCARGARYTNWSNTGYGAYYTNACNWSNGARACNWSNGAYATHCARGARCCNTAYTAYGGNMGNGT
NMGNGCNGCNWSNGCNGGNWSNTAYWSNGARTGGWSNATGACNCCNMGNTTYACNCCNTGGTGGGARACNAARATHG
AYCCNCCNGTNATGAAYATHACNCARGTNAAYGGNWSNYTNYTNGTNATHYTNCAYGCNCCNAAYYTNCCNTAYMGN
TAYCARAARGA~2AARAAYGTNWSNATHGARGAYTAYTAYGARYTNYTNTAYMGNGTNTTYATHATHAAYAAYWSNYT
NGARAARGARCARAARGTNTAYGARGGNGCNCAYMGNGCNGTNGARATHGARGCNYTNACNCCNCAYWSNWSNTAYT
GYGTNGTNGCNGARATHTAYCARCCNATGYTNGAYMGNMGNWSNCARMGNWSNGARGARMGNTGYGTNGARATHCCN
55 Reverse Translation of primate, e.g., human, DCRS4.3 (SEQ ID N0: 32). N may be A, C, G, or T.
ATGATGCCNAARCAYTGYTTYYTNGGNTTYYTNATHWSNTTYTTYYTNACNGGNGTNGCNGGNACNCARWSNACNCA
YGARWSNYTNAARCCNCARMGNGTNCARTTYCARWSNMGNAAYTTYCAYAAYATHYTNCARTGGCARCCNGGNMGNG
CNYTNACNGGNAAYWSNWSNGTNTAYTTYGTNCARTAYAARATHTAYGGNCARMGNCARTGGAARAAYAARGARGAY
TGYTGGGGNACNCARGARYTNWSNTGYGAYYTNACNWSNGARACNWSNGAYATHCARGARWSNTAYTAYGGNMGNGT
NMGNGCNGCNWSNGCNGGNWSNTAYWSNGARTGGWSNATGACNCCNMGNTTYACNCCNTGGTGGGARMGNGCNAARG
GNYTN
Nucleic acid sequence comparison of three DCRS4 embodiments:
DCRS4.1 1 ATGATGCCTAAACATTGCTTTCTAGGCTTCCTCATCAGTTTCTTCCTTAC 50 DCRS4.2 1 ATGATGCCTAAACATTGCTTTCTAGGCTTCCTCATCAGTTTCTTCCTTAC 50 DCRS4.3 1 ATGATGCCTAAACATTGCTTTCTAGGCTTCCTCATCAGTTTTTTCCTTAC 50 ***************************************** ********
DCRS4.1 51 TGGTGTAGCAGGAACTCAGTCAACGCATGAGTCTCTGAAGCCTCAGAGGG 100 DCRS4.2 51 TGGTGTAGCAGGAACTCAGTCAACGCATGAGTCTCTGAAGCCTCAGAGGG 100 DCRS4.3 51 TGGTGTAGCAGGAACTCAGTCAACGCATGAGTCTCTGAAGCCTCAGAGGG 100 15' **************************************************
DCRS4.1 101 TACAATTTCAGTCCCGAAATTTTCACAACATTTTGCAATGGCAGCCTGGG 150 DCRS4.2 101 TACAATTTCAGTCCCGAAATTTTCACAACATTTTGCAATGGCAGCCCGGG 150 DCRS4.3 101 TACAATTTCAGTCCCGAAATTTTCACAACATTTTGCAATGGCAGCCTGGG 150 2 0 ********************************************** ***
DCRS4.1 151 AGGGCACTTACTGGCAACAGCAGTGTCTATTTTGTGCAGTACAAAATATA 200 DCRS4.2 151 AGGGCACTTACTGGCAACAGCAGTGTCTATTTTGTGCAGTACAAAATATA 200 DCRS4.3 151 AGGGCACTTACTGGCAACAGCAGTGTCTATTTTGTGCAGTACAAAATATA 200 2 5 **************************************************
DCRS4.1 201 TGGACAGAGACAATGGAAAAATAAAGAAGACTGTTGGGGTACTCAAGAAC 250 DCRS4.2 201 TGGACAGAGACAATGGAAAAATAAAGAAGACTGTTGGGGTACTCAAGAAC 250 DCRS4.3 201 TGGACAGAGACAATGGAAAAATAAAGAAGACTGTTGGGGTACTCAAGAAC 250 3 0 **************************************************
DCRS4.1 251 TCTCTTGTGACCTTACCAGTGAAACCTCAGACATACAGGAACCTTATTAC 300 DCRS4.2 251 TCTCTTGTGACCTTACCAGTGAAACCTCAGACATACAGGAACCTTATTAC 300 DCRS4.3 251 TCTCTTGTGACCTTACCAGTGAAACCTCAGACATACAGGAATCTTATTAC 300 3 5 ***************************************** ********
DCRS4.1 301 GGGAGGAGGGGCAAAAATAAAAATAAAGGGAATCCTTGGGGGCCAAAACA 350 DCRS4.2 301 GGGAGGGTG-----------------AGGGCGGCCTCGGC---------- 323 DCRS4.3 301 GGGAGGGTG-----------------AGGGCGGCCTCGGC---------- 323 4 0 ****** * **** *** **
DCRS4.1 351 AAGTAAACGGAAATCAAAGGGGAACCAGAAGACCAACACAGTGACTGCCC 400 DCRS4.2 324 ---TGGGAGCTACTCAGAATGGAGCATGA-------CGCCGCGGTTCACT 363 DCRS4.3 324 ---TGGGAGCTACTCAGAATGGAGCATGA-------CGCCGCGGTTCACT 363 45 * * * *** * *** * **
DCRS4.1 401 CAGCTGCCCTGAAGGCATTTGCTGGATGTGCAAAAATAGATCCTCCAGTC 450 DCRS4.2 364 C------CCTGGTGGGAA-----------ACAAAAATAGATCCTCCAGTC 396 DCRS4.3 364 C------CCTGGTGGGAAAGAGCAAAAGGTTTATGAAGGGGCTCACAGA- 406 50 * **** ** * * * * * ***
DCRS4.1 451 ATGAATATAACCCAAGTC--AATGGCTCTTTGTTGGTAATTCTCCATGCT 498 DCRS4.2 397 ATGAATATAACCCAAGTC--AATGGCTCTTTGTTGGTAATTCTCCATGCT 444 DCRS4.3 407 GCGGTTGAAATTGAAGCTCTAACACCACACTCCAGCTACTGTGTAGTGGC 456 55 * * ** *** ** * * * * ** * **
DCRS4.1 499 CCAAATTTACCATATAGATACCAAAAGGAAAAAAATGTATCTATAGAAGA 548 DCRS4.2 445 CCAAATTTACCATATAGATACCAAAAGGAAAAAAATGTATCTATAGAAGA 494 DCRS4.3 457 TGAAATATATCA-GCCCACGTTAGACAGAAGAAGTCAGAGAAGT-GAAGA 504 **** ** ** * * * *** ** * * *****
DCRS4.1 549 TTACTATGAACTACTATACCGAGTTTTTATAATTAACAATTCACTAGAAA 598 DCRS4.2 495 TTACTATGAACTACTATACCGAGTTTTTATAATTAACAATTCACTAGAAA 544 DCRS4.3 505 GAGATGTGTGGAAATTCCATGA 526 * ** * * **
DCRS4.1 599 AGGAGCAAAAGGTTTATGAAGGGGCTCACAGAGCGGTTGAAATTGAAGCT 648 DCRS4.2 545 AGGAGCAAAAGGTTTATGAAGGGGCTCACAGAGCGGTTGAAATTGAAGCT 594 DCRS4.3 527 526 15~
DCRS4.1 649 CTAACACCACACTCCAGCTACTGTGTAGTGGCTGAAATATATCAGCCCAT 698 DCRS4.2 595 CTAACACCACACTCCAGCTACTGTGTAGTGGCTGAAATATATCAGCCCAT 644 DCRS4.3 527 526 DCRS4.1 699 GTTAGACAGAAGAAGTCAGAGAAGTGAAGAGAGATGTGTGGAAATTCCAT 748 DCRS4.2 645 GTTAGACAGAAGAAGTCAGAGAAGTGAAGAGAGATGTGTGGAAATTCCAT 694 DCRS4.3 527 526 DCRS4.1 749 GA 750 DCRS4.2 695 GA 696 DCRS4.3 527 526 Table 5: Alignment of various cytokine receptor subunits with DCRS3.1.
IL-2R is SEQ ID NO: 7; IL-9R is SEQ ID NO: 8; GM/IL-3/5 receptor b subunit common (ILRbc) is SEQ ID N0: 9; TPOR is SEQ ID NO: 10; and IL-7R
3 5 is SEQ ID NO: 11 (see GenBank).
DCRS3_HU LCS--PKPPPT----KASLPTDPPGWGC-PDLVCYTDYLQTVICILEMWN--LHP--STL
4 O ILRbc HU ILTPNGNEDTTADFFLTTMPTDSLSVST-LPLPEVQCFVFNVEYMNCTWNSSSEPQPTNL
TPOR_HU LLASDSEPLKC---FSRTFEDLTCFWDE-EEAAPSGTYQLLYAYPREKPR--ACP--LSS
HU
HU TLTWILSNNTGCYIKDR-----TLDLRQ-DQYE--ELKDEA-TSCSLHR-----SAHNAT
_ SSPWLLFTSNQAPG----G-THKCILR--GSECTWLPPE--AVLVPSD--------NFT
HU TLHYWYKNSDNDK-------VQKCSHY--------LFSEEITSGCQLQK-K---EIHLYQ
ILRbc _ QSMPHFGTRYVCQFPDQ--EEVRLFFPLHLWVKNVFLNQTRTQRVLFVDSVGLPAPPSII
TPOR
HU
_ TTNLEFEICGALV------EVKCLNFR--------KLQEIYFIETKKFL---------LI
HU
HU HATYTCHMDVFHF----MADDIFS--VNITDQSGNYSQECGSFLLAESRQYNIS---WRS
_ ITFHHCMSGREQVS---LVDPEYLPRRHVKLDPPSDLQS-----NISSGHCILT---WSI
HU
HU TFWQLQDPREPRR---QATQMLKLQNLVIPWAPENLTL----HKLSESQLELN---WNN
ILRbc _ KAMGGSQPGELQISWEEPAPEISDFLRYELRYGPRDPKNS---TGPTVIQLIATETCCPA
HU
TPOR
_ GKSNICVK-VGEKS---LTCKKIDLTTIVKPEAPFDLSVI---YREGANDFWT---FNT
HU DYEDPAFYMLKGKLQYELQYRNRGDPWAVSPRRKLIS-----VDSRSVSLLPLEFRKDSS
_ SPALEPMTTLLS---YELAFKKQEEAWEQAQHRDHIV-----G-VTWLILEAFELDPGFI
HU RFLNHCLEHLV-------QYRTD---WDHSWTEQSV------DYRHKFSLPSVDGQKRYT
ILRbc _ LQRPHSASALD-----QSPCAQPTMPWQDGPKQTSPSREASALTAEGGSCLISGLQPGNS
HU
TPOR
_ SHLQKKWKVLMHD-VAYRQEKDENKWTHVNLSST---------KLTLLQRK--LQPAAM
DCRS3_HU YELQVRAGPMPGSSYQG-------TWSEWSDPVIFQTQSEELK
ILRbc_HU FRVRSRFNPLCGSAQ---------HWSEWSHPIHWGSNTSKEN
TPOR_HU YWLQLRSEPDGISLGG--------SWGSWSLPVTVDLPGDAVA
15' IL-7R HU YEIKVRSIPDHYFKG---------FWSEWSPSYYFRTPEINNS
- . . * *, ** , Alignment of various cytokine receptor subunits with DCRS4.1. IL-lORb is the beta subunit of IL-lOR, human is SEQ ID N0: 12, mouse is SEQ ID NO: 13; INaRl 2 0 is the beta subunit of IFNa with human SEQ ID N0: 14 and mouse SEQ ID N0:
15;
INgR is interferon gamma receptor subunit alpha with human SEQ ID NO: 16 and mouse SEQ ID NO: 17; IL-lORa is the alpha receptor subunit with mouse SEQ ID
N0: 18 and human SEQ ID NO: 19; INgS (SEQ ID NO: 20) is the beta receptor subunit for INFg; Zcytor7 (SEQ ID N0: 21) and CYTOR11 (SEQ ID NO: 22) are 2 5 from patent filings from Zymogenetics, and INaR2 (SEQ ID NO: 23) is the beta subunit of the receptor for IFNa.
IL-lORb Hu PENVRMNSVNFKNILQWES-PAFAKGNL--TFTAQYLSY---------RIFQDKCMNTTL
Mu PEKVRMNSVNFKNILQWEV-PAFPKTNL--TFTAQYESY---------RSFQDHCKRTAS
3 O IL-lORb _ PQKVEVDIIDDNFILRWNR-SDESVGNV--TFSFDYQKTGMD-----NWIKLSGCQNITS
HU
INaRl _ PENIDWIIDDNYTLKWSS-HGESMGSV--TFSAEYRTKDEA-----KWLKVPECQHTTT
INaRl MU
_ PTNVTIESYNMNPIVYWEY-QIMPQVP---VFTVEVKNYGVK-----NSEWIDACINISH
INgR HU
INgR MU PTNVLIKSYNLNPWCWEY-QNMSQTP---IFTVQVKVY--------SGSWTDSCTNISD
Mu PSYVWFEARFFQHILHWKP-IPNQSEST--YYEVALKQYGNS-----TWNDIHICRKAQA
3 5 IL-lORa _ PPSVWFEAEFFHHILHWTP-IPNQSEST--CYEVALLRYGIE-----SWNSISNC--SQT
IL-lORa Hu INgS HU PLNPRLHLYNDEQILTWEP-SPSSNDPRPWYQVEYSFIDGSW----HRLLEPNCTDITE
Zcytor7 Hu PANITFLSINMKNVLQWTPPEGLQGVKV--TYTVQYFIYGQK-----KWLNKSECRNINR
HU LQHVKFQSSNFENILTWDS-GPEGTPDT--VYSIEYKTYGER-----DWVAKKGCQRITR
_ SCTFKISLRNFRSILSWEL-KNHSIVPTHYTLLYTIMSKPE------DLKWKNCANTTR
HU
4 0 INaR2 _ PQRVQFQSRNFHNILQWQPGRALTGNSS--VYFVQYKIYGQR-----QWKNKEDCWGTQE
Hu TECDFSSLSK------YGDHTLRVRAEFADEHSDWVNIT-FCPVDDTIIGPPG--MQVEV
iL-lORb _ TQCDFSHLSK------YGDYTVRVRAELADEHSEWVNVT-FCPVEDTIIGPPE--MQIES
Mu 4 5 IL-lORb _ TKCNFSSLKLN----VYEEIKLRIRAEKEN-TSSWYEVDSFTPFRKAQIGPPE--VHLEA
HU
INaRl _ TKCEFSLLDTN----VYIKTQFRVRAEEGNSTSSWNEVDPFIPFYTAHMSPPE--VRLEA
INaRl MU
_ HYCNISDHVGDP----SNSLWVRVKARVGQKESAYAKSEEFAVCRDGKIGPPKLDIR-KE
INgR HU
INgR MU HCCNIYGQIMYP----DVSAWARVKAKVGQKESDYARSKEFLMCLKGKVGPPGLEIRRKK
Mu LSCDLTTFTLDLYHR-SYGYRARVRAVDNSQYSNWTTTETRFTVDEVILTVDS--VTLKA
5 O IL-lORa _ LSYDLTAVTLDLYH--SNGYRARVRAVDGSRHSNWTVTNTRFSVDEVTLTVGS--VNLEI
Hu IL-lORa _ TKCDLTGGGRLKLFPHPFTVFLRVRAKRGNLTSKWVGLEPFQHYENVTVGPPKN-ISVTP
INgS HU
Zcytor7 Hu TYCDLSAETSDY----EHQYYAKVKAIWGTKCSKWAESGRFYPFLETQIGPPE--VALTT
HU KSCNLTVETGN----LTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDV-TCISK
_ SFCDLTDEWRS-----THEAYVTVLEGFSGNTTLFSCSHNFWLAIDMSFEPPE--FEIVG
HU
55 INaR2 _ LSCDLTSETSD-----IQEPYYGRRGKNKNKGNPWGPKQSKRKSKGNQKTNTVT-APAAL
Hu LADSLHMRFLAPKIENEYE---TWTMKNVYNSWTYNVQYWKNGTDEKFQ-ITPQYDFEVL
IL-lORb _ LAESLHLRFSAPQIENEPE---TWTLKNIYDSWAYRVQYWKNGTNEKFQ-WSPYDSEVL
Mu IL-lORb _ EDKAIVIHISPGTKDSV-----MWALD--GLSFTYSLLIWKNSSGVEER-IENIYSRHKI
HU
INaRl _ EDKAILVHISPPGQDGN-----MWALE--KPSFSYTIRIWQKSSSDKKT-INSTYYVEKI
MU
INaRl _ EKQIMIDIFHPSVFVNGDEQEVDYDPETTCYIRWNVYVRMNGS-EIQY-KILTQKEDDC
INgR HU
INgR MU EEQLSVLVFHPEVWNGESQGTMFGDGSTCYTFDYTVWEHNRSGEILH-TKHTVEKEEC
Mu MDGIIYGTIHPPRPTITPA--GDEYEQVFKDLRWKISIRKFS--ELKN-ATKRVKQETF
IL-lORa _ HNGFILGKIQLPRPKMAPA--NDTYESIFSHFREYEIAIRKVPG-NFTF-THKKVKHENF
IL-lORa Hu INgS HU GKGSLVIHFSPPFDVFHG------------ATFQYLVHYWEKSETQQEQ-VEGPFKSNSI
Zcytor7 Hu DEKSISWLTAPEKWKRNPEDLPVSMQQIYSNLKYNVSVLNTKSNRTWS-QCVTNHTLVL
HU VRSIQMIVHPTPTPIRAGDG-HRLTLEDIFHDLFYHLELQVNRTYQMHL-GGKQREYEFF
_ FTNHINVMVKFPSIVEEEL-------Q---FDLSLVIEEQSEGIVKKHKPEIKGNMSGNF
HU
INaR2 _ KAFAGCAKIDPPVMNITQ------------VNGSLLVILHAPNLPYRYQ-KEKNVSIEDY
15' IL-lORb_Hu RN-----------------LEPWTTYCVQVRGFLPDRN----------KAGEWSEPVCEQ
IL-lORb_Mu RN-----------------LEPWTTYCIQVQGFLLDQN----------RTGEWSEPICER
INaRl HU YK-----------------LSPETTYCLKVKAALLTSW----------KIGWSPVHCIK
MU PE-----------------LLPETTYCLEVKAIHPSLK----------KHSNYSTVQCIS
2 0 INaRl _ DEIQCQLAI--------PVSSLNSQYCVSAEGVLHVWG----------VTTEKSKEVCIT
INgR HU
INgR MU NETLCELNI--------SVSTLDSRYCISVDGISSFWQ----------VRTEKSKDVCIP
Mu TLT---------------VPIGVRKFCVKVLPRLESRI----------NKAEWSEEQCLL
IL-lORa _ SLL---------------TSGEVGEFCVQVKPSVASRS----------NKGMWSKEECIS
Hu IL-lORa _ VLG---------------NLKPYRWCLQTEAQLILKNKK------IRPHGLLSNVSCHE
2 5 INgS HU
Hu TW-----------------LEPNTLYCVHVESFVPGPP----------RRAQPSEKQCAR
Zcytor7 _ GLTPDTEFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYL
HU
_ TYIID-------------KLIPNTNYCVSVYLEHSDEQ-----------AVIKSPLKCTL
INaR2 HU
_ YE-_--_-__-_____-_____--_LLYRVFIINNSLE-_--_-_-_-~Q~yEGAHRA
IL-lORb Hu TTHDETVP-IL-lORb_Mu TGNDEITP-INaRl_HU TTVENELPP
3 5 INaRl_MU TTVANKMPV
INgR HU IFNSSIKG-INgR MU PFHDDRKD-IL-lORa_Mu ITTEQYFT-IL-lORa_Hu LT-RQYFT-4 0 INgS HU TTANASAR
Zcytor7_Hu TLKDQSSE
CYTOR11_HU SYRYVTKPP
INaR2_HU LPPGQESES
Table 5 shows comparison of sequences of cytokine receptor subunits with the primate, e.g., human, DCRS3.1 (50R), and DCRS4.1 (cytor). Both of the new genes are likely alpha type receptor subunits, and thus should bind to ligand without the need for a beta subunit. Based upon structural features, the ligand for the DCRS3 subunits are likely to be a member of the family of cytokines which includes IL-2, IL-4, IL-7, IL-9, and the additional cytokines which signal through IL-2y common receptor-like subunits IL-13, IL-15, and the TSLP ligand.
Similarly, the ligand for the DCRS4 receptor subunits are probably a ligand in the IL-10 or IFN families, which may be a multi-subunit cytokine, analogous to IL-6 and IL-12.
5 As used herein, the term DCRS3 shall be used to describe a protein comprising an amino acid sequence shown in Table 1;
likewise with DCRS4 and Table 3. In many cases, a substantial fragment thereof will be functionally or structurally equivalent, including, e.g., an extracellular or intracellular 10~ domain. The invention also includes a protein variation of a DCRS3 allele whose sequence is provided, e.g., a mutein or soluble extracellular construct. Typically, such agonists or antagonists will exhibit less than about 10°s sequence differences, and thus will often have between 1- and 11-fold 15 substitutions, e.g., 2-, 3-, 5-, 7-fold, and others. It also encompasses allelic and other variants, e.g., natural polymorphic, of the protein described. Typically, it will bind to its corresponding biological ligand, perhaps in a dimerized state with an alpha receptor subunit, with high affinity, e.g., 20 at least about 100 nM, usually better than about 30 nM, preferably better than about 10 nM, and more preferably at better than about 3 nM. The term shall also be used herein to refer to related naturally occurring forms, e.g., alleles, polymorphic variants, and metabolic variants of the mammalian 25 protein. Preferred forms of the receptor complexes will bind the appropriate ligand with an affinity and selectivity appropriate for a ligand-receptor interaction.
This invention also encompasses combinations of proteins or peptides having substantial amino acid sequence identity with 30 the amino acid sequences in Tables 1 or 3. It will include sequence variants with relatively few substitutions, e.g., preferably less than about 3-5.
A substantial polypeptide "fragment", or "segment", is a stretch of amino acid residues of at least about 8 amino acids, generally at least 10 amino acids, more generally at least 12 amino acids, often at least 14 amino acids, more often at least 16 amino acids, typically at least 18 amino acids, more typically at least 20 amino acids, usually at least 22 amino acids, more usually at least 24 amino acids, preferably at least 26 amino acids, more preferably at least 28 amino acids, and, in particularly preferred embodiments, at least about 30 or more amino acids. Sequences of segments of different proteins can be compared to one another over appropriate length stretches. In many situations, fragments may exhibit functional properties of the intact subunits, e.g., the extracellular domain of the transmembrane receptor may retain the ligand binding features, 10~ and may be used to prepare a soluble receptor-like complex.
Amino acid sequence homology, or sequence identity, is determined by optimizing residue matches. In some comparisons, gaps may be introduces, as required. See, e.g., Needleham, et al., (1970) J. Mol. Biol. 48:443-453; Sankoff, et al., (1983) chapter one in Time Warps String Edits and Macromolecules: The Theory and Practice of Seauence Comparison, Addison-Wesley, Reading, MA; and software packages from IntelliGenetics, Mountain View, CA; and the University of Wisconsin Genetics Computer Group (GCG), Madison, WI; each of which is incorporated herein by reference. This changes when considering conservative substitutions as matches. Conservative substitutions typically include substitutions within the following groups: glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid; asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. Homologous amino acid sequences are intended to include natural allelic and interspecies variations in the cytokine sequence. Typical homologous proteins or peptides will have from 50-100% homology (if gaps can be introduced), to 60-100% homology (if conservative substitutions are included) with an amino acid sequence segment of Table 1 or 3. Homology measures will be at least about 70%, generally at least 76%, more generally at least 81%, often at least 85%, more often at least 88%, typically at least 90%, more typically at least 92%, usually at least 94%, more usually at least 95%, preferably at least 96%, and more preferably at least 97%, and in particularly preferred embodiments, at least 98% or more. The degree of homology will vary with the length of the compared segments. Homologous proteins or peptides, such as the allelic variants, will share most biological activities with the embodiments described in Table 1 or 3.
As used herein, the term "biological activity" is used to describe, without limitation, effects on inflammatory responses, innate immunity, and/or morphogenic development by cytokine-like ligands. For example, these receptors should mediate phosphatase or phosphorylase activities, which activities are 10~ easily measured by standard procedures. See, e.g., Hardie, et al. (eds. 1995) The Protein Kinase FactBook vols. I and II, Academic Press, San Diego, CA; Hanks, et al. (1991) Meth.
Enzymol. 200:38-62; Hunter, et al. (1992) Cell 70:375-388; Lewin (1990) Cell 61:743-752; Pines, et al. (1991) Cold Spring Harbor Sump. Ouant. Biol. 56:449-463; and Parker, et al. (1993) Nature 363:736-738. The receptors, or portions thereof, may be useful as phosphate labeling enzymes to label general or specific substrates. The subunits may also be functional immunogens to elicit recognizing antibodies, or antigens capable of binding antibodies.
The terms ligand, agonist, antagonist, and analog of, e.g., a DCRS3, include molecules that modulate the characteristic cellular responses to cytokine ligand proteins, as well as molecules possessing the more standard structural binding competition features of ligand-receptor interactions, e.g., where the receptor is a natural receptor or an antibody. The cellular responses likely are typically mediated through receptor tyrosine kinase pathways.
Also, a ligand is a molecule which serves either as a natural ligand to which said receptor, or an analog thereof, binds, or a molecule which is a functional analog of the natural ligand. The functional analog may be a ligand with structural modifications, or may be a wholly unrelated molecule which has a molecular shape which interacts with the appropriate ligand binding determinants. The ligands may serve as agonists or antagonists, see, e.g., Goodman, et al. (eds. 1990) Goodman &
Gilman's~ The Pharmacological Bases of Therapeutics, Pergamon Press, New York.
Rational drug design may also be based upon structural studies of the molecular shapes of a receptor or antibody and other effectors or ligands. See, e.g., Herz, et al. (1997) J.
ReceQt Sianal Transduct. Res. 17:671-776; and Chaiken, et al.
(1996) Trends Biotechnol. 14:369-375. Effectors may be other proteins which mediate other functions in response to ligand binding, or other proteins which normally interact with the 10~ receptor. One means for determining which sites interact with specific other proteins is a physical structure determination, e.g., x-ray crystallography or 2 dimensional NMR techniques.
These will provide guidance as to which amino acid residues form molecular contact regions. For a detailed description of protein structural determination, see, e.g., Blundell and Johnson (1976) Protein Crystallography, Academic Press, New York, which is hereby incorporated herein by reference.
II. Activities The cytokine receptor-like proteins will have a number of different biological activities, e.g., modulating cell proliferation, or in phosphate metabolism, being added to or removed from specific substrates, typically proteins. Such will generally result in modulation of an inflammatory function, other innate immunity response, or a morphological effect, as typical of cytokine or interleukin signaling. The subunit may have a specific low affinity binding to the ligand.
The receptors may signal through the JAK pathway. See, e.g., Ihle, et al. (1997) Stem Cells 15(suppl. 1):105-111;
Silvennoinen, et al. (1997) APMIS 105:497-509; Levy (1997) Cytokine Growth Factor Review 8:81-90; Winston and Hunter (1996) Current Biol. 6:668-671; Barrett (1996) Baillieres Clin.
Gastroenterol. 10:1-15; and Briscoe, et al. (1996) Philos.
Trans. R. Soc. Lond. B. Biol. Sci. 351:167-171.
The biological activities of the cytokine receptor subunits will be related to addition or removal of phosphate moieties to substrates, typically in a specific manner, but occasionally in a non specific manner. Substrates may be identified, or conditions for enzymatic activity may be assayed by standard methods, e.g., as described in Hardie, et al. (eds. 1995) The Protein Kinase FactBook vols. I and II, Academic Press, San Diego, CA; Hanks, et al. (1991) Meth. Enzymol. 200:38-62;
Hunter, et al. (1992) Cell 70:375-388; Lewin (1990) Cell 61:743-752; Pines, et al. (1991) Cold Spring Harbor Symp. Ouant. Biol.
56:449-463; and Parker, et al. (1993) Nature 363:736-738.
The receptor subunits may combine with other subunits, 10~ e.g., beta subunits, to form functional complexes, e.g., which may be useful for binding ligand or preparing antibodies. These will have substantial diagnostic uses, including detection or quantitation.
ILI. Nucleic Acids This invention contemplates use of isolated nucleic acid or fragments, e.g., which encode these or closely related proteins, or fragments thereof, e.9., to encode a corresponding polypeptide_, preferably one which is biologically active. In addition, this invention covers isolated or recombinant DNAs which encode combinations of such proteins or polypeptides having characteristic sequences, e.g., of DCRS3s or DCRS4s.
Typically, the nucleic acid is capable of hybridizing, under appropriate conditions, with a nucleic acid sequence segment shown in Tables 1 or 3, but preferably not with a corresponding segment of other receptors, e.g., described in Table 5. Said biologically active protein or polypeptide can be a full length protein, or fragment, and will typically have a segment of amino acid sequence highly homologous, e.g., exhibiting significant stretches of identity, to ones shown in Tables 1 or 3. Further, this invention covers the use of isolated or recombinant nucleic acid, or fragments thereof, which encode proteins having fragments which are equivalent to DCRS3 or DCRS4 proteins. The isolated nucleic acids can have the respective regulatory sequences in the 5' and 3' flanks, e.g., promoters, enhancers, poly-A addition signals, and others from the natural gene.
Combinations, as described, are also provided.
An "isolated" nucleic acid is a nucleic acid, e.g., an RNA, DNA, or a mixed polymer, which is substantially pure, e.g., separated from other components which naturally accompany a native sequence, such as ribosomes, polymerases, and flanking 5 genomic sequences from the originating species. The term embraces a nucleic acid sequence which has been removed from its naturally occurring environment, and includes recombinant or cloned DNA isolates, which are thereby distinguishable from naturally occurring compositions, and chemically synthesized 10~ analogs or analogs biologically synthesized by heterologous systems. A substantially pure molecule includes isolated forms of the molecule, either completely or substantially pure.
An isolated nucleic acid will generally be a homogeneous composition of molecules, but will, in some embodiments, contain 15 heterogeneity, preferably minor. This heterogeneity is typically found at the polymer ends or portions not critical to a desired biological function or activity.
A "recombinant" nucleic acid is typically defined either by its method of production or its structure. In reference to its 20 method of production, e.g., a product made by a process, the process is use of recombinant nucleic acid techniques, e.g., involving human intervention in the nucleotide sequence.
Typically this intervention involves in vitro manipulation, although under certain circumstances it may involve more 25 classical animal breeding techniques. Alternatively, it can be a nucleic acid made by generating a sequence comprising fusion of two fragments which are not naturally contiguous to each other, but is meant to exclude products of nature, e.g., naturally occurring mutants as found in their natural state.
30 Thus, for example, products made by transforming cells with an unnaturally occurring vector is encompassed, as are nucleic acids comprising sequence derived using any synthetic oligonucleotide process. Such a process is often done to replace a codon with a redundant codon encoding the same or a 35 conservative amino acid, while typically introducing or removing a restriction enzyme sequence recognition site. Alternatively, the process is performed to join together nucleic acid segments of desired functions to generate a single genetic entity comprising a desired combination of functions not found in the commonly available natural forms, e.g., encoding a fusion protein. Restriction enzyme recognition sites are often the target of such artificial manipulations, but other site specific targets, e.g., promoters, DNA replication sites, regulation sequences, control sequences, or other useful features may be incorporated by design. A similar concept is intended for a recombinant, e.g., fusion, polypeptide. This will include a 10~ dimeric repeat. Specifically included are synthetic nucleic acids which, by genetic code redundancy, encode equivalent polypeptides to fragments of DCRS3 or DCRS4 and fusions of sequences from various different related molecules, e.g., other cytokine receptor family members.
A "fragment" in a nucleic acid context is a contiguous segment of at least about 17 nucleotides, generally at least 21 nucleotides, more generally at least 25 nucleotides, ordinarily at least 30 nucleotides, more ordinarily at least 35 nucleotides, often at least 39 nucleotides, more often at least 45 nucleotides, typically at least 50 nucleotides, more typically at least 55 nucleotides, usually at least 60 nucleotides, more usually at least 66 nucleotides, preferably at least 72 nucleotides, more preferably at least 79 nucleotides, and in particularly preferred embodiments will be at least 85 or more nucleotides. Typically, fragments of different genetic sequences can be compared to one another over appropriate length stretches, particularly defined segments such as the domains described below.
A nucleic acid which codes for a DCRS3 or DCRS4 will be particularly useful to identify genes, mRNA, and cDNA species which code for itself or closely related proteins, as well as DNAs which code for polymorphic, allelic, or other genetic variants, e.g., from different individuals or related species.
Preferred probes for such screens are those regions of the interleukin which are conserved between different polymorphic variants or which contain nucleotides which lack specificity, and will preferably be full length or nearly so. In other situations, polymorphic variant specific sequences will be more useful .
This invention further covers recombinant nucleic acid molecules and fragments having a nucleic acid sequence identical to or highly homologous to the isolated DNA set forth herein.
In particular, the sequences will often be operably linked to DNA segments which control transcription, translation, and DNA
replication. These additional segments typically assist in expression of the desired nucleic acid segment.
10~ Homologous, or highly identical, nucleic acid sequences, when compared to one another, e.g., DCRS3 sequences, exhibit significant similarity. The standards for homology in nucleic acids are either measures for homology generally used in the art by sequence comparison or based upon hybridization conditions.
Comparative hybridization conditions are described in greater detail below.
Substantial identity in the nucleic acid sequence comparison context means either that the segments, or their complementary strands, when compared, are identical when optimally aligned, with appropriate nucleotide insertions or deletions, in at least about 60% of the nucleotides, generally at least 66%, ordinarily at least 71%, often at least 76%, more often at least 80%, usually at least 84%, more usually at least 88%, typically at least 91%, more typically at least about 93%, preferably at least about 95%, more preferably at least about 96 to 98% or more, and in particular embodiments, as high at about 99% or more of the nucleotides, including, e.g., segments encoding structural domains such as the segments described below. Alternatively, substantial identity will exist when the segments will hybridize under selective hybridization conditions, to a strand or its complement, typically using a sequence derived from Tables 1 or 3. Typically, selective hybridization will occur when there is at least about 55%
homology over a stretch of at least about 14 nucleotides, more typically at least about 65%, preferably at least about 75%, and more preferably at least about 90%. See, Kanehisa (1984) Nucl.
Acids Res. 12:203-213, which is incorporated herein by reference. The length of homology comparison, as described, may be over longer stretches, and in certain embodiments will be over a stretch of at least about 17 nucleotides, generally at least about 20 nucleotides, ordinarily at least about 24 nucleotides, usually at least about 28 nucleotides, typically at least about 32 nucleotides, more typically at least about 40 nucleotides, preferably at least about 50 nucleotides, and more preferably at least about 75 to 100 or more nucleotides. This includes, e.g., 125, 150, 175, 200, 225, 250, 275, 300, 325, 10~ 350, 375, 400, 425, 450, 475, 500, 525, 544, and other lengths.
Stringent conditions, in referring to homology in the hybridization context, will be stringent combined conditions of salt, temperature, organic solvents, and other parameters typically controlled in hybridization reactions. Stringent temperature conditions will usually include temperatures in excess of about 30 C, more usually in excess of about 37 C, typically in excess of about 45 C, more typically in excess of about 55 C, preferably in excess of about 65 C, and more preferably in excess of about 70 C. Stringent salt conditions will ordinarily be less than about 500 mM, usually less than about 400 mM, more usually less than about 300 mM, typically less than about 200 mM, preferably less than about 100 mM, and more preferably less than about 80 mM, even down to less than about 20 mM. However, the combination of parameters is much more important than the measure of any single parameter. See, e.g., Wetmur and Davidson (1968) J. Mol. Biol. 31:349-370, which is hereby incorporated herein by reference.
The isolated DNA can be readily modified by nucleotide substitutions, nucleotide deletions, nucleotide insertions, and inversions of nucleotide stretches. These modifications result in novel DNA sequences which encode this protein or its derivatives. These modified sequences can be used to produce mutant proteins (muteins) or to enhance the expression of variant species. Enhanced expression may involve gene amplification, increased transcription, increased translation, and other mechanisms. Such mutant DCRS-like derivatives include predetermined or site-specific mutations of the protein or its fragments, including silent mutations using genetic code degeneracy. "Mutant DCRS3" as used herein encompasses a polypeptide otherwise falling within the homology definition of the DCRS3 as set forth above, but having an amino acid sequence which differs from that of other cytokine receptor-like proteins as found in nature, whether by way of deletion, substitution, or insertion. In particular, "site specific mutant DCRS3"
encompasses a protein having substantial sequence identity with a protein of Table 1, and typically shares most of the 10~ biological activities or effects of the forms disclosed herein.
Likewise in reference to DCRS4.
Although site specific mutation sites are predetermined, mutants need not be site specific. Mammalian DCRS3 mutagenesis can be achieved by making amino acid insertions or deletions in the gene, coupled with expression. Substitutions, deletions, insertions, or many combinations may be generated to arrive at a final construct. Insertions include amino- or carboxy- terminal fusions. Random mutagenesis can be conducted at a target codon and the expressed mammalian DCRS3 mutants can then be screened for the desired activity, providing some aspect of a structure-activity relationship. Methods for making substitution mutations at predetermined sites in DNA having a known sequence are well known in the art, e.g., by M13 primer mutagenesis. See also Sambrook, et al. (1989) and Ausubel, et al. (1987 and periodic Supplements).
The mutations in the DNA normally should not place coding sequences out of reading frames and preferably will not create complementary regions that could hybridize to produce secondary mRNA structure such as loops or hairpins.
The phosphoramidite method described by Beaucage and Carruthers (1981) Tetra. Letts. 22:1859-1862, will produce suitable synthetic DNA fragments. A double stranded fragment will often be obtained either by synthesizing the complementary strand and annealing the strand together under appropriate conditions or by adding the complementary strand using DNA
polymerase with an appropriate primer sequence.
Polymerase chain reaction (PCR) techniques can often be applied in mutagenesis. Alternatively, mutagenesis primers are commonly used methods for generating defined mutations at predetermined sites. See, e.g., Innis, et al. (eds. 1990) PCR
5 Protocols: A Guide to Methods and Applications Academic Press, San Diego, CA; and Dieffenbach and Dveksler (1995; eds.) PCR
Primer: A Laboratory Manual Cold Spring Harbor Press, CSH, NY.
Certain embodiments of the invention are directed to combination compositions comprising the receptor or ligand 10~ sequences described. In other embodiments, functional portions of the sequences may be joined to encode fusion proteins. In other forms, variants of the described sequences may be substituted.
15 IV. Proteins, Peptides As described above, the present invention encompasses primate DCRS3, e.g., whose sequences are disclosed in Table l, and described above. Allelic and other variants are also contemplated, including, e.g., fusion proteins combining 20 portions of such sequences with others, including, e.g., epitope tags and functional domains.
The present invention also provides recombinant proteins, e.g., heterologous fusion proteins using segments from these primate or rodent proteins. A heterologous fusion protein is a 25 fusion of proteins or segments which are naturally not normally fused in the same manner. Thus, the fusion product of a DCRS3 with another cytokine receptor is a continuous protein molecule having sequences fused in a typical peptide linkage, typically made as a single translation product and exhibiting properties, 30 e.g., sequence or antigenicity, derived from each source peptide. A similar concept applies to heterologous nucleic acid sequences. Combinations of various designated proteins into complexes are also provided.
In addition, new constructs may be made from combining 35 similar functional or structural domains from other related proteins, e.g., cytokine receptors or Toll-like receptors, including species variants. For example, ligand-binding or other segments may be "swapped" between different new fusion polypeptides or fragments. See, e.g., Cunningham, et al. (1989) Science 243:1330-1336; and O'Dowd, et al. (1988) J. Biol. Chem.
263:15985-15992, each of which is incorporated herein by reference. Thus, new chimeric polypeptides exhibiting new combinations of specificities will result from the functional linkage of receptor-binding specificities. For example, the ligand binding domains from other related receptor molecules may be added or substituted for other domains of this or related 10~ proteins. The resulting protein will often have hybrid function and properties. For example, a fusion protein may include a targeting domain which may serve to provide sequestering of the fusion protein to a particular subcellular organelle.
Candidate fusion partners and sequences can be selected from various sequence data bases, e.g., GenBank, c/o IntelliGenetics, Mountain View, CA; and BCG, University of Vdisconsin Biotechnology Computing Group, Madison, WI, which are each incorporated herein by reference. In particular, combinations of polypeptide sequences provided in Tables 1 and 3 are particularly preferred. Variant forms of the proteins may be substituted in the described combinations.
The present invention particularly provides muteins which bind cytokine-like ligands, and/or which are affected in signal transduction. Structural alignment of human DCRS3 or DCRS4 with other members of the cytokine receptor family show conserved features/residues. See Table 5. Alignment of human DCRS3 or DCRS4 sequence with other members of the cytokine receptor family indicates various structural and functionally shared features. See also, Bazan, et al. (1996) Nature 379:591; Lodi, et al. (1994) Science 263:1762-1766; Sayle and Milner-White (1995) TIBS 20:374-376; and Gronenberg, et al. (1991) Protein Enaineerina 4:263-269.
Substitutions with either mouse sequences or human sequences are particularly preferred. Conversely, conservative substitutions away from the ligand binding interaction regions will probably preserve most signaling activities; and conservative substitutions away from the intracellular domains will probably preserve most ligand binding properties.
"Derivatives" of primate DCRS3 include amino acid sequence mutants, glycosylation variants, metabolic derivatives and covalent or aggregative conjugates with other chemical moieties.
Covalent derivatives can be prepared by linkage of functionalities to groups which are found in DCRS3 amino acid side chains or at the N- or C- termini, e.g., by means which are well known in the art. These derivatives can include, without 10~ limitation, aliphatic esters or amides of the carboxyl terminus, or of residues containing carboxyl side chains, O-acyl derivatives of hydroxyl group-containing residues, and N-acyl derivatives of the amino terminal amino acid or amino-group containing residues, e.g., lysine or arginine. Acyl groups are selected from the group of alkyl-moieties, including C3 to C18 normal alkyl, thereby forming alkanoyl aroyl species.
In particular, glycosylation alterations are included, e.g., made by modifying the glycosylation patterns of a polypeptide during its synthesis and processing, or in further processing steps. Particularly preferred means for accomplishing this are by exposing the polypeptide to glycosylating enzymes derived from cells which normally provide such processing, e.g., mammalian glycosylation enzymes.
Deglycosylation enzymes are also contemplated. Also embraced are versions of the same primary amino acid sequence which have other minor modifications, including phosphorylated amino acid residues, e.g., phosphotyrosine, phosphoserine, or phosphothreonine.
A major group of derivatives are covalent conjugates of the receptors or fragments thereof with other proteins of polypeptides. These derivatives can be synthesized in recombinant culture such as N- or C-terminal fusions or by the use of agents known in the art for their usefulness in cross-linking proteins through reactive side groups. Preferred derivatization sites with cross-linking agents are at free amino groups, carbohydrate moieties, and cysteine residues.
Fusion polypeptides between the receptors and other homologous or heterologous proteins are also provided.
Homologous polypeptides may be fusions between different receptors, resulting in, for instance, a hybrid protein exhibiting binding specificity for multiple different cytokine ligands, or a receptor which may have broadened or weakened specificity of substrate effect. Likewise, heterologous fusions may be constructed which would exhibit a combination of properties or activities of the derivative proteins. Typical 10~ examples are fusions of a reporter polypeptide, e.g., luciferase, with a segment or domain of a receptor, e.g., a ligand-binding segment, so that the presence or location of a desired ligand may be easily determined. See, e.g., Dull, et al., U.S. Patent No. 4,859,609, which is hereby incorporated herein by reference. Other gene fusion partners include glutathione-S-transferase (GST), bacterial f~-galactosidase, trpE, Protein A, f3-lactamase, alpha amylase, alcohol dehydrogenase, and yeast alpha mating factor. See, e.g., Godowski, et al. (1988) Science 241:812-816. Labeled proteins will often be substituted in the described combinations of proteins.
The phosphoramidite method described by Beaucage and Carruthers (1981) Tetra. Letts. 22:1859-1862, will produce suitable synthetic DNA fragments. A double stranded fragment will often be obtained either by synthesizing the complementary strand and annealing the strand together under appropriate conditions or by adding the complementary strand using DNA
polymerase with an appropriate primer sequence.
Such polypeptides may also have amino acid residues which have been chemically modified by phosphorylation, sulfonation, biotinylation, or the addition or removal of other moieties, particularly those which have molecular shapes similar to phosphate groups. In some embodiments, the modifications will be useful labeling reagents, or serve as purification targets, e.9., affinity ligands.
Fusion proteins will typically be made by either recombinant nucleic acid methods or by synthetic polypeptide methods. Techniques for nucleic acid manipulation and expression are described generally, for example, in Sambrook, et al. (1989) Molecular Cloning: A Laboratory Manual (2d ed.), Vols. 1-3, Cold Spring Harbor Laboratory, and Ausubel, et al.
(eds. 1987 and periodic supplements) Current Protocols in Molecular Bioloay, Greene/Wiley, New York, which are each incorporated herein by reference. Techniques for synthesis of polypeptides -are described, for example, in Merrifield (1963) J.
Amer. Chem. Soc. 85:2149-2156; Merrifield (1986) Science 232:
10~ 341-347; and Atherton, et al. (1989) Solid Phase Peptide Synthesis: A Practical Approach, IRL Press, Oxford; each of which is incorporated herein by reference. See also Dawson, et al. (1994) Science 266:776-779 for methods to make larger polypeptides.
This invention also contemplates the use of derivatives of a DCRS3 or DCRS4 other than variations in amino acid sequence or glycosylation. Such derivatives may involve covalent or aggregative association with chemical moieties. These derivatives generally fall into three classes: (1) salts, (2) side chain and terminal residue covalent modifications, and (3) adsorption complexes, for example with cell membranes. Such covalent or aggregative derivatives are useful as immunogens, as reagents in immunoassays, or in purification methods such as for affinity purification of a receptor or other binding molecule, e.g., an antibody. For example, a cytokine ligand can be immobilized by covalent bonding to a solid support such as cyanogen bromide-activated Sepharose, by methods which are well known in the art, or adsorbed onto polyolefin surfaces, with or without glutaraldehyde cross-linking, for use in the assay or purification of a cytokine receptor, antibodies, or other similar molecules. The ligand can also be labeled with a detectable group, for example radioiodinated by the chloramine T
procedure, covalently bound to rare earth chelates, or conjugated to another fluorescent moiety for use in diagnostic assays.
A combination, e.g., including a DCRS3 or DCRS4, of this invention can be used as an immunogen for the production of antisera or antibodies specific, e.g., capable of distinguishing between other cytokine receptor family members, for the combinations described. The complexes can be used to screen monoclonal antibodies or antigen-binding fragments prepared by 5 immunization with various forms of impure preparations containing the protein. In particular, the term "antibodies"
also encompasses antigen binding fragments of natural antibodies, e.g., Fab, Fab2, Fv, etc. A purified DCRS3 can also be used as a reagent to detect antibodies generated in response 10~ to the presence of elevated levels of expression, or immunological disorders which lead to antibody production to the endogenous receptor. Additionally, DCRS3 fragments may also serve as immunogens to produce the antibodies of the present invention, as described immediately below. For example, this 15 invention contemplates antibodies having binding affinity to or being raised against the amino acid sequences shown in Table 1, fragments thereof, or various homologous peptides. In particular, this invention contemplates antibodies having binding affinity to, or having been raised against, specific 20 fragments which are predicted to be, or actually are, exposed at the exterior protein surface of a native DCRS3. Complexes of combinations of proteins will also be useful, and antibody preparations thereto can be made.
The blocking of physiological response to the receptor 25 ligands may result from the inhibition of binding of the ligand to the receptor, likely through competitive inhibition. Thus, in vitro assays of the present invention will often use antibodies or antigen binding segments of these antibodies, or fragments attached to solid phase substrates. These assays will 30 also allow for the diagnostic determination of the effects of either ligand binding region mutations and modifications, or other mutations and modifications, e.g., which affect signaling or enzymatic function.
This invention also contemplates the use of competitive 35 drug screening assays, e.g., where neutralizing antibodies to the receptor complexes or fragments compete with a test compound for binding to a ligand or other antibody. In this manner, the neutralizing antibodies or fragments can be used to detect the presence of a polypeptide which shares one or more binding sites to a receptor and can also be used to occupy binding sites on a receptor that might otherwise bind a ligand.
V. Making Nucleic Acids and Protein DNA which encodes the protein or fragments thereof can be obtained by chemical synthesis, screening cDNA libraries, or by screening genomic libraries prepared from a wide variety of cell 10~ lines or tissue samples. Natural sequences can be isolated using standard methods and the sequences provided herein, e.g., in Tables 1 or 3. Other species counterparts can be identified by hybridization techniques, or by various PCR techniques, combined with or by searching in sequence databases, e.g., GenBank.
This DNA can be expressed in a wide variety of host cells for the synthesis of a full-length receptor or fragments which can in turn, for example, be used to generate polyclonal or monoclonal antibodies; for binding studies; for construction and expression of modified ligand binding or kinase/phosphatase domains; and for structure/function studies. Variants or fragments can be expressed in host cells that are transformed or transfected with appropriate expression vectors. These molecules can be substantially free of protein or cellular contaminants, other than those derived from the recombinant host, and therefore are particularly useful in pharmaceutical compositions when combined with a pharmaceutically acceptable carrier and/or diluent. The protein, or portions thereof, may be expressed as fusions with other proteins. Combinations of the described proteins, or nucleic acids encoding them, are particularly interesting.
Expression vectors are typically self-replicating DNA or RNA constructs containing the desired receptor gene or its fragments, usually operably linked to suitable genetic control elements that are recognized in a suitable host cell. These control elements are capable of effecting expression within a suitable host. The multiple genes may be coordinately expressed, and may be on a polycistronic message. The specific type of control elements necessary to effect expression will depend upon the eventual host cell used. Generally, the genetic control elements can include a prokaryotic promoter system or a eukaryotic promoter expression control system, and typically include a transcriptional promoter, an optional operator to control the onset of transcription, transcription enhancers to elevate the level of mRNA expression, a sequence that encodes a suitable ribosome binding site, and sequences that terminate 10~ transcription and translation. Expression vectors also usually contain an origin of replication that allows the vector to replicate independently of the host cell.
The vectors of this invention include those which contain DNA which encodes a combination of proteins, as described, or a biologically active equivalent polypeptide. The DNA can be under the control of a viral promoter and can encode a selection marker. This invention further contemplates use of such expression vectors which are capable of expressing eukaryotic cDNAs coding for such proteins in a prokaryotic or eukaryotic host, where the vector is compatible with the host and where the eukaryotic cDNAs are inserted into the vector such that growth of the host containing the vector expresses the cDNAs in question. Usually, expression vectors are designed for stable replication in their host cells or for amplification to greatly increase the total number of copies of the desirable gene per cell. It is not always necessary to require that an expression vector replicate in a host cell, e.g., it is possible to effect transient expression of the protein or its fragments in various hosts using vectors that do not contain a replication origin that is recognized by the host cell. It is also possible to use vectors that cause integration of the protein encoding portions into the host DNA by recombination.
Vectors, as used herein, comprise plasmids, viruses, bacteriophage, integratable DNA fragments, and other vehicles which enable the integration of DNA fragments into the genome of the host. Expression vectors are specialized vectors which contain genetic control elements that effect expression of operably linked genes. Plasmids are the most commonly used form of vector but all other forms of vectors which serve an equivalent function and which are, or become, known in the art are suitable for use herein. See, e.g., Pouwels, et al. (1985 and Supplements) Cloning Vectors: A Laboratory Manual, Elsevier, N.Y., and Rodriguez, et al. (eds. 1988) Vectors: A Survey of Molecular Cloninct Vectors and Their Uses, Buttersworth, Boston, which are incorporated herein by reference.
Transformed cells are cells, preferably mammalian, that 10~ have been transformed or transfected with vectors constructed using recombinant DNA techniques. Transformed host cells usually express the desired proteins, but for purposes of cloning, amplifying, and manipulating its DNA, do not need to express.the subject proteins. This invention further contemplates culturing transformed cells in a nutrient medium, thus permitting the proteins to accumulate. The proteins can be recovered, either from the culture or, in certain instances, from the culture medium.
For purposes of this invention, nucleic sequences are operably linked when they are functionally related to each other. For example, DNA for a presequence or secretory leader is operably linked to a polypeptide if it is expressed as a preprotein or participates in directing the polypeptide to the cell membrane or in secretion of the polypeptide. A promoter is operably linked to a coding sequence if it controls the transcription of the polypeptide; a ribosome binding site is operably linked to a coding sequence if it is positioned to permit translation. Usually, operably linked means contiguous and in reading frame, however, certain genetic elements such as repressor genes are not contiguously linked but still bind to operator sequences that in turn control expression.
Suitable host cells include prokaryotes, lower eukaryotes, and higher eukaryotes. Prokaryotes include both gram negative and gram positive organisms, e.g., E. coli and B. subtilis.
Lower eukaryotes include yeasts, e.g., S. cerevisiae and Pichia, and species of the genus Dictyostelium. Higher eukaryotes include established tissue culture cell lines from animal cells, both of non-mammalian origin, e.g., insect cells, and birds, and of mammalian origin, e.g., human, primates, and rodents.
Prokaryotic host-vector systems include a wide variety of vectors for many different species. As used herein, E. coli and its vectors will be used generically to include equivalent vectors used in other prokaryotes. A representative vector for amplifying DNA is pBR322 or many of its derivatives. Vectors that can be used to express the receptor or its fragments include, but are not limited to, such vectors as those 10~ containing the lac promoter (pUC-series); trp promoter (pBR322-trp); Ipp promoter (the pIN-series); lambda-pP or pR
promoters (pOTS); or hybrid promoters such as ptac (pDR540).
See Brosius, et al. (1988) "Expression Vectors Employing Lambda-, trp-, lac-, and Ipp-derived Promoters", in Vectors: A
Survey of Molecular Clonincl Vectors and Their Uses, (eds.
Rodriguez and Denhardt), Buttersworth, Boston, Chapter 10, pp.
205-236, which is incorporated herein by reference.
Lower eukaryotes, e.g., yeasts and Dictyostelium, may be transformed with DCRS3 or DCRS4 sequence containing vectors.
For purposes of this invention, the most common lower eukaryotic host is the baker's yeast, Saccharomvces cerevisiae. It will be used to generically represent lower eukaryotes although a number of other strains and species are also available. Yeast vectors typically consist of a replication origin (unless of the integrating type), a selection gene, a promoter, DNA encoding the receptor or its fragments, and sequences for translation termination, polyadenylation, and transcription termination.
Suitable expression vectors for yeast include such constitutive promoters as 3-phosphoglycerate kinase and various other glycolytic enzyme gene promoters or such inducible promoters as the alcohol dehydrogenase 2 promoter or metallothionine promoter. Suitable vectors include derivatives of the following types: self-replicating low copy number (such as the YRp-series), self-replicating high copy number (such as the YEp-series); integrating types (such as the YIp-series), or mini-chromosomes (such as the YCp-series).
Higher eukaryotic tissue culture cells are normally the preferred host cells for expression of the functionally active interleukin or receptor proteins. In principle, many higher eukaryotic tissue culture cell lines are workable, e.g., insect 5 baculovirus expression systems, whether from an invertebrate or vertebrate source. However, mammalian cells are preferred.
Transformation or transfection and propagation of such cells has become a routine procedure. Examples of useful cell lines include HeLa cells, Chinese hamster ovary (CHO) cell lines, baby 10~ rat kidney (BRK) cell lines, insect cell lines, bird cell lines, and monkey (COS) cell lines. Expression vectors for such cell lines usually include an origin of replication, a promoter, a translation initiation site, RNA splice sites (if genomic DNA is used), a polyadenylation site, and a transcription termination 15 site. These vectors also usually contain a selection gene or amplification gene. Suitable expression vectors may be plasmids, viruses, or retroviruses carrying promoters derived, e.g., from such sources as from adenovirus, SV40, parvoviruses, vaccinia virus, or cytomegalovirus. Representative examples of 20 suitable expression vectors include pCDNAl; pCD, see Okayama, et al. (1985) Mol. Cell Biol. 5:1136-1142; pMClneo PolyA, see Thomas, et al. (1987) Cell 51:503-512; and a baculovirus vector such as pAC 373 or pAC 610.
For secreted proteins and some membrane proteins, an open 25 reading frame usually encodes a polypeptide that consists of a mature or secreted product covalently linked at its N-terminus to a signal peptide. The signal peptide is cleaved prior to secretion of the mature, or active, polypeptide. The cleavage site can be predicted with a high degree of accuracy from 30 empirical rules, e.g., von-Heijne (1986) Nucleic Acids Research 14:4683-4690 and Nielsen, et al. (1997) Protein Ena. 10:1-12, and the precise amino acid composition of the signal peptide often does not appear to be critical to its function, e.g., Randall, et al. (1989) Science 243:1156-1159; Kaiser et al.
35 (1987) Science 235:312-317. The mature proteins of the invention can be readily determined using standard methods.
It will often be desired to express these polypeptides in a system which provides a specific or defined glycosylation pattern. In this case, the usual pattern will be that provided naturally by the expression system. However, the pattern will be modifiable by exposing the polypeptide, e.g., an unglycosylated form, to appropriate glycosylating proteins introduced into a heterologous expression system. For example, the receptor gene may be co-transformed with one or more genes encoding mammalian or other glycosylating enzymes. Using this 10~ approach, certain mammalian glycosylation patterns will be achievable in prokaryote or other cells. Expression in prokaryote cells will typically lead to unglycosylated forms of protein.
The source of DCRS3 or DCRS4 can be a eukaryotic or prokaryotic host expressing recombinant DCRS, such,as is described above. The source can also be a cell line, but other mammalian cell lines are also contemplated by this invention, with the preferred cell line being from the human species.
Now that the sequences are known, a primate DCRS3 or DCRS4, fragments, or derivatives thereof can be prepared by conventional processes for synthesizing peptides. These include processes such as are described in Stewart and Young (1984) Solid Phase Pe,~tide Synthesis, Pierce Chemical Co., Rockford, IL; Bodanszky and Bodanszky (1984) The Practice of Peptide Synthesis, Springer-Verlag, New York; and Bodanszky (1984) The Principles of Peptide Synthesis, Springer-Verlag, New York; all of each which are incorporated herein by reference. For example, an azide process, an acid chloride process, an acid anhydride process, a mixed anhydride process, an active ester process (for example, p-nitrophenyl ester, N-hydroxysuccinimide ester, or cyanomethyl ester), a carbodiimidazole process, an oxidative-reductive process, or a dicyclohexylcarbodiimide (DCCD)/additive process can be used. Solid phase and solution phase syntheses are both applicable to the foregoing processes.
Similar techniques can be used with partial DCRS3 or DCRS4 sequences.
DCRS3 or DCRS4 proteins, fragments, or derivatives are suitably prepared in accordance with the above processes as typically employed in peptide synthesis, generally either by a so-called stepwise process which comprises condensing an amino acid to the terminal amino acid, one by one in sequence, or by coupling peptide fragments to the terminal amino acid. Amino groups that are not being used in the coupling reaction typically must be protected to prevent coupling at an incorrect location.
10~ If a solid phase synthesis is adopted, the C-terminal amino acid is bound to an insoluble carrier or support through its carboxyl group. The insoluble carrier is not particularly limited as long as it has a binding capability to a reactive carboxyl group. Examples of such insoluble carriers include halomethyl resins, such as chloromethyl resin or bromomethyl resin, hydroxymethyl resins, phenol resins, tert-alkyloxycarbonylhydrazidated resins, and the like.
An amino group-protected amino acid is bound in sequence through condensation of its activated carboxyl group and the reactive amino group of the previously formed peptide or chain, to synthesize the peptide step by step. After synthesizing the complete sequence, the peptide is split off from the insoluble carrier to produce the peptide. This solid-phase approach is generally described by Merrifield, et al. (1963) in J. Am. Chem.
Soc. 85:2149-2156, which is incorporated herein by reference.
The prepared protein and fragments thereof can be isolated and purified from the reaction mixture by means of peptide separation, for example, by extraction, precipitation, electrophoresis, various forms of chromatography, and the like.
The receptors of this invention can be obtained in varying degrees of purity depending upon desired uses. Purification can be accomplished by use of the protein purification techniques disclosed herein, see below, or by the use of the antibodies herein described in methods of immunoabsorbant affinity chromatography. This immunoabsorbant affinity chromatography is carried out by first linking the antibodies to a solid support and then contacting the linked antibodies with solubilized lysates of appropriate cells, lysates of other cells expressing the receptor, or lysates or supernatants of cells producing the protein as a result of DNA techniques, see below.
Generally, the purified protein will be at least about 40%
pure, ordinarily at least about 50% pure, usually at least about 60% pure, typically at least about 70% pure, more typically at least about 80% pure, preferable at least about 90% pure and more preferably at least about 95% pure, and in particular embodiments, 97%-99% or more. Purity will usually be on a 10~ weight basis, but can also be on a molar basis. Different assays will be applied as appropriate. Individual proteins may be purified and thereafter combined.
VI. Antibodies Antibodies can be raised to various mammalian, e.g., primate DCRS3 or DCRS4 proteins and fragments thereof, both in naturally occurring native forms and in their recombinant forms, the difference being that antibodies to the active receptor are more likely to recognize epitopes which are only present in the native conformations. Denatured antigen detection can also be useful in, e.g., Western analysis. Anti-idiotypic antibodies are also contemplated, which would be useful as agonists or antagonists of a natural receptor or an antibody.
Antibodies, including binding fragments and single chain versions, against predetermined fragments of the protein can be raised by immunization of animals with conjugates of the fragments with immunogenic proteins. Monoclonal antibodies are prepared from cells secreting the desired antibody. These antibodies can be screened for binding to normal or defective protein, or screened for agonistic or antagonistic activity.
These monoclonal antibodies will usually bind with at least a KD
of about 1 mM, more usually at least about 300 ~M, typically at least about 100~M, more typically at least about 30 ~M, preferably at least about 10 ~M, and more preferably at least about 3 ~M or better.
The antibodies, including antigen binding fragments, of this invention can have significant diagnostic or therapeutic value. They can be potent antagonists that bind to the receptor and inhibit binding to ligand or inhibit the ability of the receptor to elicit a biological response, e.g., act on its substrate. They also can be useful as non-neutralizing antibodies and can be coupled to toxins or radionuclides to bind producing cells, or cells localized to the source of the interleukin. Further, these antibodies can be conjugated to drugs or other therapeutic agents, either directly or indirectly by means of a linker.
10~ The antibodies of this invention can also be useful in diagnostic applications. As capture or non-neutralizing antibodies, they might bind to the receptor without inhibiting ligand or substrate binding. As neutralizing antibodies, they can be useful in competitive binding assays. They will also be useful in detecting or quantifying ligand. They may be used as reagents for Western blot analysis, or for immunoprecipitation or immunopurification of the respective protein. Likewise, nucleic acids and proteins may be immobilized to solid substrates for affinity purification or detection methods. The substrates may be, e.g., solid resin beads or sheets of plastic.
Protein fragments may be joined to other materials, particularly polypeptides, as fused or covalently joined polypeptides to be used as immunogens. Mammalian cytokine receptors and fragments may be fused or covalently linked to a variety of immunogens, such as keyhole limpet hemocyanin, bovine serum albumin, tetanus toxoid, etc. See Microbiolocrv, Hoeber Medical Division, Harper and Row, 1969; Landsteiner (1962) Specificity of Serological Reactions, Dover Publications, New York; and Williams, et al. (1967) Methods in Immunology and Immunochemistry, Vol. 1, Academic Press, New York; each of which are incorporated herein by reference, for descriptions of methods of preparing polyclonal antisera. A typical method involves hyperimmunization of an animal with an antigen. The blood of the animal is then collected shortly after the repeated immunizations and the gamma globulin is isolated.
In some instances, it is desirable to prepare monoclonal antibodies from various mammalian hosts, such as mice, rodents, primates, humans, etc. Description of techniques for preparing such monoclonal antibodies may be found in, e.g., Stites, et al.
(eds.) Basic and Clinical Immunoloav (4th ed.), Lange Medical Publications, Los Altos, CA, and references cited therein;
5 Harlow and Lane (1988) Antibodies: A Laboratory Manual, CSH
Press; Goding (1986) Monoclonal Antibodies: Principles and Practice (2d ed.) Academic Press, New York; and particularly in Kohler and Milstein (1975) in Nature 256: 495-497, which discusses one method of generating monoclonal antibodies. Each 10~ of these references is incorporated herein by reference.
Summarized briefly, this method involves injecting an animal with an immunogen. The animal is then sacrificed and cells taken from its spleen, which are then fused with myeloma cells.
The result is a hybrid cell or "hybridoma" that is capable of 15 reproducing in vitro. The population of hybridomas is then screened to isolate individual clones, each of which secrete a single antibody species to the immunogen. In this manner, the individual antibody species obtained are the products of immortalized and cloned single B cells from the immune animal 20 generated in response to a specific site recognized on the immunogenic substance.
Other suitable techniques involve in vitro exposure of lymphocytes to the antigenic polypeptides or alternatively to selection of libraries of antibodies in phage or similar 25 vectors. See, Huse, et al. (1989) "Generation of a Large Combinatorial Library of the Immunoglobulin Repertoire in Phage Lambda," Science 246:1275-1281; and Ward, et al. (1989) Nature 341:544-546, each of which is hereby incorporated herein by reference. The polypeptides and antibodies of the present 30 invention may be used with or without modification, including chimeric or humanized antibodies. Frequently, the polypeptides and antibodies will be labeled by joining, either covalently or non-covalently, a substance which provides for a detectable signal. A wide variety of labels and conjugation techniques are 35 known and are reported extensively in both the scientific and patent literature. Suitable labels include radionuclides, enzymes, substrates, cofactors, inhibitors, fluorescent moieties, chemiluminescent moieties, magnetic particles, and the like. Patents, teaching the use of such labels include U.S.
Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345;
4,277,437; 4,275,149; and 4,366,241. Also, recombinant or chimeric immunoglobulins may be produced, see Cabilly, U.S.
Patent No. 4,816,567; or made in transgenic mice, see Mendez, et al. (1997) Nature Genetics 15:146-156. These references are incorporated herein by reference.
The antibodies of this invention can also be used for 10~ affinity chromatography in isolating DCRS3 proteins or peptides.
Columns can be prepared where the antibodies are linked to a solid support, e.g., particles, such as agarose, Sephadex, or the like, where a cell lysate may be passed through the column, the column washed, followed by increasing concentrations of a mild denaturant, whereby the purified protein will be released.
Alternatively, the protein may be used to purify antibody.
Appropriate cross absorptions or depletions may be applied.
The antibodies may also be used to screen expression libraries for particular expression products. Usually the antibodies used in such a procedure will be labeled with a moiety allowing easy detection of presence of antigen by antibody binding.
Antibodies raised against a cytokine receptor will also be used to raise anti-idiotypic antibodies. These will be useful in detecting or diagnosing various immunological conditions related to expression of the protein or cells which express the protein. They also will be useful as agonists or antagonists of the ligand, which may be competitive inhibitors or substitutes for naturally occurring ligands.
A cytokine receptor protein that specifically binds to or that is specifically immunoreactive with an antibody generated against a defined immunogen, such as an immunogen consisting of the amino acid sequence of SEQ ID NO: 2, 25, 5, 28, or 31, is typically determined in an immunoassay. The immunoassay typically uses a polyclonal antiserum which was raised, e.g., to a protein of SEQ ID NO: 2, 25, 5, 28, or 31. This antiserum is selected to have low crossreactivity against other cytokine receptor family members, e.g., IL-11 receptor subunit alpha, IL-6 receptor subunit alpha, or p40, preferably from the same species, and any such crossreactivity is removed by immunoabsorption prior to use in the immunoassay.
In order to produce antisera for use in an immunoassay, the protein, e.g., of SEQ ID NO: 2, 25, 5, 28, or 31, is isolated as described herein. For example, recombinant protein may be produced in a mammalian cell line. An appropriate host, e.g., an inbred strain of mice such as Balb/c, is immunized with the 10~ selected protein, typically using a standard adjuvant, such as Freund's adjuvant, and a standard mouse immunization protocol (see Harlow and Lane, supra). Alternatively, a synthetic peptide derived from the sequences disclosed herein and conjugated to a carrier protein can be used an immunogen.
Polyclonal sera are collected and titered against the immunogen protein in an immunoassay, e.g., a solid phase immunoassay with the immunogen immobilized on a solid support. Polyclonal antisera with a titer of 104 or greater are selected and tested for their cross reactivity against other cytokine receptor family members, e.g., IL-2, IL-7, IL-9, or EPO receptor subunit, using a competitive binding immunoassay such as the one described in Harlow and Lane, supra, at pages 570-573.
Preferably at least two cytokine receptor family members are used in this determination. These cytokine receptor family members can be produced as recombinant proteins and isolated using standard molecular biology and protein chemistry techniques as described herein.
Immunoassays in the competitive binding format can be used for the crossreactivity determinations. For example, the protein of SEQ ID N0: 2, 25, 5, 28, or 31 can be immobilized to a solid support. Proteins added to the assay compete with the binding of the antisera to the immobilized antigen. The ability of the above proteins to compete with the binding of the antisera to the immobilized protein is compared to the proteins, e.g., of IL-2, IL-7, IL-9, or EPO receptor subunit. The percent crossreactivity for the above proteins is calculated, using standard calculations. Those antisera with less than 10°s crossreactivity with each of the proteins listed above are selected and pooled. The cross-reacting antibodies are then removed from the pooled antisera by immunoabsorption with the above-listed proteins.
The immunoabsorbed and pooled antisera are then used in a competitive binding immunoassay as described above to compare a second protein to the immunogen protein (e. g., DCRS3 like protein of SEQ ID NO: 2). In order to make this comparison, the two proteins are each assayed at a wide range of concentrations 10~ and the amount of each protein required to inhibit 50°s of the binding of the antisera to the immobilized protein is determined. If the amount of the second protein required is less than twice the amount of the protein of the selected protein or proteins that is required, then the second protein is said to specifically bind to an antibody generated to the immunogen.
It is understood that these cytokine receptor proteins are members of a family of homologous proteins that comprise at least 6 so far identified genes. For a particular gene product, such as a DCRS3 or DCRS4, the term refers not only to the amino acid sequences disclosed herein, but also to other proteins that are allelic, non-allelic, or species variants. It is also understood that the terms include nonnatural mutations introduced by deliberate mutation using conventional recombinant technology such as single site mutation, or by excising short sections of DNA encoding the respective proteins, or by substituting new amino acids, or adding new amino acids. Such minor alterations typically will substantially maintain the immunoidentity of the original molecule and/or its biological activity. Thus, these alterations include proteins that are specifically immunoreactive with a designated naturally occurring DCRS3 or DCRS4 protein. The biological properties of the altered proteins can be determined by expressing the protein in an appropriate cell line and measuring the appropriate effect, e.g., upon transfected lymphocytes. Particular protein modifications considered minor would include conservative substitution of amino acids with similar chemical properties, as described above for the cytokine receptor family as a whole. By aligning a protein optimally with the protein of the cytokine receptors and by using the conventional immunoassays described herein to determine immunoidentity, one can determine the protein compositions of the invention.
VII. Kits and quantitation Both naturally occurring and recombinant forms of~the cytokine receptor like molecules of this invention are 10~ particularly useful in kits and assay methods. For example, these methods would also be applied to screening for binding activity, e.g., ligands for these proteins. Several methods of automating assays have been developed in recent years so as to permit screening of tens of thousands of compounds per year.
See, e.g., a BIOMEK automated workstation, Beckman Instruments, Palo Alto, California, and Fodor, et al. (1991) Science 251:767-773, which is incorporated herein by reference. The latter describes means for testing binding by a plurality of defined polymers synthesized on a solid substrate. The development of suitable assays to screen for a ligand or agonist/antagonist homologous proteins can be greatly facilitated by the availability of large amounts of purified, soluble cytokine receptors in an active state such as is provided by this invention.
Purified DCRS3 or DCRS4 can be coated directly onto plates for use in the aforementioned ligand screening techniques.
However, non-neutralizing antibodies to these proteins can be used as capture antibodies to immobilize the respective receptor on the solid phase, useful, e.9., in diagnostic uses.
This invention also contemplates use of DCRS3 or DCRS4, fragments thereof, peptides, and their fusion products in a variety of diagnostic kits and methods for detecting the presence of the protein or its ligand. Alternatively, or additionally, antibodies against the molecules may be incorporated into the kits and methods. Typically the kit will have a compartment containing either a DCRS3 or DCRS4 peptide or gene segment or a reagent which recognizes one or the other.
Typically, recognition reagents, in the case of peptide, would be a receptor or antibody, or in the case of a gene segment, would usually be a hybridization probe.
A preferred kit for determining the concentration of, e.g., 5 DCRS3 in a sample would typically comprise a labeled compound, e.g., ligand or antibody, having known binding affinity for DCRS3, a source of DCRS3 (naturally occurring or recombinant) as a positive control, and a means for separating the bound from free labeled compound, for example a solid phase for 10~ immobilizing DCRS3 in the test sample. Compartments containing reagents, and instructions, will normally be provided.
Appropriate nucleic acid or protein containing kits are also provided.
Antibodies, including antigen binding fragments, specific 15 for mammalian DCRS3 or a peptide fragment, or receptor fragments are useful in diagnostic applications to detect the presence of elevated levels of ligand and/or its fragments. Diagnostic assays may be homogeneous (without a separation step between free reagent and antibody-antigen complex) or heterogeneous 20 (with a separation step). Various commercial assays exist, such as radioimmunoassay (RIA), enzyme-linked immunosorbent assay (ELISA), enzyme immunoassay (EIA), enzyme-multiplied immunoassay technique (EMIT), substrate-labeled fluorescent immunoassay (SLFIA) and the like. For example, unlabeled antibodies can be 25 employed by using a second antibody which is labeled and which recognizes the antibody to a cytokine receptor or to a particular fragment thereof. These assays have also been extensively discussed in the literature. See, e.g., Harlow and Lane (1988) Antibodies: A Laboratory Manual, CSH., and Coligan 30 (ed. 1991 and periodic supplements) Current Protocols In Immunoloctv Greene/Wiley, New York.
Anti-idiotypic antibodies may have similar use to serve as agonists or antagonists of cytokine receptors. These should be useful as therapeutic reagents under appropriate circumstances.
35 Frequently, the reagents for diagnostic assays are supplied in kits, so as to optimize the sensitivity of the assay. For the subject invention, depending upon the nature of the assay, the protocol, and the label, either labeled or unlabeled antibody, or labeled ligand is provided. This is usually in conjunction with other additives, such as buffers, stabilizers, materials necessary for signal production such as substrates for enzymes, and the like. Preferably, the kit will also contain instructions for proper use and disposal of the contents after use. Typically the kit has compartments for each useful reagent, and will contain instructions for proper use and disposal of reagents. Desirably, the reagents are provided as a 10~ dry lyophilized powder, where the reagents may be reconstituted in an aqueous medium having appropriate concentrations for performing the assay.
The aforementioned constituents of the diagnostic assays may be used without modification or may be modified in a variety of ways. For example, labeling may be achieved by covalently or non-covalently joining a moiety which directly or indirectly provides a detectable signal. In many of these assays, a test compound, cytokine receptor, or antibodies thereto can be labeled either directly or indirectly. Possibilities for direct labeling include label groups: radiolabels such as 125I, enzymes (U. S. Pat. No. 3,645,090) such as peroxidase and alkaline phosphatase, and fluorescent labels (U. S. Pat. No. 3,940,475) capable of monitoring the change in fluorescence intensity, wavelength shift, or fluorescence polarization. Both of the patents are incorporated herein by reference. Possibilities for indirect labeling include biotinylation of one constituent followed by binding to avidin coupled to one of the above label groups.
There are also numerous methods of separating the bound from the free ligand, or alternatively the bound from the free test compound. The cytokine receptor can be immobilized on various matrixes followed by washing. Suitable matrices include plastic such as an ELISA plate, filters, and beads. Methods of immobilizing the receptor to a matrix include, without limitation, direct adhesion to plastic, use of a capture antibody, chemical coupling, and biotin-avidin. The last step in this approach involves the precipitation of antibody/antigen complex by any of several methods including those utilizing, e.g., an organic solvent such as polyethylene glycol or a salt such as ammonium sulfate. Other suitable separation techniques include, without limitation, the fluorescein antibody magnetizable particle method described in Rattle, et al. (1984) Clin. Chem. 30:1457-1461, and the double antibody magnetic particle separation as described in U.S. Pat. No. 4,659,678, each of which is incorporated herein by reference.
The methods for linking protein or fragments to various 10~ labels have been extensively reported in the literature and do not require detailed discussion here. Many of the techniques involve the use of activated carboxyl groups either through the use of carbodiimide or active esters to form peptide bonds, the formation of thioethers by reaction of a mercapto group with an activated halogen such as chloroacetyl, or an activated olefin such as maleimide, for linkage, or the like. Fusion proteins will also find use in these applications.
Another diagnostic aspect of this invention involves use of oligonucleotide or polynucleotide sequences taken from the sequence of an cytokine receptor. These sequences can be used as probes for detecting levels of the respective cytokine receptor in patients suspected of having an immunological disorder. The preparation of both RNA and DNA nucleotide sequences, the labeling of the sequences, and the preferred size of the sequences has received ample description and discussion in the literature. Normally an oligonucleotide probe should have at least about 14 nucleotides, usually at least about 18 nucleotides, and the polynucleotide probes may be up to several kilobases. Various labels may be employed, most commonly radionuclides, particularly 32P. However, other techniques may also be employed, such as using biotin modified nucleotides for introduction into a polynucleotide. The biotin then serves as the site for binding to avidin or antibodies, which may be labeled with a wide variety of labels, such as radionuclides, fluorescers, enzymes, or the like. Alternatively, antibodies may be employed which can recognize specific duplexes, including DNA duplexes, RNA duplexes, DNA-RNA hybrid duplexes, or DNA-protein duplexes. The antibodies in turn may be labeled and the assay carried out where the duplex is bound to a surface, so that upon the formation of duplex on the surface, the presence of antibody bound to the duplex can be detected. The use of probes to the novel anti-sense RNA may be carried out in conventional techniques such as nucleic acid hybridization, plus and minus screening, recombinational probing, hybrid released translation (HRT), and hybrid arrested translation (HART). This also includes amplification techniques such as polymerase chain 10~ reaction (PCR).
Diagnostic kits which also test for the qualitative or quantitative presence of other markers are also contemplated.
Diagnosis or prognosis may depend on the combination of multiple indications used as markers. Thus, kits may test for combinations of markers. See, e.9., Viallet, et al. (1989) Progress in Growth Factor Res. 1:89-97.
VIII. Therapeutic Utility This invention provides reagents with significant therapeutic value. See, e.g., Levitzk_i (1996) Curr. Opin. Cell Biol. 8:239-244. The cytokine receptors (naturally occurring or recombinant), fragments thereof, mutein receptors, and antibodies, along with compounds identified as having binding affinity to the receptors or antibodies, should be useful in the treatment of conditions exhibiting abnormal expression of the receptors of their ligands. Such abnormality will typically be manifested by immunological disorders. Additionally, this invention should provide therapeutic value in various diseases or disorders associated with abnormal expression or abnormal triggering of response to the ligand. For example, the IL-1 ligands have been suggested to be involved in morphologic development, e.g., dorso-ventral polarity determination, and immune responses, particularly the primitive innate responses.
See, e.g., Sun, et al. (1991) Eur. J. Biochem. 196:247-254; and Hultmark (1994) Nature 367:116-117.
Recombinant cytokine receptors, muteins, agonist or antagonist antibodies thereto, or antibodies can be purified and then administered to a patient. These reagents can be combined for therapeutic use with additional active ingredients, e.g., in conventional pharmaceutically acceptable carriers or diluents, along with physiologically innocuous stabilizers and excipients.
These combinations can be sterile, e.g., filtered, and placed into dosage forms as by lyophilization in dosage vials or storage in stabilized aqueous preparations. This invention also contemplates use of antibodies or. binding fragments thereof which are not complement binding.
10~ Ligand screening using cytokine receptor or fragments thereof can be performed to identify molecules having binding affinity to the receptors. Subsequent biological assays can then be utilized to determine if a putative ligand can provide competitive binding, which can block intrinsic stimulating activity. Receptor fragments can be used as a blocker or antagonist in that it blocks the activity of ligand. Likewise, a compound having intrinsic stimulating activity can activate the receptor and is thus an agonist in that it simulates the activity of ligand, e.g., inducing signaling. This invention further contemplates the therapeutic use of antibodies to cytokine receptors as antagonists.
The quantities of reagents necessary for effective therapy will depend upon many different factors, including means of administration, target site, reagent physiological life, pharmacological life, physiological state of the patient, and other medicants administered. Thus, treatment dosages should be titrated to optimize safety and efficacy. Typically, dosages used in vitro may provide useful guidance in the amounts useful for in situ administration of these reagents. Animal testing of effective doses for treatment of particular disorders will provide further predictive indication of human dosage. Various considerations are described, e.g., in Gilman, et al. (eds.
1990) Goodman and Gilman's~ The Pharmacological Bases of Therapeutics, 8th Ed., Pergamon Press; and Reminaton's Pharmaceutical Sciences, 17th ed. (1990), Mack Publishing Co., Easton, Penn.; each of which is hereby incorporated herein by reference. Methods for administration are discussed therein and below, e.g., for oral, intravenous, intraperitoneal, or intramuscular administration, transdermal diffusion, and others.
Pharmaceutically acceptable carriers will include water, saline, buffers, and other compounds described, e.g., in the Merck 5 Index, Merck & Co., Rahway, New Jersey. Because of the likely high affinity binding, or turnover numbers, between a putative ligand and its receptors, low dosages of these reagents would be initially expected to be effective. And the signaling pathway suggests extremely low amounts of ligand may have effect. Thus, 10~ dosage ranges would ordinarily be expected to be in amounts lower than 1 mM concentrations, typically less than about 10 ~M
concentrations, usually less than about 100 nM, preferably less than about 10 pM (picomolar), and most preferably less than about 1 fM (femtomolar), with an appropriate carrier. Slow 15 release formulations, or slow release apparatus will often be utilized for continuous administration.
Cytokine receptors, fragments thereof, and antibodies or its fragments, antagonists, and agonists, may be administered directly to the host to be treated or, depending on the size of 20 the compounds, it may be desirable to conjugate them to carrier proteins such as ovalbumin or serum albumin prior to their administration. Therapeutic formulations may be administered in many conventional dosage formulations. While it is possible for the active ingredient to be administered alone, it is preferable 25 to present it as a pharmaceutical formulation. Formulations comprise at least one active ingredient, as defined above, together with one or more acceptable carriers thereof. Each carrier must be both pharmaceutically and physiologically acceptable in the sense of being compatible with the other 30 ingredients and not injurious to the patient. Formulations include those suitable for oral, rectal, nasal, or parenteral (including subcutaneous, intramuscular, intravenous and intradermal) administration. The formulations may conveniently be presented in unit dosage form and may be prepared by methods 35 well known in the art of pharmacy. See, e.g., Gilman, et al.
(eds. 1990) Goodman and Gilman's: The Pharmacological Bases of Therapeutics, 8th Ed., Pergamon Press; and Reminaton's Pharmaceutical Sciences, 17th ed. (1990), Mack Publishing Co., Easton, Penn.; Avis, et al. (eds. 1993) Pharmaceutical Dosage Forms: Parenteral Medications Dekker, NY; Lieberman, et al.
(eds. 1990) Pharmaceutical Dosage Forms: Tablets Dekker, NY; and Lieberman, et al. (eds. 1990) Pharmaceutical Dosage Forms:
Disperse Systems Dekker, NY. The therapy of this invention may be combined with or used in association with other therapeutic agents, particularly agonists or antagonists of other cytokine receptor family members.
10~
IX. Screening Drug screening using DCRS3 or DCRS4 or fragments thereof can be performed to identify compounds having binding affinity to the receptor subunit, including isolation of associated components. Subsequent biological assays can then be utilized to determine if the compound has intrinsic stimulating activity and is therefore a blocker or antagonist in that it blocks the activity of the ligand. Likewise, a compound having intrinsic stimulating activity can activate the receptor and is thus an agonist in that it simulates the activity of a cytokine ligand.
This invention further contemplates the therapeutic use of antibodies to the receptor as cytokine agonists or antagonists.
Similarly, complexes comprising multiple proteins may be used to screen for ligands or reagents capable of recognizing the complex. Most cytokine receptors comprise at least two subunits, which may be the same, or distinct. Alternatively, the transmembrane receptor may bind to a complex comprising a cytokine-like ligand associated with another soluble protein serving, e.g., as a second receptor subunit.
One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant DNA molecules expressing, e.g., a DCRS3 in combination with another cytokine receptor subunit. Cells may be isolated which express a receptor in isolation from other functional receptors. Such cells, either in viable or fixed form, can be used for standard antibody/antigen or ligand/receptor binding assays. See also, Parce, et al. (1989) Science 246:243-247; and Owicki, et al. (1990) Proc. Nat'1 Acad.
Sci. USA 87:4007-4011, which describe sensitive methods to detect cellular responses. Competitive assays are particularly useful, where the cells (source of putative ligand) are contacted and incubated with a labeled receptor or antibody having known binding affinity to the ligand, such as 125I_ antibody, and a test sample whose binding affinity to the binding composition is being measured. The bound and free labeled binding compositions are then separated to assess the degree of ligand binding. The amount of test compound bound is 10~ inversely proportional to the amount of labeled receptor binding to the known source. Many techniques can be used to separate bound from free ligand to assess the degree of ligand binding.
This separation step could typically involve a procedure such as adhesion to filters followed by washing, adhesion to plastic followed by washing, or centrifugation of the cell membranes.
Viable cells could also be used to screen for the effects of drugs on cytokine mediated functions, e.g., second messenger levels, i.e., Ca++; cell proliferation; inositol phosphate pool changes; and others. Some detection methods allow for elimination of a separation step, e.g., a proximity sensitive detection system. Calcium sensitive dyes will be useful for detecting Ca++ levels, with a fluorimeter or a fluorescence cell sorting apparatus.
X. Ligands The descriptions of DCRS3 or DCRS4 herein provide means to identify ligands, as described above. Such ligand should bind specifically to the respective receptor with reasonably high affinity. Various constructs are made available which allow either labeling of the receptor to detect its ligand. For example, directly labeling cytokine receptor, fusing onto it markers for secondary labeling, e.g., FLAG or other epitope tags, etc., will allow detection of receptor. This can be histological, as an affinity method for biochemical purification, or labeling or selection in an expression cloning approach. A two-hybrid selection system may also be applied making appropriate constructs with the available cytokine receptor sequences. See, e.g., Fields and Song (1989) Nature 340:245-246.
The broad scope of this invention is best understood with reference to the following examples, which are not intended to limit the inventions to the specific embodiments.
EXAMPLES
I. General Methods 10' Some of the standard methods are described or referenced, e.g., in Maniatis, et al. (1982) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor Press;
Sambrook, et al. (1989) Molecular Cloning: A Laboratory Manual, (2d ed.), vols. 1-3, CSH Press, NY; or Ausubel, et al. (1987 and Supplements) Current Protocols in Molecular Biology, Greene/Wiley, New York. Methods for protein purification include such methods as ammonium sulfate precipitation, column chromatography, electrophoresis, centrifugation, crystallization, and others. See, e.g., Ausubel, et al. (1987 and periodic supplements); Coligan, et al. (ed. 1996) and periodic supplements, Current Protocols In Protein Science Greene/Wiley, New York; Deutscher (1990) "Guide to Protein Purification" in Methods in Enzymoloay, vol. 182, and other volumes in this series; and manufacturer's literature on use of protein purification products, e.g., Pharmacia, Piscataway, N.J., or Bio-Rad, Richmond, CA. Combination with recombinant techniques allow fusion to appropriate segments, e.g., to a FLAG
sequence or an equivalent which can be fused via a protease-removable sequence. See, e.g., Hochuli (1990) "Purification of Recombinant Proteins with Metal Chelate Absorbent" in Setlow (ed.) Genetic Engineering, Principle and Methods 12:87-98, Plenum Press, N.Y.; and Crowe, et al. (1992) QIAexpress: The High Level Expression & Protein Purification System QUIAGEN, Inc., Chatsworth, CA.
Computer sequence analysis is performed, e.g., using available software programs, including those from the GCG (U.
Wisconsin) and GenBank sources. Public sequence databases were also used, e.g., from GenBank and others.
Many techniques applicable to IL-10 receptors may be applied to DCRS3 or DCRS4, as described, e.g., in USSN
08/110,683 (IL-10 receptor), which is incorporated herein by reference.
II. Computational Analysis Human sequences related to cytokine receptors were identified from genomic sequence database using, e.g., the BLAST
5 server (Altschul, et al. (1994) Nature Genet. 6:119-129).
Standard analysis programs may be used to evaluate structure, e.g., PHD (Rost and Sander (1994) Proteins 19:55-72) and DSC
(King and Sternberg (1996) Protein Sci. 5:2298-2310). Standard comparison software includes, e.9., Altschul, et al. (1990) J.
10~ Mol. Biol. 215:403-10; Waterman (1995) Introduction to Computational Bioloq_y~ Maps Seauences and Genomes Chapman &
Hall; Lander and Waterman (eds. 1995) Calculatin4 the Secrets of Life Applications of the Mathematical Sciences in Molecular Biolocry National Academy Press; and Speed and Waterman (eds.
15 1996) Genetic Ma~pina and DNA Seauencing (IMA Volumes in Mathematics and Its Applications, Vol 81) Springer Verlag.
III. Cloning of full-length DCRS3 or DCRS4 cDNAs; Chromosomal localization 20 PCR primers derived from DCRS3 or DCRS4 sequence are used to probe a human cDNA library. Sequences may be derived, e.g., from Table 1 or 3, preferably those adjacent the ends of sequences. Full length cDNAs for primate, rodent, or other species are cloned, e.g., by DNA hybridization screening of 25 7~gt10 phage. PCR reactions are conducted using T. aquaticus Taqplus DNA polymerase (Stratagene) under appropriate conditions.
For experimental confirmation of localization, chromosome spreads are prepared. In situ hybridization is performed on 30 chromosome preparations obtained from phytohemagglutinin-stimulated human lymphocytes cultured for 72 h. 5-bromodeoxyuridine was added for the final seven hours of culture (60 ~g/ml of medium), to ensure a posthybridization chromosomal banding of good quality.
35 A PCR fragment, amplified with the help of primers, is cloned into an appropriate vector. The vector is labeled by nick-translation with 3H. The radiolabeled probe is hybridized to metaphase spreads at final concentration of 200 ng/ml of hybridization solution as described in Mattei, et al. (1985) Hum. Genet. 69:327-331.
After coating with nuclear track emulsion (KODAK NTB2), slides are exposed. To avoid any slipping of silver grains during the banding procedure, chromosome spreads are first stained with buffered Giemsa solution and metaphase photographed. R-banding is then performed by the fluorochrome-photolysis-Giemsa (FPG) method and metaphases rephotographed before analysis.
10~ Similar appropriate methods are used for other species.
IV. Localization of DCRS mRNA
Human multiple tissue (Cat# 1, 2) and cancer cell line blots (Cat# 7757-1), containing approximately 2 ~g of poly(A)+
RNA per lane, are purchased from Clontech (Palo Alto, CA).
Probes are radiolabeled with [a-32P] dATP, e.g., using the Amersham Rediprime random primer labeling kit (RPN1633).
Prehybridization and hybridizations are performed, e.g., at 65°
C in 0.5 M Na2HP04, 7% SDS, 0.5 M EDTA (pH 8.0). High stringency washes are conducted, e.g., at 65° C with two initial washes in 2 x SSC, 0.1% SDS for 40 min followed by a subsequent wash in 0.1 x SSC, O.lo SDS for 20 min. Membranes are then exposed at -70° C to X-Ray film (Kodak) in the presence of intensifying screens. More detailed studies by cDNA library Southerns are performed with selected appropriate human DCRS3 clones to examine their expression in hemopoietic or other cell subsets.
Alternatively, two appropriate primers are selected from Tables 1 or 3. RT-PCR is used on an appropriate mRNA sample selected for the presence of message to produce a cDNA, e.g., a sample which expresses the gene.
Full length clones may be isolated by hybridization of cDNA
libraries from appropriate tissues pre-selected by PCR signal.
Northern blots can be performed.
Message for genes encoding DCRS3 or DCRS4 will be assayed by appropriate technology, e.g., PCR, immunoassay, hybridization, or otherwise. Tissue and organ cDNA preparations are available, e.g., from Clontech, Mountain View, CA.
Identification of sources of natural expression are useful, as described. And the identification of functional receptor subunit pairings will allow for prediction of what cells express the combination of receptor subunits which will result in a physiological responsiveness to each of the cytokine ligands.
For mouse distribution, e.g., Southern Analysis can be performed: DNA (5 fig) from a primary amplified cDNA library was digested with appropriate restriction enzymes to release the 10~ inserts, run on a to agarose gel and transferred to a nylon membrane (Schleicher and Schuell, Keene, NH).
Samples for mouse mRNA isolation may include: resting mouse fibroblastic L cell line (C200); Braf:ER (Braf fusion to estrogen receptor) transfected cells, control (C201); T cells, TH1 polarized (Me114 bright, CD4+ cells from spleen, polarized for 7 days with IFN-y and anti IL-4; T200); T cells, TH2 polarized (Me114 bright, CD4+ cells from spleen, polarized for 7 days with IL-4 and anti-IFN-y; T201); T cells, highly TH1 polarized (see Openshaw, et al. (1995) J. Exp. Med. 182:1357-1367; activated with anti-CD3 for 2, 6, 16 h pooled; T202); T
cells, highly TH2 polarized (see Openshaw, et al. (1995) J. Exp.
Med. 182:1357-1367; activated with anti-CD3 for 2, 6, 16 h pooled; T203); CD44- CD25+ pre T cells, sorted from thymus (T204); TH1 T cell clone D1.1, resting for 3 weeks after last stimulation with antigen (T205); TH1 T cell clone D1.1, 10 ~g/ml ConA stimulated 15 h (T206); TH2 T cell clone CDC35, resting for 3 weeks after last stimulation with antigen (T207); TH2 T cell clone CDC35, 10 ~g/ml ConA stimulated 15 h (T208); Me114+ naive T cells from spleen, resting (T209); Me114+ T cells, polarized to Thl with IFN-y/IL-12/anti-IL-4 for 6, 12, 24 h pooled (T210);
Me114+ T cells, polarized to Th2 with IL-4/anti-IFN-y for 6, 13, 24 h pooled (T211); unstimulated mature B cell leukemia cell line A20 (B200); unstimulated B cell line CH12 (B201);
unstimulated large B cells from spleen (B202); B cells from total spleen, LPS activated (B203); metrizamide enriched dendritic cells from spleen, resting (D200); dendritic cells from bone marrow, resting (D201); monocyte cell line RAW 264.7 activated with LPS 4 h (M200); bone-marrow macrophages derived with GM and M-CSF (M201); macrophage cell line J774, resting (M202); macrophage cell line J774 + LPS + anti-IL-10 at 0.5, 1, 3, 6, 12 h pooled (M203); macrophage cell line J774 + LPS + IL-10 at 0.5, 1, 3, 5, 12 h pooled(M204); aerosol challenged mouse lung tissue, Th2 primers, aerosol OVA challenge 7, 14, 23 h pooled (see Garlisi, et al. (1995) Clinical Immunology and Immunopathologv 75:75-83; X206); Nippostrongulus-infected lung tissue (see Coffman, et al. (1989) Science 245:308-310; X200);
10~ total adult lung, normal (0200); total lung, rag-1 (see Schwarz, et al. (1993) Immunodeficiency 4:249-252; 0205); IL-10 K.O.
spleen (see Kuhn, et al. (1991) Cell 75:263-274; X201); total adult spleen, normal (0201); total spleen, rag-1 (0207); IL-10 K.O. Peyer's patches (0202); total Peyer's patches, normal (0210); IL-10 K.O. mesenteric lymph nodes (X203); total mesenteric lymph nodes, normal (0211); IL-10 K.O. colon (X203);
total colon, normal (0212); NOD mouse pancreas (see Makino, et al. (1980) Jikken Dobutsu 29:1-13; X205); total thymus, rag-1 (0208); total kidney, rag-1 (0209); total heart, rag-1 (0202);
total brain, rag-1 (0203); total testes, rag-1 (0204); total liver, rag-1 (0206); rat normal joint tissue (0300); and rat arthritic joint tissue (X300).
Samples for human mRNA isolation may include: peripheral blood mononuclear cells (monocytes, T cells, NK cells, granulocytes, B cells), resting (T100); peripheral blood mononuclear cells, activated with anti-CD3 for 2, 6, 12 h pooled (T101); T cell, THO clone Mot 72, resting (T102); T cell, THO
clone Mot 72, activated with anti-CD28 and anti-CD3 for 3, 6, 12 h pooled (T103); T cell, THO clone Mot 72, anergic treated with specific peptide for 2, 7, 12 h pooled (T104); T cell, TH1 clone HY06, resting (T107); T cell, TH1 clone HY06, activated with anti-CD28 and anti-CD3 for 3, 6, 12 h pooled (T108); T cell, TH1 clone HY06, anergic treated with specific peptide for 2, 6, 12 h pooled (T109); T cell, TH2 clone HY935, resting (T110); T cell, TH2 clone HY935, activated with anti-CD28 and anti-CD3 for 2, 7, 12 h pooled (T111); T cells CD4+CD45R0- T cells polarized 27 days in anti-CD28, IL-4, and anti IFN-y, TH2 polarized, activated with anti-CD3 and anti-CD28 4 h (T116); T cell tumor lines Jurkat and Hut78, resting (T117); T cell clones, pooled AD130.2, Tc783.12, Tc783.13, Tc783.58, Tc782.69, resting (T118); T cell random ys T cell clones, resting (T119); Splenocytes, resting (B100); Splenocytes, activated with anti-CD40 and IL-4 (B101); B
cell EBV lines pooled WT49, RSB, JY, CVIR, 721.221, RM3, HSY, resting (B102); B cell line JY, activated with PMA and ionomycin for 1, 6 h pooled (B103); NK 20 clones pooled, resting (K100);
NK 20 clones pooled, activated with PMA and ionomycin for 6 h 10~ (K101); NKL clone, derived from peripheral blood of LGL leukemia patient, IL-2 treated (K106); NK cytotoxic clone 640-A30-1, resting (K107); hematopoietic precursor line TF1, activated with PMA and ionomycin for 1, 6 h pooled (C100); U937 premonocytic line, resting (M100); U937 premonocytic line, activated with PMA
and ionomycin for 1, 6 h pooled (M101); elutriated monocytes, activated with LPS, IFNy, anti-IL-10 for 1, 2, 6, 12, 24 h pooled (M102); elutriated monocytes, activated with LPS, IFNy, IL-10 for l, 2, 6, 12, 24 h. pooled (M103); elutriated monocytes, activated with LPS, IFNy, anti-IL-10 for 4, 16 h pooled (M106); elutriated monocytes, activated with LPS, IFNy, IL-10 for 4, 16 h pooled (M107); elutriated monocytes, activated LPS for 1 h (M108);
elutriated monocytes, activated LPS for 6 h (M109); DC 70%
CDla+, from CD34+ GM-CSF, TNFa 12 days, resting (D101); DC 70%
CDla+, from CD34+ GM-CSF, TNFa 12 days, activated with PMA and ionomycin for 1 hr (D102); DC 70% CDla+, from CD34+ GM-CSF, TNFa 12 days, activated with PMA and ionomycin for 6 hr (D103); DC
95% CDla+, from CD34+ GM-CSF, TNFa 12 days FACS sorted, activated with PMA and ionomycin for 1, 6 h pooled (D104); DC
95% CD14+, ex CD34+ GM-CSF, TNFa 12 days FACS sorted, activated with PMA and ionomycin 1, 6 hr pooled (D105); DC CDla+ CD86+, from CD34+ GM-CSF, TNFa 12 days FACS sorted, activated with PMA
and ionomycin for l, 6 h pooled (D106); DC from monocytes GM-CSF, IL-4 5 days, resting (D107); DC from monocytes GM-CSF, IL-4 5 days, resting (D108); DC from monocytes GM-CSF, IL-4 5 days, activated LPS 4, 16 h pooled (D109); DC from monocytes GM-CSF, IL-4 5 days, activated TNFa, monocyte supe for 4, 16 h pooled (D110); leiomyoma L11 benign tumor (X101); normal myometrium M5 (0115); malignant leiomyosarcoma GS1 (X103); lung fibroblast sarcoma line MRC5, activated with PMA and ionomycin for 1, 6 h pooled (C101); kidney epithelial carcinoma cell line CHA, activated with PMA and ionomycin for 1, 6 h pooled (C102);
5 kidney fetal 28 wk male (0100); lung fetal 28 wk male (0101);
liver fetal 28 wk male (0102); heart fetal 28 wk male (0103);
brain fetal 28 wk male (0104); gallbladder fetal 28 wk male (0106); small intestine fetal 28 wk male (0107); adipose tissue fetal 28 wk male (0108); ovary fetal 25 wk female (0109); uterus 10~ fetal 25 wk female (0110); testes fetal 28 wk male (0111);
spleen fetal 28 wk male (0112); adult placenta 28 wk (0113); and tonsil inflamed, from 12 year old (X100).
Similar samples may isolated in other species for evaluation.
V. Cloning of species counterparts of DCRS3 or DCRS4 Various strategies are used to obtain species counterparts of DCRS3 or DCRS4, preferably from other primates or rodents.
One method is by cross hybridization using closely related species DNA probes. It may be useful to go into evolutionarily similar species as intermediate steps. Another method is by using specific PCR primers based on the identification of blocks of similarity or difference between genes, e.g., areas of highly conserved or nonconserved polypeptide or nucleotide sequence.
Antibody based screening methods are also available, e.g., in expression cloning.
VI. Production of mammalian DCRS3 or DCRS4 protein An appropriate, e.g., GST, fusion construct is engineered for expression, e.g., in E. coli. For example, a mouse IGIF
pGex plasmid is constructed and transformed into E. coli.
Freshly transformed cells are grown, e.g., in LB medium containing 50 ~g/ml ampicillin and induced with IPTG (Sigma, St.
Louis, MO). After overnight induction, the bacteria are harvested and the pellets containing, e.g., DCRS3, protein are isolated. The pellets are homogenized, e.9., in TE buffer (50 mM Tris-base pH 8.0, 10 mM EDTA and 2 mM pefabloc) in 2 liters.
This material is passed through a microfluidizer (Microfluidics, Newton, MA) three times. The fluidized supernatant is spun down on a Sorvall GS-3 rotor for 1 h at 13,000 rpm. The resulting supernatant containing the cytokine receptor protein is filtered and passed over a glutathione-SEPHAROSE column equilibrated in 50 mM Tris-base pH 8Ø The fractions containing the DCRS3-GST
fusion protein are pooled and cleaved, e.g., with thrombin (Enzyme Research Laboratories, Inc., South Bend, IN). The cleaved pool is then passed over a Q-SEPHAROSE column 10~ equilibrated in 50 mM Tris-base. Fractions containing DCRS3 are pooled and diluted in cold distilled H20, to lower the conductivity, and passed back over a fresh Q-Sepharose column, alone or in succession with an immunoaffinity antibody column.
Fractions containing DCRS3 protein are pooled, aliquoted, and stored in the -70° C freezer.
Comparison of the CD spectrum with cytokine receptor protein may suggest that the protein is correctly folded. See Hazuda, et al. (1969) J. Biol. Chem. 264:1689-1693.
VII. Preparation of antibodies specific for DCRS3 or DCRS4 Inbred Balb/c mice are immunized intraperitoneally with recombinant forms of the protein, e.g., purified DCRS3 or stable transfected NIH-3T3 cells. Animals are boosted at appropriate time points with protein, with or without additional adjuvant, to further stimulate antibody production. Serum is collected, or hybridomas produced with harvested spleens.
Alternatively, Balb/c mice are immunized with cells transformed with the gene or fragments thereof, either endogenous or exogenous cells, or with isolated membranes enriched for expression of the antigen. Serum is collected at the appropriate time, typically after numerous further administrations. Various gene therapy techniques may be useful, e.g., in producing protein in situ, for generating an immune response. Serum or antibody preparations may be cross-absorbed or immunoselected to prepare substantially purified antibodies of defined specificity and high affinity.
Monoclonal antibodies may be made. For example, splenocytes are fused with an appropriate fusion partner and hybridomas are selected in growth medium by standard procedures.
Hybridoma supernatants are screened for the presence of antibodies which bind to DCRS3, e.g., by ELISA or other assay.
Antibodies which specifically recognize specific DCRS3 embodiments may also be selected or prepared.
In another method, synthetic peptides or purified protein are presented to an immune system to generate monoclonal or 10~ polyclonal antibodies. See, e.g., Coligan (ed. 1991) Current Protocols in Immunoloay Wiley/Greene; and Harlow and Lane (1989) Antibodies: A Laboratory Manual Cold Spring Harbor Press. In appropriate situations, the binding reagent is either labeled as described above, e.g., fluorescence or otherwise, or immobilized to a substrate for panning methods. Nucleic acids may also be introduced into cells in an animal to produce the antigen, which serves to elicit an immune response. See, e.g., Wang, et al.
(1993) Proc. Nat'1. Acad. Sci. 90:4156-4160; Barry, et al.
(1994) BioTechnigues 16:616-619; and Xiang, et al. (1995) Immunity 2: 129-135.
VIII. Production of fusion proteins with DCRS
Various fusion constructs are made with DCRS. A portion of the appropriate gene is fused to an epitope tag, e.g., a FLAG
tag, or to a two hybrid system construct. See, e.g., Fields and Song (1989) Nature 340:245-246.
The epitope tag may be used in an expression cloning procedure with detection with anti-FLAG antibodies to detect a binding partner, e.g., ligand for the respective cytokine receptor. The two hybrid system may also be used to isolate proteins which specifically bind to DCRS.
IX. Structure activity relationship Information on the criticality of particular residues is determined using standard procedures and analysis. Standard mutagenesis analysis is performed, e.g., by generating many different variants at determined positions, e.g., at the positions identified above, and evaluating biological activities of the variants. This may be performed to the extent of determining positions which modify activity, or to focus on specific positions to determine the residues which can be substituted to either retain, block, or modulate biological activity.
Alternatively, analysis of natural variants can indicate what positions tolerate natural mutations. This may result from populational analysis of variation among individuals, or across 10~ strains or species. Samples from selected individuals are analyzed, e.g., by PCR analysis and sequencing. This allows evaluation of population polymorphisms.
X. Isolation of a ligand for DCRS
A cytokine receptor can be used as a specific binding reagent to 'identify its binding partner, by taking advantage of its specificity of binding, much like an antibody would be used.
The binding receptor may be a heterodimer of receptor subunits;
or may involve, e.g., a complex of the DCRS with another subunit. A binding reagent is either labeled as described above, e.g., fluorescence or otherwise, or immobilized to a substrate for panning methods.
The binding composition is used to screen an expression library made from a cell line which expresses a binding partner, i.e., ligand, preferably membrane associated. Standard staining techniques are used to detect or sort surface expressed ligand, or surface expressing transformed cells are screened by panning.
Screening of intracellular expression is performed by various staining or immunofluorescence procedures. See also McMahan, et al. (1991) EMBO J. 10:2821-2832.
For example, on day 0, precoat 2-chamber permanox slides with 1 ml per chamber of fibronectin, 10 ng/ml in PBS, for 30 min at room temperature. Rinse once with PBS. Then plate COS
cells at 2-3 x 105 cells per chamber in 1.5 ml of growth media.
Incubate overnight at 37°C.
On day 1 for each sample, prepare 0.5 ml of a solution of 66 ~g/ml DEAE-dextran, 66 ~M chloroquine, and 4 ~g DNA in serum free DME. For each set, a positive control is prepared, e.g., of DCRS-FLAG cDNA at 1 and 1/200 dilution, and a negative mock.
Rinse cells with serum free DME. Add the DNA solution and incubate 5 hr at 37°C. Remove the medium and add 0.5 ml 10%
DMSO in DME for 2.5 min. Remove and wash once with DME. Add 1.5 ml growth medium and incubate overnight.
On day 2, change the medium. On days 3 or 4, the cells are fixed and stained. Rinse the cells twice with Hank's Buffered Saline Solution (HBSS) and fix in 4% paraformaldehyde 10~ (PFA)/glucose for 5 min. Wash 3X with HBSS. The slides may be stored at -80°C after all liquid is removed. For each chamber, 0.5 ml incubations are performed as follows. Add HBSS/saponin (0.1%) with 32 ~1/ml of 1 M NaN3 for 20 min. Cells are then washed with HBSS/saponin 1X. Add appropriate DCRS or DCRS/antibody complex to cells and incubate for 30 min. wash cells twice with HBSS/saponin. If appropriate, add first antibody for 30 min. Add second antibody, e.g., Vector anti-mouse antibody, at 1/200 dilution, and incubate for 30 min.
Prepare ELISA solution, e.9., Vector Elite ABC horseradish peroxidase solution, and preincubate for 30 min. Use, e.g., 1 drop of solution A (avidin) and 1 drop solution B (biotin) per 2.5 ml HBSS/saponin. Wash cells twice with HBSS/saponin. Add ABC HRP solution and incubate for 30 min. Wash cells twice with HBSS, second wash for 2 min, which closes cells. Then add Vector diaminobenzoic acid (DAB) for 5 to 10 min. Use 2 drops of buffer plus 4 drops DAB plus 2 drops of H202 per 5 ml of glass distilled water. Carefully remove chamber and rinse slide in water. Air dry for a few minutes, then add 1 drop of Crystal Mount and a cover slip. Bake for 5 min at 85-90°C.
Evaluate positive staining of pools and progressively subclone to isolation of single genes responsible for the binding.
Alternatively, receptor reagents are used to affinity purify or sort out cells expressing a putative ligand. See, e.g., Sambrook, et al. or Ausubel, et al.
Another strategy is to screen for a membrane bound receptor by panning. The receptor cDNA is constructed as described above. Immobilization may be achieved by use of appropriate antibodies which recognize, e.g., a FLAG sequence of a DCRS
fusion construct, or by use of antibodies raised against the first antibodies. Recursive cycles of selection and 5 amplification lead to enrichment of appropriate clones and eventual isolation of receptor expressing clones.
Phage expression libraries can be screened by mammalian DCRS. Appropriate label techniques, e.g., anti-FLAG antibodies, will allow specific labeling of appropriate clones.
10~
All citations herein are incorporated herein by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.
15 Many modifications and variations of this invention can be made without departing from its spirit and scope, as will be apparent to those skilled in the art. The specific embodiments described herein are offered by way of example only, and the invention is to be limited by the terms of the appended claims, 20 along with the full scope of equivalents to which such claims are entitled; and the invention is not to be limited by the specific embodiments that have been presented herein by way of example.
SEQUENCE LISTING
<110> Schering Corporation <120> Mammalian Receptor Proteins; Related Reagents and Methods <130> DX01086K PCT
<140>
<141>
<150> US 09/443,060 <151> 1999-11-18 <160> 32 <170> PatentIn Ver. 2.0 <210> 1 <211> 1707 <212> DNA
<213> primate; surmised Homo Sapiens <220>
<221> CDS
<222> (1)..(1704) <220>
<221> mat peptide <222> (61)..(1704) <400> 1 atg ccg cgt ggc tgg gcc gcc ccc ttg ctc ctg ctg ctg ctc cag gga 48 Met Pro Arg Gly Trp Ala Ala Pro Leu Leu Leu Leu Leu Leu Gln Gly gcc ctc gag ggg atg gag agg aag ctc tgc agt ccc aag cca ccc ccc 96 Ala Leu Glu Gly Met Glu Arg Lys Leu Cys Ser Pro Lys Pro Pro Pro acc aag gcc tct ctc ccc act gac cct cca ggc tgg ggc tgc ccc gac 144 Thr Lys Ala Ser Leu Pro Thr Asp Pro Pro Gly Trp Gly Cys Pro Asp ctc gtc tgc tac acc gat tac ctc cag acg gtc atc tgc atc ctg gaa 192 Leu Val Cys Tyr Thr Asp Tyr Leu Gln Thr Val Ile Cys Ile Leu Glu atg tgg aac ctc cac ccc agc acg ctc acc ctt acc tgg ata ctt tct 240 Met Trp Asn Leu His Pro Ser Thr Leu Thr Leu Thr Trp Ile Leu Ser aat aat act ggg tgc tat atc aag gac aga aca ctg gac ctc agg caa 288 Asn Asn Thr Gly Cys Tyr Ile Lys Asp Arg Thr Leu Asp Leu Arg Gln gac cag tat gaa gag ctg aag gac gag gcc acc tcc tgc agc ctc cac 336 Asp Gln Tyr Glu Glu Leu Lys Asp Glu Ala Thr Ser Cys Ser Leu His agg tcg gcc cac aat gcc acg cat gcc acc tac acc tgc cac atg gat 384 Arg Ser Ala His Asn Ala Thr His Ala Thr Tyr Thr Cys His Met Asp gta ttc cac ttc atg gcc gac gac att ttc agt gtc aac atc aca gac 432 Val Phe His Phe Met Ala Asp Asp Ile Phe Ser Val Asn Ile Thr Asp cag tct ggc aac tac tcc cag gag tgt ggc agc ttt ctc ctg get gag 480 Gln Ser Gly Asn Tyr Ser Gln Glu Cys Gly Ser Phe Leu Leu Ala Glu agc aga cag tat aat atc tcc tgg cgc tca gat tac gaa gac cct gcc 528 Ser Arg Gln Tyr Asn Ile Ser Trp Arg Ser Asp Tyr Glu Asp Pro Ala ttc tac atg ctg aag ggc aag ctt cag tat gag ctg cag tac agg aac 576 Phe Tyr Met Leu Lys Gly Lys Leu Gln Tyr Glu Leu Gln Tyr Arg Asn cgg gga gac ccc tgg get gtg agt ccg agg aga aag ctg atc tca gtg 624 Arg Gly Asp Pro Trp Ala Val Ser Pro Arg Arg Lys Leu Ile Ser Val gac tca aga agt gtc tcc ctc ctc ccc ctg gag ttc cgc aaa gac tcg 672 Asp Ser Arg Ser Val Ser Leu Leu Pro Leu Glu Phe Arg Lys Asp Ser agc tat gag ctg cag gtg cgg gca ggg ccc atg cct ggc tcc tcc tac 720 Ser Tyr Glu Leu Gln Val Arg Ala Gly Pro Met Pro Gly Ser Ser Tyr cag ggg acc tgg agt gaa tgg agt gac ccg gtc atc ttt cag acc cag 768 Gln Gly Thr Trp Ser Glu Trp Ser Asp Pro Val Ile Phe Gln Thr Gln tca gag gag tta aag gaa ggc tgg aac cct cac ctg ctg ctt ctc ctc 816 Ser Glu Glu Leu Lys Glu Gly Trp Asn Pro His Leu Leu Leu Leu Leu ctg ctt gtc ata gtc ttc att cct gcc ttc tgg agc ctg aag acc cat 864 Leu Leu Val Ile Val Phe Ile Pro Ala Phe Trp Ser Leu Lys Thr His cca ttg tgg agg cta tgg aag aag ata tgg gcc gtc ccc agc cct gag 912 Pro Leu Trp Arg Leu Trp Lys Lys Ile Trp Ala Val Pro Ser Pro Glu cgg ttc ttc atg ccc ctg tac aag ggc tgc agc gga gac ttc aag aaa 960 Arg Phe Phe Met Pro Leu Tyr Lys Gly Cys Ser Gly Asp Phe Lys Lys tgg gtg ggt gca ccc ttc act ggc tcc agc ctg gag ctg gga ccc tgg 1008 Trp Val Gly Ala Pro Phe Thr Gly Ser Ser Leu Glu Leu Gly Pro Trp agc cca gag gtg ccc tcc acc ctg gag gtg tac agc tgc cac cca cca 1056 Ser Pro Glu Val Pro Ser Thr Leu Glu Val Tyr Ser Cys His Pro Pro cgg agc ccg gcc aag agg ctg cag ctc acg gag cta caa gaa cca gca 1104 Arg Ser Pro Ala Lys Arg Leu Gln Leu Thr Glu Leu Gln Glu Pro Ala gag ctg gtg gag tct gac ggt gtg ccc aag ccc agc ttc tgg ccg aca 1152 Glu Leu Val Glu Ser Asp Gly Val Pro Lys Pro Ser Phe Trp Pro Thr gcc cag aac tcg ggg ggc tca get tac agt gag gag agg gat cgg cca 1200 Ala Gln Asn Ser Gly Gly Ser Ala Tyr Ser Glu Glu Arg Asp Arg Pro tac ggc ctg gtg tcc att gac aca gtg act gtg cta gat gca gag ggg 1248 Tyr Gly Leu Val Ser Ile Asp Thr Val Thr Val Leu Asp Ala Glu Gly cca tgc acc tgg ccc tgc agc tgt gag gat gac ggc tac cca gcc ctg 1296 Pro Cys Thr Trp Pro Cys Ser Cys Glu Asp Asp Gly Tyr Pro Ala Leu gac ctg gat get ggc ctg gag ccc agc cca ggc cta gag gac cca ctc 1344 Asp Leu Asp Ala Gly Leu Glu Pro Ser Pro Gly Leu Glu Asp Pro Leu ttg gat gca ggg acc aca gtc ctg tcc tgt ggc tgt gtc tca get ggc 1392 Leu Asp Ala Gly Thr Thr Val Leu Ser Cys Gly Cys Val Ser Ala Gly agc cct ggg cta gga ggg ccc ctg gga agc ctc ctg gac aga cta aag 1440 Ser Pro Gly Leu Gly Gly Pro Leu Gly Ser Leu Leu Asp Arg Leu Lys cca ccc ctt gca gat ggg gag gac tgg get ggg gga ctg ccc tgg ggt 1488 Pro Pro Leu Ala Asp Gly Glu Asp Trp Ala Gly Gly Leu Pro Trp Gly ggc cgg tca cct gga ggg gtc tca gag agt gag gcg ggc tca ccc ctg 1536 Gly Arg Ser Pro Gly Gly Val Ser Glu Ser Glu Ala Gly Ser Pro Leu gcc ggc ctg gat atg gac acg ttt gac agt ggc ttt gtg ggc tct gac 1584 Ala Gly Leu Asp Met Asp Thr Phe Asp Ser Gly Phe Val Gly Ser Asp tgc agc agc cct gtg gag tgt gac ttc acc agc ccc ggg gac gaa gga 1632 Cys Ser Ser Pro Val Glu Cys Asp Phe Thr Ser Pro Gly Asp Glu Gly ccc ccc cgg agc tac ctc cgc cag tgg gtg gtc att cct ccg cca ctt 1680 Pro Pro Arg Ser Tyr Leu Arg Gln Trp Val Val Ile Pro Pro Pro Leu tcg agc cct gga ccc cag gcc agc taa 1707 Ser Ser Pro Gly Pro Gln Ala Ser <210> 2 <211> 568 <212> PRT
<213> primate; surmised Homo sapiens <400> 2 Met Pro Arg Gly Trp Ala Ala Pro Leu Leu Leu Leu Leu Leu Gln Gly Ala Leu Glu Gly Met Glu Arg Lys Leu Cys Ser Pro Lys Pro Pro Pro Thr Lys Ala Ser Leu Pro Thr Asp Pro Pro Gly Trp Gly Cys Pro Asp Leu Val Cys Tyr Thr Asp Tyr Leu Gln Thr Val Ile Cys Ile Leu Glu Met Trp Asn Leu His Pro Ser Thr Leu Thr Leu Thr Trp Ile Leu Ser Asn Asn Thr Gly Cys Tyr Ile Lys Asp Arg Thr Leu Asp Leu Arg Gln Asp Gln Tyr Glu Glu Leu Lys Asp Glu Ala Thr Ser Cys Ser Leu His Arg Ser Ala His Asn Ala Thr His Ala Thr Tyr Thr Cys His Met Asp Val Phe His Phe Met Ala Asp Asp Ile Phe Ser Val Asn Ile Thr Asp Gln Ser Gly Asn Tyr Ser Gln Glu Cys Gly Ser Phe Leu Leu Ala Glu Ser Arg Gln Tyr Asn Ile Ser Trp Arg Ser Asp Tyr Glu Asp Pro Ala Phe Tyr Met Leu Lys Gly Lys Leu Gln Tyr Glu Leu Gln Tyr Arg Asn Arg Gly Asp Pro Trp Ala Val Ser Pro Arg Arg Lys Leu Ile Ser Val Asp Ser Arg Ser Val Ser Leu Leu Pro Leu Glu Phe Arg Lys Asp Ser Ser Tyr Glu Leu Gln Val Arg Ala Gly Pro Met Pro Gly Ser Ser Tyr Gln Gly Thr Trp Ser Glu Trp Ser Asp Pro Val Ile Phe Gln Thr Gln Ser Glu Glu Leu Lys Glu Gly Trp Asn Pro His Leu Leu Leu Leu Leu Leu Leu Val Ile Val Phe Ile Pro Ala Phe Trp Ser Leu Lys Thr His Pro Leu Trp Arg Leu Trp Lys Lys Ile Trp Ala Val Pro Ser Pro Glu Arg Phe Phe Met Pro Leu Tyr Lys Gly Cys Ser Gly Asp Phe Lys Lys Trp Val Gly Ala Pro Phe Thr Gly Ser Ser Leu Glu Leu Gly Pro Trp Ser Pro Glu Val Pro Ser Thr Leu Glu Val Tyr Ser Cys His Pro Pro Arg Ser Pro Ala Lys Arg Leu Gln Leu Thr Glu Leu Gln Glu Pro Ala Glu Leu Val Glu Ser Asp Gly Val Pro Lys Pro Ser Phe Trp Pro Thr Ala Gln Asn Ser Gly Gly Ser Ala Tyr Ser Glu Glu Arg Asp Arg Pro Tyr Gly Leu Val Ser Ile Asp Thr Val Thr Val Leu Asp Ala Glu Gly Pro Cys Thr Trp Pro Cys Ser Cys Glu Asp Asp Gly Tyr Pro Ala Leu Asp Leu Asp Ala Gly Leu Glu Pro Ser Pro Gly Leu Glu Asp Pro Leu Leu Asp Ala Gly Thr Thr Val Leu Ser Cys Gly Cys Val Ser Ala Gly Ser Pro Gly Leu Gly Gly Pro Leu Gly Ser Leu Leu Asp Arg Leu Lys Pro Pro Leu Ala Asp Gly Glu Asp Trp Ala Gly Gly Leu Pro Trp Gly Gly Arg Ser Pro Gly Gly Val Ser Glu Ser Glu Ala Gly Ser Pro Leu Ala Gly Leu Asp Met Asp Thr Phe Asp Ser Gly Phe Val Gly Ser Asp Cys Ser Ser Pro Val Glu Cys Asp Phe Thr Ser Pro Gly Asp Glu Gly Pro Pro Arg Ser Tyr Leu Arg Gln Trp Val Val Ile Pro Pro Pro Leu Ser Ser Pro Gly Pro Gln Ala Ser <210> 3 <211> 1704 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: reverse translation <220>
<221> misc_feature <222> (6). (1704) <223> n may be a, c, g, or t <400> 3 atgccnmgng gntgggcngc nccnytnytn ytnytnytny tncarggngc nytngarggn 60 atggarmgna arytntgyws nccnaarccn ccnccnacna argcnwsnyt nccnacngay 120 ccnccnggnt ggggntgycc ngayytngtn tgytayacng aytayytnca racngtnath 180 tgyathytng aratgtggaa yytncayccn wsnacnytna cnytnacntg gathytnwsn 240 aayaayacng gntgytayat haargaymgn acnytngayy tnmgncarga ycartaygar 300 garytnaarg aygargcnac nwsntgywsn ytncaymgnw sngcncayaa ygcnacncay 360 gcnacntaya cntgycayat ggaygtntty cayttyatgg cngaygayat httywsngtn 420 aayathacng aycarwsngg naaytaywsn cargartgyg gnwsnttyyt nytngcngar 480 wsnmgncart ayaayathws ntggmgnwsn gaytaygarg ayccngcntt ytayatgytn 540 aarggnaary tncartayga rytncartay mgnaaymgng gngayccntg ggcngtnwsn 600 ccnmgnmgna arytnathws ngtngaywsn mgnwsngtnw snytnytncc nytngartty 660 mgnaargayw snwsntayga rytncargtn mgngcnggnc cnatgccngg nwsnwsntay 720 carggnacnt ggwsngartg gwsngayccn gtnathttyc aracncarws ngargarytn 780 aargarggnt ggaayccnca yytnytnytn ytnytnytny tngtnathgt nttyathccn 840 gcnttytggw snytnaarac ncayccnytn tggmgnytnt ggaaraarat htgggcngtn 900 ccnwsnccng armgnttytt yatgccnytn tayaarggnt gywsnggnga yttyaaraar 960 tgggtnggng cnccnttyac nggnwsnwsn ytngarytng gnccntggws nccngargtn 1020 ccnwsnacny tngargtnta ywsntgycay ccnccnmgnw snccngcnaa rmgnytncar 1080 ytnacngary tncargarcc ngcngarytn gtngarwsng ayggngtncc naarccnwsn 1140 ttytggccna cngcncaraa ywsnggnggn wsngcntayw sngargarmg ngaymgnccn 1200 tayggnytng tnwsnathga yacngtnacn gtnytngayg cngarggncc ntgyacntgg 1260 ccntgywsnt gygargayga yggntayccn gcnytngayy tngaygcngg nytngarccn 1320 wsnccnggny tngargaycc nytnytngay gcnggnacna cngtnytnws ntgyggntgy 1380 gtnwsngcng gnwsnccngg nytnggnggn ccnytnggnw snytnytnga ymgnytnaar 1440 ccnccnytng cngayggnga rgaytgggcn ggnggnytnc cntggggngg nmgnwsnccn 1500 ggnggngtnw sngarwsnga rgcnggnwsn ccnytngcng gnytngayat ggayacntty 1560 gaywsnggnt tygtnggnws ngaytgywsn wsnccngtng artgygaytt yacnwsnccn 1620 ggngaygarg gnccnccnmg nwsntayytn mgncartggg tngtnathcc nccnccnytn 1680 wsnwsnccng gnccncargc nwsn 1704 <210> 4 <211> 750 <212> DNA
<213> primate; surmised Homo Sapiens <220>
<221> CDS
<222> (1)..(747) <220>
<221> mat peptide <222> (64)..(747) <400> 4 atg atg cct aaa cat tgc ttt cta ggc ttc ctc atc agt ttc ttc ctt 48 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu act ggt gta gca gga act cag tca acg cat gag tct ctg aag cct cag 96 Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln agg gta caa ttt cag tcc cga aat ttt cac aac att ttg caa tgg cag 144 Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln cct ggg agg gca ctt act ggc aac agc agt gtc tat ttt gtg cag tac 192 Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr aaa ata tat gga cag aga caa tgg aaa aat aaa gaa gac tgt tgg ggt 240 Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly act caa gaa ctc tct tgt gac ctt acc agt gaa acc tca gac ata cag 288 Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln gaa cct tat tac ggg agg agg ggc aaa aat aaa aat aaa ggg aat cct 336 Glu Pro Tyr Tyr Gly Arg Arg Gly Lys Asn Lys Asn Lys Gly Asn Pro tgg ggg cca aaa caa agt aaa cgg aaa tca aag ggg aac cag aag acc 384 Trp Gly Pro Lys Gln Ser Lys Arg Lys Ser Lys Gly Asn Gln Lys Thr aac aca gtg act gcc cca get gcc ctg aag gca ttt get gga tgt gca 432 Asn Thr Val Thr Ala Pro Ala Ala Leu Lys Ala Phe Ala Gly Cys Ala aaa ata gat cct cca gtc atg aat ata acc caa gtc aat ggc tct ttg 480 Lys Ile Asp Pro Pro Val Met Asn Ile Thr Gln Val Asn Gly Ser Leu ttg gta att ctc cat get cca aat tta cca tat aga tac caa aag gaa 528 Leu Val Ile Leu His Ala Pro Asn Leu Pro Tyr Arg Tyr Gln Lys Glu aaa aat gta tct ata gaa gat tac tat gaa cta cta tac cga gtt ttt 576 Lys Asn Val Ser Ile Glu Asp Tyr Tyr Glu Leu Leu Tyr Arg Val Phe ata att aac aat tca cta gaa aag gag caa aag gtt tat gaa ggg get 624 Ile Ile Asn Asn Ser Leu Glu Lys Glu Gln Lys Val Tyr Glu Gly Ala cac aga gcg gtt gaa att gaa get cta aca cca cac tcc agc tac tgt 672 His Arg Ala Val Glu Ile Glu Ala Leu Thr Pro His Ser Ser Tyr Cys gta gtg get gaa ata tat cag ccc atg tta gac aga aga agt cag aga 720 Val Val Ala Glu Ile Tyr Gln Pro Met Leu Asp Arg Arg Ser Gln Arg agt gaa gag aga tgt gtg gaa att cca tga 750 Ser Glu Glu Arg Cys Val Glu Ile Pro <210> 5 <211> 249 <212> PRT
<213> primate; surmised Homo sapiens <400> 5 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln Glu Pro Tyr Tyr Gly Arg Arg Gly Lys Asn Lys Asn Lys Gly Asn Pro Trp Gly Pro Lys Gln Ser Lys Arg Lys Ser Lys.Gly Asn Gln Lys Thr Asn Thr Val Thr Ala Pro Ala Ala Leu Lys Ala Phe Ala Gly Cys Ala Lys Ile Asp Pro Pro Val Met Asn Ile Thr Gln Val Asn Gly Ser Leu Leu Val Ile Leu His Ala Pro Asn Leu Pro Tyr Arg Tyr Gln Lys Glu Lys Asn Val Ser Ile Glu Asp Tyr Tyr Glu Leu Leu Tyr Arg Val Phe Ile Ile Asn Asn Ser Leu Glu Lys Glu Gln Lys Val Tyr Glu Gly Ala His Arg Ala Val Glu Ile Glu Ala Leu Thr Pro His Ser Ser Tyr Cys Val Val Ala Glu Ile Tyr Gln Pro Met Leu Asp Arg Arg Ser Gln Arg Ser Glu Glu Arg Cys Val Glu Ile Pro <210> 6 <211> 747 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: reverse translation <220>
<221> misc_feature <222> (1). (747) <223> n may be a, c, g, or t <400> 6 atgatgccna arcaytgytt yytnggntty ytnathwsnt tyttyytnac nggngtngcn 60 ggnacncarw snacncayga rwsnytnaar ccncarmgng tncarttyca rwsnmgnaay 120 ttycayaaya thytncartg gcarccnggn mgngcnytna cnggnaayws nwsngtntay 180 ttygtncart ayaarathta yggncarmgn cartggaara ayaargarga ytgytggggn 240 acncargary tnwsntgyga yytnacnwsn garacnwsng ayathcarga rccntaytay 300 ggnmgnmgng gnaaraayaa raayaarggn aayccntggg gnccnaarca rwsnaarmgn 360 aarwsnaarg gnaaycaraa racnaayacn gtnacngcnc cngcngcnyt naargcntty 420 gcnggntgyg cnaarathga yccnccngtn atgaayatha cncargtnaa yggnwsnytn 480 ytngtnathy tncaygcncc naayytnccn taymgntayc araargaraa raaygtnwsn 540 athgargayt aytaygaryt nytntaymgn gtnttyatha thaayaayws nytngaraar 600 garcaraarg tntaygargg ngcncaymgn gcngtngara thgargcnyt nacnccncay 660 wsnwsntayt gygtngtngc ngarathtay carccnatgy tngaymgnmg nwsncarmgn 720 wsngargarm gntgygtnga rathccn 747 <210> 7 <211> 210 <212> PRT
<213> primate <400> 7 Val Asn Gly Thr Ser Gln Phe Thr Cys Phe Tyr Asn Ser Arg Ala Asn Ile Ser Cys Val Trp Ser Gln Asp Gly Ala Leu Gln Asp Thr Ser Cys Gln Val His Ala Trp Pro Asp Arg Arg Arg Trp Asn Gln Thr Cys Glu Leu Leu Pro Val Ser Gln Ala Ser Trp Ala Cys Asn Leu Ile Leu Gly Ala Pro Asp Ser Gln Lys Leu Thr Thr Val Asp Ile Val Thr Leu Arg Val Leu Cys Arg Glu Gly Val Arg Trp Arg Val Met Ala Ile Gln Asp Phe Lys Pro Phe Glu Asn Leu Arg Leu Met Ala Pro Ile Ser Leu Gln Val Val His Val Glu Thr His Arg Cys Asn Ile Ser Trp Glu Ile Ser Gln Ala Ser His Tyr Phe Glu Arg His Leu Glu Phe Glu Ala Arg Thr Leu Ser Pro Gly His Thr Trp Glu Glu Ala Pro Leu Leu Thr Leu Lys Gln Lys Gln Glu Trp Ile Cys Leu Glu Thr Leu Thr Pro Asp Thr Gln Tyr Glu Phe Gln Val Arg Val Lys Pro Leu Gln Gly Glu Phe Thr Thr Trp Ser Pro Trp Ser Gln Pro Leu Ala Phe Arg Thr Lys Pro Ala Ala Leu Gly <210> 8 <211> 231 <212> PRT
<213> primate <400> 8 Ile Cys Ile Cys Thr Cys Val Cys Leu Gly Val Ser Val Thr Gly Glu Gly Gln Gly Pro Arg Ser Arg Thr Phe Thr Cys Leu Thr Asn Asn Ile Leu Arg Ile Asp Cys His Trp Ser Ala Pro Glu Leu Gly Gln Gly Ser Ser Pro Trp Leu Leu Phe Thr Ser Asn Gln Ala Pro Gly Gly Thr His Lys Cys Ile Leu Arg Gly Ser Glu Cys Thr Val Val Leu Pro Pro Glu Ala Val Leu Val Pro Ser Asp Asn Phe Thr Ile Thr Phe His His Cys Met Ser Gly Arg Glu Gln Val Ser Leu Val Asp Pro Glu Tyr Leu Pro Arg Arg His Val Lys Leu Asp Pro Pro Ser Asp Leu Gln Ser Asn Ile Ser Ser Gly His Cys Ile Leu Thr Trp Ser Ile Ser Pro Ala Leu Glu Pro Met Thr Thr Leu Leu Ser Tyr Glu Leu Ala Phe Lys Lys Gln Glu Glu Ala Trp Glu Gln Ala Gln His Arg Asp His Ile Val Gly Val Thr Trp Leu Ile Leu Glu Ala Phe Glu Leu Asp Pro Gly Phe Ile His Glu Ala Arg Leu Arg Val Gln Met Ala Thr Leu Glu Asp Asp Val Val Glu Glu Glu Arg Tyr Thr Gly Gln Trp Ser Glu Trp Ser Gln Pro Val Cys Phe Gln Ala Pro Gln Arg Gln <210> 9 <211> 216 <212> PRT
<213> primate <400> 9 Ile Cys Ile Cys Thr Cys Val Cys Leu Gly Val Ser Val Thr Gly Glu Gly Gln Gly Pro Arg Ser Arg Thr Phe Thr Cys Leu Thr Asn Asn Ile Leu Arg Ile Asp Cys His Trp Ser Ala Pro Glu Leu Gly Gln Gly Thr Leu His Tyr Trp Tyr Lys Asn Ser Asp Asn Asp Lys Val Gln Lys Cys Ser His Tyr Leu Phe Ser Glu Glu Ile Thr Ser Gly Cys Gln Leu Gln Lys Lys Glu Ile His Leu Tyr Gln Thr Phe Val Val Gln Leu Gln Asp Pro Arg Glu Pro Arg Arg Gln Ala Thr Gln Met Leu Lys Leu Gln Asn Leu Val Ile Pro Trp Ala Pro Glu Asn Leu Thr Leu His Lys Leu Ser Glu Ser Gln Leu Glu Leu Asn Trp Asn Asn Arg Phe Leu Asn His Cys Leu Glu His Leu Val Gln Tyr Arg Thr Asp Trp Asp His Ser Trp Thr Glu Gln Ser Val Asp Tyr Arg His Lys Phe Ser Leu Pro Ser Val Asp Gly Gln Lys Arg Tyr Thr Phe Arg Val Arg Ser Arg Phe Asn Pro Leu Cys Gly Ser Ala Gln His Trp Ser Glu Trp Ser His Pro Ile His Trp Gly Ser Asn Thr Ser Lys Glu Asn <210> 10 <211> 257 <212> PRT
<213> primate <400> 10 Leu Leu Ala Ser Asp Ser Glu Pro Leu Lys Cys Phe Ser Arg Thr Phe Glu Asp Leu Thr Cys Phe Trp Asp Glu Glu Glu Ala Ala Pro Ser Gly Thr Tyr Gln Leu Leu Tyr Ala Tyr Pro Arg Glu Lys Pro Arg Ala Cys Pro Leu Ser Ser Gln Ser Met Pro His Phe Gly Thr Arg Tyr Val Cys Gln Phe Pro Asp Gln Glu Glu Val Arg Leu Phe Phe Pro Leu His Leu Trp Val Lys Asn Val Phe Leu Asn Gln Thr Arg Thr Gln Arg Val Leu Phe Val Asp Ser Val Gly Leu Pro Ala Pro Pro Ser Ile Ile Lys Ala Met Gly Gly Ser Gln Pro Gly Glu Leu Gln Ile Ser Trp Glu Glu Pro Ala Pro Glu Ile Ser Asp Phe Leu Arg Tyr Glu Leu Arg Tyr Gly Pro Arg Asp Pro Lys Asn Ser Thr Gly Pro Thr Val Ile Gln Leu Ile Ala Thr Glu Thr Cys Cys Pro Ala Leu Gln Arg Pro His Ser Ala Ser Ala Leu Asp Gln Ser Pro Cys Ala Gln Pro Thr Met Pro Trp Gln Asp Gly Pro Lys Gln Thr Ser Pro Ser Arg Glu Ala Ser Ala Leu Thr Ala Glu Gly Gly Ser Cys Leu Ile Ser Gly Leu Gln Pro Gly Asn Ser Tyr Trp Leu Gln Leu Arg Ser Glu Pro Asp Gly Ile Ser Leu Gly Gly Ser Trp Gly Ser Trp Ser Leu Pro Val Thr Val Asp Leu Pro Gly Asp Ala Val Ala <210> 11 <211> 217 <212> PRT
<213> primate <400> 11 Val Ser Gly Glu Ser Gly Tyr Ala Gln Asn Gly Asp Leu Glu Asp Ala Glu Leu Asp Asp Tyr Ser Phe Ser Cys Tyr Ser Gln Leu Glu Val Asn Gly Ser Gln His Ser Leu Thr Cys Ala Phe Glu Asp Pro Asp Val Asn Thr Thr Asn Leu Glu Phe Glu Ile Cys Gly Ala Leu Val Glu Val Lys Cys Leu Asn Phe Arg Lys Leu Gln Glu Ile Tyr Phe Ile Glu Thr Lys Lys Phe Leu Leu Ile Gly Lys Ser Asn Ile Cys Val Lys Val Gly Glu Lys Ser Leu Thr Cys Lys Lys Ile Asp Leu Thr Thr Ile Val Lys Pro Glu Ala Pro Phe Asp Leu Ser Val Ile Tyr Arg Glu Gly Ala Asn Asp Phe Val Val Thr Phe Asn Thr Ser His Leu Gln Lys Lys Tyr Val Lys Val Leu Met His Asp Val Ala Tyr Arg Gln Glu Lys Asp Glu Asn Lys Trp Thr His Val Asn Leu Ser Ser Thr Lys Leu Thr Leu Leu Gln Arg Lys Leu Gln Pro Ala Ala Met Tyr Glu Ile Lys Val Arg Ser Ile Pro 180. 185 190 Asp His Tyr Phe Lys Gly Phe Trp Ser Glu Trp Ser Pro Ser Tyr Tyr Phe Arg Thr Pro Glu Ile Asn Asn Ser <210> 12 <211> 196 <212> PRT
<213> primate <400> 12 Pro Glu Asn Val Arg Met Asn Ser Val Asn Phe Lys Asn Ile Leu Gln Trp Glu Ser Pro Ala Phe Ala Lys Gly Asn Leu Thr Phe Thr Ala Gln Tyr Leu Ser Tyr Arg Ile Phe Gln Asp Lys Cys Met Asn Thr Thr Leu Thr Glu Cys Asp Phe Ser Ser Leu Ser Lys Tyr Gly Asp His Thr Leu Arg Val Arg Ala Glu Phe Ala Asp Glu His Ser Asp Trp Val Asn Ile Thr Phe Cys Pro Val Asp Asp Thr Ile Ile Gly Pro Pro Gly Met Gln Val Glu Val Leu Ala Asp Ser Leu His Met Arg Phe Leu Ala Pro Lys Ile Glu Asn Glu Tyr Glu Thr Trp Thr Met Lys Asn Val Tyr Asn Ser Trp Thr Tyr Asn Val Gln Tyr Trp Lys Asn Gly Thr Asp Glu Lys Phe Gln Ile Thr Pro Gln Tyr Asp Phe Glu Val Leu Arg Asn Leu Glu Pro Trp Thr Thr Tyr Cys Val Gln Val Arg Gly Phe Leu Pro Asp Arg Asn Lys Ala Gly Glu Trp Ser Glu Pro Val Cys Glu Gln Thr Thr His Asp Glu Thr Val Pro <210> 13 <211> 196 <212> PRT
<213> rodent <400> 13 Pro Glu Lys Val Arg Met Asn Ser Val Asn Phe Lys Asn Ile Leu Gln Trp Glu Val Pro Ala Phe Pro Lys Thr Asn Leu Thr Phe Thr Ala Gln Tyr Glu Ser Tyr Arg Ser Phe Gln Asp His Cys Lys Arg Thr Ala Ser Thr Gln Cys Asp Phe Ser His Leu Ser Lys Tyr Gly Asp Tyr Thr Val Arg Val Arg Ala Glu Leu Ala Asp Glu His Ser Glu Trp Val Asn Val Thr Phe Cys Pro Val Glu Asp Thr Ile Ile Gly Pro Pro Glu Met Gln Ile Glu Ser Leu Ala Glu Ser Leu His Leu Arg Phe Ser Ala Pro Gln Ile Glu Asn Glu Pro Glu Thr Trp Thr Leu Lys Asn Ile Tyr Asp Ser Trp Ala Tyr Arg Val Gln Tyr Trp Lys Asn Gly Thr Asn Glu_Lys Phe Gln Val Val Ser Pro Tyr Asp Ser Glu Val Leu Arg Asn Leu Glu Pro Trp Thr Thr Tyr Cys Ile Gln Val Gln Gly Phe Leu Leu Asp Gln Asn Arg Thr Gly Glu Trp Ser Glu Pro Ile Cys Glu Arg Thr Gly Asn Asp Glu Ile Thr Pro <210> 14 <211> 199 <212> PRT
<213> primate <400> 14 Pro Gln Lys Val Glu Val Asp Ile Ile Asp Asp Asn Phe Ile Leu Arg Trp Asn Arg Ser Asp Glu Ser Val Gly Asn Val Thr Phe Ser Phe Asp Tyr Gln Lys Thr Gly Met Asp Asn Trp Ile Lys Leu Ser Gly Cys Gln Asn Ile Thr Ser Thr Lys Cys Asn Phe Ser Ser Leu Lys Leu Asn Val Tyr Glu Glu Ile Lys Leu Arg Ile Arg Ala Glu Lys Glu Asn Thr Ser Ser Trp Tyr Glu Val Asp Ser Phe Thr Pro Phe Arg Lys Ala Gln Ile Gly Pro Pro Glu Val His Leu Glu Ala Glu Asp Lys Ala Ile Val Ile His Ile Ser Pro Gly Thr Lys Asp Ser Val Met Trp Ala Leu Asp Gly Leu Ser Phe Thr Tyr Ser Leu Leu Ile Trp Lys Asn Ser Ser Gly Val Glu Glu Arg Ile Glu Asn Ile Tyr Ser Arg His Lys Ile Tyr Lys Leu Ser Pro Glu Thr Thr Tyr Cys Leu Lys Val Lys Ala Ala Leu Leu Thr Ser Trp Lys Ile Gly Val Tyr Ser Pro Val His Cys Ile Lys Thr Thr Val Glu Asn Glu Leu Pro Pro <210> 15 <211> 200 <212> PRT
<213> rodent <400> 15 Pro Glu Asn Ile Asp Val Tyr Ile Ile Asp Asp Asn Tyr Thr Leu,Lys Trp Ser Ser His Gly Glu Ser Met Gly Ser Val Thr Phe Ser Ala Glu Tyr Arg Thr Lys Asp Glu Ala Lys Trp Leu Lys Val Pro Glu Cys Gln His Thr Thr Thr Thr Lys Cys Glu Phe Ser Leu Leu Asp Thr Asn Val Tyr Ile Lys Thr Gln Phe Arg Val Arg Ala Glu Glu Gly Asn Ser Thr Ser Ser Trp Asn Glu Val Asp Pro Phe Ile Pro Phe Tyr Thr Ala His Met Ser Pro Pro Glu Val Arg Leu Glu Ala Glu Asp Lys Ala Ile Leu Val His Ile Ser Pro Pro Gly Gln Asp Gly Asn Met Trp Ala Leu Glu Lys Pro Ser Phe Ser Tyr Thr Ile Arg Ile Trp Gln Lys Ser Ser Ser Asp Lys Lys Thr Ile Asn Ser Thr Tyr Tyr Val Glu Lys Ile Pro Glu Leu Leu Pro Glu Thr Thr Tyr Cys Leu Glu Val Lys Ala Ile His Pro Ser Leu Lys Lys His Ser Asn Tyr Ser Thr Val Gln Cys Ile Ser Thr Thr Val Ala Asn Lys Met Pro Val <210> 16 <211> 214 <212> PRT
<213> primate <400> 16 Pro Thr Asn Val Thr Ile Glu Ser Tyr Asn Met Asn Pro Ile Val Tyr Trp Glu Tyr Gln Ile Met Pro Gln Val Pro Val Phe Thr Val Glu Val Lys Asn Tyr Gly Val Lys Asn Ser Glu Trp Ile Asp Ala Cys Ile Asn Ile Ser His His Tyr Cys Asn Ile Ser Asp His Val Gly Asp Pro Ser Asn Ser Leu Trp Val Arg Val,Lys Ala Arg Val Gly Gln Lys Glu Ser Ala Tyr Ala Lys Ser Glu Glu Phe Ala Val Cys Arg Asp Gly Lys Ile Gly Pro Pro Lys Leu Asp Ile Arg Lys Glu Glu Lys Gln Ile Met Ile Asp Ile Phe His Pro Ser Val Phe Val Asn Gly Asp Glu Gln Glu Val Asp Tyr Asp Pro Glu Thr Thr Cys Tyr Ile Arg Val Tyr Asn Val Tyr Val Arg Met Asn Gly Ser Glu Ile Gln Tyr Lys Ile Leu Thr Gln Lys Glu Asp Asp Cys Asp Glu Ile Gln Cys Gln Leu Ala Ile Pro Val Ser Ser Leu Asn Ser Gln Tyr Cys Val Ser Ala Glu Gly Val Leu His Val Trp Gly Val Thr Thr Glu Lys Ser Lys Glu Val Cys Ile Thr Ile Phe Asn Ser Ser Ile Lys Gly <210> 17 <211> 213 <212> PRT
<213> rodent <400> 17 Pro Thr Asn Val Leu Ile Lys Ser Tyr Asn Leu Asn Pro Val Val Cys Trp Glu Tyr Gln Asn Met Ser Gln Thr Pro Ile Phe Thr Val Gln Val Lys Val Tyr Ser Gly Ser Trp Thr Asp Ser Cys Thr Asn Ile Ser Asp His Cys Cys Asn Ile Tyr Gly Gln Ile Met Tyr Pro Asp Val Ser Ala Trp Ala Arg Val Lys Ala Lys Val Gly Gln Lys Glu Ser Asp Tyr Ala Arg Ser Lys Glu Phe Leu Met Cys Leu Lys Gly Lys Val Gly Pro Pro Gly Leu Glu Ile Arg Arg Lys Lys Glu Glu Gln Leu Ser Val Leu Val Phe His Pro Glu Val Val Val Asn Gly Glu Ser Gln Gly Thr Met Phe Gly Asp Gly Ser Thr Cys Tyr Thr Phe Asp Tyr Thr Val Tyr Val Glu His Asn Arg Ser Gly Glu Ile Leu His Thr Lys His Thr Val Glu Lys Glu Glu Cys Asn Glu Thr Leu Cys Glu Leu Asn Ile Ser Val Ser Thr Leu Asp Ser Arg Tyr Cys Ile Ser Val Asp Gly Ile Ser Ser Phe Trp Gln Val Arg Thr Glu Lys Ser Lys Asp Val Cys Ile Pro Pro Phe His Asp Asp Arg Lys Asp <210> 18 <211> 207 <212> PRT
<213> rodent <400> 18 Pro Ser Tyr Val Trp Phe Glu Ala Arg Phe Phe Gln His Ile Leu His Trp Lys Pro Ile Pro Asn Gln Ser Glu Ser Thr Tyr Tyr Glu Val Ala Leu Lys Gln Tyr Gly Asn Ser Thr Trp Asn Asp Ile His Ile Cys Arg Lys Ala Gln Ala Leu Ser Cys Asp Leu Thr Thr Phe Thr Leu Asp Leu Tyr His Arg Ser Tyr Gly Tyr Arg Ala Arg Val Arg Ala Val Asp Asn Ser Gln Tyr Ser Asn Trp Thr Thr Thr Glu Thr Arg Phe Thr Val Asp Glu Val Ile Leu Thr Val Asp Ser Val Thr Leu Lys Ala Met Asp Gly Ile Ile Tyr Gly Thr Ile His Pro Pro Arg Pro Thr Ile Thr Pro Ala Gly Asp Glu Tyr Glu Gln Val Phe Lys Asp Leu Arg Val Tyr Lys Ile Ser Ile Arg Lys Phe Ser Glu Leu Lys Asn Ala Thr Lys Arg Val Lys Gln Glu Thr Phe Thr Leu Thr Val Pro Ile Gly Val Arg Lys Phe Cys Val Lys Val Leu Pro Arg Leu Glu Ser Arg Ile Asn Lys Ala Glu Trp Ser Glu Glu Gln Cys Leu Leu Ile Thr Thr Glu Gln Tyr Phe Thr <210> 19 <211> 204 <212> PRT
<213> primate <400> 19 Pro Pro Ser Val Trp Phe Glu Ala Glu Phe Phe His His Ile Leu His Trp Thr Pro Ile Pro Asn Gln Ser Glu Ser Thr Cys Tyr Glu Val Ala Leu Leu Arg Tyr Gly Ile Glu Ser Trp Asn Ser Ile Ser Asn Cys Ser Gln Thr Leu Ser Tyr Asp Leu Thr Ala Val Thr Leu Asp Leu Tyr His Ser Asn Gly Tyr Arg Ala Arg Val Arg Ala Val Asp Gly Ser Arg His Ser Asn Trp Thr Val Thr Asn Thr Arg Phe Ser Val Asp Glu Val Thr Leu Thr Val Gly Ser Val Asn Leu Glu Ile His Asn Gly Phe Ile Leu Gly Lys Ile Gln Leu Pro Arg Pro Lys Met Ala Pro Ala Asn Asp Thr Tyr Glu Ser Ile Phe Ser His Phe Arg Glu Tyr Glu Ile Ala Ile Arg Lys Val Pro Gly Asn Phe Thr Phe Thr His Lys Lys Val Lys His Glu Asn Phe Ser Leu Leu Thr Ser Gly Glu Val Gly Glu Phe Cys Val Gln Val Lys Pro Ser Val Ala Ser Arg Ser Asn Lys Gly Met Trp Ser Lys Glu Glu Cys Ile Ser Leu Thr Arg Gln Tyr Phe Thr <210> 20 <211> 208 <212> PRT
<213> primate <400> 20 Pro Leu Asn Pro Arg Leu His Leu Tyr Asn Asp Glu Gln Ile Leu Thr Trp Glu Pro Ser Pro Ser Ser Asn Asp Pro Arg Pro Val Val Tyr Gln Val Glu Tyr Ser Phe Ile Asp Gly Ser Trp His Arg Leu Leu Glu Pro Asn Cys Thr Asp Ile Thr Glu Thr Lys Cys Asp Leu Thr Gly Gly Gly Arg Leu Lys Leu Phe Pro His Pro Phe Thr Val Phe Leu Arg Val Arg Ala Lys Arg Gly Asn Leu Thr Ser Lys Trp Val Gly Leu Glu Pro Phe Gln His Tyr Glu Asn Val Thr Val Gly Pro Pro Lys Asn Ile Ser Val Thr Pro Gly Lys Gly Ser Leu Val Ile His Phe Ser Pro Pro Phe Asp Val Phe His Gly Ala Thr Phe Gln Tyr Leu Val His Tyr Trp Glu Lys Ser Glu Thr Gln Gln Glu Gln Val Glu Gly Pro Phe Lys Ser Asn Ser Ile Val Leu Gly Asn Leu Lys Pro Tyr Arg Val Tyr Cys Leu Gln Thr Glu Ala Gln Leu Ile Leu Lys Asn Lys Lys Ile Arg Pro His Gly Leu Leu Ser Asn Val Ser Cys His Glu Thr Thr Ala Asn Ala Ser Ala Arg <210> 21 <211> 207 <212> PRT
<213> primate <400> 21 Pro Ala Asn Ile Thr Phe Leu Ser Ile Asn Met Lys Asn Val Leu Gln Trp Thr Pro Pro Glu Gly Leu Gln Gly Val Lys Val Thr Tyr Thr Val Gln Tyr Phe Ile Tyr Gly Gln Lys Lys Trp Leu Asn Lys Ser Glu Cys Arg Asn Ile Asn Arg Thr Tyr Cys Asp Leu Ser Ala Glu Thr Ser Asp Tyr Glu His Gln Tyr Tyr Ala Lys Val Lys Ala Ile Trp Gly Thr Lys Cys Ser Lys Trp Ala Glu Ser Gly Arg Phe Tyr Pro Phe Leu Glu Thr Gln Ile Gly Pro Pro Glu Val Ala Leu Thr Thr Asp Glu Lys Ser Ile Ser Val Val Leu Thr Ala Pro Glu Lys Trp Lys Arg Asn Pro Glu Asp Leu Pro Val Ser Met Gln Gln Ile Tyr Ser Asn Leu Lys Tyr Asn Val Ser Val Leu Asn Thr Lys Ser Asn Arg Thr Trp Ser Gln Cys Val Thr Asn His Thr Leu Val Leu Thr Trp Leu Glu Pro Asn Thr Leu Tyr Cys Val His Val Glu Ser Phe Val Pro Gly Pro Pro Arg Arg Ala Gln Pro Ser Glu Lys Gln Cys Ala Arg Thr Leu Lys Asp Gln Ser Ser Glu <210> 22 <211> 234 <212> PRT
<213> primate <400> 22 Leu Gln His Val Lys Phe Gln Ser Ser Asn Phe Glu Asn Ile Leu Thr Trp Asp Ser Gly Pro Glu Gly Thr Pro Asp Thr Val Tyr Ser Ile Glu Tyr Lys Thr Tyr Gly Glu Arg Asp Trp Val Ala Lys Lys Gly Cys Gln Arg Ile Thr Arg Lys Ser Cys Asn Leu Thr Val Glu Thr Gly Asn Leu Thr Glu Leu Tyr Tyr Ala Arg Val Thr Ala Val Ser Ala Gly Gly Arg Ser Ala Thr Lys Met Thr Asp Arg Phe. Ser Ser Leu Gln His Thr Thr Leu Lys Pro Pro Asp Val Thr Cys Ile Ser Lys Val Arg Ser Ile Gln Met Ile Val His Pro Thr Pro Thr Pro Ile Arg Ala Gly Asp Gly His Arg Leu Thr Leu Glu Asp Ile Phe His Asp Leu Phe Tyr His Leu Glu Leu Gln Val Asn Arg Thr Tyr Gln Met His Leu Gly Gly Lys Gln Arg Glu Tyr Glu Phe Phe Gly Leu Thr Pro Asp Thr Glu Phe Leu Gly Thr Ile Met Ile Cys Val Pro Thr Trp Ala Lys Glu Ser Ala Pro Tyr Met Cys Arg Val Lys Thr Leu Pro Asp Arg Thr Trp Thr Tyr Ser Phe Ser Gly Ala Phe Leu Phe Ser Met Gly Phe Leu Val Ala Val Leu Cys Tyr Leu Ser Tyr Arg Tyr Val Thr Lys Pro Pro <210> 23 <211> 201 <212> PRT
<213> primate <400> 23 Ser Cys Thr Phe Lys Ile Ser Leu Arg Asn Phe Arg Ser Ile Leu Ser Trp Glu Leu Lys Asn His Ser Ile Val Pro Thr His Tyr Thr Leu Leu Tyr Thr Ile Met Ser Lys Pro Glu Asp Leu Lys Val Val Lys Asn Cys Ala Asn Thr Thr Arg Ser Phe Cys Asp Leu Thr Asp Glu Trp Arg Ser Thr His Glu Ala Tyr Val Thr Val Leu Glu Gly Phe Ser Gly Asn Thr Thr Leu Phe Ser Cys Ser His Asn Phe Trp Leu Ala Ile Asp Met Ser Phe Glu Pro Pro Glu Phe Glu Ile Val Gly Phe Thr Asn His Ile Asn Val Met Val Lys Phe Pro Ser Ile Val Glu Glu Glu Leu Gln Phe Asp Leu Ser Leu Val Ile Glu Glu Gln Ser Glu Gly Ile Val Lys Lys His Lys Pro Glu Ile Lys Gly Asn Met Ser Gly Asn Phe Thr Tyr Ile Ile Asp Lys Leu Ile Pro Asn Thr Asn Tyr Cys Val Ser Val Tyr Leu Glu His Ser Asp Glu Gln Ala Val Ile Lys Ser Pro Leu Lys Cys Thr Leu Leu Pro Pro Gly Gln Glu Ser Glu Ser <210> 24 <211> 1617 <212> DNA
<213> primate; surmised Homo sapiens <220>
<221> CDS
<222> (1)..(1614) <220>
<221> mat~eptide <222> (61)..(1614) <220>
<221> misc_feature <222> (1). (1617) <223> n may be a, c, g, or t; translated amino acid depends on genetic code <400> 24 atg ccg cgt ggc tgg gcc gcc ccc ttg ctc ctg ctg ctg ctc cag gga 48 Met Pro Arg Gly Trp Ala Ala Pro Leu Leu Leu Leu Leu Leu Gln Gly ggc tgg ggc tgc ccc gac ctc gtc tgc tac acc gat tac ctc cag acg 96 Gly Trp Gly Cys Pro Asp Leu Val Cys Tyr Thr Asp Tyr Leu Gln Thr gtc atc tgc atc ctg gaa atg tgg aac ctc cac ccc agc acg ctc acc 144 Val Ile Cys Ile Leu Glu Met Trp Asn Leu His Pro Ser Thr Leu Thr ctt acc tgg caa gac cag tat gaa gag ctg aag gac gag gcc acc tcc 192 Leu Thr Trp Gln Asp Gln Tyr Glu Glu Leu Lys Asp Glu Ala Thr Ser tgc agc ctc cac agg tcg gcc cac aat gcc acg cat gcc acc tac acc 240 Cys Ser Leu His Arg Ser Ala His Asn Ala Thr His Ala Thr Tyr Thr tgc cac atg gat gta ttc cac ttc atg gcc gac gac att ttc agt gtc 288 Cys His Met Asp Val Phe His Phe Met Ala Asp Asp Ile Phe Ser Val aac atc aca gac cag tct ggc aac tac tcc cag gan tgt ggc agc ttt 336 Asn Ile Thr Asp Gln Ser Gly Asn Tyr Ser Gln Xaa Cys Gly Ser Phe ctc ctg get gag agc atc aag ccg get ccc cct ttc aac gtg act gtg 384 Leu Leu Ala Glu Ser Ile Lys Pro Ala Pro Pro Phe Asn Val Thr Val acc ttc tca gga cag tat aat atn tcc tgg cgc tca gat tac gaa gac 432 Thr Phe Ser Gly Gln Tyr Asn Xaa Ser Trp Arg Ser Asp Tyr Glu Asp cct gcc ttc tac atg ctg aaa ggc aag ctt caa tat gag ctg cag tac 480 Pro Ala Phe Tyr Met Leu Lys Gly Lys Leu Gln Tyr Glu Leu Gln Tyr agg aac cgg gga gac ccc tgg get gtg agt ccg agg aga aag ctg atc 528 Arg Asn Arg Gly Asp Pro Trp Ala Val Ser Pro Arg Arg Lys Leu Ile tca gtg gac tca aga agt gtc tcc ctc ctc ccc ctg gag ttc cgc aaa 576 Ser Val Asp Ser Arg Ser Val Ser Leu Leu Pro Leu Glu Phe Arg Lys gac tcg agc tat gag ctg can gtg cgg gca ggg ccc atg cct ggc tcc 624 Asp Ser Ser Tyr Glu Leu Xaa Val Arg Ala Gly Pro Met Pro Gly Ser tcc tac cag ggg acc tgg agt gaa tgg agt gac ccg gtc atc tgt cag 672 Ser Tyr Gln Gly Thr Trp Ser Glu Trp Ser Asp Pro Val Ile Cys Gln acc cag tca gag gag tta aag gaa ggc tgg aac cct cac ctg ctg ctt 720 Thr Gln Ser Glu Glu Leu Lys Glu Gly Trp Asn Pro His Leu Leu Leu ctc ctc ctg ctt gtc ata gtc ttc att cct gcc ttc tgg agc ctg aag 768 Leu Leu Leu Leu Val Ile Val Phe Ile Pro Ala Phe Trp Ser Leu Lys acc cat cca ttg tgg agg cta tgg aag aag ata tgg gcc gtc ccc agc 816 Thr His Pro Leu Trp Arg Leu Trp Lys Lys Ile Trp Ala Val Pro Ser cct gag cgg ttc ttc atg ccc ctg tac aag ggc tgc agc gga gac ttc 864 Pro Glu Arg Phe Phe Met Pro Leu Tyr Lys Gly Cys Ser Gly Asp Phe aag aaa tgg gtg ggt gca ccc ttc act ggc tcc agc ctg gag ctg gga 912 Lys Lys Trp Val Gly Ala Pro Phe Thr Gly Ser Ser Leu Glu Leu Gly ccc tgg agc cca gag gtg ccc tcc acc ctg gag gtg tac agc tgc cac 960 Pro Trp Ser Pro Glu Val Pro Ser Thr Leu Glu Val Tyr Ser Cys His cca cca cgg agc ccg gcc aag agg ctg cag ctc acg gag cta caa gaa 1008 Pro Pro Arg Ser Pro Ala Lys Arg Leu Gln Leu Thr Glu Leu Gln Glu cca gca gag ctg gtg gag tct gac ggt gtg ccc aag ccc agc ttc tgg 1056 Pro Ala Glu Leu Val Glu Ser Asp Gly Val Pro Lys Pro Ser Phe Trp ccg aca gcc cag aac tcg ggg ggc tca get tac agt gag gag agg gat 1104 Pro Thr Ala Gln Asn Ser Gly Gly Ser Ala Tyr Ser Glu Glu Arg Asp cgg cca tac ggc ctg gtg tcc att gac aca gtg act gtg cta gat gca 1152 Arg Pro Tyr Gly Leu Val Ser Ile Asp Thr Val Thr Val Leu Asp Ala gag ggg cca tgc acc tgg ccc tgc agc tgt gag gat gac ggc tac cca 1200 Glu Gly Pro Cys Thr Trp Pro Cys Ser Cys Glu Asp Asp Gly Tyr Pro gcc ctg gac ctg gat get ggc ctg gag ccc agc cca ggc cta gag gac 1248 Ala Leu Asp Leu Asp Ala Gly Leu Glu Pro Ser Pro Gly Leu Glu Asp cca ctc ttg gat gca ggg acc aca gtc ctg tcc tgt ggc tgt gtc tca 1296 Pro Leu Leu Asp Ala Gly Thr Thr Val Leu Ser Cys Gly Cys Val Ser get ggc agc cct ggg cta gga ggg ccc ctg gga agc ctc ctg gac aga 1344 Ala Gly Ser Pro Gly Leu Gly Gly Pro Leu Gly Ser Leu Leu Asp Arg cta aag cca ccc ctt gca gat ggg gag gac tgg get ggg gga ctg ccc 1392 Leu Lys Pro Pro Leu Ala Asp Gly Glu Asp Trp Ala Gly Gly Leu Pro tgg ggt ggc cgg tca cct gga ggg gtc tca gag agt gag gcg ggc tca 1440 Trp Gly Gly Arg Ser Pro Gly Gly Val Ser Glu Ser Glu Ala Gly Ser ccc ctg gcc ggc ctg gat atg gac acg ttt gac agt ggc ttt gtg ggc 1488 Pro Leu Ala Gly Leu Asp Met Asp Thr Phe Asp Ser Gly Phe Val Gly tct gac tgc agc agc cct gtg gag tgt gac ttc acc agc ccc ggg gac 1536 Ser Asp Cys Ser Ser Pro Val Glu Cys Asp Phe Thr Ser Pro Gly Asp gaa gga ccc ccc cgg agc tac ctc cgc cag tgg gtg gtc att cct ccg 1584 Glu Gly Pro Pro Arg Ser Tyr Leu Arg Gln Trp Val Val Ile Pro Pro cca ctt tcg agc cct gga ccc cag gcc agc taa 1617 Pro Leu Ser Ser Pro Gly Pro Gln Ala Ser <210> 25 <211> 538 <212> PRT
<213> primate; surmised Homo Sapiens <400> 25 Met Pro Arg Gly Trp Ala Ala Pro Leu Leu Leu Leu Leu Leu Gln Gly Gly Trp Gly Cys Pro Asp Leu Val Cys Tyr Thr Asp Tyr Leu Gln Thr Val Ile Cys Ile Leu Glu Met Trp Asn Leu His Pro Ser Thr Leu Thr Leu Thr Trp Gln Asp Gln Tyr Glu Glu Leu Lys Asp Glu Ala Thr Ser Cys Ser Leu His Arg Ser Ala His Asn Ala Thr His Ala Thr Tyr Thr Cys His Met Asp Val Phe His Phe Met Ala Asp Asp Ile Phe Ser Val Asn Ile Thr Asp Gln Ser Gly Asn Tyr Ser Gln Xaa Cys Gly Ser Phe Leu Leu Ala Glu Ser Ile Lys Pro Ala Pro Pro Phe Asn Val Thr Val Thr Phe Ser Gly Gln Tyr Asn Xaa Ser Trp Arg Ser Asp Tyr Glu Asp Pro Ala Phe Tyr Met Leu Lys Gly Lys Leu Gln Tyr Glu Leu Gln Tyr Arg Asn Arg Gly Asp Pro Trp Ala Val Ser Pro Arg Arg Lys Leu Ile Ser Val Asp Ser Arg Ser Val Ser Leu Leu Pro Leu Glu Phe Arg Lys Asp Ser Ser Tyr Glu Leu Xaa Val Arg Ala Gly Pro Met Pro Gly Ser Ser Tyr Gln Gly Thr Trp Ser Glu Trp Ser Asp Pro Val Ile Cys Gln Thr Gln Ser Glu Glu Leu Lys Glu Gly Trp Asn Pro His Leu Leu Leu Leu Leu Leu Leu Val Ile Val Phe Ile Pro Ala Phe Trp Ser Leu Lys Thr His Pro Leu Trp Arg Leu Trp Lys Lys Ile Trp Ala Val Pro Ser Pro Glu Arg Phe Phe Met Pro Leu Tyr Lys Gly Cys Ser Gly Asp Phe Lys Lys Trp Val Gly Ala Pro Phe Thr Gly Ser Ser Leu Glu Leu Gly Pro Trp Ser Pro Glu Val Pro Ser Thr Leu Glu Val Tyr Ser Cys His Pro Pro Arg Ser Pro Ala Lys Arg Leu Gln Leu Thr Glu Leu Gln Glu Pro Ala Glu Leu Val Glu Ser Asp Gly Val Pro Lys Pro Ser Phe Trp Pro Thr Ala Gln Asn Ser Gly Gly Ser Ala Tyr Ser Glu Glu Arg Asp Arg Pro Tyr Gly Leu Val Ser Ile Asp Thr Val Thr Val Leu Asp Ala Glu Gly Pro Cys Thr Trp Pro Cys Ser Cys Glu Asp Asp Gly Tyr Pro Ala Leu Asp Leu Asp Ala Gly Leu Glu Pro Ser Pro Gly Leu Glu Asp Pro Leu Leu Asp Ala Gly Thr Thr Val Leu Ser Cys Gly Cys Val Ser Ala Gly Ser Pro Gly Leu Gly Gly Pro Leu Gly Ser Leu Leu Asp Arg Leu Lys Pro Pro Leu Ala Asp Gly Glu Asp Trp Ala Gly Gly Leu Pro Trp Gly Gly Arg Ser Pro Gly Gly Val Ser Glu Ser Glu Ala Gly Ser Pro Leu Ala Gly Leu Asp Met Asp Thr Phe Asp Ser Gly Phe Val Gly Ser Asp Cys Ser Ser Pro Val Glu Cys Asp Phe Thr Ser Pro Gly Asp Glu Gly Pro Pro Arg Ser Tyr Leu Arg Gln Trp Val Val Ile Pro Pro Pro Leu Ser Ser Pro Gly Pro Gln Ala Ser <210> 26 <211> 1614 <212> DNA
<213> Artificial Sequence <220>
<221> misc_feature <222> (1) . (1614) <223> n may be a, c, g, or t <220>
<223> Description of Artificial Sequence: reverse translation <400> 26 atgccnmgng gntgggcngc nccnytnytn ytnytnytny tncarggngg ntggggntgy 60 ccngayytng tntgytayac ngaytayytn caracngtna thtgyathyt ngaratgtgg 120 aayytncayc cnwsnacnyt nacnytnacn tggcargayc artaygarga rytnaargay 180 gargcnacnw sntgywsnyt ncaymgnwsn gcncayaayg cnacncaygc nacntayacn 240 tgycayatgg aygtnttyca yttyatggcn gaygayatht tywsngtnaa yathacngay 300 carwsnggna aytaywsnca rnnntgyggn wsnttyytny tngcngarws nathaarccn 360 gcnccnccnt tyaaygtnac ngtnacntty wsnggncart ayaaynnnws ntggmgnwsn 420 gaytaygarg ayccngcntt ytayatgytn aarggnaary tncartayga rytncartay 480 mgnaaymgng gngayccntg ggcngtnwsn ccnmgnmgna arytnathws ngtngaywsn 540 mgnwsngtnw snytnytncc nytngartty mgnaargayw snwsntayga rytnnnngtn 600 mgngcnggnc cnatgccngg nwsnwsntay carggnacnt ggwsngartg gwsngayccn 660 gtnathtgyc aracncarws ngargarytn aargarggnt ggaayccnca yytnytnytn 720 ytnytnytny tngtnathgt nttyathccn gcnttytggw snytnaarac ncayccnytn 780 tggmgnytnt ggaaraarat htgggcngtn ccnwsnccng armgnttytt yatgccnytn 840 tayaarggnt gywsnggnga yttyaaraar tgggtnggng cnccnttyac nggnwsnwsn 900 ytngarytng gnccntggws nccngargtn ccnwsnacny tngargtnta ywsntgycay 960 ccnccnmgnw snccngcnaa rmgnytncar ytnacngary tncargarcc ngcngarytn 1020 gtngarwsng ayggngtncc naarccnwsn ttytggccna cngcncaraa ywsnggnggn 1080 wsngcntayw sngargarmg ngaymgnccn tayggnytng tnwsnathga yacngtnacn 1140 gtnytngayg cngarggncc ntgyacntgg ccntgywsnt gygargayga yggntayccn 1200 gcnytngayy tngaygcngg nytngarccn wsnccnggny tngargaycc nytnytngay 1260 gcnggnacna cngtnytnws ntgyggntgy gtnwsngcng gnwsnccngg nytnggnggn 1320 ccnytnggnw snytnytnga ymgnytnaar ccnccnytng cngayggnga rgaytgggcn 1380 ggnggnytnc cntggggngg nmgnwsnccn ggnggngtnw sngarwsnga rgcnggnwsn 1440 ccnytngcng gnytngayat ggayacntty gaywsnggnt tygtnggnws ngaytgywsn 1500 wsnccngtng artgygaytt yacnwsnccn ggngaygarg gnccnccnmg nwsntayytn 1560 mgncartggg tngtnathcc nccnccnytn wsnwsnccng gnccncargc nwsn 1614 <210> 27 <211> 696 <212> DNA
<213> primate; surmised Homo sapiens <220>
<221> CDS
<222> (1)..(693) <220>
<221> mat~eptide <222> (64)..(693) <400> 27 atg atg cct aaa cat tgc ttt cta ggc ttc ctc atc agt ttc ttc ctt 48 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu act ggt gta gca gga act cag tca acg cat gag tct ctg aag cct cag 96 Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln agg gta caa ttt cag tcc cga aat ttt cac aac att ttg caa tgg cag 144 Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln ccc ggg agg gca ctt act ggc aac agc agt gtc tat ttt gtg cag tac 192 Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr aaa ata tat gga cag aga caa tgg aaa aat aaa gaa gac tgt tgg ggt 240 Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly act caa gaa ctc tct tgt gac ctt acc agt gaa acc tca gac ata cag 288 Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln gaa cct tat tac ggg agg gtg agg gcg gcc tcg get ggg agc tac tca 336 Glu Pro Tyr Tyr Gly Arg Val Arg Ala Ala Ser Ala Gly Ser Tyr Ser gaa tgg agc atg acg ccg cgg ttc act ccc tgg tgg gaa aca aaa ata 384 Glu Trp Ser Met Thr Pro Arg Phe Thr Pro Trp Trp Glu Thr Lys Ile gat cct cca gtc atg aat ata acc caa gtc aat ggc tct ttg ttg gta 432 Asp Pro Pro Val Met Asn Ile Thr Gln Val Asn Gly Ser Leu Leu Val att ctc cat get cca aat tta cca tat aga tac caa aag gaa aaa aat 480 Ile Leu His Ala Pro Asn Leu Pro Tyr Arg Tyr Gln Lys Glu Lys Asn gta tct ata gaa gat tac tat gaa cta cta tac cga gtt ttt ata att 528 Val Ser Ile Glu Asp Tyr Tyr Glu Leu Leu Tyr Arg Val Phe Ile Ile aac aat tca cta gaa aag gag caa aag gtt tat gaa ggg get cac aga 576 Asn Asn Ser Leu Glu Lys Glu Gln Lys Val Tyr Glu Gly Ala His Arg gcg gtt gaa att gaa get cta aca cca cac tcc agc tac tgt gta gtg 624 Ala Val Glu Ile Glu Ala Leu Thr Pro His Ser Ser Tyr Cys Val Val get gaa ata tat cag ccc atg tta gac aga aga agt cag aga agt gaa 672 Ala Glu Ile Tyr Gln Pro Met Leu Asp Arg Arg Ser Gln Arg Ser Glu gag aga tgt gtg gaa att cca tga 696 Glu Arg Cys Val Glu Ile Pro <210> 28 <211> 231 <212> PRT
<213> primate; surmised Homo sapiens <400> 28 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln Glu Pro Tyr Tyr Gly Arg Val Arg Ala Ala Ser Ala Gly Ser Tyr Ser Glu Trp Ser Met Thr Pro Arg Phe Thr Pro Trp Trp Glu Thr Lys Ile Asp Pro Pro Val Met Asn Ile Thr Gln Val Asn Gly Ser Leu Leu Val Ile Leu His Ala Pro Asn Leu Pro Tyr Arg Tyr Gln Lys Glu Lys Asn Val Ser Ile Glu Asp Tyr Tyr Glu Leu Leu Tyr Arg Val Phe Ile Ile Asn Asn Ser Leu Glu.Lys Glu Gln Lys Val Tyr Glu Gly Ala His Arg Ala Val Glu Ile Glu Ala Leu Thr Pro His Ser Ser Tyr Cys Val Val Ala Glu Ile Tyr Gln Pro Met Leu Asp Arg Arg Ser Gln Arg Ser Glu Glu Arg Cys Val Glu Ile Pro <210> 29 <211> 693 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: reverse translation <220>
<221> misc_feature <222> (1) . (693) <223> n may be a, c, g, or t <400> 29 atgatgccna arcaytgytt yytnggntty ytnathwsnt tyttyytnac nggngtngcn 60 ggnacncarw snacncayga rwsnytnaar ccncarmgng tncarttyca rwsnmgnaay 120 ttycayaaya thytncartg gcarccnggn mgngcnytna cnggnaayws nwsngtntay 180 ttygtncart ayaarathta yggncarmgn cartggaara ayaargarga ytgytggggn 240 acncargary tnwsntgyga yytnacnwsn garacnwsng ayathcarga rccntaytay 300 ggnmgngtnm gngcngcnws ngcnggnwsn taywsngart ggwsnatgac nccnmgntty 360 acnccntggt gggaracnaa rathgayccn ccngtnatga ayathacnca rgtnaayggn 420 wsnytnytng tnathytnca ygcnccnaay ytnccntaym gntaycaraa rgaraaraay 480 gtnwsnathg argaytayta ygarytnytn taymgngtnt tyathathaa yaaywsnytn 540 garaargarc araargtnta ygarggngcn caymgngcng tngarathga rgcnytnacn 600 ccncaywsnw sntaytgygt ngtngcngar athtaycarc cnatgytnga ymgnmgnwsn 660 carmgnwsng argarmgntg ygtngarath ccn 693 <210> 30 <211> 526 <212> DNA
<213> primate; surmised Homo Sapiens <220>
<221> CDS
<222> (1)..(390) <220>
<221> mat~eptide <222> (64)..(390) <400> 30 atg atg cct aaa cat tgc ttt cta ggc ttc ctc atc agt ttt ttc ctt 48 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu act ggt gta gca gga act cag tca acg cat gag tct ctg aag cct cag 96 Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln agg gta caa ttt cag tcc cga aat ttt cac aac att ttg caa tgg cag 144 Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln cct ggg agg gca ctt act ggc aac agc agt gtc tat ttt gtg cag tac 192 Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr aaa ata tat gga cag aga caa tgg aaa aat aaa gaa gac tgt tgg ggt 240 Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly act caa gaa ctc tct tgt gac ctt acc agt gaa acc tca gac ata cag 288 Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln gaa tct tat tac ggg agg gtg agg gcg gcc tcg get ggg agc tac tca 336 Glu Ser Tyr Tyr Gly Arg Val Arg Ala Ala Ser Ala Gly Ser Tyr Ser gaa tgg agc atg acg ccg cgg ttc act ccc tgg tgg gaa aga gca aaa 384 Glu Trp Ser Met Thr Pro Arg Phe Thr Pro Trp Trp Glu Arg Ala Lys ggt tta tgaaggggct cacagagcgg ttgaaattga agctctaaca ccacactcca 440 Gly Leu gctactgtgt agtggctgaa atatatcagc ccacgttaga cagaagaagt cagagaagtg 500 aagagagatg tgtggaaatt ccatga 526 <210> 31 <211> 130 <212> PRT
<213> primate; surmised Homo Sapiens <400> 31 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln Glu Ser Tyr Tyr Gly Arg Val Arg Ala Ala Ser Ala Gly Ser Tyr Ser Glu Trp Ser Met Thr Pro Arg Phe Thr Pro Trp Trp Glu Arg Ala Lys Gly Leu <210> 32 <211> 390 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: reverse translation <220>
<221> misc_feature <222> (1). (390) <223> n may be a, c, g,or t <400> 32 atgatgccna arcaytgytt yytnggntty ytnathwsnt tyttyytnac nggngtngcn 60 ggnacncarw snacncayga rwsnytnaar ccncarmgng tncarttyca rwsnmgnaay 120 ttycayaaya thytncartg gcarccnggn mgngcnytna cnggnaayws nwsngtntay 180 ttygtncart ayaarathta yggncarmgn cartggaara ayaargarga ytgytggggn 240 acncargary tnwsntgyga yytnacnwsn garacnwsng ayathcarga rwsntaytay 300 ggnmgngtnm gngcngcnws ngcnggnwsn taywsngart ggwsnatgac nccnmgntty 360 acnccntggt gggarmgngc naarggnytn 390
TNTTYCAYTTYATGGCNGAYGAYATHTTYWSNGTNAAYATHACNGAYCARWSNGGNAAYTAYWSNCARGARTGYGGN
WSNTTYYTNYTNGCNGARWSNMGNCARTAYAAYATHWSNTGGMGNWSNGAYTAYGARGAYCCNGCNTTYTAYATGYT
NAARGGNAARYTNCARTAYGARYTNCARTAYMGNAAYMGNGGNGAYCCNTGGGCNGTNWSNCCNMGNMGNAARYTNA
THWSNGTNGAYWSNMGNWSNGTNWSNYTNYTNCCNYTNGARTTYMGNAARGAYWSNWSNTAYGARYTNCARGTNMGN
GCNGGNCCNATGCCNGGNWSNWSNTAYCARGGNACNTGGWSNGARTGGWSNGAYCCNGTNATHTTYCARACNCARWS
NGARGARYTNAARGARGGNTGGAAYCCNCAYYTNYTNYTNYTNYTNYTNYTNGTNATHGTNTTYATHCCNGCNTTYT
GGWSNYTNAARACNCAYCCNYTNTGGMGNYTNTGGAARAARATHTGGGCNGTNCCNWSNCCNGARMGNTTYTTYATG
CCNYTNTAYAARGGNTGYWSNGGNGAYTTYAARAARTGGGTNGGNGCNCCNTTYACNGGNWSNWSNYTNGARYTNGG
NCCNTGGWSNCCNGARGTNCCNWSNACNYTNGARGTNTAYWSNTGYCAYCCNCCNMGNWSNCCNGCNAARMGNYTNC
ARYTNACNGARYTNCARGARCCNGCNGARYTNGTNGARWSNGAYGGNGTNCCNAARCCNWSNTTYTGGCCNACNGCN
CARAAYWSNGGNGGNWSNGCNTAYWSNGARGARMGNGAYMGNCCNTAYGGNYTNGTNWSNATHGAYACNGTNACNGT
NYTNGAYGCNGARGGNCCNTGYACNTGGCCNTGYWSNTGYGARGAYGAYGGNTAYCCNGCNYTNGAYYTNGAYGCNG
GNYTNGARCCNWSNCCNGGNYTNGARGAYCCNYTNYTNGAYGCNGGNACNACNGTNYTNWSNTGYGGNTGYGTNWSN
15' GCNGGNWSNCCNGGNYTNGGNGGNCCNYTNGGNWSNYTNYTNGAYMGNYTNAARCCNCCNYTNGCNGAYGGNGARGA
YTGGGCNGGNGGNYTNCCNTGGGGNGGNMGNWSNCCNGGNGGNGTNWSNGARWSNGARGCNGGNWSNCCNYTNGCNG
GNYTNGAYATGGAYACNTTYGAYWSNGGNTTYGTNGGNWSNGAYTGYWSNWSNCCNGTNGARTGYGAYTTYACNWSN
CCNGGNGAYGARGGNCCNCCNMGNWSNTAYYTNMGNCARTGGGTNGTNATHCCNCCNCCNYTNWSNWSNCCNGGNCC
NCARGCNWSN
Reverse Translation of primate, e.g., human, DCRS3.2 (SEQ ID NO: 26). N may be A, C, G, or T.
ATGCCNMGNGGNTGGGCNGCNCCNYTNYTNYTNYTNYTNYTNCARGGNGGNTGGGGNTGYCCNGAYYTNGTNTGYTA
YACNGAYTAYYTNCARACNGTNATHTGYATHYTNGARATGTGGAAYYTNCAYCCNWSNACNYTNACNYTNACNTGGC
ARGAYCARTAYGARGARYTNAARGAYGARGCNACNWSNTGYWSNYTNCAYMGNWSNGCNCAYAAYGCNACNCAYGCN
ACNTAYACNTGYCAYATGGAYGTNTTYCAYTTYATGGCNGAYGAYATHTTYWSNGTNAAYATHACNGAYCARWSNGG
NAAYTAYWSNCARNNNTGYGGNWSNTTYYTNYTNGCNGARWSNATHAARCCNGCNCCNCCNTTYAAYGTNACNGTNA
CNTTYWSNGGNCARTAYAAYNNNWSNTGGMGNWSNGAYTAYGARGAYCCNGCNTTYTAYATGYTNAARGGNAARYTN
CARTAYGARYTNCARTAYMGNAAYMGNGGNGAYCCNTGGGCNGTNWSNCCNMGNMGNAARYTNATHWSNGTNGAYWS
NMGNWSNGTNWSNYTNYTNCCNYTNGARTTYMGNAARGAYWSNWSNTAYGARYTNNNNGTNMGNGCNGGNCCNATGC
CNGGNWSNWSNTAYCARGGNACNTGGWSNGARTGGWSNGAYCCNGTNATHTGYCARACNCARWSNGARGARYTNAAR
GARGGNTGGAAYCCNCAYYTNYTNYTNYTNYTNYTNYTNGTNATHGTNTTYATHCCNGCNTTYTGGWSNYTNAARAC
NCAYCCNYTNTGGMGNYTNTGGAAR.AARATHTGGGCNGTNCCNWSNCCNGARMGNTTYTTYATGCCNYTNTAYAARG
GNTGYWSNGGNGAYTTYAARAARTGGGTNGGNGCNCCNTTYACNGGNWSNWSNYTNGARYTNGGNCCNTGGWSNCCN
GARGTNCCNWSNACNYTNGARGTNTAYWSNTGYCAYCCNCCNMGNWSNCCNGCNAARMGNYTNCARYTNACNGARYT
NCARGARCCNGCNGARYTNGTNGARWSNGAYGGNGTNCCNAARCCNWSNTTYTGGCCNACNGCNCARAAYWSNGGNG
GNWSNGCNTAYWSNGARGARMGNGAYMGNCCNTAYGGNYTNGTNWSNATHGAYACNGTNACNGTNYTNGAYGCNGAR
GGNCCNTGYACNTGGCCNTGYWSNTGYGARGAYGAYGGNTAYCCNGCNYTNGAYYTNGAYGCNGGNYTNGARCCNWS
NCCNGGNYTNGARGAYCCNYTNYTNGAYGCNGGNACNACNGTNYTNWSNTGYGGNTGYGTNWSNGCNGGNWSNCCNG
GNYTNGGNGGNCCNYTNGGNWSNYTNYTNGAYMGNYTNAARCCNCCNYTNGCNGAYGGNGARGAYTGGGCNGGNGGN
YTNCCNTGGGGNGGNMGNWSNCCNGGNGGNGTNWSNGARWSNGARGCNGGNWSNCCNYTNGCNGGNYTNGAYATGGA
YACNTTYGAYWSNGGNTTYGTNGGNWSNGAYTGYWSNWSNCCNGTNGARTGYGAYTTYACNWSNCCNGGNGAYGARG
Nucleic acid sequence comparison of two DCRS3 embodiments:
DCRS3.2 1 ATGCCGCGTGGCTGGGCCGCCCCCTTGCTCCTGCTGCTGCTCCAGGGAGC 50 50 DCRS3.1 1 ATGCCGCGTGGCTGGGCCGCCCCCTTGCTCCTGCTGCTGCTCCAGGGA-- 49 ************************************************
DCRS3.2 51 CCTCGAGGGGATGGAGAGGAAGCTCTGCAGTCCCAAGCCACCCCCCACCA 100 DCRS3.1 49 -_________________________________________________ 49 DCRS3.2 101 AGGCCTCTCTCCCCACTGACCCTCCAGGCTGGGGCTGCCCCGACCTCGTC 150 DCRS3.1 50 ---------------------------GCTGGGGCTGCCCCGACCTCGTC 72 ***********************
DCRS3.2 151 TGCTACACCGATTACCTCCAGACGGTCATCTGCATCCTGGAAATGTGGAA 200 DCRS3.1 73 TGCTACACCGATTACCTCCAGACGGTCATCTGCATCCTGGAAATGTGGAA 122 **************************************************
DCRS3.2 201 CCTCCACCCCAGCACGCTCACCCTTACCTGGATACTTTCTAATAATACTG 250 DCRS3.1 123 CCTCCACCCCAGCACGCTCACCCTTACCTGG------------------- 153 *******************************
DCRS3.2 251 GGTGCTATATCAAGGACAGAACACTGGACCTCAGGCAAGACCAGTATGAA 300 15' DCRS3.1 154 -----------------------------------C~GACCAGTATGAA 168 ***************
DCRS3.2 301 GAGCTGAAGGACGAGGCCACCTCCTGCAGCCTCCACAGGTCGGCCCACAA 350 DCRS3.1 169 GAGCTGAAGGACGAGGCCACCTCCTGCAGCCTCCACAGGTCGGCCCACAA 218 2 0 **************************************************
DCRS3.2 351 TGCCACGCATGCCACCTACACCTGCCACATGGATGTATTCCACTTCATGG 400 DCRS3.1 219 TGCCACGCATGCCACCTACACCTGCCACATGGATGTATTCCACTTCATGG 268 **************************************************
DCRS3.2 401 CCGACGACATTTTCAGTGTCAACATCACAGACCAGTCTGGCAACTACTCC 450 DCRS3.1 269 CCGACGACATTTTCAGTGTCAACATCACAGACCAGTCTGGCAACTACTCC 318 **************************************************
3 O DCRS3.2 451 CAGGAGTGTGGCAGCTTTCTCCTGGCTGAGAGCA---------------- 484 DCRS3.1 319 CAGGANTGTGGCAGCTTTCTCCTGGCTGAGAGCATCAAGCCGGCTCCCCC 368 ***** ****************************
DCRS3.2 485 --------------------------GACAGTATAATATCTCCTGGCGCT 508 DCRS3.1 369 TTTCAACGTGACTGTGACCTTCTCAGGACAGTATAATATNTCCTGGCGCT 418 ************* **********
DCRS3.2 509 CAGATTACGAAGACCCTGCCTTCTACATGCTGAAGGGCAAGCTTCAGTAT 558 DCRS3.1 419 CAGATTACGAAGACCCTGCCTTCTACATGCTGAAAGGCAAGCTTCAATAT 468 4 0 ********************************** *********** ***
DCRS3.2 559 GAGCTGCAGTACAGGAACCGGGGAGACCCCTGGGCTGTGAGTCCGAGGAG 608 DCRS3.1 469 GAGCTGCAGTACAGGAACCGGGGAGACCCCTGGGCTGTGAGTCCGAGGAG 518 **************************************************
DCRS3.2 609 AAAGCTGATCTCAGTGGACTCAAGAAGTGTCTCCCTCCTCCCCCTGGAGT 658 DCRS3.1 519 AAAGCTGATCTCAGTGGACTCAAGAAGTGTCTCCCTCCTCCCCCTGGAGT 568 **************************************************
DCRS3.2 659 TCCGCAAAGACTCGAGCTATGAGCTGCAGGTGCGGGCAGGGCCCATGCCT 708 DCRS3.1 569 TCCGCAAAGACTCGAGCTATGAGCTGCANGTGCGGGCAGGGCCCATGCCT 618 **************************** *********************
DCRS3.2 709 GGCTCCTCCTACCAGGGGACCTGGAGTGAATGGAGTGACCCGGTCATCTT 758 DCRS3.1 619 GGCTCCTCCTACCAGGGGACCTGGAGTGAATGGAGTGACCCGGTCATCTG 668 *************************************************
DCRS3.2 759 TCAGACCCAGTCAGAGGAGTTAAAGGAAGGCTGGAACCCTCACCTGCTGC 808 DCRS3.1 669 TCAGACCCAGTCAGAGGAGTTAAAGGAAGGCTGGAACCCTCACCTGCTGC 718 **************************************************
DCRS3.2 809 TTCTCCTCCTGCTTGTCATAGTCTTCATTCCTGCCTTCTGGAGCCTGAAG 858 DCRS3.1 719 TTCTCCTCCTGCTTGTCATAGTCTTCATTCCTGCCTTCTGGAGCCTGAAG 768 **************************************************
10 DCRS3.2 859 ACCCATCCATTGTGGAGGCTATGGAAGAAGATATGGGCCGTCCCCAGCCC 908 DCRS3.1 769 ACCCATCCATTGTGGAGGCTATGGAAGAAGATATGGGCCGTCCCCAGCCC 818 **************************************************
DCRS3.2 909 TGAGCGGTTCTTCATGCCCCTGTACAAGGGCTGCAGCGGAGACTTCAAGA 958 15' DCRS3.1 819 TGAGCGGTTCTTCATGCCCCTGTACAAGGGCTGCAGCGGAGACTTCAAGA 868 **************************************************
DCRS3.2 959 AATGGGTGGGTGCACCCTTCACTGGCTCCAGCCTGGAGCTGGGACCCTGG 1008 DCRS3.1 869 AATGGGTGGGTGCACCCTTCACTGGCTCCAGCCTGGAGCTGGGACCCTGG 918 2 0 **************************************************
DCRS3.2 1009 AGCCCAGAGGTGCCCTCCACCCTGGAGGTGTACAGCTGCCACCCACCACG 1058 DCRS3.1 919 AGCCCAGAGGTGCCCTCCACCCTGGAGGTGTACAGCTGCCACCCACCACG 968 **************************************************
DCRS3.2 1059 GAGCCCGGCCAAGAGGCTGCAGCTCACGGAGCTACAAGAACCAGCAGAGC 1108 DCRS3.1 969 GAGCCCGGCCAAGAGGCTGCAGCTCACGGAGCTACAAGAACCAGCAGAGC 1018 **************************************************
3 0 DCRS3.2 1109 TGGTGGAGTCTGACGGTGTGCCCAAGCCCAGCTTCTGGCCGACAGCCCAG 1158 DCRS3.1 1019 TGGTGGAGTCTGACGGTGTGCCCAAGCCCAGCTTCTGGCCGACAGCCCAG 1068 **************************************************
DCRS3.2 1159 AACTCGGGGGGCTCAGCTTACAGTGAGGAGAGGGATCGGCCATACGGCCT 1208 3 5 DCRS3.1 1069 AACTCGGGGGGCTCAGCTTACAGTGAGGAGAGGGATCGGCCATACGGCCT 1118 **************************************************
DCRS3.2 1209 GGTGTCCATTGACACAGTGACTGTGCTAGATGCAGAGGGGCCATGCACCT 1258 DCRS3.1 1119 GGTGTCCATTGACACAGTGACTGTGCTAGATGCAGAGGGGCCATGCACCT 1168 4 0 **************************************************
DCRS3.2 1259 GGCCCTGCAGCTGTGAGGATGACGGCTACCCAGCCCTGGACCTGGATGCT 1308 DCRS3.1 1169 GGCCCTGCAGCTGTGAGGATGACGGCTACCCAGCCCTGGACCTGGATGCT 1218 **************************************************
DCRS3.2 1309 GGCCTGGAGCCCAGCCCAGGCCTAGAGGACCCACTCTTGGATGCAGGGAC 1358 DCRS3.1 1219 GGCCTGGAGCCCAGCCCAGGCCTAGAGGACCCACTCTTGGATGCAGGGAC 1268 **************************************************
5 O DCRS3.2 1359 CACAGTCCTGTCCTGTGGCTGTGTCTCAGCTGGCAGCCCTGGGCTAGGAG 1408 DCRS3.1 1269 CACAGTCCTGTCCTGTGGCTGTGTCTCAGCTGGCAGCCCTGGGCTAGGAG 1318 **************************************************
DCRS3.2 1409 GGCCCCTGGGAAGCCTCCTGGACAGACTAAAGCCACCCCTTGCAGATGGG 1458 5 5 DCRS3.1 1319 GGCCCCTGGGAAGCCTCCTGGACAGACTAAAGCCACCCCTTGCAGATGGG 1368 **************************************************
DCRS3.2 1459 GAGGACTGGGCTGGGGGACTGCCCTGGGGTGGCCGGTCACCTGGAGGGGT 1508 DCRS3.1 1369 GAGGACTGGGCTGGGGGACTGCCCTGGGGTGGCCGGTCACCTGGAGGGGT 1418 **************************************************
DCRS3.2 1509 CTCAGAGAGTGAGGCGGGCTCACCCCTGGCCGGCCTGGATATGGACACGT 1558 DCRS3.1 1419 CTCAGAGAGTGAGGCGGGCTCACCCCTGGCCGGCCTGGATATGGACACGT 1468 **************************************************
DCRS3.2 1559 TTGACAGTGGCTTTGTGGGCTCTGACTGCAGCAGCCCTGTGGAGTGTGAC 1608 DCRS3.1 1469 TTGACAGTGGCTTTGTGGGCTCTGACTGCAGCAGCCCTGTGGAGTGTGAC 1518 **************************************************
DCRS3.2 1609 TTCACCAGCCCCGGGGACGAAGGACCCCCCCGGAGCTACCTCCGCCAGTG 1658 15' DCRS3.1 1519 TTCACCAGCCCCGGGGACGAAGGACCCCCCCGGAGCTACCTCCGCCAGTG 1568 **************************************************
DCRS3.2 1659 GGTGGTCATTCCTCCGCCACTTTCGAGCCCTGGACCCCAGGCCAGCTAA 1707 DCRS3.1 1569 GGTGGTCATTCCTCCGCCACTTTCGAGCCCTGGACCCCAGGCCAGCTAA 1617 2 0 *************************************************
Table 3: Nucleotide and polypeptide sequences of DNAX Cytokine Receptor Subunit like embodiment 4 (DCRS4.1; cytor). Primate, e.g., human embodiment (see SEQ ID N0: 4 and 5). Predicted signal sequence indicated, but may vary 2 5 by a few positions and depending upon cell type.
atg atg cct aaa cat tgc ttt cta ggc ttc ctc atc agt ttc ttc ctt 48 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu act ggt gta gca gga act cag tca acg cat gag tct ctg aag cct cag 96 Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln 3 5 agg gta caa ttt cag tcc cga aat ttt cac aac att ttg caa tgg cag 144 Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln cct ggg agg gca ctt act ggc aac agc agt gtc tat ttt gtg cag tac 192 4 0 Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr aaa ata tat gga cag aga caa tgg aaa aat aaa gaa gac tgt tgg ggt 240 Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly act caa gaa ctc tct tgt gac ctt acc agt gaa acc tca gac ata cag 288 Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln gaa cct tat tac ggg agg agg ggc aaa aat aaa aat aaa ggg aat cct 336 Glu Pro Tyr Tyr Gly Arg Arg Gly Lys Asn Lys Asn Lys Gly Asn Pro 55 tgg ggg cca aaa caa agt aaa cgg aaa tca aag ggg aac cag aag acc 384 Trp Gly Pro Lys Gln Ser Lys Arg Lys Ser Lys Gly Asn Gln Lys Thr aac aca gtg act gcc cca get gcc ctg aag gca ttt get gga tgt gca 432 Asn Thr Val Thr Ala Pro Ala Ala Leu Lys Ala Phe Ala Gly Cys Ala aaa ata gat cct cca gtc atg aat ata acc caa gtc aat ggc tct ttg 480 Lys Ile Asp Pro Pro Val Met Asn Ile Thr Gln Val Asn Gly Ser Leu ttg gta att ctc cat get cca aat tta cca tat aga tac caa aag gaa 528 Leu Val Ile Leu His Ala Pro Asn Leu Pro Tyr Arg Tyr Gln Lys Glu aaa aat gta tct ata gaa gat tac tat gaa cta cta tac cga gtt ttt 576 15' Lys Asn Val Ser Ile Glu Asp Tyr Tyr Glu Leu Leu Tyr Arg Val Phe ata att aac aat tca cta gaa aag gag caa aag gtt tat gaa ggg get 624 Ile Ile Asn Asn Ser Leu Glu Lys Glu Gln Lys Val Tyr Glu Gly Ala cac aga gcg gtt gaa att gaa get cta aca cca cac tcc agc tac tgt 672 His Arg Ala Val Glu Ile Glu Ala Leu Thr Pro His Ser Ser Tyr Cys gta gtg get gaa ata tat cag ccc atg tta gac aga aga agt cag aga 720 Val Val Ala Glu Ile Tyr Gln Pro Met Leu Asp Arg Arg Ser Gln Arg 3 0 agt gaa gag aga tgt gtg gaa att cca tga 750 Ser Glu Glu Arg Cys Val Glu Ile Pro MMPKHCFLGFLISFFLTGVAGTQSTHESLKPQRVQFQSRNFHNILQWQPGRALTGNSSVYFVQYKIYGQRQWKNKED
CWGTQELSCDLTSETSDIQEPYYGRRGKNKNKGNPWGPKQSKRKSKGNQKTNTVTAPAALKAFAGCAKIDPPVMNIT
QVNGSLLVILHAPNLPYRYQKEKNVSIEDYYELLYRVFIINNSLEKEQKVYEGAHRAVEIEALTPHSSYCWAEIYQ
PMLDRRSQRSEERCVEIP.
Nucleotide and polypeptide sequences of DNAX Cytokine Receptor Subunit like 4 0 embodiments (DCRS4.2, cytorX700; SEQ ID NO: 27 and 28):
ATGATGCCTAAACATTGCTTTCTAGGCTTCCTCATCAGTTTCTTCCTTACTGGTGTAGCAGGAACTCAGTCAACGCA
TGAGTCTCTGAAGCCTCAGAGGGTACAATTTCAGTCCCGAAATTTTCACAACATTTTGCAATGGCAGCCCGGGAGGG
CACTTACTGGCAACAGCAGTGTCTATTTTGTGCAGTACAAAATATATGGACAGAGACAATGGAAAAATAAAGAAGAC
TGTTGGGGTACTCAAGAACTCTCTTGTGACCTTACCAGTGAAACCTCAGACATACAGGAACCTTATTACGGGAGGGT
GAGGGCGGCCTCGGCTGGGAGCTACTCAGAATGGAGCATGACGCCGCGGTTCACTCCCTGGTGGGAAACAAAAATAG
ATCCTCCAGTCATGAATATAACCCAAGTCAATGGCTCTTTGTTGGTAATTCTCCATGCTCCAAATTTACCATATAGA
TACCAAAAGGAAAAA.AATGTATCTATAGAAGATTACTATGAACTACTATACCGAGTTTTTATAATTAACAATTCACT
AGAAAAGGAGCAAAAGGTTTATGAAGGGGCTCACAGAGCGGTTGAAATTGAAGCTCTAACACCACACTCCAGCTACT
GTGTAGTGGCTGAAATATATCAGCCCATGTTAGACAGAAGAAGTCAGAGAAGTGAAGAGAGATGTGTGGAAATTCCA
TGA
atg atg cct aaa cat tgc ttt cta ggc ttc ctc atc agt ttc ttc ctt 48 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu 55 -20 -15 -to act ggt gta gca gga act cag tca acg cat gag tct ctg aag cct cag 96 Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln agg gta caa ttt cag tcc cga aat ttt cac aac att ttg caa tgg cag 144 Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln ccc ggg agg gca ctt act ggc aac agc agt gtc tat ttt gtg cag tac 192 Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr aaa ata tat gga cag aga caa tgg aaa aat aaa gaa gac tgt tgg ggt 240 Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly act caa gaa ctc tct tgt gac ctt acc agt gaa acc tca gac ata cag 288 Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln 15' 60 65 70 75 gaa cct tat tac ggg agg gtg agg gcg gcc tcg get ggg agc tac tca 336 Glu Pro Tyr Tyr Gly Arg Val Arg Ala Ala Ser Ala Gly Ser Tyr Ser gaa tgg agc atg acg ccg cgg ttc act ccc tgg tgg gaa aca aaa ata 384 Glu Trp Ser Met Thr Pro Arg Phe Thr Pro Trp Trp Glu Thr Lys Ile 2 5 gat cct cca gtc atg aat ata acc caa gtc aat ggc tct ttg ttg gta 432 Asp Pro Pro Val Met Asn Ile Thr Gln Val Asn Gly Ser Leu Leu Val att ctc cat get cca aat tta cca tat aga tac caa aag gaa aaa aat 480 3 0 Ile Leu His Ala Pro Asn Leu Pro Tyr Arg Tyr Gln Lys Glu Lys Asn gta tct ata gaa gat tac tat gaa cta cta tac cga gtt ttt ata att 528 Val Ser Ile Glu Asp Tyr Tyr Glu Leu Leu Tyr Arg Val Phe Ile Ile aac aat tca cta gaa aag gag caa aag gtt tat gaa ggg get cac aga 576 Asn Asn Ser Leu Glu Lys Glu Gln Lys Val Tyr Glu Gly Ala His Arg gcg gtt gaa att gaa get cta aca cca cac tcc agc tac tgt gta gtg 624 Ala Val Glu Ile Glu Ala Leu Thr Pro His Ser Ser Tyr Cys Val Val get gaa ata tat cag ccc atg tta gac aga aga agt cag aga agt gaa 672 Ala Glu Ile Tyr Gln Pro Met Leu Asp Arg Arg Ser Gln Arg Ser Glu gag aga tgt gtg gaa att cca tga 696 Glu Arg Cys Val Glu Ile Pro >cytorX700 MMPKHCFLGFLISFFLTGVAGTQSTHESLKPQRVQFQSRNFHNILQWQPGRALTGNSSVYFVQYKIYGQRQWKNKED
CWGTQELSCDLTSETSDIQEPYYGRVRAASAGSYSEWSMTPRFTPWWETKIDPPVMNITQVNGSLLVILHAPNLPYR
YQKEKNVSIEDYYELLYRVFIINNSLEKEQKVYEGAHRAVEIEALTPHSSYCWAEIYQPMLDRRSQRSEERCVEIP
Nucleotide and polypeptide sequences of DNAX Cytokine Receptor Subunit like embodiments (DCRS4.3 cytorX600; SEQ ID NO: 30 and 31):
ATGATGCCTAAACATTGCTTTCTAGGCTTCCTCATCAGTTTTTTCCTTACTGGTGTAGCAGGAACTCAGTCAACGCA
TGAGTCTCTGAAGCCTCAGAGGGTACAATTTCAGTCCCGAAATTTTCACAACATTTTGCAATGGCAGCCTGGGAGGG
CACTTACTGGCAACAGCAGTGTCTATTTTGTGCAGTACAAAATATATGGACAGAGACAATGGAAAAATAAAGAAGAC
TGTTGGGGTACTCAAGAACTCTCTTGTGACCTTACCAGTGAAACCTCAGACATACAGGAATCTTATTACGGGAGGGT
GAGGGCGGCCTCGGCTGGGAGCTACTCAGAATGGAGCATGACGCCGCGGTTCACTCCCTGGTGGGAAAGAGCAAAAG
GTTTATGAAGGGGCTCACAGAGCGGTTGAAATTGAAGCTCTAACACCACACTCCAGCTACTGTGTAGTGGCTGAAAT
ATATCAGCCCACGTTAGACAGAAGAAGTCAGAGAAGTGAAGAGAGATGTGTGGAAATTCCATGA
atg atg cct aaa cat tgc ttt cta ggc ttc ctc atc agt ttt ttc ctt 48 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu 15' act ggt gta gca gga act cag tca acg cat gag tct ctg aag cct cag 96 Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln 2 0 agg gta caa ttt cag tcc cga aat ttt cac aac att ttg caa tgg cag 144 Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln cct ggg agg gca ctt act ggc aac agc agt gtc tat ttt gtg cag tac 192 2 5 Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr aaa ata tat gga cag aga caa tgg aaa aat aaa gaa gac tgt tgg ggt 240 Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly act caa gaa ctc tct tgt gac ctt acc agt gaa acc tca gac ata cag 288 Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln gaa tct tat tac ggg agg gtg agg gcg gcc tcg get ggg agc tac tca 336 Glu Ser Tyr Tyr Gly Arg Val Arg Ala Ala Ser Ala Gly Ser Tyr Ser 4 0 gaa tgg agc atg acg ccg cgg ttc act ccc tgg tgg gaa aga gca aaa 384 Glu Trp Ser Met Thr Pro Arg Phe Thr Pro Trp Trp Glu Arg Ala Lys ggt tta tgaaggggct cacagagcgg ttgaaattga agctctaaca ccacactcca 440 4 5 Gly Leu gctactgtgt agtggctgaa atatatcagc ccacgttaga cagaagaagt cagagaagtg 500 aagagagatg tgtggaaatt ccatga 526 >cytorX600 MMPKHCFLGFLISFFLTGVAGTQSTHESLKPQRVQFQSRNFHNILQWQPGRALTGNSSVYFVQYKIYGQRQWKNKED
CWGTQELSCDLTSETSDIQESYYGRVRAASAGSYSEWSMTPRFTPWWERAKGL.
5 5 Polypeptide sequence comparison of DCRS4.1, DCRS4.2 and DCRS4.3:
DCRS4.1 1 MMPKHCFLGFLISFFLTGVAGTQSTHESLKPQRVQFQSRNFHNILQWQPG 50 DCRS4.2 1 MMPKHCFLGFLISFFLTGVAGTQSTHESLKPQRVQFQSRNFHNILQWQPG 50 DCRS4.3 1 MMPKIICFLGFLISFFLTGVAGTQSTHESLKPQRVQFQSRNFHNILQWQPG 50 5 **************************************************
DCRS4.1 51 RALTGNSSWFVQYKIYGQRQWKNKEDCWGTQELSCDLTSETSDIQEPYY 100 DCRS4.2 51 RALTGNSSWFVQYKIYGQRQWKNKEDCWGTQELSCDLTSETSDIQEPYY 100 DCRS4.3 51 RALTGNSSWFVQYKIYGQRQWKNKEDCWGTQELSCDLTSETSDIQESYY 100 10 *********************************************** **
DCRS4.1 101 GR-RGKNKNKGNPWGPKQSKRKSKGNQKTNTVTAPAALKAFAGCAKIDPP 149 DCRS4.2 101 GRVRAASAGSYSEWS--MTPRFTP-----------------WWETKIDPP 131 DCRS4.3 101 GRVRAASAGSYSEWS--MTPRFTP-----------------WWE------ 119 15' ** * _ * . * .
DCRS4.1 150 VMNITQVNGSLLVILHAPNLPYRYQKEKNVSIEDYYELLYRVFIINNSLE 199 DCRS4.2 132 VMNITQVNGSLLVILHAPNLPYRYQKEKNVSIEDYYELLYRVFIINNSLE 181 DCRS4.3 123 -------------------------RAKGL 130 20 .. *..
DCRS4.1 200 KEQKWEGAHRAVEIEALTPHSSYCWAEIYQPMLDRRSQRSEERCVEIP 249 DCRS4.2 182 KEQKVYEGAHRAVEIEALTPHSSYCWAEIYQPMLDRRSQRSEERCVEIP 231 DCRS4.3 131 130 Table 4: Reverse Translation of primate, e.g., human, DCRS4.1 (SEQ ID N0: 6).
N may be A, C, G, or T.
ATGATGCCNAARCAYTGYTTYYTNGGNTTYYTNATHWSNTTYTTYYTNACNGGNGTNGCNGGNACNCARWSNACNCA
YGARWSNYTNAARCCNCARMGNGTNCARTTYCARWSNMGNAAYTTYCAYAAYATHYTNCARTGGCARCCNGGNMGNG
CNYTNACNGGNAAYWSNWSNGTNTAYTTYGTNCARTAYAARATHTAYGGNCARMGNCARTGGAARAAYAARGARGAY
TGYTGGGGNACNCARGARYTNWSNTGYGAYYTNACNWSNGARACNWSNGAYATHCARGARCCNTAYTAYGGNMGNMG
NGGNAARAAYAARAAYAARGGNAAYCCNTGGGGNCCNAARCARWSNAARMGNAARWSNAARGGNAAYCARAAR.ACNA
AYACNGTNACNGCNCCNGCNGCNYTNAARGCNTTYGCNGGNTGYGCNAARATHGAYCCNCCNGTNATGAAYATHACN
CARGTNAAYGGNWSNYTNYTNGTNATHYTNCAYGCNCCNAAYYTNCCNTAYMGNTAYCARAARGARAARAAYGTNWS
NATHGARGAYTAYTAYGARYTNYTNTAYMGNGTNTTYATHATHAAYAAYWSNYTNGARAARGARCARAARGTNTAYG
ARGGNGCNCAYMGNGCNGTNGARATHGARGCNYTNACNCCNCAYWSNWSNTAYTGYGTNGTNGCNGARATHTAYCAR
CCNATGYTNGAYMGNMGNWSNCARMGNWSNGARGARMGNTGYGTNGARATHCCN
Reverse Translation of primate, e.g., human, DCRS4.2 (SEQ ID NO: 29).
N may be A, C, G, or T.
ATGATGCCNAARCAYTGYTTYYTNGGNTTYYTNATHWSNTTYTTYYTNACNGGNGTNGCNGGNACNCARWSNACNCA
YGARWSNYTNAARCCNCARMGNGTNCARTTYCARWSNMGNAAYTTYCAYAAYATHYTNCARTGGCARCCNGGNMGNG
CNYTNACNGGNAAYWSNWSNGTNTAYTTYGTNCARTAYAARATHTAYGGNCARMGNCARTGGAARAAYAARGARGAY
TGYTGGGGNACNCARGARYTNWSNTGYGAYYTNACNWSNGARACNWSNGAYATHCARGARCCNTAYTAYGGNMGNGT
NMGNGCNGCNWSNGCNGGNWSNTAYWSNGARTGGWSNATGACNCCNMGNTTYACNCCNTGGTGGGARACNAARATHG
AYCCNCCNGTNATGAAYATHACNCARGTNAAYGGNWSNYTNYTNGTNATHYTNCAYGCNCCNAAYYTNCCNTAYMGN
TAYCARAARGA~2AARAAYGTNWSNATHGARGAYTAYTAYGARYTNYTNTAYMGNGTNTTYATHATHAAYAAYWSNYT
NGARAARGARCARAARGTNTAYGARGGNGCNCAYMGNGCNGTNGARATHGARGCNYTNACNCCNCAYWSNWSNTAYT
GYGTNGTNGCNGARATHTAYCARCCNATGYTNGAYMGNMGNWSNCARMGNWSNGARGARMGNTGYGTNGARATHCCN
55 Reverse Translation of primate, e.g., human, DCRS4.3 (SEQ ID N0: 32). N may be A, C, G, or T.
ATGATGCCNAARCAYTGYTTYYTNGGNTTYYTNATHWSNTTYTTYYTNACNGGNGTNGCNGGNACNCARWSNACNCA
YGARWSNYTNAARCCNCARMGNGTNCARTTYCARWSNMGNAAYTTYCAYAAYATHYTNCARTGGCARCCNGGNMGNG
CNYTNACNGGNAAYWSNWSNGTNTAYTTYGTNCARTAYAARATHTAYGGNCARMGNCARTGGAARAAYAARGARGAY
TGYTGGGGNACNCARGARYTNWSNTGYGAYYTNACNWSNGARACNWSNGAYATHCARGARWSNTAYTAYGGNMGNGT
NMGNGCNGCNWSNGCNGGNWSNTAYWSNGARTGGWSNATGACNCCNMGNTTYACNCCNTGGTGGGARMGNGCNAARG
GNYTN
Nucleic acid sequence comparison of three DCRS4 embodiments:
DCRS4.1 1 ATGATGCCTAAACATTGCTTTCTAGGCTTCCTCATCAGTTTCTTCCTTAC 50 DCRS4.2 1 ATGATGCCTAAACATTGCTTTCTAGGCTTCCTCATCAGTTTCTTCCTTAC 50 DCRS4.3 1 ATGATGCCTAAACATTGCTTTCTAGGCTTCCTCATCAGTTTTTTCCTTAC 50 ***************************************** ********
DCRS4.1 51 TGGTGTAGCAGGAACTCAGTCAACGCATGAGTCTCTGAAGCCTCAGAGGG 100 DCRS4.2 51 TGGTGTAGCAGGAACTCAGTCAACGCATGAGTCTCTGAAGCCTCAGAGGG 100 DCRS4.3 51 TGGTGTAGCAGGAACTCAGTCAACGCATGAGTCTCTGAAGCCTCAGAGGG 100 15' **************************************************
DCRS4.1 101 TACAATTTCAGTCCCGAAATTTTCACAACATTTTGCAATGGCAGCCTGGG 150 DCRS4.2 101 TACAATTTCAGTCCCGAAATTTTCACAACATTTTGCAATGGCAGCCCGGG 150 DCRS4.3 101 TACAATTTCAGTCCCGAAATTTTCACAACATTTTGCAATGGCAGCCTGGG 150 2 0 ********************************************** ***
DCRS4.1 151 AGGGCACTTACTGGCAACAGCAGTGTCTATTTTGTGCAGTACAAAATATA 200 DCRS4.2 151 AGGGCACTTACTGGCAACAGCAGTGTCTATTTTGTGCAGTACAAAATATA 200 DCRS4.3 151 AGGGCACTTACTGGCAACAGCAGTGTCTATTTTGTGCAGTACAAAATATA 200 2 5 **************************************************
DCRS4.1 201 TGGACAGAGACAATGGAAAAATAAAGAAGACTGTTGGGGTACTCAAGAAC 250 DCRS4.2 201 TGGACAGAGACAATGGAAAAATAAAGAAGACTGTTGGGGTACTCAAGAAC 250 DCRS4.3 201 TGGACAGAGACAATGGAAAAATAAAGAAGACTGTTGGGGTACTCAAGAAC 250 3 0 **************************************************
DCRS4.1 251 TCTCTTGTGACCTTACCAGTGAAACCTCAGACATACAGGAACCTTATTAC 300 DCRS4.2 251 TCTCTTGTGACCTTACCAGTGAAACCTCAGACATACAGGAACCTTATTAC 300 DCRS4.3 251 TCTCTTGTGACCTTACCAGTGAAACCTCAGACATACAGGAATCTTATTAC 300 3 5 ***************************************** ********
DCRS4.1 301 GGGAGGAGGGGCAAAAATAAAAATAAAGGGAATCCTTGGGGGCCAAAACA 350 DCRS4.2 301 GGGAGGGTG-----------------AGGGCGGCCTCGGC---------- 323 DCRS4.3 301 GGGAGGGTG-----------------AGGGCGGCCTCGGC---------- 323 4 0 ****** * **** *** **
DCRS4.1 351 AAGTAAACGGAAATCAAAGGGGAACCAGAAGACCAACACAGTGACTGCCC 400 DCRS4.2 324 ---TGGGAGCTACTCAGAATGGAGCATGA-------CGCCGCGGTTCACT 363 DCRS4.3 324 ---TGGGAGCTACTCAGAATGGAGCATGA-------CGCCGCGGTTCACT 363 45 * * * *** * *** * **
DCRS4.1 401 CAGCTGCCCTGAAGGCATTTGCTGGATGTGCAAAAATAGATCCTCCAGTC 450 DCRS4.2 364 C------CCTGGTGGGAA-----------ACAAAAATAGATCCTCCAGTC 396 DCRS4.3 364 C------CCTGGTGGGAAAGAGCAAAAGGTTTATGAAGGGGCTCACAGA- 406 50 * **** ** * * * * * ***
DCRS4.1 451 ATGAATATAACCCAAGTC--AATGGCTCTTTGTTGGTAATTCTCCATGCT 498 DCRS4.2 397 ATGAATATAACCCAAGTC--AATGGCTCTTTGTTGGTAATTCTCCATGCT 444 DCRS4.3 407 GCGGTTGAAATTGAAGCTCTAACACCACACTCCAGCTACTGTGTAGTGGC 456 55 * * ** *** ** * * * * ** * **
DCRS4.1 499 CCAAATTTACCATATAGATACCAAAAGGAAAAAAATGTATCTATAGAAGA 548 DCRS4.2 445 CCAAATTTACCATATAGATACCAAAAGGAAAAAAATGTATCTATAGAAGA 494 DCRS4.3 457 TGAAATATATCA-GCCCACGTTAGACAGAAGAAGTCAGAGAAGT-GAAGA 504 **** ** ** * * * *** ** * * *****
DCRS4.1 549 TTACTATGAACTACTATACCGAGTTTTTATAATTAACAATTCACTAGAAA 598 DCRS4.2 495 TTACTATGAACTACTATACCGAGTTTTTATAATTAACAATTCACTAGAAA 544 DCRS4.3 505 GAGATGTGTGGAAATTCCATGA 526 * ** * * **
DCRS4.1 599 AGGAGCAAAAGGTTTATGAAGGGGCTCACAGAGCGGTTGAAATTGAAGCT 648 DCRS4.2 545 AGGAGCAAAAGGTTTATGAAGGGGCTCACAGAGCGGTTGAAATTGAAGCT 594 DCRS4.3 527 526 15~
DCRS4.1 649 CTAACACCACACTCCAGCTACTGTGTAGTGGCTGAAATATATCAGCCCAT 698 DCRS4.2 595 CTAACACCACACTCCAGCTACTGTGTAGTGGCTGAAATATATCAGCCCAT 644 DCRS4.3 527 526 DCRS4.1 699 GTTAGACAGAAGAAGTCAGAGAAGTGAAGAGAGATGTGTGGAAATTCCAT 748 DCRS4.2 645 GTTAGACAGAAGAAGTCAGAGAAGTGAAGAGAGATGTGTGGAAATTCCAT 694 DCRS4.3 527 526 DCRS4.1 749 GA 750 DCRS4.2 695 GA 696 DCRS4.3 527 526 Table 5: Alignment of various cytokine receptor subunits with DCRS3.1.
IL-2R is SEQ ID NO: 7; IL-9R is SEQ ID NO: 8; GM/IL-3/5 receptor b subunit common (ILRbc) is SEQ ID N0: 9; TPOR is SEQ ID NO: 10; and IL-7R
3 5 is SEQ ID NO: 11 (see GenBank).
DCRS3_HU LCS--PKPPPT----KASLPTDPPGWGC-PDLVCYTDYLQTVICILEMWN--LHP--STL
4 O ILRbc HU ILTPNGNEDTTADFFLTTMPTDSLSVST-LPLPEVQCFVFNVEYMNCTWNSSSEPQPTNL
TPOR_HU LLASDSEPLKC---FSRTFEDLTCFWDE-EEAAPSGTYQLLYAYPREKPR--ACP--LSS
HU
HU TLTWILSNNTGCYIKDR-----TLDLRQ-DQYE--ELKDEA-TSCSLHR-----SAHNAT
_ SSPWLLFTSNQAPG----G-THKCILR--GSECTWLPPE--AVLVPSD--------NFT
HU TLHYWYKNSDNDK-------VQKCSHY--------LFSEEITSGCQLQK-K---EIHLYQ
ILRbc _ QSMPHFGTRYVCQFPDQ--EEVRLFFPLHLWVKNVFLNQTRTQRVLFVDSVGLPAPPSII
TPOR
HU
_ TTNLEFEICGALV------EVKCLNFR--------KLQEIYFIETKKFL---------LI
HU
HU HATYTCHMDVFHF----MADDIFS--VNITDQSGNYSQECGSFLLAESRQYNIS---WRS
_ ITFHHCMSGREQVS---LVDPEYLPRRHVKLDPPSDLQS-----NISSGHCILT---WSI
HU
HU TFWQLQDPREPRR---QATQMLKLQNLVIPWAPENLTL----HKLSESQLELN---WNN
ILRbc _ KAMGGSQPGELQISWEEPAPEISDFLRYELRYGPRDPKNS---TGPTVIQLIATETCCPA
HU
TPOR
_ GKSNICVK-VGEKS---LTCKKIDLTTIVKPEAPFDLSVI---YREGANDFWT---FNT
HU DYEDPAFYMLKGKLQYELQYRNRGDPWAVSPRRKLIS-----VDSRSVSLLPLEFRKDSS
_ SPALEPMTTLLS---YELAFKKQEEAWEQAQHRDHIV-----G-VTWLILEAFELDPGFI
HU RFLNHCLEHLV-------QYRTD---WDHSWTEQSV------DYRHKFSLPSVDGQKRYT
ILRbc _ LQRPHSASALD-----QSPCAQPTMPWQDGPKQTSPSREASALTAEGGSCLISGLQPGNS
HU
TPOR
_ SHLQKKWKVLMHD-VAYRQEKDENKWTHVNLSST---------KLTLLQRK--LQPAAM
DCRS3_HU YELQVRAGPMPGSSYQG-------TWSEWSDPVIFQTQSEELK
ILRbc_HU FRVRSRFNPLCGSAQ---------HWSEWSHPIHWGSNTSKEN
TPOR_HU YWLQLRSEPDGISLGG--------SWGSWSLPVTVDLPGDAVA
15' IL-7R HU YEIKVRSIPDHYFKG---------FWSEWSPSYYFRTPEINNS
- . . * *, ** , Alignment of various cytokine receptor subunits with DCRS4.1. IL-lORb is the beta subunit of IL-lOR, human is SEQ ID N0: 12, mouse is SEQ ID NO: 13; INaRl 2 0 is the beta subunit of IFNa with human SEQ ID N0: 14 and mouse SEQ ID N0:
15;
INgR is interferon gamma receptor subunit alpha with human SEQ ID NO: 16 and mouse SEQ ID NO: 17; IL-lORa is the alpha receptor subunit with mouse SEQ ID
N0: 18 and human SEQ ID NO: 19; INgS (SEQ ID NO: 20) is the beta receptor subunit for INFg; Zcytor7 (SEQ ID N0: 21) and CYTOR11 (SEQ ID NO: 22) are 2 5 from patent filings from Zymogenetics, and INaR2 (SEQ ID NO: 23) is the beta subunit of the receptor for IFNa.
IL-lORb Hu PENVRMNSVNFKNILQWES-PAFAKGNL--TFTAQYLSY---------RIFQDKCMNTTL
Mu PEKVRMNSVNFKNILQWEV-PAFPKTNL--TFTAQYESY---------RSFQDHCKRTAS
3 O IL-lORb _ PQKVEVDIIDDNFILRWNR-SDESVGNV--TFSFDYQKTGMD-----NWIKLSGCQNITS
HU
INaRl _ PENIDWIIDDNYTLKWSS-HGESMGSV--TFSAEYRTKDEA-----KWLKVPECQHTTT
INaRl MU
_ PTNVTIESYNMNPIVYWEY-QIMPQVP---VFTVEVKNYGVK-----NSEWIDACINISH
INgR HU
INgR MU PTNVLIKSYNLNPWCWEY-QNMSQTP---IFTVQVKVY--------SGSWTDSCTNISD
Mu PSYVWFEARFFQHILHWKP-IPNQSEST--YYEVALKQYGNS-----TWNDIHICRKAQA
3 5 IL-lORa _ PPSVWFEAEFFHHILHWTP-IPNQSEST--CYEVALLRYGIE-----SWNSISNC--SQT
IL-lORa Hu INgS HU PLNPRLHLYNDEQILTWEP-SPSSNDPRPWYQVEYSFIDGSW----HRLLEPNCTDITE
Zcytor7 Hu PANITFLSINMKNVLQWTPPEGLQGVKV--TYTVQYFIYGQK-----KWLNKSECRNINR
HU LQHVKFQSSNFENILTWDS-GPEGTPDT--VYSIEYKTYGER-----DWVAKKGCQRITR
_ SCTFKISLRNFRSILSWEL-KNHSIVPTHYTLLYTIMSKPE------DLKWKNCANTTR
HU
4 0 INaR2 _ PQRVQFQSRNFHNILQWQPGRALTGNSS--VYFVQYKIYGQR-----QWKNKEDCWGTQE
Hu TECDFSSLSK------YGDHTLRVRAEFADEHSDWVNIT-FCPVDDTIIGPPG--MQVEV
iL-lORb _ TQCDFSHLSK------YGDYTVRVRAELADEHSEWVNVT-FCPVEDTIIGPPE--MQIES
Mu 4 5 IL-lORb _ TKCNFSSLKLN----VYEEIKLRIRAEKEN-TSSWYEVDSFTPFRKAQIGPPE--VHLEA
HU
INaRl _ TKCEFSLLDTN----VYIKTQFRVRAEEGNSTSSWNEVDPFIPFYTAHMSPPE--VRLEA
INaRl MU
_ HYCNISDHVGDP----SNSLWVRVKARVGQKESAYAKSEEFAVCRDGKIGPPKLDIR-KE
INgR HU
INgR MU HCCNIYGQIMYP----DVSAWARVKAKVGQKESDYARSKEFLMCLKGKVGPPGLEIRRKK
Mu LSCDLTTFTLDLYHR-SYGYRARVRAVDNSQYSNWTTTETRFTVDEVILTVDS--VTLKA
5 O IL-lORa _ LSYDLTAVTLDLYH--SNGYRARVRAVDGSRHSNWTVTNTRFSVDEVTLTVGS--VNLEI
Hu IL-lORa _ TKCDLTGGGRLKLFPHPFTVFLRVRAKRGNLTSKWVGLEPFQHYENVTVGPPKN-ISVTP
INgS HU
Zcytor7 Hu TYCDLSAETSDY----EHQYYAKVKAIWGTKCSKWAESGRFYPFLETQIGPPE--VALTT
HU KSCNLTVETGN----LTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDV-TCISK
_ SFCDLTDEWRS-----THEAYVTVLEGFSGNTTLFSCSHNFWLAIDMSFEPPE--FEIVG
HU
55 INaR2 _ LSCDLTSETSD-----IQEPYYGRRGKNKNKGNPWGPKQSKRKSKGNQKTNTVT-APAAL
Hu LADSLHMRFLAPKIENEYE---TWTMKNVYNSWTYNVQYWKNGTDEKFQ-ITPQYDFEVL
IL-lORb _ LAESLHLRFSAPQIENEPE---TWTLKNIYDSWAYRVQYWKNGTNEKFQ-WSPYDSEVL
Mu IL-lORb _ EDKAIVIHISPGTKDSV-----MWALD--GLSFTYSLLIWKNSSGVEER-IENIYSRHKI
HU
INaRl _ EDKAILVHISPPGQDGN-----MWALE--KPSFSYTIRIWQKSSSDKKT-INSTYYVEKI
MU
INaRl _ EKQIMIDIFHPSVFVNGDEQEVDYDPETTCYIRWNVYVRMNGS-EIQY-KILTQKEDDC
INgR HU
INgR MU EEQLSVLVFHPEVWNGESQGTMFGDGSTCYTFDYTVWEHNRSGEILH-TKHTVEKEEC
Mu MDGIIYGTIHPPRPTITPA--GDEYEQVFKDLRWKISIRKFS--ELKN-ATKRVKQETF
IL-lORa _ HNGFILGKIQLPRPKMAPA--NDTYESIFSHFREYEIAIRKVPG-NFTF-THKKVKHENF
IL-lORa Hu INgS HU GKGSLVIHFSPPFDVFHG------------ATFQYLVHYWEKSETQQEQ-VEGPFKSNSI
Zcytor7 Hu DEKSISWLTAPEKWKRNPEDLPVSMQQIYSNLKYNVSVLNTKSNRTWS-QCVTNHTLVL
HU VRSIQMIVHPTPTPIRAGDG-HRLTLEDIFHDLFYHLELQVNRTYQMHL-GGKQREYEFF
_ FTNHINVMVKFPSIVEEEL-------Q---FDLSLVIEEQSEGIVKKHKPEIKGNMSGNF
HU
INaR2 _ KAFAGCAKIDPPVMNITQ------------VNGSLLVILHAPNLPYRYQ-KEKNVSIEDY
15' IL-lORb_Hu RN-----------------LEPWTTYCVQVRGFLPDRN----------KAGEWSEPVCEQ
IL-lORb_Mu RN-----------------LEPWTTYCIQVQGFLLDQN----------RTGEWSEPICER
INaRl HU YK-----------------LSPETTYCLKVKAALLTSW----------KIGWSPVHCIK
MU PE-----------------LLPETTYCLEVKAIHPSLK----------KHSNYSTVQCIS
2 0 INaRl _ DEIQCQLAI--------PVSSLNSQYCVSAEGVLHVWG----------VTTEKSKEVCIT
INgR HU
INgR MU NETLCELNI--------SVSTLDSRYCISVDGISSFWQ----------VRTEKSKDVCIP
Mu TLT---------------VPIGVRKFCVKVLPRLESRI----------NKAEWSEEQCLL
IL-lORa _ SLL---------------TSGEVGEFCVQVKPSVASRS----------NKGMWSKEECIS
Hu IL-lORa _ VLG---------------NLKPYRWCLQTEAQLILKNKK------IRPHGLLSNVSCHE
2 5 INgS HU
Hu TW-----------------LEPNTLYCVHVESFVPGPP----------RRAQPSEKQCAR
Zcytor7 _ GLTPDTEFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYL
HU
_ TYIID-------------KLIPNTNYCVSVYLEHSDEQ-----------AVIKSPLKCTL
INaR2 HU
_ YE-_--_-__-_____-_____--_LLYRVFIINNSLE-_--_-_-_-~Q~yEGAHRA
IL-lORb Hu TTHDETVP-IL-lORb_Mu TGNDEITP-INaRl_HU TTVENELPP
3 5 INaRl_MU TTVANKMPV
INgR HU IFNSSIKG-INgR MU PFHDDRKD-IL-lORa_Mu ITTEQYFT-IL-lORa_Hu LT-RQYFT-4 0 INgS HU TTANASAR
Zcytor7_Hu TLKDQSSE
CYTOR11_HU SYRYVTKPP
INaR2_HU LPPGQESES
Table 5 shows comparison of sequences of cytokine receptor subunits with the primate, e.g., human, DCRS3.1 (50R), and DCRS4.1 (cytor). Both of the new genes are likely alpha type receptor subunits, and thus should bind to ligand without the need for a beta subunit. Based upon structural features, the ligand for the DCRS3 subunits are likely to be a member of the family of cytokines which includes IL-2, IL-4, IL-7, IL-9, and the additional cytokines which signal through IL-2y common receptor-like subunits IL-13, IL-15, and the TSLP ligand.
Similarly, the ligand for the DCRS4 receptor subunits are probably a ligand in the IL-10 or IFN families, which may be a multi-subunit cytokine, analogous to IL-6 and IL-12.
5 As used herein, the term DCRS3 shall be used to describe a protein comprising an amino acid sequence shown in Table 1;
likewise with DCRS4 and Table 3. In many cases, a substantial fragment thereof will be functionally or structurally equivalent, including, e.g., an extracellular or intracellular 10~ domain. The invention also includes a protein variation of a DCRS3 allele whose sequence is provided, e.g., a mutein or soluble extracellular construct. Typically, such agonists or antagonists will exhibit less than about 10°s sequence differences, and thus will often have between 1- and 11-fold 15 substitutions, e.g., 2-, 3-, 5-, 7-fold, and others. It also encompasses allelic and other variants, e.g., natural polymorphic, of the protein described. Typically, it will bind to its corresponding biological ligand, perhaps in a dimerized state with an alpha receptor subunit, with high affinity, e.g., 20 at least about 100 nM, usually better than about 30 nM, preferably better than about 10 nM, and more preferably at better than about 3 nM. The term shall also be used herein to refer to related naturally occurring forms, e.g., alleles, polymorphic variants, and metabolic variants of the mammalian 25 protein. Preferred forms of the receptor complexes will bind the appropriate ligand with an affinity and selectivity appropriate for a ligand-receptor interaction.
This invention also encompasses combinations of proteins or peptides having substantial amino acid sequence identity with 30 the amino acid sequences in Tables 1 or 3. It will include sequence variants with relatively few substitutions, e.g., preferably less than about 3-5.
A substantial polypeptide "fragment", or "segment", is a stretch of amino acid residues of at least about 8 amino acids, generally at least 10 amino acids, more generally at least 12 amino acids, often at least 14 amino acids, more often at least 16 amino acids, typically at least 18 amino acids, more typically at least 20 amino acids, usually at least 22 amino acids, more usually at least 24 amino acids, preferably at least 26 amino acids, more preferably at least 28 amino acids, and, in particularly preferred embodiments, at least about 30 or more amino acids. Sequences of segments of different proteins can be compared to one another over appropriate length stretches. In many situations, fragments may exhibit functional properties of the intact subunits, e.g., the extracellular domain of the transmembrane receptor may retain the ligand binding features, 10~ and may be used to prepare a soluble receptor-like complex.
Amino acid sequence homology, or sequence identity, is determined by optimizing residue matches. In some comparisons, gaps may be introduces, as required. See, e.g., Needleham, et al., (1970) J. Mol. Biol. 48:443-453; Sankoff, et al., (1983) chapter one in Time Warps String Edits and Macromolecules: The Theory and Practice of Seauence Comparison, Addison-Wesley, Reading, MA; and software packages from IntelliGenetics, Mountain View, CA; and the University of Wisconsin Genetics Computer Group (GCG), Madison, WI; each of which is incorporated herein by reference. This changes when considering conservative substitutions as matches. Conservative substitutions typically include substitutions within the following groups: glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid; asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. Homologous amino acid sequences are intended to include natural allelic and interspecies variations in the cytokine sequence. Typical homologous proteins or peptides will have from 50-100% homology (if gaps can be introduced), to 60-100% homology (if conservative substitutions are included) with an amino acid sequence segment of Table 1 or 3. Homology measures will be at least about 70%, generally at least 76%, more generally at least 81%, often at least 85%, more often at least 88%, typically at least 90%, more typically at least 92%, usually at least 94%, more usually at least 95%, preferably at least 96%, and more preferably at least 97%, and in particularly preferred embodiments, at least 98% or more. The degree of homology will vary with the length of the compared segments. Homologous proteins or peptides, such as the allelic variants, will share most biological activities with the embodiments described in Table 1 or 3.
As used herein, the term "biological activity" is used to describe, without limitation, effects on inflammatory responses, innate immunity, and/or morphogenic development by cytokine-like ligands. For example, these receptors should mediate phosphatase or phosphorylase activities, which activities are 10~ easily measured by standard procedures. See, e.g., Hardie, et al. (eds. 1995) The Protein Kinase FactBook vols. I and II, Academic Press, San Diego, CA; Hanks, et al. (1991) Meth.
Enzymol. 200:38-62; Hunter, et al. (1992) Cell 70:375-388; Lewin (1990) Cell 61:743-752; Pines, et al. (1991) Cold Spring Harbor Sump. Ouant. Biol. 56:449-463; and Parker, et al. (1993) Nature 363:736-738. The receptors, or portions thereof, may be useful as phosphate labeling enzymes to label general or specific substrates. The subunits may also be functional immunogens to elicit recognizing antibodies, or antigens capable of binding antibodies.
The terms ligand, agonist, antagonist, and analog of, e.g., a DCRS3, include molecules that modulate the characteristic cellular responses to cytokine ligand proteins, as well as molecules possessing the more standard structural binding competition features of ligand-receptor interactions, e.g., where the receptor is a natural receptor or an antibody. The cellular responses likely are typically mediated through receptor tyrosine kinase pathways.
Also, a ligand is a molecule which serves either as a natural ligand to which said receptor, or an analog thereof, binds, or a molecule which is a functional analog of the natural ligand. The functional analog may be a ligand with structural modifications, or may be a wholly unrelated molecule which has a molecular shape which interacts with the appropriate ligand binding determinants. The ligands may serve as agonists or antagonists, see, e.g., Goodman, et al. (eds. 1990) Goodman &
Gilman's~ The Pharmacological Bases of Therapeutics, Pergamon Press, New York.
Rational drug design may also be based upon structural studies of the molecular shapes of a receptor or antibody and other effectors or ligands. See, e.g., Herz, et al. (1997) J.
ReceQt Sianal Transduct. Res. 17:671-776; and Chaiken, et al.
(1996) Trends Biotechnol. 14:369-375. Effectors may be other proteins which mediate other functions in response to ligand binding, or other proteins which normally interact with the 10~ receptor. One means for determining which sites interact with specific other proteins is a physical structure determination, e.g., x-ray crystallography or 2 dimensional NMR techniques.
These will provide guidance as to which amino acid residues form molecular contact regions. For a detailed description of protein structural determination, see, e.g., Blundell and Johnson (1976) Protein Crystallography, Academic Press, New York, which is hereby incorporated herein by reference.
II. Activities The cytokine receptor-like proteins will have a number of different biological activities, e.g., modulating cell proliferation, or in phosphate metabolism, being added to or removed from specific substrates, typically proteins. Such will generally result in modulation of an inflammatory function, other innate immunity response, or a morphological effect, as typical of cytokine or interleukin signaling. The subunit may have a specific low affinity binding to the ligand.
The receptors may signal through the JAK pathway. See, e.g., Ihle, et al. (1997) Stem Cells 15(suppl. 1):105-111;
Silvennoinen, et al. (1997) APMIS 105:497-509; Levy (1997) Cytokine Growth Factor Review 8:81-90; Winston and Hunter (1996) Current Biol. 6:668-671; Barrett (1996) Baillieres Clin.
Gastroenterol. 10:1-15; and Briscoe, et al. (1996) Philos.
Trans. R. Soc. Lond. B. Biol. Sci. 351:167-171.
The biological activities of the cytokine receptor subunits will be related to addition or removal of phosphate moieties to substrates, typically in a specific manner, but occasionally in a non specific manner. Substrates may be identified, or conditions for enzymatic activity may be assayed by standard methods, e.g., as described in Hardie, et al. (eds. 1995) The Protein Kinase FactBook vols. I and II, Academic Press, San Diego, CA; Hanks, et al. (1991) Meth. Enzymol. 200:38-62;
Hunter, et al. (1992) Cell 70:375-388; Lewin (1990) Cell 61:743-752; Pines, et al. (1991) Cold Spring Harbor Symp. Ouant. Biol.
56:449-463; and Parker, et al. (1993) Nature 363:736-738.
The receptor subunits may combine with other subunits, 10~ e.g., beta subunits, to form functional complexes, e.g., which may be useful for binding ligand or preparing antibodies. These will have substantial diagnostic uses, including detection or quantitation.
ILI. Nucleic Acids This invention contemplates use of isolated nucleic acid or fragments, e.g., which encode these or closely related proteins, or fragments thereof, e.9., to encode a corresponding polypeptide_, preferably one which is biologically active. In addition, this invention covers isolated or recombinant DNAs which encode combinations of such proteins or polypeptides having characteristic sequences, e.g., of DCRS3s or DCRS4s.
Typically, the nucleic acid is capable of hybridizing, under appropriate conditions, with a nucleic acid sequence segment shown in Tables 1 or 3, but preferably not with a corresponding segment of other receptors, e.g., described in Table 5. Said biologically active protein or polypeptide can be a full length protein, or fragment, and will typically have a segment of amino acid sequence highly homologous, e.g., exhibiting significant stretches of identity, to ones shown in Tables 1 or 3. Further, this invention covers the use of isolated or recombinant nucleic acid, or fragments thereof, which encode proteins having fragments which are equivalent to DCRS3 or DCRS4 proteins. The isolated nucleic acids can have the respective regulatory sequences in the 5' and 3' flanks, e.g., promoters, enhancers, poly-A addition signals, and others from the natural gene.
Combinations, as described, are also provided.
An "isolated" nucleic acid is a nucleic acid, e.g., an RNA, DNA, or a mixed polymer, which is substantially pure, e.g., separated from other components which naturally accompany a native sequence, such as ribosomes, polymerases, and flanking 5 genomic sequences from the originating species. The term embraces a nucleic acid sequence which has been removed from its naturally occurring environment, and includes recombinant or cloned DNA isolates, which are thereby distinguishable from naturally occurring compositions, and chemically synthesized 10~ analogs or analogs biologically synthesized by heterologous systems. A substantially pure molecule includes isolated forms of the molecule, either completely or substantially pure.
An isolated nucleic acid will generally be a homogeneous composition of molecules, but will, in some embodiments, contain 15 heterogeneity, preferably minor. This heterogeneity is typically found at the polymer ends or portions not critical to a desired biological function or activity.
A "recombinant" nucleic acid is typically defined either by its method of production or its structure. In reference to its 20 method of production, e.g., a product made by a process, the process is use of recombinant nucleic acid techniques, e.g., involving human intervention in the nucleotide sequence.
Typically this intervention involves in vitro manipulation, although under certain circumstances it may involve more 25 classical animal breeding techniques. Alternatively, it can be a nucleic acid made by generating a sequence comprising fusion of two fragments which are not naturally contiguous to each other, but is meant to exclude products of nature, e.g., naturally occurring mutants as found in their natural state.
30 Thus, for example, products made by transforming cells with an unnaturally occurring vector is encompassed, as are nucleic acids comprising sequence derived using any synthetic oligonucleotide process. Such a process is often done to replace a codon with a redundant codon encoding the same or a 35 conservative amino acid, while typically introducing or removing a restriction enzyme sequence recognition site. Alternatively, the process is performed to join together nucleic acid segments of desired functions to generate a single genetic entity comprising a desired combination of functions not found in the commonly available natural forms, e.g., encoding a fusion protein. Restriction enzyme recognition sites are often the target of such artificial manipulations, but other site specific targets, e.g., promoters, DNA replication sites, regulation sequences, control sequences, or other useful features may be incorporated by design. A similar concept is intended for a recombinant, e.g., fusion, polypeptide. This will include a 10~ dimeric repeat. Specifically included are synthetic nucleic acids which, by genetic code redundancy, encode equivalent polypeptides to fragments of DCRS3 or DCRS4 and fusions of sequences from various different related molecules, e.g., other cytokine receptor family members.
A "fragment" in a nucleic acid context is a contiguous segment of at least about 17 nucleotides, generally at least 21 nucleotides, more generally at least 25 nucleotides, ordinarily at least 30 nucleotides, more ordinarily at least 35 nucleotides, often at least 39 nucleotides, more often at least 45 nucleotides, typically at least 50 nucleotides, more typically at least 55 nucleotides, usually at least 60 nucleotides, more usually at least 66 nucleotides, preferably at least 72 nucleotides, more preferably at least 79 nucleotides, and in particularly preferred embodiments will be at least 85 or more nucleotides. Typically, fragments of different genetic sequences can be compared to one another over appropriate length stretches, particularly defined segments such as the domains described below.
A nucleic acid which codes for a DCRS3 or DCRS4 will be particularly useful to identify genes, mRNA, and cDNA species which code for itself or closely related proteins, as well as DNAs which code for polymorphic, allelic, or other genetic variants, e.g., from different individuals or related species.
Preferred probes for such screens are those regions of the interleukin which are conserved between different polymorphic variants or which contain nucleotides which lack specificity, and will preferably be full length or nearly so. In other situations, polymorphic variant specific sequences will be more useful .
This invention further covers recombinant nucleic acid molecules and fragments having a nucleic acid sequence identical to or highly homologous to the isolated DNA set forth herein.
In particular, the sequences will often be operably linked to DNA segments which control transcription, translation, and DNA
replication. These additional segments typically assist in expression of the desired nucleic acid segment.
10~ Homologous, or highly identical, nucleic acid sequences, when compared to one another, e.g., DCRS3 sequences, exhibit significant similarity. The standards for homology in nucleic acids are either measures for homology generally used in the art by sequence comparison or based upon hybridization conditions.
Comparative hybridization conditions are described in greater detail below.
Substantial identity in the nucleic acid sequence comparison context means either that the segments, or their complementary strands, when compared, are identical when optimally aligned, with appropriate nucleotide insertions or deletions, in at least about 60% of the nucleotides, generally at least 66%, ordinarily at least 71%, often at least 76%, more often at least 80%, usually at least 84%, more usually at least 88%, typically at least 91%, more typically at least about 93%, preferably at least about 95%, more preferably at least about 96 to 98% or more, and in particular embodiments, as high at about 99% or more of the nucleotides, including, e.g., segments encoding structural domains such as the segments described below. Alternatively, substantial identity will exist when the segments will hybridize under selective hybridization conditions, to a strand or its complement, typically using a sequence derived from Tables 1 or 3. Typically, selective hybridization will occur when there is at least about 55%
homology over a stretch of at least about 14 nucleotides, more typically at least about 65%, preferably at least about 75%, and more preferably at least about 90%. See, Kanehisa (1984) Nucl.
Acids Res. 12:203-213, which is incorporated herein by reference. The length of homology comparison, as described, may be over longer stretches, and in certain embodiments will be over a stretch of at least about 17 nucleotides, generally at least about 20 nucleotides, ordinarily at least about 24 nucleotides, usually at least about 28 nucleotides, typically at least about 32 nucleotides, more typically at least about 40 nucleotides, preferably at least about 50 nucleotides, and more preferably at least about 75 to 100 or more nucleotides. This includes, e.g., 125, 150, 175, 200, 225, 250, 275, 300, 325, 10~ 350, 375, 400, 425, 450, 475, 500, 525, 544, and other lengths.
Stringent conditions, in referring to homology in the hybridization context, will be stringent combined conditions of salt, temperature, organic solvents, and other parameters typically controlled in hybridization reactions. Stringent temperature conditions will usually include temperatures in excess of about 30 C, more usually in excess of about 37 C, typically in excess of about 45 C, more typically in excess of about 55 C, preferably in excess of about 65 C, and more preferably in excess of about 70 C. Stringent salt conditions will ordinarily be less than about 500 mM, usually less than about 400 mM, more usually less than about 300 mM, typically less than about 200 mM, preferably less than about 100 mM, and more preferably less than about 80 mM, even down to less than about 20 mM. However, the combination of parameters is much more important than the measure of any single parameter. See, e.g., Wetmur and Davidson (1968) J. Mol. Biol. 31:349-370, which is hereby incorporated herein by reference.
The isolated DNA can be readily modified by nucleotide substitutions, nucleotide deletions, nucleotide insertions, and inversions of nucleotide stretches. These modifications result in novel DNA sequences which encode this protein or its derivatives. These modified sequences can be used to produce mutant proteins (muteins) or to enhance the expression of variant species. Enhanced expression may involve gene amplification, increased transcription, increased translation, and other mechanisms. Such mutant DCRS-like derivatives include predetermined or site-specific mutations of the protein or its fragments, including silent mutations using genetic code degeneracy. "Mutant DCRS3" as used herein encompasses a polypeptide otherwise falling within the homology definition of the DCRS3 as set forth above, but having an amino acid sequence which differs from that of other cytokine receptor-like proteins as found in nature, whether by way of deletion, substitution, or insertion. In particular, "site specific mutant DCRS3"
encompasses a protein having substantial sequence identity with a protein of Table 1, and typically shares most of the 10~ biological activities or effects of the forms disclosed herein.
Likewise in reference to DCRS4.
Although site specific mutation sites are predetermined, mutants need not be site specific. Mammalian DCRS3 mutagenesis can be achieved by making amino acid insertions or deletions in the gene, coupled with expression. Substitutions, deletions, insertions, or many combinations may be generated to arrive at a final construct. Insertions include amino- or carboxy- terminal fusions. Random mutagenesis can be conducted at a target codon and the expressed mammalian DCRS3 mutants can then be screened for the desired activity, providing some aspect of a structure-activity relationship. Methods for making substitution mutations at predetermined sites in DNA having a known sequence are well known in the art, e.g., by M13 primer mutagenesis. See also Sambrook, et al. (1989) and Ausubel, et al. (1987 and periodic Supplements).
The mutations in the DNA normally should not place coding sequences out of reading frames and preferably will not create complementary regions that could hybridize to produce secondary mRNA structure such as loops or hairpins.
The phosphoramidite method described by Beaucage and Carruthers (1981) Tetra. Letts. 22:1859-1862, will produce suitable synthetic DNA fragments. A double stranded fragment will often be obtained either by synthesizing the complementary strand and annealing the strand together under appropriate conditions or by adding the complementary strand using DNA
polymerase with an appropriate primer sequence.
Polymerase chain reaction (PCR) techniques can often be applied in mutagenesis. Alternatively, mutagenesis primers are commonly used methods for generating defined mutations at predetermined sites. See, e.g., Innis, et al. (eds. 1990) PCR
5 Protocols: A Guide to Methods and Applications Academic Press, San Diego, CA; and Dieffenbach and Dveksler (1995; eds.) PCR
Primer: A Laboratory Manual Cold Spring Harbor Press, CSH, NY.
Certain embodiments of the invention are directed to combination compositions comprising the receptor or ligand 10~ sequences described. In other embodiments, functional portions of the sequences may be joined to encode fusion proteins. In other forms, variants of the described sequences may be substituted.
15 IV. Proteins, Peptides As described above, the present invention encompasses primate DCRS3, e.g., whose sequences are disclosed in Table l, and described above. Allelic and other variants are also contemplated, including, e.g., fusion proteins combining 20 portions of such sequences with others, including, e.g., epitope tags and functional domains.
The present invention also provides recombinant proteins, e.g., heterologous fusion proteins using segments from these primate or rodent proteins. A heterologous fusion protein is a 25 fusion of proteins or segments which are naturally not normally fused in the same manner. Thus, the fusion product of a DCRS3 with another cytokine receptor is a continuous protein molecule having sequences fused in a typical peptide linkage, typically made as a single translation product and exhibiting properties, 30 e.g., sequence or antigenicity, derived from each source peptide. A similar concept applies to heterologous nucleic acid sequences. Combinations of various designated proteins into complexes are also provided.
In addition, new constructs may be made from combining 35 similar functional or structural domains from other related proteins, e.g., cytokine receptors or Toll-like receptors, including species variants. For example, ligand-binding or other segments may be "swapped" between different new fusion polypeptides or fragments. See, e.g., Cunningham, et al. (1989) Science 243:1330-1336; and O'Dowd, et al. (1988) J. Biol. Chem.
263:15985-15992, each of which is incorporated herein by reference. Thus, new chimeric polypeptides exhibiting new combinations of specificities will result from the functional linkage of receptor-binding specificities. For example, the ligand binding domains from other related receptor molecules may be added or substituted for other domains of this or related 10~ proteins. The resulting protein will often have hybrid function and properties. For example, a fusion protein may include a targeting domain which may serve to provide sequestering of the fusion protein to a particular subcellular organelle.
Candidate fusion partners and sequences can be selected from various sequence data bases, e.g., GenBank, c/o IntelliGenetics, Mountain View, CA; and BCG, University of Vdisconsin Biotechnology Computing Group, Madison, WI, which are each incorporated herein by reference. In particular, combinations of polypeptide sequences provided in Tables 1 and 3 are particularly preferred. Variant forms of the proteins may be substituted in the described combinations.
The present invention particularly provides muteins which bind cytokine-like ligands, and/or which are affected in signal transduction. Structural alignment of human DCRS3 or DCRS4 with other members of the cytokine receptor family show conserved features/residues. See Table 5. Alignment of human DCRS3 or DCRS4 sequence with other members of the cytokine receptor family indicates various structural and functionally shared features. See also, Bazan, et al. (1996) Nature 379:591; Lodi, et al. (1994) Science 263:1762-1766; Sayle and Milner-White (1995) TIBS 20:374-376; and Gronenberg, et al. (1991) Protein Enaineerina 4:263-269.
Substitutions with either mouse sequences or human sequences are particularly preferred. Conversely, conservative substitutions away from the ligand binding interaction regions will probably preserve most signaling activities; and conservative substitutions away from the intracellular domains will probably preserve most ligand binding properties.
"Derivatives" of primate DCRS3 include amino acid sequence mutants, glycosylation variants, metabolic derivatives and covalent or aggregative conjugates with other chemical moieties.
Covalent derivatives can be prepared by linkage of functionalities to groups which are found in DCRS3 amino acid side chains or at the N- or C- termini, e.g., by means which are well known in the art. These derivatives can include, without 10~ limitation, aliphatic esters or amides of the carboxyl terminus, or of residues containing carboxyl side chains, O-acyl derivatives of hydroxyl group-containing residues, and N-acyl derivatives of the amino terminal amino acid or amino-group containing residues, e.g., lysine or arginine. Acyl groups are selected from the group of alkyl-moieties, including C3 to C18 normal alkyl, thereby forming alkanoyl aroyl species.
In particular, glycosylation alterations are included, e.g., made by modifying the glycosylation patterns of a polypeptide during its synthesis and processing, or in further processing steps. Particularly preferred means for accomplishing this are by exposing the polypeptide to glycosylating enzymes derived from cells which normally provide such processing, e.g., mammalian glycosylation enzymes.
Deglycosylation enzymes are also contemplated. Also embraced are versions of the same primary amino acid sequence which have other minor modifications, including phosphorylated amino acid residues, e.g., phosphotyrosine, phosphoserine, or phosphothreonine.
A major group of derivatives are covalent conjugates of the receptors or fragments thereof with other proteins of polypeptides. These derivatives can be synthesized in recombinant culture such as N- or C-terminal fusions or by the use of agents known in the art for their usefulness in cross-linking proteins through reactive side groups. Preferred derivatization sites with cross-linking agents are at free amino groups, carbohydrate moieties, and cysteine residues.
Fusion polypeptides between the receptors and other homologous or heterologous proteins are also provided.
Homologous polypeptides may be fusions between different receptors, resulting in, for instance, a hybrid protein exhibiting binding specificity for multiple different cytokine ligands, or a receptor which may have broadened or weakened specificity of substrate effect. Likewise, heterologous fusions may be constructed which would exhibit a combination of properties or activities of the derivative proteins. Typical 10~ examples are fusions of a reporter polypeptide, e.g., luciferase, with a segment or domain of a receptor, e.g., a ligand-binding segment, so that the presence or location of a desired ligand may be easily determined. See, e.g., Dull, et al., U.S. Patent No. 4,859,609, which is hereby incorporated herein by reference. Other gene fusion partners include glutathione-S-transferase (GST), bacterial f~-galactosidase, trpE, Protein A, f3-lactamase, alpha amylase, alcohol dehydrogenase, and yeast alpha mating factor. See, e.g., Godowski, et al. (1988) Science 241:812-816. Labeled proteins will often be substituted in the described combinations of proteins.
The phosphoramidite method described by Beaucage and Carruthers (1981) Tetra. Letts. 22:1859-1862, will produce suitable synthetic DNA fragments. A double stranded fragment will often be obtained either by synthesizing the complementary strand and annealing the strand together under appropriate conditions or by adding the complementary strand using DNA
polymerase with an appropriate primer sequence.
Such polypeptides may also have amino acid residues which have been chemically modified by phosphorylation, sulfonation, biotinylation, or the addition or removal of other moieties, particularly those which have molecular shapes similar to phosphate groups. In some embodiments, the modifications will be useful labeling reagents, or serve as purification targets, e.9., affinity ligands.
Fusion proteins will typically be made by either recombinant nucleic acid methods or by synthetic polypeptide methods. Techniques for nucleic acid manipulation and expression are described generally, for example, in Sambrook, et al. (1989) Molecular Cloning: A Laboratory Manual (2d ed.), Vols. 1-3, Cold Spring Harbor Laboratory, and Ausubel, et al.
(eds. 1987 and periodic supplements) Current Protocols in Molecular Bioloay, Greene/Wiley, New York, which are each incorporated herein by reference. Techniques for synthesis of polypeptides -are described, for example, in Merrifield (1963) J.
Amer. Chem. Soc. 85:2149-2156; Merrifield (1986) Science 232:
10~ 341-347; and Atherton, et al. (1989) Solid Phase Peptide Synthesis: A Practical Approach, IRL Press, Oxford; each of which is incorporated herein by reference. See also Dawson, et al. (1994) Science 266:776-779 for methods to make larger polypeptides.
This invention also contemplates the use of derivatives of a DCRS3 or DCRS4 other than variations in amino acid sequence or glycosylation. Such derivatives may involve covalent or aggregative association with chemical moieties. These derivatives generally fall into three classes: (1) salts, (2) side chain and terminal residue covalent modifications, and (3) adsorption complexes, for example with cell membranes. Such covalent or aggregative derivatives are useful as immunogens, as reagents in immunoassays, or in purification methods such as for affinity purification of a receptor or other binding molecule, e.g., an antibody. For example, a cytokine ligand can be immobilized by covalent bonding to a solid support such as cyanogen bromide-activated Sepharose, by methods which are well known in the art, or adsorbed onto polyolefin surfaces, with or without glutaraldehyde cross-linking, for use in the assay or purification of a cytokine receptor, antibodies, or other similar molecules. The ligand can also be labeled with a detectable group, for example radioiodinated by the chloramine T
procedure, covalently bound to rare earth chelates, or conjugated to another fluorescent moiety for use in diagnostic assays.
A combination, e.g., including a DCRS3 or DCRS4, of this invention can be used as an immunogen for the production of antisera or antibodies specific, e.g., capable of distinguishing between other cytokine receptor family members, for the combinations described. The complexes can be used to screen monoclonal antibodies or antigen-binding fragments prepared by 5 immunization with various forms of impure preparations containing the protein. In particular, the term "antibodies"
also encompasses antigen binding fragments of natural antibodies, e.g., Fab, Fab2, Fv, etc. A purified DCRS3 can also be used as a reagent to detect antibodies generated in response 10~ to the presence of elevated levels of expression, or immunological disorders which lead to antibody production to the endogenous receptor. Additionally, DCRS3 fragments may also serve as immunogens to produce the antibodies of the present invention, as described immediately below. For example, this 15 invention contemplates antibodies having binding affinity to or being raised against the amino acid sequences shown in Table 1, fragments thereof, or various homologous peptides. In particular, this invention contemplates antibodies having binding affinity to, or having been raised against, specific 20 fragments which are predicted to be, or actually are, exposed at the exterior protein surface of a native DCRS3. Complexes of combinations of proteins will also be useful, and antibody preparations thereto can be made.
The blocking of physiological response to the receptor 25 ligands may result from the inhibition of binding of the ligand to the receptor, likely through competitive inhibition. Thus, in vitro assays of the present invention will often use antibodies or antigen binding segments of these antibodies, or fragments attached to solid phase substrates. These assays will 30 also allow for the diagnostic determination of the effects of either ligand binding region mutations and modifications, or other mutations and modifications, e.g., which affect signaling or enzymatic function.
This invention also contemplates the use of competitive 35 drug screening assays, e.g., where neutralizing antibodies to the receptor complexes or fragments compete with a test compound for binding to a ligand or other antibody. In this manner, the neutralizing antibodies or fragments can be used to detect the presence of a polypeptide which shares one or more binding sites to a receptor and can also be used to occupy binding sites on a receptor that might otherwise bind a ligand.
V. Making Nucleic Acids and Protein DNA which encodes the protein or fragments thereof can be obtained by chemical synthesis, screening cDNA libraries, or by screening genomic libraries prepared from a wide variety of cell 10~ lines or tissue samples. Natural sequences can be isolated using standard methods and the sequences provided herein, e.g., in Tables 1 or 3. Other species counterparts can be identified by hybridization techniques, or by various PCR techniques, combined with or by searching in sequence databases, e.g., GenBank.
This DNA can be expressed in a wide variety of host cells for the synthesis of a full-length receptor or fragments which can in turn, for example, be used to generate polyclonal or monoclonal antibodies; for binding studies; for construction and expression of modified ligand binding or kinase/phosphatase domains; and for structure/function studies. Variants or fragments can be expressed in host cells that are transformed or transfected with appropriate expression vectors. These molecules can be substantially free of protein or cellular contaminants, other than those derived from the recombinant host, and therefore are particularly useful in pharmaceutical compositions when combined with a pharmaceutically acceptable carrier and/or diluent. The protein, or portions thereof, may be expressed as fusions with other proteins. Combinations of the described proteins, or nucleic acids encoding them, are particularly interesting.
Expression vectors are typically self-replicating DNA or RNA constructs containing the desired receptor gene or its fragments, usually operably linked to suitable genetic control elements that are recognized in a suitable host cell. These control elements are capable of effecting expression within a suitable host. The multiple genes may be coordinately expressed, and may be on a polycistronic message. The specific type of control elements necessary to effect expression will depend upon the eventual host cell used. Generally, the genetic control elements can include a prokaryotic promoter system or a eukaryotic promoter expression control system, and typically include a transcriptional promoter, an optional operator to control the onset of transcription, transcription enhancers to elevate the level of mRNA expression, a sequence that encodes a suitable ribosome binding site, and sequences that terminate 10~ transcription and translation. Expression vectors also usually contain an origin of replication that allows the vector to replicate independently of the host cell.
The vectors of this invention include those which contain DNA which encodes a combination of proteins, as described, or a biologically active equivalent polypeptide. The DNA can be under the control of a viral promoter and can encode a selection marker. This invention further contemplates use of such expression vectors which are capable of expressing eukaryotic cDNAs coding for such proteins in a prokaryotic or eukaryotic host, where the vector is compatible with the host and where the eukaryotic cDNAs are inserted into the vector such that growth of the host containing the vector expresses the cDNAs in question. Usually, expression vectors are designed for stable replication in their host cells or for amplification to greatly increase the total number of copies of the desirable gene per cell. It is not always necessary to require that an expression vector replicate in a host cell, e.g., it is possible to effect transient expression of the protein or its fragments in various hosts using vectors that do not contain a replication origin that is recognized by the host cell. It is also possible to use vectors that cause integration of the protein encoding portions into the host DNA by recombination.
Vectors, as used herein, comprise plasmids, viruses, bacteriophage, integratable DNA fragments, and other vehicles which enable the integration of DNA fragments into the genome of the host. Expression vectors are specialized vectors which contain genetic control elements that effect expression of operably linked genes. Plasmids are the most commonly used form of vector but all other forms of vectors which serve an equivalent function and which are, or become, known in the art are suitable for use herein. See, e.g., Pouwels, et al. (1985 and Supplements) Cloning Vectors: A Laboratory Manual, Elsevier, N.Y., and Rodriguez, et al. (eds. 1988) Vectors: A Survey of Molecular Cloninct Vectors and Their Uses, Buttersworth, Boston, which are incorporated herein by reference.
Transformed cells are cells, preferably mammalian, that 10~ have been transformed or transfected with vectors constructed using recombinant DNA techniques. Transformed host cells usually express the desired proteins, but for purposes of cloning, amplifying, and manipulating its DNA, do not need to express.the subject proteins. This invention further contemplates culturing transformed cells in a nutrient medium, thus permitting the proteins to accumulate. The proteins can be recovered, either from the culture or, in certain instances, from the culture medium.
For purposes of this invention, nucleic sequences are operably linked when they are functionally related to each other. For example, DNA for a presequence or secretory leader is operably linked to a polypeptide if it is expressed as a preprotein or participates in directing the polypeptide to the cell membrane or in secretion of the polypeptide. A promoter is operably linked to a coding sequence if it controls the transcription of the polypeptide; a ribosome binding site is operably linked to a coding sequence if it is positioned to permit translation. Usually, operably linked means contiguous and in reading frame, however, certain genetic elements such as repressor genes are not contiguously linked but still bind to operator sequences that in turn control expression.
Suitable host cells include prokaryotes, lower eukaryotes, and higher eukaryotes. Prokaryotes include both gram negative and gram positive organisms, e.g., E. coli and B. subtilis.
Lower eukaryotes include yeasts, e.g., S. cerevisiae and Pichia, and species of the genus Dictyostelium. Higher eukaryotes include established tissue culture cell lines from animal cells, both of non-mammalian origin, e.g., insect cells, and birds, and of mammalian origin, e.g., human, primates, and rodents.
Prokaryotic host-vector systems include a wide variety of vectors for many different species. As used herein, E. coli and its vectors will be used generically to include equivalent vectors used in other prokaryotes. A representative vector for amplifying DNA is pBR322 or many of its derivatives. Vectors that can be used to express the receptor or its fragments include, but are not limited to, such vectors as those 10~ containing the lac promoter (pUC-series); trp promoter (pBR322-trp); Ipp promoter (the pIN-series); lambda-pP or pR
promoters (pOTS); or hybrid promoters such as ptac (pDR540).
See Brosius, et al. (1988) "Expression Vectors Employing Lambda-, trp-, lac-, and Ipp-derived Promoters", in Vectors: A
Survey of Molecular Clonincl Vectors and Their Uses, (eds.
Rodriguez and Denhardt), Buttersworth, Boston, Chapter 10, pp.
205-236, which is incorporated herein by reference.
Lower eukaryotes, e.g., yeasts and Dictyostelium, may be transformed with DCRS3 or DCRS4 sequence containing vectors.
For purposes of this invention, the most common lower eukaryotic host is the baker's yeast, Saccharomvces cerevisiae. It will be used to generically represent lower eukaryotes although a number of other strains and species are also available. Yeast vectors typically consist of a replication origin (unless of the integrating type), a selection gene, a promoter, DNA encoding the receptor or its fragments, and sequences for translation termination, polyadenylation, and transcription termination.
Suitable expression vectors for yeast include such constitutive promoters as 3-phosphoglycerate kinase and various other glycolytic enzyme gene promoters or such inducible promoters as the alcohol dehydrogenase 2 promoter or metallothionine promoter. Suitable vectors include derivatives of the following types: self-replicating low copy number (such as the YRp-series), self-replicating high copy number (such as the YEp-series); integrating types (such as the YIp-series), or mini-chromosomes (such as the YCp-series).
Higher eukaryotic tissue culture cells are normally the preferred host cells for expression of the functionally active interleukin or receptor proteins. In principle, many higher eukaryotic tissue culture cell lines are workable, e.g., insect 5 baculovirus expression systems, whether from an invertebrate or vertebrate source. However, mammalian cells are preferred.
Transformation or transfection and propagation of such cells has become a routine procedure. Examples of useful cell lines include HeLa cells, Chinese hamster ovary (CHO) cell lines, baby 10~ rat kidney (BRK) cell lines, insect cell lines, bird cell lines, and monkey (COS) cell lines. Expression vectors for such cell lines usually include an origin of replication, a promoter, a translation initiation site, RNA splice sites (if genomic DNA is used), a polyadenylation site, and a transcription termination 15 site. These vectors also usually contain a selection gene or amplification gene. Suitable expression vectors may be plasmids, viruses, or retroviruses carrying promoters derived, e.g., from such sources as from adenovirus, SV40, parvoviruses, vaccinia virus, or cytomegalovirus. Representative examples of 20 suitable expression vectors include pCDNAl; pCD, see Okayama, et al. (1985) Mol. Cell Biol. 5:1136-1142; pMClneo PolyA, see Thomas, et al. (1987) Cell 51:503-512; and a baculovirus vector such as pAC 373 or pAC 610.
For secreted proteins and some membrane proteins, an open 25 reading frame usually encodes a polypeptide that consists of a mature or secreted product covalently linked at its N-terminus to a signal peptide. The signal peptide is cleaved prior to secretion of the mature, or active, polypeptide. The cleavage site can be predicted with a high degree of accuracy from 30 empirical rules, e.g., von-Heijne (1986) Nucleic Acids Research 14:4683-4690 and Nielsen, et al. (1997) Protein Ena. 10:1-12, and the precise amino acid composition of the signal peptide often does not appear to be critical to its function, e.g., Randall, et al. (1989) Science 243:1156-1159; Kaiser et al.
35 (1987) Science 235:312-317. The mature proteins of the invention can be readily determined using standard methods.
It will often be desired to express these polypeptides in a system which provides a specific or defined glycosylation pattern. In this case, the usual pattern will be that provided naturally by the expression system. However, the pattern will be modifiable by exposing the polypeptide, e.g., an unglycosylated form, to appropriate glycosylating proteins introduced into a heterologous expression system. For example, the receptor gene may be co-transformed with one or more genes encoding mammalian or other glycosylating enzymes. Using this 10~ approach, certain mammalian glycosylation patterns will be achievable in prokaryote or other cells. Expression in prokaryote cells will typically lead to unglycosylated forms of protein.
The source of DCRS3 or DCRS4 can be a eukaryotic or prokaryotic host expressing recombinant DCRS, such,as is described above. The source can also be a cell line, but other mammalian cell lines are also contemplated by this invention, with the preferred cell line being from the human species.
Now that the sequences are known, a primate DCRS3 or DCRS4, fragments, or derivatives thereof can be prepared by conventional processes for synthesizing peptides. These include processes such as are described in Stewart and Young (1984) Solid Phase Pe,~tide Synthesis, Pierce Chemical Co., Rockford, IL; Bodanszky and Bodanszky (1984) The Practice of Peptide Synthesis, Springer-Verlag, New York; and Bodanszky (1984) The Principles of Peptide Synthesis, Springer-Verlag, New York; all of each which are incorporated herein by reference. For example, an azide process, an acid chloride process, an acid anhydride process, a mixed anhydride process, an active ester process (for example, p-nitrophenyl ester, N-hydroxysuccinimide ester, or cyanomethyl ester), a carbodiimidazole process, an oxidative-reductive process, or a dicyclohexylcarbodiimide (DCCD)/additive process can be used. Solid phase and solution phase syntheses are both applicable to the foregoing processes.
Similar techniques can be used with partial DCRS3 or DCRS4 sequences.
DCRS3 or DCRS4 proteins, fragments, or derivatives are suitably prepared in accordance with the above processes as typically employed in peptide synthesis, generally either by a so-called stepwise process which comprises condensing an amino acid to the terminal amino acid, one by one in sequence, or by coupling peptide fragments to the terminal amino acid. Amino groups that are not being used in the coupling reaction typically must be protected to prevent coupling at an incorrect location.
10~ If a solid phase synthesis is adopted, the C-terminal amino acid is bound to an insoluble carrier or support through its carboxyl group. The insoluble carrier is not particularly limited as long as it has a binding capability to a reactive carboxyl group. Examples of such insoluble carriers include halomethyl resins, such as chloromethyl resin or bromomethyl resin, hydroxymethyl resins, phenol resins, tert-alkyloxycarbonylhydrazidated resins, and the like.
An amino group-protected amino acid is bound in sequence through condensation of its activated carboxyl group and the reactive amino group of the previously formed peptide or chain, to synthesize the peptide step by step. After synthesizing the complete sequence, the peptide is split off from the insoluble carrier to produce the peptide. This solid-phase approach is generally described by Merrifield, et al. (1963) in J. Am. Chem.
Soc. 85:2149-2156, which is incorporated herein by reference.
The prepared protein and fragments thereof can be isolated and purified from the reaction mixture by means of peptide separation, for example, by extraction, precipitation, electrophoresis, various forms of chromatography, and the like.
The receptors of this invention can be obtained in varying degrees of purity depending upon desired uses. Purification can be accomplished by use of the protein purification techniques disclosed herein, see below, or by the use of the antibodies herein described in methods of immunoabsorbant affinity chromatography. This immunoabsorbant affinity chromatography is carried out by first linking the antibodies to a solid support and then contacting the linked antibodies with solubilized lysates of appropriate cells, lysates of other cells expressing the receptor, or lysates or supernatants of cells producing the protein as a result of DNA techniques, see below.
Generally, the purified protein will be at least about 40%
pure, ordinarily at least about 50% pure, usually at least about 60% pure, typically at least about 70% pure, more typically at least about 80% pure, preferable at least about 90% pure and more preferably at least about 95% pure, and in particular embodiments, 97%-99% or more. Purity will usually be on a 10~ weight basis, but can also be on a molar basis. Different assays will be applied as appropriate. Individual proteins may be purified and thereafter combined.
VI. Antibodies Antibodies can be raised to various mammalian, e.g., primate DCRS3 or DCRS4 proteins and fragments thereof, both in naturally occurring native forms and in their recombinant forms, the difference being that antibodies to the active receptor are more likely to recognize epitopes which are only present in the native conformations. Denatured antigen detection can also be useful in, e.g., Western analysis. Anti-idiotypic antibodies are also contemplated, which would be useful as agonists or antagonists of a natural receptor or an antibody.
Antibodies, including binding fragments and single chain versions, against predetermined fragments of the protein can be raised by immunization of animals with conjugates of the fragments with immunogenic proteins. Monoclonal antibodies are prepared from cells secreting the desired antibody. These antibodies can be screened for binding to normal or defective protein, or screened for agonistic or antagonistic activity.
These monoclonal antibodies will usually bind with at least a KD
of about 1 mM, more usually at least about 300 ~M, typically at least about 100~M, more typically at least about 30 ~M, preferably at least about 10 ~M, and more preferably at least about 3 ~M or better.
The antibodies, including antigen binding fragments, of this invention can have significant diagnostic or therapeutic value. They can be potent antagonists that bind to the receptor and inhibit binding to ligand or inhibit the ability of the receptor to elicit a biological response, e.g., act on its substrate. They also can be useful as non-neutralizing antibodies and can be coupled to toxins or radionuclides to bind producing cells, or cells localized to the source of the interleukin. Further, these antibodies can be conjugated to drugs or other therapeutic agents, either directly or indirectly by means of a linker.
10~ The antibodies of this invention can also be useful in diagnostic applications. As capture or non-neutralizing antibodies, they might bind to the receptor without inhibiting ligand or substrate binding. As neutralizing antibodies, they can be useful in competitive binding assays. They will also be useful in detecting or quantifying ligand. They may be used as reagents for Western blot analysis, or for immunoprecipitation or immunopurification of the respective protein. Likewise, nucleic acids and proteins may be immobilized to solid substrates for affinity purification or detection methods. The substrates may be, e.g., solid resin beads or sheets of plastic.
Protein fragments may be joined to other materials, particularly polypeptides, as fused or covalently joined polypeptides to be used as immunogens. Mammalian cytokine receptors and fragments may be fused or covalently linked to a variety of immunogens, such as keyhole limpet hemocyanin, bovine serum albumin, tetanus toxoid, etc. See Microbiolocrv, Hoeber Medical Division, Harper and Row, 1969; Landsteiner (1962) Specificity of Serological Reactions, Dover Publications, New York; and Williams, et al. (1967) Methods in Immunology and Immunochemistry, Vol. 1, Academic Press, New York; each of which are incorporated herein by reference, for descriptions of methods of preparing polyclonal antisera. A typical method involves hyperimmunization of an animal with an antigen. The blood of the animal is then collected shortly after the repeated immunizations and the gamma globulin is isolated.
In some instances, it is desirable to prepare monoclonal antibodies from various mammalian hosts, such as mice, rodents, primates, humans, etc. Description of techniques for preparing such monoclonal antibodies may be found in, e.g., Stites, et al.
(eds.) Basic and Clinical Immunoloav (4th ed.), Lange Medical Publications, Los Altos, CA, and references cited therein;
5 Harlow and Lane (1988) Antibodies: A Laboratory Manual, CSH
Press; Goding (1986) Monoclonal Antibodies: Principles and Practice (2d ed.) Academic Press, New York; and particularly in Kohler and Milstein (1975) in Nature 256: 495-497, which discusses one method of generating monoclonal antibodies. Each 10~ of these references is incorporated herein by reference.
Summarized briefly, this method involves injecting an animal with an immunogen. The animal is then sacrificed and cells taken from its spleen, which are then fused with myeloma cells.
The result is a hybrid cell or "hybridoma" that is capable of 15 reproducing in vitro. The population of hybridomas is then screened to isolate individual clones, each of which secrete a single antibody species to the immunogen. In this manner, the individual antibody species obtained are the products of immortalized and cloned single B cells from the immune animal 20 generated in response to a specific site recognized on the immunogenic substance.
Other suitable techniques involve in vitro exposure of lymphocytes to the antigenic polypeptides or alternatively to selection of libraries of antibodies in phage or similar 25 vectors. See, Huse, et al. (1989) "Generation of a Large Combinatorial Library of the Immunoglobulin Repertoire in Phage Lambda," Science 246:1275-1281; and Ward, et al. (1989) Nature 341:544-546, each of which is hereby incorporated herein by reference. The polypeptides and antibodies of the present 30 invention may be used with or without modification, including chimeric or humanized antibodies. Frequently, the polypeptides and antibodies will be labeled by joining, either covalently or non-covalently, a substance which provides for a detectable signal. A wide variety of labels and conjugation techniques are 35 known and are reported extensively in both the scientific and patent literature. Suitable labels include radionuclides, enzymes, substrates, cofactors, inhibitors, fluorescent moieties, chemiluminescent moieties, magnetic particles, and the like. Patents, teaching the use of such labels include U.S.
Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345;
4,277,437; 4,275,149; and 4,366,241. Also, recombinant or chimeric immunoglobulins may be produced, see Cabilly, U.S.
Patent No. 4,816,567; or made in transgenic mice, see Mendez, et al. (1997) Nature Genetics 15:146-156. These references are incorporated herein by reference.
The antibodies of this invention can also be used for 10~ affinity chromatography in isolating DCRS3 proteins or peptides.
Columns can be prepared where the antibodies are linked to a solid support, e.g., particles, such as agarose, Sephadex, or the like, where a cell lysate may be passed through the column, the column washed, followed by increasing concentrations of a mild denaturant, whereby the purified protein will be released.
Alternatively, the protein may be used to purify antibody.
Appropriate cross absorptions or depletions may be applied.
The antibodies may also be used to screen expression libraries for particular expression products. Usually the antibodies used in such a procedure will be labeled with a moiety allowing easy detection of presence of antigen by antibody binding.
Antibodies raised against a cytokine receptor will also be used to raise anti-idiotypic antibodies. These will be useful in detecting or diagnosing various immunological conditions related to expression of the protein or cells which express the protein. They also will be useful as agonists or antagonists of the ligand, which may be competitive inhibitors or substitutes for naturally occurring ligands.
A cytokine receptor protein that specifically binds to or that is specifically immunoreactive with an antibody generated against a defined immunogen, such as an immunogen consisting of the amino acid sequence of SEQ ID NO: 2, 25, 5, 28, or 31, is typically determined in an immunoassay. The immunoassay typically uses a polyclonal antiserum which was raised, e.g., to a protein of SEQ ID NO: 2, 25, 5, 28, or 31. This antiserum is selected to have low crossreactivity against other cytokine receptor family members, e.g., IL-11 receptor subunit alpha, IL-6 receptor subunit alpha, or p40, preferably from the same species, and any such crossreactivity is removed by immunoabsorption prior to use in the immunoassay.
In order to produce antisera for use in an immunoassay, the protein, e.g., of SEQ ID NO: 2, 25, 5, 28, or 31, is isolated as described herein. For example, recombinant protein may be produced in a mammalian cell line. An appropriate host, e.g., an inbred strain of mice such as Balb/c, is immunized with the 10~ selected protein, typically using a standard adjuvant, such as Freund's adjuvant, and a standard mouse immunization protocol (see Harlow and Lane, supra). Alternatively, a synthetic peptide derived from the sequences disclosed herein and conjugated to a carrier protein can be used an immunogen.
Polyclonal sera are collected and titered against the immunogen protein in an immunoassay, e.g., a solid phase immunoassay with the immunogen immobilized on a solid support. Polyclonal antisera with a titer of 104 or greater are selected and tested for their cross reactivity against other cytokine receptor family members, e.g., IL-2, IL-7, IL-9, or EPO receptor subunit, using a competitive binding immunoassay such as the one described in Harlow and Lane, supra, at pages 570-573.
Preferably at least two cytokine receptor family members are used in this determination. These cytokine receptor family members can be produced as recombinant proteins and isolated using standard molecular biology and protein chemistry techniques as described herein.
Immunoassays in the competitive binding format can be used for the crossreactivity determinations. For example, the protein of SEQ ID N0: 2, 25, 5, 28, or 31 can be immobilized to a solid support. Proteins added to the assay compete with the binding of the antisera to the immobilized antigen. The ability of the above proteins to compete with the binding of the antisera to the immobilized protein is compared to the proteins, e.g., of IL-2, IL-7, IL-9, or EPO receptor subunit. The percent crossreactivity for the above proteins is calculated, using standard calculations. Those antisera with less than 10°s crossreactivity with each of the proteins listed above are selected and pooled. The cross-reacting antibodies are then removed from the pooled antisera by immunoabsorption with the above-listed proteins.
The immunoabsorbed and pooled antisera are then used in a competitive binding immunoassay as described above to compare a second protein to the immunogen protein (e. g., DCRS3 like protein of SEQ ID NO: 2). In order to make this comparison, the two proteins are each assayed at a wide range of concentrations 10~ and the amount of each protein required to inhibit 50°s of the binding of the antisera to the immobilized protein is determined. If the amount of the second protein required is less than twice the amount of the protein of the selected protein or proteins that is required, then the second protein is said to specifically bind to an antibody generated to the immunogen.
It is understood that these cytokine receptor proteins are members of a family of homologous proteins that comprise at least 6 so far identified genes. For a particular gene product, such as a DCRS3 or DCRS4, the term refers not only to the amino acid sequences disclosed herein, but also to other proteins that are allelic, non-allelic, or species variants. It is also understood that the terms include nonnatural mutations introduced by deliberate mutation using conventional recombinant technology such as single site mutation, or by excising short sections of DNA encoding the respective proteins, or by substituting new amino acids, or adding new amino acids. Such minor alterations typically will substantially maintain the immunoidentity of the original molecule and/or its biological activity. Thus, these alterations include proteins that are specifically immunoreactive with a designated naturally occurring DCRS3 or DCRS4 protein. The biological properties of the altered proteins can be determined by expressing the protein in an appropriate cell line and measuring the appropriate effect, e.g., upon transfected lymphocytes. Particular protein modifications considered minor would include conservative substitution of amino acids with similar chemical properties, as described above for the cytokine receptor family as a whole. By aligning a protein optimally with the protein of the cytokine receptors and by using the conventional immunoassays described herein to determine immunoidentity, one can determine the protein compositions of the invention.
VII. Kits and quantitation Both naturally occurring and recombinant forms of~the cytokine receptor like molecules of this invention are 10~ particularly useful in kits and assay methods. For example, these methods would also be applied to screening for binding activity, e.g., ligands for these proteins. Several methods of automating assays have been developed in recent years so as to permit screening of tens of thousands of compounds per year.
See, e.g., a BIOMEK automated workstation, Beckman Instruments, Palo Alto, California, and Fodor, et al. (1991) Science 251:767-773, which is incorporated herein by reference. The latter describes means for testing binding by a plurality of defined polymers synthesized on a solid substrate. The development of suitable assays to screen for a ligand or agonist/antagonist homologous proteins can be greatly facilitated by the availability of large amounts of purified, soluble cytokine receptors in an active state such as is provided by this invention.
Purified DCRS3 or DCRS4 can be coated directly onto plates for use in the aforementioned ligand screening techniques.
However, non-neutralizing antibodies to these proteins can be used as capture antibodies to immobilize the respective receptor on the solid phase, useful, e.9., in diagnostic uses.
This invention also contemplates use of DCRS3 or DCRS4, fragments thereof, peptides, and their fusion products in a variety of diagnostic kits and methods for detecting the presence of the protein or its ligand. Alternatively, or additionally, antibodies against the molecules may be incorporated into the kits and methods. Typically the kit will have a compartment containing either a DCRS3 or DCRS4 peptide or gene segment or a reagent which recognizes one or the other.
Typically, recognition reagents, in the case of peptide, would be a receptor or antibody, or in the case of a gene segment, would usually be a hybridization probe.
A preferred kit for determining the concentration of, e.g., 5 DCRS3 in a sample would typically comprise a labeled compound, e.g., ligand or antibody, having known binding affinity for DCRS3, a source of DCRS3 (naturally occurring or recombinant) as a positive control, and a means for separating the bound from free labeled compound, for example a solid phase for 10~ immobilizing DCRS3 in the test sample. Compartments containing reagents, and instructions, will normally be provided.
Appropriate nucleic acid or protein containing kits are also provided.
Antibodies, including antigen binding fragments, specific 15 for mammalian DCRS3 or a peptide fragment, or receptor fragments are useful in diagnostic applications to detect the presence of elevated levels of ligand and/or its fragments. Diagnostic assays may be homogeneous (without a separation step between free reagent and antibody-antigen complex) or heterogeneous 20 (with a separation step). Various commercial assays exist, such as radioimmunoassay (RIA), enzyme-linked immunosorbent assay (ELISA), enzyme immunoassay (EIA), enzyme-multiplied immunoassay technique (EMIT), substrate-labeled fluorescent immunoassay (SLFIA) and the like. For example, unlabeled antibodies can be 25 employed by using a second antibody which is labeled and which recognizes the antibody to a cytokine receptor or to a particular fragment thereof. These assays have also been extensively discussed in the literature. See, e.g., Harlow and Lane (1988) Antibodies: A Laboratory Manual, CSH., and Coligan 30 (ed. 1991 and periodic supplements) Current Protocols In Immunoloctv Greene/Wiley, New York.
Anti-idiotypic antibodies may have similar use to serve as agonists or antagonists of cytokine receptors. These should be useful as therapeutic reagents under appropriate circumstances.
35 Frequently, the reagents for diagnostic assays are supplied in kits, so as to optimize the sensitivity of the assay. For the subject invention, depending upon the nature of the assay, the protocol, and the label, either labeled or unlabeled antibody, or labeled ligand is provided. This is usually in conjunction with other additives, such as buffers, stabilizers, materials necessary for signal production such as substrates for enzymes, and the like. Preferably, the kit will also contain instructions for proper use and disposal of the contents after use. Typically the kit has compartments for each useful reagent, and will contain instructions for proper use and disposal of reagents. Desirably, the reagents are provided as a 10~ dry lyophilized powder, where the reagents may be reconstituted in an aqueous medium having appropriate concentrations for performing the assay.
The aforementioned constituents of the diagnostic assays may be used without modification or may be modified in a variety of ways. For example, labeling may be achieved by covalently or non-covalently joining a moiety which directly or indirectly provides a detectable signal. In many of these assays, a test compound, cytokine receptor, or antibodies thereto can be labeled either directly or indirectly. Possibilities for direct labeling include label groups: radiolabels such as 125I, enzymes (U. S. Pat. No. 3,645,090) such as peroxidase and alkaline phosphatase, and fluorescent labels (U. S. Pat. No. 3,940,475) capable of monitoring the change in fluorescence intensity, wavelength shift, or fluorescence polarization. Both of the patents are incorporated herein by reference. Possibilities for indirect labeling include biotinylation of one constituent followed by binding to avidin coupled to one of the above label groups.
There are also numerous methods of separating the bound from the free ligand, or alternatively the bound from the free test compound. The cytokine receptor can be immobilized on various matrixes followed by washing. Suitable matrices include plastic such as an ELISA plate, filters, and beads. Methods of immobilizing the receptor to a matrix include, without limitation, direct adhesion to plastic, use of a capture antibody, chemical coupling, and biotin-avidin. The last step in this approach involves the precipitation of antibody/antigen complex by any of several methods including those utilizing, e.g., an organic solvent such as polyethylene glycol or a salt such as ammonium sulfate. Other suitable separation techniques include, without limitation, the fluorescein antibody magnetizable particle method described in Rattle, et al. (1984) Clin. Chem. 30:1457-1461, and the double antibody magnetic particle separation as described in U.S. Pat. No. 4,659,678, each of which is incorporated herein by reference.
The methods for linking protein or fragments to various 10~ labels have been extensively reported in the literature and do not require detailed discussion here. Many of the techniques involve the use of activated carboxyl groups either through the use of carbodiimide or active esters to form peptide bonds, the formation of thioethers by reaction of a mercapto group with an activated halogen such as chloroacetyl, or an activated olefin such as maleimide, for linkage, or the like. Fusion proteins will also find use in these applications.
Another diagnostic aspect of this invention involves use of oligonucleotide or polynucleotide sequences taken from the sequence of an cytokine receptor. These sequences can be used as probes for detecting levels of the respective cytokine receptor in patients suspected of having an immunological disorder. The preparation of both RNA and DNA nucleotide sequences, the labeling of the sequences, and the preferred size of the sequences has received ample description and discussion in the literature. Normally an oligonucleotide probe should have at least about 14 nucleotides, usually at least about 18 nucleotides, and the polynucleotide probes may be up to several kilobases. Various labels may be employed, most commonly radionuclides, particularly 32P. However, other techniques may also be employed, such as using biotin modified nucleotides for introduction into a polynucleotide. The biotin then serves as the site for binding to avidin or antibodies, which may be labeled with a wide variety of labels, such as radionuclides, fluorescers, enzymes, or the like. Alternatively, antibodies may be employed which can recognize specific duplexes, including DNA duplexes, RNA duplexes, DNA-RNA hybrid duplexes, or DNA-protein duplexes. The antibodies in turn may be labeled and the assay carried out where the duplex is bound to a surface, so that upon the formation of duplex on the surface, the presence of antibody bound to the duplex can be detected. The use of probes to the novel anti-sense RNA may be carried out in conventional techniques such as nucleic acid hybridization, plus and minus screening, recombinational probing, hybrid released translation (HRT), and hybrid arrested translation (HART). This also includes amplification techniques such as polymerase chain 10~ reaction (PCR).
Diagnostic kits which also test for the qualitative or quantitative presence of other markers are also contemplated.
Diagnosis or prognosis may depend on the combination of multiple indications used as markers. Thus, kits may test for combinations of markers. See, e.9., Viallet, et al. (1989) Progress in Growth Factor Res. 1:89-97.
VIII. Therapeutic Utility This invention provides reagents with significant therapeutic value. See, e.g., Levitzk_i (1996) Curr. Opin. Cell Biol. 8:239-244. The cytokine receptors (naturally occurring or recombinant), fragments thereof, mutein receptors, and antibodies, along with compounds identified as having binding affinity to the receptors or antibodies, should be useful in the treatment of conditions exhibiting abnormal expression of the receptors of their ligands. Such abnormality will typically be manifested by immunological disorders. Additionally, this invention should provide therapeutic value in various diseases or disorders associated with abnormal expression or abnormal triggering of response to the ligand. For example, the IL-1 ligands have been suggested to be involved in morphologic development, e.g., dorso-ventral polarity determination, and immune responses, particularly the primitive innate responses.
See, e.g., Sun, et al. (1991) Eur. J. Biochem. 196:247-254; and Hultmark (1994) Nature 367:116-117.
Recombinant cytokine receptors, muteins, agonist or antagonist antibodies thereto, or antibodies can be purified and then administered to a patient. These reagents can be combined for therapeutic use with additional active ingredients, e.g., in conventional pharmaceutically acceptable carriers or diluents, along with physiologically innocuous stabilizers and excipients.
These combinations can be sterile, e.g., filtered, and placed into dosage forms as by lyophilization in dosage vials or storage in stabilized aqueous preparations. This invention also contemplates use of antibodies or. binding fragments thereof which are not complement binding.
10~ Ligand screening using cytokine receptor or fragments thereof can be performed to identify molecules having binding affinity to the receptors. Subsequent biological assays can then be utilized to determine if a putative ligand can provide competitive binding, which can block intrinsic stimulating activity. Receptor fragments can be used as a blocker or antagonist in that it blocks the activity of ligand. Likewise, a compound having intrinsic stimulating activity can activate the receptor and is thus an agonist in that it simulates the activity of ligand, e.g., inducing signaling. This invention further contemplates the therapeutic use of antibodies to cytokine receptors as antagonists.
The quantities of reagents necessary for effective therapy will depend upon many different factors, including means of administration, target site, reagent physiological life, pharmacological life, physiological state of the patient, and other medicants administered. Thus, treatment dosages should be titrated to optimize safety and efficacy. Typically, dosages used in vitro may provide useful guidance in the amounts useful for in situ administration of these reagents. Animal testing of effective doses for treatment of particular disorders will provide further predictive indication of human dosage. Various considerations are described, e.g., in Gilman, et al. (eds.
1990) Goodman and Gilman's~ The Pharmacological Bases of Therapeutics, 8th Ed., Pergamon Press; and Reminaton's Pharmaceutical Sciences, 17th ed. (1990), Mack Publishing Co., Easton, Penn.; each of which is hereby incorporated herein by reference. Methods for administration are discussed therein and below, e.g., for oral, intravenous, intraperitoneal, or intramuscular administration, transdermal diffusion, and others.
Pharmaceutically acceptable carriers will include water, saline, buffers, and other compounds described, e.g., in the Merck 5 Index, Merck & Co., Rahway, New Jersey. Because of the likely high affinity binding, or turnover numbers, between a putative ligand and its receptors, low dosages of these reagents would be initially expected to be effective. And the signaling pathway suggests extremely low amounts of ligand may have effect. Thus, 10~ dosage ranges would ordinarily be expected to be in amounts lower than 1 mM concentrations, typically less than about 10 ~M
concentrations, usually less than about 100 nM, preferably less than about 10 pM (picomolar), and most preferably less than about 1 fM (femtomolar), with an appropriate carrier. Slow 15 release formulations, or slow release apparatus will often be utilized for continuous administration.
Cytokine receptors, fragments thereof, and antibodies or its fragments, antagonists, and agonists, may be administered directly to the host to be treated or, depending on the size of 20 the compounds, it may be desirable to conjugate them to carrier proteins such as ovalbumin or serum albumin prior to their administration. Therapeutic formulations may be administered in many conventional dosage formulations. While it is possible for the active ingredient to be administered alone, it is preferable 25 to present it as a pharmaceutical formulation. Formulations comprise at least one active ingredient, as defined above, together with one or more acceptable carriers thereof. Each carrier must be both pharmaceutically and physiologically acceptable in the sense of being compatible with the other 30 ingredients and not injurious to the patient. Formulations include those suitable for oral, rectal, nasal, or parenteral (including subcutaneous, intramuscular, intravenous and intradermal) administration. The formulations may conveniently be presented in unit dosage form and may be prepared by methods 35 well known in the art of pharmacy. See, e.g., Gilman, et al.
(eds. 1990) Goodman and Gilman's: The Pharmacological Bases of Therapeutics, 8th Ed., Pergamon Press; and Reminaton's Pharmaceutical Sciences, 17th ed. (1990), Mack Publishing Co., Easton, Penn.; Avis, et al. (eds. 1993) Pharmaceutical Dosage Forms: Parenteral Medications Dekker, NY; Lieberman, et al.
(eds. 1990) Pharmaceutical Dosage Forms: Tablets Dekker, NY; and Lieberman, et al. (eds. 1990) Pharmaceutical Dosage Forms:
Disperse Systems Dekker, NY. The therapy of this invention may be combined with or used in association with other therapeutic agents, particularly agonists or antagonists of other cytokine receptor family members.
10~
IX. Screening Drug screening using DCRS3 or DCRS4 or fragments thereof can be performed to identify compounds having binding affinity to the receptor subunit, including isolation of associated components. Subsequent biological assays can then be utilized to determine if the compound has intrinsic stimulating activity and is therefore a blocker or antagonist in that it blocks the activity of the ligand. Likewise, a compound having intrinsic stimulating activity can activate the receptor and is thus an agonist in that it simulates the activity of a cytokine ligand.
This invention further contemplates the therapeutic use of antibodies to the receptor as cytokine agonists or antagonists.
Similarly, complexes comprising multiple proteins may be used to screen for ligands or reagents capable of recognizing the complex. Most cytokine receptors comprise at least two subunits, which may be the same, or distinct. Alternatively, the transmembrane receptor may bind to a complex comprising a cytokine-like ligand associated with another soluble protein serving, e.g., as a second receptor subunit.
One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant DNA molecules expressing, e.g., a DCRS3 in combination with another cytokine receptor subunit. Cells may be isolated which express a receptor in isolation from other functional receptors. Such cells, either in viable or fixed form, can be used for standard antibody/antigen or ligand/receptor binding assays. See also, Parce, et al. (1989) Science 246:243-247; and Owicki, et al. (1990) Proc. Nat'1 Acad.
Sci. USA 87:4007-4011, which describe sensitive methods to detect cellular responses. Competitive assays are particularly useful, where the cells (source of putative ligand) are contacted and incubated with a labeled receptor or antibody having known binding affinity to the ligand, such as 125I_ antibody, and a test sample whose binding affinity to the binding composition is being measured. The bound and free labeled binding compositions are then separated to assess the degree of ligand binding. The amount of test compound bound is 10~ inversely proportional to the amount of labeled receptor binding to the known source. Many techniques can be used to separate bound from free ligand to assess the degree of ligand binding.
This separation step could typically involve a procedure such as adhesion to filters followed by washing, adhesion to plastic followed by washing, or centrifugation of the cell membranes.
Viable cells could also be used to screen for the effects of drugs on cytokine mediated functions, e.g., second messenger levels, i.e., Ca++; cell proliferation; inositol phosphate pool changes; and others. Some detection methods allow for elimination of a separation step, e.g., a proximity sensitive detection system. Calcium sensitive dyes will be useful for detecting Ca++ levels, with a fluorimeter or a fluorescence cell sorting apparatus.
X. Ligands The descriptions of DCRS3 or DCRS4 herein provide means to identify ligands, as described above. Such ligand should bind specifically to the respective receptor with reasonably high affinity. Various constructs are made available which allow either labeling of the receptor to detect its ligand. For example, directly labeling cytokine receptor, fusing onto it markers for secondary labeling, e.g., FLAG or other epitope tags, etc., will allow detection of receptor. This can be histological, as an affinity method for biochemical purification, or labeling or selection in an expression cloning approach. A two-hybrid selection system may also be applied making appropriate constructs with the available cytokine receptor sequences. See, e.g., Fields and Song (1989) Nature 340:245-246.
The broad scope of this invention is best understood with reference to the following examples, which are not intended to limit the inventions to the specific embodiments.
EXAMPLES
I. General Methods 10' Some of the standard methods are described or referenced, e.g., in Maniatis, et al. (1982) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor Press;
Sambrook, et al. (1989) Molecular Cloning: A Laboratory Manual, (2d ed.), vols. 1-3, CSH Press, NY; or Ausubel, et al. (1987 and Supplements) Current Protocols in Molecular Biology, Greene/Wiley, New York. Methods for protein purification include such methods as ammonium sulfate precipitation, column chromatography, electrophoresis, centrifugation, crystallization, and others. See, e.g., Ausubel, et al. (1987 and periodic supplements); Coligan, et al. (ed. 1996) and periodic supplements, Current Protocols In Protein Science Greene/Wiley, New York; Deutscher (1990) "Guide to Protein Purification" in Methods in Enzymoloay, vol. 182, and other volumes in this series; and manufacturer's literature on use of protein purification products, e.g., Pharmacia, Piscataway, N.J., or Bio-Rad, Richmond, CA. Combination with recombinant techniques allow fusion to appropriate segments, e.g., to a FLAG
sequence or an equivalent which can be fused via a protease-removable sequence. See, e.g., Hochuli (1990) "Purification of Recombinant Proteins with Metal Chelate Absorbent" in Setlow (ed.) Genetic Engineering, Principle and Methods 12:87-98, Plenum Press, N.Y.; and Crowe, et al. (1992) QIAexpress: The High Level Expression & Protein Purification System QUIAGEN, Inc., Chatsworth, CA.
Computer sequence analysis is performed, e.g., using available software programs, including those from the GCG (U.
Wisconsin) and GenBank sources. Public sequence databases were also used, e.g., from GenBank and others.
Many techniques applicable to IL-10 receptors may be applied to DCRS3 or DCRS4, as described, e.g., in USSN
08/110,683 (IL-10 receptor), which is incorporated herein by reference.
II. Computational Analysis Human sequences related to cytokine receptors were identified from genomic sequence database using, e.g., the BLAST
5 server (Altschul, et al. (1994) Nature Genet. 6:119-129).
Standard analysis programs may be used to evaluate structure, e.g., PHD (Rost and Sander (1994) Proteins 19:55-72) and DSC
(King and Sternberg (1996) Protein Sci. 5:2298-2310). Standard comparison software includes, e.9., Altschul, et al. (1990) J.
10~ Mol. Biol. 215:403-10; Waterman (1995) Introduction to Computational Bioloq_y~ Maps Seauences and Genomes Chapman &
Hall; Lander and Waterman (eds. 1995) Calculatin4 the Secrets of Life Applications of the Mathematical Sciences in Molecular Biolocry National Academy Press; and Speed and Waterman (eds.
15 1996) Genetic Ma~pina and DNA Seauencing (IMA Volumes in Mathematics and Its Applications, Vol 81) Springer Verlag.
III. Cloning of full-length DCRS3 or DCRS4 cDNAs; Chromosomal localization 20 PCR primers derived from DCRS3 or DCRS4 sequence are used to probe a human cDNA library. Sequences may be derived, e.g., from Table 1 or 3, preferably those adjacent the ends of sequences. Full length cDNAs for primate, rodent, or other species are cloned, e.g., by DNA hybridization screening of 25 7~gt10 phage. PCR reactions are conducted using T. aquaticus Taqplus DNA polymerase (Stratagene) under appropriate conditions.
For experimental confirmation of localization, chromosome spreads are prepared. In situ hybridization is performed on 30 chromosome preparations obtained from phytohemagglutinin-stimulated human lymphocytes cultured for 72 h. 5-bromodeoxyuridine was added for the final seven hours of culture (60 ~g/ml of medium), to ensure a posthybridization chromosomal banding of good quality.
35 A PCR fragment, amplified with the help of primers, is cloned into an appropriate vector. The vector is labeled by nick-translation with 3H. The radiolabeled probe is hybridized to metaphase spreads at final concentration of 200 ng/ml of hybridization solution as described in Mattei, et al. (1985) Hum. Genet. 69:327-331.
After coating with nuclear track emulsion (KODAK NTB2), slides are exposed. To avoid any slipping of silver grains during the banding procedure, chromosome spreads are first stained with buffered Giemsa solution and metaphase photographed. R-banding is then performed by the fluorochrome-photolysis-Giemsa (FPG) method and metaphases rephotographed before analysis.
10~ Similar appropriate methods are used for other species.
IV. Localization of DCRS mRNA
Human multiple tissue (Cat# 1, 2) and cancer cell line blots (Cat# 7757-1), containing approximately 2 ~g of poly(A)+
RNA per lane, are purchased from Clontech (Palo Alto, CA).
Probes are radiolabeled with [a-32P] dATP, e.g., using the Amersham Rediprime random primer labeling kit (RPN1633).
Prehybridization and hybridizations are performed, e.g., at 65°
C in 0.5 M Na2HP04, 7% SDS, 0.5 M EDTA (pH 8.0). High stringency washes are conducted, e.g., at 65° C with two initial washes in 2 x SSC, 0.1% SDS for 40 min followed by a subsequent wash in 0.1 x SSC, O.lo SDS for 20 min. Membranes are then exposed at -70° C to X-Ray film (Kodak) in the presence of intensifying screens. More detailed studies by cDNA library Southerns are performed with selected appropriate human DCRS3 clones to examine their expression in hemopoietic or other cell subsets.
Alternatively, two appropriate primers are selected from Tables 1 or 3. RT-PCR is used on an appropriate mRNA sample selected for the presence of message to produce a cDNA, e.g., a sample which expresses the gene.
Full length clones may be isolated by hybridization of cDNA
libraries from appropriate tissues pre-selected by PCR signal.
Northern blots can be performed.
Message for genes encoding DCRS3 or DCRS4 will be assayed by appropriate technology, e.g., PCR, immunoassay, hybridization, or otherwise. Tissue and organ cDNA preparations are available, e.g., from Clontech, Mountain View, CA.
Identification of sources of natural expression are useful, as described. And the identification of functional receptor subunit pairings will allow for prediction of what cells express the combination of receptor subunits which will result in a physiological responsiveness to each of the cytokine ligands.
For mouse distribution, e.g., Southern Analysis can be performed: DNA (5 fig) from a primary amplified cDNA library was digested with appropriate restriction enzymes to release the 10~ inserts, run on a to agarose gel and transferred to a nylon membrane (Schleicher and Schuell, Keene, NH).
Samples for mouse mRNA isolation may include: resting mouse fibroblastic L cell line (C200); Braf:ER (Braf fusion to estrogen receptor) transfected cells, control (C201); T cells, TH1 polarized (Me114 bright, CD4+ cells from spleen, polarized for 7 days with IFN-y and anti IL-4; T200); T cells, TH2 polarized (Me114 bright, CD4+ cells from spleen, polarized for 7 days with IL-4 and anti-IFN-y; T201); T cells, highly TH1 polarized (see Openshaw, et al. (1995) J. Exp. Med. 182:1357-1367; activated with anti-CD3 for 2, 6, 16 h pooled; T202); T
cells, highly TH2 polarized (see Openshaw, et al. (1995) J. Exp.
Med. 182:1357-1367; activated with anti-CD3 for 2, 6, 16 h pooled; T203); CD44- CD25+ pre T cells, sorted from thymus (T204); TH1 T cell clone D1.1, resting for 3 weeks after last stimulation with antigen (T205); TH1 T cell clone D1.1, 10 ~g/ml ConA stimulated 15 h (T206); TH2 T cell clone CDC35, resting for 3 weeks after last stimulation with antigen (T207); TH2 T cell clone CDC35, 10 ~g/ml ConA stimulated 15 h (T208); Me114+ naive T cells from spleen, resting (T209); Me114+ T cells, polarized to Thl with IFN-y/IL-12/anti-IL-4 for 6, 12, 24 h pooled (T210);
Me114+ T cells, polarized to Th2 with IL-4/anti-IFN-y for 6, 13, 24 h pooled (T211); unstimulated mature B cell leukemia cell line A20 (B200); unstimulated B cell line CH12 (B201);
unstimulated large B cells from spleen (B202); B cells from total spleen, LPS activated (B203); metrizamide enriched dendritic cells from spleen, resting (D200); dendritic cells from bone marrow, resting (D201); monocyte cell line RAW 264.7 activated with LPS 4 h (M200); bone-marrow macrophages derived with GM and M-CSF (M201); macrophage cell line J774, resting (M202); macrophage cell line J774 + LPS + anti-IL-10 at 0.5, 1, 3, 6, 12 h pooled (M203); macrophage cell line J774 + LPS + IL-10 at 0.5, 1, 3, 5, 12 h pooled(M204); aerosol challenged mouse lung tissue, Th2 primers, aerosol OVA challenge 7, 14, 23 h pooled (see Garlisi, et al. (1995) Clinical Immunology and Immunopathologv 75:75-83; X206); Nippostrongulus-infected lung tissue (see Coffman, et al. (1989) Science 245:308-310; X200);
10~ total adult lung, normal (0200); total lung, rag-1 (see Schwarz, et al. (1993) Immunodeficiency 4:249-252; 0205); IL-10 K.O.
spleen (see Kuhn, et al. (1991) Cell 75:263-274; X201); total adult spleen, normal (0201); total spleen, rag-1 (0207); IL-10 K.O. Peyer's patches (0202); total Peyer's patches, normal (0210); IL-10 K.O. mesenteric lymph nodes (X203); total mesenteric lymph nodes, normal (0211); IL-10 K.O. colon (X203);
total colon, normal (0212); NOD mouse pancreas (see Makino, et al. (1980) Jikken Dobutsu 29:1-13; X205); total thymus, rag-1 (0208); total kidney, rag-1 (0209); total heart, rag-1 (0202);
total brain, rag-1 (0203); total testes, rag-1 (0204); total liver, rag-1 (0206); rat normal joint tissue (0300); and rat arthritic joint tissue (X300).
Samples for human mRNA isolation may include: peripheral blood mononuclear cells (monocytes, T cells, NK cells, granulocytes, B cells), resting (T100); peripheral blood mononuclear cells, activated with anti-CD3 for 2, 6, 12 h pooled (T101); T cell, THO clone Mot 72, resting (T102); T cell, THO
clone Mot 72, activated with anti-CD28 and anti-CD3 for 3, 6, 12 h pooled (T103); T cell, THO clone Mot 72, anergic treated with specific peptide for 2, 7, 12 h pooled (T104); T cell, TH1 clone HY06, resting (T107); T cell, TH1 clone HY06, activated with anti-CD28 and anti-CD3 for 3, 6, 12 h pooled (T108); T cell, TH1 clone HY06, anergic treated with specific peptide for 2, 6, 12 h pooled (T109); T cell, TH2 clone HY935, resting (T110); T cell, TH2 clone HY935, activated with anti-CD28 and anti-CD3 for 2, 7, 12 h pooled (T111); T cells CD4+CD45R0- T cells polarized 27 days in anti-CD28, IL-4, and anti IFN-y, TH2 polarized, activated with anti-CD3 and anti-CD28 4 h (T116); T cell tumor lines Jurkat and Hut78, resting (T117); T cell clones, pooled AD130.2, Tc783.12, Tc783.13, Tc783.58, Tc782.69, resting (T118); T cell random ys T cell clones, resting (T119); Splenocytes, resting (B100); Splenocytes, activated with anti-CD40 and IL-4 (B101); B
cell EBV lines pooled WT49, RSB, JY, CVIR, 721.221, RM3, HSY, resting (B102); B cell line JY, activated with PMA and ionomycin for 1, 6 h pooled (B103); NK 20 clones pooled, resting (K100);
NK 20 clones pooled, activated with PMA and ionomycin for 6 h 10~ (K101); NKL clone, derived from peripheral blood of LGL leukemia patient, IL-2 treated (K106); NK cytotoxic clone 640-A30-1, resting (K107); hematopoietic precursor line TF1, activated with PMA and ionomycin for 1, 6 h pooled (C100); U937 premonocytic line, resting (M100); U937 premonocytic line, activated with PMA
and ionomycin for 1, 6 h pooled (M101); elutriated monocytes, activated with LPS, IFNy, anti-IL-10 for 1, 2, 6, 12, 24 h pooled (M102); elutriated monocytes, activated with LPS, IFNy, IL-10 for l, 2, 6, 12, 24 h. pooled (M103); elutriated monocytes, activated with LPS, IFNy, anti-IL-10 for 4, 16 h pooled (M106); elutriated monocytes, activated with LPS, IFNy, IL-10 for 4, 16 h pooled (M107); elutriated monocytes, activated LPS for 1 h (M108);
elutriated monocytes, activated LPS for 6 h (M109); DC 70%
CDla+, from CD34+ GM-CSF, TNFa 12 days, resting (D101); DC 70%
CDla+, from CD34+ GM-CSF, TNFa 12 days, activated with PMA and ionomycin for 1 hr (D102); DC 70% CDla+, from CD34+ GM-CSF, TNFa 12 days, activated with PMA and ionomycin for 6 hr (D103); DC
95% CDla+, from CD34+ GM-CSF, TNFa 12 days FACS sorted, activated with PMA and ionomycin for 1, 6 h pooled (D104); DC
95% CD14+, ex CD34+ GM-CSF, TNFa 12 days FACS sorted, activated with PMA and ionomycin 1, 6 hr pooled (D105); DC CDla+ CD86+, from CD34+ GM-CSF, TNFa 12 days FACS sorted, activated with PMA
and ionomycin for l, 6 h pooled (D106); DC from monocytes GM-CSF, IL-4 5 days, resting (D107); DC from monocytes GM-CSF, IL-4 5 days, resting (D108); DC from monocytes GM-CSF, IL-4 5 days, activated LPS 4, 16 h pooled (D109); DC from monocytes GM-CSF, IL-4 5 days, activated TNFa, monocyte supe for 4, 16 h pooled (D110); leiomyoma L11 benign tumor (X101); normal myometrium M5 (0115); malignant leiomyosarcoma GS1 (X103); lung fibroblast sarcoma line MRC5, activated with PMA and ionomycin for 1, 6 h pooled (C101); kidney epithelial carcinoma cell line CHA, activated with PMA and ionomycin for 1, 6 h pooled (C102);
5 kidney fetal 28 wk male (0100); lung fetal 28 wk male (0101);
liver fetal 28 wk male (0102); heart fetal 28 wk male (0103);
brain fetal 28 wk male (0104); gallbladder fetal 28 wk male (0106); small intestine fetal 28 wk male (0107); adipose tissue fetal 28 wk male (0108); ovary fetal 25 wk female (0109); uterus 10~ fetal 25 wk female (0110); testes fetal 28 wk male (0111);
spleen fetal 28 wk male (0112); adult placenta 28 wk (0113); and tonsil inflamed, from 12 year old (X100).
Similar samples may isolated in other species for evaluation.
V. Cloning of species counterparts of DCRS3 or DCRS4 Various strategies are used to obtain species counterparts of DCRS3 or DCRS4, preferably from other primates or rodents.
One method is by cross hybridization using closely related species DNA probes. It may be useful to go into evolutionarily similar species as intermediate steps. Another method is by using specific PCR primers based on the identification of blocks of similarity or difference between genes, e.g., areas of highly conserved or nonconserved polypeptide or nucleotide sequence.
Antibody based screening methods are also available, e.g., in expression cloning.
VI. Production of mammalian DCRS3 or DCRS4 protein An appropriate, e.g., GST, fusion construct is engineered for expression, e.g., in E. coli. For example, a mouse IGIF
pGex plasmid is constructed and transformed into E. coli.
Freshly transformed cells are grown, e.g., in LB medium containing 50 ~g/ml ampicillin and induced with IPTG (Sigma, St.
Louis, MO). After overnight induction, the bacteria are harvested and the pellets containing, e.g., DCRS3, protein are isolated. The pellets are homogenized, e.9., in TE buffer (50 mM Tris-base pH 8.0, 10 mM EDTA and 2 mM pefabloc) in 2 liters.
This material is passed through a microfluidizer (Microfluidics, Newton, MA) three times. The fluidized supernatant is spun down on a Sorvall GS-3 rotor for 1 h at 13,000 rpm. The resulting supernatant containing the cytokine receptor protein is filtered and passed over a glutathione-SEPHAROSE column equilibrated in 50 mM Tris-base pH 8Ø The fractions containing the DCRS3-GST
fusion protein are pooled and cleaved, e.g., with thrombin (Enzyme Research Laboratories, Inc., South Bend, IN). The cleaved pool is then passed over a Q-SEPHAROSE column 10~ equilibrated in 50 mM Tris-base. Fractions containing DCRS3 are pooled and diluted in cold distilled H20, to lower the conductivity, and passed back over a fresh Q-Sepharose column, alone or in succession with an immunoaffinity antibody column.
Fractions containing DCRS3 protein are pooled, aliquoted, and stored in the -70° C freezer.
Comparison of the CD spectrum with cytokine receptor protein may suggest that the protein is correctly folded. See Hazuda, et al. (1969) J. Biol. Chem. 264:1689-1693.
VII. Preparation of antibodies specific for DCRS3 or DCRS4 Inbred Balb/c mice are immunized intraperitoneally with recombinant forms of the protein, e.g., purified DCRS3 or stable transfected NIH-3T3 cells. Animals are boosted at appropriate time points with protein, with or without additional adjuvant, to further stimulate antibody production. Serum is collected, or hybridomas produced with harvested spleens.
Alternatively, Balb/c mice are immunized with cells transformed with the gene or fragments thereof, either endogenous or exogenous cells, or with isolated membranes enriched for expression of the antigen. Serum is collected at the appropriate time, typically after numerous further administrations. Various gene therapy techniques may be useful, e.g., in producing protein in situ, for generating an immune response. Serum or antibody preparations may be cross-absorbed or immunoselected to prepare substantially purified antibodies of defined specificity and high affinity.
Monoclonal antibodies may be made. For example, splenocytes are fused with an appropriate fusion partner and hybridomas are selected in growth medium by standard procedures.
Hybridoma supernatants are screened for the presence of antibodies which bind to DCRS3, e.g., by ELISA or other assay.
Antibodies which specifically recognize specific DCRS3 embodiments may also be selected or prepared.
In another method, synthetic peptides or purified protein are presented to an immune system to generate monoclonal or 10~ polyclonal antibodies. See, e.g., Coligan (ed. 1991) Current Protocols in Immunoloay Wiley/Greene; and Harlow and Lane (1989) Antibodies: A Laboratory Manual Cold Spring Harbor Press. In appropriate situations, the binding reagent is either labeled as described above, e.g., fluorescence or otherwise, or immobilized to a substrate for panning methods. Nucleic acids may also be introduced into cells in an animal to produce the antigen, which serves to elicit an immune response. See, e.g., Wang, et al.
(1993) Proc. Nat'1. Acad. Sci. 90:4156-4160; Barry, et al.
(1994) BioTechnigues 16:616-619; and Xiang, et al. (1995) Immunity 2: 129-135.
VIII. Production of fusion proteins with DCRS
Various fusion constructs are made with DCRS. A portion of the appropriate gene is fused to an epitope tag, e.g., a FLAG
tag, or to a two hybrid system construct. See, e.g., Fields and Song (1989) Nature 340:245-246.
The epitope tag may be used in an expression cloning procedure with detection with anti-FLAG antibodies to detect a binding partner, e.g., ligand for the respective cytokine receptor. The two hybrid system may also be used to isolate proteins which specifically bind to DCRS.
IX. Structure activity relationship Information on the criticality of particular residues is determined using standard procedures and analysis. Standard mutagenesis analysis is performed, e.g., by generating many different variants at determined positions, e.g., at the positions identified above, and evaluating biological activities of the variants. This may be performed to the extent of determining positions which modify activity, or to focus on specific positions to determine the residues which can be substituted to either retain, block, or modulate biological activity.
Alternatively, analysis of natural variants can indicate what positions tolerate natural mutations. This may result from populational analysis of variation among individuals, or across 10~ strains or species. Samples from selected individuals are analyzed, e.g., by PCR analysis and sequencing. This allows evaluation of population polymorphisms.
X. Isolation of a ligand for DCRS
A cytokine receptor can be used as a specific binding reagent to 'identify its binding partner, by taking advantage of its specificity of binding, much like an antibody would be used.
The binding receptor may be a heterodimer of receptor subunits;
or may involve, e.g., a complex of the DCRS with another subunit. A binding reagent is either labeled as described above, e.g., fluorescence or otherwise, or immobilized to a substrate for panning methods.
The binding composition is used to screen an expression library made from a cell line which expresses a binding partner, i.e., ligand, preferably membrane associated. Standard staining techniques are used to detect or sort surface expressed ligand, or surface expressing transformed cells are screened by panning.
Screening of intracellular expression is performed by various staining or immunofluorescence procedures. See also McMahan, et al. (1991) EMBO J. 10:2821-2832.
For example, on day 0, precoat 2-chamber permanox slides with 1 ml per chamber of fibronectin, 10 ng/ml in PBS, for 30 min at room temperature. Rinse once with PBS. Then plate COS
cells at 2-3 x 105 cells per chamber in 1.5 ml of growth media.
Incubate overnight at 37°C.
On day 1 for each sample, prepare 0.5 ml of a solution of 66 ~g/ml DEAE-dextran, 66 ~M chloroquine, and 4 ~g DNA in serum free DME. For each set, a positive control is prepared, e.g., of DCRS-FLAG cDNA at 1 and 1/200 dilution, and a negative mock.
Rinse cells with serum free DME. Add the DNA solution and incubate 5 hr at 37°C. Remove the medium and add 0.5 ml 10%
DMSO in DME for 2.5 min. Remove and wash once with DME. Add 1.5 ml growth medium and incubate overnight.
On day 2, change the medium. On days 3 or 4, the cells are fixed and stained. Rinse the cells twice with Hank's Buffered Saline Solution (HBSS) and fix in 4% paraformaldehyde 10~ (PFA)/glucose for 5 min. Wash 3X with HBSS. The slides may be stored at -80°C after all liquid is removed. For each chamber, 0.5 ml incubations are performed as follows. Add HBSS/saponin (0.1%) with 32 ~1/ml of 1 M NaN3 for 20 min. Cells are then washed with HBSS/saponin 1X. Add appropriate DCRS or DCRS/antibody complex to cells and incubate for 30 min. wash cells twice with HBSS/saponin. If appropriate, add first antibody for 30 min. Add second antibody, e.g., Vector anti-mouse antibody, at 1/200 dilution, and incubate for 30 min.
Prepare ELISA solution, e.9., Vector Elite ABC horseradish peroxidase solution, and preincubate for 30 min. Use, e.g., 1 drop of solution A (avidin) and 1 drop solution B (biotin) per 2.5 ml HBSS/saponin. Wash cells twice with HBSS/saponin. Add ABC HRP solution and incubate for 30 min. Wash cells twice with HBSS, second wash for 2 min, which closes cells. Then add Vector diaminobenzoic acid (DAB) for 5 to 10 min. Use 2 drops of buffer plus 4 drops DAB plus 2 drops of H202 per 5 ml of glass distilled water. Carefully remove chamber and rinse slide in water. Air dry for a few minutes, then add 1 drop of Crystal Mount and a cover slip. Bake for 5 min at 85-90°C.
Evaluate positive staining of pools and progressively subclone to isolation of single genes responsible for the binding.
Alternatively, receptor reagents are used to affinity purify or sort out cells expressing a putative ligand. See, e.g., Sambrook, et al. or Ausubel, et al.
Another strategy is to screen for a membrane bound receptor by panning. The receptor cDNA is constructed as described above. Immobilization may be achieved by use of appropriate antibodies which recognize, e.g., a FLAG sequence of a DCRS
fusion construct, or by use of antibodies raised against the first antibodies. Recursive cycles of selection and 5 amplification lead to enrichment of appropriate clones and eventual isolation of receptor expressing clones.
Phage expression libraries can be screened by mammalian DCRS. Appropriate label techniques, e.g., anti-FLAG antibodies, will allow specific labeling of appropriate clones.
10~
All citations herein are incorporated herein by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.
15 Many modifications and variations of this invention can be made without departing from its spirit and scope, as will be apparent to those skilled in the art. The specific embodiments described herein are offered by way of example only, and the invention is to be limited by the terms of the appended claims, 20 along with the full scope of equivalents to which such claims are entitled; and the invention is not to be limited by the specific embodiments that have been presented herein by way of example.
SEQUENCE LISTING
<110> Schering Corporation <120> Mammalian Receptor Proteins; Related Reagents and Methods <130> DX01086K PCT
<140>
<141>
<150> US 09/443,060 <151> 1999-11-18 <160> 32 <170> PatentIn Ver. 2.0 <210> 1 <211> 1707 <212> DNA
<213> primate; surmised Homo Sapiens <220>
<221> CDS
<222> (1)..(1704) <220>
<221> mat peptide <222> (61)..(1704) <400> 1 atg ccg cgt ggc tgg gcc gcc ccc ttg ctc ctg ctg ctg ctc cag gga 48 Met Pro Arg Gly Trp Ala Ala Pro Leu Leu Leu Leu Leu Leu Gln Gly gcc ctc gag ggg atg gag agg aag ctc tgc agt ccc aag cca ccc ccc 96 Ala Leu Glu Gly Met Glu Arg Lys Leu Cys Ser Pro Lys Pro Pro Pro acc aag gcc tct ctc ccc act gac cct cca ggc tgg ggc tgc ccc gac 144 Thr Lys Ala Ser Leu Pro Thr Asp Pro Pro Gly Trp Gly Cys Pro Asp ctc gtc tgc tac acc gat tac ctc cag acg gtc atc tgc atc ctg gaa 192 Leu Val Cys Tyr Thr Asp Tyr Leu Gln Thr Val Ile Cys Ile Leu Glu atg tgg aac ctc cac ccc agc acg ctc acc ctt acc tgg ata ctt tct 240 Met Trp Asn Leu His Pro Ser Thr Leu Thr Leu Thr Trp Ile Leu Ser aat aat act ggg tgc tat atc aag gac aga aca ctg gac ctc agg caa 288 Asn Asn Thr Gly Cys Tyr Ile Lys Asp Arg Thr Leu Asp Leu Arg Gln gac cag tat gaa gag ctg aag gac gag gcc acc tcc tgc agc ctc cac 336 Asp Gln Tyr Glu Glu Leu Lys Asp Glu Ala Thr Ser Cys Ser Leu His agg tcg gcc cac aat gcc acg cat gcc acc tac acc tgc cac atg gat 384 Arg Ser Ala His Asn Ala Thr His Ala Thr Tyr Thr Cys His Met Asp gta ttc cac ttc atg gcc gac gac att ttc agt gtc aac atc aca gac 432 Val Phe His Phe Met Ala Asp Asp Ile Phe Ser Val Asn Ile Thr Asp cag tct ggc aac tac tcc cag gag tgt ggc agc ttt ctc ctg get gag 480 Gln Ser Gly Asn Tyr Ser Gln Glu Cys Gly Ser Phe Leu Leu Ala Glu agc aga cag tat aat atc tcc tgg cgc tca gat tac gaa gac cct gcc 528 Ser Arg Gln Tyr Asn Ile Ser Trp Arg Ser Asp Tyr Glu Asp Pro Ala ttc tac atg ctg aag ggc aag ctt cag tat gag ctg cag tac agg aac 576 Phe Tyr Met Leu Lys Gly Lys Leu Gln Tyr Glu Leu Gln Tyr Arg Asn cgg gga gac ccc tgg get gtg agt ccg agg aga aag ctg atc tca gtg 624 Arg Gly Asp Pro Trp Ala Val Ser Pro Arg Arg Lys Leu Ile Ser Val gac tca aga agt gtc tcc ctc ctc ccc ctg gag ttc cgc aaa gac tcg 672 Asp Ser Arg Ser Val Ser Leu Leu Pro Leu Glu Phe Arg Lys Asp Ser agc tat gag ctg cag gtg cgg gca ggg ccc atg cct ggc tcc tcc tac 720 Ser Tyr Glu Leu Gln Val Arg Ala Gly Pro Met Pro Gly Ser Ser Tyr cag ggg acc tgg agt gaa tgg agt gac ccg gtc atc ttt cag acc cag 768 Gln Gly Thr Trp Ser Glu Trp Ser Asp Pro Val Ile Phe Gln Thr Gln tca gag gag tta aag gaa ggc tgg aac cct cac ctg ctg ctt ctc ctc 816 Ser Glu Glu Leu Lys Glu Gly Trp Asn Pro His Leu Leu Leu Leu Leu ctg ctt gtc ata gtc ttc att cct gcc ttc tgg agc ctg aag acc cat 864 Leu Leu Val Ile Val Phe Ile Pro Ala Phe Trp Ser Leu Lys Thr His cca ttg tgg agg cta tgg aag aag ata tgg gcc gtc ccc agc cct gag 912 Pro Leu Trp Arg Leu Trp Lys Lys Ile Trp Ala Val Pro Ser Pro Glu cgg ttc ttc atg ccc ctg tac aag ggc tgc agc gga gac ttc aag aaa 960 Arg Phe Phe Met Pro Leu Tyr Lys Gly Cys Ser Gly Asp Phe Lys Lys tgg gtg ggt gca ccc ttc act ggc tcc agc ctg gag ctg gga ccc tgg 1008 Trp Val Gly Ala Pro Phe Thr Gly Ser Ser Leu Glu Leu Gly Pro Trp agc cca gag gtg ccc tcc acc ctg gag gtg tac agc tgc cac cca cca 1056 Ser Pro Glu Val Pro Ser Thr Leu Glu Val Tyr Ser Cys His Pro Pro cgg agc ccg gcc aag agg ctg cag ctc acg gag cta caa gaa cca gca 1104 Arg Ser Pro Ala Lys Arg Leu Gln Leu Thr Glu Leu Gln Glu Pro Ala gag ctg gtg gag tct gac ggt gtg ccc aag ccc agc ttc tgg ccg aca 1152 Glu Leu Val Glu Ser Asp Gly Val Pro Lys Pro Ser Phe Trp Pro Thr gcc cag aac tcg ggg ggc tca get tac agt gag gag agg gat cgg cca 1200 Ala Gln Asn Ser Gly Gly Ser Ala Tyr Ser Glu Glu Arg Asp Arg Pro tac ggc ctg gtg tcc att gac aca gtg act gtg cta gat gca gag ggg 1248 Tyr Gly Leu Val Ser Ile Asp Thr Val Thr Val Leu Asp Ala Glu Gly cca tgc acc tgg ccc tgc agc tgt gag gat gac ggc tac cca gcc ctg 1296 Pro Cys Thr Trp Pro Cys Ser Cys Glu Asp Asp Gly Tyr Pro Ala Leu gac ctg gat get ggc ctg gag ccc agc cca ggc cta gag gac cca ctc 1344 Asp Leu Asp Ala Gly Leu Glu Pro Ser Pro Gly Leu Glu Asp Pro Leu ttg gat gca ggg acc aca gtc ctg tcc tgt ggc tgt gtc tca get ggc 1392 Leu Asp Ala Gly Thr Thr Val Leu Ser Cys Gly Cys Val Ser Ala Gly agc cct ggg cta gga ggg ccc ctg gga agc ctc ctg gac aga cta aag 1440 Ser Pro Gly Leu Gly Gly Pro Leu Gly Ser Leu Leu Asp Arg Leu Lys cca ccc ctt gca gat ggg gag gac tgg get ggg gga ctg ccc tgg ggt 1488 Pro Pro Leu Ala Asp Gly Glu Asp Trp Ala Gly Gly Leu Pro Trp Gly ggc cgg tca cct gga ggg gtc tca gag agt gag gcg ggc tca ccc ctg 1536 Gly Arg Ser Pro Gly Gly Val Ser Glu Ser Glu Ala Gly Ser Pro Leu gcc ggc ctg gat atg gac acg ttt gac agt ggc ttt gtg ggc tct gac 1584 Ala Gly Leu Asp Met Asp Thr Phe Asp Ser Gly Phe Val Gly Ser Asp tgc agc agc cct gtg gag tgt gac ttc acc agc ccc ggg gac gaa gga 1632 Cys Ser Ser Pro Val Glu Cys Asp Phe Thr Ser Pro Gly Asp Glu Gly ccc ccc cgg agc tac ctc cgc cag tgg gtg gtc att cct ccg cca ctt 1680 Pro Pro Arg Ser Tyr Leu Arg Gln Trp Val Val Ile Pro Pro Pro Leu tcg agc cct gga ccc cag gcc agc taa 1707 Ser Ser Pro Gly Pro Gln Ala Ser <210> 2 <211> 568 <212> PRT
<213> primate; surmised Homo sapiens <400> 2 Met Pro Arg Gly Trp Ala Ala Pro Leu Leu Leu Leu Leu Leu Gln Gly Ala Leu Glu Gly Met Glu Arg Lys Leu Cys Ser Pro Lys Pro Pro Pro Thr Lys Ala Ser Leu Pro Thr Asp Pro Pro Gly Trp Gly Cys Pro Asp Leu Val Cys Tyr Thr Asp Tyr Leu Gln Thr Val Ile Cys Ile Leu Glu Met Trp Asn Leu His Pro Ser Thr Leu Thr Leu Thr Trp Ile Leu Ser Asn Asn Thr Gly Cys Tyr Ile Lys Asp Arg Thr Leu Asp Leu Arg Gln Asp Gln Tyr Glu Glu Leu Lys Asp Glu Ala Thr Ser Cys Ser Leu His Arg Ser Ala His Asn Ala Thr His Ala Thr Tyr Thr Cys His Met Asp Val Phe His Phe Met Ala Asp Asp Ile Phe Ser Val Asn Ile Thr Asp Gln Ser Gly Asn Tyr Ser Gln Glu Cys Gly Ser Phe Leu Leu Ala Glu Ser Arg Gln Tyr Asn Ile Ser Trp Arg Ser Asp Tyr Glu Asp Pro Ala Phe Tyr Met Leu Lys Gly Lys Leu Gln Tyr Glu Leu Gln Tyr Arg Asn Arg Gly Asp Pro Trp Ala Val Ser Pro Arg Arg Lys Leu Ile Ser Val Asp Ser Arg Ser Val Ser Leu Leu Pro Leu Glu Phe Arg Lys Asp Ser Ser Tyr Glu Leu Gln Val Arg Ala Gly Pro Met Pro Gly Ser Ser Tyr Gln Gly Thr Trp Ser Glu Trp Ser Asp Pro Val Ile Phe Gln Thr Gln Ser Glu Glu Leu Lys Glu Gly Trp Asn Pro His Leu Leu Leu Leu Leu Leu Leu Val Ile Val Phe Ile Pro Ala Phe Trp Ser Leu Lys Thr His Pro Leu Trp Arg Leu Trp Lys Lys Ile Trp Ala Val Pro Ser Pro Glu Arg Phe Phe Met Pro Leu Tyr Lys Gly Cys Ser Gly Asp Phe Lys Lys Trp Val Gly Ala Pro Phe Thr Gly Ser Ser Leu Glu Leu Gly Pro Trp Ser Pro Glu Val Pro Ser Thr Leu Glu Val Tyr Ser Cys His Pro Pro Arg Ser Pro Ala Lys Arg Leu Gln Leu Thr Glu Leu Gln Glu Pro Ala Glu Leu Val Glu Ser Asp Gly Val Pro Lys Pro Ser Phe Trp Pro Thr Ala Gln Asn Ser Gly Gly Ser Ala Tyr Ser Glu Glu Arg Asp Arg Pro Tyr Gly Leu Val Ser Ile Asp Thr Val Thr Val Leu Asp Ala Glu Gly Pro Cys Thr Trp Pro Cys Ser Cys Glu Asp Asp Gly Tyr Pro Ala Leu Asp Leu Asp Ala Gly Leu Glu Pro Ser Pro Gly Leu Glu Asp Pro Leu Leu Asp Ala Gly Thr Thr Val Leu Ser Cys Gly Cys Val Ser Ala Gly Ser Pro Gly Leu Gly Gly Pro Leu Gly Ser Leu Leu Asp Arg Leu Lys Pro Pro Leu Ala Asp Gly Glu Asp Trp Ala Gly Gly Leu Pro Trp Gly Gly Arg Ser Pro Gly Gly Val Ser Glu Ser Glu Ala Gly Ser Pro Leu Ala Gly Leu Asp Met Asp Thr Phe Asp Ser Gly Phe Val Gly Ser Asp Cys Ser Ser Pro Val Glu Cys Asp Phe Thr Ser Pro Gly Asp Glu Gly Pro Pro Arg Ser Tyr Leu Arg Gln Trp Val Val Ile Pro Pro Pro Leu Ser Ser Pro Gly Pro Gln Ala Ser <210> 3 <211> 1704 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: reverse translation <220>
<221> misc_feature <222> (6). (1704) <223> n may be a, c, g, or t <400> 3 atgccnmgng gntgggcngc nccnytnytn ytnytnytny tncarggngc nytngarggn 60 atggarmgna arytntgyws nccnaarccn ccnccnacna argcnwsnyt nccnacngay 120 ccnccnggnt ggggntgycc ngayytngtn tgytayacng aytayytnca racngtnath 180 tgyathytng aratgtggaa yytncayccn wsnacnytna cnytnacntg gathytnwsn 240 aayaayacng gntgytayat haargaymgn acnytngayy tnmgncarga ycartaygar 300 garytnaarg aygargcnac nwsntgywsn ytncaymgnw sngcncayaa ygcnacncay 360 gcnacntaya cntgycayat ggaygtntty cayttyatgg cngaygayat httywsngtn 420 aayathacng aycarwsngg naaytaywsn cargartgyg gnwsnttyyt nytngcngar 480 wsnmgncart ayaayathws ntggmgnwsn gaytaygarg ayccngcntt ytayatgytn 540 aarggnaary tncartayga rytncartay mgnaaymgng gngayccntg ggcngtnwsn 600 ccnmgnmgna arytnathws ngtngaywsn mgnwsngtnw snytnytncc nytngartty 660 mgnaargayw snwsntayga rytncargtn mgngcnggnc cnatgccngg nwsnwsntay 720 carggnacnt ggwsngartg gwsngayccn gtnathttyc aracncarws ngargarytn 780 aargarggnt ggaayccnca yytnytnytn ytnytnytny tngtnathgt nttyathccn 840 gcnttytggw snytnaarac ncayccnytn tggmgnytnt ggaaraarat htgggcngtn 900 ccnwsnccng armgnttytt yatgccnytn tayaarggnt gywsnggnga yttyaaraar 960 tgggtnggng cnccnttyac nggnwsnwsn ytngarytng gnccntggws nccngargtn 1020 ccnwsnacny tngargtnta ywsntgycay ccnccnmgnw snccngcnaa rmgnytncar 1080 ytnacngary tncargarcc ngcngarytn gtngarwsng ayggngtncc naarccnwsn 1140 ttytggccna cngcncaraa ywsnggnggn wsngcntayw sngargarmg ngaymgnccn 1200 tayggnytng tnwsnathga yacngtnacn gtnytngayg cngarggncc ntgyacntgg 1260 ccntgywsnt gygargayga yggntayccn gcnytngayy tngaygcngg nytngarccn 1320 wsnccnggny tngargaycc nytnytngay gcnggnacna cngtnytnws ntgyggntgy 1380 gtnwsngcng gnwsnccngg nytnggnggn ccnytnggnw snytnytnga ymgnytnaar 1440 ccnccnytng cngayggnga rgaytgggcn ggnggnytnc cntggggngg nmgnwsnccn 1500 ggnggngtnw sngarwsnga rgcnggnwsn ccnytngcng gnytngayat ggayacntty 1560 gaywsnggnt tygtnggnws ngaytgywsn wsnccngtng artgygaytt yacnwsnccn 1620 ggngaygarg gnccnccnmg nwsntayytn mgncartggg tngtnathcc nccnccnytn 1680 wsnwsnccng gnccncargc nwsn 1704 <210> 4 <211> 750 <212> DNA
<213> primate; surmised Homo Sapiens <220>
<221> CDS
<222> (1)..(747) <220>
<221> mat peptide <222> (64)..(747) <400> 4 atg atg cct aaa cat tgc ttt cta ggc ttc ctc atc agt ttc ttc ctt 48 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu act ggt gta gca gga act cag tca acg cat gag tct ctg aag cct cag 96 Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln agg gta caa ttt cag tcc cga aat ttt cac aac att ttg caa tgg cag 144 Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln cct ggg agg gca ctt act ggc aac agc agt gtc tat ttt gtg cag tac 192 Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr aaa ata tat gga cag aga caa tgg aaa aat aaa gaa gac tgt tgg ggt 240 Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly act caa gaa ctc tct tgt gac ctt acc agt gaa acc tca gac ata cag 288 Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln gaa cct tat tac ggg agg agg ggc aaa aat aaa aat aaa ggg aat cct 336 Glu Pro Tyr Tyr Gly Arg Arg Gly Lys Asn Lys Asn Lys Gly Asn Pro tgg ggg cca aaa caa agt aaa cgg aaa tca aag ggg aac cag aag acc 384 Trp Gly Pro Lys Gln Ser Lys Arg Lys Ser Lys Gly Asn Gln Lys Thr aac aca gtg act gcc cca get gcc ctg aag gca ttt get gga tgt gca 432 Asn Thr Val Thr Ala Pro Ala Ala Leu Lys Ala Phe Ala Gly Cys Ala aaa ata gat cct cca gtc atg aat ata acc caa gtc aat ggc tct ttg 480 Lys Ile Asp Pro Pro Val Met Asn Ile Thr Gln Val Asn Gly Ser Leu ttg gta att ctc cat get cca aat tta cca tat aga tac caa aag gaa 528 Leu Val Ile Leu His Ala Pro Asn Leu Pro Tyr Arg Tyr Gln Lys Glu aaa aat gta tct ata gaa gat tac tat gaa cta cta tac cga gtt ttt 576 Lys Asn Val Ser Ile Glu Asp Tyr Tyr Glu Leu Leu Tyr Arg Val Phe ata att aac aat tca cta gaa aag gag caa aag gtt tat gaa ggg get 624 Ile Ile Asn Asn Ser Leu Glu Lys Glu Gln Lys Val Tyr Glu Gly Ala cac aga gcg gtt gaa att gaa get cta aca cca cac tcc agc tac tgt 672 His Arg Ala Val Glu Ile Glu Ala Leu Thr Pro His Ser Ser Tyr Cys gta gtg get gaa ata tat cag ccc atg tta gac aga aga agt cag aga 720 Val Val Ala Glu Ile Tyr Gln Pro Met Leu Asp Arg Arg Ser Gln Arg agt gaa gag aga tgt gtg gaa att cca tga 750 Ser Glu Glu Arg Cys Val Glu Ile Pro <210> 5 <211> 249 <212> PRT
<213> primate; surmised Homo sapiens <400> 5 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln Glu Pro Tyr Tyr Gly Arg Arg Gly Lys Asn Lys Asn Lys Gly Asn Pro Trp Gly Pro Lys Gln Ser Lys Arg Lys Ser Lys.Gly Asn Gln Lys Thr Asn Thr Val Thr Ala Pro Ala Ala Leu Lys Ala Phe Ala Gly Cys Ala Lys Ile Asp Pro Pro Val Met Asn Ile Thr Gln Val Asn Gly Ser Leu Leu Val Ile Leu His Ala Pro Asn Leu Pro Tyr Arg Tyr Gln Lys Glu Lys Asn Val Ser Ile Glu Asp Tyr Tyr Glu Leu Leu Tyr Arg Val Phe Ile Ile Asn Asn Ser Leu Glu Lys Glu Gln Lys Val Tyr Glu Gly Ala His Arg Ala Val Glu Ile Glu Ala Leu Thr Pro His Ser Ser Tyr Cys Val Val Ala Glu Ile Tyr Gln Pro Met Leu Asp Arg Arg Ser Gln Arg Ser Glu Glu Arg Cys Val Glu Ile Pro <210> 6 <211> 747 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: reverse translation <220>
<221> misc_feature <222> (1). (747) <223> n may be a, c, g, or t <400> 6 atgatgccna arcaytgytt yytnggntty ytnathwsnt tyttyytnac nggngtngcn 60 ggnacncarw snacncayga rwsnytnaar ccncarmgng tncarttyca rwsnmgnaay 120 ttycayaaya thytncartg gcarccnggn mgngcnytna cnggnaayws nwsngtntay 180 ttygtncart ayaarathta yggncarmgn cartggaara ayaargarga ytgytggggn 240 acncargary tnwsntgyga yytnacnwsn garacnwsng ayathcarga rccntaytay 300 ggnmgnmgng gnaaraayaa raayaarggn aayccntggg gnccnaarca rwsnaarmgn 360 aarwsnaarg gnaaycaraa racnaayacn gtnacngcnc cngcngcnyt naargcntty 420 gcnggntgyg cnaarathga yccnccngtn atgaayatha cncargtnaa yggnwsnytn 480 ytngtnathy tncaygcncc naayytnccn taymgntayc araargaraa raaygtnwsn 540 athgargayt aytaygaryt nytntaymgn gtnttyatha thaayaayws nytngaraar 600 garcaraarg tntaygargg ngcncaymgn gcngtngara thgargcnyt nacnccncay 660 wsnwsntayt gygtngtngc ngarathtay carccnatgy tngaymgnmg nwsncarmgn 720 wsngargarm gntgygtnga rathccn 747 <210> 7 <211> 210 <212> PRT
<213> primate <400> 7 Val Asn Gly Thr Ser Gln Phe Thr Cys Phe Tyr Asn Ser Arg Ala Asn Ile Ser Cys Val Trp Ser Gln Asp Gly Ala Leu Gln Asp Thr Ser Cys Gln Val His Ala Trp Pro Asp Arg Arg Arg Trp Asn Gln Thr Cys Glu Leu Leu Pro Val Ser Gln Ala Ser Trp Ala Cys Asn Leu Ile Leu Gly Ala Pro Asp Ser Gln Lys Leu Thr Thr Val Asp Ile Val Thr Leu Arg Val Leu Cys Arg Glu Gly Val Arg Trp Arg Val Met Ala Ile Gln Asp Phe Lys Pro Phe Glu Asn Leu Arg Leu Met Ala Pro Ile Ser Leu Gln Val Val His Val Glu Thr His Arg Cys Asn Ile Ser Trp Glu Ile Ser Gln Ala Ser His Tyr Phe Glu Arg His Leu Glu Phe Glu Ala Arg Thr Leu Ser Pro Gly His Thr Trp Glu Glu Ala Pro Leu Leu Thr Leu Lys Gln Lys Gln Glu Trp Ile Cys Leu Glu Thr Leu Thr Pro Asp Thr Gln Tyr Glu Phe Gln Val Arg Val Lys Pro Leu Gln Gly Glu Phe Thr Thr Trp Ser Pro Trp Ser Gln Pro Leu Ala Phe Arg Thr Lys Pro Ala Ala Leu Gly <210> 8 <211> 231 <212> PRT
<213> primate <400> 8 Ile Cys Ile Cys Thr Cys Val Cys Leu Gly Val Ser Val Thr Gly Glu Gly Gln Gly Pro Arg Ser Arg Thr Phe Thr Cys Leu Thr Asn Asn Ile Leu Arg Ile Asp Cys His Trp Ser Ala Pro Glu Leu Gly Gln Gly Ser Ser Pro Trp Leu Leu Phe Thr Ser Asn Gln Ala Pro Gly Gly Thr His Lys Cys Ile Leu Arg Gly Ser Glu Cys Thr Val Val Leu Pro Pro Glu Ala Val Leu Val Pro Ser Asp Asn Phe Thr Ile Thr Phe His His Cys Met Ser Gly Arg Glu Gln Val Ser Leu Val Asp Pro Glu Tyr Leu Pro Arg Arg His Val Lys Leu Asp Pro Pro Ser Asp Leu Gln Ser Asn Ile Ser Ser Gly His Cys Ile Leu Thr Trp Ser Ile Ser Pro Ala Leu Glu Pro Met Thr Thr Leu Leu Ser Tyr Glu Leu Ala Phe Lys Lys Gln Glu Glu Ala Trp Glu Gln Ala Gln His Arg Asp His Ile Val Gly Val Thr Trp Leu Ile Leu Glu Ala Phe Glu Leu Asp Pro Gly Phe Ile His Glu Ala Arg Leu Arg Val Gln Met Ala Thr Leu Glu Asp Asp Val Val Glu Glu Glu Arg Tyr Thr Gly Gln Trp Ser Glu Trp Ser Gln Pro Val Cys Phe Gln Ala Pro Gln Arg Gln <210> 9 <211> 216 <212> PRT
<213> primate <400> 9 Ile Cys Ile Cys Thr Cys Val Cys Leu Gly Val Ser Val Thr Gly Glu Gly Gln Gly Pro Arg Ser Arg Thr Phe Thr Cys Leu Thr Asn Asn Ile Leu Arg Ile Asp Cys His Trp Ser Ala Pro Glu Leu Gly Gln Gly Thr Leu His Tyr Trp Tyr Lys Asn Ser Asp Asn Asp Lys Val Gln Lys Cys Ser His Tyr Leu Phe Ser Glu Glu Ile Thr Ser Gly Cys Gln Leu Gln Lys Lys Glu Ile His Leu Tyr Gln Thr Phe Val Val Gln Leu Gln Asp Pro Arg Glu Pro Arg Arg Gln Ala Thr Gln Met Leu Lys Leu Gln Asn Leu Val Ile Pro Trp Ala Pro Glu Asn Leu Thr Leu His Lys Leu Ser Glu Ser Gln Leu Glu Leu Asn Trp Asn Asn Arg Phe Leu Asn His Cys Leu Glu His Leu Val Gln Tyr Arg Thr Asp Trp Asp His Ser Trp Thr Glu Gln Ser Val Asp Tyr Arg His Lys Phe Ser Leu Pro Ser Val Asp Gly Gln Lys Arg Tyr Thr Phe Arg Val Arg Ser Arg Phe Asn Pro Leu Cys Gly Ser Ala Gln His Trp Ser Glu Trp Ser His Pro Ile His Trp Gly Ser Asn Thr Ser Lys Glu Asn <210> 10 <211> 257 <212> PRT
<213> primate <400> 10 Leu Leu Ala Ser Asp Ser Glu Pro Leu Lys Cys Phe Ser Arg Thr Phe Glu Asp Leu Thr Cys Phe Trp Asp Glu Glu Glu Ala Ala Pro Ser Gly Thr Tyr Gln Leu Leu Tyr Ala Tyr Pro Arg Glu Lys Pro Arg Ala Cys Pro Leu Ser Ser Gln Ser Met Pro His Phe Gly Thr Arg Tyr Val Cys Gln Phe Pro Asp Gln Glu Glu Val Arg Leu Phe Phe Pro Leu His Leu Trp Val Lys Asn Val Phe Leu Asn Gln Thr Arg Thr Gln Arg Val Leu Phe Val Asp Ser Val Gly Leu Pro Ala Pro Pro Ser Ile Ile Lys Ala Met Gly Gly Ser Gln Pro Gly Glu Leu Gln Ile Ser Trp Glu Glu Pro Ala Pro Glu Ile Ser Asp Phe Leu Arg Tyr Glu Leu Arg Tyr Gly Pro Arg Asp Pro Lys Asn Ser Thr Gly Pro Thr Val Ile Gln Leu Ile Ala Thr Glu Thr Cys Cys Pro Ala Leu Gln Arg Pro His Ser Ala Ser Ala Leu Asp Gln Ser Pro Cys Ala Gln Pro Thr Met Pro Trp Gln Asp Gly Pro Lys Gln Thr Ser Pro Ser Arg Glu Ala Ser Ala Leu Thr Ala Glu Gly Gly Ser Cys Leu Ile Ser Gly Leu Gln Pro Gly Asn Ser Tyr Trp Leu Gln Leu Arg Ser Glu Pro Asp Gly Ile Ser Leu Gly Gly Ser Trp Gly Ser Trp Ser Leu Pro Val Thr Val Asp Leu Pro Gly Asp Ala Val Ala <210> 11 <211> 217 <212> PRT
<213> primate <400> 11 Val Ser Gly Glu Ser Gly Tyr Ala Gln Asn Gly Asp Leu Glu Asp Ala Glu Leu Asp Asp Tyr Ser Phe Ser Cys Tyr Ser Gln Leu Glu Val Asn Gly Ser Gln His Ser Leu Thr Cys Ala Phe Glu Asp Pro Asp Val Asn Thr Thr Asn Leu Glu Phe Glu Ile Cys Gly Ala Leu Val Glu Val Lys Cys Leu Asn Phe Arg Lys Leu Gln Glu Ile Tyr Phe Ile Glu Thr Lys Lys Phe Leu Leu Ile Gly Lys Ser Asn Ile Cys Val Lys Val Gly Glu Lys Ser Leu Thr Cys Lys Lys Ile Asp Leu Thr Thr Ile Val Lys Pro Glu Ala Pro Phe Asp Leu Ser Val Ile Tyr Arg Glu Gly Ala Asn Asp Phe Val Val Thr Phe Asn Thr Ser His Leu Gln Lys Lys Tyr Val Lys Val Leu Met His Asp Val Ala Tyr Arg Gln Glu Lys Asp Glu Asn Lys Trp Thr His Val Asn Leu Ser Ser Thr Lys Leu Thr Leu Leu Gln Arg Lys Leu Gln Pro Ala Ala Met Tyr Glu Ile Lys Val Arg Ser Ile Pro 180. 185 190 Asp His Tyr Phe Lys Gly Phe Trp Ser Glu Trp Ser Pro Ser Tyr Tyr Phe Arg Thr Pro Glu Ile Asn Asn Ser <210> 12 <211> 196 <212> PRT
<213> primate <400> 12 Pro Glu Asn Val Arg Met Asn Ser Val Asn Phe Lys Asn Ile Leu Gln Trp Glu Ser Pro Ala Phe Ala Lys Gly Asn Leu Thr Phe Thr Ala Gln Tyr Leu Ser Tyr Arg Ile Phe Gln Asp Lys Cys Met Asn Thr Thr Leu Thr Glu Cys Asp Phe Ser Ser Leu Ser Lys Tyr Gly Asp His Thr Leu Arg Val Arg Ala Glu Phe Ala Asp Glu His Ser Asp Trp Val Asn Ile Thr Phe Cys Pro Val Asp Asp Thr Ile Ile Gly Pro Pro Gly Met Gln Val Glu Val Leu Ala Asp Ser Leu His Met Arg Phe Leu Ala Pro Lys Ile Glu Asn Glu Tyr Glu Thr Trp Thr Met Lys Asn Val Tyr Asn Ser Trp Thr Tyr Asn Val Gln Tyr Trp Lys Asn Gly Thr Asp Glu Lys Phe Gln Ile Thr Pro Gln Tyr Asp Phe Glu Val Leu Arg Asn Leu Glu Pro Trp Thr Thr Tyr Cys Val Gln Val Arg Gly Phe Leu Pro Asp Arg Asn Lys Ala Gly Glu Trp Ser Glu Pro Val Cys Glu Gln Thr Thr His Asp Glu Thr Val Pro <210> 13 <211> 196 <212> PRT
<213> rodent <400> 13 Pro Glu Lys Val Arg Met Asn Ser Val Asn Phe Lys Asn Ile Leu Gln Trp Glu Val Pro Ala Phe Pro Lys Thr Asn Leu Thr Phe Thr Ala Gln Tyr Glu Ser Tyr Arg Ser Phe Gln Asp His Cys Lys Arg Thr Ala Ser Thr Gln Cys Asp Phe Ser His Leu Ser Lys Tyr Gly Asp Tyr Thr Val Arg Val Arg Ala Glu Leu Ala Asp Glu His Ser Glu Trp Val Asn Val Thr Phe Cys Pro Val Glu Asp Thr Ile Ile Gly Pro Pro Glu Met Gln Ile Glu Ser Leu Ala Glu Ser Leu His Leu Arg Phe Ser Ala Pro Gln Ile Glu Asn Glu Pro Glu Thr Trp Thr Leu Lys Asn Ile Tyr Asp Ser Trp Ala Tyr Arg Val Gln Tyr Trp Lys Asn Gly Thr Asn Glu_Lys Phe Gln Val Val Ser Pro Tyr Asp Ser Glu Val Leu Arg Asn Leu Glu Pro Trp Thr Thr Tyr Cys Ile Gln Val Gln Gly Phe Leu Leu Asp Gln Asn Arg Thr Gly Glu Trp Ser Glu Pro Ile Cys Glu Arg Thr Gly Asn Asp Glu Ile Thr Pro <210> 14 <211> 199 <212> PRT
<213> primate <400> 14 Pro Gln Lys Val Glu Val Asp Ile Ile Asp Asp Asn Phe Ile Leu Arg Trp Asn Arg Ser Asp Glu Ser Val Gly Asn Val Thr Phe Ser Phe Asp Tyr Gln Lys Thr Gly Met Asp Asn Trp Ile Lys Leu Ser Gly Cys Gln Asn Ile Thr Ser Thr Lys Cys Asn Phe Ser Ser Leu Lys Leu Asn Val Tyr Glu Glu Ile Lys Leu Arg Ile Arg Ala Glu Lys Glu Asn Thr Ser Ser Trp Tyr Glu Val Asp Ser Phe Thr Pro Phe Arg Lys Ala Gln Ile Gly Pro Pro Glu Val His Leu Glu Ala Glu Asp Lys Ala Ile Val Ile His Ile Ser Pro Gly Thr Lys Asp Ser Val Met Trp Ala Leu Asp Gly Leu Ser Phe Thr Tyr Ser Leu Leu Ile Trp Lys Asn Ser Ser Gly Val Glu Glu Arg Ile Glu Asn Ile Tyr Ser Arg His Lys Ile Tyr Lys Leu Ser Pro Glu Thr Thr Tyr Cys Leu Lys Val Lys Ala Ala Leu Leu Thr Ser Trp Lys Ile Gly Val Tyr Ser Pro Val His Cys Ile Lys Thr Thr Val Glu Asn Glu Leu Pro Pro <210> 15 <211> 200 <212> PRT
<213> rodent <400> 15 Pro Glu Asn Ile Asp Val Tyr Ile Ile Asp Asp Asn Tyr Thr Leu,Lys Trp Ser Ser His Gly Glu Ser Met Gly Ser Val Thr Phe Ser Ala Glu Tyr Arg Thr Lys Asp Glu Ala Lys Trp Leu Lys Val Pro Glu Cys Gln His Thr Thr Thr Thr Lys Cys Glu Phe Ser Leu Leu Asp Thr Asn Val Tyr Ile Lys Thr Gln Phe Arg Val Arg Ala Glu Glu Gly Asn Ser Thr Ser Ser Trp Asn Glu Val Asp Pro Phe Ile Pro Phe Tyr Thr Ala His Met Ser Pro Pro Glu Val Arg Leu Glu Ala Glu Asp Lys Ala Ile Leu Val His Ile Ser Pro Pro Gly Gln Asp Gly Asn Met Trp Ala Leu Glu Lys Pro Ser Phe Ser Tyr Thr Ile Arg Ile Trp Gln Lys Ser Ser Ser Asp Lys Lys Thr Ile Asn Ser Thr Tyr Tyr Val Glu Lys Ile Pro Glu Leu Leu Pro Glu Thr Thr Tyr Cys Leu Glu Val Lys Ala Ile His Pro Ser Leu Lys Lys His Ser Asn Tyr Ser Thr Val Gln Cys Ile Ser Thr Thr Val Ala Asn Lys Met Pro Val <210> 16 <211> 214 <212> PRT
<213> primate <400> 16 Pro Thr Asn Val Thr Ile Glu Ser Tyr Asn Met Asn Pro Ile Val Tyr Trp Glu Tyr Gln Ile Met Pro Gln Val Pro Val Phe Thr Val Glu Val Lys Asn Tyr Gly Val Lys Asn Ser Glu Trp Ile Asp Ala Cys Ile Asn Ile Ser His His Tyr Cys Asn Ile Ser Asp His Val Gly Asp Pro Ser Asn Ser Leu Trp Val Arg Val,Lys Ala Arg Val Gly Gln Lys Glu Ser Ala Tyr Ala Lys Ser Glu Glu Phe Ala Val Cys Arg Asp Gly Lys Ile Gly Pro Pro Lys Leu Asp Ile Arg Lys Glu Glu Lys Gln Ile Met Ile Asp Ile Phe His Pro Ser Val Phe Val Asn Gly Asp Glu Gln Glu Val Asp Tyr Asp Pro Glu Thr Thr Cys Tyr Ile Arg Val Tyr Asn Val Tyr Val Arg Met Asn Gly Ser Glu Ile Gln Tyr Lys Ile Leu Thr Gln Lys Glu Asp Asp Cys Asp Glu Ile Gln Cys Gln Leu Ala Ile Pro Val Ser Ser Leu Asn Ser Gln Tyr Cys Val Ser Ala Glu Gly Val Leu His Val Trp Gly Val Thr Thr Glu Lys Ser Lys Glu Val Cys Ile Thr Ile Phe Asn Ser Ser Ile Lys Gly <210> 17 <211> 213 <212> PRT
<213> rodent <400> 17 Pro Thr Asn Val Leu Ile Lys Ser Tyr Asn Leu Asn Pro Val Val Cys Trp Glu Tyr Gln Asn Met Ser Gln Thr Pro Ile Phe Thr Val Gln Val Lys Val Tyr Ser Gly Ser Trp Thr Asp Ser Cys Thr Asn Ile Ser Asp His Cys Cys Asn Ile Tyr Gly Gln Ile Met Tyr Pro Asp Val Ser Ala Trp Ala Arg Val Lys Ala Lys Val Gly Gln Lys Glu Ser Asp Tyr Ala Arg Ser Lys Glu Phe Leu Met Cys Leu Lys Gly Lys Val Gly Pro Pro Gly Leu Glu Ile Arg Arg Lys Lys Glu Glu Gln Leu Ser Val Leu Val Phe His Pro Glu Val Val Val Asn Gly Glu Ser Gln Gly Thr Met Phe Gly Asp Gly Ser Thr Cys Tyr Thr Phe Asp Tyr Thr Val Tyr Val Glu His Asn Arg Ser Gly Glu Ile Leu His Thr Lys His Thr Val Glu Lys Glu Glu Cys Asn Glu Thr Leu Cys Glu Leu Asn Ile Ser Val Ser Thr Leu Asp Ser Arg Tyr Cys Ile Ser Val Asp Gly Ile Ser Ser Phe Trp Gln Val Arg Thr Glu Lys Ser Lys Asp Val Cys Ile Pro Pro Phe His Asp Asp Arg Lys Asp <210> 18 <211> 207 <212> PRT
<213> rodent <400> 18 Pro Ser Tyr Val Trp Phe Glu Ala Arg Phe Phe Gln His Ile Leu His Trp Lys Pro Ile Pro Asn Gln Ser Glu Ser Thr Tyr Tyr Glu Val Ala Leu Lys Gln Tyr Gly Asn Ser Thr Trp Asn Asp Ile His Ile Cys Arg Lys Ala Gln Ala Leu Ser Cys Asp Leu Thr Thr Phe Thr Leu Asp Leu Tyr His Arg Ser Tyr Gly Tyr Arg Ala Arg Val Arg Ala Val Asp Asn Ser Gln Tyr Ser Asn Trp Thr Thr Thr Glu Thr Arg Phe Thr Val Asp Glu Val Ile Leu Thr Val Asp Ser Val Thr Leu Lys Ala Met Asp Gly Ile Ile Tyr Gly Thr Ile His Pro Pro Arg Pro Thr Ile Thr Pro Ala Gly Asp Glu Tyr Glu Gln Val Phe Lys Asp Leu Arg Val Tyr Lys Ile Ser Ile Arg Lys Phe Ser Glu Leu Lys Asn Ala Thr Lys Arg Val Lys Gln Glu Thr Phe Thr Leu Thr Val Pro Ile Gly Val Arg Lys Phe Cys Val Lys Val Leu Pro Arg Leu Glu Ser Arg Ile Asn Lys Ala Glu Trp Ser Glu Glu Gln Cys Leu Leu Ile Thr Thr Glu Gln Tyr Phe Thr <210> 19 <211> 204 <212> PRT
<213> primate <400> 19 Pro Pro Ser Val Trp Phe Glu Ala Glu Phe Phe His His Ile Leu His Trp Thr Pro Ile Pro Asn Gln Ser Glu Ser Thr Cys Tyr Glu Val Ala Leu Leu Arg Tyr Gly Ile Glu Ser Trp Asn Ser Ile Ser Asn Cys Ser Gln Thr Leu Ser Tyr Asp Leu Thr Ala Val Thr Leu Asp Leu Tyr His Ser Asn Gly Tyr Arg Ala Arg Val Arg Ala Val Asp Gly Ser Arg His Ser Asn Trp Thr Val Thr Asn Thr Arg Phe Ser Val Asp Glu Val Thr Leu Thr Val Gly Ser Val Asn Leu Glu Ile His Asn Gly Phe Ile Leu Gly Lys Ile Gln Leu Pro Arg Pro Lys Met Ala Pro Ala Asn Asp Thr Tyr Glu Ser Ile Phe Ser His Phe Arg Glu Tyr Glu Ile Ala Ile Arg Lys Val Pro Gly Asn Phe Thr Phe Thr His Lys Lys Val Lys His Glu Asn Phe Ser Leu Leu Thr Ser Gly Glu Val Gly Glu Phe Cys Val Gln Val Lys Pro Ser Val Ala Ser Arg Ser Asn Lys Gly Met Trp Ser Lys Glu Glu Cys Ile Ser Leu Thr Arg Gln Tyr Phe Thr <210> 20 <211> 208 <212> PRT
<213> primate <400> 20 Pro Leu Asn Pro Arg Leu His Leu Tyr Asn Asp Glu Gln Ile Leu Thr Trp Glu Pro Ser Pro Ser Ser Asn Asp Pro Arg Pro Val Val Tyr Gln Val Glu Tyr Ser Phe Ile Asp Gly Ser Trp His Arg Leu Leu Glu Pro Asn Cys Thr Asp Ile Thr Glu Thr Lys Cys Asp Leu Thr Gly Gly Gly Arg Leu Lys Leu Phe Pro His Pro Phe Thr Val Phe Leu Arg Val Arg Ala Lys Arg Gly Asn Leu Thr Ser Lys Trp Val Gly Leu Glu Pro Phe Gln His Tyr Glu Asn Val Thr Val Gly Pro Pro Lys Asn Ile Ser Val Thr Pro Gly Lys Gly Ser Leu Val Ile His Phe Ser Pro Pro Phe Asp Val Phe His Gly Ala Thr Phe Gln Tyr Leu Val His Tyr Trp Glu Lys Ser Glu Thr Gln Gln Glu Gln Val Glu Gly Pro Phe Lys Ser Asn Ser Ile Val Leu Gly Asn Leu Lys Pro Tyr Arg Val Tyr Cys Leu Gln Thr Glu Ala Gln Leu Ile Leu Lys Asn Lys Lys Ile Arg Pro His Gly Leu Leu Ser Asn Val Ser Cys His Glu Thr Thr Ala Asn Ala Ser Ala Arg <210> 21 <211> 207 <212> PRT
<213> primate <400> 21 Pro Ala Asn Ile Thr Phe Leu Ser Ile Asn Met Lys Asn Val Leu Gln Trp Thr Pro Pro Glu Gly Leu Gln Gly Val Lys Val Thr Tyr Thr Val Gln Tyr Phe Ile Tyr Gly Gln Lys Lys Trp Leu Asn Lys Ser Glu Cys Arg Asn Ile Asn Arg Thr Tyr Cys Asp Leu Ser Ala Glu Thr Ser Asp Tyr Glu His Gln Tyr Tyr Ala Lys Val Lys Ala Ile Trp Gly Thr Lys Cys Ser Lys Trp Ala Glu Ser Gly Arg Phe Tyr Pro Phe Leu Glu Thr Gln Ile Gly Pro Pro Glu Val Ala Leu Thr Thr Asp Glu Lys Ser Ile Ser Val Val Leu Thr Ala Pro Glu Lys Trp Lys Arg Asn Pro Glu Asp Leu Pro Val Ser Met Gln Gln Ile Tyr Ser Asn Leu Lys Tyr Asn Val Ser Val Leu Asn Thr Lys Ser Asn Arg Thr Trp Ser Gln Cys Val Thr Asn His Thr Leu Val Leu Thr Trp Leu Glu Pro Asn Thr Leu Tyr Cys Val His Val Glu Ser Phe Val Pro Gly Pro Pro Arg Arg Ala Gln Pro Ser Glu Lys Gln Cys Ala Arg Thr Leu Lys Asp Gln Ser Ser Glu <210> 22 <211> 234 <212> PRT
<213> primate <400> 22 Leu Gln His Val Lys Phe Gln Ser Ser Asn Phe Glu Asn Ile Leu Thr Trp Asp Ser Gly Pro Glu Gly Thr Pro Asp Thr Val Tyr Ser Ile Glu Tyr Lys Thr Tyr Gly Glu Arg Asp Trp Val Ala Lys Lys Gly Cys Gln Arg Ile Thr Arg Lys Ser Cys Asn Leu Thr Val Glu Thr Gly Asn Leu Thr Glu Leu Tyr Tyr Ala Arg Val Thr Ala Val Ser Ala Gly Gly Arg Ser Ala Thr Lys Met Thr Asp Arg Phe. Ser Ser Leu Gln His Thr Thr Leu Lys Pro Pro Asp Val Thr Cys Ile Ser Lys Val Arg Ser Ile Gln Met Ile Val His Pro Thr Pro Thr Pro Ile Arg Ala Gly Asp Gly His Arg Leu Thr Leu Glu Asp Ile Phe His Asp Leu Phe Tyr His Leu Glu Leu Gln Val Asn Arg Thr Tyr Gln Met His Leu Gly Gly Lys Gln Arg Glu Tyr Glu Phe Phe Gly Leu Thr Pro Asp Thr Glu Phe Leu Gly Thr Ile Met Ile Cys Val Pro Thr Trp Ala Lys Glu Ser Ala Pro Tyr Met Cys Arg Val Lys Thr Leu Pro Asp Arg Thr Trp Thr Tyr Ser Phe Ser Gly Ala Phe Leu Phe Ser Met Gly Phe Leu Val Ala Val Leu Cys Tyr Leu Ser Tyr Arg Tyr Val Thr Lys Pro Pro <210> 23 <211> 201 <212> PRT
<213> primate <400> 23 Ser Cys Thr Phe Lys Ile Ser Leu Arg Asn Phe Arg Ser Ile Leu Ser Trp Glu Leu Lys Asn His Ser Ile Val Pro Thr His Tyr Thr Leu Leu Tyr Thr Ile Met Ser Lys Pro Glu Asp Leu Lys Val Val Lys Asn Cys Ala Asn Thr Thr Arg Ser Phe Cys Asp Leu Thr Asp Glu Trp Arg Ser Thr His Glu Ala Tyr Val Thr Val Leu Glu Gly Phe Ser Gly Asn Thr Thr Leu Phe Ser Cys Ser His Asn Phe Trp Leu Ala Ile Asp Met Ser Phe Glu Pro Pro Glu Phe Glu Ile Val Gly Phe Thr Asn His Ile Asn Val Met Val Lys Phe Pro Ser Ile Val Glu Glu Glu Leu Gln Phe Asp Leu Ser Leu Val Ile Glu Glu Gln Ser Glu Gly Ile Val Lys Lys His Lys Pro Glu Ile Lys Gly Asn Met Ser Gly Asn Phe Thr Tyr Ile Ile Asp Lys Leu Ile Pro Asn Thr Asn Tyr Cys Val Ser Val Tyr Leu Glu His Ser Asp Glu Gln Ala Val Ile Lys Ser Pro Leu Lys Cys Thr Leu Leu Pro Pro Gly Gln Glu Ser Glu Ser <210> 24 <211> 1617 <212> DNA
<213> primate; surmised Homo sapiens <220>
<221> CDS
<222> (1)..(1614) <220>
<221> mat~eptide <222> (61)..(1614) <220>
<221> misc_feature <222> (1). (1617) <223> n may be a, c, g, or t; translated amino acid depends on genetic code <400> 24 atg ccg cgt ggc tgg gcc gcc ccc ttg ctc ctg ctg ctg ctc cag gga 48 Met Pro Arg Gly Trp Ala Ala Pro Leu Leu Leu Leu Leu Leu Gln Gly ggc tgg ggc tgc ccc gac ctc gtc tgc tac acc gat tac ctc cag acg 96 Gly Trp Gly Cys Pro Asp Leu Val Cys Tyr Thr Asp Tyr Leu Gln Thr gtc atc tgc atc ctg gaa atg tgg aac ctc cac ccc agc acg ctc acc 144 Val Ile Cys Ile Leu Glu Met Trp Asn Leu His Pro Ser Thr Leu Thr ctt acc tgg caa gac cag tat gaa gag ctg aag gac gag gcc acc tcc 192 Leu Thr Trp Gln Asp Gln Tyr Glu Glu Leu Lys Asp Glu Ala Thr Ser tgc agc ctc cac agg tcg gcc cac aat gcc acg cat gcc acc tac acc 240 Cys Ser Leu His Arg Ser Ala His Asn Ala Thr His Ala Thr Tyr Thr tgc cac atg gat gta ttc cac ttc atg gcc gac gac att ttc agt gtc 288 Cys His Met Asp Val Phe His Phe Met Ala Asp Asp Ile Phe Ser Val aac atc aca gac cag tct ggc aac tac tcc cag gan tgt ggc agc ttt 336 Asn Ile Thr Asp Gln Ser Gly Asn Tyr Ser Gln Xaa Cys Gly Ser Phe ctc ctg get gag agc atc aag ccg get ccc cct ttc aac gtg act gtg 384 Leu Leu Ala Glu Ser Ile Lys Pro Ala Pro Pro Phe Asn Val Thr Val acc ttc tca gga cag tat aat atn tcc tgg cgc tca gat tac gaa gac 432 Thr Phe Ser Gly Gln Tyr Asn Xaa Ser Trp Arg Ser Asp Tyr Glu Asp cct gcc ttc tac atg ctg aaa ggc aag ctt caa tat gag ctg cag tac 480 Pro Ala Phe Tyr Met Leu Lys Gly Lys Leu Gln Tyr Glu Leu Gln Tyr agg aac cgg gga gac ccc tgg get gtg agt ccg agg aga aag ctg atc 528 Arg Asn Arg Gly Asp Pro Trp Ala Val Ser Pro Arg Arg Lys Leu Ile tca gtg gac tca aga agt gtc tcc ctc ctc ccc ctg gag ttc cgc aaa 576 Ser Val Asp Ser Arg Ser Val Ser Leu Leu Pro Leu Glu Phe Arg Lys gac tcg agc tat gag ctg can gtg cgg gca ggg ccc atg cct ggc tcc 624 Asp Ser Ser Tyr Glu Leu Xaa Val Arg Ala Gly Pro Met Pro Gly Ser tcc tac cag ggg acc tgg agt gaa tgg agt gac ccg gtc atc tgt cag 672 Ser Tyr Gln Gly Thr Trp Ser Glu Trp Ser Asp Pro Val Ile Cys Gln acc cag tca gag gag tta aag gaa ggc tgg aac cct cac ctg ctg ctt 720 Thr Gln Ser Glu Glu Leu Lys Glu Gly Trp Asn Pro His Leu Leu Leu ctc ctc ctg ctt gtc ata gtc ttc att cct gcc ttc tgg agc ctg aag 768 Leu Leu Leu Leu Val Ile Val Phe Ile Pro Ala Phe Trp Ser Leu Lys acc cat cca ttg tgg agg cta tgg aag aag ata tgg gcc gtc ccc agc 816 Thr His Pro Leu Trp Arg Leu Trp Lys Lys Ile Trp Ala Val Pro Ser cct gag cgg ttc ttc atg ccc ctg tac aag ggc tgc agc gga gac ttc 864 Pro Glu Arg Phe Phe Met Pro Leu Tyr Lys Gly Cys Ser Gly Asp Phe aag aaa tgg gtg ggt gca ccc ttc act ggc tcc agc ctg gag ctg gga 912 Lys Lys Trp Val Gly Ala Pro Phe Thr Gly Ser Ser Leu Glu Leu Gly ccc tgg agc cca gag gtg ccc tcc acc ctg gag gtg tac agc tgc cac 960 Pro Trp Ser Pro Glu Val Pro Ser Thr Leu Glu Val Tyr Ser Cys His cca cca cgg agc ccg gcc aag agg ctg cag ctc acg gag cta caa gaa 1008 Pro Pro Arg Ser Pro Ala Lys Arg Leu Gln Leu Thr Glu Leu Gln Glu cca gca gag ctg gtg gag tct gac ggt gtg ccc aag ccc agc ttc tgg 1056 Pro Ala Glu Leu Val Glu Ser Asp Gly Val Pro Lys Pro Ser Phe Trp ccg aca gcc cag aac tcg ggg ggc tca get tac agt gag gag agg gat 1104 Pro Thr Ala Gln Asn Ser Gly Gly Ser Ala Tyr Ser Glu Glu Arg Asp cgg cca tac ggc ctg gtg tcc att gac aca gtg act gtg cta gat gca 1152 Arg Pro Tyr Gly Leu Val Ser Ile Asp Thr Val Thr Val Leu Asp Ala gag ggg cca tgc acc tgg ccc tgc agc tgt gag gat gac ggc tac cca 1200 Glu Gly Pro Cys Thr Trp Pro Cys Ser Cys Glu Asp Asp Gly Tyr Pro gcc ctg gac ctg gat get ggc ctg gag ccc agc cca ggc cta gag gac 1248 Ala Leu Asp Leu Asp Ala Gly Leu Glu Pro Ser Pro Gly Leu Glu Asp cca ctc ttg gat gca ggg acc aca gtc ctg tcc tgt ggc tgt gtc tca 1296 Pro Leu Leu Asp Ala Gly Thr Thr Val Leu Ser Cys Gly Cys Val Ser get ggc agc cct ggg cta gga ggg ccc ctg gga agc ctc ctg gac aga 1344 Ala Gly Ser Pro Gly Leu Gly Gly Pro Leu Gly Ser Leu Leu Asp Arg cta aag cca ccc ctt gca gat ggg gag gac tgg get ggg gga ctg ccc 1392 Leu Lys Pro Pro Leu Ala Asp Gly Glu Asp Trp Ala Gly Gly Leu Pro tgg ggt ggc cgg tca cct gga ggg gtc tca gag agt gag gcg ggc tca 1440 Trp Gly Gly Arg Ser Pro Gly Gly Val Ser Glu Ser Glu Ala Gly Ser ccc ctg gcc ggc ctg gat atg gac acg ttt gac agt ggc ttt gtg ggc 1488 Pro Leu Ala Gly Leu Asp Met Asp Thr Phe Asp Ser Gly Phe Val Gly tct gac tgc agc agc cct gtg gag tgt gac ttc acc agc ccc ggg gac 1536 Ser Asp Cys Ser Ser Pro Val Glu Cys Asp Phe Thr Ser Pro Gly Asp gaa gga ccc ccc cgg agc tac ctc cgc cag tgg gtg gtc att cct ccg 1584 Glu Gly Pro Pro Arg Ser Tyr Leu Arg Gln Trp Val Val Ile Pro Pro cca ctt tcg agc cct gga ccc cag gcc agc taa 1617 Pro Leu Ser Ser Pro Gly Pro Gln Ala Ser <210> 25 <211> 538 <212> PRT
<213> primate; surmised Homo Sapiens <400> 25 Met Pro Arg Gly Trp Ala Ala Pro Leu Leu Leu Leu Leu Leu Gln Gly Gly Trp Gly Cys Pro Asp Leu Val Cys Tyr Thr Asp Tyr Leu Gln Thr Val Ile Cys Ile Leu Glu Met Trp Asn Leu His Pro Ser Thr Leu Thr Leu Thr Trp Gln Asp Gln Tyr Glu Glu Leu Lys Asp Glu Ala Thr Ser Cys Ser Leu His Arg Ser Ala His Asn Ala Thr His Ala Thr Tyr Thr Cys His Met Asp Val Phe His Phe Met Ala Asp Asp Ile Phe Ser Val Asn Ile Thr Asp Gln Ser Gly Asn Tyr Ser Gln Xaa Cys Gly Ser Phe Leu Leu Ala Glu Ser Ile Lys Pro Ala Pro Pro Phe Asn Val Thr Val Thr Phe Ser Gly Gln Tyr Asn Xaa Ser Trp Arg Ser Asp Tyr Glu Asp Pro Ala Phe Tyr Met Leu Lys Gly Lys Leu Gln Tyr Glu Leu Gln Tyr Arg Asn Arg Gly Asp Pro Trp Ala Val Ser Pro Arg Arg Lys Leu Ile Ser Val Asp Ser Arg Ser Val Ser Leu Leu Pro Leu Glu Phe Arg Lys Asp Ser Ser Tyr Glu Leu Xaa Val Arg Ala Gly Pro Met Pro Gly Ser Ser Tyr Gln Gly Thr Trp Ser Glu Trp Ser Asp Pro Val Ile Cys Gln Thr Gln Ser Glu Glu Leu Lys Glu Gly Trp Asn Pro His Leu Leu Leu Leu Leu Leu Leu Val Ile Val Phe Ile Pro Ala Phe Trp Ser Leu Lys Thr His Pro Leu Trp Arg Leu Trp Lys Lys Ile Trp Ala Val Pro Ser Pro Glu Arg Phe Phe Met Pro Leu Tyr Lys Gly Cys Ser Gly Asp Phe Lys Lys Trp Val Gly Ala Pro Phe Thr Gly Ser Ser Leu Glu Leu Gly Pro Trp Ser Pro Glu Val Pro Ser Thr Leu Glu Val Tyr Ser Cys His Pro Pro Arg Ser Pro Ala Lys Arg Leu Gln Leu Thr Glu Leu Gln Glu Pro Ala Glu Leu Val Glu Ser Asp Gly Val Pro Lys Pro Ser Phe Trp Pro Thr Ala Gln Asn Ser Gly Gly Ser Ala Tyr Ser Glu Glu Arg Asp Arg Pro Tyr Gly Leu Val Ser Ile Asp Thr Val Thr Val Leu Asp Ala Glu Gly Pro Cys Thr Trp Pro Cys Ser Cys Glu Asp Asp Gly Tyr Pro Ala Leu Asp Leu Asp Ala Gly Leu Glu Pro Ser Pro Gly Leu Glu Asp Pro Leu Leu Asp Ala Gly Thr Thr Val Leu Ser Cys Gly Cys Val Ser Ala Gly Ser Pro Gly Leu Gly Gly Pro Leu Gly Ser Leu Leu Asp Arg Leu Lys Pro Pro Leu Ala Asp Gly Glu Asp Trp Ala Gly Gly Leu Pro Trp Gly Gly Arg Ser Pro Gly Gly Val Ser Glu Ser Glu Ala Gly Ser Pro Leu Ala Gly Leu Asp Met Asp Thr Phe Asp Ser Gly Phe Val Gly Ser Asp Cys Ser Ser Pro Val Glu Cys Asp Phe Thr Ser Pro Gly Asp Glu Gly Pro Pro Arg Ser Tyr Leu Arg Gln Trp Val Val Ile Pro Pro Pro Leu Ser Ser Pro Gly Pro Gln Ala Ser <210> 26 <211> 1614 <212> DNA
<213> Artificial Sequence <220>
<221> misc_feature <222> (1) . (1614) <223> n may be a, c, g, or t <220>
<223> Description of Artificial Sequence: reverse translation <400> 26 atgccnmgng gntgggcngc nccnytnytn ytnytnytny tncarggngg ntggggntgy 60 ccngayytng tntgytayac ngaytayytn caracngtna thtgyathyt ngaratgtgg 120 aayytncayc cnwsnacnyt nacnytnacn tggcargayc artaygarga rytnaargay 180 gargcnacnw sntgywsnyt ncaymgnwsn gcncayaayg cnacncaygc nacntayacn 240 tgycayatgg aygtnttyca yttyatggcn gaygayatht tywsngtnaa yathacngay 300 carwsnggna aytaywsnca rnnntgyggn wsnttyytny tngcngarws nathaarccn 360 gcnccnccnt tyaaygtnac ngtnacntty wsnggncart ayaaynnnws ntggmgnwsn 420 gaytaygarg ayccngcntt ytayatgytn aarggnaary tncartayga rytncartay 480 mgnaaymgng gngayccntg ggcngtnwsn ccnmgnmgna arytnathws ngtngaywsn 540 mgnwsngtnw snytnytncc nytngartty mgnaargayw snwsntayga rytnnnngtn 600 mgngcnggnc cnatgccngg nwsnwsntay carggnacnt ggwsngartg gwsngayccn 660 gtnathtgyc aracncarws ngargarytn aargarggnt ggaayccnca yytnytnytn 720 ytnytnytny tngtnathgt nttyathccn gcnttytggw snytnaarac ncayccnytn 780 tggmgnytnt ggaaraarat htgggcngtn ccnwsnccng armgnttytt yatgccnytn 840 tayaarggnt gywsnggnga yttyaaraar tgggtnggng cnccnttyac nggnwsnwsn 900 ytngarytng gnccntggws nccngargtn ccnwsnacny tngargtnta ywsntgycay 960 ccnccnmgnw snccngcnaa rmgnytncar ytnacngary tncargarcc ngcngarytn 1020 gtngarwsng ayggngtncc naarccnwsn ttytggccna cngcncaraa ywsnggnggn 1080 wsngcntayw sngargarmg ngaymgnccn tayggnytng tnwsnathga yacngtnacn 1140 gtnytngayg cngarggncc ntgyacntgg ccntgywsnt gygargayga yggntayccn 1200 gcnytngayy tngaygcngg nytngarccn wsnccnggny tngargaycc nytnytngay 1260 gcnggnacna cngtnytnws ntgyggntgy gtnwsngcng gnwsnccngg nytnggnggn 1320 ccnytnggnw snytnytnga ymgnytnaar ccnccnytng cngayggnga rgaytgggcn 1380 ggnggnytnc cntggggngg nmgnwsnccn ggnggngtnw sngarwsnga rgcnggnwsn 1440 ccnytngcng gnytngayat ggayacntty gaywsnggnt tygtnggnws ngaytgywsn 1500 wsnccngtng artgygaytt yacnwsnccn ggngaygarg gnccnccnmg nwsntayytn 1560 mgncartggg tngtnathcc nccnccnytn wsnwsnccng gnccncargc nwsn 1614 <210> 27 <211> 696 <212> DNA
<213> primate; surmised Homo sapiens <220>
<221> CDS
<222> (1)..(693) <220>
<221> mat~eptide <222> (64)..(693) <400> 27 atg atg cct aaa cat tgc ttt cta ggc ttc ctc atc agt ttc ttc ctt 48 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu act ggt gta gca gga act cag tca acg cat gag tct ctg aag cct cag 96 Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln agg gta caa ttt cag tcc cga aat ttt cac aac att ttg caa tgg cag 144 Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln ccc ggg agg gca ctt act ggc aac agc agt gtc tat ttt gtg cag tac 192 Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr aaa ata tat gga cag aga caa tgg aaa aat aaa gaa gac tgt tgg ggt 240 Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly act caa gaa ctc tct tgt gac ctt acc agt gaa acc tca gac ata cag 288 Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln gaa cct tat tac ggg agg gtg agg gcg gcc tcg get ggg agc tac tca 336 Glu Pro Tyr Tyr Gly Arg Val Arg Ala Ala Ser Ala Gly Ser Tyr Ser gaa tgg agc atg acg ccg cgg ttc act ccc tgg tgg gaa aca aaa ata 384 Glu Trp Ser Met Thr Pro Arg Phe Thr Pro Trp Trp Glu Thr Lys Ile gat cct cca gtc atg aat ata acc caa gtc aat ggc tct ttg ttg gta 432 Asp Pro Pro Val Met Asn Ile Thr Gln Val Asn Gly Ser Leu Leu Val att ctc cat get cca aat tta cca tat aga tac caa aag gaa aaa aat 480 Ile Leu His Ala Pro Asn Leu Pro Tyr Arg Tyr Gln Lys Glu Lys Asn gta tct ata gaa gat tac tat gaa cta cta tac cga gtt ttt ata att 528 Val Ser Ile Glu Asp Tyr Tyr Glu Leu Leu Tyr Arg Val Phe Ile Ile aac aat tca cta gaa aag gag caa aag gtt tat gaa ggg get cac aga 576 Asn Asn Ser Leu Glu Lys Glu Gln Lys Val Tyr Glu Gly Ala His Arg gcg gtt gaa att gaa get cta aca cca cac tcc agc tac tgt gta gtg 624 Ala Val Glu Ile Glu Ala Leu Thr Pro His Ser Ser Tyr Cys Val Val get gaa ata tat cag ccc atg tta gac aga aga agt cag aga agt gaa 672 Ala Glu Ile Tyr Gln Pro Met Leu Asp Arg Arg Ser Gln Arg Ser Glu gag aga tgt gtg gaa att cca tga 696 Glu Arg Cys Val Glu Ile Pro <210> 28 <211> 231 <212> PRT
<213> primate; surmised Homo sapiens <400> 28 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln Glu Pro Tyr Tyr Gly Arg Val Arg Ala Ala Ser Ala Gly Ser Tyr Ser Glu Trp Ser Met Thr Pro Arg Phe Thr Pro Trp Trp Glu Thr Lys Ile Asp Pro Pro Val Met Asn Ile Thr Gln Val Asn Gly Ser Leu Leu Val Ile Leu His Ala Pro Asn Leu Pro Tyr Arg Tyr Gln Lys Glu Lys Asn Val Ser Ile Glu Asp Tyr Tyr Glu Leu Leu Tyr Arg Val Phe Ile Ile Asn Asn Ser Leu Glu.Lys Glu Gln Lys Val Tyr Glu Gly Ala His Arg Ala Val Glu Ile Glu Ala Leu Thr Pro His Ser Ser Tyr Cys Val Val Ala Glu Ile Tyr Gln Pro Met Leu Asp Arg Arg Ser Gln Arg Ser Glu Glu Arg Cys Val Glu Ile Pro <210> 29 <211> 693 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: reverse translation <220>
<221> misc_feature <222> (1) . (693) <223> n may be a, c, g, or t <400> 29 atgatgccna arcaytgytt yytnggntty ytnathwsnt tyttyytnac nggngtngcn 60 ggnacncarw snacncayga rwsnytnaar ccncarmgng tncarttyca rwsnmgnaay 120 ttycayaaya thytncartg gcarccnggn mgngcnytna cnggnaayws nwsngtntay 180 ttygtncart ayaarathta yggncarmgn cartggaara ayaargarga ytgytggggn 240 acncargary tnwsntgyga yytnacnwsn garacnwsng ayathcarga rccntaytay 300 ggnmgngtnm gngcngcnws ngcnggnwsn taywsngart ggwsnatgac nccnmgntty 360 acnccntggt gggaracnaa rathgayccn ccngtnatga ayathacnca rgtnaayggn 420 wsnytnytng tnathytnca ygcnccnaay ytnccntaym gntaycaraa rgaraaraay 480 gtnwsnathg argaytayta ygarytnytn taymgngtnt tyathathaa yaaywsnytn 540 garaargarc araargtnta ygarggngcn caymgngcng tngarathga rgcnytnacn 600 ccncaywsnw sntaytgygt ngtngcngar athtaycarc cnatgytnga ymgnmgnwsn 660 carmgnwsng argarmgntg ygtngarath ccn 693 <210> 30 <211> 526 <212> DNA
<213> primate; surmised Homo Sapiens <220>
<221> CDS
<222> (1)..(390) <220>
<221> mat~eptide <222> (64)..(390) <400> 30 atg atg cct aaa cat tgc ttt cta ggc ttc ctc atc agt ttt ttc ctt 48 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu act ggt gta gca gga act cag tca acg cat gag tct ctg aag cct cag 96 Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln agg gta caa ttt cag tcc cga aat ttt cac aac att ttg caa tgg cag 144 Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln cct ggg agg gca ctt act ggc aac agc agt gtc tat ttt gtg cag tac 192 Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr aaa ata tat gga cag aga caa tgg aaa aat aaa gaa gac tgt tgg ggt 240 Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly act caa gaa ctc tct tgt gac ctt acc agt gaa acc tca gac ata cag 288 Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln gaa tct tat tac ggg agg gtg agg gcg gcc tcg get ggg agc tac tca 336 Glu Ser Tyr Tyr Gly Arg Val Arg Ala Ala Ser Ala Gly Ser Tyr Ser gaa tgg agc atg acg ccg cgg ttc act ccc tgg tgg gaa aga gca aaa 384 Glu Trp Ser Met Thr Pro Arg Phe Thr Pro Trp Trp Glu Arg Ala Lys ggt tta tgaaggggct cacagagcgg ttgaaattga agctctaaca ccacactcca 440 Gly Leu gctactgtgt agtggctgaa atatatcagc ccacgttaga cagaagaagt cagagaagtg 500 aagagagatg tgtggaaatt ccatga 526 <210> 31 <211> 130 <212> PRT
<213> primate; surmised Homo Sapiens <400> 31 Met Met Pro Lys His Cys Phe Leu Gly Phe Leu Ile Ser Phe Phe Leu Thr Gly Val Ala Gly Thr Gln Ser Thr His Glu Ser Leu Lys Pro Gln Arg Val Gln Phe Gln Ser Arg Asn Phe His Asn Ile Leu Gln Trp Gln Pro Gly Arg Ala Leu Thr Gly Asn Ser Ser Val Tyr Phe Val Gln Tyr Lys Ile Tyr Gly Gln Arg Gln Trp Lys Asn Lys Glu Asp Cys Trp Gly Thr Gln Glu Leu Ser Cys Asp Leu Thr Ser Glu Thr Ser Asp Ile Gln Glu Ser Tyr Tyr Gly Arg Val Arg Ala Ala Ser Ala Gly Ser Tyr Ser Glu Trp Ser Met Thr Pro Arg Phe Thr Pro Trp Trp Glu Arg Ala Lys Gly Leu <210> 32 <211> 390 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence: reverse translation <220>
<221> misc_feature <222> (1). (390) <223> n may be a, c, g,or t <400> 32 atgatgccna arcaytgytt yytnggntty ytnathwsnt tyttyytnac nggngtngcn 60 ggnacncarw snacncayga rwsnytnaar ccncarmgng tncarttyca rwsnmgnaay 120 ttycayaaya thytncartg gcarccnggn mgngcnytna cnggnaayws nwsngtntay 180 ttygtncart ayaarathta yggncarmgn cartggaara ayaargarga ytgytggggn 240 acncargary tnwsntgyga yytnacnwsn garacnwsng ayathcarga rwsntaytay 300 ggnmgngtnm gngcngcnws ngcnggnwsn taywsngart ggwsnatgac nccnmgntty 360 acnccntggt gggarmgngc naarggnytn 390
Claims (20)
1. A composition of matter selected from:
a) a substantially pure or recombinant DCRS3 polypeptide comprising at least three distinct nonoverlapping segments of at least four amino acids identical to segments of SEQ ID NO: 2 or 25;
b) a substantially pure or recombinant DCRS3 polypeptide comprising at least two distinct nonoverlapping segments of at least five amino acids identical to segments of SEQ ID NO: 2 or 25;
c) a natural sequence DCRS3 comprising mature SEQ ID NO: 2 or 25;
d) a fusion polypeptide comprising DCRS3 sequence;
e) a substantially pure or recombinant DCRS4 polypeptide comprising at least three distinct nonoverlapping segments of at least four amino acids identical to segments of SEQ ID NO: 5, 28, or 31;
f) a substantially pure or recombinant DCRS4 polypeptide comprising at least two distinct nonoverlapping segments of at least five amino acids identical to segments of SEQ ID NO: 5, 28, or 31;
g) a natural sequence DCRS4 comprising mature SEQ ID NO:
5, 28, or 31; or h) a fusion polypeptide comprising DCRS4 sequence.
a) a substantially pure or recombinant DCRS3 polypeptide comprising at least three distinct nonoverlapping segments of at least four amino acids identical to segments of SEQ ID NO: 2 or 25;
b) a substantially pure or recombinant DCRS3 polypeptide comprising at least two distinct nonoverlapping segments of at least five amino acids identical to segments of SEQ ID NO: 2 or 25;
c) a natural sequence DCRS3 comprising mature SEQ ID NO: 2 or 25;
d) a fusion polypeptide comprising DCRS3 sequence;
e) a substantially pure or recombinant DCRS4 polypeptide comprising at least three distinct nonoverlapping segments of at least four amino acids identical to segments of SEQ ID NO: 5, 28, or 31;
f) a substantially pure or recombinant DCRS4 polypeptide comprising at least two distinct nonoverlapping segments of at least five amino acids identical to segments of SEQ ID NO: 5, 28, or 31;
g) a natural sequence DCRS4 comprising mature SEQ ID NO:
5, 28, or 31; or h) a fusion polypeptide comprising DCRS4 sequence.
2. The substantially pure or isolated antigenic DCRS3 or DCRS4 polypeptide of Claim 1, wherein said distinct nonoverlapping segments of identity:
a) include one of at least eight amino acids;
b) include one of at least four amino acids and a second of at least five amino acids;
c) include at least three segments of at least four, five, and six amino acids, or d) include one of at least twelve amino acids.
a) include one of at least eight amino acids;
b) include one of at least four amino acids and a second of at least five amino acids;
c) include at least three segments of at least four, five, and six amino acids, or d) include one of at least twelve amino acids.
3. The composition of matter of Claim 1, wherein said:
a) DCRS3 polypeptide:
i) comprises a mature sequence of Table 1;
ii) is an unglycosylated form of DCRS3;
iii) is from a primate, such as a human;
iv) comprises at least seventeen amino acids of SEQ
ID NO: 2 or 25;
v) exhibits at least four nonoverlapping segments of at least seven amino acids of SEQ ID NO: 2 or 25;
vi) comprises a sequence of at least 3 amino acids on each side across an exon boundary;
vii) is a natural allelic variant of DCRS3;
viii) has a length at least about 30 amino acids;
ix) exhibits at least two non-overlapping epitopes which are specific for a primate DCRS3;
x) is glycosylated;
xi) has a molecular weight of at least 30 kD with natural glycosylation;
xii) is a synthetic polypeptide;
xiii) is attached to a solid substrate;
xiv) is conjugated to another chemical moiety;
xv) is a 5-fold or less substitution from natural sequence; or xvi) is a deletion or insertion variant from a natural sequence; or b) DCRS4 polypeptide:
i) comprises a mature sequence of Table 3;
ii) is an unglycosylated form of DCRS4;
iii) is from a primate, such as a human;
iv) comprises at least seventeen amino acids of SEQ
ID NO: 5, 28, or 31;
v) exhibits at least four nonoverlapping segments of at least seven amino acids of SEQ ID NO: 5, 28, or 31;
vi) comprises a sequence of at least 3 amino acids on each side across an exon boundary;
vii) is a natural allelic variant of DCRS4;
viii) has a length at least about 30 amino acids;
ix) exhibits at least two non-overlapping epitopes which are specific for a primate DCRS4;
x) is glycosylated;
xi) has a molecular weight of at least 30 kD with natural glycosylation;
xii) is a synthetic polypeptide;
xiii) is attached to a solid substrate;
xiv) is conjugated to another chemical moiety;
xv) is a 5-fold or less substitution from natural sequence; or xvi) is a deletion or insertion variant from a natural sequence.
a) DCRS3 polypeptide:
i) comprises a mature sequence of Table 1;
ii) is an unglycosylated form of DCRS3;
iii) is from a primate, such as a human;
iv) comprises at least seventeen amino acids of SEQ
ID NO: 2 or 25;
v) exhibits at least four nonoverlapping segments of at least seven amino acids of SEQ ID NO: 2 or 25;
vi) comprises a sequence of at least 3 amino acids on each side across an exon boundary;
vii) is a natural allelic variant of DCRS3;
viii) has a length at least about 30 amino acids;
ix) exhibits at least two non-overlapping epitopes which are specific for a primate DCRS3;
x) is glycosylated;
xi) has a molecular weight of at least 30 kD with natural glycosylation;
xii) is a synthetic polypeptide;
xiii) is attached to a solid substrate;
xiv) is conjugated to another chemical moiety;
xv) is a 5-fold or less substitution from natural sequence; or xvi) is a deletion or insertion variant from a natural sequence; or b) DCRS4 polypeptide:
i) comprises a mature sequence of Table 3;
ii) is an unglycosylated form of DCRS4;
iii) is from a primate, such as a human;
iv) comprises at least seventeen amino acids of SEQ
ID NO: 5, 28, or 31;
v) exhibits at least four nonoverlapping segments of at least seven amino acids of SEQ ID NO: 5, 28, or 31;
vi) comprises a sequence of at least 3 amino acids on each side across an exon boundary;
vii) is a natural allelic variant of DCRS4;
viii) has a length at least about 30 amino acids;
ix) exhibits at least two non-overlapping epitopes which are specific for a primate DCRS4;
x) is glycosylated;
xi) has a molecular weight of at least 30 kD with natural glycosylation;
xii) is a synthetic polypeptide;
xiii) is attached to a solid substrate;
xiv) is conjugated to another chemical moiety;
xv) is a 5-fold or less substitution from natural sequence; or xvi) is a deletion or insertion variant from a natural sequence.
4. A composition comprising:
a) a substantially pure DCRS3 and another cytokine receptor family member;
b) a sterile DCRS3 polypeptide of Claim 1;
c) said DCRS3 polypeptide of Claim 1 and a carrier, wherein said carrier is:
i) an aqueous compound, including water, saline, and/or buffer; and/or ii) formulated for oral, rectal, nasal, topical, or parenteral administration; or d) a substantially pure DCRS4 and another cytokine receptor family member;
e) a sterile DCRS4 polypeptide of Claim 1;
f) said DCRS4 polypeptide of Claim 1 and a carrier, wherein said carrier is:
i) an aqueous compound, including water, saline, and/or buffer; and/or ii) formulated for oral, rectal, nasal, topical, or parenteral administration.
a) a substantially pure DCRS3 and another cytokine receptor family member;
b) a sterile DCRS3 polypeptide of Claim 1;
c) said DCRS3 polypeptide of Claim 1 and a carrier, wherein said carrier is:
i) an aqueous compound, including water, saline, and/or buffer; and/or ii) formulated for oral, rectal, nasal, topical, or parenteral administration; or d) a substantially pure DCRS4 and another cytokine receptor family member;
e) a sterile DCRS4 polypeptide of Claim 1;
f) said DCRS4 polypeptide of Claim 1 and a carrier, wherein said carrier is:
i) an aqueous compound, including water, saline, and/or buffer; and/or ii) formulated for oral, rectal, nasal, topical, or parenteral administration.
5. The fusion polypeptide of Claim 1, comprising:
a) mature protein sequence of Table 1;
b) mature protein sequence of Table 1;
c) a detection or purification tag, including a FLAG, His6, or Ig sequence; or d) sequence of another cytokine receptor protein.
a) mature protein sequence of Table 1;
b) mature protein sequence of Table 1;
c) a detection or purification tag, including a FLAG, His6, or Ig sequence; or d) sequence of another cytokine receptor protein.
6. A kit comprising a polypeptide of Claim 1, and:
a) a compartment comprising said protein or polypeptide;
or b) instructions for use or disposal of reagents in said kit.
a) a compartment comprising said protein or polypeptide;
or b) instructions for use or disposal of reagents in said kit.
7. A binding compound comprising an antigen binding site from an antibody, which specifically binds to a natural:
A) DCRS3 polypeptide of Claim 1, wherein:
a) said binding compound is in a container;
b) said polypeptide is from a human;
c) said binding compound is an Fv, Fab, or Fab2 fragment;
d) said binding compound is conjugated to another chemical moiety; or e) said antibody:
i) is raised against a peptide sequence of a mature polypeptide of Table 1;
ii) is raised against a mature DCRS3;
iii) is raised to a purified human DCRS3;
iv) is immunoselected;
v) is a polyclonal antibody;
vi) binds to a denatured DCRS3;
vii) exhibits a Kd to antigen of at least 30 µM;
viii) is attached to a solid substrate, including a bead or plastic membrane;
ix) is in a sterile composition; or x) is detectably labeled, including a radioactive or fluorescent label; or B) DCRS4 polypeptide of Claim 1, wherein:
a) said binding compound is in a container;
b) said polypeptide is from a human;
c) said binding compound is an Fv, Fab, or Fab2 fragment;
d) said binding compound is conjugated to another chemical moiety; or e) said antibody:
i) is raised against a peptide sequence of a mature polypeptide of Table 3;
ii) is raised against a mature DCRS4;
iii) is raised to a purified human DCRS4;
iv) is immunoselected;
v) is a polyclonal antibody;
vi) binds to a denatured DCRS4;
vii) exhibits a Kd to antigen of at least 30 µM;
viii) is attached to a solid substrate, including a bead or plastic membrane;
ix) is in a sterile composition; or x) is detectably labeled, including a radioactive or fluorescent label.
A) DCRS3 polypeptide of Claim 1, wherein:
a) said binding compound is in a container;
b) said polypeptide is from a human;
c) said binding compound is an Fv, Fab, or Fab2 fragment;
d) said binding compound is conjugated to another chemical moiety; or e) said antibody:
i) is raised against a peptide sequence of a mature polypeptide of Table 1;
ii) is raised against a mature DCRS3;
iii) is raised to a purified human DCRS3;
iv) is immunoselected;
v) is a polyclonal antibody;
vi) binds to a denatured DCRS3;
vii) exhibits a Kd to antigen of at least 30 µM;
viii) is attached to a solid substrate, including a bead or plastic membrane;
ix) is in a sterile composition; or x) is detectably labeled, including a radioactive or fluorescent label; or B) DCRS4 polypeptide of Claim 1, wherein:
a) said binding compound is in a container;
b) said polypeptide is from a human;
c) said binding compound is an Fv, Fab, or Fab2 fragment;
d) said binding compound is conjugated to another chemical moiety; or e) said antibody:
i) is raised against a peptide sequence of a mature polypeptide of Table 3;
ii) is raised against a mature DCRS4;
iii) is raised to a purified human DCRS4;
iv) is immunoselected;
v) is a polyclonal antibody;
vi) binds to a denatured DCRS4;
vii) exhibits a Kd to antigen of at least 30 µM;
viii) is attached to a solid substrate, including a bead or plastic membrane;
ix) is in a sterile composition; or x) is detectably labeled, including a radioactive or fluorescent label.
8. A kit comprising said binding compound of Claim 7, and:
a) a compartment comprising said binding compound; or b) instructions for use or disposal of reagents in said kit.
a) a compartment comprising said binding compound; or b) instructions for use or disposal of reagents in said kit.
9. A method of producing an antigen: antibody complex, comprising contacting under appropriate conditions a primate:
a) DCRS3 polypeptide with an antibody of Claim 7, thereby allowing said complex to form; or b) DCRS4 polypeptide with an antibody of Claim 7, thereby allowing said complex to form.
a) DCRS3 polypeptide with an antibody of Claim 7, thereby allowing said complex to form; or b) DCRS4 polypeptide with an antibody of Claim 7, thereby allowing said complex to form.
10. The method of Claim 9, wherein:
a) said complex is purified from other cytokine receptors;
b) said complex is purified from other antibody;
c) said contacting is with a sample comprising an interferon;
d) said contacting allows quantitative detection of said antigen;
e) said contacting is with a sample comprising said antibody; or f) said contacting allows quantitative detection of said antibody.
a) said complex is purified from other cytokine receptors;
b) said complex is purified from other antibody;
c) said contacting is with a sample comprising an interferon;
d) said contacting allows quantitative detection of said antigen;
e) said contacting is with a sample comprising said antibody; or f) said contacting allows quantitative detection of said antibody.
11. A composition comprising:
a) a sterile binding compound of Claim 7, or b) said binding compound of Claim 7 and a carrier, wherein said carrier is:
i) an aqueous compound, including water, saline, and/or buffer; and/or ii) formulated for oral, rectal, nasal, topical, or parenteral administration.
a) a sterile binding compound of Claim 7, or b) said binding compound of Claim 7 and a carrier, wherein said carrier is:
i) an aqueous compound, including water, saline, and/or buffer; and/or ii) formulated for oral, rectal, nasal, topical, or parenteral administration.
12. An isolated or recombinant nucleic acid encoding said:
A) DCRS3 polypeptide of Claim 1, wherein said:
a) DCRS3 is from a human; or b) said nucleic acid:
i) encodes an antigenic peptide sequence of Table 1;
ii) encodes a plurality of antigenic peptide sequences of Table 1;
iii) exhibits identity over at least thirteen nucleotides to a natural cDNA encoding said segment;
iv) is an expression vector;
v) further comprises an origin of replication;
vi) is from a natural source;
vii) comprises a detectable label;
viii) comprises synthetic nucleotide sequence;
ix) is less than 6 kb, preferably less than 3 kb;
x) is from a primate;
xi) comprises a natural full length coding sequence;
xii) is a hybridization probe for a gene encoding said DCRS3; or xiii) is a PCR primer, PCR product, or mutagenesis primer; or B) DCRS4 polypeptide of Claim 1, wherein said:
a) DCRS4 is from a human; or b) said nucleic acid:
i) encodes an antigenic peptide sequence of Table 3;
ii) encodes a plurality of antigenic peptide sequences of Table 3;
iii) exhibits identity over at least thirteen nucleotides to a natural cDNA encoding said segment;
iv) is an expression vector;
v) further comprises an origin of replication;
vi) is from a natural source;
vii) comprises a detectable label;
viii) comprises synthetic nucleotide sequence;
ix) is less than 6 kb, preferably less than 3 kb;
x) is from a primate;
xi) comprises a natural full length coding sequence;
xii) is a hybridization probe for a gene encoding said DCRS4; or xiii) is a PCR primer, PCR product, or mutagenesis primer.
A) DCRS3 polypeptide of Claim 1, wherein said:
a) DCRS3 is from a human; or b) said nucleic acid:
i) encodes an antigenic peptide sequence of Table 1;
ii) encodes a plurality of antigenic peptide sequences of Table 1;
iii) exhibits identity over at least thirteen nucleotides to a natural cDNA encoding said segment;
iv) is an expression vector;
v) further comprises an origin of replication;
vi) is from a natural source;
vii) comprises a detectable label;
viii) comprises synthetic nucleotide sequence;
ix) is less than 6 kb, preferably less than 3 kb;
x) is from a primate;
xi) comprises a natural full length coding sequence;
xii) is a hybridization probe for a gene encoding said DCRS3; or xiii) is a PCR primer, PCR product, or mutagenesis primer; or B) DCRS4 polypeptide of Claim 1, wherein said:
a) DCRS4 is from a human; or b) said nucleic acid:
i) encodes an antigenic peptide sequence of Table 3;
ii) encodes a plurality of antigenic peptide sequences of Table 3;
iii) exhibits identity over at least thirteen nucleotides to a natural cDNA encoding said segment;
iv) is an expression vector;
v) further comprises an origin of replication;
vi) is from a natural source;
vii) comprises a detectable label;
viii) comprises synthetic nucleotide sequence;
ix) is less than 6 kb, preferably less than 3 kb;
x) is from a primate;
xi) comprises a natural full length coding sequence;
xii) is a hybridization probe for a gene encoding said DCRS4; or xiii) is a PCR primer, PCR product, or mutagenesis primer.
13. A cell or tissue comprising said recombinant nucleic acid of Claim 12.
14. The cell of Claim 13, wherein said cell is:
a) a prokaryotic cell;
b) a eukaryotic cell;
c) a bacterial cell;
d) a yeast cell;
e) an insect cell;
f) a mammalian cell;
g) a mouse cell;
h) a primate cell; or i) a human cell.
a) a prokaryotic cell;
b) a eukaryotic cell;
c) a bacterial cell;
d) a yeast cell;
e) an insect cell;
f) a mammalian cell;
g) a mouse cell;
h) a primate cell; or i) a human cell.
15. A kit comprising said nucleic acid of Claim 12, and:
a) a compartment comprising said nucleic acid;
b) a compartment further comprising a primate DCRS3 or DCRS4 polypeptide; or c) instructions for use or disposal of reagents in said kit.
a) a compartment comprising said nucleic acid;
b) a compartment further comprising a primate DCRS3 or DCRS4 polypeptide; or c) instructions for use or disposal of reagents in said kit.
16. A nucleic acid which:
a) hybridizes under wash conditions 30 minutes at 30°C
and less than 2M salt to the coding portion of SEQ ID
NO: 1 or 24; or b) exhibits identity over a stretch at least about 30 nucleotides to a primate DCRS3;
a) hybridizes under wash conditions of 30 minutes at 30°C
and less than 2M salt to the coding portion of SEQ ID
NO: 4, 27, or 30; or b) exhibits identity over a stretch of at least about 30 nucleotides to a primate DCRS4.
a) hybridizes under wash conditions 30 minutes at 30°C
and less than 2M salt to the coding portion of SEQ ID
NO: 1 or 24; or b) exhibits identity over a stretch at least about 30 nucleotides to a primate DCRS3;
a) hybridizes under wash conditions of 30 minutes at 30°C
and less than 2M salt to the coding portion of SEQ ID
NO: 4, 27, or 30; or b) exhibits identity over a stretch of at least about 30 nucleotides to a primate DCRS4.
17. The nucleic acid of Claim 16, wherein:
a) said wash conditions are at 45° C and/or 500 mM salt;
or b) said stretch is at least 55 nucleotides.
a) said wash conditions are at 45° C and/or 500 mM salt;
or b) said stretch is at least 55 nucleotides.
18. The nucleic acid of Claim 16, wherein:
a) said wash conditions are at 45° C and/or 150 mM salt;
or b) said stretch is at least 75 nucleotides.
a) said wash conditions are at 45° C and/or 150 mM salt;
or b) said stretch is at least 75 nucleotides.
19. A method of modulating physiology or development of a cell or tissue culture cells comprising contacting said cell with an agonist or antagonist of a mammalian DCRS3 or DCRS4.
20. The method of Claim 19, wherein said cell is transformed with a nucleic acid encoding a DCRS3 or DCRS4 and another cytokine receptor subunit.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US44306099A | 1999-11-18 | 1999-11-18 | |
US09/443,060 | 1999-11-18 | ||
US17032099P | 1999-12-13 | 1999-12-13 | |
US60/170,320 | 1999-12-13 | ||
PCT/US2000/031363 WO2001036467A2 (en) | 1999-11-18 | 2000-11-16 | Mammalian receptor proteins; related reagents and methods |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2392109A1 true CA2392109A1 (en) | 2001-05-25 |
Family
ID=26865975
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002392109A Abandoned CA2392109A1 (en) | 1999-11-18 | 2000-11-16 | Mammalian receptor proteins; related reagents and methods |
Country Status (6)
Country | Link |
---|---|
EP (1) | EP1230368A2 (en) |
JP (1) | JP2003523179A (en) |
AU (1) | AU1919201A (en) |
CA (1) | CA2392109A1 (en) |
MX (1) | MXPA02005058A (en) |
WO (1) | WO2001036467A2 (en) |
Families Citing this family (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6057128A (en) | 1998-03-17 | 2000-05-02 | Genetics Institute, Inc. | MU-1, member of the cytokine receptor family |
US7198789B2 (en) | 1998-03-17 | 2007-04-03 | Genetics Institute, Llc | Methods and compositions for modulating interleukin-21 receptor activity |
AU2253301A (en) | 1999-12-03 | 2001-06-12 | Zymogenetics Inc. | Human cytokine receptor |
US20030170823A1 (en) | 1999-12-23 | 2003-09-11 | Presnell Scott R. | Novel cytokine ZCYTO18 |
WO2002012345A2 (en) | 2000-08-08 | 2002-02-14 | Zymogenetics, Inc. | Soluble zcytor 11 cytokine receptors |
US7094570B2 (en) | 2003-03-11 | 2006-08-22 | Ludwig Institute For Cancer Research | Isolated nucleic acid molecules which encode a soluble IL-TIF/IL-22 receptor or binding protein which binds to IL-TIF/IL-22, and uses thereof |
US7268223B2 (en) | 2000-09-22 | 2007-09-11 | Wyeth | Isolated nucleic acid molecules which encode a soluble IL-TIF receptor or binding protein which binds to IL-TIF/IL-22, and uses thereof |
EP1191035A3 (en) * | 2000-09-25 | 2002-04-17 | Schering Aktiengesellschaft | Three members of the cytokin-receptor class II family |
EP1404831A2 (en) * | 2001-01-12 | 2004-04-07 | Genetics Institute, LLC | Type 2 cytokine receptor and nucleic acids encoding same |
JP2004532624A (en) | 2001-03-02 | 2004-10-28 | ザイモジェネティクス,インコーポレイティド | Mouse cytokine receptor |
WO2002077174A2 (en) | 2001-03-27 | 2002-10-03 | Zymogenetics, Inc. | Human cytokine receptor |
JP4504023B2 (en) | 2002-03-22 | 2010-07-14 | ザイモジェネティクス インコーポレーティッド | Anti-IL-TIF antibodies and methods for use in inflammation |
GB0216661D0 (en) * | 2002-07-17 | 2002-08-28 | Ares Trading Sa | Protein |
KR20050119120A (en) | 2003-03-14 | 2005-12-20 | 와이어쓰 | Antibodies against human il-21 receptor and uses therefor |
MXPA05010134A (en) | 2003-03-24 | 2005-11-16 | Zymogenetics Inc | Anti-il-20 antibodies and binding partners and methods of using in inflammation. |
EP1812476B1 (en) | 2004-10-22 | 2010-07-21 | ZymoGenetics, Inc. | Anti-il-22ra antibodies and binding partners and methods of using in inflammation |
NZ712762A (en) | 2005-12-02 | 2017-03-31 | Genentech Inc | Compositions and methods for the treatment of diseases and disorders associated with cytokine signaling involving antibodies that bind to il-22 and il-22r |
US12012441B2 (en) | 2020-10-26 | 2024-06-18 | Neptune Biosciences Llc | Engineered human IL-21 cytokines and methods for using the same |
WO2023187201A1 (en) * | 2022-04-01 | 2023-10-05 | Universite Catholique De Louvain | Thrombopoietin receptor fragments and uses thereof |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5198359A (en) * | 1989-03-07 | 1993-03-30 | Boehringer Ingelheim International Gmbh | Recombinant protein receptor for il-2 |
US5945511A (en) * | 1997-02-20 | 1999-08-31 | Zymogenetics, Inc. | Class II cytokine receptor |
US5965704A (en) * | 1997-08-05 | 1999-10-12 | Zymogenetics, Inc. | Class two cytokine receptor-11 |
WO1999040195A1 (en) * | 1998-02-06 | 1999-08-12 | Schering Corporation | Mammalian receptor proteins; related reagents and methods |
AU5104799A (en) * | 1998-08-04 | 2000-02-28 | Regeneron Pharmaceuticals, Inc. | Novel orphan cytokine receptors |
EP1124851A1 (en) * | 1998-11-06 | 2001-08-22 | Smithkline Beecham Corporation | Hnovilr |
AU5142300A (en) * | 1999-05-18 | 2000-12-05 | Millennium Pharmaceuticals, Inc. | Il-9/il-2 receptor-like molecules and uses thereof |
AU2253301A (en) * | 1999-12-03 | 2001-06-12 | Zymogenetics Inc. | Human cytokine receptor |
CA2395369C (en) * | 1999-12-23 | 2014-07-08 | Zymogenetics, Inc. | Cytokine zcyto18 |
-
2000
- 2000-11-16 EP EP00982122A patent/EP1230368A2/en not_active Withdrawn
- 2000-11-16 JP JP2001538956A patent/JP2003523179A/en not_active Withdrawn
- 2000-11-16 WO PCT/US2000/031363 patent/WO2001036467A2/en not_active Application Discontinuation
- 2000-11-16 MX MXPA02005058A patent/MXPA02005058A/en unknown
- 2000-11-16 CA CA002392109A patent/CA2392109A1/en not_active Abandoned
- 2000-11-16 AU AU19192/01A patent/AU1919201A/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
AU1919201A (en) | 2001-05-30 |
EP1230368A2 (en) | 2002-08-14 |
WO2001036467A2 (en) | 2001-05-25 |
MXPA02005058A (en) | 2002-11-07 |
WO2001036467A3 (en) | 2002-05-10 |
JP2003523179A (en) | 2003-08-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2001261351B2 (en) | Mammalian receptor proteins; related reagents and methods | |
US20050009145A1 (en) | Mammalian receptor proteins; related reagents and methods | |
EP1315743A2 (en) | Mammalian genes; related reagents and methods | |
EP0977779A1 (en) | Mammalian cytokines; related reagents and methods | |
AU2001261351A1 (en) | Mammalian receptor proteins; related reagents and methods | |
WO1999040195A1 (en) | Mammalian receptor proteins; related reagents and methods | |
CA2392109A1 (en) | Mammalian receptor proteins; related reagents and methods | |
CA2374391C (en) | Mammalian receptor proteins; related reagents and methods | |
CA2323236A1 (en) | Human receptor proteins; related reagents and methods | |
US7812126B2 (en) | DIRS1 polypeptides | |
US20050106673A1 (en) | Mammalian receptor proteins; related reagents and methods | |
US6326472B1 (en) | Human receptor proteins; related reagents and methods | |
AU2007202362B2 (en) | Mammalian receptor proteins; related reagents and methods | |
MXPA00008848A (en) | Human receptor proteins;related reagents and methods |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
FZDE | Dead |