EP1169447A1 - Genes et produits d'expression provenant de cellules hematopoietiques - Google Patents
Genes et produits d'expression provenant de cellules hematopoietiquesInfo
- Publication number
- EP1169447A1 EP1169447A1 EP00922147A EP00922147A EP1169447A1 EP 1169447 A1 EP1169447 A1 EP 1169447A1 EP 00922147 A EP00922147 A EP 00922147A EP 00922147 A EP00922147 A EP 00922147A EP 1169447 A1 EP1169447 A1 EP 1169447A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- sequence
- polypeptide
- dna
- cells
- sequences
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 108090000623 proteins and genes Proteins 0.000 title claims description 141
- 230000014509 gene expression Effects 0.000 title claims description 60
- 210000003958 hematopoietic stem cell Anatomy 0.000 title description 13
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 126
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 121
- 229920001184 polypeptide Polymers 0.000 claims abstract description 119
- 238000000034 method Methods 0.000 claims abstract description 56
- 102000040430 polynucleotide Human genes 0.000 claims abstract description 53
- 108091033319 polynucleotide Proteins 0.000 claims abstract description 53
- 239000002157 polynucleotide Substances 0.000 claims abstract description 53
- 241000282414 Homo sapiens Species 0.000 claims abstract description 47
- 230000008569 process Effects 0.000 claims abstract description 5
- 230000001965 increasing effect Effects 0.000 claims abstract description 4
- 210000004027 cell Anatomy 0.000 claims description 141
- 108020004414 DNA Proteins 0.000 claims description 70
- 239000012634 fragment Substances 0.000 claims description 62
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 52
- 108091026890 Coding region Proteins 0.000 claims description 48
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 28
- 150000007523 nucleic acids Chemical class 0.000 claims description 25
- 102000039446 nucleic acids Human genes 0.000 claims description 23
- 108020004707 nucleic acids Proteins 0.000 claims description 23
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 11
- 239000002609 medium Substances 0.000 claims description 8
- 210000002901 mesenchymal stem cell Anatomy 0.000 claims description 8
- 230000012010 growth Effects 0.000 claims description 7
- 230000000295 complement effect Effects 0.000 claims description 6
- 239000001963 growth medium Substances 0.000 claims description 5
- 241000124008 Mammalia Species 0.000 claims description 4
- 150000001875 compounds Chemical class 0.000 claims description 3
- 238000000338 in vitro Methods 0.000 claims description 3
- 108090000144 Human Proteins Proteins 0.000 claims description 2
- 102000003839 Human Proteins Human genes 0.000 claims description 2
- 230000003053 immunization Effects 0.000 claims 1
- 230000000638 stimulation Effects 0.000 claims 1
- 230000002759 chromosomal effect Effects 0.000 abstract description 14
- 238000013507 mapping Methods 0.000 abstract description 13
- 201000010099 disease Diseases 0.000 abstract description 11
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 11
- 230000035772 mutation Effects 0.000 abstract description 8
- 230000003394 haemopoietic effect Effects 0.000 abstract description 6
- 239000002243 precursor Substances 0.000 abstract description 5
- 102000004169 proteins and genes Human genes 0.000 description 65
- 108020004635 Complementary DNA Proteins 0.000 description 53
- 102100031573 Hematopoietic progenitor cell antigen CD34 Human genes 0.000 description 53
- 101000777663 Homo sapiens Hematopoietic progenitor cell antigen CD34 Proteins 0.000 description 53
- 239000002299 complementary DNA Substances 0.000 description 46
- 238000010804 cDNA synthesis Methods 0.000 description 45
- 239000013598 vector Substances 0.000 description 35
- 239000000047 product Substances 0.000 description 31
- 230000000875 corresponding effect Effects 0.000 description 22
- 210000000349 chromosome Anatomy 0.000 description 19
- 150000001413 amino acids Chemical class 0.000 description 18
- 210000001185 bone marrow Anatomy 0.000 description 18
- 239000013604 expression vector Substances 0.000 description 18
- 239000013615 primer Substances 0.000 description 18
- 239000003153 chemical reaction reagent Substances 0.000 description 17
- 108020004999 messenger RNA Proteins 0.000 description 16
- 239000013612 plasmid Substances 0.000 description 16
- 210000001519 tissue Anatomy 0.000 description 15
- 230000001580 bacterial effect Effects 0.000 description 14
- 210000000130 stem cell Anatomy 0.000 description 14
- 238000004458 analytical method Methods 0.000 description 13
- 101150101112 7 gene Proteins 0.000 description 12
- 210000004700 fetal blood Anatomy 0.000 description 12
- 108700026244 Open Reading Frames Proteins 0.000 description 11
- 239000000523 sample Substances 0.000 description 11
- 238000002360 preparation method Methods 0.000 description 10
- 238000000746 purification Methods 0.000 description 10
- 238000009396 hybridization Methods 0.000 description 9
- 210000005087 mononuclear cell Anatomy 0.000 description 9
- 239000002773 nucleotide Substances 0.000 description 9
- 125000003729 nucleotide group Chemical group 0.000 description 9
- 238000013518 transcription Methods 0.000 description 9
- 230000035897 transcription Effects 0.000 description 9
- 239000008280 blood Substances 0.000 description 8
- 210000000601 blood cell Anatomy 0.000 description 8
- 239000003623 enhancer Substances 0.000 description 8
- 230000002068 genetic effect Effects 0.000 description 8
- 102000005962 receptors Human genes 0.000 description 8
- 108020003175 receptors Proteins 0.000 description 8
- 230000001105 regulatory effect Effects 0.000 description 8
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 7
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 7
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 7
- 210000004369 blood Anatomy 0.000 description 7
- 238000002955 isolation Methods 0.000 description 7
- 239000000463 material Substances 0.000 description 7
- 238000000636 Northern blotting Methods 0.000 description 6
- 108010076504 Protein Sorting Signals Proteins 0.000 description 6
- 125000000539 amino acid group Chemical group 0.000 description 6
- 230000003321 amplification Effects 0.000 description 6
- 230000027455 binding Effects 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 6
- 238000010367 cloning Methods 0.000 description 6
- 239000003550 marker Substances 0.000 description 6
- 238000003199 nucleic acid amplification method Methods 0.000 description 6
- 239000000126 substance Substances 0.000 description 6
- 241000588724 Escherichia coli Species 0.000 description 5
- 108091092195 Intron Proteins 0.000 description 5
- 241001465754 Metazoa Species 0.000 description 5
- 238000010240 RT-PCR analysis Methods 0.000 description 5
- 230000000692 anti-sense effect Effects 0.000 description 5
- 230000000694 effects Effects 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 230000011132 hemopoiesis Effects 0.000 description 5
- 210000003917 human chromosome Anatomy 0.000 description 5
- 230000000813 microbial effect Effects 0.000 description 5
- 230000028327 secretion Effects 0.000 description 5
- 241000894007 species Species 0.000 description 5
- 230000002103 transcriptional effect Effects 0.000 description 5
- 239000003981 vehicle Substances 0.000 description 5
- 102000004127 Cytokines Human genes 0.000 description 4
- 108090000695 Cytokines Proteins 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 4
- 108091034117 Oligonucleotide Proteins 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- 230000010261 cell growth Effects 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 230000004069 differentiation Effects 0.000 description 4
- 210000003527 eukaryotic cell Anatomy 0.000 description 4
- 239000000284 extract Substances 0.000 description 4
- 210000004962 mammalian cell Anatomy 0.000 description 4
- 238000004519 manufacturing process Methods 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 238000003259 recombinant expression Methods 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 description 4
- 230000003248 secreting effect Effects 0.000 description 4
- 238000001890 transfection Methods 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 3
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- 101000958041 Homo sapiens Musculin Proteins 0.000 description 3
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 3
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 3
- 210000001744 T-lymphocyte Anatomy 0.000 description 3
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 3
- 239000000427 antigen Substances 0.000 description 3
- 108091007433 antigens Proteins 0.000 description 3
- 102000036639 antigens Human genes 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 239000000872 buffer Substances 0.000 description 3
- -1 clones Substances 0.000 description 3
- 239000013599 cloning vector Substances 0.000 description 3
- 239000003636 conditioned culture medium Substances 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 3
- 230000013595 glycosylation Effects 0.000 description 3
- 238000006206 glycosylation reaction Methods 0.000 description 3
- 102000046949 human MSC Human genes 0.000 description 3
- 230000006698 induction Effects 0.000 description 3
- 230000008488 polyadenylation Effects 0.000 description 3
- 102000054765 polymorphisms of proteins Human genes 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 230000000717 retained effect Effects 0.000 description 3
- 238000010186 staining Methods 0.000 description 3
- 230000004936 stimulating effect Effects 0.000 description 3
- 238000006467 substitution reaction Methods 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 230000014621 translational initiation Effects 0.000 description 3
- 241000701161 unidentified adenovirus Species 0.000 description 3
- 230000003612 virological effect Effects 0.000 description 3
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- 108700028369 Alleles Proteins 0.000 description 2
- 108091093088 Amplicon Proteins 0.000 description 2
- 108020005544 Antisense RNA Proteins 0.000 description 2
- XKRFYHLGVUSROY-UHFFFAOYSA-N Argon Chemical compound [Ar] XKRFYHLGVUSROY-UHFFFAOYSA-N 0.000 description 2
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 241000699800 Cricetinae Species 0.000 description 2
- 239000003155 DNA primer Substances 0.000 description 2
- 102100031780 Endonuclease Human genes 0.000 description 2
- 102000004190 Enzymes Human genes 0.000 description 2
- 108090000790 Enzymes Proteins 0.000 description 2
- 108700024394 Exon Proteins 0.000 description 2
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 2
- 108010017080 Granulocyte Colony-Stimulating Factor Proteins 0.000 description 2
- 102000004269 Granulocyte Colony-Stimulating Factor Human genes 0.000 description 2
- HVLSXIKZNLPZJJ-TXZCQADKSA-N HA peptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HVLSXIKZNLPZJJ-TXZCQADKSA-N 0.000 description 2
- 101001030211 Homo sapiens Myc proto-oncogene protein Proteins 0.000 description 2
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 2
- 101100335081 Mus musculus Flt3 gene Proteins 0.000 description 2
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 2
- 101710160107 Outer membrane protein A Proteins 0.000 description 2
- 102000011755 Phosphoglycerate Kinase Human genes 0.000 description 2
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 2
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 238000002105 Southern blotting Methods 0.000 description 2
- 241000187747 Streptomyces Species 0.000 description 2
- 108700005078 Synthetic Genes Proteins 0.000 description 2
- 101001099217 Thermotoga maritima (strain ATCC 43589 / DSM 3109 / JCM 10099 / NBRC 100826 / MSB8) Triosephosphate isomerase Proteins 0.000 description 2
- 108091036066 Three prime untranslated region Proteins 0.000 description 2
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 2
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 2
- 210000003719 b-lymphocyte Anatomy 0.000 description 2
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 2
- 230000008827 biological function Effects 0.000 description 2
- 239000012472 biological sample Substances 0.000 description 2
- 229910000389 calcium phosphate Inorganic materials 0.000 description 2
- 239000001506 calcium phosphate Substances 0.000 description 2
- 235000011010 calcium phosphates Nutrition 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 230000011712 cell development Effects 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 239000013611 chromosomal DNA Substances 0.000 description 2
- 238000003200 chromosome mapping Methods 0.000 description 2
- 239000003184 complementary RNA Substances 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 239000005547 deoxyribonucleotide Substances 0.000 description 2
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000002708 enhancing effect Effects 0.000 description 2
- 239000012894 fetal calf serum Substances 0.000 description 2
- 230000001605 fetal effect Effects 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 2
- 210000005260 human cell Anatomy 0.000 description 2
- 210000004754 hybrid cell Anatomy 0.000 description 2
- 210000004408 hybridoma Anatomy 0.000 description 2
- 230000002163 immunogen Effects 0.000 description 2
- 238000007901 in situ hybridization Methods 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 238000012423 maintenance Methods 0.000 description 2
- 201000001441 melanoma Diseases 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 230000003278 mimic effect Effects 0.000 description 2
- 238000010647 peptide synthesis reaction Methods 0.000 description 2
- 210000005259 peripheral blood Anatomy 0.000 description 2
- 239000011886 peripheral blood Substances 0.000 description 2
- 210000001778 pluripotent stem cell Anatomy 0.000 description 2
- 230000035755 proliferation Effects 0.000 description 2
- 230000005855 radiation Effects 0.000 description 2
- 238000010188 recombinant method Methods 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 230000032258 transport Effects 0.000 description 2
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 2
- 210000003954 umbilical cord Anatomy 0.000 description 2
- 238000001262 western blot Methods 0.000 description 2
- GZCWLCBFPRFLKL-UHFFFAOYSA-N 1-prop-2-ynoxypropan-2-ol Chemical compound CC(O)COCC#C GZCWLCBFPRFLKL-UHFFFAOYSA-N 0.000 description 1
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- 108010005465 AC133 Antigen Proteins 0.000 description 1
- 102000005908 AC133 Antigen Human genes 0.000 description 1
- 102100031585 ADP-ribosyl cyclase/cyclic ADP-ribose hydrolase 1 Human genes 0.000 description 1
- 102000013563 Acid Phosphatase Human genes 0.000 description 1
- 108010051457 Acid Phosphatase Proteins 0.000 description 1
- 208000036762 Acute promyelocytic leukaemia Diseases 0.000 description 1
- 108020004491 Antisense DNA Proteins 0.000 description 1
- 208000032791 BCR-ABL1 positive chronic myelogenous leukemia Diseases 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 208000011691 Burkitt lymphomas Diseases 0.000 description 1
- 101150118155 Cd34 gene Proteins 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 208000010833 Chronic myeloid leukaemia Diseases 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- 108091033380 Coding strand Proteins 0.000 description 1
- 102000007644 Colony-Stimulating Factors Human genes 0.000 description 1
- 108010071942 Colony-Stimulating Factors Proteins 0.000 description 1
- 206010052360 Colorectal adenocarcinoma Diseases 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 241000701022 Cytomegalovirus Species 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 1
- 108010092408 Eosinophil Peroxidase Proteins 0.000 description 1
- 102100028471 Eosinophil peroxidase Human genes 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 238000012413 Fluorescence activated cell sorting analysis Methods 0.000 description 1
- 241000700662 Fowlpox virus Species 0.000 description 1
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 1
- 102100036263 Glutamyl-tRNA(Gln) amidotransferase subunit C, mitochondrial Human genes 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- 102000005744 Glycoside Hydrolases Human genes 0.000 description 1
- 108010031186 Glycoside Hydrolases Proteins 0.000 description 1
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 1
- 102000004457 Granulocyte-Macrophage Colony-Stimulating Factor Human genes 0.000 description 1
- 102000002812 Heat-Shock Proteins Human genes 0.000 description 1
- 108010004889 Heat-Shock Proteins Proteins 0.000 description 1
- 101710154606 Hemagglutinin Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- 101000777636 Homo sapiens ADP-ribosyl cyclase/cyclic ADP-ribose hydrolase 1 Proteins 0.000 description 1
- 101100220044 Homo sapiens CD34 gene Proteins 0.000 description 1
- 101001001786 Homo sapiens Glutamyl-tRNA(Gln) amidotransferase subunit C, mitochondrial Proteins 0.000 description 1
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 1
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 108010002386 Interleukin-3 Proteins 0.000 description 1
- 102000015696 Interleukins Human genes 0.000 description 1
- 108010063738 Interleukins Proteins 0.000 description 1
- 206010058467 Lung neoplasm malignant Diseases 0.000 description 1
- 108091027974 Mature messenger RNA Proteins 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- 208000033761 Myelogenous Chronic BCR-ABL Positive Leukemia Diseases 0.000 description 1
- 230000004988 N-glycosylation Effects 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 101100336468 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gem-1 gene Proteins 0.000 description 1
- 108091092724 Noncoding DNA Proteins 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 1
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 108010004729 Phycoerythrin Proteins 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 239000002202 Polyethylene glycol Substances 0.000 description 1
- 208000006664 Precursor Cell Lymphoblastic Leukemia-Lymphoma Diseases 0.000 description 1
- 101710176177 Protein A56 Proteins 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 239000013614 RNA sample Substances 0.000 description 1
- 230000004570 RNA-binding Effects 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 241000256248 Spodoptera Species 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 101150006914 TRP1 gene Proteins 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- 206010046865 Vaccinia virus infection Diseases 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 238000004873 anchoring Methods 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 239000003816 antisense DNA Substances 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 229910052786 argon Inorganic materials 0.000 description 1
- 238000010420 art technique Methods 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 102000015736 beta 2-Microglobulin Human genes 0.000 description 1
- 108010081355 beta 2-Microglobulin Proteins 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 230000024245 cell differentiation Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 208000019065 cervical carcinoma Diseases 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- 230000014107 chromosome localization Effects 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 229940047120 colony stimulating factors Drugs 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 210000001608 connective tissue cell Anatomy 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 239000000287 crude extract Substances 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 230000001351 cycling effect Effects 0.000 description 1
- 230000002559 cytogenic effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 231100000676 disease causative agent Toxicity 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 210000002889 endothelial cell Anatomy 0.000 description 1
- 230000000925 erythroid effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 239000012847 fine chemical Substances 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000001415 gene therapy Methods 0.000 description 1
- 230000009395 genetic defect Effects 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 238000012254 genetic linkage analysis Methods 0.000 description 1
- 230000002414 glycolytic effect Effects 0.000 description 1
- 239000003102 growth factor Substances 0.000 description 1
- 208000014951 hematologic disease Diseases 0.000 description 1
- 230000002489 hematologic effect Effects 0.000 description 1
- 208000018706 hematopoietic system disease Diseases 0.000 description 1
- 229920000140 heteropolymer Polymers 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 102000053563 human MYC Human genes 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 238000002169 hydrotherapy Methods 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000028993 immune response Effects 0.000 description 1
- 206010022000 influenza Diseases 0.000 description 1
- 239000003112 inhibitor Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 229940047122 interleukins Drugs 0.000 description 1
- 239000000543 intermediate Substances 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 101150066555 lacZ gene Proteins 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 201000005296 lung carcinoma Diseases 0.000 description 1
- 210000001165 lymph node Anatomy 0.000 description 1
- 210000004698 lymphocyte Anatomy 0.000 description 1
- 208000003747 lymphoid leukemia Diseases 0.000 description 1
- 210000003738 lymphoid progenitor cell Anatomy 0.000 description 1
- 239000006166 lysate Substances 0.000 description 1
- 230000002934 lysing effect Effects 0.000 description 1
- 238000010841 mRNA extraction Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 230000031864 metaphase Effects 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- 238000013508 migration Methods 0.000 description 1
- 230000005012 migration Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 238000007479 molecular analysis Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 210000000865 mononuclear phagocyte system Anatomy 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 210000003643 myeloid progenitor cell Anatomy 0.000 description 1
- 239000005445 natural material Substances 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 229910052759 nickel Inorganic materials 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 229940046166 oligodeoxynucleotide Drugs 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 238000011275 oncology therapy Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000011164 ossification Effects 0.000 description 1
- 230000009818 osteogenic differentiation Effects 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 230000007030 peptide scission Effects 0.000 description 1
- 210000001322 periplasm Anatomy 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 210000004180 plasmocyte Anatomy 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 239000002987 primer (paints) Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000009465 prokaryotic expression Effects 0.000 description 1
- 230000002062 proliferating effect Effects 0.000 description 1
- 229940121649 protein inhibitor Drugs 0.000 description 1
- 239000012268 protein inhibitor Substances 0.000 description 1
- 230000030788 protein refolding Effects 0.000 description 1
- 208000009305 pseudorabies Diseases 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 238000009256 replacement therapy Methods 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 108020004418 ribosomal RNA Proteins 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 239000012723 sample buffer Substances 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 210000002966 serum Anatomy 0.000 description 1
- 239000004017 serum-free culture medium Substances 0.000 description 1
- 238000001542 size-exclusion chromatography Methods 0.000 description 1
- 231100001055 skeletal defect Toxicity 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- YZHUMGUJCQRKBT-UHFFFAOYSA-M sodium chlorate Chemical compound [Na+].[O-]Cl(=O)=O YZHUMGUJCQRKBT-UHFFFAOYSA-M 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- LNKSLLXPNOTUHF-UHFFFAOYSA-M sodium;2-amino-2-(hydroxymethyl)propane-1,3-diol;dodecyl sulfate Chemical compound [Na+].OCC(N)(CO)CO.CCCCCCCCCCCCOS([O-])(=O)=O LNKSLLXPNOTUHF-UHFFFAOYSA-M 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 210000001541 thymus gland Anatomy 0.000 description 1
- 238000004448 titration Methods 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 230000005945 translocation Effects 0.000 description 1
- 238000002054 transplantation Methods 0.000 description 1
- 238000011282 treatment Methods 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 208000007089 vaccinia Diseases 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/52—Cytokines; Lymphokines; Interferons
Definitions
- This invention relates to newly identified polynucleotide sequences corresponding to transcription products of human genes, and to complete gene sequences associated therewith and to gene expression products thereof and to uses for the foregoing, especially where these involve hematopoiesis and the bone marrow microenvironment. More specifically, the invention disclosed herein relates to a novel gene that is expressed in CD34" hematopoietic stem and/or progenitor cells (HSPCs) but not in CD34 " hematopoietic cells.
- HSPCs hematopoietic stem and/or progenitor cells
- circulating blood cells are products of the terminal differentiation of a number of determined precursor cells.
- precursor cells for example, precursors of white cells, red cells and platelets
- Marrow films and biopsy specimens have contributed much information about the condition of the hematopoietic process in the living organism, especially humans.
- researchers have defined a pluripotent stem cell, which can give rise to any and all types of blood cells. In particular, this pluripotent stem cell, found in the marrow, will differentiate along one of two well defined pathways.
- the stem cell will differentiate into either a myeloid stem cell, ultimately giving rise to all of the final differentiated blood cells except B and T lymphocytes, or will differentiate into a lymphoid stem cell, which itself will eventually differentiate into either plasma cells or T lymphocytes (T cells) .
- T cells T lymphocytes
- CD has been understood as describing either "cluster designation” or “cluster of differentiation” and refers to a molecule recognized by a “cluster” of monoclonal antibodies useful in identifying the stage of differentiation of the cells and thus to distinguish one class of hematological cells from another, including cells operating at different stages of the hematopoietic process.
- CD34 is a protein of about 1 05 to 1 20 kD in size and is present on hematopoietic stem cells.
- HSPC Human hematopoietic stem/progenitor cells
- BM bone marrow
- MNC mononuclear cells
- CD34 + HSPC are also found in neonatal cord blood (CB), and are enriched in peripheral blood after mobilization by cytokines and chemotherapy (Krause et al., Blood, 87, 1 (1 996).
- CD34 + cells account for - 1 -4% of MNC in BM and - 0.5% in CB and mobilized peripheral blood (mPB).
- Purified CD34" cells can engraft in bone marrow and generate blood/lymphoid cells for years in patients after transplantation.
- CD34 + cells can form colonies of hematopoietic cells in culture.
- CD34 mononuclear cells that lack CD34 expression
- CD34 are largely mature hematopoietic cells of various differentiated lineages, and have lost the ability to form colonies in culture (Krause et al., 1 996) .
- the invention herein discloses diagnostic and therapeutic applications of a novel secreted protein and its encoding gene, the latter gene being specifically expressed in CD34 ' hematopoietic stem/progenitor cells.
- hematopoietic stem/progenitor cells are part of a relatively rare population of mononuclear cells present in bone marrow and in blood from the umbilical cord and which express the CD34 cell surface antigen (such cells being denoted CD34 + ).
- CD34 + the CD34 cell surface antigen
- the CD34 gene is not exclusively expressed in HSPCs (for example, endothelial cells also express high levels of CD34).
- blood cells obtained from umbilical cord were found to express a novel protein if the cells possessed the CD34 cell surface antigen but was not expressed if CD34 was not detected on the cells.
- Figure 1 shows the nucleotide sequence for the novel gene disclosed according to the present invention (SEQ ID NO: 1 ) which contains the open reading frame (SEQ ID NO: 2) corresponding to the deduced protein of Figure 2 and the two relevant dpnll restriction sites useful in cloning the gene and which provide the fragment utilized.
- Figure 1 B is a continuation of Figure 1 A.
- Figure 2 shows the deduced amino acid sequence (SEQ ID NO:3) for the open reading frame of the sequence disclosed in Figure 1 and thus for the C1 7 protein.
- FIG 3 shows the effect of C1 7 protein on proliferation of mesenchymal stem cells in serum-free culture.
- the C1 7 protein allows for proliferation rates equivalent to that supported by serum-containing medium.
- One aspect of the present invention is directed to nucleic acids and isolated DNA sequences and molecules, and fragments thereof (and corresponding isolated RNA sequences, and fragments thereof) showing sequence homology with, or capable of hybridizing to, the DNA sequence identified in Figure 1 (SEQ ID NO: 1 ) .
- the present invention is also directed to fragments or portions of such sequences which contain at least 1 5 bases, preferably at least 30 bases, more preferably at least 50 bases and most preferably at least 80 bases, and to those sequences which are at least 60%, preferably at least 80%, and most preferably at least 95% identical thereto, and to DNA (or RNA) sequences encoding the same polypeptide as the sequence of Figure 1 , including fragments and portions thereof and, when derived from natural sources, includes alleles thereof.
- the term "percent identity” or “percent identical,” when referring to a sequence, means that a sequence is compared to a claimed or described sequence after alignment of the sequence to be compared (the "Compared Sequence") with the described or claimed sequence (the “Reference Sequence”).
- C is the number of differences between the Reference Sequence and the Compared Sequence over the length of alignment between the Reference Sequence and the Compared Sequence wherein (i) each base or amino acid in the Reference Sequence that does not have a corresponding aligned base or amino acid in the Compared Sequence and (ii) each gap in the Reference Sequence and (iii) each aligned base or amino acid in the Reference Sequence that is different from an aligned base or amino acid in the Compared Sequence, constitutes a difference; and R is the number of bases or amino acids in the Reference Sequence over the length of the alignment with the Compared Sequence with any gap created in the Reference Sequence also being counted as a base or amino acid.
- the Compared Sequence has the specified minimum percent identity to the Reference Sequence even though alignments may exist in which the hereinabove calculated Percent Identity is less than the specified Percent Identity.
- a further aspect of the present invention is directed to a
- DNA sequence (as well as the corresponding RNA sequence) which is or contains a DNA sequence identical to one contained in Figure 1 (SEQ ID NO: 1 ) .
- a DNA sequence according to the present invention is hybridizable under stringent conditions with a DNA sequence identified in Figure 1 and set forth in the Sequence Listing (Seq. ID No. 1 ).
- stringent conditions means hybridization will occur only if there is at least 97% identity between the sequences.
- Yet another aspect of the present invention is directed to an isolated DNA (or RNA) sequence or molecule comprising at least the coding region of a human gene (or a DNA sequence encoding the same polypeptide as such coding region), in particular an expressed human gene, which human gene comprises a DNA sequence homologous with, or contributing to, the sequence depicted in Figure 1 (SEQ ID NO: 1 ), or one at least 90%, preferably at least 95 %, and most preferably at least 98%, identical thereto, as well as fragments or portions of the coding region which encode a polypeptide having a similar function to the polypeptide encoded by said coding region.
- the isolated DNA (or RNA) sequence can include only the coding region of the expressed gene (or fragment or portion thereof as hereinabove indicated) or can further include all or a portion of the non-coding DNA (or RNA) of the expressed human gene.
- sequences homologous with and contributing to the sequence shown in Figure 1 are from the coding region of a human gene.
- the present invention also relates to vectors or plasmids which include such DNA (or RNA) sequences, as well as the use of the DNA (or RNA) sequences.
- SEQ ID NO: 1 The sequence depicted in Figure 1 (SEQ ID NO: 1 ), is hybridizable with actual DNA and RNA sequences as derived from different human tissues. The distribution of this sequence in various human tissues was determined from database matchings for other human sequences.
- the polynucleotides of the present invention may be in the form of RNA or in the form of DNA, which DNA includes cDNA, genomic DNA, and synthetic DNA.
- the DNA may be double-stranded or single- stranded, and if single stranded may be the coding strand or non-coding (anti-sense) strand.
- the coding sequence which encodes the mature polypeptide may be identical to the coding sequence shown in Figure 1 or may be a different coding sequence, which coding sequence, as a result of the redundancy or degeneracy of the genetic code, encodes the same mature polypeptide as the DNA of Figure 1 (SEQ ID NO: 1 ) .
- the polynucleotide which codes for the polypeptide of Figure 2 may include, but is not limited to: only the coding sequence for the mature polypeptide; the coding sequence for the mature polypeptide and additional coding sequence such as a leader or secretory sequence, a proprotein sequence and a membrane anchor; the coding sequence for the mature polypeptide (and optionally additional coding sequence) and non-coding sequence, such as introns or non-coding sequence 5' and/or 3' of the coding sequence for the mature polypeptide.
- polynucleotide as used for the present invention encompasses a polynucleotide which includes only coding sequence for the polypeptide as well as a polynucleotide which includes additional coding and/or non-coding sequences.
- the present invention further relates to variants of the hereinabove described polynucleotides which encode for fragments, analogs and derivatives of the polypeptide having the amino acid sequence of Figure 2 (SEQ ID NO: 3).
- Variants of the polynucleotide may be naturally occurring allelic variants of the polynucleotide or a non-naturally occurring variant of the polynucleotide.
- nucleic acids, or polynucleotides, according to the present invention may have coding sequences which are naturally occurring allelic variants of the coding sequence shown in Figure 1 .
- an allelic variant is an alternate form of a polynucleotide sequence which may have a substitution, deletion or addition of one or more nucleotides, which does not substantially alter the function of the encoded polypeptide.
- the present invention also includes polynucleotides, wherein the coding sequence for the mature polypeptide may be fused in the same reading frame to a polynucleotide sequence which aids in expression and secretion of a polypeptide from a host cell, for example, a leader sequence which functions as a secretory sequence for controlling transport of a polypeptide from the cell and a transmembrane anchor which facilitates attachment of the polypeptide to a cellular membrane.
- the polypeptide having a leader sequence is a preprotein and may have the leader sequence cleaved by the host cell to form the mature polypeptide.
- the polynucleotides may also encode for a proprotein which is the mature protein plus additional 5' amino acid residues.
- a mature protein having a prosequence is a proprotein and is often an inactive form of the protein. Once the prosequence is cleaved an active mature protein remains.
- a polynucleotide according to the present invention may code for a mature protein, for a protein having a prosequence, for a protein having a transmembrane anchor or for a polypeptide having a prosequence, a presequence (leader sequence) and a transmembrane anchor.
- the polynucleotides of the present invention may also have the coding sequence fused in frame to a marker sequence which allows for purification of the polypeptide of the present invention.
- the marker sequence may be a hexa-histidine tag supplied by a pQE-9 vector to provide for purification of the mature polypeptide fused to the marker in the case of a bacterial host, or, for example, the marker sequence may be a hemagglutinin (HA) tag when a mammalian host, e.g. COS-7 cells, is used.
- the HA tag corresponds to an epitope derived from the influenza hemagglutinin protein (Wilson, I., et al., Cell, 37:767 ( 1 984)).
- Fragments of the full length polynucleotide of the present invention may be used as hybridization probes for a cDNA library to isolate the full length cDNA and to isolate other cDNAs which have a high sequence similarity to the gene or similar biological activity.
- Probes of this type preferably have at least 1 5 bases, may have at least 30 bases and even 50 or more bases.
- the probe may also be used to identify a cDNA clone corresponding to a full-length transcript and a genomic clone or clones that contain the complete gene including regulatory and promotor regions, exons, and introns.
- An example of a screen comprises isolating the coding region of the gene by using the known DNA sequence to synthesize an oligonucleotide probe. Labeled oligonucleotides having a sequence complementary to that of the gene of the present invention are used to screen a library of human cDNA, genomic DNA or mRNA to determine which members of the library the probe hybridizes to.
- a polynucleotide according to the present invention may have at least 1 5 bases, preferably at least 30 bases, and more preferably at least 50 bases which hybridize to a polynucleotide of the present invention and which has an identity thereto, as hereinabove described, and which may or may not retain activity.
- Such polynucleotides may be employed as probes for the polynucleotide of Figure 1 , for example, for recovery of the polynucleotide or as a diagnostic probe or as a PCR primer.
- polynucleotides according to the present invention may also occur in the form of mixtures of polynucleotides hybridizable to some extent with the sequence of Figure 1 (SEQ ID NO: 1 ), including any and all fragments thereof, and which polynucleotide mixtures may be composed of any number of such polynucleotides, or fragments thereof, including mixtures having at least 10, perhaps at least 30 such sequences, or fragments thereof.
- coding regions comprise only a small portion of the human genome
- identification and mapping of transcribed regions and coding regions of chromosomes is of significant interest.
- human sequences are valuable for chromosome mapping, human identification, identification of tissue type and origin, forensic identification, and locating disease-associated genes (i.e., genes that are associated with an inherited human disease, whether through mutation, deletion, or faulty gene expression) on the chromosome.
- Various aspects of the present invention include each of the individual sequences, corresponding partial and complete cDNAs, genomic DNA, mRNA, antisense strands, PCR primers, coding regions, and constructs.
- Expression vectors and polypeptide expression products are also within the scope of the present invention, along with antibodies, especially monoclonal antibodies, to such expression products.
- cistron means the segment of DNA (or DNA segment) involved in producing a polypeptide chain; it includes regions preceding and following the coding region (5'-and 3'- untranslated regions, or UTRs, also called leader and trailer sequences, regions, or segments) as well as intervening sequences (introns) between individual coding segments (exons), which intronic regions are typically removed during processing of post- transcriptional RNA to form the final translatable mRNA product. Of course, by their nature, cDNAs contain no intronic sequences.
- DNA segment refers to a DNA polymer, in the form of a separate fragment or as a component of a larger DNA construct, which has been derived from DNA isolated at least once in substantially pure form, i.e., free of contaminating endogenous materials and in a quantity or concentration enabling identification, manipulation, and recovery of the segment and its component nucleotide sequences by standard biochemical methods, for example, using a cloning vector.
- segments are provided in the form of an open reading frame uninterrupted by internal nontranslated sequences, or introns, which are typically present in eukaryotic genes. Sequences of non-translated DNA may be present downstream from the open reading frame, where the same do not interfere with manipulation or expression of the coding regions.
- nucleic acids and polypeptide expression products disclosed according to the present invention may be in "enriched form.”
- enriched means that the concentration of the material is at least about 2, 5, 10, 100, or 1 000 times its natural concentration (for example), advantageously 0.01 %, by weight, preferably at least about 0.1 % by weight. Enriched preparations of about 0.5%, 1 %, 5% , 1 0%, and 20% by weight are also contemplated .
- sequences, constructs, vectors, clones, and other materials comprising the present invention can advantageously be in enriched or isolated form.
- RNA and RNA sequences, and polypeptides, disclosed in accordance with the present invention will commonly be in isolated form.
- isolated means that the material is removed from its original environment (e.g., the natural environment if it is naturally occurring).
- a naturally-occurring polynucleotide or DNA present in a living animal is not isolated, but the same polynucleotide or DNA, separated from some or all of the coexisting materials in the natural system, is isolated.
- DNA could be part of a vector and/or such polynucleotide could be part of a composition, and still be isolated in that such vector or polynucleotide is not part of its natural environment.
- the DNA and RNA sequences, or polypeptides, disclosed in accordance with the present invention may also be in "purified” form.
- the term “purified” does not require absolute purity; rather, it is intended as a relative definition, and can include preparations that are highly purified or preparations that are only partially purified, as those terms are understood by those of skill in the relevant art.
- Individual clones isolated from a cDNA library have been conventionally purified to electrophoretic homogeneity.
- the cDNA clones are obtained via manipulation of a partially purified naturally occurring substance (messenger RNA). By conversion of mRNA into a cDNA library, pure individual cDNA clones can be isolated from the synthetic library by clonal selection.
- creating a cDNA library from RNA and subsequently isolating individual clones from that library results in an approximately 1 0 6 fold purification of the native message.
- Purification of starting material or natural material to at least one order of magnitude, preferably two or three orders, and more preferably four or five orders of magnitude is expressly contemplated.
- claimed polynucleotide which has a purity of preferably 0.001 %, or at least 0.01 % or 0.1 %; and even desirably 1 % by weight or greater is expressly contemplated.
- coding region refers to that portion of a human gene which either naturally or normally codes for the expression product of that gene in its natural genomic environment, i.e., the region coding in vivo for the native expression product of the gene.
- the coding region can be from a normal, mutated or altered gene, or can even be from a DNA sequence, or gene, wholly synthesized in the laboratory using methods well known to those of skill in the art of DNA synthesis.
- nucleotide sequence refers to a heteropolymer of deoxyribonucleotides.
- DNA segments encoding the proteins provided by this invention are assembled from cDNA fragments and short oligonucleotide linkers, or from a series of oligonucleotides, to provide a synthetic gene which is capable of being expressed in a recombinant transcriptional unit comprising regulatory elements derived from a microbial or viral operon.
- expression product means that polypeptide or protein that is the natural transcription product of the gene and any nucleic acid sequence coding equivalents resulting from genetic code degeneracy and thus coding for the same amino acid(s) .
- fragment when referring to a coding sequence, means a portion of DNA comprising less than the complete human coding region whose expression product retains essentially the same biological function or activity as the expression product of the complete coding region.
- primer means a short nucleic acid sequence that is paired with one strand of DNA and provides a free 3'OH end at which a DNA polymerase starts synthesis of a deoxyribonucleotide chain.
- promoter means a region of DNA involved in binding of RNA polymerase to initiate transcription.
- ORF open reading frame
- exon means any segment of an interrupted gene that is represented in the mature RNA product.
- reference to a DNA sequence includes both single stranded and double stranded DNA.
- specific sequence unless the context indicates otherwise, refers to the single strand DNA of such sequence, the duplex of such sequence with its complement (double stranded DNA) and the complement of such sequence.
- the overall approach to identification of cDNAs involved with the mesenchymal differentiation process in hMSCs involved measurement of gene expression during osteogenic differentiation of the cells as grown in culture.
- Cells were harvested and the total RNA content thereof was recovered.
- reverse transcriptase and polymerase chain reaction procedures were used to produce and amplify the corresponding cDNAs, which were then screened to find regulated DNA sequences that were subsequently purified and cloned. These clones were then sequenced and used to determine a consensus sequence (one based upon the most commonly occurring bases at each nucleotide position in a sequence after the contributing sequences are aligned by residue position).
- Probes based on these cDNAs were used to identify the relevant transcripts, using Northern Blotting Analysis methods well known in the art.
- the nucleotide sequence disclosed according to the present invention was found to be expressed in CD34 bearing cells of cord blood as well as of bone marrow, with the full transcript being about 1 .1 kb (as determined by Northern Blot Hybridization Analysis).
- the sequence contained an open reading frame coding for a polypeptide of 1 36 amino acids (the latter showing no significant homology to any of the known proteins in GenBank and was therefore considered to be novel).
- Hydropathy analysis of the deduced peptide sequence indicates a signal peptide of 1 9 amino acids at the N-terminus of the protein, suggesting that it is secreted by CD34 + cells.
- Each of the DNA sequences identified herein can be used in numerous ways as polynucleotide reagents.
- the sequences can be used as diagnostic probes for the presence of a specific mRNA in a particular cell type as well as in genetic linkage analysis (polymorphisms). Further, the sequences can be used as probes for locating gene regions associated with genetic disease.
- the nucleotide and gene sequences of the present invention are also valuable for chromosome identification. Each sequence is specifically targeted to and can hybridize with a particular location on an individual human chromosome. Moreover, there is a current need for identifying particular sites on the chromosome.
- the mapping of the polynucleotides to specific chromosomes according to the present invention is an important first step in correlating those sequences with genes associated with disease, such as diseases affecting bone formation or skeletal abnormalities.
- sequences can be mapped to chromosomes by preparing PCR primers (preferably 1 5-30 bp) from the sequences disclosed herein. Computer analysis of these sequences is used to rapidly select primers that do not span more than one exon in the corresponding genomic DNA, which would otherwise complicate the amplification process. These primers are then used for PCR screening of somatic cell hybrids containing individual human chromosomes. Only those hybrids containing the human gene corresponding to the sequences or subsequences disclosed herein will yield an amplified fragment.
- PCR mapping of somatic cell hybrids is a rapid procedure for assigning a particular sequence to a particular chromosome. Three or more clones can be assigned per day using a single thermal cycler, as is well known in the art. Using the present invention with the same oligonucleotide primers, sublocalization can be achieved with panels of fragments from specific chromosomes or pools of large genomic clones in an analogous manner.
- Other mapping strategies that can similarly be used to map a sequence, or part of a sequence, to its chromosome include in situ hybridization, prescreening with labeled flow-sorted chromosomes and preselection by hybridization to construct chromosome specific-cDNA libraries.
- Fluorescence in situ hybridization (FISH) of a cDNA clone to a metaphase chromosomal spread can be used to provide a precise chromosomal location in one step.
- This technique can be used with cDNA as short as 500 or 600 bases; however, clones larger than 2,000 bp have a higher likelihood of binding to a unique chromosomal location with sufficient signal intensity for simple detection.
- FISH requires use of the clone from which the sequence was derived, and the longer the better. For example, 2,000 bp is good, 4,000 is better, but more than 4,000 is probably not necessary to get good results a reasonable percentage of the time.
- Reagents for chromosome mapping can be used individually (to mark a single chromosome or a single site on that chromosome) or as panels of reagents (for marking multiple sites and/or multiple chromosomes). Reagents corresponding to noncoding regions of the genes actually are preferred for mapping purposes. Coding sequences are more likely to be conserved within gene families, thus increasing the chance of cross hybridizations during chromosomal mapping.
- a cDNA precisely localized to a chromosomal region associated with the disease could be one of between 50 and 500 potential causative genes. (This assumes 1 megabase mapping resolution and one gene per 20 kb.)
- Comparison of affected and unaffected individuals generally involves first looking for structural alterations in the chromosomes, such as deletions or translocations that are visible from chromosome spreads or detectable using PCR based on that cDNA sequence. Ultimately, complete sequencing of genes from several individuals is required to confirm the presence of a mutation and to distinguish mutations from polymorphisms.
- sequences of the invention can be used to control gene expression through triple helix formation or antisense DNA or RNA, both of which methods are based on binding of a polynucleotide sequence to DNA or RNA.
- Polynucleotides suitable for use in these methods are usually 20 to 40 bases in length and are designed to be complementary to a region of the gene involved in transcription (triple helix - see Lee et al, Nucl.
- the present invention is also a useful tool in gene therapy, which requires isolation of the disease-associated gene in question as a prerequisite to the insertion of a normal gene into an organism to correct a genetic defect.
- the high specificity of the cDNA probes according to this invention have promise of targeting such gene locations in a highly accurate manner.
- sequences of the present invention are also useful for identification of individuals from minute biological samples.
- the United States military for example, is considering the use of restriction fragment length polymorphism (RFLP) for identification of its personnel.
- RFLP restriction fragment length polymorphism
- an individual's genomic DNA is digested with one or more restriction enzymes, and probed on a Southern blot to yield unique bands for identifying personnel.
- This method does not suffer from the current limitations of "Dog Tags" which can be lost, switched, or stolen, making positive identification difficult.
- the sequences of the present invention are useful as additional DNA markers for RFLP.
- RFLP is a pattern based technique, which does not require the DNA sequence of the individual to be sequenced.
- Portions of the sequences of the present invention can be used to provide an alternative technique that determines the actual base-by-base DNA sequence of selected portions of an individual's genome.
- These sequences can also be used to prepare PCR primers for amplifying and isolating such selected DNA.
- One can, for example, take part of the sequence of the invention and prepare two PCR primers from the 5' and 3' ends of the sequence, or fragment of the sequence. These are used to amplify an individual's DNA, corresponding to the sequence. The amplified DNA is sequenced.
- Panels of corresponding DNA sequences from individuals can provide unique individual identifications, as each individual will have a unique set of such DNA sequences, due to allelic differences.
- the sequences of the present invention can be used to particular advantage to obtain such identification sequences from individuals and from tissue. Allelic variation occurs to some degree in the coding regions of these sequences, and to a greater degree in the noncoding regions. It is estimated that allelic variation between individual humans occurs with a frequency of about once per each 500 bases.
- Each of the fragments or complete coding sequences comprising a part of the present invention can, to some degree, be used as a standard against which DNA from an individual can be compared for identification purposes. Because greater numbers of polymorphisms occur in the noncoding regions, fewer sequences are necessary to differentiate individuals.
- a panel of reagents from the sequences according to the present invention is used to generate a unique ID database for an individual, those same reagents can later be used to identify tissue from that individual. Positive identification of that individual, living or dead can be made from extremely small tissue samples.
- DNA-based identification techniques are in forensic biology.
- PCR technology can be used to amplify DNA sequences taken from very small biological samples.
- gene sequences are amplified at specific loci known to contain a large number of allelic variations, for example the DQ ⁇ class II HLA gene (Erlich, H., PCR Technology, Freeman and Co. (1 992)). Once this specific area of the genome is amplified, it is digested with one or more restriction
- sequences of the present invention can be used to provide polynucleotide reagents specifically targeted to additional loci in the human genome, and can enhance the reliability of DNA-based forensic identifications. Those sequences targeted to noncoding regions are particularly appropriate. As mentioned above, actual base sequence information can be used for identification as an accurate alternative to patterns formed by restriction enzyme generated fragments. Reagents for obtaining such sequence information are within the scope of the present invention. Such reagents can comprise complete genes, parts of genes or corresponding coding regions, or fragments of at least 1 5 bp, preferably at least 1 8 bp.
- reagents capable of identifying the source of a particular tissue. Such need arises, for example, in forensics when presented with tissue of unknown origin.
- Appropriate reagents can comprise, for example, DNA probes or primers specific to particular tissue prepared from the sequences of the present invention. Panels of such reagents can identify tissue by species and/or by organ type. In a similar manner, these reagents can be used to screen tissue cultures for contamination.
- Sequences that match perfectly to several different genes can be detected by hybridizing to chromosomes: if many chromosomal loci are observed, the sequence (or a close variant) is in more than one gene.
- This problem can be circumvented by using the 3'-untranslated part of the cDNA alone as a probe for the chromosomal location or for the full-length cDNA or gene.
- the 3'-untranslated region is more likely to be unique within gene families, since there is no evolutionary pressure to conserve a coding function of this region of the mRNA.
- the cDNA libraries disclosed according to the present invention ideally use directional cloning methods so that either the 5' end of the cDNA (likely to contain coding sequence) or the 3' end (likely to be a non-coding sequence) can be selectively obtained.
- the polynucleotides of the present invention can be derived from natural sources or synthesized using known methods.
- the sequences falling within the scope of the present invention are not limited to the specific sequences described, but include human allelic and species variations thereof and portions thereof.
- the invention includes the entire coding sequence associated with the specific polynucleotide sequence of bases described in the Sequence Listing, as well as portions of the entire coding sequence. Allelic variations can be routinely determined by comparison of one sequence with a sequence from another individual of the same species.
- the invention includes sequences coding for the same amino acid sequences as do the specific sequences disclosed herein. In other words, in a coding region, substitution of one codon for another which encodes the same amino acid is expressly contemplated. (Coding regions can be determined through routine sequence analysis.)
- a cDNA library there are many species of mRNA represented. Each cDNA clone can be interesting in its own right, but must be isolated from the library before further experimentation can be completed. In order to sequence any specific cDNA, it must be removed and separated (i.e. isolated and purified) from all the other sequences. This can be accomplished by many techniques known to those of skill in the art. These procedures normally involve identification of a bacterial colony containing the cDNA of interest and further amplification of that bacteria. Once a cDNA is separated from the mixed clone library, it can be used as a template for further procedures such as nucleotide sequencing.
- the present invention also includes recombinant constructs comprising one or more of the sequences as broadly described above.
- the constructs comprise a vector, such as a plasmid or viral vector, into which a sequence of the invention has been inserted, in a forward or reverse orientation.
- the construct further comprises regulatory sequences, including for example, a promoter, operably linked to the sequence.
- a promoter operably linked to the sequence.
- Bacterial pBs, phagescript, PsiX1 74, pBluescript SK, pBs KS, pNH8a, pNH 1 6a, pNH1 8a, pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia).
- Eukaryotic pWLneo, pSV2cat, pOG44, pXT1 , pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia).
- the present invention is not restricted to such constructs or sequences alone but also includes expression vehicles, which may include plasmids, viruses, or any other expression vectors, including cells and liposomes, containing any of the nucleic acids, nucleotide sequences, DNAs, RNAs, or fragments thereof, as disclosed according to the present invention. Furthermore, this will be true regardless of whether such sequences are coding sequences or non- coding sequences and whether such coding sequences code for all or part of the expression products as disclosed herein, so long as such expression products, or fragments thereof, exhibit some utility in keeping with the invention disclosed herein.
- the present invention includes an isolated DNA sequence, or nucleic acid, that expresses a human protein when in a suitable expression system, for example, a cell- free, or in vitro, expression system, such system may also be contained in, or part of, a suitable expression vehicle, or vector, be that a cell, a plasmid, a virus, or other operative expression vector.
- promoter region may include a promoter different from that normally associated in vivo with the genes coding for the gene expression products and proteins disclosed according to the present invention.
- Promoter regions can be selected from any desired gene using CAT (chloramphenicol transferase) vectors or other vectors with selectable markers.
- CAT chloramphenicol transferase
- Two appropriate vectors are pKK232-8 and pCM7.
- Particular named bacterial promoters include lacl, lacZ, T3, T7, gpt, lambda P R , and trc.
- Eukaryotic promoters include CMV immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-l. Selection of the appropriate vector and promoter is well within the level of ordinary skill in the art.
- the present invention relates to host cells containing the above-described construct(s).
- the host cell can be a higher eukaryotic cell, such as a mammalian cell, or a lower eukaryotic cell, such as a yeast cell, or the host cell can be a procaryotic cell, such as a bacterial cell.
- Introduction of the construct into the host cell can be effected by calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporation (Davis, L., Dibner, M., Battey, I., Basic Methods in Molecular Biology, 1 986)) .
- the constructs in host cells can be used in a conventional manner to produce the gene product coded by the recombinant sequence.
- the encoded polypeptide once the sequence is known from the cDNAs, or from isolation of the pure product, can be synthetically produced by conventional methods of peptide synthesis, either manual or automated.
- conventional techniques in molecular biology can be used to obtain the polypeptide.
- the present invention includes all polypeptides coded for by any and each of the DNA or RNA sequences disclosed herein, including fragments of said polypeptides, as well as derivatives and functional analogs thereof.
- amino acid sequence can be synthesized using commercially available peptide synthesizers. This is particularly useful in producing small peptides and fragments of larger polypeptides. (Fragments are useful, for example, in generating antibodies against the native polypeptide.)
- the DNA encoding the desired polypeptide can be inserted into a host organism and expressed.
- the organism can be a bacterium, yeast, cell line, or multicellular plant or animal.
- the literature is replete with examples of suitable host organisms and expression techniques.
- polynucleotide DNA or mRNA
- This methodology can be used to deliver the polypeptide to the animal, or to generate an immune response against a foreign polypeptide.
- the coding sequence can be inserted into a vector, which is then used to transfect a cell.
- the cell (which may or may not be part of a larger organism) then expresses the polypeptide.
- the present invention further relates to a polypeptide which has the amino acid sequence of Figure 2, (SEQ ID NO: 3)as well as fragments, analogs and derivatives of such polypeptide.
- fragment when referring to the polypeptide of Figure 2 (SEQ ID NO: 3), means a polypeptide which retains essentially the same biological function or activity as said polypeptide.
- an analog includes a proprotein which can be activated by cleavage of the proprotein portion to produce an active mature polypeptide.
- Such fragments, derivatives and analogs must have sufficient similarity to the polypeptide of Figure 2 (SEQ ID NO: 3) so that activity of the native polypeptide is retained.
- the polypeptide of the present invention may be a recombinant polypeptide, a natural polypeptide or a synthetic polypeptide, preferably a recombinant polypeptide.
- Recombinant means that a protein is derived from recombinant (e.g., microbial or mammalian) expression systems.
- Microbial refers to recombinant proteins made in bacterial or fungal (e.g., yeast) expression systems.
- recombinant microbial defines a protein essentially free of native endogenous substances and unaccompanied by associated native glycosylation. Protein expressed in most bacterial cultures, e.g., E coli, will be free of glycosylation modifications; protein expressed in yeast will have a glycosylation pattern different from that expressed in mammalian cells.
- the fragment, derivative or analog of the polypeptide of Figure 2 may be (i) one in which one or more of the amino acid residues are substituted with a conserved or non-conserved amino acid residue (preferably a conserved amino acid residue) and such substituted amino acid residue may or may not be one encoded by the genetic code, or (ii) one in which one or more of the amino acid residues includes a substituent group, or (iii) one in which the mature polypeptide is fused with another compound, such as a compound to increase the half-life of the polypeptide (for example, polyethylene glycol), or (iv) one in which the additional amino acids are fused to the mature polypeptide, such as a leader or secretory sequence or a sequence which is employed for purification of the mature polypeptide or a proprotein sequence.
- Such fragments, derivatives and analogs are deemed to be within the abilities of those skilled in the art in view of the teachings herein.
- polypeptides of the present invention are preferably provided in an isolated form, and preferably are purified to homogeneity. When applied to polypeptides, the term "isolated" has its already stated meaning.
- polypeptides of the present invention include the polypeptide of Figure 2 (in particular the mature polypeptide) as well as polypeptides which have at least 90% identity to the polypeptide of Figure 2 (SEQ ID NO: 3), or which have, at least 95% identity to the polypeptide of Figure 2 (SEQ ID NO: 3) and still more preferably at least 98% identity to the polypeptide of Figure 2 (SEQ ID NO: 3) and also include portions of such polypeptides with such portion of the polypeptide generally containing at least 30 amino acids and more preferably at least 50 amino acids.
- Fragments or portions of the polypeptides of the present invention may be employed for producing the corresponding full-length polypeptide by peptide synthesis; therefore, the fragments may be employed as intermediates for producing the full-length polypeptides. Fragments or portions of the polynucleotides of the present invention may be used to synthesize full-length polynucleotides of the present invention.
- the polypeptide disclosed in Figure 2 has growth stimulating activity when present in an in vitro growth medium containing human mesenchymal stem cells.
- stem cells in the presence of the polypeptide disclosed herein, are induced to replicate at a faster rate (as shown in Figure 3).
- recombinant C1 7 protein, expressed by 293 cells was affinity purified and added to human MSC (hMSC) cultures.
- the hMSCs maintained in serum- free conditions, typically exhibit little basal proliferative activity.
- dos titrations of fetal calf serum (FBS) were used as a positive control.
- Recombinant C17 protein stimulated hMSC growth by about 10 fold compared to serum-free media, and at levels equivalent to 10% fetal calf serum.
- the C1 7 polypeptide would be present in the medium at a concentration of at least 1 picogram (pg) per ml of medium.
- the present invention also relates to vectors which include polynucleotides of the present invention, host cells which are genetically engineered with vectors of the invention and the production of polypeptides of the invention by recombinant techniques.
- Host cells are genetically engineered (transduced or transformed or transfected) with the vectors of this invention which may be, for example, a cloning vector or an expression vector, either of which may be in the form of a plasmid, a viral particle, a phage, etc.
- the engineered host cells can be cultured in conventional nutrient media modified as appropriate for activating promoters, selecting transformants or amplifying the genes of the present invention.
- the culture conditions such as temperature, pH and the like, are those previously used with the host cell selected for expression, and will be apparent to the ordinarily skilled artisan.
- the polynucleotides of the present invention may be employed for producing polypeptides by recombinant techniques.
- the polynucleotide may be included in any one of a variety of expression vectors for expressing a polypeptide.
- Such vectors include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., derivatives of SV40; bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors derived from combinations of plasmids and phage DNA, viral DNA such as vaccinia, adenovirus, fowl pox virus, and pseudorabies.
- any other vector may be used as long as it is replicable and viable in the host.
- an appropriate DNA sequence or segment may be inserted into the vector by a variety of procedures.
- the DNA sequence is inserted into the appropriate restriction endonuclease site(s) by procedures known in the art. Such procedures and others are deemed to be within the scope of those skilled in the art.
- the DNA sequence in the expression vector is operatively linked to an appropriate expression control sequence(s) (for example, a promoter sequence) to direct mRNA synthesis.
- appropriate expression control sequence(s) for example, a promoter sequence
- promoters are: LTR or SV40 promoter, the E. coli. lac or trp, the phage lambda P L promoter and other promoters known to control expression of genes in prokaryotic or eukaryotic cells or their viruses.
- the expression vector also contains a ribosome binding site for translation initiation and a transcription terminator.
- the vector may also include appropriate sequences for amplifying expression.
- the expression vectors preferably contain one or more selectable marker genes to provide a phenotypic trait for selection of transformed host cells such as dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such as tetracycline or ampicillin resistance in E. coli.
- the vector containing the appropriate DNA sequence as hereinabove described, as well as an appropriate promoter or control sequence, may be employed to transform an appropriate host to permit the host to express the protein.
- appropriate hosts there may be mentioned: bacterial cells, such as E. coli, Streptomyces, Salmonella typhimurium; fungal cells, such as yeast; insect cells such as Drosophila S2 and Spodoptera Sf9; animal cells such as CHO, COS or Bowes melanoma; adenoviruses; plant cells, etc.
- bacterial cells such as E. coli, Streptomyces, Salmonella typhimurium
- fungal cells such as yeast
- insect cells such as Drosophila S2 and Spodoptera Sf9
- animal cells such as CHO, COS or Bowes melanoma
- adenoviruses adenoviruses
- plant cells etc.
- the selection of an appropriate host is deemed to be within the scope of those skilled in the art from
- Recombinant expression vehicle or vector refers to a plasmid or phage or virus or vector, for expressing a polypeptide from a
- the expression vehicle can comprise a transcriptional unit comprising an assembly of (1 ) a genetic element or elements having a regulatory role in gene expression, for example, promoters or enhancers, (2) a structural or coding sequence which is transcribed into mRNA and translated into protein, and (3) appropriate transcription initiation and termination sequences.
- Structural units intended for use in yeast or eukaryotic expression systems preferably include a leader sequence enabling extracellular secretion of translated protein by a host cell.
- recombinant protein is expressed without a leader or transport sequence, it may include an N- terminal methionine residue. This residue may or may not be subsequently cleaved from the expressed recombinant protein to provide a final product.
- Recombinant expression system means host cells which have stably integrated a recombinant transcriptional unit into chromosomal DNA or carry the recombinant transcriptional unit extra chromosomally.
- the cells can be prokaryotic or eukaryotic.
- Recombinant expression systems as defined herein will express heterologous protein upon induction of the regulatory elements linked to the DNA segment or synthetic gene to be expressed.
- Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under the control of appropriate promoters. Cell- free translation systems can also be employed to produce such proteins using RNAs derived from the DNA constructs of the present invention.
- Enhancer sequence Transcription of the DNA encoding the polypeptides according to the present invention by higher eukarotes can be increased by insertion of an enhancer sequence into the vector.
- enhancers have been known for some time and are usually cis-acting elements of DNA, usually anywhere from 1 0 to 300 bp that act on a promoter to increase transcription. Common examples include the SV40 enhancer, the cytomegalovirus early promoter enhancer, the polyoma enhancer and the enhancers found in adenovirus.
- recombinant expression vectors will include origins of replication and selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly-expressed gene to direct transcription of a downstream structural sequence.
- promoters can be derived from operons encoding glycolytic enzymes such as 3- phosphoglycerate kinase (PGK), ⁇ -factor, acid phosphatase, or heat shock proteins, among others.
- the heterologous structural sequence is assembled in appropriate phase with translation initiation and termination sequences, and preferably, a leader sequence capable of directing secretion of translated protein into the periplasmic space or extracellular medium.
- the heterologous sequence can encode a fusion protein including an N-terminal identification peptide imparting desired
- .- characteristics e.g., stabilization or simplified purification of expressed recombinant product.
- Useful expression vectors for bacterial use are constructed by inserting a structural DNA sequence encoding a desired protein together with suitable translation initiation and termination signals in operable reading phase with a functional promoter.
- the vector will comprise one or more phenotypic selectable markers and an origin of replication to ensure maintenance of the vector and to, if desirable, provide amplification within the host.
- Suitable prokaryotic hosts for transformation include E. coli, Bacillus subtilis, Salmonella typhimurium and various species within the genera Pseudomonas, Streptomyces, and Staphylococcus, although others may also be employed as a matter of choice.
- useful expression vectors for bacterial use can comprise a selectable marker and bacterial origin of replication derived from commercially available plasmids comprising genetic elements of the well known cloning vector pBR322 (ATCC 3701 7) .
- cloning vector pBR322 ATCC 3701 7
- Such commercial vectors include, for example, pKK223- 3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotec, Madison, Wl, USA). These pBR322 "backbone" sections are combined with an appropriate promoter and the structural sequence to be expressed.
- the selected promoter is derepressed by appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an additional period.
- Cells are typically harvested by centrifugation, disrupted by physical or chemical means, and the resulting crude extract retained for further purification.
- Various mammalian cell culture systems can also be employed to express recombinant protein. Examples of mammalian expression systems include the COS-7 lines of monkey kidney fibroblasts, described by Gluzman, Cell, _23: 1 75 ( 1 981 ), and other cell lines capable of expressing a compatible vector, for example, the C1 27, 3T3, CHO, HeLa and BHK cell lines.
- Mammalian expression vectors will comprise an origin of replication, a suitable promoter and enhancer, and also any necessary ribosome binding sites, polyadenylation site, splice donor and acceptor sites, transcriptional termination sequences, and 5' flanking nontranscribed sequences.
- DNA sequences derived from the SV40 viral genome for example, SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide the required nontranscribed genetic elements.
- Recombinant protein produced in bacterial culture is conveniently isolated by initial extraction from cell pellets, followed by one or more salting-out, aqueous ion exchange or size exclusion chromatography steps. Protein refolding steps can be used, as necessary, in completing configuration of the mature protein. Finally, high performance liquid chromatography (HPLC) can be employed for final purification steps. Microbial cells employed in expression of proteins can be disrupted by any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents.
- the protein, its fragments or other derivatives, or analogs thereof, or cells expressing them, can be used as an immunogen to produce antibodies thereto.
- These antibodies can be, for example, polyclonal, monoclonal, chimeric, single chain, Fab fragments, or the product of an Fab expression library.
- Various procedures known in the art may be used for the production of polyclonal antibodies.
- Antibodies generated against the polypeptide corresponding to a sequence of the present invention can be obtained by direct injection of the polypeptide into an animal or by administering the polypeptide to an animal, preferably a nonhuman. The antibody so obtained will then bind the polypeptide itself.
- any technique which provides antibodies produced by continuous cell line cultures can be used. Examples include the hybridoma technique (Kohler and Milstein, 1 975, Nature, 256:495-497), the trioma technique, the human B-cell hybridoma technique (Kozbor et al., 1 983, Immunology Today 4:72), and the EBV-hybridoma technique to produce human monoclonal antibodies (Cole, et al., 1985, in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96).
- the antibodies can be used in methods relating to the localization and activity of the protein sequences of the invention, e.g., for imaging these proteins, measuring levels thereof in appropriate physiological samples and the like.
- Figure 2 will permit those skilled in the art to readily locate appropriate receptors on the surfaces of the mesenchymal stem cells, as well as other cell types, and thereby confer the ability to regulate growth of such cells.
- the present invention also encompasses sequences homologous to the disclosed nucleotide and polypeptide sequences, it will of course be possible to derive structurally similar analogs containing similar functional domains, including small molecules that can mimic the functions of the C1 7 protein without themselves being proteinaceous in structure.
- small organic molecules may easily be developed by molecular modeling, using computer programs and algorithms, or by combinatorial methods, to mimic the domains of the C1 7 protein disclosed herein.
- Such mimicing structures are also considered to be encompassed by the disclosure of the present invention.
- Such chemicals can be readily synthesized and added to cell growth media, thereby stimulating the relevant receptors and enhancing the rate of cell growth. Such methods of enhancing cell growth are likewise deemed to be within the bounds of the invention disclosed herein.
- Such growth effects can easily be used to locate cell-growth stimulating receptors on the surfaces of cells.
- cells can be grown in a suitable medium to which has been added an appropriate amount of a labeled C 1 7 protein, or homolog thereof, or small chemical analog thereof, and then determining if said homolog, or analog, can stimulate the growth of the cells.
- the analog can also be introduced in a suitably labeled form, typically chemically labeled by the usual means well known to chemists, such labeling including both radiolabeled and nonradiolabeled methods, and then allowed to remain in the medium for various periods of time to allow for possible binding to a surface receptor on the surface of the cells so as to locate such receptors.
- This can then be followed by use of common isolation techniques to permit isolating, identification and characterization of the receptors, be they surface receptors or otherwise. In so doing, the growth-stimulating receptors of various cell types can be determined.
- buffers, media, reagents, cells, culture conditions and the like are not intended to be limiting, but are to be read so as to include all related materials that one of ordinary skill in the art would recognize as being of interest or value in the particular context in which that discussion is presented. For example, it is often possible to substitute one buffer system or culture medium for another and still achieve similar, if not identical, results. Those of skill in the art will have sufficient knowledge of such systems and methodologies so as to be able, without undue experimentation, to make such substitutions as will optimally serve their purposes in using the methods and procedures disclosed herein.
- MNC MNC from 3-4 units of CB were pooled and labeled with an anti-CD34 antibody (clone QBEND/1 0) provided in the CD34 Progenitor Cell Isolation Kit (Miltenyi Biotec, Auburn, CA) . Up to 2 billion MNC were passed through an LS + column assembled in the VarioMACS system (Miltenyi Biotec) . The CD34 + cells that were labeled with magnetic beads and retained in the column were isolated by eluting cells from the column after removal from the magnet. To ensure the elimination of CD34 " cells in the flow-through (FT) fraction, these FT cells were passed through a second column as before. The FT fraction after this double depletion was used as the CD34 cell population.
- FT flow-through
- CD34 cells isolated from the first and second column were pooled. The content of CD34 T cells of each population was monitored using the fluorescence-activated cell sorting (FACS) staining (see below). The majority of cells were immediately lysed with TRIzol reagent (Gibco/BRL, Gaithersburg, MD).
- CD34 + cells from bone marrow of healthy donors were isolated similarly by the PureCell Company (San Mateo, CA), according to federal and state regulations. We also used human CD34 " cells from mPB from healthy volunteers. Five days after consecutive G-CSF treatment, leukopheresed blood cells were obtained. CD34 + and CD34 " cells were isolated similarly, using the lsolex-300 system (Nexell/Baxter, Irvine, CA). Bone marrow-derived mesenchymal stem cells (MSC) were isolated and expanded in culture as described in the literature (Pittenger et al., Science, 284, 1 43 ( 1 999)).
- CB cells before and after cell isolation were labeled with a R- Phycoerythrin (R-PE)-conjugated CD34 antibody (Clone HPCA-2, Becton Dickinson Immunology Systems [BDIS], San Jose, CA) .
- R-PE R- Phycoerythrin
- HPCA-2 recognizes a different CD34 epitope from that recognized by QBEND/1 0, which is used to purify the cells.
- Antibody-labeled cells were analyzed with a BDIS FACS Calibur or Vantage instrument equipped with an ion Argon laser tuned to 488 nm. Specific CD34 staining of individual MNC was recorded in the FL2 channel (for R-PE). Non-specific staining (background) was 0.1 % .
- RNA samples derived from CD34 + and CD34 cells were always processed in parallel.
- Representational difference analysis (RDA) amplicon preparation and subtractive hybridization was done as described in the literature (Lisitsyn et al, Science, 259:946-951 (1 993); Hubank and Schatz, Nucleic Acids Research, 22:5640-5648 (1 994)), except that shorter PCR cycles (95°C, 30 sec, 72°C, 2 min) were used for preparation of amplicons (before subtraction) and difference products (after subtraction) . After three rounds of subtraction, distinct bands were apparent in an agarose gel.
- the third (and final) difference products were digested with Dpnll (to remove adapter and generate GATC overhangs), and then cloned into a BamHI-digested pUC1 8 vector. More than 500 clones were obtained after we transformed the DH5 ⁇ strain of £. coli with a small aliquot of the ligated DNA. Initially 55 individual clones were randomly picked. The inserts of individual clones were PCR amplified and sequenced. The sequences were searched first against the GenBank non- redundant (NR) database using the BLASTN and BLASTX algorithms (Altschul et al., Nucleic Acids Res.
- the primers for CD34 cDNA amplification (298 bp) are as follows:
- CD34-5' CTGTGTCTCAACATGGCA-3' (SEQ ID NO: 4)
- CD34-3' GCCTTGATGTCACTTAGG-5' (SEQ ID NO: 5)
- the primers for C1 7 cDNA amplification (286 bp) are:
- C17-5' GATCACCCGCGACTTCAACC (SEQ ID NO: 6)
- C17-3' TGGCAGGACCGTAGTCACTG (SEQ ID NO: 7)
- the primers for beta-2-microglobulin ( ⁇ 2M, as a control) cDNA amplification (270 bp) are:
- IMAGE International Molecular Analysis of Gene Expression
- C1 7 from the IMAGE clone 786066 was amplified by PCR and cloned in-frame into the mammalian expression vector pCDNA3.1 /myc-HisB (Invitrogen, Carlsbad, CA), at the EcoRI and BamHI sites.
- the resulting plasmid is named pCMV.CI 7/myc/his.
- the recombinant C1 7 protein expressed from this construct is tagged with a human c-myc epitope and six histidine residues (His6) at the C-terminus (in italics below).
- VDPSSVPSFLEQKLISEEDLNSAVDHHHHHH SEQ ID NO: 10
- Human 293T-derived BOSC23 cells were transfected with the vector by calcium phosphate precipitation (Cheng et al., Nature Biotech. , 14:606 (1 996); Cheng et al, Gene Ther. , 4: 1 01 3 ( 1 997)). Forty-eight hours after the transfection, the cells and the conditioned media were collected. The cells were scraped from the culture dishes in the presence of a protein inhibitor cocktail (CompleteTM; Roche Biochemicals) .
- CompleteTM protein inhibitor cocktail
- the cells were lysed in a buffer containing 1 50mM NaCl, 20mM Tris-HCI (pH7.4), 10% glycerol, 1 % NP40, 10mM EDTA, 2mM NaV0 3 , 100mM NaF, and the CompleteTM.
- the lysates were cleared by centrifugation at 1 4,000 rpm for 30 min at 4°C.
- the cell extracts and the conditioned culture media were denatured in the sample buffer under reducing conditions, and electrophoresed on a 4-20% polyacrylamide gel in SDS-Tris/Glycine buffer.
- the production of rC1 7 was monitored by Western blot with antibodies against the c-myc or His6 epitope (from Invitrogen).
- the Stanford G3 human-hamster radiation hybrid (RH) panel was purchased from Research Genetics. Two pairs of PCR primers were designed based on the 5'-untranslated region of C1 7 cDNA:
- TTTGATTTTCATCACCTTTC (DEQ ID NO: 1 1 ) and CTGGTTTAATGGAGTAATGG (SEQ ID NO: 1 2)
- the C1 7 gene fragment (290 bp) was searched against dbEST using the BLASTN algorithm. See http://ncbi.nlm.nih.gov/blast/ for more details of the cDNA libraries used, score and E value. ND: not determined by the depositors, who partially sequenced the inserts either from 5' or 3' ends. At the time of the search (July 1 998), a total of 2,072,964 EST (human and non-human) entries had been deposited.
- RT-PCR reverse transcriptase-polymerase chain reaction
- C1 7 gene expression was readily detected in CB CD34 ⁇ cells but was undetectable in the CD34 cell population.
- a similar RT-PCR result was obtained with the cells from mPB as well as with the cells from bone marrow. Therefore, the C1 7 gene expression is restricted to the CD34 ⁇ cell population isolated from CB, BM and mPB, three sources known to contain HSPC.
- BM CD34 BM CD34 + cells were cultured under two culture conditions with different cytokines. Under the first condition, BM CD34 + cells were treated with TPO, SCF and Flt3/Flk2 ligand (FL), a combination which is known to favor the maintenance of stem cells and expansion of progenitor cells (Luens et al., Blood, 91 : 1 206-1 21 5 ( 1 998); Kaushansky, Blood, 92: 1 -3 (1 998)). Under the second condition, cells were treated with five hematopoietic colony-stimulating factors (IL-3, IL- 6, G-CSF, GM-CSF and EPO).
- IL-3 hematopoietic colony-stimulating factors
- C1 7 gene expression in cultured and untreated CD34 + cells was analyzed by Northern blot using the C1 7 RDA fragment (290 bp) as the probe. A single prominent band of ⁇ 1 .0 kb was observed in untreated BM CD34 + cells as well as in cultured cells, which expressed C1 7 gene at various levels. After culture for 7 to 1 5 days, the C1 7 mRNA level was elevated under condition #1 while it was slightly reduced under condition #2.
- C1 7 expression was detected in human bone marrow and very weakly in lymph nodes, but undetectable in spleen, thymus, fetal liver, and PBL by Northern hybridization.
- C1 7 was undetectable in the another blot containing polyA + RNA from several human cancer cell lines: HeLa S3 (cervical carcinoma), A549 (lung carcinoma), G-361 (melanoma), SW480 (colorectal adenocarcinoma), HL-60 (promyelocytic leukemia), K-562 (chronic myelogenous leukemia), Molt-4 (lymphoblastic leukemia), and Raji (Burkitt's lymphoma).
- a full-length DNA sequence of the C1 7 cDNA was obtained by purchasing five plasmid clones containing C1 7-related EST sequences (Table 1 ).
- the insert of these plasmids has been partially sequences from either the 5' or 3' end by the IMAGE Consortium members.
- the size of inserts in these plasmids was determined and sequenced from both ends.
- the insert in IMAGE clone 786066 is the longest ( ⁇ 1 kb), and includes all the sequences from the other four plasmids and the C1 7 RDA fragment.
- the insert sequence of IMAGE clone 786066 was used for subsequent analyses.
- a putative mRNA polyadenylation signal, AATAAA is found near the 3' end of the C1 7 cDNA (see, for example, SEQ ID NO: 1 at residues 979-984).
- the IMAGE clone 786066 contains a near-full length cDNA for the C1 7 gene.
- the C1 7 cDNA contains an open-reading-frame of 408 nucleotides, encoding a protein of 1 36 amino acids (SEQ ID NO: 3).
- SEQ ID NO: 3 The presence of a Kozak sequence immediately around the first ATG suggests that it is a favorable translation start (Kozak, J. Cell Biol.
- a hydrophobicity analysis of the deduced peptide sequence shows a putative signal peptide at the N-terminus. Moreover, a defined signal peptide analysis revealed that the secretory peptide cleavage site is between the 1 9 th and 20 th amino acids thereof (Nielsen et al., Protein Engineering, 10: 1 -6 (1 997)). There are no other hydrophobic transmembrane or GPI-anchoring signal domains in the rest of the sequence, indicating that C17 is a secreted protein.
- C1 7 peptide contains 4 alpha helices, a characteristic of hematopoietic cytokines and interleukins (Bazan, Immunology Today, 1 1 (10):350-354 (1 990); Wells and de Vos, Ann. Rev. Biochem. , 65:609-634 (1 996)).
- GenBank accession number for C1 7 is AF1 93766.
- the C1 7 protein was further characterized by cloning the C1 7 cDNA coding region into a mammalian expression vector to make pCMV.CI 7/myc/his.
- the recombinant C1 7 protein was tagged with both the 9E1 0 c-myc epitope and six histidine residues (His6) at the C- terminus, thereby facilitating detection and purification of rC1 7.
- Human 293T cells were transfected with the vector to allow expression of the tagged C1 7 gene. Forty-eight hours after transfection, both the cell extracts and the conditioned media (supernatants) from transfected cells were analyzed by Western blot using antibodies against either of the two tags.
- Anti-myc antibody recognized specific proteins in the cell extract and supernatant unique to the C1 7-transfected cells.
- a major 1 9 kD protein band is specifically recognized, which is consistent with the predicted size of 1 9 kD for the unprocessed, tagged C1 7 protein ( 1 67 amino acids total, including the signal peptide) .
- a single protein band was detected, indicating that the C1 7 protein was indeed secreted.
- rC1 7 was produced in large quantity by cloning C1 7 into a prokaryotic expression vector, pBAD/glll (from Invitrogen) .
- pBAD/glll from Invitrogen
- the putative C1 7 signal sequence was removed and the rest of the coding sequence was ligated in frame with a bacterial leader sequence. This allows the recombinant protein to be secreted into the pericytoplasmic space.
- the C1 7 protein expressed by this vector is also tagged with the c-myc and His6 epitopes.
- arabinose Upon induction by arabinose, a protein of 1 9 kD (as predicted for the full-length rC1 7) was induced to express at a high level by 0.002% or higher arabinose concentration.
- the radiation hybrid (RH) technique was used to map the location of the C1 7 gene in the human genome.
- a panel of G3 human-hamster hybrid chromosomal DNA samples was used as templates for PCR amplification with primers specific to human C1 7 gene.
- the primers can only amplify a 200 bp fragment if human (but not hamster) genomic DNA is present as a template.
- PCR reactions with some G3 RH DNA templates generated the predicted DNA fragment while the others failed.
- the resulting data were used to determine its chromosomal localization, based on the Stanford Human Genome Center database and algorithm (http://www-shgc.stanford.edu/).
- This result is confirmed by the mapping of its related ESTs (Hs.1 3872 in the Unigene database) performed by others.
- This region co-localizes with human chromosome 4p1 5-1 6 in cytogenetic mapping.
- Some other genes associated with hematopoiesis are also localized in this region.
- CD38 between D4S412 and D4S1601, as C17/Hs.13872
- AC133 antigen located around D4S1601 to D4S1608.
- the latter is a recently discovered cell surface protein and is preferentially expressed in CD34 ⁇ HSPC (Yin et al., Blood, 90:5002-5012 (1997); Miraglia et al., Blood, 90:5013-5021 (1997)).
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Toxicology (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Peptides Or Proteins (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
La présente invention concerne un polypeptide de précurseur/cellule souche hématopoïétique humain (hHSPC) (baptisé polypeptide C17), ainsi que l'ADN (et l'ARN) codant ce polypeptide. L'invention se rapporte également à des procédés d'utilisation des polynucléotides et polypeptides de l'invention, y compris comme marqueurs dans la cartographie chromosomique, l'analyse des empreintes génétiques et du rôle possible des mutations génétiques dans le processus morbide, et dans la production de sérums polyclonaux ou d'anticorps monoclonaux spécifiques des polypeptides précités. L'invention concerne enfin un procédé permettant d'accroître la vitesse de multiplication des hMSC à l'aide du polypeptide de l'invention.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12946399P | 1999-04-15 | 1999-04-15 | |
US129463P | 1999-04-15 | ||
PCT/US2000/009904 WO2000063382A1 (fr) | 1999-04-15 | 2000-04-14 | Genes et produits d'expression provenant de cellules hematopoietiques |
Publications (1)
Publication Number | Publication Date |
---|---|
EP1169447A1 true EP1169447A1 (fr) | 2002-01-09 |
Family
ID=22440077
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP00922147A Withdrawn EP1169447A1 (fr) | 1999-04-15 | 2000-04-14 | Genes et produits d'expression provenant de cellules hematopoietiques |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP1169447A1 (fr) |
JP (1) | JP2002541850A (fr) |
AU (1) | AU4237700A (fr) |
WO (1) | WO2000063382A1 (fr) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6525174B1 (en) | 1997-06-06 | 2003-02-25 | Human Genome Sciences, Inc. | Precerebellin-like protein |
WO2003000729A2 (fr) * | 2001-06-20 | 2003-01-03 | Genentech, Inc. | Nouveau polypeptide secrete et methodes de traitement des troubles osseux |
EP1420816A4 (fr) * | 2001-08-08 | 2006-06-07 | Schering Corp | Utilisations d'une cytokine mammalienne et reactifs correspondants |
WO2009055613A2 (fr) * | 2007-10-26 | 2009-04-30 | Genentech, Inc. | Inhibition de l'activité de l'activateur du plasminogène de type urokinase (upa) |
EP3012269A1 (fr) | 2014-10-20 | 2016-04-27 | Erhard Hofer | Protéine de type 1 (CYTL1) et ses utilisations |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1039801A4 (fr) * | 1997-06-06 | 2003-03-26 | Human Genome Sciences Inc | 207 proteines humaines secretees |
-
2000
- 2000-04-14 JP JP2000612461A patent/JP2002541850A/ja not_active Withdrawn
- 2000-04-14 AU AU42377/00A patent/AU4237700A/en not_active Abandoned
- 2000-04-14 EP EP00922147A patent/EP1169447A1/fr not_active Withdrawn
- 2000-04-14 WO PCT/US2000/009904 patent/WO2000063382A1/fr not_active Application Discontinuation
Non-Patent Citations (1)
Title |
---|
See references of WO0063382A1 * |
Also Published As
Publication number | Publication date |
---|---|
WO2000063382A1 (fr) | 2000-10-26 |
AU4237700A (en) | 2000-11-02 |
WO2000063382A9 (fr) | 2001-10-11 |
JP2002541850A (ja) | 2002-12-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2443617C (fr) | Sequences repetees du gene ca125 et leurs utilisations dans des interventions diagnostiques et therapeutiques | |
US5849528A (en) | Polynucleotides encoding a human S100 protein | |
US20020187472A1 (en) | Steap-related protein | |
NZ509178A (en) | Compositions and methods for therapy and diagnosis of prostate cancer | |
JP2002533056A (ja) | 肺癌の治療および診断のための化合物および方法 | |
JP2001508656A (ja) | ヒトmage様タンパク質 | |
KR20020007348A (ko) | 유방암의 치료 및 진단용 조성물 및 이의 사용 방법 | |
CA2300364A1 (fr) | Polypeptides de tumeur de la prostate et compositions antigenes | |
US6913891B1 (en) | Human myeloid terminal differentiation response gene | |
WO2000059933A2 (fr) | Genes mesenchymateux humains et produits d'expression | |
JP2002540789A5 (fr) | ||
US20070148686A1 (en) | Protein present at the surface of hematopoietic stem cells of the lymphoid line and of nk cells, and uses thereof | |
EP1169447A1 (fr) | Genes et produits d'expression provenant de cellules hematopoietiques | |
US20020090694A1 (en) | Human Hox C10 and polynucleotides encoding | |
EP1197554A1 (fr) | Facteur de differenciation de proliferation | |
JP2003505028A (ja) | Cd40受容体のスプライシング変種 | |
JP2004527240A (ja) | 癌細胞の増殖を調節するのに有用なポリヌクレオチド | |
JP2001513640A (ja) | 新規なヒト膜貫通4スーパーファミリータンパク質 | |
US6309821B1 (en) | DNA encoding a PAC10 human homolog | |
JP2001517093A (ja) | 成長因子様タンパク質をコードするポリヌクレオチド | |
US20030109027A1 (en) | TSLL2 gene | |
US7270980B2 (en) | Compounds for immunodiagnosis of prostate cancer and methods for their use | |
US20030054385A1 (en) | Human ubiquitin-conjugating enzymes | |
MXPA01012717A (es) | Expresion genica modulada en inflamacion gastrointestinal. | |
US20020137166A1 (en) | ASIP-related proteins |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20011027 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
AX | Request for extension of the european patent |
Free format text: AL;LT;LV;MK;RO;SI |
|
17Q | First examination report despatched |
Effective date: 20041124 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20050607 |