NZ510464A - Further pro polypeptides and sequences thereof - Google Patents
Further pro polypeptides and sequences thereofInfo
- Publication number
- NZ510464A NZ510464A NZ510464A NZ51046499A NZ510464A NZ 510464 A NZ510464 A NZ 510464A NZ 510464 A NZ510464 A NZ 510464A NZ 51046499 A NZ51046499 A NZ 51046499A NZ 510464 A NZ510464 A NZ 510464A
- Authority
- NZ
- New Zealand
- Prior art keywords
- polypeptide
- sequence identity
- sequence
- amino acid
- nucleic acid
- Prior art date
Links
- 108090000765 processed proteins & peptides Proteins 0.000 title claims abstract description 673
- 102000004196 processed proteins & peptides Human genes 0.000 title claims abstract description 665
- 229920001184 polypeptide Polymers 0.000 title claims abstract description 664
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 304
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 262
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 262
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 191
- 239000002773 nucleotide Substances 0.000 claims abstract description 142
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 142
- 206010028980 Neoplasm Diseases 0.000 claims abstract description 24
- 238000004519 manufacturing process Methods 0.000 claims abstract description 8
- 239000013598 vector Substances 0.000 claims abstract description 8
- 108020004414 DNA Proteins 0.000 claims description 506
- 210000004027 cell Anatomy 0.000 claims description 82
- 241000282414 Homo sapiens Species 0.000 claims description 44
- 238000000034 method Methods 0.000 claims description 43
- 230000014509 gene expression Effects 0.000 claims description 38
- 238000012258 culturing Methods 0.000 claims description 24
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 23
- 108091026890 Coding region Proteins 0.000 claims description 22
- 238000004113 cell culture Methods 0.000 claims description 22
- 230000008569 process Effects 0.000 claims description 20
- 239000003814 drug Substances 0.000 claims description 10
- 108060003951 Immunoglobulin Proteins 0.000 claims description 3
- 102000018358 immunoglobulin Human genes 0.000 claims description 3
- 239000008194 pharmaceutical composition Substances 0.000 claims description 3
- 239000003085 diluting agent Substances 0.000 claims description 2
- 239000000546 pharmaceutical excipient Substances 0.000 claims description 2
- 241000588724 Escherichia coli Species 0.000 claims 1
- 108091028043 Nucleic acid sequence Proteins 0.000 claims 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 claims 1
- 210000005253 yeast cell Anatomy 0.000 claims 1
- 102000053602 DNA Human genes 0.000 description 274
- 108090000623 proteins and genes Proteins 0.000 description 236
- 102000004169 proteins and genes Human genes 0.000 description 212
- 235000018102 proteins Nutrition 0.000 description 209
- 230000000295 complement effect Effects 0.000 description 169
- 125000000539 amino acid group Chemical group 0.000 description 126
- 238000012360 testing method Methods 0.000 description 103
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 85
- 235000001014 amino acid Nutrition 0.000 description 83
- 238000012512 characterization method Methods 0.000 description 77
- 239000005557 antagonist Substances 0.000 description 69
- 239000000556 agonist Substances 0.000 description 68
- 239000002299 complementary DNA Substances 0.000 description 68
- 239000012634 fragment Substances 0.000 description 65
- 238000009396 hybridization Methods 0.000 description 65
- 102000005962 receptors Human genes 0.000 description 60
- 108020003175 receptors Proteins 0.000 description 60
- 238000012216 screening Methods 0.000 description 56
- 108020004511 Recombinant DNA Proteins 0.000 description 50
- 230000006870 function Effects 0.000 description 47
- 102000003839 Human Proteins Human genes 0.000 description 44
- 108090000144 Human Proteins Proteins 0.000 description 44
- 230000027455 binding Effects 0.000 description 39
- 230000004071 biological effect Effects 0.000 description 39
- 102000035160 transmembrane proteins Human genes 0.000 description 38
- 108091005703 transmembrane proteins Proteins 0.000 description 38
- 230000015572 biosynthetic process Effects 0.000 description 26
- 102000004190 Enzymes Human genes 0.000 description 24
- 108090000790 Enzymes Proteins 0.000 description 24
- 102000035195 Peptidases Human genes 0.000 description 24
- 108091005804 Peptidases Proteins 0.000 description 24
- 229940088598 enzyme Drugs 0.000 description 24
- 210000001519 tissue Anatomy 0.000 description 22
- 230000001404 mediated effect Effects 0.000 description 20
- 239000000523 sample Substances 0.000 description 20
- 229940024606 amino acid Drugs 0.000 description 19
- 210000004901 leucine-rich repeat Anatomy 0.000 description 19
- 150000001413 amino acids Chemical class 0.000 description 18
- 230000002255 enzymatic effect Effects 0.000 description 18
- 210000000020 growth cone Anatomy 0.000 description 18
- 210000004379 membrane Anatomy 0.000 description 18
- 239000012528 membrane Substances 0.000 description 18
- 230000004850 protein–protein interaction Effects 0.000 description 17
- 230000004069 differentiation Effects 0.000 description 16
- 230000000694 effects Effects 0.000 description 16
- 239000000203 mixture Substances 0.000 description 16
- 238000012544 monitoring process Methods 0.000 description 16
- 102000004127 Cytokines Human genes 0.000 description 15
- 108090000695 Cytokines Proteins 0.000 description 15
- 238000011161 development Methods 0.000 description 14
- 230000018109 developmental process Effects 0.000 description 14
- 235000010335 lysozyme Nutrition 0.000 description 14
- 102000004555 Butyrophilins Human genes 0.000 description 13
- 108010017533 Butyrophilins Proteins 0.000 description 13
- 108090000723 Insulin-Like Growth Factor I Proteins 0.000 description 13
- 241000699666 Mus <mouse, genus> Species 0.000 description 13
- 102000012479 Serine Proteases Human genes 0.000 description 13
- 108010022999 Serine Proteases Proteins 0.000 description 13
- 230000013595 glycosylation Effects 0.000 description 13
- 230000012010 growth Effects 0.000 description 13
- 230000003993 interaction Effects 0.000 description 13
- 230000007246 mechanism Effects 0.000 description 13
- 210000002241 neurite Anatomy 0.000 description 13
- 238000011282 treatment Methods 0.000 description 13
- 102000003814 Interleukin-10 Human genes 0.000 description 12
- 108090000174 Interleukin-10 Proteins 0.000 description 12
- 102000013275 Somatomedins Human genes 0.000 description 12
- 230000004913 activation Effects 0.000 description 12
- 201000011510 cancer Diseases 0.000 description 12
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 12
- ADHFFUOAOLWHGU-JPDUFPOXSA-N (2s)-2-[[(2s)-4-amino-2-[[(2s,3r)-2-[[(2s)-2-[[(2s)-2-[[(2s,3s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-6-amino-2-[[(2s)-2-[[(2s)-2-[[2-[[(2s)-2-amino-3-hydroxypropanoyl]amino]acetyl]amino]-3-hydroxypropanoyl]amino]propanoyl]amino]hexanoyl]a Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@@H](N)CO)C(C)C)C1=CC=CC=C1 ADHFFUOAOLWHGU-JPDUFPOXSA-N 0.000 description 11
- 241000193468 Clostridium perfringens Species 0.000 description 11
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 11
- 108010052285 Membrane Proteins Proteins 0.000 description 11
- 230000000977 initiatory effect Effects 0.000 description 11
- 229930182817 methionine Natural products 0.000 description 11
- 101800001299 Cerebellin Proteins 0.000 description 10
- 102400001244 Cerebellin Human genes 0.000 description 10
- 102000003886 Glycoproteins Human genes 0.000 description 10
- 108090000288 Glycoproteins Proteins 0.000 description 10
- 102000000395 SH3 domains Human genes 0.000 description 10
- 108050008861 SH3 domains Proteins 0.000 description 10
- 108050003978 Semaphorin Proteins 0.000 description 10
- 230000031018 biological processes and functions Effects 0.000 description 10
- 210000000988 bone and bone Anatomy 0.000 description 10
- 238000006206 glycosylation reaction Methods 0.000 description 10
- 239000003112 inhibitor Substances 0.000 description 10
- 230000029663 wound healing Effects 0.000 description 10
- 239000004365 Protease Substances 0.000 description 9
- 210000003050 axon Anatomy 0.000 description 9
- 210000004556 brain Anatomy 0.000 description 9
- 238000001727 in vivo Methods 0.000 description 9
- 230000035772 mutation Effects 0.000 description 9
- 102100034870 Kallikrein-8 Human genes 0.000 description 8
- 101710176225 Kallikrein-8 Proteins 0.000 description 8
- 102000015494 Mitochondrial Uncoupling Proteins Human genes 0.000 description 8
- 108010050258 Mitochondrial Uncoupling Proteins Proteins 0.000 description 8
- 101710131039 Opsin-5 Proteins 0.000 description 8
- 102000014400 SH2 domains Human genes 0.000 description 8
- 108050003452 SH2 domains Proteins 0.000 description 8
- 102000014105 Semaphorin Human genes 0.000 description 8
- 108050003627 Wnt Proteins 0.000 description 8
- 230000001413 cellular effect Effects 0.000 description 8
- 238000006243 chemical reaction Methods 0.000 description 8
- 235000013336 milk Nutrition 0.000 description 8
- 210000004080 milk Anatomy 0.000 description 8
- 239000008267 milk Substances 0.000 description 8
- 230000037361 pathway Effects 0.000 description 8
- 108020004635 Complementary DNA Proteins 0.000 description 7
- 102000008100 Human Serum Albumin Human genes 0.000 description 7
- 108091006905 Human Serum Albumin Proteins 0.000 description 7
- 102000018697 Membrane Proteins Human genes 0.000 description 7
- 241001465754 Metazoa Species 0.000 description 7
- 108010056664 N-acetyllactosaminide beta-1,6-N-acetylglucosaminyltransferase Proteins 0.000 description 7
- 102000006010 Protein Disulfide-Isomerase Human genes 0.000 description 7
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 7
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 7
- 210000002569 neuron Anatomy 0.000 description 7
- 108020003519 protein disulfide isomerase Proteins 0.000 description 7
- 230000019491 signal transduction Effects 0.000 description 7
- 108010006303 Carboxypeptidases Proteins 0.000 description 6
- 102000005367 Carboxypeptidases Human genes 0.000 description 6
- 101710146739 Enterotoxin Proteins 0.000 description 6
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 6
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 6
- 108060003306 Galactosyltransferase Proteins 0.000 description 6
- 101000939438 Homo sapiens Mitochondrial brown fat uncoupling protein 1 Proteins 0.000 description 6
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 6
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 6
- 108060004795 Methyltransferase Proteins 0.000 description 6
- 102000016397 Methyltransferase Human genes 0.000 description 6
- 102100023315 N-acetyllactosaminide beta-1,6-N-acetylglucosaminyl-transferase Human genes 0.000 description 6
- 102100026918 Phospholipase A2 Human genes 0.000 description 6
- 108010058864 Phospholipases A2 Proteins 0.000 description 6
- 241000700159 Rattus Species 0.000 description 6
- 102000018679 Tacrolimus Binding Proteins Human genes 0.000 description 6
- 230000002159 abnormal effect Effects 0.000 description 6
- 239000000427 antigen Substances 0.000 description 6
- 102000036639 antigens Human genes 0.000 description 6
- 108091007433 antigens Proteins 0.000 description 6
- 210000003169 central nervous system Anatomy 0.000 description 6
- 238000003745 diagnosis Methods 0.000 description 6
- 201000010099 disease Diseases 0.000 description 6
- 208000035475 disorder Diseases 0.000 description 6
- 239000000147 enterotoxin Substances 0.000 description 6
- 231100000655 enterotoxin Toxicity 0.000 description 6
- 210000002744 extracellular matrix Anatomy 0.000 description 6
- 230000004054 inflammatory process Effects 0.000 description 6
- 239000003446 ligand Substances 0.000 description 6
- 210000002540 macrophage Anatomy 0.000 description 6
- 230000004060 metabolic process Effects 0.000 description 6
- 239000002243 precursor Substances 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- 230000035755 proliferation Effects 0.000 description 6
- 235000019833 protease Nutrition 0.000 description 6
- 108010035291 retinol dehydrogenase Proteins 0.000 description 6
- 239000000758 substrate Substances 0.000 description 6
- 230000017423 tissue regeneration Effects 0.000 description 6
- 239000003053 toxin Substances 0.000 description 6
- 231100000765 toxin Toxicity 0.000 description 6
- 108700012359 toxins Proteins 0.000 description 6
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 5
- 108700016155 Acyl transferases Proteins 0.000 description 5
- 102100026663 All-trans-retinol dehydrogenase [NAD(+)] ADH7 Human genes 0.000 description 5
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 5
- 241000196324 Embryophyta Species 0.000 description 5
- 108010001687 Enterotoxin Receptors Proteins 0.000 description 5
- 102000000820 Enterotoxin Receptors Human genes 0.000 description 5
- 102000030902 Galactosyltransferase Human genes 0.000 description 5
- 101000934338 Homo sapiens Myeloid cell surface antigen CD33 Proteins 0.000 description 5
- 102000019298 Lipocalin Human genes 0.000 description 5
- 108050006654 Lipocalin Proteins 0.000 description 5
- 241000124008 Mammalia Species 0.000 description 5
- 241001529936 Murinae Species 0.000 description 5
- 102100025243 Myeloid cell surface antigen CD33 Human genes 0.000 description 5
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 5
- 101000890616 Sus scrofa Allograft inflammatory factor 1 Proteins 0.000 description 5
- 102000002248 Thyroxine-Binding Globulin Human genes 0.000 description 5
- 108010000259 Thyroxine-Binding Globulin Proteins 0.000 description 5
- 102000008219 Uncoupling Protein 2 Human genes 0.000 description 5
- 108010021111 Uncoupling Protein 2 Proteins 0.000 description 5
- 102000013814 Wnt Human genes 0.000 description 5
- 230000004075 alteration Effects 0.000 description 5
- 230000006907 apoptotic process Effects 0.000 description 5
- 230000003376 axonal effect Effects 0.000 description 5
- 210000003719 b-lymphocyte Anatomy 0.000 description 5
- 210000001638 cerebellum Anatomy 0.000 description 5
- 102000003675 cytokine receptors Human genes 0.000 description 5
- 108010057085 cytokine receptors Proteins 0.000 description 5
- 239000003102 growth factor Substances 0.000 description 5
- 230000003054 hormonal effect Effects 0.000 description 5
- 230000002779 inactivation Effects 0.000 description 5
- 230000001965 increasing effect Effects 0.000 description 5
- 108091022911 insulin-like growth factor binding Proteins 0.000 description 5
- 102000028416 insulin-like growth factor binding Human genes 0.000 description 5
- 210000003734 kidney Anatomy 0.000 description 5
- 108020004999 messenger RNA Proteins 0.000 description 5
- 108010048477 olfactomedin Proteins 0.000 description 5
- 239000000813 peptide hormone Substances 0.000 description 5
- 230000000704 physical effect Effects 0.000 description 5
- 230000003389 potentiating effect Effects 0.000 description 5
- 210000002307 prostate Anatomy 0.000 description 5
- 230000011664 signaling Effects 0.000 description 5
- -1 survival factors Substances 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 108091005660 ADAMTS1 Proteins 0.000 description 4
- 102100026423 Adhesion G protein-coupled receptor E5 Human genes 0.000 description 4
- 102000000905 Cadherin Human genes 0.000 description 4
- 108050007957 Cadherin Proteins 0.000 description 4
- 102000003846 Carbonic anhydrases Human genes 0.000 description 4
- 108090000209 Carbonic anhydrases Proteins 0.000 description 4
- 208000005623 Carcinogenesis Diseases 0.000 description 4
- 102000000844 Cell Surface Receptors Human genes 0.000 description 4
- 108010001857 Cell Surface Receptors Proteins 0.000 description 4
- 108010035532 Collagen Proteins 0.000 description 4
- 102000008186 Collagen Human genes 0.000 description 4
- 206010009944 Colon cancer Diseases 0.000 description 4
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 4
- 108010092364 Glucuronosyltransferase Proteins 0.000 description 4
- 102000016354 Glucuronosyltransferase Human genes 0.000 description 4
- 101000718243 Homo sapiens Adhesion G protein-coupled receptor E5 Proteins 0.000 description 4
- 206010061218 Inflammation Diseases 0.000 description 4
- 101710201625 Leucine-rich protein Proteins 0.000 description 4
- 102100033468 Lysozyme C Human genes 0.000 description 4
- 108010054377 Mannosidases Proteins 0.000 description 4
- 102000001696 Mannosidases Human genes 0.000 description 4
- 102100029820 Mitochondrial brown fat uncoupling protein 1 Human genes 0.000 description 4
- 108010014251 Muramidase Proteins 0.000 description 4
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 4
- 208000028389 Nerve injury Diseases 0.000 description 4
- 208000012902 Nervous system disease Diseases 0.000 description 4
- 208000025966 Neurological disease Diseases 0.000 description 4
- 108090000189 Neuropeptides Proteins 0.000 description 4
- 102000004316 Oxidoreductases Human genes 0.000 description 4
- 108090000854 Oxidoreductases Proteins 0.000 description 4
- 102000016611 Proteoglycans Human genes 0.000 description 4
- 108010067787 Proteoglycans Proteins 0.000 description 4
- 101710145027 Putative protein 2 Proteins 0.000 description 4
- 102000003838 Sialyltransferases Human genes 0.000 description 4
- 108090000141 Sialyltransferases Proteins 0.000 description 4
- 239000002253 acid Substances 0.000 description 4
- FPIPGXGPPPQFEQ-OVSJKPMPSA-N all-trans-retinol Chemical compound OC\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C FPIPGXGPPPQFEQ-OVSJKPMPSA-N 0.000 description 4
- YZXBAPSDXZZRGB-DOFZRALJSA-N arachidonic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O YZXBAPSDXZZRGB-DOFZRALJSA-N 0.000 description 4
- 230000008827 biological function Effects 0.000 description 4
- 230000033228 biological regulation Effects 0.000 description 4
- 230000036952 cancer formation Effects 0.000 description 4
- 231100000504 carcinogenesis Toxicity 0.000 description 4
- 210000000845 cartilage Anatomy 0.000 description 4
- 210000005056 cell body Anatomy 0.000 description 4
- 230000024245 cell differentiation Effects 0.000 description 4
- 210000000170 cell membrane Anatomy 0.000 description 4
- 230000012292 cell migration Effects 0.000 description 4
- 229920001436 collagen Polymers 0.000 description 4
- 210000001787 dendrite Anatomy 0.000 description 4
- 229940079593 drug Drugs 0.000 description 4
- 239000003937 drug carrier Substances 0.000 description 4
- 235000019197 fats Nutrition 0.000 description 4
- 239000008103 glucose Substances 0.000 description 4
- 210000000987 immune system Anatomy 0.000 description 4
- 230000002401 inhibitory effect Effects 0.000 description 4
- 230000006651 lactation Effects 0.000 description 4
- 239000004325 lysozyme Substances 0.000 description 4
- 229960000274 lysozyme Drugs 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 235000021243 milk fat Nutrition 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 210000001616 monocyte Anatomy 0.000 description 4
- 230000004899 motility Effects 0.000 description 4
- 230000008764 nerve damage Effects 0.000 description 4
- 210000000653 nervous system Anatomy 0.000 description 4
- 230000004770 neurodegeneration Effects 0.000 description 4
- 208000015122 neurodegenerative disease Diseases 0.000 description 4
- 102000052563 odorant-binding protein Human genes 0.000 description 4
- 108010000645 odorant-binding protein Proteins 0.000 description 4
- 239000008177 pharmaceutical agent Substances 0.000 description 4
- 230000028327 secretion Effects 0.000 description 4
- 210000000130 stem cell Anatomy 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 235000000346 sugar Nutrition 0.000 description 4
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- 208000031261 Acute myeloid leukaemia Diseases 0.000 description 3
- 208000024827 Alzheimer disease Diseases 0.000 description 3
- 241000283690 Bos taurus Species 0.000 description 3
- 102000014914 Carrier Proteins Human genes 0.000 description 3
- 102100035244 Cerebellin-1 Human genes 0.000 description 3
- 101710135505 Cerebellin-1 Proteins 0.000 description 3
- 102100038423 Claudin-3 Human genes 0.000 description 3
- 108090000599 Claudin-3 Proteins 0.000 description 3
- 241000193403 Clostridium Species 0.000 description 3
- 108010015742 Cytochrome P-450 Enzyme System Proteins 0.000 description 3
- 102000003849 Cytochrome P450 Human genes 0.000 description 3
- 108050001049 Extracellular proteins Proteins 0.000 description 3
- 108091011114 FK506 binding proteins Proteins 0.000 description 3
- 206010016952 Food poisoning Diseases 0.000 description 3
- 208000019331 Foodborne disease Diseases 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- 102000000521 Immunophilins Human genes 0.000 description 3
- 108010016648 Immunophilins Proteins 0.000 description 3
- 102000012750 Membrane Glycoproteins Human genes 0.000 description 3
- 108010090054 Membrane Glycoproteins Proteins 0.000 description 3
- 101100108446 Mus musculus Aifm3 gene Proteins 0.000 description 3
- 102100030741 Myelin protein P0 Human genes 0.000 description 3
- 208000033776 Myeloid Acute Leukemia Diseases 0.000 description 3
- 208000018737 Parkinson disease Diseases 0.000 description 3
- 208000037273 Pathologic Processes Diseases 0.000 description 3
- 101100362691 Prochlorococcus marinus (strain SARG / CCMP1375 / SS120) rpsP gene Proteins 0.000 description 3
- 108090000412 Protein-Tyrosine Kinases Proteins 0.000 description 3
- 102000004022 Protein-Tyrosine Kinases Human genes 0.000 description 3
- 108010036039 Serrate-Jagged Proteins Proteins 0.000 description 3
- 102000011842 Serrate-Jagged Proteins Human genes 0.000 description 3
- QJJXYPPXXYFBGM-LFZNUXCKSA-N Tacrolimus Chemical compound C1C[C@@H](O)[C@H](OC)C[C@@H]1\C=C(/C)[C@@H]1[C@H](C)[C@@H](O)CC(=O)[C@H](CC=C)/C=C(C)/C[C@H](C)C[C@H](OC)[C@H]([C@H](C[C@H]2C)OC)O[C@@]2(O)C(=O)C(=O)N2CCCC[C@H]2C(=O)O1 QJJXYPPXXYFBGM-LFZNUXCKSA-N 0.000 description 3
- 108010027179 Tacrolimus Binding Proteins Proteins 0.000 description 3
- 108700031126 Tetraspanins Proteins 0.000 description 3
- 102000043977 Tetraspanins Human genes 0.000 description 3
- LFTYTUAZOPRMMI-UHFFFAOYSA-N UNPD164450 Natural products O1C(CO)C(O)C(O)C(NC(=O)C)C1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-UHFFFAOYSA-N 0.000 description 3
- 102000008200 Uncoupling Protein 3 Human genes 0.000 description 3
- 108010021098 Uncoupling Protein 3 Proteins 0.000 description 3
- 241000269370 Xenopus <genus> Species 0.000 description 3
- 102000045404 acyltransferase activity proteins Human genes 0.000 description 3
- 108700014220 acyltransferase activity proteins Proteins 0.000 description 3
- 230000002411 adverse Effects 0.000 description 3
- 239000003098 androgen Substances 0.000 description 3
- 210000004369 blood Anatomy 0.000 description 3
- 239000008280 blood Substances 0.000 description 3
- 230000020411 cell activation Effects 0.000 description 3
- 230000003915 cell function Effects 0.000 description 3
- 230000010261 cell growth Effects 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 230000000723 chemosensory effect Effects 0.000 description 3
- 210000004081 cilia Anatomy 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 235000013365 dairy product Nutrition 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 239000000032 diagnostic agent Substances 0.000 description 3
- 229940039227 diagnostic agent Drugs 0.000 description 3
- 230000029087 digestion Effects 0.000 description 3
- 230000008030 elimination Effects 0.000 description 3
- 238000003379 elimination reaction Methods 0.000 description 3
- 210000002257 embryonic structure Anatomy 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 235000013305 food Nutrition 0.000 description 3
- 239000003205 fragrance Substances 0.000 description 3
- 210000003714 granulocyte Anatomy 0.000 description 3
- 210000003128 head Anatomy 0.000 description 3
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 3
- 230000001506 immunosuppresive effect Effects 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 210000000265 leukocyte Anatomy 0.000 description 3
- 150000004668 long chain fatty acids Chemical class 0.000 description 3
- 210000004698 lymphocyte Anatomy 0.000 description 3
- 238000012423 maintenance Methods 0.000 description 3
- 108010009689 mannosyl-oligosaccharide 1,2-alpha-mannosidase Proteins 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 208000030159 metabolic disease Diseases 0.000 description 3
- 102000003819 neurexophilin Human genes 0.000 description 3
- 108090000126 neurexophilin Proteins 0.000 description 3
- 230000007472 neurodevelopment Effects 0.000 description 3
- 210000004498 neuroglial cell Anatomy 0.000 description 3
- 229920001542 oligosaccharide Polymers 0.000 description 3
- 150000002482 oligosaccharides Chemical class 0.000 description 3
- 230000005868 ontogenesis Effects 0.000 description 3
- 210000000496 pancreas Anatomy 0.000 description 3
- 230000009054 pathological process Effects 0.000 description 3
- 230000001737 promoting effect Effects 0.000 description 3
- 230000013777 protein digestion Effects 0.000 description 3
- 230000002829 reductive effect Effects 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 230000002207 retinal effect Effects 0.000 description 3
- 230000007727 signaling mechanism Effects 0.000 description 3
- 208000024891 symptom Diseases 0.000 description 3
- QJJXYPPXXYFBGM-SHYZHZOCSA-N tacrolimus Natural products CO[C@H]1C[C@H](CC[C@@H]1O)C=C(C)[C@H]2OC(=O)[C@H]3CCCCN3C(=O)C(=O)[C@@]4(O)O[C@@H]([C@H](C[C@H]4C)OC)[C@@H](C[C@H](C)CC(=C[C@@H](CC=C)C(=O)C[C@H](O)[C@H]2C)C)OC QJJXYPPXXYFBGM-SHYZHZOCSA-N 0.000 description 3
- 229940124597 therapeutic agent Drugs 0.000 description 3
- 230000001225 therapeutic effect Effects 0.000 description 3
- 238000002560 therapeutic procedure Methods 0.000 description 3
- XUIIKFGFIJCVMT-UHFFFAOYSA-N thyroxine-binding globulin Natural products IC1=CC(CC([NH3+])C([O-])=O)=CC(I)=C1OC1=CC(I)=C(O)C(I)=C1 XUIIKFGFIJCVMT-UHFFFAOYSA-N 0.000 description 3
- 238000010361 transduction Methods 0.000 description 3
- 230000026683 transduction Effects 0.000 description 3
- 230000032258 transport Effects 0.000 description 3
- 210000003501 vero cell Anatomy 0.000 description 3
- 239000000304 virulence factor Substances 0.000 description 3
- 230000007923 virulence factor Effects 0.000 description 3
- AZQWKYJCGOJGHM-UHFFFAOYSA-N 1,4-benzoquinone Chemical compound O=C1C=CC(=O)C=C1 AZQWKYJCGOJGHM-UHFFFAOYSA-N 0.000 description 2
- WRGQSWVCFNIUNZ-GDCKJWNLSA-N 1-oleoyl-sn-glycerol 3-phosphate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC[C@@H](O)COP(O)(O)=O WRGQSWVCFNIUNZ-GDCKJWNLSA-N 0.000 description 2
- FPIPGXGPPPQFEQ-UHFFFAOYSA-N 13-cis retinol Natural products OCC=C(C)C=CC=C(C)C=CC1=C(C)CCCC1(C)C FPIPGXGPPPQFEQ-UHFFFAOYSA-N 0.000 description 2
- 102000029791 ADAM Human genes 0.000 description 2
- 108091022885 ADAM Proteins 0.000 description 2
- 102000051388 ADAMTS1 Human genes 0.000 description 2
- 102000057234 Acyl transferases Human genes 0.000 description 2
- 108091010877 Allograft inflammatory factor 1 Proteins 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 208000001593 Bernard-Soulier syndrome Diseases 0.000 description 2
- BPYKTIZUTYGOLE-IFADSCNNSA-N Bilirubin Chemical compound N1C(=O)C(C)=C(C=C)\C1=C\C1=C(C)C(CCC(O)=O)=C(CC2=C(C(C)=C(\C=C/3C(=C(C=C)C(=O)N\3)C)N2)CCC(O)=O)N1 BPYKTIZUTYGOLE-IFADSCNNSA-N 0.000 description 2
- 102000004631 Calcineurin Human genes 0.000 description 2
- CURLTUGMZLYLDI-UHFFFAOYSA-N Carbon dioxide Chemical compound O=C=O CURLTUGMZLYLDI-UHFFFAOYSA-N 0.000 description 2
- 108010078791 Carrier Proteins Proteins 0.000 description 2
- 102000016289 Cell Adhesion Molecules Human genes 0.000 description 2
- 108010067225 Cell Adhesion Molecules Proteins 0.000 description 2
- 102100031667 Cell adhesion molecule-related/down-regulated by oncogenes Human genes 0.000 description 2
- 101800004542 Chondromodulin-1 Proteins 0.000 description 2
- 102400000676 Chondromodulin-1 Human genes 0.000 description 2
- 102000002029 Claudin Human genes 0.000 description 2
- 108050009302 Claudin Proteins 0.000 description 2
- 208000001333 Colorectal Neoplasms Diseases 0.000 description 2
- 208000028702 Congenital thrombocyte disease Diseases 0.000 description 2
- 229930105110 Cyclosporin A Natural products 0.000 description 2
- PMATZTZNYRCHOR-CGLBZJNRSA-N Cyclosporin A Chemical compound CC[C@@H]1NC(=O)[C@H]([C@H](O)[C@H](C)C\C=C\C)N(C)C(=O)[C@H](C(C)C)N(C)C(=O)[C@H](CC(C)C)N(C)C(=O)[C@H](CC(C)C)N(C)C(=O)[C@@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)N(C)C(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)N(C)C(=O)CN(C)C1=O PMATZTZNYRCHOR-CGLBZJNRSA-N 0.000 description 2
- 108010036949 Cyclosporine Proteins 0.000 description 2
- 108010039724 Cysteine dioxygenase Proteins 0.000 description 2
- 102100035784 Decorin Human genes 0.000 description 2
- 108090000738 Decorin Proteins 0.000 description 2
- 102100030074 Dickkopf-related protein 1 Human genes 0.000 description 2
- 101710099518 Dickkopf-related protein 1 Proteins 0.000 description 2
- 101100478055 Dictyostelium discoideum cotC gene Proteins 0.000 description 2
- 102100021771 Endoplasmic reticulum mannosyl-oligosaccharide 1,2-alpha-mannosidase Human genes 0.000 description 2
- 102000006395 Globulins Human genes 0.000 description 2
- 108010044091 Globulins Proteins 0.000 description 2
- 108700023372 Glycosyltransferases Proteins 0.000 description 2
- 102000001895 Gonadotropin Receptors Human genes 0.000 description 2
- 108010040490 Gonadotropin Receptors Proteins 0.000 description 2
- 102000001398 Granzyme Human genes 0.000 description 2
- 108050003624 Granzyme M Proteins 0.000 description 2
- 101000998953 Homo sapiens Immunoglobulin heavy variable 1-2 Proteins 0.000 description 2
- 101001000631 Homo sapiens Peripheral myelin protein 22 Proteins 0.000 description 2
- 101001082860 Homo sapiens Peroxisomal membrane protein 2 Proteins 0.000 description 2
- 101000964562 Homo sapiens Zinc finger FYVE domain-containing protein 9 Proteins 0.000 description 2
- 206010020751 Hypersensitivity Diseases 0.000 description 2
- 102100036887 Immunoglobulin heavy variable 1-2 Human genes 0.000 description 2
- 102000016844 Immunoglobulin-like domains Human genes 0.000 description 2
- 108050006430 Immunoglobulin-like domains Proteins 0.000 description 2
- 102100027612 Kallikrein-11 Human genes 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- 108010006444 Leucine-Rich Repeat Proteins Proteins 0.000 description 2
- 102100022170 Leucine-rich repeats and immunoglobulin-like domains protein 1 Human genes 0.000 description 2
- 101710180792 Leucine-rich repeats and immunoglobulin-like domains protein 1 Proteins 0.000 description 2
- 102100040448 Leukocyte cell-derived chemotaxin 1 Human genes 0.000 description 2
- 101710125682 Leukocyte cell-derived chemotaxin 1 Proteins 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 102000008791 Lysozyme C Human genes 0.000 description 2
- 108050000633 Lysozyme C Proteins 0.000 description 2
- 208000001089 Multiple system atrophy Diseases 0.000 description 2
- 101001091370 Mus musculus Kallikrein-8 Proteins 0.000 description 2
- 101001086281 Mus musculus Opsin-5 Proteins 0.000 description 2
- 241000699670 Mus sp. Species 0.000 description 2
- 108010083674 Myelin Proteins Proteins 0.000 description 2
- 102000006386 Myelin Proteins Human genes 0.000 description 2
- 108010046068 N-Acetyllactosamine Synthase Proteins 0.000 description 2
- MBLBDJOUHNCFQT-UHFFFAOYSA-N N-acetyl-D-galactosamine Natural products CC(=O)NC(C=O)C(O)C(O)C(O)CO MBLBDJOUHNCFQT-UHFFFAOYSA-N 0.000 description 2
- MNLRQHMNZILYPY-MDMHTWEWSA-N N-acetyl-alpha-D-muramic acid Chemical compound OC(=O)[C@@H](C)O[C@H]1[C@H](O)[C@@H](CO)O[C@H](O)[C@@H]1NC(C)=O MNLRQHMNZILYPY-MDMHTWEWSA-N 0.000 description 2
- 208000008589 Obesity Diseases 0.000 description 2
- 108700020796 Oncogene Proteins 0.000 description 2
- 108010064983 Ovomucin Proteins 0.000 description 2
- 102100035917 Peripheral myelin protein 22 Human genes 0.000 description 2
- 101000844718 Rattus norvegicus Deleted in malignant brain tumors 1 protein Proteins 0.000 description 2
- 108091007187 Reductases Proteins 0.000 description 2
- 101710141795 Ribonuclease inhibitor Proteins 0.000 description 2
- 102100037968 Ribonuclease inhibitor Human genes 0.000 description 2
- 108091058545 Secretory proteins Proteins 0.000 description 2
- 102000040739 Secretory proteins Human genes 0.000 description 2
- 102000002938 Thrombospondin Human genes 0.000 description 2
- 108060008245 Thrombospondin Proteins 0.000 description 2
- AUYYCJSJGJYCDS-LBPRGKRZSA-N Thyrolar Chemical class IC1=CC(C[C@H](N)C(O)=O)=CC(I)=C1OC1=CC=C(O)C(I)=C1 AUYYCJSJGJYCDS-LBPRGKRZSA-N 0.000 description 2
- 108090000631 Trypsin Proteins 0.000 description 2
- 102000004142 Trypsin Human genes 0.000 description 2
- 101710152431 Trypsin-like protease Proteins 0.000 description 2
- 108091005906 Type I transmembrane proteins Proteins 0.000 description 2
- 108091005956 Type II transmembrane proteins Proteins 0.000 description 2
- LFTYTUAZOPRMMI-CFRASDGPSA-N UDP-N-acetyl-alpha-D-glucosamine Chemical compound O1[C@H](CO)[C@@H](O)[C@H](O)[C@@H](NC(=O)C)[C@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-CFRASDGPSA-N 0.000 description 2
- 108010093894 Xanthine oxidase Proteins 0.000 description 2
- 102100033220 Xanthine oxidase Human genes 0.000 description 2
- 102100040801 Zinc finger FYVE domain-containing protein 9 Human genes 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 230000010933 acylation Effects 0.000 description 2
- 238000005917 acylation reaction Methods 0.000 description 2
- 210000003486 adipose tissue brown Anatomy 0.000 description 2
- SHGAZHPCJJPHSC-YCNIQYBTSA-N all-trans-retinoic acid Chemical compound OC(=O)\C=C(/C)\C=C\C=C(/C)\C=C\C1=C(C)CCCC1(C)C SHGAZHPCJJPHSC-YCNIQYBTSA-N 0.000 description 2
- 230000007815 allergy Effects 0.000 description 2
- AWUCVROLDVIAJX-UHFFFAOYSA-N alpha-glycerophosphate Natural products OCC(O)COP(O)(O)=O AWUCVROLDVIAJX-UHFFFAOYSA-N 0.000 description 2
- 230000000202 analgesic effect Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000000845 anti-microbial effect Effects 0.000 description 2
- 230000003502 anti-nociceptive effect Effects 0.000 description 2
- 239000004599 antimicrobial Substances 0.000 description 2
- 239000003963 antioxidant agent Substances 0.000 description 2
- 230000003078 antioxidant effect Effects 0.000 description 2
- 229940114079 arachidonic acid Drugs 0.000 description 2
- 235000021342 arachidonic acid Nutrition 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- SQVRNKJHWKZAKO-UHFFFAOYSA-N beta-N-Acetyl-D-neuraminic acid Natural products CC(=O)NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO SQVRNKJHWKZAKO-UHFFFAOYSA-N 0.000 description 2
- 210000001593 brown adipocyte Anatomy 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 230000021164 cell adhesion Effects 0.000 description 2
- 230000034303 cell budding Effects 0.000 description 2
- 230000004663 cell proliferation Effects 0.000 description 2
- 210000002421 cell wall Anatomy 0.000 description 2
- 210000003756 cervix mucus Anatomy 0.000 description 2
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 229960001265 ciclosporin Drugs 0.000 description 2
- 230000004186 co-expression Effects 0.000 description 2
- 235000017471 coenzyme Q10 Nutrition 0.000 description 2
- ACTIUHUUMQJHFO-UPTCCGCDSA-N coenzyme Q10 Chemical compound COC1=C(OC)C(=O)C(C\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CC\C=C(/C)CCC=C(C)C)=C(C)C1=O ACTIUHUUMQJHFO-UPTCCGCDSA-N 0.000 description 2
- 208000029742 colonic neoplasm Diseases 0.000 description 2
- 210000002808 connective tissue Anatomy 0.000 description 2
- 239000013078 crystal Substances 0.000 description 2
- 101150065962 csg gene Proteins 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- 230000016396 cytokine production Effects 0.000 description 2
- 230000001086 cytosolic effect Effects 0.000 description 2
- 239000003145 cytotoxic factor Substances 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 208000037765 diseases and disorders Diseases 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 239000012636 effector Substances 0.000 description 2
- 210000004212 embryonic organizer Anatomy 0.000 description 2
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 2
- 210000000981 epithelium Anatomy 0.000 description 2
- 102000045442 glycosyltransferase activity proteins Human genes 0.000 description 2
- 108700014210 glycosyltransferase activity proteins Proteins 0.000 description 2
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 2
- 239000010931 gold Substances 0.000 description 2
- 229910052737 gold Inorganic materials 0.000 description 2
- 208000031169 hemorrhagic disease Diseases 0.000 description 2
- 230000000971 hippocampal effect Effects 0.000 description 2
- 210000003630 histaminocyte Anatomy 0.000 description 2
- 229940088597 hormone Drugs 0.000 description 2
- 239000005556 hormone Substances 0.000 description 2
- 102000053531 human UCP1 Human genes 0.000 description 2
- 210000003917 human chromosome Anatomy 0.000 description 2
- 238000005805 hydroxylation reaction Methods 0.000 description 2
- 230000036737 immune function Effects 0.000 description 2
- 230000028993 immune response Effects 0.000 description 2
- 230000036039 immunity Effects 0.000 description 2
- 229940072221 immunoglobulins Drugs 0.000 description 2
- 230000001861 immunosuppressant effect Effects 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000002757 inflammatory effect Effects 0.000 description 2
- 230000003914 insulin secretion Effects 0.000 description 2
- 102000006495 integrins Human genes 0.000 description 2
- 108010044426 integrins Proteins 0.000 description 2
- 230000008611 intercellular interaction Effects 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 208000017169 kidney disease Diseases 0.000 description 2
- 210000000982 limb bud Anatomy 0.000 description 2
- SQEHCNOBYLQFTG-UHFFFAOYSA-M lithium;thiophene-2-carboxylate Chemical compound [Li+].[O-]C(=O)C1=CC=CS1 SQEHCNOBYLQFTG-UHFFFAOYSA-M 0.000 description 2
- 210000004185 liver Anatomy 0.000 description 2
- 230000035800 maturation Effects 0.000 description 2
- 230000003818 metabolic dysfunction Effects 0.000 description 2
- 230000007102 metabolic function Effects 0.000 description 2
- 230000037323 metabolic rate Effects 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 230000005012 migration Effects 0.000 description 2
- 238000013508 migration Methods 0.000 description 2
- 210000001700 mitochondrial membrane Anatomy 0.000 description 2
- 239000003226 mitogen Substances 0.000 description 2
- 230000021332 multicellular organism growth Effects 0.000 description 2
- 210000000663 muscle cell Anatomy 0.000 description 2
- 210000005012 myelin Anatomy 0.000 description 2
- 210000000822 natural killer cell Anatomy 0.000 description 2
- 230000001537 neural effect Effects 0.000 description 2
- 235000020824 obesity Nutrition 0.000 description 2
- 230000035699 permeability Effects 0.000 description 2
- 150000003904 phospholipids Chemical class 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 230000001766 physiological effect Effects 0.000 description 2
- 230000035790 physiological processes and functions Effects 0.000 description 2
- 230000035935 pregnancy Effects 0.000 description 2
- 230000002062 proliferating effect Effects 0.000 description 2
- 235000019419 proteases Nutrition 0.000 description 2
- 239000012268 protein inhibitor Substances 0.000 description 2
- 229940121649 protein inhibitor Drugs 0.000 description 2
- 230000006916 protein interaction Effects 0.000 description 2
- ZAHRKKWIAAJSAO-UHFFFAOYSA-N rapamycin Natural products COCC(O)C(=C/C(C)C(=O)CC(OC(=O)C1CCCCN1C(=O)C(=O)C2(O)OC(CC(OC)C(=CC=CC=CC(C)CC(C)C(=O)C)C)CCC2C)C(C)CC3CCC(O)C(C3)OC)C ZAHRKKWIAAJSAO-UHFFFAOYSA-N 0.000 description 2
- 239000003642 reactive oxygen metabolite Substances 0.000 description 2
- 108091008598 receptor tyrosine kinases Proteins 0.000 description 2
- 102000027426 receptor tyrosine kinases Human genes 0.000 description 2
- 230000008929 regeneration Effects 0.000 description 2
- 238000011069 regeneration method Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 229930002330 retinoic acid Natural products 0.000 description 2
- 229960003471 retinol Drugs 0.000 description 2
- 235000020944 retinol Nutrition 0.000 description 2
- 239000011607 retinol Substances 0.000 description 2
- 210000003296 saliva Anatomy 0.000 description 2
- 230000037390 scarring Effects 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- SQVRNKJHWKZAKO-OQPLDHBCSA-N sialic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)OC1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-OQPLDHBCSA-N 0.000 description 2
- 108091006024 signal transducing proteins Proteins 0.000 description 2
- 102000034285 signal transducing proteins Human genes 0.000 description 2
- 229960002930 sirolimus Drugs 0.000 description 2
- QFJCIRLUMZQUOT-HPLJOQBZSA-N sirolimus Chemical compound C1C[C@@H](O)[C@H](OC)C[C@@H]1C[C@@H](C)[C@H]1OC(=O)[C@@H]2CCCCN2C(=O)C(=O)[C@](O)(O2)[C@H](C)CC[C@H]2C[C@H](OC)/C(C)=C/C=C/C=C/[C@@H](C)C[C@@H](C)C(=O)[C@H](OC)[C@H](O)/C(C)=C/[C@@H](C)C(=O)C1 QFJCIRLUMZQUOT-HPLJOQBZSA-N 0.000 description 2
- 210000003491 skin Anatomy 0.000 description 2
- 210000000813 small intestine Anatomy 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 230000004936 stimulating effect Effects 0.000 description 2
- 230000000638 stimulation Effects 0.000 description 2
- 150000008163 sugars Chemical group 0.000 description 2
- 230000004083 survival effect Effects 0.000 description 2
- 230000002889 sympathetic effect Effects 0.000 description 2
- 210000000225 synapse Anatomy 0.000 description 2
- 230000000946 synaptic effect Effects 0.000 description 2
- 210000001138 tear Anatomy 0.000 description 2
- 239000005495 thyroid hormone Substances 0.000 description 2
- 229940036555 thyroid hormone Drugs 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 229960001727 tretinoin Drugs 0.000 description 2
- 239000012588 trypsin Substances 0.000 description 2
- 230000001810 trypsinlike Effects 0.000 description 2
- 210000004881 tumor cell Anatomy 0.000 description 2
- 235000019143 vitamin K2 Nutrition 0.000 description 2
- 239000011728 vitamin K2 Substances 0.000 description 2
- ODIGIKRIUKFKHP-UHFFFAOYSA-N (n-propan-2-yloxycarbonylanilino) acetate Chemical group CC(C)OC(=O)N(OC(C)=O)C1=CC=CC=C1 ODIGIKRIUKFKHP-UHFFFAOYSA-N 0.000 description 1
- HYPYXGZDOYTYDR-HAJWAVTHSA-N 2-methyl-3-[(2e,6e,10e,14e)-3,7,11,15,19-pentamethylicosa-2,6,10,14,18-pentaenyl]naphthalene-1,4-dione Chemical compound C1=CC=C2C(=O)C(C/C=C(C)/CC/C=C(C)/CC/C=C(C)/CC/C=C(C)/CCC=C(C)C)=C(C)C(=O)C2=C1 HYPYXGZDOYTYDR-HAJWAVTHSA-N 0.000 description 1
- 108091071337 20 family Proteins 0.000 description 1
- 108700001666 APC Genes Proteins 0.000 description 1
- 206010000234 Abortion spontaneous Diseases 0.000 description 1
- 208000002874 Acne Vulgaris Diseases 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 208000024893 Acute lymphoblastic leukemia Diseases 0.000 description 1
- 102000035485 Allograft inflammatory factor 1 Human genes 0.000 description 1
- 102100040121 Allograft inflammatory factor 1 Human genes 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 102100030009 Azurocidin Human genes 0.000 description 1
- 101710154607 Azurocidin Proteins 0.000 description 1
- 208000032791 BCR-ABL1 positive chronic myelogenous leukemia Diseases 0.000 description 1
- 206010004446 Benign prostatic hyperplasia Diseases 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- 229940124294 CD33 monoclonal antibody Drugs 0.000 description 1
- 102100025222 CD63 antigen Human genes 0.000 description 1
- 102100027221 CD81 antigen Human genes 0.000 description 1
- 102100027217 CD82 antigen Human genes 0.000 description 1
- 102100037904 CD9 antigen Human genes 0.000 description 1
- 206010006895 Cachexia Diseases 0.000 description 1
- 241000244203 Caenorhabditis elegans Species 0.000 description 1
- 101100264044 Caenorhabditis elegans cwn-2 gene Proteins 0.000 description 1
- 108010042955 Calcineurin Proteins 0.000 description 1
- 241000282461 Canis lupus Species 0.000 description 1
- 101710179555 Carboxypeptidase 2 Proteins 0.000 description 1
- 108090000087 Carboxypeptidase B Proteins 0.000 description 1
- 102000003670 Carboxypeptidase B Human genes 0.000 description 1
- 102100032378 Carboxypeptidase E Human genes 0.000 description 1
- 108010058255 Carboxypeptidase H Proteins 0.000 description 1
- 108090000007 Carboxypeptidase M Proteins 0.000 description 1
- 102100032936 Carboxypeptidase M Human genes 0.000 description 1
- 108010080937 Carboxypeptidases A Proteins 0.000 description 1
- 102000000496 Carboxypeptidases A Human genes 0.000 description 1
- 206010008025 Cerebellar ataxia Diseases 0.000 description 1
- 208000015879 Cerebellar disease Diseases 0.000 description 1
- 101710163331 Chaoptin Proteins 0.000 description 1
- 208000010693 Charcot-Marie-Tooth Disease Diseases 0.000 description 1
- 108090000601 Claudin-4 Proteins 0.000 description 1
- 102100038447 Claudin-4 Human genes 0.000 description 1
- RGJOEKWQDUBAIZ-IBOSZNHHSA-N CoASH Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-IBOSZNHHSA-N 0.000 description 1
- 101710201511 Coagulation factor X-activating enzyme heavy chain Proteins 0.000 description 1
- ACTIUHUUMQJHFO-UHFFFAOYSA-N Coenzym Q10 Natural products COC1=C(OC)C(=O)C(CC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)C)=C(C)C1=O ACTIUHUUMQJHFO-UHFFFAOYSA-N 0.000 description 1
- 108010071942 Colony-Stimulating Factors Proteins 0.000 description 1
- 102000007644 Colony-Stimulating Factors Human genes 0.000 description 1
- 108010002947 Connectin Proteins 0.000 description 1
- 208000001819 Crigler-Najjar Syndrome Diseases 0.000 description 1
- 101710093674 Cyclic nucleotide-gated cation channel beta-1 Proteins 0.000 description 1
- 102000001493 Cyclophilins Human genes 0.000 description 1
- 108010068682 Cyclophilins Proteins 0.000 description 1
- 102000015833 Cystatin Human genes 0.000 description 1
- 102100028007 Cystatin-SA Human genes 0.000 description 1
- 101710144510 Cysteine proteinase inhibitor Proteins 0.000 description 1
- 102000000634 Cytochrome c oxidase subunit IV Human genes 0.000 description 1
- 108090000365 Cytochrome-c oxidases Proteins 0.000 description 1
- AEMOLEFTQBMNLQ-AQKNRBDQSA-N D-glucopyranuronic acid Chemical compound OC1O[C@H](C(O)=O)[C@@H](O)[C@H](O)[C@H]1O AEMOLEFTQBMNLQ-AQKNRBDQSA-N 0.000 description 1
- 101100317380 Danio rerio wnt4a gene Proteins 0.000 description 1
- 108700021041 Disintegrin Proteins 0.000 description 1
- 101800001224 Disintegrin Proteins 0.000 description 1
- 108091008794 FGF receptors Proteins 0.000 description 1
- 208000001308 Fasciculation Diseases 0.000 description 1
- 102000044168 Fibroblast Growth Factor Receptor Human genes 0.000 description 1
- IAJILQKETJEXLJ-UHFFFAOYSA-N Galacturonsaeure Natural products O=CC(O)C(O)C(O)C(O)C(O)=O IAJILQKETJEXLJ-UHFFFAOYSA-N 0.000 description 1
- 241000287828 Gallus gallus Species 0.000 description 1
- 101710202596 Gap junction beta-1 protein Proteins 0.000 description 1
- 208000009139 Gilbert Disease Diseases 0.000 description 1
- 208000022412 Gilbert syndrome Diseases 0.000 description 1
- 206010018338 Glioma Diseases 0.000 description 1
- 206010018429 Glucose tolerance impaired Diseases 0.000 description 1
- JZNWSCPGTDBMEW-UHFFFAOYSA-N Glycerophosphorylethanolamin Natural products NCCOP(O)(=O)OCC(O)CO JZNWSCPGTDBMEW-UHFFFAOYSA-N 0.000 description 1
- 108010009202 Growth Factor Receptors Proteins 0.000 description 1
- 102000009465 Growth Factor Receptors Human genes 0.000 description 1
- 206010019315 Heart transplant rejection Diseases 0.000 description 1
- 102100031000 Hepatoma-derived growth factor Human genes 0.000 description 1
- 206010069382 Hereditary neuropathy with liability to pressure palsies Diseases 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 101000934368 Homo sapiens CD63 antigen Proteins 0.000 description 1
- 101000914479 Homo sapiens CD81 antigen Proteins 0.000 description 1
- 101000914469 Homo sapiens CD82 antigen Proteins 0.000 description 1
- 101000738354 Homo sapiens CD9 antigen Proteins 0.000 description 1
- 101000998774 Homo sapiens Insulin-like peptide INSL5 Proteins 0.000 description 1
- 101001033233 Homo sapiens Interleukin-10 Proteins 0.000 description 1
- 101001003149 Homo sapiens Interleukin-10 receptor subunit beta Proteins 0.000 description 1
- 101001091385 Homo sapiens Kallikrein-6 Proteins 0.000 description 1
- 101000777628 Homo sapiens Leukocyte antigen CD37 Proteins 0.000 description 1
- 101000980823 Homo sapiens Leukocyte surface antigen CD53 Proteins 0.000 description 1
- 101001098256 Homo sapiens Lysophospholipase Proteins 0.000 description 1
- 101000777117 Homo sapiens Mitochondrial uncoupling protein 4 Proteins 0.000 description 1
- 101000635944 Homo sapiens Myelin protein P0 Proteins 0.000 description 1
- 101000983077 Homo sapiens Phospholipase A2 Proteins 0.000 description 1
- 101001096022 Homo sapiens Phospholipase B1, membrane-associated Proteins 0.000 description 1
- 101001135375 Homo sapiens Protein PET117 homolog, mitochondrial Proteins 0.000 description 1
- 101001012157 Homo sapiens Receptor tyrosine-protein kinase erbB-2 Proteins 0.000 description 1
- 101000963987 Homo sapiens SH3 domain-binding protein 5 Proteins 0.000 description 1
- 101000863880 Homo sapiens Sialic acid-binding Ig-like lectin 6 Proteins 0.000 description 1
- 101100539367 Homo sapiens UCP1 gene Proteins 0.000 description 1
- 108010003272 Hyaluronate lyase Proteins 0.000 description 1
- 102000001974 Hyaluronidases Human genes 0.000 description 1
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 1
- 206010060378 Hyperinsulinaemia Diseases 0.000 description 1
- 206010020772 Hypertension Diseases 0.000 description 1
- 206010062016 Immunosuppression Diseases 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 102000004218 Insulin-Like Growth Factor I Human genes 0.000 description 1
- 108090001117 Insulin-Like Growth Factor II Proteins 0.000 description 1
- 102000048143 Insulin-Like Growth Factor II Human genes 0.000 description 1
- 102100033266 Insulin-like peptide INSL5 Human genes 0.000 description 1
- 102000001617 Interferon Receptors Human genes 0.000 description 1
- 108010054267 Interferon Receptors Proteins 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 108010002352 Interleukin-1 Proteins 0.000 description 1
- 108010017550 Interleukin-10 Receptors Proteins 0.000 description 1
- 102000004551 Interleukin-10 Receptors Human genes 0.000 description 1
- 102100020788 Interleukin-10 receptor subunit beta Human genes 0.000 description 1
- 102000004388 Interleukin-4 Human genes 0.000 description 1
- 108090000978 Interleukin-4 Proteins 0.000 description 1
- 108090001005 Interleukin-6 Proteins 0.000 description 1
- 102000015696 Interleukins Human genes 0.000 description 1
- 108010063738 Interleukins Proteins 0.000 description 1
- 102100034866 Kallikrein-6 Human genes 0.000 description 1
- 101000945995 Latrodectus hasselti Alpha-latrotoxin-Lh1a Proteins 0.000 description 1
- 101000945994 Latrodectus hesperus Alpha-latrotoxin-Lhe1a Proteins 0.000 description 1
- 101000945997 Latrodectus mactans Alpha-latrotoxin-Lm1a Proteins 0.000 description 1
- 101000945996 Latrodectus tredecimguttatus Alpha-latrotoxin-Lt1a Proteins 0.000 description 1
- 102000016267 Leptin Human genes 0.000 description 1
- 108010092277 Leptin Proteins 0.000 description 1
- 102100031586 Leukocyte antigen CD37 Human genes 0.000 description 1
- 108050007732 Leukocyte cell-derived chemotaxin 2 Proteins 0.000 description 1
- 102100024221 Leukocyte surface antigen CD53 Human genes 0.000 description 1
- 102100023113 Long-chain fatty acid transport protein 4 Human genes 0.000 description 1
- 108010011449 Long-chain-fatty-acid-CoA ligase Proteins 0.000 description 1
- 102100025069 MARVEL domain-containing protein 1 Human genes 0.000 description 1
- 102100021760 Magnesium transporter protein 1 Human genes 0.000 description 1
- 108010000684 Matrix Metalloproteinases Proteins 0.000 description 1
- 102000002274 Matrix Metalloproteinases Human genes 0.000 description 1
- 102000005741 Metalloproteases Human genes 0.000 description 1
- 108010006035 Metalloproteases Proteins 0.000 description 1
- 206010027476 Metastases Diseases 0.000 description 1
- 102100040200 Mitochondrial uncoupling protein 2 Human genes 0.000 description 1
- 101710112393 Mitochondrial uncoupling protein 2 Proteins 0.000 description 1
- 108010074633 Mixed Function Oxygenases Proteins 0.000 description 1
- 102000008109 Mixed Function Oxygenases Human genes 0.000 description 1
- 108700005084 Multigene Family Proteins 0.000 description 1
- 108010085220 Multiprotein Complexes Proteins 0.000 description 1
- 102000007474 Multiprotein Complexes Human genes 0.000 description 1
- 206010028293 Muscle contractions involuntary Diseases 0.000 description 1
- 208000033761 Myelogenous Chronic BCR-ABL Positive Leukemia Diseases 0.000 description 1
- OVRNDRQMDRJTHS-CBQIKETKSA-N N-Acetyl-D-Galactosamine Chemical compound CC(=O)N[C@H]1[C@@H](O)O[C@H](CO)[C@H](O)[C@@H]1O OVRNDRQMDRJTHS-CBQIKETKSA-N 0.000 description 1
- 108010046220 N-Acetylgalactosaminyltransferases Proteins 0.000 description 1
- 102000007524 N-Acetylgalactosaminyltransferases Human genes 0.000 description 1
- OVRNDRQMDRJTHS-KEWYIRBNSA-N N-acetyl-D-galactosamine Chemical compound CC(=O)N[C@H]1C(O)O[C@H](CO)[C@H](O)[C@@H]1O OVRNDRQMDRJTHS-KEWYIRBNSA-N 0.000 description 1
- 230000004988 N-glycosylation Effects 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 101100202428 Neopyropia yezoensis atps gene Proteins 0.000 description 1
- 102000007339 Nerve Growth Factor Receptors Human genes 0.000 description 1
- 108010032605 Nerve Growth Factor Receptors Proteins 0.000 description 1
- 108010012255 Neural Cell Adhesion Molecule L1 Proteins 0.000 description 1
- 102000003797 Neuropeptides Human genes 0.000 description 1
- 230000004989 O-glycosylation Effects 0.000 description 1
- 108090000304 Occludin Proteins 0.000 description 1
- 102000012547 Olfactory receptors Human genes 0.000 description 1
- 108050002069 Olfactory receptors Proteins 0.000 description 1
- 108010078627 Oncogene Protein v-crk Proteins 0.000 description 1
- 101710160107 Outer membrane protein A Proteins 0.000 description 1
- 206010033128 Ovarian cancer Diseases 0.000 description 1
- 206010061535 Ovarian neoplasm Diseases 0.000 description 1
- 230000010800 Physiochemical Activity Effects 0.000 description 1
- 108010066816 Polypeptide N-acetylgalactosaminyltransferase Proteins 0.000 description 1
- 101800001357 Potential peptide Proteins 0.000 description 1
- 102400000745 Potential peptide Human genes 0.000 description 1
- 101100515690 Prochlorococcus marinus (strain SARG / CCMP1375 / SS120) nadK2 gene Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 101710088675 Proline-rich peptide Proteins 0.000 description 1
- 108010043005 Prolyl Hydroxylases Proteins 0.000 description 1
- 102000004079 Prolyl Hydroxylases Human genes 0.000 description 1
- 206010060862 Prostate cancer Diseases 0.000 description 1
- 208000004403 Prostatic Hyperplasia Diseases 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 201000004681 Psoriasis Diseases 0.000 description 1
- 102100030086 Receptor tyrosine-protein kinase erbB-2 Human genes 0.000 description 1
- 208000007014 Retinitis pigmentosa Diseases 0.000 description 1
- 102100040119 SH3 domain-binding protein 5 Human genes 0.000 description 1
- 108091007602 SLC58A1 Proteins 0.000 description 1
- 108090000184 Selectins Proteins 0.000 description 1
- 102000003800 Selectins Human genes 0.000 description 1
- 102000013008 Semaphorin-3A Human genes 0.000 description 1
- 108010090319 Semaphorin-3A Proteins 0.000 description 1
- 101710189648 Serine/threonine-protein phosphatase Proteins 0.000 description 1
- 208000009106 Shy-Drager Syndrome Diseases 0.000 description 1
- 108010029176 Sialic Acid Binding Ig-like Lectin 1 Proteins 0.000 description 1
- 102100029947 Sialic acid-binding Ig-like lectin 6 Human genes 0.000 description 1
- 102100032855 Sialoadhesin Human genes 0.000 description 1
- 101710154810 Snake venom metalloproteinase Proteins 0.000 description 1
- 208000010112 Spinocerebellar Degenerations Diseases 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 108010000499 Thromboplastin Proteins 0.000 description 1
- 102100030859 Tissue factor Human genes 0.000 description 1
- 102100026260 Titin Human genes 0.000 description 1
- 101710169732 Transforming growth factor beta activator LRRC32 Proteins 0.000 description 1
- 108700025716 Tumor Suppressor Genes Proteins 0.000 description 1
- 102000044209 Tumor Suppressor Genes Human genes 0.000 description 1
- 206010067584 Type 1 diabetes mellitus Diseases 0.000 description 1
- 101150076688 UCP2 gene Proteins 0.000 description 1
- 101150016260 UCP3 gene Proteins 0.000 description 1
- LFTYTUAZOPRMMI-NESSUJCYSA-N UDP-N-acetyl-alpha-D-galactosamine Chemical compound O1[C@H](CO)[C@H](O)[C@H](O)[C@@H](NC(=O)C)[C@H]1O[P@](O)(=O)O[P@](O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 LFTYTUAZOPRMMI-NESSUJCYSA-N 0.000 description 1
- 102100031358 Urokinase-type plasminogen activator Human genes 0.000 description 1
- 108090000435 Urokinase-type plasminogen activator Proteins 0.000 description 1
- 244000000188 Vaccinium ovalifolium Species 0.000 description 1
- ZYPGADGCNXOUJP-CXVPHVKISA-N Variotin Chemical compound CCCC[C@@H](O)\C=C(/C)\C=C\C=C\C(=O)N1CCCC1=O ZYPGADGCNXOUJP-CXVPHVKISA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 108010066342 Virus Receptors Proteins 0.000 description 1
- 102000018265 Virus Receptors Human genes 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- PFRQBZFETXBLTP-UHFFFAOYSA-N Vitamin K2 Natural products C1=CC=C2C(=O)C(CC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)C)=C(C)C(=O)C2=C1 PFRQBZFETXBLTP-UHFFFAOYSA-N 0.000 description 1
- 101150010310 WNT-4 gene Proteins 0.000 description 1
- 101150019524 WNT2 gene Proteins 0.000 description 1
- 108700020986 Wnt-2 Proteins 0.000 description 1
- 102000052556 Wnt-2 Human genes 0.000 description 1
- 102000052548 Wnt-4 Human genes 0.000 description 1
- 108700020984 Wnt-4 Proteins 0.000 description 1
- 208000027418 Wounds and injury Diseases 0.000 description 1
- 101710197154 Zinc metalloproteinase-disintegrin VMP-II Proteins 0.000 description 1
- 101710112278 Zinc metalloproteinase-disintegrin-like Proteins 0.000 description 1
- 101710200352 Zinc metalloproteinase-disintegrin-like VMP-III Proteins 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- 206010000496 acne Diseases 0.000 description 1
- 210000001789 adipocyte Anatomy 0.000 description 1
- 210000004100 adrenal gland Anatomy 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 230000004931 aggregating effect Effects 0.000 description 1
- 230000032683 aging Effects 0.000 description 1
- 208000026935 allergic disease Diseases 0.000 description 1
- 108010044715 asialofetuin Proteins 0.000 description 1
- 230000017047 asymmetric cell division Effects 0.000 description 1
- 230000035578 autophosphorylation Effects 0.000 description 1
- 230000004009 axon guidance Effects 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 208000027119 bilirubin metabolic disease Diseases 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 230000008436 biogenesis Effects 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 210000001185 bone marrow Anatomy 0.000 description 1
- 238000010322 bone marrow transplantation Methods 0.000 description 1
- 230000004641 brain development Effects 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- 239000001569 carbon dioxide Substances 0.000 description 1
- 229910002092 carbon dioxide Inorganic materials 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000009134 cell regulation Effects 0.000 description 1
- 230000008614 cellular interaction Effects 0.000 description 1
- 230000005754 cellular signaling Effects 0.000 description 1
- 235000012000 cholesterol Nutrition 0.000 description 1
- 210000001612 chondrocyte Anatomy 0.000 description 1
- 102000006533 chordin Human genes 0.000 description 1
- 108010008846 chordin Proteins 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- RGJOEKWQDUBAIZ-UHFFFAOYSA-N coenzime A Natural products OC1C(OP(O)(O)=O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-UHFFFAOYSA-N 0.000 description 1
- 239000005516 coenzyme A Substances 0.000 description 1
- 229940093530 coenzyme a Drugs 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 229940047120 colony stimulating factors Drugs 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 238000013270 controlled release Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 108050004038 cystatin Proteins 0.000 description 1
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 1
- 239000002852 cysteine proteinase inhibitor Substances 0.000 description 1
- 230000003013 cytotoxicity Effects 0.000 description 1
- 231100000135 cytotoxicity Toxicity 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000006735 deficit Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000003210 demyelinating effect Effects 0.000 description 1
- 210000003298 dental enamel Anatomy 0.000 description 1
- KDTSHFARGAKYJN-UHFFFAOYSA-N dephosphocoenzyme A Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 KDTSHFARGAKYJN-UHFFFAOYSA-N 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- JXSJBGJIGXNWCI-UHFFFAOYSA-N diethyl 2-[(dimethoxyphosphorothioyl)thio]succinate Chemical compound CCOC(=O)CC(SP(=S)(OC)OC)C(=O)OCC JXSJBGJIGXNWCI-UHFFFAOYSA-N 0.000 description 1
- 102000038379 digestive enzymes Human genes 0.000 description 1
- 108091007734 digestive enzymes Proteins 0.000 description 1
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 1
- 210000003981 ectoderm Anatomy 0.000 description 1
- 230000000667 effect on insulin Effects 0.000 description 1
- 235000014103 egg white Nutrition 0.000 description 1
- 210000000969 egg white Anatomy 0.000 description 1
- 230000027721 electron transport chain Effects 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 230000029142 excretion Effects 0.000 description 1
- 235000021107 fermented food Nutrition 0.000 description 1
- 239000003527 fibrinolytic agent Substances 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 235000019688 fish Nutrition 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 108091008053 gene clusters Proteins 0.000 description 1
- 230000008571 general function Effects 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 229940097043 glucuronic acid Drugs 0.000 description 1
- 230000009036 growth inhibition Effects 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 230000007773 growth pattern Effects 0.000 description 1
- 230000011132 hemopoiesis Effects 0.000 description 1
- 108010037896 heparin-binding hemagglutinin Proteins 0.000 description 1
- 108010052188 hepatoma-derived growth factor Proteins 0.000 description 1
- 208000029824 high grade glioma Diseases 0.000 description 1
- 102000052620 human IL10 Human genes 0.000 description 1
- 229960002773 hyaluronidase Drugs 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 208000036796 hyperbilirubinemia Diseases 0.000 description 1
- 230000035879 hyperinsulinaemia Effects 0.000 description 1
- 206010020718 hyperplasia Diseases 0.000 description 1
- 210000002865 immune cell Anatomy 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000008076 immune mechanism Effects 0.000 description 1
- 229960003444 immunosuppressant agent Drugs 0.000 description 1
- 239000003018 immunosuppressive agent Substances 0.000 description 1
- 238000002513 implantation Methods 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 208000014674 injury Diseases 0.000 description 1
- 230000002608 insulinlike Effects 0.000 description 1
- 229940047124 interferons Drugs 0.000 description 1
- 229940047122 interleukins Drugs 0.000 description 1
- 239000000543 intermediate Substances 0.000 description 1
- 230000004068 intracellular signaling Effects 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 230000003907 kidney function Effects 0.000 description 1
- 230000001535 kindling effect Effects 0.000 description 1
- 150000002611 lead compounds Chemical class 0.000 description 1
- NRYBAZVQPHGZNS-ZSOCWYAHSA-N leptin Chemical compound O=C([C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)CCSC)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CS)C(O)=O NRYBAZVQPHGZNS-ZSOCWYAHSA-N 0.000 description 1
- 229940039781 leptin Drugs 0.000 description 1
- 208000032839 leukemia Diseases 0.000 description 1
- 230000002197 limbic effect Effects 0.000 description 1
- 101150027724 lin-44 gene Proteins 0.000 description 1
- 229920006008 lipopolysaccharide Polymers 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 230000011360 lung alveolus development Effects 0.000 description 1
- 210000003563 lymphoid tissue Anatomy 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 230000003211 malignant effect Effects 0.000 description 1
- 201000011614 malignant glioma Diseases 0.000 description 1
- 210000005171 mammalian brain Anatomy 0.000 description 1
- 210000001161 mammalian embryo Anatomy 0.000 description 1
- 230000008009 mammalian fertilization Effects 0.000 description 1
- 210000005075 mammary gland Anatomy 0.000 description 1
- 210000004359 mandibular condyle Anatomy 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- DKHGMERMDICWDU-GHDNBGIDSA-N menaquinone-4 Chemical compound C1=CC=C2C(=O)C(C/C=C(C)/CC/C=C(C)/CC/C=C(C)/CCC=C(C)C)=C(C)C(=O)C2=C1 DKHGMERMDICWDU-GHDNBGIDSA-N 0.000 description 1
- 230000037353 metabolic pathway Effects 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 230000009401 metastasis Effects 0.000 description 1
- 230000001394 metastastic effect Effects 0.000 description 1
- 208000037819 metastatic cancer Diseases 0.000 description 1
- 208000011575 metastatic malignant neoplasm Diseases 0.000 description 1
- 206010061289 metastatic neoplasm Diseases 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 210000000258 minor salivary gland Anatomy 0.000 description 1
- 208000015994 miscarriage Diseases 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 230000002297 mitogenic effect Effects 0.000 description 1
- 230000000921 morphogenic effect Effects 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 210000000066 myeloid cell Anatomy 0.000 description 1
- 201000000050 myeloid neoplasm Diseases 0.000 description 1
- 208000010125 myocardial infarction Diseases 0.000 description 1
- 210000002850 nasal mucosa Anatomy 0.000 description 1
- 230000003988 neural development Effects 0.000 description 1
- 230000004766 neurogenesis Effects 0.000 description 1
- 230000014511 neuron projection development Effects 0.000 description 1
- 230000007514 neuronal growth Effects 0.000 description 1
- 230000005015 neuronal process Effects 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 208000031237 olivopontocerebellar atrophy Diseases 0.000 description 1
- 231100000590 oncogenic Toxicity 0.000 description 1
- 230000002246 oncogenic effect Effects 0.000 description 1
- 210000002997 osteoclast Anatomy 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 108010043846 ovoinhibitor Proteins 0.000 description 1
- 108010000416 ovomacroglobulin Proteins 0.000 description 1
- 230000036542 oxidative stress Effects 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 230000001991 pathophysiological effect Effects 0.000 description 1
- 238000000059 patterning Methods 0.000 description 1
- 108010091748 peptide A Proteins 0.000 description 1
- 239000000816 peptidomimetic Substances 0.000 description 1
- 210000005259 peripheral blood Anatomy 0.000 description 1
- 239000011886 peripheral blood Substances 0.000 description 1
- 210000001428 peripheral nervous system Anatomy 0.000 description 1
- 208000033808 peripheral neuropathy Diseases 0.000 description 1
- 239000003016 pheromone Substances 0.000 description 1
- 150000008104 phosphatidylethanolamines Chemical class 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 230000001817 pituitary effect Effects 0.000 description 1
- 229940012957 plasmin Drugs 0.000 description 1
- 210000004180 plasmocyte Anatomy 0.000 description 1
- 230000004983 pleiotropic effect Effects 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 230000001242 postsynaptic effect Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 230000002335 preservative effect Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 1
- 150000003180 prostaglandins Chemical class 0.000 description 1
- 235000004252 protein component Nutrition 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 210000001243 pseudopodia Anatomy 0.000 description 1
- 150000003254 radicals Chemical class 0.000 description 1
- 238000009790 rate-determining step (RDS) Methods 0.000 description 1
- 210000003370 receptor cell Anatomy 0.000 description 1
- NPCOQXAVBJJZBQ-UHFFFAOYSA-N reduced coenzyme Q9 Natural products COC1=C(O)C(C)=C(CC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)CCC=C(C)C)C(O)=C1OC NPCOQXAVBJJZBQ-UHFFFAOYSA-N 0.000 description 1
- 230000011514 reflex Effects 0.000 description 1
- 230000029865 regulation of blood pressure Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 230000027756 respiratory electron transport chain Effects 0.000 description 1
- 230000000284 resting effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 102000029752 retinol binding Human genes 0.000 description 1
- 108091000053 retinol binding Proteins 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 210000003079 salivary gland Anatomy 0.000 description 1
- 210000004116 schwann cell Anatomy 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000008786 sensory perception of smell Effects 0.000 description 1
- 239000003001 serine protease inhibitor Substances 0.000 description 1
- 230000009450 sialylation Effects 0.000 description 1
- 210000002027 skeletal muscle Anatomy 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 210000000278 spinal cord Anatomy 0.000 description 1
- 208000000995 spontaneous abortion Diseases 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 239000003270 steroid hormone Substances 0.000 description 1
- 210000002784 stomach Anatomy 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-L sulfate group Chemical group S(=O)(=O)([O-])[O-] QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 1
- 208000011580 syndromic disease Diseases 0.000 description 1
- 150000003505 terpenes Chemical class 0.000 description 1
- 230000000476 thermogenic effect Effects 0.000 description 1
- 230000008719 thickening Effects 0.000 description 1
- 229960000103 thrombolytic agent Drugs 0.000 description 1
- 210000001578 tight junction Anatomy 0.000 description 1
- 230000025366 tissue development Effects 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 230000002463 transducing effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 208000001510 transient familial neonatal hyperbilirubinemia Diseases 0.000 description 1
- 238000002054 transplantation Methods 0.000 description 1
- 208000035408 type 1 diabetes mellitus 1 Diseases 0.000 description 1
- 150000003668 tyrosines Chemical class 0.000 description 1
- 229940035936 ubiquinone Drugs 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 210000003556 vascular endothelial cell Anatomy 0.000 description 1
- 230000008189 vertebrate development Effects 0.000 description 1
- 229940041603 vitamin k 3 Drugs 0.000 description 1
- 210000001849 von ebner gland Anatomy 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
- 238000009333 weeding Methods 0.000 description 1
- 239000011701 zinc Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Medicinal Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Gastroenterology & Hepatology (AREA)
- Cell Biology (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Toxicology (AREA)
- Genetics & Genomics (AREA)
- Immunology (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Zoology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
Abstract
An isolated nucleic acid has at least 80% sequence identity to a nucleotide sequence that encodes a polypeptide comprising the amino acid sequence shown in Figure 32(SEQ IDNO:54). Preferably the nucleotide sequence is that shown in Figure 31 (SEQ ID NO:53) or that deposited under ATCC 203287. Also described are vectors, host cells and antibodies associated with the nucleic acids and polypeptides. The polypeptide can be used in the manufacture of a pharmaceutical for treating a tumor.
Description
FURTHER PRO POLYPEPTIDES AND SEQUENCES THEREOF
FIELD OF THE INVENTION The present invention relates generally to the identification and isolation of novel DNA and to the recombinant production of novel polypeptides.
BACKGROUND OF THE INVENTION Extracellular proteins play important roles in, among other things, the formation, differentiation and maintenance of multicellular organisms. The fate of many individual cells, e.g., proliferation, migration, differentiation, or interaction with other cells, is typically governed by information received from other cells 10 and/or the immediate environment. This information is often transmitted by secreted polypeptides (for instance, mitogenicfactors, survival factors, cytotoxic factors, differentiation factors, neuropeptides, and hormones) which are, in turn, received and interpreted by diverse cell receptors or membrane-bound proteins. These secreted polypeptides or signaling molecules normally pass through the cellular secretory pathway to reach their site of action in the extracellular environment.
Secreted proteins have various industrial applications, including as pharmaceuticals, diagnostics,
biosensors and bioreactors. Most protein drugs available at present, such as thrombolytic agents, interferons, interleukins, erythropoietins, colony stimulating factors, and various other cytokines, are secretory proteins. Their receptors, which are membrane proteins, also have potential as therapeutic or diagnostic agents. Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts are 20 focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. Examples of screening methods and techniques are described in the literature [see, for example, Klein et al., Proc. Natl. Acad. Sci. 93:7108-7113 (1996); U.S. Patent No. 5,536,637)].
Membrane-bound proteins and receptors can play important roles in, among other things, the formation, differentiation and maintenance of multicellular organisms. The fate of many individual cells, e.g., proliferation, 25 migration, differentiation, or interaction with other cells, is typically governed by information received from other cells and/or the immediate environment. This information is often transmitted by secreted polypeptides (for instance, mitogenic factors, survival factors, cytotoxic factors, differentiation factors, neuropeptides, and hormones) which are, in turn, received and interpreted by diverse cell receptors or membrane-bound proteins. Such membrane-bound proteins and cell receptors include, but are not limited to, cytokine receptors, receptor 30 kinases, receptor phosphatases, receptors involved in cell-cell interactions, and cellular adhesin molecules like selectins and integrins. For instance, transduction of signals that regulate cell growth and differentiation is regulated in part by phosphorylation of various cellular proteins. Protein tyrosine kinases, enzymes that catalyze that process, can also act as growth factor receptors. Examples include fibroblast growth factor receptor and nerve growth factor receptor.
1
Membrane-bound proteins and receptor molecules have various industrial applications, including as pharmaceutical and diagnostic agents. Receptor immunoadhesins, for instance, can be employed as therapeutic agents to block receptor-ligand interactions. The membrane-bound proteins can also be employed for screening of potential peptide or small molecule inhibitors of the relevant receptor/ligand interaction.
Efforts are being undertaken by both industry and academia to identify new, native receptor or 5 membrane-bound proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel receptor or membrane-bound proteins.
1. PRQ1560
The tetraspan family of proteins has grown to include approximately 20 known genes from various 10 species, including drosophila. The tetraspans are also known as the transmembrane 4 (TM4) superfamily and are proposed to have an organizing function in the cell membrane. Their ability to interact with other molecules and function in such diverse activities as cell adhesion, activation and differentiation, point to a role of aggregating large molecular complexes. Skubitz, et al., J. Immunology. 157:3617-3626(1996). The tetraspan group has also emerged as a set of proteins with prominent functions in Schwann cell biology. Mirsky and 15 lessen, Curr. Opin. Neurobiol.. 6(l):89-96 (1996). Tetraspans (also sometimes called tetraspanins) are further described in Maecker, et al., FASEB. 11:428-442 (1997). Thus, members of the tetraspan family are of interest.
2. PRQ444
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. 20 Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of a novel secreted protein designated herein as PR0444.
3. PRQ1018
Efforts are being undertaken by both industry and academia to identify new, native transmembrane and receptor proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane proteins. We herein describe the identification and characterization of a novel transmembrane polypeptide designated herein as PR01018.
4. PRQ1773
The primary and rate-limiting step in retinoic acid biosynthesis requires the conversion of retinol to retinal. Retinol dehydrogenase proteins are enzymes which function to recognize holo-cellular retinol-binding protein as a substrate, thereby catalyzing the first step of retinoic acid biogenesis from its substrate. Various retinol dehydrogenase genes have been cloned and characterized, wherein the products of these genes are 35 suggested as potentially being useful for the treatment of retinitis pigmentosa, psoriasis, acne and various cancers (Chai et al., J. Biol. Chem. 270:28408-28412 (1995) and Chai et al., Gene 169:219-222 (1996)). Given the obvious importance of the retinol dehydrogenase enzymes, there is significant interest in the identification and
2
characterization of novel polypeptides having homology to a retinol dehydrogenase. We herein describe the identification and characterization of novel polypeptides having homology to a retinol dehydrogenase protein, designated herein as PRO1773 polypeptides.
. PRQ1477
Glycosylation is an important mechanism for modulating the physiochemical and biological properties of proteins in a stage- and tissue-specific manner. One of the important enzymes involved in glycosylation in Saccharomyces cerevisiae is alpha 1,2-mannosidase, an enzyme that catalyzes the conversion of Man9GlcNAc2 to Man8GlcNAc2 during the formation of N-linked oligosaccharides. The Saccharomyces cerevisiae alpha 1,2-mannosidase enzyme of is a member of the Class 1 alpha 1,2-mannosidases that are conserved from yeast to 10 mammals. Given the important roles played by the alpha 1,2-maimosidases and the mannosidases in general in glycosylation and the physiochemical activity regulated by glycosylation, there is significant interest in identifying novel polypeptides having homology to one or more mannosidases. We herein describe the identification and characterization of novei polypeptides having homology to a mannosidase protein, designated herein as PR01477 polypeptides.
6. PRQ1478
Recently, a new subfamily of galactosyltransferase genes that encode type II transmembrane proteins was identified from a mouse genomic library (Hennet et al., (1998) J. Biol. Chem. 272111:58-65). Galactosyl transferases, in general, are all of interest. Beta 1,4-galactosyltransferase is been found in two 20 subcellular compartments where it is believed to perform two distinct function. Evans, et al., loess ays. 17(3):261-268 (1995). Beta 1,4-galactosyltransferase is described as a possible transducing receptor in Dubois and Shur. Adv. Exp. Med. Biol.. 376:105-114 (19951. and further reported on in Shur. Glvcobiologv. 1(6):563-575 (1991). Expression and function of cell surface galactosyltransferase is reported on in Shur, Biochim. Bioohvs. Acta.. 988(3):389-409 (1989). Moreover, the receptor function of galactosyltransferase during 25 mammalian fertilization is described in Shur, Adv. Exp. Biol.. 207:79-93 (1986), and the receptor function during cellular interactions is described in Shur, Mol. Cell Biochem.. 61(2):143-158 (1984). Thus, it is understood that galactosyltransferases and their related proteins axe of interest.
7. PRQ831
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins.
Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of a novel secreted protein designated herein as PR0831.
8. PRQ1113
Protein-proteininteractions include receptor and antigen complexes and signaling mechanisms. As more is known about the structural and functional mechanisms underlying protein-protein interactions, protein-protein
3
interactions can be more easily manipulated to regulate the particular result of the protein-protein interaction. Thus, the underlying mechanisms of protein-protein interactions are of interest to the scientific and medical community.
All proteins containing leucine-rich repeats are thought to be involved in protein-protein interactions. Leucine-rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular 5 locations. The crystal structure of ribonuclease inhibitor protein has revealed that leucine-rich repeats correspond to beta-alpha structural units. These units are arranged so that they form a parallel beta-sheet with one surface exposed to solvent, so that the protein acquires an unusual, nonglubular shape. These two features have been indicated as responsible for the protein-binding functions of proteins containing leucine-rich repeats. See, Kobe and Deisenhofer, Trends Biochem. Sci.. 19(10):415-421 (Oct. 1994).
A study has been reported on leucine-rich proteoglycans which serve as tissue organizers, orienting and ordering collagen fibrils during ontogeny and are involved in pathological processes such as wound healing, tissue repair, and tumor stroma formation. Iozzo, R. V., Crit. Rev. Biochem. Mol. Biol.. 32(2): 141-174 (1997). Others studies implicating leucine rich proteins in wound healing and tissue repair are De La Salle, C., et al.. Vouv. Rev. Fr. Hematol. (Germany), 37(4):215-222(1995), reporting mutations in the leucine rich motif 15 in a complex associated with the Weeding disorder Bernard-Soulier syndrome, Chlemetson, K. J., Thromb. Haemost. (Germany), 74(1): 111-116 (July 1995), reporting that platelets have leucine rich repeats and Ruosiahti, E. I., et al., W09110727-A by La Jolla Cancer Research Foundation reporting that decorin binding to transforming growth factor^ has involvement in a treatment for cancer, wound healing and scarring. Related by function to this group of proteins is the insulin like growth factor (IGF), in that it is useful in wound-healing and 20 associated therapies concerned with re-growth of tissue, such as connective tissue, skin and bone; in promoting body growth in humans and animals; and in stimulating other growth-related processes. The acid labile subunit of IGF (ALS) is also of interest in that it increases the half-life of IGF and is part of the IGF complex in vivo.
Another protein which has been reported to have leucine-rich repeats is the SLIT protein which has been reported to be useful in treating neuro-degenerative diseases such as Alzheimer's disease, nerve damage such 25 as in Parkinson's disease, and for diagnosis of cancer, see, Artavanistsakonas, S. and Rothberg, J. M., W09210518-A1 by Yale University. Of particular interest is LIG-1, a membrane glycoprotein that is expressed specifically in glial cells in the mouse brain, and has leucine rich repeats and immunoglobulin-like domains. Suzuki, et al., J. Biol. Chem. (U.S.), 271(37)-.22522 (1996). Other studies reporting on the biological functions ofproteins having leucine rich repeats include: Tayar, N., etal., Mol. Cell Endocrinol.. (Ireland). 125(l-2):65-30 70(Dec. 1996) (gonadotropin receptor involvement); Miura, Y., etal., Nippon Rinsho (Japan). 54(7): 1784-1789 (July 1996) (apoptosis involvement); Harris, P. C., etal., J. Am. Soc. Nephrol.. 6(4): 1125-1133 (Oct. 1995) (kidney disease involvement).
9. PRQ1194
The nuclear genes PET117 and PET119 are required for the assembly of active cytochrome c oxidase in S. Cerevisiae, and therefore, are of interest. Also of interest are nucleic acids which have sequence identity with these genes. PET genes are further described in McEwen, et al., Curr. Genet.. 23(1):9-14 (1993).
4
. PROlllO
The bone marrow plays many important roles in the mammal. One of those roles is to provide a source of various progenitor cells that differentiate into important cells and other components of the blood and immune systems. As such, the function of the myeloid system is of extreme interest.
We herein describe the identification and characterization of novel polypeptides having homology to 5 myeloid upregulated protein, designated herein as PROl 110 polypeptides.
11. PRQ1378
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding 10 sequences for novel secreted proteins. We herein describe the identification and characterization of a novel secreted protein designated herein as PRO1378.
12. PRQ1481
Efforts are being undertaken by both industry and academia to identify new, native proteins. Many 15 efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel proteins. We herein describe the identification and characterization of a novel protein designated herein as PRO 1481.
13. PRQ1189
There has been much interest in the identification of receptor proteins on stem cells and progenitor cells which may be involved in triggering proliferation or differentiation. A type II transmembrane protein was identified in proliferating progenitor cells in the outer perichondrial rim of the postnatal mandibular condyle proliferation. The investigators concluded that E25 could be a useful marker for chondro-osteogenic differentiation (Deleersnijder, et al. J. Biol. Chem. 271(321:19475-19482 (1996)).
14. PRQ1415
Efforts are being undertaken by both industry and academia to identify new, native transmembrane and receptor proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane proteins. We herein describe the identification and 30 characterization of a novel transmembrane polypeptide designated herein as PR01415.
. FRQ1411
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding 35 sequences for novel secreted proteins. We herein describe the identification and characterization of a novel secreted protein designated herein as PR01411.
16. PRQ1295
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of a novel secreted protein designated herein as PR01295.
17. PRQ1359
Enzymes such as hyaluronidase, sialyltransferase, urokinase-type plasminogenactivator.plasmin, matrix metalloproteinases, and others, play central roles in the catabolism of extracellular matrix molecules. As such, these enzymes and inhibitors thereof, may play roles in metastatic cancer and the treatment thereof. Van 10 Aswegen and du Plessis, Med. Hypotheses. 48(5):443-447 (1997). For the foregoing reason, as well as their diversity in substrate specificity example, sialyltransferases are of particular interest. For example, a peptide of interest is the GalNAc alpha 2, 6-sailytransferase as described in Kurosawa, et al., J. Biol. Chem.. 269(2):1402-1409 (1994). This peptide was constructed to be secreted, and retained its catalytic activity. The expressed enzyme exhibited activity toward asialomucin and asialofetuin, but not other glycoproteins tested. As 15 sialylation is an important function, sialyltransferases such as this one, and peptides related by sequence identity, are of interest. Sialyltransferases are further described in the literature, see for example, Sjoberg, et al, J. Biol. Chem.. 271(13):7450-7459 (1996), Tsuji, J. Biochem.. 120(1): 1-13 (1996) and Harduin-Lepers, et al., Glvcobiologv. 5(8):741-758 (1995).
18. PRQ1190
Kang et al. reported the identification a novel cell surface glycoprotein of the Ig superfamily (J. Cell biol. (1997) 138(0:203-213'). Cell adhesion molecules of the Ig superfamily are implicated in a wide variety of biological processes, including cell migration, growth control, and tumorigenesis. The Kang et al. studies suggest that loss of CDO function may play a role in oncogenesis. Accordingly, the identification of additional 25 CDO-like molecules, and more generally, cell adhesion molecules of the Ig superfamily, is of interest.
19. PRQ1772
Peptidases are enzymatic proteins that function to cleave peptide substrates either in a specific or nonspecific manner. Peptidases are generally involved in a large number of very important biological processes in 30 mammalian and non-mammalian organisms. Numerous different peptidase enzymes from a variety of different mammalian and non-mammalian organisms have been both identified and characterized. The mammalian peptidase enzymes play important roles in many different biological processes including, for example, protein digestion, activation, inactivation, or modulation of peptide hormone activity, and alteration of the physical properties of proteins and enzymes.
In light of the important physiological roles played by peptidase enzymes, efforts are currently being undertaken by both industry and academia to identify new, native peptidase homologs. Many of these efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for
6
novel transmembrane proteins. Examples of screening methods and techniques are described in the literature [see, for example, Klein etal., Proc. Natl. Acad. Sci.. 93:7108-7113 (1996); U.S. Patent No. 5,536,637)]. We herein describe the identification of novel polypeptides having homology to various peptidase enzymes, designated herein as PRO1772 polypeptides.
20. PRQ1248
Putative protein-2 (PUT-2) is a homolog of the human disease genes L1CAM, G6PD and P55 (Riboldi Tunnicliffeetal., Genome Analysis, submitted). As such, there is interest in identifying novel polypeptides and encoding DNA having homology to the PUT-2 protein. We herein describe the identification and characterization of novel polypeptides having homology to PUT-2 protein, designated herein as PR01248 polypeptides.
21. PRQ1316
Dickkopf (Dkk) is a family of secreted proteins having a high degree of homology in the cysteine-rich domains (i.e., 80-90%). Dkk-1, the first discovered member, of this family has potent head-inducgin activity on the Spemann organizer. Glinka et al., Nature 391 (6665): 357-362 (1988). The Spemann organizer of the 15 amphibian embryo can be subdivided into two discrete activities, namely trunk organizer and head organizer. Dkk-1 has been found to be both sufficient and necessary to cause head induction in Xenopus embryos and is further a potent antagonist of Wnt signaling, suggesting that the Dkk genes encode an entire family of Wnt inhibitors.
Members of the Wnt gene family function in both normal development and differentiation as well as 20 in tumorigenesis. Wnts are encoded by a large gene family whose members have been found in round worms, insects, cartilaginous fish, and vertebrates. Holland et al., Dev. Suppl., 125-133 (1994). Wnt genes encode a family of secreted glycoproteins that modulate cell fate and behavior in embryos through activation of receptor-mediated signaling pathways.
Studies of mutations in Wnt genes have indicated a role for Wnts in growth control and tissue 25 patterning. In Drosophila, wingless (wg) encodes a Wnt-related gene (Rijsewik et al., Cell, 50: 649-657 (1987)) and wg mutations alter the pattern of embryonic ectoderm, neurogenesis, and imaginal disc outgrowth. Morata and Lawerence, Dev. Biol., 56: 227-240 (1977); Baker, Dev. Biol., 125: 96-108 (1988); Klingensmith and Nusse, Dev. Biol., 166: 396-414 (1994). In Caenorhabditis elegans, lin-44 encodes a Wnt homolog which is required for asymmetric cell divisions. Herman and Horvitz, Development, 120:1035-1047(1994). Knock-out 30 mutations in mice have shown Wnts to be essential for brain development (McMahon and Bradley, Cell, 62: 1073-1085 (1990); Thomas and Cappechi, Nature, 346: 847-850 (1990)), and the outgrowth of embryonic primordia for kidney (Stark et al., Nature, 372: 679-683 (1994)), tail bud (Takada et al., Genes Dev., 8: 174-189 (1994)), and limb bud. Parr and McMahon, Nature, 374: 350-353 (1995). Overexpression of Wnts in the mammary gland can result in mammary hyperplasia and tumors, ((McMahon, supra (1992); Nusse and 35 Varmus, H.E., Cell 69: 1073-1087(1992)), and precocious alveolar development. Bradbury etal., Dev. Biol., 170: 553-563 (1995). Moreover, constitutive expression of Wnt-4 in virgin hosts of transplanted mammary epithelium resulted in highly branched tissue, similar to a pregnancy-like growth pattern. Bradbury et al., Dev.
7
Biol. 170: 553-563 (1995).
The Wnt/Wg signal transduction pathway plays an important role in the biological development of the organism and has been implicated in several human cancers. This pathway also includes the tumor suppressor gene, APC. Mutations in the APC gene are associated with the development of sporadic and inherited forms of human colorectal cancer. For example, elevated levels of Wnt-2 have been observed in colorectal cancers. 5 Vider, B-Z. et al., Oncogene 12: 153-158 (1996).
22. FRQ1197
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding 10 sequences for novel secreted proteins. We herein describe the identification and characterization of a novel secreted protein designated herein as PR01197.
23. PRQ1293
Immunoglobulins are antibody molecules, the proteins that function both as receptors for antigen on the 15 B-cell membrane and as the secreted products of the plasma cell. Like all antibody molecules, immunoglobulins perform two major functions: they bind specifically to an antigen and they participate in a limited number of biological effector functions. Therefore, new members of the Ig superfamily and fragments thereof are always of interest. Molecules which act as receptors by various viruses and those which act to regulate immune function are of particular interest. Also of particular interest are those molecules which have homology to known Ig 20 family members which act as virus receptors or regulate immune function. Thus, molecules having homology to Ig superfamily members and fragments thereof (i.e., heavy and light chain fragments) are of particular interest.
We herein describe the identification and characterization of novel polypeptides having homology to an immunoglobulin heavy chain variable region protein, designated herein as PR01293 polypeptides.
24. PRQ1380
Efforts are being undertaken by both industry and academia to identify new, native transmembrane and receptor proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane proteins. We herein describe the identification and 30 characterization of a novel transmembrane polypeptide designated herein as PR01380.
. PRQ1265
The identification of novel secreted proteins involved in physiological and metabolic pathways is of interest because of their potential use as pharmaceutical agents. Of particular interest is the identification of 35 novel polypeptides that are potentially involved in immune response and inflammation mechanisms. A novel polypeptide has recently been identified that is expressed in mouse B cells in response to IL-4. The gene encoding this polypeptide is referred to as interleukin-four induced gene 1, or "Figl" (Chu et al. Proc. Natl.
8
Acad. Sci (1997) 94(61:2507-25121.
26. PRQ1250
Long chain fatty acid Co A ligase is an enzymatic protein that functions to ligate together long chain fatty acids, a function that plays important roles in a variety of different physiological processes. Given the 5 importance of this enzymatic protein, efforts are currently being undertaken to identify novel long chain fatty acid Co A ligase homologs. We herein describe the identification and characterization of novel polypeptides having homology to long chain fatty acid CoA ligase, designated herein as PR01250 polypeptides.
27. PRQ1475
N-acetylglucosaminyltransferase proteins comprise a family of enzymes that provide for a variety of important biological functions in the mammalian organism. As an example, UDP-N-acetylglucosamine: alpha-3-D-mannoside beat-1,2-N-acetylglucosaminyltransferase I is an enzymatic protein that catalyzes an essential first step in the conversion of high-mannose N-glycans to hybrid and complex N-glycans (Sarkar et al., Proc. Natl. Acad. Sci. USA. 88:234-238 (1991). Given the obvious importance of the N-acetylglucosaminyltransferase 15 enzymes, there is significant interest in the identification and characterization of novel polypeptides having homology to an N-acetylglucosaminyltransferase protein. We herein describe the identification and characterization of novel polypeptides having homology to an N-acetylglucosaminyitransferase protein, designated herein as PRO1475 polypeptides.
28. PRQ1377
Efforts are being undertaken by both industry and academia to identify new, native transmembrane and receptor proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane proteins. We herein describe the identification and characterization of a novel transmembrane polypeptide designated herein as PRO1377.
29. PRQ1326
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of a novel 30 secreted protein designated herein as PRO 1326.
. PRQ1249
Efforts are being undertaken by both industry and academia to identify new, native transmembrane and receptor proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to 35 identify the coding sequences for novel transmembrane proteins. We herein describe the identification and characterization of a novel transmembrane polypeptide designated herein as PR01249.
9
31. PRQ1315
Many important cytokine proteins have been identified and characterized and shown to signal through specific cell surface receptor complexes. For example, the class II cytokine receptor family (CRF2) includes the interferon receptors, the interleukin-10 receptor and the tissue factor CRFB4 (Spencer et al., J. Exp. Med. 187:571-578 (1998) and Kotenko et al., EMBO J. 16:5894-5903 (1997)). Thus, the multitude of biological 5 activities exhibited by the various cytokine proteins is absolutely dependent upon the presence of cytokine receptor proteins on the surface of target cells. There is, therefore, a significant interest in identifying and characterizing novel polypeptides having homology to one or more of the cytokine receptor family. We herein describe the identification and characterization of a novel polypeptide having homology to cytokine receptor family-4 proteins, designated herein as PRO 1315 polypeptides.
32. PRQ1S99
Granzyme M is a natural killer cell serine protease. The human gene is 7.5 kilobases, has an exon-intron structure identical to other serine proteases, and is closely linked to the serine protease gene cluster on chromosome 19pl3.3. (Pilat et al., Genomics. 24:445-450 (1994)). Granzyme M has been found in two human 15 natural killer leukemia cell lines, unstimulated human peripheral blood monocytes and untreated purified CD3-CD56+ large granular lymphocytes. (Smyth et al., J. Immunol.. 151:6195-6205 (1993)).
33. PRQ1430
Reductases form a large class of enzymatic proteins found in a variety of mammalian tissues and play 20 many important roles for the proper functioning of these tissues. They are antioxidant enzymes that catalyze the conversion of reactive oxygen species to water. Abnormal levels or functioning of reductases have been implicated in several diseases and disorders including strokes, heart attacks, oxidative stress, hypertension and the development of both benign and malignant tumors. For example, malignant prostate epithelium may have lowered expression of such antioxidant enzymes [Baker et al., Prostate 32(4):229-233 (1997)]. International 25 patent application no. W09622360-A1 describes a prostate specific reductase that is useful for diagnosing and treating prostate cancer and screening new antagonists. Inhibitors of alpha-reductase have been used in the treatment of benign prostatic hyperplasia (Anderson, Drugs Aging (19961 6(51:388-396). For these reasons, the identification of new members of the reductase family has been of interest for the treatment and diagnosis of cancers and other diseases and disorders.
34. PRQ1374
Prolyl 4-hyroxylase (P4HA) catalyzes the formation of 4-hydroxyproline in collagens. Annunen, et al., J. Biol. Chem., 272(28): 17342-17348 (1997); Helaakoski, et al., PNAS USA, 92(10):4427-4431 (1995); and Hopkinson, et al., Gene, 149(2):391-392 (1994). This enzyme and molecules related thereto are of interest.
. PRQ1311
The tetraspan family of proteins, also referred to as the "transmembrane 4 (TM4) superfamily", are
proposed to have an organizing function in the cell membrane. It is believed that they interact with large molecular complexes and function in such diverse activities as cell adhesion, activation and differentiation (see Maecker et al. FASEB (1997111:428-442). Accordingly, the identification of new members of the tetraspan family of proteins is of interest. Efforts are being undertaken by both industry and academia to identify new, native transmembrane proteins. Many efforts are focused on the screening of mammalian recombinant DNA 5 libraries to identify the coding sequences for novel receptor proteins.
36. PRO 1357
Ebnerin is a cell surface protein associated with von Ebner glands in mammals. Efforts are being undertaken by both industry and academia to identify new, native proteins and specifically those which possess 10 sequence homology to cell surface proteins such as ebnerin or other salivary gland-associated proteins. Many of these efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel receptor proteins. We herein describe the identification of novel polypeptides having significant homology to the von Ebner minor salivary gland-associated protein, designated herein as PRO 1357 polypeptides.
37. PRQ1244
One type of transmembrane protein that has received attention is implantation-associated uterine protein. Deficiencies or abnormalities of this protein may be a cause of miscarriage. Therefore, the identification and characterization of implantation-associated proteins is of interest.
38. PRQ1246
Bone-related sulphatase is an enzymatic protein that has been shown to degrade sulphate groups of proteoglycan sugar chains in bone tissue (Australian Patent Publication No. AU 93/44921-A, March 3, 1994). Because of its specific sulphatase activity, it has been suggested that bone-related sulphatase may find use in the 25 treatment of bone metabolic diseases. As such, there is significant interest in identifying and characterizing novel polypeptides having sequence similarity to bone-related sulphatase. We herein describe the identification and characterization of novel polypeptides having homology to bone-related sulphatase, designated herein as PR01246 polypeptides.
39. PRQ1356
Clostridium perfringens enterotoxin (CPE) is considered to be the virulence factor responsible for causing the symptoms of C. perfringens type A food poisoning and may also be involved in other human and veterinary illnesses (McClane, Toxicon. 34:1335-1343 (1996)). CPE carries out its adverse cellular functions by binding to an approximately 50 kD cell surface receptor protein designated the Clostridium perfringens 35 enterotoxin receptor (CPE-R) to form an approximately 90,000 kD complex on the surface of the cell. cDNAs encoding the CPE-R protein have been identified characterized in both human and mouse (Katahira et al., J. Cell Biol. 136:1239-1247 (1997) and Katahira et al., J. Biol. Chem. 272:26652-26658 (1997)). Since the CPE toxin
11
has been reported to cause a variety of illnesses in mammalian hosts and those illnesses are initiated by binding of the CPE toxin to the CPE-R, there is significant interest in identifying novel CPE-R homologs. We herein describe the identification and characterization of novel polypeptides having homology to the CPE-R, designated herein as PR01356 polypeptides.
40. PRQ1275
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of a novel secreted protein designated herein as PRO 1275.
41. PRQ1274
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of a novel 15 secreted protein designated herein as PRO1274.
42. PRQ1412
Efforts are being undertaken by both industry and academia to identify new, native transmembrane and receptor proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to 20 identify the coding sequences for novel transmembrane proteins. We herein describe the identification and characterization of a novel transmembrane polypeptide designated herein as PRO 1412.
43. PRQ1S57
The identification of secretory proteins that play roles in neural development are of interest. Such 25 proteins may fmd use in the understanding of and possible treatment of neurological diseases and disorders. Chordin protein, which has been isolated from Xenopus, is a potent dorsalizing factor that regulates cell-cell interactions in the organizing centers of Xenopus head, trunk and tail development (Sasai et al., (1994) Cell 79(51:779-790: see also Mullins, (1998) Trends Genet. 14/41:127-129: and Kessel etal. (1998)) Trends Genet. 14(51:169-1711. It may be used as a component of culture medium for culturing nerve and muscle cells, and 30 may have use in the treatment of neurodegenerative diseases and neural injury (U.S. Pat. No. 5,679,783).
44. PRQ1286
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding 35 sequences for novel secreted proteins. We herein describe the identification and characterization of a novel secreted protein designated herein as PR01286.
12
45. PRQ1294
The extracellular mucous matrix of olfactory neuroepithelium is a highly organized structure in intimate contact with chemosensory cilia that house the olfactory transduction machinery. The major protein component of this extracellular matrix is olfactomedin, a glycoprotein that is expressed in olfactory neuroepithelium and which form intermolecular disulfide bonds so as to produce a polymer (Yokoe et al., Proc. Natl. Acad. Sci. USA 5 90:4655-4659 (1993), Bal et al., Biochemistry 32:1047-1053 (1993) and Snyder et al.. Biochemistry 30:9143-9153 (1991)). It has been suggested that olfactomedin may influence the maintenance, growth or differentiation of chemosensory cilia on the apical dendrites of olfactory neurons. Given this important role, there is significant interest in identifying and characterizing novel polypeptides having homology to olfactomedin. We herein describe the identification and characterization of a novel polypeptide having homology to olfactomedin protein. 10 We herein describe the identification and characterization of novel polypeptides having homology to olfactomedin protein, designated herein as PR01294 polypeptides.
46. PRO 1347
Butyrophilin is a milk glycoprotein that constitutes more than 40% of the total protein associated with 15 the fat globule membrane in mammalian milk. Expression of butyrophilin mRNA has been shown to correlate with the onset of milk fat production toward the end pregnancy and is maintained throughout lactation. Butyrophilin has been identified in bovine, murine and human (see Taylor et al., Biochim. Biophvs. Acta 1306:1-4 (1996), Ishii et al., Riochim. Biophvs. Acta 1245:285-292 (1995), Mather et al., J. Dairy Sci. 76:3832-3850 (1993), Ogg, et al., Mamm. Genome. 7(12):900-905 (1996), Sato, et al., J. Biochem.. 20 117(1): 147-157 (1995) and Banghart et al., J. Biol. Chem. 273:4171-4179 (1998)) and is a type I transmembrane protein that is incorporated into the fat globulin membrane. It has been suggested that butyrophilin may play a role as the principle scaffold for the assembly of a complex with xanthine dehydrogenase/oxidase and other proteins that function in the budding and release of milk-fat globules from the apical surface during lactation (Banghait et al., suoral. Given that butyrophilin plays a role in mammalian milk production, there is substantial 25 interest in identifying novel butyrophilin homologs.
47. PRQ1305
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding 30 sequences for novel secreted proteins. We herein describe the identification and characterization of a novel secreted protein designated herein as PR01305.
48. PRQ1273
The lipocalin protein family is a large group of small extracellular proteins. The family demonstrates 35 great diversity at the sequence level; however, most lipocalins share characteristic conserved sequence motifs. Lipocalins are known to be involved in retinol transport, invertebrate cryptic coloration, olfaction and pheromone transport, and prostaglandin synthesis. The lipocalins have also been implicated in the regulation of cell
13
homoeostasis and the modulation of the immune response, and as carrier proteins, to act in the general clearance of endogenous and exogenous compounds. Flower, Biochem. J.. 318(Pt 1): 1-14- (1996); Flower, FEBS Lett.. 354(1):7-11 (1994). Thus, novel members of the lipocalin protein family are of interest.
49. PRQ1302
CD33 is a cell-surface protein that is a member of the sialoadhesin family of proteins that are capable of mediating sialic-acid dependent binding with distinct specificities for both the type of sialic acid and its linkage to subterminal sugars. CD33 is specifically expressed in early myeloid and some monocyte cell lineages and has been shown to be strongly associated with various myeloid tumors including, for example, acute non-lymphocytic leukemia (ANLL). As such, CD33 has been suggested as a potential target for the treatment 10 of cancers associated with high level expression of the protein. One CD33 homolog (designated CD33L) is described in Takei et al., Cvtoeenet. Cell Genet. 78:295-300 (1997). Another study describes the use of CD33 monoclonal antibodies in bone marrow transplantation for acute myeloid leukemia. Robertson, et al., Prog. Clin. Biol. Res.. 389:47-63 (1994).
Moreover, studies have reported that members of the sialoadhesion family contribute to a range of 15 macrophage functions, both under normal conditions as well as during inflammatory reactions. Crocker, et al., Glvcoconi. J., 14(5):601-609 (1997). Moreover, these proteins are associated with diverse biological processes, i.e., hemopoiesis, neuronal development and immunity. Kelm, et al., Glvcoconi. J.. 13(6):913-926 (1996). Thus, novel polypeptides related to CD33 by sequence identity are of interest.
50. PRQ1283
Olfactory reception occurs via the interaction of odorants with the chemosensory cilia of the olfactory receptor cells located in the nasal epithelium. Based upon the diversity of nasal epithelial-associated odorant binding proteins, the mammalian olfactory system is capable of recognizing and discriminating a large number of different odorant molecules. In this regard, numerous different odorant binding proteins and their encoding 25 DNA have recently been identified and characterized (Dear etal., Biochemistry 30:10376-10382 (1991). Pevsner etal., Sdencg241:336-339(1988), Bucketal., Cell 65:175-187 (1991)andBreeretal.. J. Recent. Res. 13:527-540 (1993)). Because study of the mechanisms of odorant detection by the mammalian olfactory system are of interest, there is significant interest in identifying novel odorant binding protein. We herein describe the identification and characterizationof novel polypeptides having homology to odorant binding proteins, designated 30 herein as PR01283 polypeptides.
51. PRQ1279
Proteases are enzymatic proteins which are involved in a large number of very important biological processes in mammalian and non-mammalian organisms. Numerous different protease enzymes from a variety 35 of different mammalian and non-mammalian organisms have been both identified and characterized, including the serine proteases which exhibit specific activity toward various serine-containing proteins. The mammalian protease enzymes play important roles in biological processes such as, for example, protein digestion, activation.
14
inactivation, or modulation of peptide hormone activity, and alteration of the physical properties of proteins and enzymes.
Neuropsin is a novel serine protease whose mRNA is expressed in the central nervous system. Mouse neuropsin has been cloned, and studies have shown that it is involved in the hippocampal plasticity. Neuropsin has also been indicated as associated with extracellular matrix modifications and cell migrations. See, generally, 5 Chen, et al., Neurosci.. 7(2):5088-5097 (1995) and Chen, et al., J. Histochem. Cvtochem.. 46:313-320 (1998).
We herein describe the identification and characterization of novel polypeptides having homology to neuropsin protein, designated herein as PR01279 polypeptides.
52. PRQ1304
The immunophilins are a family of proteins that function as receptors for immunosuppressant drugs,
such as cyclosporin A, FK506, and rapamycin. The immunophilins occur in two separate classes, (1) the FK506-binding proteins (FKBPs), which bind to FK506 and rapamycin, and (2) the cyclophilins, which bind to cyclosporin A. With regard to the FK506-binding proteins, it has been reported that the FK506/FKBP complex functions to inhibit the activity of the serine/threonine protein phosphatase 2B (calcineurin), thereby 15 providing immunosuppressant activity (Gold, Mol. Neurobiol. 15:285-306 (1997V). It has also been reported that the FKJBP immunophilins are found in the mammalian nervous system and may be involved in axonal regeneration in the central nervous system through a mechanism that is independent of the process by which immunosuppression is achieved (Gold, supra). Thus, there is substantial interest in identifying novel polypeptides having homology to the FKBP immunophilins.
We herein describe the identification and characterization of novel polypeptides having homology to
FK506 binding protein, designated herein as PR01304 polypeptides.
53. PRQ1317
There is considerable interest in the identification of molecules whose expression is increased upon 25 stimulation of leukocyte populations because insights into the structure and function of these molecules may lead to farther understanding of the intracellular and intercellular events that accompany activation. One such molecule, CD97, a cell surface antigen that is rapidly upregulated upon activation on lymphocytes, has recently been the subject of several publications (see Eichler el al. in Tissue Antigens (1997) 50(5);429-438: Aust et al., Cancer Res. (1997) 57(91:1798-18061. Leukocytes strongly positive for CD97 are concentrated at sites of 30 inflammation relative to CD97 expression in normal lymphoid tissue. A soluble subunit of CD97, CD97alpha, has been found in the body fluids from inflamed tissues (Gray et al. J. Immunol. (1996) 157(121:5438-54471.
54. PRQ1303
Proteases are enzymatic proteins which are involved in a large number of very important biological 35 processes in mammalian and non-mammalian organisms. Numerous different protease enzymes from a variety of different mammalian and non-mammalian organisms have been both identified and characterized, including the serine proteases which exhibit specific activity toward various serine-containing proteins. The mammalian
protease enzymes play important roles in biological processes such as, for example, protein digestion, activation, inactivation, or modulation of peptide hormone activity, and alteration of the physical properties of proteins and enzymes.
Neuropsin is a novel serine protease whose mRNA is expressed in the central nervous system. Mouse neuropsin has been cloned, and studies have shown that it is involved in the hippocampal plasticity. Neuropsin 5 has also been indicated as associated with extracellular matrix modifications and cell migrations. See, generally, Chen, et al., J. Neurosci.. 7(2):5088-5097 (1995) and Chen, et al., J. Histochem. Cvtochem.. 46:313-320 (1998). Other studies have reported that kindling induces neuropsin mRNA in the mouse brain. Okabe, et al., Brain Res., 728(1): 116-120 (1996). Additionally, a study has reported that generation of reactive oxygen species has an important role in neuropsin transcript in the limbic areas which might be related to the disturbance in 10 avoidance learning. Akita, et al., Brain Res.. 769(l):86-96 (1997). Thus, neuropsins, and related proteins and agents, including agonists and antagonists are of interest.
55. PRQ1306
There is much interest in the identification of proteins that play roles in mammalian disease and 15 disorders which could lead to new methods of treatment. A macrophage polypeptide, daintain/allograft inflammatory factor 1 (daintain/AlFl), has been identified in the pancreas of prediabetic rats, and has been determined to have a direct effect on insulin secretion. When injected intravenously in mice in low doses, daintain/AIFl doses inhibited glucose-stimulated insulin secretion with a concomitant impairment of glucose elimination. At higher doses, daintain/AIFl potentiated glucose-stimulated insulin secretion and enhanced 20 glucose elimination. Thus, it was suggested that daintain/AIFl may have a role in connection with the pathogenesis of insulin-dependent diabetes mellitus (Chen et al. Proc. Natl Acad. Sci. (1997) 94(251:13879-13884). AIF-1 has also been implicated in both rat and human allogenic heart transplant rejection (Utans et al. Transplantation (19961 61(91:1387-1392), and may play a role in macrophage activation and function (Utans et al. J. Clin. Invest. (19951 95(61:2954-29621.
56. PRQ1336
Protein-protein interactions include receptor and antigen complexes and signalingmechanisms. As more is known about the structural and functional mechanisms underlying protein-protein interactions, protein-protein interactions can be more easily manipulated to regulate the particular result of the protein-protein interaction. 30 Thus, the underlying mechanisms of protein-protein interactions are of interest to the scientific and medical community.
Leucine-rich proteins are known to be involved in protein-protein interactions. A study has been reported on leucine-rich proteoglycans which serve as tissue organizers, orienting and ordering collagen fibrils during ontogeny and are involved in pathological processes such as wound healing, tissue repair, and tumor 35 stroma formation. Iozzo, R. V., Crit. Rev. Biochem. Mol. Biol.. 32(2):141-174 (1997). Others studies implicating leucine rich proteins in wound healing and tissue repair are De La Salle, C., et al., Vouv. Rev. Fr. Hematol. (Germany), 37(4):215-222 (1995), reporting mutations in the leucine rich motif in a complex associated
16
with the bleeding disorder Bemard-Soulier syndrome and Chlemetson, K. J., Thromb. Haemost. (Germany), 74(1): 111-116 (July 1995), reporting that platelets have leucine rich repeats.
Another protein of particular interest which has been reported to have leucine-rich repeats is the slit protein which has been reported to be useful in treating neuro-degenerative diseases such as Alzheimer's disease, nerve damage such as in Parkinson's disease, and for diagnosis of cancer, see, Artavanistsakonas, S. and 5 Rothberg, J. M., W09210518-A1 by Yale University. The slit protein has been characterized and reported to be secreted by glial cells and involved in the formation of axonal pathways in Drosophila as well as the mediation of extracellular protein interactions. Wharton and Crews, Mech. Dev.. 40(3): 141-154 91993); Rothberg and Artavanis-Tsakonas, J. Mol. Biol.. 227(2):367-370 (1992); Rothberg, et al., Genes Dev.. 4(12A):2169-2187 (1990); and Rothberg, et al., QelL 55(6): 1047-1059 (1988).
57. PRQ1278
Lysozymes are secretedenzymes that preferentially hydrolyze the [beta]-l ,4 glucosidic linkages between N-acetylmuramic acid and N-acetylgucosamine which occur in the mucopeptide cell wall structure of certain microoganisms. Lysozyme is of widespread distribution in animals and plants. It has been found in mammalian 15 secretions and tissues including saliva, tears, milk, cervical mucus, leucocytes, kidneys, etc. The identification of new members of the lysozyme family of proteins is of interest because of the variety of roles lysozymes play in metabolic function and dysfunction. Abnormal levels of lysozymes have been implicated in various disease states. Lysozymes have been reported to have anti-microbial, analgesic, and antinociceptive properties. Additional characteristics and possible uses of lysozymes are described in U.S. Pat. No. 5,618,712.
58. PRQ1298
Glycosylation can determine the fate of a protein, for example, whether it is secreted or not. Also, glycoproteins play many structural and functional roles, particularly as pan of the cell membrane. Therefore, glycosylation is of interest. Studies have reported on the growth-related coordinate regulation of the early N-25 glycosylation genes in yeast. Kukuruzinska and Lennon, Glvcobiologv. 4(4):437-443 (1994). Moreover, the relationship between protein glycosylation and fatty acylation of glycoproteins was studied in the wild-type and asparagine-linked glycosylation-deficient mutants in yeast. Appukuttan, FEBS Lett.. 255(1):139-142 (1989). The biosynthesis of asparagine-linked oligosaccharides in yeast was also studied using a mutant. Jackson, et al., Glvcobiologv. 3(4):357-364 (1993). Yeast mutants deficient in protein glycosylation have also been reported 30 in Huffacker and Robbins, PNAS. 80(24)-.7466-7470 (1983).
59. PRQ1301
Cytochrome P450 proteins form a large class of monooxygenase enzymes involved in hydroxylation. Hydroxylation reactions are important in the synthesis of cholesterol and steroid hormones. Enzymes of the 35 cytochrome P450 family play an important role in the metabolism endogenous compounds such as arachidonic acid. These enzymes are also important in the metabolism of foreign substances such as the elimination of drugs from the body [see, for example, Peterson, Aliment. Pharmacol. Ther.. 9:1-9 (1995).]. In addition, metabolites
17
generated through the cytochrome P450 pathway may play a role in carcinogenesis, blood pressure regulation and renal function [see, for example, Rahman et al., Am. J. Hvnertens.. 10:356-365 (1997)].
60. PRQ1268
Efforts are being undertaken by both industry and academia to identify new, native transmembrane and 5 receptor proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmehibrane proteins. We herein describe the identification and characterization of a novel transmembrane polypeptide designated herein as PR01268.
61. PRQ1269
Granulocytes, the most common type of white blood cell, have the ability to mediate immunologic cytotoxicity against tumor cells and microorganisms. Accordingly, there has been interest in identifying various factors that are produced by these cells because of their potential use as pharmaceutical agents. Patent publication no. W09729765-A1, to Selsted, describes the identification of granulocyte peptide A which was isolated from bovine and murine granulocytes. Several uses for this peptide were identified including, a 15 therapeutic use, use as an agricultural agent, use as a preservative for food, and use as a water treatment agent.
62. PRQ1327
Neurexophilin is a protein that was discovered as a neuronal glycoprotein that was copurified with neurexin I alpha during affinity chromatography on immobilized alpha-latrotoxin (Missler et al., J. Neurosci. 20 18:3630-3638 (1998)). Recent data has shown that the mammalian brain contains four genes for neurexophilins the products of which share a common structure composed of five domains: (1) an N-terminal signal peptide, (2) a variable N-terminal domain, (3) a highly conserved central domain that is N-glycosylated, (4) a short linker region and (5) a conserved C-terminal domain that is cysteine-rich (Missler et al., supra). These data further demonstrate that the neurexophilins are proteolytically processed after synthesis and bind to alpha-neurexins. 25 The structure and characteristics of neurexophilins indicate that they may function as neuropeptides that may signal via alpha-neurexins. Therefore, there is significant interest in identifying and characterizing novel polypeptides having homology to the neurexophilins.
We herein describe the identification and characterization of novel polypeptides having homology to neurexophilin protein, designated herein as PR01327 polypeptides.
63. PRQ1382
Cerebellin is a secreted, postsynaptic neuroprotein found throughout the brain. The highest concentrations of this protein have been found in the cerebellum. It has also been detected in the pituitary, spinal cord, and adrenal glands (Satoh et al. J. Endocrinol. (1997) 15491):27-34). The feasibility of using cerebellum 35 as a quantifiable marker for the investigation of the maturation of Purkinje ceils of the cerebellum and to chart neurodevelopment has been reported (see Slemmon et al. Proc. Natl. Acad. Sci (1985) 82(20V.7145-7148>. Significantly decreased levels of cerebellin have been found in human brains obtained in post-mortem studies
18
from patients with spinocerebellar degeneration, olivopontocerebellar atrophy (OPCAQ) and Shy-Drager syndrome, suggesting that cerebellin plays important pathophysiological roles in these cerebellar diseases (Mizuno etal. Brain Res. (19951 686(11:115-118: Mizuno etal. No To Shinkei (19951 47(111:1069-10741. In view of the importance of cerebellin in neurodevelopment and in neurological diseases and disorders, the identification and characterization of members of this protein family is of interest (see also Yiangou et al. JL 5 Neurochem (19891 53(31:886-889 and Muenaini et al. Synapse (19881 2(21:125-1381.
64. PRQ1328
Efforts are being undertaken by both industry and academia to identify new, native transmembrane and receptor proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to 10 identify the coding sequences for novel transmembrane proteins. We herein describe the identification and characterization of a novel transmembrane polypeptide designated herein as PR01328.
65. PRQ1325
Efforts are being undertaken by both industry and academia to identify new, native transmembrane and 15 receptor proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane proteins. We herein describe the identification and characterization of a novel transmembrane polypeptide designated herein as PR01325.
66. PRQ1340
Cadherins are known as the principal mediators of homotypic cellular recognition and play a demonstrated role in the morphogenic direction of tissue development. Cadherins are a diverse family of proteins that have been identified in various tissues including nervous tissue (Suzuki et al., Cell Reeul.. 2:261-270 (1991)). Ksp-cadherinis a kidney-specific member of the cadherin multigene family (Thomson et al., Biol. Chem. 270:17594-17601 (1995)). Cadherins are thought to play an important role in human cancer (Yap, 25 Cancer Invest.. 16:252-261 (1998)).
67. PRQ1339
Carboxypeptidases are of interest. Carboxypeptidase E appears to be involved in the biosynthesis of a wide range of peptide hormones. Fricker, Anna. Rev. Phvsiol.. 50:309-321 (1988). This carboxypeptidase 30 has been associated with obesity. Leiter. J. Endocrinol.. 155(21:211-214(19971. Carboxypeptidase M has been reported as being a marker of macrophage maturation. Krause, et al., Immunol. Rev.. 161:119-127 (1998). Human mast cell carboxypeptidase has been reported to be associated with allergies. Goldstein, et al., Monoar. Allergy. 27:132-145 (1990). Carboxypeptidase A2 has also been reported on. Faming, et al., J. Biol. Chem.. 266(36):24606-24612 (1991). Other carboxypeptidases of particular interest which are known in the art include 35 human pancreatic carboxypeptidase 2, carboxypeptidase al and carboxypeptidase B. Therefore, novel members of the carboxypeptidase family are of interest.
19
68. PRQ1337
Of particular interest is the identification of blood-related proteins which may have potential therapeutic use or may be useful in the diagnosis of blood-related disorders. Thyroxine-binding globulin (TBG) is synthesized by the liver and secreted into the bloodstream. It is the principal thyroid hormone transport protein in human serum (Refetoff et al. Horm. Res. (1996) 45(3-51:128-138). High serum levels of TBG have been 5 found to cause hyperthyroxinaemia (Leahy etal., Postgrad Med. J. (1984) 60(7031:324-327). Accordingly, the identification and characterization of TBG proteins is of interest (see Flink et al. Proc. Natl Acad Sci. USA (1986) 83(201:7708-7712: Bartalena et al. Acta Med. Austriaca. (1988) 15 SuppI 1:12-15). including the identification of abnormal TBG proteins (see Refetoff, Endocr Rev. (19891 10(31:275-2931. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel 10 secreted proteins. Examples of screening methods and techniques are described in the literature [see, for example, Klein et al., Proc. Natl. Acad. Sci.. 93:7108-7113 (1996); U.S. Patent No. 5,536,637)].
69. PRQ1342
Efforts are being undertaken by both industry and academia to identify new, native transmembrane and 15 receptor proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane proteins. We herein describe the identification and characterization of a novel transmembrane polypeptide designated herein as PR01342.
70. PRQ1343
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins.
Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of a novel secreted protein designated herein as PRO1343.
71. PRQ1480
Semaphores are a large family of transmembrane and secreted proteins, many of which are expressed in the nervous system. Members of the semaphorin family include both ligands and receptors. (Eckhardt et al„ Mol. Cell. Neurosci.- 9: 409-419 (1997)). Studies have revealed a role for semaphorins in embryonic motor and central nervous system axon guidance and synapse formation. (Catalano et al-, Mol. Cell. Neurosci.. 11: 30 173-182 (1998); Kitsukawa et al., Neuron. 19: 995-1005 (1997); Yu et al., Neuron. 20: 207-220 (1998)). Semaphorins have been shown to induce neuronal growth cone collapse and alter their pathway in vivo. (Shoji et al., Development. 125: 1275-1283 (1998)). Members of the semaphorin family have been shown to be immunologically active, inducing cytokine production in human monocytes. (Comeau et al., Immunity. 8:473-482 (1998)). Semaphorins may also play a role in cancer. Expression of a mouse semaphorin gene is known 35 to correlate with metastatic ability in mouse tumor cell lines. (Christensen et al., Cancer Res.. 58: 1238-1244 (1998)).
72. PRQ1487
Fringe is a protein which specifically blocks serrate-mediated activation of notch in the dorsal compartment of the Drosophila wing imaginal disc (see Fleming et al., Development. 124(15):2973-81 (1997); Wu et al. Science (1996) 273/5273):355-3581. Fringe protein is also involved in vertebrate development where a thickening of the apical ectodermal ridge essential for limb bud outgrowth involves an interaction between 5 dorsal cells that express radical fringe and those that do not (see Wolpert, L. Philos Trans R Soc Lond B Biol gci 1998) 353(13701:871-875: Kengaku et al. Science (1998) 280(53671:1274-1277: Cohen et al. Nat. Genet. (1997) 16(31:283-288: Johnston et al. Development (1997) 124(111:2245-2254: Laufer et al. Nature (1997) 386(66231:366-373: Rodriguez-Esteban etal. Nature (19971 386(66231:360-366:1. ). Therefore, fringe protein is of interest for both its role in development as well as its ability to regulate serrate, particularly serrate's 10 signaling abilities. Also of interest are novel polypeptides which may have a role in development and/or the regulation of serrate-like molecules. Of particular interest are novel polypeptides having homology to fringe protein.
73. PRQ1418
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins.
Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of a novel secreted protein designated herein as PR01418.
74. PRQ1472
Butyrophilin is a milk glycoprotein that constitutes more than 40% of the total protein associated with the fat globule membrane in mammalian milk. Expression of butyrophilin mRNA has been shown to correlate with the onset of milk fat production toward the end pregnancy and is maintained throughout lactation. Butyrophilin has been identified in bovine, murine and human (see Taylor et al., Biochim. Biophvs. Acta 25 1306:1-4 (1996), Ishii et al., Biochim. Biophvs. Acta 1245:285-292 (1995), Mather et al., J. Dairy Sci. 76:3832-3850 (1993), Ogg, et al., Mamm. Genome. 7(12):900-905 (1996), Sato, et al., J. Biochem.. 117(1): 147-157 (1995) and Banghart et al., J. Biol. Chem. 273:4171-4179 (1998)) and is a type I transmembrane protein that is incorporated into the fat globulin membrane. It has been suggested that butyrophilin may play a role as the principle scaffold for the assembly of a complex with xanthine dehydrogenase/oxidase and other 30 proteins that function in the budding and release of milk-fat globules from the apical surface during lactation (Banghart et al., supral. Given that butyrophilin plays a role in mammalian milk production, there is substantial interest in identifying novel butyrophilin homologs. Members of the butyrophilin family are further described in Tazi-Ahnini, et al., Immunoeenetics. 47(l):55-63 (1997); Davey, et al., Gene. 199(l-2):57-62 (1997); and Mather and Jack, J. Dairy Sci.. 76(12):3832-3850 (1993).
75. PRQ1461
Proteases are enzymatic proteins which are involved in many biological processes in mammalian and
21
non-mammalian organisms including digestion, protein activation and inactivation, modulation of peptide hormone activity, and alteration of the physical properties of proteins and enzymes. Serine proteases comprise a large class of enzymes that exhibit specific activity toward various serine-containing proteins. Trypsin, which is synthesized by the pancreas and secreted to the small intestine, is a well-characterized serine protease that hydrolyzes peptide bonds of ingested proteins. Trypsin-like proteases have been characterized that are cell-5 surface proteins (see Farlev etal. Biochim Biophvs Acta (1993) 1173(3):350-352: and Levtus et al. Biochemistry (1988) 27(3): 1067-1074). It is believed that some of these trypsin-like proteins may be synthesized as a membrane-bound precursor which matures to a soluble and active protease (Yamaoka et al. J. Biol. Chem (1998) 273(19): 11895-11901).
Because of there importance in metabolism and other enzymatic processes, efforts are being undertaken 10 by both industry and academia to identify new, native serine-like proteases. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel receptor proteins.
76. FRQ1410
Efforts are being undertaken by both industry and academia to identify new, native transmembrane and 15 receptor proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane proteins. We herein describe the identification and characterization of a novel transmembrane polypeptide designated herein as PR01410.
77. FRQ1568
The tetraspanin (or tetraspan) family of proteins has grown to include approximately twenty known genes from various species. The tetraspanins are four transmembrane domain membrane-bound molecules which include for example, CD81, CD82, CD9, CD63, CD37 and CD53. Many of these proteins have a flair for promiscuous associations with other molecules, including lineage-specific proteins, integrins, and other transpanins. In terms of function, they are involved in diverse processes such as cell activation and proliferation, 25 adhesion and motility, differentiation and cancer. One study has proposed that these functions may all relate to their ability to act as "molecular facilitators", grouping specific cell-surface proteins and thus increasing the formation and stability of functional signaling complexes. Maecker, et al., FASEB, ll(6):428-42 (1997). Another study concludes that they are responsible for changes in cell morphology, cell-ECM adhesion and cell-signaling. Skubitz, et al., J. Immunology. 157:3617-3626 (1996). Thus, new members of this family are of 30 interest.
78. PRQ1570
Proteases are enzymatic proteins which are involved in many biological processes in mammalian and non-mammalian organisms including digestion, protein activation and inactivation, modulation of peptide 35 hormone activity, and alteration of the physical properties of proteins and enzymes. Serine proteases comprise a large class of enzymes that exhibit specific activity toward various serine-containing proteins. Trypsin, which is synthesized by the pancreas and secreted to the small intestine, is a well-characterized serine protease that
22
hydrolyzes peptide bonds of ingested proteins. Trypsin-like proteases have been characterized that are cell-surface proteins (see Farley etal. Biochim Biophvs Acta (19931 1173(31:350-352: andLeytus etal. Biochemistry (1988) 27(31:1067-10741. It is believed that some of these trypsin-like proteins may be synthesized as a membrane-bound precursor which matures to a soluble and active protease (Y amaoka et al. J. Biol. Chem (1998) 273(191:11895-119011.
Of particular interest are human colon carcinoma derived serine proteases SP59, SP60 and SP67 which may be useful to screen for specific inhibitors or modulators to use in treatment of associated disease states and disorders related to these proteins. In Japanese patent J09149790-A, SP60 is reported to be identified, having accession number P W22986 and 233 amino acids.
79. PRQ1317
Members of the semaphorin family of glycoproteins play important roles in the developing nervous system, and more particularly in axonal guidance. Semaphorins have been identified in the human immune system, where they are believed to play functional roles including B-cell signaling (Hall et al. Proc. Natl. Acad. Sci (1996) 93(211:11780-501. A human semaphorin gene, useful in the diagnosis of nervous system an immune 15 disorders, is disclosed in Japanese Pat. No. J10155490-A, published June 16, 1998. The identification of additional members of the semaphorin family if of interest.
80. PRQ1780
Enzymatic proteins that may be implicated in metabolic diseases or disorders are of particular interest. 20 The enzymatic addition of sugars to fat-soiuble chemicals is an important process that increases their solubility in water and aids in their excretion. In mammals, glucuronic acid is the main sugar that is used to prevent the waste products of metabolism and fat-soluble chemicals from reaching toxic levels in the body. The UDP glucuronosyltransferases that carry out this reaction are part of a super family of UDP glycosy ltransferases found in animals, plants and bacteria. In the liver, UDP-glucuronosyltransferase conjugates bilirubin. There are a 25 number of conditions which affect UDP-glucuronosyltransferase activity resulting in unconjugated hyperbilirubinemia. These conditions include genetic disorders such as Crigler-Najjar Syndrome (see Jurgen et al., Biochem. J. (1996) 314:477-483) and Gilbert syndrome, as well as acquired conditions such as Lucey-Driscoll Syndrome. Accordingly, the identification of novel members of the glucuronosyltransferase family is of interest (see Tukey et al., J. Biol. Chem. (19931 268(20): 15260-6; and W09212987-A).
81. PRQ1486
The cerebellum contains a hexadecapeptide, termed cerebellin, that is conserved in sequence from human to chicken. Three independent, overlapping cDNA clones have been isolated from a human cerebellum cDNA library that encode the cerebellin sequence. The longest clone codes for a protein of 193 amino acids 35 generally termed precerebellin, or a cerebellin precursor. This protein has a significant similarity to the globular region of the B chain of human complement component Clq. The region of relatedness extends approximately over 145 amino acids located in the carboxyl terminus of both proteins. Unlike Clq B chain, no collagen-like
23
motifs are present in the araino-terminal regions of precerebellin. It is believed that cerebellin is not liberated from precerebellin by the classical dibasic amino acid protealytic cleavage mechanism seen in many neuropeptide precursors. The cerebellin precursor has been associated with synaptic physiology. Urade, etal., PNAS. USA. 88(3): 1069-1073 (1991). Cerebellin, its precursor, and related molecules, particularly those having sequence identity with cerebellin, are therefore of interest.
82. PRQ1433
Efforts are being undertaken by both industry and academia to identify new, native transmembrane and receptor proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane proteins. We herein describe the identification and 10 characterization of a novel transmembrane polypeptide designated herein as PRO 1433.
83. PRQ1490
Enzymatic proteins play important roles in the chemical reactions involved in the digestion of foods, the biosynthesis of macromolecules, the controlled release and utilization of chemical energy, and other processes 15 necessary to sustain life. Acyltransferases are enzymes which acylate moieties. For example, acyl-glycerol-phosphate acyltransferases can act on lysophosphatidic acid as a substrate. The lysophosphatidic acid is converted to phophatidic acid and thus plays a role in forming phosphatidylethanolamine found in membranes. See, Brown, et al., Plant Mol. Biol.. 26(l):211-223 (1994). Moreover, l-acyl-sn-glycerol-3-phosphate acyltransferase (LPAAT) is an enzymatic protein that shows a preference for medium-chain-length fatty acyl-20 coenzyme A substrates. See. Knutson etal.. Plant Phvsiol. 109:999-1006(199511. Thus, acyltransferases play an important role in the biosynthesis of molecules requiring acylation.
We herein describe the identification and characterization of novel polypeptides having homology to a l-acyl-sn-glycerol-3-phosphate acyltransferase protein, designated herein as PR01490 polypeptides.
84. PRQ1482
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of a novel secreted protein designated herein as PR01482.
85. PRQ1446
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of a novel 35 secreted protein designated herein as PR01446.
24
86. PRQ1S58
Methyltransferase enzymes catalyze the transfer of methyl groups from a donor molecule to an acceptor molecule. Methyltransferase enzymes play extremely important roles in a number of different biological processes including, for example, in the electron transport chain in the plasma membrane in prokaryotes and in the inner mitochondrial membrane in eukaiyotic cells (see, e.g., Barkovich et al., J. Biol. Chem. 272:9182-9188 5 (1997), Dibrov et al., J. Biol. Chem. 272:9175-9181 (1997), Lee et al., J. Bacteriol.. 179:1748-1754 (1997) and Marbois et al., Arch. Biochem. Biophvs. 313:83-88 (1994)). Methyltransferase enzymes have been shown to be essential for the biosynthesis of ubiquinone (coenzyme Q) and menaquinone (vitamin K2), both of which are essential isoprenoid quinone components of the respiratory electron transport chain. Given the obvious importance of the methyltransferase enzymes, there is substantial interest in identifying novel polypeptide 10 homologs of the methyltransferases. We herein describe the identification and characterization of a novel polypeptide having homology to methyltransferase enzymes, designated herein as PR01558 polypeptides.
87. PRQ1604
The identification of novel growth factors is of particular interest because of the roles they play in 15 inducing cellular growth, proliferation and differentiation in both normal states and abnormal states. The identification of growth factors that are over- or under-expressed in abnormal tissues (e.g. tumors) may lead to the development of diagnostic tools and therapeutic agents. Growth factors have been isolated from hepatoma-derived cell lines. Hepatoma-derived growth factors have been isolated from mouse (Japanese Pat. No. J09313185-A, published December 9,1997) and human (Japanese Pat. No. J06343470-A, published December 20 20, 1994) tissues. A hepatoma-derived growth factor, isolated from a human hepatoma-derived cell line, has been found to be ubiquitously expressed in several tumor-derived cell lines, as well as in normal tissues (Nakamura et al., J. Biol. Chem (1994) 269(401:25143-91. The growth factor was determined to be a novel heparin-binding protein that is mitogenic for fibroblasts.
88. PRQ1491
The neuronal cell body is usually round like any other cell. However, these cells have structures, also referred to as "processes", which grow from them to form synaptic connections. Some of these processes carry information away from the cell body; sometimes over very long distances. These long and thin processes are axons. The axon is a thin, static tube. Other processes carry information either towards the cell body, or both 30 towards and away from the cell body. These shorter and usually thicker processes are called dendrites. Both axons and dendrites are called neurites.
During development and the growth stage of neurons, neurites are formed by means of growth cones. A growth cone is the growing tip of a neurite. The growth cone is flattened and highly motile. It is where new material is added and further extension of the axon originates. Controlling where the growth cone crawls controls 35 were the axon will be laid down and thus where it will be present.
The growth cone has several definable parts. The thin, flattened, veil-like processes that stick out and retract from the leading edge are called lamellipodia. The needle-like processes that stick out and retract from
the leading edge are called microspikes or filopodia. These are the structures involved in pushing the leading edge of the growth cone forward.
The accurate navigation of growth cones to their appropriate targets requires that they Tecognize and respond to navigational cues in their immediate environment. Some of these cues encourage extension into certain areas whereas others discourage extension into others. Well characterized molecules that encourage neurite 5 outgrowth in vitro include the extracellular matrix molecule laminin and the neuronal cell surface molecule L1/G4/8D9. These molecules which promote neurite extension are generally widely distributed throughout the body. Laminin immunoreactivity is reasonably widespread in the developing central and peripheral nervous systems. Similarly, L1/G4/8D9 is present on a wide variety of neuronal processes in the developing central nervous system, particularly long projecting axons. It is, therefore, unclear whether the known outgrowth 10 promoting molecules play an important role in self-specific choices growth cones make as they decide between possible routes. Instead, their function is believed to provide a generally permissive environment in which growth cones extend and respond to more specific navigational cues.
Among these more specific cues are molecules that inhibit the motility of particular growth cones. Growth cones have been observed to lose their motile morphology and cease advancing (collapse) on contact with 15 other neurites of different types. Territory formation in vitro may mean the manifestation of a process that leads to selective fasciculation in vivo. Some growth cones have been observed to crawl along specific axonal pathways, or stereotype sequences of axonal pathways in developing embryos. Specific motility inhibiting effects could determine which of several alternative pathways a growth cone will extend on. Growth cones would be expected to prefer growing on axons that do not induce them to collapse while shunning those that do. 20 It has been observed that, for example, sympathetic growth cones will be inhibited or collapse when coming in contact with retinal neurites. Likewise, growth cones of retinal neurites will collapse when coming in contact with sympathetic neurites. It is believed that such cell activity is achieved through the presence of receptors which specifically respond to specific growth inhibition cues by the molecules which transmit specific cues pertaining to growth. Cues are believed to be present on cell surfaces, particularly on axon surfaces. 25 When nerve damage occurs, repair is impeded or incapable of occurring due to the failure of neurites to replace damaged axons or dendrites. If an existing neurite is damaged, severed or destroyed, a new neurite is incapable of growing out from the cell body to replace it. The presence of molecules which inhibit neurite growth are believed to be responsible for the difficulty in neurite regeneration. Collapsins are proteins that function to modulate the activity of molecules which modulate growth cone extension. 30 We herein describe the identification and characterization of novel polypeptides having homology to a collapsin protein, designated herein as PRO 1491 polypeptides.
89. PRQ1431
The transduction of intracellular signaling is crucial to cell processing such as differentiation, motility 35 and division. Such signal transduction is believed to occur throughout the cell in the form of complex interactions between proteins. Such protein-protein interactions are often mediated by modular domains within signaling proteins. As a result, signal transduction is now modeled as a system in which molecules act in a
26
combination, and the composition of that combination, determines the signal.
Src homology domains (e.g., SH2 and SH3) are two domains found in regions of sequence similarity of proteins involved in signal transduction. Early work on the oncogenic tyrosine kinase Src identified the SH2 domain. Since then, SH2 and SH3 domains have been found in many diverse proteins, making them among the most common type of structural motif. SH2 and SH3 domains are modular in that they fold independently of 5 the protein that contains them, their secondary structure places N- and C- termini close to one another in space, and they appear at variable locations (anywhere from N- to C-terminal) from one protein ot the next (Cohen et al., Cell 80: 237-348, 1995).
Early studies that mutated the SH2 or SH3 domain showed that these two domains were important for function, but it was not until the cloning of unrelated families of signaling proteins such as RAS-GAP, and the 10 Crk oncogene that the modular nature of these domains was revealed. These latter experiments demonstrated that RAS-GAP and Crk bound tightly to receptor tyrosine kinases upon ligand stimulation. Follow-up studies demonstrated that the mechanism of this binding was through the SH2 domain and that receptor autophosphorylation was required. Such a finding implied that activation of the receptor tyrosine kinase could be viewed as a means of changing the binding aspect of the intracellular domain, and the receptor-SH2 containing 15 protein interaction would initiate the signal transduction cascade.
SH3 domains have a more general function than that which is purported for SH2. SH3 binding proteins have been isolated by screening bacteriophage expression libraries with labeled SH3 domains. The results of these experiments showed that SH3 domains would bind to short proline-rich peptides, in particular the motif PxxP. Based on the level of knowledge present at the time of the preparation of the present patent application, 20 all of the SH3 binding sites identified have the property of being proline rich. Binding of an SH3 domain is independent of covalent modification of the binding site, such as phosphorylation as occurs with the SH2 domain. As a result, SH3-ligand interactions are usually constitutive and not inducible, although exceptions do exist. In general, SH3 domains are less likely to act as signal "switches" than as a means of assembling protein complexes via moderate-affinity interactions. Such moderate affinity interactions also imply that the SH3-mediated 25 interactions will be relatively short in duration and remodeled in response to changes in concentration of binding partners.
The resolution of binding characteristics of SH2 and SH3 domains has led to proposed compounds which would block signal transduction. Peptidomimetic ligands based on the sequence of target proteins for SH2 and SH3 domains may represent new lead compounds for the therapy of proliferative diseases that are dependent 30 upon constitutively activated tyrosine kinases (e.g., BCR/ABL in chronic myelogenous and acute lymphocytic leukemias or HER-2/Neu in breast and ovarian cancer).
90. FRQ1563
Cellular disintegrin and metalloproteinase (ADAMs) are a family of genes with a sequence similar to 35 those of snake venom metalloproteinases and disintegrins. The ADAMTS-1 gene encodes a new type of ADAM protein with respect to possessing the thrombospondin (TSP) type I motifs, the expression of which is associated with the inflammatory process (Kuno et al., J. Biol. Chem. 273:13912-13917 (1998), Kuno et al., Genomics
27
46:466-471 (1997) and Kuno et al., J. Biol. Chem. 272:556-562 (1997)). Expression of the ADAMTS-1 gene is induced in kidney and heart by in vivo administration of lipopolysaccharide, suggesting a ppssible role in the inflammation reaction. In this regard, the ADAMTS-1 protein has been suggested as playing a possible role in various inflammatory processes as well as in the development of cancer cachexia (Kuno et al., 1998, supra). We herein describe the identification and characterization of novel polypeptides having homology to ADAMTS-1 5 protein, designated herein as PR01563 polypeptides.
91. PRQ1565
Chondromodulin proteins are cartilage-generated matrix components that synergisticaily stimulate the growth and differentiation of chondrocytes (Suzuki, Connect. Tissue Res. 35:303-307 (1996)). More 10 specifically, chondromodulin-I functions to inhibit the proliferation of vascular endothelial cells and tube formation, thereby functioning to stimulate cartilage growth and inhibiting replacing cartilage by bone in an early stage. Chondromodulin-II, while not capable of inhibiting vascularization like chondromodulin-1, also functions to stimulate osteoclast differentiation and cartilage growth. As such, these two polypeptides are essential for the regulation of the formation of cartilage and endochondral bone structures. Given the extremely important 15 physiological roles played by the chondromodulin proteins, there is significant interest in identifying and characterizing novel polypeptides having homology to these proteins. We herein describe the identification and characterization of novel polypeptides having homology to cbondromodulin-I protein, designated herein as PR01565 polypeptides.
92. PRQ1571
Clostridium perfringens enterotoxin (CPE) is considered to be the virulence factor responsible for causing the symptoms of C. perfringens type A food poisoning and may also be involved in other human and veterinary illnesses (McClane, Toxicon. 34:1335-1343 (1996)). CPE carries out its adverse cellular functions by binding to an approximately 50 kD cell surface receptor protein designated the Clostridium perfringens 25 enterotoxin receptor (CPE-R) to form an approximately 90,000 kD complex on the surface of the cell. cDNAs encoding the CPE-R protein have been identified characterized in both human and mouse (Katahira et al., J. Cell Biol. 136:1239-1247 (1997) andKatahira etal., J.Biol. Chem. 272:26652-26658(1997)). Since the CPE toxin has been reported to cause a variety of illnesses in mammalian hosts and those illnesses are initiated by binding of the CPE toxin to the CPE-R, there is significant interest in identifying novel CPE-R homologs. We herein 30 describe the identification and characterization of novel polypeptides having homology to the CPE-R, designated herein as PRO 1679 polypeptides.
93. PRQ1572
Clostridium perfringens enterotoxin utilizes two structurally related membrane proteins as functional 35 receptors in vivo. Human and mouse cDNAs showing homology to the Clostridium enterotoxin receptor (CPE-R) gene have previously been cloned as described in Katahira, et al., J. Biol. Chem.. 272(42):26652-8 (1997). They have been classified into two groups, the Vero cell CPE receptor homologues and rat androgen withdrawal
28
apoptosis protein (RVP1). These receptors are thus of interest as are related molecules. Of particular interest is the use of these receptors and related molecules in the identification of modulators of these receptors.
Also of interest are members of the claudin family and molecules related thereto. Claudins are integral membrane proteins localizing at tight junctions with no sequence similarity to occludin. Furuse, et al., J. Cell Biol.. 141(7): 1539-50 (1998).
94. PRQ1573
Clostridium perfringens enterotoxin utilizes two structurally related membrane proteins as functional receptors in vivo. Human and mouse cDNAs showing homology to the Clostridium enterotoxin receptor (CPE-R) gene have previously been cloned as described in Katahira, et al., J. Biol. Chem.. 272(42):26652-8 (1997). 10 They have been classified into two groups, the Vero cell CPE receptor homologues and rat androgen withdrawal apoptosis protein (RVP1). These receptors are thus of interest as are related molecules. Of particular interest is the use of these receptors and related molecules in the identification of modulators of these receptors.
Also of interest is the ventral prostate. 1 protein (RVP.l) which is transcriptionally induced in the regressing rat prostate after castration. This protein is further described in Peacock, et al., Genomics. 15 46(3):443-9 (1997).
95. PRQ1488
Clostridium perfringens enterotoxin utilizes two structurally related membrane proteins as functional receptors in vivo. Human and mouse cDNAs showing homology to the Clostridium enterotoxin receptor 20 (CPE-R) gene have previously been cloned as described in Katahira, et al., J. Biol. Chem.. 272(42):26652-8 (1997), and Katahira, etal., J. Cell Biol.. 136(6): 1239-1247 (1997). They have been classified into two groups, the Vero cell CPE receptor homologues and rat androgen withdrawal apoptosis protein (RVP1). These receptors are thus of interest as are related molecules. Of particular interest is the use of these receptors and related molecules in the identification of modulators of these receptors.
Efforts are being undertaken by both industry and academia to identify new, native receptor proteins.
Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel receptor proteins.
96. PRQ1489
Clostridium perfringens enterotoxin (CPE) is considered to be the virulence factor responsible for causing the symptoms of C. perfringens type A food poisoning and may also be involved in other human and veterinary illnesses (McClane, Toxicon. 34:1335-1343 (1996)). CPE carries out its adverse cellular functions by binding to an approximately 50 kD cell surface receptor protein designated the Clostridium perfringens enterotoxin receptor (CPE-R) to form an approximately 90,000 kD complex on the surface of the cell. cDNAs 35 encodingtheCPE-Rproteinhavebeenidentifiedcharacterizedinbothhumanandmouse(Katahiraetal., J. Cell Biol. 136:1239-1247 (1997) and Katahira etal., J.Biol. Chem. 272:26652-26658(1997)). Since the CPE toxin has been reported to cause a variety of illnesses in mammalian hosts and those illnesses are initiated by binding
29
of the CPE toxin to the CPE-R, there is significant interest in identifying novel CPE-R homologs. We herein describe the identification and characterization of novel polypeptides having homology to the CPE-R, designated herein as PR01489 polypeptides.
97. PRQ1474
Avian egg whites are a rich source of protein inhibitors of proteinases belonging to all four mechanistic classes. Ovomucoid and ovoinhibitor are muitidomain Kazal-type inhibitors with each domain containing an actual or putative reactive site for a serine proteinase. Cystatin is a cysteine proteinase inhibitor, while ovostatin inhibits proteinases of all four mechanistic classes. For a review of these inhibitors, see Saxena and Tayyab, Cell Mol. Life Sci.. 53(1): 13-23 (1997). New members of protein inhibitors of proteinases are of interest,
particularly those having sequence identity with known inhibitors such as ovomucoid.
Serine protease inhibitors in general are of interest. Serine proteases such as neuropsin have been indicated as associated with extracellular matrix modifications and cell migrations. See, generally, Chen, et al., Neurosci.. 7(2):5088-5097 (1995) and Chen, et al., J. Histochem. Cvtochem.. 46:313-320 (1998). Another serine protease, the enamel matrix serine proteinase, is associated with the degradation of organic matrix in teeth.
Simmer, et al., J. Dent. Res.. 77(2):377-386 (1998), Overall and Limeback, Biochem J.. 256(3):965-972 (1988), and Moradian-Oldak, Connect. Tissue Res.. 35(1-41:231-238 (1996'). Thus, inhibitors of these proteases are of interest in the case that these mechanisms require control.
98. PRQ1508
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins.
Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of a novel secreted protein designated herein as PRO1508.
99. PRQ1555
Efforts are being undertaken by both industry and academia to identify new, native transmembrane proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane proteins. We herein describe the identification and characterization of a novel transmembrane protein designated herein as PRO1555.
100. PRQ1485
Lysozymes are secreted enzymes that preferentially hydrolyze the [beta] -1,4 glucosidic linkages between N-acetylmuramic acid and N-acetylgucosamine which occur in the mucopeptide cell wall structure of certain microoganisms. Lysozyme is of widespread distribution in animals and plants. It has been found in mammalian
secretions and tissues including saliva, tears, milk, cervical mucus, leucocytes, kidneys, etc. The identification of new members of the lysozyme family of proteins is of interest because of the variety of roles lysozymes play in metabolic function and dysfunction. Abnormal levels of lysozymes have been implicated in various disease
states. Lysozymes have been reported to have anti-microbial, analgesic, and antinociceptive properties. Additional characteristics and possible uses of lysozymes are described in U.S. Pat. No. 5,618,712.
Of particular interest is lysozyme C which has been recruited as a digestive enzyme in the stomachs of creatures needing to retrieve nutrients from microorganisms in fermented food. The history of lysozyme C and related proteins are further described in Qasba and Kumar, Crit. Rev. Biochem. Mol. Biol.. 32(4):255-306 5 (1997); Irwin, EXS, 75:347-361 (1996)
101. PRQ1564
Glycosylation is a common and complex form of post-translational protein modification. Although a large and increasing number of unique structures is known to exist, most arise from a series of common synthetic 10 intermediates and differ at their periphery glycosyltransferases, which recognize both the oligosaccharide acceptor and features of the underlying protein. UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase is an enzymatic protein that initiates O-glycosylation of specific serine and threonine amino acids in proteins by adding N-acetylgalactosamine to the hydroxy group of these amino acids. Since numerous important biological and physiological events are reguited by protein glycosylation, there is 15 significant interest in identifying and characterizing novel polypeptides having homology to the known glycosylation proteins. We herein describe the identification and characterization of novel polypeptides having homology to an N-acetylgalactosaminyltransferase protein, designated herein as PR01564 polypeptides.
102. PRQ17S5
Efforts are being undertaken by both industry and academia to identify new, native transmembrane proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane proteins. We herein describe the identification and characterization of a novel transmembrane protein designated herein as PR01755.
103. PRQ1757
Efforts are being undertaken by both industry and academia to identify new, native transmembrane proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane proteins. We herein describe the identification and characterization of a novel transmembrane protein designated herein as PR01757.
104. PRQ17S8
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of a novel 35 secreted protein designated herein as PR01758.
31
105. PRQ1575
Protein Disulfide Isomerase (PDI) enhances formation of disulfide bonds in human serum albumin (HSA). Consequently, PDI assists in the formation of the overall structure of human serum albumin. Co-expression of PDI with human serum albumin increases secretion of HSA by reducing the chance of HSA structural instability and destruction by cellular proteases. Co-expression of PDI and HSA improved localization 5 in the endoplasmic reticulum of eukaryotic cells. (Hayano et al., EP-50941-A (1992)). PDI and the beta-subunit of human prolyl 4-hydroxylase have been shown to be products of the same gene. (Pihlajaniemi et al., EMBO L, 6:643-49 (1987)). In addition, copies of the CGHC-containing active site sequences of PDI have been found in an abundant luminal endoplasmic reticulum protein, Erp72. (Mazzarella et al., J. Biol. Chem.. 2:1094-1101 (1990)).
Efforts are being undertaken by both industry and academia to identify new, native receptor proteins.
Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel receptor proteins.
106. PRQ1787
Multiple de novo MPZ (P0) point mutations have been identified in a sporadic Dejerine-Sottas (DDS)
case. Warner, et al., Hum. Mutat., 10(l):21-4 (1997). DDS is a severe demyelinating peripheral neuropathy with onset in infancy, and has been associated with mutations in either PMP22 or MPZ. Moreover, mutational analysis of the MPZ, PMP22 and Cx32 genes in patients of Spanish ancestry with Charcot-Marie-Tooth disease and hereditary neuropathy with liability to pressure palsies have been reported on. Bort, et al., Hum. Genet.,
99(6):746-54 (1997). Myelin glycoprotein P0 has been reported on in a number of other studies as well (Blanquet-Grossard, et al., Clin. Genet., 48(6):281-3 (1995), Hayasaka, et al., Nat. Genet., 5(l):31-4 (1993) and Saavedra, et al., J. Mol. Evol., 29(2):149-56 (1989). Thus, proteins which may belong to the myelin pO family are of interest.
107. PRQ1781
Efforts are being undertaken by both industry and academia to identify new, native transmembrane proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane proteins. We herein describe the identification and characterization of a novel transmembrane protein designated herein as PR01781.
108. PRQ1556
Efforts are being undertaken by both industry and academia to identify new, native transmembrane proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane proteins. We herein describe the identification and characterization
of a novel transmembrane protein designated herein as PR01556.
32
109. PRQ1759
Efforts are being undertaken by both industry and academia to identify new, native transmembrane proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane proteins. We herein describe the identification and characterization of a novel transmembrane protein designated herein as PRO 1759.
110. PRQ1760
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of a novel 10 secreted protein designated herein as PR01760.
111. PRQ1561
Phospholipase A2 (PLA2) is a protein which hydrolyzes a 2-acyl ester bond of phospholipids, and examples thereof include cytosolic PLA2 and secretory PLA2 which can be clearly distinguished from each 15 other. It has been known that the cytosolic PLA2 (cPLA2) selectively hydrolyzes phospholipids containing arachidonic acid of which 2-position is esterified. Given these important biological activities, there is significant interest in identifying and characterizing novel plypeptides having homology to phospholipase A2 proteins. We herein describe the identification and characterization of novel polypeptides having homology to human phospholipase A2 protein, designated herein as PR01561 polypeptides.
112. PRQ1567
Colon specific genes (CSGs)and their expression products are described in published international application W09639419. They are useful diagnostic markers for colon cancer and for colon cancer metastasis and can also be used to screen for potential pharmaceutical and diagnostic agents. The identification of new 25 members of the CSG family is of interest.
113. PRQ1693
Insulin-like growth factors have both growth-promoting and insulin-like activities. There are two well characterized plasma IGF-binding proteins in human. The larger protein is an acid-labile protein of 53K which 30 circulates mostly as a 125 to 150 kD complex thought to be composed of IGF-I or IGF-II, the binding protein itself and an acid-labile non-IGF-binding protein with an approximate molecular mass of 100K kD. The smaller protein has an apparent molecular mass of 28K in the non-reduced form and 34K when reduced. These IGF-binding proteins have been shown to play important roles in the physiological activities played by the insulin-like growth factor proteins. As such, there is substantial interest in identifying and characterizing novel polypeptides 35 having homology to the insulin-like growth factor binding proteins. We herein describe the identification and characterization of novel polypeptides having homology to an insulin-like growth factor binding protein, designated herein as PR01693 polypeptides.
33
114. PRQ1784
Efforts are being undertaken by both industry and academia to identify new, native transmembrane proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane proteins. We herein describe the identification and characterization of a novel transmembrane protein designated herein as PR01784.
115. PRQ1605
N-acetylglucosaminyltransferase proteins comprise a family of enzymes that provide for a variety of important biological functions in the mammalian organism. As an example, UDP-N-acetylglucosamine: alpha-3-D-mannoside beat-l,2-N-acetyIglucosaminyltransferase I is an enzymatic protein that catalyzes an essential first 10 step in the conversion of high-mannose N-glycans to hybrid and complex N-glycans (Sarkar et al., Proc. Nad. Acad. Sci. USA. 88:234-238 (1991). Given the obvious importance of the N-acetylglucosaminyltransferase enzymes, there is significant interest in the identification and characterization of novel polypeptides having homology to an N-acetylglucosaminyltransferase protein. We herein describe the identification and characterization of novel polypeptides having homology to an N-acetylglucosaminyltransferase protein, 15 designated herein as PRQ1605 polypeptides.
116. PRQ1788
Protein-protein interactions include receptor and antigen complexes and signaling mechanisms. As more is known about the structural and functional mechanisms underlying protein-protein interactions, protein-protein 20 interactions can be more easily manipulated to regulate the particular result of the protein-protein interaction. Thus, the underlying mechanisms of protein-protein interactions are of interest to the scientific and medical community.
Proteins containing leucine-rich repeats are thought to be involved in protein-protein interactions. Leucine-rich repeats are short sequence motifs present in a number of proteins with diverse functions and cellular 25 locations. The crystal structure of ribonuclease inhibitor protein has revealed that leucine-rich repeats correspond to beta-alpha structural units. These units are arranged so that they form a parallel beta-sheet with one surface exposed to solvent, so that the protein acquires an unusual, nonglobular shape. These two features have been indicated as responsible for the protein-binding functions of proteins containing leucine-rich repeats. See, Kobe and Deisenhofer, Trends Biochem. Sci.. 19(10):415-421 (Oct. 1994).
A study has been reported on leucine-rich proteoglycans which serve as tissue organizers, orienting and ordering collagen fibrils during ontogeny and are involved in pathological processes such as wound healing, tissue repair, and tumor stroma formation. Iozzo, R. V., Crit. Rev. Biochem. Mol. Biol.. 32(2):141-174 (1997). Others studies implicating leucine rich proteins in wound healing and tissue repair have been reported including De La Salle, C., et al., Vouv. Rev. Fr. Hematol. (Germany), 37(4):215-222 (1995), reporting 35 mutations in the leucine rich motif in a complex associated with the bleeding disorder Bernard-Soulier syndrome; Chlemetson, K. J., Thromb. Haemost. (Germany), 74(1):111-116 (July 1995), reporting that platelets have leucine rich repeats and Ruoslahti, E. I., et al.; and W09110727-A by La Jolla Cancer Research Foundation,
34
reporting that decorin binding to transforming growth factor-a has involvement in a treatment for cancer, wound healing and scarring. Related by function to this group of proteins is the insulin like growth factor (IGF), in that it is useful in wound-healing and associated therapies concerned with re-growth of tissue, such as connective tissue, skin and bone; in promoting body growth in humans and animals; and in stimulating other growth-related processes. The acid labile subunit of IGF (ALS) is also of interest in that it increases the half-life of IGF and 5 is part of the IGF complex in vivo. Ollendorff, V., et al., Cell Growth Differ. 5(2):213-219 (Feb. 1994) identified the GARP gene which encodes a leucine-rich repeat-containing protein that has structural similarities with human GP lb alpha and GP V platelet proteins, and with the Chaoptin, Toll, and Connectin adhesion molecules of Drosophila.
Another protein which has been reported to have leucine-rich repeats is the SLIT protein which has been 10 reported to be useful in treating neurodegenerative diseases such as Alzheimer's disease, nerve damage such as in Parkinson's disease, and for diagnosis of cancer, see, Artavanistsakonas, S. and Rothberg, J. M., W09210518-A1 by Yale University. Of particular interest is LIG-1, a membrane glycoprotein that is expressed specifically in glial cells in the mouse brain, and has leucine rich repeats and immunoglobulin-like domains. Suzuki, et al., J. Biol. Chem. (U.S.), 27I(37):22522 (1996). Other studies reporting on the biological functions 15 of proteins having leucine rich repeats include: Tayar, N., et al., Mol. Cell Endocrinol.. (Ireland), 125(l-2):65-70 (Dec. 1996) (gonadotropin receptor involvement); Miura, Y., et al., Ninpon Rinsho (Japan), 54(7): 1784-1789 (July 1996) (apoptosis involvement); Harris, P. C., et al., J. Am. Soc. Nephrol.. 6(4): 1125-1133 (Oct. 1995) (kidney disease involvement); and Almeida, A., et al., Oncogene 16(23):2997-3002 (June 1998) (malignant glioma involvement).
117. FRQ1801
lnterleukin-10 (IL-10) is a pleiotropic immunosuppressive cytokine that has been implicated as an important regulator of the functions of myeloid and lymphoid cells. It has been demonstrated that IL-10 functions as a potent inhibitor of the activation of the synthesis of various inflammatory cytokines including, for 25 example, IL-1, IL-6, IFN-y and TNF-cc (Gesser et al., Proc. Natl. Acad. Sci. USA 94:14620-14625 (1997)). Moreover, IL-10 has been demonstrated to strongly inhibit several of the accessory activities of macrophages, thereby functioning as a potent suppressor of the effector functions of macrophages, T-cells and NK cells (Kuhn et al., Cell 75:263-274 (1993)). Furthermore, IL-10 has been strongly implicated in the regulation of B-cell, mast cell and thymocyte differentiation.
IL-10 was independently identified in two separate lines of experiments. First, cDNA clones encoding murine IL-10 were identified based upon the expression of cytokine synthesis inhibitory factor (Moore et al., Science 248:1230-1234 (1990)), wherein the human IL-10 counterpart cDNAs were subsequently identified by cross-hybridization with the murine IL-10 cDNA (Viera et al., Proc. Natl. Acad. Sci. USA 88:1172-1176 (1991)). Additionally, IL-10 was independently identified as a B-cell-derived mediator which functioned to co-35 stimulate active thymocytes (Suda et al., Cell Immunol. 129:228 (1990)).
Recently, a novel cytokine polypeptide which is member of the IL-10-related cytokine family has been identified and characterized. This novel secreted cytokine, designated IL-19, is a 177 amino acid polypeptide
having a molecular weight of approximately 20.4 kD (see WO 98/08870, published March 5,1998). It has been reported that IL-19 is specifically expressed by activated monocytes, wherein increased and/or decreased levels of IL-19 may be associated with one or more physiological disorders that are associated with increased or decreased levels of cytokine production (see WO 98/08870). Specifically, IL-19 is suggested as being capable of inhibiting the synthesis of inflammatory cytokines by cells of the immune system.
Given the obvious importance of the various cytokine polypeptides and, more specifically, immunosuppressive cytokines such as IL-10 and potentially IL-19, there is significant interest in the identification and characterization of novel cytokine polypeptides having homology to IL-10 and/or IL-19. We herein describe the identification and characterization of novel polypeptides having homology to IL-19, designated herein as PR01801 polypeptides.
118. UCP4
Uncoupling proteins or "UCPs", believed to play a role in the metabolic process, have been reported in the literature. UCPs were first found and described in the brown fat cells of hibernating animals, such as bears. UCPs were believed to help such hiberaators and other cold-weather adapted animals maintain core body 15 temperatures in cold weather by raising their body's resting metabolic rate. Because humans possess relatively small quantities of brown adipose tissue, UCPs were originally thought to play a minor role in human metabolism.
Several different human uncoupling proteins have now been described. [See, generally, Gura, Science, 280:1369-1370 (1998)]. The human uncoupling protein referred to as UCP1 was identified by Nicholls et al. 20 Nicholls et al. showed that the inner membrane of brown fat cell mitochondria was very permeable to proteins, and the investigators traced the observed permeability to aprotein, called UCP1, in the mitochondrial membrane. Nicholls et al. reported that the UCP1, by creating such permeability, reduced the number of ATPs that can be made from a food source, thus raising body metabolic rate and generating heat. [Nicholls et al., Physiol. Rev., 64, 1-64 (1984)].
It was later found that UCP1 is indeed expressed only in brown adipose tissue [Bouillaud et al., Proc.
Natl. Acad. Sci., 82:445-448 (1985); Jacobsson et al., J. Biol. Chem., 260:16250-16254 (1985)]. Genetic mapping studies have shown that the human UCP1 gene is located on chromosome 4. [Cassard et al., J. Cell. Biochem., 43:255-264 (1990)].
Another human UCP, referred to as UCPH or UCP2, has also been described. [Gimeno et al., Diabetes, 30 46:900-906 (1997); Fleury et al., Nat. Genet., 15:269-272 (1997); Boss et al., FEBS Letters, 408:39-42 (1997); see also, Wolf, Nutr. Rev. , 55:178-179(1997)]. Fleury etal. teach that the UCP2 protein has 59% amino acid identity to UCP1, and that UCP2 maps to regions of human chromosome 11 which have been linked to hyperinsulinaemia and obesity. [Fleury et al., supra]. It has also been reported that UCP2 is expressed in a variety of adult tissues, such as brain and muscle and fat cells. [Gimeno et al., supra, and Fleury et al., supra]. 35 A third human UCP, UCP3, was recently described in Boss etal., supra; Vidal-Puig et al., Biochem.
Biophys. Res. Comm., 235:79-82 (1997); Solanes et al., J. Biol. Chem., 272:25433-25436 (1997); and Gong et al., J. Biol. Chem., 272:24129-24132 (1997). [See also Great Britain Patent No. 9716886]. Solanes et al.
36
report that unlike UCP1 and UCP2, UCP3 is expressed preferentially in human skeletal muscle, and that the UCP3 gene maps to human chromosome 11, adjacent to the UCP2 gene. [Solanes et al., supra]. Gong et al. describe that the UCP3 expression can be regulated by known thermogenic stimuli, such as thyroid hormone, beta3-andrenergic agonists and leptin. [Gong et al., supra].
119. PRQ193
Efforts are being undertaken by both industry and academia to identify new, native transmembrane proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel transmembrane proteins. We herein describe the identification and characterization of a novel transmembrane protein designated herein as PR0193.
120. PRQ1130
Polypeptides such as the human 2-19 protein may function as cytokines. Cytokines are low molecular weight proteins which function to stimulate or inhibit the differentiation, proliferation or function of immune cells. Cytokine proteins often act as intercellular messengers and have multiple physiological effects. Given 15 the physiological importance of immune mechanisms in vivo, efforts are currently being undertaken to identify new, native proteins which are involved in effecting the immune system. We describe herein the identification of a novel polypeptide which has sequence similarity to the human 2-19 protein.
121. FRQ1335
Carbonic anhydrase is an enzymatic protein that which aids carbon dioxide transport and release in the dioxide and water. Thus, the actions of carbonic anhydrase are essential for a variety of important physiological reactions in the mammal. As such, there is significant interest in the identification and characterization of novel polypeptides having homology to carbonic anhydrase. We herein describe the identification and characterization 25 of novel polypeptides having homology to carbonic anhydrase, designated herein as PR01335 polypeptides.
122. PRQ1329
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding 30 sequences for novel secreted proteins. We herein describe the identification and characterization of a novel secreted protein designated herein as PR01329.
123. PRQ1550
Efforts are being undertaken by both industry and academia to identify new, native secreted proteins. 35 Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. We herein describe the identification and characterization of a novel secreted protein designated herein as PR01550.
37
SUMMARY OF THE INVENTION
In a first aspect, the present invention provides an isolated nucleic acid having at least 80% sequence identity to a nucleotide sequence that encodes a polypeptide comprising the amino acid sequence shown in Figure 32 (SEQ ID NO: 54).
In a further aspect, the present invention provides an isolated nucleic acid which comprises the full-length coding sequence of the DNA deposited under accession number ATCC 203287.
In a yet further aspect, the present invention provides a vector comprising a nucleic acid of any one of the previous aspects.
In a still further aspect, the present invention provides a non-human host cell comprising a vector of the previous aspect.
In another aspect, the present invention provides a process for producing a polypeptide comprising culturing a host cell of the above aspect under conditions suitable for expression of said polypeptide and recovering said polypeptide from the cell culture.
In another aspect, the present invention provides an isolated PRO1295 polypeptide having at least 80% sequence identity to the amino acid sequence shown in Figure 32 (SEQ ID NO: 54).
In another aspect the present invention provides an isolated PRO 1295 polypeptide having at least 80% sequence identity to the amino acid sequence encoded by the nucleotide deposited under ATCC 203287.
In another aspect the present invention provides an isolated polypeptide comprising a polypeptide sequence having at least about 80% amino acid sequence identity to the extracellular domain of a PRO1295 polypeptide of SEQ
In another aspect the present invention provides an isolated polypeptide having at least about 80% amino acid sequence identity to a PRO1295 polypeptide of SEQ ID NO: 54 lacking its associated signal peptide.
In another aspect, the present invention provides an isolated nucleic acid encoding a polypeptide of the above aspects.
In another aspect, the present invention provides a chimeric molecule comprising a polypeptide according to any one of the above aspects fused to a heterologous amino acid sequence.
In another aspect, the present invention provides an antibody which specifically binds to a polypeptide
ID NO: 54.
according to any one of the above aspects.
O-ICE OF M.Z
38
17 KAH 2C34
RECEIVED
In another aspect, the present invention provides a pharmaceutical composition comprising a polypeptide of any one of the above aspects and a pharmaceutically acceptable carrier, diluent or excipient.
In another aspect, the present invention provides a use of a polypeptide of any one of the above aspects in the manufacture of a medicament for treating tumor.
The invention relates to the aspects as set out above. Some materials have been divided out of the present application that were previously present and claimed herein and are to be found in New Zealand Patent Specification Nos 531662 to 531668. Not all of the stated aspects and embodiments set out below fall within the scope of the invention as claimed. These divided out materials and other non-claimed materials, are still presented in .the present specification to assist the reader to practice the claimed invention.
2. PRQ444
A cDNA clone (DNA26846-1393) has been identified that encodes a novel secreted polypeptide, designated in the present application as "PR0444,"
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR0444 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR0444 polypeptide having 15 the sequence of amino acid residues from about 1 or about 17 to about 117, inclusive of Figure 4 (SEQ ID NO:6), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR0444 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 656 and about 958, inclusive, of Figure 3 (SEQ ID NO:5). Preferably, hybridization occurs under stringent hybridization 20 and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203406 25 (DNA26846-1397), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203406 (DNA26846-1397).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 30 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 1 or about 17 to about 117, inclusive of Figure 4 (SEQ ID NO:6), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 10 nucleotides, more preferably at least about 20 nucleotides, and most preferably at least about 40 nucleotides and 35 produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR0444 polypeptide having the sequence of amino acid residues from about 1 or about 17 to about 117, inclusive of Figure 4 (SEQ ID NO:6), or (b) the complement of the DNA molecule of (a), and, if the DNA
40
INT^LLL'uAi PROPERTY
office OF N Z
2 4 OCT 2003
RECEIVED
molecule has at least about an 80% sequence identity, preferably at least about an 83% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR0444 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is 5 complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 16 in the sequence of Figure 4 (SEQ ID NO:6).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 10 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 17 to about 117, inclusive of Figure 4 (SEQ ID NO:6), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR0444 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 15 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR0444 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR0444 polypeptide, which in one 20 embodiment, includes an amino acid sequence comprising residues 1 or about 17 to 117 of Figure 4 (SEQ ID NO:6).
In another aspect, the invention concerns an isolated PR0444 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 25 sequence of amino acid residues 1 or about 17 to about 117, inclusive of Figure 4 (SEQ ID NO:6).
In a further aspect, the invention concerns an isolated PR0444 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 17 to 117 of Figure 4 (SEQ ID NO:6).
In yet another aspect, the invention concerns an isolated PR0444 polypeptide, comprising the sequence of amino acid residues 1 or about 17 to about 117, inclusive of Figure 4 (SEQ ID NO:6), or a fragment thereof sufficient to provide a binding site for an anti-PR0444 antibody. Preferably, the PR0444 fragment retains a qualitative biological activity of a native PR0444 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 35 molecule under stringent conditions with (a) a DNA molecule encoding a PR0444 polypeptide having the sequence of amino acid residues from about 1 or about 17 to about 117, inclusive of Figure 4 (SEQ ID NO:6), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an J
41
PCT/US99/20U1
sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
3. FRQ1018
A cDNA clone (DNA56107-1415) has been identified that encodes a novel transmembrane polypeptide, designated in the present application as "PR01018".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01018 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01018 polypeptide having the sequence of amino acid residues from about 1 or about 25 to about 189, inclusive of Figure 6 (SEQ ID NO:8), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR010I8
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 129 or about 201 and about 695, inclusive, of Figure 5 (SEQ ID NO:7). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 20 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203405 (DNA56107-1415) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in 25 ATCC Deposit No. 203405 (DNA56107-1415).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 25 to about 189, inclusive of Figure62 (SEQ ID 30 NO:8), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 10 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01018 polypeptide having the sequence of amino acid residues from 1 or about 25 to about 189, inclusive of Figure 6 (SEQ ID NO:8), or (b) the complement of the DNA molecule of (a), and, if 35 the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
42
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01018 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 24 in the sequence of Figure 6 (SEQ ID NO:8). The transmembrane 5 domains have been tentatively identified as extending from about amino acid position 86 to about amino acid position 103 and from about amino acid position 60 to about amino acid position 75 in the PR01018 amino acid sequence (Figure 6, SEQ ID NO:8).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 10 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 25 to about 189, inclusive of Figure 6 (SEQ ID NO:8), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01018 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 15 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 5 (SEQ ID NO:7).
In another embodiment, the invention provides isolated PR01018 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01018 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 25 to about 189 of Figure 6 (SEQ ID NO:8).
In another aspect, the invention concerns an isolated PRO 1018 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 25 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 25 to about 189, inclusive of Figure 6 (SEQ ID NO:8).
In a further aspect, the invention concerns an isolated PR01018 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 30 of residues 1 or about 25 to about 189, inclusive of Figure 6 (SEQ ID NO:8).
In yet another aspect, the invention concerns an isolated PR01018 polypeptide, comprising the sequence of amino acid residues 1 or about 25 to about 189, inclusive of Figure 6 (SEQ ID NO:8), or a fragment thereof sufficient to provide a binding site for an anti-PR01018 antibody. Preferably, the PR01018 fragment retains a qualitative biological activity of a native PRO 1018 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PR01018 polypeptide having the sequence of amino acid residues from about 1 or about 25 to about 189, inclusive of Figure 6 (SEQ ID NO:8),
43
or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
4. PRQ1773
A cDNA clone (DNA56406-1704) has been identified, having homology to nucleic acid encoding a retinol dehydrogenase protein that encodes a novel polypeptide, designated in the present application as "PR01773".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1773 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO 1773 polypeptide having 15 the sequence of amino acid residues from about 1 or about 18 to about 319, inclusive of Figure 8 (SEQ ID NO: 10), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1773 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 111 or about 162 and about 1067, inclusive, of Figure 7 (SEQ ID NO:9). Preferably, hybridization occurs under 20 stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203478 25 (DNA56406-1704) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203478 (DNA56406-1704).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 30 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 18 to about 319, inclusive of Figure 8 (SEQ ID NO: 10), or (b) the complement of the DNA of (a).
In a farther aspect, the invention concerns an isolated nucleic acid molecule having at least 525 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA 35 molecule encoding a PR01773 polypeptide having the sequence of amino acid residues from 1 or about 18 to about 319, inclusive of Figure 8 (SEQ ID NO: 10), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, prefereably at least about an 85% sequence
44
identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01773 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding 5 nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 17 in the sequence of Figure 8 (SEQ ID NO: 10). The transmembrane domain has been tentatively identified as extending from about amino acid position 136 to about amino acid position 152 in the PR01773 amino acid sequence (Figure 8, SEQ ID N0:10).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 10 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 18 to about 319, inclusive of Figure 8 (SEQ ID NO: 10), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1773 polypeptide coding sequence that may find IS use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 7 (SEQ ID NO:9).
In another embodiment, the invention provides isolated PRO1773 polypeptide encoded by any of the 20 isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01773 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 18 to about 319 of Figure 8 (SEQ ID NO: 10).
In another aspect, the invention concerns an isolated PR01773 polypeptide, comprising an amino acid 25 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identify to the sequence of amino acid residues 1 or about 18 to about 319, inclusive of Figure 8 (SEQ ID N0:10).
In a further aspect, the invention concerns an isolated PR01773 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 % positives, more preferably at least 30 about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 18 to about 319, inclusive of Figure 8 (SEQ ID NO: 10).
In yet another aspect, the invention concerns an isolated PR01773 polypeptide, comprising the sequence of amino acid residues 1 or about 18 to about 319, inclusive of Figure 8 (SEQ ID NO: 10), or a fragment thereof sufficient to provide a binding site for an anti-PR01773 antibody. Preferably, the PROI773 fragment retains 35 a qualitative biological activity of a native PR01773 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01773 polypeptide having the
45
sequence of amino acid residues from about 1 or about 18 toabout319, inclusive ofFigure 8 (SEQ ID NO: 10), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 5 recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1773 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01773 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PRO1773 polypeptide by contacting the native PRO1773 polypeptide with a candidate molecule and 10 monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01773 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceutically acceptable carrier.
. PRQ1477
A cDNA clone (DNA56529-1647) has been identified, having homology to nucleic acid encoding a mannosidase protein that encodes a novel polypeptide, designated in the present application as "PROMT?".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01477 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 20 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO1477 polypeptide having the sequence of amino acid residues from about 1 to about 699, inclusive of Figure 10 (SEQ ID NO: 12), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01477 25 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 23 and about 2119, inclusive, of Figure 9 (SEQ ID NO: 11). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 30 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203293 (DNA56529-1647) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203293 (DNA56529-1647).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence
46
identity to the sequence of amino acid residues 1 to about 699, inclusive of Figure 10 (SEQ ID NO: 12), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 540 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1477 polypeptide having the sequence of amino acid residues from 1 to about 699, 5 inclusive of Figure 10 (SEQ ID NO: 12), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding
a PR01477 polypeptide, with or without and/or the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The transmembrane domains have been tentatively identified as extending from about amino acid position 21 to about amino acid position 40 and from about amino acid position 84 to about amino acid position 105 in the PRO 1477 amino acid sequence (Figure 10, SEQ ID NO: 12).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 to about 699, inclusive of Figure 10 (SEQ ID NO: 12), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1477 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 9 (SEQ ID NO: 11).
In another embodiment, the invention provides isolated PRO1477 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01477 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 to about 699 of Figure 10 (SEQ ID NO: 12).
In another aspect, the invention concerns an isolated PR01477 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 699, inclusive of Figure 10 (SEQ ID NO: 12).
In a further aspect, the invention concerns an isolated PR01477 polypeptide, comprising an amino acid
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 to about 699, inclusive of Figure 10 (SEQ ID NO: 12).
47
In yet another aspect, the invention concerns an isolated PRO1477 polypeptide, comprising the sequence of amino acid residues 1 to about 699, inclusive of Figure 10 (SEQ ID NO: 12), or a fragment thereof sufficient to provide a binding site for an anti-PR01477 antibody. Preferably, the PR01477 fragment retains a qualitative biological activity of a native PR01477 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 5 molecule under stringent conditions with (a) a DNA molecule encoding a PR01477 polypeptide having the sequence of amino acid residues from about 1 to about 699, inclusive of Figure 10 (SEQ ID NO: 12), or (b) the complement of the DNA molecuie of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising 10 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01477 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01477 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 15 native PR01477 polypeptide by contacting the native PR01477 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01477 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceutically acceptable carrier.
6. FRQ1478
A cDNA clone (DN A56531-1648) has been identified that encodes a novel polypeptide having sequence identity with galactosyltransferase and designated in the present application as "PR01478."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01478 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO1478 polypeptide having the sequence of amino acid residues from about 1 to about 327, inclusive of Figure 12 (SEQ ID NO: 17), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01478
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 77 and about 1057, inclusive, of Figure 11 (SEQ ID NO: 16). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 35 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203286
48
PCTAJS99/20111
(DNA56531-1648), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203286 (DNA56531-1648).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 5 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 1 to about 327, inclusive of Figure 12 (SEQ ID NO:17), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 10 under stringent conditions with (a) a DNA molecule encoding a PR01478 polypeptide having the sequence of amino acid residues from about 1 to about 327, inclusive of Figure 12 (SEQ ID NO: 17), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 15 In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01478 polypeptide in its soluble form, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The transmembrane domain (type II) has been tentatively identified as extending from about amino acid position 29 through about amino acid position 49 in the PR01478 amino acid sequence (Figure 12, SEQ ID NO: 17). Therefore, a peptide including amino acids 20 50-327, with or without amino acids 1-28, is specifically embodied herein, as well as the nucleic acid encoding such a peptide.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 25 amino acid sequence of residues 1 to about 327, inclusive of Figure 12 (SEQ ID NO: 17), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01478 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 30 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01478 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1478 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 1 through 327 of Figure 12 (SEQ ID NO: 17). 35 In another aspect, the invention concerns an isolated PR01478 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the
49
sequence of amino acid residues 1 to about 327, inclusive of Figure 12 (SEQ ID NO: 17).
In a further aspect, the invention concerns an isolated PRO 1478 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 through 327 of Figure 12 (SEQ ID NO: 17).
In yet another aspect, the invention concerns an isolated PRO1478 polypeptide, comprising the sequence of amino acid residues 1 to about 327, inclusive of Figure 12 (SEQ ID NO: 17), or a fragment thereof sufficient to provide a binding site for an anti-PR01478 antibody. Preferably, the PR01478 fragment retains a qualitative biological activity of a native PR01478 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 10 molecule under stringent conditions with (a) a DNA molecule encoding a PR01478 polypeptide having the sequence of amino acid residues from about 1 to about 327, inclusive of Figure 12 (SEQ ID NO: 17), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising 15 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1478 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01478 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 20 native PRO1478 polypeptide, by contacting the native PRO1478 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1478 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceutically acceptable carrier.
7. PRQ831
A cDNA clone (DNA56862-1343) has been identified that encodes a novel secreted polypeptide, designated in the present application as "PR0831".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR0831 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR0831 polypeptide having the sequence of amino acid residues from about 1 or about 16 to about 73, inclusive of Figure 14 (SEQ ID NO:22), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR0831
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 40 or about 85 and about 258, inclusive, of Figure 13 (SEQ ID NO:21). Preferably, hybridization occurs under
50
stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203174 5 (DNA56862-1343) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203174 (DNA56862-1343).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 10 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 16 to about 73, inclusive of Figure 14 (SEQ ID NO:22), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucieic acid molecule having at least 470 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA 15 molecule encoding a PR0831 polypeptide having the sequence of amino acid residues from 1 or about 16 to about 73, inclusive of Figure 14 (SEQ ID NO:22), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR0831 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 15 in the sequence of Figure 14 (SEQ ID NO:22).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 16 to about 73, inclusive of Figure 14 (SEQ ID NO:22), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR0831 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 13 (SEQ ID NO:21).
In another embodiment, the invention provides isolated PR0831 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR0831 polypeptide, which in
51
certain embodiments, includes an amino acid sequence comprising residues 1 or about 16 to about 73 of Figure 14 (SEQ ID NO:22).
In another aspect, the invention concerns an isolated PR0831 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 5 sequence of amino acid residues 1 or about 16 to about 73, inclusive of Figure 14 (SEQ ID NO:22).
In a further aspect, the invention concerns an isolated PR0831 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 16 to about 73, inclusive of Figure 14 (SEQ ID NO:22).
In yet another aspect, the invention concerns an isolated PR0831 polypeptide, comprising the sequence of amino acid residues 1 or about 16 to about 73, inclusive of Figure 14 (SEQ ID NO:22), or a fragment thereof sufficient to provide a binding site for an anti-PR0831 antibody. Preferably, the PR0831 fragment retains a qualitative biological activity of a native PR0831 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 15 molecule under stringent conditions with (a) a DNA molecule encoding a PR0831 polypeptide having the sequence of amino acid residues from about 1 or about 16 to about 73, inclusive of Figure 14 (SEQ ID NO:22), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell 20 comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
8. PRQ1113
A cDNA clone (DNA57254-1477) has been identified that encodes a novel polypeptide having sequence 25 identity with leucine rich repeat proteins and designated in the present application as "PR01113."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01113 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 30 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PROl 113 polypeptide having the sequence of amino acid residues from about 1 to about 616, inclusive of Figure 16 (SEQ ID NO:24), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl 113 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 214 and 35 about 2061, inclusive, of Figure 15 (SEQ ID NO:23). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having
52
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203289 (DNA57254-1477), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 5 Deposit No. 203289 (DNA57254-1477).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 1 to about 616, inclusive of Figure 16 (SEQ ID 10 NO:24), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PROl 113 polypeptide having the sequence of amino acid residues from about 1 to about 616, inclusive of Figure 16 (SEQ ID NO:24), or (b) the complement 15 of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PROl 113 polypeptide in its soluble, i.e. transmembrane domain deleted or inactivated variants, or is 20 complementary to such encoding nucleic acid molecule. The transmembrane domain has been tentatively identified as extending from about amino acid position 13 through about amino acid position 40 in the PROl 113 amino acid sequence (Figure 16, SEQ ID NO:24). Thus, also presented herein is a peptide comprising amino acids 41-616, and optionally 1-12 of SEQ ID NO:24, and the nucleic acids encoding the same.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 25 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues I to about 616, inclusive of Figure 16 (SEQ ID NO:24), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PROl 113 polypeptide coding sequence that may find 30 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PROl 113 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PROl 113 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 1 through 616 of Figure 16 (SEQ ID NO:24).
In another aspect, the invention concerns an isolated PROl 113 polypeptide, comprising an amino acid
53
sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 616, inclusive of Figure 16 (SEQ ID NO:24).
In a further aspect, the invention concerns an isolated PROl 113 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 5 about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 through 616 of Figure 16 (SEQ ID NO:24).
In yet another aspect, the invention concerns an isolated PROl 113 polypeptide, comprising the sequence of amino acid residues 1 to about 616, inclusive of Figure 16 (SEQ ID NO:24), or a fragment thereof sufficient to provide a binding site for an anti-PROl 113 antibody. Preferably, the PROl 113 fragment retains a qualitative 10 biological activity of a native PROl 113 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PROl 113 polypeptide having the sequence of amino acid residues from about 1 to about 616, inclusive of Figure 16 (SEQ ID NO:24), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 15 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PROl 113 20 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PROl 113 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PROl 113 polypeptide, by contacting the native PR01113 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PROl 113 polypeptide, 25 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
9. PROl 194
A cDNA clone (DNA57841-1522) has been identified that encodes a novel secreted polypeptide designated in the present application as "PROl 194."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PROl 194 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PROl 194 polypeptide having 35 the sequence of amino acid residues from 1 or about 22 to about 81, inclusive of Figure 18 (SEQ ID NO:29), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl 194
54
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 72 and about 251, inclusive, of Figure 17 (SEQ ID NO:28). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 5 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203458 (DNA57841-1522), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203458 (DNA57841-1522).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 22 to about 81, inclusive of Figure 18 (SEQ ID NO:29), or the complement of the DNA of (a).
IS In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PROl 194 polypeptide having the sequence of amino acid residues from about 22 to about 81, inclusive of Figure 18 (SEQ ID NO:29), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably 20 at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 25 amino acid sequence of residues 22 to about 81, inclusive of Figure 18 (SEQ ID NO:29), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PROl 194 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 30 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO 1194 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PROl 194 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 22 through 81 of Figure 18 (SEQ ID NO:29). 35 In another aspect, the invention concerns an isolated PROl 194 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the
55
sequence of amino acid residues 22 to about 81, inclusive of Figure 18 (SEQ ID NO:29).
In a further aspect, the invention concerns an isolated PROl 194 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 22 through 81 of Figure 18 (SEQ ID NO:29).
In yet another aspect, the invention concerns an isolated PRO 1194 polypeptide, comprising the sequence of amino acid residues 22 to about 81, inclusive of Figure 18 (SEQ ID NO:29), or a fragment thereof sufficient to provide a binding site for an anti-PROl 194 antibody. Preferably, the PROl 194 fragment retains a qualitative biological activity of a native PROl 194 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 10 molecule under stringent conditions with (a) a DNA molecule encoding a PROl 194 polypeptide having the sequence of amino acid residues from about 22 to about 81, inclusive of Figure 18 (SEQ ID NO:29), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising 15 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PROl 194 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PROl 194 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 20 native PRO 1194 polypeptide, by contacting the native PRO 1194 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PROl 194 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
10. PROmO
A cDNA clone (DNA58727-1474) has been identified, having homology to nucleic acid encoding myeloid upregulated protein that encodes a novel polypeptide, designated in the present application as "PROlllO".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 30 a PROl 110 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PROl 110 polypeptide having the sequence of amino acid residues from about 1 to about 322, inclusive of Figure 20 (SEQ ID NO:31), or (b) 35 the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROlllO polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 131
56
and about 1096, inclusive, of Figure 19 (SEQ ID N0:30). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 5 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203171 (DNA58727-1474) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203171 (DNA58727-1474).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 10 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 322, inclusive of Figure 20 (SEQ ID NO:31), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 10 15 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PROl 110 polypeptide having the sequence of amino acid residues from 1 to about 322, inclusive of Figure 20 (SEQ ID NO:31), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) 20 or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PROlllO polypeptide,with or without the initiating methionine and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The transmembrane domains have been tentatively identified as extending from about amino acid position 41 to about amino acid 25 position 60, from about amino acid position 66 to about amino acid position 85, from about amino acid position 101 to about amino acid position 120, from about amino acid position 137 to about amino acid position 153, from about amino acid position 171 to about amino acid position 192, from about amino acid position 205 to about amino acid position 226, from about amino acid position 235 to about amino acid position 255, and from about amino acid position 294 to about amino acid position 312 in the PROl 110 amino acid sequence (Figure 30 20, SEQ ID NO-.31).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 to about 322, inclusive of Figure 20 (SEQ ID NO:31), or (b) the complement 35 of the DNA of (a).
Another embodiment is directed to fragments of a PROl 110 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length,
57
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 19 (SEQ ID N0:30).
In another embodiment, the invention provides isolated PROlllO polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PROlllO polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 to about 322 of Figure 20 (SEQ ID NO:31).
In another aspect, the invention concerns an isolated PROlllO polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 10 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 322, inclusive of Figure 20 (SEQ ID NO:31).
In a further aspect, the invention concerns an isolated PRO 1110 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence 15 of residues 1 to about 322, inclusive of Figure 20 (SEQ ID NO:31).
In yet another aspect, the invention concerns an isolated PROl 110 polypeptide, comprising the sequence of amino acid residues 1 to about 322, inclusive of Figure 20 (SEQ ID NO:31), or a fragment thereof sufficient to provide a binding site for an anti-PROl 110 antibody. Preferably, the PROl 110 fragment retains a qualitative biological activity of a native PROlllO polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PROlllO polypeptide having the sequence of amino acid residues from about 1 to about 322, inclusive of Figure 20 (SEQ ID NO:31), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 25 identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PROlllO polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PROl 110 antibody. 30 In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PROlllO polypeptide by contacting the native PROlllO polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PROl 110 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
11. PRQ1378
A cDNA clone (DNA58730-1607) has been identified that encodes a novel secreted polypeptide
58
designated in the present application as "PRO1378".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01378 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 5 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01378 polypeptide having the sequence of amino acid residues from 1 or about 16 to about 335, inclusive of Figure 22 (SEQ ID NO:33), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1378 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 1365 10 and about 2369, inclusive, of Figure 21 (SEQ ID NO:32). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 15 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203221 (DNA58730-1607), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203221 (DNA58730-1607).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 20 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from 1 or about 16 to about 335, inclusive of Figure 22 (SEQ ID NO:33), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 20 25 nucleotides, preferably at least about 50 nucleotides, and more preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1378 polypeptide having the sequence of amino acid residues from about 16 to about 335, inclusive of Figure 22 (SEQ ID NO:33), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at 30 least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01378 polypeptide, with or without the N-terminal signal sequence, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 35 1 through about amino acid position 15 in the sequence of Figure 22 (SEQ ID NO:33).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more
59
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 16 to about 335, inclusive of Figure 22 (SEQ ID NO:33), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1378 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 5 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01378 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01378 polypeptide, which in one 10 embodiment, includes an amino acid sequence comprising residues 16 to 335 of Figure 22 (SEQ ID NO:33).
In another aspect, the invention concerns an isolated PR01378 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 16 to about 335, inclusive of Figure 22 (SEQ ID NO:33). 15 In a further aspect, the invention concerns an isolated PR01378 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 16 to 335 of Figure 22 (SEQ ID NO:33).
In yet another aspect, the invention concerns an isolated PR01378 polypeptide, comprising the sequence 20 of amino acid residues 16 to about 335, inclusive of Figure 22 (SEQ ID NO:33), or a fragment thereof sufficient to provide a binding site for an anti-PR01378 antibody. Preferably, the PR01378 fragment retains a qualitative biological activity of a native PR01378 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1378 polypeptide having the 25 sequence of amino acid residues from about 16 to about 335, inclusive of Figure 22 (SEQ ID NO:33), or (b) the complement of the DNA molecuie of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 30 polypeptide from the cell culture.
12. PRQ1481
A cDNA clone (DNA58732-1650) has been identified that encodes a novel polypeptide designated in the present application as "PRO 1481."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO 1481 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
60
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO 1481 polypeptide having the sequence of amino acid residues from 1 or about 24 to about 334, inclusive of Figure 24 (SEQ ID NO:41), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1481 5 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 88 and about 1321, inclusive, of Figure 23 (SEQ ID N0:40). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 10 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203290 (DNA58732-1650), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203290 (DNA58732-1650).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 24 to about 334, inclusive of Figure 24 (SEQ ID NO:41), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01481 polypeptide having the sequence of amino acid residues from about 24 to about 334, inclusive of Figure 24 (SEQ ID NO:41), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably 25 at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01481 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domain deleted, truncated or inactivated variants, or is complementary to such 30 encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 23 in the sequence of Figure 24 (SEQ ID NO:41). The transmembrane domain has been tentatively identified as extending from about amino acid position 235 through about amino acid position 262 in the PR01481 amino acid sequence (Figure 24, SEQ ID NO:41).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 35 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 24 to about 334, inclusive of Figure 24 (SEQ ID NO:41), or (b) the complement
61
of the DNA of (a).
Another embodiment is directed to fragments of a PR01481 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01481 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO 1481 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 24 through 334 of Figure 24 (SEQ ID NO:41).
In another aspect, the invention concerns an isolated PR01481 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 24 to about 334, inclusive of Figure 24 (SEQ ID NO:41).
In a further aspect, the invention concerns an isolated PRO 1481 polypeptide, comprising an amino acid 15 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 24 through 334 of Figure 24 (SEQ ID NO:41).
In yet another aspect, the invention concerns an isolated PRO 1481 polypeptide, comprising the sequence of amino acid residues 24 to about 334, inclusive of Figure 24 (SEQ ID NO:41), or a fragment thereof sufficient 20 to provide a binding site for an anti-PR01481 antibody. Preferably, the PR01481 fragment retains a qualitative biological activity of a native PR01481 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01481 polypeptide having the sequence of amino acid residues from about 24 to about 334, inclusive of Figure 24 (SEQ ID NO:41), or (b) 25 the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01481
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01481 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PRO 1481 polypeptide, by contacting the native PRO 1481 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01481 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
62
13. PROl 189
A cDN A clone (DN A58828-1519) has been identified that encodes a novel polypeptidehaving homology to E25 which is designated in the present application as "PROl 189."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO 1189 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PROl 189 polypeptide having the sequence of amino acid residues from about 1 to about 263, inclusive of Figure 26 (SEQ ID NO:43), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl 189
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 79 and about 867, inclusive, of Figure 25 (SEQ ID NO:42). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 15 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203172 (DNA58828-1519), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 20 Deposit No. 203172 (DNA58828-1519).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 1 to about 263, inclusive of Figure 26 (SEQ ID 25 NO:43), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PROl 189 polypeptide having the sequence of amino acid residues from about 1 to about 263, inclusive of Figure 26 (SEQ ID NO:43), or (b) the complement 30 of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90 % sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO 1189 polypeptide with its transmembrane domain deleted or inactivated, or is complementary to such 35 encoding nucleic acid molecule. The transmembrane domain has been tentatively identified as extending from about amino acid position 53 through about amino acid position 75 in the PROl 189 amino acid sequence (Figure 26, SEQ ID NO:43).
63
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 to about 263, inclusive of Figure 26 (SEQ ID NO:43), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PROl 189 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO 1189 polypeptide encoded by any of the 10 isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PROl 189 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 1 to 263 of Figure 26 (SEQ ID NO:43).
In another aspect, the invention concerns an isolated PROl 189 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 15 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 263, inclusive of Figure 26 (SEQ ID NO:43).
In a further aspect, the invention concerns an isolated PROl 189 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 20 of residues 1 to 263 of Figure 26 (SEQ ID NO:43).
In yet another aspect, the invention concerns an isolated PROl 189 polypeptide, comprising the sequence of amino acid residues 1 to about 263, inclusive of Figure 26 (SEQ ID NO:43), or a fragment thereof sufficient to provide a binding site for an anti-PRO 1189 antibody. Preferably, the PRO 1189 fragment retains a qualitative biological activity of a native PRO 1189 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1189 polypeptide having the sequence of amino acid residues from about 1 to about 263, inclusive of Figure 26 (SEQ ID NO:43), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 30 identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PROl 189 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PROl 189 antibody. 35 In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PRO 1189 polypeptide, by contacting the native PROl 189 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
64
In a still further embodiment, the invention concerns a composition comprising a PRO 1189 polypeptide, or an agonist or antagonist as hereinabove defmed, in combination with a pharmaceuticaliy acceptable carrier.
14. PRQ1415
A cDNA clone (DNA58852-1637) has been identified that encodes a novel polypeptide, designated in 5 the present application as "PR01415".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01415 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 10 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO 1415 polypeptide having the sequence of amino acid residues from about 1 or about 26 to about 283, inclusive of Figure 28 (SEQ ID N0:50), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01415 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 148 15 or about 223 and about 996, inclusive, of Figure 27 (SEQ ID NO:49). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 20 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203271 (DNA58852-1637) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203271 (DNA58852-1637).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 25 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 26 to about 283, inclusive of Figure 28 (SEQ ID N0:50), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 100 30 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1415 polypeptide having the sequence of amino acid residues from 1 or about 26 to about 283, inclusive of Figure 28 (SEQ ID N0:50), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence 35 identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01415 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and
65
its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 25 in the sequence of Figure 28 (SEQ ID N0:50). The transmembrane domain has been tentatively identified as extending from about amino acid position 94 to about amino acid position 118 in the PR01415 amino acid sequence (Figure 28, SEQ ID N0:50).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 26 to about 283, inclusive of Figure 28 (SEQ ID N0:50), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1415 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 27 (SEQ ID NO:49).
In another embodiment, the invention provides isolated PR01415 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01415 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 26 to about 283 of Figure 28 (SEQ ID N0:50).
In another aspect, the invention concerns an isolated PR01415 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 26 to about 283, inclusive of Figure 28 (SEQ ID N0:50).
In a further aspect, the invention concerns an isolated PR01415 polypeptide, comprising an amino acid 25 sequence scoring at least about 80% positives, preferably at least about 85 % positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 26 to about 283, inclusive of Figure 28 (SEQ ID N0:50).
In yet another aspect, the invention concerns an isolated PR01415 polypeptide, comprising the sequence of amino acid residues 1 or about 26 to about 283, inclusive of Figure 28 (SEQ ID N0:50), or a fragment 30 thereof sufficient to provide a binding site for an anti-PR01415 antibody. Preferably, the PR01415 fragment retains a qualitative biological activity of a native PR01415 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1415 polypeptide having the sequence of amino acid residues from about 1 or about 26 to about 283, inclusive of Figure 28 (SEQ ID 35 N0:50), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell
66
comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01415 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01415 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 5 native PR01415 polypeptide by contacting the native PR01415 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO 1415 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
IS. PRQ1411
A cDNA clone (DNA59212-1627) has been identified that encodes a novel secreted polypeptide designated in the present application as "PR01411."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01411 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO 1411 polypeptide having the sequence of amino acid residues from 1 or about 22 to about 440, inclusive of Figure 30 (SEQ ID NO:52), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01411
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 247 and about 1503, inclusive, of Figure 29 (SEQ ID NO:51). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 25 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203245 (DNA59212-1627), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 30 Deposit No. 203245 (DNA592I2-1627).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 22 to about 440, inclusive of Figure 30 (SEQ ID 35 NO:52), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule
67
PCT/US99/201I1
under stringent conditions with (a) a DNA molecule encoding a PR01411 polypeptide having the sequence of amino acid residues from about 22 to about 440, inclusive of Figure 30 (SEQ ID NO:52), or (b) the complement of the DNA molecule of (a), and, if die DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 22 to about 440, inclusive of Figure 30 (SEQ ID NO:52), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01411 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01411 polypeptide encoded by any of the 15 isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01411 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 22 through 440 of Figure 30 (SEQ ID NO:52).
In another aspect, the invention concerns an isolated PR01411 polypeptide, comprising an amino acid 20 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 22 to about 440, inclusive of Figure 30 (SEQ ID NO:52).
In a further aspect, the invention concerns an isolated PROI411 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 25 about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 22 through 440 of Figure 30 (SEQ ID NO:52).
In yet another aspect, the invention concerns an isolated PRO 1411 polypeptide, comprising the sequence of amino acid residues 22 to about 440, inclusive of Figure 30 (SEQ ID NO:52), or a fragment thereof sufficient to provide a binding site for an anti-PR01411 antibody. Preferably, the PR01411 fragment retains a qualitative 30 biological activity of a native PR01411 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01411 polypeptide having the sequence of amino acid residues from about 22 to about 440, inclusive of Figure 30 (SEQ ID NO:52), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 35 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host ceil comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the
68
polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01411 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01411 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01411 polypeptide, by contacting the native PR01411 polypeptide with a candidate molecule and 5 monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO 1411 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
16. PROI295
A cDNA clone (DNA59218-1559) has been identified that encodes a novel secreted polypeptide designated in the present application as "PRO1295."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1295 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 15 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO1295 polypeptide having the sequence of amino acid residues from 1 or about 19 to about 280, inclusive of Figure 32 (SEQ ID NO:54), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01295 20 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 261 and about 1046, inclusive, of Figure 31 (SEQ ID NO:53). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 25 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203287 (DNA59218-1559), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203287 (DNA59218-1559).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 19 to about 280, inclusive of Figure 32 (SEQ ID NO:54), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1295 polypeptide having the sequence of
69
amino acid residues from about 19 to about 280, inclusive of Figure 32 (SEQ ID NO:54), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 5 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with die amino acid sequence of residues 19 to about 280, inclusive of Figure 32 (SEQ ID NO:54), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1295 polypeptide coding sequence that may find 10 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO1295 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1295 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 19 through 280 of Figure 32 (SEQ ID NO:54).
In another aspect, the invention concerns an isolated PRO 1295 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 20 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 19 to about 280, inclusive of Figure 32 (SEQ ID NO:54).
In a further aspect, the invention concerns an isolated PR01295 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 25 of residues 19 through 280 of Figure 32 (SEQ ID NO:54).
In yet another aspect, the invention concerns an isolated PRO1295 polypeptide, comprising the sequence of amino acid residues 19 to about 280, inclusive of Figure 32 (SEQ ID NO:54), or a fragment thereof sufficient to provide a binding site for an anti-PR01295 antibody. Preferably, the PRO1295 fragment retains a qualitative biological activity of a native PRO1295 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PRO1295 polypeptide having the sequence of amino acid residues from about 19 to about 280, inclusive of Figure 32 (SEQ ID NO:54), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 35 identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
70
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01295 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01295 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01295 polypeptide, by contacting the native PR01295 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the inventionconeernsacompositioncomprisingaPR01295polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
17. PRQ1359
A cDNA clone (DNA59219-1613) has been identified that encodes a novel polypeptide having sequence 10 identity with sialytransferases and designated in the present application as "PROI359" polypeptides.
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01359 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 15 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO 1359 polypeptide having the sequence of amino acid residues from 1 or about 32 to about 299, inclusive of Figure 34 (SEQ ID NO:56), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01359 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 277 and 20 about 1080, inclusive, of Figure 33 (SEQ ID NO:55). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 25 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203220 (DNA59219-1613), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203220 (DNA59219-1613).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 30 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 32 to about 299, inclusive of Figure 34 (SEQ ID NO:56), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 35 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1359 polypeptide having the sequence of amino acid residues from about 32 to about 299, inclusive of Figure 34 (SEQ ID NO:56), or (b) the complement
71
of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01359 polypeptide in its soluble, i.e. transmembrane domain deleted or inactivated variants, or is 5 complementary to such encoding nucleic acid molecule. The transmembrane domain (type II) has been tentatively identified as extending from about amino acid position 9 through about amino acid position 31 in the PR01359 amino acid sequence (Figure 34, SEQ ID NO:56).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 10 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 32 to about 299, inclusive of Figure 34 (SEQ ID NO:56), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01359 polypeptide coding sequence that may fmd use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 15 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01359 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1359 polypeptide, which in one 20 embodiment, includes an amino acid sequence comprising residues 32 through 299 of Figure 34 (SEQ ID NO:56).
In another aspect, the invention concerns an isolated PR01359 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 25 sequence of amino acid residues 32 to about 299, inclusive of Figure 34 (SEQ ID NO:56).
In a further aspect, the invention concerns an isolated PR01359 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 32 through 299 of Figure 34 (SEQ ID NO:56).
In yet another aspect, the invention concerns an isolated PR01359 polypeptide, comprising the sequence of amino acid residues 32 to about 299, inclusive of Figure 34 (SEQ ID NO:56), or a fragment thereof sufficient to provide a binding site for an anti-PR01359 antibody. Preferably, the PR01359 fragment retains a qualitative biological activity of a native PR01359 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 35 molecule under stringent conditions with (a) a DNA molecule encoding a PRO1359 polypeptide having the sequence of amino acid residues from about 32 to about 299, inclusive of Figure 34 (SEQ ID NO:56), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence
72
identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the ceil culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01359 5 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01359 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01359 polypeptide, by contacting the native PR01359 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO 1359 polypeptide, 10 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
18. PROl 190
A\:DNA clone (DNA59586-1520) has been identified that encodes a novel polypeptide designated in the present application as "PROl 190", and which has homology to CDO protein.
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PROl 190 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identiiy, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PROl 190 polypeptide having 20 the sequence of amino acid residues from about 1 to about 1115, inclusive of Figure 36 (SEQ ID NO:58), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl 190 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 340 and about 3684, inclusive, of Figure 35 (SEQ ID NO:58). Preferably, hybridization occurs under stringent 25 hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203288 30 (DNA59586-1520), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203288 (DNA59586-152Q).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 35 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 1 to about 1115, inclusive of Figure 36 (SEQ ID NO:58), or the complement of the DNA of (a).
73
WO 00/12708 PCT/US99/20111
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PROl 190 polypeptide having the sequence of amino acid residues from about 1 to about 1115, inclusive of Figure 36 (SEQIDNO:58), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably 5 at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PROl 190 polypeptide, with one or more of its transmembrane domains deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The transmembrane domains have been tentatively 10 identified in the PROl 190 amino acid sequence shown in Figure 36 (SEQ ID NO:58) as extending from about amino acid position 16 to about amino acid position 30 and from about amino acid position 854 to about amino acid position 879.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 15 preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 to about 1115, inclusive of Figure 36 (SEQ ID NO:58), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1190 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 20 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
. In another embodiment, the invention provides isolated PROl 190 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove definied.
In a specific aspect, the invention provides isolated native sequence PRO 1190 polypeptide, which in one 25 embodiment, includes an amino acid sequence comprising residues 1 to 1115 of Figure 36 (SEQ ID NO:58).
In another aspect, the invention concerns an isolated PRO 1190 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 1115, inclusive of Figure 36 (SEQ ID NO:58). 30 In a further aspect, the invention concerns an isolated PROl 190 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 to 1115 of Figure 36 (SEQ ID NO:58).
In yet another aspect, the invention concerns an isolated PRO 1190 polypeptide, comprising the sequence 35 of amino acid residues 1 to about 1115, inclusive of Figure 36 (SEQ ID NO:58), or a fragment thereof sufficient to provide a binding site for an anti-PROl 190 antibody. Preferably, the PROl 190 fragment retains a qualitative biological activity of a native PROl 190 polypeptide.
74
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PROl 190 polypeptide having the sequence of amino acid residues from about 1 to about 1115, inclusive of Figure 36 (SEQ ID NO:58), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 5 identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1190 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PROl 190 antibody. 10 In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PROl 190 polypeptide, by contacting the native PROl 190 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PROl 190 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
19. PRQ1772
A cDNA clone (DNA59817-1703) has been identified, having homology to nucleic acid encoding peptidase enzymes, that encodes a novel polypeptide, designated in the present application as "PR01772".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 20 a PRO1772 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01772 polypeptide having the sequence of amino acid residues from about 1 or about 37 to about 487, inclusive of Figure 38 (SEQ ID 25 NO:63), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01772 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 93 or about 201 and about 1553, inclusive, of Figure 37 (SEQ ID NO:62). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203470 (DNA59817-1703) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the 35 nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203470 (DNA59817-1703).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
75
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 37 to about 487, inclusive of Figure 38 (SEQ ID NO:63), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 415 5 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1772 polypeptide having the sequence of amino acid residues from 1 or about 37 to about 487, inclusive of Figure 38 (SEQ ID NO:63), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence 10 identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO 1772 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid 15 position 1 to about amino acid position 36 in the sequence of Figure 38 (SEQ ID NO:63). The transmembrane domain has been tentatively identified as extending from about amino acid position 313 to about amino acid position 331 in the PR01772 amino acid sequence (Figure 38, SEQ ID NO:63).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 20 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 37 to about 487, inclusive of Figure 38 (SEQ ID NO:63), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1772 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 25 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 37 (SEQ ID NO:62).
In another embodiment, the invention provides isolated PRO1772 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01772 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 37 to about 487 of Figure 38 (SEQ ID NO:63).
In another aspect, the invention concerns an isolated PRO1772 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 35 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 37 to about 487, inclusive of Figure 38 (SEQ ID NO:63).
In a further aspect, the invention concerns an isolated PR01772 polypeptide, comprising an amino acid
76
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 37 to about 487, inclusive of Figure 38 (SEQ ID NO:63).
In yet another aspect, the invention concerns an isolated PRO1772 polypeptide, comprising the sequence of amino acid residues 1 or about 37 to about 487, inclusive of Figure 38 (SEQ ID NO.-63), or a fragment 5 thereof sufficient to provide a binding site for an anti-PR01772 antibody. Preferably, the PROI772 fragment retains a qualitative biological activity of a native PR01772 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PROI772 polypeptide having the sequence of amino acid residues from about 1 or about 37 to about 487, inclusive of Figure 38 (SEQ ID 10 NO:63), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PROI772
polypeptide. In a particular embodiment, the agonist or antagonist is an anti~PROI772 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01772 polypeptide by contacting the native PR01772 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01772 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
. PRQ1248
A cDNA clone (DNA60278-1530) has been identified, having homology to nucleic acid encoding PUT-25 2, that encodes a novel polypeptide, designated in the present application as "PR01248".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01248 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 30 preferably at least about 95 % sequence identity to (a) a DNA molecuie encoding a PRO1248 polypeptide having the sequence of amino acid residues from about 1 or about 21 to about 183, inclusive of Figure 40 (SEQ ID NO:68), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1248 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 122 35 or about 182 and about 670, inclusive, of Figure 39 (SEQ ID NO:67). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having
77
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203170 (DNA60278-1530) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in 5 ATCC Deposit No. 203170 (DNA60278-1530).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 21 to about 183, inclusive of Figure 40 (SEQ ID 10 NO:68), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 10 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01248 polypeptide having the sequence of amino acid residues from 1 or about 21 to about 183, inclusive of Figure 40 (SEQ ID NO:68), or (b) the complement of the DNA molecule of (a), and, 15 if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01248 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and 20 its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 20 in the sequence of Figure 40 (SEQ ID NO:68). The transmembrane domain has been tentatively identified as extending from about amino acid position 90 to about amino acid position 112 in the PR01248 amino acid sequence (Figure 40, SEQ ID NO:68).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 21 to about 183, inclusive of Figure 40 (SEQ ID NO:68), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01248 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 39 (SEQ ID NO:67).
In another embodiment, the invention provides isolated PRO1248 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01248 polypeptide, which in
78
PCT/US99/20U1
certain embodiments, includes an amino acid sequence comprising residues 1 or about 21 to about 183 of Figure 40 (SEQ ID NO:68).
In another aspect, the invention concerns an isolated PRO 1248 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 5 sequence of amino acid residues 1 or about 21 to about 183, inclusive of Figure 40 (SEQ ID NO:68).
In a further aspect, the invention concerns an isolated PRO1248 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 21 to about 183, inclusive of Figure 40 (SEQ ID NO:68).
In yet another aspect, the invention concerns an isolated PR01248 polypeptide, comprising the sequence of amino acid residues 1 or about 21 to about 183, inclusive of Figure 40 (SEQ ID NO:68), or a fragment thereof sufficient to provide a binding site for an anti-PR01248 antibody. Preferably, the PR01248 fragment retains a qualitative biological activity of a native PR01248 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 15 molecule under stringent conditions with (a) a DNA molecule encoding a PRO1248 polypeptide having the sequence of amino acid residues from about 1 or about 21 to about 183, inclusive of Figure 40 (SEQ ID N0:68), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell 20 comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1248 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01248 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 25 native PR01248 polypeptide by contacting the native PR01248 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01248 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
21. PRQ1316
A cDNA clone (DNA60608-1577) has been identified, having homology to Dickkopf that encodes a novel polypeptide, designated in the present application as "PR01316."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01316 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85 % sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01316 polypeptide having
79
the sequence of amino acid residues from 1 or about 26 to about 259, inclusive of Figure 42 (SEQ ID N0:70), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01316 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 281 and about 987, inclusive, of Figure 41 (SEQ ID NO:69). Preferably, hybridization occurs under stringent 5 hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203126 10 (DNA60608-1577), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203126 (DNA60608-1577).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 15 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 26 to about 259, inclusive of Figure 42 (SEQ ID N0:70), or the complement of the DNA of (a).
In a further aspect, the invention concern an isolated nucleic acid molecule having at least 15 nucleotides which hybridizes under stringent conditions with (a) a DNA molecule having a identity with a region spanning 20 either from residues 1-454 or from residues 1095-3130 of the Figure 41 (SEQ ID NO:69), or (b) the complement of the DNA molecule of (a). Alternatively, an isolated nucleic acid molecuie having at least 15 nucleotides having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, and most preferably at least about 95% sequence identity to: (a) a DNA molecule having a identity with a region spanning either from residues 1-454 or from residues 1095-3130 of the 25 Figure 41 (SEQ ID NO:69), or (b) the complement of the DNA molecule of (a).
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PROD 16 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 30 1 to about amino acid position 25 in the sequence of Figure 42 (SEQ ID N0:70). An N-glycosylation site has been identified at position 52 and a fungal Zn(2)-Cys(6) binuciear cluster has been identified at position 99.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 35 amino acid sequence of residues 26 to about 259, inclusive ofFigure 42 (SEQ ID N0:70), or (b) the complement of the DNA of (a).
In another embodiment, the invention provides isolated PR01316 polypeptide encoded by any of the
80
isolated nucleic acid sequences herein above defined.
In a specific aspect, the invention provides isolated native sequence PRO 1316 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 26 to 259 of Figure 42 (SEQ ID N0:70).
In another aspect, the invention concerns an isolated PR01316 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 5 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 26 to about 259, inclusive of Figure 42 (SEQ ID N0:70).
In a further aspect, the invention concerns an isolated PR01316 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 10 of residues 26 to 259 of Figure 42 (SEQ ID N0:70).
In yet another aspect, the invention concerns an isolated PRO 1316 polypeptide, comprising the sequence of amino acid residues 26 to about 259, inclusive of Figure 42 (SEQ ID N0:70), or a fragment thereof sufficient to provide a binding site for an anti-PR01316 antibody. Preferably, the PR01316 fragment retains a qualitative biological activity of a native PRO 1316 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PR01316 polypeptide having the sequence of amino acid residues from about 26 to about 259, inclusive of Figure 42 (SEQ ID N0:70), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 20 identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of the a native PR01316 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01316 antibody. 25 In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01316 polypeptide, by contacting the native PR01316 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO 1316 polypeptide, or an agonist or antagonist as herein above defined, in combination with a pharmaceuticaliy acceptable carrier.
22. PRQ1197
A cDNA clone (DNA60611-1524) has been identified that encodes a novel secreted polypeptide designated in the present application as "PROl 197."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 35 a PROl 197 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most
81
preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO 1197 polypeptide having the sequence of amino acid residues from 1 or about 25 to about 363, inclusive of Figure 44 (SEQ ID NO:72), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl 197 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 383 and 5 about 1399, inclusive, of Figure 43 (SEQ ID NO:71). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 10 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203175 (DNA60611-1524), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203175 (DNA60611-1524).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 15 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 25 to about 363, inclusive of Figure 44 (SEQ ID NO:72), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 20 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PROl 197 polypeptide having the sequence of amino acid residues from about 25 to about 363, inclusive of Figure 44 (SEQ ID NO:72), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90 % sequence identity, most preferably 25 at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 25 to about 363, inclusive of Figure 44 (SEQ ID NO-.72), or (b) the complement 30 of the DNA of (a).
Another embodiment is directed to fragments of a PROl 197 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length. 35 In another embodiment, the invention provides isolated PROl 197 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PROl 197 polypeptide, which in one
82
PCT/US99/20U1
embodiment, includes an amino acid sequence comprising residues 25 through 363 of Figure 44 (SEQ ID NO:72).
In another aspect, the invention concerns an isolated PROl 197 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 5 sequence of amino acid residues 25 to about 363, inclusive of Figure 44 (SEQ ID NO:72).
In a further aspect, the invention concerns an isolated PROl 197 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 25 through 363 of Figure 44 (SEQ ID NO:72).
In yet another aspect, the invention concerns an isolated PROl 197 polypeptide, comprising the sequence of amino acid residues 25 to about 363, inclusive of Figure 44 (SEQ ID NO:72), or a fragment thereof sufficient to provide a binding site for an anti-PROl 197 antibody. Preferably, the PROl 197 fragment retains a qualitative biological activity of a native PROl 197 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 15 molecule under stringent conditions with (a) a DNA molecule encoding a PROl 197 polypeptide having the sequence of amino acid residues from about 25 to about 363, inclusive of Figure 44 (SEQ ID NO:72), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 20 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PROl 197 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PROl 197 antibody.
23. PR01293
A cDNA clone (DNA60618-1557) has been identified, having homology to nucleic acid encoding an immunoglobulin heavy chain variable region protein that encodes a novel polypeptide, designated in the present application as "PR01293".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 30 a PR01293 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO 1293 polypeptide having the sequence of amino acid residues from about 1 or about 20 to about 341, inclusive of Figure 46 (SEQ ID 35 NO:77), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1293 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 37
83
or about 94 and about 1059, inclusive, of Figure 45 (SEQ ID NO:76). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 5 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203292 (DNA60618-1557) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203292 (DNA60618-1557).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 10 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 20 to about 341, inclusive of Figure 46 (SEQ ID NO:77), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 100 15 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1293 polypeptide having the sequence of amino acid residues from 1 or about 20 to about 341, inclusive of Figure 46 (SEQ ID NO:77), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence 20 identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01293 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid 25 position 1 to about amino acid position 19 in the sequence of Figure 46 (SEQ ID NO:77). The transmembrane domain has been tentatively identified as extending from about amino acid position 237 to about amino acid position 262 in the PR01293 amino acid sequence (Figure 46, SEQ ID NO:77).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 30 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 20 to about 341, inclusive of Figure 46 (SEQ ID NO:77), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1293 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 35 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 45 (SEQ ID NO:76).
84
In another embodiment, the invention provides isolated PRO 1293 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01293 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 20 to about 341 of Figure 46 (SEQ ID NO:77).
In another aspect, the invention concerns an isolated PRO1293 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 20 to about 341, inclusive of Figure 46 (SEQ ID NO:77).
In a further aspect, the invention concerns an isolated PR01293 polypeptide, comprising an amino acid 10 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 20 to about 341, inclusive of Figure 46 (SEQ ID NO:77).
In yet another aspect, the invention concerns an isolated PR01293 polypeptide, comprising the sequence of amino acid residues 1 or about 20 to about 341, inclusive of Figure 46 (SEQ ID NO:77), or a fragment 15 thereof sufficient to provide a binding site for an anti-PR01293 antibody. Preferably, the PRO1293 fragment retains a qualitative biological activity of a native PRO1293 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01293 polypeptide having the sequence of amino acid residues from about i or about 20 to about 341, inclusive of Figure 46 (SEQ ID 20 N0.77), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01293
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01293 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PRO 1293 polypeptide by contacting the native PR01293 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1293 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
24. PRQ1380
A cDNA clone (DNA60740-16I5) has been identified that encodes a novel multi-span transmembrane 35 polypeptide designated in the present application as "PR01380".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO 1380 polypeptide.
85
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01380 polypeptide having the sequence of amino acid residues from about 1 to about 470, inclusive of Figure 48 (SEQ ID NO:79), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1380
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 36 and about 1460, inclusive, of Figure 47 (SEQ ID NO:78). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 10 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203456 (DNA60740-1615), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 15 Deposit No. 203456 (DNA60740-1615).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 1 to about 470, inclusive of Figure 48 (SEQ ID 20 NO:79), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01380 polypeptide having the sequence of amino acid residues from about 1 to about 470, inclusive of Figure 48 (SEQ ID N0:79), or (b) the complement 25 of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01380 polypeptide, and its soluble variants (i.e. one or more transmembrane domains deleted or 30 inactivated), or is complementary to such encoding nucleic acid molecule. Transmembrane domains have been tentatively identified at about the following amino acid positions: 50-74, 105-127,135-153, 163-183,228-252, 305-330, and 448-472 in the PR01380 amino acid sequence (Figure 48, SEQ ID NO:79).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 35 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 to about 470, inclusive of Figure 48 (SEQ ID NO:79), or (b) the complement of the DNA of (a).
86
Another embodiment is directed to fragments of a PRO1380 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01380 polypeptide encoded by any of the 5 isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1380 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 1 to 470 of Figure 48 {SEQ ID NO:79).
In another aspect, the invention concerns an isolated PR01380 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 10 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 470, inclusive of Figure 48 (SEQ ID NO:79).
Ina further aspect, the invention concerns an isolated PR01380 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 15 of residues 1 to 470 of Figure 48 (SEQ ID NO:79).
In yet another aspect, the invention concerns an isolated PR01380 polypeptide, comprising the sequence of amino acid residues 1 to about 470, inclusive of Figure 48 (SEQ ID NO:79), or a fragment thereof sufficient to provide a binding site for an anti-PR01380 antibody. Preferably, the PR01380 fragment retains a qualitative biological activity of a native PR01380 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PRO1380 polypeptide having the sequence of amino acid residues from about 1 to about 470, inclusive of Figure 48 (SEQ ID NO:79), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 25 identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
. PRQ1265
AcDNAclone (DNA60764-1533) has been identified that encodes a novel polypeptide having homology to the Figl polypeptide and designated in the present application as "PR01265."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01265 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 35 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01265 polypeptide having the sequence of amino acid residues from 1 or about about 22 to about 567, inclusive of Figure 50 (SEQ ID
87
NO:84), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl265 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 142 and about 1779, inclusive, of Figure 49 (SEQ ID NO:83). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203452 (DNA60764-1533), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 10 acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203452 (DNA60764-1533).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 15 identity to the sequence of amino acid residues from about 22 to about 567, inclusive of Figure 50 (SEQ ID NO:84), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01265 polypeptide having the sequence of 20 amino acid residues from about 22 to about 567, inclusive of Figure 50 (SEQ ID NO:84), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 25 a PR01265 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 21 in the sequence of Figure 50 (SEQ ID NO: 84).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 30 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 22 to about 567, inclusive of Figure 50 (SEQ ID NO:84), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01265 polypeptide coding sequence that may find 35 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
88
In another embodiment, the invention provides isolated PRO1265 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01265 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 1 or about 22 to 567 of Figure 50 (SEQ ID NO:84).
In another aspect, the invention concerns an isolated PR01265 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 22 to about 567, inclusive of Figure 50 (SEQ ID NO:84).
In a further aspect, the invention concerns an isolated PR01265 polypeptide, comprising an amino acid 10 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 22 to 567 of Figure 50 (SEQ ID NO:84).
In yet another aspect, the invention concerns an isolated PRO1265 polypeptide, comprising the sequence of amino acid residues 22 to about 567, inclusive of Figure 50 (SEQ ID NO:84), or a fragment thereof sufficient 15 to provide a binding site for ananti-PR01265 antibody. Preferably, the PR01265 fragment retains a qualitative biological activity of a native PRO1265 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01265 polypeptide having the sequence of amino acid residues from about 22 to about 567, inclusive of Figure 50 (SEQ ID NO:84), or (b) 20 the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
26. PRQ1250
A cDNA clone (DNA60775-1532) has been identified, having homology to nucleic acid encoding long chain fatty acid CoA ligase that encodes a novel polypeptide, designated in the present application as "PRO1250".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01250 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01250 polypeptide having 35 the sequence of amino acid residues from about 1 to about 739, inclusive of Figure 52 (SEQ ID N0:86), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1250
89
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 74 and about 2290, inclusive, of Figure 51 (SEQ ID NO:85). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 5 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203173 (DNA60775-1532) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203173 (DNA60775-1532).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 739, inclusive of Figure 52 (SEQ ID NO:86), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 10
nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1250 polypeptide having the sequence of amino acid residues from 1 to about 739, inclusive of Figure 52 (SEQ ID NO:86), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85 % sequence identity, more 20 preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01250 polypeptide, with or without the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The type II 25 transmembrane domain has been tentatively identified as extending from about amino acid position 61 to about amino acid position 80 in the PRO1250 amino acid sequence (Figure 52, SEQ ID NO: 86).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 30 amino acid sequence of residues 1 to about 739, inclusive of Figure 52 (SEQ ID NO:86), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01250 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 35 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 51 (SEQ ID NO:85).
In another embodiment, the invention provides isolated PR01250 polypeptide encoded by any of the
90
isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01250 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 to about 739 of Figure 52 (SEQ ID N0:86).
In another aspect, the invention concerns an isolated PR01250 polypeptide, comprising an amino acid 5 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 739, inclusive of Figure 52 (SEQ ID NO:86).
In a further aspect, the invention concerns an isolated PRO1250 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 10 about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 to about 739, inclusive of Figure 52 (SEQ ID NO:86).
In yet another aspect, the invention concerns an isolated PRO1250 polypeptide, comprising the sequence of amino acid residues 1 to about 739, inclusive of Figure 52 (SEQ ID NO:86), or a fragment thereof sufficient to provide a binding site for an anti-PR01250 antibody. Preferably, the PR01250 fragment retains a qualitative 15 biological activity of a native PR01250 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (l) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01250 polypeptide having the sequence of amino acid residues from about 1 to about 739, inclusive of Figure 52 (SEQ ID NO:86), or (b) the complement of the DNA molecuie of (a), and if the test DNA molecule has at least about an 80% sequence 20 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01250 25 polypeptide. In a particular embodiment, the agonist or antagonist is an anli-PRO 1250 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01250 polypeptide by contacting the native PR01250 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1250 polypeptide, 30 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
27. PRQ1475
A cDNA clone (DNA61185-1646) has been identified, having homology to nucleic acid encoding an N-acety lglucosaminy ltransferase that encodes a novel polypeptide, designated in the present application as 35 "PR01475".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01475 polypeptide.
91
PCT/US99/201U
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecuie encoding a PRO1475 polypeptide having the sequence of amino acid residues from about 1 to about 660, inclusive of Figure 54 (SEQ ID NO:88), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01475
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 130 and about 2109, inclusive, of Figure 53 (SEQ ID NO:87). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 10 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203464 (DNA61185-1646) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in 15 ATCC Deposit No. 203464 (DNA61I85-1646).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 660, inclusive of Figure 54 (SEQ ID NO:88), or (b) 20 the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 180 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01475 polypeptide having the sequence of amino acid residues from 1 to about 660, inclusive of Figure 54 (SEQ ID NO:88), or (b) the complement of the DNA molecule of (a), and, if the DNA 25 molecule has at least about an 80% sequence identity, prefereably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1475 polypeptide, with or without the initiating methionine, and its soluble, i.e., transmembrane domain 30 deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The transmembrane domain has been tentatively identified as extending from about amino acid position 38 to about amino acid position 55 in the PR01475 amino acid sequence (Figure 54, SEQ ID NO:88).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 35 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 to about 660, inclusive of Figure 54 (SEQ ID NO:88), or (b) the complement of the DNA of (a).
92
PCT/US99/201I1
Another embodiment is directed to fragments of a PRO1475 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 53 (SEQ ID NO:87).
In another embodiment, the invention provides isolated PRO1475 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PRO 1475 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 to about 660 of Figure 54 (SEQ ID NO:88).
In another aspect, the invention concerns an isolated PR01475 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 660, inclusive of Figure 54 (SEQ ID NO:88).
In a further aspect, the invention concerns an isolated PR01475 polypeptide, comprising an amino acid 15 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 to about 660, inclusive of Figure 54 (SEQ ID NO:88).
In yet another aspect, the invention concerns an isolated PR01475 polypeptide, comprising the sequence of amino acid residues 1 to about 660, inclusive of Figure 54 (SEQ ID NO:88), or a fragment thereof sufficient 20 to provide a binding site for an anti-PR01475 antibody. Preferably, the PRO 1475 fragment retains a qualitative biological activity of a native PR01475 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1475 polypeptide having the sequence of amino acid residues from about 1 to about 660, inclusive of Figure 54 (SEQ ID NO:88), or (b) the 25 complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01475
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01475 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01475 polypeptide by contacting the native PR01475 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01475 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
93
28. PROl 377
A cDNA clone (DNA61608-1606) has been identified that encodes a novel multi-span transmembrane polypeptide designated in the present application as "PR01377."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01377 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01377 polypeptide having the sequence of amino acid residues from 1 or about 19 to about 307, inclusive of Figure 56 (SEQ ID NO:95), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1377
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 203 and about 1069, inclusive, of Figure 55 (SEQ ID NO:94). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 15 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203239 (DNA61608-1606), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 20 Deposit No. 203239 (DNA61608-1606).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 19 to about 307, inclusive of Figure 56 (SEQ ID 25 NO:95), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1377 polypeptide having the sequence of amino acid residues from about 19 to about 307, inclusive of Figure 56 (SEQ ID NO:95), or (b) the complement 30 of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01377 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and 35 one or more of its transmembrane domains deleted or inactivated, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 18 in the sequence of Figure 56 (SEQ ID N0:95). Transmembrane domain
94
has been tentatively identified as extending from about amino acid positions 37-56, 106-122, 211-20, 240-260, and 288-304 in the PR01377 amino acid sequence (Figure 56, SEQ ID NO:95).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 5 amino acid sequence of residues 19 to about 307, inclusive of Figure 56 (SEQ ID NO:95), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01377 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 10 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO1377 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01377 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 19 to 307 of Figure 56 (SEQ ID NO:95). 15 In another aspect, the invention concerns an isolated PR01377 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 19 to about 307, inclusive of Figure 56 (SEQ ID NO:95).
In a further aspect, the invention concerns an isolated PR01377 polypeptide, comprising an amino acid 20 sequence scoring at least about 80% positives, preferably at least about 85 % positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 19 to 307 of Figure 56 (SEQ ID NO:95).
In yet another aspect, the invention concerns an isolated PRO1377 polypeptide, comprising the sequence of amino acid residues 19 to about 307, inclusive of Figure 56 (SEQ ID NO:95), or a fragment thereof sufficient 25 to provide a binding site for an anti-PR01377 antibody. Preferably, the PR01377 fragment retains a qualitative biological activity of a native PR01377 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01377 polypeptide having the sequence of amino acid residues from about 19 to about 307, inclusive of Figure 56 (SEQ ID N0:95), or (b) 30 the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
29. PRQ1326
A cDNA clone (DNA62808-I582) has been identified that encodes a novel secreted polypeptide
95
designated in the present application as "PR01326."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1326 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 5 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01326 polypeptide having the sequence of amino acid residues from 1 or about 30 to about 401, inclusive of Figure 58 (SEQ ID N0:100), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecuie encoding a PR01326 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 199 and 10 about 1314, inclusive, of Figure 57 (SEQ ID NO:99). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 15 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203358 (DNA62808-1582), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203358 (DNA62808-1582).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 20 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 30 to about 401, inclusive of Figure 58 (SEQ ID NO: 100), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 25 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1326 polypeptide having the sequence of amino acid residues from about 30 to about 401, inclusive of Figure 58 (SEQ ID NO: 100), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90 % sequence identity, most 30 preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01326 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 29 in the sequence of Figure 58 (SEQ 35 ID NO: 100).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more
96
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 30 to about 401, inclusive of Figure 58 (SEQ ID N0:100), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1326 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 5 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01326 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01326 polypeptide, which in one 10 embodiment, includes an amino acid sequence comprising residues 30 to 401 of Figure 58 (SEQ ID NO: 100).
In another aspect, the invention concerns an isolated PR01326 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 30 to about 401, inclusive of Figure 58 (SEQ ID N0:100).
IS In a further aspect, the invention concerns an isolated PR01326 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 30 to 401 of Figure 58 (SEQ ID NO: 100).
In yet another aspect, the invention concerns an isolated PR01326polypeptide, comprising the sequence 20 of amino acid residues 30 to about 401, inclusive of Figure 58 (SEQ ID NO: 100), or a fragment thereof sufficient to provide a binding site for an anti-PR01326 antibody. Preferably, the PR01326 fragment retains a qualitative biological activity of a native PR01326 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01326 polypeptide having the 25 sequence of amino acid residues from about 30 to about 401, inclusive of Figure 58 (SEQ ID NO: 100), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 30 polypeptide from the cell culture.
. PRQ1249
A cDNA clone (DNA62809-1531) has been identified that encodes a novel transmembrane polypeptide, designated in the present application as "PRO1249".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01249 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
97
PCMJS99/20U1
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01249 polypeptide having the sequence of amino acid residues from about 1 or about 17 to about 1089, inclusive of Figure 60 (SEQ ID NO: 102), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01249 5 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 3 or about 51 and about 3269, inclusive, of Figure 59 (SEQ ID NO: 101). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 10 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203237 (DNA62809-1531) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203237 (DNA62809-1531).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 17 to about 1089, inclusive of Figure 60 (SEQ ID NO: 102), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 10
nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01249 polypeptide having the sequence of amino acid residues from 1 or about 17 to about 1089, inclusive of Figure 60 (SEQ ID NO: 102), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence 25 identity, more preferably at least about a 90 % sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01249 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding 30 nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 16 in the sequence of Figure 60 (SEQ ID N0:102). The transmembrane domains have been tentatively identified as extending from about amino acid position 317 to about amino acid position 341, from about amino acid position 451 to about amino acid position 470, from about amino acid position 481 to about amino acid position 500, from about amino acid position 510 to about amino acid position 35 527, from about amino acid position 538 to about amino acid position 555, from about amino acid position 831 to about amino acid position 850, from about amino acid position 1016 to about amino acid position 1034 and from about amino acid position 1052 to about amino acid position 1070 in the PR01249 amino acid sequence
98
(Figure 60, SEQ ID NO: 102).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 17 to about 1089, inclusive of Figure 60 (SEQ ID NO: 102), or (b) 5 the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01249 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived 10 from the nucleotide sequence shown in Figure 59 (SEQ ID NO: 101).
In another embodiment, the invention provides isolated PRO1249 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01249 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 17to about 1089 of Figure 15 60 (SEQ ID NO: 102).
In another aspect, the invention concerns an isolated PR01249 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 17 to about 1089, inclusive of Figure 60 (SEQ ID NO: 102). 20 In a further aspect, the invention concerns an isolated PR01249 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 17 to about 1089, inclusive of Figure 60 (SEQ ID NO: 102).
In yet another aspect, the invention concerns an isolated PRO1249 polypeptide, comprising the sequence 25 of amino acid residues 1 or about 17 to about 1089, inclusive of Figure 60 (SEQ ID NO: 102), or a fragment thereof sufficient to provide a binding site for an anti-PR01249 antibody. Preferably, the PR01249 fragment retains a qualitative biological activity of a native PR01249 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01249 polypeptide having the 30 sequence of amino acid residues from about 1 or about 17 to about 1089, inclusive of Figure 60 (SEQ ID NO: 102), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 35 recovering the polypeptide from the cell culture.
99
31. PRQ1315
A cDNA clone (DNA62815-1576) has been identified, having homology to nucleic acid encoding cytokine receptor family-4 proteins that encodes a novel polypeptide, designated in the present application as "PRO 1315".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 5 a PRODIS polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO 1315 polypeptide having the sequence of amino acid residues from about 1 or about 29 to about 442, inclusive of Figure 62 (SEQ ID 10 NO: 104), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRODIS polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 121 or about 205 and about 1446, inclusive, of Figure 61 (SEQ ID NO: 103). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203247 (DNA62815-1576) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the 20 nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203247 (DNA62815-1576).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 25 identity to the sequence of amino acid residues 1 or about 29 to about 442, inclusive of Figure 62 (SEQ ID N0:104), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 500 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRODIS polypeptide having the sequence of amino acid residues from 1 ot about 29 to 30 about 442, inclusive of Figure 62 (SEQ ID NO: 104), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 35 a PR01315 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid
100
position 1 to about amino acid position 28 in the sequence of Figure 62 (SEQ ID NO: 104). The transmembrane domain has been tentatively identified as extending from about amino acid position 140 to about amino acid position 163 in the PR01315 amino acid sequence (Figure 62, SEQ ID NO: 104).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 5 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 29 to about 442, inclusive of Figure 62 (SEQ ID NO: 104), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01315 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 10 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 61 (SEQ ID NO: 103).
In another embodiment, the invention provides isolated PROD 15 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01315 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 29 to about 442 of Figure 62 (SEQ ID NO: 104).
In another aspect, the invention concerns an isolated PROD 15 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 20 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 29 to about 442, inclusive of Figure 62 (SEQ ID NO: 104).
In a further aspect, the invention concerns an isolated PRO 1315 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 25 of residues 1 or about 29 to about 442, inclusive of Figure 62 (SEQ ID NO: 104).
In yet another aspect, the invention concerns an isolated PRO 1315 polypeptide, comprising the sequence of amino acid residues 1 or about 29 to about 442, inclusive of Figure 62 (SEQ ID NO: 104), or a fragment thereof sufficient to provide a binding site for an anti-PR01315 antibody. Preferably, the PRO 1315 fragment retains a qualitative biological activity of a native PR01315 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PROD 15 polypeptide having the sequence of amino acid residues from about 1 or about 29 to about 442, inclusive of Figure 62 (SEQ ID NO: 104), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 35 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
101
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01315 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01315 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01315 polypeptide by contacting the native PR01315 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01315 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
32. PRO 1599
A cDNA clone (DNA62845-1684)has been identified that encodes a novel polypeptide having homology 10 to Granzyme M and designated in the present application as "PRO1599."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01599 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 15 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO 1599 polypeptide having the sequence of amino acid residues from 1 or about 31 to about 283, inclusive of Figure 64 (SEQ ID NO: 111), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1599 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 159 and 20 about 917, inclusive, of Figure 63 (SEQ ID NO: 110). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 25 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203361 (DNA62845-1684), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203361 (DNA62845-1684).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 30 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 31 to about 283, inclusive of Figure 64 (SEQ ID NO:111), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 35 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01599 polypeptide having the sequence of amino acid residues from about 31 to about 283, inclusive of Figure 64 (SEQ ID NO:111), or (b) the
102
complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01599 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is 5 complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 30 in the sequence of Figure 64 (SEQ ID NO: 111).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 10 preferably at ieast about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 31 to about 283, inclusive of Figure 64 (SEQ ID NO:111), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01599 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 15 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PROI599 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01599 polypeptide, which in one 20 embodiment, includes an amino acid sequence comprising residues 31 to 283 of Figure 64 (SEQ ID NO:111).
In another aspect, the invention concerns an isolated PR01599 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 31 to about 283, inclusive of Figure 64 (SEQ ID NO: 111). 25 In a further aspect, the invention concerns an isolated PRO 1599 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 % positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 31 to 283 of Figure 64 (SEQ ID NO:lll).
In yet another aspect, the invention concerns an isolated PR01599 polypeptide, comprising the sequence 30 of amino acid residues 31 to about 283, inclusive of Figure 64 (SEQ ID NO:111), or a fragment thereof sufficient to provide a binding site for an anti-PR01599 antibody. Preferably, the PR01599 fragment retains a qualitative biological activity of a native PRO1599 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01599 polypeptide having the 35 sequence of amino acid residues from about 31 to about 283, inclusive of Figure 64 (SEQ ID NO: 111), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence
103
identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1599 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01599 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01599 polypeptide, by contacting the native PR01599 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01599 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
33. PRQ1430
A cDNA clone (DNA64842-1632) has been identified that encodes a novel polypeptide having homology to reductase proteins, designated in the present application as "PR01430."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 15 a PR01430 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO1430 polypeptide having the sequence of amino acid residues from 1 or about 18to about 331, inclusive of Figure 66 (SEQ ID NO:116), 20 or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1430 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 33 and about 1074, inclusive, of Figure 65 (SEQ ID NO: 115). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecuie comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203278 (DNA64842-1632), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 30 acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203278 (DNA64842-1632).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 35 identity to the sequence of amino acid residues from about 18 to about 331, inclusive of Figure 66 (SEQ ID NO: 116), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
104
nucleotides, and preferably at least about 100 and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01430 polypeptide having the sequence of amino acid residues from about 18 to about 331, inclusive of Figure 66 (SEQ ID NO: 116), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least 5 about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01430 polypeptide, with or without the N-terminal signal sequence, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 17 in the sequence of Figure 66 (SEQ ID NO: 116). 10 In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 18 to about 331, inclusive of Figure 66 (SEQ ID NO:116), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1430 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO 1430 polypeptide encoded by any of the 20 isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO 1430 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 18 to 331 of Figure 66 (SEQ ID NO:116).
In another aspect, the invention concerns an isolated PRO 1430 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 25 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 18 to about 331, inclusive of Figure 66 (SEQ ID NO:116).
In a further aspect, the invention concerns an isolated PRO 1430 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 % positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 30 of residues 18 to 331 of Figure 66 (SEQ ID NO:116).
In yet another aspect, the invention concerns an isolated PR01430polypeptide, comprising the sequence of amino acid residues 18 to about 331, inclusive of Figure 66 (SEQ ID NO:116), or a fragment thereof sufficient to provide a binding site for an anti-PR01430 antibody. Preferably, the PR01430 fragment retains a qualitative biological activity of a native PRO1430 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PR01430 polypeptide having the sequence of amino acid residues from about 18 to about 331, inclusive of Figure 66 (SEQ ID NO:116), or (b)
105
PCT7US99/20111
the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1430
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PROl430 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01430 polypeptide, by contacting the native PR01430 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01430polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
34. PRQ1374
A cDNA clone (DNA64849-1604) has been identified that encodes a novel polypeptide having sequence 15 identity with P4HA and designated in the present application as "PR01374."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO 1374 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 20 preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01374 polypeptide having the sequence of amino acid residues from 1 or about 20 to about 544, inclusive of Figure 68 (SEQ ID NO:118), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01374 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 78 and 25 about 1652, inclusive, of Figure 67 (SEQ ID NO: 117). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85 % sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 30 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203468 (DNA64849-1604), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203468 (DNA64849-1604).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 35 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 20 to about 544, inclusive of Figure 68 (SEQ ID
106
NO: 118), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1374 polypeptide having the sequence of amino acid residues from about 20 to about 544, inclusive of Figure 68 (SEQ ID NO: 118), or (b) the 5 complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 10 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 20 to about 544, inclusive of Figure 68 (SEQ ID NO: 118), or (b) the complement of the DNA of (a).
Another embodimem is directed to fragments of a PRO 1374 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 15 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01374 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01374 polypeptide, which in one 20 embodiment, includes an amino acid sequence comprising residues 20 through 544 of Figure 68 (SEQ ID NO: 118).
In another aspect, the invention concerns an isolated PR01374 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 25 sequence of amino acid residues 20 to about 544, inclusive of Figure 68 (SEQ ID NO: 118).
In a further aspect, the invention concerns an isolated PR01374 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 20 through 544 of Figure 68 (SEQ ID NO:l 18).
In yet another aspect, the invention concerns an isolated PR01374 polypeptide, comprising the sequence of amino acid residues 20 to about 544, inclusive of Figure 68 (SEQ ID NO: 118), or a fragment thereof sufficient to provide a binding site for an anti-PR01374 antibody. Preferably, the PR01374 fragment retains a qualitative biological activity of a native PR01374 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 35 molecule under stringent conditions with (a) a DNA molecule encoding a PRO1374 polypeptide having the sequence of amino acid residues from about 20 to about 544, inclusive of Figure 68 (SEQ ID NO:118), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence
107
identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or <b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1374 5 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01374 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01374 polypeptide, by contacting the native PR01374 polypeptide with a candidate molecuie and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO 1374 polypeptide, 10 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
. PRQ1311
A cDNA clone (DNA64863-1573) has been identified that encodes a novel tetraspan polypeptide designated in the present application as "PR01311".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO 1311 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01311 polypeptide having 20 the sequence of amino acid residues from 1 or about 45 to about 294, inclusive of Figure 70 (SEQ ID NO: 123), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROD 11 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 327 and about 1076, inclusive, of Figure 69 (SEQ ID NO: 122). Preferably, hybridization occurs under stringent 25 hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203251 30 (DNA64863-1573), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203251 (DNA64863-1573).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 35 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 45 to about 294, inclusive of Figure 70 (SEQ ID NO: 123), or the complement of the DNA of (a).
108
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PROI311 polypeptide having the sequence of amino acid residues from about 45 to about 294, inclusive of Figure 70 (SEQ ID NO: 123), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, 5 preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01311 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domains deleted or inactivated variants, or is complementary to such encoding 10 nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 44 in the sequence of Figure 70 (SEQ ID NO: 123). Four transmembrane domains has been tentatively identified as extending from about amino acid 22-42,57-85, 94-116, and 230-257 in the PR01311 amino acid sequence (Figure 70, SEQ ID NO: 123).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 15 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 45 to about 294, inclusive of Figure 70 (SEQ ID NO: 123), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1311 polypeptide coding sequence that may find 20 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO 13II polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO 1311 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 45 to 294 of Figure 70 (SEQ ID NO: 123).
In another aspect, the invention concerns an isolated PR01311 polypeptide, comprising an amino acid sequence having at ieast about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 30 sequence of amino acid residues 45 to about 294, inclusive of Figure 70 (SEQ ID NO: 123).
In a further aspect, the invention concerns an isolated PR01311 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 % positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 45 to 294 of Figure 70 (SEQ ID NO: 123).
In yet another aspect, the invention concerns an isolated PR01311 polypeptide, comprising the sequence of amino acid residues 45 to about 294, inclusive of Figure 70 (SEQ ID NO: 123), or a fragment thereof sufficient to provide a binding site for an anti-PR01311 antibody. Preferably, the PR01311 fragment retains
109
a qualitative biological activity of a native PRO 1311 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01311 polypeptide having the sequence of amino acid residues from about 45 to about 294, inclusive of Figure 70 (SEQ ID NO: 123), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 5 identity, preferably at least about an 85% sequence identity, more preferably at ieast about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
36. PRQ1357
A cDNA clone (DNA64881-1602) has been identified, having homology to nucleic acid encoding the von Ebner minor salivary gland protein that encodes a novel polypeptide, designated in the present application as "PRO 1357".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 15 a PR01357 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01357 polypeptide having the sequence of amino acid residues from about 1 or about 22 to about 484, inclusive of Figure 72 (SEQ ID 20 NO: 128), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01357 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 74 or about 137 and about 1525, inclusive, ofFigure71 (SEQ ID NO: 127). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203240 (DNA64881-1602) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the 30 nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203240 (DNA64881-1602).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 35 identity to the sequence of amino acid residues 1 or about 22 to about 484, inclusive of Figure 72 (SEQ ID NO: 128), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 40
110
nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1357 polypeptide having the sequence of amino acid residues from 1 or about 22 to about 484, inclusive of Figure 72 (SEQ ID NO: 128), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence 5 identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01357 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 21 in the sequence of Figure 72 (SEQ 10 ID NO: 128).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 22 to about 484, inclusive of Figure 72 (SEQ ID NO: 128), or (b) 15 the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01357 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived 20 from the nucleotide sequence shown in Figure 71 (SEQ ID NO:127).
In another embodiment, the invention provides isolated PR01357 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01357 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 22 to about 484 of Figure 25 72 (SEQ ID NO: 128).
In another aspect, the invention concerns an isolated PR01357 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 22 to about 484, inclusive of Figure 72 (SEQ ID NO: 128). 30 In a further aspect, the invention concerns an isolated PR01357 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 22 to about 484, inclusive of Figure 72 (SEQ ID NO: 128).
In yet another aspect, the invention concerns an isolated PR01357 polypeptide, comprising the sequence 35 of amino acid residues 1 or about 22 to about 484, inclusive of Figure 72 (SEQ ID NO: 128), or a fragment thereof sufficient to provide a binding site for an anti-PR01357 antibody. Preferably, the PR01357 fragment retains a qualitative biological activity of a native PR01357 polypeptide.
Ill
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1357 polypeptide having the sequence of amino acid residues from about 1 or about 22 to about 484, inclusive of Figure 72 (SEQ ID NO: 128), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1357 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01357 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01357 polypeptide by contacting the native PR01357 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1357 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
39. PRQ1356
A cDNA clone (DNA64886-1601) has been identified, having homology to nucleic acid encoding Clostridium perfringens enterotoxin receptor, that encodes a novel polypeptide, designated in the present application as "PR01356".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01356 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01356 polypeptide having the sequence of amino acid residues from about 1 or about 25 to about 230, inclusive of Figure 78 (SEQ ID NO: 134), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01356 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 122 or about 194 and about 811, inclusive, of Figure 77 (SEQ ID NO: 133). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having
112
(followedby page 117) 2 4 OCT 2003
received
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203241 (DNA64886-1601) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in 5 ATCC Deposit No. 203241 (DNA64886-1601).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 25 to about 230, inclusive of Figure 78 (SEQ ID 10 NO: 134), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 20 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PROI356 polypeptide having the sequence of amino acid residues from 1 or about 25 to about 230, inclusive of Figure 78 (SEQ ID NO: 134), or (b) the complement of the DNA molecule of (a), and, 15 if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01356 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and 20 its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 24 in the sequence of Figure 78 (SEQ ID NO: 134). The transmembrane domains have been tentatively identified as extending from about amino acid position 82 to about amino acid position 102, from about amino acid position 117 to about amino acid position 140 and from about amino acid 25 position 163 to about amino acid position 182 in the PR01356 amino acid sequence (Figure 78, SEQ ID NO: 134).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 30 amino acid sequence of residues 1 or about 25 to about 230, inclusive of Figure 78 (SEQ ID NO: 134), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01356 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 35 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 77 (SEQ ID NO: 133).
In another embodiment, the invention provides isolated PRO1356 polypeptide encoded by any of the
117
isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PRO1356 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 25 to about 230 of Figure 78 (SEQ ID NO: 134).
In another aspect, the invention concerns an isolated PR01356 polypeptide, comprising an amino acid 5 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 25 to about 230, inclusive of Figure 78 (SEQ ID NO: 134).
In a further aspect, the invention concerns an isolated PRO1356 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 10 about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 25 to about 230, inclusive of Figure 78 (SEQ ID NO: 134).
In yet another aspect, the invention concerns an isolated PR01356 polypeptide, comprising the sequence of amino acid residues 1 or about 25 to about 230, inclusive of Figure 78 (SEQ ID NO: 134), or a fragment thereof sufficient to provide a binding site for an anti-PR01356 antibody. Preferably, the PR01356 fragment 15 retains a qualitative biological activity of a native PR01356 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PROl356 polypeptide having the sequence of amino acid residues from about 1 or about 25 to about 230, inclusive of Figure 78 (SEQ ID NO:134), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about 20 an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host ceil comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01356 25 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01356 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01356 polypeptide by contacting the native PRO 1356 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01356polypeptide, 30 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
40. PRQ1275
A cDNA clone (DNA64888-1542) has been identified that encodes a novel secreted polypeptide designated in the present application as "PR01275."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO 1275 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
118
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO1275 polypeptide having the sequence of amino acid residues from about 26 to about 119, inclusive of Figure 80 (SEQ ID NO: 136), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01275 5 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 112 and about 393, inclusive, of Figure 79 (SEQ ID NO: 135). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 10 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203249 (DNA64888-1542), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203249 (DNA64888-1542).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 26 to about 119, inclusive of Figure 80 (SEQ ID NO: 136), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01275 polypeptide having the sequence of amino acid residues from about 26 to about 119, inclusive of Figure 80 (SEQ ID NO:136), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, 25 preferably at least about an 85 % sequence identity, more preferably at least about a 90 % sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecuie comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 30 amino acid sequence of residues 26 to about 119, inclusive of Figure 80 (SEQ ID NO: 136), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1275 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 35 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01275 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
119
In a specific aspect, the invention provides isolated native sequence PRO1275 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 26 through 119 of Figure 80 (SEQ ID NO: 136).
In another aspect, the invention concerns an isolated PR01275 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 5 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 26 to about 119, inclusive of Figure 80 (SEQ ID NO: 136).
In a further aspect, the invention concerns an isolated PRO1275 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 10 of residues 26 through 119 of Figure 80 (SEQ ID NO: 136).
In yet another aspect, the invention concerns an isolated PRO1275 polypeptide, comprising the sequence of amino acid residues 26 to about 119, inclusive of Figure 80 (SEQ ID NO: 136), or a fragment thereof sufficient to provide a binding site for an anti-PR01275 antibody. Preferably, the PR01275 fragment retains a qualitative biological activity of a native PR01275 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PRO1275 polypeptide having the sequence of amino acid residues from about 26 to about 119, inclusive of Figure 80 (SEQ ID NO: 136), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 20 identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01275 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01275 antibody. 25 In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01275 polypeptide, by contacting the native PR01275 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01275 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
42. FRQ1412
A cDNA clone (DNA64897-1628) has been identified that encodes a novel transmembrane polypeptide designated in die present application as "PR01412."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01412 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01412 polypeptide having
120
INTELLECTUAL PROPERTY 0FRCF OF N.Z
(followed by page 123)
24 OCT 2003
received
the sequence of amino acid residues from 1 or about 29 to about 311, inclusive of Figure 84 (SEQ ID NO: 140), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROM 12 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 226 and about 1074, inclusive, of Figure 83 (SEQ ID NO: 139). Preferably, hybridization occurs under stringent 5 hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203216 10 (DNA64897-1628), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203216 (DNA64897-1628).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 15 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 29 to about 311, inclusive of Figure 84 (SEQ ID NO: 140), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 20 under stringent conditions with (a) a DNA molecuie encoding a PR01412 polypeptide having the sequence of amino acid residues from about 29 to about 311, inclusive of Figure 84 (SEQ ID NO: 140), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 25 In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01412 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 28 in the sequence of Figure 84 (SEQ ID NO: 140). The transmembrane 30 domain has been tentatively identified as extending from about amino acid position 190 through about amino acid position 216 in the PR01412 amino acid sequence (Figure 84, SEQ ID N0:140).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 35 amino acid sequence of residues 29 to about 311, inclusive of Figure 84 (SEQ ID N0:140), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1412 polypeptide coding sequence that may find
123
use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01412 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
Inaspecific aspect, the invention provides isolated native sequence PR01412 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 29 to 311 of Figure 84 (SEQ ID NO: 140).
In another aspect, the invention concerns an isolated PROM 12 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 10 sequence of amino acid residues 29 to about 311, inclusive of Figure 84 (SEQ ID N0:140).
In a further aspect, the invention concerns an isolated PR01412 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 29 to 311 of Figure 84 (SEQ ID N0:140).
In yet another aspect, the invention concerns an isolated PRO 1412 polypeptide, comprising the sequence of amino acid residues 29 to about 311, inclusive of Figure 84 (SEQ ID NO: 140), or a fragment thereof sufficient to provide a binding site for an anti-PR01412 antibody. Preferably, the PR01412 fragment retains a qualitative biological activity of a native PR01412 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 20 molecule under stringent conditions with (a) a DNA molecule encoding a PR01412 polypeptide having the sequence of amino acid residues from about 29 to about 311, inclusive of Figure 84 (SEQ ID NO: 140), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising 25 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
43. PRQ1557
A cDNA clone (DNA64902-1667) has been identified that encodes a novel polypeptide having homology 30 to chordin and designated in the present application as "PR01557".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO 1557 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 35 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01557 polypeptide having the sequence of amino acid residues from 1 or about 26 to about 451, inclusive of Figure 86 (SEQ ID NO: 142), or (b) the complement of the DNA molecule of (a).
124
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01557 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 362 and about 1639, inclusive, of Figure 85 (SEQ ID NO: 141). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 5 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203317 (DNA64902-1667), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 10 Deposit No. 203317 (DNA64902-1667).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 26 to about 451, inclusive of Figure 86 (SEQ ID 15 NO: 142), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1557 polypeptide having the sequence of amino acid residues from about 26 to about 451, inclusive of Figure 86 (SEQ ID NO: 142), or (b) the 20 complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90 % sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1557 polypeptide, with or without the N-terminal signal sequence, or is complementary to such encoding 25 nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 25 in the sequence of Figure 86 (SEQ ID NO: 142).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 30 amino acid sequence of residues 26 to about 451, inclusive of Figure 86 (SEQ ID NO:142), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01557 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 35 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01557 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
125
In a specific aspect, the invention provides isolated native sequence PRO 1557 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 26 to 451 of Figure 86 (SEQ ID N0:142).
In another aspect, the invention concerns an isolated PR01557 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 5 sequence of amino acid residues 26 to about 451, inclusive of Figure 86 (SEQ ID NO: 142).
In a further aspect, the invention concerns an isolated PR01557 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 26 to 451 of Figure 86 (SEQ ID NO: 142).
In yet another aspect, the invention concerns an isolated PRO1557 polypeptide, comprising the sequence of amino acid residues 26 to about 451, inclusive of Figure 86 (SEQ ID NO:142), or a fragment thereof sufficient to provide a binding site for an anti-PR01557 antibody. Preferably, the PR01557 fragment retains a qualitative biological activity of a native PR01557 polypeptide.
molecule under stringent conditions with (a) a DNA molecule encoding a PR01557 polypeptide having the sequence of amino acid residues from about 26 to about 451, inclusive of Figure 86 (SEQ ID NO: 142), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 20 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01557 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01557 antibody.
native PRO1557 polypeptide by contacting the native PRO1557 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1557 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
46. PRQ1347
A cDNA clone (DNA64950-1590) has been identified that encodes a novel polypeptide having sequence identity with butyrophilin and designated in the present application as "PR01347."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01347 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a
126
(followed by page 131)
INTELLECTUAL PROPERTY OFRCE OF N.Z
24 OCT 2003
received
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO1347 polypeptide having the sequence of amino acid residues from 1 or about 18 to about 500, inclusive of Figure 92 (SEQ ID NO: 148), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01347 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 234 and 5 about 1682, inclusive, of Figure 91 (SEQ ID NO: 147). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 10 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203224 (DNA64950-1590), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203224 (DNA64950-1590).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 15 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 18 to about 500, inclusive of Figure 92 (SEQ ID NO: 148), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 20 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1347 polypeptide having the sequence of amino acid residues from about 18 to about 500, inclusive of Figure 92 (SEQ ID NO: 148), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most 25 preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01347 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domain deleted (or that terminus truncated) or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as 30 extending from amino acid position 1 through about amino acid position 17 in the sequence of Figure 92 (SEQ ID NO: 148). The transmembrane domain has been tentatively identified as extending from about amino acid position 239 through about amino acid position 255 in the PR01347 amino acid sequence (Figure 92, SEQ ID NO: 148).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 35 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 18 to about 500, inclusive of Figure 92 (SEQ ID NO: 148), or (b) the
131
complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1347 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PROl 347 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01347 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 18 through 500 of Figure 92 (SEQ ID NO: 148).
In another aspect, the invention concerns an isolated PR01347 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 18 to about 500, inclusive of Figure 92 (SEQ ID NO: 148).
In a further aspect, the invention concerns an isolated PR01347 polypeptide, comprising an amino acid 15 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 18 through 500 of Figure 92 (SEQ ID NO: 148).
In yet another aspect, the invention concerns an isolated PR01347 polypeptide, comprising the sequence of amino acid residues 18 to about 500, inclusive of Figure 92 (SEQ ID NO: 148), or a fragment thereof 20 sufficient to provide a binding site for an anti-PROl347 antibody. Preferably, the PRO1347 fragment retains a qualitative biological activity of a native PRO 1347 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01347 polypeptide having the sequence of amino acid residues from about 18 to about 500, inclusive of Figure 92 (SEQ ID NO: 148), or (b) 25 the complement of the DNA molecuie of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01347
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01347 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PRO1347 polypeptide, by contacting the native PRO1347 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO 1347 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
132
47. PRQ1305
A cDNA clone (DNA64952-1568) has been identified that encodes a novel secreted polypeptide, designated in the present application as "PRO1305".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1305 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01305 polypeptide having the sequence of amino acid residues from about 1 or about 26 to about 258, inclusive of Figure 94 (SEQ ID NO: 153), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01305
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 126 or about 201 and about 899, inclusive, of Figure 93 (SEQ ID NO:152). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 15 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203222 (DNA64952-1568) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in 20 ATCC Deposit No. 203222 (DNA64952-1568).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 26 to about 258, inclusive of Figure 94 (SEQ ID 25 NO: 153), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 10 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01305 polypeptide having the sequence of amino acid residues from 1 or about 26 to about 258, inclusive of Figure 94 (SEQ ID NO: 153), or (b) the complement of the DNA molecule of (a), and, 30 if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01305 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is 35 complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position I to about amino acid position 25 in the sequence of Figure 94 (SEQ ID NO: 153).
133
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 26 to about 258, inclusive of Figure 94 (SEQ ID NO: 153), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1305 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 93 (SEQ ID NO: 152).
In another embodiment, the invention provides isolated PRO1305 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01305 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 26 to about 258 of Figure 94 (SEQ ID NO: 153).
In another aspect, the invention concerns an isolated PR01305 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 26 to about 258, inclusive of Figure 94 (SEQ ID NO: 153).
In a further aspect, the invention concerns an isolated PR01305 polypeptide, comprising an amino acid 20 sequence scoring at least about 80% positives, preferably at least about 85 % positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 26 to about 258, inclusive of Figure 94 (SEQ ID NO: 153).
In yet another aspect, the invention concerns an isolated PRO 1305 polypeptide, comprising the sequence of amino acid residues 1 or about 26 to about 258, inclusive of Figure 94 (SEQ ID NO: 153), or a fragment 25 thereof sufficient to provide a binding site for an anti-PR01305 antibody. Preferably, the PR01305 fragment retains a qualitative biological activity of a native PR01305 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01305 polypeptide having the sequence of amino acid residues from about 1 or about 26 to about 258, inclusive of Figure 94 (SEQ ID 30 NO: 153), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
48. PRQ1273
A cDNA clone (DNA65402-1540) has been identified that encodes a novel polypeptide having sequence
134
identity with lipocalins and designated in the present application as "PRO 1273."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PROI273 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 5 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01273 polypeptide having the sequence of amino acid residues from 1 or about 21 to about 163, inclusive of Figure 96 (SEQ ID NO: 158), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01273 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 86 and 10 about 514, inclusive, of Figure 95 (SEQ ID NO: 157). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 15 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203252 (DNA65402-1540), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203252 (DNA65402-1540).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 20 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 21 to about 163, inclusive of Figure 96 (SEQ ID NO: 158), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 25 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01273 polypeptide having the sequence of amino acid residues from about 21 to about 163, inclusive of Figure 96 (SEQ ID NO: 158), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most 30 preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 21 to about 163, inclusive of Figure 96 (SEQ ID NO: 158), or (b) the 35 complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1273 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length,
135
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO1273 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01273 polypeptide, which in one 5 embodiment, includes an amino acid sequence comprising residues 21 through 163 of Figure 96 (SEQ ID NO: 158).
In another aspect, the invention concerns an isolated PR01273 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at.least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 10 sequence of amino acid residues 21 to about 163, inclusive of Figure 96 (SEQ ID NO: 158).
In a further aspect, the invention concerns an isolated PROI273 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 21 through 163 of Figure 96 (SEQ ID NO: 158).
In yet another aspect, the invention concerns an isolated PRO1273 polypeptide, comprising the sequence of amino acid residues 21 to about 163, inclusive of Figure 96 (SEQ ID NO: 158), or a fragment thereof sufficient to provide a binding site for an anti-PR01273 antibody. Preferably, the PR01273 fragment retains a qualitative biological activity of a native PR01273 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 20 molecule under stringent conditions with (a) a DNA molecule encoding a PR01273 polypeptide having the sequence of amino acid residues from about 21 to about 163, inclusive of Figure 96 (SEQ ID NO: 158), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 25 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1273 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01273 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 30 native PR01273 polypeptide, by contacting the native PR01273 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01273 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
49. PRQ1302
A cDNA clone (DNA65403-1565) has been identified that encodes a novel polypeptide having sequence identity with CD33 and designated in the present application as "PRO 1302."
136
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01302 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01302 polypeptide having 5 the sequence of amino acid residues from 1 or about 16 to about 463, inclusive of Figure 98 (SEQ ID NO: 160), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01302 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 88 and about 1431, inclusive, of Figure 97 (SEQ ID NO: 159). Preferably, hybridization occurs under stringent 10 hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203230 15 (DNA65403-1565), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203230 (DNA65403-1565).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 20 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 16 to about 463, inclusive of Figure 98 (SEQ ID NO: 160), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 25 under stringent conditions with (a) a DNA molecule encoding a PRO1302 polypeptide having the sequence of amino acid residues from about 16 to about 463, inclusive of Figure 98 (SEQ ID NO: 160), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 30 In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01302 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domain deleted (or truncated form) or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 15 in the sequence of Figure 98 (SEQ ID NO: 160). 35 The transmembrane domain has been tentatively identified as extending from about amino acid position 351 through about amino acid position 370 in the PRO 1302 amino acid sequence (Figure 98, SEQ ID N0:160).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
137
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 16 to about 463, inclusive of Figure 98 (SEQ ID NO: 160), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1302 polypeptide coding sequence that may find 5 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PROI302 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PROl302 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 16 through 463 of Figure 98 (SEQ ID NO: 160).
In another aspect, the invention concerns an isolated PR01302 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 15 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 16 to about 463, inclusive of Figure 98 (SEQ ID NO: 160).
In a further aspect, the invention concerns an isolated PR01302 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence 20 of residues 16 through 463 of Figure 98 (SEQ ID NO: 160).
In yet another aspect, the invention concerns an isolated PRO 1302 polypeptide, comprising the sequence of amino acid residues 16 to about 463, inclusive of Figure 98 (SEQ ID NO: 160), or a fragment thereof sufficient to provide a binding site for an anti-PR01302 antibody. Preferably, the PR01302 fragment retains a qualitative biological activity of a native PR01302 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PRO1302 polypeptide having the sequence of amino acid residues from about 16 to about 463, inclusive of Figure 98 (SEQ ID NO: 160), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 30 identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01302 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01302 antibody. 35 In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01302 polypeptide, by contacting the native PR01302 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
138
In a still further embodiment, the invention concerns a composition comprising a PROl 302 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
50. PRQ1283
A cDNA clone (DNA65404-1551) has been identified, having homology to nucleic acid encoding 5 odorant binding protein, that encodes a novel polypeptide, designated in the present application as " PRO1283".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01283 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 10 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01283 polypeptide having the sequence of amino acid residues from about 1 or about 18 to about 170, inclusive of Figure 100 (SEQ ID NO: 162), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1283 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 45 15 or about 96 and about 554, inclusive, of Figure 99 (SEQ ID NO: 161). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 20 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203244 (DNA65404-1551) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203244 (DNA65404-1551).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 25 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 18 to about 170, inclusive of Figure 100 (SEQ ID NO: 162), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 10 30 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01283 polypeptide having the sequence of amino acid residues from 1 or about 18 to about 170, inclusive of Figure 100 (SEQ ID NO: 162), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence 35 identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01283 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is
139
complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 17 in the sequence of Figure 100 (SEQ ID NO: 162).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 5 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 18 to about 170, inclusive of Figure 100 (SEQ ID NO: 162), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01283 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 10 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 99 (SEQ ID NO: 161).
In another embodiment, the invention provides isolated PR01283 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PRO1283 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 18 to about 170 of Figure 100 (SEQ ID NO: 162).
In another aspect, the invention concerns an isolated PRO1283 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 20 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 18 to about 170, inclusive of Figure 100 (SEQ ID NO: 162).
In a further aspect, the invention concerns an isolated PR01283 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 25 of residues 1 or about 18 to about 170, inclusive of Figure 100 (SEQ ID NO: 162).
In yet another aspect, the invention concerns an isolated PRO 1283 polypeptide, comprising the sequence of amino acid residues 1 or about 18 to about 170, inclusive of Figure 100 (SEQ ID NO: 162), or a fragment thereof sufficient to provide a binding site for an anti-PR01283 antibody. Preferably, the PR01283 fragment retains a qualitative biological activity of a native PR01283 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PR01283 polypeptide having the sequence of amino acid residues from about 1 or about 18 to about 170, inclusive of Figure 100 (SEQ ID NO: 162), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 35 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
140
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1283 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01283 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01283 polypeptide by contacting the native PR01283 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01283 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
51. PROl 279
A cDNA clone (DNA65405-1547) has been identified, having homology to nucleic acid encoding 10 neuropsin that encodes a novel polypeptide, designated in the present application as "PR01279".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01279 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 15 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01279 polypeptide having the sequence of amino acid residues from about 1 or about 19 to about 250, inclusive of Figure 102 (SEQ ID NO: 170), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01279 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 106 20 or about 160 and about 855, inclusive, of Figure 101 (SEQ ID NO: 169). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 25 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203476 (DNA65405-1547) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203476 (DNA65405-1547).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 30 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 19 to about 250, inclusive of Figure 102 (SEQ ID NO: 170), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 100 35 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01279 polypeptide having the sequence of amino acid residues from 1 or about 19 to about 250, inclusive of Figure 102 (SEQ ID NO: 170), or (b) the complement of the DNA molecule of (a), and,
141
if the DNA molecuie has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01279 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is 5 complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 18 in the sequence of Figure 102 (SEQ ID NO: 170).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 10 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 19 to about 250, inclusive of Figure 102 (SEQ ID NO: 170), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01279 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 15 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 101 (SEQ ID NO: 169).
In another embodiment, the invention provides isolated PRO 1279 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PROI279 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 19 to about 250 of Figure 102 (SEQ ID NO: 170).
In another aspect, the invention concerns an isolated PR01279 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 25 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 19 to about 250, inclusive of Figure 102 (SEQ ID NO: 170).
In a further aspect, the invention concerns an isolated PR01279polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 30 of residues 1 or about 19 to about 250, inclusive of Figure 102 (SEQ ID NO: 170).
In yet another aspect, the invention concerns an isolated PR01279 polypeptide, comprising the sequence of amino acid residues 1 or about 19 to about 250, inclusive of Figure 102 (SEQ ID NO: 170), or a fragment thereof sufficient to provide a binding site for an anti-PR01279 antibody. Preferably, the PRO1279 fragment retains a qualitative biological activity of a native PR01279 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PR01279 polypeptide having the sequence of amino acid residues from about 1 or about 19 to about 250, inclusive of Figure 102 (SEQ ID
142
NO: 170), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1279
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01279 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01279 polypeptide by contacting the native PR01279 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01279 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
S3. PRQ1317
A cDNA clone (DNA65408-1578) has been identified that encodes a novel secreted polypeptide that IS shares homology with human CD97. The novel polypeptide is designated in the present application as "PR01317".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01317 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 20 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01317 polypeptide having the sequence of amino acid residues from 1 or about 19 to about 74, inclusive of Figure 106 (SEQ ID NO: 189), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01317 25 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 60 and about 227, inclusive, of Figure 105 (SEQ ID NO: 188). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 30 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203217 (DNA65408-1578), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203217 (DNA65408-1578).
; ,
143
^followed by page 146)
INTELLECTUAL PROPERTY OFRCE OF IM.Z
24 OCT 2003 received
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 19 to about 74, inclusive of Figure 106 (SEQ ID NO: 189), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01317 polypeptide having the sequence of amino acid residues from about 19 to about 74, inclusive of Figure 106 (SEQ ID NO: 189), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, 10 preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01317 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as 15 extending from amino acid position 1 through about amino acid position 18 in the sequence of Figure 106 (SEQ ID NO: 189).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 20 amino acid sequence of residues 19 to about 74, inclusive of Figure 106 (SEQ ID NO: 189), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01317 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 25 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01317 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01317 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 19 to 74 of Figure 106 (SEQ ID NO: 189). 30 In another aspect, the invention concerns an isolated PR01317 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 19 to about 74, inclusive of Figure 106 (SEQ ID NO: 189).
In a further aspect, the invention concerns an isolated PROI317 polypeptide, comprising an amino acid 35 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 19 to 74 of Figure 106 (SEQ ID NO: 189).
146
In yet another aspect, the invention concerns an isolated PRO 1317 polypeptide, comprising the sequence of amino acid residues 19 to about 74, inclusive of Figure 106 (SEQ ID NO: 189), or a fragment thereof sufficient to provide a binding site for an anti-PR01317 antibody. Preferably, the PR01317 fragment retains a qualitative biological activity of a native PR01317 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 5 molecule under stringent conditions with (a) a DNA molecule encoding a PR01317 polypeptide having the sequence of amino acid residues from about 19 to about 74, inclusive of Figure 106 (SEQ ID NO: 189), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 10 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
55. PRQ1306
o
A cDNA clone (DNA65410-1569) has been identified that encodes a novel polypeptide having homology to AIFl/daintain and designated in the present application as "PR01306".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01306 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01306 polypeptide having the sequence of amino acid residues from about 1 to about 150, inclusive of Figure 110 (SEQ ID NO: 196), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01306 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 106 and about 555, inclusive, of Figure 109 (SEQ ID NO: 195). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203231 (DNA65410-1569), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203231 (DNA65410-1569).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 1 to about 150, inclusive of Figure 110 (SEQ ID NO: 196), or the complement of the DNA of (a).
147
(followed by page 150)
INTELLECTUAL PROPERTY OFRCE OF N.Z
2 4 OCT 2003
received
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1306 polypeptide having the sequence of amino acid residues from about 1 to about 150, inclusive of Figure 110 (SEQ ID NO: 196), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, 5 preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 10 amino acid sequence of residues 1 to about 150, inclusive of Figure 110 (SEQ ID NO: 196), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1306 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 15 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01306 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO 1306 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 1 to 150 of Figure 110 (SEQ ID NO:196). 20 In another aspect, the invention concerns an isolated PRO 1306 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 150, inclusive of Figure 110 (SEQ ID NO: 196).
In a further aspect, the invention concerns an isolated PRO1306 polypeptide, comprising an amino acid 25 sequence scoring at least about 80% positives, preferably at least about 85 % positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 to 150 of Figure 110 (SEQ ID NO:196).
In yet another aspect, the invention concerns an isolated PRO1306 polypeptide, comprising the sequence of amino acid residues 1 to about 150, inclusive of Figure 110 (SEQ ID NO: 196), or a fragment thereof 30 sufficient to provide a binding site for an anti-PR01306 antibody. Preferably, the PR01306 fragment retains a qualitative biological activity of a native PR01306 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1306 polypeptide having the sequence of amino acid residues from about 1 to about 150, inclusive of Figure 110 (SEQ ID NO: 196), or (b) 35 the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising
150
the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01306 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01306 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 5 native PR01306 polypeptide, by contacting the native PR01306 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01306 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
56. PRQ1336
A cDNA clone (DNA65423-1595) has been identified that encodes a novel polypeptide having sequence identity with slit and designated in the present application as "PRO 1336."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01336 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01336 polypeptide having the sequence of amino acid residues from 1 or about 28 to about 1523, inclusive of Figure 112 (SEQ ID NO: 198), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01336
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 164 and about 4651, inclusive, ofFigures lllA-B(SEQIDNO:197). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 25 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203227 (DNA65423-1595), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 30 Deposit No. 203227 (DNA65423-1595).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 28 to about 1523, inclusive of Figure 112 (SEQ ID 35 NO: 198), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule
151
under stringent conditions with (a) a DNA molecule encoding a PRO 1336 polypeptide having the sequence of amino acid residues from about 28 to about 1523, inclusive of Figure 112 (SEQ ID NO: 198), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 28 to about 1523, inclusive of Figure 112 (SEQ ID NO: 198), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01336 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO 1336 polypeptide encoded by any of the 15 isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO 1336 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 28 through 1523 of Figure 112 (SEQ ID NO:198).
In another aspect, the invention concerns an isolated PR01336 polypeptide, comprising an amino acid 20 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 28 to about 1523, inclusive of Figure 112 (SEQ ID N0:198).
In a further aspect, the invention concerns an isolated PRO1336 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 25 about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 28 through 1523 of Figure 112 (SEQ ID NO: 198).
In yet another aspect, the invention concerns an isolated PR01336 polypeptide, comprising the sequence of amino acid residues 28 to about 1523, inclusive of Figure 112 (SEQ ID NO: 198), or a fragment thereof sufficient to provide a binding site for an anti-PR01336 antibody. Preferably, the PR01336 fragment retains 30 a qualitative biological activity of a native PR01336 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01336 polypeptide having the sequence of amino acid residues from about 28 to about 1523, inclusive of Figure 112 (SEQ ID NO: 198), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% 35 sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii)
152
recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01336 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01336 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01336 polypeptide, by contacting the native PROI336 polypeptide with a candidate molecule and 5 monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO 1336 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
57. PRQ1278
A cDNA clone (DNA66304-1546) has been identified that encodes a novel polypeptidehavinghomology to lysozyme C and designated in the present application as "PR01278."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01278 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 15 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01278 polypeptide having the sequence of amino acid residues from 1 or about 20 to about 148, inclusive of Figure 114 (SEQ ID N0:203), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01278 20 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 198 and about 584, inclusive, of Figure 113 (SEQ ID N0:202). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 25 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203321 (DNA66304-1546), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203321 (DNA66304-1546).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 20 to about 148, inclusive of Figure 114 (SEQ ID N0:203), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01278 polypeptide having the sequence of
153
amino acid residues from about 20 to about 148, inclusive of Figure 114 (SEQ ID N0:203), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 5 a PR01278 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 19 in the sequence of Figure 114 (SEQ ID N0:203).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 10 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 20 to about 148, inclusive of Figure 114 (SEQ ID N0:203), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1278 polypeptide coding sequence that may find 15 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01278 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1278 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 20 to 148 of Figure 114 (SEQ ID N0:203).
In another aspect, the invention concerns an isolated PRO1278 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 25 sequence of amino acid residues 20 to about 148, inclusive of Figure 114 (SEQ ID N0:203).
In a further aspect, the invention concerns an isolated PR01278 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 20 to 148 of Figure 114 (SEQ ID N0:203).
In yet another aspect, the invention concerns an isolated PR01278 polypeptide, comprising the sequence of amino acid residues 20 to about 148, inclusive of Figure 114 (SEQ ID N0:203), or a fragment thereof sufficient to provide a binding site for an anti-PR01278 antibody. Preferably, the PR01278 fragment retains a qualitative biological activity of a native PRO1278 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 35 molecule under stringent conditions with (a) a DNA molecuie encoding a PROI278 polypeptide having the sequence of amino acid residues from about 20 to about 148, inclusive of Figure 114 (SEQ ID N0:203), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence
154
identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1278 5 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01278 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PRO 1278 polypeptide, by contacting the native PRO 1278 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO 1278 polypeptide, 10 or an agonist or antagonist as hereinabove defmed, in combination with a pharmaceuticaliy acceptable carrier.
58. PRQ1298
A cDNA clone (DNA66511-1563) has been identified that encodes a novel polypeptide having sequence identity with glycosyltransferases and designated in the present application as "PR01298." 15 In one embodiment, the invention provides an isolated nucleic acid molecuie comprising DNA encoding a PR01298 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01298 polypeptide having 20 the sequence of amino acid residues from 1 or about 16 to about 323, inclusive of Figure 116 (SEQ ID N0:210), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01298 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 139 and about 1062, inclusive, of Figure 115 (SEQ ID N0:209). Preferably, hybridization occurs under stringent 25 hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203228 30 (DNA66511-1563), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203228 (DNA66511-1563).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 35 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 16 to about 323, inclusive of Figure 116 (SEQ ID N0:210), or the complement of the DNA of (a).
155
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1298 polypeptide having the sequence of amino acid residues from about 16 to about 323, inclusive of Figure 116 (SEQ ID N0:210), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, 5 preferably at least about an 85% sequence identity, more preferably at least about a 90 % sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 10 amino acid sequence of residues 16 to about 323, inclusive of Figure 116 (SEQ ID N0:210), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01298 polypeptide coding sequence that may fmd use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 15 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01298 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1298 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 16 through 323 of Figure 116 (SEQ ID 20 N0:210).
In another aspect, the invention concerns an isolated PRO1298 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 16 to about 323, inclusive of Figure 116 (SEQ ID N0:210). 25 In a further aspect, the invention concerns an isolated PRO1298 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 16 through 323 of Figure 116 (SEQ ID N0:210).
In yet another aspect, the invention concerns an isolated PRO1298 polypeptide, comprising the sequence 30 of amino acid residues 16 to about 323, inclusive of Figure 116 (SEQ ID N0:210), or a fragment thereof sufficient to provide a binding site for an anti-PR01298 antibody. Preferably, the PR01298 fragment retains a qualitative biological activity of a native PRO 1298 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01298 polypeptide having the 35 sequence of amino acid residues from about 16 to about 323, inclusive of Figure 116 (SEQ ID N0:210), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence
156
identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
in yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1298 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01298 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01298 polypeptide, by contacting the native PR01298 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO 1298 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
59. PRQ1301
A cDN A clone (DN A66512-1564) has been identified that encodes a novel polypeptide having homology to cytochrome P450 and designated in the present application as "PR01301."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 15 a PR01301 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01301 polypeptide having the sequence of amino acid residues from 1 or about 19 to about 462, inclusive of Figure 118(SEQIDNO:212), 20 or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1301 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 97 and about 1428, inclusive, of Figure 117 (SEQ ID NO:211). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203218 (DNA66512-1564), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 30 acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203218 (DNA66512-1564).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 35 identity to the sequence of amino acid residues from about 19 to about 462, inclusive of Figure 118 (SEQ ID NO:212), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
157
PCMJS99/20I11
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecuie encoding a PRO 1301 polypeptide having the sequence of amino acid residues from about 19 to about 462, inclusive of Figure 118 (SEQ ID N0:212), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most 5 preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO 1301 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine and its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 10 through about amino acid position 18 in the sequence of Figure 118 (SEQ ID NO:212). The transmembrane domain has been tentatively identified as extending from about amino acid position 271 through about amino acid position 290 in the PR01301 amino acid sequence (Figure 118, SEQ ID NO:212).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 15 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 19 to about 462, inclusive of Figure 118 (SEQ ID NO:212), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1301 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 20 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01301 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
Inaspecific aspect, the invention provides isolated native sequence PR01301 polypeptide, which in one 25 embodiment, includes an amino acid sequence comprising residues 19 to 462 of Figure 118 (SEQ ID N0:212).
In another aspect, the invention concerns an isolated PR01301 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 19 to about 462, inclusive of Figure 118 (SEQ ID N0:212). 30 In a further aspect, the invention concerns an isolated PRO 1301 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 19 to 462 of Figure 118 (SEQ ID NO:212).
In yet another aspect, the invention concerns an isolated PR01301 polypeptide, comprising the sequence 35 of amino acid residues 19 to about 462, inclusive of Figure 118 (SEQ ID NO:212), or a fragment thereof sufficient to provide a binding site for an anti-PR01301 antibody. Preferably, the PR01301 fragment retains a qualitative biological activity of a native PR01301 polypeptide.
158
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01301 polypeptide having the sequence of amino acid residues from about 19 to about 462, inclusive of Figure 118 (SEQ ID NO:212), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 5 identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
60. PRQ1268
A cDNA clone (DNA66519-1535) has been identified that encodes a novel transmembrane polypeptide designated in the present application as "PR01268."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1268 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 15 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO 1268 polypeptide having the sequence of amino acid residues from about 1 to about 140, inclusive of Figure 120 (SEQ ID NO:214), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1268 20 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 89 and about 508, inclusive, of Figure 119 (SEQ ID NO:213). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 25 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203236 (DNA66519-1535), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203236 (DNA66519-1535).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 1 to about 140, inclusive of Figure 120 (SEQ ID NO:214), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01268 polypeptide having the sequence of
159
WO 00/12708 PCT/US99/20111
amino acid residues from about 1 to about 140, inclusive of Figure 120 (SEQ ID NO:214), or (b) the complement of the DNA molecule of (a), and, if the DNA molecuie has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 5 a PR01268 polypeptide, with one or more of its soluble, i.e. transmembrane, domains deleted or inactivated, or is complementary to such encoding nucleic acid molecule. Transmembrane domains has been tentatively identified at about amino acids 12-28 (type II), 51-66, and 107-124 in the PRO 1268 amino acid sequence (Figure 120, SEQ ID N0:214).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 10 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 to about 140, inclusive of Figure 120 (SEQ ID NO:214), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1268 polypeptide coding sequence that may find 15 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01268 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1268 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 1 to 140 of Figure 120 (SEQ ID NO:214).
In another aspect, the invention concerns an isolated PR01268 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 25 sequence of amino acid residues 1 to about 140, inclusive of Figure 120 (SEQ ID NO:214).
In a further aspect, the invention concerns an isolated PRO1268 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 to 140 of Figure 120 (SEQ ID NO:214).
In yet another aspect, the invention concerns an isolated PR01268 polypeptide, comprising the sequence of amino acid residues 1 to about 140, inclusive of Figure 120 (SEQ ID NO:214), or a fragment thereof sufficient to provide a binding site for an anti-PR01268 antibody. Preferably, the PRO1268 fragment retains a qualitative biological activity of a native PR01268 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 35 molecule under stringent conditions with (a) a DNA moiecule encoding a PR01268 polypeptide having the sequence of amino acid residues from about 1 to about 140, inclusive of Figure 120 (SEQ ID NO:214), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence
160
identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
61. PRQ1269
A cDNA clone (DNA66520-1536) has been identified thatencodesa novel polypeptide having homology to granulocyte peptide A and designated in the present application as "PR01269."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1269 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO 1269 polypeptide having the sequence of amino acid residues from 1 or about 21 to about 196, inclusive of Figure 122 (SEQ ID NO:2I6), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01269
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 86 and about 613, inclusive, of Figure 121 (SEQ ID NO:215). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 20 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203226 (DNA66520-1536), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 25 Deposit No. 203226 (DNA66520-1536).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 21 to about 196, inclusive of Figure 122 (SEQ ID 30 NO:216), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01269 polypeptide having the sequence of amino acid residues from about 21 to about 196, inclusive of Figure 122 (SEQ ID NO:216), or (b) the 35 complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
161
WO 00/12708 PCT/US99/20111
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01269 polypeptide, with or without the N-teraiinal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 20 in the sequence of Figure 122 (SEQ ID NO:216).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 21 to about 196, inclusive of Figure 122 (SEQ ID NO:21)6, or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1269 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO 1269 polypeptide encoded by any of the 15 isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01269 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 21 to 196 of Figure 122 (SEQ ID NO:216).
In another aspect, the invention concerns an isolated PRO1269 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 20 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 21 to about 196, inclusive of Figure 122 (SEQ ID NO:216).
In a further aspect, the invention concerns an isolated PR01269 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 25 of residues 21 to 196 of Figure 122 (SEQ ID NO:216).
In yet another aspect, the invention concerns an isolated PRO1269 polypeptide, comprising the sequence of amino acid residues 21 to about 196, inclusive of Figure 122 (SEQ ID NO:216), or a fragment thereof sufficient to provide a binding site for an anti-PR01269 antibody. Preferably, the PR01269 fragment retains a qualitative biological activity of a native PRO 1269 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PRO1269 polypeptide having the sequence of amino acid residues from about 21 to about 196, inclusive of Figure 122 (SEQ ID NO:216), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 35 identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
162
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1269 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01269 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01269 polypeptide, by contacting the native PR01269 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1269 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
62. PRQ1327
A cDNA clone (DNA66521-1583) has been identified, having homology to nucleic acid encoding 10 neurexoplilin, that encodes a novel polypeptide, designated in the present application as "PR01327".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01327 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 15 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO1327 polypeptide having the sequence of amino acid residues from about 1 or about 15 to about 252, inclusive of Figure 124 (SEQ ID NO:218), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01327 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 55 20 or about 97 and about 810, inclusive, of Figure 123 (SEQ ID NO:217). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 25 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203225 (DNA66521-1583) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203225 (DNA66521-1583).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 30 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 15 to about 252, inclusive of Figure 124 (SEQ ID NO:218), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 260 35 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01327 polypeptide having the sequence of amino acid residues from 1 or about 15 to about 252, inclusive of Figure 124 (SEQ ID NO:218), or (b) the complement of the DNA molecule of (a), and,
163
WO 00/12708 PCT/US99/20111
if the DNA molecule has at least about an 80% sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01327 polypeptide, with or without the N-teiminal signal sequence and/or the initiating methionine, or is 5 complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 14 in the sequence of Figure 124 (SEQ ID NO:218).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 10 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 15 to about 252, inclusive of Figure 124 (SEQ ID NO:218), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1327 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 15 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 123 (SEQ ID NO:217).
In another embodiment, the invention provides isolated PR01327 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01327 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 15 to about 252 of Figure 124 (SEQ ID NO:218).
In another aspect, the invention concerns an isolated PR01327 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 25 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 15 to about 252, inclusive of Figure 124 (SEQ ID NO:218).
In a further aspect, the invention concerns an isolated PRO1327 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence 30 of residues 1 or about 15 to about 252, inclusive of Figure 124 (SEQ ID NO:218).
In yet another aspect, the invention concerns an isolated PRO1327 polypeptide, comprising the sequence of amino acid residues 1 or about 15 to about 252, inclusive of Figure 124 (SEQ ID NO:218), or a fragment thereof sufficient to provide a binding site for an anti-PR01327 antibody. Preferably, the PR01327 fragment retains a qualitative biological activity of a native PR01327 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PR01327 polypeptide having the sequence of amino acid residues from about 1 or about 15 to about 252, inclusive of Figure 124 (SEQ ID
164
PCMJS99/201U
NO:218), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01327
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01327 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01327 polypeptide by contacting the native PR01327 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO 1327 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
63. PRQ1382
A cDNA clone (DNA66526-1616) has been identified that encodes a novel polypeptide having homology 15 to cerebellin and designated in the present application as "PR01382."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01382 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 20 preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01382 polypeptide having the sequence of amino acid residues from 1 or about 28 to about 201, inclusive of Figure 126 (SEQ ID N0:220), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01382 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 418 and 25 about 939, inclusive, of Figure 125 (SEQ ID NO:219). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecuie comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 30 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203246 (DNA66526-1616), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203246 (DNA66526-1616).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 35 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 28 to about 201, inclusive of Figure 126 (SEQ ID
165
N0:220), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01382 polypeptide having the sequence of amino acid residues from about 28 to about 201, inclusive of Figure 126 (SEQ ID N0:220), or (b) the 5 complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1382 polypeptide, with or without the N-terminal signal sequence, or is complementary to such encoding 10 nucleic acid molecuie. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 27 in the sequence of Figure 126 (SEQ ID N0:220).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 15 amino acid sequence of residues 28 to about 201, inclusive of Figure 126 (SEQ ID N0:220), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01382 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 20 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO1382 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO 1382 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 28 to 201 of Figure 126 (SEQ ID N0:220). 25 In another aspect, the invention concerns an isolated PRO 1382 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 28 to about 201, inclusive of Figure 126 (SEQ ID N0:220).
In a further aspect, the invention concerns an isolated PR01382 polypeptide, comprising an amino acid 30 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 28 to 201 of Figure 126 (SEQ ID N0:220).
In yet another aspect, the invention concerns an isolated PRO1382 polypeptide, comprising the sequence of amino acid residues 28 to about 201, inclusive of Figure 126 (SEQ ID N0:220), or a fragment thereof 35 sufficient to provide a binding site for an anti-PR01382 antibody. Preferably, the PR01382 fragment retains a qualitative biological activity of a native PR01382 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
166
WO 00/12708 PCT/US99/20111
molecule under stringent conditions with (a) a DNA molecule encoding a PR01382 polypeptide having the sequence of amino acid residues from about 28 to about 201, inclusive of Figure 126 (SEQ ID N0.220), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising 5 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1382 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PRO!382 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 10 native PR01382 polypeptide, by contacting the native PR01382 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1382 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
64. PRQ1328
A cDNA clone (DNA66658-1584) has been identified that encodes a novel transmembrane polypeptide, designated in the present application as "PRO1328".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01328 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01328 polypeptide having the sequence of amino acid residues from about 1 or about 20 to about 257, inclusive of Figure 128 (SEQ ID NO:225), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01328
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 9 or about 66 and about 779, inclusive, of Figure 127 (SEQ ID NO:224). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 30 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203229 (DNA66658-1584) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in 35 ATCC Deposit No. 203229 (DNA66658-1584).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence
167
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 20 to about 257, inclusive of Figure 128 (SEQ ID NO:225), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 475 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA 5 molecule encoding a PR01328 polypeptide having the sequence of amino acid residues from 1 or about 20 to about 257, inclusive of Figure 128 (SEQ ID NO:225), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO 1328 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 19 in the sequence of Figure 128 (SEQ ID NO:225). The transmembrane IS domains have been tentatively identified as extending from about amino acid position 32 to about amino acid position 51, from about amino acid position 119 to about amino acid position 138, from about amino acid position 152 to about amino acid position 169 and from about amino acid position 216 to about amino acid position 235 in the PR01328 amino acid sequence (Figure 128, SEQ ID NO:225).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 20 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 20 to about 257, inclusive of Figure 128 (SEQ ID NO:225), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01328 polypeptide coding sequence that may find 25 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 127 (SEQ ID NO:224).
In another embodiment, the invention provides isolated PRO1328 polypeptide encoded by any of the 30 isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01328 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 20 to about 257 of Figure 128 (SEQ ID NO:225).
In another aspect, the invention concerns an isolated PR01328 polypeptide, comprising an amino acid 35 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 20 to about 257, inclusive of Figure 128 (SEQ ID NO:225).
168
In a further aspect, the invention concerns an isolated PR01328 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 20 to about 257, inclusive of Figure 128 (SEQ ID NO:225).
In yet another aspect, the invention concerns an isolated PR01328 polypeptide, comprising the sequence 5 of amino acid residues 1 or about 20 to about 257, inclusive of Figure 128 (SEQ ID NO:225), or a fragment thereof sufficient to provide a binding site for an anti-PR01328 antibody. Preferably, the PR01328 fragment retains a qualitative biological activity of a native PR01328 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01328 polypeptide having the 10 sequence of amino acid residues from about 1 or about 20 to about 257, inclusive of Figure 128 (SEQ ID NO:225), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 15 recovering the polypeptide from the cell culture.
65. PROl 325
A cDNA clone (DNA66659-1593) has been identified that encodes a novel transmembrane polypeptide, designated in the present application as "PR01325".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01325 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO 1325 polypeptide having 25 the sequence of amino acid residues from about 1 or about 19 to about 832, inclusive of Figure 130 (SEQ ID NO:227), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01325 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 51 or about 105 and about 2546, inclusive, of Figure 129 (SEQ ID NO:226). Preferably, hybridization occurs 30 under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203269 35 (DNA66659-1593) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203269 (DNA66659-1593).
169
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 19 to about 832, inclusive of Figure 130 (SEQ ID NO:227), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 100
nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01325 polypeptide having the sequence of amino acid residues from 1 or about 19 to about 832, inclusive of Figure 130 (SEQ ID NO:227), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence 10 identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01325 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding 15 nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 18 in the sequence of Figure 130 (SEQ ID NO:227). The transmembrane domains have been tentatively identified as extending from about amino acid position 292 to about amino acid position 317, from about amino acid position 451 to about amino acid position 470, from about amino acid position 501 to about amino acid position 520, from about amino acid position 607 to about amino acid position 20 627 and from about amino acid position 751 to about amino acid position 770 in the PRO1325 amino acid sequence (Figure 130, SEQ ID NO:227).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 25 amino acid sequence of residues 1 or about 19 to about 832, inclusive of Figure 130 (SEQ ID NO:227), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1325 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 30 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 129 (SEQ ID NO:226).
In another embodiment, the invention provides isolated PRO 1325 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PRO1325 polypeptide, which in 35 certain embodiments, includes an amino acid sequence comprising residues 1 or about 19 to about 832 of Figure 130 (SEQ ID NO:227).
In another aspect, the invention concerns an isolated PRO1325 polypeptide, comprising an amino acid
170
sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 19 to about 832, inclusive of Figure 130 (SEQ ID NO:227).
In a further aspect, the invention concerns an isolated PRO1325 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 5 about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 19 to about 832, inclusive of Figure 130 (SEQ ID NO:227).
In yet another aspect, the invention concerns an isolated PRO 1325 polypeptide, comprising the sequence of amino acid residues 1 or about 19 to about 832, inclusive of Figure 130 (SEQ ID NO:227), or a fragment thereof sufficient to provide a binding site for an anti-PROl325 antibody. Preferably, the PR01325 fragment 10 retains a qualitative biological activity of a native PR01325 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1325 polypeptide having the sequence of amino acid residues from about 1 or about 19 to about 832, inclusive of Figure 130 (SEQ ID NO:227), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about 15 an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
66. PRQ1340
A cDNA clone (DN A66663-1598) has been identified that encodes a novel polypeptide having homology to Ksp-cadherin and designated in the present application as "PR01340."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01340 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO1340 polypeptide having the sequence of amino acid residues from 1 or about 19 to about 807, inclusive of Figure 132 (SEQ ID NO:229), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01340
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 182 and about 2548, inclusive, of Figure 131 (SEQ ID NO:228). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 35 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203268
171
(DNA66663-1598), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203268 (DNA66663-1598).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 5 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 19 to about 807, inclusive of Figure 132 (SEQ ID NO:229), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 10 under stringent conditions with (a) a DNA molecule encoding a PRO1340 polypeptide having the sequence of amino acid residues from about 19 to about 807, inclusive of Figure 132 (SEQ ID NO:229), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 15 In a specific aspect, the invention provides an isolated nucleic acid molecuie comprising DNA encoding a PRO1340 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 18 in the sequence of Figure 132 (SEQ ID NO:229). The transmembrane 20 domain has been tentatively identified as extending from about amino acid position 762 to about amino acid position 784 in the PR01340 amino acid sequence (Figure 132, SEQ ID NO:229).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 25 amino acid sequence of residues 19 to about 807, inclusive of Figure 132 (SEQ ID NO:229), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1340 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 30 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01340 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01340 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 19 to 807 of Figure 132 (SEQ ID NO:229). 35 In another aspect, the invention concerns an isolated PR01340 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the
172
PCT/US99/201H
sequence of amino acid residues 19 to about 807, inclusive of Figure 132 (SEQ ID NO:229).
In a further aspect, the invention concerns an isolated PR01340 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 19 to 807 of Figure 132 (SEQ ID NO:229).
In yet another aspect, the invention concerns an isolated PRO1340 polypeptide, comprising the sequence of amino acid residues 19 to about 807, inclusive of Figure 132 (SEQ ID NO:229), or a fragment thereof sufficient to provide a binding site for an anti-PR01340 antibody. Preferably, the PR01340 fragment retains a qualitative biological activity of a native PRO 1340 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 10 molecule under stringent conditions with (a) a DNA molecule encoding a PRO1340 polypeptide having the sequence of amino acid residues from about 19 to about 807, inclusive of Figure 132 (SEQ ID N0.229), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 15 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01340 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01340 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 20 native PR01340 polypeptide, by contacting the native PR01340 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1340 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
67. PRQ1339
A cDNA clone (DNA66669-1597) has been identified that encodes a novel polypeptide having sequence identity with carboxypepsidases and designated in the present application as "PR01339."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01339 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01339 polypeptide having the sequence of amino acid residues from 1 or about 17 to about 421, inclusive of Figure 134 (SEQ IDN0:234), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01339
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 58 and about 1271, inclusive, of Figure 133 (SEQ ID N0:233). Preferably, hybridization occurs under stringent
173
hybridization and wash conditions.
In a farther aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203272 5 (DNA66669-1597), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203272 (DNA66669-1597).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 10 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 17 to about 421, inclusive of Figure 134 (SEQ ID NO:234), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 15 under stringent conditions with (a) a DNA molecule encoding a PR01339 polypeptide having the sequence of amino acid residues from about 17 to about 421, inclusive of Figure 134 (SEQ ID NO:234), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least abouta 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 20 In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 17 to about 421, inclusive of Figure 134 (SEQ ID NO:234), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01339 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01339 polypeptide encoded by any of the 30 isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01339 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 17 through 421 of Figure 134 (SEQ ID NO:234).
In another aspect, the invention concerns an isolated PRO 1339 polypeptide, comprising an amino acid 35 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 17 to about 421, inclusive of Figure 134 (SEQ ID N0:234).
174
PCT/US99/201U
In a further aspect, the invention concerns an isolated PRO 1339 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 17 through 421 of Figure 134 (SEQ ID NO:234).
In yet another aspect, the invention concerns an isolated PRO1339 polypeptide, comprising the sequence 5 of amino acid residues 17 to about 421, inclusive of Figure 134 (SEQ ID NO:234), or a fragment thereof sufficient to provide a binding site for an anti-PR01339 antibody. Preferably, the PR01339 fragment retains a qualitative biological activity of a native PR01339 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1339 polypeptide having the 10 sequence of amino acid residues from about 17 to about 421, inclusive of Figure 134 (SEQ ID NO:234), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 15 polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01339 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01339 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01339 polypeptide, by contacting the native PR01339 polypeptide with a candidate molecule and 20 monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1339 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
68. PRQ1337
A cDNA clone (DNA66672- I586)has been identified that encodes a novel polypeptide having homology to human thyroxine-binding globulin designated in die present application as "PR01337".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1337 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 30 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01337 polypeptide having the sequence of amino acid residues from 1 or about 21 to about 417, inclusive of Figure 136 (SEQ ID NO:236), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROI337 35 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 120 and about 1310, inclusive, of Figure 135 (SEQ ID NO:235). Preferably, hybridization occurs under stringent hybridization and wash conditions.
175
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203265 (DNA66672-66672), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 5 acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203265 (DNA66672-66672).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 10 identity to the sequence of amino acid residues from about 21 to about 417, inclusive of Figure 136 (SEQ ID NO:236), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1337 polypeptide having the sequence of 15 amino acid residues from about 21 to about 417, inclusive of Figure 136 (SEQ ID NO:236), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecuie.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 20 a PR01337 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 20 in the sequence of Figure 136 (SEQ ID NO:236).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 25 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 21 to about 417, inclusive of Figure 136 (SEQ ID NO:236), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01337 polypeptide coding sequence that may find 30 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO1337 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01337 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 21 to 417 of Figure 136 (SEQ ID NO:236).
In another aspect, the invention concerns an isolated PRO1337 polypeptide, comprising an amino acid
176
sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 21 to about 417, inclusive of Figure 136 (SEQ ID NO:236).
In a further aspect, the invention concerns an isolated PRO 1337 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 5 about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 21 to 417 of Figure 136 (SEQ ID NO:236).
In yet another aspect, the invention concerns an isolated PRO1337 polypeptide, comprising the sequence of amino acid residues 21 to about 417, inclusive of Figure 136 (SEQ ID NO:236), or a fragment thereof sufficient to provide a binding site for an anti-PR01337 antibody. Preferably, the PRO 1337 fragment retains 10 a qualitative biological activity of a native PR01337 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01337 polypeptide having the sequence of amino acid residues from about 21 to about 417, inclusive of Figure 136 (SEQ ID NO:236), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 15 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01337 20 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01337 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01337 polypeptide, by contacting the native PR01337 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01337 polypeptide, 25 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
69. PRQ1342
A cDNA clone (DNA66674-1599) has been identified that encodes a novel transmembrane polypeptide designated in the present application as "PR01342".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01342 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01342 polypeptide having 35 the sequence of amino acid residues from 1 or about 21 to about 596, inclusive of Figure 138 (SEQ ID NO:243), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01342
177
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 299 and about 2026, inclusive, of Figure 137 (SEQ ID NO:242). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 5 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203281 (DNA66674-1599), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203281 (DNA66674-1599).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 21 to about 596, inclusive of Figure 138 (SEQ ID NO:243), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01342 polypeptide having the sequence of amino acid residues from about 21 to about 596, inclusive of Figure 138 (SEQ ID NO:243), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, 20 preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01342 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble variants (i.e. transmembrane domain deleted or inactivated), or is complementary to such encoding 25 nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 20 in the sequence of Figure 138 (SEQ ID NO:243). The transmembrane domain has been tentatively identified as extending from about amino acid position 510 to about amino acid position 532 in the PR01342 amino acid sequence (Figure 138, SEQ ID NO:243).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 30 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 21 to about 596, inclusive of Figure 138 (SEQ ID NO:243), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1342 polypeptide coding sequence that may find 35 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
178
In another embodiment, the invention provides isolated PRO1342 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1342 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 21 to 596 of Figure 138 (SEQ ID N0.243).
In another aspect, the invention concerns an isolated PR01342 polypeptide, comprising an amino acid 5 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 21 to about 596, inclusive of Figure 138 (SEQ ID NO:243).
In a further aspect, the invention concerns an isolated PR01342 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 10 about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 21 to 596 of Figure 138 (SEQ ID N0:243).
In yet another aspect, the invention concerns an isolated PRO1342 polypeptide, comprising the sequence of amino acid residues 21 to about 596, inclusive of Figure 138 (SEQ ID NO:243), or a fragment thereof sufficient to provide a binding site for an anti-PR01342 antibody. Preferably, the PR01342 fragment retains 15 a qualitative biological activity of a native PR01342 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01342 polypeptide having the sequence of amino acid residues from about 21 to about 596, inclusive of Figure 138 (SEQ ID NO:243), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 20 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
70. PRQ1343
A cDNA clone (DNA66675-1587) has been identified that encodes a novel secreted polypeptide, designated in the present application as "PR01343".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01343 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01343 polypeptide having the sequence of ammo acid residues from about 1 or about 26 to about 247, inclusive of Figure 140 (SEQ ID NO:248), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1343
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 71 or about 146 and about 811, inclusive, of Figure 139(SEQIDNO:247). Preferably, hybridization occurs under
179
stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203282 5 (DNA66675-1587) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203282 (DNA66675-1587).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 10 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 26 to about 247, inclusive of Figure 140 (SEQ ID NO:248), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA 15 molecule encoding a PRO 1343 polypeptide having the sequence of amino acid residues from 1 or about 26 to about 247, inclusive of Figure 140 (SEQ ID NO:248), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01343 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 25 in the sequence of Figure 140 (SEQ ID NO:248).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 26 to about 247, inclusive of Figure 140 (SEQ ID NO:248), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1343 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 139 (SEQ ID NO:247).
In another embodiment, the invention provides isolated PRO1343 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01343 polypeptide, which in
180
PCT/US99/201U
certain embodiments, includes an amino acid sequence comprising residues 1 or about 26 to about 247 of Figure 140 (SEQ ID NO:248).
In another aspect, the invention concerns an isolated PR01343 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 5 sequence of amino acid residues 1 or about 26 to about 247, inclusive of Figure 140 (SEQ ID NO:248).
In a further aspect, the invention concerns an isolated PROI343 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues I or about 26 to about 247, inclusive of Figure 140 (SEQ ID NO:248).
In yet another aspect, the invention concerns an isolated PR01343 polypeptide, comprising the sequence of amino acid residues 1 or about 26 to about 247, inclusive of Figure 140 (SEQ ID NO:248), or a fragment thereof sufficient to provide a binding site for an anti-PR01343 antibody. Preferably, the PR01343 fragment retains a qualitative biological activity of a native PRO1343 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 15 molecule under stringent conditions with (a) a DNA molecule encoding a PR01343 polypeptide having the sequence of amino acid residues from about 1 or about 26 to about 247, inclusive of Figure 140 (SEQ ID NO:248), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host 20 cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
71. PROI48Q
A cDNA clone (DNA67962-1649) has been identified that encodes a novel polypeptide having homology 25 to Semaphorin C and designated in the present application as "PRO1480."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01480 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 30 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO1480 polypeptide having the sequence of amino acid residues from about 1 to about 837, inclusive of Figure 142 (SEQ ID NO:253), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01480 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 241 and 35 about 2751, inclusive, of Figure 141 (SEQ ID NO:252). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having
181
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203291 (DNA67962-1649), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 5 Deposit No. 203291 (DNA67962-1649).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 1 to about 837, inclusive of Figure 142 (SEQ ID 10 NO:253), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01480 polypeptide having the sequence of amino acid residues from about 1 to about 837, inclusive of Figure 142 (SEQ ID NO:253), or (b) the 15 complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01480 polypeptide, its soluble variants, (i.e. transmembrane domains deleted or inactivated) or is 20 complementary to such encoding nucleic acid molecule. Transmembrane domains have been tentatively identified as extending from about amino acid position 23 to about amino acid position 46 (type II) and about amino acid position 718 to about amino acid position 738 in the PR01480 amino acid sequence (Figure 142, SEQ ID NO:253).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 25 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 to about 837, inclusive of Figure 142 (SEQ ID NO:253), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1480 polypeptide coding sequence that may find 30 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO1480 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO 1480 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 1 to 837 of Figure 142 (SEQ ID NO:253).
In another aspect, the invention concerns an isolated PRO 1480 polypeptide, comprising an amino acid
182
sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 837, inclusive of Figure 142 (SEQ ID NO:253).
In a further aspect, the invention concerns an isolated PR01480 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 5 about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 to 837 of Figure 142 (SEQ ID NO:253).
In yet another aspect, the invention concerns an isolated PRO1480 polypeptide, comprising the sequence of amino acid residues 1 to about 837, inclusive of Figure 142 (SEQ ID NO:253), or a fragment thereof sufficient to provide a binding site for an anti-PR01480 antibody. Preferably, the PR01480 fragment retains 10 a qualitative biological activity of a native PRO1480 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01480 polypeptide having the sequence of amino acid residues from about 1 to about 837, inclusive of Figure 142 (SEQ ID NO:253), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 15 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01480 20 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01480 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR0148Q polypeptide, by contacting the native PR01480 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1480 polypeptide, 25 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
72. PRQ1487
A cDNA clone (DNA68836-1656) has been identified that encodes a novel polypeptide having homology to fringe protein and designated in the present application as "PR01487".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO 1487 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01487 polypeptide having 35 the sequence of amino acid residues from 1 or about 24 to about 802, inclusive of Figure 144 (SEQ ID N0:260), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1487
183
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 558 and about 2894, inclusive, of Figures 143A-B (SEQ ID NO:259). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 5 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203455 (DNA68836-1656), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203455 (DNA68836-1656).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 24 to about 802, inclusive of Figure 144 (SEQ ID N0:260), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1487 polypeptide having the sequence of amino acid residues from about 24 to about 802, inclusive of Figure 144 (SEQ ID N0:260), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, 20 preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecuie.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01487 polypeptide, with or without the N-terminal signal sequence and/or the initiating, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as 25 extending from amino acid position 1 through about amino acid position 23 in the sequence of Figure 144 (SEQ ID N0:260).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 30 amino acid sequence of residues 24 to about 802, inclusive of Figure 144 (SEQ ID N0:260), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1487 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 35 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO1487 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
184
In a specific aspect, the invention provides isolated native sequence PRO1487 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 24 to 802 of Figure 144 (SEQ ID N0:260).
In another aspect, the invention concerns an isolated PRO 1487 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 5 sequence of amino acid residues 24 to about 802, inclusive of Figure 144 (SEQ ID N0:260).
In a further aspect, the invention concerns an isolated PRO1487 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 24 to 802 of Figure 144 (SEQ ID N0:260).
In yet another aspect, the invention concerns an isolated PRO 1487 polypeptide, comprising the sequence of amino acid residues 24 to about 802, inclusive of Figure 144 (SEQ ID N0:260), or a fragment thereof sufficient to provide a binding site for an anti-PR01487 antibody. Preferably, the PR01487 fragment retains a qualitative biological activity of a native PRO1487 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 15 molecule under stringent conditions with (a) a DNA molecule encoding a PRO1487 polypeptide having the sequence of amino acid residues from about 24 to about 802, inclusive of Figure 144 (SEQ ID N0:260), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising 20 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01487 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01487 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 25 native PR01487 polypeptide, by contacting the native PR01487 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01487 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
73. PRQ1418
A cDNA clone (DNA68864-1629) has been identified that encodes a novel secreted polypeptide designated in the present application as "PRO 1418."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01418 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01418 polypeptide having
185
the sequence of amino acid residues from 1 or about 20 to about 350, inclusive of Figure 146 (SEQ ID NO:265), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concents an isolated nucleic acid molecule encoding a PR01418 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 195 and about 1187, inclusive, of Figure 145 (SEQ ID NO:264). Preferably, hybridization occurs under stringent 5 hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203276 10 (DNA68864-1629), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203276 (DNA68864-1629).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 15 identity, moTe preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 20 to about 350, inclusive of Figure 146 (SEQ ID NO:265), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 20 under stringent conditions with (a) a DNA molecule encoding a PR01418 polypeptide having the sequence of amino acid residues from about 20 to about 350, inclusive of Figure 146 (SEQ ID NO:265), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90 % sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 25 In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 20 to about 350, inclusive of Figure 146 (SEQ ID NO:265), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01418 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01418 polypeptide encoded by any of the 35 isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01418 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 20 through 350 of Figure 146 (SEQ ID
186
NO:265).
In another aspect, the invention concerns an isolated PR01418 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 83% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 20 to about 350, inclusive of Figure 146 (SEQ ID NO:265).
In a further aspect, the invention concerns an isolated PR01418 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 20 through 350 of Figure 146 (SEQ ID NO:265).
In yet another aspect, the invention concerns an isolated PRO 1418 polypeptide, comprising the sequence 10 of amino acid residues 20 to about 350, inclusive of Figure 146 (SEQ ID NO:265), or a fragment thereof sufficient to provide a binding site for an anti-PR01418 antibody. Preferably, the PR01418 fragment retains a qualitative biological activity of a native PR01418 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01418 polypeptide having the 15 sequence of amino acid residues from about 20 to about 350, inclusive of Figure 146 (SEQ ID NO:265), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identiiy, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 20 polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01418 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01418 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01418 polypeptide, by contacting the native PRO 1418 polypeptide with a candidate molecule and 25 monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO 1418 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
74. PRQ1472
A cDN A clone (DN A68866-1644) has been identified that encodes a novel polypeptide having sequence identity with butyrophilin and designated in the present application as "PR01472."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1472 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 35 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO1472 polypeptide having the sequence of amino acid residues from 1 or about 18 to about 466, inclusive of Figure 148 (SEQ ID NO:267),
187
or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01472 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 185 and about 1531, inclusive, of Figure 147 {SEQ ID NO:266). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203283 (DNA68866-1644), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 10 acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203283 (DNA68866-1644).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 15 identity to the sequence of amino acid residues from about 18 to about 466, inclusive of Figure 148 (SEQ ID NO:267), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01472 polypeptide having the sequence of 20 amino acid residues from about 18 to about 466, inclusive of Figure 148 (SEQ ID NO:267), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 25 a PR01472 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domains deleted or inactivated variants, or is complementary to such encoding nucleic acid molecuie. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 1-17 in the sequence of Figure 148 (SEQ ID NO:267). The transmembrane domains have been tentatively identified as being from about amino acid position 131 through about amino acid 30 position 150 and from about amino acid position 235 through about amino acid position 259 in the PRO 1472 amino acid sequence (Figure 148, SEQ ID NO:267). It is understood that PR01472 can be manipulated to contain only particular regions given the information herein, e.g. to have only the extracellular or cytoplasmic regions only, or to have the carboxyl end truncated wherein the second transmembrane domain is deleted.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 35 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 18 to about 466, inclusive of Figure 148 (SEQ ID NO:267), or (b) the
188
PCTAJS99/20111
complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1472 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01472 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1472 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 18 through 466 of Figure 148 (SEQ ID N0:267).
In another aspect, the invention concerns an isolated PR01472 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 18 to about 466, inclusive of Figure 148 (SEQ ID NO:267).
In a further aspect, the invention concerns an isolated PR01472 polypeptide, comprising an amino acid 15 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 18 through 466 of Figure 148 (SEQ ID NO:267).
In yet another aspect, the invention concerns an isolated PR01472polypeptide, comprising the sequence of amino acid residues 18 to about 466, inclusive of Figure 148 (SEQ ID NO:267), or a fragment thereof 20 sufficient to provide a binding site for an anti-PR01472 antibody. Preferably, the PRO1472 fragment retains a qualitative biological activity of a native PRO1472 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1472 polypeptide having the sequence of amino acid residues from about 18 to about 466, inclusive of Figure 148 (SEQ ID NO:267), or (b) 25 the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01472
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01472 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01472 polypeptide, by contacting the native PR01472 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01472 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
189
75. PROl 461
A cDNA clone (DNA68871-1638) has been identified that encodes a novel polypeptidehaving homology to serine protease and designated in the present application as "PR01461".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01461 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR0146I polypeptide having the sequence of amino acid residues from about 1 to about 423, inclusive of Figure 150 (SEQ ID NO:269), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01461
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 32 and about 1300, inclusive, of Figure 149 (SEQ ID NO:268). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 15 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203280 (DNA68871-68871), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 20 Deposit No. 203280 (DNA68871-68871).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 1 to about 423, inclusive of Figure 150 (SEQ ID 25 NO:269), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01461 polypeptide having the sequence of amino acid residues from about 1 to about 423, inclusive of Figure 150 (SEQ ID NO:269), or (b) the 30 complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90 % sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01461 polypeptide, with or without the initiating methionine, and its soluble variants (i.e. transmembrane 35 domain deleted or inactivated), or is complementary to such encoding nucleic acid molecule. A type II transmembrane domain has been tentatively identified as extending from about amino acid position 21 to about amino acid position 40 in the PR01461 amino acid sequence (Figure 150, SEQ ID NO:269).
190
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 to about 423, inclusive of Figure 150 (SEQ ID NO:269), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 2461 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01461 polypeptide encoded by any of the 10 isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO 1461 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 1 to 423 of Figure 150 (SEQ ID NO:269).
In another aspect, the invention concerns an isolated PR01461 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 15 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 423, inclusive of Figure 150 (SEQ ID NO:269).
In a further aspect, the invention concerns an isolated PR01461 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 20 of residues 1 to 423 of Figure 150 (SEQ ID NO:269).
in yet another aspect, the invention concerns an isolated PR01461 polypeptide, comprising the sequence of amino acid residues 1 to about 423, inclusive of Figure 150 (SEQ ID NO:269), or a fragment thereof sufficient to provide a binding site for an anti-PR01461 antibody. Preferably, the PR01461 fragment retains a qualitative biological activity of a native PR01461 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PR01461 polypeptide having the sequence of amino acid residues from about 1 to about 423, inclusive of Figure 150 (SEQ ID NO:269), or (b) the complement of the DNA molecuie of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 30 identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1461 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01461 antibody. 35 In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01461 polypeptide, by contacting the native PR01461 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
191
In a still further embodiment, the invention concerns a composition comprising a PRO 1461 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
76. PRQ1410
A cDNA clone (DNA68874-1622) has been identified that encodes a novel transmembrane polypeptide, 5 designated in the present application as "PR01410".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01410 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 10 preferably at least about 95 % sequence identiiy to (a) a DNA molecule encoding a PR01410 polypeptide having the sequence of amino acid residues from about 1 or about 21 to about 238, inclusive of Figure 152 (SEQ ID NO:271), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1410 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 152 15 orabout212andabout865, inclusive, ofFigure 151 (SEQ ID N0:270). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 20 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203277 (DNA68874-1622) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203277 (DNA68874-1622).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 25 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 21 to about 238, inclusive of Figure 152 (SEQ ID N0:271), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 100 30 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01410 polypeptide having the sequence of amino acid residues from 1 or about 21 to about 238, inclusive of Figure 152 (SEQ ID NO:271), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence 35 identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01410 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and
192
its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 20 in the sequence of Figure 152 (SEQ ID NO:271). The transmembrane domain has been tentatively identified as extending from about amino acid position 194 to about amino acid position 220 in the PROI410 amino acid sequence (Figure 152, SEQ ID NO:271).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 21 to about 238, inclusive of Figure 152 (SEQ ID NO:271), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1410 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 151 (SEQ ID N0.270).
In another embodiment, the invention provides isolated PR01410 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01410 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 21 to about 238 of Figure 152 (SEQ ID NO:271).
In another aspect, the invention concerns an isolated PRO 1410 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 21 to about 238, inclusive of Figure 152 (SEQ ID NO:271).
In a further aspect, the invention concerns an isolated PR01410 polypeptide, comprising an amino acid 25 sequence scoring at least about 80% positives, preferably at least about 85 % positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 21 to about 238, inclusive of Figure 152 (SEQ ID N0.271).
In yet another aspect, the invention concerns an isolated PR01410 polypeptide, comprising the sequence of amino acid residues 1 or about 21 to about 238, inclusive of Figure 152 (SEQ ID NO:271), or a fragment 30 thereof sufficient to provide a binding site for an anti-PROI410 antibody. Preferably, the PR01410 fragment retains a qualitative biological activity of a native PR01410 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01410 polypeptide having the sequence of amino acid residues from about 1 or about 21 to about 238, inclusive of Figure 152 (SEQ ID 35 NO:271), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host
193
ceil comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1410 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01410 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 5 native PR01410 polypeptide by contacting the native PROI410 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO 1410 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
77. PRQ1568
A cDNA clone (DNA68880-1676) has been identified that encodes a novel polypeptide having sequence identity with tetraspanins and designated in the present application as "PR01568."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO 1568 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO 1568 polypeptide having the sequence of amino acid residues from 1 or about 34 to about 305, inclusive ofFigure 154 (SEQ ID NO:273), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1568
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 307 and about 1122, inclusive, of Figure 153 (SEQ ID NO:272). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 25 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably al least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203319 (DNA68880-1676), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 30 Deposit No. 203319 (DNA68880-1676).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 34 to about 305, inclusive of Figure 154 (SEQ ID 35 NO:273), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule
194
under stringent conditions with (a) a DNA molecule encoding a PR01568 polypeptide having the sequence of amino acid residues from about 34 to about 305, inclusive of Figure 154 (SEQ ID NO:273), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
S In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01568 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 33 in the sequence of Figure 154 (SEQ ID NO:273). The transmembrane 10 domains have been tentatively identified as extending from about amino acids 12-35,57-86,94-114 and 226-248 in the PR01568 amino acid sequence (Figure 154, SEQ ID NO:273).
In another aspect, the invention concerns an isolated nucleic acid molecuie comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the 15 amino acid sequence of residues 34 to about 305, inclusive of Figure 154 (SEQ ID NO:273), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1568 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 20 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01568 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1568 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 34 through 305 of Figure 154 (SEQ ID 25 NO:273).
In another aspect, the invention concerns an isolated PRO1568 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 34 to about 305, inclusive of Figure 154 (SEQ ID NO:273). 30 In a further aspect, the invention concerns an isolated PRO1568 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 % positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 34 through 305 of Figure 154 (SEQ ID N0.273).
In yet another aspect, the invention concerns an isolated PR01568 polypeptide, comprising the sequence 35 of amino acid residues 34 to about 305, inclusive of Figure 154 (SEQ ID N0:273), or a fragment thereof sufficient to provide a binding site for an anti-PR01568 antibody. Preferably, the PR01568 fragment retains a qualitative biological activity of a native PR01568 polypeptide.
195
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1568 polypeptide having the sequence of amino acid residues from about 34 to about 305, inclusive of Figure 154 (SEQ ID NO:273), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 5 identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01568 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01568 antibody. 10 In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01568 polypeptide, by contacting the native PR01568 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01568 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
78. FR01570
A cDNA clone (DNA68885-1678) has been identified that encodes a novel polypeptide having sequence identity with SP60 and designated in the present application as "PR01570." In particular, for the first time, Applicants have identified an additional 199 amino acids on the amino terminal end of the protein previously 20 identified as SP60.
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01570 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 25 preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO1570 polypeptide having the sequence of amino acid residues from about 1 to about 432, inclusive ofFigure 156 (SEQ ID N0:275), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01570 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 210 and 30 about 1505, inclusive, of Figure 155 (SEQ ID NO:274). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 35 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203311 (DNA68885-I678), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC
196
Deposit No. 203311 (DNA68885-1678).
In a stili further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 1 to about 432, inclusive of Figure 156 (SEQ ID 5 NO:275), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01570 polypeptide having the sequence of amino acid residues from about 1 to about 432, inclusive of Figure 156 (SEQ ID NO:275), or (b) the 10 complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90 % sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. In a preferred embodiment, the probes provided herein are from the amino terminal end of the peptide identified in Figure 1, defined as amino acids 1-199 of SEQ ID NO:275.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01570 polypeptide, in a form which is secreted and is soluble, i.e. transmembrane domain deleted, truncated or inactivated variants.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 20 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 to about 432, inclusive of Figure 156 (SEQ ID NO:275), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1570 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 25 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length. Preferably, the probes are from the amino terminal end as provided herein.
In another embodiment, the invention provides isolated PRO1570 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1570 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 1 through 432 of Figure 156 (SEQ ID NO-.275).
In another aspect, the invention concerns an isolated PR01570 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 35 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 432, inclusive ofFigure 156 (SEQ ID NO:275).
In a further aspect, the invention concerns an isolated PRO 1570 polypeptide, comprising an amino acid
197
sequence scoring at least about 80% positives, preferably at least about 83% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 through 432 ofFigure 156 (SEQ ID NO:275).
In yet another aspect, the invention concerns an isolated PRO1570 polypeptide, comprising the sequence of amino acid residues 1 to about 432, inclusive of Figure 156 (SEQ ID NO:275), or a fragment thereof 5 sufficient to provide a binding site for an anti-PROl570 antibody. Preferably, the PRO1570 fragment retains a qualitative biological activity of a native PR01570 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01570 polypeptide having the sequence of amino acid residues from about 1 to about 432, inclusive of Figure 156 (SEQ ID NO:275), or (b) 10 the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 83% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01570
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01570 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01570 polypeptide, by contacting the native PR01570 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1570 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
79. PROl 317
A cDNA clone (DNA71166-1685) has been identified that encodes a novel polypeptide having homology 25 to semaphorin B and designated in the present application as "PR01317".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01317 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 30 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01317 polypeptide having the sequence of amino acid residues from 1 or about 31 to about 761, inclusive of Figure 158 (SEQ ID NO:277), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01317 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 195 and 35 about 2387, inclusive, of Figure 157 (SEQ ID NO:276). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having
198
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203355 (DNA71166-1685), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 5 Deposit No. 203355 (DNA71166-1685).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 31 to about 761, inclusive of Figure 158 (SEQ ID 10 NO:277), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01317 polypeptide having the sequence of amino acid residues from about 31 to about 761, inclusive of Figure 158 (SEQ ID NO:277), or (b) the 15 complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01317 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and 20 its soluble variants (i.e. transmembrane domains deleted or inactivated), or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 30 in the sequence of Figure 158 (SEQ ID NO:277). Transmembrane domains have been tentatively identified as extending from about amino acid positions 13-31,136-156,222-247, 474-490, and 685-704 in the PR01317 amino acid sequence (Figure 158, SEQ ID NO:277). 25 In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 31 to about 761, inclusive of Figure 158 (SEQ ID NO:277), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01317 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01317 polypeptide encoded by any of the 35 isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO 1317 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 31 to 761 of Figure 158 (SEQ ID NO:277).
199
In another aspect, the invention concerns an isolated PRO 1317 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 31 to about 761, inclusive of Figure 158 (SEQ ID NO:277).
In a further aspect, the invention concerns an isolated PR01317 polypeptide, comprising an amino acid 5 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 31 to 761 ofFigure 158 (SEQ ID NO:277).
In yet another aspect, the invention concerns an isolated PR01317 polypeptide, comprising the sequence of amino acid residues 31 to about 761, inclusive of Figure 158 (SEQ ID NO:277), or a fragment thereof 10 sufficient to provide a binding site for an anti-PR01317 antibody. Preferably, the PR01317 fragment retains a qualitative biological activity of a native PR01317 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01317 polypeptide having the sequence of amino acid residues from about 31 to about 761, inclusive of Figure 158 (SEQ ID N0:277), or (b) 15 the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01317
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01317 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01317 polypeptide, by contacting the native PR01317 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01317 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
80. PRQ1780
A cDN A clone (DN A71169-1709) has been identified that encodes a novel polypeptide having homology 30 to glucuronosyltransferase and designated in the present application as "PRO1780".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01780 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 3 5 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO1780 polypeptide having the sequence of amino acid residues from 1 or about 20 to about 523, inclusive ofFigure 160 (SEQ ID NO-.282), or (b) the complement of the DNA molecule of (a).
200
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1780 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 125 and about 1636, inclusive, of Figure 159 (SEQ ID N0.281). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 5 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203467 (DNA71169-1709), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 10 Deposit No. 203467 (DNA71169-1709).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 20 to about 523, inclusive of Figure 160 (SEQ ID 15 NO:282), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1780 polypeptide having the sequence of amino acid residues from about 20 to about 523, inclusive of Figure 160 (SEQ ID NO:282), or (b) the complement of the 20 DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01780 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and 25 its soluble variants (i.e. transmembrane domain deleted or inactivated), or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 19 in the sequence of Figure 160 (SEQ ID NO:282). The transmembrane domain has been tentatively identified as extending from about amino acid position 483 to about amino acid position 504 in the PR01780 amino acid sequence (Figure 160, SEQ ID NO:282).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 20 to about 523, inclusive of Figure 160 (SEQ ID NO:282), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01780 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50
201
nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01780 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01780 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 20 to 523 ofFigure 160 (SEQ ID NO:282).
In another aspect, the invention concerns an isolated PRO 1780 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 20 to about 523, inclusive ofFigure 160 (SEQ ID NO:282).
In a further aspect, the invention concerns an isolated PRO 1780 polypeptide, comprising an amino acid 10 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 20 to 523 of Figure 160 (SEQ ID NO:282).
In yet another aspect, the invention concerns an isolated PRO1780 polypeptide, comprising the sequence of amino acid residues 20 to about 523, inclusive of Figure 160 (SEQ ID NO:282), or a fragment thereof 15 sufficient to provide a binding site for an anti-PR01780 antibody. Preferably, the PR01780 fragment retains a qualitative biological activity of a native PR01780 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1780 polypeptide having the sequence of amino acid residues from about 20 to about 523, inclusive of Figure 160 (SEQ ID NO:282), or (b) 20 the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecuie under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01780
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01780 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PROI780 polypeptide, by contacting the native PR01780 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1780 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
81. PRQ1486
A cDNA clone (DNA71180-1655) has been identified that encodes a novel polypeptide having sequence 35 identity with cerebellin, particularly precerebellin, and designated in the present application as "PR01486."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01486 polypeptide.
202
PCT/US99/201H
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01486 polypeptide having the sequence of amino acid residues from 1 or about 33 to about 205, inclusive ofFigure 162 (SEQ ID NO:287), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01486
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 568 and about 1086, inclusive, of Figure 161 (SEQ ID NO:286). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 10 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203403 (DNA71180-1655), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 15 Deposit No. 203403 (DNA71180-1655).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 33 to about 205, inclusive of Figure 162 (SEQ ID 20 N0:287), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01486 polypeptide having the sequence of amino acid residues from about 33 to about 205, inclusive of Figure 162 (SEQ ID NO:287), or (b) the 25 complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90 % sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 30 preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 33 to about 205, inclusive of Figure 162 (SEQ ID NO:287), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO I486 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 35 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01486 polypeptide encoded by any of the
203
isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01486 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 33 through 205 of Figure 162 (SEQ ID NO:287).
In another aspect, the invention concerns an isolated PRO 1486 polypeptide, comprising an amino acid 5 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 33 to about 205, inclusive of Figure 162 (SEQ ID NO:287).
In a further aspect, the invention concerns an isolated PRO1486 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 10 about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 33 through 205 of Figure 162 (SEQ ID NO:287).
In yet another aspect, the invention concerns an isolated PRO 1486 polypeptide, comprising the sequence of amino acid residues 33 to about 205, inclusive of Figure 162 (SEQ ID NO:287), or a fragment thereof sufficient to provide a binding site for an anti-PR01486 antibody. Preferably, the PR01486 fragment retains 15 a qualitative biological activity of a native PR01486 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1486 polypeptide having the sequence of amino acid residues from about 33 to about 205, inclusive ofFigure 162 (SEQ ID NO:287), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 20 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1486 25 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01486 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01486 polypeptide, by contacting the native PR01486 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1486 polypeptide, 30 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
82. PRQ1433
A cDNA clone (DNA71184-1634) has been identified that encodes a novel transmembrane polypeptide, designated in the present application as "PR01433".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01433 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
204
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO 1433 polypeptide having the sequence of amino acid residues from about 1 to about 388, inclusive of Figure 164 (SEQ ID NO:292), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1433 5 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 185 and about 1348, inclusive, of Figure 163 (SEQ ID NO:291). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 10 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203266 (DNA71184-1634) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203266 (DNA7I184-1634).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 388, inclusive of Figure 164 (SEQ ID NO:292), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 250
nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01433 polypeptide having the sequence of amino acid residues from 1 to about 388, inclusive of Figure 164 (SEQ ID NO:292), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85 % sequence identity, more 25 preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01433 polypeptide, with or without the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The transmembrane 30 domain has been tentatively identified as extending from about amino acid position 76 to about amino acid position 97 in the PR01433 amino acid sequence (Figure 164, SEQ ID NO:292).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 35 amino acid sequence of residues 1 to about 388, inclusive of Figure 164 (SEQ ID NO:292), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01433 polypeptide coding sequence that may find
205
use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 163 (SEQ ID NO:291).
In another embodiment, the invention provides isolated PR01433 polypeptide encoded by any of the 5 isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01433 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 to about 388 of Figure 164 (SEQ ID NO:292).
In another aspect, the invention concerns an isolated PR01433 polypeptide, comprising an amino acid 10 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 388, inclusive of Figure 164 (SEQ ID NO:292).
In a further aspect, the invention concerns an isolated PR01433 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 15 about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 to about 388, inclusive of Figure 164 (SEQ ID NO:292).
In yet another aspect, the invention concents an isolated PR01433 polypeptide, comprising the sequence of amino acid residues 1 to about 388, inclusive of Figure 164 (SEQ ID NO:292), or a fragment thereof sufficient to provide a binding site for an anti-PR01433 antibody. Preferably, the PR01433 fragment retains 20 a qualitative biological activity of a native PR01433 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01433 polypeptide having the sequence of amino acid residues from about 1 to about 388, inclusive of Figure 164 (SEQ ID NO:292), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 25 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01433 30 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01433 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01433 polypeptide by contacting the native PR01433 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1433 polypeptide, 35 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
206
83. PRQ1490
A cDNA clone (DNA71213-1659) has been identified, having homology to nucleic acid encoding a 1-acyl-sn-glycerol-3-phosphate acyltransferase protein that encodes a novel polypeptide, designated in the present application as "PR01490".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 5 a PRO1490 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01490 polypeptide having the sequence of amino acid residues from about 1 or about 26 to about 368, inclusive of Figure 166 (SEQ ID 10 NO:297), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1490 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 272 or about 347 and about 1375, inclusive, of Figure 165 (SEQ ID NO:296). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203401 (DNA71213-1659) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the 20 nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203401 (DNA71213-1659).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 25 identity to the sequence of amino acid residues 1 or about 26 to about 368, inclusive of Figure 166 (SEQ ID NO:297), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 285 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01490 polypeptide having the sequence of amino acid residues from 1 or about 26 to 30 about 368, inclusive of Figure 166 (SEQ ID NO:297), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 35 a PR01490 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid
207
position 1 to about amino acid position 25 in the sequence ofFigure 166 (SEQ ID NO:297). The transmembrane domains have been tentatively identified as extending from about amino acid position 307 to about amino acid position 323 and from about amino acid position 335 to about amino acid position 352 in the PRO1490 amino acid sequence (Figure 166, SEQ ID NO:297).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 5 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 26 to about 368, inclusive of Figure 166 (SEQ ID NO:297), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1490 polypeptide coding sequence that may find 10 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 165 (SEQ ID NO:296).
In another embodiment, the invention provides isolated PR01490 polypeptide encoded by any of the 15 isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01490 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 26 to about 368 ofFigure 166 (SEQ ID NO:297).
In another aspect, the invention concerns an isolated PRO1490 polypeptide, comprising an amino acid 20 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 26 to about 368, inclusive of Figure 166 (SEQ ID NO:297).
In a further aspect, the invention concerns an isolated PRO1490 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 25 about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 26 to about 368, inclusive of Figure 166 (SEQ ID NO:297).
In yet another aspect, the invention concerns an isolated PR01490 polypeptide, comprising the sequence of amino acid residues 1 or about 26 to about 368, inclusive of Figure 166 (SEQ ID NO:297), or a fragment thereof sufficient to provide a binding site for an anti-PR01490 antibody. Preferably, the PR01490 fragment 30 retains a qualitative biological activity of a native PR01490 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01490 polypeptide having the sequence of amino acid residues from about 1 or about 26 to about 368, inclusive of Figure 166 (SEQ ID NO:297), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about 35 an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii)
208
recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1490 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01490 antibody.
In a further embodiment, the invendon concerns a method of identifying agonists or antagonists of a native PR01490 polypeptide by contacting the native PR01490 polypeptide with a candidate molecule and 5 monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO 1490 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
84. PROl 482
A cDNA clone (DNA71234-1651) has been identified that encodes a novel secreted polypeptide,
designated in the present application as "PRO1482".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1482 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 15 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01482 polypeptide having the sequence of amino acid residues from about 1 or about 29 to about 143, inclusive of Figure 168 (SEQ ID N0:302), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1482 20 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 33 orabout 117 and about 461, inclusive, ofFigure 167(SEQIDNO:301). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 25 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203402 (DNA71234-1651) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203402 (DNA71234-1651).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 29 to about 143, inclusive of Figure 168 (SEQ ID N0:302), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 260
nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1482 polypeptide having the sequence of amino acid residues from 1 or about 29 to
209
about 143, inclusive ofFigure 168 (SEQ ID N0:302), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 5 a PR01482 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 28 in the sequence of Figure 168 (SEQ ID N0:302).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 10 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 29 to about 143, inclusive of Figure 168 (SEQ ID N0:302), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01482polypeptide coding sequence that may find 15 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 167 (SEQ ID N0:301).
In another embodiment, the invention provides isolated PR01482 polypeptide encoded by any of the 20 isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01482 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 29 to about 143 of Figure 168 (SEQ ID N0:302).
In another aspect, the invention concerns an isolated PR01482 polypeptide, comprising an amino acid 25 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 29 to about 143, inclusive of Figure 168 (SEQ ID N0:302).
In a further aspect, the invention concerns an isolated PRO 1482 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 30 about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 29 to about 143, inclusive of Figure 168 (SEQ ID N0:302).
In yet another aspect, the invention concerns an isolated PR01482 polypeptide, comprising the sequence of amino acid residues 1 or about 29 to about 143, inclusive of Figure 168 (SEQ ID N0:302), or a fragment thereof sufficient to provide a binding site for an anti-PR01482 antibody. Preferably, the PR01482 fragment 35 retains a qualitative biological activity of a native PR01482 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01482 polypeptide having the
210
sequence of amino acid residues from about 1 or about 29 to about 143, inclusive of Figure 168 (SEQ ID N0:302), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 5 recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01482 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01482 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01482 polypeptide by contacting the native PR01482 polypeptide with a candidate molecule and 10 monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01482 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a phanoaceutically acceptable carrier.
85. PRQ1446
A cDNA clone (DNA71277-1636) has been identified that encodes a novel secreted polypeptide designated in the present application as "PR01446."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1446 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 20 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01446 polypeptide having the sequence of amino acid residues from 1 or about 16 to about 109, inclusive ofFigure 170(SEQIDNO:304), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1446 25 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 197 and about 478, inclusive, of Figure 169 (SEQ ID N0:303). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 30 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203285 (DNA71277-1636), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203285 (DNA7I277-1636).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence
211
identity to the sequence of amino acid residues from about 16 to about 109, inclusive of Figure 170 (SEQ ID N0:304), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1446 polypeptide having the sequence of 5 amino acid residues from about 16 to about 109, inclusive of Figure 170 (SEQ ID N0:304), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 10 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 16 to about 109, inclusive of Figure 170 (SEQ ID N0:304), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01446 polypeptide coding sequence that may find 15 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO1446 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01446polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 16 through 109 of Figure 170 (SEQ ID N0:304).
In another aspect, the invention concerns an isolated PR01446 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 25 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 16 to about 109, inclusive of Figure 170 (SEQ ID N0.304).
In a further aspect, the invention concerns an isolated PR01446 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 30 of residues 16 through 109 of Figure 170 (SEQ ID N0:304).
In yet another aspect, the invention concerns an isolated PR01446polypeptide, comprising the sequence of amino acid residues 16 to about 109, inclusive of Figure 170 (SEQ ID N0.304), or a fragment thereof sufficient to provide a binding site for an anti-PR01446 antibody. Preferably, the PR01446 fragment retains a qualitative biological activity of a native PR01446 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PR01446 polypeptide having the sequence of amino acid residues from about 16 to about 109, inclusive ofFigure 170 (SEQ ID N0:304), or (b)
212
PCTAJS99/20111
the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01446
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01446 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01446 polypeptide, by contacting the native PR01446 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO 1446 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
86. PRQ1S58
A cDNA clone (DNA71282-1668) has been identified, having homology to nucleic acid encoding 15 methyltransferase enzymes that encodes a novel polypeptide, designated in the present application as "PR01558".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01558 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 20 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identiiy to (a) a DNA molecule encoding a PR01558 polypeptide having the sequence of amino acid residues from about 1 or about 26 to about 262, inclusive of Figure 172 (SEQ ID N0:306), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1558 25 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 84 or about 159 and about 869, inclusive, ofFigure 171 (SEQ ID N0:305). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 30 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203312 (DNA71282-1668) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203312 (DNA7I282-1668).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence
213
identiiy to the sequence of amino acid residues 1 or about 26 to about 262, inclusive of Figure 172 (SEQ ID N0:306), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR015S8 polypeptide having the sequence of amino acid residues from 1 or about 26 to 5 about 262, inclusive of Figure 172 (SEQ ID N0:306), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 10 a PR01558 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 25 in the sequence ofFigure 172 (SEQ ID N0:306). The transmembrane domains have been tentatively identified as extending from about amino acid position 8 to about amino acid 15 position 30 and from about amino acid position 109 to about amino acid position 130 in the PR01558 amino acid sequence (Figure 172, SEQ ID N0:306).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 20 amino acid sequence of residues 1 or about 26 to about 262, inclusive of Figure 172 (SEQ ID N0:306), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01558 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 25 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 171 (SEQ ID N0:305).
In another embodiment, the invention provides isolated PR01558 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PRO1558 polypeptide, which in 30 certain embodiments, includes an amino acid sequence comprising residues 1 or about 26 to about 262 of Figure 172 (SEQ ID N0:306
In another aspect, the invention concerns an isolated PR01558 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85 % sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 35 sequence of amino acid residues 1 or about 26 to about 262, inclusive of Figure 17(SEQ ID N0:306
In a further aspect, the invention concerns an isolated PR01558 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least
214
about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 26 to about 262, inclusive of Figure 17(SEQ ID N0:306
In yet another aspect, the invention concerns an isolated PR01558 polypeptide, comprising the sequence of amino acid residues 1 or about 26 to about 262, inclusive of Figure 172 (SEQ ID N0:306), or a fragment thereof sufficient to provide a binding site for an anti-PR01558 antibody. Preferably, the PR01558 fragment 5 retains a qualitative biological activity of a native PR01S58 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1558 polypeptide having the sequence of amino acid residues from about 1 or about 26 to about 262, inclusive of Figure 172 (SEQ ID N0.306), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about 10 an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1558 15 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01558 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01558 polypeptide by contacting the native PR01558 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01558 polypeptide, 20 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
87. PROl 604
A cDNA clone (DNA71286-1687)has been identified that encodes a novel polypeptide having homology to hepatoma-derived growth factor (HDGF) designated in the present application as "PRO1604". 25 In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01604 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO1604 polypeptide having 30 the sequence of amino acid residues from 1 or about 14 to about 671, inclusive of Figure 174 (SEQ ID NO :308), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1604 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 104 and about 2077, inclusive, of Figure 173 (SEQ ID N0:307). Preferably, hybridization occurs under stringent 35 hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least
215
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203357 (DNA71286-1687), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203357 (DNA71286-1687).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 14 to about 671, inclusive of Figure 174 (SEQ ID N0:308), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1604 polypeptide having the sequence of amino acid residues from about 14 to about 671, inclusive of Figure 174 (SEQ ID N0:308), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, 15 preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1604 polypeptide, with or without the N-terminal signal sequence, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 20 1 through about amino acid position 13 in the sequence of Figure 174 (SEQ ID N0:308).
In another aspect, the invention concents an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 14 to about 671, inclusive of Figure 174 (SEQ ID N0:308), or (b) the 25 complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01604 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length. 30 In another embodiment, the invention provides isolated PRO1604 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01604polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 14 to 671 of Figure 174 (SEQ ID N0:308).
In another aspect, the invention concerns an isolated PR01604 polypeptide, comprising an amino acid 35 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 14 to about 671, inclusive of Figure 174 (SEQ ID N0:308).
216
In a further aspect, the invention concerns an isolated PR01604 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 14 to 671 of Figure 174 (SEQ ID NO:308).
In yet another aspect, the invention concerns an isolated PR01604 polypeptide, comprising the sequence 5 of amino acid residues 14 to about 671, inclusive of Figure 174 (SEQ ID N0:308), or a fragment thereof sufficient to provide a binding site for an anti-PRQI604 antibody. Preferably, the PR01604 fragment retains a qualitative biological activity of a native PR01604 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1604 polypeptide having the 10 sequence of amino acid residues from about 14 to about 671, inclusive of Figure 174 (SEQ ID N0:308), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 15 polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PROl 604 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PROl604 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01604 polypeptide, by contacting the native PR01604 polypeptide with a candidate molecule and 20 monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1604 polypeptide, or an agonist or antagonist as hereinabove defmed, in combination with a pharmaceuticaliy acceptable carrier.
88. PRQ1491
A cDNA clone (DNA71883-1660) has been identified, having homology to nucleic acid encoding a collapsin protein, that encodes a novel polypeptide, designated in the present application as "PR01491".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01491 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 30 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01491 polypeptide having the sequence of amino acid residues from about 1 or about 37 to about 777, inclusive of Figure 176 (SEQ ID N0.310), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01491 35 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 107 or about 215 and about 2437, inclusive, of Figure 175 (SEQ ID N0:309). Preferably, hybridization occurs under stringent hybridization and wash conditions.
217
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203475 (DNA71883-1660) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the 5 nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203475 (DNA71883-1660).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 10 identity to the sequence of amino acid residues 1 or about 37 to about 777, inclusive of Figure 176 (SEQ ID N0:310), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 1,670 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01491 polypeptide having the sequence of amino acid residues from 1 or about 37 to 15 about 777, inclusive of Figure 176 (SEQ ID N0:310), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 20 a PR01491 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 36 in the sequence ofFigure 176 (SEQ ID N0:310).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 25 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 37 to about 777, inclusive of Figure 176 (SEQ ID N0:310), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01491 polypeptide coding sequence that may find 30 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 175 (SEQ ID N0:309).
In another embodiment, the invention provides isolated PR01491 polypeptide encoded by any of the 35 isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01491 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 37 to about 777 of Figure
218
176 (SEQ ID N0:310).
In another aspect, the invention concerns an isolated PR01491 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 37 to about 777, inclusive of Figure 176 (SEQ ID N0r310).
In a further aspect, the invention concerns an isolated PR01491 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 37 to about 777, inclusive of Figure 176 (SEQ ID N0:310).
In yet another aspect, the invention concerns an isolated PRO 1491 polypeptide, comprising the sequence 10 of amino acid residues 1 or about 37 to about 777, inclusive of Figure 176 (SEQ ID N0:310), or a fragment thereof sufficient to provide a binding site for an anti-PR01491 antibody. Preferably, the PROI491 fragment retains a qualitative biological activity of a native PRO 1491 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01491 polypeptide having the 15 sequence of amino acid residues from about 1 or about 37 to about 777, inclusive of Figure 176 (SEQ ID N0:310), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90 % sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 20 recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1491 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PROI491 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PRO 1491 polypeptide by contacting the native PRO 1491 polypeptide with a candidate molecule and 25 monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01491 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
89. PRQ1431
A cDNA clone (DNA73401-1633) has been identified having a domain with homology to SH3 that encodes a novel polypeptide, which has been designated in the present application as "PR0143P.
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01431 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 35 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO 1431 polypeptide having the sequence of amino acid residues from about 1 to about 370, inclusive of Figure 178 (SEQ ID NO:315) or
219
PCT/US99/201I1
(b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1431 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between residues 1 to about 133S and about 1560 to about 3934, inclusive, of Figure 177 (SEQ ID NO:314). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns (a) an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203273 (DNA73401-1633) or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 10 acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203273 (DNA73401-1633).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 15 identity to the sequence of amino acid residues from about 1 to about 370, inclusive, of Figure 178 (SEQ ID NO:315), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 15 nucleotides that is produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01431 polypeptide having the sequence of amino acid residues from about 1 to about 20 370, inclusive, of Figure 178 (SEQ ID NO:315), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 25 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 to about 370, inclusive, of Figure 178 (SEQ ID NO-.315), inclusive, ofFigure 178 (SEQ ID NO:315).
In another embodiment, the invention provides isolated PR01431 polypeptide encoded by any of the 30 isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01431 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 1 to 370, inclusive, of Figure 178 (SEQ ID NO-.315).
In another aspect, the invention concerns an isolated PR01431 polypeptide, comprising an amino acid 35 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 370, inclusive, of Figure 178 (SEQ ID NO:315).
220
In a further aspect, the invention concerns an isolated PRO 1431 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 to 370 ofFigure 178 (SEQ ID NO:315).
In yet another aspect, the invention concerns an isolated PR01431 or PR01432 polypeptide, comprising 5 the sequence of amino acid residues 1 to about 370, inclusive, or Figure 178 (SEQ ID N0.315), inclusive, of Figure 178 (SEQ ID N0:315), or a fragment thereof sufficient to provide a binding site for an anti-PR01431 antibody. Preferably, the PR0I431 fragment retains a qualitative biological activity of a native PR01431 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 10 molecule under stringent conditions with (a) a DNA molecule encoding a PR01431 polypeptide having the sequence of amino acid residues from about 1 to about 370, inclusive, of Figure 178 (SEQ ID N0:315), inclusive, of Figure 178 (SEQ ID N0:315), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at ieast about a 95% sequence identity 15 to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1431 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01431 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 20 native PR01431 polypeptide, by contacting the native PR01431 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01431 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
90. PRQ1563
A cDNA clone (DNA73492-1671) has been identified, having homology to nucleic acid encoding ADAMTS-1 that encodes a novel polypeptide, designated in the present application as "PR01563".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01563 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO1563 polypeptide having the sequence of amino acid residues from about 1 or about 49 to about 837, inclusive of Figure 180 (SEQ ID NO:317), or (b) the complement of the DNA molecule of (a).
in another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01563
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 419 or about 563 and about 2929, inclusive, of Figures 179A-B (SEQ ID NO:316). Preferably, hybridization occurs
221
under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203324 5 (DNA73492-1671) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203324 (DNA73492-1671).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 10 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 49 to about 837, inclusive of Figure 180 (SEQ ID N0.-317), or (b) the complement of the DNA of (a).
In a farther aspect, the invention concerns an isolated nucleic acid molecule having at least 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA 15 molecule encoding a PR01563 polypeptide having the sequence of amino acid residues from 1 or about 49 to about 837, inclusive of Figure 180 (SEQ ID N0:317), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1563 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 48 in the sequence of Figure 180 (SEQ ID NO:317).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 49 to about 837, inclusive of Figure 180 (SEQ ID NO:3I7), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1563 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figures 179 A-B (SEQ ID N0.316).
In another embodiment, the invention provides isolated PR01563 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01563 polypeptide, which in
222
certain embodiments, includes an amino acid sequence comprising residues 1 or about 49 to about 837 of Figure 180 (SEQ ID NO:317).
In another aspect, the invention concerns an isolated PR01563 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 5 sequence of amino acid residues 1 or about 49 to about 837, inclusive of Figure 180 (SEQ ID N0:3I7).
In a further aspect, the invention concerns an isolated PRO1563 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 49 to about 837, inclusive of Figure 180 (SEQ ID N0:317).
In yet another aspect, the invention concerns an isolated PRO1563 polypeptide, comprising the sequence of amino acid residues 1 or about 49 to about 837, inclusive of Figure 180 (SEQ ID NO:317), or a fragment thereof sufficient to provide a binding site for an anti-PR01563 antibody. Preferably, the PR01563 fragment retains a qualitative biological activity of a native PR01563 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 15 molecule under stringent conditions with (a) a DNA molecule encoding a PR01563 polypeptide having the sequence of amino acid residues from about 1 or about 49 to about 837, inclusive of Figure 180 (SEQ ID NO:317), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host 20 cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01563 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01563 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 25 native PR01563 polypeptide by contacting the native PR01563 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1563 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
91. PROl 565
A cDNA clone (DNA73727-1673) has been identified, having homology to nucleic acid encoding a chondromodulin protein that encodes a novel polypeptide, designated in the present application as "PR01565".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO 1565 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01565 polypeptide having
223
the sequence of amino acid residues from about 1 or about 41 to about 317, inclusive of Figure 182 (SEQ ID NO:322), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01565 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 59 or about 179 and about 1009, inclusive, of Figure 181 (SEQ ID NO:321). Preferably, hybridization occurs 5 under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203459 10 (DNA73727-1673) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203459 (DNA73727-1673).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 15 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 41 to about 317, inclusive of Figure 182 (SEQ ID NO:322), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 410 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA 20 molecule encoding a PRO1565 polypeptide having the sequence of amino acid residues from 1 or about 41 to about 317, inclusive ofFigure 182 (SEQ ID NO:322), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01565 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 40 in the sequence ofFigure 182 (SEQ ID N0.322). The transmembrane 30 domain has been tentatively identified as extending from about amino acid position 25 to about amino acid position 47 in the PR01565 amino acid sequence (Figure 182, SEQ ID NO:322).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 35 amino acid sequence of residues 1 or about 41 to about 317, inclusive of Figure 182 (SEQ ID NO:322), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01565 polypeptide coding sequence that may find
224
PCT/US99/20I11
use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 181 (SEQ ID NO:321).
In another embodiment, the invention provides isolated PRO1565 polypeptide encoded by any of the 5 isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PRO1565 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 41 to about 317 ofFigure 182 (SEQ ID NO:322).
In another aspect, the invention concerns an isolated PROI565 polypeptide, comprising an amino acid 10 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 41 to about 317, inclusive of Figure 182 (SEQ ID NO:322).
In a further aspect, the invention concerns an isolated PR01565 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 15 about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 41 to about 317, inclusive of Figure 182 (SEQ ID N0:322).
In yet another aspect, the invention concerns an isolated PR01565 polypeptide, comprising the sequence of amino acid residues 1 or about 41 to about 317, inclusive ofFigure 182 (SEQ ID NO:322), or a fragment thereof sufficient to provide a binding site for an anti-PR01565 antibody. Preferably, the PR01565 fragment 20 retains a qualitative biological activity of a native PRO1565 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01565 polypeptide having the sequence of amino acid residues from about 1 or about 41 to about 317, inclusive of Figure 182 (SEQ ID NO:322), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about 25 an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01565 30 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01565 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01565 polypeptide by contacting the native PR01565 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01565 polypeptide, 35 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
225
92. PRQ1571
A cDNA clone (DNA73730-1679) has been identified, having homology to nucleic acid encoding the Clostridium perfringens enterotoxin receptor (CPE-R) that encodes a novel polypeptide, designated in the present application as "PR01571".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 5 a PR01571 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO 1571 polypeptide having the sequence of amino acid residues from about 1 or about 22 to about 239, inclusive of Figure 184 (SEQ ID 10 NO:324), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1571 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 90 or about 153 and about 806, inclusive, ofFigure 183 (SEQ ID NO:323). Preferably, hybridization occurs under stringent hybridization and wash conditions.
IS In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203320 (DNA73730-1679) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the 20 nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203320 (DNA73730-1679).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 25 identity to the sequence of amino acid residues I or about 22 to about 239, inclusive of Figure 184 (SEQ ID NO:324), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 910 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01571 polypeptide having the sequence of amino acid residues from 1 or about 22 to 30 about 239, inclusive of Figure 184 (SEQ ID NO:324), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 35 a PR01571 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid
226
position 1 to about amino acid position 21 in the sequence of Figure 184 (SEQ IDNO:324). The transmembrane domains have been tentatively identified as extending from about amino acid position 82 to about amino acid position 103, from about amino acid position 1 IS to about amino acid position 141 and from about amino acid position 160 to about amino acid position 182 in the PR01571 amino acid sequence (Figure 184, SEQ ID NO:324).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 22 to about 239, inclusive of Figure 184 (SEQ ID NO:324), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01571 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 183 (SEQ ID N0.323).
In another embodiment, the invention provides isolated PRO 1571 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01571 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 22 to about 239 of Figure 184 (SEQ ID N0:324).
In another aspect, the invention concerns an isolated PR01571 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 22 to about 239, inclusive of Figure 184 (SEQ ID N0.324).
In a further aspect, the invention concerns an isolated PR01571 polypeptide, comprising an amino acid 25 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 22 to about 239, inclusive of Figure 184 (SEQ ID NO:324).
In yet another aspect, the invention concerns an isolated PRO 1571 polypeptide, comprising the sequence of amino acid residues 1 or about 22 to about 239, inclusive of Figure 184 (SEQ ID NO:324), or a fragment 30 thereof sufficient to provide a binding site for an anti-PR01571 antibody. Preferably, the PR01571 fragment retains a qualitative biological activity of a native PRO 1571 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01571 polypeptide having the sequence of amino acid residues from about 1 or about 22 to about 239, inclusive of Figure 184 (SEQ ID 35 N0:324), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host
227
cell comprising Che test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01S71 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01571 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 5 native PR01571 polypeptide by contacting the native PR01571 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO 1571 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
93. PRQ1572
A cDNA clone (DNA73734-1680) has been identified that encodes a novel polypeptide having sequence identity with CPE-R and designated in the present application as "PR01572."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01572 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO1572 polypeptide having the sequence of amino acid residues from 1 or about 24 to about 261, inclusive of Figure 186 (SEQ ID NO:326), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1572
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 159 and about 872, inclusive, of Figure 185 (SEQ ID NO:325). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 25 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203363 (DNA73734-1680), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 30 Deposit No. 203363 (DNA73734-1680).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 24 to about 261, inclusive of Figure 186 (SEQ ID 35 NO:326), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule
228
under stringent conditions with (a) a DNA molecule encoding a PR01572 polypeptide having the sequence of amino acid residues from about 24 to about 261, inclusive of Figure 186 (SEQ ID NO:326), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01572 polypeptide, with or without the N-tenninal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 23 in the sequence of Figure 186 (SEQ ID NO:326). The transmembrane 10 domains have been tentatively identified as approximately at about 81-100,121-141 and 173-194 in the PR01572 amino acid sequence (Figure 186, SEQ ID NO:326).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 15 amino acid sequence of residues 24 to about 261, inclusive of Figure 186 (SEQ ID NO:326), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01572 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 20 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01572 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01572 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 24 through 261 of Figure 186 (SEQ ID 25 NO-.326).
In another aspect, the invention concerns an isolated PR01572 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 24 to about 261, inclusive of Figure 186 (SEQ ID NO:326). 30 In a further aspect, the invention concerns an isolated PR01572 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 24 through 261 of Figure 186 (SEQ ID N0:326).
In yet another aspect, the invention concerns an isolated PR01572 polypeptide, comprising the sequence 35 of amino acid residues 24 to about 261, inclusive of Figure 186 (SEQ ID NO:326), or a fragment thereof sufficient to provide a binding site for an anti-PR01572 antibody. Preferably, the PR01572 fragment retains a qualitative biological activity of a native PR01572 polypeptide.
229
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01572 polypeptide having the sequence of amino acid residues from about 24 to about 261, inclusive of Figure 186 (SEQ ID NO:326), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 5 identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01572 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01572 antibody. 10 In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01572 polypeptide, by contacting the native PR01572 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1572 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
94. PRQ1S73
A cDNA clone (DNA73735-1681) has been identified that encodes a novel polypeptide having sequence identity with CPE-R and designated in the present application as "PR01573".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 20 a PRO1573 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01573 polypeptide having the sequence of amino acid residues from 1 or about 18 to about 225, inclusive of Figure 188 (SEQ ID N0.328), 25 or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1573 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 148 and about 771, inclusive, of Figure 187 (SEQ ID N0.327). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203356 (DNA73735-1681), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 35 acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203356 (DNA73735-1681).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
230
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 18 to about 225, inclusive of Figure 188 (SEQ ID NO:328), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 5 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1573 polypeptide having the sequence of amino acid residues from about 18 to about 225, inclusive of Figure 188 (SEQ ID NO:328), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most 10 preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01573 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 15 1 through about amino acid position 17 in the sequence of Figure 188 (SEQ ID NO:328). The transmembrane domains have been tentatively identified as at approximately 82-101, 118-145 and 164-188 in the PR01573 amino acid sequence (Figure 188, SEQ ID NO:328).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 20 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 18 to about 225, inclusive of Figure 188 (SEQ ID NO:328), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01573 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 25 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01573 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1573 polypeptide, which in one 30 embodiment, includes an amino acid sequence comprising residues 18 through 225 of Figure 188 (SEQ ID N0:328).
In another aspect, the invention concerns an isolated PRO 1573 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 35 sequence of amino acid residues 18 to about 225, inclusive of Figure 188 (SEQ ID NO:328).
In a further aspect, the invention concerns an isolated PRO 1573 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least
231
about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 18 through 225 of Figure 188 (SEQ ID NO:328).
In yet another aspect, the invention concerns an isolated PRO 1573 polypeptide, comprising the sequence of amino acid residues 18 to about 225, inclusive of Figure 188 (SEQ ID NO:328), or a fragment thereof sufficient to provide a binding site for an anti-PR01573 antibody. Preferably, the PR01573 fragment retains 5 a qualitative biological activity of a native PR01573 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01573 polypeptide having the sequence of amino acid residues from about 18 to about 225, inclusive of Figure 188 (SEQ ID NO:328), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 10 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01573 15 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01573 antibody.
In a farther embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01573 polypeptide, by contacting the native PR01573 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1573 polypeptide, 20 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
95. PRQ1488
A cDNA clone (DN A73736-1657)has been identified that encodes a novel polypeptide having homology to Clostridium perfringens enterotoxin receptor (CPE-R), designated in the present application as "PR01488". 25 In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01488 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO1488 polypeptide having 30 the sequence of amino acid residues from about 1 to about 220, inclusive of Figure 190 (SEQ ID N0:330), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01488 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 6 and about 665, inclusive, of Figure 189 (SEQ ID NO:329). Preferably, hybridization occurs under stringent 35 hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least
232
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203466 (DNA73736-1657), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203466 (DNA73736-1657).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 1 to about 220, inclusive of Figure 190 (SEQ ID N0:330), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1488 polypeptide having the sequence of amino acid residues from about 1 to about 220, inclusive of Figure 190 (SEQ ID N0:330), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, 15 preferably at least about an 85 % sequence identity, more preferably at least about a 90 % sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01488 polypeptide, with or without the initiating methionine, and its soluble variants (i.e. transmembrane domains deleted or inactivated), or is complementary to such encoding nucleic acid molecule. Transmembrane 20 domains has been tentatively identified as being located at about amino acid positions 8-30, 82-102, 121-140, and 166-186 in the PR01488 amino acid sequence (Figure 190, SEQ ID N0.330).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 25 amino acid sequence of residues 1 to about 220, inclusive of Figure 190 (SEQ ID N0:330), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1488 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to aboui 50 30 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO1488 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
Ina specific aspect, the invention provides isolated native sequence PR01488 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues I to 220 of Figure 190 (SEQ ID N0:330). 35 In another aspect, the invention concerns an isolated PRO1488 polypeptide, comprising an amino acid sequence having al least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the
233
sequence of amino acid residues 1 to about 220, inclusive of Figure 190 (SEQ ID N0:330).
In a further aspect, the invention concerns an isolated PR01488 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 to 220 ofFigure 190 (SEQ ID N0:330).
In yet another aspect, the invention concerns an isolated PR01488 polypeptide, comprising the sequence of amino acid residues 1 to about 220, inclusive of Figure 190 (SEQ ID N0:330), or a fragment thereof sufficient to provide a binding site for an anti-PR01488 antibody. Preferably, the PR01488 fragment retains a qualitative biological activity of a native PRO1488 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 10 molecule under stringent conditions with (a) a DNA molecule encoding a PR01488 polypeptide having the sequence of amino acid residues from about 1 to about 220, inclusive of Figure 190 (SEQ ID N0:330), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising 15 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1488 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01488 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 20 native PR01488 polypeptide, by contacting the native PR01488 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1488 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
96. PROl 489
A cDNA clone (DNA73737-1658) has been identified, having homology to nucleic acid encoding the Clostridium perfringens enterotoxin receptor (CPE-R) that encodes a novel polypeptide, designated in the present application as "PR01489".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DN A encoding 30 a PR01489 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01489 polypeptide having the sequence of amino acid residues from about 1 to about 173, inclusive of Figure 192 (SEQ ID N0:332), or 35 (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1489 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 264
234
and about 782, inclusive, ofFigure 191 (SEQ ID N0.331). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 5 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203412 (DNA73737-1658) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203412 (DNA73737-1658).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 10 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 173, inclusive of Figure 192 (SEQ ID NO:332), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 25 15 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1489 polypeptide having the sequence of amino acid residues from 1 to about 173, inclusive of Figure 192 (SEQ ID NO:332), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) 20 or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01489 polypeptide, with or without the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The transmembrane domains have been tentatively identified as extending from about amino acid position 31 to about amino acid 25 position 51, from about amino acid position 71 to about amino acid position 90 and from about amino acid position 112 to about amino acid position 133 in the PR01489 amino acid sequence (Figure 192, SEQ ID NO:332).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 30 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 to about 173, inclusive of Figure 192 (SEQ ID NO:332), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01489 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 35 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 191 (SEQ ID NO:331).
235
PCT/US99/20T11
In another embodiment, the invention provides isolated PR01489 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01489 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 to about 173 ofFigure 192 (SEQ ID NO:332).
In another aspect, the invention concerns an isolated PRO 1489 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 173, inclusive ofFigure 192 (SEQ ID NO:332).
In a further aspect, the invention concerns an isolated PR01489 polypeptide, comprising an amino acid 10 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 to about 173, inclusive ofFigure 192 (SEQ ID NO:332).
In yet another aspect, the invention concerns an isolated PR01489 polypeptide, comprising the sequence of amino acid residues 1 to about 173, inclusive of Figure 192 (SEQ ID NO:332), or a fragment thereof 15 sufficient to provide a binding site for an anti-PR01489 antibody. Preferably, the PR01489 fragment retains a qualitative biological activity of a native PRO1489 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01489 polypeptide having the sequence of amino acid residues from about 1 to about 173, inclusive of Figure 192 (SEQ ID NO:332), or (b) 20 the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identiiy, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1489
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01489 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01489 polypeptide by contacting the native PR01489 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1489 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
97. PRQ1474
A cDNA clone (DNA73739-1645) has been identified that encodes a novel polypeptide having sequence 35 identity with ovomucoid and designated in the present application as "PRO1474."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01474 polypeptide.
236
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO1474 polypeptide having the sequence of amino acid residues from 1 or about 20 to about 85, inclusive ofFigure 194 (SEQ ID NO:334), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01474
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 102 and about 299, inclusive, of Figure 193 (SEQ ID NO:333). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 10 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203270 (DNA73739-1645), or (b) the complement of the DNA molecule of (a). In apreferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 15 Deposit No. 203270 (DNA73739-1645).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 20 to about 85, inclusive of Figure 194 (SEQ ID 20 NO:334), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1474 polypeptide having the sequence of amino acid residues from about 20 to about 85, inclusive of Figure 194 (SEQ ID NO:334), or (b) the 25 complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 30 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 20 to about 85, inclusive of Figure 194 (SEQ ID NO:334), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1474 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 35 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01474 polypeptide encoded by any of the
237
isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01474 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 20 through 85 of Figure 194 (SEQ ID N0:334).
In another aspect, the invention concerns an isolated PR01474 polypeptide, comprising an amino acid 5 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 20 to about 85, inclusive of Figure 194 (SEQ ID N0:334).
In a further aspect, the invention concerns an isolated PRO1474 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 10 about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 20 through 85 of Figure 194 (SEQ ID NO:334).
In yet another aspect, the invention concerns an isolated PRO1474 polypeptide, comprising the sequence of amino acid residues 20 to about 85, inclusive of Figure 194 (SEQ ID NO:334), or a fragment thereof sufficient to provide a binding site for an anti-PR01474 antibody. Preferably, the PR01474 fragment retains 15 a qualitative biological activity of a native PRO1474 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecuie encoding a PR01474 polypeptide having the sequence of amino acid residues from about 20 to about 85, inclusive of Figure 194 (SEQ ID NO:334), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 20 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01474 25 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01474 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PRO1474 polypeptide, by contacting the native PRO1474 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01474 polypeptide, 30 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
98. PRQ1508
A cDNA clone (DNA73742-1662) has been identified that encodes a novel secreted polypeptide and designated in the present application as "PR01508."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01508 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
238
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO1508 polypeptide having the sequence of amino acid residues from 1 or about 31 to about 148, inclusive of Figure 196 (SEQ ID NO:336), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01508 5 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 160 and about 513, inclusive, of Figure 195 (SEQ ID NO:335). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 10 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203316 (DNA73742-1662), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203316 (DNA73742-1662).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 31 to about 148, inclusive of Figure 196 (SEQ ID NO:336), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01508 polypeptide having the sequence of amino acid residues from about 31 to about 148, inclusive of Figure 196 (SEQ ID NO:336), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, 25 preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01508 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as 30 extending from amino acid position 1 through about amino acid position 30 in the sequence of Figure 196 (SEQ ID N0:336).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 35 amino acid sequence of residues 31 to about 148, inclusive of Figure 196 (SEQ ID NO:336), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01508 polypeptide coding sequence that may find
239
use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO 1508 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01508 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 31 to 148 of Figure 196 (SEQ ID NO:336).
In another aspect, the invention concerns an isolated PR01508 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 10 sequence of amino acid residues 31 to about 148, inclusive of Figure 196 (SEQ ID NO:336).
In a further aspect, the invention concerns an isolated PRO1508 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 31 to 148 ofFigure 196 (SEQ ID NO:336).
In yet another aspect, the invention concerns an isolated PRO1508 polypeptide, comprising the sequence of amino acid residues 31 to about 148, inclusive of Figure 196 (SEQ ID NO:336), or a fragment thereof sufficient to provide a binding site for an anti-PR01508 antibody. Preferably, the PRO1508 fragment retains a qualitative biological activity of a native PR01508 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 20 molecule under stringent conditions with (a) a DNA molecule encoding a PRO1508 polypeptide having the sequence of amino acid residues from about 31 to about 148, inclusive of Figure 196 (SEQ ID NO:336), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 25 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
99. PRQ1555
A cDNA clone (DNA73744-1665) has been identified that encodes a novel transmembrane polypeptide 30 designated in the present application as "PR01555".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01555 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 35 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO 1555 polypeptide having the sequence of amino acid residues from 1 or about 32 to about 246, inclusive ofFigure 198 (SEQ ID N0:338), or (b) the complement of the DNA molecule of (a).
240
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1555 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 83 and about 827, inclusive, of Figure 197 (SEQ ID NO:337). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 5 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203322 (DNA73744-1665), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 10 Deposit No. 203322 (DNA73744-1665).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 32 to about 246, inclusive of Figure 198 (SEQ ID 15 NO:338), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecuie having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1555 polypeptide having the sequence of amino acid residues from about 32 to about 246, inclusive of Figure 198 (SEQ ID NO:338), or (b) the 20 complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01555 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and 25 its soluble variants (i.e. transmembrane domains deleted or inactivated), or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 31 in the sequence of Figure 198 (SEQ ID NO:338). Two transmembrane domains have been tentatively identified as extending from about amino acid position 1 to about aminn acid position 32, and from about amino acid position 195 through about amino acid position 217, in the PRO 1555 30 amino acid sequence (Figure 198, SEQ ID NO:338).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 32 to about 246, inclusive of Figure 198 (SEQ ID NO:338), or (b) the 35 complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01555 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length,
241
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PROI555 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PROI555 polypeptide, which in one 5 embodiment, includes an amino acid sequence comprising residues 32 to 246 ofFigure 198 (SEQ ID NO:338).
In another aspect, the invention concerns an isolated PR01555 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 32 to about 246, inclusive ofFigure 198 (SEQ ID NO:338). 10 In a further aspect, the invention concerns an isolated PRO 1555 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 32 to 246 of Figure 198 (SEQ ID NO:338).
In yet another aspect, the invention concerns an isolated PR01555 polypeptide, comprising the sequence 15 of amino acid residues 32 to about 246, inclusive of Figure 198 (SEQ ID NO:338), or a fragment thereof sufficient to provide a binding site for an anti-PR01555 antibody. Preferably, the PR01555 fragment "-tains a qualitative biological activity of a native PRO1555 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1555 polypeptide having the 20 sequence of amino acid residues from about 32 to about 246, inclusive of Figure 198 (SEQ ID NO:338), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 25 polypeptide from the cell culture.
100. PRO 1485
A cDN A clone (DNA73746-1654) has been identified that encodes a novel polypeptide having sequence identity with lysozyme, and more particularly, lysozyme C precursor, and designated in the present application 30 as "PR01485."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01485 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 35 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01485 polypeptide having the sequence of amino acid residues from 1 or about 19 to about 148, inclusive ofFigure 200 (SEQ ID N0:340), or (b) the complement of the DNA molecule of (a).
242
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1485 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 205 and about 594, inclusive, of Figure 199 (SEQ ID NO:339). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having S at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203411 (DNA73746-1654), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 10 Deposit No. 203411 (DNA73746-1654).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 19 to about 148, inclusive ofFigure 200 (SEQ ID 15 N0:340), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecuie under stringent conditions with (a) a DNA molecule encoding a PRO1485 polypeptide having the sequence of amino acid residues from about 19 to about 148, inclusive of Figure 200 (SEQ ID N0:340), or (b) the 20 complement of die DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 25 preferably at least about 90% positives, most preferably at least about 95% positives when compared with die amino acid sequence of residues 19 to about 148, inclusive of Figure 200 (SEQ ID N0:340), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01485 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 30 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01485 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1485 polypeptide, which in one 35 embodiment, includes an amino acid sequence comprising residues 19 through 148 of Figure 200 (SEQ ID N0:340).
In another aspect, the invention concerns an isolated PRO1485 polypeptide, comprising an amino acid
243
sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 19 to about 148, inclusive of Figure 200 (SEQ ID N0:340).
In a further aspect, the invention concerns an isolated PR01485 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 5 about 90 % positives, most preferabty at least about 95 % positives when compared with the amino acid sequence of residues 19 through 148 of Figure 200 (SEQ ID NO:340).
In yet another aspect, the invention concerns an isolated PR01485 polypeptide, comprising the sequence of amino acid residues 19 to about 148, inclusive of Figure 200 (SEQ ID N0:340), or a fragment thereof sufficient to provide a binding she for an anti-PR01485 antibody. Preferably, the PR01485 fragment retains 10 a qualitative biological activity of a native PR01485 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01485 polypeptide having the sequence of amino acid residues from about 19 to about 148, inclusive of Figure 200 (SEQ ID N0:340), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 15 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecuie under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01485 20 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01485 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01485 polypeptide, by contacting the native PR01485 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1485 polypeptide, 25 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
101. PROl 564
A cDNA clone (DNA73760-1672) has been identified, having homology to nucleic acid encoding an N-acetylgalactosaminyltransferase protein that encodes a novel polypeptide, designated in the present application 30 as "PRO1564".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1564 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 35 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01564 polypeptide having the sequence of amino acid residues from about 1 or about 29 to about 639, inclusive of Figure 202 (SEQ ID NO:347), or (b) the complement of the DNA molecule of (a).
244
la another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01564 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 462 or about 546 and about 2378, inclusive, of Figure 201 (SEQ ID NO:346). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 5 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203314 (DNA73760-1672) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, die nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in 10 ATCC Deposit No. 203314 (DNA73760-1672).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 29 to about 639, inclusive of Figure 202 (SEQ ID 15 NO:347), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01564 polypeptide having the sequence of amino acid residues from 1 or about 29 to about 639, inclusive of Figure 202 (SEQ ID NO:347), or (b) the complement of the DNA molecule of (a), and, 20 if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01564 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and 25 its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid positionl toaboutaminoacidposition28inthesequenceofFigure202(SEQIDNO:347). The transmembrane domain has been tentatively identified as extending from about amino acid position 11 to about amino acid position 36 in the PR01564 amino acid sequence (Figure 202, SEQ ID NO:347).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 29 to about 639, inclusive of Figure 202 (SEQ ID NO:347), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1564 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50
245
nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 201 (SEQ ID NO:346).
In another embodiment, the invention provides isolated PR01S64 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PRO1564 polypeptide, which in 5 certain embodiments, includes an amino acid sequence comprising residues 1 or about 29 to about 639 of Figure 202 (SEQ ID NO:347).
In another aspect, the invention concerns an isolated PR01564 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 10 sequence of amino acid residues 1 or about 29 to about 639, inclusive of Figure 202 (SEQ ID N0:347).
In a further aspect, the invention concerns an isolated PR01564 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 29 to about 639, inclusive of Figure 202 (SEQ ID NO:347).
In yet another aspect, the invention concerns an isolated PR01564 polypeptide, comprising the sequence of amino acid residues 1 or about 29 to about 639, inclusive of Figure 202 (SEQ ID NO:347), or a fragment thereof sufficient to provide a binding site for an anti-PROl564 antibody. Preferably, the PR01564 fragment retains a qualitative biological activity of a native PR01564 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 20 molecule under stringent conditions with (a) a DNA molecule encoding a PR01564 polypeptide having the sequence of amino acid residues from about 1 or about 29 to about 639, inclusive of Figure 202 (SEQ ID NO:347), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host 25 cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01564 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01564 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 30 native PR01564 polypeptide by contacting the native PR01564 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1564 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
102. PRQ1755
A cDNA clone (DNA76396-1698) has been identified that encodes a novel transmembrane polypeptide designated in the present application as "PR01755".
246
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01755 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO1755 polypeptide having 5 the sequence of amino acid residues from 1 or about 32 to about 276, inclusive ofFigure 204 (SEQ ID NO:352), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1755 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 151 and about 885, inclusive, of Figure 203 (SEQ ID NO:351). Preferably, hybridization occurs under stringent 10 hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203471 15 (DNA76396-1698), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203471 (DNA76396-1698).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 20 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 32 to about 276, inclusive of Figure 204 (SEQ ID NO:352), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 25 under stringent conditions with (a) a DNA molecule encoding a PR01755 polypeptide having the sequence of amino acid residues from about 32 to about 276, inclusive of Figure 204 (SEQ ID NO:352), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 30 In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01755 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble variants (i.e. transmembrane domain deleted or inactivated), or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 31 in the sequence of Figure 204 (SEQ ID NO:352). The transmembrane 35 domain has been tentatively identified as extending from about amino acid position 178 to about amino acid position 198 in the PR01755 amino acid sequence (Figure 204, SEQ ID NO:352).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
247
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 32 to about 276, inclusive of Figure 204 (SEQ ID NO:352), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01755 polypeptide coding sequence that may find 5 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO 1755 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1755 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 32 to 276 of Figure 204 (SEQ ID NO:352).
In another aspect, the invention concerns an isolated PR01755 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 15 sequence of amino acid residues 32 to about 276, inclusive of Figure 204 (SEQ ID NO:352).
In a further aspect, the invention concerns an isolated PR01755 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 32 to 276 of Figure 204 (SEQ ID NO:352).
In yet another aspect, the invention concerns an isolated PR01755 polypeptide, comprising the sequence of amino acid residues 32 to about 276, inclusive of Figure 204 (SEQ ID NO:352), or a fragment thereof sufficient to provide a binding site for an anti-PR01755 antibody. Preferably, the PR01755 fragment retains a qualitative biological activity of a native PRO1755 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 25 molecule under stringent conditions with (a) a DNA molecule encoding a PR01755 polypeptide having the sequence of amino acid residues from about 32 to about 276, inclusive of Figure 204 (SEQ ID NO:352), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 30 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1755 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01755 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 35 native PR01755 polypeptide, by contacting the native PR01755 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1755 polypeptide,
248
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier. 103. PRQ1757
A cDNA clone (DNA76398-1699) has been identified that encodes a novel transmembrane polypeptide, designated in the present application as "PRO1757".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01757 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PROI757 polypeptide having 10 the sequence of amino acid residues from about 1 or about 20 to about 121, inclusive of Figure 206 (SEQ ID NO:354), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01757 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 59 or about 116 and about 121, inclusive, ofFigure 205 (SEQIDNO:353). Preferably, hybridization occurs under 15 stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identiiy to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203474 20 (DNA76398-1699) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203474 (DNA76398-1699).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 25 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 20 to about 121, inclusive of Figure 206 (SEQ ID NO:354), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 125 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA 30 molecule encoding a PRO1757 polypeptide having the sequence of amino acid residues from 1 or about 20 to about 121, inclusive of Figure 206 (SEQ ID NO:354), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecuie comprising DNA encoding a PR01757 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding
249
nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 19 in the sequence ofFigure 206 (SEQ ID NO:354). The transmembrane domain has been tentatively identified as extending from about amino acid position 91 to about amino acid position 110 in the PR01757 amino acid sequence (Figure 206, SEQ ID NO:354).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 5 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 20 to about 121, inclusive of Figure 206 (SEQ ID NO:354), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01757 polypeptide coding sequence that may find 10 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 205 (SEQ ID NO:353).
In another embodiment, the invention provides isolated PR01757 polypeptide encoded by any of the 15 isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01757 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 20 to about 121 ofFigure 206 (SEQ ID NO:354).
In another aspect, the invention concerns an isolated PR01757 polypeptide, comprising an amino acid 20 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 20 to about 121, inclusive of Figure 206 (SEQ ID NO:354).
In a further aspect, the invention concerns an isolated PR01757 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 25 about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 20 to about 121, inclusive of Figure 206 (SEQ ID NO:354).
In yet another aspect, the invention concerns an isolated PRO1757 polypeptide, comprising the sequence of amino acid residues 1 or about 20 to about 121, inclusive of Figure 206 (SEQ ID NO:354), or a fragment thereof sufficient to provide a binding site for an anti-PR01757 antibody. Preferably, the PR01757 fragment 30 retains a qualitative biological activity of a native PR01757 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01757 polypeptide having the sequence of amino acid residues from about 1 or about 20 to about 121, inclusive of Figure 206 (SEQ ID N0:354), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about 35 an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii)
250
recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01757 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01757 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01757 polypeptide by contacting the native PR01757 polypeptide with a candidate molecule and 5 monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01757 polypeptide, or an agonist or antagonist as hereinabove defmed, in combination with a pharmaceuticaliy acceptable carrier.
104. PRQ1758
A cDNA clone (DNA76399-1700) has been identified that encodes a novel secreted polypeptide designated in the present application as "PR01758".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01758 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 15 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO 1758 polypeptide having the sequence of amino acid residues from 1 or about 16 to about 157, inclusive ofFigure 208 (SEQ ID NO:356), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01758 20 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 123 and about 548, inclusive, of Figure 207 (SEQ ID NO:355). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 25 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203472 (DNA76399-1700), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203472 (DNA76399-1700).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 16 to about 157, inclusive of Figure 208 (SEQ ID NO:356), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01758 polypeptide having the sequence of
251
PCT/US99/20I11
amino acid residues from about 16 to about 157, inclusive of Figure 208 (SEQ ID N0:356), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 5 a PRO 1758 polypeptide, with or without the N-terminal signal sequence, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 15 in the sequence of Figure 208 (SEQ ID NO:356).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 10 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 16 to about 157, inclusive of Figure 208 (SEQ ID NO:356), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1758 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 15 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01758 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01758 polypeptide, which in one 20 embodiment, includes an amino acid sequence comprising residues 16 to 157 ofFigure 208 (SEQ ID NO:356).
In another aspect, the invention concerns an isolated PR01758 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 16 to about 157, inclusive of Figure 208 (SEQ ID NO:356). 25 In a further aspect, the invention concerns an isolated PR01758 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 16 to 157 of Figure 208 (SEQ ID NO:356).
In yet another aspect, the invention concerns an isolated PR01758 polypeptide, comprising the sequence 30 of amino acid residues 16 to about 157, inclusive of Figure 208 (SEQ ID NO:356), or a fragment thereof sufficient to provide a binding site for an anti-PROI758 antibody. Preferably, the PR01758 fragment retains a qualitative biological activity of a native PR01758 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01758 polypeptide having the 35 sequence of amino acid residues from about 16 to about 157, inclusive of Figure 208 (SEQ ID N0:356), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence
252
identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
105. FRQ1S7S
A cDNA clone (DNA76401-1683) has been identified that encodes a novel polypeptide having homology to protein disulfide isomerase and designated in the present application as "PRO 1575."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01575 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 10 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01575 polypeptide having the sequence of amino acid residues from 1 or about 21 to about 273, inclusive ofFigure 210 (SEQIDNO:358), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1575 IS polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 82 and about 840, inclusive, of Figure 209 (SEQ ID NO:357). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 20 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203360 (DNA76401-1683), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203360 (DNA76401-1683).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 21 to about 273, inclusive of Figure 210 (SEQ ID NO:358), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01575 polypeptide having the sequence of amino acid residues from about 21 to about 273, inclusive of Figure 210 (SEQ ID NO:358), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, 35 preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding
253
a PR01575 polypeptide, its soluble variants, (i.e. transmembrane domain and/or signal peptide deleted or inactivated) or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 20 in the sequence of Figure 210 (SEQ ID N0:358). The transmembrane domain has been tentatively identified as extending from about amino acid position 143 to about amino acid position 162 in the PR0157S amino acid sequence (Figure 5 210, SEQ ID NO:358).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 21 to about 273, inclusive of Figure 210 (SEQ ID NO:358), or (b) the 10 complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1575 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length. 15 In another embodiment, the invention provides isolated PR01575 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01575 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 21 to 273 ofFigure 210 (SEQ ID NO:358).
In another aspect, the invention concerns an isolated PR01575 polypeptide, comprising an amino acid 20 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 21 to about 273, inclusive of Figure 210 (SEQ ID N0:358).
In a further aspect, the invention concerns an isolated PR01575 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 25 about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 21 to 273 ofFigure 210 (SEQ ID NO:358).
In yet another aspect, the invention concerns an isolated PRO1575 polypeptide, comprising the sequence of amino acid residues 21 to about 273, inclusive of Figure 210 (SEQ ID NO:358), or a fragment thereof sufficient to provide a binding site for an anti-PR01575 antibody. Preferably, the PR01575 fragment retains 30 a qualitative biological activity of a native PRO1575 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1575 polypeptide having the sequence of amino acid residues from about 21 to about 273, inclusive of Figure 210 (SEQ ID NO:358), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 35 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the
254
polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1575 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01575 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01575 polypeptide, by contacting the native PR01575 polypeptide with a candidate molecule and 5 monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01575 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
106. PRO 1787
A cDN A clone (DN A76510-2504) has been identified that encodes a novel polypeptide having sequence identity with myelin pO and designated in the present application as "PRO1787."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01787 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 15 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01787 polypeptide having the sequence of amino acid residues from 1 or about 38 to about 269, inclusive ofFigure 212 (SEQ ID NO:364), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1787 20 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 274 and about 969, inclusive, of Figure 211 (SEQ ID NO:363). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 25 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203477 (DNA76510-2504), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203477 (DNA76510-2504).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 38 to about 269, inclusive of Figure 212 (SEQ ID NO:364), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1787 polypeptide having the sequence of
255
amino acid residues from about 38 to about 269, inclusive of Figure 212 (SEQ ID NO:364), or (b) the complementof the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 5 a PR01787 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 37 in the sequence of Figure 212 (SEQ ID NO:364). The transmembrane domain has been tentatively identified as extending from about amino acid position 161 through about amino acid 10 position 183 in the PR01787 amino acid sequence (Figure 212, SEQ ID NO:364).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 38 to about 269, inclusive of Figure 212 (SEQ ID NO:364), or (b) the 15 complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01787 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length. 20 In another embodiment, the invention provides isolated PRO1787 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01787 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 38 through 269 of Figure 212 (SEQ ID N0:364).
In another aspect, the invention concerns an isolated PR01787 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 38 to about 269, inclusive of Figure 212 (SEQ ID NO:364).
In a further aspect, the invention concerns an isolated PR01787 polypeptide, comprising an amino acid 30 sequence scoring at least about 80% positives, preferably at least about 85 % positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 38 through 269 of Figure 212 (SEQ ID NO:364).
In yet another aspect, the invention concerns an isolated PRO1787 polypeptide, comprising the sequence of amino acid residues 38 to about 269, inclusive of Figure 212 (SEQ ID NO:364), or a fragment thereof 35 sufficient to provide a binding site for an anti-PR01787 antibody. Preferably, the PR01787 fragment retains a qualitative biological activity of a native PR01787 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
256
molecule under stringent conditions with (a) a DNA molecule encoding a PRO1787 polypeptide having the sequence of amino acid residues from about 38 to about 269, inclusive of Figure 212 (SEQ ID NO:364), or (b) the complement of the DNA molecuie of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 5 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1787 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01787 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 10 native PR01787 polypeptide, by contacting the native PR01787 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01787 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
107. PRQ1781
A cDNA clone (DNA76522-2500) has been identified that encodes a novel transmembrane polypeptide designated in the present application as "PRO 1781".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01781 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01781 polypeptide having the sequence of amino acid residues from 1 or about 20 to about 373, inclusive of Figure 214 (SEQ ID NO:366), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01781
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 78 and about 1139, inclusive, ofFigure 213 (SEQ ID NO:365). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 30 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203469 (DNA76522-2500), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 35 Deposit No. 203469 (DNA76522-2500).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence
257
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 20 to about 373, inclusive of Figure 214 (SEQ ID NO:366), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule baving at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 5 under stringent conditions with (a) a DNA molecule encoding a PR01781 polypeptide having the sequence of amino acid residues from about 20 to about 373, inclusive of Figure 214 (SEQ ID NO:36), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 10 In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01781 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble variants (i.e. transmembrane domain deleted or inactivated), or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 19 in the sequence of Figure 214 (SEQ ID NO:366). The transmembrane 15 domain has been tentatively identified as extending from about amino acid position 39 to about amino acid position 60 in the PR01781 amino acid sequence (Figure 214, SEQ ID NO:366).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 20 amino acid sequence of residues 20 to about 373, inclusive of Figure 214 (SEQ ID NO:366), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1781 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 25 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01781 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01781 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 20 to 373 ofFigure 214 (SEQ ID NO:366). 30 In another aspect, the invention concerns an isolated PR01781 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 20 to about 373, inclusive of Figure 214 (SEQ ID NO:366).
In a further aspect, the invention concerns an isolated PR01781 polypeptide, comprising an amino acid 35 sequence scoring at least about 80% positives, preferably at least about 85 % positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 20 to 373 of Figure 214 (SEQ ID N0:366).
258
In yet another aspect, the invention concerns an isolated PRO 1781 polypeptide, comprising the sequence of amino acid residues 20 to about 373, inclusive of Figure 214 (SEQ ID NO:366), or a fragment thereof sufficient to provide a binding site for an anti-PROI781 antibody. Preferably, the PROI781 fragment retains a qualitative biological activity of a native PR01781 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 5 molecule under stringent conditions with (a) a DNA molecule encoding a PR01781 polypeptide having the sequence of amino acid residues from about 20 to about 373, inclusive of Figure 214 (SEQ ID NO:366), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising 10 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
108. PRQ1556
A cDNA clone (DNA76529-1666) has been identified that encodes a novel transmembrane polypeptide 15 designated in the present application as "PRO 1556".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01556 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 20 preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01556 polypeptide having the sequence of amino acid residues from 1 or about 25 to about 269, inclusive ofFigure 216 (SEQ ID N0:372), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1556 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 160 and 25 about 891, inclusive, of Figure 215 (SEQ ID NO:371). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 30 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203315 (DNA76529-1666), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203315 (DNA76529-1666).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 35 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 25 to about 269, inclusive of Figure 216 (SEQ ID
259
NO:372), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1556 polypeptide having the sequence of amino acid residues from about 25 to about 269, inclusive of Figure 216 (SEQ ID NO:372), or (b) the 5 complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90 % sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01556 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and 10 its soluble variants (i.e. transmembrane domains deleted or inactivated), or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 24 in the sequence of Figure 216 (SEQ ID N0:372). Two transmembrane domains have been tentatively identified as extending from about amino acid position 11 to about amino acid position 25 and from about amino acid position 226 to about amino acid position 243 in the PR01556 amino acid 15 sequence (Figure 216, SEQ ID NO:372).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 25 to about 269, inclusive of Figure 216 (SEQ ID NO:372), or (b) the 20 complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01556 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length. 25 In another embodiment, the invention provides isolated PR01556 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01556 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 25 to 269 ofFigure 216 (SEQ ID NO:372).
In another aspect, the invention concerns an isolated PR01556 polypeptide, comprising an amino acid 30 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 25 to about 269, inclusive of Figure 216 (SEQ ID NO:372).
In a further aspect, the invention concerns an isolated PR01556 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 35 about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 25 to 269 of Figure 216 (SEQ ID NO:372).
In yet another aspect, the invention concerns an isolated PR01556 polypeptide, comprising the sequence
260
PCT/U S99/20111
of amino acid residues 25 to about 269, inclusive of Figure 216 (SEQ ID NO:372), or a fragment thereof sufficient to provide a binding site for an anti-PR01556 antibody. Preferably, the PR01556 fragment retains a qualitative biological activity of a native PRO1556 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01556 polypeptide having the 5 sequence of amino acid residues from about 25 to about 269, inclusive of Figure 216 (SEQ ID NO:372), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 10 polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1556 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01556 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01556 polypeptide, by contacting the native PR01556 polypeptide with a candidate molecule and 15 monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1556 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
109. PRQ1759
A cDNA clone (DNA76531-1701) has been identified that encodes a novel polypeptide having multiple transmembrane domains, designated in the present application as "PR01759."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1759 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 25 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01759 polypeptide having the sequence of amino acid residues from 1 or about 19 to about 450, inclusive ofFigure 218 (SEQ ID NO:374), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01759 30 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 179 and about 1474, inclusive, of Figure 217 (SEQ ID NO:373). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 35 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203465 (DNA76531-1701), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic
261
acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203465 (DNA76531-1701).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 5 identity to the sequence of amino acid residues from about 19 to about 450, inclusive of Figure 218 (SEQ ID NO:374), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1759 polypeptide having the sequence of 10 amino acid residues from about 19 to about 450, inclusive of Figure 218 (SEQ ID NO:374), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 15 a PR01759 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domains deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 18 in the sequence of Figure 218 (SEQ ID N0:374). The transmembrane domains have been tentatively identified as being at about amino acids 1-19 (possibly a signal peptide), 41-55, 20 75-94,127-143,191-213, 249-270,278-299,314-330, 343-359,379-394, and 410-430 in the PR01759 amino acid sequence (Figure 218, SEQ ID NO:374).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 25 amino acid sequence of residues 19 to about 450, inclusive of Figure 218 (SEQ ID NO:374), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01759 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 30 nucleotides in length, and most preferably frotn about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PRO 1759 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1759 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 19 through 450 of Figure 218 (SEQ ID 35 NO:374).
In another aspect, the invention concerns an isolated PR01759 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more
262
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 19 to about 450, inclusive of Figure 218 (SEQ ID NO:374).
In a further aspect, the invention concerns an isolated PR01759 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence 5 of residues 19 through 450 of Figure 218 (SEQ ID NO:374).
In yet another aspect, the invention concerns an isolated PR01759polypeptide, comprising the sequence of amino acid residues 19 to about 450, inclusive of Figure 218 (SEQ ID NO:374), or a fragment thereof sufficient to provide a binding site for an anti-PR01759 antibody. Preferably, the PR01759 fragment retains a qualitative biological activity of a native PR01759 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA
molecule under stringent conditions with (a) a DNA molecule encoding a PR01759 polypeptide having the sequence of amino acid residues from about 19 to about 450, inclusive of Figure 218 (SEQ ID NO:374), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 15 identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01759 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01759 antibody. 20 In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01759 polypeptide, by contacting the native PR01759 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1759 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
110. PROl 760
A cDNA clone (DNA76532-1702) has been identified that encodes a novel secreted polypeptide, designated in the present application as "PR01760."
In one embodiment, the invention provides an isolated nucleic acid molecuie comprising DNA encoding 30 a PR01760 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO1760 polypeptide having the sequence of amino acid residues from 1 or about 21 to about 188, inclusive ofFigure 220 (SEQ ID NO:376), 35 or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01760 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 120 and
263
about 623, inclusive, of Figure 219 (SEQ ID NO:375). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 5 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203473 (DNA76532-1702), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203473 (DNA76532-1702).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 10 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 21 to about 188, inclusive of Figure 220 (SEQ ID NO:376), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 15 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1760 polypeptide having the sequence of amino acid residues from about 21 to about 188, inclusive of Figure 220 (SEQ ID NO:376), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90 % sequence identity, most 20 preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 21 to about 188, inclusive of Figure 220 (SEQ ID NO:376), or (b) the 25 complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1760 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length. 30 In another embodiment, the invention provides isolated PR01760 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01760 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 21 through 188 of Figure 220 (SEQ ID NO:376).
In another aspect, the invention concerns an isolated PR01760 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the
264
sequence of amino acid residues 21 to about 188, inclusive of Figure 220 (SEQ ID NO:376).
In a further aspect, the invention concerns an isolated PR01760 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 21 through 188 of Figure 220 (SEQ ID NO:376).
In yet another aspect, theinventionconcernsanisolatedPR01760polypeptide, comprising the sequence of amino acid residues 21 to about 188, inclusive of Figure 220 (SEQ ID NO:376), or a fragment thereof sufficient to provide a binding site for an anti-PROI760 antibody. Preferably, the PR01760 fragment retains a qualitative biological activity of a native PR01760 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 10 molecule under stringent conditions with (a) a DNA molecule encoding a PRO1760 polypeptide having the sequence of amino acid residues from about 21 to about 188, inclusive ofFigure 220 (SEQ ID NO:376), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 15 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01760 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01760 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 20 native PR01760 polypeptide, by contacting the native PR01760 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PROl760polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
112. PRQ1567
A cDN A clone (DNA76541-1675) has been identified that encodes a novel polypeptidehaving homology to the expression product of the colon specific gene, CSG6, and is designated in the present application as "PR01567".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 30 a PR01567 polypeptide.
265
(followed by page 268)
INTELLECTUAL PROPERTY OFFICE OF N Z
2 4 OCT 2003 Received
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01567 polypeptide having the sequence of amino acid residues from 1 or about 23 to about 178, inclusive ofFigure 224 (SEQ ID NO:383), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1567
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 175 and about 642, inclusive, of Figure 223 (SEQ ID NO:382). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 10 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203409 (DNA76541-1675), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 15 Deposit No. 203409 (DNA76541-1675).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 23 to about 178, inclusive of Figure 224 (SEQ ID 20 NO:383), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01567 polypeptide having the sequence of amino acid residues from about 23 to about 178, inclusive of Figure 224 (SEQ ID NO:383), or (b) the 25 complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01567 polypeptide, with or without the N-terminal signal sequence, or is complementary to such encoding 30 nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 22 in the sequence of Figure 224 (SEQ ID NO:383).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 35 amino acid sequence of residues 23 to about 178, inclusive of Figure 224 (SEQ ID NO:383), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1567 polypeptide coding sequence that may fmd
268
use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01567 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defmed.
In a specific aspect, the invention provides isolated native sequence PROIS67 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 23 to 178 ofFigure 224 (SEQ ID NO:383).
In another aspect, the invention concerns an isolated PR01567 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 10 sequence of amino acid residues 23 to about 178, inclusive of Figure 224 (SEQ ID NO:383).
In a further aspect, the invention concerns an isolated PRO1567 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 23 to 178 of Figure 224 (SEQ ID NO:383).
In yet another aspect, the invention concerns an isolated PRO1567 polypeptide, comprising the sequence of amino acid residues 23 to about 178, inclusive of Figure 224 (SEQ ID NO:383), or a fragment thereof sufficient to provide a binding site for an anti-PR01567 antibody. Preferably, the PR01567 fragment retains a qualitative biological activity of a native PR01567 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 20 molecule under stringent conditions with (a) a DNA molecule encoding a PR01567 polypeptide having the sequence of amino acid residues from about 23 to about 178, inclusive of Figure 224 (SEQ ID N0:383), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 25 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01567 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01567 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 30 native PR01567 polypeptide, by contacting the native PR01567 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR0I567 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
113. PRO 1693
A cDNA clone (DNA77301-1708) has been identified, having homology to nucleic acid encoding an insulin-like growth factor binding protein that encodes a novel polypeptide, designated in the present application
269
as "PR01693".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01693 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 5 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO1693 polypeptide having the sequence of amino acid residues from about 1 or about 34 to about 513, inclusive of Figure 226 (SEQ ID NO:385), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01693 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 508 10 or about 607 and about 2046, inclusive, of Figure 225 (SEQ ID NO:384). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 15 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203407 (DNA77301-1708) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203407 (DNA77301-1708).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 20 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 34 to about 513, inclusive of Figure 226 (SEQ ID NO:385), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 175 25 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01693 polypeptide having the sequence of amino acid residues from 1 or about 34 to about 513, inclusive of Figure 226 (SEQ ID NO-.385), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence 30 identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01693 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid 35 position 1 to about amino acid position 33 in the sequence of Figure 226 (SEQ ID NO:385). The transmembrane domain has been tentatively identified as extending from about amino acid position 420 to about amino acid position 442 in the PR01693 amino acid sequence (Figure 226, SEQ ID NO:385).
270
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 34 to about 513, inclusive of Figure 226 (SEQ ID NO:385), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1693 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 225 (SEQ ID NO:384).
In another embodiment, the invention provides isolated PR01693 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01693 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 34 to about 513 ofFigure 226 (SEQ ID NO:385).
In another aspect, the invention concerns an isolated PR01693 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 34 to about 513, inclusive of Figure 226 (SEQ ID NO:385).
In a further aspect, the invention concerns an isolated PR01693 polypeptide, comprising an amino acid 20 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 34 to about 513, inclusive of Figure 226 (SEQ ID NO:385).
In yet another aspect, the invention concerns an isolated PR01693 polypeptide, comprising the sequence of amino acid residues 1 or about 34 to about 513, inclusive of Figure 226 (SEQ ID NO:385), or a fragment 25 thereof sufficient to provide a binding site for an anti-PR01693 antibody. Preferably, the PR01693 fragment retains a qualitative biological activity of a native PR01693 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01693 polypeptide having the sequence of amino acid residues from about 1 or about 34 to about 513, inclusive of Figure 226 (SEQ ID 30 NO:385), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01693
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01693 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a
271
native PRO1693 polypeptide by contacting the native PRO1693 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1693 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
114. PRQ1784
A cDNA clone (DNA77303-2502) has been identified that encodes a novel transmembrane polypeptide designated in the present application as "PR01784."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01784 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO1784 polypeptide having the sequence of amino acid residues from 1 or about 30 to about 146, inclusive ofFigure 228 (SEQ ID N0:390), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01784
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 155 and about 505, inclusive, of Figure 227 (SEQ ID NO:389). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 20 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203479 (DNA77303-2502), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 25 Deposit No. 203479 (DNA77303-2502).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 30 to about 146, inclusive of Figure 228 (SEQ ID 30 N0:390), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecuie having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01784 polypeptide having the sequence of amino acid residues from about 30 to about 146, inclusive of Figure 228 (SEQ ID N0:390), or (b) the 35 complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
272
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01784 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its-soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 29 in the sequence of Figure 228 (SEQ ID N0:390). The transmembrane 5 domain has been tentatively identified as extending from about amino acid position 52 through about amino acid position 70 in the PR01784 amino acid sequence (Figure 228, SEQ ID N0:390).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 10 amino acid sequence of residues 30 to about 146, inclusive of Figure 228 (SEQ ID N0:390), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1784 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 15 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01784 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PROl784 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 30 through 146 of Figure 228 (SEQ ID 20 N0:390).
In another aspect, the invention concerns an isolated PRO1784 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 30 to about 146, inclusive of Figure 228 (SEQ ID N0:390). 25 In a further aspect, the invention concerns an isolated PR01784 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85 % positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 30 through 146 of Figure 228 (SEQ ID N0:390).
In yet another aspect, the invention concerns an isolated PRO1784 polypeptide, comprising the sequence 30 of amino acid residues 30 to about 146, inclusive of Figure 228 (SEQ ID N0:390), or a fragment thereof sufficient to provide a binding site for an anti-PR01784 antibody. Preferably, the PRO1784 fragment retains a qualitative biological activity of a native PR01784 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1784 polypeptide having the 35 sequence of amino acid residues from about 30 to about 146, inclusive of Figure 228 (SEQ ID N0:390), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence
273
PCTAJS99/20111
identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1784 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01784 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01784 polypeptide, by contacting the native PR01784 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01784 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
115. PRQ1605
A cDNA clone (DNA77648-1688) has been identified, having homology to nucleic acid encoding a glycosyltransferase protein that encodes a novel polypeptide, designated in the present application as "PR01605".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01605 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01605 polypeptide having 20 the sequence of amino acid residues from about 1 or about 27 to about 140, inclusive of Figure 230 (SEQ ID NO:395), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1605 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 425 or about 503 and about 844, inclusive, ofFigure 229 (SEQ ID NO:394). Preferably, hybridization occurs under 25 stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit 203408 30 (DNA77648-1688) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, die nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203408 (DNA77648-1688).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 35 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 27 to about 140, inclusive of Figure 230 (SEQ ID NO:395), or (b) the complement of the DNA of (a).
274
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 380 nucleotides and produced by hybridizing a test DNA molecuie under stringent conditions with (a) a DNA molecule encoding a PR01605 polypeptide having the sequence of amino acid residues from 1 or about 27 to about 140, inclusive ofFigure 230 (SEQ ID NO:395), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence 5 identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01605 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as 10 extending from about amino acid position 1 to about amino acid position 26 in the sequence of Figure 230 (SEQ ID NO:395).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 15 amino acid sequence of residues 1 or about 27 to about 140, inclusive of Figure 230 (SEQ ID NO:395), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO1605 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 20 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 229 (SEQ ID NO:394).
In another embodiment, the invention provides isolated PRO1605 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01605 polypeptide, which in 25 certain embodiments, includes an amino acid sequence comprising residues 1 or about 27 to about 140 of Figure 230 (SEQ ID NO:395).
In another aspect, the invention concerns an isolated PRO1605 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 30 sequence of amino acid residues 1 or about 27 to about 140, inclusive of Figure 230 (SEQ ID NO:395).
In a further aspect, the invention concerns an isolated PR01605 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 27 to about 140, inclusive of Figure 230 (SEQ ID NO:395).
In yet another aspect, the invention concerns an isolated PR01605 polypeptide, comprising the sequence of amino acid residues 1 or about 27 to about 140, inclusive of Figure 230 (SEQ ID NO:395), or a fragment thereof sufficient to provide a binding site for an anti-PR01605 antibody. Preferably, the PR01605 fragment
275
retains a qualitative biological activity of a native PRO1605 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1605 polypeptide having the sequence of amino acid residues from about 1 or about 27 to about 140, inclusive of Figure 230 (SEQ ID NO:395), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about 5 an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing ahost cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concents agonists and antagonists of a native PR01605 10 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01605 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PRO1605 polypeptide by contacting the native PRO1605 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO 1605 polypeptide, 15 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
116. PRQ1788
A cDN A clone (DN A77652-2505) has been identified that encodes a novel polypeptide having homology to leucine-rich repeat proteins and designated in the present application as "PR01788."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01788 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01788 polypeptide having 25 the sequence of amino acid residues from 1 or about 17 to about 353, inclusive ofFigure 232 (SEQ ID NO:397), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01788 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 112 and about 1122, inclusive, of Figure 231 (SEQ ID NO:396). Preferably, hybridization occurs under stringent 30 hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203480 35 (DNA77652-2505), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203480 (DNA77652-2505).
276
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 17 to about 353, inclusive of Figure 232 (SEQ ID NO:397), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1788 polypeptide having the sequence of amino acid residues from about 17 to about 353, inclusive of Figure 232 (SEQ ID NO:397), or (b) the complement of the DNA molecule of (a), and, if the DNA molecuie has at least about an 80% sequence identity, 10 preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01788 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding 15 nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 16 in the sequence of Figure 232 (SEQ ID NO:397). Transmembrane domains have been tentatively identified as extending from about amino acid position 215 through about amino acid position 232 and about amino acid position 287 through about amino acid position 304 in the PRO 1788 amino acid sequence (Figure 232, SEQ ID NO:397).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 17 to about 353, inclusive of Figure 232 (SEQ ID NO:397), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01788 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01788 polypeptide encoded by any of the 30 isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO 1788 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 17 to 353 of Figure 232 (SEQ ID NO-.397).
In another aspect, the invention concerns an isolated PR01788 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at Least about 85% sequence identity, more 35 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 17 to about 353, inclusive of Figure 232 (SEQ ID NO:397).
In a further aspect, the invention concerns an isolated PRO1788 polypeptide, comprising an amino acid
277
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 17 to 353 of Figure 232 (SEQ ID NO:397).
In yet another aspect, the invention concerns an isolated PRO1788 polypeptide, comprising the sequence of amino acid residues 17 to about 353, inclusive of Figure 232 (SEQ ID NO:397), or a fragment thereof 5 sufficient to provide a binding site for an anti-PR01788 antibody. Preferably, the PR01788 fragment retains a qualitative biological activity of a native PR01788 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO1788 polypeptide having the sequence of amino acid residues from about 17 to about 353, inclusive of Figure 232 (SEQ ID N0:397), or (b) 10 the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01788
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01788 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR01788 polypeptide, by contacting the native PR01788 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1788 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
117. PRQ1801
A cDNA clone (DNA83500-2506) has been identified, having homology to nucleic acid encoding IL-19 25 polypeptide, that encodes a novel polypeptide, designated in the present application as "PR01801".
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding aPR01801 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 30 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO 1801 polypeptide having the sequence of amino acid residues from about 1 or about 43 to about 261, inclusive of Figure 234 (SEQ ID N0:402), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01801 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 109 35 or about 235 and about 891, inclusive, of Figure 233 (SEQ ID N0:401). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having
278
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203391 (DNA83500-2506) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in 5 ATCC Deposit No. 203391 (DNA83500-2506).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 43 to about 261, inclusive of Figure 234 (SEQ ID 10 N0:402), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 30 nucleotides, usually at least about 50 nucleotides, more usually at least about 100 nucleotides and generally at least about 150 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01801 polypeptide having the sequence of amino acid residues from 1 or 15 about 43 to about 261, inclusive of Figure 234 (SEQ ID N0:402), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 20 a PRO 1801 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 42 in the sequence ofFigure 234 (SEQ ID N0:402).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 25 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 43 to about 261, inclusive of Figure 234 (SEQ ID N0:402), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01801 polypeptide coding sequence that may find 30 use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 233 (SEQ ID N0:401).
In another embodiment, the invention provides isolated PRO 1801 polypeptide encoded by any of the 35 isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PROl 801 polypeptide, which in certain embodiments, includes an amino acid sequence comprising residues 1 or about 43 to about 261 ofFigure
279
234 (SEQ ID N0:402).
In another aspect, the invention concerns an isolated PRO 1801 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 43 to about 261, inclusive of Figure 234 (SEQ ID NO:4Q2).
In a further aspect, the invention concerns an isolated PR01801 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 or about 43 to about 261, inclusive of Figure 234 (SEQ ID N0:402).
In yet another aspect, the invention concerns an isolated PR01801 polypeptide, comprising the sequence 10 of amino acid residues 1 or about 43 to about 261, inclusive of Figure 234 (SEQ ID NO:402), or a fragment thereof sufficient to provide a binding site for an anti-PROl801 antibody. Preferably, the PROl801 fragment retains a qualitative biological activity of a native PR01801 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01801 polypeptide having the 15 sequence of amino acid residues from about 1 or about 43 to about 261, inclusive of Figure 234 (SEQ ID N0:402), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 20 recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PR01801 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01801 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PRO 1801 polypeptide by contacting the native PRO 1801 polypeptide with a candidate molecule and 25 monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR01801 polypeptide, or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
Another embodiment of the present invention is directed to a method of inhibiting the production of an inflammatory cytokine by a cell capable of producing that inflammatory cytokine, wherein the method comprises 30 the step of contacting the cell with a PR01801 polypeptide, wherein the production of the inflammatory cytokine is inhibited. The ceil may be, for example, a T-cell, an NK cell or a macrophage and the inflammatory cytokine whose production is inhibited may be, for example, IL-1, IL-6, IFN-y or TNF-a.
A further embodiment of the present invention is directed to a method for the treatment of an individual in need of immunosuppression, wherein the method comprises the step of administering to the individual an 35 immunosuppressive amount of a PR01801 polypeptide. The individual in need of immunosuppression may suffer from an autoimmune disease, such as rheumatoid arthritis, myasthenia gravis, insulin-dependent diabetes mellitus, systemic lupus erythematosus, thyroiditis or colitis, or from septic shock, endotoxic shock or any other
280
type of disorder where immunosuppression is desired. The individual may also be one who has received or is to receive a tissue transplant, where the method serves to inhibit rejection of the tissue transplant.
Other embodiments will become evident upon a reading of the present specification.
118. UCP4
A cDNA clone (DNA77568-1626) has been identified, having certain homologies to some known human uncoupling proteins, that encodes a novel polypeptide, designated in the present application as "UCP4."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a UCP4 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 10 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a UCP4 polypeptide having the sequence of amino acid residues from about 1 to about 323, inclusive of Figure 236 (SEQ ID N0:406), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a UCP4 polypeptide 15 comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 40 and about 1011 inclusive, of Figure 235 (SEQ ID N0:405). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85 % sequence identity, more preferably at least 20 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203134, or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203134. In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding 25 a polypeptide having at least about 80 % sequence identity, preferably at least about 85 % sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 1 to about 323, inclusive of Figure 236 (SEQ ID N0:406), or the complement of the DNA of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 30 encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 to about 323, inclusive of Figure 236 (SEQ ID N0:406), or (b) the complement of the DNA of (a).
Further embodiments of the invention are directed to fragments of the UCP4 coding sequence, which 35 are sufficiently long to be used as hybridization probes. Preferably, such fragments contain at least about 20 to about 80 consecutive bases included in the sequence of Figure 235 (SEQ ID N0:405). Optionally, such fragments include the N-terminus or the C-terminus of the sequence of Figure 236 (SEQ ID N0:406).
281
In another embodiment, the invention provides isolated UCP4 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence UCP4 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 1 to 323 of Figure 236 (SEQ ID N0:406).
In another aspect, the invention concerns an isolated UCP4 polypeptide, comprising an amino acid 5 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 323, inclusive of Figure 236 (SEQ ID N0:406).
In a further aspect, the invention concerns an isolated UCP4 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 10 about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 1 to 323 of Figure 236 (SEQ ID N0:406).
In yet another aspect, the invention concerns an isolated UCP4 polypeptide, comprising the sequence of amino acid residues 1 to about 323, inclusive of Figure 236 (SEQ ID N0:406), or a fragment thereof sufficient to, for instance, provide a binding site for an anti-UCP4 antibody. Preferably, the UCP4 fragment 15 retains at least one biological activity of a native UCP4 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a UCP4 polypeptide having the sequence of amino acid residues from about 1 to about 323, inclusive of Figure 236 (SEQ ID N0:406), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 20 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of the native UCP4 25 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-UCP4 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native UCP4 polypeptide, by contacting the native UCP4 polypeptide with a candidate molecule and monitoring the desired activity. The invention also provides therapeutic methods and diagnostic methods using UCP4.
In a still further embodiment, the invention concerns a composition comprising a UCP4 polypeptide, 30 or an agonist or antagonist as hereinabove defined, in combination with a carrier.
119. FRQ193
A cDNA clone (DNA23322-1393) has been identified that encodes a novel multi-transmembrane polypeptide, designated in the present application as "PR0193."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR0193 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity,
282
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR0193 polypeptide having the sequence of amino acid residues from about 1 to about 158, inclusive of Figure 238 (SEQ ID N0:410), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR0193 5 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 138 and about 611, inclusive, of Figure 237 (SEQ ID N0:409). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about. 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 10 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No.203400 (DNA23322-1393), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203400 (DNA23322-1393).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 1 to about 158, inclusive of Figure 238 (SEQ ID N0:410), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR0193 polypeptide having the sequence of amino acid residues from about 1 to about 158, inclusive of Figure 238 (SEQ ID N0:410), or (b) the complement of the DNA molecuie of (a), and, if the DNA molecule has at least about an 80 % sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% 25 sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
in a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR0193 polypeptide in its soluble form, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The transmembrane domain has been tentatively 30 identified as extending from about amino acid positions 23-42,60-80,97-117 and 128-148 in the PR0193 amino acid sequence (Figure 238, SEQ ID N0:410).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 35 amino acid sequence of residues 1 to about 158, inclusive of Figure 238 (SEQ ID N0:410), or (b) the complement of the DNA of (a).
In another embodiment, the invention provides isolated PRO 193 polypeptide encoded by any of the
283
isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO 193 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 1 through 158 of Figure 238 (SEQ ID N0:410).
In another aspect, the invention concerns an isolated PR0193 polypeptide, comprising an amino acid 5 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 to about 158, inclusive of Figure 238 (SEQ ID N0:410).
In a further aspect, the invention concerns an isolated PRO 193 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 10 about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 through 158 of Figure 238 (SEQ ID N0:410).
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR0193 polypeptide having the sequence of amino acid residues from about 1 to about 158, inclusive of Figure 238 (SEQ ID N0:410), or (b) 15 the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of die polypeptide, and (iii) recovering the polypeptide from tbe cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of the a native PR0193
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR0193 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a native PR0193 polypeptide, by contacting the native PR0193 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PR0193 polypeptide,
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier.
121. PRQ1335
A cDNA clone (DNA62812-1594) has been identified, having homology to nucleic acid encoding carbonic anhydrase that encodes a novel polypeptide, designated in the present application as "PRO1335*.
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO1335 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01335 polypeptide having the sequence of amino acid residues from about 1 or about 16 to about 337, inclusive of Figure 242 (SEQ ID NO:423), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01335
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 271 or about 316 and about 1281, inclusive, of Figure 241 (SEQ ID NO:422). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 10 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203248 (DNA62812-1594) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in 15 ATCC Deposit No. 203248 (DNA62812-1594).
In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 1 or about 16 to about 337, inclusive of Figure 242 (SEQ ID 20 NO:423), or (b) the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 180 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01335 polypeptide having the sequence of amino acid residues from 1 or about 16 to about 337, inclusive of Figure 242 (SEQ ID NO:423), or (b) the complement of the DNA molecule of (a), and, 25 if the DNA molecuie has at least about an 80 % sequence identity, prefereably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01335 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and 30 its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid position 1 to about amino acid position 15 in the sequence of Figure 242 (SEQ ID NO:423). The transmembrane domain has been tentatively identified as extending from about amino acid position 291 to about amino acid position 310 in the PR01335 amino acid sequence (Figure 242, SEQ ID NO:423).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the
287
amino acid sequence of residues 1 or about 16 to about 337, inclusive of Figure 242 (SEQ ID NO:423), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01335 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 5 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived from the nucleotide sequence shown in Figure 241 (SEQ ID NO:422).
In another embodiment, the invention provides isolated PR01335 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a specific aspect, the invention provides isolated native sequence PR01335 polypeptide, which in 10 certain embodiments, includes an amino acid sequence comprising residues 1 or about 16 to about 337 ofFigure 242 (SEQ ID NO:423).
In another aspect, the invention concerns an isolated PR01335 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the IS sequence of amino acid residues 1 or about 16 to about 337, inclusive of Figure 242 (SEQ ID NO:423).
In a further aspect, the invention concerns an isolated PR01335 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 1 or about 16 to about 337, inclusive of Figure 242 (SEQ ID NO:423).
In yet another aspect, the invention concerns an isolated PROI335 polypeptide, comprising the sequence of amino acid residues 1 or about 16 to about 337, inclusive of Figure 242 (SEQ ID NO:423), or a fragment thereof sufficient to provide a binding site for an anti-PR01335 antibody. Preferably, the PR01335 fragment retains a qualitative biological activity of a native PR01335 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 25 molecule under stringent conditions with (a) a DNA molecule encoding a PR01335 polypeptide having the sequence of amino acid residues from about 1 or about 16 to about 337, inclusive of Figure 242 (SEQ ID NO:423), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host 30 cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
In yet anotber embodiment, the invention concerns agonists and antagonists of a native PR01335 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PRO!335 antibody.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 35 native PR01335 polypeptide by contacting the native PR01335 polypeptide with a candidate molecule and monitoring a biological activity mediated by said polypeptide.
In a still further embodiment, the invention concerns a composition comprising a PRO1335 polypeptide,
288
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuticaliy acceptable carrier. 122. PROl 329
A cDNA clone (DNA66660-1585) has been identified that encodes a novel polypeptide designated in the present application as "PR01329."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01329 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PROI329 polypeptide having 10 the sequence of amino acid residues from 1 or about 17 to about 209, inclusive ofFigure 244 (SEQ ID N0.429), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01329 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 138 and about 716, inclusive, of Figure 243 (SEQ ID NO:428). Preferably, hybridization occurs under stringent 15 hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203279 20 (DNA66660-1585), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203279 (DNA66660-1585).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 25 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 17 to about 209, inclusive of Figure 244 (SEQ ID NO:429), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 30 under stringent conditions with (a) a DNA molecule encoding a PRO1329 polypeptide having the sequence of amino acid residues from about 17 to about 209, inclusive of Figure 244 (SEQ ID NO:429), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 35 In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01329 polypeptide, with or without the N-tenninal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as
289
extending from amino acid position 1 through about amino acid position 16 in the sequence ofFigure 244 (SEQ ID NO:429).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 5 amino acid sequence of residues 17 to about 209, inclusive of Figure 244 (SEQ ID NO:429), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PRO 1329 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 10 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01329 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PRO1329 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 17 to 209 ofFigure 244 (SEQ ID NO:429). 15 In another aspect, the invention concerns an isolated PR01329 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 17 to about 209, inclusive of Figure 244 (SEQ ID N0:429).
In a further aspect, the invention concerns an isolated PR01329 polypeptide, comprising an amino acid 20 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 17 to 209 of Figure 244 (SEQ ID N0:429).
In yet another aspect, the invention concerns an isolated PRO1329 polypeptide, comprising the sequence of amino acid residues 17 to about 209, inclusive of Figure 244 (SEQ ID NO:429), or a fragment thereof 25 sufficient to provide a binding site for an anti-PR01329 antibody. Preferably, the PR01329 fragment retains a qualitative biological activity of a native PR01329 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01329 polypeptide having the sequence of amino acid residues from about 17 to about 209, inclusive of Figure 244 (SEQ ID N0:429), or (b) 30 the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
123. PRQ1550
A cDNA clone (DNA76393-1664) has been identified that encodes a novel secreted polypeptide and
290
designated in the present application as "PR01550."
In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PR01550 polypeptide.
In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PR01550 polypeptide having the sequence of amino acid residues from 1 or about 31 to about 243, inclusive ofFigure 246 (SEQ ID NO:232), or (b) the complement of the DNA molecule of (a).
In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01550 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 228 and about 866, inclusive, of Figure 245 (SEQ ID NO:231). Preferably, hybridization occurs under stringent hybridization and wash conditions.
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203323 (DNA76393-1664), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC
1
Deposit No. 203323 (DNA76393-1664).
In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues from about 31 to about 243, inclusive of Figure 246 (SEQ ID NO:232), or the complement of the DNA of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01550 polypeptide having the sequence of amino acid residues from about 31 to about 243, inclusive of Figure 246 (SEQ ID NO:232), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule.
In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding a PRO 1550 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 30 in the sequence of Figure 246 (SEQ ID NO:232).
In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence of residues 31 to about 243, inclusive of Figure 246 (SEQ ID NO:232), or (b) the complement of the DNA of (a).
Another embodiment is directed to fragments of a PR01550 polypeptide coding sequence that may find use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length.
In another embodiment, the invention provides isolated PR01550 polypeptide encoded by any of the isolated nucleic acid sequences hereinabove defined.
In a specific aspect, the invention provides isolated native sequence PR01550 polypeptide, which in one embodiment, includes an amino acid sequence comprising residues 31 to 243 of Figure 246 (SEQ ID NO:232).
In another aspect, the invention concerns an isolated PRO1550 polypeptide, comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the sequence of amino acid residues 31 to about 243, inclusive of Figure 246 (SEQ ID NO:232).
In a further aspect, the invention concerns an isolated PR01550 polypeptide, comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence of residues 31 to 243 ofFigure 246 (SEQ ID NO:232).
In yet another aspect, the invention concerns an isolated PRO 1550 polypeptide, comprising the sequence of amino acid residues 31 to about 243, inclusive of Figure 246 (SEQ ID NO:232), or a fragment thereof sufficient to provide a binding site for an anti-PR01550 antibody. Preferably, the PR01550 fragment retains a qualitative biological activity of a native PRO 1550 polypeptide.
In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1550 polypeptide having the sequence of amino acid residues from about 31 to about 243, inclusive of Figure 246 (SEQ ID NO:232), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence identity to (a) or (b), (ii) culturing a host cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the polypeptide from the cell culture.
124. Additional Embodiments
In other embodiments of the present invention, the invention provides vectors comprising DNA encoding any of the herein described polypeptides. Host cell comprising any such vector are also provided. By way of example, the host cells may be CHO cells, E. coli, or yeast. A process for producing any of the herein described polypeptides is further provided and comprises culturing host cells under conditions suitable for expression of the desired polypeptide and recovering the desired polypeptide from the cell culture.
In other embodiments, the invention provides chimeric moleculescomprising any of the herein described polypeptides fused to a heterologous polypeptide or amino acid sequence. Example of such chimeric molecules comprise any of the herein described polypeptides fused to an epitope tag sequence or a Fc region of an immunoglobulin.
In another embodiment, the invention provides an antibody which specifically binds to any of the above 5 or below described polypeptides. Optionally, the antibody is a monoclonal antibody, humanized antibody, antibody fragment or single-chain antibody.
In yet other embodiments, the invention provides oligonucleotide probes useful for isolating genomic and cDNA nucleotide sequences, wherein those probes may be derived from any of the above or below described nucleotide sequences.
In other embodiments, the invention provides an isolated nucleic acid molecule comprising a nucleotide sequence that encodes a PRO polypeptide.
In one aspect, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about 80% sequence identity, preferably at least about 81% sequence identity, more preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet more preferably at least about 15 84% sequence identity, yet more preferably at least about 85% sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence identity, yet more preferably at least about 88% sequence identiiy, yet more preferably at least about 89% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 91% sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least about 93% sequence identity, yet 20 more preferably at least about 94 % sequence identity, yet more preferably at least about 95 % sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more preferably at least about 99% sequence identity to (a) a DNA molecule encoding a PRO polypeptide having a full-length amino acid sequence as disclosed herein, an amino acid sequence lacking the signal peptide as disclosed herein or an extracellular 25 domain of a transmembrane protein, with or without the signal peptide, as disclosed herein, or (b) the complement of the DNA molecule of (a).
In other aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about 80% sequence identity, preferably at least about 81% sequence identity, more preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet more preferably at least about 30 84% sequence identity, yet more preferably at least about 85% sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence identity, yet more preferably at least about 88 % sequence identity, yet more preferably at least about 89 % sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 91 % sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least about 93% sequence identity, yet 35 more preferably at least about 94% sequence identity, yet more preferably at least about 95 % sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more preferably at least about 99%
293
sequence identity to (a) a DNA molecule comprising the coding sequence of a full-length PRO polypeptide cDNA as disclosed herein, the coding sequence of a PRO polypeptide lacking the signal peptide as disclosed herein or the coding sequence of an extracellular domain of a transmembrane PRO polypeptide, with or without the signal peptide, as disclosed herein, or (b) the complement of the DNA molecule of (a).
In a further aspect, the invention concerns an isolated nucleic acid molecule comprising a nucleotide 5 sequence having at least about 80% sequence identity, preferably at least about 81% sequence identity, more preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet more preferably at least about 84 % sequence identity, yet more preferably at least about 85 % sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 89% 10 sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 91 % sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least about 93% sequence identity, yet more preferably at least about 94% sequence identity, yet more preferably at least about 95 % sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more 15 preferably at least about 99% sequence identity to (a) a DNA molecule that encodes the same mature polypeptide encoded by any of the human protein cDNAs deposited with the ATCC as disclosed herein, or (b) the complement of the DNA molecule of (a).
Another aspect the invention provides an isolated nucleic acid molecule comprising a nucleotide sequence encoding a PRO polypeptide which is either transmembrane domain-deleted or transmembrane domain-20 inactivated, or is complementary to such encoding nucleotide sequence, wherein the transmembrane domain(s) of such polypeptide are disclosed herein. Therefore, soluble extracellular domains of the herein described PRO polypeptides are contemplated.
Another embodiment is directed to fragments of a PRO polypeptide coding sequence that may find use as, for example, hybridization probes or for encoding fragments of a PRO polypeptide that may optionally 25 encode a polypeptide comprising a binding site for an anti-PRO antibody. Such nucleic acid fragments are usually at least about 20 nucleotides in length, preferably at least about 30 nucleotides in length, more preferably at least about 40 nucleotides in length, yet more preferably at least about 50 nucleotides in length, yet more preferably at least about 60 nucleotides in length, yet more preferably at least about 70 nucleotides in length, yet more preferably at least about 80 nucleotides in length, yet more preferably at least about 90 nucleotides in 30 length, yet more preferably at least about 100 nucleotides in length, yet more preferably at least about 110 nucleotides in length, yet more preferably at least about 120 nucleotides in length, yet more preferably at least about 130 nucleotides in length, yet more preferably at least about 140 nucleotides in length, yet more preferably at least about 150 nucleotides in length, yet more preferably at least about 160 nucleotides in length, yet more preferably at least about 170 nucleotides in length, yet more preferably at least about 180 nucleotides in length, 35 yet more preferably at least about 190 nucleotides in length, yet more preferably at least about 200 nucleotides in length, yet more preferably at least about 250 nucleotides in length, yet more preferably at least about 300 nucleotides in length, yet more preferably at least about 350 nucleotides in length, yet more preferably at least
294
about 400 nucleotides in length, yet more preferably at least about 450 nucleotides in length, yet more preferably at least about 500 nucleotides in length, yet more preferably at least about 600 nucleotides in length, yet more preferably at least about 700 nucleotides in length, yet more preferably at least about 800 nucleotides in length, yet more preferably at least about 900 nucleotides in length and yet more preferably at least about 1000 nucleotides in length, wherein in this context the term "about" means the referenced nucleotide sequence length 5 plus or minus 10% of that referenced length. It is noted that novel fragments of a PRO polypeptide-encoding nucleotide sequence may be determined in a routine manner by aligning the PRO polypeptide-encoding nucleotide sequence with other known nucleotide sequences using any of a number of well known sequence alignment programs and determining which PRO polypeptide-encoding nucleotide sequence fragments) are novel. All of such PRO polypeptide-encoding nucleotide sequences are contemplated herein. Also contemplated are the PRO 10 polypeptide fragments encoded by these nucleotide molecule fragments, preferably those PRO polypeptide fragments that comprise a binding site for an anti-PRO antibody.
In another embodiment, the invention provides isolated PRO polypeptide encoded by any of the isolated nucleic acid sequences hereinabove identified.
In a certain aspect, the invention concerns an isolated PRO polypeptide, comprising an amino acid 15 sequence having at least about 80% sequence identity, preferably at least about 81% sequence identity, more preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet more preferably at least about 84% sequence identity, yet more preferably at least about 85% sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 89% 20 sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 91% sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least about 93% sequence identity, yet more preferably at least about 94% sequence identity, yet more preferably at least about 95 % sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more 25 preferably at least about 99% sequence identity to a PRO polypeptide having a full-length amino acid sequence as disclosed herein, an amino acid sequence lacking the signal peptide as disclosed herein or an extracellular domain of a transmembrane protein, with or without the signal peptide, as disclosed herein.
In a further aspect, the invention concerns an isolated PRO polypeptide comprising an amino acid sequence having at least about 80% sequence identity, preferably at least about 81 % sequence identity, more 30 preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet more preferably at least about 84% sequence identity, yet more preferably at least about 85% sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 35 91 % sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least about 93 % sequence identity, yet more preferably at least about 94% sequence identity, yet more preferably at least about 95 % sequence identity, yet more preferably at least about 96 % sequence identity, yet more preferably
295
at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more preferably at least about 99% sequence identity to an amino acid sequence encoded by any of the human protein cDNAs deposited with the ATCC as disclosed herein.
In a further aspect, the invention concerns an isolated PRO polypeptide comprising an amino acid sequence scoring at least about 80% positives, preferably at least about 81 % positives, more preferably at least 5 about 82% positives, yet more preferably at least about 83 % positives, yet more preferably at least about 84% positives, yet more preferably at least about 85 % positives, yet more preferably at least about 86% positives, yet more preferably at least about 87% positives, yet more preferably at least about 88% positives, yet more preferably at least about 89% positives, yet more preferably at least about 90% positives, yet more preferably at least about 91 % positives, yet more preferably at least about 92 % positives, yet more preferably at least about 10 93% positives, yet more preferably at least about 94% positives, yet more preferably at least about 95% positives, yet more preferably at least about 96% positives, yet more preferably at least about 97% positives, yet more preferably at least about 98% positives and yet more preferably at least about 99% positives when compared with the amino acid sequence of a PRO polypeptide having a full-length amino acid sequence as disclosed herein, an amino acid sequence lacking the signal peptide as disclosed herein or an extracellular domain 15 of a transmembrane protein, with or without the signal peptide, as disclosed herein.
In a specific aspect, the invention provides an isolated PRO polypeptide without the N-terminal signal sequence and/or the initiating methionine and is encoded by a nucleotide sequence that encodes such an amino acid sequence as hereinbefore described. Processes for producing the same are also herein described, wherein those processes comprise culturing a host cell comprising a vector which comprises the appropriate encoding 20 nucleic acid molecule under conditions suitable for expression of the PRO polypeptide and recovering the PRO polypeptide from the cell culture.
Another aspect the invention provides an isolated PRO polypeptide which is either transmembrane domain-deleted or transmembrane domain-inactivated. Processes for producing the same are also herein described, wherein those processes comprise culturing a host cell comprising a vector which comprises the 25 appropriate encoding nucleic acid molecule under conditions suitable for expression of the PRO polypeptide and recovering the PRO polypeptide from the cell culture.
In yet another embodiment, the invention concerns agonists and antagonists of a native PRO polypeptide as defined herein. In a particular embodiment, the agonist or antagonist is an anti-PRO antibody or a small molecule.
In a further embodiment, the invention concerns a method of identifying agonists or antagonists to a
PRO polypeptide which comprise contacting the PRO polypeptide with a candidate molecule and monitoring a biological activity mediated by said PRO polypeptide. Preferably, the PRO polypeptide is a native PRO polypeptide.
In a still further embodiment, the invention concerns a composition of matter comprising a PRO 35 polypeptide, or an agonist or antagonist of a PRO polypeptide as herein described, or an anti-PRO antibody, in combination with a carrier. Optionally, the carrier is a pharmaceuticaliy acceptable carrier.
Another embodiment of the present invention is directed to the use of a PRO polypeptide, or an agonist
296
or antagonist thereof as hereinbefore described, or an anti-PRO antibody, for the preparation of a medicament useful in the treatment of a condition which is responsive to the PRO polypeptide, an agonist or antagonist thereof or an anti-PRO antibody.
BRIEF DESCRIPTION OF THE DRAWINGS 5 Figure 1 shows a nucleotide sequence (SEQ ID NO:3) of anative sequence PR01560 (UNQ767) cDNA,
wherein SEQ ID NO:3 is a clone designated herein as "DNA19902-1669". The start and stop codons are shown in bold and underlined fcmt.
Figure 2 shows the amino acid sequence (SEQ ID NO:4) derived from the coding sequence of SEQ ID NO:3 shown in Figure 1.
Figure 3 shows a nucleotide sequence (SEQ ID NO:5) of a native sequence PR0444 (UNQ328) cDNA,
wherein SEQ ID NO:5 is a clone designated herein as "DNA26846-1397". The start and stop codons are shown in bold and underlined font.
Figure 4 shows the amino acid sequence (SEQ ID NO:6) derived from the coding sequence of SEQ ID NO:5 shown in Figure 3.
Figure 5 shows a nucleotide sequence (SEQ ID NO:7) of anative sequence PRO 1018 (UNQ501) cDNA,
wherein SEQ ID NO:7 is a clone designated herein as "DNA56107-1415". The start and stop codons are shown in bold and underlined font.
Figure 6 shows the amino acid sequence (SEQ ID NO:8) derived from the coding sequence of SEQ ID NO:7 shown in Figure 5.
Figure 7 shows a nucleotide sequence (SEQ ID NO:9) of a native sequence PR01773 (UNQ835) cDNA,
wherein SEQ ID NO:9 is a clone designated herein as "DNA56406-1704". The start and stop codons are shown in bold and underlined font.
Figure 8 shows the amino acid sequence (SEQ ID NO: 10) derived from the coding sequence of SEQ ID NO:9 shown in Figure 7.
Figure 9 shows a nucleotide sequence (SEQ ID NO: 11) of a native sequence PR01477 (UNQ747)
cDNA, wherein SEQ ID NO:ll is a clone designated herein as "DNA56529-1647". The start and stop codons are shown in bold and underlined font.
Figure 10 shows the amino acid sequence (SEQ ID NO: 12) derived from the coding sequence of SEQ ID NO: 11 shown in Figure 9.
Figure 11 shows a nucleotide sequence (SEQ ID NO: 16) of a native sequence PR01478 (UNQ748)
cDNA, wherein SEQ ID NO:16 is a clone designated herein as "DNA56531-1648". The start and stop codons are shown in bold and underlined font.
Figure 12 shows the amino acid sequence (SEQ ID NO: 17) derived from the coding sequence of SEQ ID NO:16 shown in Figure 11.
Figure 13 shows a nucleotide sequence (SEQ ID NO:21) of a native sequence PR0831 (UNQ471)
cDNA, wherein SEQ ID NO:21 is a clone designated herein as "DNA56862-1343". The start and stop codons are shown in bold and underlined font.
297
Figure 14 shows the amino acid sequence (SEQ ID NO:22) derived from the coding sequence of SEQ ID NO:21 shown in Figure 13.
Figure 15 shows a nucleotide sequence (SEQ ID NO:23) of a native sequence PROl 113 (UNQ556) cDNA, wherein SEQ ID NO:23 is a clone designated herein as "DNA57254-1477". The start and stop codons axe shown in bold and underlined font.
Figure 16 shows the amino acid sequence (SEQ ID NO:24) derived from the coding sequence of SEQ
ID NO:23 shown in Figure 15.
Figure 17 shows a nucleotide sequence (SEQ ID NO:28) of a native sequence PROl 194 (UNQ607) cDNA, wherein SEQ ID NO:28 is a clone designated herein as "DNA57841-1522". The start and stop codons are shown in bold and underlined font.
Figure 18 shows the amino acid sequence (SEQ ID NO:29) derived from the coding sequence of SEQ
ID NO:28 shown in Figure 17.
Figure 19 shows a nucleotide sequence (SEQ ID N0:30) of a native sequence PROlllO (UNQ553) cDNA, wherein SEQ ID N0:30 is a clone designated herein as "DNA58727-1474". The start and stop codons are shown in bold and underlined font.
Figure 20 shows the amino acid sequence (SEQ ID NO:31) derived from the coding sequence of SEQ
ID N0:30 shown in Figure 19.
Figure 21 shows a nucleotide sequence (SEQ ID NO:32) of a native sequence PR01378 (UNQ715) cDNA, wherein SEQ ID NO:32 is a clone designated herein as "DNA58730-1607". The start and stop codons are shown in bold and underlined font.
Figure 22 shows the amino acid sequence (SEQ ID NO:33) derived from the coding sequence of SEQ
ID NO:32 shown in Figure 21.
Figure 23 shows a nucleotide sequence (SEQ ID N0:40) of a native sequence PR01481 (UNQ750) cDNA, wherein SEQ ID N0:40 is a clone designated herein as "DNA58732-1650". The start and stop codons are shown in bold and underlined font.
Figure 24 shows the amino acid sequence (SEQ ID NO:41) derived from the coding sequence of SEQ
ID N0:40 shown in Figure 23.
Figure 25 shows a nucleotide sequence (SEQ ID N0:42) of a native sequence PROl 189 (UNQ603) cDNA, wherein SEQ ID NO:42 is a clone designated herein as "DNA58828-1519". The start and stop codons are shown in bold and underlined font.
Figure 26 shows the amino acid sequence (SEQ ID NO:43) derived from the coding sequence of SEQ
ID NO:42 shown in Figure 25.
Figure 27 shows a nucleotide sequence (SEQ ID NO:49) of a native sequence PR01415 (UNQ731) cDNA, wherein SEQ ID N0:49 is a clone designated herein as "DNA58852-1637". The start and stop codons are shown in bold and underlined font.
Figure 28 shows the amino acid sequence (SEQ ID N0:50) derived from the coding sequence of SEQ
ID N0:49 shown in Figure 27.
298
Figure 29 shows a nucleotide sequence (SEQ ID NO:51) of a native sequence PR01411 (UNQ729) cDNA, wherein SEQ ID NO:5I is a clone designated herein as "DNA59212-I627". The start and stop codons are shown in bold and underlined font.
Figure 30 shows the amino acid sequence (SEQ ID NO:52) derived from the coding sequence of SEQ ID NO:5I shown in Figure 29.
Figure 31 shows a nucleotide sequence (SEQ ID NO:53) of a native sequence PRO1295 (UNQ664)
cDNA, wherein SEQ ID NO:53 is a clone designated herein as "DNA59218-1559". The start and stop codons are shown in bold and underlined font.
Figure 32 shows the amino acid sequence (SEQ ID NO:54) derived from the coding sequence of SEQ ID NO:53 shown in Figure 31.
Figure 33 shows a nucleotide sequence (SEQ ID NO:55) of a native sequence PR01359 (UNQ708)
cDNA, wherein SEQ ID N0:55 is a clone designated herein as "DNA59219-1613". The start and stop codons are shown in bold and underlined font.
Figure 34 shows the amino acid sequence (SEQ ID NO:56) derived from the coding sequence of SEQ ID NO:55 shown in Figure 33.
Figure 35 shows a nucleotide sequence (SEQ ID NO:57) of a native sequence PROl 190 (UNQ604)
cDNA, wherein SEQ ID NO:57 is a clone designated herein as "DNA59586-1520". The start and stop codons are shown in bold and underlined font.
Figure 36 shows the amino acid sequence (SEQ ID NO:58) derived from the coding sequence of SEQ ID NO:57 shown in Figure 35.
Figure 37 shows a nucleotide sequence (SEQ ID NO:62) of a native sequence PR01772 (UNQ834)
cDNA, wherein SEQ ID NO:62 is a clone designated herein as "DNA59817-1703". The start and stop codons are shown in bold and underlined font.
Figure 38 shows the amino acid sequence (SEQ ID NO:63) derived from the coding sequence of SEQ ID NO:62 shown in Figure 37.
Figure 39 shows a nucleotide sequence (SEQ ID NO:67) of a native sequence PR01248 (UNQ631)
cDNA, wherein SEQ ID N0:67 is a clone designated herein as "DNA60278-1530". The start and stop codons are shown in bold and underlined font.
Figure 40 shows the amino acid sequence (SEQ ID NO :68) derived from the coding sequence of SEQ ID NO:67 shown in Figure 39.
Figure 41 shows a nucleotide sequence (SEQ ID NO:69) of a native sequence PR01316 (UNQ682)
cDNA, wherein SEQ ID NO:69 is a clone designated herein as "DNA60608-1577". The start and stop codons are shown in bold and underlined font.
Figure 42 shows the amino acid sequence (SEQ ID N0:70) derived from the coding sequence of SEQ ID NO:69 shown in Figure 41.
Figure 43 shows a nucleotide sequence (SEQ ID NO:71) of a native sequence PROl 197 (UNQ610)
cDNA, wherein SEQ ID NO:71 is a clone designated herein as "DNA60611-1524". The start and stop codons are shown in bold and underlined font.
299
PCT/US99/201H
Figure 44 shows the amino acid sequence (SEQ ID NO:72) derived from the coding sequence of SEQ ID NO:71 shown in Figure 43.
Figure 45 shows a nucleotide sequence (SEQ ID NO:76) of a native sequence PR01293 (UNQ662) cDNA, wherein SEQ ID NO:76 is a clone designated herein as "DNA60618-1557". The start and stop codons are shown in bold and underlined font.
Figure 46 shows the amino acid sequence (SEQ ID NO:77) derived from the coding sequence of SEQ
ID NO:76 shown in Figure 45.
Figure 47 shows a nucleotide sequence (SEQ ID NO:78) of a native sequence PR01380 (UNQ717) cDNA, wherein SEQ ID NO:78 is a clone designated herein as "DNA60740-1615". The start and stop codons are shown in bold and underlined font.
Figure 48 shows the amino acid sequence (SEQ ID NO:79) derived from the coding sequence of SEQ
ID NO:78 shown in Figure 47.
Figure 49 shows a nucleotide sequence (SEQ ID NO:83) of a native sequence PRO1265 (UNQ636) cDNA, wherein SEQ ID NO:83 is a clone designated herein as "DNA60764-1533". The start and stop codons are shown in bold and underlined font.
Figure 50 shows the amino acid sequence (SEQ ID NO:84) derived from the coding sequence of SEQ
ID NO:83 shown in Figure 49.
Figure 51 shows a nucleotide sequence (SEQ ID NO:85) of a native sequence PRO 1250 (UNQ633) cDNA, wherein SEQ ID NO:85 is a clone designated herein as "DNA60775-1532". The start and stop codons are shown in bold and underlined font.
Figure 52 shows the amino acid sequence (SEQ ID NO:86) derived from the coding sequence of SEQ
ID NO:85 shown in Figure 51.
Figure 53 shows a nucleotide sequence (SEQ ID NO:87) of a native sequence PRO1475 (UNQ746) cDNA, wherein SEQ ID NO:87 is a clone designated herein as "DNA61185-1646". The start and stop codons are shown in bold and underlined font.
Figure 54 shows the amino acid sequence (SEQ ID NO:88) derived from the coding sequence of SEQ
ID NO:87 shown in Figure 53.
Figure 55 shows a nucleotide sequence (SEQ ID NO:94) of a native sequence PR01377 (UNQ714) cDNA, wherein SEQ ID NO:94 is a clone designated herein as "DNA61608-1606". The start and stop codons are shown in bold and underlined font.
Figure 56 shows the amino acid sequence (SEQ ID NO:95) derived from the coding sequence of SEQ
ID NO:94 shown in Figure 55.
Figure 57 shows a nucleotide sequence (SEQ ID NO:99) of a native sequence PR01326 (UNQ686) cDNA, wherein SEQ ID NO:99 is a clone designated herein as "DNA62808-1582". The start and stop codons are shown in bold and underlined font.
Figure 58 shows the amino acid sequence (SEQ ID NO: 100) derived from the coding sequence of SEQ
ID NO:99 shown in Figure 57.
300
Figure 59 shows a nucleotide sequence (SEQ ID NO: 101) of a native sequence PR01249 (UNQ632) cDNA, wherein SEQ IDNO.lOl is a clone designated herein as "DNA62809-1531". The start and stop codons are shown in bold and underlined font.
Figure 60 shows the amino acid sequence (SEQ ID NO: 102) derived from the coding sequence of SEQ ID NO.IOO shown in Figure 59.
Figure 61 shows a nucleotide sequence (SEQ ID NO: 103) of a native sequence PR01315 (UNQ681)
cDNA, wherein SEQ ID NO: 103 is a clone designated herein as "DNA62815-1578". The start and stop codons are shown in bold and underlined font.
Figure 62 shows the amino acid sequence (SEQ ID NO: 104) derived from the coding sequence of SEQ ID NO: 103 shown in Figure 61.
Figure 63 shows a nucleotide sequence (SEQ ID NO:l 10) of a native sequence PR01549 (UNQ782)
cDNA, wherein SEQ ID NO:l 10 is a clone designated herein as "DNA62845-1684". The start and stop codons are shown in bold and underlined font.
Figure 64 shows the amino acid sequence (SEQ ID NO: 111) derived from the coding sequence of SEQ ID NO:l 10 shown in Figure 63.
Figure 65 shows a nucleotide sequence (SEQ ID NO: 115) of a native sequence PR01430 (UNQ736)
cDNA, wherein SEQ ID NO: 115 is a clone designated herein as "DNA64842-1632". The start and stop codons are shown in bold and underlined font.
Figure 66 shows the amino acid sequence (SEQ ID NO: 116) derived from the coding sequence of SEQ ID NO: 115 shown in Figure 65.
Figure 67 shows a nucleotide sequence (SEQ ID NO: 117) of a native sequence PR01374 (UNQ711)
cDNA, wherein SEQ ID NO: 117 is a clone designated herein as "DNA64849-1604". The start and stop codons are shown in bold and underlined font.
Figure 68 shows the amino acid sequence (SEQ ID NO: 118) derived from the coding sequence of SEQ ID NO: 117 shown in Figure 67.
Figure 69 shows a nucleotide sequence (SEQ ID NO: 122) of a native sequence PRO 1311 (UNQ677)
cDNA, wherein SEQ ID NO: 122 is a clone designated herein as "DNA64863-1573". The start and stop codons are shown in bold and underlined font.
Figure 70 shows the amino acid sequence (SEQ ID NO: 123) derived from the coding sequence of SEQ ID NO: 122 shown in Figure 69.
Figure 71 shows a nucleotide sequence (SEQ ID NO: 127) of a native sequence PRO 1357 (UNQ706)
cDNA, wherein SEQ ID NO: 127 is a clone designated herein as "DNA64881-1602". The start and stop codons are shown in bold and underlined font.
Figure 72 shows the amino acid sequence (SEQ ID NO: 128) derived from the coding sequence of SEQ ID NO: 127 shown in Figure 71.
Figure 73 shows a nucleotide sequence (SEQ ID NO: 129) of a native sequence PR01244 (UNQ628)
cDNA, wherein SEQ ID NO: 129 is a clone designated herein as "DNA64883-1526". The start and stop codons are shown in bold and underlined font.
301
Figure 74 shows the amino acid sequence (SEQ ID NO: 130) derived from the coding sequence of SEQ ID NO: 129 shown in Figure 73.
Figure 75 shows a nucleotide sequence (SEQ ID NO: 131) of a native sequence PRO1246 (UNQ630) cDNA, wherein SEQ ID NO: 131 is a clone designated herein as "DNA64885-1529". The start and stop codons are shown in bold and underlined font.
Figure 76 shows the amino acid sequence (SEQ ID NO: 132) derived from the coding sequence of SEQ
ID NO:131 shown in Figure 75.
Figure 77 shows a nucleotide sequence (SEQ ID NO: 133) of a native sequence PRO 1356 (UNQ705) cDNA, wherein SEQ ID NO: 133 is a clone designated herein as "DNA64886-1601". The start and stop codons are shown in bold and underlined font.
Figure 78 shows the amino acid sequence (SEQ ID NO: 134) derived from the coding sequence of SEQ
ID NO: 133 shown in Figure 77.
Figure 79 shows a nucleotide sequence (SEQ ID NO: 135) of a native sequence PR01275 (UNQ645) cDNA, wherein SEQ ID NO: 135 is a clone designated herein as "DNA64888-1542". The start and stop codons are shown in bold and underlined font.
Figure 80 shows the amino acid sequence (SEQ ID NO: 136) derived from the coding sequence of SEQ
ID NO: 135 shown in Figure 79.
Figure 81 shows a nucleotide sequence (SEQ ID NO: 137) of a native sequence PR01274 (UNQ644) cDNA, wherein SEQ ID NO: 137 is a clone designated herein as "DNA64889-1542". The start and stop codons are shown in bold and underlined font.
Figure 82 shows the amino acid sequence (SEQ ID NO: 138) derived from the coding sequence of SEQ
ID NO: 137 shown in Figure 81.
Figure 83 shows a nucleotide sequence (SEQ ID NO: 139) of a native sequence PR01412 (UNQ730) cDNA, wherein SEQ ID NO:139 is a clone designated herein as "DNA64897-1628". The start and stop codons are shown in bold and underlined font.
Figure 84 shows the amino acid sequence (SEQ ID NO: 140) derived from the coding sequence of SEQ
ID NO: 139 shown in Figure 83.
Figure 85 shows a nucleotide sequence (SEQ ID NO: 141) of a native sequence PR01557 (UNQ765) cDNA, wherein SEQ ID NO: 141 is a clone designated herein as "DNA64902-1667". The start and stop codons are shown in bold and underlined font.
Figure 86 shows the amino acid sequence (SEQ ID NO: 142) derived from the coding sequence of SEQ
ID NO: 141 shown in Figure 85.
Figure 87 shows a nucleotide sequence (SEQ ID NO: 143) of a native sequence PRO 1286 (UNQ655) cDN A, wherein SEQ ID NO: 143 is a clone designated herein as "DNA64903-1553". The start and stop codons are shown in bold and underlined font.
Figure 88 shows the amino acid sequence (SEQ ID NO: 144) derived from the coding sequence of SEQ
ID NO: 143 shown in Figure 87.
302
Figure 89 shows a nucleotide sequence (SEQ ID NO: 145) of a native sequence PR01294 (UNQ663) cDNA, wherein SEQ ID NO: 145 is a clone designated herein as "DNA64905-1558". The start and stop codons are shown in bold and underlined font.
Figure 90 shows the amino acid sequence (SEQ ID NO: 146) derived from the coding sequence of SEQ ID NO: 145 shown in Figure 89.
Figure 91 shows a nucleotide sequence (SEQ ID NO: 147) of a native sequence PR01347 (UNQ702)
cDNA, wherein SEQ ID NO: 147 is a clone designated herein as "DNA64950-1590". The start and stop codons are shown in bold and underlined font.
Figure 92 shows the amino acid sequence (SEQ ID NO: 148) derived from the coding sequence of SEQ ID NO: 147 shown in Figure 91.
Figure 93 shows a nucleotide sequence (SEQ ID NO: 152) of a native sequence PR01305 (UNQ671)
cDNA, wherein SEQ ID NO: 152 is a clone designated herein as "DNA64952-1568". The start and stop codons are shown in bold and underlined font.
Figure 94 shows the amino acid sequence (SEQ ID NO: 153) derived from the coding sequence of SEQ ID NO: 152 shown in Figure 93.
Figure 95 shows a nucleotide sequence (SEQ ID NO: 157) of a native sequence PRO1273 (UNQ643)
cDNA, wherein SEQ ID NO: 157 is a clone designated herein as "DNA65402-1540". The start and stop codons are shown in bold and underlined font.
Figure 96 shows the amino acid sequence (SEQ ID NO: 158) derived from the coding sequence of SEQ ID NO: 157 shown in Figure 95.
Figure 97 shows a nucleotide sequence (SEQ ID NO: 159) of a native sequence PR01302 (UNQ668)
cDNA, wherein SEQ ID NO: 159 is a clone designated herein as "DNA65403-1565". The start and stop codons are shown in bold and underlined font.
Figure 98 shows the amino acid sequence (SEQ ID NO: 160) derived from the coding sequence of SEQ ID NO: 159 shown in Figure 97.
Figure 99 shows a nucleotide sequence (SEQ ID NO: 161) of a native sequence PR01283 (UNQ653)
cDNA, wherein SEQ ID NO: 161 is a clone designated herein as "DNA65404-1551". The start and stop codons are shown in bold and underlined font.
Figure 100 shows the amino acid sequence (SEQ ID NO: 162) derived from the coding sequence of SEQ ID NO: 161 shown in Figure 99.
Figure 101 shows a nucleotide sequence (SEQ ID NO: 169) of a native sequence PR01279 (UNQ649)
cDNA, wherein SEQ ID NO: 169 is a clone designated herein as "DNA65405-1547". The start and stop codons are shown in bold and underlined font.
Figure 102 shows the amino acid sequence (SEQ ID NO: 170) derived from the coding sequence of SEQ ID NO: 169 shown in Figure 101.
Figure 103 shows a nucleotide sequence (SEQ ID NO: 179) of a native sequence PR01304 (UNQ670)
cDNA, wherein SEQ ID NO: 179 is a clone designated herein as "DNA65406-1567". The start and stop codons are shown in bold and underlined font.
303
Figure 104 shows the amino acid sequence (SEQ ID NO: 180) derived from the coding sequence of SEQ ID NO: 179 shown in Figure 103.
Figure 105 shows a nucleotide sequence (SEQ ID NO: 188) of a native sequence PRO 1317 (UNQ683) cDNA, wherein SEQ ID NO: 188 is a clone designated herein as "DNA65408-1578". The start and stop codons are shown in bold and underlined font.
Figure 106 shows the amino acid sequence (SEQ ID NO: 189) derived from the coding sequence of SEQ
ID NO:188 shown in Figure 105.
Figure 107 shows a nucleotide sequence (SEQ ID NO: 193) of a native sequence PR01303 (UNQ669) cDNA, wherein SEQ ID NO: 193 is a clone designated herein as "DNA65409-1566". The start and stop codons are shown in bold and underlined font.
Figure 108showstheaminoacidsequence(SEQIDNO:194)derivedfromthecodingsequenceofSEQ
ID NO:193 shown in Figure 107.
Figure 109 shows a nucleotide sequence (SEQ ID NO-.195) of a native sequence PR013Q6 (UNQ672) cDNA, wherein SEQ ID NO: 195 is a clone designated herein as "DNA65410-1569". The start and stop codons are shown in bold and underlined font.
Figure 110 shows the amino acid sequence (SEQ ID NO: 196) derived from the coding sequence of SEQ
ID NO: 195 shown in Figure 109.
Figures 111A-B show a nucleotide sequence (SEQ ID NO: 197) of a native sequence PR01336 (UNQ691) cDNA, wherein SEQ ID NO: 197 is a clone designated herein as "DNA65423-1595". The start and stop codons are shown in bold and underlined font.
Figure 112 shows the amino acid sequence (SEQ ID NO: 198) derived from the coding sequence of SEQ
ID NO: 198 shown in Figures 111A-B.
Figure 113 shows a nucleotide sequence (SEQ ID N0:202) of a native sequence PR01278 (UNQ648) cDNA, wherein SEQ ID N0:202 is a clone designated herein as "DNA66304-1546". The start and stop codons are shown in bold and underlined font.
Figure 114 shows the amino acid sequence (SEQ ID N0:203) derived from the coding sequence of SEQ
ID N0:202 shown in Figure 113.
Figure 115 shows a nucleotide sequence (SEQ ID N0:209) of anative sequence PRO1298 (UNQ666) cDN A, wherein SEQ ID N0:209 is a clone designated herein as "DN A66511-1563". The start and stop codons are shown in bold and underlined font.
Figure 116 shows the amino acid sequence (SEQ ID N0:210) derived from the coding sequence of SEQ
ID N0:209 shown in Figure 115.
Figure 117 shows a nucleotide sequence (SEQ ID NO:211) of a native sequence PROI301 (UNQ667) cDNA, wherein SEQ ID NO:211 is a clone designated herein as "DNA66512-1564". The start and stop codons are shown in bold and underlined font.
Figure 118 shows the amino acid sequence (SEQ ID NO:212) derived from the coding sequence of SEQ
ID NO:211 shown in Figure 117.
304
Figure 119 shows a nucleotide sequence (SEQ ID NO:213) of a native sequence PR01268 (UNQ638) cDNA, wherein SEQ ID NO:213 is a clone designated herein as "DNA66519-1535". The start and stop codons are shown in bold and underlined font.
Figure 120 shows the amino acid sequence (SEQ ID NO:214) derived from the coding sequence of SEQ ID NO:213 shown in Figure 119.
Figure 121 shows a nucleotide sequence (SEQ ID NO:215) of a native sequence PR01269 (UNQ639)
cDNA, wherein SEQ ID NO:215 is a clone designated herein as "DNA66520-1536". The start and stop codons are shown in bold and underlined font.
Figure 122 shows the amino acid sequence (SEQ ID NO:216) derived from the coding sequence of SEQ ID NO:215 shown in Figure 121.
Figure 123 shows a nucleotide sequence (SEQ ID NO:217) of a native sequence PR01327 (UNQ687)
cDNA, wherein SEQ ID NO:217 is a clone designated herein as "DNA66521-1583". The start and stop codons are shown in bold and underlined font.
Figure 124 shows the amino acid sequence (SEQ ID NO:218) derived from the coding sequence of SEQ ID NO:217 shown in Figure 123.
Figure 125 shows a nucleotide sequence (SEQ ID NO:219) of a native sequence PR01382 (UNQ718)
cDNA, wherein SEQ ID NO:219 is a clone designated herein as "DNA66526-1616". The start and stop codons are shown in bold and underlined font.
Figure 126 shows the amino acid sequence (SEQ ID N0:220) derived from the coding sequence of SEQ ID NO:219 shown in Figure 125.
Figure 127 shows a nucleotide sequence (SEQ ID NO:224) of a native sequence PR01328 (UNQ688)
cDNA, wherein SEQ ID NO:224 is a clone designated herein as "DNA66658-1584". The start and stop codons are shown in bold and underlined font.
Figure 128 shows the amino acid sequence (SEQ ID NO:225) derived from the coding sequence of SEQ ID NO:224 shown in Figure 127.
Figure 129 shows a nucleotide sequence (SEQ ID NO:226) of a native sequence PR01325 (UNQ685)
cDNA, wherein SEQ ID NO;226 is a clone designated herein as "DNA66659-1593". The start and stop codons are shown in bold and underlined font.
Figure 130 shows the amino acid sequence (SEQ ID NO:227) derived from the coding sequence of SEQ ID NO:226 shown in Figure 129.
Figure 131 shows a nucleotide sequence (SEQ ID N0.228) of a native sequence PR01340 (UNQ695)
cDNA, wherein SEQ ID NO:228 is a clone designated herein as "DNA66663-1598". The start and stop codons are shown in bold and underlined font.
Figure 132 shows the amino acid sequence (SEQ ID N0.229) derived from the coding sequence of SEQ ID NO:228 shown in Figure 131.
Figure 133 shows a nucleotide sequence (SEQ ID NO:233) of a native sequence PR01339 (UNQ694)
cDNA, wherein SEQ ID NO:233 is a clone designated herein as "DNA66669-1597". The start and stop codons are shown in bold and underlined font.
305
Figure 134 shows the amino acid sequence (SEQ ID NO:234) derived from the coding sequence of SEQ ID NO:233 shown in Figure 133.
Figure 135 shows a nucleotide sequence (SEQ ID NO:235) of a native sequence PR01337 (UNQ692) cDNA, wherein SEQ ID NO:235 is a clone designated herein as "DNA66672-1586". The start and stop codons are shown in bold and underlined font.
Figure 136 shows the amino acid sequence (SEQ ID NO :236) derived from the coding sequence of SEQ
ID NO:235 shown in Figure 135.
Figure 137 shows a nucleotide sequence (SEQ ID NO:242) of a native sequence PR01342 (UNQ697) cDNA, wherein SEQ ID NO:242 is a clone designated herein as "DNA66674-1599". The start and stop codons , are shown in bold and underlined font.
Figure 138 shows the amino acid sequence (SEQ ID NO:243) derived from the coding sequence of SEQ
ID NO:242 shown in Figure 137.
Figure 139 shows a nucleotide sequence (SEQ ID NO:247) of a native sequence PR01343 (UNQ698) cDNA, wherein SEQ ID NO:247 is a clone designated herein as "DN A66675-1587". The start and stop codons are shown in bold and underlined font.
Figure 140 shows the amino acid sequence (SEQ ID NO:248) derived from the coding sequence of SEQ
ID NO:247 shown in Figure 139.
Figure 141 shows a nucleotide sequence (SEQ ID NO:252) of a native sequence PRO1480 (UNQ749) cDNA, wherein SEQ ID NO:252 is a clone designated herein as "DNA67962-1649". The start and stop codons are shown in bold and underlined font.
Figure 142 shows the amino acid sequence (SEQ ID NO:253) derived from the coding sequence of SEQ
ID NO:252 shown in Figure 141.
Figures 143A-B show a nucleotide sequence (SEQ ID NO:259) of a native sequence PR01487 (UNQ756) cDNA, wherein SEQ ID NO:259 is a clone designated herein as "DNA68836-1656". The start and stop codons are shown in bold and underlined font.
Figure 144 shows the amino acid sequence (SEQ ID N0:260) derived from the coding sequence of SEQ
ID NO:259 shown in Figures 143A-B.
Figure 145 shows a nucleotide sequence (SEQ ID NO:264) of anative sequence PR01418 (UNQ732) cDNA, wherein SEQ ID NO:264 is a clone designated herein as "DNA68864-1629". The start and stop codons are shown in bold and underlined font.
Figure 146 shows the amino acid sequence (SEQ ID NO-.265) derived from the coding sequence of SEQ
ID NO:264 shown in Figure 145.
Figure 147 shows a nucleotide sequence (SEQ ID NO:266) of a native sequence PRO1472 (UNQ744) cDNA, wherein SEQ ID NO:266 is a clone designated herein as "DNA68866-1644". The start and stop codons are shown in bold and underlined font.
Figure 148 shows the amino acid sequence (SEQ ID NO:267) derived from the coding sequence of SEQ
ID NO:266 shown in Figure 147.
306
Figure 149 shows a nucleotide sequence (SEQ ID NO:268) of a native sequence PR01461 (UNQ742) cDNA, wherein SEQ ID NO:268 is a clone designated herein as "DNA68871 -1638". The start and stop codons are shown in bold and underlined font.
Figure 150 shows the amino acid sequence (SEQ ID NO:269) derived from the coding sequence of SEQ ID NO:268 shown in Figure 149.
Figure 151 shows a nucleotide sequence (SEQ ID N0:270) of a native sequence PR01410 (UNQ728)
cDNA, wherein SEQ ID N0:270 is a clone designated herein as "DNA68874-1622". The start and stop codons are shown in bold and underlined font.
Figure 152 shows the amino acid sequence (SEQ ID NO:271) derived from the coding sequence of SEQ ID N0:270 shown in Figure 151.
Figure 153 shows a nucleotide sequence (SEQ ID NO:272) of a native sequence PR01568 (UNQ774)
cDNA, wherein SEQ ID NO:272 is a clone designated herein as "DNA68880-1676". The start and stop codons are shown in bold and underlined font.
Figure 154 shows the amino acid sequence (SEQ ID NO:273) derived from the coding sequence of SEQ ID NO:272 shown in Figure 153.
Figure 155 shows a nucleotide sequence (SEQ ID NO:274) of a native sequence PRO1570 (UNQ776)
cDNA, wherein SEQ ID NO:274 is a clone designated herein as "DNA68885-1678". The start and stop codons are shown in bold and underlined font.
Figure 156 shows the amino acid sequence (SEQ ID NO:275) derived from the coding sequence of SEQ ID NO:274 shown in Figure 155.
Figure 157 shows a nucleotide sequence (SEQ ID NO:276) of a native sequence PR01317 (UNQ783)
cDNA, wherein SEQ ID NO:276 is a clone designated herein as "DNA71166-1685". The start and stop codons are shown in bold and underlined font.
Figure 158 shows the amino acid sequence (SEQ ID NO:277) derived from the coding sequence of SEQ ID NO:276 shown in Figure 157.
Figure 159 shows a nucleotide sequence (SEQ ID NO:281) of a native sequence PR01780 (UNQ842)
cDNA, wherein SEQ ID N0:281 is a clone designated herein as "DNA71169-I709". The start and stop codons are shown in bold and underlined font.
Figure 160 shows the amino acid sequence (SEQ ID NO:282) derived from the coding sequence of SEQ ID NO:281 shown in Figure 159.
Figure 161 shows a nucleotide sequence (SEQ ID NO:286) of a native sequence PR01486 (UNQ755)
cDNA, wherein SEQ ID NO:286 is a clone designated herein as "DNA71180-1655". The start and stop codons are shown in bold and underlined font.
Figure 162 shows the amino acid sequence (SEQ ID NO:287) derived from the coding sequence of SEQ ID NO:286 shown in Figure 161.
Figure 163 shows a nucleotide sequence (SEQ ID NO:291) of a native sequence PR01433 (UNQ738)
cDNA, wherein SEQ ID NO:291 is a clone designated herein as "DNA71184-1634". The start and stop codons are shown in bold and underlined font.
307
Figure 164 shows the amino acid sequence (SEQ ID NO:292) derived from the coding sequence of SEQ ID NO:29I shown in Figure 163.
Figure 165 shows a nucleotide sequence (SEQ ID NO:296) of a native sequence PR01490 (UNQ759) cDNA, wherein SEQ ID NO:296 is a clone designated herein as "DNA71213-1659". The start and stop codons are shown in bold and underlined font.
Figure 166 shows the amino acid sequence (SEQ ID NO:297) derived from the coding sequence of SEQ
ID NO:296 shown in Figure 165.
Figure 167 shows a nucleotide sequence (SEQ ID N0:301) of a native sequence PR01482 (UNQ751) cDNA, wherein SEQ ID N0:301 is a clone designated herein as "DNA71234-1651". The start and stop codons are shown in bold and underlined font.
Figure 168 shows the amino acid sequence (SEQ ID N0:302) derived from the coding sequence of SEQ
ID N0:301 shown in Figure 167.
Figure 169 shows a nucleotide sequence (SEQ ID N0:303) of a native sequence PR01446 (UNQ740) cDNA, wherein SEQ IDNO:303 is a clone designated herein as "DNA71277-1636". The start and stop codons are shown in bold and underlined font.
Figure 170 shows the amino acid sequence (SEQ ID N0:304) derived from the coding sequence of SEQ
ID N0:303 shown in Figure 169.
Figure 171 shows a nucleotide sequence (SEQ ID N0:305) of a native sequence PR01558 (UNQ766) cDNA, wherein SEQ ID N0:305 is a clone designated herein as "DNA71282-1668". The start and stop codons are shown in bold and underlined font.
Figure 172 shows the amino acid sequence (SEQ ID N0:306) derived from the coding sequence of SEQ
ID N0:305 shown in Figure 171.
Figure 173 shows a nucleotide sequence (SEQ ID N0:307) of a native sequence PR01604 (UNQ785) cDNA, wherein SEQ ID N0:307 is a clone designated herein as "DNA71286-1687". The start and stop codons are shown in bold and underlined font.
Figure 174 shows the amino acid sequence (SEQ ID N0:308) derived from the coding sequence of SEQ
ID N0:307 shown in Figure 173.
Figure 175 shows a nucleotide sequence (SEQ ID N0:309) of a native sequence PR01491 (UNQ760) cDNA, wherein SEQ ID N0:309 is a clone designated herein as "DNA71883-1660". The start and stop codons are shown in bold and underlined font.
Figure 176 shows the amino acid sequence (SEQ ID NO:310) derived from the coding sequence of SEQ
ID N0:309 shown in Figure 175.
Figure 177 shows a nucleotide sequence (SEQ ID NO:314) of a native sequence PR01431 (UNQ737) cDNA, wherein SEQ ID NO:314 is a clone designated herein as "DNA73401-1633". The start and stop codons are shown in bold and underlined font.
Figure 178 shows the amino acid sequence (SEQ ID NO:315) derived from the coding sequence of SEQ
ID NO:314 shown in Figure 177.
308
Figures 179A-B show a nucleotide sequence (SEQ ID NO:316) of a native sequence PR01563 (UNQ769) cDNA, wherein SEQ ID NO:316 is a clone designated herein as "DNA73492-1671". The start and stop codons are shown in bold and underlined font.
Figure 180 shows the amino acid sequence (SEQ ID NO:317) derived from the coding sequence of SEQ ID N0.316 shown in Figures 179A-B.
Figure 181 shows a nucleotide sequence (SEQ ID NO:321) of a native sequence PR01565 (UNQ771)
cDNA, wherein SEQ ID NO:32I is a clone designated herein as "DNA73727-1673". The start and stop codons are shown in bold and underlined font.
Figure 182 shows the amino acid sequence (SEQ ID NO:322) derived from the coding sequence of SEQ ID NO:321 shown in Figure 181.
Figure 183 shows a nucleotide sequence (SEQ ID NO:323) of a native sequence PR01571 (UNQ777)
cDNA, wherein SEQ ID NO:323 is a clone designated herein as "DNA73730-1679". The start and stop codons are shown in bold and underlined font.
Figure 184 shows the amino acid sequence (SEQ ID NO:324) derived from the coding sequence of SEQ ID NO:323 shown in Figure 183.
Figure 185 shows a nucleotide sequence (SEQ ID NO:325) of a native sequence PR01572 (UNQ778)
cDNA, wherein SEQ ID NO:325 is a clone designated herein as "DNA73734-1680". The start and stop codons are shown in bold and underlined font.
Figure 186 shows the amino acid sequence (SEQ ID NO:326) derived from the coding sequence of SEQ ID NO:325 shown in Figure 185.
Figure 187 shows a nucleotide sequence (SEQ ID NO:327) of a native sequence PR01573 (UNQ779)
cDNA, wherein SEQ ID NO:327 is a clone designated herein as "DN A73735-1681". The start and stop codons are shown in bold and underlined font.
Figure 188 shows the amino acid sequence (SEQ ID NO:328) derived from the coding sequence of SEQ ID NO:327 shown in Figure 187.
Figure 189 shows a nucleotide sequence (SEQ ID NO:329) of a native sequence PR01488 (UNQ757)
cDNA, wherein SEQ ID NO:329 is a clone designated herein as "DNA73736-1657". The start and stop codons are shown in bold and underlined font.
Figure 190 shows the amino acid sequence (SEQ ID N0:330) derived from the coding sequence of SEQ ID NO:329 shown in Figure 189.
Figure 191 shows a nucleotide sequence (SEQ ID NO:331) of a native sequence PR01489 (UNQ758)
cDNA, wherein SEQ ID NO:331 is a clone designated herein as "DNA73737-1658". The start and stop codons are shown in bold and underlined font.
Figure 192 shows the amino acid sequence (SEQ ID NO:332) derived from the coding sequence of SEQ ID NO:331 shown in Figure 191.
Figure 193 shows a nucleotide sequence (SEQ ID NO:333) of a native sequence PR01474 (UNQ745)
cDNA, wherein SEQ ID NO:333 is a clone designated herein as "DNA73739-1645". The start and stop codons are shown in bold and underlined font.
309
Figure 194 shows the amino acid sequence (SEQ ID NO:334) derived from the coding sequence of SEQ ID NO:333 shown in Figure 193.
Figure 195 shows a nucleotide sequence (SEQ ID NO:335) of a native sequence PRO1508 (UNQ761) cDNA, wherein SEQ ID NO:335 is a clone designated herein as "DNA73742-1662". The start and stop codons are shown in bold and underlined font.
Figure 196 shows the amino acid sequence (SEQ ID NO:336) derived from the coding sequence of SEQ
ID NO:335 shown in Figure 195.
Figure 197 shows a nucleotide sequence (SEQ ID NO:337) of a native sequence PR01555 (UNQ763) cDNA, wherein SEQ ID NO:337 is a clone designated herein as "DNA73744-1665". The start and stop codons are shown in bold and underlined font.
Figure 198 shows the amino acid sequence (SEQ ID NO:338) derived from the coding sequence of SEQ
ID NO:337 shown in Figure 197.
Figure 199 shows a nucleotide sequence (SEQ ID NO:339) of a native sequence PR01485 (UNQ754) cDNA, wherein SEQ ID NO:339 is a clone designated herein as "DNA73746-1654". The start and stop codons are shown in bold and underlined font.
Figure 200 shows the amino acid sequence (SEQ ID N0:340) derived from the coding sequence of SEQ
ID NO:339 shown in Figure 199.
Figure 201 shows a nucleotide sequence (SEQ ID NO:346) of a native sequence PR01564 (UNQ770) cDNA, wherein SEQ ID NO:346 is a clone designated herein as "DNA73760-1672". The start and stop codons are shown in bold and underlined font.
Figure 202 shows the amino acid sequence (SEQ ID NO:347) derived from the coding sequence of SEQ
ID NO:346 shown in Figure 201.
Figure 203 shows a nucleotide sequence (SEQ ID NO:351) of a native sequence PRO1755 (UNQ828) cDNA, wherein SEQ ID NO:351 is a clone designated herein as "DNA76396-1698". The start and stop codons are shown in bold and underlined font.
Figure 204 shows die amino acid sequence (SEQ ID NO:352) derived from the coding sequence of SEQ
ID NO:351 shown in Figure 203.
Figure 205 shows a nucleotide sequence (SEQ ID NO:353) of a native sequence PR01757 (UNQ830) cDNA, wherein SEQ ID N0:353 is a clone designated herein as "DNA76398-1699". The start and stop codons axe shown in bold and underlined font.
Figure 206 shows the amino acid sequence (SEQ ID NO:354) derived from the coding sequence of SEQ
ID NO:353 shown in Figure 205.
Figure 207 shows a nucleotide sequence (SEQ ID NO:355) of a native sequence PR01758 (UNQ831) cDNA, wherein SEQ ID NO:355 is aclone designated herein as "DNA76399-1700". The start and stop codons are shown in bold and underlined font.
Figure 208 shows the amino acid sequence (SEQ ID NO:356) derived from the coding sequence of SEQ
ID NO-.355 shown in Figure 207.
310
Figure 209 shows a nucleotide sequence (SEQ ID NO:357) of a native sequence PR01575 (TJNQ781) cDNA, wherein SEQ ID NO:357 is a clone designated herein as "DNA76401-1683". The start and stop codons are shown in bold and underlined font.
Figure 210 shows the amino acid sequence (SEQ ID NO:358) derived from the coding sequence of SEQ ID NO:357 shown in Figure 209.
Figure 211 shows a nucleotide sequence (SEQ ID NO:363) of a native sequence PR01787 (UNQ849)
cDNA, wherein SEQ ID NO:363 is a clone designated herein as "DNA76510-2504". The start and stop codons are shown in bold and underlined font.
Figure 212 shows the amino acid sequence (SEQ ID NO:364) derived from the coding sequence of SEQ ID NO:363 shown in Figure 211.
Figure 213 shows a nucleotide sequence (SEQ ID NO:365) of a native sequence PR01781 (UNQ843)
cDNA, wherein SEQ ID N0:365 is a clone designated herein as "DNA76522-2500". The start and stop codons are shown in bold and underlined font.
Figure 214 shows the amino acid sequence (SEQ ID NO:366) derived from the coding sequence of SEQ ID NO:365 shown in Figure 213.
Figure 215 shows a nucleotide sequence (SEQ ID NO:371) of a native sequence PR01556 (UNQ764)
cDNA, wherein SEQ ID N0:371 is a clone designated herein as "DNA76529-1666". The start and stop codons are shown in bold and underlined font.
Figure 216 shows the amino acid sequence (SEQ ID NO.-372) derived from the coding sequence of SEQ ID NO:371 shown in Figure 215.
Figure 217 shows a nucleotide sequence (SEQ ID NO:373) of a native sequence PR01759 (UNQ832)
cDNA, wherein SEQ IDNO:373 is a clone designated herein as "DNA76531-1701". The start and stop codons are shown in bold and underlined font.
Figure 218 shows the amino acid sequence (SEQ ID NO:374) derived from the coding sequence of SEQ ID NO:373 shown in Figure 217.
Figure 219 shows a nucleotide sequence (SEQ ID NO:375) of a native sequence PR01760 (UNQ833)
cDNA, wherein SEQ ID NO:375 is a clone designated herein as "DNA76532-1702". The start and stop codons are shown in bold and underlined font.
Figure 220 shows the amino acid sequence (SEQ ID NO:376) derived from the coding sequence of SEQ ID NO:375 shown in Figure 219.
Figure 221 shows a nucleotide sequence (SEQ ID NO:377) of a native sequence PR01561 (UNQ768)
cDNA, wherein SEQ ID NO:377 is a clone designated herein as "DNA76538-1670". The start and stop codons are shown in bold and underlined font.
Figure 222 shows the amino acid sequence (SEQ ID NO:378) derived from the coding sequence of SEQ ID NO:377 shown in Figure 221.
Figure 223 shows a nucleotide sequence (SEQ ID NO:382) of a native sequence PR01567 (UNQ773)
cDNA, wherein SEQ ID N0:382 is a clone designated herein as "DNA76541-1675". The start and stop codons are shown in bold and underlined font.
311
Figure 224 shows the amino acid sequence (SEQ ID NO:383) derived from the coding sequence of SEQ ID NO:382 shown in Figure 223.
Figure 225 shows a nucleotide sequence (SEQ ID NO:384) of a native sequence PRO 1693 (UNQ803) cDNA, wherein SEQ ID NO:384 is a clone designated herein as "DNA77301-1693". The start and stop codons are shown in bold and underlined font.
Figure 226 shows the amino acid sequence (SEQ ID NO:385) derived from the coding sequence of SEQ
ID NO:384 shown in Figure 225.
Figure 227 shows a nucleotide sequence (SEQ ID NO:389) of a native sequence PR01784 (UNQ846) cDNA, wherein SEQ ID NO:389 is a clone designated herein as "DNA77303-2502". The start and stop codons are shown in bold and underlined font.
Figure 228 shows the amino acid sequence (SEQ ID N0:390) derived from the coding sequence of SEQ
ID NO:389 shown in Figure 227.
Figure 229 shows a nucleotide sequence (SEQ ID NO:394) of a native sequence PRO 1605 (UNQ786) cDNA, wherein SEQ ID NO:394 is a clone designated herein as "DNA77648-1688". The start and stop codons are shown in bold and underlined font.
Figure 230 shows the amino acid sequence (SEQ ID NO:395) derived from the coding sequence of SEQ
ID NO:394 shown in Figure 229.
Figure 231 shows a nucleotide sequence (SEQ ID NO:396) of a native sequence PR01788 (UNQ850) cDNA, wherein SEQ ID NO:396 is a clone designated herein as "DNA77652-2505". The start and stop codons are shown in bold and underlined font.
Figure 232 shows the amino acid sequence (SEQ ID NO:397) derived from the coding sequence of SEQ
ID NO:396 shown in Figure 231.
Figure 233 shows a nucleotide sequence (SEQ ID N0:401) of a native sequence PR01801 (UNQ852) cDNA, wherein SEQ ID N0:401 is a clone designated herein as "DNA83500-2506". The start and stop codons are shown in bold and underlined font.
Figure 234 shows the amino acid sequence (SEQ ID N0:402) derived from the coding sequence of SEQ
ID N0:401 shown in Figure 233.
Figure 235 shows a nucleotide sequence (SEQ ID N0:405) of a native sequence UCP4 cDNA, wherein SEQ ID N0:405 is a clone designated herein as "DNA77568-1626". The start and stop codons are shown in bold and underlined font.
Figure 236 shows the amino acid sequence (SEQ ID NO :406) derived from the coding sequence of SEQ
ID N0:405 shown in Figure 235.
Figure 237 shows a nucleotide sequence (SEQ ID N0:409) of a native sequence PR0193 cDNA, wherein SEQ ID N0:409 is a clone designated herein as "DNA23322-1393". The start and stop codons are shown in bold and underlined font.
Figure 238 shows the amino acid sequence (SEQ ID N0:410) derived from the coding sequence of SEQ
ID N0:409 shown in Figure 237.
312
Figure 239 shows a nucleotide sequence (SEQ ID NO:414) of a native sequence PROl 130 cDNA, wherein SEQ ID NO:414 is a done designated herein as "DNA59814-1486". The start and stop codons are shown in bold and underlined font.
Figure 240 shows the amino acid sequence (SEQ ID NO:415) derived from the coding sequence of SEQ ID NO:414 shown in Figure 239.
Figure 241 shows a nucleotide sequence (SEQ ID NO:422) of a native sequence PR01335 cDNA, wherein SEQ ID NO:422 is a clone designated herein as "DNA62812-1594". The start and stop codons are shown in bold and underlined font.
Figure 242 shows the amino acid sequence (SEQ ID NO:423) derived from the coding sequence of SEQ ID NO:422 shown in Figure 241.
Figure 243 shows a nucleotide sequence (SEQ ID NO:428) of a native sequence PR01329 cDNA, wherein SEQ ID NO:428 is a clone designated herein as "DNA66660-1585". The start and stop codons are shown in bold and underlined font.
Figure 244 shows the amino acid sequence (SEQ ID NO:429) derived from the coding sequence of SEQ ID NO:428 shown in Figure 243.
Figure 245 shows a nucleotide sequence (SEQ ID NO:231) of a native sequence PR01550 cDNA, wherein SEQ ID NO:231 is a clone designated herein as "DNA76393-1664". The start and stop codons are shown in bold and underlined font.
Figure 246 shows the amino acid sequence (SEQ ID NO:232) derived from the coding sequence of SEQ ID NO:231 shown in Figure 245.
Figures 247A-D show hypothetical exemplifications for using the below described method to determine % amino acid sequence identity (Figures 247A-B) and % nucleic acid sequence identity (Figures 247C-D) using the ALIGN-2 sequence comparison computer program, wherein "PRO" represents the amino acid sequence of a hypothetical PEACH polypeptide of interest, "Comparison Protein" represents the amino acid sequence of a polypeptide against which the "PRO" polypeptide of interest is being compared, "PRO-DNA" represents a hypothetical PEACH-encoding nucleic acid sequence of interest, "Comparison DNA" represents the nucleotide sequence of a nucleic acid molecule against which the "PRO-DNA" nucleic acid molecule of interest is being compared, "X, "Y" and "Z" each represent different hypothetical amino acid residues and "N", "L" and "V" each represent different hypothetical nucleotides.
Figures 248A-Q provide the complete source code for the ALIGN-2 sequence comparison computer program. This source code may be routinely compiled for use on a UNIX operating system to provide the ALIGN-2 sequence comparison computer program.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS I. Definitions
The terms "PRO polypeptide" and "PRO" or "UCP"as used herein and when immediately followed by a numerical designation refer to various polypeptides, wherein the complete designation (i.e., PRO/number) refers to specific polypeptide sequences as described herein. The terms "PRO/number polypeptide" and
313 f OFRCE OF i\|.z
24 OCT 2003 received
"PRO/number" wherein the term "number" is provided as an actual numerical designation as used herein encompass native sequence polypeptides and polypeptide variants (which are further defined herein). The PRO polypeptides described herein may be isolated from a variety of sources, such as from human tissue types or from another source, or prepared by recombinant or synthetic methods.
A "native sequence PRO polypeptide" or "UCP" comprises a polypeptide having the same amino acid 5 sequence as the corresponding PRO polypeptide derived from nature. Such native sequence PRO polypeptides can be isolated from nature or can be produced by recombinant or synthetic means. The term "native sequence PRO polypeptide" specifically encompasses naturally-occurring truncated or secreted forms of the specific PRO polypeptide (e.g., an extracellular domain sequence), naturally-occurring variant forms (e.g., alternatively spliced forms) and naturally-occurring allelic variants of the polypeptide. In various embodiments of the 10 invention, native sequence PRO polypeptides are mature or full-length native sequence polypeptides comprising the full-length amino acids sequences shown in the accompanying figures. Start and stop codons are shown in bold font and underlined in the figures. However, while the PRO polypeptide disclosed in the accompanying figures are shown to begin with methionine residues designated herein as amino acid position I in the figures, it is conceivable and possible that other methionine residues located either upstream or downstream from the IS amino acid position 1 in the figures may be employed as the starting amino acid residue for the PRO polypeptides.
The PRO polypeptide "extracellular domain" or "ECD" refers to a form of the PRO polypeptide which is essentially free of the transmembrane and cytoplasmic domains. Ordinarily, a PRO polypeptide ECD will have less than 1 % of such transmembrane and/or cytoplasmic domains and preferably, will have less than 0.5% of 20 such domains. It will be understood that any transmembrane domains identified for the PRO polypeptides of the present invention are identified pursuant to criteria routinely employed in the art for identifying that type of hydrophobic domain. The exact boundaries of a transmembrane domain may vary but most likely by no more than about 5 amino acids at either end of the domain as initially identified herein. Optionally, therefore, an extracellular domain of a PRO polypeptide may contain from about 5 or fewer amino acids on either side of the 25 transmembrane domain/extracellular domain boundary as identified in the Examples or specification and such polypeptides, with or without the associated signal peptide, and nucleic acid encoding them, are comtempiated by the present invention.
The approximate location of the "signal peptides" of the various PRO polypeptides disclosed herein are shown in the accompanying figures. It is noted, however, that the C-terminal boundary of a signal peptide may 30 vary, but most likely by no more than about 5 amino acids on either side of the signal peptide C-terminal boundary as initially identified herein, wherein the C-terminal boundary of the signal peptide may be identified pursuant to criteria routinely employed in the art for identifying that type of amino acid sequence element (e.g., Nielsen et al., Prot. Eng. 10:1-6 (1997) and von Heinje et al., Nucl. Acids. Res. 14:4683-4690 (1986)). Moreover, it is also recognized that, in some cases, cleavage of a signal sequence from a secreted polypeptide 35 is not entirely uniform, resulting in more than one secreted species. These mature polypeptides, where the signal peptide is cleaved within no more than about 5 amino acids on either side of the C-terminal boundary of the signal peptide as identified herein, and the polynucleotides encoding them, are contemplated by the present
314
invention.
"PRO polypeptide variant" means an active PRO polypeptide as defined above or below having at least about 80% amino acid sequence identity with a full-length native sequence PRO polypeptide sequence as disclosed herein, a PRO polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular domain of a PRO polypeptide, with or without the signal peptide, as disclosed herein or any other fragment of 5 a full-length PRO polypeptide sequence as disclosed herein. Such PRO polypeptide variants include, for instance, PRO polypeptides wherein one or more amino acid residues are added, or deleted, at the N- or C-terminus of the full-length native amino acid sequence. Ordinarily, a PRO polypeptide variant will have at least about 80% amino acid sequence identity, preferably at least about 81 % amino acid sequence identity, more preferably at least about 82% amino acid sequence identity, more preferably at least about 83% amino acid 10 sequence identity, more preferably at least about 84% amino acid sequence identity, more preferably at least about 85 % amino acid sequence identity, more preferably at least about 86% amino acid sequence identity, more preferably at least about 87% amino acid sequence identity, more preferably at least about 88% amino acid sequence identity, more preferably at least about 89% amino acid sequence identity, more preferably at least about 90% amino acid sequence identity, more preferably at least about 91 % amino acid sequence identity, more IS preferably at least about 92% amino acid sequence identity, more preferably at least about 93% amino acid sequence identity, more preferably at least about 94% amino acid sequence identity, more preferably at least about 95% amino acid sequence identity, more preferably at least about 96% amino acid sequence identity, more preferably at least about 97% amino acid sequence identity, more preferably at least about 98% amino acid sequence identity and most preferably at least about 99% amino acid sequence identity with a full-length native 20 sequence PRO polypeptide sequence as disclosed herein, a PRO polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular domain of a PRO polypeptide, with or without the signal peptide, as disclosed herein or any other fragment of a full-length PRO polypeptide sequence as disclosed herein. Ordinarily, PRO variant polypeptides are at least about 10 amino acids in length, often at least about 20 amino acids in length, more often at least about 30 amino acids in length, more often at least about 40 amino acids in 25 length, more often at least about 50 amino acids in length, more often at least about 60 amino acids in length, more often at least about 70 amino acids in length, more often at least about 80 amino acids in length, more often at least about 90 amino acids in length, more often at least about 100 amino acids in length, more often at least about 150 amino acids in length, more often at least about 200 amino acids in length, more often at least about 300 amino acids in length, or more.
"Percent (%) amino acid sequence identity" with respect to the PRO polypeptide sequences identified herein is defined as the percentage of amino acid residues in a candidate sequence that are identical with the amino acid residues in the specific PRO polypeptide sequence, after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any conservative substitutions as part of the sequence identity. Alignment for purposes of determining percent amino acid 35 sequence identity can be achieved in various ways that are within the skill in the art, for instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software. Those skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed
315
PCTAJS99/20111
to achieve maximal alignment over the full length of the sequences being compared. For purposes herein, however, % amino acid sequence identity values are generated using the WU-BLAST-2 computer program (Altschul et al., Methods in Enzvmologv 266:460-480 (1996)). Most of the WU-BLAST-2 search parameters are set to the default values. Those not set to default values, i.e., the adjustable parameters, are set with the following values: overlap span = 1, overlap fraction = 0.125, word threshold (T) = 11, and scoring matrix 5 = BLOSUM62. For purposes herein, a % amino acid sequence identity value is determined by dividing (a) the number of matching identical amino acid residues between the amino acid sequence of the PRO polypeptide of interest having a sequence derived from the native PRO polypeptide and the comparison amino acid sequence of interest (i.e., the sequence against which the PRO polypeptide of interest is being compared which may be a PRO variant polypeptide) as determined by WU-BLAST-2 by (b) die total number of amino acid residues of 10 the PRO polypeptide of interest. For example, in the statement "a polypeptide comprising an the amino acid sequence A which has or having at least 80% amino acid sequence identity to the amino acid sequence B", the amino acid sequence A is the comparison amino acid sequence of interest and the amino acid sequence B is the amino acid sequence of the PRO polypeptide of interest.
Unless specifically stated otherwise, all % amino acid sequence identity values used herein are obtained 15 as described in the immediately preceding paragraph using the WU-BLAST-2 computer program. However, % amino acid sequence identity values may also be obtained as described below by using the sequence comparison computer program ALIGN-2, wherein the complete source code for the ALIGN-2 program is provided in Figures 248A-Q. The ALIGN-2 sequence comparison computer program was authored by Genentecb, Inc. and the source code shown in Figures 248A-Q has been filed with user documentation in the 20 U.S. Copyright Office, Washington D.C., 20559, where it is registered under U.S. Copyright Registration No. TXU510087. The ALIGN-2 program is publicly available through Genentech, Inc., South San Francisco, California or may be compiled from the source code provided in Figures 248A-Q. The ALIGN-2 program should be compiled for use on a UNIX operating system, preferably digital UNIX V4.0D. All sequence comparison parameters are set by the ALIGN-2 program and do not vary.
In situations where ALIGN-2 is employed for amino acid sequence comparisons, the % amino acid sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B (which can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino acid sequence identity to, with, or against a given amino acid sequence B) is calculated as follows:
100 times the fraction X/Y
where X is the number of amino acid residues scored as identical matches by the sequence alignment program ALIGN-2 in that program's alignment of A and B, and where Y is the total number of amino acid residues in B. It will be appreciated that where the length of amino acid sequence A is not equal to the length of amino acid 35 sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid sequence identity of B to A. As examples of % amino acid sequence identity calculations using this method, Figures 247A-B demonstrate how to calculate the % amino acid sequence identity of the amino acid sequence designated
316
"Comparison Protein" to the amino acid sequence designated "PRO".
Percent amino acid sequence identity may also be determined using the sequence comparison program NCBI-BLAST2 (Altschul et al., Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 sequence comparison program may be downloaded from http://www.ncbi.nlm.nih.gov. NCBI-BLAST2 uses several search parameters, wherein all of those search parameters are set to default values including, for example, 5 unmask = yes, strand = all, expected occurrences = 10, minimum low complexity length = 15/5, multi-pass e-value — 0.01, constant for multi-pass = 25, dropoff for final gapped alignment = 25 and scoring matrix = BLOSUM62.
In situations where NCBI-BLAST2 is employed for amino acid sequence comparisons, the % amino acid sequence identity of a given amino acid sequence A to, with, or against a given amino acid sequence B 10 (which can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % amino acid sequence identity to, with, or against a given amino acid sequence B) is calculated as follows:
100 times the fraction X/Y
where X is the number of amino acid residues scored as identical matches by the sequence alignment program NCBI-BLAST2 in that program's alignment of A and B, and where Y is the total number of amino acid residues in B. It will be appreciated that where the length of amino acid sequence A is not equal to the length of amino acid sequence B, the % amino acid sequence identity of A to B will not equal the % amino acid sequence identity of B to A.
"PRO variant polynucleotide" or "PRO variant nucleic acid sequence" means a nucleic acid molecule which encodes an active PRO polypeptide as defined below and which has at least about 80% nucleic acid sequence identity with a nucleotide acid sequence encoding a full-length native sequence PRO polypeptide sequence as disclosed herein, a full-length native sequence PRO polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular domain of a PRO polypeptide, with or without the signal peptide, as 25 disclosed herein or any other fragment of a full-length PRO polypeptide sequence as disclosed herein. Ordinarily, a PRO variant polynucleotide will have at least about 80% nucleic acid sequence identity, more preferably at least about 81% nucleic acid sequence identity, more preferably at least about 82% nucleic acid sequence identity, more preferably at least about 83% nucleic acid sequence identity, more preferably at least about 84% nucleic acid sequence identity, more preferably at least about 85% nucleic acid sequence identity, 30 more preferably at least about 86% nucleic acid sequence identity, more preferably at least about 87% nucleic acid sequence identity, more preferably at least about 88% nucleic acid sequence identity, more preferably at least about 89% nucleic acid sequence identity, more preferably at least about 90% nucleic acid sequence identity, more preferably at least about 91% nucleic acid sequence identity, more preferably at least about 92% nucleic acid sequence identity, more preferably at least about 93% nucleic acid sequence identity, more 35 preferably at least about 94% nucleic acid sequence identity, more preferably at least about 95% nucleic acid sequence identity, more preferably at least about 96% nucleic acid sequence identity, more preferably at least about 97% nucleic acid sequence identity, more preferably at least about 98% nucleic acid sequence identity and
317
yet more preferably at least about 99% nucleic acid sequence identity with the nucleic acid sequence encoding a full-length native sequence PRO polypeptide sequence as disclosed herein, a full-length native sequence PRO polypeptide sequence lacking the signal peptide as disclosed herein, an extracellular domain of a PRO polypeptide, with or without the signal sequence, as disclosed herein or any other fragment of a full-length PRO polypeptide sequence as disclosed herein. Variants do not encompass the native nucleotide sequence.
Ordinarily, PRO variant polynucleotides are at least about 30 nucleotides in length, often at least about
60 nucleotides in length, more often at least about 90 nucleotides in length, more often at least about 120 nucleotides in length, more often at least about 150 nucleotides in length, more often at least about 180 nucleotides in length, more often at least about 210 nucleotides in length, more often at least about 240 nucleotides in length, more often at least about 270 nucleotides in length, more often at least about 300 10 nucleotides in length, more often at least about 450 nucleotides in length, more often at least about 600 nucleotides in length, more often at least about 900 nucleotides in length, or more.
"Percent (%) nucleic acid sequence identity" with respect to PRO-encoding nucleic acid sequences identified herein is defined as the percentage of nucleotides in a candidate sequence that are identical with the nucleotides in the PRO nucleic acid sequence of interest, after aligning the sequences and introducing gaps, if 15 necessary, to achieve the maximum percent sequence identity. Alignment for purposes of determining percent nucleic acid sequence identity can be achieved in various ways that are within the skill in the art, for instance, using publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) software. For purposes herein, however, % nucleic acid sequence identity values are generated using the WU-BLAST-2 computer program (Altschul et al., Methods in Enzvmology 266:460-480 (1996)). Most of the WU-20 BLAST-2 search parameters are set to the default values. Those not set to default values, i.e., the adjustable parameters, are set with the following values: overlap span = 1, overlap fraction = 0.125, word threshold (T) = 11, and scoring matrix = BLOSUM62. For purposes herein, a % nucleic acid sequence identity value is determined by dividing (a) the number of matching identical nucleotides between the nucleic acid sequence of the PRO polypeptide-encoding nucleic acid molecule of interest having a sequence derived from the native 25 sequence PRO polypeptide-encoding nucleic acid and the comparison nucleic acid molecule of interest (i.e., the sequence against which the PRO polypeptide-encoding nucleic acid molecule of interest is being compared which may be a variant PRO polynucleotide) as determined by WU-BLAST-2 by (b) the total number of nucleotides of the PRO polypeptide-encoding nucleic acid molecule of interest. For example, in the statement "an isolated nucleic acid molecuie comprising a nucleic acid sequence A which has or having at least 80% nucleic acid 30 sequence identity to the nucleic acid sequence B", the nucleic acid sequence A is the comparison nucleic acid molecule of interest and the nucleic acid sequence B is the nucleic acid sequence of the PRO polypeptide-encoding nucleic acid molecuie of interest.
Unless specifically stated otherwise, all % nucleic acid sequence identity values used herein are obtained as described in the immediately preceding paragraph using the WU-BLAST-2 computer program. However, 35 % nucleic acid sequence identity values may also be obtained as described below by using the sequence comparison computer program ALIGN-2, wherein the complete source code for the ALIGN-2 program is provided in Figures 248A-Q. The ALIGN-2 sequence comparison computer program was authored by
318
Genentech, Inc. and the source code shown in Figures 248A-Q has been filed with user documentation in the U.S. Copyright Office, Washington D.C., 20559, where it is registered under U.S. Copyright Registration No. TXU510087. The ALIGN-2 program is publicly available through Genentech, Inc., South San Francisco, California or may be compiled from the source code provided in Figures 248A-Q. The ALIGN-2 program should be compiled for use on a UNIX operating system, preferably digital UNIX V4.0D. All sequence comparison parameters are set by the ALIGN-2 program and do not vary.
In situations where ALIGN-2 is employed for nucleic acid sequence comparisons, the % nucleic acid sequence identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence D (which can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % nucleic acid sequence identity to, with, or against a given nucleic acid sequence D) is calculated as follows:
100 times the fraction W/Z
where W is the number of nucleotides scored as identical matches by the sequence alignment program ALIGN-2 in that program's alignment of C and D, and where Z is the total number of nucleotides in D. It will be 15 appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence D, the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence identity of D to
C. As examples of % nucleic acid sequence identity calculations, Figures 247C-D demonstrate how to calculate the % nucleic acid sequence identity of the nucleic acid sequence designated "Comparison DNA" to the nucleic acid sequence designated "PRO-DNA".
Percent nucleic acid sequence identity may also be determined using the sequence comparison program
NCBI-BLAST2 (Altschul et al., Nucleic Acids Res. 25:3389-3402 (1997)). The NCBI-BLAST2 sequence comparison program may be downloaded from http://www.ncbi.nlm.nih.gov. NCBI-BLAST2 uses several search parameters, wherein all of those search parameters are set to default values including, for example, unmask = yes, strand = all, expected occurrences = 10, minimum low complexity length = 15/5, multi-pass 25 e-value = 0.01, constant for multi-pass = 25, dropoff for final gapped alignment = 25 and scoring matrix = BLOSUM62.
In situations where NCBI-BLAST2 is employed for sequence comparisons, the % nucleic acid sequence identity of a given nucleic acid sequence C to, with, or against a given nucleic acid sequence D (which can alternatively be phrased as a given nucleic acid sequence C that has or comprises a certain % nucleic acid 30 sequence identity to, with, or against a given nucleic acid sequence D) is calculated as follows:
100 times the fraction W/Z
where W is the number of nucleotides scored as identical matches by the sequence alignment program NCBI-35 BLAST2 in that program's alignment of C and D, and where Z is the total number of nucleotides in D. It will be appreciated that where the length of nucleic acid sequence C is not equal to the length of nucleic acid sequence
D, the % nucleic acid sequence identity of C to D will not equal the % nucleic acid sequence identity of D to
319
C.
In other embodiments, PRO variant polynucleotides are nucleic acid molecules that encode an active PRO polypeptide and which are capable of hybridizing, preferably under stringent hybridization and wash conditions, to nucleotide sequences encoding a full-length PRO polypeptide as disclosed herein. PRO variant polypeptides may be those that are encoded by a PRO variant polynucleotide.
Hie term "positives", in the context of sequence comparison performed as described above, includes residues in the sequences compared that are not identical but have similar properties (e.g. as a result of conservative substitutions, see Table 1 below). For purposes herein, the % value of positives is determined by dividing (a) the number of amino acid residues scoring a positive value between the PRO polypeptide amino acid sequence of interest having a sequence derived from the native PRO polypeptide sequence and the comparison 10 amino acid sequence of interest (i.e., the amino acid sequence against which the PRO polypeptide sequence is being compared) as determined in the BLOSUM62 matrix of WU-BLAST-2 by (b) the total number of amino acid residues of the PRO polypeptide of interest.
Unless specifically stated otherwise, the % value of positives is calculated as described in the immediately preceding paragraph. However, in the context of the amino acid sequence identity comparisons IS performed as described for ALIGN-2 and NCBI-BLAST2 above, includes amino acid residues in the sequences compared that are not only identical, but also those that have similar properties. Amino acid residues that score a positive value to an amino acid residue of interest are those that are either identical to the amino acid residue of interest or are a preferred substitution (as defined in Table 1 below) of the amino acid residue of interest.
For amino acid sequence comparisons using ALIGN-2 or NCBI-BLAST2, the % value of positives of 20 a given amino acid sequence A to, with, or against a given amino acid sequence B (which can alternatively be phrased as a given amino acid sequence A that has or comprises a certain % positives to, with, or against a given amino acid sequence B) is calculated as follows:
100 times the fraction X/Y
where X is the number of amino acid residues scoring a positive value as defined above by the sequence alignment program ALIGN-2 or NCBI-BLAST2 in that program's alignment of A and B, and where Y is the total number of amino acid residues in B. It will be appreciated that where the length of amino acid sequence A is not equal to the length of amino acid sequence B, the % positives of A to B will not equal the % positives 30 of B to A.
"Isolated," when used to describe the various polypeptides disclosed herein, means polypeptide that has been identified and separated and/or recovered from a component of its natural environment. Contaminant components of its natural environment are materials that would typically interfere with diagnostic or therapeutic uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or non-proteinaceous 35 solutes. In preferred embodiments, the polypeptide will be purified (1) to a degree sufficient to obtain at least 15 residues of N-terminal or internal amino acid sequence by use of a spinning cup sequenator, or (2) to homogeneity by SDS-PAGE under non-reducing or reducing conditions using Coomassie blue or, preferably,
320
silver stain. Isolated polypeptide includes polypeptide in situ within recombinant cells, since at least one component of the PRO polypeptide natural environment will not be present. Ordinarily, however, isolated polypeptide will be prepared by at least one purification step.
An "isolated" PRO polypeptide-encoding nucleic acid is a nucleic acid molecule that is identified and separated from at least one contaminant nucleic acid molecuie with which it is ordinarily associated in the natural 5 source of the PRO polypeptide nucleic acid. An isolated PRO polypeptide nucleic acid molecule is other than in the form or setting in which it is found in nature. Isolated PRO polypeptide nucleic acid molecules therefore are distinguished from the specific PRO polypeptide nucleic acid molecule as it exists in natural cells. However, an isolated PRO polypeptide nucleic acid molecule includes PRO polypeptide nucleic acid molecules contained in cells that ordinarily express the PRO polypeptide where, for example, the nucleic acid molecule is in a 10 chromosomal location different from that of natural cells.
The term "control sequences" refers to DNA sequences necessary for the expression of an operably linked coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, for example, include a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic ceils are known to utilize promoters, polyadenylation signals, and enhancers.
Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 20 "operably linked" means that the DNA sequences being linked are contiguous, and, in the case of a secretory ieader, contiguous and in reading phase. However, enhancers do not have to be contiguous. Linking is accomplished by ligation at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide adaptors or linkers are used in accordance with conventional practice.
The term "antibody" is used in the broadest sense and specifically covers, for example, single anti-PRO 25 monoclonal antibodies (including agonist, antagonist, and neutralizing antibodies), anti-PRO antibody compositions with poiyepitopic specificity, single chain anti-PRO antibodies, and fragments of anti-PRO antibodies (see below). The term "monoclonal antibody" as used herein refers to an antibody obtained from a population of substantially homogeneous antibodies, i.e., the individual antibodies comprising the population are identical except for possible naturally-occurring mutations that may be present in minor amounts. 30 "Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art, and generally is an empirical calculation dependent upon probe length, washing temperature, and salt concentration. In general, longer probes require higher temperatures for proper annealing, while shorter probes need lower temperatures. Hybridization generally depends on the ability of denatured DNA to reanneal when complementary strands are present in an environment below their melting temperature. The higher the degree 35 of desired homology between the probe and hybridizable sequence, the higher the relative temperature which can be used. As a result, it follows that higher relative temperatures would tend to make the reaction conditions more stringent, while lower temperatures less so. For additional details and explanation of stringency of
321
hybridization reactions, see Ausubel et al., Current Protocols in Molecular Biology. Wiley Interscience Publishers, (1995).
"Stringent conditions" or "high stringency conditions", as defined herein, may be identified by those that: (1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50°C; (2) employ during hybridization a S denaturing agent, such as formamide, for example, 50% (v/v) formamide with 0.1% bovine serum albumin/0.1 % Ficoll/0.1 % polyvinylpyrrolidone/50mMsodium phosphate buffer at pH 6.5 with 750 mM sodium chloride, 75 mM sodium citrate at 42°C; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA (50 pg/ml), 0.1 % SDS, and 10% dextran sulfate at 42°C, with washes at 42°C 10 in0.2xSSC (sodium chloride/sodium citrate) and 50% formamide at 55 °C, followedby a high-stringency wash consisting of 0.1 x SSC containing EDTA at 55 °C.
"Moderately stringent conditions" may be identified as described by Sambrook et al., Molecular Cloning: A Laboratory Manual. New York: Cold Spring Harbor Press, 1989, and include the use of washing solution and hybridization conditions (e.g., temperature, ionic strength and %SDS) less stringent that those 15 described above. An example of moderately stringent conditions is overnight incubation at 37°C in a solution comprising: 20% formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5 x Denhardt's solution, 10% dextran sulfate, and 20 mg/ml denatured sheared salmon sperm DNA, followed by washing the filters in 1 x SSC at about 37-50°C. The skilled artisan will recognize how to adjust the temperature, ionic strength, etc. as necessary to accommodate factors such as probe length and the like. 20 The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising a PRO
polypeptide fused to a "tag polypeptide". The tag polypeptide has enough residues to provide an epitope against which an antibody can be made, yet is short enough such that it does not interfere with activity of the polypeptide to which it is fused. The tag polypeptide preferably also is fairly unique so that the antibody does not substantially cross-react with other epitopes. Suitable tag polypeptides generally have at least six amino acid 25 residues and usually between about 8 and 50 amino acid residues (preferably, between about 10 and 20 amino acid residues).
As used herein, the term "immunoadhesin" designates antibody-like molecules which combine the binding specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulin constant domains. Structurally, the immunoadhesins comprise a fusion of an amino acid sequence with the 30 desired binding specificity which is other than the antigen recognition and binding site of an antibody (i.e., is "heterologous"), and an immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin molecule typically is a contiguous amino acid sequence comprising at least the binding site of a receptor or a ligand. The immunoglobulin constant domain sequence in the immunoadhesin may be obtained from any immunoglobulin, such as IgG-1, IgG-2, IgG-3, or IgG-4 subtypes, IgA (including IgA-1 and IgA-2), IgE, IgD 35 or IgM.
"Active" or "activity" for the purposes herein refers to form(s) of a PRO polypeptide which retain a biological and/or an immunological activity of native or naturally-occurring PRO, wherein "biological" activity
322
refers to a biological function (either inhibitory or stimulatory) caused by a native or naturally-occurring PRO other than the ability to induce the production of an antibody against an antigenic epitope possessed by a native or naturally-occurring PRO and an "immunological" activity refers to the ability to induce the production of an antibody against an antigenic epitope possessed by a native or naturally-occurring PRO.
The term "antagonist" is used in the broadest sense, and includes any molecule that partially or fully 5 blocks, inhibits, or neutralizes a biological activity of a native PRO polypeptide disclosed herein. In a similar manner, the term "agonist" is used in the broadest sense and includes any molecule that mimics a biological activity of a native PRO polypeptide disclosed herein. Suitable agonist ot antagonist molecules specifically include agonist or antagonist antibodies or antibody fragments, fragments or amino acid sequence variants of native PRO polypeptides, peptides, antisense oligonucleotides, small organic molecules, etc. Methods for 10 identifying agonists or antagonists of a PRO polypeptide may comprise contacting a PRO polypeptide with a candidate agonist or antagonist molecule and measuring a detectable change in one or more biological activities normally associated with the PRO polypeptide.
"Treatment" refers to both therapeutic treatment and prophylactic or preventative measures, wherein the object is to prevent or slow down (lessen) the targeted pathologic condition or disorder. Those in need of 15 treatment include those already with the disorder as well as those prone to have the disorder or those in whom the disorder is to be prevented.
"Chronic" administration refers to administration of the agent(s) in a continuous mode as opposed to an acute mode, so as to maintain the initial therapeutic effect (activity) for an extended period of time. "Intermittent" administration is treatment that is not consecutively done without interruption, but rather is cyclic 20 in nature.
"Mammal" for purposes of treatment refers to any animal classified as a mammal, including humans, domestic and farm animals, and zoo, sports, or pet animals, such as dogs, cats, cattle, horses, sheep, pigs, goats, rabbits, etc. Preferably, the mammal is human.
Administration "in combination with" one or more further therapeutic agents includes simultaneous 25 (concurrent) and consecutive administration in any order.
"Carriers" as used herein include pharmaceuticaliy acceptable carriers, excipients, or stabilizers which are nontoxic to the cell or mammal being exposed thereto at the dosages and concentrations employed. Often the physiologically acceptable carrier is an aqueous pH buffered solution. Examples of physiologically acceptable carriers include buffers such as phosphate, citrate, and other organic acids; antioxidants including 30 ascorbic acid; low molecular weight (less than about 10 residues) polypeptide; proteins, such as serum albumin, gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides, and other carbohydrates including glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol; salt-forming counterfoils such as sodium; and/or nonionic surfactants such as TWEEN™, polyethylene glycol (PEG), 35 and PLURONICS™.
"Antibody fragments" comprise a portion of an intact antibody, preferably the antigen binding or variable region of the intact antibody. Examples of antibody fragments include Fab, Fab", F(ab') 2, and Fv
323
fragments; diabodies; linear antibodies (Zapata et al., Protein Eng. 8(10): 1057-1062 [1995]); single-chain antibody molecules; and multispecific antibodies formed from antibody fragments.
Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" fragments, each with a single antigen-binding site, and a residual "Fc" fragment, a designation reflecting the ability to crystallize readily. Pepsin treatment yields an F(ab')2 fragment that has two antigen-combining sites 5 and is still capable of cross-linking antigen.
"Fv" is the minimum antibody fragment which contains a complete antigen-recognition and -binding site. This region consists of a dimer of one heavy- and one light-chain variable domain in tight, non-covalent association. It is in this configuration that the three CDRs of each variable domain interact to define an antigen-binding site on the surface of the VH-VL dimer. Collectively, the six CDRs confer antigen-binding specificity 10 to the antibody. However, even a single variable domain (or half of an Fv comprising only three CDRs specific for an antigen) has the ability to recognize and bind antigen, although at a lower affinity than the entire binding site.
The Fab fragment also contains the constant domain of the light chain and the first constant domain (CHI) of the heavy chain. Fab fragments differ from Fab* fragments by the addition of a few residues at the 15 carboxy terminus of the heavy chain CHI domain including one or more cysteines from die antibody hinge region. Fab'-SH is the designation herein for Fab' in which the cysteine residue(s) of the constant domains bear a free thiol group. F(ab')2 antibody fragments originally were produced as pairs of Fab' fragments which have hinge cysteines between them. Other chemical couplings of antibody fragments are also known.
The "light chains" of antibodies (immunoglobulins) from any vertebrate species can be assigned to one 20 of two clearly distinct types, called kappa and lambda, based on the amino acid sequences of their constant domains.
Depending on the amino acid sequence of the constant domain of their heavy chains, immunoglobulins can be assigned to different classes. There are five major classes of immunoglobulins: IgA, IgD, IgE, IgG, and IgM, and several of these may be further divided into subclasses (isotypes), e.g., IgGl, IgG2, IgG3, IgG4, IgA, 25 and IgA2.
"Single-chain Fv" or "sFv" antibody fragments comprise the VH and VL domains of antibody, wherein these domains are present in a single polypeptide chain. Preferably, the Fv polypeptide further comprises a polypeptide linker between the VH and VL domains which enables the sFv to form the desired structure for antigen binding. For a review of sFv, see Pluckthun in The Pharmacology of Monoclonal Antibodies, vol. 113, 30 Rosenburg and Moore eds., Springer-Verlag, New York, pp. 269-315 (1994).
The term "diabodies" refers to small antibody fragments with two antigen-binding sites, which fragments comprise a heavy-chain variable domain (VH) connected to a light-chain variable domain (VL) in the same polypeptide chain (VH - VL). By using a linker that is too short to allow pairing between the two domains on the same chain, the domains are forced to pair with the complementary domains of another chain and create 35 two antigen-binding sites. Diabodies are described more fully in, for example, EP 404,097; WO 93/11161; and Holiinger et al., Proc. Nad. Acad. Sci. USA. 90:6444-6448 (1993).
An "isolated" antibody is one which has been identified and separated and/or recovered from a
324
component of its natural environment. Contaminant components of its natural environment are materials which would interfere with diagnostic or therapeutic uses for the antibody, and may include enzymes, hormones, and other proteinaceous or nonproteinaceous solutes. In preferred embodiments, the antibody will be purified (1) to greater than 95 % by weight of antibody as determined by the Lowry method, and most preferably more than 99% by weight, (2) to a degree sufficient to obtain at least 15 residues of N-terminal or internal amino acid 5 sequence by use of a spuming cup sequenator, or (3) to homogeneity by SDS-PAGE under reducing or nonreducing conditions using Coomassie blue or, preferably, silver stain. Isolated antibody includes the antibody in situ within recombinant cells since at least one component of the antibody's natural environment will not be present. Ordinarily, however, isolated antibody will be prepared by at least one purification step.
The word "label" when used herein refers to a detectable compound or composition which is conjugated 10 directly or indirectly to the antibody so as to generate a "labeled" antibody. The label may be detectable by itself (e.g. radioisotope labels or fluorescent labels) or, in the case of an enzymatic label, may catalyze chemical alteration of a substrate compound or composition which is detectable.
By "solid phase" is meant a non-aqueous matrix to which the antibody of the present invention can adhere. Examples of solid phases encompassed herein include those formed partially or entirely of glass (e.g., 15 controlled pore glass), polysaccharides (e.g., agarose), polyacrylamides, polystyrene, polyvinyl alcohol and silicones. In certain embodiments, depending on the context, the solid phase can comprise the well of an assay plate; in others it is a purification column (e.g., an affinity chromatography column). This term also includes a discontinuous solid phase of discrete particles, such as those described in U.S. Patent No. 4,275,149.
A "liposome" is a small vesicle composed of various types of lipids, phospholipids and/or surfactant 20 which is useful for delivery of a drug (such as a PRO polypeptide or antibody thereto) to a mammal. The components of the liposome are commonly arranged in a bilayer formation, similar to the lipid arrangement of biological membranes.
A "small molecule" is defined herein to have a molecular weight below about 500 Daltons.
II. Compositions and Methods of the Invention
The present invention provides newly identified and isolated nucleotidesequencesencoding polypeptides referred to in the present application as PRO polypeptides. In particular, cDNAs encoding various PRO polypeptides have been identified and isolated, as disclosed in further detail in the Examples below. It is noted that proteins produced in separate expression rounds may be given different PRO numbers but the UNQ number 30 is unique for any given DNA and the encoded protein, and will not be changed. However, for sake of simplicity, in the present specification the protein encoded by the full length native nucleic acid molecules disclosed herein as well as all further native homologues and variants included in the foregoing definition of PRO, will be referred to as "PRO/number", regardless of their origin or mode of preparation.
As disclosed in the Examples below, various cDNA clones have been deposited with the ATCC. The 35 actual nucleotide sequences of those clones can readily be determined by the skilled artisan by sequencing of the deposited clone using routine methods in the art. The predicted amino acid sequence can be determined from the nucleotide sequence using routine skill. For the PRO polypeptides and encoding nucleic acids described
325
herein, Applicants have identified what is believed to be the reading frame best identifiable with the sequence information available at the time.
A. Full-Leneth PRO Polypeptides 1. PRQ1560
Using the WU-BLAST2 sequence alignment computer program, the full-length native sequence
PR01560 (shown in Figure 2 and SEQ ID NO:4) has certain amino acid sequence identity with Tspan-6, identified after the discovery of the present invention herein. Accordingly, it is presently believed that PR01560 disclosed in the present application is a newly identified member of the tetraspan family.
2. PRQ444
The DNA26846-I397 clone was isolated from a human fetal lung library using a trapping technique which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA26846-1397 clone encodes a secreted factor. As far as is known, the DNA26846-1397 sequence encodes a novel factor designated herein as PR0444. Using the WU-BLAST2 sequence alignment computer program, no significant sequence identity 15 with known proteins was revealed.
3. PRQ1018
The DNA56107-1415 clone was isolated from a human ovary tumor tissue library using a trapping technique which selects for nucleotide sequences encoding secreted proteins. As far as is known, the 20 DNA56107-1415 sequence encodes a novel factor designated herein as PR01018; using the WU-BLAST2 sequence alignment computer program, no significant sequence identities to any known proteins were revealed.
4. PRQ1773
Using the WU-BLAST2 sequence alignment computer program, it has been found that a portion of the 25 full-length native sequence PRO1773 (shown in Figure 8 and SEQ ID NO: 10) has certain amino acid sequence identity with a portion of the retinol dehydrogenase type II protein of rattus norvegicus (ROH2_RAT). Accordingly, it is presently believed that PR01773 disclosed in the present application is a newly identified member of the retinol dehydrogenase protein family and may possess activity typical of that protein family.
5. PRQ1477
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length native sequence PR01477 (shown in Figure 10 and SEQ ID NO: 12) has certain amino acid sequence identity with the mannosyl-oiigosaccharide 1,2-alpha-mannosidase protein (A54408). Accordingly, it is presently believed that PRO1477 disclosed in the present application is a newly identified member of the mannosidase 35 protein family and may possess activity typical of the mannosidase protein family.
326
6. PRQ1478
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01478 (shown in Figure 12 and SEQ ID NO: 17) has certain amino acid sequence identity with galactosyltransferases. Accordingly, it is presently believed that PR01478 disclosed in the present application is a newly identified member of the galactosyltransferase family and may possess at least one shared mechanism 5 with other members of this family.
7. PRQ831
The DNA56862-1343 clone was isolated from a human uterus library using a trapping technique which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA56862-1343 clone does encode a 10 secreted factor. As far as is known, the DNAS6862-1343 sequence encodes a novel factor designated herein as PR0831; using the WU-BLAST2 sequence alignment computer program, no sequence identities to any known proteins were revealed.
8- PRQ1113
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PROl 113 (shown in Figure 16 and SEQ ID N0:24) has certain amino acid sequence identity with LIG-1 and SLIT. Accordingly, it is presently believed that PROH13 disclosed in the present application is a newly identified member of the leucine rich repeat family and may possess protein-protein interaction activity as is typical of this family.
9. PROl 194
As far as is known, the DNA57841-1522 sequence encodes a novel factor designated herein as PROl 194; using WU-BLAST2 sequence alignment computer programs, limited sequence identities to known proteins were revealed.
. PROlllO
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length native sequence PROlllO (shown in Figure 20 and SEQ ID NO:31) has certain amino acid sequence identity with the murine myeloid upregulated protein. Accordingly, it is presently believed that PROl 110 disclosed in 30 the present application is a newly identified member of the myeloid upregulated protein family and may possess activity typical of that family.
U. PRQ1378
The DNA58730-1607 clone was isolated from a bone marrow library using a trapping technique which 35 selects for nucleotide sequences encoding secreted proteins. Thus, the DN A58730-1607 clone encodes a secreted factor. As far as is known, the DNA58730-1607 sequence encodes a novel factor designated herein as PR01378. WU-BLAST2 sequence alignment computer programs revealed some sequence identities between
327
the amino acid sequence of PR01378 with known proteins. However, they were determined to not be significant.
12. PRO 1481
As far as is known, the DNA58732-1650 sequence encodes a novel factor designated herein as 5 PR01481. Using WU-BLAST2 sequence alignment computer programs, only some sequence identities to known proteins were revealed.
13. PROl 189
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 10 sequence PROl 189 (shown in Figure 26 and SEQ ID NO:43) has certain amino acid sequence identity with die amino acid sequence of an E2S protein designated "MUSE25A_1" in the Dayhoff database. Accordingly, it is presently believed that PROl 189 disclosed in the present application is a newly identified member of the E2S protein family and may possess activity or properties typical of that family.
14. PRQ1415
The DNA58852-1637 clone was isolated from a diseased human prostate tissue library using a trapping technique which selects for nucleotide sequences encoding secreted proteins. As far as is known, the DNA58852-1637 sequence encodes a novel factor designated herein as PR01415; using the WU-BLAST2 sequence alignment computer program, no significant sequence identities to any known proteins were revealed.
. PRQ1411
As far as is known, the DNA59212-1627 sequence encodes a novel factor designated herein as PRO 1411. However, using WU-BLAST2 sequence alignment computer programs, some sequence identities to known proteins were revealed.
16. PRQ1295
As far as is known, the DNA59218-1559 sequence encodes a novel factor designated herein as PR01295. Using WU-BLAST2sequence alignment computer programs, only some sequence identities to known proteins were revealed.
17. PRQ13S9
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR013S9 (shown in Figure 34 and SEQ ID NO:56) has certain amino acid sequence identity with N-acetylgalactosamine alpha-2, 6-sialyltransferase. Accordingly, it is presently believed that PR01359 disclosed 35 in the present application is a newly identified member of the sialyltransferase family and may possess transferase activity typical of this family.
328
18. PROl 190
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PROl 190 (shown in Figure 36 and SEQ ID NO:58) has certain amino acid sequence identity with both rat and human CDO. Accordingly, it is presently believed that PROl 190 disclosed in the present application is a newly identified member of the CDO family and may possess cell adhesion activity typical of the CDO 5 family.
19. PRQ1772
Using the WU-BLAST2 sequence alignment computer program, it has been found that a portion of the full-length native sequence PRO1772 (shown in Figure 38 and SEQ ID NO:63) has certain amino acid sequence 10 identity with a human microsomal dipeptidase protein (P R13857). Accordingly, it is presently believed that PRO1772 disclosed in the present application is a newly identified member of the peptidase protein family and may possess activity typical of that protein family.
. PRQ1248
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length native sequence PR01248 (shown in Figure 40 and SEQ ID NO:68) has amino acid sequence identity with the PUT-2 protein (AF026198_5). Accordingly, it is presently believed that PR01248 disclosed in the present application is a newly PUT-2 homolog and may possess activity typical of the PUT-2 protein.
21. PRQ1316
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01316 (shown in Figure 42 and SEQ ID N0:70) has certain amino acid sequence identity with murine dickkopf. Accordingly, it is presently believed that PR01316 disclosed in the present application is a newly identified member of the dickkopf family and may possess the ability to cause head induction from the 25 Spemann organizer and/or Wnt antagonism.
22. PROl 197
As far as is known, the DNA60611-1524 sequence encodes a novel factor designated herein as PROl 197. Using WU-BLAST2 sequence alignment computer programs, only some sequence identities to known 30 proteins were revealed as further described in the examples.
23. PRQ1293
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length native sequence PR01293 (shown in Figure 46 and SEQ ID NO:77) has certain amino acid sequence identity 35 with the human Ig heavy chain V region protein (HSVCD541). Accordingly, it is presently believed that PRO 1293 disclosed in the present application is a newly identified member of the Ig superfamily of proteins and fragments thereof and may possess activity typical of that family.
329
24. PRO 1380
The DNA60740-1615 clone was isolated from a human retina library. As far as is known, the DNA60740-1615 sequence encodes a novel multi-span transmembrane polypeptide designated herein as PR01380. Using WU-BLAST2 sequence alignment computer programs, some sequence identity with known proteins were revealed.
. PRQ1265
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PRO1265 (shown in Figure 50 and SEQ ID NO:84) has certain amino acid sequence identity with the Figl polypeptide designated "MMU70429 1" in the Dayhoff database (version 35.45 SwissProt 35). 10 Accordingly, it is presently believed that PR01265 disclosed in the present application is a newly identified member of the FIG1 family and may possess activity typical of the FIG1 polypeptide, including activation by interleukin-4.
26. PRQ1250
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length native sequence PR01250 (shown in Figure 52 and SEQ ID NO:86) has certain amino acid sequence identity with the human long chain fatty acid CoA ligase protein (LCFB HUMAN). Accordingly, it is presently believed that PR01250 disclosed in the present application is a newly identified long chain fatty acid CoA ligase homolog that may have activity typical of long chain fatty acis CoA ligase.
27. PRQ1475
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length native sequence PR01475 (shown in Figure 54 and SEQ ID NO:88) has certain amino acid sequence identity with a portion of the mouse alpha-3-D-mannoside beat-l,2-N-acetylglucosaminyltransferase I protein. 25 Accordingly, it is presently believed that PRO1475 disclosed in the present application is a newly identified member of the N-acetylglucosaminyltransferase protein family and may possess activity typical of that protein family.
28. PRQ1377
As described herein, WU-BLAST2 sequence alignment computer programs were used to determine the sequence identity of the PR01377 amino acid sequence with the amino acid sequences of known proteins. While some sequence identities were revealed, they were determined to not be significant. Accordingly, as far as is known, the DNA61608 sequence encodes a novel transmembrane protein designated herein as PROI377.
29. PRQ1326
The DNA62808-1582 clone is believed to encode a secreted factor. As far as is known, the DNA62808-1582 sequence encodes a novel factor designated herein as PR01326; using WU-BLAST2 sequence alignment
330
computer programs, sequence identities to known proteins were revealed but determined not to be significant.
. FRQ1249
The DNA62809-1531 clone was isolated from a human colon tumor tissue library using a trapping technique which selects for nucleotide sequences encoding secreted proteins. As far as is known, the DNA62809-5 1531 sequence encodes a novel factor designated herein as PR01249; using the WU-BLAST2 sequence alignment computer program, no sequence identities to any known proteins were revealed.
31. PRQ1315
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 10 native sequence PRO 1315 (shown in Figure 62 and SEQ ID N0:104) has certain amino acid sequence identity with the class II cytokine receptor 4 protein of mus musculus (MMU53696 1). Accordingly, it is presendy believed that PR01315 disclosed in the present application is a newly identified member of the cytokine reeptor protein family and may possess activity typical of that family.
32. FRQ1599
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01599 (shown in Figure 64 and SEQ ID NO:111) has certain amino acid sequence identity with Dayhoff sequence "CFAD_PIG". Accordingly, it is presently believed that PR01599 disclosed in the present application is a newly identified member of the Granzyme M family and may possess activity or properties 20 typical of the Granzyme M family.
33. PRQ1430
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PRO1430 (shown in Figure 66 and SEQ ID NO: 116) has certain amino acid sequence identity prostate 25 specific reductase (designated "P_W03198" in the Dayhoff database). Accordingly, it is presently believed that PR01430 disclosed in the present application is a newly identified member of the reductase family and may possess activity typical of members of the reductase family.
34. PRQ1374
As far as is known, the DNA64849-1604 sequence encodes a novel factor designated herein as
PR01374; using WU-BLAST2 sequence alignment computer programs, some sequence identities to known proteins such as the human alpha subunit of P4HA were revealed. Therefore, it is believed that PR01374 is related to P4HA and may share one or more mechanisms.
35. PRQ1311
The DNA64863-1573 clone was isolated from human aortic endothelial cells and is believed to encode a novel transmembrane polypeptide designated herein as PR01311. Using WU-BLAST2 sequence alignment
331
computer programs, some sequence identities with known proteins were revealed, but were determined to not be significant.
36, PRQ13S7
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 5 native sequence PR01357 (shown in Figure 72 and SEQ ID NO: 128) has certain amino acid sequence identity with the von Ebner minor salivary gland protein of mus musculus (MMU46068 1). Accordingly, it is presently believed that PR013S7 disclosed in the present application is a newly identified von Ebner minor salivary gland protein homolog.
37. FRQ1244
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01244 (shown in Figure 74 and SEQ ID NO: 130) has certain amino acid sequence identity with a known implantation-associated protein designated "AF008554 1" on the Dayhoff database (version 35.45 SwissProt 35). Accordingly, it is presently believed that PR01244 disclosed in the present application is a newly IS identified member of the implantation-associated protein family and may possess attachment activity typical of that protein family.
38. PROl 246
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 20 native sequence PR01246 (shown in Figure 76 and SEQ ID N0:132) has certain amino acid sequence identity with the murine bone-related sulphatase-like precursor protein (P_R51355). Accordingly, it is presently believed that PRO1246 disclosed in the present application is a newly identified bone-related sulphatase homolog and may possess activity typical of bone-related sulfatase.
39. PROl 356
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length native sequence PR01356 (shown in Figure 78 and SEQ ID NO: 134) has certain amino acid sequence identity with the CPE-receptor protein of mus musculus (AB000713_1). Accordingly, it is presently believed that PRO 1356 disclosed in the present application is a newly identified member of the CPE receptor family and may 30 possess activity typical of that family.
40. PRQ1275
As far as is known, the DNA64888-1542 sequence encodes a novel factor designated herein as PR01275. Using WU-BLAST2 sequence alignment computer programs, some sequence identities to known 35 proteins were revealed.
332
41. PRQ1274
As far as is known, the DNA64889-1S41 sequence encodes a novel factor designated herein as PR01274. Using WU-BLAST2 sequence alignment computer programs, some sequence identities to known proteins were revealed.
42. PRQ1412
The DNA64897-1628 clone is believed to be a secreted factor. As far as is known, the DNA64897-1628 sequence encodes a novel factor designated herein as PRO 1412; using WU-BLAST2 sequence alignment computer programs, sequence identities to known proteins were revealed but determined not to be significant.
43. PRQ1557
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01SS7 (shown in Figure 86; SEQ ID NO: 142) has certain amino acid sequence identity chordin protein designated AF034606 1 in the Dayhoff database. Accordingly, it is presently believed that PR01557 disclosed in the present application is a newly identified member of the chordin family and may possess activity 15 typical of the chordin family.
44. PRQ1286
The DNA64903-1553 clone identified using techniques which selects for nucleotide sequences encoding secreted proteins. As far as is known, the DNA64903 sequence encodes a novel secreted factor designated herein 20 as PRO1286. Using WU-BLAST2 sequence alignment computer programs, some sequence identities to known proteins were revealed; however, it was determined that they were not significant.
45. PRQ1294
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 25 native sequence PRO1294 (shown in Figure 90 and SEQ ID NO: 146) has certain amino acid sequence identity with the neuronal olfactomedin-related ER localized protein of the rat (173636). Accordingly, it is presently believed that PROI294 disclosed in the present application is a newly identified olfactomedin homolog and may possess activity typical of that protein.
46. PRQ1347
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PRO 1347 (shown in Figure 92 and SEQ ID NO: 148) has certain amino acid sequence identity with butyrophilin. Moreover, there is a transmembrane domain approximately in the middle of the sequence as is typical of butyrophilins. Accordingly, it is presently believed that PR01347 disclosed in the present application 35 is a newly identified member of the butyrophilin family and may play a role in the budding and release of milk-fat glubules during lactation.
333
47. PRQ1305
The DNA64952-1568 clone was isolated from a human fetal kidney library using a trapping technique which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA54952-1568 clone does encode a secreted factor. As far as is known, the DNA64952-1568 sequence encodes a novel factor designated herein as PR01305; using the WU-BLAST2 sequence alignment computer program, no sequence identities to 5 any known proteins were revealed.
48. PRQ1273
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PRO1273 (shown in Figure 96 and SEQ ID NO: 158) has certain amino acid sequence identity with a 10 lipocalin precursor. Moreover, Figure 96 shows that PR01273 has a motif conserved in lipocalins. Accordingly, it is presently believed that PR01273 disclosed in the present application is a newly identified member of the lipocalin family and shares at least one mechanism with lipocalins.
49. PRQ1302
IS Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01302 (shown in Figure 98 and SEQ ID NO: 160) has certain amino acid sequence identity with CD33L1 and CD33L2. Accordingly, it is presently believed that PR01302 disclosed in the present application is a newly identified member of the siaioadhesin family and possesses characteristics typical of this family. Specifically, PR01302 may be involved in cancer, inflammation, hemopoisis, neuronal development and/or 20 immunity.
50. PRQ1283
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length native sequence PRO1283 (shown in Figure 100 and SEQ ID NO: 162) has certain amino acid sequence identity 25 with the rat odorant binding protein homolog OBP-II precursor (A40464). Accordingly, it is presently believed that PRO1283 disclosed in the present application is a newly odorant binding protein and may possess activity typical of the odorant binding proteins.
51. PRQ1279
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length native sequence PRO 1279 (shown in Figure 102 and SEQ ID NO: 170) has certain amino acid sequence identity with the mouse neuropsin protein (156559). Accordingly, it is presently believed that PRO1279 disclosed in the present application is a newly identified neuropsin homolog and may possess activity typical of the neuropsin protein.
52. PRO1304
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length
334
native sequence PR01304 (shown in Figure 104 and SEQ ID NO: 180) has certain amino acid sequence identity with the FK-506 binding protein of mus musculus (AF040252 1). Accordingly, it is presently believed that PR01304 disclosed in the present application is a newly identified member of the FK506 binding protein family and may possess activity typical of that family.
S3. FRQ1317
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01317 (shown in Figure 106 and SEQ ID NO: 189) has certain amino acid sequence identity with human CD97 protein. Accordingly, it is presently believed that PR01317 disclosed in the present application is a leukocyte antigen that may be involved in leukocyte activation.
54. PRQ1303
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01303 (shown in Figure 108 and SEQ ID NO: 194) has certain amino acid sequence identity with neuropsin. Accordingly, it is presently believed that PR01303 disclosed in the present application is a newly IS identified member of the serine protease family and may possess catabolic activity typical of this family.
55. PROl 306
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01306 (shown in Figure 110 and SEQ ID NO: 196) has certain amino acid sequence identity with 20 Dayhoff sequence no. AIF1HUMAN. Accordingly, it is presently believed that PR01306 disclosed in the present application is a newly identified member of the AIFl/daintain family and may possess activity and properties typical of AIFl/daintain.
56. PROl 336
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01336 (shown in Figure 112 and SEQ ID NO: 198) has certain amino acid sequence identity with slit. Accordingly, it is presently believed that PRO1336 disclosed in the present application is a newly identified member of the EGF-repeat family and may possess protein interaction mediation activity.
57. PRQ1278
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PRO1278 (shown in Figure 114 and SEQ ID N0:203) has certain amino acid sequence identity lysozyme c -1 precursor designated "LYC1ANAPL" in the Dayhoff database. Accordingly, it is presently believed that PR01278 disclosed in the present application is a newly identified member of the lysozyme family 35 and may possess hydrolytic and other activity typical of the lysozyme family.
335
58. PRQ1298
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PRO1298 (shown in Figure 116 and SEQ ID N0.210) has certain amino acid sequence identity with glycosyltransferase alg2. Accordingly, it is presently believed that PR01298 disclosed in the present application is a newly identified member of the glycosyltransferase family and may share at least one mechanism with 5 members of this family.
59. PRQ1301
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01301 (shown in Figure 118 and SEQ ID NO:212) has consistent amino acid sequence identity with 10 cytochrome P4S0 proteins. Accordingly, it is presently believed that PR01301 disclosed in the present application is a newly identified member of the cytochrome P450 family and may possess monooxygenase activity typical of the cytochrome P450 family.
60. PRQ1268
As far as is known, the DNA66519-1535 sequence encodes a novel transmembrane polypeptide factor designated herein as PR01268. Using WU-BLAST2 sequence alignment computer programs, sequence identity to a known protein was revealed, but determined to not be significant.
61. PRQ1269
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01269 (shown in Figure 122 and SEQ ID NO:216) has certain amino acid sequence identity a bovine granulocyte peptide A precursor, designated "P_W23722" on the Dayhoff database (version 35.45 SwissProt 35). Accordingly, it is presently believed that PRO1269 disclosed in the present application is a newly identified member of the granulocyte A peptide family and may possess microbial activity typical of that family 25 of peptides.
62. PRQ1327
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length native sequence PROI327 (shown in Figure 124 and SEQ ID NO:218) has certain amino acid sequence identity 30 with the rat neurexophilin-1 protein (NPH1RAT). Accordingly, it is presently believed that PR01327 disclosed in the present application is a newly identified member of the neurexophilin protein family and may possess activity typical of that protein family.
63. PRQ1382
Using WU-BLAST2 sequence alignment computer programs, it has been found that a lull-length native sequence PR01382 (shown in Figure 126 and SEQ ID N0:220) has certain amino acid sequence identity with the amino acid sequence of a known cerebellin-like glycoprotein designated "CERLRAT" in the Dayhoff
336
database. Accordingly, it is presently believed that PR01382 disclosed in the present application is a newly identified member of the cerebellin family of neuropeptides and may possess activity and properties typical of cerebellin.
64. PRQ1328
The DNA66658-1584 clone was isolated from a human diseased prostate tissue library using a trapping technique which selects for nucleotide sequences encoding proteins. As far as is known, the DNA66658-1584 sequence encodes a novel factor designated herein as PR01328; using the WU-BLAST2 sequence alignment computer program, no significant sequence identities to any known proteins were revealed.
65. PROl 325
The DNA66659-1593 clone was isolated from a human thymus tissue library using a trapping technique which selects for nucleotide sequences encoding proteins. As far as is known, the DNA66659-1593 sequence encodes a novel factor designated herein as PR01325; using the WU-BLAST2 sequence alignment computer program, no sequence identities to any known proteins were revealed.
66. PRQ1340
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01340 (shown in Figure 132 and SEQ ID NO:229) has certain amino acid sequence identity with Dayhoff sequence no. 146536. Accordingly, it is presently believed that PROl340 disclosed in the present 20 application is a newly identified member of the cadherin family and may possess activity and properties typical of the cadherin family.
67. PRQ1339
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 25 sequence PR01339 (shown in Figure 134 and SEQ ID NO:234) has certain amino acid sequence identity with human pancreatic carboxypeptidase and carboxypeptidase al. Accordingly, it is presently believed that PRO1339 disclosed in the present application is a newly identified member of the carboxypeptidase family and possesses caboxypeptidase activity.
68. PRQ1337
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01337 (shown in Figure 136 and SEQ ID NO:236) has certain amino acid sequence identity with a human TBG identified as "THBG_HUMAN" in the Dayhoff database. Accordingly, it is presently believed that PR01337 disclosed in the present application is a newly identified member of the TBG family and may 35 possess thyroid hormone transport capability and have other
337
69. PROl 342
The DNA66674-1599 clone was isolated from human esophageal tissue. As described in further detail below, using WU-BLAST2 sequence alignment computer programs, some sequence identities to known proteins were revealed. The DNA66674-1599 clone appears to encode for a novel transmembrane polypeptide.
70. PRQ1343
The DNA6667S-1587 clone was isolated from a human smooth muscle cell tissue library using a trapping technique which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA66675-1587 clone does encode a secreted factor. As far as is known, the DNA66675-1587 sequence encodes a novel factor designated herein as PR01343; using the WU-BLAST2 sequence alignment computer program, no 10 significant sequence identities to any known proteins were revealed.
71. PRQ1480
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01480 (shown in Figure 142 and SEQ ID NO:253) has certain amino acid sequence identity with 15 Dayhoff sequence no. 148746. Accordingly, it is presently believed that PR01480 disclosed in the present application is a newly identified member of the Semaphorin C family
72. PRQ1487
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 20 sequence PR01487 (Figure 144; SEQ ID N0:260) has certain amino acid sequence identity with a radical fringe protein designated GGU82088 1 on the Dayhoff database. Accordingly, it is presently believed that PR01487 disclosed in the present application is a newly identified member of the hinge family and may possess activity typical of the fringe family.
73. PRQ1418
As far as is known, the DNA68864-1629 sequence encodes a novel factor designated herein as PR01418. Using WU-BLAST2 sequence alignment computer programs, sequence identities to known proteins were minimal.
74. PRQ1472
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01472 (shown in Figure 148 and SEQ ID NO:267) has certain amino acid sequence identity with butyrophilin. Accordingly, it is presently believed that PR01472 disclosed in the present application is a newly identified member of the butyrophilin family and may possess involvement in lactation.
75. PRQ1461
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native
338
sequence PR01461 (shown in Figure ISO and SEQ ID NO:269) has certain amino acid sequence identity the trypsin-like enzyme identified as "P_R89435" on the Dayhoff database. Accordingly, it is presently believed that PR01461 disclosed in the present application is a newly identified member of the serine protease family and may possess serine protease activity, and more particularly, may possess enzymatic activity typical of other trypsin-like enzymes. Homology was also found to exist between the PR01461 amino acid sequence and other 5 trypsin-like enzymes and serine proteases in the Dayhoff database.
76. PRQ1410
The DNA68874-1622 clone was isolated from a human brain meningioma tissue library using a trapping technique which selects for nucleotide sequences encoding proteins. As far as is known, the DNA68874-1622 10 sequence encodes a novel factor designated herein as PR01410; using the WU-BLAST2 sequence alignment computer program, no sequence identities to any known proteins were revealed.
77. PRQ1S68
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 15 sequence PR01568 (shown in Figure 154 and SEQ ID NO:273) has certain amino acid sequence identity to tetraspan 5 and tetraspan 4. Accordingly, it is presently believed that PR01568 disclosed in the present application is a newly identified member of the tetraspanin family and may possess molecular facilitator activity typical of this family.
78. PROl 570
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01570 (shown in Figure 156 and SEQ ID NO:275) has certain amino acid sequence identity with SP60; however, for the first time, the first 199 amino acids (or amino terminal end) of that protein are identified and presented herein. Accordingly, it is presently believed that PR01570 disclosed in the present application 25 is a newly identified member of the serine protease family and is involved in carcinoma.
79. PRQ1317
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01317 (shown in Figure 158 and SEQ ID NO:277) has certain amino acid sequence identity with 30 a known semaphorin B protein, designated "148745" on the Dayhoff database. Accordingly, it is presently believed that PR01317 disclosed in the present application is a newly identified member of the semaphorin glycoprotein family and may possess activity or properties typical of semaphorins.
80. PRQ1780
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01780 (shown in Figure 160 and SEQ ID NO:282) has certain amino acid sequence identity with a known glucuronosyltransferase designated "UDA2 RAB1T" in the Dayhoff database. Accordingly, it is
339
presently believed that PRO 1780 disclosed in the present application is a newly identified member of the gluciironosyltransferase family and may possess enzymatic activity and other properties typical of the glucuronosyltransferase family.
81. PRQ1486
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01486 (shown in Figure 162 and SEQ ID NO:287) has certain amino acid sequence identity with cerebellin I precursor. Accordingly, it is presently believed that PRO1486 disclosed in the present application is a newly identified member of the cerebellin family and shares at least one mechanism with cerebellin.
82. PRQ1433
The DNA71184-1634 clone was isolated from a human adrenal gland tissue library using a trapping technique which selects for nucleotide sequences encoding proteins. As far as is known, the DNA71184-1634 sequence encodes a novel factor designated herein as PR01433; using the WU-BLAST2 sequence alignment computer program, no sequence identities to any known proteins were revealed.
83. PROl 490
Using the WU-BLAST2 sequence alignment computer program, it has been found that a portion of the full-length native sequence PR01490 (shown in Figure 166 and SEQ ID NO-.297) has certain amino acid sequence identity with a portion of the l-acyl-sn-glycerol-3-phosphate acyltransferase protein (S60478). 20 Accordingly, it is presently believed that PR01490 disclosed in the present application is a newly identified member of the acyltransferase protein family and may possess activity typical of l-acyl-sn-glycerol-3-phosphate acyltransferase proteins.
84. PRQ1482
The DNA71234-1651 clone was isolated from a human adrenal gland library using a trapping technique which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA71234-1651 clone does encode a secreted factor. As far as is known, the DNA71234-1651 sequence encodes a novel factor designated herein as PR01482; using the WU-BLAST2 sequence alignment computer program, no sequence identities to any known proteins were revealed.
85. PROl 446
As far as is known, the DNA71277-1636 sequence encodes a novel factor designated herein as PRO1446. Using WU-BLAST2 sequence alignment computer programs, minimal sequence identities to known proteins were revealed.
86. PRQ1558
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length
340
native sequence PRO 1558 (shown in Figure 172 and SEQ ID N0:306) has significant amino acid sequence identity with a methyltransferase protein (CAMTEUCGU). Accordingly, it is presently believed that PR01558 disclosed in the present application is a newly identified member of the methyltransferase protein family and may possess activity typical of that protein family.
87. PRQ16Q4
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01604 (shown in Figure 174 and SEQ ID N0:308) has certain amino acid sequence identity with the mouse liver cancer-originated cell growth factor designated PW37483 on the Dayhoff database. Accordingly, it is presently believed that PR01604 disclosed in the present application is a newly identified 10 member of the HDGF family and may possess growth factor activity typical of other HDGFs.
88. PRQ149I
Using the WU-BLAST2 sequence alignment computer program, it has been found that a portion of the full-length native sequence PRO 1491 (shown in Figure 176 and SEQ ID N0:310) has certain amino acid 15 sequence identity with a portion of the collapsin-2 protein of Gallus gallus (GGU28240 1). Accordingly, it is presently believed that PR01491 disclosed in the present application is a newly identified member of the collapsin protein family and may possess activity typical of that protein family.
89. PRQ1431
It has been found that the full-length native sequence PRO 1431 [shown in Figure 178(SEQIDNO:315)
has significant sequence identity with the SID domain containing protein SHI7HUMAN. Accordingly, it is presently believed that PR01431 disclosed in the present application is a newly identified member of proteins having an SH3 domains and may possess signal transduction properties.
90. PROl 563
Using the WU-BLAST2 sequence alignment computer program, it has been found that a portion of a full-length native sequence PR01563 (shown in Figure 180 and SEQ ID N0:317) has certain amino acid sequence identity with aportion of the mouse ADAMTS-1 protein (AB001735 1). Accordingly, it is presently believed that PR0I563 disclosed in the present application is a newly identified member of the ADAM protein 30 family and may possess activity typical of that protein family.
91. PRQ1S6S
Using the WU-BLAST2 sequence alignment computer program, it has been found that a portion of the full-length native sequence PRO 1565 (shown in Figure 182 and SEQ ID N0-.322) has certain amino acid 35 sequence identity with a portion of the chondromodulin-I protein of rattus norvegicus (AF051425_1). Accordingly, it is presently believed that PR01565 disclosed in the present application is a newly identified member of the chondromodulin protein family and may possess activity typical of that protein family.
341
92. PRQ1571
Using the WU-BLAST2 sequence alignment computer program, it has been found that a portion of die full-length native sequence PR01571 (shown in Figure 184 and SEQ ID NO:324) has certain amino acid sequence identity with a portion of the human Clostridium perfringens enterotoxin receptor protein (AB000712_1). Accordingly, it is presently believed that PR01S71 disclosed in the present application is a 5 newly identified CPE-R homolog and may possess activity typical of the CPE-R protein.
93. PRQ1S72
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01S72 (shown in Figure 186 and SEQ ID NO:326) has certain amino acid sequence identity with 10 CPE-R. Accordingly, it is presently believed that PR01572 disclosed in the present application is related to CPE-R and may possess at least one shared mechanism.
94. PRQ1573
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 15 sequence PR01S73 (shown in Figure 188 and SEQ ID NO:328) has certain amino acid sequence identity with CPE-R. Accordingly, it is presently believed that PR01573 disclosed in the present application is related to CPE-R and may possesses at least one shared mechanism.
95. PROl 488
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PRO1488 (Figure 190; SEQ ID N0:330) has certain amino acid sequence identity with a known CPE-R designated "AB0007I2_1" on the Dayhoff database. Accordingly, it is presently believed that PR01488 disclosed in the present application is a newly identified member of the CPE-R family and may possess binding activity typical of the CPE-R family.
96. FRQ1489
Using the WU-BLAST2 sequence alignment computer program, it has been found that a portion of the full-length native sequence PR01489 (shown in Figure 192 and SEQ ID NO:332) has certain amino acid sequence identity with the Clostridium perfringens enterotoxin receptor of Cercopithecus aethiops (D88492 1).
Accordingly, it is presently believed that PR01489 disclosed in the present application is a newly identified Clostridium perfringens enterotoxin receptor homolog and may possess activity typical of the Clostridium perfringens enterotoxin receptor protein.
97. PRQ1474
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01474 (shown in Figure 194 and SEQ ID NO:334) has certain amino acid sequence identity with ovomucoid. Accordingly, it is presendy believed that PRO1474 disclosed in the present application is a newly
342
identified member of the kazal serine protease inhibitor family and may possess serine protease inhibitory activity typical of this family.
98. PROl 508
The DNA73742-1508 clone was isolated from a human diseased cartilage tissue library. As far as is S known, the DNA73742-1508 sequence encodes a novel factor designated herein as PRO1508; although, using WU-BLAST2 sequence alignment computer programs, some sequence identities to known proteins were revealed.
99. PROl 555
The DNA73744-1665 clone was isolated from a human tissue library. As far as is known, the
DNA73744 sequence encodes a novel transmembrane protein designated herein as PRO1555. Using WU-BLAST2 sequence alignment computer programs, some sequence identities to known proteins were revealed.
100. PROl 485
IS Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01485 (shown in Figure 200 and SEQ ID N0:340) has certain amino acid sequence identity with lysozyme C precursor peptide. Accordingly, it is presently believed that PR01485 disclosed in the present application is a newly identified member of the lysozyme family and shares at least one like mechanism.
101. PRQ1564
Using the WU-BLAST2 sequence alignment computer program, it has been found that a portion of a full-length native sequence PR01564 (shown in Figure 202 and SEQ ID N0:347) has certain amino acid sequence identity with a portion of a mouse polypeptide GalNAc transferase T4 protein (MMU73819_1). Accordingly, it is presently believed that PRO1564 disclosed in the present application is a newly identified 25 member of the N-acetylgalactosaminyltransferase protein family and may possess activity typical of that protein family.
102. FRQ1755
As far as is known, the DNA76396-1698 sequence encodes a novel transmembrane protein designated 30 herein as PR01755. Although, some sequence identities to known proteins was revealed using WU-BLAST2 sequence alignment computer programs.
103. PROl 757
The DNA76398-1699 clone was isolated from a human testicular tissue library using a trapping 35 technique which selects for nucleotide sequences encoding proteins. As far as is known, the DNA76398-1699 sequence encodes a novel factor designated herein as PR01757; using the WU-BLAST2 sequence alignment computer program, no significant sequence identities to any known proteins were revealed.
343
104. PRQ17S8
The DNA76399-1700 clone was isolated from a library derived from human thymus tissue obtained from a fetus that died at 17 weeks' gestation from anencephalus. It is believed that the DNA76399-1700 clone encodes a novel secreted factor, designated herein as PR01758. Using WU-BLAST2 sequence alignment computer programs, significant sequence identity was revealed between the amino acid sequences of PRO 1758 5 and Dayhoff sequence No. AC005328_2.
105. PRQ1575
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01S7S (shown in Figure 210 and SEQ ID NO:358) has certain amino acid sequence identity with 10 Dayhoff sequence no. A12005_l. Accordingly, it is presently believed that PROI575 disclosed in the present application is a newly identified member of the protein disulfide isomerase family and may possess activity and properties typical of the disulfide isomerase family.
106. PRQ1787
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01787 (shown in Figure 212 and SEQ ID NO:364) has certain amino acid sequence identity with various species of myelin pO. Accordingly, it is presently believed that PR01787 disclosed in the present application is a newly identified member of the myelin pO protein family and may share at least one similar mechanism. It is believed that modulators of PRO1787 may be used to treat myelin pO associated disorders, such 20 as neuropathy, hereditary tooth disease, etc.
107. PRQ1781
Using WU-BLAST2 sequence alignment computer programs, some sequence identities were found between the PR01781 amino acid sequence (SEQ ID NO:366) and the amino acid sequences of known proteins, 25 but were not found to be significant. Accordingly, as far as is known, the DNA76522-2500 sequence encodes a novel protein.
108. PROl 556
The DNA76529-1666 clone was isolated from a human breast tumor tissue library. As far as is known, 30 the DNA76S29-1666 sequence encodes a novel transmembrane protein designated herein as PR01S56. Using WU-BLAST2 sequence alignment computer programs, some sequence identities to known proteins were revealed.
109. PRQ17S9
As far as is known, the DNA76531-1701 sequence encodes a novel factor designated herein as
PR017S9; using WU-BLAST2 sequence alignment computer programs, limited sequence identities to known proteins were revealed.
344
110. PRQ1760
As far as is known, the DNA76532-1702 sequence encodes a novel factor designated herein as PR01760; using WU-BLAST2 sequence alignment computer programs, limited sequence identities to known proteins were revealed.
111. PROl 561
Using the WU-BLAST2 sequence alignment computer program, it has been found that a portion of a full-length native sequence PRO 1561 (shown in Figure 222 and SEQ ID NO:378) has certain amino acid sequence identity with a portion of the human phospholipase A2 protein (P_R63053). Accordingly, it is presently believed that PR01561 disclosed in the present application is a newly identified member of the 10 phospholipase A2 protein family and may possess activity typical of that protein family.
112. PRQ1S67
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01567 (Figure 224; SEQ ID NO:383) has certain amino acid sequence identity with human colon 15 specific gene CSG6 polypeptide, identified as P W06549 on the Dayhoff database. Accordingly, it is presently believed that PR01567 disclosed in the present application is a newly identified CSG expression product, and may possess properties typical of such proteins.
113. PRQ1693
Using the WU-BLAST2 sequence alignment computer program, it has been found that a portion of the full-length native sequence PR01693 (shown in Figure 226 and SEQ ID NO:385) has certain amino acid sequence identity with a portion of a mouse insulin-like growth factor binding protein (ALS_MOUSE). Accordingly, it is presently believed that PR01693 disclosed in the present application is a newly identified member of the insulin-like growth factor binding protein family and may possess activity typical of that protein 25 family.
114. PROl 784
As far as is known, die DNA77303-2502 sequence encodes a novel factor designated herein as PR01784; using WU-BLAST2 sequence alignment computer programs, some sequence identities to known 30 proteins were revealed.
115. PRQ1605
Using the WU-BLAST2 sequence alignment computer program, it has been found that a portion of the full-length native sequence PR01605 (shown in Figure 230 and SEQ ID NO:395) has certain amino acid 35 sequence identity with a portion of the human alpha-l,3-mannosylglycoprotein beta-1,6-n-acetyltransferase protein (GNT5_HUMAN). Accordingly, it is presently believed that PR01605 disclosed in the present application is a newly identified member of the glycosyltransferase protein family and may possess activity
345
typical of that protein family.
116. PRQ1788
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native sequence PR01788 (shown in Figure 232 and SEQ ID NO-.397) has certain amino acid sequence identity with 5 Dayhoff sequence " GARP_HUMAN", a leucine-rich repeat-containing protein encoded by a gene localized in the llql4 chromosomal region. Accordingly, it is presently believed that PR01788 disclosed in the present application is a newly identified member of the leucine-rich repeat-containing family and may possess activity or properties typical of the leucine-rich repeat-containing family.
117. PRQ1801
Using the WU-BLAST2 sequence alignment computer program, it has been found that a portion of the full-length native sequence PR01801 (shown in Figure 234 and SEQ ID N0:402) has certain amino acid sequence identity with a portion of the IL-19 protein (P W37935). Accordingly, it is presently believed that PR01801 disclosed in the present application is a newly identified member of the IL-10-related cytokine family 15 and may possess activity typical of that cytokine family.
118. UCP4
Using the Megalign DNASTAR computer program (and algorithms and parameters in this software set by the manufacturer) (Oxford Molecular Group, Inc.), it has been found that a full-length native sequence UCP4 20 (shown in Figure 236 and SEQ ID N0:406) has certain amino acid sequence identity with UCP3, UCP2 and UCP1. Accordingly, it is presently believed that UCP4 disclosed in the present application is a newly identified member of the human uncoupling protein family and may possess activity(s) and/or property(s) typical of that protein family, such as the ability to enhance or supress metabolic rate by affecting mitochondrial membrane potential.
119. PRQ193
The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides referred to in die present application as PR0193. In particular, Applicants have identified and isolated cDNA encoding a PR0193 polypeptide, as disclosed in further detail in the Examples below. The PR0193-encoding 30 clone was isolated from a human retina library.
120. PROl 130
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length native sequence PROl 130 (shown in Figure 240 and SEQ ID NO-.415) has amino acid sequence identity with 35 the human2-19 protein. Accordingly, it is presently believed that PROl 130 disclosed in the present application is a newly identified 2-19 protein homolog.
346
121. PROl 335
Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length native sequence PR01335 (shown in Figure 242 and SEQ ID NO-.423) has certain amino acid sequence identity with the human carbonic anhydrase precursor protein (AF037335 1). Accordingly, it is presently believed that PRO1335 disclosed in the present application is a newly identified member of the carbonic anhydrase protein 5 family and may possess activity typical of that family.
122. PRQ1329
The DNA66660-1585 clone is believed to encode a secreted factor. As far as is known, the DNA66660-1585 sequence encodes a novel factor designated herein as PR01329; using WU-BLAST2 sequence alignment 10 computer programs, sequence identities to known proteins were revealed but determined not to be significant.
123. PROl 550
The DNA76393-1664 clone was isolated from a subtracted human breast tumor tissue library. As far as is known, the DNA76393-1664 sequence encodes a novel factor designated herein as PR01550; using WU-15 BLAST2 sequence alignment computer programs, sequence identities to known proteins were revealed but determined not to be significant.
B. PRO Variants
In addition to the full-length native sequence PRO polypeptides described herein, it is contemplated that 20 PRO variants can be prepared. PRO variants can be prepared by introducing appropriate nucleotide changes into the PRO DNA, and/or by synthesis of the desired PRO polypeptide. Those skilled in the art will appreciate that amino acid changes may alter post-translational processes of the PRO, such as changing the number or position of glycosylation sites or altering the membrane anchoring characteristics.
Variations in the native full-length sequence PRO or in various domains of the PRO described herein, 25 can be made, for example, using any of the techniques and guidelines for conservative and non-conservative mutations set forth, for instance, in U.S. Patent No. 5,364,934. Variations may be a substitution, deletion or insertion of one or more codons encoding the PRO that results in a change in the amino acid sequence of the PRO as compared with the native sequence PRO. Optionally the variation is by substitution of at least one amino acid with any other amino acid in one or more of the domains of the PRO. Guidance in determining which 30 amino acid residue may be inserted, substituted or deleted without adversely affecting the desired activity may be found by comparing the sequence of the PRO with that of homologous known protein molecules and minimizing the number of amino acid sequence changes made in regions of high homology. Amino acid substitutions can be the result of replacing one amino acid with another amino acid having similar structural and/or chemical properties, such as the replacement of a leucine with a serine, i.e., conservative amino acid 35 replacements. Insertions or deletions may optionally be in the range of about 1 to 5 amino acids. The variation allowed may be determined by systematically making insertions, deletions or substitutions of amino acids in the sequence and testing the resulting variants for activity exhibited by the full-length or mature native sequence.
347
WO 00/12708 PCT/US99/20111
PRO polypeptide fragments are provided herein. Such fragments may be truncated at the N-terminus or C-terminus, or may lack internal residues, for example, when compared with a full length native protein. Certain fragments lack amino acid residues that are not essential for a desired biological activity of the PRO polypeptide.
PRO fragments may be prepared by any of a number of conventional techniques. Desired peptide 5 fragments may be chemically synthesized. An alternative approach involves generating PRO fragments by enzymatic digestion, e.g., by treating the protein with an enzyme known to cleave proteins at sites defined by particular amino acid residues, or by digesting the DNA with suitable restriction enzymes and isolating the desired fragment. Yet another suitable technique involves isolating and amplifying a DNA fragment encoding a desired polypeptide fragment, by polymerase chain reaction (PCR). Oligonucleotides that define the desired 10 termini of the DNA fragment are employed at the 5* and 3' primers in die PCR. Preferably, PRO polypeptide fragments share at least one biological and/or immunological activity with the native PRO polypeptide disclosed herein.
In particular embodiments, conservative substitutions of interest are shown in Table 1 under the heading of preferred substitutions. If such substitutions result in a change in biological activity, then more substantial IS changes, denominated exemplary substitutions in Table 1, or as further described below in reference to amino acid classes, are introduced and the products screened.
348
Table 1
Original
Exemplary
Preferred
Residue
Substitutions
Substitutions
Ala (A)
val; ieu; ile val
Arg (R)
lys; gin; asn lys
Asn(N)
gin; his; lys; arg gin
Asp (D)
glu glu
Cys (C)
ser ser
Gln(Q)
asn asn
Glu (E)
asp asp
Gly (G)
pro; ala ala
His (H)
asn; gin; lys; arg arg
De (D
leu; val; met; ala; phe;
norleucine leu
Leu (L)
norleucine; ile; val;
met; ala; phe ile
Lys (K)
arg; gin; asn arg
Met (M)
leu; phe; ile leu
Phe (F)
leu; val; ile; ala; tyr leu
Pro (P)
ala ala
Ser (S)
thr thr
Thr (T)
ser ser
Trp (W)
tyr; phe tyr
TyrOO
trp; phe; thr; ser phe
Val (V)
ile; leu; met; phe;
ala; norleucine leu
Substantial modifications in functionor immunologicalidentity of the PRO polypeptide are accomplished 30 by selecting substitutions that differ significantly in their effect on maintaining (a) the structure of the polypeptide backbone in the area of the substitution, for example, as a sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. Naturally occurring residues are divided into groups based on common side-chain properties:
(1) hydrophobic: norleucine, met, ala, val, leu, ile;
(2) neutral hydrophiiic: cys, ser, thr;
(3) acidic: asp, glu;
(4) basic: asn, gin, his, lys, arg;
(5) residues that influence chain orientation: gly, pro; and
(6) aromatic: trp, tyr, phe.
40 Non-conservative substitutions will entail exchanging a member of one of these classes for another class.
Such substituted residues also may be introduced into the conservative substitution sites or, more preferably, into the remaining (non-conserved) sites.
The variations can be made using methods known in the art such as oiigonucleotide-mediated (site-directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis [Carter etal., Nucl. 45 Acids Res.. 13:4331 (1986); Zoller et al., Nucl. Acids Res.. 10:6487 (1987)], cassene mutagenesis [Wells et al., Gene. 24:315 (1985)], restriction selection mutagenesis [Wells et al., Philos. Trans. R. Soc. London Ser A. 317:415 (1986)] or other known techniques can be performed on the cloned DNA to produce the PRO variant
349
DNA.
Scanning amino acid analysis can also be employed to identify one or more amino acids along a contiguous sequence. Among the preferred scanning amino acids are relatively small, neutral amino acids. Such amino acids include alanine, glycine, serine, and cysteine. Alanine is typically a preferred scanning amino acid among this group because it eliminates the side-chain beyond the beta-carbon and is less likely to alter the main-5 chain conformation of the variant [Cunningham and Wells, Science. 244: 1081-1085 (1989)]. Alanine is also typically preferred because it is the most common amino acid. Further, it is frequently found in both buried and exposed positions [Creighton, The Proteins. (W.H. Freeman & Co., N.Y.); Chothia, J. Mot. Biol.. 150:1 (1976)]. If alanine substitution does not yield adequate amounts of variant, an isoteric amino acid can be used.
C. Modifications of PRO
Covalent modifications of PRO are included within the scope of this invention. One type of covalent modification includes reacting targeted amino acid residues of a PRO polypeptide with an organic derivatizing agent that is capable of reacting with selected side chains or the N- or C- terminal residues of the PRO. Derivatization with bifunctional agents is useful, for instance, for crosslinking PRO to a water-insoluble support 15 matrix or surface for use in the method fm- purifying anti-PRO antibodies, and vice-versa. Commonly used crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, for example, esters with 4-azidosaiicylic acid, homobifunctional imidoesters, including disuccinimidyl esters such as 3,3'-dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N-maleimido-1,8-octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate.
Other modifications include deamidation of glutaminyi and asparaginyl residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, phosphorylation of hydroxyl groups of seryl or threonyl residues, methylation of the a-amino groups of lysine, arginine, and histidine side chains [T.E. Creighton, Proteins: Structure and Molecular Properties. W.H. Freeman & Co., San Francisco, pp. 79-86 (1983)], acetylation of the N-terminal amine, and amidation of any C-terminal carboxyl group. 25 Another type of covalent modification of the PRO polypeptide included within the scope of this invention comprises altering the native glycosylation pattern of the polypeptide. "Altering the native glycosylation pattern" is intended for purposes herein to mean deleting one or more carbohydrate moieties found in native sequence PRO (either by removing the underlying glycosylation site or by deleting the glycosylation by chemical and/or enzymatic means), and/or adding one or more glycosylation sites that are not present in the 30 native sequence PRO. In addition, the phrase includes qualitative changes in the glycosylation of the native proteins, involving a change in the nature and proportions of the various carbohydrate moieties present.
Addition of glycosylation sites to the PRO polypeptide may be accomplished by altering the amino acid sequence. The alteration may be made, for example, by the addition of, or substitution by, one or more serine or threonine residues to the native sequence PRO (for O-linked glycosylation sites). The PRO amino acid 35 sequence may optionally be altered through changes at the DNA level, particularly by mutating the DNA encoding the PRO polypeptide at preselected bases such that codons are generated that will translate into the desired amino acids.
350
Another means of increasing the number of carbohydrate moieties on the PRO polypeptide is by chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the art, e.g., in WO 87/05330 published 11 September 1987, and in Aplin and Wriston, CRC Crit. Rev. Biochem.. pp. 259-306 (1981).
Removal of carbohydrate moieties present on the PRO polypeptide may be accomplished chemically 5 or enzymatically or by mutational substitution of codons encoding for amino acid residues that serve as targets for glycosylation. Chemical deglycosylation techniques are known in the art and described, for instance, by Hakimuddin, et al., Arch. Biochem. Biophvs.. 259:52 (1987) and by Edge et al., Anal. Biochem.. 118:131 (1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety of endo- and exo-glycosidases as described by Thotakura et al., Meth. F.nzvmol 138:350 (1987). 10 Another type of covalent modification of PRO comprises linking the PRO polypeptide to one of a variety of nonproteinaceous polymers, e.g., polyethylene glycol (PEG), polypropylene glycol, or polyoxyalkylenes, in themannersetforthinU.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or4,179,337.
The PRO of the present invention may also be modified in a way to form a chimeric molecule comprising PRO fused to another, heterologous polypeptide or amino acid sequence.
In one embodiment, such a chimeric molecule comprises a fusion of the PRO with a tag polypeptide which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is generally placed at the amino- or carboxyl- terminus of the PRO. The presence of such epitope-tagged forms of the PRO can be detected using an antibody against the tag polypeptide. Also, provision of the epitope tag enables the PRO to be readily purified by affinity purification using an anti-tag antibody or another type of affinity matrix that binds 20 to the epitope tag. Various tag polypeptides and their respective antibodies are well known in the an. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag polypeptide and its antibody 12CA5 [Field et al., Mol. Cell. Biol.. &2159-2165 (1988)]; the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 9E10 antibodies thereto [Evan et al., Molecular and Cellular Biology. 5,:3610-3616 (1985)]; and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsky et al., Protein Engineering. 3(6):547-25 553 (1990)]. Other tag polypeptides include the Flag-peptide [Hopp et al., BioTechnologv. 1204-1210 (1988)]; the KT3 epitope peptide [Martin et al., Science. 255:192-194 (1992)]; an a-tubulin epitope peptide [Skinner et al., J. Biol. Chem.. 266:15163-15166 (1991)]; and the T7 gene 10 protein peptide tag [Lutz-Freyermuth et al., Proc. Natl. Acad. Sci. USA. 82:6393-6397 (1990)].
In an alternative embodiment, the chimeric molecule may comprise a fusion of the PRO with an 30 immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the chimeric molecule (also referred to as an "immunoadhesin"), such a fusion could be to the Fc region of an IgG molecuie. The Ig fusions preferably include the substitution of a soluble (transmembrane domain deleted or inactivated) form of a PRO polypeptide in place of at least one variable region within an Ig molecule. In a particularly preferred embodiment, the immunoglobulin fusion includes the hinge, CH2 and CH3, or the hinge, CHI, CH2 and CH3 35 regions of an IgGl molecule. For the production of immunoglobulin fusions see also US Patent No. 5,428,130 issued June 27, 1995.
351
D. Preparation of PRO
The description below relates primarily to production of PRO by culturing cells transformed or transfected with a vector containing PRO nucleic acid. It is, of course, contemplated that alternative methods, which are well known in the art, may be employed to prepare PRO. For instance, the PRO sequence, or portions thereof, may be produced by direct peptide synthesis using solid-phase techniques [see, e.g., Stewart 5 etal.. Solid-Phase Peptide Synthesis. W.H. Freeman Co.. San Francisco. CAn969t: Merrifield. J. Am. Chem. Soc.. 85:2149-2154 <1963)3- In vitro protein synthesis may be performed using manual techniques or by automation. Automated synthesis may be accomplished, for instance, using an Applied Biosystems Peptide Synthesizer (Foster City, CA) using manufacturer's instructions. Various portions of the PRO may be chemically synthesized separately and combined using chemical or enzymatic methods to produce the full-length 10 PRO.
1. Isolation of DNA Encoding PRO
DNA encoding PRO may be obtained from a cDNA library prepared from tissue believed to possess the PRO mRNA and to express it at a detectable level. Accordingly, human PRO DNA can be conveniently IS obtained from a cDNA library prepared from human tissue, such as described in the Examples. The PRO-encoding gene may also be obtained from a genomic library or by known synthetic procedures (e.g., automated nucleic acid synthesis).
Libraries can be screened with probes (such as antibodies to the PRO or oligonucleotides of at least about 20-80 bases) designed to identify the gene of interest or the protein encoded by it. Screening the cDNA 20 or genomic library with the selected probe may be conducted using standard procedures, such as described in Sambrook et al., Molecular Cloning: A Laboratory Mannal rNew York: Cold Spring Harbor Laboratory Press, 1989). An alternative means to isolate the gene encoding PRO is to use PCR methodology [Sambrook et al., sunra: Dieffenbach et al., PCR Primer: A Laboratory Manual (Cold Spring Harbor Laboratory Press, 1995)].
The Examples below describe techniques for screening a cDNA library. The oligonucleotide sequences 25 selected as probes should be of sufficient length and sufficiently unambiguous that false positives are minimized. The oligonucleotide is preferably labeled such that it can be detected upon hybridization to DNA in the library being screened. Methods of labeling are well known in the art, and include the use of radiolabels like 32P-labeled ATP, biotinylation or enzyme labeling. Hybridization conditions, including moderate stringency and high stringency, are provided in Sambrook et al., supra.
Sequences identified in such library screening methods can be compared and aligned to other known sequences deposited and available in public databases such as GenBank or other private sequence databases. Sequence identity (at either the amino acid or nucleotide level) within defined regions of the molecule or across the full-length sequence can be determined using methods known in the art and as described herein.
Nucleic acid having protein coding sequence may be obtained by screening selected cDNA or genomic 35 libraries using the deduced amino acid sequence disclosed herein for the first time, and, if necessary, using conventional primer extension procedures as described in Sambrook et al., supra, to detect precursors and processing intermediates of mRNA that may not have been reverse-transcribed into cDNA.
352
2. Selection and Transformation of Host Cells Host cells are transfected or transformed with expression or cloning vectors described herein for PRO production and cultured in conventional nutrient media modified as appropriate for inducing promoters, selecting transformants, or amplifying the genes encoding the desired sequences. The culture conditions, such as media, temperature, pH and the like, can be selected by the skilled artisan without undue experimentation. In general, principles, protocols, and practical techniques for maximizing the productivity of cell cultures can be found in Mammalian Cell Biotechnology: a Practical Approach. M. Butler, ed. (IRL Press, 1991) and Sambrook et al.,
Methods of eukaryotic cell transfection and prokaryotic cell transformation are known to the ordinarily skilled artisan, for example, CaCl2, CaP04, liposome-mediated and electroporation. Depending on the host cell 10 used, transformation is performed using standard techniques appropriate to such cells. The calcium treatment employing calcium chloride, as described in Sambrook et al., supra, or electroporation is generally used for prokaryotes. Infection with Agrobacterium tumefaciens is used for transformation of certain plant cells, as described by Shaw etal., Gene, 23:315 (1983) and WO 89/05859 published 29 June 1989. For mammalian cells without such cell walls, the calcium phosphate precipitation method of Graham and van der Eb, Virology. 15 52:456-457 (1978) can be employed. General aspects of mammalian cell host system transfections have been described in U.S. Patent No. 4,399,216. Transformations into yeast are typically carried out according to the method of Van Solingen etal., J. Bact.. 130:946 (1977) and Hsiao etal., Proc. Natl. Acad. Sci. (USA). 76:3829 (1979). However, other methods for introducing DNA into cells, such as by nuclear microinjection, electroporation, bacterial protoplast fusion with intact cells, or polycations, e.g., polybrene, polyoraithine, may 20 also be used. For various techniques for transforming mammalian cells, see Keown et al., Methods in Enzvmologv. 185:527-537 (1990) and Mansour et al., Nature. 336:348-352 (1988).
Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote, yeast, or higher eukaryote cells. Suitable prokaryotes include but are not limited to eubacteria, such as Gram-negative or Gram-positive organisms, for example, Enterobacteriaceae such as E. coli. Various E. coli strains are 25 publicly available, such as E. coli K12 strain MM294 (ATCC 31,446); E. coli X1776 (ATCC 31,537); E. coli strain W31I0 (ATCC 27,325) and K5 772 (ATCC 53,635). Other suitable prokaryotic host cells include Enterobacteriaceae such as Escherichia, e.g., E. coli, Enterobacter, Erwinia, Klebsiella, Proteus, Salmonella, e.g., Salmonella typhimurium, Serratia, e.g., Serratia marcescans, and Shigella, as well as Bacilli such as B. subtUis and B. licheniformis (e.g., B. licheniformis 41P disclosed in DD 266,710 published 12 April 1989), 30 Pseudomonas such as P. aeruginosa, and Streptomyces. These examples are illustrative rather than limiting. Strain W3110 is one particularly preferred host or parent host because it is a common host strain for recombinant DNA product fermentations. Preferably, the host cell secretes minimal amounts of proteolytic enzymes. For example, strain W3110 may be modified to effect a genetic mutation in the genes encoding proteins endogenous to the host, with examples of such hosts including E. coli W3110 strain 1A2, which has the complete genotype 35 tonA ; E. coli W3110 strain 9E4, which has the complete genotype tonA ptr3; E. coli W3110 strain 27C7 (ATCC 55,244), which has the complete genotype tonA ptr3 phoA E15 (argF-lac)169 degP ompTkarf; E. coli W3110 strain 37D6, which has the complete genotype tonA ptr3 phoA E15 (argF-lac)169 degP ompT rbs7
353
ilvG kaif; E. coli W3I10 strain 40B4, which is strain 37D6 with a non-kanamycin resistant degP deletion mutation; and an E. coli strain having mutant periplasmic protease disclosed in U.S. Patent No. 4,946,783 issued 7 August 1990. Alternatively, in vitro methods of cloning, e.g., PCR or other nucleic acid polymerase reactions, are suitable.
In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable cloning 5 or expression hosts for PRO-encoding vectors. Saccharomyces cerevisiae is a commonly used lower eukaryotic host microorganism. Others include Schizosaccharomycespombe (Beach and Nurse, Nature. 290: 140 [1981]; EP 139,383 published 2 May 1985); Kluyveromyces hosts (U.S. Patent No. 4,943,529; Fleer et al., Bio/Technology. 9:968-975 (1991)) such as, e.g., 2ST. lactis (MW98-8C, CBS683, CBS4574; Louvencourtetal., J. Bacterid.. 737 [1983]), K. fragilis (ATCC 12,424), K. bulgaricus (ATCC 16,045), K. wickeramii (ATCC 10 24,178), K. waltii (ATCC 56,500), K. drosophilarum (ATCC 36,906; Van den Berg et al., Bio/Technology. 8:135 (1990)), K. thermotolerans, and K. marxianus; yarrowia (EP 402,226); Pichia pastoris (EP 183,070; Sreekrishna et al., J. Basic Microbiol.. 28:265-278 [1988]); Candida; Trichoderma reesia (EP 244,234); Neurospora crassa (Case et al., Proc. Natl. Acad. Sci. USA. 76:5259-5263 [1979]); Schwanniomyces such as Schwanniomyces occidental^ (EP 394,538 published 31 October 1990); and iilamentous fungi such as, e.g., 15 Neurospora, Pemcillium, Tolypocladium (WO 91/00357 published 10 January 1991), and Aspergillus hosts such as A. nidulans (Ballance et al., Biochem. Biophvs. Res. Commun.. 112:284-289 [1983]; Tilburnet al., Gene. 26:205-221 [1983]; Yelton et al., Proc. Natl. Acad. Sci. USA. 81: 1470-1474 [1984]) and A. niger (Kelly and Hynes, EMBO J.. 4:475-479 [1985]). Methylotropic yeasts are suitable herein and include, but are not limited to, yeast capable of growth on methanol selected from the genera consisting of Hansenula, Candida, Kloeckera, 20 Pichia, Saccharomyces, Torulopsis, and Rhodotorula. A list of specific species that are exemplary of this class of yeasts may be found in C. Anthony, The Biochemistry of Methvlotronhs. 269 (1982).
Suitable host cells for the expression of glycosylated PRO are derived from multicellular organisms. Examples of invertebrate cells include insect cells such as Drosophila S2 and Spodoptera Sf9, as well as plant cells. Examples of useful mammalian host cell lines include Chinese hamster ovary (CHO) and COS cells. 25 More specific examples include monkey kidney CV1 line transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney line (293 or 293 cells subcloned for growth in suspension culture, Graham et al., J._ Gen Virol.. 36:59 (1977)); Chinese hamster ovary cells/-DHFR (CHO, Urlaub and Chasin, Proc. Natl. Acad. Sci. USA. 77:4216 (1980)); mouse Sertoli cells (TM4, Mather, Biol. Reprod.. 23:243-251 (1980)); human lung cells (W138, ATCC CCL 75); human liver cells (Hep G2, HB 8065); and mouse mammary tumor (MMT 30 060562, ATCC CCL51). The selection of the appropriate host cell is deemed to be within the skill in the art.
3. Selection and Use of a Replicable Vector
The nucleic acid (e.g., cDNA or genomic DNA) encoding PRO may be inserted into a replicable vector for cloning (amplification of the DNA) or for expression. Various vectors are publicly available. The vector 35 may, for example, be in the form of a plasmid, cosmid, viral panicle, or phage. The appropriate nucleic acid sequence may be insened into the vector by a variety of procedures. In general, DNA is inserted into an appropriate restriction endonuclease site(s) using techniques known in the an. Vector components generally
354
include, but are not limited to, one or more of a signal sequence, an origin of replication, one or more marker genes, an enhancer element, a promoter, and a transcription termination sequence. Construction of suitable vectors containing one or more of these components employs standard ligation techniques which are known to the skilled artisan.
The PRO may be produced recombinantly not only directly, but also as a fusion polypeptide with a 5 heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage site at the N-terminus of the mature protein or polypeptide. In general, the signal sequence may be a component of the vector, or it may be a part of the PRO-encoding DNA that is inserted into the vector. The signal sequence may be a prokaryotic signal sequence selected, for example, from the group of the alkaline phosphatase, penicillinase, lpp, or heat-stable enterotoxin II leaders. For yeast secretion the signal sequence may be, e.g., 10 the yeast invertase leader, alpha factor leader (including Saccharomyces and Kluyveromyces a-factor leaders, the latter described in U.S. Patent No. 5,010,182), or acid phosphatase leader, the C. albicans glucoamylase leader (EP 362,179 published 4 April 1990), or the signal described in WO 90/13646 published 15 November 1990. In mammalian cell expression, mammalian signal sequences may be used to direct secretion of the protein, such as signal sequences from secreted polypeptides of the same or related species, as well as viral 15 secretory leaders.
Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate in one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses. The origin of replication from the plasmid pBR322 is suitable for most Gram-negative bacteria, the 2/t plasmid origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, VSV or BPV) are useful for 20 cloning vectors in mammalian cells.
Expression and cloning vectors will typically contain a selection gene, also termed a selectable marker. Typical selection genes encode proteins that (a) confer resistance to antibiotics or other toxins, e.g., ampicillin, neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply critical nutrients not available from complex media, e.g., the gene encoding D-alanine racemase for Bacilli. 25 An example of suitable selectable markers for mammalian cells are those that enable the identification of cells competent to take up the PRO-encoding nucleic acid, such as DHFR or thymidine kinase. An appropriate host cell when wild-type DHFR is employed is the CHO cell line deficient in DHFR activity, prepared and propagated as described by Urlaub etal., Proc. Natl. Acad. Sci. USA. 77:4216 (1980). A suitable selection gene for use in yeast is the trpl gene present in the yeast plasmid YRp7 [Stinchcomb et al., Nature. 30 282:39 (1979); Kingsman et al., Gene. 7:141 (1979); Tschemper et al., Gene. 10:157 (1980)]. The trpl gene provides a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, for example, ATCC No. 44076 or PEP4-1 [Jones, Genetics. 85:12 (1977)].
Expression and cloning vectors usually contain a promoter operably linked to the PRO-encoding nucleic acid sequence to direct mRNA synthesis. Promoters recognized by a variety of potential host cells are well 35 known. Promoters suitable for use with prokaryotic hosts include the p-lactamase and lactose promoter systems [Chang et al., Nature. 275:615 (1978); Goeddel et al., Nature. 281:544 (1979)], alkaline phosphatase, a tryptophan (trp) promoter system [Goeddel, Nucleic Acids Res.. 8:4057 (1980); EP 36,776], and hybrid
355
promoters such as the tac promoter [deBoer et al., Proc. Natl. Acad. Sci. USA. 80:21-25 (1983)]. Promoters for use in bacterial systems also will contain a Shine-Daigarno (S.D.) sequence operably linked to the DNA encoding PRO.
Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3-phosphoglycerate kinase [Hitzeman et al., J. Biol. Chem.. 255:2073 (1980)] or other glycolytic enzymes [Hess 5 et al., J. Adv. Enzvme Res.. 7:149 (1968); Holland, Biochemistry. 17:4900 (1978)], such as enolase, glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase,phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphateisomerase,phosphoglucose isomerase, and glucokinase.
Other yeast promoters, which are inducible promoters having the additional advantage of transcription 10 controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein, glyceraIdehyde-3-phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilisation Suitable vectors and promoters for use in yeast expression are further described in EP 73,657.
PRO transcription from vectors in mammalian host cells is controlled, for example, by promoters 15 obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,211,504 published 5 July 1989), adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a retrovirus, hepatitis-B virus and Simian Virus 40 (SV40), from heterologous mammalian promoters, e.g., the actin promoter or an immunoglobulin promoter, and from heat-shock promoters, provided such promoters are compatible with the host cell systems.
Transcription of a DNA encoding the PRO by higher eukaryotes may be increased by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA, usually about from 10 to 300 bp, that act on a promoter to increase its transcription. Many enhancer sequences are now known from mammalian genes (globin, elastase, albumin, a-fetoprotein, and insulin). Typically, however, one will use an enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the late side of the replication 25 origin (bp 100-270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers. The enhancer may be spliced into the vector at a position 5' or 3' to the PRO coding sequence, but is preferably located at a site 5' from the promoter.
Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated cells from other multicellular organisms) will also contain sequences necessary for the termination of 30 transcription and for stabilizing the mRNA. Such sequences are commonly available from the 5' and, occasionally 3', untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide segments transcribed as polyadenylated fragments in the untranslated portion of the mRNA encoding PRO.
Still other methods, vectors, and host cells suitable for adaptation to the synthesis of PRO in recombinant vertebrate cell culture are described in Gething et al., Nature. 293:620-625 (1981); Mantei et al., 35 Nature. 281:40-46 (1979); EP 117,060; and EP 117,058.
356
4. Detecting Gene Amplification/Expression
Gene amplification and/or expression may be measured in a sample directly, for example, by conventional Southern blotting, Northern blotting to quantitate the transcription of mRNA [Thomas, Proc. Natl. Acad. Sci. USA. 77:5201-5205 (1980)], dot blotting (DNA analysis), or in situ hybridization, using an appropriately labeled probe, based on the sequences provided herein. Alternatively, antibodies may be employed 5 that can recognize specific duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein duplexes. The antibodies in turn may be labeled and the assay may be carried out where the duplex is bound to a surface, so that upon the formation of duplex on the surface, the presence of antibody bound to the duplex can be detected.
Gene expression, alternatively, may be measured by immunological methods, such as 10 immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to quantitate directly the expression of gene product. Antibodies useful for immunohistochemical staining and/or assay of sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, the antibodies may be prepared against a native sequence PRO polypeptide or against a synthetic peptide based on the DNA sequences provided herein or against exogenous sequence fused to PRO DNA and encoding a specific 15 antibody epitope.
. Purification of Polypeptide
Forms of PRO may be recovered from culture medium or from host cell Iysates. If membrane-bound, it can be released from the membrane using a suitable detergent solution (e.g. Triton-X 100) or by enzymatic 20 cleavage. Cells employed in expression of PRO can be disrupted by various physical or chemical means, such as freeze-thaw cycling, sonication, mechanical disruption, or cell lysing agents.
It may be desired to purify PRO from recombinant cell proteins or polypeptides. The following procedures are exemplary of suitable purification procedures: by fractionation on an ion-exchange column; ethanol precipitation; reverse phase HPLC; chromatography on silica or on a cation-exchange resin such as 25 DEAE; cbromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, for example, Sephadex G-75; protein A Sepharose columns to remove contaminants such as IgG; and metal chelating columns to bind epitope-tagged forms of the PRO. Various methods of protein purification may be employed and such methods are known in the art and described for example in Deutscher, Methods in Enzvmologv. 182 (1990); Scopes, Protein Purification: Principles and Practice. Springer-Verlag, New York (1982). The purification 30 step(s) selected will depend, for example, on the nature of the production process used and the particular PRO produced.
E. Uses for PRO
Nucleotide sequences (or their complement) encoding PRO have various applications in the art of 35 molecular biology, including uses as hybridization probes, in chromosome and gene mapping and in the generation of anti-sense RNA and DNA. PRO nucleic acid will also be useful for the preparation of PRO polypeptides by the recombinant techniques described herein.
357
The full-length native sequence PRO gene, or portions thereof, may be used as hybridization probes for a cDNA library to isolate the full-length PRO cDNA or to isolate still other cDNAs (for instance, those encoding naturally-occurring variants of PRO or PRO from other species) which have a desired sequence identity to the native PRO sequence disclosed herein. Optionally, the length of the probes will be about 20 to about SO bases. The hybridization probes may be derived from at least partially novel regions of the full length native 5 nucleotide sequence wherein those regions may be determined without undue experimentation or from genomic sequences including promoters, enhancer elements and introns of native sequence PRO. By way of example, a screening method will comprise isolating the coding region of the PRO gene using the known DNA sequence to synthesize a selected probe of about 40 bases. Hybridization probes may be labeled by a variety of labels, including radionucleotides such as nP or 35S, or enzymatic labels such as alkaline phosphatase coupled to the 10 probe via avidin/biotin coupling systems. Labeled probes having a sequence complementary to that of the PRO gene of the present invention can be used to screen libraries of human cDNA, genomic DNA or mRNA to determine which members of such libraries the probe hybridizes to. Hybridization techniques are described in further detail in the Examples below.
Any EST sequences disclosed in the present application may similarly be employed as probes, using 15 the methods disclosed herein.
Other useful fragments of the PRO nucleic acids include antisense or sense oligonucleotides comprising a singe-stranded nucleic acid sequence (either RNA or DNA) capable of binding to target PRO mRNA (sense) or PRO DNA (antisense) sequences. Antisense or sense oligonucleotides, according to the present invention, comprise a fragment of the coding region of PRO DNA. Such a fragment generally comprises at least about 14 20 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence encoding a given protein is described in, for example, Stein and Cohen (Cancer Res. 48:2659. 1988) and van der Krol et al. (BioTechnioues 6:958. 1988).
Binding of antisense or sense oligonucleotides to target nucleic acid sequences results in the formation of duplexes that block transcription or translation of the target sequence by one of several means, including 25 enhanced degradation of the duplexes, premature termination of transcription or translation, or by other means. The antisense oligonucleotides thus may be used to block expression of PRO proteins. Antisense or sense oligonucleotides further comprise oligonucleotides having modified sugar-phosphodiester backbones (or other sugar linkages, such as those described in WO 91/06629) and wherein such sugar linkages are resistant to endogenous nucleases. Such oligonucleotides with resistant sugar linkages are stable in vivo (i.e„ capable of 30 resisting enzymatic degradation) but retain sequence specificity to be able to bind to target nucleotide sequences.
Other examples of sense or antisense oligonucleotides include those oligonucleotides which are covalently linked to organic moieties, such as those described in WO 90/10048, and other moieties that increases affinity of the oligonucleotide for a target nucleic acid sequence, such as poly-(L-lysine). Further still, intercalating agents, such as ellipticine, and alkylating agents or metal complexes may be attached to sense or 35 antisense oligonucleotides to modify binding specificities of the antisense or sense oligonucleotide for the target nucleotide sequence.
Antisense or sense oligonucleotides may be introduced into a cell containing the target nucleic acid
358
sequence by any gene transfer method, including, for example, CaP04-mediated DNA transfection, electroporation, or by using gene transfer vectors such as Epstein-Barr virus. In a preferred procedure, an antisense or sense oligonucleotide is inserted into a suitable retroviral vector. A cell containing the target nucleic acid sequence is contacted with the recombinant retroviral vector, either in vivo or ex vivo. Suitable retroviral vectors include, but are not limited to, those derived from the murine retrovirus M-MuLV, N2 (a retrovirus 5 derived from M-MuLV), or the double copy vectors designated DCT5A, DCT5B and DCT5C (see WO 90/13641).
Sense or antisense oligonucleotides also may be introduced into a cell containing die target nucleotide sequence by formation of a conjugate with a ligand binding molecule, as described in WO 91/04753. Suitable ligand binding molecules include, but are not limited to, cell surface receptors, growth factors, other cytokines, 10 or other ligands that bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does not substantially interfere with the ability of the ligand binding molecule to bind to its corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide or its conjugated version into the cell.
Alternatively, a sense or an antisense oligonucleotide may be introduced into a cell containing the target nucleic acid sequence by formation of an oligonucleotide-lipid complex, as described in WO 90/10448. The 15 sense or antisense oligonucleotide-lipid complex is preferably dissociated within the cell by an endogenous lipase.
The probes may also be employed in PCR techniques to generate a pool of sequences for identification of closely related PRO coding sequences.
Nucleotide sequences encoding a PRO can also be used to construct hybridization probes for mapping the gene which encodes that PRO and for the genetic analysis of individuals with genetic disorders. The 20 nucleotide sequences provided herein may be mapped to a chromosome and specific regions of a chromosome using known techniques, such as in situ hybridization, linkage analysis against known chromosomal markers, and hybridization screening with libraries.
When the coding sequences for PRO encode a protein which binds to another protein (example, where the PRO is a receptor), the PRO can be used in assays to identify the other proteins or molecules involved in the 25 binding interaction. By such methods, inhibitors of the receptor/ligand binding interaction can be identified. Proteins involved in such binding interactions can also be used to screen for peptide or small molecule inhibitors or agonists of the binding interaction. Also, the receptor PRO can be used to isolate correlative ligand(s). Screening assays can be designed to find lead compounds that mimic the biological activity of a native PRO or a receptor for PRO. Such screening assays will include assays amenable to high-throughput screening of 30 chemical libraries, making them particularly suitable for identifying small molecule drug candidates. Small molecules contemplated include synthetic organic or inorganic compounds. The assays can be performed in a variety of formats, including protein-protein binding assays, biochemical screening assays, immunoassays and cell based assays, which are well characterized in the art.
Nucleic acids which encode PRO or its modified forms can also be used to generate either transgenic 35 animals or "knock out" animals which, in turn, are useful in the development and screening of therapeutically useful reagents. A transgenic animal (e.g., a mouse or rat) is an animal having cells that contain a transgene, which transgene was introduced into the animal or an ancestor of the animal at a prenatal, e.g., an embryonic
359
stage. A transgene is a DNA which is integrated into the genome of a cell from which a transgenic animal develops. In one embodiment, cDNA encoding PRO can be used to clone genomic DNA encoding PRO in accordance with established techniques and the genomic sequences used to generate transgenic animals that contain cells which express DNA encoding PRO. Methods for generating transgenic animals, particularly animals such as mice or rats, have become conventional in the art and are described, for example, in U.S. Patent 5 Nos. 4,736,866 and 4,870,009. Typically, particular cells would be targeted for PRO transgene incorporation with tissue-specific enhancers. Transgenic animals that include a copy of a transgene encoding PRO introduced into the germ line of the animal at an embryonic stage can be used to examine the effect of increased expression of DNA encoding PRO. Such animals can be used as tester animals for reagents thought to confer protection from, for example, pathological conditions associated with its overexpression. In accordance with this facet of 10 the invention, an animal is treated with the reagent and a reduced incidence of the pathological condition, compared to untreated animals bearing the transgene, would indicate a potential therapeutic intervention for the pathological condition.
Alternatively, non-human homologues of PRO can be used to construct a PRO "knock out" animal which has a defective or altered gene encoding PRO as a result of homologous recombination between the 15 endogenous gene encoding PRO and altered genomic DNA encoding PRO introduced into an embryonic stem cell of the animal. For example, cDNA encoding PRO can be used to clone genomic DNA encoding PRO in accordance with established techniques. A portion of the genomic DNA encoding PRO can be deleted or replaced with another gene, such as a gene encoding a selectable marker which can be used to monitor integration. Typically, several kilobases of unaltered flanking DNA (both at the 5" and 3' ends) are included 20 in the vector [see e.g., Thomas and Capecchi, Cell. 51:503 (1987) for a description of homologous recombination vectors]. The vector is introduced into an embryonic stem cell line (e.g., by electroporation) and cells in which the introduced DNA has homologously recombined with the endogenous DNA are selected [see e.g., Li et al., Cell. 69:915 (1992)]. The selected cells are then injected into a blastocyst of an animal (e.g., a mouse or rat) to form aggregation chimeras [see e.g., Bradley, in Terutocarcinomas and Embryonic Stem 25 Cells: A Practical Approach, E. J. Robertson, ed. (IRL, Oxford, 1987), pp. 113-152], A chimeric embryo can then be implanted into a suitable pseudopregnant female foster animal and the embryo brought to term to create a "knock out" animal. Progeny harboring the homologously recombined DNA in their germ cells can be identified by standard techniques and used to breed animals in which all cells of the animal contain the homologously recombined DNA. Knockout animals can be characterized for instance, for their ability to defend 30 against certain pathological conditions and for their development of pathological conditions due to absence of the PRO polypeptide.
Nucleic acid encoding the PRO polypeptides may also be used in gene therapy. In gene therapy applications, genes are introduced into cells in order to achieve in vivo synthesis of a therapeutically effective genetic product, for example for replacement of a defective gene. "Gene therapy" includes both conventional 35 gene therapy where a lasting effect is achieved by a single treatment, and the administration of gene therapeutic agents, which involves the one time or repeated administration of a therapeutically effective DNA or mRNA. Antisense RNAs and DNAs can be used as therapeutic agents for blocking the expression of certain genes in
360
vivo. It has already been shown that short antisense oligonucleotides can be imported into cells where they act as inhibitors, despite their low intracellular concentrations caused by their restricted uptake by the cell membrane. (Zameenik et al.. Proc. Natl. Acad. Sci. USA 83:4143-4146 [1986]). The oligonucleotides can be modified to enhance their uptake, e.g. by substituting their negatively charged phosphodiester groups by uncharged groups.
There are a variety of techniques available for introducing nucleic acids into viable cells. The techniques vary depending upon whether the nucleic acid is transferred into cultured cells in vitro, or in vivo in the cells of the intended host. Techniques suitable for the transfer of nucleic acid into mammalian cells in vitro include the use of liposomes, electroporation, microinjection, cell fusion, DEAE-dextran, the calcium phosphate precipitation method, etc. The currently preferred in vivo gene transfer techniques include transfection with viral 10 (typically retroviral) vectors and viral coat protein-liposome mediated transfection (Dzau et al., Trends in Biotechnology 11. 205-210 [1993]). In some situations it is desirable to provide the nucleic acid source with an agent that targets the target ceils, such as an antibody specific for a cell surface membrane protein or the target cell, a ligand for a receptor on the target cell, etc. Where liposomes are employed, proteins which bind to a cell surface membrane protein associated with endocytosis may be used for targeting and/or to facilitate IS uptake, e.g. capsid proteins or fragments thereof tropic for a particular cell type, antibodies for proteins which undergo internalization in cycling, proteins that target intracellular localization and enhance intracellular half-life. The technique of receptor-mediated endocytosis is described, for example, by Wu et al., J. Biol. Chem. 262, 4429-4432 (1987); and Wagner et al., Proc. Natl. Acad. Sci. USA 87, 3410-3414 (1990). For review of gene marking and gene therapy protocols see Anderson et al., Science 256. 808-813 (1992). 20 The PRO polypeptides described herein may also be employed as molecular weight markers for protein electrophoresis purposes and the isolated nucleic acid sequences may be used for recombinantly expressing those markers.
The nucleic acid molecules encoding the PRO polypeptides or fragments thereof described herein are useful for chromosome identification. In this regard, there exists an ongoing need to identify new chromosome 25 markers, since relatively few chromosome marking reagents, based upon actual sequence data are presently available. Each PRO nucleic acid molecule of the present invention can be used as a chromosome marker.
The PRO polypeptides and nucleic acid molecules of the present invention may also be used for tissue typing, wherein the PRO polypeptides of the present invention may be differentially expressed in one tissue as compared to another. PRO nucleic acid molecules will find use for generating probes for PCR, Northern 30 analysis, Southern analysis and Western analysis.
The PRO polypeptides described herein may also be employed as therapeutic agents. The PRO polypeptides of the present invention can be formulated according to known methods to prepare pharmaceuticaliy useful compositions, whereby the PRO product hereof is combined in admixture with a pharmaceuticaliy acceptable carrier vehicle. Therapeutic formulations are prepared for storage by mixing the active ingredient 35 having the desired degree of purity with optional physiologically acceptable carriers, excipients or stabilizers rRmningtrin's Pharmaceutical Sciences 16th edition, Osol, A. Ed. (1980)), in the form of lyophilized formulations or aqueous solutions. Acceptable carriers, excipients or stabilizers are nontoxic to recipients at the
361
dosages and concentrations employed, and include buffers such as phosphate, citrate and other organic acids; antioxidants including ascorbic acid; low molecular weight (less than about 10 residues) polypeptides; proteins, such as serum albumin, gelatin or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone, amino acids such as glycine, glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides and other carbohydrates including glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as 5 mannitol or sorbitol; salt-forming counterions such as sodium; and/or nonionic surfactants such as TWEEN™, PLURONICS™ or PEG.
The formulations to be used for in vivo administration must be sterile. This is readily accomplished by filtration through sterile filtration membranes, prior to or following lyophilization and reconstitution.
Therapeutic compositions herein generally are placed into a container having a sterile access port, for 10 example, an intravenous solution bag or vial having a stopper pierceable by a hypodermic injection needle.
The route of administration is in accord with known methods, e.g. injection or infusion by intravenous, intraperitoneal, intracerebral, intramuscular, intraocular, intraarterial or intralesional routes, topical administration, or by sustained release systems.
Dosages and desired drug concentrations of pharmaceutical compositions of the present invention may 15 vary depending on the particular use envisioned. The determination of the appropriate dosage or route of administration is well within the skill of an ordinary physician. Animal experiments provide reliable guidance for the determination of effective doses for human therapy. Interspecies scaling of effective doses can be performed following the principles laid down by Mordenti, J. and Chappell, W. "The use of interspecies scaling in toxicokinetics" In Toxicokinetics and New Drug Development, Yacobi et al., Eds., Pergamon Press, New 20 York 1989, pp. 42-96.
When in vivo administration of a PRO polypeptide or agonist or antagonist thereof is employed, normal dosage amounts may vary from about 10 ng/kg to up to 100 mg/kg of mammal body weight or more per day, preferably about 1 pg/kg/day to 10 mg/kg/day, depending upon the route of administration. Guidance as to particular dosages and methods of delivery is provided in the literature; see, for example, U.S. Pat. Nos. 25 4,657,760; 5,206,344; or 5,225,212. It is anticipated that different formulations will be effective for different treatment compounds and different disorders, that administration targeting one organ or tissue, for example, may necessitate delivery in a manner different from that to another organ or tissue.
Where sustained-release administration of a PRO polypeptide is desired in a formulation with release characteristics suitable for the treatment of any disease or disorder requiring administration of the PRO 30 polypeptide, microencapsulation of the PRO polypeptide is contemplated. Microencapsulation of recombinant proteins for sustained release has been successfully performed with human growth hormone (rhGH), interferon-(rhlFN- ), interleukin-2, and MN rgpl20. Johnson et al., Nat. Med.. 2:795-799 (1996); Yasuda, Biomed. Ther.. 27:1221-1223 (1993); Hora et al., Bio/Technology. 8:755-758 (1990); Cleland, "Design and Production of Single Immunization Vaccines Using Polylactide Polyglycolide Microsphere Systems," in Vaccine Design: 35 The Subunit and Adjuvant Approach. Powell and Newman, eds, (Plenum Press: New York, 1995), pp. 439-462; WO 97/03692, WO 96/40072, WO 96/07399; and U.S. Pat. No. 5,654,010.
The sustained-release formulations of these proteins were developed using poly-lactic-coglycolic acid
362
PCT/US99/20U1
(PLGA) polymer due to its biocompatibility and wide range of biodegradable properties. The degradation products of PLGA, lactic and glycolic acids, can be cleared quickly within the human body. Moreover, the degradability of this polymer can be adjusted from months to years depending on its molecular weight and composition. Lewis, "Controlled release of bioactive agents from lactide/glycolide polymer," in: M. Chasm and R. Langer (Eds.), Biodegradable Polymers as Drug Delivery Systems (Marcel Dekker: New York, 1990), pp. 5 1-41.
This invention encompasses methods of screening compounds to identify those that mimic the PRO polypeptide (agonists) or prevent the effect of the PRO polypeptide (antagonists). Screening assays for antagonist drug candidates are designed to identify compounds that bind or complex with the PRO polypeptides encoded by the genes identified herein, or otherwise interfere with tbe interaction of the encoded polypeptides 10 with other cellular proteins. Such screening assays will include assays amenable to high-throughput screening of chemical libraries, making them particularly suitable for identifying small molecule drug candidates.
The assays can be performed in a variety of formats, including protein-protein binding assays, biochemical screening assays, immunoassays, and cell-based assays, which are well characterized in the art.
All assays for antagonists are common in that they call for contacting the drug candidate with a PRO IS polypeptide encoded by a nucleic acid identified herein under conditions and for a time sufficient to allow these two components to interact.
In binding assays, the interaction is binding and the complex formed can be isolated or detected in the reaction mixture. In a particular embodiment, the PRO polypeptide encoded by the gene identified herein or the drug candidate is immobilized on a solid phase, e.g., on a microtiter plate, by covalent or non-covalent 20 attachments. Non-covalent attachment generally is accomplished by coating the solid surface with a solution of the PRO polypeptide and drying. Alternatively, an immobilized antibody, e.g., a monoclonal antibody, specific for the PRO polypeptide to be immobilized can be used to anchor it to a solid surface. The assay is performed by adding the non-immobilized component, which may be labeled by a detectable label, to the immobilized component, e.g., the coated surface containing the anchored component. When the reaction is complete, the 25 non-reacted components are removed, e.g., by washing, and complexes anchored on the solid surface are detected. When the originally non-immobilized component carries a detectable label, the detection of label immobilized on the surface indicates that completing occurred. Where the originally non-immobilized component does not carry a label, complexing can be detected, for example, by using a labeled antibody specifically binding the immobilized complex.
If the candidate compound interacts with but does not bind to a particular PRO polypeptide encoded by a gene identified herein, its interaction with that polypeptide can be assayed by methods well known for detecting protein-protein interactions. Such assays include traditional approaches, such as, e.g., cross-linking, co-immunoprecipitation, and co-purification through gradients or chromatographic columns. In addition, protein-protein interactions can be monitored by using a yeast-based genetic system described by Fields and co-workers 35 (Fields and Song, Nature (London). 340:245-246 (1989); Cfaien et al., Proc. Natl. Acad. Sci. USA. 88:9578-9582 (1991)) as disclosed by Cbevray and Nathans, Proc. Natl. Acad. Sci. USA. 89:5789-5793 (19911. Many transcriptional activators, such as yeast GAL4, consist of two physically discrete modular domains, one acting
363
as the DNA-binding domain, the other one functioning as the transcription-activation domain. The yeast expression system described in the foregoing publications (generally referred to as the "two-hybrid system") takes advantage of this property, and employs two hybrid proteins, one in which the target protein is fused to the DNA-binding domain of GAL4, and another, in which candidate activating proteins are fused to the activation domain. The expression of a GALl-tocZ reporter gene under control of a GAL4-activated promoter 5 depends on reconstitution of GAL4 activity via protein-protein interaction. Colonies containing interacting polypeptides are detected with a chromogenic substrate for p-galactosidase. A complete kit (MATCHMAKER™) for identifying protein-protein interactions between two specific proteins using the two-hybrid technique is commercially available from Clontech. This system can also be extended to map protein domains involved in specific protein interactions as well as to pinpoint amino acid residues that are crucial for 10 these interactions.
Compounds that interfere with die interaction of a gene encoding a PRO polypeptide identified herein and other intra- or extracellular components can be tested as follows: usually a reaction mixture is prepared containing the product of the gene and the intra- or extracellular component under conditions and for a time allowing for the interaction and binding of the two products. To test the ability of a candidate compound to 15 inhibit binding, the reaction is run in the absence and in the presence of the test compound. In addition, a placebo may be added to a third reaction mixture, to serve as positive control. The binding (complex formation) between the test compound and the intra- or extracellular component present in the mixture is monitored as described hereinabove. The formation of a complex in the control reactions) but not in the reaction mixture containing the test compound indicates that the test compound interferes with the interaction of the test compound 20 and its reaction partner.
To assay for antagonists, the PRO polypeptide may be added to a cell along with the compound to be screened for a particular activity and the ability of the compound to inhibit the activity of interest in the presence of the PRO polypeptide indicates that the compound is an antagonist to the PRO polypeptide. Alternatively, antagonists may be detected by combining the PRO polypeptide and a potential antagonist with membrane-bound 25 PRO polypeptide receptors or recombinant receptors under appropriate conditions for a competitive inhibition assay. The PRO polypeptide can be labeled, such as by radioactivity, such that the number of PRO polypeptide molecules bound to the receptor can be used to determine the effectiveness of the potential antagonist. The gene encoding the receptor can be identified by numerous methods known to those of skill in the art, for example, ligand panning and FACS sorting. Coligan et al., Current Protocols in Immnn 1(2): Chapter 5 (1991). 30 Preferably, expression cloning is employed wherein polyadenylated RNA is prepared from a cell responsive to the PRO polypeptide and a cDNA library created from this RNA is divided into pools and used to transfect COS cells or other cells that are not responsive to the PRO polypeptide. Transfected cells that are grown on glass slides are exposed to labeled PRO polypeptide. The PRO polypeptide can be labeled by a variety of means including iodination or inclusion of a recognition site for a site-specific protein kinase. Following fixation and 35 incubation, the slides are subjected to autoradiographic analysis. Positive pools are identified and sub-pools are prepared and re-transfected using an interactive sub-pooling and re-screening process, eventually yielding a single clone that encodes the putative receptor.
364
- As an alternative approach for receptor identification, labeled PRO polypeptide can be photoaffinity-linked with cell membrane or extract preparations that express the receptor molecule. Cross-linked material is resolved by PAGE and exposed to X-ray film. The labeled complex containing the receptor can be excised, resolved into peptide fragments, and subjected to protein micro-sequencing. The amino acid sequence obtained from micro- sequencing would be used to design a set of degenerate oligonucleotide probes to screen a cDNA 5 library to identify the gene encoding the putative receptor.
In another assay for antagonists, mammalian cells or a membrane preparation expressing the receptor would be incubated with labeled PRO polypeptide in the presence of the candidate compound. The ability of the compound to enhance or block this interaction could then be measured.
More specific examples of potential antagonists include an oligonucleotide that binds to the fusions of 10 immunoglobulin with PRO polypeptide, and, in particular, antibodies including, without limitation, poly- and monoclonal antibodies and antibody fragments, single-chain antibodies, anti-idiotypic antibodies, and chimeric or humanized versions of such antibodies or fragments, as well as human antibodies and antibody fragments. Alternatively, a potential antagonist may be a closely related protein, for example, a mutated form of the PRO polypeptide that recognizes the receptor but imparts no effect, thereby competitively inhibiting the action of the 15 PRO polypeptide.
Another potential PRO polypeptide antagonist is an antisense RNA or DNA construct prepared using antisense technology, where, e.g., an antisense RNA or DNA molecule acts to block directly the translation of mRNA by hybridizing to targeted mRNA and preventing protein translation. Antisense technology can be used to control gene expression through triple-helix formation or antisense DNA or RNA, both of which methods are 20 based on binding of a polynucleotide to DNA or RNA. For example, the 5' coding portion of the polynucleotide sequence, which encodes the mature PRO polypeptides herein, is used to design an antisense RNA oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide is designed to be complementary to a region of the gene involved in transcription (triple helix - see Lee et al., Nucl. Acids Res.. 6:3073 (1979); Cooney et al., Science. 241: 456 (1988); Dervan et al., Science. 251:1360 (1991)), thereby 25 preventing transcription and the production of the PRO polypeptide. The antisense RNA oligonucleotide hybridizes to the mRNA in vivo and blocks translation of the mRNA molecule into the PRO polypeptide (antisense - Okano, Neurochem.. 56:560 (1991); Oligodeoxvnucleotides as Antisense Inhibitors of Gene Expression (CRC Press: Boca Raton, FL, 1988). The oligonucleotides described above can also be delivered to cells such that the antisense RNA or DNA may be expressed in vivo to inhibit production of the PRO 30 polypeptide. When antisense DNA is used, oligodeoxyribonucieotides derived from the translation-initiation site, e.g., between about -10 and +10 positions of the target gene nucleotide sequence, are preferred.
Potential antagonists include small molecules that bind to the active site, the receptor binding site, or growth factor or other relevant binding site of the PRO polypeptide, thereby blocking the normal biological activity of the PRO polypeptide. Examples of small molecules include, but are not limited to, small peptides 35 or peptide-like molecules, preferably soluble peptides, and synthetic non-peptidyl organic or inorganic compounds.
Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of RNA.
365
Ribozymes act by sequence-specific hybridization to the complementary target RNA, followed by endonucleolytic cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified by known techniques. For further details see, e.g., Rossi, Current Biology. 4:469-471 (1994), and PCT publication No. WO 97/33551 (published September 18, 1997).
Nucleic acid molecules in triple-helix formation used to inhibit transcription should be single-stranded 5 and composed of deoxynucleotides. The base composition of these oligonucleotides is designed such that it promotes triple-helix formation via Hoogsteen base-pairing rules, which generally require sizeable stretches of purines or pyrimidines on one strand of a duplex. For further details see, e.g., PCT publication No. WO 97/33551, supra.
These small molecules can be identified by any one or more of the screening assays discussed 10 hereinabove and/or by any other screening techniques well known for those skilled in the art.
F. Anti-PRO Antibodies The present invention further provides anti-PRO antibodies. Exemplary antibodies include polyclonal, monoclonal, humanized, bispecific, and heteroconjugate antibodies.
1. Polyclonal Antibodies The anti-PRO antibodies may comprise polyclonal antibodies. Methods of preparing polyclonal antibodies are known to die skilled artisan. Polyclonal antibodies can be raised in a mammal, for example, by one or more injections of an immunizing agent and, if desired, an adjuvant. Typically, the immunizing agent 20 and/or adjuvant will be injected in the mammal by multiple subcutaneous or intraperitoneal injections. The immunizing agent may include the PRO polypeptide or a fusion protein thereof. It may be useful to conjugate the immunizing agent to a protein known to be immunogenic in the mammal being immunized. Examples of such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. Examples of adjuvants which may be employed include Freund's 25 complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). The immunization protocol may be selected by one skilled in the art without undue experimentation.
2. Monoclonal Antibodies The anti-PRO antibodies may, alternatively, be monoclonal antibodies. Monoclonal antibodies may be 30 prepared using hybridoma methods, such as those described by Kohler and Milstein, Nature. 256:495 (1975). In a hybridoma method, a mouse, hamster, or other appropriate host animal, is typically immunized with an immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind to the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro.
The immunizing agent will typically include the PRO polypeptide or a fusion protein thereof. 35 Generally, either peripheral blood lymphocytes ("PBLs") are used if cells of human origin are desired, orspleen cells or lymph node cells are used if non-human mammalian sources are desired. The lymphocytes are then fused with an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a
366
hybridoma cell [Goding, Monoclonal Antibodies: Principles and Practice. Academic Press, (1986) pp. 59-103]. Immortalized cell lines are usually transformed mammalian ceils, particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture medium that preferably contains one or more substances that inhibit the growth or survival of the unfused, immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine S guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient cells.
Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression of antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More 10 preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, California and the American Type Culture Collection, Manassas, Virginia. Human myeloma and mouse-human heteromyeloma cell lines also have been described for the production ofhuman monoclonal antibodies [Kozbor, J. Immunol.. 133:3001 (1984): Brodeur etal.. Monoclonal Antibody Production Techniques and Applications. Marcel Dekker, Inc., New York, (1987) pp. 51-63]. 15 Hie culture medium in which the hybridoma cells are cultured can then be assayed for the presence of monoclonal antibodies directed against PRO. Preferably, the binding specificity of monoclonal antibodies produced by the hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (EL1SA). Such techniques and assays are known in the art. The binding affinity of the monoclonal antibody can, for example, be determined by the 20 Scatchard analysis of Munson and Pollard, Anal Rinchnm 107:220 (1980).
After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution procedures and grown by standard methods [Goding, supra!. Suitable culture media for this purpose include, for example, Dulbecco'sModifiedEaglelsMediumandRPMI-1640 medium. Alternatively, the hybridoma cells may be grown in vivo as ascites in a mammal.
The monoclonal antibodies secreted by the subclones may be isolated or purified from the culture medium or ascites fluid by conventional immunoglobulin purification procedures such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography.
The monoclonal antibodies may also be made by recombinant DNA methods, such as those described in U.S.Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can be readily isolated 30 and sequenced using conventional procedures (e.g., by using oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred source of such DNA. Once isolated, tbe DNA may be placed into expression vectors, which are then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal 35 antibodies in the recombinant host cells. The DNA also may be modified, for example, by substituting the coding sequence for human heavy and light chain constant domains in place of the homologous murine sequences [U.S. Patent No. 4,816,567; Morrison et al., suoral or by covalently joining to the immunoglobulin coding
367
sequence all or part of tbe coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin polypeptide can be substituted for the constant domains of an antibody of the invention, or can be substituted for the variable domains of one antigen-combining site of an antibody of the invention to create a chimeric bivalent antibody.
The antibodies may be monovalent antibodies. Methods for preparing monovalent antibodies are well 5 known in the art. For example, one method involves recombinant expression of immunoglobulin light chain and modified heavy chain. The heavy chain is truncated generally at any point in the Fc region so as to prevent heavy chain crosslinking. Alternatively, the relevant cysteine residues are substituted with another amino acid residue or are deleted so as to prevent crosslinking.
In vitro methods are also suitable for preparing monovalent antibodies. Digestion of antibodies to 10 produce fragments thereof, particularly, Fab fragments, can be accomplished using routine techniques known in the art.
3. Human and Humanized Antibodies
The anti-PRO antibodies of the invention may further comprise humanized antibodies or human 15 antibodies. Humanized forms of non-human (e.g., murine) antibodies are chimeric immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab')2 or other antigen-binding subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin. Humanized antibodies include human immunoglobulins (recipient antibody) in which residues from a complementary determining region (CDR) of the recipient are replaced by residues from a CDR of a non-human 20 species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity. In some instances, Fv framework residues of the human immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies may also comprise residues which are found neither in the recipient antibody nor in the imported CDR or framework sequences. In general, the humanized antibody will comprise substantially all of at least one, and typically two, variable domains, in which all or substantially all of the CDR 25 regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR regions are those of a human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin [Jones etal.. Nature. 321:522-525 (1986): Riechmannetal.. Nature. 332:323-329(1988): andPresta. Curr. Op. Struct. Biol- 2:593-596 (1992)].
Methods for humanizing non-human antibodies are well known in the art. Generally, a humanized antibody has one or more amino acid residues introduced into it from a source which is non-human. These non-human amino acid residues are often referred to as "import" residues, which are typically taken from an "import" variable domain. Humanization can be essentially performed following the method of Winter and co-workers [Jones et al., Nature. 321:522-525 (1986); Riechmann et al., Nature. 332:323-327 (1988); Verhoeyen et al., 35 Science. 23g: 1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the corresponding sequences of a human antibody. Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact human variable domain has been substituted by the
368
corresponding sequence from a non-human species. In practice, humanized antibodies are typically human antibodies in which some CDR residues and possibly some FR residues are substituted by residues from analogous sites in rodent antibodies.
Human antibodies can also be produced using various techniques known in the art, including phage display libraries [Hoogenboom and Winter, J. Mol. Biol.. 227:381 (1991); Marks etal., J. Mol. Biol.. 222:581 5 (1991)]. The techniques of Cole et al. and Boerner et al. are also available for the preparation of human monoclonal antibodies (Cole et al., Monoclonal Antibodies and Cancer Therapy. Alan R. Liss, p. 77 (1985) and Boerner et al., J. Immunol.. 147(11:86-95 (1991)]. Similarly, human antibodies can be made by introducing of human immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous immunoglobulin genes have been partially or completely inactivated. Upon challenge, human antibody production is observed, 10 which closely resembles that seen in humans in all respects, including gene rearrangement, assembly, and antibody repertoire. This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, and in the following scientific publications: Marks et al., Bio/Technology 10.779-783 (19921: Lonberg etal.. Nature 368 856-859(1994): Morrison, Nature 368.812-13 (1994); Fishwild et al.. Nature Biotechnology 14, 845-51 (1996); Neuberger, Nature Biotechnology 14. 826 15 (1996); Lonberg and Huszar, Intern. Rev. Immunol. 12 65-93 (1995).
4. Bispecific Antibodies
Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding specificities for at least two different antigens. In the present case, one of the binding specificities is for the 20 PRO, the other one is for any other antigen, and preferably for a cell-surface protein or receptor or receptor subunit.
Methods for making bispecific antibodies are known in the art. Traditionally, the recombinant production of bispecific antibodies is based on the co-expression of two immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different specificities [Milstein and Cuello, Nature. 305:537-539 (1983)]. 25 Because of the random assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a potential mixture of ten different antibody molecules, of which only one has the correct bispecific structure. The purification of die correct molecule is usually accomplished by affinity chromatography steps. Similar procedures are disclosed in WO 93/08829, published 13 May 1993, and in Traunecker et al., EMBO L., 10:3655-3659 (1991).
Antibody variable domains with the desired binding specificities (antibody-antigen combining sites) can be fused to immunoglobulin constant domain sequences. The fusion preferably is with an immunoglobulin heavy-chain constant domain, comprising at least pan of the lunge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region (CHI) containing the site necessary for light-chain binding present in at least one of the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the 35 immunoglobulin light chain, are inserted into separate expression vectors, and are co-transfected into a suitable host organism. For further details of generating bispecific antibodies see, for example, Suresh etal., Methods in Enzvmotogv. 121:210 (1986).
369
PCTAJS99/20111
According to another approach described in WO 96/27011, the interface between a pair of antibody molecules can be engineered to maximize the percentage of heterodimers which are recovered from recombinant cell culture. The preferred interface comprises at least a part of the CH3 region of an antibody constant domain. In this method, one or more small amino acid side chains from the interface of the first antibody molecule are replaced with larger side chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or similar 5 size to the large side chain(s) are created on the interface of the second antibody molecule by replacing large amino acid side chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for increasing the yield of the heterodimer over other unwanted end-products such as homodimers.
Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. F(ab')2 bispecific antibodies). Techniques for generating bispecific antibodies from antibody fragments have been 10 described in the literature. For example, bispecific antibodies can be prepared can be prepared using chemical linkage. Braman etal.. Science 229:81 (1985) describe a procedure wherein intact antibodies are proteolytically cleaved to generate F(ab')j fragments. These fragments are reduced in the presence of the dithiol complexing agent sodium arsenite to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' fragments generated are then convened to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 15 derivatives is then reconverted to the Fab'-thiol by reduction with mercaptoethylamine and is mixed with an equimolar amount of the other Fab'-TNB derivative to form the bispecific antibody. The bispecific antibodies produced can be used as agents for the selective immobilization of enzymes.
Fab' fragments may be directly recovered from E. coli and chemically coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) describe the production of a fully humanized 20 bispecific antibody F(ab')2 molecule. Each Fab' fragment was separately secreted from E. coli and subjected to directed chemical coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic activity of human cytotoxic lymphocytes against human breast tumor targets.
Various technique for making and isolating bispecific antibody fragments directly from recombinant cell 25 culture have also been described. For example, bispecific antibodies have been produced using leucine zippers. Kostelny et al., J. Tmwiimnl. 148(5): 1547-1553 (1992). The leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' portions of two different antibodies by gene fusion. The antibody homodimers were reduced at the hinge region to form monomers and then re-oxidized to form the antibody heterodimers. This method can also be utilized for the production of antibody homodimers. The "diabody" technology 30 described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an alternative mechanism for making bispecific antibody fragments. The fragments comprise a heavy-chain variable domain (VH) connected to a light-chain variable domain (VJ by a linker which is too shon to allow pairing between the two domains on the same chain. Accordingly, the VH and VL domains of one fragment are forced to pair with the complementary VL and VH domains of another fragment, thereby forming two antigen-binding sites. Another 35 strategy for making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been reponed. See, Gruber et al., J. Immunol. 152:5368 (1994).
Antibodies with more than two valencies are contemplated. For example, trispecific antibodies can be prepared.
370
Ttitt et al., J. Immunol. 147:60 (1991).
Exemplary bispecific antibodies may bind to two different epitopes on a given PRO polypeptide herein. Alternatively, an anti-PRO polypeptide arm may be combined with an arm which binds to a triggering molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), or Fc receptors for IgG (FcyR), such as FcyRI (CD64), FcyRII (CD32) and FcyRIII (CD16) so as to focus cellular defense mechanisms S to the cell expressing the particular PRO polypeptide. Bispecific antibodies may also be used to localize cytotoxic agents to cells which express a particular PRO polypeptide. These antibodies possess a PRO-binding arm and an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPTA, DOT A, or TETA. Another bispecific antibody of interest binds the PRO polypeptide and further binds tissue factor (TF).
. Heteroconiueate Antibodies
Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies have, for example, been proposed to target immune system cells to unwanted cells [U.S. Patent No. 4,676,980], and for treatment of HIV infection 15 [WO 91/00360; WO 92/200373; EP 03089]. It is contemplated that the antibodies may be prepared in vitro using known methods in synthetic protein chemistry, including those involving crosslinking agents. For example, immunotoxins may be constructed using a disulfide exchange reaction or by forming a thioether bond. Examples of suitable reagents for this purpose include iminothiolate and methyl-4-mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No. 4,676,980.
6. Effector Function Engineering
It may be desirable to modify the antibody of the invention with respect to effector function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For example, cysteine residue(s) may be introduced into the Fc region, thereby allowing interchain disulfide bond formation in this region. The 25 homodimeric antibody thus generated may have improved internalization capability and/or increased complement-mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et al., L Exp Med.. 176:1191-1195 (1992) and Shopes. I. Immunol- 148:2918-2922 (1992). Homodimeric antibodies with enhanced anti-tumor activity may also be prepared using heterobifunctional cross-linkers as described in Wolff etal. Cancer Research. 53:2560-2565(1993). Alternatively, an antibody can be engineered that has dual 30 Fc regions and may thereby have enhanced complement lysis and ADCC capabilities. See Stevenson etal.. Anti-Cancer Drue Design. 3: 219-230 (1989).
7. Immunoconiugates
The invention also pertains to immunoconjugates comprising an antibody conjugated to a cytotoxic agent 35 such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin of bacterial, iungal, plant, or animal origin, or fragments thereof), or a radioactive isotope (i.e., a radioconjugate).
Chemotherapeutic agents useful in the generation of such immunoconjugates have been described above.
371
Enzymatically active toxins and fragments thereof that can be used include diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of radionuclides are 5 available for the production of radioconjugated antibodies. Examples include 2l2Bi, U1I, 13lIn, S0Y, and IMRe.
Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-10 diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate),andbis-activefiuorinecoinpounds(suchas l,5-difluoro-2,4-dinitrobenzene). For example, a ricin immunotoxin can be prepared as described in Vitetta et al., Science. 238: 1098 (1987). Carbon-14-labeled 1-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See W094/11026.
IS In another embodiment, the antibody may be conjugated to a "receptor" (such streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is administered to the patient, followed by removal of unbound conjugate from the circulation using a clearing agent and then administration of a "ligand" (e.g., avidin) that is conjugated to a cytotoxic agent (e.g., a radionucleotide).
8. ItrnnnnnlfTxrawnec
The antibodies disclosed herein may also be formulated as immunoliposomes. Liposomes containing the antibody are prepared by methods known in the art, such as described in Epstein et al., Proc. Natl. Acad. Sci. USA. §2: 3688 (1985); Hwang et al., Proc. Nad Acad. Sci. USA. 77: 4030 (1980); and U.S. Pat. Nos. 4,485,045 and 4,544,545. Liposomes with enhanced circulation time are disclosed in U.S. Patent No. 25 5,013,556.
Particularly useful liposomes can be generated by tbe reverse-phase evaporation method with a lipid composition comprising phosphatidylcholine,cholesterol, and PEG-derivatized phosphatidylethanolamine (PEG-PE). Liposomes are extruded through filters of defined pore size to yield liposomes with the desired diameter. Fab' fragments of the antibody of the present invention can be conjugated to the liposomes as described in Martin 30 et al.. J. Biol. Chem.. 257: 286-288 (1982) via a disulfide-interchange reaction. A chemotherapeutic agent (such as Doxorubicin) is optionally contained within the liposome. SeeGabizon etal.. J. National Cancer Inst.. §1(19): 1484 (1989).
9. Pharmaceutical Compositions of Antibodies 35 Antibodies specifically binding a PRO polypeptide identified herein, as well as other molecules identified by the screening assays disclosed hereinbefore, can be administered for the treatment of various disorders in the form of pharmaceutical compositions.
372
If tbe PRO polypeptide is intracellular and whole antibodies are used as inhibitors, internalizing antibodies are preferred. However, lipofections or liposomes can also be used to deliver the antibody, or an antibody fragment, into cells. Where antibody fragments are used, the smallest inhibitory fragment that specifically binds to the binding domain of the target protein is preferred. For example, based upon the variable-region sequences of an antibody, peptide molecules can be designed that retain the ability to bind the target 5 protein sequence. Such peptides can be synthesized chemically and/or produced by recombinant DNA technology. See, e.g., Marasco etal., Proc. Natl. Acad. Sci. USA. §Q; 7889-7893 (1993). Thefornniabon herein may also contain more than one active compound as necessary for the particular indication being treated, preferably those with complementary activities that do not adversely affect each other. Alternatively, or in addition, the composition may comprise an agent that enhances its function, such as, for example, a cytotoxic 10 agent, cytokine, chemotherapeutic agent, or growth-inhibitory agent. Such molecules are suitably present in combination in amounts that are effective for the purpose intended.
The active ingredients may also be entrapped in microcapsules prepared, for example, by coacervation techniques or by interfacial polymerization, for example, hydroxymethylcellulose or gelatin-microcapsules and poly-(methylraethacylate) microcapsules, respectively, in colloidal drug delivery systems (for example, IS liposomes, albumin microspheres, microemulsions, nano-particles, and nanocapsules) or in macroemulsians. Such techniques are disclosed in Remington's Pharmaceutical Sciences, supra.
The formulations to be used for in vivo administration must be sterile. This is readily accomplished by filtration through sterile filtration membranes.
Sustained-release preparations may be prepared. Suitable examples of sustained-release preparations 20 include semipermeable matrices of solid hydrophobic polymers containing the antibody, which matrices are in the form of shaped articles, e.g., films, or microcapsules. Examples of sustained-release matrices include polyesters, hydrogels (for example, poly(2-hydroxyethyl-methacrylate), or poly(vinyialcohol)), polylactides (U.S. Pat. No. 3,773,919), copolymers of L-glutamic acid and y ethyl-L-glutamate, non-degradable ethylene-vinyl acetate, degradable lactic acid-glycoiic acid copolymers such as the LUPRON DEPOT ™ (injectable 25 microspheres composed of lactic acid-glycolic acid copolymer and leuprolide acetate), and poly-D-(-)-3-hydroxybutyric acid. While polymers such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release of molecules for over 100 days, certain hydrogels release proteins for shorter time periods. When encapsulated antibodies remain in the body for a long time, they may denature or aggregate as a result of exposure to moisture at 37°C, resulting in a loss of biological activity and possible changes in immunogenicity. Rational strategies 30 can be devised for stabilization depending on the mechanism involved. For example, if the aggregation mechanism is discovered to be intermolecuiar S-S bond formation through thio-disulflde interchange, stabilization may be achieved by modifying sulfhydiyl residues, lyophilizing from acidic solutions, controlling moisture content, using appropriate additives, and developing specific polymer matrix compositions.
G. Uses for anti-PRO Antibodies
The anti-PRO antibodies of the invention have various utilities. For example, anti-PRO antibodies may be used in diagnostic assays for PRO, e.g., detecting its expression in specific cells, tissues, or serum. Various
373
diagnostic assay techniques known in the an may be used, such as competitive binding assays, direct or indirect sandwich assays and immunoprecipitation assays conducted in either heterogeneous or homogeneous phases [Zola, Monoclonal Antibodies: A Manual of Techniques CRC Press, Inc. (1987) pp. 147-158]. The antibodies used in the diagnostic assays can be labeled with a detectable moiety. The detectable moiety should be capable of producing, either directly or indirectly, a detectable signal. For example, the detectable moiety may be a 5 radioisotope, such as 3H, UC, KP, ^S, or mI, a fluorescent or chemiluminescent compound, such as fluorescein isothiocyanate, rhodamine, or luciferin, or an enzyme, such as alkaline phosphatase, beta-galactosidase or horseradish peroxidase. Any method known in the an for conjugating the antibody to the detectable moiety may be employed, including those methods described by Hunter et al., Nature. 144:945 (1962); David et al., Biochemistry. 13:1014(1974): Pain etal.. J. Immunol. Meth.. 40:219 (1981): and Nveren. J. Histochem. and 10 Cvtochem.. 30:407 (1982).
Anti-PRO antibodies also are useful for the affinity purification of PRO from recombinant cell culture or natural sources. In this process, the antibodies against PRO are immobilized on a suitable support, such a Sephadex resin or filter paper, using methods well known in the an. The immobilized antibody then is contacted with a sample containing the PRO to be purified, and thereafter the suppon is washed with a suitable solvent that IS will remove substantially all the material in the sample except the PRO, which is bound to the immobilized antibody. Finally, the suppon is washed with another suitable solvent that will release the PRO from the antibody.
The following examples are offered for illustrative purposes only, and are not intended to limit the scope of the present invention in any way.
All patent and literature references cited in the present specification are hereby incorporated by reference in their entirety.
EXAMPLES
Commercially available reagents referred to in the examples were used according to manufacturer's 25 instructions unless otherwise indicated. The source of those cells identified in the following examples, and throughout the specification, by ATCC accession numbers is the American Type Culture Collection, Manassas, VA.
EXAMPLE 1: Extracellular Domain Homology Screening to Identify Novel Polypeptides and cDNA F.nrnriinp 30 Therefor
The extracellular domain (ECD) sequences (including the secretion signal sequence, if any) from about 950 known secreted proteins from the Swiss-Prot public database were used to search EST databases. The EST databases included public databases (e.g., Dayhoff, GenBank), and proprietary databases (e.g. LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program BLAST or 35 BLAST-2 (Altschul et al., Methods in Enzvmologv 266:460-480 (1996)) as a comparison of the ECD protein sequences to a 6 frame translation of the EST sequences. Those comparisons with a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into consensus DNA
374
sequences with tbe program "phrap" (Phil Green, University of Washington, Seattle, WA).
Using this extracellular domain homology screen, consensus DNA sequences were assembled relative to the other identified EST sequences using phrap. In addition, the consensus DNA sequences obtained were often (but not always) extended using repeated cycles of BLAST or BLAST-2 and phrap to extend the consensus sequence as far as possible using the sources of EST sequences discussed above.
Based upon the consensus sequences obtained as described above, oligonucleotides were then synthesized and used to identify by PCR a cDNA library that contained the sequence of interest and for use as probes to isolate a clone of the full-length coding sequence for a PRO polypeptide. Forward and reverse PCR primers generally range from 20 to 30 nucleotides and are often designed to give a PCR product of about 100-1000 bp in length. The probe sequences are typically 40-55 bp in length. In some cases, additional 10 oligonucleotides are synthesized when the consensus sequence is greater than about 1-1.5kbp. In order to screen several libraries for a full-length clone, DNA from the libraries was screened by PCR amplification, as per Ausubel et al., Current Protocols in Molecular Biology, with the PCR primer pair. A positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide and one of the primer pairs. The cDNA libraries used to isolate the cDNA clones were constructed by standard methods using 15 commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes etal., Science. 253:1278-1280 (1991)) in die unique Xhol and NotI sites.
EXAMPLE 2: Isolation of cDNA clones bv Amylase Screening 1. Preparation of oligo dT primed cDNA library mRNA was isolated from a human tissue of interest using reagents and protocols from Invitrogen, San Diego, CA (Fast Track 2). This RNA was used to generate an oligo dT primed cDNA library in the vector 25 pRK5D using reagents and protocols from Life Technologies, Gaithersburg, MD (Super Script Plasmid System). In this procedure, the double stranded cDNA was sized to greater than 1000 bp and the Sall/NotI linkered cDNA was cloned into XhoI/NotI cleaved vector. pRKSD is a cloning vector that has an sp6 transcription initiation site followed by an Sfil restriction enzyme site preceding the XhoI/NotI cDNA cloning sites.
2. Preparation of random primed cDNA library
A secondary cDNA library was generated in order to preferentially represent the 5' ends of the primary cDNA clones. Sp6 RNA was generated from the primary library (described above), and this RNA was used to generate a random primed cDNA library in the vector pSST-AMY.O using reagents and protocols from Life Technologies (Super Script Plasmid System, referenced above). In this procedure the double stranded cDNA 35 was sized to 500-1000 bp, linkered with blunt to NotI adaptors, cleaved with Sfil, and cloned into Sfil/NotI cleaved vector. pSST-AMY.O is a cloning vector that has a yeast alcohol dehydrogenase promoter preceding the cDNA cloning sites and the mouse amylase sequence (the mature sequence without the secretion signal)
375
followed by the yeast alcohol dehydrogenase terminator, after the cloning sites. Thus, cDNAs cloned into this vector that are fused in frame with amylase sequence will lead to the secretion of amylase from appropriately transfected yeast colonies.
3. Transformation and Detection 5 DNA from the library described in paragraph 2 above was chilled on ice to which was added electrocompetent DH10B bacteria (Life Technologies, 20 ml). The bacteria and vector mixture was then electroporated as recommended by the manufacturer. Subsequently, SOC media (Life Technologies, 1 ml) was added and the mixture was incubated at 37DC for 30 minutes. The transformants were then plated onto 20 standard ISO mm LB plates containing ampicillin and incubated for 16 hours (37°C). Positive colonies were 10 scraped off the plates and the DNA was isolated from the bacterial pellet using standard protocols, e.g. CsCl-gradient. The purified DNA was then carried on to the yeast protocols below.
The yeast methods were divided into three categories: (1) Transformation of yeast with the plasmid/cDNA combined vector; (2) Detection and isolation of yeast clones secreting amylase; and (3) PCR amplification of the insert directly from the yeast colony and purification of the DNA for sequencing and further IS analysis.
The yeast strain used was HD56-5A (ATCC-90785). This strain has the following genotype: MAT alpha, ura3-52, leu2-3, leu2-112, his3-ll, his3-15, MAL+, SUC+, GAL+. Preferably, yeast mutants can be employed that have deficient post-translatkmal pathways. Such mutants may have translocation deficient alleles in sec71, sec72, sec62, with truncated sec7l being most preferred. Alternatively, antagonists (including 20 antisense nucleotides and/or ligands) which interfere with the normal operation of these genes, other proteins implicated in this post translation pathway (e.g., SEC61p, SEC72p, SEC62p, SEC63p, TDJlp or SSAlp-4p) or the complex formation of these proteins may also be preferably employed in combination with the amylase-expressing yeast.
Transformation was performed based on the protocol outlined by Gietz et al., Nucl. Acid. Res.. 2Q: 1425 25 (1992). Transformed cells were then inoculated from agar into YEPD complex media broth (100 ml) and grown overnight at 30°C. The YEPD broth was prepared as described in Kaiser et al., Methods in Yeast Genetics. Cold Spring Harbor Press, Cold Spring Harbor, NY, p. 207 (1994). The overnight culture was then diluted to about 2 x 10s cells/ml (approx. 00600=0.1) into fresh YEPD broth (500 ml) and regrown to 1 x 107 cells/ml (approx. 00400=0.4-0.5).
The cells were then harvested and prepared for transformation by transfer into GS3 rotor bottles in a
Sorval GS3 rotor at 5,000 rpm for 5 minutes, the supernatant discarded, and then resuspended into sterile water, and centrifuged again in 50 ml falcon tubes at 3,500 rpm in a Beckman GS-6KR centrifuge. The supernatant was discarded and the cells were subsequently washed with LiAc/TE (10 ml, 10 mM Tris-HCl, 1 mM EDTA pH 7.5, 100 mM Li2OOCCH3), and resuspended into LiAc/TE (2.5 ml).
Transformation took place by mixing the prepared cells (100 pi) with freshly denatured single stranded salmon testes DNA (Lofstrand Labs, Gaithersburg, MD) and transfonning DNA (1 jug, vol. < 10 pi) in microfuge tubes. The mixture was mixed briefly by vortexing, then 40% PEG/TE (600 pi, 40% polyethylene
376
glycol-4000, 10 mM Tris-HCl, 1 mM EDTA, 100 mM Li2OOCCH3, pH 7.5) was added. This mixture was gently mixed and incubated at 30°C while agitating for 30 minutes. The cells were then heat shocked at 42°C for 15 minutes, and the reaction vessel centrifuged in a microfuge at 12,000 rpm for 5-10 seconds, decanted and resuspended into TE (500 pi, 10 mM Tris-HCl, 1 mM EDTA pH 7.5) followed by recentrifugation. The cells were then diluted into TE (1 ml) and aliquots (200 pi) were spread onto the selective media previously prepared 5 in 150 mm growth plates (VWR).
Alternatively, instead of multiple small reactions, the transformation was performed using a single, large scale reaction, wherein reagent amounts were scaled up accordingly.
The selective media used was a synthetic complete dextrose agar lacking uracil (SCD-Ura) prepared as described in Kaiser et al., Methods in Yeast Genetics. Cold Spring Harbor Press, Cold Spring Harbor, NY, p. 10 208-210 (1994). Transformants were grown at 30°C for 2-3 days.
The detection of colonies secreting amylase was performed by including red starch in the selective growth media. Starch was coupled to the red dye (Reactive Red-120, Sigma) as per the procedure described by Biely etal., Anal. Biochem.. 172:176-179 (19881. The coupled starch was incorporated into the SCD-Ura agar plates at a final concentration of 0.15 % (w/v), and was buffered with potassium phosphate to a pH of 7.0 (50-15 100 mM final concentration).
The positive colonies were picked and streaked across fresh selective media (onto 150 mm plates) in order to obtain well isolated and identifiable single colonies. Well isolated single colonies positive for amylase secretion were detected by direct incorporation of red starch into buffered SCD-Ura agar. Positive colonies were determined by iheir ability to break down starch resulting in a clear halo around the positive colony visualized 20 directly.
4. Isolation of DNA bv PCR Amplification
When a positive colony was isolated, a portion of it was picked by a toothpick and diluted into sterile water (30 pi) in a 96 well plate. At this time, the positive colonies were either frozen and stored for subsequent 25 analysis or immediately amplified. An aliquot of cells (5 pi) was used as a template for tbe PCR reaction in a 25 pi volume containing: 0.5 pi Klentaq (Clontech, Palo Alto, CA); 4.0 pi 10 mM dNTP's (Perkin Elmer-Cetus); 2.5 pi Kentaq buffer (Clontech); 0.25 pi forward oligo 1; 0.25 pi reverse oligo 2; 12.5 pi distilled water. The sequence of the forward oligonucleotide 1 was:
' -TGTAAAACGACGGCC AGTT AAAT AG ACCTGC A ATT ATT A ATCT-3' (SEQ ID NO: 1) 30 The sequence of reverse oligonucleotide 2 was:
' -CAGGAAAC AGCTATGACCACCTGC AC ACCTGC A A ATCC ATT -3' (SEQ ID NO:2)
PCR was then performed as follows:
a.
Denature
92°C,
minutes
b.
3 cycles of:
Denature
Anneal
Extend
92 °C, 59"C, 72 °C,
seconds 30 seconds 60 seconds
40
c.
3 cycles of:
Denature Anneal
92°C, 57°C,
seconds 30 seconds
377
Extend 72°C, 60 seconds d. 25 cycles of: Denature 92°C, 30 seconds
Anneal 55 °C, 30 seconds
Extend 72 °C, 60 seconds e. Hold 4°C
The underlined regions of the oligonucleotides annealed to the ADH promoter region and the amylase region, respectively, and amplified a 307 bp region from vector pSST-AMY.O when no insert was present.
Typically, the first 18 nucleotides of the 5' end of these oligonucleotides contained annealing sites for the sequencing primers. Thus, the total product of the PCR reaction from an empty vector was 343 bp. However, signal sequence-fused cDNA resulted in considerably longer nucleotide sequences.
Following the PCR, an aliquot of the reaction (5 pi) was examined by agarose gel electrophoresis in a 1 % agarose gel using a Tris-Borate-EDTA (TBE) buffering system as described by Sambrook et al., supra.
Clones resulting in a single strong PCR product larger than 400 bp were further analyzed by DNA sequencing after purification with a 96 Qiaquick PCR clean-up column (Qiagen Inc., Chalsworth, CA).
EXAMPLE 3: Isolation of cDNA Clones UsinP Siynal Algorithm Analysis
Various polypeptide-encoding nucleic acid sequences were identified by applying a proprietary signal 20 sequence finding algorithm developed by Genentech, Inc. (South San Francisco, CA) upon ESTs as well as clustered and assembled EST fragments from public (e.g., GenBank) and/or private (LIFESEQ®, Incyte Pharmaceuticals, Inc., Palo Alto, CA) databases. The signal sequence algorithm computes a secretion signal score based on the character of the DNA nucleotides surrounding the first and optionally the second methionine codon(s) (ATG) at the 5' -end of the sequence or sequence fragment under consideration. The nucleotides 25 following the first ATG must code for at least 35 unambiguous amino acids without any stop codons. If the first ATG has the required amino acids, the second is not examined. If neither meets the requirement, the candidate sequence is not scored. In order to determine whether the EST sequence contains an authentic signal sequence, the DNA and corresponding amino acid sequences surrounding the ATG codon are scored using a set of seven sensors (evaluation parameters) known to be associated with secretion signals. Use of this algorithm resulted 30 in the identification of numerous polypeptide-encoding nucleic acid sequences.
EXAMPLE 4: Isolation of cDNA clones Encoding Human PROI56Q
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is designated herein as DNA17409. Based on the DNA17409 35 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01560.
DNA sequencing of the isolated clones isolated as described above gave the full-length DNA sequence for DNA19902-1669 [Figure 1, SEQ ID NO:3]; and the derived protein sequence for PR01560. 40 The entire coding sequence of DNA19902-1669 is included in Figure 1 (SEQ ID NO:3). Clone
378
DNA19902-1669 contains a single open reading frame with an apparent translations! initiation site at nucleotide positions 41-43, and an apparent stop codon at nucleotide positions 776-778. The predicted polypeptide precursor is 245 amino acids long. The approximate locations of the signal peptide, transmembrane domains, N-glycosylation sites, N-myristoylation sites, tyrosine kinase phosphorylation sites, and membrane lipoprotein lipid attachment sites are also indicated in Figure 2. Clone DNA19902-1669 has been deposited with the ATCC 5 and is assigned ATCC deposit no. 203454. The full-length PR01560 protein shown in Figure 2 has an estimated molecular weight of about 27,563 daltons and a pi of about 8.36.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 2 (SEQ ID NO:4), revealed sequence identity between the PR01560 amino acid sequence and the following Dayhoff sequences: AF053453_1, AF053454_1, 10 A15 HUMAN, AF054840_1, CD63_HUMAN, AF065389_1, AFQ54838_1, AF089749J, P_R27525, and PR86834.
FXAMPI.F. 5: Isolation of cDNA clones Encoding Human PRQ444
A cDNA sequence isolated in the amylase screen described in Example 2 above was designated IS DNA13121. Based upon this sequence, probes were generated and used to screen a human fetal lung library (LIB25) prepared as described in paragraph 1 of Example 2 above. The cloning vector was pRK5B (pRK5B is a precursor of pRKSD that does not contain the Sfil site; see, Holmes et al., Science. 253:1278-1280 (1991)), and the cDNA size cut was less than 2800 bp.
A full length clone was identified that contained a single open reading frame with an apparent 20 translational initiation site at nucleotide positions 608-610 and ending at the stop codon found at nucleotide positions 959-961 (Figure 3, SEQ ID NO:5). The predicted polypeptide precursor is 117 amino acids long, has a calculated molecular weight of approximately 12,692 daltons and an estimated pi of approximately 7.50. Analysis of the full-length PR0444 sequence shown in Figure 4 (SEQ ID NO:6) evidences the presence of a signal peptide at amino acid 1 to about amino acid 16. An analysis of the Dayhoff database (version 35.45 25 SwissProt 35) evidenced homology between the PR0444 amino acid sequence and the following Dayhoff sequences: CEF44D12_8, P_R88452, YNE1_CAEEL, A47312, AF009957_1, and A06133_l. Clone DNA26846-1397 was deposited with the ATCC on October 27,1998 and is assigned ATCC deposit no. 203406.
EXAMPLE 6: Isolation of cDNA clones Encoding Human PRQ1018 30 A cDNA clone (DNA56107-1415) encoding a native human PR01018 polypeptide was identified by a yeast screen, in a human ovary tumor cDNA library that preferentially represents the 5' ends of the primary cDNA clones. The yeast screen employed identified a single EST clone designated herein as DNA41000. The DNA41000 sequence was then compared to various EST databases including public EST databases (e.g., GenBank), and a proprietary EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify 35 homologous EST sequences. The comparison was performed using the computer program BLAST or BLAST2 f Altschul et al.. Methods in Enzvmologv. 266:460-480(1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were clustered and assembled into
379
a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). This consensus sequence is herein designated DNA44449. Oligonucleotide primers based upon the DNA44449 sequence were then synthesized and employed to screen a human ovary tumor cDNA library which resulted in the identification of the DNA56107-1415 clone shown in Figure S.
The full-length DNA56107-1415 clone shown in Figure 5 contains a single open reading frame with 5 an apparent translational initiation site at nucleotide positions 129-131 and ending at the stop codon at nucleotide positions 696-698 (Figure 5). The predicted polypeptide precursor is 189 amino acids long (Figure 6). Analysis of the full-length PR01018 sequence shown in Figure 6 (SEQ ID NO:8) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 24, transmembrane domains from about amino acid 86 to about amino acid 103 and from about amino acid 60 to about amino acid 75 and an amino acid sequence 10 block having homology to G-protein coupled receptor proteins from about amino acid 44 to about amino acid 84. Clone DNA56107-1415 has been deposited with ATCC on October 27,1998 and is assigned ATCC deposit no. 203405.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 6 (SEQ ID NO:8), evidenced significant 15 homology between the PRO 1018 amino acid sequence and the following Dayhoff sequences: CEB0399_4, S59764, YHDT HAE1N and AE000675_3.
EXAMPLE 7: Isolation of cDNA clnnm: F.nrnriinn Human PRQ1773
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 20 in Example 1 above. This consensus sequence is herein designated DNA49797. Based upon an observed homology between the DNA49797 consensus sequence and an EST sequence contained within Incyte EST clone no. 509434, Incyte EST clone no. 509434 was purchased and its insert obtained and sequenced. That sequence is herein shown in Figure 7 and is designated DNA56406-1704.
The entire nucleotide sequence of DNA56406-1704 is shown in Figure 7 (SEQ ID NO:9). Clone 25 DNA56406-1704 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 111-113 and ending at the stop codon at nucleotide positions 1068-1070 (Figure 7). The predicted polypeptide precursor is 319 amino acids long (Figure 8). The full-length PR01773 protein shown in Figure 8 has an estimated molecular weight of about 35,227 daltons and a pi of about 8.97. Analysis of the full-length PRO1773 sequence shown in Figure 8 (SEQ ID NO: 10) evidences the presence of the following: a signal peptide 30 from about amino acid 1 to about amino acid 17, a transmembrane domain from about amino acid 136 to about amino acid 152, potential N-glycosylation sites from about amino acid 161 to about amino acid 164, from about amino acid 187 to about amino acid 190 and from about amino acid 253 to about amino acid 256, a glycosaminoglycan attachment site from about amino acid 39 to about amino acid 42 and potential N-myristolation sites from about amino acid 36 to about amino acid 41, from about amino acid 42 to about amino 35 acid 47, from about amino acid 108 to about amino acid 113, from about amino acid 166 to about amino acid 171, from about amino acid 198 to about amino acid 203 and from about amino acid 207 to about amino acid 212. Clone DNA56406-1704 has been deposited with ATCC on November 17, 1998 and is assigned ATCC
380
deposit no. 203478.
An analysis of the Dayhoff database (version 3S.4S SwissProt 33), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 8 (SEQ ID NO: 10), evidenced significant homology between the PR01773 amino acid sequence and the following Dayhoff sequences: ROH2RAT, ROH3RAT, AF030513_1 ,ROHl_RAT, AF056194_1, AF057034_1, P_W18337, P_W18328, BDH_HUMAN 5 and BDH_RAT.
EXAMPJUF.g; Isolation of cDNA clones Encoding Human PRO 1477
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA52641. Based on the DNA52641 10 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0240.
PCR primers (forward and reverse) were synthesized:
forward PCR primer 5 '-CGCCAGAAGGGCGTGATTGACGTC-3' (SEQ ID NO: 13)
reverse PCR primer 51 -CCATCCTTCTTCCC AG AC AGGCCG-3' (SEQIDNO:14)
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA5264I sequence which had the following nucleotide sequence hybridization probe
'-GAAGCCTGTGTCCAGGTCCTTCAGTGAGTGGTTTGGCCTCGGTC-3' (SEQ ID NO:15) 20 In order to screen several libraries for a source of a full-length clone, DNA from die libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR0240 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human fetal liver tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 25 PR01477 (designated herein as DNA56529-1647 [Figure 9, SEQ ID NO: 11]; and the derived protein sequence for PRO1477.
The entire nucleotide sequence of DNA56529-1647 is shown in Figure 9 (SEQ ID NO:ll). Clone DNA56529-1647 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 23-25 and ending at the stop codon at nucleotide positions 2120-2122 (Figure 9). The predicted 30 polypeptide precursor is 699 amino acids long (Figure 10). The full-length PR0240 protein shown in Figure 10 has an estimated molecular weight of about 79,553 daltons and a pi of about 7.83. Analysis of die full-length PR01477 sequence shown in Figure 10 (SEQ ID NO: 12) evidences the presence of the following: transmembrane domains from about amino acid 21 to about amino acid 40 and from about amino acid 84 to about amino acid 105. Clone DNA56529-1647 has been deposited with ATCC on September 29,1998 and is assigned 35 ATCC deposit no. 203293.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 10 (SEQ ID NO:12), evidenced significant
381
homology between the PR01477 amino acid sequence and the following Dayhoff sequences: CELT03G111, CEZC410_4, A54408, SSMAN9MAN_1, GEN12643, GEN12642, AF027156_1, P_W46900, SPAC23A1_4 and DMC86E4_5.
EXAMPLE 9: Isolation of cDNA clones F.ncoding Human PRQ1478 5 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is designated herein "DNA52719". Based on the DNA52719 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO1478.
PCR primers (forward and reverse) were synthesized:
forward PCR nrimer 5'GCGAACGCTTCGAGGAGTCCTGG3' (SEQ ID NO: 18); and reverse PCR nrimer 5' GCAGTGCGGG AAGCC AC AT GGTAC3' (SEQ ID NO:19).
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensusDNA52719 sequence which had the following nucleotide sequence:
hybridization probe 5'CTTCCTGAGCAGGAAGAAGATCCGGCACCACATCTACGTGCTCAAC3'(SEO ID N0:20).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01478 gene using the probe oligonucleotide and one of the PCR primers. RNA 20 for construction of the cDNA libraries was isolated from human fetal kidney tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01478 and the derived protein sequence for PR01478.
The entire coding sequence of PR01478 is included in Figure 11 (SEQ BDNO;16). Clone DNA56531-1648 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 25 77-79 and an apparent stop codon at nucleotide positions 1058-1060 of SEQ ID NO: 16. The predicted polypeptide precursor is 327 amino acids long. The type II transmembrane sequence is believed to be at about amino acids 29-49 of SEQ ID NO:17, and an N-glycosylation site is believed to be at about amino acids 154-157 of SEQ ID NO: 17. Clone DNA56531-1648 has been deposited with ATCC and is assigned ATCC deposit no. 203286. The full-length PR01478 protein shown in Figure 12 has an estimated molecular weight of about 30 37,406 daltons and a pi of about 9.3.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 12 (SEQ ID NO:17), revealed sequence identity between the PR01478 amino acid sequence and the following Dayhoff sequences: YNJ4_CAEEL, P R55706, A38781_l, NALS MOUSE, HUMHGTJ, AF048687_1, CEW02B12_11, Y09F_MYCTU, FOJO_DROME, 35 and G01936.
382
EXAMPLE 10: Isolation of cDNA clones Encoding Human PRQ831
DNA56862-1343 was identified by applying the proprietary signal sequence finding algorithm described in Example 3 above. Use of the above described signal sequence algorithm allowed identification of an EST cluster sequence from the Incyte database, designated Incyte cluster sequence no. 2SS07. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST 5 databases (e.g., GenBank) and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identiiy existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in Enzvmologv 266:460-480 (19%)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington). 10 The consensus sequence obtained therefrom is herein designated as DNA55714.
In light of the sequence homology between the DNA55714 sequence and an EST sequence contained within the Merck EST clone no. AA09944S, the Merck EST clone no. AA09944S was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 13 and is herein designated as DNA56862-1343.
IS Clone DNAS6862-1343 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 40-42 and ending at the stop codon at nucleotide positions 259-261 (Figure 13). The predicted polypeptide precursor is 73 amino acids long (Figure 14). The full-length PR0831 protein shown in Figure 14 has an estimated molecular weight of about 7,879 daltons and a pi of about 7.21. Analysis of the full-length PR0831 sequence shown in Figure 14 (SEQ ID NO:22) evidences the presence of the following: a signal 20 peptide from about amino acid 1 to about amino acid 15 and an amino acid sequence block having homology to growth factor and cytokine receptor family proteins from about amino acid 3 to about amino acid 18. Clone DNA56862-1343 has been deposited with ATCC on September 1, 1998 and is assigned ATCC deposit no. 203174.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 25 alignment analysis of the full-length sequence shown in Figure 14 (SEQ ID NO:22), evidenced significant homology between the PR0831 amino acid sequence and the following Dayhoff sequences: P_W30724, HUMPPA_1,AF022238_1,4HHB_C,P_R39727,P_R39728, TRYT_MERUN, GPR5HUMAN, AB010266_3 and HSBCL3S2_1.
EXAMPLE 11: Isolation of cDNA clones Encoding Human PROl 113
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is designated herein "DNA34025". Based on the DNA34025 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 35 PROl 113.
PCR primers (forward and reverse) were synthesized:
forward PCR primer 5'GAGGACTCACCAATCTGGTTCGGC3' (SEQ ID NO:25); and
383
reverse PCR primer 5'AACTGGAAAGGAAGGCTGTCTCCC3' (SEQ ID NO:26).
Additionally, a synthetic oligonucleotide hybridization probe was constructed from die consensus DNA34025 sequence which had the following nucleotide sequence:
hybridization probe 5 "GTAAAGGAGAAGAACATCACGGTACGGGATACCAGGTGTGTTTATCCTAA3' (SEQ ID NO:27).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PROl 113 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human fetal kidney.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 10 PR01113 (designated herein as DNA57254-1477 [Figure 15, SEQIDNO:23]; and the derived protein sequence for PROl 113.
The entire coding sequence of PROl 113 is shown in Figure IS (SEQ ID NO:23). Clone DNA57254-1477 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 214-216, and an apparent stop codon at nucleotide positions 2062-2064 of SEQ ID N0.23. The predicted IS polypeptide precursor is 616 amino acids long. The transmembrane domain (type II) is believed to be at about amino acids 13-40 of SEQ ID NO:24. The N-giycosylation sites and N-myristoylation sites are indicated in Figure 16. Clone DNA57254-1477 has been deposited with the ATCC and is assigned ATCC deposit no. 203289. The full-length PROl 113 protein shown in Figure 16 has an estimated molecular weight of about 68,243 daltons and a pi of about 8.66.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 16 (SEQ ID NO:24), revealed sequence identity between the PROl 113 amino acid sequence and the following Dayhoff sequences (data incorporated herein): D86983 1, A58532, SLIT_DROME, AB007865_1, AC004142_1, CELT21D128, AB003184J, DMU42767_1, MUSLRRPJ and GPCR LYMST.
EXAMPLE 12: Isolation of cDNA clones Encoding Human PRO 1194
Use of tbe signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 30 EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. One or more of the ESTs was derived from a human pineal gland library. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, 35 University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56S11.
In light of the sequence homology between the DNA56511 sequence and an EST contained within the
384
Merck EST AA069568, the clone 382736 which includes this EST was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 17 and is herein designated as DNA57841-1522.
The full length clone shown in Figure 17 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 9-11 and ending at the stop codon found at nucleotide positions 5 252-254 (Figure 17; SEQ ID NO:28). The predicted polypeptide precursor (Figure 18, SEQ ID NO:29) is 81 amino acids long. The signal peptide is at about amino acids 1-21 of SEQ ID NO:29. PROl 194 has a calculated molecular weight of approximately 9,223 daltons and an estimated pi of approximately 10.47. Clone DNA57841-1522 was deposited with the ATCC on November 3, 1998 and is assigned ATCC deposit no. 203458.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 18 (SEQ ID NO:29), revealed sequence identity between the PROl 194 amino acid sequence and the following Dayhoff sequences: PT17_YEAST, RR2CHLVU, CEK12F21, S22452, S76705, AF031898_7, A4_DROME, AF038931_1, E49905, and GSPLAERHY.
EXAMPT.F. W Isolation of cDNA clones Encoding Human PROl 110
A cDNA clone (DNA58727-1474) encoding a native human PROlllO polypeptide was identified by a yeast screen, in a human fetal kidney cDNA library that preferentially represents the 5' ends of the primary cDNA clones. The yeast screen employed identified a single EST clone designated herein as DNA45566. The 20 DNA45566 sequence was then compared to various EST databases including public EST databases (e.g., GenBank), and a proprietary EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST sequences. The comparison was performed using the computer program BLAST or BLAST2 f Altschul et al.. Methods in Enzvmologv. 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were clustered and assembled into 25 a consensus DNA sequence with the program "phrap'' (Phil Green, University of Washington, Seattle, Washington). This consensus sequence is herein designated DNA46965. Oligonucleotide primers based upcm the DNA46965 sequence were then synthesized and employed to screen a human SK-Lu-1 adenocarcinoma cDNA library (LIB247) which resulted in the identification of the DNA58727-1474 clone shown in Figure 19.
The full-length DNA58727-1474 clone shown in Figure 19 contains a single open reading frame with 30 an apparent translational initiation site at nucleotide positions 131-133 and ending at the stop codon at nucleotide positions 1097-1099 (Figure 19). The predicted polypeptide precursor is 322 amino acids long (Figure 20). The full-length PROl 110 protein shown in Figure 20 has an estimated molecular weight of about 35,274 daltons and a pi of about 8.57. Analysis of the full-length PROlllO sequence shown in Figure 20 (SEQ ID NO:31) evidences the presence of the following: transmembrane domains from about amino acid 41 to about amino acid 35 60, from about amino acid 66 to about amino acid 85, from about amino acid 101 to about amino acid 120, from about amino acid 137 to about amino acid 153, from about amino acid 171 to about amino acid 192, from about amino acid 205 to about amino acid 226, from about amino acid 235 to about amino acid 255 and from about
385
WO 00/12708 PCT/US99/20111
amino acid 294 to about amino acid 312, a potential N-glycosylation site from about amino acid 6 to about amino acid 69, and a glycosaminoglycan attachment site from about amino acid 18 to about amino acid 21. Clone DNA58727-1474 has been deposited with ATCC on September 1, 1998 and is assigned ATCC deposit no. 203171.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 5 alignment analysis of the full-length sequence shown in Figure 20 (SEQ ID NO:31), evidenced significant homology between the PROl 110 amino acid sequence and the following Dayhoff sequences: MMMYELUPR_1, PR99799, MAL_HUMAN, P_P80929, RNMALGENEJ, S68406, PLLP RAT, MMMALPROT 1,138891 and S55622.
EXAMPLE 14: Isolation of cDNA clones Encoding Human PRQ1378
An initial DNA sequence referred to herein as DNA51941 was identified using a yeast screen, in a human bone marrow cDNA library that preferentially represents the 5' ends of the primary cDNA clones. Based on the DNA51941 sequence, the following oligonucleotides were synthesized for use as probes to isolate a clone of the full-length coding sequence for PR01377 from a bone marrow cDNA library: 15 T G T C C T T T G T C C C A G A C T T C T G T C C (SEQ ID NO:34), CTGGATGCTAATGTGTCCAGTAAATGATCCCCTTATCCCGTCGCGATGCT (SEQ ID NO:35); TTCCACTCAATGAGGTGAGCCACTC ( SEQ ID NO:36); GGCGAGCCCTAACTATCCAGGAG (SEQ ID NO:37); GGAGATCGCTGCGCTGGCCAGGTCCTCCCTGCATGGTAT (SEQ ID NO:38); and CTGCTGCAAAGCGAGCCTCTTG (SEQ ID NO:39).
The full length DNA58730-1607 clone shown in Figure 21 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 1365 to 1367 and ending at the stop codon found at nucleotide positions 2370 to 2372 (Figure 21; SEQ ID NO:32). The predicted polypeptide precursor (Figure 22, SEQ ID NO:33) is 335 amino acids long, with a signal peptide sequence at about amino acids 1-15. PR01378 has a calculated molecular weight of approximately 36,108 daltons and an estimated pi of 25 approximately 4.51.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 22 (SEQ ID NO:33), revealed some homology between the PR01378 amino acid sequence and the following Dayhoff sequences: ICAL RABIT, SP2 HUMAN, SHPSPRBB_1, SP23_HUMAN, P_W08158, and P_W08150.
Clone DNA58730-1607 was deposited with the ATCC on September 15,1998, and is assigned ATCC
deposit no. 203221.
EXAMPLE 15: Isolation of cDNA clones Encoding Hnman PRQ1481
An initial DNA sequence, referred to herein as DNA53254, was identified using a yeast screen, in a 35 human fetal kidney cDNA library that preferentially represents the 5' ends of the primary cDNA clones. Based on the DNA53254 sequence, oligonucleotides were synthesized for use as probes (or primers) to isolate a clone of the full-length coding sequence for PR01481 from a human fetal kidney cDNA library.
386
The Ml length DNA58732-1650 done shown in Figure 23 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 320-322 and ending at die stop codon found at nucleotide positions 1322-1324 (Figure 23; SEQ ID N0:40). The predicted polypeptide precursor (Figure 24, SEQ ID NO:41) is 334 amino acids long. The signal peptide is at about amino adds 1-23, and a transmembrane domain is at about amino acids 235-262 of SEQ ID NO:41. The N-glycosylation sites are indicated in Figure 24. PR01481 has a calculated molecular weight of approximately 36,294 daltons and an estimated pi of approximately 4.98. Clone DNA58732-1650 has been deposited with the ATCC and is assigned ATCC deposit no. 203290.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the Ml-length sequence shown in Figure 24 (SEQ ID NO:41), revealed sequence identity between the PR01481 amino acid sequence and the following Dayhoff sequences (data incorporated herein): YN23YEAST, S67770, H36857, YLU2_PICAN, GEN12881, CVY15035_28, YM96_YEAST, ESC1SCHPO, CELZK783 1 and S59310.
EXAMPLE 16: Isolation of cDNA clones Encoding Human PRO 1189
A cDNA sequence isolated in the amylase screen described in Example 2 above is herein designated DNA41784. The DNA41784 sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and proprietary EST DNA databases (LIFESEQTM, Incyte Pharmaceuticals, Palo Alto, CA; and Genentech, South San Francisco, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA45499.
Based on the DNA45499 sequence, oligonucleotide probes were generated and used to screen a human bone marrow library prepared as described in paragraph 1 of Example 2 above. The cloning vector was pRK5B (pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science. 253:1278-1280 (1991)), and the cDNA size cut was less than 2800 bp.
PCR primers (forward and reverse) were synthesized:
forward PCR primer (45499.fl) 5' -G A A AGAC ACG AC AC AGC AGCTTGC-3' (SEQ ID NO:44)
forward PCR primer (45499.f2) 5'-GGGAACTGCTATCTGATGCC-3' (SEQ ID NO:45)
forward PCR primer (45499.Q) 5'-CAGGATCTCCTCTTGCAGTCTGCAGC-3' (SEQ ID NO:46)
reverse PCR nrimer (45499.rl) 5' CTTCTCG A ACC AC AT A AGTTTG AGGC AG-3' (SEQ ID NO: 47) reverse PCR primer (45499. r2) 5' -CACGATTCCCTCCACAGCAACTGGG-3' (SEQ ID NO:48).
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA45499 sequence which had the following nucleotide sequence:
hybridization probe (45499.pl)
' -CGCCTT ACCGCGCAGCCCG AAG ATTC ACT ATGGTG A A AATCGCCTTC A AT-3' (SEQ ID N0:230).
387
INTELLECTUAL PROPERTY OFRCE OF i\|.Z
2 h OCT 2003
received
In order to screen several libraries for a source of a full-length clone, DNA from tbe libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PROl 189 gene using the probe oligonucleotide and one of the PCR primers.
A full length clone was identified that contained a single open reading frame with an apparent translational initiation site at nucleotide positions 79-81, and a stop signal at nucleotide positions 868-870 5 (Figure 25; SEQ ID NO:42). The predicted polypeptide precursor is 263 amino acids long has a calculated molecular weight of approximately 29,741 daltons and an estimated pi of approximately 5.74. Additional features include a type II transmembrane domain at about amino acids 53-75 and a potential N-glycosylation site at about amino acids 166-169.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 10 alignment analysis of the full-length sequence shown in Figure 26 (SEQ ID NO:43), evidenced significant homology between the PROl 189 amino acid sequence and Dayhoff sequences MUSE25A1 and HS696H22_1. Additionally, some homology was revealed between the PROl 189 amino acid sequence and the following Dayhoff sequences: AF017985J, CBRG01D9_2,179662, and CHPDRBAG_1.
Clone DNA58828-1519 has been deposited with ATCC and is assigned ATCC deposit no. 203172.
EXAMPLE 17: Isolation of cDNA clones Encoding Human PRQ141S
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database, designated Incyte EST cluster sequence no. 150918. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST 20 databases (e.g., GenBank) and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 25 Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA55720.
In light of the sequence homology between the DNA55720 sequence and an EST sequence contained within the Incyte EST cloneno. 4081476, the Incyte EST clone no. 4081476 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 27 and is herein designated as DNA58852-1637.
Clone DNA58852-1637 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 148-150 and ending at the stop codon at nucleotide positions 997-999 (Figure 27). The predicted polypeptide precursor is 283 amino acids long (Figure 28). The full-length PR01415 protein shown in Figure 28 has an estimated molecular weight of about 29,191 daltons and a pi of about 4.52. Analysis of the full-length PRO 1415 sequence shown in Figure 28 (SEQ ID N0:50) evidences the presence of the 35 following: a signal peptide from about amino acid 1 to about amino acid 25, a transmembrane domain from about amino acid 94 to about amino acid 118 and potential N-myristolation sites from about amino acid 18 to about amino acid 23, from about amino acid 40 to about amino acid 45, from about amino acid 46 to about amino acid
388
SI, from about amino acid 145 to about amino acid 150, from about amino acid 192 to about amino acid 197, from about amino acid 193 to about amino acid 198, from about amino acid 211 to about amino acid 216, from about amino acid 238 to about amino acid 243 and from about amino acid 242 to about amino acid 247. Clone DNA58852-1637 has been deposited with ATCC on September 22, 1998 and is assigned ATCC deposit no. 203271.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 28 (SEQ ID N0:50), evidenced significant homology between the PR01415 amino acid sequence and the following Dayhoff sequences: HSU66616_1, P_W24017, A38219, CD30JHUMAN, HSU78971_1, P_W22214, NFM_HUMAN, ADH 1_ASPFL, PAU93274 5 and CENB MOUSE.
EXAMPLE 18: Isolation of cDNA clones Encoding Hnman PROI411
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from an Incyte database. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 15 EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. One or more of the ESTs were derived from a thryroid tissue library. The homology search was performed using the computer program BLAST or BLAST2 (Altsbul etal., Methods in Enzvmologv 266:460-480(19961). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, 20 University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56013.
In light of the sequence homology between die DNA560I3 sequence and an EST sequence contained within the Incyte EST 1444225, the clone including this EST was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 29 and is herein designated as DNA59212-25 1627.
The full length clone shown in Figure 29 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 184-186 and ending at the stop codon found at nucleotide positions 1504-1506 (Figure 29; SEQ ID NO:51). The predicted polypeptide precursor (Figure 30, SEQ ID NO:52) is 440 amino acids long. The signal peptide is at about amino acids 1-21, and the cell attachment site 30 is at about amino acids 301-303 of SEQ ID NO:52. PRO 1411 has a calculated molecular weight of approximately 42,208 daltons and an estimated pi of approximately 6.36. Clone DNA59212-1627 was deposited with the ATCC on September 9, 1998 and is assigned ATCC deposit no. 203245.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 30 (SEQ ID NO:52), revealed sequence identity 35 between the PRO 1411 amino acid sequence and the following Dayhoff sequences (data from database incorporated herein): MTV023_19, P_R05307, P_W26348, P_P82962, AF000949_1, EBN1JEBV, P_R95107, GRP2_PHAVU, P_R81318, andS74439_l.
389
EXAMPLE 19: Isolation of cDNA clones Encoding Human PRQ1295
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. One 5 or more of the ESTs was derived from a thymus tissue library. The homology search was performed using die computer program BLAST or BLAST2 (Altshul et al., Methods in Fn?vmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 10 designated DNAS6262.
In light of the sequence homology between the DNAS6262 sequence and an EST contained within the Incyte EST 3743334, the clone including this EST was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 31 and is herein designated as DNA59218-1559.
The full length clone shown in Figure 31 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 207-209 and ending at the stop codon found at nucleotide positions 1047-1049 (Figure 31; SEQ ID NO:53). The predicted polypeptide precursor (Figure 32, SEQ ID NO:54) is 280 amino acids long. The signal peptide is at about amino acids 1-18 of SEQ ID NO:54. A targeting signal and N-glycosylation site are also indicated in Figure 54. PR01295 has a calculated molecular 20 weight of approximately 30,163 daltons and an estimated pi of approximately 6.87. Clone DNA59218-1559 was deposited with the ATCC on September 29, 1998 and is assigned ATCC deposit no. 203287.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 32 (SEQ ID NO:54), revealed sequence identity between the PRO1295 amino acid sequence and the following Dayhoff sequences (data incorporated herein): 25 AB011099_l, ILVE_MYCTU, ATTECR_2, AF010496_27, P_R15346, S37191, PER_DROMS, L2MU_ADECC and P_W34238.
EXAMPLE 20: Isolation of cDNA clon^ Encoding Human PRQ1359
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 30 cluster sequence from an Incyte database. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. One or more of the ESTs was derived from a sigmoid colon tissue library. The homology search was performed using the computer program BLAST or BLAST2 (Altshul etal., Methods in Enzvmologv 266:460-480 (1996^. 35 Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein
390
designated DNAS6263.
In light of the sequence homology between the DNA56263 sequence and the Incyte EST 1931418, the clone including this EST was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 33 and is herein designated as DNA59219-16I3.
The full length clone shown in Figure 33 contained a single open reading frame with an apparent 5 translational initiation site at nucleotide positions 184-186 and ending at the stop codon found at nucleotide positions 1081-1083 (Figure 33; SEQ ID NO:55). The predicted polypeptide precursor (Figure 34, SEQ ID NO:56) is 299 amino acids long. The transmembrane domain is at about amino acids 9-31 of SEQ ID NO:56. N-gylcosylation sites are at about amino acids 64-67 and 115-118ofSEQIDNO:56. PR01359 has a calculated molecular weight of approximately 34,291 daltons and an estimated pi of approximately 9.87. Clone 10 DNA59219-1613 was deposited with the ATCC on September 15, 1998 and is assigned ATCC deposit no. 203220.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 34 (SEQ ID NO:56), revealed sequence identity between the PR01359 amino acid sequence and the following Dayhoff sequences (data incorporated herein): 15 GEM14384, P_R78622, A23699_l, P_R65244, A54898, AF059321_1, RNU55938_1, BTRNAST6_1, P_R75I99 and P_R63216.
EXAMPLE 21: Isolation of cDNA clones Encoding Human PROl 190
The method described in Example 1 above allowed the identification of a single Merck/Washington 20 University EST sequence, EST no. AA339802, which is designated herein as "DNA53943". Based on the DNA53943 sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PROl 190.
PCR primers (forward and reverse) were synthesized:
forward PCR primer: (53943.fl) GGGAAACACAGCAGTCATTGCCTGC (SEQ ID NO:59)
reverse PCR primer: (53943.rl) GCACACGTAGCCTGTCGCTGGAGC (SEQ ID N0:60)
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA53943 sequence which had the following nucleotide sequence:
hybridization probe: (53943.pl) CACCCCAAAGCCCAGGTCCGGTACAGCGTCAAACAAGAGTGG(SEQ 30 ID NO:61)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PROl 190 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human bone marrow. 35 DNA sequencing of tbe clones isolated as described above gave the full-length DNA sequence for
PROl 190 (designated herein as DNA59586-1520 [Figure 35, SEQ ID NO:57]; and the derived protein sequence for PROl 190.
391
The entire coding sequence of PR01190 is shown in Figure 35 (SEQ ID NO:57). Clone DNA59586-1520 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 340-342 and an apparent stop codon at nucleotide positions 3685-3687. The predicted polypeptide precursor is 1115 amino acids long. The full-length PROl 190 protein shown in Figure 36 has an estimated molecular weight of about 121,188 daltons and a pi of about 7.07. Other features of the PR01190 protein include: two 5 transmembrane domains at amino acids 16-30 and 854-879; a cytochrome P450 cystein heme-iron ligand signature at amino acids 1051-1060; an N-6 adenine-specific DNA methylases signature at amino acids 1045-1051; and potential N-glycosylation sites at amino acids 65-68,76-79,98-101,189-192,275-278,518-521,726-729, and 760-763. Clone DNA59586-1520 was deposited with the ATCC on September 29, 1998, and is assigned ATCC deposit no. 203288.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 36 (SEQ ID NO:58), revealed homology between the PR01190 amino acid sequence and the following Dayhoff sequences: AF004840_1, AF004841 1, AF026465_1, HSU72391_1, P_R13144, AX01_HUMAN, GEN13349, 158164, D87212_l, A53449, and D86983 1, and KIAA0230.
EXAMPLE 22: Isolation of cDNA clones Encoding Human PRQ1772
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA45120. Based on the DNA45120 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 20 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO1772.
PCR primers (forward and reverse) were synthesized:
forward PCR primer (45120.fn 5'-CCTTCACCTGCAGTACACCATGGGC-3' (SEQIDNO:64) reverse PCR primer (45120.rn 5 '-GTCACACACAGCTCTGGCAGCTGAG-3' (SEQ ID NO:65) 25 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA45120 sequence which had the following nucleotide sequence hybridation probe (4512Q,pl)
'-CCAAGTTCAGACACCACATGTACACCAACGTCAGCGGATTGACAAGC-3' (SEQ ID NO:66) RNA for construction of the cDNA libraries was isolated from human bone marrow tissue. 30 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for
PRO1772 (designated herein as DNA59817-1703 [Figure 37, SEQ ID NO:62]; and the derived protein sequence for PRO1772.
The entire nucleotide sequence of DNA59817-1703 is shown in Figure 37 (SEQ ID NO:62). Clone DNA59817-1703 contains a single open reading frame with an apparent translational initiation site at nucleotide 35 positions 93-95 and ending at the stop codon at nucleotide positions 1554-1556 (Figure 37). The predicted polypeptide precursor is 487 amino acids long (Figure 38). The full-length PR01772 protein shown in Figure 38 has an estimated molecular weight of about 53,569 daltons and a pi of about 7.68. Analysis of the full-length
392
PR01772 sequence shown in Figure 38 (SEQ ID NO:63) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 36, a transmembrane domain from about amino acid 313 to about amino acid 331, potential N-glycosylation sites from about amino acid 119 to about amino acid 122, from about amino acid 184 to about amino acid 187, from about amino acid 243 to about amino acid 246 and from about amino acid 333 to about amino acid 336, potential N-myristolation sites from about amino acid 41 5 to about amino acid 46, from about amino acid 59 to about amino acid 64, from about amino acid 73 to about amino acid 78, from about amino acid 133 to about amino acid 138, from about amino acid 182 to about amino acid 187, from about amino acid 194 to about amino acid 199, from about amino acid 324 to about amino acid 329, from about amino acid 354 to about amino acid 359, from about amino acid 357 to about amino acid 362, from about amino acid 394 to about amino acid 399, from about amino acid 427 to about amino acid 432 and 10 from about amino acid 472 to about amino acid 477 and a prokaryotic membrane lipoprotein lipid attachment site from about amino acid 136 to about amino acid 146. Clone DNA59817-1703 has been deposited withATCC on November 17, 1998 and is assigned ATCC deposit no. 203470.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 38 (SEQ ID NO:63), evidenced significant IS homology between the PR01772 amino acid sequence and the following Dayhoff sequences: P R30823, MDP1_PIG, MDP1HUMAN, P_R13857, P_R53920, MDPl_MOUSE, P_R30822, JC4222, CELF52C6_2 and MYV027 13.
EXAMPLE 23: Isolation of cDNA clones Encoding Human PRQ1248 20 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST
cluster sequence from the Incyte database, designated Incyte EST cluster sequence no. 7494. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST 25 or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56056.
In light of the sequence homology between the DNA56056 sequence and an EST contained within the 30 Merck EST clone no. AA404441, the Merck EST clone no. AA404441 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 39 and is herein designated as DNA60278-1530.
Clone DNA60278-1530 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 122-124 and ending at the stop codon at nucleotide positions 671-673 (Figure 39). 35 The predicted polypeptide precursor is 183 amino acids long (Figure 40). The full-length PR01248 protein shown in Figure 40 has an estimated molecular weight of about 20,574 daltons and a pi of about 6.60. Analysis of the full-length PR01248 sequence shown in Figure 40 (SEQ ID NO:68) evidences the presence of the
393
PCT/US99/20I11
following: a signal peptide from abom amino acid 1 to about amino acid 20, a transmembrane domain from about amino acid 90 to about amino acid 112 and potential N-glycosylation sites from about amino acid 21 to about amino acid 24, from about amino acid 38 to about amino acid 41 and from about amino acid 47 to about amino acid 50. Clone DNA60278-1530 has been deposited with ATCC on September 1, 1998 and is assigned ATCC deposit no. 203170.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 40 (SEQ ID NO:68), evidenced significant homology between the PR01248 amino acid sequence and the following Dayhoff sequences: AF026198_5, CELR12C12 5, PN0563, S64541J, PN0564, P R44881 and XLU78189_1.
EXAMPLE 24: Isolation of cDNA clones Encoding Human PRQ1316
The extracellular domain (ECD) which includes the signal sequence, if any, of publicly available databases known to contain secreted sequences were used to search various publicly available EST (Expressed Sequenced Tag) databases (GenBank, Merck/Wash. U). The search was performed using the computer program BLAST or BLAST2 [Altschul et al., Methods in Enzymology 266:460-480 (1996)] as a comparison of the ECD 15 protein sequences to a 6 frame translation of the EST sequences. Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not did not encode known proteins were clustered and assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, Seattle, WA).
The above search resulted in the identification of the EST, designated W55979 which showed homology with the secreted protein Dkk-1. The clone corresponding to EST W55979 (clone NbHH19W) was purchased 20 firom Merck/Washington University and the cDNA insert was obtained and sequenced in its entirety.
The nucleic acid sequence corresponding to the full length PR01316 (designated DNA60608-1577) encoded by the purchased clone, is shown in Figure 41 (SEQ ID NO:69). DNA60608-1577 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 211-213, and a stop codon at nucleotide positions 988-990 (Figure 42; SEQ ID N0:70). The predicted polypeptide precursor is 259 amino 25 acids long. Additional regions of significant interest include the nucleotide residues encoding the signal peptide (211-283), an N-glycosylation site (364-366), and the Zn(2)-Cys(6) binuclear cluster domain (505-655). Clone DNA60608-1577 has been deposited with ATCC and is assigned ATCC deposit no. 203126. The full-length PR01316 protein shown in Figure 42 has an estimated molecular weight of about 28,447 daltons and a pi of about 9.48.
Based on a BLAST and FastA sequence alignment analysis (using the ALIGN computer program) of the full-length sequence, PR01316 shows significant amino acid sequence identity to the dickkopf family of proteins. Additionally, DNA60608 has shown homology to AF030433_1, LFE4_CHICK, COL RABIT, YQI6_CAEEL, ITB6_HUMAN, CONO_LYMST, S41033, D63483_l, D86864_l and AB001978_1.
EXAMPLE 25: Isolation of cDNA clones Encoding Human PROl 197
An initial DNA sequence, referred to herein as DNA56267, was identified using a yeast screen, in a human SK-Lu-1 adenocarcinoma cDNA library that preferentially represents the 5' ends of the primary cDNA
394
clones. ONAS6267 was used to synthesize oligonucleotides for use as probes to isolate a clone of the full-length coding sequence for PROl 197 from a human breast carcinoma cDNA library.
SEQ ID NO:73: 5'AATTC ATGGCAAAT ATTTCCCTTCCC3' (forward);
SEQ ID NO:74: 5'TGGTAAACTGGCCCAAACTCGG3' (reverse); and
SEQ ID NO:75:5 'TTAAAGTCATCCGTCCTTGGCTCAGGATTTGGAGAGCTTGCACCACCAAA3'
(probe).
The full length DNA60611-1524 clone shown in Figure 43 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 311-313 and ending at the stop codon found at nucleotide positions 1400-1402 (Figure 43; SEQ ID NO:71). The predicted polypeptide precursor (Figure 44, SEQ ID NO:72) is 363 amino acids long. The signal peptide is at about amino acids 1-24 of SEQ ID NO:72. 10 PROl 197 has a calculated molecular weight of approximately 38,825 daltons and an estimated pi of approximately 9.88. Clone DNA60611-1524 has been deposited with ATCC and is assigned ATCC deposit no. 203175.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 44 (SEQ ID NO:72), revealed sequence identity 15 between the PROl 197 amino acid sequence and the following Dayhoff sequences (information from database incorporated herein): Y144_HUMAN, 147141 (a gastric mucin, mucins are described in Ann. NY Acad. Sci.. 140(2):804-834 (1967), AMYH_YEAST, CELK06A9_3, CELZK783_1, HKR1YEAST, AB003521_1, D8789S_1, S61993 and YM96_YEAST.
EXAMPLE 26: Isolation of cDNA clones Encoding Human PRQ1293
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database, designated Incyte EST cluster sequence no. 115204. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, 25 CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56522. 30 In light of the sequence homology between the DNA56522 sequence and an EST contained within the
Incyte EST clone no. 2966119, the Incyte EST clone no. 2966119 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 45 and is herein designated as DNA60618-1557.
Clone DNA60618-1557 contains a single open reading frame with an apparent translational initiation 35 site at nucleotide positions 37-39 and ending at the stop codon at nucleotide positions 1060-1062 (Figure 45). The predicted polypeptide precursor is 341 amino acids long (Figure 46). The full-length PR01293 protein shown in Figure 46 has an estimated molecular weight of about 38,070 daltons and a pi of about 6.88. Analysis
395
of the full-length PR01293 sequence shown in Figure 46 (SEQ ID NO:77) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 19, a transmembrane domain from about amino acid 237 to about amino acid 262, a potential N-glycosylation site from about amino acid 205 to about amino acid 208, a cell attachment sequence from about amino acid 151 to about amino acid 152 and an amino acid sequence block having homology to coproporphyrinogen III oxidase proteins from about amino acid 115 5 to about amino acid 140. Clone DNA60618-1557 has been deposited with ATCC on September 29, 1998 and is assigned ATCC deposit no. 203292.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 46 (SEQ ID NO:77), evidenced significant homology between the PR01293 amino acid sequence and the following Dayhoff sequences: HSVCD54_1, 10 A33HUMAN, AF009220_1, HSU82279_1, AF004230_1, P_R13272, AF004231_1, AF043644_1, S44125 and HSIGGHC85_1.
EXAMPT.F. 37: Isolation of cDNA clones Encoding Human PRO 1380
A cDNA sequence isolated in the amylase screen described in Example 2 above is herein designated IS DNA45776. Based on the DNA45776 sequence, oligonucleotide probes were generated and used to screen a human retina library prepared as described in paragraph 1 of Example 2 above. The cloning vector was pRK5B (pRK5B is a precursor of pRK5D that does not contain tbe Sfil site; see. Holmes et al., Science. 253:1278-1280 (1991)), and the cDNA size cut was less than 2800 bp.
PCR primers (forward and reverse) were synthesized:
forward PCR nrimer (45776.fn 5'-TTTTGCGGTCACCATTGTCTGC-3' (SEQ ID N0:80) and reverse PCR nrimer (45776.rD 5'-CGTAGGTGACACAGAAGCCCAGG-3' (SEQ ID NO:81). Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA45776 sequence which had the following nucleotide sequence:
hybridization probe (45776.pd 25 5'-TACGGCATGACCGGCTCCTTTCCTATGAGGAACTCCCAGGCACTGATAT-3' (SEQ ID NO:82).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01380 gene using the probe oligonucleotide and one of the PCR primers.
A full length clone was identified that contained a single open reading frame with an apparent 30 translational initiation site at nucleotide positions 36-38, and a stop signal at nucleotide positions 1461-1463 (Figure 47; SEQ ID NO:78). The predicted polypeptide precursor is 470 amino acids long has a calculated molecular weight of approximately 51,715 daltons and an estimated pi of approximately 7.86. Additional features include transmembrane domains at about amino acids 50-74,105-127,135-153,163-183,228-252,305-330, and 448-472; potential N-glycosylation sites at about amino acids 14-17 and 84-87; and a dihydrofoiate 35 reductase signature at about amino acids 60-68.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 48 (SEQ ID NO .79), evidenced homology
396
between the PR01380 amino acid sequence and the following Dayhoff sequences: HSU81375 1, CEZK809_6, CEK02E11_1, AF034102 1, JC4196, CEF36H2_2, P_R92315, YAC2YEAST, F1707_13,andCEF44D12_3.
Clone DNA60740-1615 was deposited with the ATCC on November 3,1998, and is assigned ATCC deposit no. 203456.
EXAMPI.F- 28: Isolation of cDNA clones Encoding Hnman PRO 1265
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the LIFESEQ® database, designated EST Cluster No. 86995. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 10 identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). One or more of the ESTs used in the assembly was derived from acDNA library prepared 15 from RNA isolated from inflamed human adenoid tissue. The consensus sequence obtained therefrom is herein designated DNA55717.
In light of the sequence homology between the DNA55717 sequence and an EST sequence contained within Incyte EST no. 20965, EST clone no. 20965 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 49 and is herein designated as DNA60764. 20 The Ml length clone shown in Figure 49 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 79-81 and ending at the stop codon found at nucleotide positions 1780-1782 (Figure 49; SEQ ID NO:83). The predicted polypeptide precursor (Figure 50, SEQ ID NO:84) is 567 amino acids long. PR01265 has a calculated molecular weight of approximately 62,881 daltons and an estimated pi of approximately 8.97. Additional features include a signal peptide sequence at about amino acids 25 1-21; potential N-glycosylation sites at about amino acids 54-57, 134-137,220-223, and 559-562; and a region having amino acid sequence identify with D-amino acid oxidase proteins at about amino acids 61-80.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 50 (SEQ ID NO:84), revealed significant sequence identity between the PR01265 amino acid sequence and Dayhoff sequence no. MMU70429_1. 30 Sequence homology was also found to exist between the full-length sequence shown in Figure 50 (SEQ ID NO:84) and the following additional Dayhoff sequences: BC542A_1, E69899, S76290, MTV014_14, AOFB_HUMAN, ZMJ002204_1, S45812_l, DBRNAPD_1, and CRTl_SOYBN.
Clone DNA60764-1533 was deposited with the ATCC on November 10,1998, and is assigned ATCC deposit no. 203452.
EXAMPLE 29: Isolation of cDNA clones Encoding Human PRQ1250
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST
397
cluster sequence from the Incyte database, designated Incyte EST cluster sequence no. S6523. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a 5 BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNAS6103.
In light of the sequence homology between the DNA56103 sequence and an EST sequence contained within the Incyte EST clone no. 3371784, the Incyte EST clone no. 3371784 was purchased and the cDNA insert 10 was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 51 and is herein designated as DNA60775-1532.
Clone DNA60775-1532 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 74-76 and ending at the stop codon at nucleotide positions 2291-2293 (Figure 51). The predicted polypeptide precursor is 739 amino acids long (Figure 52). The full-length PR01250 protein 15 shown in Figure 52 has an estimated molecular weight of about 82,263 daltons and a pi of about 7.55. Analysis of the full-length PR01250 sequence shown in Figure 52 (SEQ ID NO:86) evidences the presence of the following: a type II transmembrane domain from about amino acid 61 to about amino acid 80, a putative AMP-binding domain signature sequence from about amino acid 314 to about amino acid 325, and potential N-glycosylation sites from about amino acid 102 to about amino acid 105, from about amino acid 588 to about 20 amino acid 591 and from about amino acid 619 to about amino acid 622. Clone DNA60775-1532 has been deposited with ATCC on September 1, 1998 and is assigned ATCC deposit no. 203173.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 52 (SEQ ID NO:86), evidenced significant homology between the PRO1250 amino acid sequence and die following Dayhoff sequences: LCFB HUMAN, 25 S56508_l, BNAMPBP2_1, BNACS7_1, CELT08BI_6, CELC46F4_2, AF008206_6, CELR07C3_11, LMU70253_2 and AF008206_7.
EXAMPLE 30: Isolation of cDNA clones Encoding Human PRQ1475
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 30 in Example 1 above. This consensus sequence is herein designated DNA45639. Based on the DNA45639 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01475.
PCR primers (forward and reverse) were synthesized:
forward PCR primer (45639.fl) 51 -GATGGCAAAACGTGTGTTTGACACG-3' (SEQIDNO:89) forward PCR primer (45639.f2) 5'-CCTCAACCAGGCCACGGGCCAC-3' (SEQIDNO:90)
reverse PCR primer (45639.rl) 5'-CCCAGGCAGAGATGCAGTACAGGC-3' (SEQ ID NO:91)
398
reverse PCR primer (4S639.r2~i 5-CCTCCAGTAGGTGGATGGATTGGCTC-3' {SEQ ID NO:92) Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA45639 sequence which had the following nucleotide sequence hybridization probe f45639.pl)
51 -CTCACCTCATGAGGATGAGGCCATGGTGCTATTCCTCAACATGGTAG-3' (SEQ ID NO:93) 5 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01475 gene using die probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human fetal brain tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 10 PR01475 (designated herein as DNA61185-1646 [Figure 53, SEQ ID NO:87]; and the derived protein sequence for PRO1475.
The entire nucleotide sequence of DNA61185-1646 is shown in Figure 53 (SEQ ID NO:87). Clone DNA61185-1646 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 130-132 and ending at the stop codon at nucleotide positions 2110-2112 (Figure 53). The predicted 15 polypeptide precursor is 660 amino acids long (Figure 54). The full-length PR01475 protein shown in Figure 54 has an estimated molecular weight of about 75,220 daltons and a pi of about 6.76. Analysis of die full-length PRO1475 sequence shown in Figure 54 (SEQ ID NO:88) evidences the presence of the following: a transmembrane domain from about amino acid 38 to about amino acid 55 and a homologous region to mouse GNTl from about amino acid 229 to about amino acid 660. Clone DNA61185-1646 has been deposited with 20 ATCC on November 17, 1998 and is assigned ATCC deposit no. 203464.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 54 (SEQ ID NO:88), evidenced significant homology between the PR01475 amino acid sequence and the following Dayhoff sequences: GNT1MOUSE, CGU65792_1, CGU65791_1, P_R24781, CELF48E3_1, G786_HUMAN, P_W06547, GNT1_CAEEL, 25 219 HUMAN and EF07_MOUSE.
EXAMPLE 31: Isolation of cDNA clones Encoding Human PRQ1377
An initial DNA sequence, referred to herein as DNA46892, was identified using a yeast screen, in a human umbilical vein endothelial cell cDNA library that preferentially represents the 5' ends of the primary 30 cDNA clones. Based on the DNA46892 sequence, the following oligonucleotides were synthesized for use as probes to isolate a clone of the full-length coding sequence for PR01377 from a human fetal kidney cDNA libraiy: GTTGTGGGTGAATAAAGGAGGGCAG (SEQ ID NO:96), CTGTGCTCATGTTCATGGACAACTG (SEQ ID NO:97), and GGATGATTTCATCTCCATTAGCCTGCTGTCTCTGGCTATGTTGGTGGGAT(SEQ ID NO:98).
The full length DNA61608-1606 clone shown in Figure 55 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 149-151 and ending at the stop codon found at nucleotide positions 1070-1072 (Figure 55; SEQ ID NO:94). The predicted polypeptide precursor (Figure 56,
399
SEQ ID NO:95) is 307 amino acids long. PR01377 has a calculated molecular weight of approximately 32,231 daltons and an estimated pi of approximately 6.62. Additional features include: a signal peptide at about amino acids 1-18; potential N-glycosylation sites at about amino acids 29-32 and 241-244, and transmembrane domains at about amino acids 37-56, 106-122, 211-230, 240-260, and 288-304.
An analysis of the Dayhoff database (version 3S.4S SwissProt 35), using a WU-BLAST2 sequence 5 alignment analysis of the full-length sequence shown in Figure 56 (SEQ ID NO:95), revealed some homology between the PR01377 amino acid sequence and the following Dayhoff sequences: CET01D3 6, CET28F34, CEF26D10_3, S66962, ATX2_YEAST, CEHI3N06_8, S49959, YIC3_YEAST, G02273, and P_W35557.
Clone DNA61608-1606 has been deposited with ATCC and is assigned ATCC deposit no. 203239.
EXAMPLE 32: Isolation of cDNA clones Encoding Human PRQ1326
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from tbe LIFESEQ8 database, designated Incyte Cluster No. 59366, also referred herein as "DNA10295". This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database 15 (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). One or more of the ESTs was derived from RNA 20 isolated from tumor tissue removed from the penis of a male with squamous ceil carcinoma. The consensus sequence obtained therefrom is herein designated DNA56257.
In light of the sequence homology between the DNA56257 sequence and an EST sequence contained within Incyte EST no. 1450878, the EST clone 1450878 was purchased and the cDNA insert was obtained and sequenced in its entirely. The sequence of this cDNA insert is shown in Figure 57 and is herein designated as 25 "DNA62808-1582".
The full length clone shown in Figure 57 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 112 to 114 and ending at the stop codon found at nucleotide positions 1315 to 1317 (Figure 57; SEQ ID NO:99). The predicted polypeptide precursor (Figure 58, SEQ ID N0:100) is 401 amino acids long. Other features of the PR01326 protein include: a signal sequence at about 30 amino acids 1-29; a ribosomal protein S3Ae homologous region at about amino acids 129-166; and potential N-glycosylation sites at about amino acids 109-112,144-147 and 398-401. PR01326 has a calculated molecular weight of approximately 45,333 daltons and an estimated pi of approximately 4.95. Clone DNA62808-1582 was deposited with the ATCC on October 20, 1998 and is assigned ATCC deposit no. 203358.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 35 alignment analysis of the full-length sequence shown in Figure 58 (SEQ ID NO: 100), revealed some homology between the PR01326 amino acid sequence and the following Dayhoff sequences: AC004013 1, HROMHCEMB J ,CEF47A4_2, A45592, MYSPJHUMAN.NFU43192_l ,ONGMBWMZ_l, CELC25A11_2,
400
CELC25A11_1, and A42184.
EXAMPLE 33: Isolation of cDNA clones Encoding Human PRO 1249
Use of die signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database, designated Incyte EST cluster no. 122605. This EST cluster sequence 5 was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 10 assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56060.
In light of the sequence homology between the DNA56060 sequence and an EST sequence contained within the Incyte EST clone no. 2630770, the Incyte EST clone no. 2630770 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 59 and is herein designated 15 as DNA62809-1531.
Clone DNA62809-1531 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 3-5 and ending at the stop codon at nucleotide positions 3270-3272 (Figure 59). The predicted polypeptide precursor is 1089 amino acids long (Figure 60). The full-length PR01249 protein shown in Figure 60 has an estimated molecular weight of about 118,699 daltons and a pi of about 8.49. Analysis of 20 the full-length PR01249 sequence shown in Figure 60 (SEQ ID NO: 102) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 16, transmembrane domains from about amino acid position 317 to about amino acid position 341, from about amino acid position 451 to about amino acid position 470, from about amino acid position 481 to about amino acid position 500, from about amino acid position 510 to about amino acid position 527, from about amino acid position 538 to about amino acid position 25 555, from about amino acid position 831 to about amino acid position 850, from about amino acid position 1016 to about amino acid position 1034 and from about amino acid position 1052 to about amino acid position 1070, a leucine zipper pattern sequence from about amino acid 843 to about amino acid 864 and potential N-glycosyiations sites from about amino acid 37 to about amino acid 40 and from about amino acid 268 to about amino acid 271. Clone DNA62809-1531 has been deposited with ATCC on September 9,1998 and is assigned 30 ATCC deposit no. 203237.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 60 (SEQ ID NO: 102), evidenced significant homology between the PR01249 amino acid sequence and the following Dayhoff sequences: AC004472_3, AB004539J7, S64782, S62432, YJG2YEAST, CELC27A12_8, YKQ5 YEAST, AB009505_3, SPBC24E9_8 35 and AF060218 4.
401
EXAMPLE 34: Isolation of cDNA clones Encoding Human PRQ1315
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA35925. Based on the DNA35925 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 5 PR01315.
PCR primers (forward and reverse) were synthesized:
forward PCR primer (35925.fn 5'-CGCTGCTGCTGTTGCTCCTGG-3' (SEQ ID N0:105)
forward PCR primer (35925.f2) 5'-CAGTGTGCCAGGACTTTG-3' (SEQ ID NO: 106)
forward PCR primer (35925.f31 5' -AGTCGCAGGCAGCGTTGG-3' (SEQ ID NO: 107)
reverse PCR primer (35925.rn 5'-CTCCTCCGAGTCTGTGTGCTCCTGC-3' (SEQ ID NO: 108)
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA35925 sequence which had the following nucleotide sequence hvhridiration nrobe (35925.pi)
'-GGACGGGCAGTTCCCTGTGTCTCTGGTGGTTTGCCTAAACCTGCAAACATC-3' (SEQ ID NO: 109) 15 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer par identified above. A positive library was then used to isolate clones encoding the PR01315 gene using the probe oligonucleotide and one of die PCR primers. RNA for construction of the cDNA libraries was isolated from human retina tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 20 PR01315 (designated herein as DNA62815-1576 [Figure 61, SEQ ID NO: 103]; and the derived protein sequence for PR01315.
The entire nucleotide sequence of DNA62815-I576 is shown in Figure 61 (SEQ ID NO: 103). Clone DNA62815-1576 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 121-123 and ending at the stop codon at nucleotide positions 1447-1449 (Figure 61). The predicted 25 polypeptide precursor is 442 amino acids long (Figure 62). The full-length PR01315 protein shown in Figure 62 has an estimated molecular weight of about 49,932 daltons and a pi of about 4.55. Analysis of the full-length PR01315 sequence shown in Figure 62 (SEQ ID NO: 104) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 28, a transmembrane domain from about amino acid 140 to about amino acid 163 and potential N-glycosylation sites from about amino acid 71 to about amino acid 74, 30 from about amino acid 80 to about amino acid 83, from about amino acid 89 to about amino acid 92, from about amino acid 204 to about amino acid 207 and from about amino acid 423 to about amino acid 426. Clone DNA62815-1576 has been deposited with ATCC on September 9, 1998 and is assigned ATCC deposit no. 203247.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 35 alignment analysis of the full-length sequence shown in Figure 62 (SEQ ID NO: 104), evidenced significant homology between the PR01315 amino acid sequence and the following Dayhoff sequences: MMU53696_1, NVY08571_2, B64560, STMSLPE_1, P_R80508, P_W19258, A55817, GEN14043, AE000768_7 and
402
PCT/US99/201H
RNMUC ASGP5_lpSMC.
FX A MPT.F 3S- Isolation of cDNA clones Encoding Human PRQ1599
Incyte EST no. 1491360 was identified as a sequence of interest using the techniques described in Example 1 above having a BLAST score of 70 or greater that does not encode a known protein. The nucleotide 5 sequence of EST no. 1491360 and its complementary sequence is designated herein "DNA37192". Based on the DNA37192 sequence, oligonucleotides were synthesized: I) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01599.
PCR primers (forward and reverse) were synthesized:
forward PCR primer: GACGTCTGCAACAGCTCCTGGAAG (37192.fl; SEQ ID NO:112)
reverse PCR primer: CGAGAAGGAAACGAGGCCGTGAG (37192.rl; SEQ ID NO:113)
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA37192 sequence which had the following nucleotide sequence:
hybridization probe: TGACACTTACCATGCTCTGCACCCGCAGTGGGGACAGCCACAGA (SEQ ID 15 NO: 114).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01S99 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human fetal liver tissue. 20 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for
PR01S99 (designated herein as DNA6284S-1684 [Figure 63, SEQ ID N0:110]; and the derived protein sequence for PR01599.
TheentirecodingsequenceofPR01599isshowninFigure63(SEQIDNO:HO). Clone DNA62845-1684 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 25 69-71 and an apparent stop codon at nucleotide positions 918-920. The predicted polypeptide precursor is 283 amino acids long. The full-length PR01S99 protein shown in Figure 64 has an estimated molecular weight of about 30,350 daltons and a pi of about 9.66. Additional features of PRO1599 include: a signal peptide at about amino acids 1-30; potential N-glycosylation sites at about amino acids 129-132 and 189-192; a potential cAMP and cGMP-dependent protein kinase phosphorylation site at about amino acids 263-266; potential N-30 myristoylation sites at about amino acids 28-33,55-60,174-179, and236-241; a potential amidation site at about amino acids 144-147; and a serine protease, trypsin family, histidine active site at about amino acids 70-75.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 64 (SEQ ID NO:111), revealed significant homology between the PR01599 amino acid sequence and the following Dayhoff sequence: CFADPIG. 35 Homology was also found between the PR01599 amino acids sequence and the following additional Dayhoff sequences. CFADJHUMAN; PR05421; PR55757; PR05772; GRAMHUMAN; MUSLMETJ; P_P80335; P_R55758; A42048_l; and P_W05383.
403
Clone DNA6284S-1684 was deposited with the ATCC on October 20, 1998 and is assigned ATCC deposit no. 203361.
EXAMPLE 36: Isolation of cDNA clones Encoding Human PRQ1430
A DNA sequence designated herein as DNA49433 was obtained as described in Example 1 above.
Merck EST no. T49469, which was identified as being an EST of interest from the assembly, was purchased and the cDNA insert was obtained and sequenced in entirety.
DNA sequencing of the clone as described above gave the full-length DNA sequence for PR01430, which is designated herein as "DNA64842-1632" (SEQ ID NO:115), and the derived protein sequence for PRO1430 (SEQ ID NO: 116). Clone DNA64842-1632 contains a single open reading frame with an apparent 10 translational initiation site at nucleotide positions 82-84, and an apparent stop codon at nucleotide positions 1075-1077. The full-length PRO1430 protein shown in Figure 66 has an estimated molecular weight of about 35,932 daltons and a pi of about 8.45. The predicted polypeptide precursor is 331 amino acids long. Additional features include a signal peptide at about amino acids 1-17; a potential N-glycosylation site at about amino acids 171-174, and regions of homology with short chain alcohol dehydrogenase family proteins at about amino acids 29-51, 15 116-126, 180-217, and 222-230.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 66 (SEQ ID NO: 116), revealed significant homology between the PR01430 amino acid sequence and Dayhoff sequence no. P_W03198. Homology was also found between the PR01430 amino acid sequence and the following Dayhoff sequences: MTV030_10, 20 MTV037_2, A40116_l, S42651, CEC15H11_6, SPCC736_13, SCU43704_1, S19842, OXIR_STRAT, and OXIRSTRLI.
Clone DNA64842-1632 has been deposited with ATCC and is assigned ATCC deposit no. 203278.
EXAMPLE 37: Isolation of cDNA clones Encoding Hitman PRQ1374 25 A consensus DNA sequence encoding PR01374 was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is designated herein "DNA47357". Based on the DN A47357 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01374.
PCR primers (forward and reverse) were synthesized:
forward PCR primer 5' CGGGACAGGAGACCCAGAAAGGG3' (SEQ ID NO:I19) and;
reverse PCR primer 5'GGCCAAGTGATCCAAGGCATCTTC3' (SEQ ID N0:120).
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA47357 sequence which had the following nucleotide sequence:
hvbridizationprobe 5'CTGCGGGACCTGACTAGATTCTACGACAAGGTACTTTCTTTGCATGGGG5' (SEQ ID NO: 121).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was
404
PCT/US99/20I11
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PRO1374 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from a human adenocarcinoma cell line.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01374 and the derived protein sequence for PR01374.
The entire coding sequence of PR01374 is shown in Figure 67 (SEQ ID NO: 117). Clone DNA64849-
1604 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 20-22 and an apparent stop codon at nucleotide positions 1653-1655 of SEQ ID NO: 117. The predicted polypeptide precursor is 544 amino acids long. The approximate locations of die signal peptide, N-glycosylation sites, leucine zipper patterns, and ribonucleotide reductase small subunit signature are indicated in Figure 68. 10 Clone DNA64849-1604 has been deposited with the ATCC and is assigned ATCC deposit no. 203468. The full-length PR01374 protein shown in Figure 68 has an estimated molecular weight of about 61,126 daltons and a pi of about 6.4.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 68 (SEQ ID NO: 118), revealed sequence identity 15 between the PR01374 amino acid sequence and the following Dayhoff sequences: CEF35G2_4, P_W37046, S44204, CET28D61, CET20B3_6, CELC14E23, CUAL_CHICK, ATM7J2_3, S74997andHIVH5994R8_l.
EXAMPLE 38: Isolation of cDNA clones Encoding Human PRQ1311
A cDNA sequence isolated in the amylase screen described in Example 2 above is herein designated 20 DNA37721. The DNA37721 sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and proprietary EST DNA databases (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA; Genentech, South San Franscisco, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul etal., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 25 in some cases 90) or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated "DNA48616". Based on the DNA48616 sequence, oligonucleotide probes were generated and used to screen a human aortic endothelial cell library prepared as described in paragraph 1 of Example 2 above. The cloning vector was pRK5B (pRKSB is a precursor of pRK5D 30 that does not contain the Sfil site; see, Holmes et al., Science. 253:1278-1280 (1991)), and the cDNA size cut was less than 2800 bp.
PCR primers (forward and reverse) were synthesized:
forward PCR primer f48616.fl1 5'-ATCATCTATTCCACCGTGTTCTGGC-3' (SEQ ID NO: 124)
reverse PCR primer (48616.rU 5' -G AC AG AGTGCTCC ATG ATG ATGTCC-3' (SEQ ID NO: 125) 35 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA48616 sequence which had the following nucleotide sequence:
hybridization probe (48616.pl>
405
'-CCTGTCTGTGGGCATCTATGCAGAGGTTGAGCGGCAGAAATATAAAACCC-3' (SEQ ID NO:126) In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01311 gene using the probe oligonucleotide and one of the PCR primers.
A full length clone was identified that contained a single open reading frame with an apparent 5 translational initiation site at nucleotide positions 195-197, and a stop signal at nucleotide positions 1077-1079 (Figure 69; SEQ ID NO: 122). The predicted polypeptide precursor is 294 amino acids long has a calculated molecular weight of approximately 33,211 daltons and an estimated pi of approximately 5.35 Additional features include: a signal sequence at about amino acids 1-44; possible transmembrane domains at about amino acids 22-42, 57-85, 94-116, and 230-257; potential N-glycosylation sites at about amino acids 118-121, 1899-192, and 10 230-233; potential tyrosine kinase phosphorylation sites at about amino acids 3-11 and 129-136; potential N-myristoylation sites at about amino acids 80-85, 109-114, 180-185, 218-223, 248-253, 276-281, 285-290, and 287-292; and a cell attachment sequence at about amino acids 3-5.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 70 (SEQ ID NO: 123), evidenced some homology 15 between the PR01311 amino acid sequence and the following Dayhoff sequences: AF065389 1, AF053455 1, CD63 HUMAN, A15_HUMAN, AF043906_1, C151_HUMAN, AF053453_1, AF054838_1, P_R91446, and CD82_HUM AN.
Clone DNA64863-1573 was deposited with the ATCC on September 9, 1998, and is assigned ATCC deposit no. 203251.
EXAMPLE 39: Isolation of cDNA clones Encoding Human PRQ1357
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database, designated Incyte EST cluster sequence no. 69537. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST 25 databases (e.g., GenBank) and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 30 Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56034.
In light of the sequence homology between the DNA56034 sequence and an EST sequence contained within the incyte EST clone no. 936239, the Incyte EST clone no. 936239 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 71 and is herein designated as DNA64881-16G2.
Clone DNA64881-1602 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 74-76 and ending at the stop codon at nucleotide positions 1526-1528 (Figure 71). The predicted polypeptide precursor is 484 amino acids long (Figure 72). The full-length PR01357 protein
406
shown in Figure 72 has an estimated molecular weight of about 52,468 daitons and a pi of about 7.14. Analysis of the full-length PR01357 sequence shown in Figure 72 (SEQ ID NO: 128) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 21, potential N-glycosylation sites from about amino acid 48 to about amino acid 51, from about amino acid 264 to about amino acid 267 and from about amino acid 401 to about amino acid 404, a glycosaminoglycan attachment site from about amino acid 412 to 5 about amino acid 415 and an amino acid sequence block having homology to the LBP/BPI/CETP family of proteins from about amino acid 407 to about amino acid 457. Clone DNA64881-1602 has been deposited with ATCC on September 9, 1998 and is assigned ATCC deposit no. 203240.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 72 (SEQ ID NO: 128), evidenced significant 10 homology between the PRO1357 amino acid sequence and the following Dayhoff sequences: MMU46068_1, S17447, MMU1_1, BPI RABIT, P_W16808, P_R21844, PSP_MOUSE, HSLBPEX1J and BTU79413_1.
EXAMPLE 40: Isolation of cDNA clones Encoding Human PRQ1244
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST IS cluster sequence from the LIFESEQ* database, designated cluster no. 7874. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA databases (LIFESEQ9, Incyte Pharmaceuticals, Palo Alto, CA; Genentech, South San Francisco, CA) to identify existing homologies. One or more of the ESTs was derived from a library constructed from tissue of the corpus cavernosum. The homology search was performed using 20 the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated "DNA56011".
In light of the sequence homology between the DNA56011 sequence and an EST sequence contained within Incyte EST No. 3202349, the EST clone no. 3202349 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 73 (SEQ ID NO: 129) and is herein designated "DNA64883-1526".
The full length clone shown in Figure 73 contained a single open reading frame with an apparent 30 translational initiation site at nucleotide positions 9-11 and ending at the stop codon found at nucleotide positions 1014-1016 (Figure 73; SEQ ID NO:129). The predicted polypeptide precursor (Figure 74, SEQ ID N0:130) is 335 amino acids long. PR01244 has a calculated molecular weight of approximately 38,037 daltons and an estimated pi of approximately 9.87. Other features include a signal peptide at about amino acids 1-29; transmembrane domains at about amino acids 183-205, 217-237, 271-287, and 301-321; potential N-35 glycosylation sites at about amino acids 71-74, and 215-218; and a cell attachment sequence at about amino acids 150-152.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence
407
alignment analysis of the full-length sequence shown in Figure 74 (SEQ ID NO: 130), revealed homology between the PR01244 amino acid sequence and the following Dayhoff sequences: AF008554_1, P 485334, G02297, HUMN33S111,HUMN33S10_1,Y013_CAEEL,GEN 13255, S49758.E70107, and ERP5_MEDSA.
Clone DNA64883-1526 was deposited with the ATCC on September 9, 1998, and is assigned ATCC deposit no. 203253.
EXAMPLE 41: Isolation of cDNA clones Encoding Human PRQ1246
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database, designated Incyte EST cluster sequence no. 56853. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST 10 databases (e.g., GenBank) and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 15 Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56021.
In light of the sequence homology between the DNA56021 sequence and an EST sequence contained within the Incyte EST clone no. 2481345, the Incyte EST clone no. 2481345 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 75 and is herein designated as DNA64885-1529.
Clone DNA64885-1529 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 119-121 and ending at the stop codon at nucleotide positions 1727-1729 (Figure 75). The predicted polypeptide precursor is 536 amino acids long (Figure 76). The full-length PRO1246 protein shown in Figure 76 has an estimated molecular weight of about 61,450 daltons and a pi of about 9.17. Analysis of the hill-length PRO1246 sequence shown in Figure 76 (SEQ ID NO: 132) evidences the presence of the 25 following: a signal peptide from about amino acid 1 to about amino acid 15, potential N-glycosylation sites from about amino acid 108 to about amino acid 111, from about amino acid 166 to about amino acid 169, from about amino acid 193 to about amino acid 196, from about amino acid 262 to about amino acid 265, from about amino acid 375 to about amino acid 378, from about amino acid 413 to about amino acid 416 and from about amino acid 498 to about amino acid 501 and amino acid sequence blocks having homology to sulfatase proteins from 30 about amino acid 286 to about amino acid 315, from about amino acid 359 to about amino acid 369 and from about amino acid 78 to about amino acid 97. Clone DNA64885-1529 has been deposited with ATCC on November 3, 1998 and is assigned ATCC deposit no. 203457.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 76 (SEQ ID NO: 132), evidenced significant 35 homology between the PR01246 amino acid sequence and the following Dayhoff sequences: PR51355, CELK09C41, BCU44852_1, IDS_HUMAN, G65169, E64903, ARSA HUMAN, GL6SJHUMAN, HSARSF 1 and GEN12648.
408
EXAMPLE 42: Isolation of cDNA clones Encoding Human PRO 1356
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database, designated Incyte EST cluster sequence no. 44725. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, 5 CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56023. 10 In light of the sequence homology between the DNA56023 sequence and an EST sequence contained within the Incyte EST clone no. 4071746, the Incyte EST clone no. 4071746 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 77 and is herein designated as DNA64886-1601.
Clone DNA64886-1601 contains a single open reading frame with an apparent translational initiation 15 site at nucleotide positions 122-124 and ending at the stop codon at nucleotide positions 812-814 (Figure 77). The predicted polypeptide precursor is 230 amino acids long (Figure 78). The full-length PR01356 protein shown in Figure 78 has an estimated molecular weight of about 24,549 daltons and apl of about 8.56. Analysis of the full-length PR01356 sequence shown in Figure 78 (SEQ ID NO: 134) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 24, transmembrane domains from about 20 amino acid 82 to about amino acid 102, from about amino acid 117 to about amino acid 140 and from about amino acid 163 to about amino acid 182, a potential N-glycosylation site from about amino acid 190 to about amino acid 193 and an amino acid sequence block having homology to the PMP-22/EMP/MP20 family of proteins from about amino acid 46 to about amino acid 59. Clone DNA64886-1601 has been deposited with ATCC on September 9, 1998 and is assigned ATCC deposit no. 203241.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 78 (SEQ ID NO: 134), evidenced significant homology between the PR01356 amino acid sequence and the following Dayhoff sequences: AB00014 1, AB000712_1, A39484, AF000959_1, AF035814_1,HSU89916_1,MMU19582_1,P_R30059,HUAC004125_1 and PM22_RAT.
EXAMPLE 43: Isolation of cDNA clones Encoding Human PRQ1275
A novel secreted molecuie, designated herein as DNA57700, was used to BLAST against Incyte's (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) proprietary database and Genbank's public database. Positive clones were identified and used to generate assembly files by seqext program. The search was 35 performed using the computer program BLAST or BLAST2 [Altschul et al., Methods in Enzvmologv. 266:460-480 (1996)] as a comparison of the ECD protein sequences to a 6 frame translation of the EST sequences. Those comparisons resulting in a BLAST score of 70 (or in some cases, 90) or greater that did not encode known
409
proteins were clustered and assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, Seattle, Washington).
A consensus DNA sequence was assembled relative to other GST sequences using repeated cycles of BLAST and phrap. This consensus sequence is designated herein "DNA59572".
Based on the DNAS9S72 consensus sequence and its relation to sequences identified in the assembly, 5 one of the clones (Incyte clone 2026581) including one of the sequences in the assembly was purchased and sequenced. Incyte clone 2026581 came from a library constructed of RNA from epidermal breast keratinocytes.
The entire coding sequence of PRO1275 is shown in Figure 79 (SEQ ID NO: 135). Clone DNA64888-1542 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 37-39 and an apparent stop codon at nucleotide positions 394-396 of SEQ ID NO:135. The predicted 10 polypeptide precursor is 119 amino acids long. The signal peptide is at about amino acids 1-25 of SEQ ID NO: 136. Clone DNA64888-1542 has been deposited with ATCC and is assigned ATCC deposit no. 203249. The full-length PR01275 protein shown in Figure 79 has an estimated molecular weight of about 13,248 daltons and a pi of about 7.78.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 15 alignment analysis of the full-length sequence shown in Figure 80 (SEQ ID NO: 136), revealed sequence identity between the PRO1275 amino acid sequence and the following Dayhoff sequences (information from database incorporated herein): B48151 (Mst98Cb), D86424 1 (high-sulfur keratin protein), P R79964 (connective tissue growth factor), CHRD RAT (chordin), MT DREPO (metallothionein), PL05 PLETR (plectoxins), P R25156 (Ig antigen), S73732_l (VLDP), AF025440_1 (OIP4) and P_R32757 (IGF-II).
EXAMPLE 44: Isolation of cDNA clones Encoding Human PRQ1274
A novel secreted molecule, designated herein as DNA57700, was used to blast against Incyte's (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) proprietary database and Genbank's public database. Positive clones were identified and used to generate assembly files by seqext program. The search was 25 performed using the computer program BLAST or BLAST2 [Altschul et al., Methods in Enzvmologv. 266:460-480 (1996)] as a comparison of the ECD protein sequences to a 6 frame translation of the EST sequences. Those comparisons resulting in a BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, Seattle, Washington).
A consensus DNA sequence was assembled relative to other EST sequences using repeated cycles of
BLAST and phrap. This consensus sequence is designated herein "DNA59573".
Based on the DNA59573 consensus sequence and its relation to sequences identified in the assembly, one of the clones (Incyte clone 2623992) including one of the sequences in the assembly was purchased and sequenced. Incyte clone 2623992 came from a library constructed of RNA from epidermal breast keratinocytes. 35 TheentirecodingsequenceofPR01274isshowninFigure81(SEQIDNO:137). CloneDNA64889-
1541 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 24-26, and an apparent stop codon at nucleotide positions 354-356 of SEQ ID NO; 137. The predicted
410
polypeptide precursor is 110 amino acids long. The signal peptide is at about 1-24 of SEQ ID NO: 138. Conserved regions in the insulin family of proteins and an N-glycosylation site are indicated in Figure 82. Clone DNA64889-1541 has been deposited with ATCC and is assigned ATCC deposit no. 203250. The full-length PR01274 protein shown in Figure 82 has an estimated molecular weight of about 12,363 daltons and a pi of about 8.31.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 82 (SEQ ID NO: 138), revealed sequence identity between the PRO1274 amino acid sequence and the following Dayhoff sequences (information from database incorporated herein): CEW05B29, AF016922_1 (insulin-like growth factor 1), B48151, A53640, BTIGF2REC1 (insulin-like growth factor 2), HSNF1GEN12_1, TXA3_RADMA (neurotoxin 3), 10 CXMl_CONGE, P P61301, TXA4 RADMA (neurotoxin 4).
EXAMPLE 45: Isolation of cDNA clones Encoding Human PRQ1412
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the LIFESEQ* database, designated Incyte Cluster No. 101368 , also referred herein as IS "DNA10643". This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 20 encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). One or more of the ESTs was derived from RNA isolated from fibroblasts of the prostate stroma removed from a male fetus. The consensus sequence obtained therefrom is herein designated "DNA58754".
In light of the sequence homology between the DNA58754 sequence and an EST sequence contained 25 within EST no. 3597385, the EST clone 3597385 was purchased and the cDNA insert was obtained and sequenced in its entirety. The sequence of this cDNA insert is shown in Figure 83 and is herein designated as "DNA64897-1628".
The full length clone shown in Figure 83 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 142 to 144 and ending at the stop codon found at nucleotide 30 positions 1075 to 1077 (Figure 83; SEQ ID NO: 139). The predicted polypeptide precursor (Figure 84, SEQ ID NO: 140) is 311 amino acids long. Other features of the PRO 1412 protein include: a signal sequence at about amino acids 1-28; a transmembrane domain at about amino acids 190-216; potential N-glycosylation sites at about amino acids 49-52, 91-94, 108-111, 128-131, 135-138 and 190-193; a tyrosine kinase phosphoiylation site at about amino acids 62-69; and a lysosome-associated membrane glycoprotein duplicated domain at about 35 amino acids 183-224. PR01412 has a calculated molecular weight of approximately 33,908 daltons and as estimated pi of approximately 6.87. Clone DNA64897-1628 was deposited with the ATCC on September 15, 1998, and is assigned ATCC deposit no. 203216.
411
An analysis of the Dayhoff database (version 3S.4S SwissProt 33), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 84 (SEQ ID NO: 140), revealed some homology between the PR01412 amino acid sequence and the following Dayhoff sequences: 150116, AF035963_1, NCA2 RAT, 161783, P_W07682, MMHCI35G15 3, S21461, MMIGL2_1, ONHIGMV9A_l and MMU70448_1.
EXAMPLE 46: Isolation of cDNA clones F.nqvling Human PRO 1557
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST sequence from the Genentech database, designated "DNA58763. This EST sequence was then compared to a variety of expressed sequence tag (EST) databases, which included the EST databases listed above, to identify 10 existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained from the assembly is herein designated "DNA58763". IS In light of the sequence homology between the DNA58763 sequence and an EST sequence contained within the EST no.2267403, EST no. 2267403 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 85 and is herein designated as DNA64902-1667.
The full length clone shown in Figure 85 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 287 to 289 and ending at the stop codon found at nucleotide 20 positions 1640 to 1642 (Figure 85; SEQ ID NO: 141). The predicted polypeptide precursor (Figure 86, SEQ ID NO:142) is 451 amino acids long. PR01557 has a calculated molecular weight of approximately 49,675 daltons and an estimated pi of approximately 7.15. Additional features include: a signal sequence at about amino acids 1-25; a potential N-glycosylation site at about amino acids 114-117; a potential cAMP and cGMP-dependent protein kinase phosphorylation site at about amino acids 388-41; potential N-myristoylation sites at 2S about amino acids 54-49,66-71,146-151, and 367-372; potential amidation sites at about amino acids 36-39 and 205-208; and an ATP/GTP-binding site motif A (P-loop) at about amino acids 151-258.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), """g a WU-BLAST2 sequence alignment analysis of tbe full-length sequence shown in Figure 86 (SEQ ID NO: 142), revealed significant homology between the PR01557 amino acid sequence and Dayhoff sequence AF034606 1. Homology was also 30 found between the PR01557 amino acid sequence and the following Dayhoff sequences: P_W31559, AF031230_1, SOG DROME, CAll_MOUSE, P_R41320, CHRD RAT, P W40288, NEL_CHICK, and HSMUC5B_1.
Clone DNA64902-1667 was deposited with the ATCC on October 6, 1998, and is assigned ATCC deposit no. 203317.
F.XAMPLE 47: Isolation of cDNA clones Encoding Human PRQ1286
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST
412
cluster sequence from the LIFESEQ* database, designated EST Cluster No. 86809. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identiiy existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Fnrvmnlnpv 266:460-480 (1996)). Those comparisons resulting in a 5 BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). ESTs in the assembly included those identified from tumors, cell lines, or diseased tissue. One or more of the ESTs was obtained from a cDNA library constructed from RNA isolated from diseased colon tissue. The consensus sequence obtained therefrom is herein designated DNA58822.
In light of the sequence homology between the DNA58822 sequence and an EST sequence contained within EST no. 169S434, EST clone no. 1695434 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 87 and is herein designated DNA64903-1553 (SEQ ID NO: 143).
The full length clone shown in Figure 87 contained a single open reading frame with an apparent 15 translational initiation site at nucleotide positions 93-95 and ending at the stop codon found at nucleotide positions 372-374 (Figure 87; SEQ ID NO:143). The predicted polypeptide precursor (Figure 88, SEQ ID NO:144) is 93 amino acids long, with a signal sequence at about amino acids 1-18. PR01286 has a calculated molecular weight of approximately 10, 111 daltons and an estimated pi of approximately 9.70.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 20 alignment analysis of the full-length sequence shown in Figure 88 (SEQ ID NO: 144), revealed some homology between the PR01286 amino acid sequence and the following Dayhoff sequences: SR5CARATH, CELC17H12_11, MCPD ENTAE, JQ2283, INVO_LEMCA, P_R07309, ADEVBCAGN_4, AF020947_1, CELT23H21, and MDH STRAR.
Clone DNA64903-1553 was deposited with the ATCC on September 15,1998 and is assigned ATCC 25 deposit no. 203223.
EXAMPLE 48: Isolation of cDNA clones Encoding Human PR01294
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database, designated Incyte EST cluster sequence no. 10559. This EST cluster 30 sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, C A) to identiiy existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 35 assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA57203.
In light of the sequence homology between the DNA57203 sequence and an EST sequence contained
413
PCT/US99/20I11
within the Incyte EST clone no. 3037763, the Incyte EST clone no. 3037763 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 89 and is herein designated as DNA64905-1558.
Clone DNA64905-1558 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 110-112 and ending at the stop codon at nucleotide positions 1328-1330 (Figure 89).
The predicted polypeptide precursor is 406 amino acids long (Figure 90). The full-length PR01294 protein shown in Figure 90 has an estimated molecular weight of about 46,038 daltons and a pi of about 6.50. Analysis of the full-length PR01294 sequence shown in Figure 90 (SEQ ID NO: 146) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 21 and potential N-glycosylation sites from about amino acid 177 to about amino acid 180 and fiom about amino acid 248 to about amino acid 251. 10 Clone DNA64905-1558 has been deposited with ATCC on September 15, 1998 and is assigned ATCC deposit no. 203233.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 90 (SEQ ID NO: 146), evidenced significant homology between the PR01294 amino acid sequence and the following Dayhoff sequences: 173636, 15 AF028740_1, AB006686S3_1, P_R98225, RNU78105_1, CELC48E7_4, CEF11C3J, SCP1_MESAU, TPM3HUMAN and CELK05B2_3.
EXAMPLE 49: Isolation of cDNA clones Encoding Human PRQ1347
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 20 in Example 1 above. This consensus sequence is designated herein "DNA47373". Based on the DNA47373 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01347.
PCR primers (forward and reverse) were synthesized:
forward PCR nrimer 5'GCGTGGTCCACCTCTACAGGGACG3' (SEQ ID NO: 149); and reverse PCR primer 5'GGAACTGACCCAGTGCTGACACC3' (SEQ ID N0:150).
Additionally, a synthetic oligonucleotide hybridization probe was constructed fiom the consensus DNA47373 sequence which had the following nucleotide sequence:
hybridization probe 5'GCAGATGCCACAGTATCAAGGCAGGACAAAACTGGTGAAGGATTC3' (SEQ ID 30 NO: 151).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01347 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human small intestine. 35 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for
PR01347 and tbe derived protein sequence for PR01347.
The entire coding sequence of PR01347 is shown in Figure 91 (SEQ ID NO: 147). Clone DNA64950-
414
within the Incyte EST clone no. 3037763, the Incyte EST clone no. 3037763 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 89 and is herein designated as DNA64905-1558.
Clone DNA64905-1558 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 110-112 and ending at the stop codon at nucleotide positions 1328-1330 (Figure 89).
The predicted polypeptide precursor is 406 amino acids long (Figure 90). The full-length PR01294 protein shown in Figure 90 has an estimated molecular weight of about 46,038 daltons and a pi of about 6.50. Analysis of the full-length PR01294 sequence shown in Figure 90 (SEQ ID NO:146) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 21 and potential N-glycosylation sites from about amino acid 177 to about amino acid 180 and from about amino acid 248 to about amino acid 251. 10 Clone DNA64905-1558 has been deposited with ATCC on September 15, 1998 and is assigned ATCC deposit no. 203233.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 90 (SEQ ID NO: 146), evidenced significant homology between the PR01294 amino acid sequence and the following Dayhoff sequences: 173636, 15 AF028740_1, AB006686S3J, P_R98225, RNU78105_1, CELC48E7_4, CEF11C3J, SCP1_MESAU, TPM3_HUMAN and CELK05B2_3.
EXAMPLE 49: Isolation of cDNA clones Encoding Human PRQ1347
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 20 in Example 1 above. This consensus sequence is designated herein "DNA47373". Based on the DNA47373 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01347.
PCR primers (forward and reverse) were synthesized:
forward PCR primer 5'GCGTGGTCCACCTCTACAGGGACG3' (SEQ ID NO:149); and reverse PCR primer 5'GGAACTGACCCAGTGCTGACACC3' (SEQ ID NO: 150).
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA47373 sequence which had the following nucleotide sequence:
hybridization probe 5'GCAGATGCCACAGTATCAAGGCAGGACAAA ACTGGTGAAGGATTC3' (SEQ ID 30 NO: 151).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01347 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human small intestine. 35 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for
PR01347 and the derived protein sequence for PR01347.
The entire coding sequence of PR01347 is shown in Figure 91 (SEQ ID NO: 147). Clone DNA64950-
414
1590 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 183-185, and an apparent stop codon at nucleotide positions 1683-1685 of SEQ ID NO: 147. The predicted polypeptide precursor is 500 amino acids long. The signal peptide is at about amino acids 1-17 and the transmembrane domain is at about 239-255 of SEQ ID NO: 148. Clone DNA64950-1590 has been deposited with ATCC and is assigned ATCC deposit no. 203224. The full-length PR01347 protein shown in Figure 92 5 has an estimated molecular weight of about 56,748 daltons and a pi of about 8.5.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 92 (SEQ ID NO: 148), revealed sequence identity between the PR01347 amino acid sequence and the following Dayhoff sequences (data incorporated herein): BUTY HUMAN, AF033107_1, HSU90142_1, HSU90144_1, HSB73_1, HS111M5_2, R052_HUMAN, 10 AF018080_1, HSAJ03147 4, and MOG_MOUSE.
EXAMPLE 50: Isolation of cDNA clones Encoding Hnman PRQ1305
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA38103. Based on the DNA38103 15 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01305.
PCR primers (forward and reverse) were synthesized:
forward PCR nrimer (381Q3.fn 5'-AACTGCTCTGTGGTTGGAAGCCTG-3' (SEQ ID NO: 154) 20 reverse PCR primer (381Q3.rtt 5'-CAGTCACATGGCTGACAGACCCAC-3' (SEQ ID NO: 155)
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA38103 sequence which had the following nucleotide sequence hybridization probe (381Q3.pn
5AGGTTATCAGGGGCTTCACTGTGAAACCTGCAAAGAGG-3' (SEQ ID NO:156) 25 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01305 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 30 PR01305 (designated herein as DNA64952-1568 [Figure 93, SEQ ID NO:152]; and the derived protein sequence for PR01305.
The entire nucleotide sequence of DNA64952-1568 is shown in Figure 93 (SEQ ID NO: 152). Clone DNA64952-1568 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 126-128 and ending at the stop codon at nucleotide positions 900-902 (Figure 93). The predicted 35 polypeptide precursor is 258 amino acids long (Figure 94). The full-length PR01305 protein shown in Figure 94 has an estimated molecular weight of about 25,716 daltons and a pi of about 8.13. Analysis of the full-length PR01305 sequence shown in Figure 94 (SEQ ID NO-.153) evidences the presence of the following: a signal
415
PCT7US99/201H
peptide from about amino acid 1 to about amino acid 25, potential N-glycosylation sites from about amino acid 30 to about amino acid 33, from about amino acid 172 to about amino acid 175, from about amino acid 195 to about amino acid 198, from about amino acid 208 to about amino acid 211 and from about amino acid 235 to about amino acid 238 and an EGF-like domain cysteine pattern signature sequence from about amino acid 214 to about amino acid 225. Clone DNA64952-1568 has been deposited with ATCC on September 15, 1998 and 5 is assigned ATCC deposit no. 203222.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 94 (SEQ ID NO: 153), evidenced significant homology between the PR01305 amino acid sequence and the following Dayhoff sequences: CET22A3_7, LMA2_MOUSE,AFQ55580_1,AF016903_1,LMB2_MOUSE,P_R71730,LMB3_MOUSE,LMG1_HUMAN, 10 LMG1DROME and LMA5_MOUSE. As such, the PR01305 polypeptide does show homology to laminin and may be a laminin homolog.
EXAMPLE 51: Isolation of cDNA clones Encoding Human PRQ1273
An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) IS was searched and an EST was identified. This sequence was blasted against public databases and Incyte's database. The search was performed using the computer program BLAST or BLAST2 [Altschul et al., Methods in Enzvmologv. 266:460-480 (1996)] as a comparison of the extracellular domain (ECD) protein sequences to a 6 frame translation of the EST sequences. Those comparisons resulting in a BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were clustered and assembled into consensus DNA 20 sequences with the program "phrap'' (Phil Green, University of Washington, Seattle, Washington).
A consensus DNA sequence was assembled relative to other EST sequences using repeated cycles of BLAST and phrap. This consensus sequence is designated herein "DNA60747". Based on the DNA60747 consensus sequence and its relation to a sequence within the assembly of aligned sequences, Incyte clone 3541105 was purchased and sequenced in full. This Incyte clone came from a library constructed of RNA 25 isolated from seminal vesicle tissue.
The entire coding sequence of PR01273 is shown in Figure 95 (SEQ ID NO:157). Clone DNA65402-1540 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 26-28 and an apparent stop codon at nucleotide positions 515-517 of SEQ ID NO:157. The predicted polypeptide precursor is 163 amino acids long. The signal peptide is at about amino acids 1-20 and the 30 conserved region in lipocalins is at about amino acids 25-36 of SEQ ID NO: 158. Clone DNA65402-1540 has been deposited with ATCC and is assigned ATCC deposit no. 203252. The full-length PRO1273 protein shown in Figure 96 has an estimated molecular weight of about 18,045 daltons and a pi of about 4.87.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 96 (SEQ ID NO: 158), revealed sequence identity 35 between the PR01273 amino acid sequence and the following Dayhoff sequences (information from database incorporated herein): PGHD FELCA (prostaglandin^ d-isomerase precursor), S57748 (prostaglandin D synthetase precursor), LIPO_BUFMA (lipocalin precursor), S52354, QSP_CHICK, ECP191, LACB CAPHI,
416
OLFARANPI, D87752_l, and LACB_BOVIN.
EXAMPLE 52: Isolation of cDNA clones Encoding Human PRQ13Q2
A consensus DNA sequence encoding PR01302 was assembled relative to other EST sequences using repeated cycles of phrap as described in Example 1 above. This consensus sequence is designated herein 5 "DNA28742". Based on the DNA28742 consensus sequence, the assembly from which the consensus sequence was derived and other information and discoveries provided herein, the Incyte clone 3344926 (from a diseased spleen tissue library) was purchased and sequenced in full. Sequencing provided the full-length DNA sequence for PR01302 and the derived protein sequence for PR01302.
The entire coding sequence of PR01302 is shown in Figure 97 (SEQ ID NO: 159). Clone DNA65403-10 1565 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 43-45 and an apparent stop codon at nucleotide positions 1432-1435 of SEQ ID NO:159. The predicted polypeptide precursor is 463 amino acids long. The signal peptide is at about amino acids 1-15 and the transmembrane sequence is at about amino acids 351-370 of SEQ ID NO: 160. Clone DN A65403-1565 has been deposited with the ATCC and is assigned ATCC deposit no. 203230. The full-length PR01302 protein shown 15 in Figure 98 has an estimated molecular weight of about 50,082 daltons and a pi of about 7.3.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 98 (SEQ ID NO: 160), revealed sequence identity -between the PR01302 amino acid sequence and the following Dayhoff sequences (data incorporated herein): D86358_l, D86359_l, S71403_l, MAGJHUMAN, JH0593, MMSIAL2 _1, C22A_HUMAN, PGBMHUMAN, 20 PGBM HUMAN, LACH DROME, and KMLS HUMAN.
EXAMPLE 53: Isolation of cDNA clones Encoding Human PRQ1283
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA28753. Based on the DNA28753 25 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01283.
PCR primers (forward and reverse) were synthesized:
forward PCR primer (28753.fn 5'-GGAGATGAAGACCCTGTTCCTG-3' (SEQ ID NO: 163) 30 forward PCR primer (28753.fill 5'-GGAGATGAAGACCCTGTTCCTGGGTG-3' (SEQ ID NO: 164) reverse PCR primer (28753.rl) 5'-GTCCTCCGGAAAGTCCTTATC-3' (SEQ ID NO: 165)
reverse PCR primer (28753.rllt 5'-GCCTAGTGTTCGGGAACGCAGCTTC-3' (SEQ ID NO: 166) Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA28753 sequence which had the following nucleotide sequence 35 hybridization probe (28753.pl)
'-CAGGGACCTGGTACGTGAAGGCCATGGTGGTCGATAAGGACTTTCCGGAG-3' (SEQ ID NO:167) hybridization probe (28753.plD
417
'-CTGTCCTTCACCCTGGAGGAGGAGGATATCACAGGGACCTGGTAC-3' (SEQ ID NO: 168)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PRO1283 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human breast tumor tissue.
DNA sequencing of die clones isolated as described above gave the full-length DNA sequence for
PR01283 (designated herein as DNA65404-I551 [Figure 99, SEQ ID NO: 161]; and the derived protein sequence for PRO1283.
The entire nucleotide sequence of DNA65404-1551 is shown in Figure 99 (SEQ ID NO: 161). Clone DNA65404-1551 contains a single open reading frame with an apparent translational initiation site at nucleotide 10 positions 45-47 and ending at the stop codon at nucleotide positions 555-557 (Figure 99). The predicted polypeptide precursor is 170 amino acids long (Figure 100). The full-length PROI283 protein shown in Figure 100 has an estimated molecular weight of about 19,457 daltons and a pi of about 9.10. Analysis of the full-length PR01283 sequence shown in Figure 100 (SEQ ID NO: 162) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 17. Clone DNA65404-1551 has been deposited with 15 ATCC on September 9, 1998 and is assigned ATCC deposit no. 203244
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the frill-length sequence shown in Figure 100 (SEQ ID NO: 162), evidenced significant homology between the PR01283 amino acid sequence and the following Dayhoff sequences: A40464, VEGP_HUMAN, ALL1_CANFA, LALP_TRIVU, S51803, XELPDS_1, LIPO BUFMA, S52354, 20 QSP_CHICK and ERBPRAT.
EXAMPLE 54: Isolation of cDNA clones Encoding Human PRQ1279
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA30856. Based on the DNA30856 25 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO1279.
PCR primers (forward and reverse) were synthesized:
forward PCR primer (3Q856.fl1 5'-GGCTGCGGGACTGGAAGTCATCGGG-3' (SEQ ID NO:171) 30 forward PCR nrimer (30856.fin 5 -CTCCAGGCCATGAGGATTCTGCAG-3' (SEQ ID NO:172) forward PCR primer (30856. f!2> 5' -CCTCTGGTCTGTAACCAG-3' (SEQ ID NO: 173)
reverse PCR primer (3Q856.rn 5' -TCTGTGATGTTGCCGGGGTAGGCG-3' (SEQ ID NO:174)
reverse PCR primer (3Q856.rin 5'-CGTGTAGACACCAGGCTTTCGGGTG-3' (SEQ ID NO:175) reverse PCR primer (3Q856.rl2^ 5 '-CCCTTGATGATCCTGGTC-3' (SEQ ID NO: 176) 35 Additionally, synthetic oligonucleotide hybridization probes were constructed from the consensus DNA30856 sequence which had the following nucleotide sequences hybridization probe (30856.pD
418
'-AGGCCATGAGGATTCTGCAGTTAATCCTGCTTGCTCTGGCAACAGGGCTT-3' (SEQ ID NO: 177) hybridization probe (30856.pl H
'-GAGAGACCAGGATCATCAAGGGGTTCGAGTGCAAGCCTCACTC-3' (SEQ ID NO-.178)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 5 isolate clones encoding the PR01279 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human lung tumor tissue.
DNA sequencing of die clones isolated as described above gave the full-length DNA sequence for PR01279 (designated herein as DNA65405-1547 [Figure 101, SEQ ID NO:169]; and the derived protein sequence for PR01279.
The entire nucleotide sequence of DNA65405-1547 is shown in Figure 101 (SEQ ID NO: 169). Clone
DNA65405-1547 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 106-108 and ending at the stop codon at nucleotide positions 856-858 (Figure 101). The predicted polypeptide precursor is 250 amino acids long (Figure 102). The full-length PR01279 protein shown in Figure 102 has an estimated molecular weight of about 27,466 daltons and a pi of about 8.87. Analysis of the full-15 length PR01279 sequence shown in Figure 102 (SEQ ID N0:170) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 18, a serine protease, trypsin family, histidine active site from about amino acid 58 to about amino acid 63, potential N-glycosylation sites firom about amino acid 99 to about amino acid 102, from about amino acid 165 to about amino acid 168, from about amino acid 181 to about amino acid 184 and from about amino acid 210 to about amino acid 213, a glycosaminogiycan attachment 20 site from about amino acid 145 to about amino acid 148, amino acid sequence blocks present in kringle domain proteins from about amino acid 197 to about amino acid 209 and from about amino acid 47 to about amino acid 64, amino acid sequence blocks having homology to serine protease, trypsin family, histidine proteins from about amino acid 199 to about amino acid 209, from about amino acid 47 to about amino acid 63 and from about amino acid 220 to about amino acid 243 and amino acid sequence blocks having homology to apple domain proteins 25 from about amino acid 222 to about amino acid 249 and from about amino acid 189 to about amino acid 222. Clone DNA65405-1547 has been deposited with ATCC on November 17, 1998 and is assigned ATCC deposit no. 203476.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 102 (SEQ ID NO: 170), evidenced significant 30 homology between the PR01279 amino acid sequence and the following Dayhoff sequences: 156559, S55066, KLK7 RAT, KLK1RAT, KLKB RAT, KLK3_MOUSE, KLK8_RAT, AF013988_1, D78203J and HSU62801_1.
Additionally, DNA65405-1547 was obtained by purchasing the Incyte EST clone no. 2723646 and sequencing the insert of that clone, thereby giving the DNA65405-1547 sequence shown in Figure 101 (SEQ ID 35 NO-.169).
419
EXAMPLE 55: Isolation of cDNA clones Encoding Human PRQ1304
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA35745. Based on the DNA35745 consensus sequence, oligonucleotides were synthesized: 1) to identiiy by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 5 PR01304.
PCR primers (forward and reverse) were synthesized:
forward PCR nrimer (35745.fn 5'-GTGTTCTGCTGGAGCCGATGCC-3' (SEQ ID NO: 181)
forward PCR nrimer f35745.5'-GACATGGACAATGACAGG-3' (SEQ ID NO: 182)
forward PCR nrimer (3574S.f3^ 5'-CCTTTCAGGATGTAGGAG-3' (SEQ ID NO: 183)
forward PCR primer (35745,f41 5,-GATGTCTGCCACCCCAAG-3• (SEQ ID NO: 184)
reverse PCR primer (3S745.rl1 5'-GCATCCTGATATGACTTGTCACGTGGC-3' (SEQ ID NO:185) reverse PCR primer (35745.r2'> 5'-TACAAGAGGGAAGAGGAGTTGCAC-3' (SEQ ID NO:186) Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA35745 sequence which had the following nucleotide sequence 15 hybridization probe (35745.pit
' -GCCC ATTATGACGGCTACCTGGCTAAAGACGGCTCGAAATTCTACTGC AGCC-3(SEQ ID NO: 187) In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01304 gene using the probe oligonucleotide and one of the PCR primers. RNA 20 for construction of the cDNA libraries was isolated from human ovary tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PRO1304 (designated herein as DNA65406-1567 [Figure 103, SEQ ID NO: 179]; and the derived protein sequence for PR01304.
The entire nucleotide sequence of DNA65406-1567 is shown in Figure 103 (SEQ ID NO: 179). Clone 25 DNA65406-1567 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 23-25 and ending at the stop codon at nucleotide positions 689-691 (Figure 103). The predicted polypeptide precursor is 222 amino acids long (Figure 104). The full-length PR01304 protein shown in Figure 104 has an estimated molecular weight of about 25,794 daltons and a pi of about 6.24. Analysis of the full-length PR01304 sequence shown in Figure 104 (SEQ ID NO: 180) evidences the presence of the following: an 30 endoplasmic reticulum targeting sequence from about amino acid 219 to about amino acid 222, a potential N-glycosylation site from about amino acid 45 to about amino acid 48, FKBP-type peptidyl-prolyl cis-trans isomerase homology blocks from about amino acid 87 to about amino acid 123 and from about amino acid 129 to about amino acid 142 and EF-hand calcium binding domain protein homology blocks from about amino acid 202 to about amino acid 214 and from about amino acid 195 to about amino acid 214. Clone DNA65406-1567 35 has been deposited with ATCC on September 15, 1998 and is assigned ATCC deposit no. 203219.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 104 (SEQ ID NO: 180), evidenced significant
420
homology between the PR01304 amino acid sequence and the following Dayhoff sequences: AF040252_1, PR28980, S71238, CELC05C8J, VFU52045_1, S75144, FKB3_BOVIN, CELC50F2_6, CELB0511_12and P_R41781.
The DNA65406-1567 sequence was also obtained by isolating and sequencing the insert of Incyte EST clone no. 2813577.
EXAMPLE 56: Isolation of cDNA clones F.nr-nriing Human PRQ1317
Using the technique described in Example 1 above, Incyte EST no. 33598 was identified as a sequence of interest having a BLAST score of 70 or greater that did not encode a known protein. The sequence of Incyte EST no. 33598 is designated herein as "DNA36958". Based on the DNA36958 sequence, oligonucleotides can 10 be synthesized: 1) to identiiy by PCR a cDNA library that contains the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01317.
The following are suitable PCR primers (forward and reverse) that can be synthesized based on the DNA36958 sequence:
forward PCR primer: AGGGACCATTGCTTCTTCCAGGCC (36958.fl; SEQ ID N0:190)
reverse PCR primer: CGTTACATGTCTCCAAGGGGAATG (36958.rl; SEQ ID NO:191)
Additionally, a synthetic oligonucleotide hybridization probe can be constructed from the consensus DNA36958 sequence having the following nucleotide sequence:
hybridization probe: CCTGTGCTAAGTGCCCCCCAAATGCTTCCTGTGTCAATAACACTCACTGC (36958.pl; SEQ ID NO: 192)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries is screened by PCR amplification with the PCR primer pair identified above. A positive library is then used to isolate clones encoding the PR01317 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries can be isolated from tissue containing the sequence of interest, for example from peripheral blood, particularly blood taken from a patient having a high leukocyte count (e.g 25 hypereosinophilia).
The full-length DNA sequence for PR01317, designated herein as DNA65408-1578 (Figure 105; SEQ ID NO:188) was obtained by purchasing Incyte EST no. 335958, obtaining the cDNA insert, and sequencing it in its entirety. Incyte clone no. 335958 originated from a library constructed using RNA isolated from peripheral blood cells apheresed from a male patient afflicted with hypereosinophilia. 30 The entire coding sequence of PR01317 is shown in Figure 105 (SEQ ID NO: 188). Clone DNA65408-
1578 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 6-8 and an apparent stop codon at nucleotide positions 228-230. The predicted polypeptide precursor is 74 amino acids long. The full-length PR01317 protein shown in Figure 106 has an estimated molecular weight of about 7,831 daltons and a pi of about 9.08. Additional features include: a signal peptide at about amino acids 35 1-18, potential N-glycosylation sites at about amino acids 34-37 and 39-42, and a microbodies C-terminal targeting signal at amino acids 72-74.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence
421
WO 00/12708 PCT/US99/20111
alignment analysis of the full-length sequence shown in Figure 106 (SEQ ID NO: 189), revealed significant homology between the PR01317 amino acid sequence and the Dayhoff sequence designated CD97_HUMAN. Additionally, some homology was found between the PR01317 amino acid sequence and the following Dayhoff sequences: GEN12618, CELZK783_1, G156_PARPR, GIAVSPEJ, AF040387_1, S78059, 150617, XLSEKll, and NEL2 RAT.
Clone DNA65408-1578 was deposited with the ATCC on September 15, 1998, and is assigned ATCC
deposit no. 203217.
EXAMPLE 57: Isolation of cDNA clones Encoding Hnman PRQ1303
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 10 in Example 1 above. This consensus sequence is designated herein "DNA47347". Based on the DNA47347 consensus sequence and its homology to an Incyte EST within the assembly from which DNA47347 was derived, Incyte clone 1430305 (from an ileum tissue library) was purchased and sequenced in full. The sequence encoding PR01303 was thereby identified.
The entire coding sequence of PR01303 is shown in Figure 107 (SEQ ID NO: 193). Clone DNA65409-15 1566 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 121-123 and an apparent stop codon at nucleotide positions 865-867. Tbe predicted polypeptide precursor is 248 amino acids long. The signal peptide is at about amino acids 1-17 of SEQ ID NO: 194. The locations of N-glycosylation sites, active and conserved regions and domains are further indicated in Figure 194. Clone DNA6S409-1566 has been deposited with ATCC and is assigned ATCC deposit no. 203232. The full-length 20 PRO1303 protein shown in Figure 108 has an estimated molecular weight of about 26,734 daltons and a pi of about 7.9.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 108 (SEQ ID NO: 194), revealed sequence identity between the PR01303 amino acid sequence and the following Dayhoff sequences (data incorporated herein): 25 AB009849 1, PW08475, AF024605_1, A42048 1, TRY3_RAT, MMAE00066414, TRY1RAT, MMAE000663 4, MMAE000665_2, and MMAE00066412.
EXAMPLE 58: Isolation of cDNA clones Encoding Human PRO 1306
Using the method described in Example 1 above, Incyte EST No. 2449282, also referred to herein as 30 DNA5918, was identified as a sequence of interest having a BLAST score of 70 or greater that did not encode a known protein. From the DNA5918 sequence, a consensus sequence was assembled using BLAST and the program "phrap" (Phil Green, University of Washington, Seattle, Washington). This consensus sequence is designated herein as "DNA47399". Based on the DNA47399 consensus sequence, oligonucleotides can be synthesized: 1) to identify by PCR a cDNA library that contains die sequence of interest, and 2) for use as 35 probes to isolate a clone of the full-length coding sequence for PR01306.
The entire coding sequence of PR01306 shown in Figure 109 (SEQ ID NO: 195), was obtained by purchasing Incyte EST no. 2449282, obtaining the cDNA insert and sequencing it in its entirety. Clone
422
DNA65410-1569 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 106-108 and an apparent stop codon at nucleotide positions 556-558. The predicted polypeptide precursor is 150 amino acids long. The full-length PR01306 protein shown in Figure 110 has an estimated molecular weight of about 17,068 daltons, a pi of about 7.29, and a potential N-glycosylation site at about amino acids 131-134.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 110 (SEQ ID NO: 196), revealed significant homology between the PR01306 amino acid sequence and Dayhoff sequence AIF1JHUMAN. Homology was also shown between the PR01306 amino acid sequence and the following Dayhoff sequences: JC4902, BAR1JRAT, AF020281_1, HSU95213_1, TCH3ARATH, LEY14765J, CATR_NAEGR, S35185, and 10 AF065247_1.
Clone DNA65410-1569, was deposited with the ATCC on September 15, 1998 and is assigned ATCC deposit no. 203231.
EXAMPLE 59: Isolation of cDNA clones Encoding Human PRQ1336 15 An EST sequence was identified and entered into a proprietary Genentech database. The EST was blasted against various EST databases. The EST databases included public EST databases (e.g., GenBank), and a proprietary EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA), and proprietary ESTs from Genentech. The search was performed using the computer program BLAST or BLAST2 [Altschul et al., Methods in Enzvmologv. 266:460-480 (1996)) as a comparison of the ECD protein sequences to a 6 frame 20 translation of the EST sequences. Those comparisons resulting in a BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, Seattle, Washington).
A consensus DNA sequence encoding PRO1336 was assembled relative to other aligned EST sequences (forming an assembly) using phrap. This consensus sequence is designated herein "DNA43319". Based on the 25 DNA43319 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO1336.
PCR primers (forward and reverse) were synthesized:
forward PCR primer 51ATGGAG ATTCCTGCC A ACTTGCCG3' (SEQ ID N0:199); and 30 reverse PCR nrimer 5 'TTGTTGGCATTGAGGAGGAGCAGC3'. (SEQ ID N0:200).
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA43319 sequence which had the following nucleotide sequence:
hybridization probe 5 'GAGGGCATCGTCGAAATACGCCTAGAACAGAACTCCATCAAAGCCATCCC3' (SEQ ID N0:201).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01336 gene using the probe oligonucleotide and one of the PCR primers. RNA
423
for construction of the cDNA libraries was isolated from human fetal lung tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PRO1336 (designated herein as DNA65423-1595 [Figures 111A-B, SEQ ID NO: 198]; and the derived protein sequence for PR01336.
The entire coding sequence of PR01336 is shown in Figures 111A-B (SEQ ID NO:198). Clone 5 DNA65423-1595 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 83-85 and an apparent stop codon at nucleotide positions 4652-4654 of SEQ ID NO: 198. The predicted polypeptide precursor is 1523 amino acids long. The approximate locations of the signal peptide (amino acids 1-27), aspartic acid and asparagine hydroxylation sites, EGF-like domain cystein pattern signature regions, a leucine zipper pattern region, a region conserved in immunoglobulins and major histocompatibility complexes, 10 and N-glycosyiation sites are indicated in Figure 112. Clone DNA65423-1595 has been deposited with the ATCC and is assigned ATCC deposit no. 203227. The full-length PR01336 protein shown in Figure 112 has an estimated molecular weight of about 167,715 daltons and a pi of about 8.06.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 112 (SEQ ID NO: 198), revealed sequence identity 15 between the PRO1336 amino acid sequence and the following Dayhoff sequences (data incorporated herein): SLTTDROME, CEF40E10_1, LCU58977J, AF029779J, FBP1_STRPU, NOTC_XENLA, AC004663_1, XELXDEL l, P_W05835 and HSU77720_1.
KX A MPT -F. fin- Isolation of cDNA clones Encoding Human PRO 1278 20 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is designated herein uConsen5230". In addition, the Consen5230 consensus sequence was extended using repeated cycles of BLAST and phrap to extend the consensus sequence as far as possible using the sources of EST sequences discussed above. The extended consensus sequence is designated herein as "DNA44801". Based on the DNA44801 consensus sequence, oligonucleotides were 25 synthesized: 1) to identiiy by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01278.
PCR primers (forward and reverse) were synthesized:
forward PCR primers: GCAGGCTTTGAGGATGAAGGCTGC (44801.fl; SEQ ID N0:204) and CTCATTGGCTGCCTGGTCACAGGC (44801.f2; SEQ ID N0:205)
reverse PCR primers: CCAGTCGGACAGGTCTCTCCCCTC (44801.rl; SEQ ID N0:206) and TCAGTGACCAAGGCTGAGCAGGCG (44801.r2; SEQIDN0:2Q7)
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA44801 sequence which had the following nucleotide sequence:
hybridization probe: CTACACTCGTTGCAAACTGGCAAAAATATTCTCGAGGGCTGGCCTGQ44801 .pi; 35 SEQ ID N0:208)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to
424
isolate clones encoding the PROI278 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human testis.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01278 (designated herein as DNA66304-1546 [Figure 113, SEQ ID N0.202]; and the derived protein sequence for PRO1278.
TheentirecodingsequenceofPR01278isshowninFigure 113(SEQIDN0:2Q2). Clone DNA66304-
1546 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 141-143 and an apparent stop codon at nucleotide positions 585-587. The predicted polypeptide precursor is 148 amino acids long. The full-length PRO1278 protein shown in Figure 114 has an estimated molecular weight of about 16,623 daltons and a pi of about 8.47. Additional features include a signal peptide sequence at about 10 amino acids 1-19; a potential N-glycosylation site at about amino acids 58-61; an alpha-lactalbumin/lysozyme C signature at about amino acids 94-112; and homolgy with alpha-lactalbumin/lysozyme C at about amino acids 35-59, 67-59 and 112-133.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 114 (SEQ ID N0:203), revealed significant 15 homology between the PROI278 amino acid sequence and the following Dayhoff sequences; LYC1ANAPL, LYC3_ANAPL, and LYC_HUMAN.
Clone DNA66304-1546 was deposited with the ATCC on October 6, 1998, and is assigned ATCC deposit no. 203321.
EXAMPLE 61: Isolation of cDNA clones Encoding Human PRO 1298
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from an Incyte database. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. One 25 or more of the ESTs was derived from a diseased prostate tissue libraiy. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 30 designated DNA56389.
In light of the sequence homology between the DNA56389 sequence and an EST sequence contained within an Incyte EST within the assembly from with the consensus sequence was derived, Incyte clone 3355717 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 115 and is herein designated as DNA66511-1563.
The full length clone shown in Figure 115 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 94-96 and ending at the stop codon found at nucleotide positions 1063-1065 (Figure 115; SEQ ID N0:209). The predicted polypeptide precursor (Figure 116, SEQ ID N0:210)
425
is 323 amino acids long. The signal peptide is at about amino acids 1-15 of SEQ ID N0:210. PR01298 has a calculated molecular weight of approximately 37,017 daltons and an estimated pi of approximately 8.83. Clone DNA66511-1563 was deposited with the ATCC on September 15, 1998 and is assigned ATCC deposit no. 203228.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 5 alignment analysis of the full-length sequence shown in Figure 116 (SEQIDNO:210), revealed sequence identity between the PR01298 amino acid sequence and the following Dayhoff sequences (data incorporated herein): ALG2_YEAST, CAPM_STAAU, C69098, C69255, SUS2 MAIZE, A69143, S74778, AB009527J3, AF050103_2 and BBA224769_1.
EXAMPLE 62: Isolation of cDNA clones Encoding Human PRQ1301
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the LIFESEQ® database, designated Incyte Cluster No. 93492, also referred herein as "DNA10591". This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database 15 (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). One or more of the ESTs was derived from a 20 cDNA library constructed from RNA isolated from lung tissue removed from a male with adenocarcinoma. The consensus sequence obtained therefrom is herein designated "DNA57725".
In light of the sequence homology between the DNA57725 sequence and an EST sequence contained within the EST no. 3395984, the EST clone 3395984 was purchased and the cDNA insert was obtained and sequenced in its entirety. The sequence of this cDNA insert is shown in Figure 117 and is herein designated as 25 "DNA66512-1564".
The full length clone shown in Figure 117 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 43 to 45 and ending at the stop codon found at nucleotide positions 1429 to 1431 (Figure 117; SEQIDNO:211). The predicted polypeptide precursor (Figure 118, SEQ ID NO:212) is 462 amino acids long. Other features of the PR01301 protein include: a signal sequence at about 30 amino acids 1-18; a transmembrane domain at about amino acids 271-290; a cytochrome P450 homologous region at about amino acids 134-462; and potential N-glycosylation sites at about amino acids 94-97,217-220, and 246-249. PR01301 has a calculated molecular weight of approximately 52,432 daltons and an estimated pi of approximately 6.14. Clone DNA66512-1564 was deposited with the ATCC on September 15, 1998 and is assigned ATCC deposit no. 203218.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 118 (SEQ ID NO:212), revealed some homology between the PR01301 amino acid sequence and the following Dayhoff sequences: PSU29243_1, A69975,
426
ATAC00448418, D78607_l, CEB0331_1, HUMCYTI1IA_], AF014800_1, CELT13C5_4, CELC45H4_14, and CEC54E10J.
EXAMPT .F. 61: Isolation of cDNA clones Encoding Human PRQ1268
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 5 cluster sequence from the LIFESEQ* database, designated EST No. 8879. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed Miing the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score 10 of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Piil Green, University of Washington, Seattle, Washington). One or more of the ESTs was derived from a cI>NA library constructed from human brain tumor tissue taken from a cerebral meninges lesion. The consensus sequence obtained therefrom is herein designated DNA56258.
In light of the sequence homology between the DNA56258 sequence and an EST sequence contained within the Incyte EST no. 2944341, EST clone no. 2944541 vas purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 119 and is herein designated as "DNA66519-1535".
Tbe full length clone shown in Figure 119 contained a single open reading frame with an apparent 20 translational initiation site at nucleotide positions 89 to 91 and ending at the stop codon found at nucleotide positions 509 to 511 (Figure 119; SEQ ID NO:213). The predicted polypeptide precursor (Figure 120, SEQ ID NO:214) is 140 amino acids long. PRO 1268 has a calculated molecular weight of approximately 15,503 daltons and an estimated pi of approximately 6.44. Additional features include a type II transmembrane domain at about amino acids 12-28; type I transmembrane domains at a.bout amino acids 51-66 and 107-124; a potential 25 N-glycosylation site at about amino acids 79-82, and a region having homology with G-protein coupbled receptors at about amino acids 59-99.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 120 (SEQ ID NO:214), revealed some homology between the PR01268 amino acid sequence and Dayhoff sequence no. CEF39B2 9. However, the percent 30 sequence identity was determined to not be significant.
Clone DNA66519-1535 was deposited with the ATCC on September 15, 1998 and is assigned ATCC deposit no. 203236.
EXAMPLE 64: Isolation of cDNA clones Encoding Human PRO 1269 35 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST
cluster sequence from the LIFESEQ® database, designated EST Cluster No. 101920. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases
427
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enrvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 5 Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56509.
In light of the sequence homology between the DNA56509 sequence and an EST sequence contained within the EST no. 103157, EST clone no.103157 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 121 and is herein designated as DNA66520-1536.
The full length clone shown in Figure 121 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 26-29 and ending at ihe stop codon found at nucleotide positions 614-616 (Figure 121; SEQ ID NO:215). The predicted polypeptide precursor (Figure 122, SEQ ID NO:216) is 196 amino acids long, with a signal peptide located at about amino acids 1-20. There is a potential N-glycosyiation site at about amino acids 112-115. PR01269 has a calculated molecular weight of approximately 15 21,731 daltons and an estimated pi of approximately 8.97.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 122 (SEQ ID NO:216), revealed significant homology between the PR01269 amino acid sequence and the amino acid sequence of Dayhoff sequence no. P_W23722. In addition, sequence homology was found between the PR01269 amino acid sequences and the 20 amino acid sequences of the following Dayhoff sequences: MMTAG7_1, MTV026 16, NAAA BPT3, S75616J, and NCP_PIG.
Clone DNA66520-1536 was deposited with the ATCC on September 15,1998, and is assigned ATCC deposit no. 203226.
EXAMPLE 65: Isolation of cDNA clones Encoding Human PRQ1327
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database, designated Incyte EST cluster sequence no. 173410. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, 30 CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56520. 35 In light of the sequence homology between the DNA56S20 sequence and an EST sequence contained within the Incyte EST clone no. 3451760, the Incyte EST clone no. 3451760 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 123 and is herein designated
428
as DNA66521-1583.
Clone DNA66521-1583 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 55-57 and ending at the stop codon at nucleotide positions 811-813 (Figure 123). The predicted polypeptide precursor is 252 amino acids long (Figure 124). The full-length PR01327 protein shown in Figure 124 has an estimated molecular weight of about 28,127 daltons and a pi of about 8.91. Analysis of 5 the full-length PR01327 sequence shown in Figure 124 (SEQ ID NO:218) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 14, potential N-glycosylation sites from about amino acid 62 to about amino acid 65, from about amino acid 127 to about amino acid 130, from about amino acid 137 to about amino acid 140 and from about amino acid 143 to about amino acid 146 and a 2-oxo acid dehydrogenase acyltransferase homology block from about amino acid 61 to about amino acid 71. Clone 10 DNA66521-1583 has been deposited with ATCC on September 15, 1998 and is assigned ATCC deposit no. 203225.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 124 (SEQ ID NO:218), evidenced significant homology between the PR01327 amino acid sequence and the following Dayhoff sequences: NPH1_RAT, 15 NPH2_MOUSE, OTU_DROME, D40750, BB61RABIT, P_R23873, P_W09643, CAGHMGPA_1, HUMPRP11_1 and S670958J.
EXAMPLE 66: Isolation of cDNA clones Encoding Human PRQ1382
Using the method described in Example 1 above, Incyte EST no. 2719 was identified as a sequence of 20 interest having a BLAST score of 70 or greater that does not encode a known protein. The nucleotide sequence of EST no. 2719 is designated herein "DNA42842". Based on the DNA42842 sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01382.
PCR primers (forward and reverse) were synthesized:
forward PCR primer ACGGCTCACCATGGGCTCCG (42842.fl; SEQ ID NO:221)
reverse PCR primer AGGAAGAGGAGCCCTTGGAGTCCG (42842.rl; SEQ ED NO:222)
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA42842 sequence which had the following nucleotide sequence:
hybridization probe CGTGCTGGAGGGCAAGTGTCTGGTGGTGTGCGACTCGAAC (42842.pl; SEQ ID 30 NO:223).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01382 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from a human breast carcinoma. 35 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for
PR01382 (designated herein as DNA66526-1616 [Figure 125, SEQ ID NO:219]; and the derived protein sequence for PR01382.
429
The entire coding sequence of PR01382 is shown in Figure 125 (SEQ ID NO:219). Clone DNA66526-1616 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 337-339 and an apparent stop codon at nucleotide positions 940-942. The predicted polypeptide precursor is 201 amino acids long. The full-length PR01382 protein shown in Figure 126 has an estimated molecular weight of about 21,808 daltons and a pi of about 9.04. Additional features include a signal peptide at about amino acids 5 1-27; potential N-glycosylation sites at about amino acids 29-32 and 88-91; and regions of homology with Clq proteins at about amino acids 92-126, 159-178, and 191-200.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 126 (SEQ ID N0:220), revealed significant homology between the PR01382 amino acid sequence Dayhoff sequence no. CERL RAT. Homology was also 10 revealed between the PR01382 amino acid sequence and the following Dayhoff sequences: CERB HUMAN, S76975_l, A41752, HUMC1QB2_1, A57131, CA1A_HUMAN, ACR3_MOUSE, and COLELEPMA.
Clone DNA66526-1616 has been deposited with ATCC and is assigned ATCC deposit no. 203246.
EXAMPLE 67: Isolation of cDNA clones Encoding Human PRO 132ft IS Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST
cluster sequence from the Incyte database, designated Incyte EST cluster sequence no. 40671. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identiiy existing homologies. The homology search was performed using the computer program BLAST 20 or BLAST2 (Altshul et al.. Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56749.
In light of the sequence homology between the DNA56749. sequence and an ESt sequence contained 25 within the Incyte EST clone no. 4111192, the Incyte EST clone no. 4111192 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 127 and is herein designated as DNA66658-1584.
Clone DNA66658-1584 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 9-11 and ending at the stop codon at nucleotide positions 780-782 (Figure 127). The 30 predicted polypeptide precursor is 257 amino acids long (Figure 128). The full-length PROI328 protein shown in Figure 128 has an estimated molecular weight of about 28,472 daltons and a pi of about 9.33. Analysis of the full-length PR01328 sequence shown in Figure 128 (SEQ ID NO:225) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 19, transmembrane domains from about amino acid 32 to about amino acid 51, from about amino acid 119 to about amino acid 138, from about amino 35 acid 152 to about amino acid 169 and from about amino acid 216 to about amino acid 235, a glycosaminoglycan attachment site from about amino acid 120 to about amino acid 123 and sodrum/rmerotransmitter symporter family protein homology block from about amino acid 31 to about amino acid 65. Clone DNA66658-1584 has
430
been deposited with ATCC on September 15, 1998 and is assigned ATCC deposit no. 203229.
An analysis of tbe Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 128 (SEQ ID NO:225), evidenced significant homology between the PR01328 amino acid sequence and the following Dayhoff sequences: CEVF36H2L 2, TIP2_TOBAC, AB009466 16, ATU39485_1, P_R60153, P_R77082, S73351, C69392, LEU95008_1 and 5 E64667.
EXAMPLE 68: Isolation of cDNA clones Encoding Human PRQ1325
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database, designated Incyte EST cluster sequence no. 139524. This EST cluster 10 sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 15 assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56115.
In light of the sequence homology between the DNA56115 sequence and an EST sequence contained within the Incyte EST clone no. 3744079, the Incyte EST clone no. 3744079 was purchased and the cDN A insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 129 and is herein designated 20 as DNA66659-1593.
Clone DNA66659-1593 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 51-53 and ending at the stop codon at nucleotide positions 2547-2549 (Figure 129). The predicted polypeptide precursor is 832 amino acids long (Figure 130). The full-length PR01325 protein shown in Figure 130 has an estimated molecular weight of about 94,454 daltons and a pi of about 6.94. Analysis 25 of the full-length PR01325 sequence shown in Figure 130 (SEQ ID NO:227) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 18, transmembrane domains from about amino acid 292 to about amino acid 317, from about amino acid 451 to about amino acid 470, from about amino acid 501 to about amino acid 520, from about amino acid 607 to about amino acid 627 from about amino acid 751 to about amino acid 770, a leucine zipper pattern sequence from about amino acid 497 to about amino acid 30 518 and potential N-glycosylation sites from about amino acid 27 to about amino acid 30, from about amino acid 54 to about amino acid 57, from about amino acid 60 to about amino acid 63, from about amino acid position 123 to about amino acid position 126, from about amino acid position 141 to about amino acid position 144, from about amino acid position 165 to about amino acid position 168, from about amino acid position 364 to about amino acid position 367, from about amino acid position 476 to about amino acid position 479, from about 35 amino acid position 496 to about amino acid position 499, from about amino acid position 572 to about amino acid position 575, from about amino acid position 603 to about amino acid position 606 and from about amino acid position 699 to about amino acid position 702. Clone DNA66659-1593 has been deposited with ATCC on
431
September 22,1998 and is assigned ATCC deposit no. 203269.
An analysis of die Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 130 (SEQ ID NO:227), evidenced significant homology between the PRO1325 amino acid sequence and the following Dayhoff sequences: CELR04E51, CELZK721 5, CELC30E1_5, CELC30E1_6, CELC30E12, CEY37H2C_1, CELC30E17, CELT07H8_7 and E64006.
EXAMPLE 69: Isolation of cDNA clones Encoding Human PRQ1340
Using the method set forth in Example 1 above, Incyte EST no. 878906 was identified as a sequence of interest having a BLAST score of 70 or greater that does not encode a known protein. The nucleotide sequence of EST no. 878906 is designated herein "DNA42809", Based on the DNA42809 sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01340.
PCR primers (forward and reverse) were synthesized:
forward PCR primer TCCAGGTGGACCCCACTTCAGG (42809.fl; SEQ ID N0:430)
reverse PCR nrimer GGGAGGCTTATAGGCCCAATCTGG (42809.rl; SEQ ID NO:431)
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA42809 sequence which had the following nucleotide sequence:
hybridization nrobe GGCTTCAGCAGCACGTGTGAAGTCGAAGTCGCAGTCACAGATATCAATGA (42809.pl; SEQ ID NO:432)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01340 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01340 (designated herein as DNA66663-1598 [Figure 131, SEQ ID NO:228]; and the derived protein sequence for PR01340.
The entire coding sequence of PR01340 is shown in Figure 131 (SEQ ID NO:228). Clone DNA66663-1598 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 128-130 and an apparent stop codon at nucleotide positions 2549-2551. The predicted polypeptide precursor is 807 amino acids long. The full-length PR01340 protein shown in Figure 132 has an estimated molecular weight of about 87,614 daltons and a pi of about 4.83. Additional features include: a signal peptide at about amino acids 1-18; a transmembrane domain at about amino acids 762-784; a cell attachment sequence at about amino acids 492-494; potential N-glycosylation sites at about amino acids 517-520, 602-605 and 700-703; and cadherin extracellular repeat domains at about amino acids 307-351, 324-348, 67-103, 97-141 and 114-138.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 132 (SEQ ID NO:229), revealed significant homology between the PR01340 amino acid sequence and Dayhoff sequence no. 146536. Homology was also
432
intellectual property ofrce of m2
24 OCT 2003 received
revealed between the PR01340 amino acid sequence and the following Dayhoff sequences: S55396, RATPDRPT_1, CADD_CHICK, CADl_CHICK, CADB CHICK, 150180, CAD4_CHICK, G02878, and DSC1MOUSE.
Clone DNA66663-1598 has been deposited with ATCC and is assigned ATCC deposit no. 203268.
EXAMPLE 70: Isolation of cDNA clones Encoding Human PRQ1339
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is designated herein "DNA40652". Within the consensus sequence assembly was Incyte EST 2479394. Based on the consensus sequence and other discoveries and information provided herein, the clone including Incyte EST 2479394 was purchased and sequenced in full. 10 Sequencing provided the nucleic acid sequence shown in Figure 133 which includes the sequence encoding PR01339.
Clone DNA66669-1597 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 9-11 and an apparent stop codon at nucleotide positions 1272-1274 of SEQ ID NO:233. The predicted polypeptide precursor is 421 amino acids long. The signal peptide is at about amino 15 acids 1-16 of SEQ ID NO:234. The region conserved in zinc carboxypeptidases and the N-glycosylation site are indcated in Figure 134. Clone DNA66669-1597 has been deposited with the ATCC and is assigned ATCC deposit no. 203272. The full-length PR01339 protein shown in Figure 134 has an estimated molecular weight of about 47,351 daltons and a pi of about 6.61.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 20 alignment analysis of the full-length sequence shown in Figure 134 (SEQ ID NO:234), revealed sequence identity between the PRO1339 amino acid sequence and the following Dayhoff sequences (data incorporated herein): PJW01505, CBP1JHUMAN, HSA224866_1, P_R90293, YHT2_YEAST, CEF02D8_4, CEW01A8_6, P_W36815, HSU83411_1 and CBPNHUMAN.
EXAMPLE 71: Isolation of cDNA clones Encoding Human PRQ1337
Using the method described in Example 1 above, a single Incyte EST was identified (EST No. 1747546) and also referred to herein as "DNA4417". To assemble a consensus sequence, repeated cycles of BLAST and phrap were used to extend the DNA4417 sequence as far as possible using the sources of EST sequences discussed above. The consensus sequence is designated herein as "DNA45669". Based on the DNA45669 30 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01337.
PCR primers (forward and reverse) were synthesized:
forward PCR primers: CAACCATGCAAGGACAGGGCAGG (45669.fl; SEQ ID NO:237) and 35 CTTTGCTGTTGGCCTCTGTGCTCCCAACCATGCAAGGACAGGGCAGG (45669.rl; SEQ ID NO:238); reverse PCR primers: TGACTCGGGGTCTCCAAAACCAGC (45669.rl; SEQ ID NO:239) and GGTATAGGCGGAAGGCAAAGTCGG (45669.r2; SEQ ID N0:240);
433
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA45669 sequence which had the following nucleotide sequence:
hvbridizationnrobe: GGC ATCTT ACCTTT ATGG AGT ACTCTTTGCTGTTGGCCTCTGTGCTCCr45669.pl: SEQ ID NO:241).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 5 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01337 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01337 (designated herein as DNA66672-1586 [Figure 135, SEQ ID NO:235]; and the derived protein 10 sequence for PRO 1337.
TheentirecodingsequenceofPR01337isshowninFigure 135(SEQIDN0:235). CloneDNA66672-1586 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 60-62 and an apparent stop codon at nucleotide positions 1311-1313. The predicted polypeptide precursor is 417 amino acids long. The full-length PR01337 protein shown in Figure 136 has an estimated molecular weight of 15 about 46,493 daltons and a pi of about 9.79.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 136 (SEQ ID N0:236) revealed significant homology between the PR01337 amino acid sequence and the Dayhoff sequence THBG_HUMAN. Homology was also found between the PR01337 amino acid sequence and the following Dayhoff sequences: 20 KAINHUMAN, HSACT1_1, IPSP_HUMAN, G02081, HAMHPP_1, CPI6_RAT,S31507, AB000547_1, and KBP_MOUSE.
Clone DNA66672-1586 was deposited with the ATCC on September 22, 1998, and is assigned ATCC deposit no. 203265.
EXAMPLE 72: Isolation of cDNA clones Fnnnriing Human PRO 1342
A cDNA sequence isolated in the amylase screen described in Example 2 above is herein designated DNA43203. The DNA43203 sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and proprietary EST DNA databases (LIFESEQTM, Incyte Pharmaceuticals, Palo Alto, CA; Genentech, South San Francisco, CA) to identify 30 existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is designated herein as "DNA48360". 35 Based on the DNA48360 sequence, oligonucleotide probes were generated and used to screen a human esophageal tissue library prepared as described in paragraph 1 of Example 2 above. The cloning vector was pRK5B (pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science.
434
253:1278-1280 (1991)), and tbe cDNA size cut was less than 2800 bp.
PCR primers (forward and reverse) were synthesized:
forward PCR nrimer: 5'-GAAGCACCAGCCTTTATCTCTTCACC-3' (48360.fl; SEQ ID NO:244)
reverse PCR primer: 5'- GTCAGAGTTGGTGGCTGTGCTAGC-3' (48360.rl; SEQ ID NO:245) Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA48360 sequence 5 which had the following nucleotide sequence:
hybridization probe:
'GGACCCAGGCATCTTGCTTTCCAGCCACAAAGAGACAGATGAAGATGC-3 (48360.pl; SEQ ID NO:246)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 10 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01342 gene using the probe oligonucleotide and one of the PCR primers.
A full length clone was identified that contained a single open reading frame with an apparent translational initiation site at nucleotide positions 239-241, and a stop signal at nucleotide positions 2027-2029 (Figure 137; SEQ ID NO:242). The predicted polypeptide precursor is 596 amino acids long has a calculated 15 molecular weight of approximately 57,173 daltons and an estimated pi of approximately 4.82. Additional features include: signal sequence at about amino acids 1-20; a transmembrane domain at about aminn acids 510-532; a potential N-glycosylation site at about amino acids 25-28; a glycosaminoglycan attachment site at about amino acids 325-328; and bacterial ice-nucieation protein octamer repeats at about amino acids 284-337,404-457,254-307, 359-412, 194-247, 239-292, 299-352, 134-187, 314-367, and 164-217.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 138 (SEQ ID NO:243), evidenced some homology between the PR01342 amino acid sequence and the following Dayhoff sequences: CELZC178 2, LMSAP2GN_1, D88734_, AMYH_YEAST, MMDSPPGJ, VGLXHSVEB, S52714, CELF59A6_5, CELK06A9 3, and YM96 YEAST.
Clone DNA66674-1599 was deposited with the ATCC on September 22,1998, and is assigned ATCC
deposit no. 203281.
EXAMPLE 73: Isolation of cDNA clones Encoding Human PRQ1343
A cDNA sequence isolated in the amylase screen described in Example 2 above was found, by the WU-30 BLAST2 sequence alignment computer program, to have no significant sequence identity to any known human encoding nucleic acid. This cDNA sequence is herein designated DNA48921. Probes were generated from the sequence of the DNA48921 molecule and used to screen a human smooth muscle cell tissue library prepared as described in paragraph 1 above. The cloning vector was pRK5B (pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science. 253:1278-1280 (1991)), and the cDNA size cut was less 35 than 2800 bp.
The oligonucleotide probes employed were as follows:
forward PCR primer (48921.fl) 5'-CAATATGCATCTTGCACGTCTGG-3' (SEQ ID NO:249)
435
reverse PCR primer (48921.rl) 5,-AAGCTTCTCTGCTTCCTTTCCTGC-3, (SEQ ID N0:250) hybridization probe (48921 .p I)
' -TGACCCCATTGAGAAGGTC ATTG AAGGGATCAACCGAGGGCTG-3' (SEQ ID N0.251)
A full length clone was identified that contained a single open reading frame with an apparent translational initiation site at nucleotide positions 71-73 and a stop signal at nucleotide positions 812-814 (Figure 5 139, SEQ ID NO:247). The predicted polypeptide precursor is 247 amino acids long, has a calculated molecular weight of approximately 25,335 daltons and an estimated pi of approximately 7.0. Analysis of the full-length PR01343 sequence shown in Figure 140 (SEQ ID NO:248) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 25 and a homologous region to circumsporozoite repeats from about amino acid 35 to about amino acid 225: Clone DNA66675-1587 has been deposited with ATCC on 10 September 22, 1998 and is assigned ATCC deposit no. 203282.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 140 (SEQ ID NO:248), evidenced significant homology between the PR01343 amino acid sequence and the following Dayhoff sequences: CSP_PLACC, CEF25H82, U88974_40, BNAMRNAA_1, BOBOPC3_l, S58135, AF061832_1, BHU52040_1, 15 HUMPROFILEl and MTV023J4.
Additionally, an Incyte EST clone (Incyte EST clone no. 4701148) having homology to the DNA48921 sequence was obtained and the insert sequenced, thereby giving rise to the DNA66675-1587 sequence shown in Figure 139.
EXAMPLE 74: Isolation of cDNA clones Encoding Human PRQ1480
Using the methods described in Example 1 above, Incyte EST Nos. 550415 and 1628847 were identified as sequences of interest having BLAST scores of 70 or greater that did not encode known proteins. These sequences were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). This consensus sequence is designated herein as "DNA1395". 25 In addition, the "DNA 1395" consensus sequence was extended using repeated cycles of BLAST and phrap to extend the consensus sequence as far as possible using the sources of EST sequences discussed above. The extended consensus sequence is designated herein as "DNA40642". Based on the DNA40642 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01480. 30 PCR primers (forward and reverse) were synthesized:
forward PCR primer: AGCCCGTGCAGAATCTGCTCCTGG (40642.fl; SEQ ID N0.254)
reverse PCR primers: TGAAGCCAGGGCAGCGTCCTCTGG (40642.rl; SEQ ID NO:255);
GTACAGGCTGCAGTTGGC (40642.r2; SEQ ID N0:256)
Additionally, synthetic oligonucleotide hybridization probes were constructed from the consensus 35 DNA40642 sequence which had the following nucleotide sequence:
hvbridizationprobes: AGAAGCCATGTGAGCAAGTCCAGTTCCAGCCCAACACAGTG(40642.pl; SEQ ID NO-.257); GAGCTGCAGATCTTCTCATCGGGACAGCCCGTGCAGAATCTGCTC (40642.p2; SEQ ID
436
NO:258).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01480 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for
PRO1480, designated herein as DNA67962-1649 [Figure 141, SEQ ID NO.-252]; and the derived protein sequence for PR01480.
The entire coding sequence of PRO1480 is shown in Figure 141 (SEQIDNO-.252). Clone DNA67962-1649 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 10 241-243 and an apparent stop codon at nucleotide positions 27S2-27S4. The predicted polypeptide precursor is 837 amino acids long. The full-length PRO1480 protein shown in Figure 142 has an estimated molecular weight of about 92,750 daltons and a pi of about 7.04. Additional features include: transmembrane domains at about amino acids 23-46 (type II) and 718-738; potential N-glycosylation sites at about amino acids 69-72,96-99, 165-168, 410-413, 525-528, and 630-633; and a leucine zipper pattern at about amino acids 12-33. 15 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 142 (SEQ ID NO:253), revealed significant homology between the PR01480amino acid sequence and Dayhoff sequence 148746. Homology was also shown between the PR01480 amino acid sequence and the following Dayhoff sequences: S66498; P_W17658; MMU69535_1; HSU60800_1; 148745; A49069; 148747; GGU28240_1; and AF022946_1. 20 Clone DNA67962-1649 has been deposited with ATCC and is assigned ATCC deposit no. 203291.
EXAMPLE 75: Isolation of cDNA clones Encoding Human PRQ1487
A single Merck EST, HSC2ID011, referred herein as "DNA8208", was identified as an EST of interest having a BLAST score of 70 or greater that did not encode a known protein as described in Example 1 above. 25 The DNA8208 sequence was extended using repeated cycles of BLAST and the program "phrap" (Phil Green, University of Washington, Seattle, Washington) to extend the sequence as far as possible using the sources of EST sequences discussed above. The resulting consensus sequence is designated herein as "DNA68836". Based on the DNA68836 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding 30 sequence for PR01487.
PCR primers (forward and reverse) were synthesized:
forward PCR primer: GTGCCACTACGGGGTGTGGACGAC (54209.fl; SEQ ID NO:261) and reverse PCR primer TCCCATTTCTTCCGTGGTGCCCAG (54209.rl; SEQ ID NO:262)
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 35 DNA68836 sequence which had the following nucleotide sequence:
hybridization probe CCAGAAGAAGTCCTTCATGATGCTCAAGTACATGCACGACCACTAC (54209.pl; SEQ ID NO:263)
437
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01487 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated human fetal kidney tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 5 PRO1487 (designated herein as DNA68836-1656 (Figures 143A-B; SEQ ID NO:259) and the derived protein sequence for PR01487 (Figure 144; SEQ ID N0:260).
The entire coding sequence of PR01487 is shown in Figures I43A-B (SEQ ID NO:259). Clone DNA68836-1656 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 489-491 and an apparent stop codon at nucleotide positions 289S-2897. The predicted polypeptide 10 precursor is 802 amino acids long The full-length PR01487 protein shown in Figure 144 has an estimated molecular weight of about 91,812 daltons and a pi of about 9.S2. Additional features include a signal peptide at about amino acids 1-23; potential N-glycosylation sites at about amino acids 189-192, 623-626, and 796-799; and a cell attachment sequence at about amino acids 62-64.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 15 alignment analysis of the full-length sequence shown in Figure 144 (SEQ ID N0:260), revealed significant homology between the PR01487 amino acid sequence and the following Dayhoff sequences: CET24D1_1, S44860, CELC02H61, CEC38H2_3, CELC17A2_5, CET09E11_10, CEE03H4_3, CELT22B113, GGU82088_1, and CEF56H6_1.
Clone DNA68836-1656 was deposited with the ATCC on November 3,1998, and is assigned ATCC 20 deposit no. 203455.
EXAMPLE 76: Isolation of cDNA clones Encoding Human PRQ1418
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from an Incyte database. This EST cluster sequence was then compared to a variety of 25 expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ9, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. One or more of the ESTs was derived from a placenta tissue library. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 30 proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA58845.
In light of the sequence homology between the DNA58845 sequence and an EST included in Incyte clone 1306026, that clone was purchased and the cDNA insert was obtained and sequenced. The sequence of 35 this cDNA insert is shown in Figure 145 and is herein designated as DNA68864-1629.
The full length clone shown in Figure 145 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 138-140 and ending at the stop codon found at nucleotide
438
positions 1188-1190 (Figure 145; SEQ ID NO:264). The predicted polypeptide precursor (Figure 146, SEQ ID NO:265) is 350 amino acids long with a signal peptide at about amino acids 1-19 of SEQ ID NO:265. PR01418 has a calculated molecular weight of approximately 39,003 daltons and an estimated pi of approximately 5.59. Clone DNA68864-1629 was deposited with the ATCC on September 22, 1998 and is assigned ATCC deposit no. 203276.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 146 (SEQ ID NO:265), revealed sequence identity between the PR01418 amino acid sequence and the following Dayhoff sequences (data incorporated herein): AGA1_HAEIN (immunoglobulin al protease precursor), P_W03740, CELT23E7_1, SSN6_YEAST, MMPININ_1, AB00993 1, P_R52601, S22624, A10377_l and MUA1_XENLA.
EXAMPLE 77: Isolation of cDNA clones Encoding Human PRQ1472
An Incyte sequence was identified and put in a computer to determine whether it had homology with other proteins in databases. The EST databases included public EST databases (e.g., GenBank), and the proprietary EST database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA). The search was performed 15 using the computer program BLAST or BLAST2 [Altschul etal., Methods in Enzvmologv. 266:460-480 (1996)1 as a comparison of the ECD protein sequences to a 6 frame translation of the EST sequences. Those comparisons resulting in a BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, Seattle, Washington).
A consensus DNA sequence encoding PRO1472 was assembled relative to other EST sequences using phrap. This consensus sequence is designated herein "DNA62824". Based on the DNA62824 consensus sequence and other discoveries and information provided herein, the Incyte clone including EST 1579843 (from a duodenal tissue library) found in the assembly was purchased and sequenced in full.
Sequencing provided the entire coding sequence of PRO1472 as shown in Figure 147 (SEQ ID 25 NO:266). Clone DNA68866-1644 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 134-136 and an apparent stop codon at nucleotide positions 1532-1534 of SEQ ID NO:266. The predicted polypeptide precursor is 466 amino acids long. As indicated in Figure 148, the signal peptide is at about amino acid positions 1-17 and the transmembrane domains are at about positions 131-150 and 235-259 of SEQ ID NO:267. Clone DNA68866-1644 has been deposited with ATCC and is assigned ATCC 30 deposit no. 203283. The full-length PR01472 protein shown in Figure 148 has an estimated molecular weight of about 52,279 daltons and a pi of about 6.16.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 148 (SEQ ID NO:267), revealed sequence identity between the PRO 1472 amino acid sequence and the following Dayhoff sequences (data incorporated herein): 35 BUTTHUMAN, HS45P21_1, HS45P21_3, HS45P21_5, HS45P21_4, HSU90142_1, HSU90546_1, AF033107 1, MMHC135G15 7andHSB73 1.
439
PCT/US99/201I1
EXAMPLE 78: Isolation of cDNA clones Encoding Human PRQ1461
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from die LIFESEQ* database, designated Incyte EST Cluster No. 139103, and also referred to herein as "DNA10747". The DNA10747 sequence was then compared to a variety of EST databases which included public EST databases (e.g., GenBank) and the LIFESEQ* database, to identify existing homologies.
The homology search was performed using the computer program BLAST or BLAST2 (Altshul etal., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). One or more of the ESTs used in the assembly was derived from a library constructed from pancreatic tumor tissue. The consensus 10 sequence obtained therefrom is herein designated "DNA59553".
In ligbt of the sequence homology between the DNAS9S53 sequence and an EST sequence contained within Incyte EST no. 2944541, the EST clone was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 149 and is herein designated as DNA68871-1638.
The full length clone shown in Figure 149 contained a single open reading frame with an apparent 15 translational initiation site at nucleotide positions 32-34 and ending at the stop codon found at nucleotide positions 1301-1303 (Figure 149; SEQ ID NO:268). The predicted polypeptide precursor (Figure 150, SEQ ID NO:269) is 423 amino acids long. PR01461 has a calculated molecular weight of approximately 47,696 daltons and an estimated pi of approximately 8.96. Additional features include: a type II transmembrane domain at about amino acids 21-40; an ATP/GTP-binding site motif A (P-loop) at about amino acids 359-366; a trypsin family histidine 20 active site at about amino acids 228-233; potential N-myristoylation sites at about aminn acids 179-184,213-218, 317-322, and 360-365; and potential N-glycosylation sites at about amino acids 75-78, 166-169 and 223-226.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 150 (SEQ ID NO:269), revealed significant homology between the PR0146I amino acid sequence Dayhoff sequence no. P R89435. Homology was also 25 found to exist between the PRO 1461 amino acid sequence and the following additional Dayhoff sequences: AB002134_1, P_R89430, P_W22987, HEPS_MOUSE, ENTKHUMAN, P_W22986, KAL_MOUSE, ACRO_PIG, p_R57283, and TRY7_ANOGA.
Clone DNA68871-68871 was deposited with the ATCC on September 22,1998, and is assigned ATCC deposit no.203280.
FX A MPT,F. 79: Isolation of cDNA clones Encoding Human PRQ1410
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database, designated Incyte EST cluster sequence no. 98502. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST 35 databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a
440
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Gteen, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56451.
In light of the sequence homology between the DNA56451 sequence and an EST sequence contained within the Incyte EST clone no. 1257046, the Incyte EST clone 125046 was purchased and the cDNA insert was 5 obtained and sequenced. The sequence of this cDNA insert is shown in Figure 151 and is herein designated as DN A68874-1622.
Clone DNA68874-1622 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 152-154 and ending at the stop codon at nucleotide positions 866-868 (Figure 151). The predicted polypeptide precursor is 238 amino acids long (Figure 152). The full-length PR01410 protein 10 shown in Figure 152 has an estimated molecular weight of about 25,262 daltons and a pi of about 6.44. Analysis of the full-length PR01410 sequence shown in Figure 152 (SEQ ID NO:27i) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 20, a transmembrane domain from about amino acid 194 to about amino acid 220 and a potential N-glycosylation site from about amino acid 132 to about amino acid 135. Clone DNA68874-1622 has been deposited with ATCC on September 22,1998 and is assigned 15 ATCC deposit no. 203277
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 152 (SEQ ID NO.271), evidenced significant homology between the PR01410 amino acid sequence and die following Dayhoff sequences: 148652, PR76466, HSMHC3W36A_2, EPB4_HUMAN, P_R14256, EPA8_MOUSE, P_R77285, P_W13569, AF000560_1, and 20 ASF1_HELAN.
EXAMPLE 80: Isolation of cDNA clones Encoding Human PRQ1568
A consensus DNA sequence was assembled relative to other EST sequences using phrap to form an assembly as described in Example 1 above. The consensus sequence is designated herein "DNA54208". Based 25 on the DNA54208 consensus sequence, the assembly and other information and discoveries provided herein, a clone including an EST in the assembly was ordered and sequenced. The EST is Incyte 3089490. Sequencing in full gave the sequence shown in Figure 153.
The entire coding sequence of PR01568 is included in Figure 153 (SEQ ID NO:272). Clone DNA68880-1676 contains a single open reading frame with an apparent translational initiation site at nucleotide 30 positions 208-210 and an apparent stop codon at nucleotide positions 1123-1125 of SEQ ID NO:272. The predicted polypeptide precursor is 305 amino acids long. The' signal peptide, transmembrane regions, N-myristoylation and amidation sites axe also indicated in Figure 154. Clone DNA68880-1676 has been deposited with the ATCC and is assigned ATCC deposit no. 203319. The full-length PR01568 protein shown in Figure 154 has an estimated molecular weight of about 35,383 daltons and a pi of about 5.99. 35 An analysis of the Dayhoff database (version 35.45 SwissProt 3S), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 154 (SEQ ID NO:273), revealed sequence identity between the PRO1568 amino acid sequence and the following Dayhoff sequences (incorporated herein):
441
AF089749_1, AF054841_1, NAG2_HUMAN, CD63_HUMAN, CD82_HUMAN, P_W05732, P R86834, A15HUMAN, P W27333 and CD37HUMAN.
EXAMPLE 81: Isolation of cDNA clones Encoding Human PRO 1570
A consensus DNA sequence encoding PR01570 was assembled relative to other EST sequences using 5 phrap as described in Example 1 above to form an assemby. This consensus sequence is designated herein as "DNA65415". Based on the DNA65415 consensus sequence and other discoveries and information provided herein, the clone including Incyte EST 3232285 (from a uterine/colon cancer tissue library) was purchased and sequenced in full which gave SEQ ID NO:274.
The entire coding sequence of PR01570 is included in Figure 155 (SEQ ID NO:274). Clone 10 DNA68885-1678 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 210-212 and an apparent stop codon at nucleotide positions 1506-1508 of SEQ ID NO:274. The predicted polypeptide precursor is 432 amino acids long. Figure 275 shows a number of motifs. Clone DNA68885-1678 has been deposited with the ATCC and is assigned ATCC deposit no. 203311. The full-length PR01570 protein shown in Figure 156 has an estimated molecular weight of about 47,644 daltons and a pi of 15 about 5.18.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 156 (SEQ ID NO:275), revealed sequence identity between the PR01570 amino acid sequence and the following Dayhoff sequences (incorporated herein): P_W22986, TMS2_HUMAN, HEPS_HUMAN, P_R89435, AB002134_1, KAL_MOUSE, ACRO_HUMAN, 20 GEN12917, AF045649_1, and P_W34285.
EXAMPLE 82: Isolation of cDNA clones Encoding Human PRQ1317
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is designated herein "Consen8865". In addition, the Consen8865 25 consensus sequence was extended using repeated cycles of BLAST and phrap to extend the consensus sequence as far as possible using the sources of EST sequences discussed above. The extended consensus sequence is designated herein as HDNA63334". Based on the DNA63334 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA Library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01317.
PCR primers (forward and reverse) were synthesized:
forward PCR primer: CTGCTGGTGAAATCTGGCGTGGAG (63334.fl; SEQ ID NO:278); and reverse PCR primer: GTCTGGTCCTGGCTGTCCACCCAG (63334.rl; SEQ ID NO:279).
Additionally, a synthetic oligonucleotide hybridization probe was constructed from tbe consensus DNA63334 sequence which had the following nucleotide sequence:
hvbridizationprobe: CATCTTGTCATGTACCTGGGAACCACCACAGGGTCGCTCCACAAG(63334.pl; SEQ ID N0:280).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was
442
PCT/US99/201II
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01317 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human hippocampal tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01317 (designated herein as DNA71166-1685 [Figure 157, SEQ ID NO:276J; and the derived protein 5 sequence for PR01317.
The entire coding sequence of PRO 1317 is shown in Figure 157 (SEQ ID NO:276). Clone DNA71166-1685 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 105-107 and an apparent stop codon at nucleotide positions 2388-2390. The predicted polypeptide precursor is 761 amino acids long and has an estimated molecular weight of about 83,574 daltons and a pi of about 6.78. 10 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 158 (SEQ ID NO:277), revealed significant homology between the PR01317 amino acid sequence and Dayhoff sequence no. 148745. Homology was also revealed between the PR01317 amino acid sequence the following Dayhoff sequences: 148746, GEN13418, P_W58540, P 217657, MUSC1J, P_471380, U73167_5, HSU33920_1, and GG828240_1. 15 Clone DNA71166-1685 was deposited with the ATCC on October 20, 1998, and is assigned ATCC
deposit no. 203355.
EXAMPLE 83: Isolation of cDNA clones Encoding Human PRQ1780
The DNA63837.init sequence was obtained as described in Example 1 above and was extended using 20 repeated cycles of BLAST and the program "phrap" (Phil Green, University of Washington, Seattle) to extend the consensus sequence as far as possible using the sources of EST sequences discussed above. The extended consensus sequence is designated herein as "DNA63837". Based on the DNA63837 consensus sequence, oligonucleotides were synthesized: 1) to identiiy by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01780. 25 PCR primers (forward and reverse) were synthesized:
forward PCR primer: TGCCTTTGCTCACCTACCCCAAGG (63837.fl; SEQ ID NO:283)
reverse PCR primer: TCAGGCTGGTCTCCAAAGAGAGGG (63837.rl; SEQ ID NO:284)
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA63837 sequence which had the following nucleotide sequence:
hybridization probe: CCCAAAGATGTCCACCTGGCTGCAAATGTGAAAATTGTGGACTGG (63837.pl; SEQ ID NO:285)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01780 gene using the probe oligonucleotide and one of the PCR primers. RNA 35 for construction of the cDNA libraries was isolated from a human fetal kidney.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01780 (designated herein as DNA71169-1709 [Figure 159, SEQ ID NO:281]; and the derived protein
443
sequence for PR01780.
The entire coding sequence of PR01780 is shown in Figure 159 (SEQ ID NO:281). Clone DNA71169-1709 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 68-70 and an apparent stop codon at nucleotide positions 1637-1639. The predicted polypeptide precursor is 523 amino acids long. The full-length PR01780 protein shown in Figure 160 has an estimated molecular weight of 5 about 59,581 daltons and a pi of about 8.68. Additional features include a signal peptide sequence at about amino acids 1 • 19; a transmembrane domain at about amino acids 483-504; tyrosine phosphorylation sites at about amino acids 68-74 and 425-433; N-myristoylation sites at about amino acids 16-21,301-206,370-375, and 494-499; and a leucine zipper pattern at about amino acids 493-514.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 10 alignment analysis of the full-length sequence shown in Figure 160 (SEQ ID NO:282), revealed significant homology between the PR01780 amino acid sequence and the following Dayhoff sequences: UDA2RABIT, CGTHUMAN, UD11_HUMAN, P_R26153, UDB1JRAT, HSU59209_1, AB010872_1, UDB5_MOUSE, UDB8_HUMAN, and UD14JHUMAN.
Clone DNA71169-1709 was deposited with the ATCC on November 17,1998, and is assigned ATCC 15 deposit no. 203467.
EXAMPLE 84: Isolation of cDNA clones Encoding Human PRQ1486
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is designated herein "DNA48897". Based on the DNA48897 20 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01486.
PCR primers (forward and reverse) were synthesized:
forward PCR primer 5' AGGCAGCCACCAGCTCTGTGCTAC3' (SEQ ID NO:288); and 25 reverse PCR primer 5'CAGAGAGGGAAGATGAGGAAGCCAGAG3* (SEQ ID NO:289).
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA48897 sequence which had the following nucleotide sequence:
hybridization probe 5'CTGTGCTACTGCCCTTGGACCCTGGGGACCGAGTGTCTCTGC3' (SEQ ID N0:290).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01486 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from a human adenocarcinoma cell line.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 35 PROI486 and the derived protein sequence for PR01486.
The entire coding sequence of PR01486 is included in Figure 161 (SEQ ID NO:286). Clone DNA71180-1655 contains a single open reading frame with an apparent translational initiation site at nucleotide
444
positions 472-474 and an apparent stop codon at nucleotide positions 1087-1089 of SEQ ID NO:286. The predicted polypeptide precursor is 205 amino acids long. The signal peptide is at about amino acids 1-32 of SEQ ID NO:287. Regions similar to those of Clq and an N-glycosylationi site are located as indicated in Figure 162. Clone DNA71180-1655 has been deposited with the ATCC and is assigned ATCC deposit no. 203403. The full-length PR01486protein shown in Figure 162 has an estimated molecular weight of about21,521 daltons 5 and a pi of about 7.07.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 162 (SEQ ID NO:287), revealed sequence identity between the PR01486 amino acid sequence and the following Dayhoff sequences: CERB_HUMAN, CERLJRAT, GEN11893, P_R22263, CA18_HUMAN, C1QCHUMAN, AF054891_1, A57131, 10 HUMClQb2_l, ACR3MOUSE.
EXAMPLE 85: Isolation of cDNA clones Encoding Human PRQ1433
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA45230. Based on the DNA45230 15 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01433.
PCR primers (forward and reverse) were synthesized:
forward PCR primer f45230.fl) 5,-GCTGACCTGGTTCCCATCTACTCC-3, (SEQ ID NO:293) 20 reverse PCR primer (45230.rl) 5' -CCCACAGACACCCATGACACTTCC-3' (SEQ ID NO:294)
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA45230 sequence which had the following nucleotide sequence hybridization probe (4S230.pl)
'-AAGAATGAATTGTACAAAGCAGGTGATCTTCGAGGAGGGCTCCTGGGGCC-3' (SEQ ID NO-.295) 25 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01433 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human adrenal gland tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 30 PR01433 (designated herein as DNA71184-1634 [Figure 163, SEQ ID NO:291]; and the derived protein sequence for PRO1433.
The entire nucleotide sequence of DNA71184-1634 is shown in Figure 163 (SEQ ID NO:291). Clone DNA71184-1634 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 185-187 and ending at the stop codon at nucleotide positions 1349-1351 (Figure 163). The predicted 35 polypeptide precursor is 388 amino acids long (Figure 164). The full-length PRO1433 protein shown in Figure 164 has an estimated molecular weight of about 43,831 daltons and a pi of about 9.64. Analysis of the full-length PR01433 sequence shown in Figure 164 (SEQ ID NO:292) evidences the presence of the following: a
445
transmembrane domain from about amino acid 76 to about amino acid 97, potential N-glycosylation sites from about amino acid 60 to about amino acid 63, from about amino acid 173 to about amino acid 176 and from about amino acid 228 to about amino acid 231 and potential N-myristolation sites from about amino acid 10 to about amino acid IS, from about amino acid 41 to about amino acid 46, from about amino acid 84 to about amino acid 89, from about amino acid 120 to about amino acid 123, from about amino acid 169 to about amino acid 174, 5 from about amino acid 229 to about amino acid 234, from about amino acid 240 to about amino acid 245, from about amino acid 318 to about amino acid 323 and from about amino acid 378 to about amino acid 383. Clone DNA71184-1634 has been deposited with ATCC on September 22, 1998 and is assigned ATCC deposit no. 203266.
An analysis of the Dayhoff database (version 33.45 SwissProt 35), using a WU-BLAST2 sequence 10 alignment analysis of the full-length sequence shown in Figure 164 (SEQ ID NO:292), evidenced significant homology between the PR01433 amino acid sequence and the following Dayhoff sequences: CELW01A114, CEF59A14, S67138, MTV050 3, S75135 and S12411.
EXAMPLE 86: Isolation of cDNA clones Encoding Human PRQ1490 IS A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA67006. Based on tbe DNA67006 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01490.
PCR primers (forward and reverse) were synthesized:
forward PCR primer f67006.fn 5'-CTTCCTCTGTGGGTGGACCATGTG-3' (SEQ ID NO:298)
reverse PCR primer (67006.rn 5' -GCCACCTCCATGCTAACGCGG-3' (SEQ ID NO:299) Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA67006 sequence which had the following nucleotide sequence 25 hybridization probe (67006.dP
'-CCAAGGTCCTCGCTAAGAAGGAGCTGCTCTACGTGCCCCTCATCG-3' (SEQ ID N0:300)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PRO1490 gene using the probe oligonucleotide and one of the PCR primers. RNA 30 for construction of the cDNA libraries was isolated from human adrenal gland tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01490 (designated herein as DNA71213-1659 [Figure 165, SEQ ID NO:296]; and the derived protein sequence for PR01490.
The entire nucleotide sequence of DNA71213-1659 is shown in Figure 165 (SEQ ID NO:296). Clone 35 DNA71213-1659 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 272-274 and ending at the stop codon at nucleotide positions 1376-1378 (Figure 165). The predicted polypeptide precursor is 368 amino acids long (Figure 166). The full-length PR01490 protein shown in Figure
446
WO 00/12708 PCT/US99/20111
166 has an estimated molecular weight of about 42,550 daltons and a pi of about 9.11. Analysis of the full-length PR01490 sequence shown in Figure 166 (SEQ ID NO:297) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 25, transmembrane domains from about amino acid 307 to about amino acid 323 and from about amino acid 335 to about amino acid 352 and tyrosine kinase phosphorylation sites from about amino acid 160 to about amino acid 168 and from about amino acid 161 to 5 about amino acid 168. Clone DNA71213-1659 has been deposited with ATCC on October 27, 1998 and is assigned ATCC deposit no. 203401.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 166 (SEQ ID NO:297), evidenced significant homology between the PRO1490 amino acid sequence and the following Dayhoff sequences: A52744_l, S60478, 10 P_R99249, P_R59712, YBP2YEAST, S54641, CELT05H4_15, CELF28B3_1, CELZK40J and YIHG_ECOLI.
EXAMPLE 87: Isolation of cDNA clones Encoding Human PRO 1482
A cDNA clone (DNA71234-1651) encoding a native human PR01482 polypeptide was identified by 15 a yeast screen, in a human adrenal gland cDNA library that preferentially represents the 5' ends of the primary cDNA clones.
The full-length DNA71234-1651 clone shown in Figure 167 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 33-35 and ending at the stop codon at nucleotide positions 462-464 (Figure 167). The predicted polypeptide precursor is 143 amino acids long (Figure 168). The 20 full-length PR01482 protein shown in Figure 168 has an estimated molecular weight of about 15,624 daltons and a pi of about 9.58. Analysis of the full-length PR01482 sequence shown in Figure 168 (SEQ ID N0:302) evidences the presence of the following: a signal peptide from about mim acid 1 to about amino acid 28. Clone DNA71234-1651 has been deposited with ATCC on October 27, 1998 and is assigned ATCC deposit no. 203402.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 168 (SEQ ID N0:302), evidenced significant homology between the PR01482 amino acid sequence and the following Dayhoff sequences: A18267_3.
FYAKfpT F SS- Isolation of cDNA clones Encoding Human PRQ1446 30 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST
cluster sequence from tbe Incyte database. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. One or more of the ESTs was derived from a pancreatic islet cell library. The homology search was performed using 35 the computer program BLAST or BLAST2 (Altshul etal., Methods in Enzvmologv 266:460-480 (1996Y). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green,
447
University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNAS6514.
In light of the sequence homology between the DNA56514 sequence and an EST sequence contained within the Incyte EST 2380344, the clone including this EST was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 169 and is herein designated as 5 DNA71277-I636.
The full length clone shown in Figure 169 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 152-154 and ending at the stop codon found at nucleotide positions 479-481 (Figure 169; SEQ ID N0:303). The predicted polypeptide precursor (Figure 170, SEQ ID N0:304) is 109 amino acids long with a signal peptide at about amino acids 1-15 of SEQ ID N0:304. PR01446 10 has a calculated molecular weight of approximately 11,822 daltons and an estimated pi of approximately 8.63. Clone DNA71277-1636 was deposited with the ATCC on September 22, 1998 and is assigned ATCC deposit no. 203285.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 170 (SEQ ID N0:304), revealed sequence identity 15 between the PR01446 amino acid sequence and the following Dayhoff sequences (data incorporated herein); P53_CANFA, P53_FELCA,LRP1_HSV1F, OSU57338 1, S75842, P_P93722, AF002189_1, B70408, S54309 andS53365. The first in this list is further described in Kraegel, et al., Cancer Lett.. 92(2): 181-186 (1995).
EXAMPLE ?9: Isolation of cDNA clones Encoding Human PRQ1558 20 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST
cluster sequence from the Incyte database, designated Incyte EST cluster sequence no. 86390. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, C A) to identify existing homologies. The homology search was performed using the computer program BLAST 25 or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA58842.
In light of the sequence homology between the DNA58842 sequence and an EST sequence contained 30 within the Incyte EST clone no. 3746964, the Incyte EST clone no. 3746964 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 171 and is herein designated as DNA71282-1668.
Clone DNA71282-1668 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 84-86 and ending at the stop codon at nucleotide positions 870-872 (Figure 171). The 35 predicted polypeptide precursor is 262 amino acids long (Figure 172). Hie full-length PR01558 protein shown in Figure 172 has an estimated molecular weight of about 28,809 daltons and a pi of about 8.80. Analysis of the full-length PRO 1558 sequence shown in Figure 172 (SEQ ID N0:306) evidences the presence of the
448
WO 00/12708 PCT/US99/20111
following: a signal peptide from about amino acid 1 to about amino acid 25, transmembrane domains from about amino acid 8 to about amino acid 30 and from about amino acid 109 to about amino acid 130, a potential N-glycosylation site from about amino acid 190 to about amino acid 193, a tyrosine kinase phosphorylation site from about amino acid 238 to about amino acid 246, potential N-myristolatkm sites from about amino acid 22 to about amino acid 27, from about amino acid 28 to about amino acid 33, from about amino acid 110 to about 5 amino acid 115, from about amino acid 205 to about amino acid 210 and from about amino acid 255 to about amino acid 260 and amidation sites from about amino acid 31 to about amino acid 34 and from about amino acid 39 to about amino acid 42. Clone DNA71282-1668 has been deposited with ATCC on October 6, 1998 and is assigned ATCC deposit no. 203312.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 10 alignment analysis of the full-length sequence shown in Figure 172 (SEQ ID N0:306), evidenced significant homology between the PRO1558 amino acid sequence and the following Dayhoff sequences: AF075724_2, MXU24657J, CAMT_EUCGU, MSU20736_1, P_R29515, B70431, JC4004, CEY32B12A_3, CELF53B3_2 and PJR13543.
EXAMPLE 90: Isolation of cDNA clones Encoding Human PRO 1604
An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) was searched. Incyte EST No. 3550440 was identified as having homology to HDGF. EST No. 3550440 was then compared to various EST databases including public EST databases (e.g. GenBank), and the LIFESEQ® database, to identify homologous EST sequences. The search was performed using the computer program 20 BLAST or BLAST2 [Altschul et al., Methods in Enzvmologv. 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). This consensus sequence is designated herein "DNA67237".
In light of the sequence homology between the DNA67237 sequence and EST no. 3367060 from the 25 LIFESEQ® database, the clone containing Incyte EST No. 3367060 was purchased and the cDNA insert was obtained and sequenced to obtain the entire coding sequence of PR01604 which is shown in Figure 173 (SEQ ID N0:307).
Clone DNA71286-1687 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 65-67 and an apparent stop codon at nucleotide positions 2078-2080. The predicted 30 polypeptide precursor is 671 amino acids long. The full-length PR01604 protein shown in Figure 174 has an estimated molecular weight of about 74,317 daltons and a pi of about 7.62. Additional features include a signal peptide at about amino acids 1-13; potential cAMP- and cGMP-dependent protein kinase phosphorylation sites at about amino acids 156-159,171-174, and 451-454; potential N-myristoylation sites at about amino acids 46-51, 365-370, and 367-372; and a cell attachment sequence at about amino acids 661-663. 35 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 174 (SEQ ID N0:308), revealed significant homology between the PR01604 amino acid sequence and Dayhoff sequence no. P W37483. Homology was
449
also shown between the PR01604 amino acid sequence and the following additional Dayhoff sequences: AF063020J, PR66727, P_W37482, JC5661, CEC25A1_11, CEU33058_1, 138073, MST2JDROHY, and HSATRX36_1.
Clone JDNA71286-1687 was deposited with the ATCC on October 20, 1998, and is assigned ATCC deposit no. 203357.
EXAMPLE 91: Isolation of cDNA clones Encoding Human PRQ1491
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA67202. Based on the DNA67202 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 10 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01491.
PCR primers (forward and reverse) were synthesized:
forward PCR primer (67202.fn 5'-CAACGCAGCCGTGATAAACAAGTGG-3' (SEQ ID NO:311) reverse PCR primer (672Q2.rl) 5'-GCTTGGACATGTACCAGGCCGTGG-3' (SEQ ID NO:312) 15 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA67202 sequence which had the following nucleotide sequence hybridization probe (67202.pn
'-GGCCAGACTGATTTGCTCAATTCCTGGAAGTGATGGGGCAGATAC-3' (SEQ ID NO:313)
RNA for construction of the cDNA libraries was isolated from human aortic endothelial cell tissue. 20 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for
PR01491 (designated herein as DNA71883-1660 [Figure 175, SEQ ID N0:309]; and the derived protein sequence for PRO 1491.
The entire nucleotide sequence of DNA71883-1660 is shown in Figure 175 (SEQ ID N0:309). Clone DNA71883-1660 contains a single open reading frame with an apparent translational initiation site at nucleotide 25 positions 107-109 and ending at the stop codon at nucleotide positions 2438-2440 (Figure 175). The predicted polypeptide precursor is 777 amino acids long (Figure 176). The full-length PR01491 protein shown in Figure 176 has an estimated molecular weight of about 89,651 daltons and a pi of about 7.97. Analysis of the full-length PR01491 sequence shown in Figure 176 (SEQ ID N0:310) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 36, potential N-glycosylation sites from about amino 30 acid 139 to about amino acid 142, from about amino acid 607 to about amino acid 610 and from about amino acid 724 to about amino acid 727, a tyrosine kinase phosphorylation site from about amino acid 571 to about amino acid 576 and a gram-positive cocci surface protein anchoring hexapeptide sequence from about amino acid 32 to about amino acid 37. Clone DNA71883-1660 has been deposited with ATCC on November 17,1998 and is assigned ATCC deposit no. 203475.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 176 (SEQ ID N0:310), evidenced significant homology between the PR01491 amino acid sequence and the following Dayhoff sequences: GGU28240_1,
450
MUSC1_1, D49423, MMSEMH_1, AB002329_1, AF022947 1, HSU33920_1, HUMLUCA191, G01856and AF022946J.
EXAMPLE 92: Isolation of cDNA clones F.nr.ndine Human PRO 1431
An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, C A) S was searched and an EST (isolated from adult brain stem tissue) was identified (1370141, DNA66505) which showed homology to SH3. RNA for construction of cDNA libraries was isolated from human bone marrow. Afull length cDNA conresponding to the isolated EST was isolated using an in vitro cloning technique (DNA73401-1633) in pRK5.
The cDNA libraries used to isolate the cDNA clones encoding human PR01431 were constructed by 10 standard methods using commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel electrophoresis, and cloned in a defmed orientation into a suitable cloning vector (such as pRKB or pRKD; pRKSB is a precursor of pRKSD that does not contain the Sfil site; see, Holmes et al., Science, 253:1278-1280 (1991)) in the unique Xhol and NotI.
IS A cDNA clone was sequenced in entirety. The entire nucleotide sequence of DNA73401-1633 (SEQ
ID N0:314) is shown in Figure 177. Clone DNA73401-1633 contains a single open reading frame with an apparent translational initiation site at about nucleotide positions 630-632 and a stop codon at about nucleotide positions 1740-1742. The predicted polypeptide precursor encoded by DNA734Q1-1633 is 370 amino acids long. Clone DNA73401 (designated as DNA73402-1633) has been deposited with ATCC and is assigned ATCC 20 deposit no. 203273.
Based sequence alignment analysis (using the ALIGN computer program) of the full-length sequence, PRO 1431 shows significant amino acid sequence identity to SH17_HUMAN, an SH3 containing protein known as SH3P17. Additional significant identity score were found with D89164_l, AF0321I8_1, EXLP_TOBAC, YHR4 YEAST, S46992, RATP130CAS_2, AF043259_1, RATP130CAS_1 and MYSC_ACACA.
EXAMPLE 93: Isolation of cDNA clones Encoding Human PRQ1S63
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA67191. Based on the DNA67191 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 30 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01563.
PCR primers (forward and reverse) were synthesized:
forward PCR primer (67191 .f 11 5,-CCCTGAAGCTGCCAGATGGCTCC-3, (SEQ ID N0:318)
reverse PCR primer (67191 .rH 5'-CTGTGCTCTTCGGTGCAGCCAGTC-3' (SEQ ID NO:3I9) 35 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA67191 sequence which had the following nucleotide sequence hybridization probe (67191 ,pD
451
'-CCACAGATGTGGTACTGCCTGGGGCAGTCAGCTTGCGCTACAG-3' (SEQ ID N0:320)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with tbe PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01563 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human bone marrow tissue.
DNA sequencing of the clones isolated as described above gave tbe full-length DNA sequence for
PR01563 (designated herein as DNA73492-1671 [Figures 179A-B, SEQ ID N0:316]; and the derived protein sequence for PR01563.
The entire nucleotide sequence of DNA73492-1671 is shown in Figures 179A-B (SEQ ID NO:316). Clone DNA73492-1671 contains a single open reading frame with an apparent translational initiation site at 10 nucleotide positions 419-421 and ending at the stop codon at nucleotide positions 2930-2932 (Figures 179A-B). The predicted polypeptide precursor is 837 amino acids long (Figure 180). The full-length PR01S63 protein shown in Figure 180 has an estimated molecular weight of about 90,167 daltons and a pi of about 8.39. Analysis of the full-length PR01563 sequence shown in Figure 180 (SEQ ID NO:317) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 48, a potential N-glycosylation site from 15 about amino acid 68 to about amino acid 71, glycosaminoglycan attachment sites from about amino acid 188 to about amino acid 191 and from about amino acid 772 to about amino acid 775, a cAMP- and cGMP-dependent protein kinase phosphorylation site from about amino acid 182 to about amino acid 185, a tyrosine kinase phosphorylation site from about amino acid 730 to about amino acid 736, potential N-myristolation sites from about amino acid 5 to about amino acid 10, from about amino acid 19 to about amino acid 24, from about amino 20 acid 121 to about amino acid 126, from about amino acid 125 to about amino acid 130, from about amino acid 130 to about amino acid 135, from about amino acid 147 to about amino acid 152, from about amino acid 167 to about amino acid 172, from about amino acid 168 to about amino acid 173, from about amino acid 174 to about amino acid 179, from about amino acid 323 to about amino acid 328, from about amino acid 352 to about amino acid 357, from about amino acid 539 to about amino acid 544, from about amino acid 555 to about amino 25 acid 560, from about amino acid 577 to about amino acid 582, from about amino acid 679 to about amino acid 684, from about amino acid 682 to about amino acid 687, and from about amino acid 763 to about amino acid 768, amidation sites from about amino acid 560 to about amino acid 563 and from about amino acid 834 to about amino acid 837, leucine zipper pattern sequences from about amino acid 17 to about amino acid 38 and from about amino acid 24 to about amino acid 45 and a neutral zinc metallopeptidase, zinc-binding region signature 30 sequence from about amino acid 358 to about amino acid 367. Clone DNA73492-1671 has been deposited with ATCC on October 6, 1998 and is assigned ATCC deposit no. 203324.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 180 (SEQ ID NO:317), evidenced significant homology between the PR01563 amino acid sequence and the following Dayhoff sequences: AB014588_1, 35 D67076_1, AB001735_1, P_W47028, AB002364J, P_W47029, GEN13695, P_R40823, AF005665J and DISA TRIGA.
452
EXAMPLE 94: Isolation of cDNA clones Encoding Human PRO 1565
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA67183. Based on an observed homology between the DNA67183 consensus sequence and an EST sequence contained within Incyte EST clone no. 2510320, Incyte EST clone no. 2510320 was purchased and its insert was obtained and sequenced. That 5 insert sequence is shown in Figure 181 and is herein designated DNA73727-1673 (SEQ ID NO:321).
The entire nucleotide sequence of DNA73727-1673 is shown in Figure 181 (SEQ ID NO:321). Clone DNA73727-1673 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 59-61 and ending at die stop codon at nucleotide positions 1010-1012 (Figure 181). The predicted polypeptide precursor is 317 amino acids long (Figure 182). The full-length PR01565 protein shown in Figure 10 182 has an estimated molecular weight of about 37,130 daltons and a pi of about 5.18. Analysis of the full-length PRO1565 sequence shown in Figure 182 (SEQ ID NO:322) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 40, a potential type II transmembrane domain from about amino acid 25 to about amino acid 47, potential N-glycosylation sites from about amino acid 94 to about amino acid 97 and from about amino acid 180 to about amino acid 183, glycosaminoglycan attachment sites from 15 about amino acid 92 to about amino acid 95, from about amino acid 70 to about amino acid 73, from about amino acid 85 to about amino acid 88, from about amino acid 133 to about amino acid 136, from about amino acid 148 to about amino acid 151, from about amino acid 192 to about amino acid 195 and from about amino acid 239 to about amino acid 242, potential N-myristolation sites from about amino acid 33 to about amino acid 38, from about amino acid 95 to about amino acid 100, from about amino acid 116 to about amino acid 121, 20 from about amino acid 215 to about amino acid 220 and from about amino acid 272 to about amino acid 277, a microbodies C-terminal targeting signal sequence from about amino acid 315 to about amino acid 317 and a cytochrome C family heme-binding site signature sequence from about amino acid 9 to about amino acid 14. Clone DNA73727-1673 has been deposited with ATCC on November 3, 1998 and is assigned ATCC deposit no. 203459.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 182 (SEQ ID NO:322), evidenced significant homology between the PR01565 amino acid sequence and the following Dayhoff sequences: AF051425 1, P_R65490, P_R65488, GRPE STAAU, RNU31330_1, ACCD_BRANA, D50558J, HUMAMYAB3_1, PJW34452 and P_P50629.
EXAMPLE 95: Isolation of cDNA clones Encoding Human PRQ1571
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA69559. Based on homology observed between the DNA69559 consensus sequence and an EST sequence contained within the Incyte EST clone no. 35 3140760, Incyte EST clone no. 3140760 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 183 and is herein designated as DNA73730-1679.
Clone DNA73730-1679 contains a single open reading frame with an apparent translational initiation
453
site at nucleotide positions 90-92 and ending at the stop codon at nucleotide positions 807-809 (Figure 183). The predicted polypeptide precursor is 239 amino acids long (Figure 184). The full-length PR01S71 protein shown in Figure 184 has an estimated molecular weight of about 25,699 daltons and a pi of about 8.99. Analysis of the full-length PR01571 sequence shown in Figure 184 (SEQ ID NO:324) evidences the presence of die following: a signal peptide from about amino acid 1 to about amino acid 21 and transmembrane domains from 5 about amino acid 82 to about amino acid 103, from about amino acid 115 to about amino acid 141 and from about amino acid 160 to about amino acid 182. Clone DNA73730-1679 has been deposited with ATCC on October 6, 1998 and is assigned ATCC deposit no. 203320.
An analysis of die Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 184 (SEQ ID NO.-324), evidenced significant 10 homology between the PR0157I amino acid sequence and the following Dayhoff sequences: AF072128_1, AB000712_1, AB000714_1, AF007189_1, AF000959_1, AF068863_1, P_W15288, PM22_HUMAN, PR30056 and LSU46824_1.
EXAMPLE 96: Isolation of cDNA clones Encoding Human PRQ1572 IS Using the method described in Example 1 above, a consensus sequence was obtained. Hie consensus sequence is designated herein "DNA69560". Based on the DNA69560 consensus sequence and other information provided herein, a clone including another EST (Incyte DNA3051424) from the assembly was purchased and sequenced.
The entire coding sequence of PR01573 is included in Figure 185 (SEQ ID NO:325). Clone 20 DNA73734-1680 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 90-92 and an apparent stop codon at nucleotide positions 873-875. The predicted polypeptide precursor is 261 amino acids long. The signal peptide is at about amino acids 1-23 and the transmembrane domains are. at about amino acids 81-100, 121-141, and 173-194 of SEQ ID NO-.326. One or more of the transmembrane domains can be deleted or inactivated. The locations of a N-glycosylation site, N-myristoylation sites, a tyrosine 25 kinase phosphorylation site and a prokaryotic membrane lipoprotein lipid attachment site are indicated in Figure 186. Clone DNA73734-1680 has been deposited with the ATCC and is assigned ATCC deposit no. 203363. The full-length PR01572 protein shown in Figure 186 has an estimated molecular weight of about 27,856 daltons and a pi of about 8.5.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 30 alignment analysis of the Ml-length sequence shown in Figure 186 (SEQ ID NO:326), revealed sequence identity between the PR01572 amino acid sequence and the following Dayhoff sequences (incorporated herein): AF072127_1, HSU89916_1, AB000713_1, AB000714_1, AB0007I2_1, AF000959_1, AF072128J, AF068863_1, P_W29881, andP_W58869.
EXAMPLE 97: Isolation of cDNA clones Encoding Human PRQ1573
EST 3628990 was identified in an Incyte Database, (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) and extended in a comparison to other sequences in databases to form an assembly. The alignment search
454
was performed using the computer program BLAST or BLAST2 [Altschul et al., Methods in Enzvmologv. 266:460-480 (1996)] as a comparison of the ECD protein sequences to a 6 frame translation of the EST sequences. Those comparisons resulting in a BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence is designated herein 5 "DNA69561".
Based on the DNA69561 consensus sequence and other information provided herein, a clone including another EST (Incyte DNA3752657) from the assembly was purchased and sequenced. This clone came from a breast tumor tissue library.
The entire coding sequence of PR01573 is included in Figure 187 (SEQ ID NO:327). Clone 10 DNA73735-1681 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 97-99 and an apparent stop codon at nucleotide positions 772-774. The predicted polypeptide precursor is 225 amino acids long. The signal peptide is at about amino acids 1-17 and the transmembrane domains are at about amino acids 82-101, 118-145, and 164-188 of SEQ ID NO:328. One or more of the transmembrane domains can be deleted or inactivated. A phosphorylation site, amidation site, and N-myristoylation sites are 15 shown in Figure 188. Clone DNA73735-1681 has been deposited with ATCC and is assigned ATCC deposit no. 203356. The full-length PR01573 protein shown in Figure 188 has an estimated molecular weight of about 24,845 daltons and a pi of about 9.07.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 188 (SEQ ID NO:328), revealed sequence identity 20 between the PRO1573 amino acid sequence and the following Dayhoff sequences (incorporated herein): AF007189_1, AB0007I41, AB000713_1, AB000712_1, A39484, AF000959_1, AF072127_, AF072128J, AF068863_1 and AF077739_1.
EXAMPLE 98: Isolation of cDNA clones Encoding Human PRO 1488 25 An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA)
was searched and EST No. 3639112H1 was identified as having homology to CPE-R. EST No. 3639112H1 is designated herein as "DNA69562". EST clone 3639112H1, which was derived from a lung tissue library of a 20-week old fetus who died from Patau's syndrome, was purchased and the cDNA insert was obtained and sequenced in its entirety. The entire nucleotide sequence of PR01488 is shown in Figure 189 (SEQ ID 30 N0:329), and is designated herein as DNA73736-1657. DNA73736-1657 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 6-8 and a stop codon at nucleotide positions 666-668 (Figure 189; SEQ ID NO:329). The predicted polypeptide precursor is 220 amino acids long.
The full-length PR01488 protein shown in Figure 190 has an estimated molecular weight of about 23,292 daltons and a pi of about 8.43. Four transmembrane domains have been identified as being located at 35 about amino acid positions 8-30, 82-102, 121-140, and 166-186.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 190 (SEQ ID N0:330), revealed significant
455
homology between the PRO1488 amino acid sequence and Dayhoff sequence AB000712 1. Homology was also found between the PR01488 amino acid sequence and the following additional Dayhoff sequences: AB000714_1, AF007189J, AF000959 1, P_W63697, MMU82758_1, AF072127_1, AF072128_1. HSU89916_1, AF068863 1, CEAF000418_1, and AF077739_1.
Clone DNA73736-1657 was deposited with the ATCC on November 17, 1998, and is assigned ATCC 5 deposit no. 203466.
EXAMPLE 99: Isolation of cDNA clones Encoding Human PRQ1489
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA69563. Based upon an observed 10 sequence similarity between the DNA69S63 consensus sequence and an EST sequence contained within the Incyte EST clone no. 3376608, Incyte EST clone no. 3376608 was purchased and its insert obtained and sequenced. That insert is herein designated DNA73737-1658.
The entire nucleotide sequence of DNA73737-1658 is shown in Figure 191 (SEQ ID NO:331). Clone DNA73737-16S8 contains a single open reading frame with an apparent translational initiation site at nucleotide 15 positions 264-266 and ending at the stop codon at nucleotide positions 783-78S (Figure 191). The predicted polypeptide precursor is 173 amino acids long (Figure 192). The full-length PR01489 protein shown in Figure 192 has an estimated molecular weight of about 18,938 daltons and a pi of about 9.99. Analysis of the full-length PRO1489 sequence shown in Figure 192 (SEQ ID NO:332) evidences the presence of the following: transmembrane domains from about amino acid 31 to about amino acid 51, from about amino acid 71 to about 20 amino acid 90 and from about amino acid 112 to about amino acid 133 and a potential N-glycosylation site from about amino acid 161 to about amino acid 164. Clone DNA73737-1658 has been deposited with ATCC on October 27, 1998 and is assigned ATCC deposit no. 203412.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 192 (SEQ ID NO:332), evidenced significant 25 homology between the PRO 1489 amino acid sequence and the following Dayhoff sequences: AF007189 1, AB000712_1, AF000959_1, MMU82758_1, AF035814J, AF072127_1, AF072128_1, HSU89916_1, AF068863_1 and PPU50051 1.
EXAMPLE 100: Isolation of cDNA clones Encoding Human PRQ1474 30 An expressed sequence tag (EST) DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA)
was searched and an EST was identified. This EST showed homology to pancreatic secretory trypsin inhibitor.
The done which included this EST was purchased from Incyte (it came from a uterine cervical tissue library) and sequenced in full to reveal tbe nucleic acid of SEQ ID NO:333, which encodes PR01474.
The entire nucleotide sequence of PR01474 is shown in Figure 193 (SEQ ID NO:333). Clone 35 DNA73739-1645 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 45-47 and a stop codon at nucleotide positions 300-302 (Figure 193; SEQ ID NO:333). The predicted polypeptide precursor is 85 amino acids long. As indicated in Figure 194, the Kazal serine protease inhibitor
456
PCT/US99/20I11
family signature begins at about amino acid 45 of SEQ ID NO:334. Also indicated in Figure 194 is a region conserved in integral alpha chains (beginning at about amino acid 32 of SEQ ID NO:334). Clone DNA73739-1645 has been deposited with the ATCC and is assigned ATCC deposit no. 203270. The full-length PR01474 protein shown in Figure 194 has an estimated molecular weight of about 9,232 daltons and a pi of about 7.94.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 5 alignment analysis of the full-length sequence shown in Figure 194 (SEQ ID NO:334), revealed sequence identity between the PR01474 amino acid sequence and the following Dayhoff sequences (all ovomucoids, data incorporated herein by reference): IOVO_FRAER, IOVO_FRAAF, IOVO_FRACO, IOVO_CYRMO, IOVOSTRCA, H61492, C6I589, IOVO_POLPL, D61589, and IOVO_TURME.
EXAMPLE 101: Isolation of cDNA clones Encoding Human PRO 1508
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the LIFESEQ® database, designated Incyte Cluster No. 34523, also referred herein as "DNA10047". This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public and private EST databases (e.g., GenBank and (LIFESEQ®) to identify existing 15 homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated "DNA55723".
In light of the sequence homology between the DNA55723 sequence a sequence contained within Incyte
EST no. 2989064, the EST clone 2989064 was purchased and the cDNA insert was obtained and sequenced in its entirety. The sequence of this cDNA insert is shown in Figure 195 and is herein designated as "DNA73742-1662".
The full length clone shown in Figure 195 contained a single open reading frame with an apparent 25 translational initiation site at nucleotide positions 70 to 72 and ending at the stop cotton found at nucleotide positions 514 to 516 (Figure 195; SEQ ID NO:335). The predicted polypeptide precursor (Figure 196, SEQ ID NO:335) is 148 amino acids long. Other features of the PR01508 protein include: a signal sequence at about amino acids 1-30; a tyrosine kinase phosphorylation motif at about amino acids 96-103; and N-myristoylation motifs at about amino acids 27-32, 28-33, and 140-145. PR01508 has a calculated molecular weight of 30 approximately 17,183daltonsandanestimatedpIofapproximately8.77. Clone DNA73742-1662 was deposited with the ATCC on October 6, 1998 and is assigned ATCC deposit no. 203316.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 1% (SEQ ID NO-.336), revealed some homology between the PRO1508 amino acid sequence and the following Dayhoff sequences: HSAJ3728 1; P_R74962; 35 P R74941; AF053074 1; F69515; S20706; RPB1 PLAFD; A20587 I; A51861 1; and S75947.
457
EXAMPLE 102: Isolation of cDNA clones Encoding Human PRO 1555
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the LIFESEQ® database, designated EST cluster no. 521, and also referred to herein as "DNA10316". This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and the LIFESEQ® database to identify 5 existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated "DNA56374". 10 In light of the sequence homology between the DNA56374 sequence and an EST sequence contained within Incyte EST no.2855769, EST no.2855769 was purchased and the cDNA insert was obtained and sequenced. EST no. 2855769 was derived from a library constructed from female breast fat tissue. The sequence of this cDNA insert is shown in Figure 197 and is herein designated as DNA73744-1665.
The full length clone shown in Figure 197 contained a single open reading frame with an apparent IS translational initiation site at nucleotide positions 90 to 92 and ending at the stop codon found at nucleotide positions 828 to 830 (Figure 197; SEQ ID NO:337). The predicted polypeptide precursor (Figure 198, SEQ ID NO:338) is 246 amino acids long. PR01555 has a calculated molecular weight of approximately 26,261 daltons and an estimated pi of approximately 5.65. Additional features include: a signal peptide at about amino acids 1-31; transmembrane domains at about amino acids 11-31 and 195-217; a potential N-glycosylation site 20 at about amino acids 111-114; potential casein kinase II phosphorylation sites at about amino acids 2-5,98-101, and 191-194; and potential N-myristoylation sites at about amino acids 146-151, and 192-197.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 198 (SEQ ID NO:338), revealed some homology between the PR01555 amino acid sequence and the following Dayhoff sequences: YKA4 CAEEL, 25 AB014541_l, HVSX99518_2, SSU63019_1, GEN14286, MMU68267_1, XP2_XENLA, ICP4_HSV11, PW40200, and AE001360_1.
Clone DNA73744-1665 was deposited with the ATCC on October 6, 1998, and is assigned ATCC deposit no. 203322.
EXAMPLE 103: Isolation of cDNA clones Encoding Human PRO 1485
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is designated herein "DNA44791". Based on the DNA44791 consensus sequence, oligonucleotides were synthesized: 1) to identiiy by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 35 PR01485.
PCR primers (2 forward and 2 reverse) were synthesized:
forward PCR primer 1: 51CCCTCC AAGG ATG AC A A AGGCGC 3' (SEQ ID NO:341);
458
forward PCR primer 2: 5 'GGTCAGC AGCTTTCTTGCCCTAAATCAGG 3' (SEQ ID NO:342);
reverse PCR nrimer I: S'ATCTCAGGCGGCATCCTGTCAGCC 3' (SEQ ID NO:343); and reverse PCR primer 2: 5'GTGGATGCCTGCAAGAAGGTTGGG 3' (SEQ ID NO:344).
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA44791 sequence which had the following nucleotide sequence:
hybridization probe 5' AGCTTTCTTGCCCT A A ATC AGGCC AGCCTC ATC AGTCGCTGTG AC 3' (SEQ ID NO:345)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR0148S gene using the probe oligonucleotide and one of the PCR primers. RNA 10 for construction of the cDNA libraries was isolated from human testis.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01485 (designated herein as DNA73746-1654 [Figure 199, SEQ ID NO:339]; and the derived protein sequence for PR01485.
The entire coding sequence of PR01485 is shown in Figure 199 (SEQ ID N0:339). Clone DNA73746-15 1654 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 151-153 and an apparent stop codon at nucleotide positions 595-597 of SEQ ID NO:339. The predicted polypeptide precursor is 148 amino acids long. The signal peptide is at about amino acids 1-18 of SEQ ID N0:340. The lysozyme C signature, CAAX box, and an N-gycosylation site are shown in Figure 200. Clone DNA73746-1654 has been deposited with ATCC and is assigned ATCC deposit no. 203411. The full-length 20 PRO1485 protein shown in Figure 200 has an estimated molecular weight of about 16,896 daltons and a pi of about 6.05.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of die full-length sequence shown in Figure 200 (SEQ ID N0:340), revealed sequence identity between the PRO1485 amino acid sequence and the following Dayhoff sequences: LYC_PHACO, P_R76684, 25 2HFL_Y, JC2144, JC5544, JC5555, JC5369, LYC2_PIG, P_R12113, and JC5380.
. EXAMPLE 104: Isolation of cDNA clones Encoding Human PRQ1564
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA67213. Based on the DNA67213 30 consensus sequence, oligonucleotides were synthesized: 1) to identiiy by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01564.
PCR primers (forward and reverse) were synthesized:
forward PCR primer r67213.fl) 5'-GGAGAGGTGGTGGCCATGGACAG-3' (SEQ ID NO:348) 35 reverse PCR primer (67213.rU 5'-CTGTCACTGCAAGGAGCCAACACC-3' (SEQ ID NO-.349)
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA67213 sequence which had the following nucleotide sequence
459
hybridization probe (67213.dH
' -T ATGTCGCTGCG AGGTGGTG A A AACCTCG A ACTGT CTTTC A AGGC-3' (SEQ ID N0:350)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01564 gene using the probe oligonucleotide and one of the PCR primers. RNA 5 for construction of the cDNA libraries was isolated from human breast carcinoma tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01564 (designated herein as DNA73760-1672 [Figure 201, SEQ ID NO:346]; and the derived protein sequence for PR01564.
The entire nucleotide sequence of DNA73760-1672 is shown in Figure 201 (SEQ ID NO:346). Clone 10 DNA73760-1672 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 462-464 and ending at the stop codon at nucleotide positions 2379-2381 (Figure 201). The predicted polypeptide precursor is 639 amino acids long (Figure 202). The full-length PR01S64 protein shown in Figure 202 has an estimated molecular weight of about 73,063 daltons and a pi of about 6.84. Analysis of the full-length PR01S64 sequence shown in Figure 202 (SEQ ID NO:347) evidences the presence of the following: a 15 signal peptide from about amino acid 1 to about aminn acid 28, a trasnmembrane domain from about amino acid 11 to about amino acid 36, potential N-glycosylation sites from about amino acid 107 to about amino acid 110 and from about amino acid 574 to about amino acid 577, a tyrosine kinase phosphorylation site from about amino acid 50 to about amino acid 57, potential N-myristolation sites from about amino acid 158 to about amino acid 163, from about amino acid 236 to about amino acid 241, from about amino acid 262 to about amino acid 267, 20 from about amino acid 270 to about amino acid 275, from about amino acid 380 to about amino acid 385 and from about amino acid 513 to about amino acid518, an amidation site from about amino acid 110 to about amino acid 113 and a prokaryotic membrane lipoprotein lipid attachment site from about amino acid 15 to about amino acid 25. Clone DNA73760-1672 has been deposited with ATCC on October 6, 1998 and is assigned ATCC deposit no.203314.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 202 (SEQ ID NO:347), evidenced significant homology between the PR01564 amino acid sequence and the following Dayhoff sequences: MMU73819_1, HSY08564 1, P_W34470, P_R66402, PAGTHUMAN, CEGLY5B_1, CEGLY6A1, CEGLY6B1, AP000006_308 and E69322.
EXAMPLE 105: Isolation of cDNA clones Encoding Human PRQ1755
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the LIFESEQ** database, designated EST Cluster No. 141872. This EST cluster sequence was then compared to a variety of ESTs from the databases listed above to identify existing homologies. The 35 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with
460
WO 00/12708 PCT/US99/20111
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated "DNA55731".
In light of the sequence homology between the DNASS731 sequence and a sequence contained within Incyte EST no. 2S7323, the EST clone was purchased and the cDNA insert was obtained and sequenced. Incyte clone 2S7323 was derived from a library constructed using RNA isolated from the hNT2 cell line (Stratagene 5 library no. STR9372310), which was derived from a human teratocarcinoma that exhibited properties characteristic of a committed neuronal precursor at an early stage of development. The sequence of this cDNA insert is shown in Figure 203 and is herein designated "DNA76396-1698". Alternatively, the DNA76396-1698 sequence can be obtained by preparing oligonucleotide probes and primers and isolating the sequence from an appropriate library (e.g. STR9372310).
The full length clone shown in Figure 203 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 58 to 60 and ending at the stop codon found at nucleotide positions 886 to 888 (Figure 203; SEQ ID NO:351). The predicted polypeptide precursor (Figure 204, SEQ ID NO-.352) is 276 amino acids long. PR01755 has a calculated molecular weight of approximately 29,426 daltons and an estimated pi of approximately 9.40. Additional features include: a signal peptide sequence at 15 about amino acids 1-31; a transmembrane domain at about amino acids 178-198; a cAMP and cGMP-dependent protein kinase phosphorylation site at about amino acids 210-213; potential N-myristoylation sites at about amino acids 117-122, 154-149, and 214-219; and a cell attachment sequence at about amino acids 149-151.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 204 (SEQ ID NO:352), revealed some homology 20 between the PR01755 amino acid sequence and the following Dayhoff sequences: APG-BRANA, P_R37743, NAU88587J, YHL1_EBV, P_W31855, CET10B10_4, AF039404_1, PRP1_HUMAN, AF038575_1, and AF053091_1.
Clone DNA76396-1698 was deposited with the ATCC on November 17, 1998, and is assigned ATCC deposit no. 203471.
EXAMPLE 106: Isolation of cDNA clones Encoding Human PRQ1757
Use of the signal sequence algorithm described in Example 3 above allowed identification of three EST sequences from the Incyte database, designated Incyte EST clones no. 2007947, 2014962 and 1912034. These EST sequences were thai clustered and assembled into a consensus DNA sequence with the program "phrap" 30 (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated as DNA56054.
In light of the sequence homology between the DNA56054 sequence and a sequence contained within the Incyte EST clone no. 2007947, the Incyte EST clone no. 2007947 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 205 and is herein designated as 35 DNA76398-1699.
Clone DNA76398-1699 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 59-61 and ending at the stop codon at nucleotide positions 422-424 (Figure 205). The
461
predicted polypeptide precursor is 121 amino acids long (Figure 206). The full-length PR01757 protein shown in Figure 206 has an estimated molecular weight of about 12,073 daltons and a pi of about 4.11. Analysis of the full-length PR01757 sequence shown in Figure 206 (SEQ ID NO:354) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 19, a transmembrane domain from about amino acid 91 to about amino acid 110, a glycosaminoglycan attachment site from about amino acid 44 to about 5 amino acid 47, a cAMP- and cGMP-dependent protein kinase phosphorylation site from about amino acid 116 to about amino acid 119 and a potential N-myristolation site from about amino acid 91 to about amino acid 96. Clone DNA76398-1699 has been deposited with ATCC on November 17, 1998 and is assigned ATCC deposit no. 203474.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 10 alignment analysis of the full-length sequence shown in Figure 206 (SEQ ID NO:354), evidenced significant homology between the PR01757 amino acid sequence and the following Dayhoff sequences: JQ0964, COLLJHSVS7, HSU70136_1, AF003473_1, D89728_l, MTFl_MOUSE, AF029777_1, HSU88153_1 and PJW05321.
EXAMPLE 107: Isolation of cDNA clones Encoding Hnman PRQ1758
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the LIFESEQ* database, designated EST cluster No. 20926. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) from the databases mentioned above, to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 20 (Altshul et al.. Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56260.
In light of the sequence homology between the DNA56260 sequence and a sequence contained within 25 EST no. 2936330 from the LIFESEQ* database, the EST clone, which originated from a library constructed from thymus tissue of a fetus that died from anencephalus, was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 207 and is herein designated as DNA76399-1700.
The full length clone shown in Figure 207 contained a single open reading frame with an apparent 30 translational initiation site at nucleotide positions 78 to 80 and ending at the stop codon found at nucleotide positions 549-551 (Figure 207; SEQ ID NO:355). The predicted polypeptide precursor (Figure 208, SEQ ID NO:356) is 157 amino acids long. PR01758 has a calculated molecular weight of approximately 17,681 daltons and an estimated pi of approximately 7.65. Additional features include: a signal peptide from about amino acids 1-15; a potential N-glycosylation site at about amino acids 24-27; a cAMP- and cGMP-dependent protein kinase 35 phosphorylation site at about amino acids 27-30; a casein kinase II phosphorylation site at abom amino acids 60-63; potential N-myristoylation sites at about amino acids 17-22,50-55,129-134, and 133-138; a cell attachment sequence at about amino acids 153-155; and a cytochrome c family heme-binding site signature at about amino
462
acids 18-23.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 208 (SEQ ID NO:356), revealed significant homology between the PRO1758 amino acid sequence and Dayhoff sequence no AC005328_2. Homology was also found between the PR01758 amino acid sequence and Dayhoff sequence no. CELC46F21.
Clone DNA76399-1700 was deposited with the ATCC on November 17, 1998 and is assigned ATCC
deposit no. 203472.
EXAMPLE 108: Isolation of cDNA clones Encoding Human PRQ1575
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 10 in Example 1 above. This consensus sequence is designated herein as "DNA35699". Based on the DNA35699 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01575.
PCR primers (forward and reverse) were synthesized:
forward PCR primers: CCAGCAGTGCCCATACTCCATAGC (35699.fl; SEQ ID N0.359); TGACGAGTGGGATACACTGC (35699.f2; SEQ ID N0:360)
reverse PCR primer: GCTCTACGGAAACTTCTGCTGTGG (35699.rl; SEQ ID NO:361)
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA35699 sequence which had the following nucleotide sequence:
hybridization probe: ATTCCCAGGCGTGTCATTTGGGATCAGCACTGATTCTGAGGTTCTGACAC (35699.pl; SEQ ID NO:362)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01575 gene using the probe oligonucleotide and one of the PCR primers. RNA 25 for construction of the cDNA libraries was isolated from human pancreatic tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01575 (designated herein as DNA76401-1683 [Figure 209, SEQ ID NO:357]; and the derived protein sequence for PR01575.
The entire coding sequence of PR01575 is shown in Figure 209 (SEQ ID NO:357). Clone DNA76401-30 1683 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 22-24 and an apparent stop codon at nucleotide positions 841-843. The predicted polypeptide precursor is 273 amino acids long. The full-length PR01575 protein shown in Figure 210 has an estimated molecular weight of about 30,480 daltons and a pi of about 4.60. Additional features include: a signal peptide at about amino acids 1-20; a transmembrane domain at about amino acids 143-162; a potential N-glycosylation site at about amino 35 acids 100-103; and potential N-myristoylation sites at about amino acids 84-89,103-108,154-159, and 201-206.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 210 (SEQ ID N0:358), revealed significant
463
homology between the PRO1575 amino acid sequence and Dayhoff sequence A12005 1. Homology was also revealed between the PR01575 amino acid sequence and the following additional Dayhoff sequences: PP80615; PJR25297; P_R51696; A47300; PDI_DROME; P_R49829; P_R63807; DMALPADAP_1; and DRZNF6_1.
Clone DNA76401-1683 was deposited with the ATCC on October 20,1998, and is assigned ATCC deposit no. 203360.
EXAMPT-F. 1QQ- Isolation of cDNA clones Encoding Human PRQ1787
A consensus DNA sequence was assembled relative to other EST sequences using phrap to form an assembly as described in Example 1 above. This consensus sequence is designated herein "DNA45123". Based on homology of DNA45123 to Incyte EST 3618549 identified in the assembly, as well as other discoveries and 10 information provided herein, the clone including this EST was purchased and sequenced. DNA sequencing of the clone gave the full-length DNA sequence for PR01787 and the derived protein sequence for PR01787.
The entire coding sequence of PR01787 is included in Figure 211 (SEQ ID NO:363). Clone DNA76510-2504 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 163-165 and an apparent stop codon at nucleotide positions 970-972 of SEQ ID NO:363. The IS approximate locations of the signal peptide, transmembrane domain, N-glycosylationsites, N-myristoylationsites and a kinase phosphorylation site are indicated in Figure 212. The predicted polypeptide precursor is 269 amino acids long. Clone DNA76510-2504 has been deposited with the ATCC and is assigned ATCC deposit no. 203477. The full-length PR01787 protein shown in Figure 212 has an estimated molecular weight of about 29,082 daltons and a pi of about 9.02.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of die full-length sequence shown in Figure 212 (SEQ ID N0:364), revealed sequence identity between the PR01787 amino acid sequence and the following Dayhoff sequences: MYP0_RAT, MYP0_HUMAN, MYPO_BOVIN, GEN12838, HSSCN2B2_1, AF007783_1, HSU90716_1, P_W42015, XLU43330_1 and AF060231_1.
EXAMPLE 110: Isolation of cDNA clones Engine Human PRQ1781
Initial DNA sequences referred to herein as DNA58070 and DNA56340 were identified using a yeast screen, in a human SK-Lu-1 adenocarcinoma cell line cDNA library that preferentially represents the 5' ends of the primary cDNA clones. These sequences were clustered and assembled into a consensus DNA sequence 30 using the computer program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence is designated herein as "DNA59575".
Based on the DNA59575 consensus sequence, the following oligonucleotides, were synthesized for use as probes to isolate a clone of the full-length coding sequence for PR01781 from a human fetal lung cDNA library: TGG AAAAGAAGTCTGGTC AGAAGGTTTAGG (SEQ ID NO-.367), 35 CATTTGGCTTCATTCTCCTGCTCTG (SEQ ID NO:368), AAAACCTCAGAACAACTCATTTTGCACC (SEQ ID NO:369) and GTCTCACCATGGTTGCTCTTGCCAAATTGTGGGAAGCAGGG(SEQ ID N0:370). The full length DNA76522-2500 clone shown in Figure 213 contained a single open reading frame with
464
an apparent translational initiation site at nucleotide positions 21 to 23 and ending at tbe stop codon found at nucleotide positions 1141-1143 (Figure 213; SEQ ID NO:365). The predicted polypeptide precursor (Figure 214, SEQ ID NO:366) is 373 amino acids long. PR01781 has a calculated molecular weight of approximately 41,221 daltons and an estimated pi of approximately 8.54. Additional features include: a possible signal peptide at about amino acids 1 -19; a transmembrane domain at about amino acids 39-60; a tyrosine phosphorylation site 5 at about amino acids 228-236; potential N-myristoylation sites at about amino acids 16-21,17-22,43-48,45-50, 47-52, 49-54, 53-58, 58-63, 59-64, 62-67, 126-131, and 142-147; amidation sites at about amino acids 22-25 and 280-283; and a prokaryotic membrane lipoprotein lipid attachment site at about amino acids 12-22.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 214 (SEQ ID NO:366), revealed some homology 10 between the PR01781 amino acid sequence and the following Dayhoff sequences: CEY4510D_5, AP000001 146, PJR10676, DACJSTRSQ, CEC40H55, P_R35204, KPU58495J, KPN16781_1, AF010403_1, and AF056116_14.
Clone DNA76522-2500 was deposited with the ATCC on November 17, 1998, and is assigned ATCC deposit no. 203469.
EXAMPLE 111: Isolation of cDNA clones Encoding Human PROI556
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the LIFESEQ* database, designated EST Cluster No. 103158, and also referred to herein as "DNA10398". This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) 20 databases which included public EST databases (e.g., GenBank) and the LIFESEQ* database, to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 25 Washington). The consensus sequence obtained therefrom is herein designated DNA56417.
In light of the sequence homology between the DNA56417 sequence and a sequence contained within Incyte EST no. 959332, EST no. 959332 was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 215 and is herein designated as DNA76529-1666.
The full length clone shown in Figure 215 contained a single open reading frame with an apparent 30 translational initiation site at nucleotide positions 85 to 87 and ending at the stop codon found at nucleotide positions 892 to 894 (Figure 215; SEQ ID NO:371). The predicted polypeptide precursor (Figure 216, SEQ ID NO:372) is 269 amino acids long. PR01556 has a calculated molecular weight of approximately 28,004 daltons and an estimated pi of approximately 5.80. Additional features include: a signal peptide sequence at about amino acids 1-24; transmembrane domains at about amino acids 11-25 and 226-243; a potential N-35 glycosylation site at about amino acids 182-185, potential cAMP- and cGMP-dependent protein kinase phosphorylation site at about amino acids 70-73; and potential N-myristoylation sites at about amino acids 29-34, 35-39,117-122,121-126,125-130,154-159,166-171,241-246,246-251,247-252,249-254,250-255,251-256,
465
WO 00/12708 PCT/US99/20111
252-257, 253-258, 254-259, 255-260, 256-261, 257-262, and 259-264.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of die full-length sequence shown in Figure 216 (SEQ ID NO:372), revealed some homology between tbe PR01556 amino acid sequence and the following Dayhoff sequences: T8F5_4, R23B MOUSE, CANS HUMAN, P W41640, DSU51091_1, TP2B_CHICK, DVU20660_1, S43296, P_R23962, and 5 BRN1_HUMAN.
Clone DNA76529-1666 was deposited with the ATCC on October 6, 1998, and is assigned ATCC deposit no. 203315.
EXAMPLE 112: Isolation of cDNA clones Encoding Human PRQ1759 10 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST
cluster sequence from the Incyte database, designated DNA10571. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. One or more of the ESTs was derived from pooled eosinophils of allergic asthmatic 15 patients. The homology search was performed using the computer program BLAST or BLAST2 (Altshul etal.. Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA57313.
In light of the sequence homology between the DNA57313 sequence and the Incyte EST 2434255, the clone including this EST was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 217 and is herein designated as DNA76531-1701.
The full length clone shown in Figure 217 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 125-127 and ending at the stop codon found at nucleotide 25 positions 1475-1477 (Figure 217; SEQ ID NO:373). The approximate locations of the signal peptide and transmembrane domains are indicated in Figure 218, whereas the approximate locations for N-myristoylation sites, a lipid attachment site, an amidation site and a kinase phosphorylation site are indicated in Figure 218. The predicted polypeptide precursor (Figure 218, SEQ ID NO:374) is 450 amino acids long. PR01759 has a calculated molecular weight of approximately 49,765 daltons and an estimated pi of approximately 8.14. Clone 30 DNA76531-1701 was deposited with the ATCC on November 17, 1998 and is assigned ATCC deposit no. 203465.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 218 (SEQ ID N0:374), revealed sequence identity between the PR01759 amino acid sequence and the following Dayhoff sequences: OPDE_PSEAE, 35 TH11_TRYBB, S67684, RGT2YEAST, S68362, ATSUGTRPR_1, P W17836 (Patent application W09715668-A2), F69587, A48076, and A45611.
466
EXAMPLE 113: Isolation of cDNA clones Encoding Human PRQ1760
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. One 5 or more of the ESTs was derived from a prostate tumor library. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (19%)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 10 designated DNAS8798.
In light of the sequence homology between DNA58798 sequence and the Incyte EST 33S874S, the clone including this EST was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 219 and is herein designated as DNA76S32-1702.
The full length clone shown in Figure 219 contained a single open reading frame with an apparent 15 translational initiation site at nucleotide positions 60-62 and ending at the stop codon found at nucleotide positions 624-626 (Figure 219; SEQ ID NO:375). The predicted polypeptide precursor (Figure 220, SEQ ID NO:376) is 188 amino acids long. Motifs are further indicated in Figure 220. PR01760 has a calculated molecular weight of approximately 21,042 daltons and an estimated pi of approximately 5.36. Clone DNA76532-1702 was deposited with the ATCC on November 17, 1998 and is assigned ATCC deposit no. 203473. 20 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 220 (SEQ ID NO:376), revealed sequence identity between the PRO1760 amino acid sequence and the following Dayhoff sequences: CELT07F12_2, T22J18_16, ATF1C12_3, APE3 YEAST, P_W22471, SAU56908J, SCPA_STRPY, ATAC00423817, SAPURCLUS_2 and AF041468_9.
EXAMPLE 114: Isolation of cDNA clones Encoding Human PRQ1561
A consensus DNA sequence was assembled relative to other EST sequences using phrap and repeated cycles of BLAST and phrap to extend a sequence as far as possible using the EST sequences discussed above as described in Example 1 above. This consensus sequence is herein designated DNA40630. Based on the 30 DNA40630 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01561.
PCR primers (forward and reverse) were synthesized:
forward PCR primer (40630.fn 5'-CTGCCTCCACTGCTCTGTGCTGGG-3' (SEQ ID NO:379) 35 reverse PCR primer C40630.rn 5'-CAGAGCAGTGGATGTTCCCCTGGG-3' (SEQ ID N0:380)
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40630 sequence which had the following nucleotide sequence
467
hvhriHi?afif>Ti probe (4063Q.pl)
'-CTGAACAAGATGGTCAAGCAAGTGACTGGGAAAATGCCCATCCTC-3' (SEQ ID NO:38I)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01561 gene using the probe oligonucleotide and one of the PCR primers. RNA 5 for construction of the cDNA libraries was isolated from human breast tumor tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01S61 (designated herein as DNA76538-1670 [Figure 221, SEQ ID NO:377]; and the derived protein sequence for PR01561.
The entire nucleotide sequence of DNA76S38-1670 is shown in Figure 221 (SEQ ID NO:377). Clone 10 DNA76538-1670 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 29-31 and ending at the stop codon at nucleotide positions 377-379 (Figure 221). The predicted polypeptide precursor is 116 amino acids long (Figure 222). The full-length PR01561 protein shown in Figure 222 has an estimated molecular weight of about 12,910 daltons and a pi of about 6.41. Analysis of the fiill-length PR01561 sequence shown in Figure 222 (SEQ ID NO:378) evidences the presence of the following: a 15 signal peptide from about amino acid 1 to about amino acid 17, a transmembrane domain from about amino acid 1 to about amino acid 24, a potential N-glycosylation site from about amino acid 86 to about amino acid 89, potential N-myristolation sites from about amino acid 20 to about amino acid 25 and from about amino acid 45 to about amino acid SO and a phospholipase A2 histidine active site from about amino acid 63 to about amino acid 70. Clone DNA76538-1670 has been deposited with ATCC on October 6, 1998 and is assigned ATCC 20 deposit no. 203313.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 222 (SEQ ID N0:378), evidenced significant homology between the PR01561 amino acid sequence and the following Dayhoff sequences: PR63053, PJR25416, PJR63055, P_P93363, P_R63046, PA2A_VIPAA, P_W58476, GEN13747, PA2X_HUMAN and 25 PA2A_CRODU.
In addition to the above, a sequence homology search evidenced significant homology between the DNA40630 consensus sequence and Incyte EST clone no. 1921092. As such, Incyte EST clone no. 1921092 was purchased and the insert obtained and sequenced, thereby giving rise to the DNA76538-1670 sequence shown in Figure 221 (SEQ ID N0:377).
EXAMPLE 115: Isolation of cDNA clones Encoding Human PRQ1567
A cDNA sequence isolated in the amylase screen described in Example 2 above is herein designated DNA47580. The DNA47580 sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database 35 (LIFESEQ1", Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater
468
that did not encode known proteins were clustered and assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated "DNA57246".
In light of the sequence homology between the DNA57246 sequence and EST no. 1793996 from the LIFESEQ™ database, the clone containing the EST no. 1793996, which originates from a library constructed 5 from prostate tumor tissue, was purchased and the cDNA insert was obtained and sequenced. The sequence of this cDNA insert is shown in Figure 223 (SEQ ID NO:382) and is herein designated as DNA76541-1675.
A full length clone was identified that contained a single open reading frame with an apparent translational initiation site at nucleotide positions 109-111, and a stop signal at nucleotide positions 643-645 (Fig. 223; SEQ ID NO:382). The predicted polypeptide precursor is 178 amino acids long has a calculated molecular 10 weight of approximately 19,600 daltons and an estimated pi of approximately 5.89. Additional features include a signal peptide at about amino acids 1-22; a potential N-glycosylation site at about amino acids 167-170; a protein kinase C phosphorylation site at about amino acids 107-109; and potential N-myristoylation sites at about amino acids 46-51, 72-77, and 120-125.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 15 alignment analysis of the full-length sequence shown in Figure 224 (SEQ ID NO:383), evidenced significant homology between the PR01567 amino acid sequence and human colon specific gene CSG6 polypeptide designated Dayhoff sequence "P_W06549". Homology was also found between the PR01567 amino acid sequence and the following additional Dayhoff sequences: HUAC0Q2301_1, P_246880, A49685, SPBP RAT, S42924, SPBP MOUSE, 152115, MMU03711_1, and AF041468_31.
Clone DNA76541-1675 has been deposited with the ATCC on October 27,1998, and is assigned ATCC
deposit no. 203409.
EXAMPLE 116: Isolation of cDNA clones Encoding Human PRQ1693
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 25 in Example 1 above. This consensus sequence is herein designated DNA38251. Based on the DNA38251 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO1693.
PCR primers (forward and reverse) were synthesized:
forward PCR primer (38251 .fl) 5'-CTGGGATCTGAACAGTTTCGGGGC-3' (SEQ ID NO:386)
reverse PCR primer (38251.rl) 5'-GGTCCCCAGGACATGGTCTGTCCC-3' (SEQ ID NO:387) Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA38251 sequence which had the following nucleotide sequence hybridization probe (38251.p1) 35 5' -GCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCG-3' (SEQ ID NO:388)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to
469
isolate clones encoding the PR01693 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01693 (designated herein as DNA77301-1708 [Figure 225, SEQ ID NO:384]; and the derived protein sequence for PR01693.
The entire nucleotide sequence of DNA77301-1708 is shown in Figure 225 (SEQ ID NO:384). Clone
DNA77301-1708 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 508-510 and ending at the stop codon at nucleotide positions 2047-2049 (Figure 225). The predicted polypeptide precursor is 513 amino acids long (Figure 226). The full-length PR01693 protein shown in Figure 226 has an estimated molecular weight of about 58,266 daltons and a pi of about 9.84. Analysis of the full-10 length PR01693 sequence shown in Figure 226 (SEQ ID NO:385) evidences the presence of the following: a signal peptide from about aminn acid 1 to about amino acid 33, a transmembrane domain from about amino acid 420 to about amino acid 442, potential N-glycosylation sites from about amino acid 126 to about amino acid 129, from about amino acid 357 to about amino acid 360, from about amino acid 496 to about amino acid 499 and from about amino acid 504 to about amino acid 507, a cAMP- and cGMP-dependent protein kinase 15 phosphorylation site from about amino acid 465 to about amino acid 468, a tyrosine kinase phosphorylation site from about amino acid 136 to about amino acid 142 and potential N-myristolation sites from about amino acid 11 to about amino acid 16, from about amino acid 33 to about aminn acid 38, from about amino acid 245 to about amino acid 250, from about amino acid 332 to about amino acid 337, from about amino acid 497 to about aminn acid 502 and from about amino acid 507 to about amino acid 512. Clone DNA77301-1708 has been 20 deposited with ATCC on October 27, 1998 and is assigned ATCC deposit no. 203407.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 226 (SEQ ID NO:385), evidenced significant homology between the PR01693 amino acid sequence and the following Dayhoff sequences: AB007876_1, ALS_MOUSE, HSCHON03_1, P_R85889, AF062006_1, AB014462_1, A58532, MUSLRRPA_1, AB007865_1 25 and AF030435_1.
EXAMPLE 117: Isolation of cDNA clones Encoding Human PRQ1784
A cDNA sequence isolated in the amylase screen described in Example 2 above is herein designated DNA43862. Based on the DNA43862 sequence, oligonucleotide probes were generated and used to screen a 30 human fetal kidney library prepared as described in paragraph 1 above. The cloning vector was pRK5B (pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science. 253:1278-1280 (1991)), and the cDNA size cut was less than 2800 bp.
PCR primers (forward and reverse) were synthesized:
forward PCR nrimer (ftt 5' -CTTTTCAGTGTCACCTCAGCGATCTC-3' (SEQ ID NO:391); and 35 reverse PCR primer (rl) 5' -CC AAAACATGGAGC AGG A AC AGG-3' (SEQ ID NO:392).
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA43862 sequence which had the following nucleotide sequence:
470
hybridization probe (pi)
'-CCAGTTGGTGCTCTCGGACCTACCATGCGAAGAAGATGAAATGTGTG-3' (SEQ ID NO:393).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01784 gene using the probe oligonucleotide and one of the PCR primers.
A full length clone was identified that contained a single open reading frame with an apparent translationai initiation site at nucleotide positions 68-70, and a stop signal at nucleotide positions S06-S08 (Fig. 227; SEQ ID NO:389). The predicted polypeptide precursor is 146 amino acids long has a calculated molecular weight of approximately 16,116 daltons and an estimated pi of approximately 4.99. The approximate locations of the signal peptide, transmembrane domain and N-myristoylation site are indicated in Figure 228. Clone 10 DNA77303-2502 has been deposited with the ATCC and is assigned ATCC deposit no. 203479.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 228 (SEQ ID N0:390), evidenced sequence identity between the PR01784 amino acid sequence and the following Dayhoff sequences: RNU87224 1, RNAF000114_1, P W31947, S18038, AE001300_8, AF039833_1, P_W39833_1, P_W39788, HSU87223_1, 15 NTU06712 1, and P_W31946.
RYAMPT.F. 1 ]«• Isolation of cDNA clones Encoding Human PRQ1605
A cDNA clone (DNA77648-1688) encoding a native human PRO1605 polypeptide was identified by a yeast screen, in a human fetal kidney cDNA library that preferentially represents the 5' ends of the primary 20 cDNA clones.
The full-length DNA77648-1688 clone shown in Figure 229 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 425-427 and ending at the stop codon at nucleotide positions 845-847 (Figure 229). The predicted polypeptide precursor is 140 amino acids long (Figure 230). The full-length PRO1605 protein shown in Figure 230 has an estimated molecular weight of about 15,668 daltons 25 and a pi of about 10.14. Analysis of the full-length PR01605 sequence shown in Figure 230 (SEQ ID NO:395) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 26. Clone DNA77648-1688 has been deposited with ATCC on October 27, 1998 and is assigned ATCC deposit no. 203408.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 30 alignment analysis of the full-length sequence shown in Figure 230 (SEQ ID NO:395), evidenced significant homology between the PR01605 amino acid sequence and the following Dayhoff sequences: GNT5 HUMAN, PR48975, P_W22519, MM26SPROT_l, HSU86782_1, CH60_LEPIN, HMCTHELPY, F65126, HIU08875 1 and P_R41724.
EXAMPLE 119: Isolation of cDNA clones Encoding Human PRQ1788
The extracellular domain (ECD) sequences (including the secretion signal sequence, if any) from about 950 known secreted proteins from the Swiss-Prot public database were used to search EST databases. The EST
471
databases included public EST databases (e.g., GenBank), and a proprietary EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program BLAST or BLAST2 [Altschul et al., Methods in Enzvmologv. 266:460-480 (1996)] as a comparison of the ECD protein sequences to a 6 frame translation of the EST sequences. Incyte Clone No. 2968304 was identified as a sequence of interest having a BLAST score of 70 or greater that did not encode known proteins. The nucleotide sequence 5 of Incyte Clone No. 2968304 is designated herein as "DNA6612".
In addition, the DNA6612 sequence was extended using repeated cycles of BLAST and phrap (Phil Green, University of Washington, Seattle, Washington) to extend the sequence as far as possible using the sources of EST sequences discussed above. The extended consensus sequence is designated herein as "DNA49648". Based on the DNA49648 consensus sequence, oligonucleotides were synthesized: 1) to identify 10 by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01788.
PCR primers (forward and reverse) were synthesized:
forward PCR nrimer: CCCTGCCAGCCGAGAGCTTCACC (49648.fl; SEQ ID NO:398)
reverse PCR primer: GGTTGGTGCCCGAAAGGTCCAGC (49648.rl; SEQ ID NO:399) IS Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus
DNA49648 sequence which had the following nucleotide sequence:
hvbridizationprobe: CAACCCCAAGCTTAACTGGGCAGGAGCTGAGGTGTTTTCAGGCQ49648.pl; SEQ ID N0:400)
In order to screen several libraries for a source of a full-length clone, DNA from tbe libraries was 20 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01788 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01788 (designated herein as DNA77652-2505 [Figure 231, SEQ ID NO:396]; and the derived protein 25 sequence for PRO1788.
The entire coding sequence of PRO1788 is shown in Figure 231 (SEQ ID NO:396). Clone DNA77652-2S0S contains a single open reading frame with an apparent translational initiation site at nucleotide positions 64-66 and an apparent stop codon at nucleotide positions 1123-1125. Thepredictedpolypeptideprecursoris3S3 amino acids long. The full-length PRO1788 protein shown in Figure 232 has an estimated molecular weight of 30 about 37,847 daltons and a pi of about 6.80. Additional features of PRO 1788 include: a signal peptide at about amino acids 1-16; transmembrane domains at about amino acids 215-232 and287-304; potential N-glycosylation sites at about amino acids 74-77 and 137-140; a glycosaminoglycan attachment site at about amino acids 45-48; a tyrosine kinase phosphorylation site at about amino acids 318-325; N-myristoylation sites at about amino acids 13-18, 32-37, 88-93, 214-219, and 223-228; and a leucine zipper pattern at about amino acids 284-305. 35 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 232 (SEQ ID NO:397), revealed significant homology between the PR01788 amino acid sequence and the following Dayhoff sequences: AF030435_1;
472
AF062006_1; DMTARTAN1; G ARP_HUM AN; S42799; P_R71294; HSU88879 1; DROWHEELER_l; A58532; and AF068920 1.
Clone DNA77652-2505 was deposited with the ATCC on November 17,1998, and is assigned ATCC deposit no. 203480.
EXAMPLE 120: Isolation of cDNA clones Encoding Human PRQ1801
A proprietary expressed sequence tag (EST) DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) was searched and an EST was identified which showed homology to the IL-19 protein. This EST sequence is Incyte EST clone no. 819592 and is herein designated DNA79293. Based on the DNA79293 sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 10 of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01801.
PCR primers (forward and reverse) were synthesized:
forward PCR primer 5'-CTCCTGTGGTCTCCAGATTTCAGGCCTA-3' (SEQ ID N0:403)
reverse PCR primer 5'-AGTCCTCCTTAAGATTCTGATGTCAA-3' (SEQ ID N0:404)
RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue. The cDNA 15 libraries used to isolated the cDNA clones were constructed by standard methods using commercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; pRKSB is a precursor of pRK5D that does not contain the Sfil site; see, Holmes et al., Science. 253:1278-1280 20 (1991)) in the unique Xhol and NotI sites.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01801 (designated herein as DNA83500-2506 [Figure 233, SEQ ID N0:401]; and the derived protein sequence for PR01801.
The entire nucleotide sequence of DNA83500-2506 is shown in Figure 233 (SEQ ID N0:401). Clone 25 DNA83500-2506 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 109-111 and ending at the stop codon at nucleotide positions 892-894 (Figure 233). The predicted polypeptide precursor is 261 aminn acids long (Figure 234). The full-length PR01801 protein shown in Figure 234 has an estimated molecular weight of about 29,667 daltons and a pi of about 8.76. Analysis of the full-length PR01801 sequence shown in Figure 234 (SEQ ID N0:402) evidences the presence of the following: a 30 signal peptide from about amino acid 1 to about amino acid 42, cAMP- and cGMP-dependent protein kinase phosphorylation sites from about amino acid 192 to about amino acid 195 and from about amino acid 225 to about amino acid 228 and potential N-myristolation sites from about aminn acid 42 to about amino acid 47, from about amino acid 46 to about amino acid 51 and from about amino acid 136 to about aminn acid 141. Clone DNA83500-2506 has been deposited with ATCC on October 29, 1998 and is assigned ATCC deposit no. 35 203391.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 234 (SEQ ID N0:402), evidenced significant
473
homology between the PR01801 aminn acid sequence and the following Dayhoff sequences: P_W37935, HGSJB477, P_R32277, IL10JV1ACFA, P_W46585, P_R39714, P_R71471, P_R10159, IL10_RAT and PW57201.
EXAMPLE 121: Isolation of cDNA clones Encoding Human UCP4 5 EST databases, which included public EST databases (e.g., GenBank), and a proprietary EST database
(LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA), were searched for sequences having homologies to human UCP3. The search was performed using the computer program BLAST or BLAST2 [Altschul et al., Methods in F.nrvmnlngv. 266:460-480 (1996)] as a comparison of the UCP3 protein sequences to a 6 frame translation of the EST sequences. Those comparisons resulting in a BLAST score of 70 (or in some cases, 90) or greater 10 that did not encode known proteins were clustered and assembled into consensus DNA sequences with the program AssemblLIGN and MacVector (Oxford Molecular Group, Inc.).
A DNA sequence ("fromDNA") was assembled relative to other EST sequences using AssemblLIGN software. In addition, the fromDNA sequence was extended using repeated cycles of BLAST and AssemblLIGN to extend the sequence as far as possible using the sources of EST sequences discussed above. Based on this 15 DNA sequence, oligonucleotides were synthesized to isolate a clone of the full-length coding sequences for UCP4 by PCR. Forward and reverse PCR primers generally range from 20 to 30 nucleotides and are often designed to give a PCR product of about 100-1000 bp in length. The probe sequences are typically 40-55 bp in length. In some cases, additional oligonucleotides are synthesized when the consensus sequence is greater than about l-1.5kbp.
PCR primers (forward and reverse) were synthesized:
forward PCR primer CGCGGATCCCGTTATCGTCTTGCGCTACTGC (SEQ ID N0:407)
reverse PCR Primer GCGGAATTCTTAAAATGGACTGACTCCACTCATC (SEQ ID N0:408)
RNA for construction of the cDNA libraries was isolated from brain tissue. The cDNA libraries used to isolated the cDNA clones were constructed by standard methods using commercially available reagents such 25 as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al., Science. 253:1278-1280 (1991)) in the unique Xhol and NotI sites.
DNA sequencing of the clone isolated by PCR as described above gave the full-length DNA sequence for UCP4 (designated herein as DNA77568-1626 [Figure 235, SEQ ID N0:405] and the derived protein sequence for UCP4.
The entire coding sequence of UCP4 is shown in Figure 235 (SEQ ID N0:405). Clone DNA77568-1626 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 35 27-29, and an apparent stop codon at nucleotide positions 996-998. (See Figure 235; SEQ ID N0:405). The predicted polypeptide precursor is 323 amino acids long. It is presently believed that UCP4 is a membrane-bound protein and contains at least 6 transmembrane regions. These putative transmembrane regions in the
474
UCP4 amino acid sequence are illustrated in Figure 236. Clone DNA77S68-1626, contained in tbe pcDNA3 vector (Invitrogen) has been deposited with ATCC and is assigned ATCC deposit no. 203134. UCP4 polypeptide is obtained or obtainable by expressing the molecule encoded by the cDNA insert of the deposited ATCC 203134 vector. Digestion of the vector with BamHI and EcoRI restriction enzymes will yield an approximate 972 plus 34 bp insert. The full-length UCP4 protein shown in Figure 236 has an estimated 5 molecular weight of about 36,061 daltons and a pi of about 9.28.
EXAMPLE 122: Isolation of cDNA clones Encoding Human PRQ193
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. Based on this consensus sequence, oligonucleotides were synthesized: 1) to identify by 10 PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0193.
A pair of PCR primers (forward and reverse) were synthesized:
forward PCR primer 5'-GTTTGAGGAAGCTGGGATAC-3' (SEQ ID NO:411); and reverse PCR primer 5'-CCAAACTCGAGCACCTGTTC-3' (SEQ ID NO:412).
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus sequence which had the following nucleotide sequence:
hybridization probe
'-ATGGCAGGCTTCCTAGATAATTTTCGTTGGCCAGAATGTG-3' (SEQ ID NO:413).
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 20 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR0193 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human retina tissue (LIB94).
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR0193 [herein designated as DNA23322-1393] (SEQ ID N0:409) and the derived protein sequence for 25 PR0193.
The entire nucleotide sequence of DNA23322-1393 is shown in Figure 237 (SEQ ID N0:409). Clone DNA23322-1393 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 138-140 and ending at the stop codon at nucleotide positions 612-614 (Figure 237). The predicted polypeptide precursor is 158 amino acids long (Figure 238). The full-length PR0193 protein shown in Figure 30 238 has an estimated molecular weight of about 17,936 and a pi of about 5.32. Clone DNA23322-1393 has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains the correct sequence, and the sequences provided herein are based on known sequencing techniques.
Still analyzing the amino acid sequence of SEQ ID N0:410, transmembrane domains are at about amino acids 23-42,60-80,97-117 and 128-148 of SEQ ID NO:410. A cell attachment sequence is at about amino acids 35 81-83 of SEQ ID N0:410. A peroxidase proximal heme-ligand domain is at about amino acids 81-83 of SEQ ID NQ:410. The corresponding nucleotides can be routinely determined given the sequences provided herein.
475
EXAMPLE 123: Isolation of cDNA clones Encoding Human PROl 130
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example 1 above. This consensus sequence is herein designated DNA34360. Based on the DNA34360 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 5 PRO 1130.
PCR primers (forward and reverse) were synthesized:
forward PCR primer (34360.fn 5'-GCCATAGTCACGACATGGATG-3' (SEQ ID NO:416)
forward PCR nrimer (34360. fT> 5'-GGATGGCCAGAGCTGCTG-3' (SEQ ID NO:417)
forward PCR primer (3436Q.f31 5'-AAAGTACAAGTGTGGCCTCATCAAGC-3' (SEQ ID NO:418) 10 reverse PCR primer (3436Q.rn 5'-TCTGACTCCTAAGTCAGGCAGGAG-3' (SEQ ID NO:419)
reverse PCR primer (3436Q.rTl 5'-ATTCTCTCCACAGACAGCTGGTTC'3' (SEQ ID N0:420) Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA34360 sequence which had the following nucleotide sequence hybridization probe (34360.pl)
5' -GTAC AAGTGTGGCCTC ATC AAGCCCTGCCCAGCC A ACT ACTTTGCG-3' (SEQ ID NO:421)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PROl 130 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human aortic endothelial cell tissue. 20 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for
PR01130 (designated herein as DNA59814-1486 [Figure 239, SEQ ID NO:414]; and the derived protein sequence for PRO 1130.
The entire nucleotide sequence of DNA59814-1486 is shown in Figure 239 (SEQ ID NO:414). Clone DNA59814-1486 contains a single open reading frame with an apparent translational initiation site at nucleotide 25 positions 312-314 and ending at the stop codon at nucleotide positions 984-986 (Figure 239). The predicted polypeptide precursor is 224 amino acids long (Figure 240). The full-length PROl 130 protein shown in Figure 240 has an estimated molecular weight of about 24,963 daltons and a pi of about 9.64. Analysis of the full-length PROl 130 sequence shown in Figure 240 (SEQ ID NO:415) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 15, an ATP/GTP-binding site motif A from about 30 amino acid 184 to about amino acid 191 and a potential N-glycosylation site from about amino acid 107 to about amino acid 110. Clone DNA59814-1486 has been deposited with ATCC on October 20,1998 and is assigned ATCC deposit no. 203359.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 240 (SEQ ID NO:415), evidenced significant 35 homology between the PROl 130 amino acid sequence and the following Dayhoff sequences: PW06547, 216 HUMAN, D87120 1, MMU72677J, LAU04889 1, andD69319.
476
EXAMPLE 124: Isolation of cDNA clones Encoding Human PRO 1335
A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in Example l above. This consensus sequence is herein designated DNA35727. Based on the DNA35727 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 5 PR01335.
PCR primers (forward and reverse) were synthesized:
forward PCR primer (35727.fl) 5'-GTAAAGTCGCTGGCCAGC-3' (SEQ ID NO:424)
forward PCR primer (35727.f2) 5'-CCCGATCTGCCTGCTGTA-3' (SEQ ID NO:425)
reverse PCR primer (35727 .rl) 5'-CTGCACTGTATGGCCATTATTGTG-3' (SEQ ID NO:426) 10 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA35727 sequence which had the following nucleotide sequence hybridization probe (35727.pl)
'-CAGAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCC-3' (SEQ ID NO:427)
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 15 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones encoding the PR01335 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of the cDNA libraries was isolated from human retina tissue.
DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR01335 (designated herein as DNA62812-1594 [Figure 241, SEQ ID NO:422]; and the derived protein 20 sequence for PR01335.
The entire nucleotide sequence of DNA62812-1594 is shown in Figure 241 (SEQ ID NO:422). Clone DNA62812-1594 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 271-273 and ending at the stop codon at nucleotide positions 1282-1284 (Figure 241). The predicted polypeptide precursor is 337 amino acids long (Figure 242). The full-length PR01335 protein shown in Figure 25 242 has an estimated molecular weight of about 37,668 daltons and a pi of about 6.27. Analysis of the full-length PR01335 sequence shown in Figure 242 (SEQ ID NO:423) evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 15, a transmembrane domain from about amino acid 291 to about amino acid 310, a potential N-glycosylation site from about amino acid 213 to about amino acid 216 and amino acid sequence blocks having homology to eukaryotic-type carbonic anhydrase proteins from about 30 amino acid 197 to about amino acid 245, from about amino acid 104 to about amino acid 140 and from about amino acid 22 to about amino acid 69. Clone DNA62812-1594 has been deposited with ATCC on September 9, 1998 and is assigned ATCC deposit no. 203248.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 242 (SEQ ID NO:423), evidenced significant 35 homology between the PR01335 amino acid sequence and the following Dayhoff sequences: AF037335_1, 138013, PTPG_MOUSE, CAH2 HUMAN , 1CAC, CAH7_HUMAN, CAH3_HUMAN, CAH1HUMAN, CAH5 HUMAN and P R41746.
477
EXAMPLE 125: Isolation of cDNA clones Encoding Human PRQ1329
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST cluster sequence from the LIFESEQ* database, designated Incyte Cluster No. 167544, also referred herein as "DNA10680". This EST cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database 5 (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). One or more of the ESTs was derived from a 10 cDNA library constructed from RNA isolated from synovial membrane tissue removed from the elbow of a female with rheumatoid arthritis. The consensus sequence obtained therefrom is herein designated "DNA58836".
In light of the sequence homology between the DNA58836 sequence and a sequence contained within the Incyte EST clone no. 368774, EST clone 368774 was purchased and the cDNA insert was obtained and 15 sequenced. The sequence of this cDNA insert is shown in Figure 243 and is herein designated as DNA66660-1585.
The Ml length clone shown in Figure 243 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 90 to 92 and ending at the stop codon found at nucleotide positions 717 to 719 (Figure 243; SEQ ID NO:428). The predicted polypeptide precursor (Figure 244, SEQ 20 ID NO:429) is 209 amino acids long, with a signal sequence at about amino acids 1-16. PR01329 has a calculated molecular weight of approximately 21,588 daltons and an estimated pi of approximately 5.50. Clone DNA66660-1585 was deposited with the ATCC on September 22, 1998 and is assigned ATCC deposit no. 203279.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 25 alignment analysis of the full-length sequence shown in Figure 244 (SEQ ID NO:429), revealed some homology between the PR01329 amino acid sequence and the following Dayhoff sequences: CELK06A9 3, PROA_XANCP, CXU21300_4, MTV037_17, SYN1_RAT, 156542, S60743, BNOLE3_l, AB001573_1, and PJP80671.
F.XAMPT.F. 17fi- Isolation of cDNA clones Encoding Human PRQ1550
Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST sequence from the Merck database, designated CELT15B712, also referred herein as "DNA10022". This EST sequence was then compared to a variety of expressed sequence tag (EST) databases which included public and proprietary EST databases (e.g., GenBank and LIFESEQ*) to identify existing homologies. The homology 35 search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the
478
I
program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein designated "DNA55708".
In light of die sequence homology between the DNA55708 sequence and a sequence contained within Incyte EST no. 3411659, die EST clone 3411659 was purchased and the cDNA insert was obtained and sequenced in its entirety. The sequence of this cDNA insert is shown in Figure 245 and is herein designated as "DNA76393-1664".
The Ml length clone shown in Figure 245 contained a single open reading frame with an apparent translational initiation site at nucleotide positions 138 to 140 and ending at the stop codon found at nucleotide positions 867 to 869 (Figure 245; SEQ ID NO:231). The predicted polypeptide precursor (Figure 246, SEQ ID NO:232) is 243 amino acids long. Other features of the PR01550 protein include: a signal sequence at about amino acids 1-30; a hydrophobic domain at about amino acids 195-217; and a potential N-glycosylation site at about amino acids 186-189. PRO 1550 has a calculated molecular weight of approximately 26,266 daltons and an estimated pi of approximately 8.43. Clone DNA76393-1664 was deposited with the ATCC on October 6, 1998, and is assigned ATCC deposit no. 203323.
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence alignment analysis of the full-length sequence shown in Figure 246 (SEQ ID NO:232), revealed some homology between the PR01550 amino acid sequence and the following Dayhoff sequences: CELF59E1211; CA24ASCSU; AF018082J; CA13_BOVIN; CA54HUMAN; CA34 HUMAN; HUMCOL7AlX_l; P W09643; AF053538_1; and HSEMCXIV2_1.
EXAMPLE 127: Use of PRO as a hybridization probe
The following method describes use of a nucleotide sequence encoding PRO as a hybridization probe. DNA comprising the coding sequence of full-length or mature PRO as disclosed herein is employed as a probe to screen for homologous DN As (such as those encoding naturally-occurring variants of PRO) in human tissue cDNA libraries or human tissue genomic libraries.
Hybridization and washing of filters containing either library DNAs is performed under the following high stringency conditions. Hybridization of radiolabeled PRO-derived probe to the filters is performed in a solution of 50% formamide, 5x SSC, 0.1% SDS, 0.1% sodium pyrophosphate, 50 mM sodium phosphate, pH 6.8, 2x Denhardt's solution, and 10% dextran sulfate at 42°C for 20 hours. Washing of the filters is performed in an aqueous solution of 0. lx SSC and 0.1 % SDS at 42°C.
DNAs having a desired sequence identity with the DNA encoding full-length native sequence PRO can then be identified using standard techniques known in the art.
EXAMPLE 128: Expression of PRO in E. coli
This example illustrates preparation of an unglycosylated form of PRO by recombinant expression in
E. coli.
The DNA sequence encoding PRO is initially amplified using selected PCR primers. The primers should contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected
479
r^uECTU^rjqoP^
i ofcjct of N.Z
2 4 OCT 2003 received
expression vector. A variety of expression vectors may be employed. An example of a suitable vector is pBR322 (derived from E. coli; see Bolivar et al., Gene. 2:95 (1977)) which contains genes for ampicillin and tetracycline resistance. The vector is digested with restriction enzyme and dephosphorylated. The PCR amplified sequences are then ligated into the vector. The vector will preferably include sequences which encode for an antibiotic resistance gene, a trp promoter, a polyhis leader (including the first six STII codons, polyhis 5 sequence, and enterokinase cleavage site), the PRO coding region, lambda transcriptional terminator, and an argU gene.
The ligation mixture is then used to transform a selected E. coli strain using the methods described in Sambrook et al., supra. Transfbrmants are identified by their ability to grow on LB plates and antibiotic resistant colonies are then selected. Plasmid DNA can be isolated and confirmed by restriction analysis and DNA 10 sequencing.
Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented with antibiotics. The overnight culture may subsequently be used to inoculate a larger scale culture. The cells are then grown to a desired optical density, during which the expression promoter is turned on.
After culturing the cells for several more hours, the cells can be harvested by centrifugation. The cell 15 pellet obtained by the centrifugation can be solubilized using various agents known in the art, and the solubilized PRO protein can then be purified using a metal chelating column under conditions that ailow tight binding of the protein.
PRO may be expressed in E. coli in a poly-His tagged form, using the following procedure. The DNA encoding PRO is initially amplified using selected PCR primers. The primers will contain restriction enzyme 20 sites which correspond to the restriction enzyme sites on the selected expression vector, and other useful sequences providing for efficient and reliable translation initiation, rapid purification on a metal chelation column, and proteolytic removal with enterokinase. The PCR-amplified, poly-His tagged sequences are then ligated into an expression vector, which is used to transform an E. coli host based on strain 52 (W3110 fuhA(tonA) Ion galE rpoHts(htpRts) clpP(lacIq). Transformants are first grown in LB containing 50 mg/ml 25 carbenicillin at 30°C with shaking until an 0,D.600of3-5 is reached. Cultures are then diluted 50-100 fold into CRAP media (prepared by mixing 3.57 g (NH^SO^ 0.71 g sodium citrate*2H20, 1.07 g KC1, 5.36 g Difco yeast extract, 5.36 g Sheffield hycase SF in 500 mL water, as well as 110 mM MPOS, pH 7.3, 0.55% (w/v) glucose and 7 mM MgSO*) and grown for approximately 20-30 hours at 30°C with shaking. Samples are removed to verify expression by SDS-PAGE analysis, and the bulk culture is centrifuged to pellet the cells. Cell 30 pellets are frozen until purification and refolding.
E. coli paste from 0.5 to I L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 M guanidine, 20 mM Tris, pH 8 buffer. Solid sodium sulfite and sodium tetrathionate is added to make final concentrations of 0.1M and 0.02 M, respectively, and the solution is stirred overnight at 4°C. This step results in a denatured protein with all cysteine residues blocked by sulfitolization. The solution is centrifuged at 40,000 35 rpm in a Beckman Ultracentifuge for 30 min. The supernatant is diluted with 3-5 volumes of metal chelate column buffer (6 M guanidine, 20 mM Tris, pH 7.4) and filtered through 0.22 micron filters to clarify. The clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column equilibrated in the metal chelate
480
column buffer. The column is washed with additional buffer containing SO mM imidazole (Calbiochem, Utrol grade), pH 7.4. The protein is eluted with buffer containing 250 mM imidazole. Fractions containing the desired protein are pooled and stored at 4°C. Protein concentration is estimated by its absorbance at 280 nm using the calculated extinction coefficient based on its amino acid sequence.
The proteins are refolded by diluting the sample slowly into freshly prepared refolding buffer consisting 5 of: 20 mM Tris, pH 8.6, 0.3 M NaCl, 2.5 M urea, 5 mM cysteine, 20 mM glycine and 1 mM EDTA. Refolding volumes are chosen so that the final protein concentration is between 50 to 100 micrograms/ml. The refolding solution is stirred gently at 4°C for 12-36 hours. The refolding reaction is quenched by the addition of TFA to a final concentration of 0.4 % (pH of approximately 3). Before further purification of the protein, the solution is filtered through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. The 10 refolded protein is chromatographed on a Poros Rl/H reversed phase column using a mobile buffer of 0.1% TFA with eiution with a gradient of acetonitrile from 10 to 80%. Aliquots of fractions with A280 absorbance are analyzed on SDS polyacrylamide gels and fractions containing homogeneous refolded protein are pooled. Generally, the properly refolded species of most proteins are eluted at the lowest concentrations of acetonitrile since those species are the most compact with their hydrophobic interiors shielded from interaction with the IS reversed phase resin. Aggregated species are usually eluted at higher acetonitrile concentrations. In addition to resolving misfolded forms of proteins from the desired form, the reversed phase step also removes endotoxin from the samples.
Fractions containing the desired folded PRO polypeptide are pooled and the acetonitrile removed using a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, pH 6.8 with 20 0.14 M sodium chloride and 4% mannitol by dialysis or by gel filtration using G25 Superfine (Pharmacia) resins equilibrated in the formulation buffer and sterile filtered.
Many of the PRO polypeptides disclosed herein were successfully expressed as described above.
EXAMPLE 129: Expression of PRO in mammalian cells 25 This example illustrates preparation of a potentially glycosylated form of PRO by recombinant expression in mammalian cells.
The vector, pRK5 (see EP 307,247, published March 15,1989), is employed as the expression vector. Optionally, the PRO DNA is ligated into pRK5 with selected restriction enzymes to allow insertion of the PRO DNA using ligation methods such as described in Sambrook et al., supra. The resulting vector is called pRK5-30 PRO.
In oik embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) are grown to confluence in tissue culture plates in medium such as DMEM supplemented with fetal calf serum and optionally, nutrient components and/or antibiotics. About 10 ng pRK5-PRO DNA is mixed with about 1 pg DNA encoding the VA RNA gene [Thimmappaya et al., Cell. 31:543 (1982)] and dissolved in 500 /tl of 1 mM 35 Tris-HCl, 0.1 mM EDTA, 0.227 M CaCl2. To this mixture is added, dropwise, 500 jil of 50 mM HEPES (pH 7.35), 280 mM NaCl, 1.5 mM NaPO*, and a precipitate is allowed to form for 10 minutes at 25°C. The precipitate is suspended and added to the 293 cells and allowed to settle for about four hours at 37"C. The
481
PCT/US99/201I1
culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are then washed with serum free medium, fresh medium is added and the cells are incubated for about 5 days.
Approximately 24 hours after the transfections, the culture medium is removed and replaced with culture medium (alone) or culture medium containing 200 pCi/ml 33S-cysteine and 200 pCi/ml33S-methionine. After a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto a 15% 5 SDS gel. The processed gel may be dried and exposed to film for a selected period of time to reveal the presence of PRO polypeptide. The cultures containing transfected cells may undergo further incubation (in serum free medium) and the medium is tested in selected bioassays.
In an alternative technique, PRO may be introduced into 293 cells transiently using the dextran sulfate method described by Somparyrac et al., Proc. Natl. Acad. Sci.. 12:7575 (1981). 293 cells are grown to 10 maximal density in a spinner flask and 700 fig pRK5-PRO DNA is added. The cells are first concentrated from the spinner flask by centrifugation and washed with PBS. The DNA-dextran precipitate is incubated on the cell pellet for four hours. The cells are treated with 20% glycerol for 90 seconds, washed with tissue culture medium, and re-introduced into the spinner flask containing tissue culture medium, 5 /tg/ml bovine insulin and 0.1 pg/ml bovine transferrin. After about four days, the conditioned media is centrifuged and filtered to remove IS cells and debris. The sample containing expressed PRO can then be concentrated and purified by any selected method, such as dialysis and/or column chromatography.
In another embodiment, PRO can be expressed in CHO cells. The pRK5-PRO can be transfected into CHO cells using known reagents such as CaP04 or DEAE-dextran. As described above, the cell cultures can be incubated, and the medium replaced with culture medium (alone) or medium containing a radiolabel such as 20 MS-methionine. After determining the presence of PRO polypeptide, the culture medium may be replaced with serum free medium. Preferably, the cultures are incubated for about 6 days, and then the conditioned medium is harvested. The medium containing the expressed PRO can then be concentrated and purified by any selected method.
Epitope-tagged PRO may also be expressed in host CHO cells. The PRO may be subcloned out of the 25 pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag such as a polyhis tag into a Baculovirus expression vector. The poly-his tagged PRO insert can then be subcloned into a SV40 driven vector containing a selection marker such as DHFR for selection of stable clones. Finally, the CHO cells can be transfected (as described above) with the SV40 driven vector. Labeling may be performed, as described above, to verify expression. The culture medium containing the expressed poly-His tagged PRO can then be 30 concentrated and purified by any selected method, such as by Ni2+-chelate affinity chromatography.
PRO may also be expressed in CHO and/or COS cells by a transient expression procedure or in CHO cells by another stable expression procedure.
Stable expression in CHO cells is performed using the following procedure. The proteins are expressed as an IgG construct (immunoadhesin), in which the coding sequences for the soluble forms (e.g. extracellular 35 domains) of the respective proteins are fused to an IgGl constant region sequence containing the hinge, CH2 and CH2 domains and/or is a poly-His tagged form.
Following PCR amplification, the respective DNAs are subcloned in a CHO expression vector using
482
standard techniques as described in Ausubel et al., Current Protocols of Molecular Biology. Unit 3.16, John Wiley and Sons (1997). CHO expression vectors are constructed to have compatible restriction sites 5' and 3' of the DNA of interest to allow the convenient shuttling of cDNA's. The vector used expression in CHO cells is as described in Lucas et al., Nucl. Acids Res. 24:9 (1774-1779 (1996), and uses the SV40 early promoter/enhancer to drive expression of the cDNA of interest and dihydrofolate reductase (DHFR). DHFR 5 expression permits selection for stable maintenance of the plasmid following transfection.
Twelve micrograms of the desired plasmid DNA is introduced into approximately 10 million CHO cells using commercially available transfection reagents Superfect* (Quiagen), Dosper' or Fugene* (Boehringer Mannheim). The cells are grown as described in Lucas et al., supra. Approximately 3 x IO"7 cells are frozen in an ampule for further growth and production as described below.
The ampules containing the plasmid DNA are thawed by placement into water bath and mixed by vortexing. The contents are pipetted into a centrifuge tube containing 10 mLs of media and centrifuged at 1000 rpm for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective media (0.2 ^m filtered PS20 with 5% 0.2 /um diafiltered fetal bovine serum). The cells are then aliquoted into a 100 mL spinner containing 90 mL of selective media. After 1-2 days, the cells are transferred into a 250 mL spinner 15 filled with 150 mL selective growth medium and incubated at 37°C. After another 2-3 days, 250 mL, 500 mL and 2000 mL spinners are seeded with 3 x 10s cells/mL. The cell media is exchanged with fresh media by centrifugation and resuspension in production medium. Although any suitable CHO media may be employed, a production medium described in U.S. Patent No. 5,122,469, issued June 16, 1992 may actually be used. A 3L production spinner is seeded at 1.2 x lCcells/mL. On day 0, the cell number pH ie determined. On day 20 1, the spinner is sampled and sparging with filtered air is commenced. On day 2, the spinner is sampled, the temperature shifted to 33°C, and 30 mL of 500 g/L glucose and 0.6 mL of 10% antifoam (e.g., 35% polydimethylsiloxane emulsion, Dow Coming 365 Medical Grade Emulsion) taken. Throughout the production, the pH is adjusted as necessary to keep it at around 7.2. After 10 days, or until the viability dropped below 70 %, the cell culture is harvested by centrifugation and filtering through a 0.22 nm filter. The filtrate was either 25 stored at 4°C or immediately loaded onto columns for purification.
For the poly-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). Before purification, imidazole is added to the conditioned media to a concentration of 5 mM. The conditioned media is pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM Hepes, pH 7.4, buffer containing 0.3 M NaCl and 5 mM imidazole at a flow rate of 4-5 ml/min. at 4°C. After loading, the column is washed with additional 30 equilibration buffer and the protein eluted with equilibration buffer containing 0.25 M imidazole. The highly purified protein is subsequently desalted into a storage buffer containing 10 mM Hepes, 0.14 M NaCl and 4% mannitol, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column and stored at -80°C.
Immunoadhesin (Fc-containing) constructs are purified from the conditioned media as follows. The conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 35 mM Na phosphate buffer, pH 6.8. After loading, the column is washed extensively with equilibration buffer before elution with 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized by collecting 1 ml fractions into tubes containing 275 /uL of 1 M Tris buffer, pH 9. The highly purified protein is subsequently
483
desalted into storage buffer as described above for the poly-His tagged proteins. The homogeneity is assessed by SDS polyacrylamide gels and by N-tenninal aminn acid sequencing by Edman degradation.
Many of the PRO polypeptides disclosed herein were successfully expressed as described above.
EXAMPLE 130: Expression of PRO in Yeast 5 The following method describes recombinant expression of PRO in yeast.
First, yeast expression vectors are constructed for intracellular production or secretion of PRO from the ADH2/GAPDH promoter. DNA encoding PRO and the promoter is inserted into suitable restriction enzyme sites in the selected plasmid to direct intracellular expression of PRO. For secretion, DNA encoding PRO can be cloned into the selected plasmid, together with DNA encoding the ADH2/GAPDH promoter, a native PRO 10 signal peptide or other mammalian signal peptide, or, for example, a yeast alpha-factor or invertase secretory signal/leader sequence, and linker sequences (if needed) for expression of PRO.
Yeast cells, such as yeast strain ABl 10, can then be transformed with the expression plasmids described above and cultured in selected fermentation media. The transformed yeast supernatants can be analyzed by precipitation with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining of the gels with IS Coomassie Blue stain.
Recombinant PRO can subsequently be isolated and purified by removing the yeast cells from the fermentation medium by centrifugation and then concentrating the medium using selected cartridge filters. The concentrate containing PRO may further be purified using selected column chromatography resins.
Many of the PRO polypeptides disclosed herein were successfully expressed as described above.
EXAMPLE 131: Expression of PRO in Baculovirus-Infected Insect Cells
The following method describes recombinant expression of PRO in Baculovirus-infected insect cells.
The sequence coding for PRO is fused upstream of an epitope tag contained within a baculovirus expression vector. Such epitope tags include po]y-his tags and immunoglobulin tags (like Fc regions of IgG). 25 A variety of plasmids may be employed, including plasmids derived from commercially available plasmids such as pVL1393 (Novagen). Briefly, the sequence encoding PRO or the desired portion of die coding sequence of PRO such as the sequence encoding the extracellular domain of a transmembrane protein or the sequence encoding the mature protein if the protein is extracellular is amplified by PCR with primers complementary to the 5' and 3' regions. The 5' primer may incorporate flanking (selected) restriction enzyme sites. The product 30 is then digested with those selected restriction enzymes and subcloned into the expression vector.
Recombinant baculovirus is generated by co-transfecting the above plasmid and BaculoGold™ virus DNA (Pharmingen) into Spodopterafrugiperda ("Sf9") cells (ATCC CRL 1711) using lipofectin (commercially available from GIBCO-BRL). After 4-5 days of incubation at 28°C, the released viruses are harvested and used for further amplifications. Viral infection and protein expression are performed as described by O'Reilley et 35 al., Baculovirus expression vectors: A Laboratory Manual. Oxford: Oxford University Press (1994).
Expressed poly-his tagged PRO can then be purified, for example, by Ni2+-chelate affinity chromatography as follows. Extracts are prepared from recombinant virus-infected Sf9 cells as described by
484
Rupen et al., Nature. 362:175-179 (1993). Briefly, Sf9 cells are washed, resuspended in sonication buffer (25 mL Hepes, pH 7.9; 12.5 mM MgCl2; 0.1 mM EDTA; 10% glycerol; 0.1% NP-40; 0.4 M KC1), and sonicated twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the supernatant is diluted 50-fold in loading buffer (50 mM phosphate, 300 mM NaCl, 10% glycerol, pH 7.8) and filtered through a 0.45 ura filter. A Ni2+-NTA agarose column (commercially available from Qiagen) is prepared with a bed volume of 5 5 mL, washed with 25 mL of water and equilibrated with 25 mL of loading buffer. The filtered cell extract is loaded onto the column at 0.5 mL per minute. The column is washed to baseline A280 with loading buffer, at which point fraction collection is started. Next, the column is washed with a secondary wash buffer (50 mM phosphate; 300 mM NaCl, 10% glycerol, pH 6.0), which elutes nonspecifically bound protein. After reaching A2io baseline again, the column is developed with a 0 to 500 mM Imidazole gradient in the secondary wash 10 buffer. One mL fractions are collected and analyzed by SDS-PAGE and silver staining or Western blot with Ni2+-NTA-conjugated to alkaline phosphatase (Qiagen). Fractions containing the eluted His,0-tagged PRO are pooled and dialyzed against loading buffer.
Alternatively, purification of the IgG tagged (or Fc tagged) PRO can be performed using known chromatography techniques, including for instance, Protein A or protein G column chromatography. 15 Many of the PRO polypeptides disclosed herein were successfully expressed as described above.
EXAMPLE 132: Preparation of Antibodies that Bind PRO
This example illustrates preparation of monoclonal antibodies which can specifically bind PRO.
Techniques for producing the monoclonal antibodies are known in the art and are described, for 20 instance, in Goding, supra. Immunogens that may be employed include purified PRO, fusion proteins containing PRO, and cells expressing recombinant PRO on the cell surface. Selection of the immunogen can be made by the skilled artisan without undue experimentation.
Mice, such as Balb/c, are immunized with the PRO immunogen emulsified in complete Freund's adjuvant and injected subcutaneously or intraperitoneal^ in an amount from 1-100 micrograms. Alternatively, 25 the immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical Research, Hamilton, MT) and injected into the animal's hind foot pads. Tbe immunized mice are then boosted 10 to 12 days later with additional immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, the mice may also be boosted with additional immunization injections. Serum samples may be periodically obtained from the mice by retro-orbital bleeding for testing in ELISA assays to detect anti-PRO antibodies.
After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected with a final intravenous injection of PRO. Three to four days later, the mice are sacrificed and the spleen cells are harvested. The spleen cells are then fused (using 35% polyethylene glycol) to a selected murine myeloma cell line such as P3X63AgU.l, available fiom ATCC, No. CRL 1597. The fusions generate hybridoma cells which can then be plated in 96 well tissue culture plates containing HAT (hypoxanthine, aminopterin, and 35 thymidine) medium to inhibit proliferation of non-fused cells, myeloma hybrids, and spleen cell hybrids.
The hybridoma cells will be screened in an ELISA for reactivity against PRO. Determination of "positive" hybridoma cells secreting the desired monoclonal antibodies against PRO is within the skill in the art.
485
The positive hybridoma cells can be injected intraperitoneal^ into syngeneic Balb/c mice to produce ascites containing the anti-PRO monoclonal antibodies. Alternatively, the hybridoma cells can be grown in tissue culture flasks or roller bottles. Purification of the monoclonal antibodies produced in the ascites can be accomplished using ammonium sulfate precipitation, followed by gel exclusion chromatography. Alternatively, affinity chromatography based upon binding of antibody to protein A or protein G can be employed.
EXAMPT.F. 133: Purification of PRO Polypeptides Using Specific Antibodies
Native or recombinant PRO polypeptides may be purified by a variety of standard techniques in the art of protein purification. For example, pro-PRO polypeptide, mature PRO polypeptide, or pre-PRO polypeptide is purified by immunoaffinity chromatography using antibodies specific for the PRO polypeptide of interest. In 10 general, an immunoaffinity column is constructed by covalently coupling the anti-PRO polypeptide antibody to an activated chromatographic resin.
Polyclonal immunoglobulins are prepared from immune sera either by precipitation with ammonium sulfate or by purification on immobilized Protein A (Pharmacia LKB Biotechnology, Piscataway, N.J.). Likewise, monoclonal antibodies are prepared from mouse ascites fluid by ammonium sulfate precipitation or 15 chromatography on immobilized Protein A. Partially purified immunoglobulin is covalently attached to a chromatographic resin such as CnBr-activated SEPHAROSE™ (Pharmacia LKB Biotechnology). The antibody is coupled to the resin, the resin is blocked, and the derivative resin is washed according to the manufacturer's instructions.
Such an immunoaffinity column is utilized in the purification of PRO polypeptide by preparing a fraction 20 from cells containing PRO polypeptide in a soluble form. This preparation is derived by solubilization of the whole cell or of a subcellular fraction obtained via differential centrifugation by the addition of detergent or by other methods well known in the art. Alternatively, soluble PRO polypeptide containing a signal sequence may be secreted in useful quantity into the medium in which the cells are grown.
A soluble PRO polypeptide-containing preparation is passed over the immunoaffinity column, and the 25 column is washed under conditions that allow the preferential absorbance of PRO polypeptide (e.g., high ionic strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt antibody/PRO polypeptide binding (e.g., a low pH buffer such as approximately pH 2-3, or a high concentration of a chaotrope such as urea or thiocyanate ion), and PRO polypeptide is collected.
EXAMPLE 134: Drug Screening
This invention is particularly useful for screening compounds by using PRO polypeptides or binding fragment thereof in any of a variety of drug screening techniques. The PRO polypeptide or fragment employed in such a test may either be free in solution, affixed to a solid support, borne on a cell surface, or located intracellularly. One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 35 transformed with recombinant nucleic acids expressing the PRO polypeptide or fragment. Drugs are screened against such transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can be used for standard binding assays. One may measure, for example, the formation of complexes between PRO
486
WO 00/12708 PCT/US99/20111
polypeptide or a fragment and the agent being tested. Alternatively, one can examine the diminution in complex formation between the PRO polypeptide and its target cell or target receptors caused by the agent being tested.
Thus, the present invention provides methods of screening for drugs or any other agents which can affect a PRO polypeptide-associated disease or disorder. These methods comprise contacting such an agent with an PRO polypeptide or fragment thereof and assaying (I) for the presence of a complex between the agent and 5 the PRO polypeptide or fragment, or (ii) for the presence of a complex between the PRO polypeptide or fragment and the cell, by methods well known in the art. In such competitive binding assays, the PRO polypeptide or fragment is typically labeled. After suitable incubation, free PRO polypeptide or fragment is separated from that present in bound form, and the amount of free or uncomplexed label is a measure of the ability of the particular agent to bind to PRO polypeptide or to interfere with the PRO poiypeptide/cell complex. 10 Another technique for drug screening provides high throughput screeningfor compounds having suitable binding affinity to a polypeptide and is described in detail in WO 84/03564, published on September 13,1984. Briefly stated, large numbers of different small peptide test compounds are synthesized on a solid substrate, such as plastic pins or some other surface. As applied to a PRO polypeptide, the peptide test compounds are reacted with PRO polypeptide and washed. Bound PRO polypeptide is detected by methods well known in the art. IS Purified PRO polypeptide can also be coated directly onto plates for use in the aforementioned drug screening techniques. In addition, non-neutralizing antibodies can be used to capture the peptide and immobilize it on the solid support.
This invention also contemplates the use of competitive drug screening assays in which neutralizing antibodies capable of binding PRO polypeptide specifically compete with a test compound for binding to PRO 20 polypeptide or fragments thereof. In this manner, the antibodies can be used to detect the presence of any peptide which shares one or more antigenic determinants with PRO polypeptide.
EXAMPLE 13.5: Rational Drue Design
The goal of rational drug design is to produce structural analogs of biologically active polypeptide of 25 interest (i.e., a PRO polypeptide) or of small molecules with which they interact, e.g., agonists, antagonists, or inhibitors. Any of these examples can be used to fashion drugs which are more active or stable forms of the PRO polypeptide or which enhance or interfere with the function of the PRO polypeptide in vivo (c.f., Hodgson, Bio/Technology. 9: 19-21 (1991)).
In one approach, the three-dimensional structure of the PRO polypeptide, or of an PRO 30 polypeptide-inhibitor complex, is determined by x-ray crystallography, by computer modeling or, most typically, by a combination of the two approaches. Both the shape and charges of the PRO polypeptide must be ascertained to elucidate the structure and to determine active site(s) of the molecule. Less often, useful information regarding the structure of the PRO polypeptide may be gained by modeling based on the structure of homologous proteins. In both cases, relevant structural information is used to design analogous PRO polypeptide-like molecules or to 35 identify efficient inhibitors. Useful examples of rational drug design may include molecules whichhave unproved activity or stability as shown by Braxton and Wells, Biochemistry. 21:7796-7801 (1992) ot which act as inhibitors, agonists, or antagonists of native peptides as shown by Athauda et al., J. Biochem.. 113:742-746
487
(1993).
It is also possible to isolate a target-specific antibody, selected by functional assay, as described above, and then to solve its crystal structure. This approach, in principle, yields a pharmacore upon which subsequent drug design can be based. It is possible to bypass protein crystallography altogether by generating anti-idiotypic antibodies (anti-ids) to a functional, pharmacologically active antibody. As a minor image of a mirror image, the binding site of the anti-ids would be expected to be an analog of the original receptor. The anti-id could then be used to identify and isolate peptides from banks of chemically or biologically produced peptides. The isolated peptides would then act as the pharmacore.
By virtue of the present invention, sufficient amounts of the PRO polypeptide may be made available to perform such analytical studies as X-ray crystallography. In addition, knowledge of the PRO polypeptide amino acid sequence provided herein will provide guidance to those employing computer modeling techniques in place of or in addition to x-ray crystallography.
Deposit of Material
The following materials have been deposited with the American Type Culture Collection, 10801 University Blvd., Manassas, VA 20110-2209, USA (ATCC):
Material
Tid>l?2
ATCC Dep. No.
Deposit Date
DNA26846-1397
203406
October 27, 1998
DNA56107-1415
203405
October 27, 1998
DNA56406-1704
203478
November 17, 1998
DNA56529-1647
203293
September 29, 1998
DNA56531-1648
203286
September 29, 1998
DNA56862-1343
203174
September 1, 1998
DNA57254-1477
203289
September 29, 1998
DNA57841-1522
203458
November 3, 1998
DNA58727-1474
203171
September 1, 1998
DNA58730-1607
203221
September 15, 1998
DNA58732-1650
203290
September 29, 1998
DNA58828-1519
203172
September 1, 1998
DNA58852-1637
203271
September 22, 1998
DNA59212-1627
203245
September 9,1998
DNA59218-1559
203287
September 29, 1998
DNA59219-1613
203220
September 15,1998
DNA59586-1520
203288
September 29,1998
DNA59817-1703
203470
November 17,1998
DNA60278-1530
203170
September 1, 1998
DNA60608-1577
203126
August 18, 1998
40
DNA60611-1524
203175
September 1, 1998
DNA60618-1557
203292
September 29, 1998
DNA60740-1615
203456
November 3, 1998
DNA60764-1533
203452
November 10,1998
DNA60775-1532
203173
September 1, 1998
45
DNA61185-1646
203464
November 17, 1998
DNA61608-1606
203239
September 9, 1998
DNA62808-1326
203358
October 2fci19M m.
488
,TEo!^ALOFp«oreBTy "OCT 2003
_recei\/Fn
40
45
50
55
DNA62809-1531 DNA62815-1578 DNA62845-1684 DNA64842-1632 DNA64849-I604 DNA64863-1573 DNA64881-1602
203237 203247 203361 203278 203468 203251 203240
September 9, 1998 September 9, 1998 October 20, 1998 September 22, 1998 November 17, 1998 September 9, 1998 September 9, 1998
DNA64886-1601 DNA64888-1542
DNA64897-1628 DNA64902-1667
203241 203249
203216 203317
September 9, 1998 September 9, 1998
September 15, 1998 October 6, 1998
DNA64950-1590 DNA64952-1568 DNA65402-1540 DNA65403-1565 DNA65404-1551 DNA65405-1547 DNA65406-1567 DNA65408-1578
DNA65410-1569 DNA65423-1595 DNA66304-1546 DNA66511-1411 DNA66512-1564 DNA66519-1535 DNA66520-1536 DNA66521-1583 DNA66526-1616 DNA66658-1584 DNA66659-1593 DNA66663-1598 DNA66669-1597 DNA66672-1586 DNA66674-1599 DNA66675-1587 DNA67962-1649 DNA68836-1656 DNA68864-1629 DNA68866-1644 DNA68871-1638 DNA68874-1622 DNA68880-1676 DNA68885-1570 DNA71166-1685 DNA71169-1709 DNA71180-1655 DNA71184-1634 DNA71213-1659 DNA71234-1651 DNA71277-1636 DNA71282-1668
203224 203222 203252
203230 203244 203476 203219
203217
203231
203227 203321
203228
203218 203236 203226
203225 203246
203229 203269 203268 203272
203265
203281
203282 203291 203455
203276
203283 203280
203277 203319
203311 203355 203467 203403
203266
203401
203402 203285
203312
September 15, 1998 September 15, 1998 September 9, 1998 September 15, 1998 September 9, 1998 November 17, 1998 September 15, 1998 September 15, 1998
September 15, 1998 September 15, 1998 October 6, 1998 September 15,1998 September 15, 1998 September 15,1998 September 15, 1998 September 15, 1998 September 9, 1998 September 15, 1998 September 22, 1998 September 22, 1998 September 22, 1998 September 22, 1998 September 22, 1998 September 22, 1998 September 29, 1998 November 3, 1998 September 22, 1998 September 22, 1998 September 22, 1998 September 22, 1998 October 6, 1998 October 6,1998 October 20, 1998 November 17,1998 October 27, 1998 September 22,1998 October 27, 1998 October 27, 1998 September 22, 1998 October 6, 1998
489
2 4 OCT 2003
RFrciwcr\
DNA71286-1604
203357
October 20, 1998
DNA71883-1660
203475
November 17, 1998
DNA73401-1633
203273
September 22, 1998
DNA73492-167I
203324
October 6, 1998
DNA73727-1673
203459
November 3, 1998
DNA73730-1679
203320
October 6, 1998
DNA73734-1680
203363
October 20,1998
DNA73735-1681
203356
October 20, 1998
DNA73736-1657
203466
November 17, 1998
DNA73737-1658
203412
October 27, 1998
DNA73739-1645
203270
September 22, 1998
DNA73742-1662
203316
October 6, 1998
DNA73744-1665
203322
October 6, 1998
DNA73746-1654
203411
October 27, 1998
DNA73760-1672
203314
October 6, 1998
DNA76396-1698
203471
November 17, 1998
DNA76398-1699
203474
November 17, 1998
DNA76399-1700
203472
November 17, 1998
DNA76401-1683
203360
October 20, 1998
DNA76510-2504
203477
November 17, 1998
DNA76522-2500
203469
November 17, 1998
DNA76529-1666
203315
October 6, 1998
DNA76531-1701
203465
November 17, 1998
DNA76532-1702
203473
November 17, 1998
•DNA76541-1675
203409
October 27, 1998
DNA77301-1708
203407
October 27, 1998
DNA77303-2502
203479
November 17, 1998
DNA77648-1688
203408
October 27, 1998
DNA77652-2505
203480
November 17, 1998
DNA83500-2506
203391
October 29, 1998
DNA77568-1626
203134
August 18, 1998
DNA23322-1393
203400
October 27, 1998
'DNA62812-1594
203248
September 9, 1998
DNA66660-1585
203279
September 22, 1998
DNA76393-1664
203323
October 6,1998
These deposit were made under the provisions of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder (Budapest 40 Treaty). This assures maintenance of a viable culture of the deposit for 30 years from the date of deposit. The deposits will be made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement between Genentech, Inc. and ATCC, which assures permanent and unrestricted availability of the progeny of the culture of the deposit to the public upon issuance of the pertinent U.S. patent or upon laying open to the public of any U.S. or foreign patent application, whichever comes first, and assures availability of the progeny 45 to one determined by the U.S. Commissioner of Patents and Trademarks to be entitled thereto according to 35 USC §122 and the Commissioner's rules pursuant thereto (including 37 CFR §1.14 with particular reference to 886 OG 638).
The assignee of the present application has agreed that if a culture of the materials on deposit should die or be lost or destroyed when cultivated under suitable conditions, the materials will be promptly replaced on 50 notification with another of the same. Availability of the deposited material is not to be construed as a license
490
INTELLECTUAL PROPERTY OFFICE OF N.Z
2 4 OCT 2003 received
PCTAJS99/20U1
to practice the invention in contravention of the rights granted under the authority of any government in accordance with its patent laws.
The foregoing written specification is considered to be sufficient to enable one stalled in the art to practice the invention. The present invention is not to be limited in scope by the construct deposited, since the deposited embodiment is intended as a single illustration of certain aspects of the invention and any constructs 5 that are functionally equivalent are within the scope of this invention. The deposit of material herein does not constitute an admission that the written description herein contained is inadequate to enable the practice of any aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition to those shown and described herein will become apparent to those skilled in the art from the foregoing 10 description and fall within the scope of the appended claims.
491
492
Sequence Listing
<110> Genentech, Inc.
<120> Novel Polypeptides and Nucleic Acids Encoding the Same
<130> P2830R1
<140> PCT/US99/20111 <141> 1999-09-01
<150> US 60/098,750 <151> 1998-09-01
<150> US 60/098,803 <151> 1998-09-02
<150> US 60/098,821 <151> 1998-02-09
<150> US 60/098,843 <151> 1998-02-09
<150> US 60/099,536 <151> 1998-09-09
<150> US 60/099,596 <151> 1998-09-09
<150> US 60/099,598 <151> 1998-09-09
<150> US 60/099,602 <151> 1998-09-09
<150> US 60/099,642 <151> 1998-09-09
<150> US 60/099,741 <151> 1998-10-09
<150> US 60/099,754 <151> 1998-10-09
<150> US 60/099,763 <151> 1998-10-09
<150> US 60/099,792 <151> 1998-10-09
<150> US 60/099,808 <151> 1998-10-09
<150> US 60/099,812 <151> 1998-10-09
<150> US 60/099,815 <151> 1998-10-09
<150> US 60/099,816 <151> 1998-10-09
<150> US 60/100,385 <151> 1998-15-09
<150> US 60/100,388 <151> 1998-15-09
<150> US 60/100,390 <151> 1998-15-09
<150> US 60/100,584 <151> 1998-16-09
<150> US 60/100,627 <151> 1998-16-09
<150> US 60/100,661 <151> 1998-16-09
<150> US 60/100,662 <151> 1998-16-09
<150> US 60/100,664 <151> 1998-16-09
<150> US 60/100,683 <151> 1998-17-09
<150> US 60/100,684 <151> 1998-17-09
<150> US 60/100,710 <151> 1998-17-09
<150> US 60/100,711 <151> 1998-17-09
<150> US 60/100,919 <151> 1998-17-09
<150> US 60/100,930 <151> 1998-17-09
<150> US 60/100,848 <151> 1998-18-09
<150> US 60/100,849 <151> 1998-18-09
<150> US 60/101,014 <151> 1998-18-09
<150> US 60/101,068 <151> 1998-18-09
<150> US 60/101,071 <151> 1998-18-09
<150> US 60/101,279 <151> 1998-22-09
<150> US 60/101,471
<151> 1998-23-09
<150> US 60/101,472 <151> 1998-23-09
<150> US 60/101,474 <151> 1998-23-09
<150> US 60/101,475 <151> 1998-23-09
<150> US 60/101,476 <151> 1998-23-09
<150> US 60/101,477 <151> 1998-23-09
<150> US 60/101,479 <151> 1998-23-09
<150> US 60/101,738 <151> 1998-24-09
<150> US 60/101,741 <151> 1998-24-09
<150> US 60/101,743 <151> 1998-24-09
<150> US 60/101,915 <151> 1998-24-09
<150> US 60/101,916 <151> 1998-24-09
<150> US 60/102,207 <151> 1998-29-09
<150> US 60/102,240 <151> 1998-29-09
<150> US 60/102,307 <151> 1998-29-09
<150> US 60/102,330 <151> 1998-29-09
<150> US 60/102,331 <151> 1998-29-09
<150> US 60/102,484 <151> 1998-30-09
<150> US 60/102,487 <151> 1998-30-09
<150> US 60/102,570 <151> 1998-30-09
<150> US 60/102,571 <151> 1998-30-09
<150> US 60/102,684 <151> 1998-01-10
<150> US 60/102,687 <151> 1998-01-10
<150> US 60/102,965 <151> 1998-02-10
<150> US 60/103,258 <151> 1998-06-10
<150> US 60/103,449 <151> 1998-06-10
<150> US 60/103,314 <151> 1998-07-10
<150> US 60/103,315 <151> 1998-07-10
<150> US 60/103,328 <151> 1998-07-10
<150> US 60/103,395 <151> 1998-07-10
<150> US 60/103,396 <151> 1998-07-10
<150> US 60/103,401 <151> 1998-07-10
<150> US 60/103,633 <151> 1998-08-10
<150> US 60/103,678 <151> 1998-08-10
<150> US 60/103,679 <151> 1998-08-10
<150> US 60/103,711 <151> 1998-08-10
<150> US 60/104,257 <151> 1998-14-10
<150> US 60/104,987 <151> 1998-20-10
<150> US 60/105,000 <151> 1998-20-10
<150> US 60/105,002 <151> 1998-20-10
<150> US 60/105,104 <151> 1998-21-10
<150> US 60/105,169 <151> 1998-22-10
<150> US 60/105,266 <151> 1998-22-10
<150> US 60/105,693 <151> 1998-26-10
<150> US 60/105,694 <151> 1998-26-10
<150> US 60/105,807 <151> 1998-27-10
<150> US 60/105,881 <151> 1998-27-10
<150> US 60/105,882 <151> 1998-27-10
<150> US 60/106,062 <151> 1998-27-10
<150> US 60/106,023 <151> 1998-28-10
<150> US 60/106,029 <151> 1998-28-10
<150> US 60/106,030 <151> 1998-28-10
<150> US 60/106,032 <151> 1998-28-10
<150> US 60/106,033 <151> 1998-28-10
<150> US 60/106,178 <151> 1998-28-10
<150> US 60/106,248 <151> 1998-29-10
<150> US 60/106,384 <151> 1998-29-10
<150> US 60/108,500 <151> 1998-29-10
<150> US 60/106,464 <151> 1998-30-10
<150> US 60/106,856 <151> 1998-03-11
<150> US 60/106,902 <151> 1998-03-11
<150> US 60/106,905 <151> 1998-03-11
<150> US 60/106,919
<151> 1998-03-11
<150> US 60/106,932 <151> 1998-03-11
<150> US 60/106,934 <151> 1998-03-11
<150> US 60/107,783 <151> 1998-10-11
<150> US 60/108,775 <151> 1998-17-11
<150> US 60/108,779 <151> 1998-17-11
<150> US 60/108,787 <151> 1998-17-11
<150> US 60/108,788 <151> 1998-17-11
<150> US 60/108,801 <151> 1998-17-11
<150> US 60/108,802 <151> 1998-17-11
<150> US 60/108,806 <151> 1998-17-11
<150> US 60/108,807 <151> 1998-17-11
<150> US 60/108,867 <151> 1998-17-11
<150> US 60/108,925 <151> 1998-17-11
<150> US 60/108,848 <151> 1998-18-11
<150> US 60/108,849 <151> 1998-18-11
<150> US 60/108,850 <151> 1998-18-11
<150> US 60/108,851 <151> 1998-18-11
<150> US 60/108,852 <151> 1998-18-11
<150> US 60/108,858 <151> 1998-18-11
<150> US 60/108,904 <151> 1998-18-11
498
<160> 432
<210> 1 <211> 43 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 1
tgtaaaacga cggccagtta aatagacctg caattattaa tct 43
<210> 2 <211> 41 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 2
caggaaacag ctatgaccac ctgcacacct gcaaatccat t 41
<210> 3
<211> 1110
<212> DNA
<213> Homo sapiens
<400> 3
ccaatcgccc ggtgcggtgg tgcagggtct cgggctagtc atggcgtccc
50
cgtctcggag actgcagact aaaccagtca ttacttgttt caagagcgtt
100
ctgctaatct acacttttat tttctggatc actggcgtta tccttcttgc
150
agttggcatt tggggcaagg tgagcctgga gaattacttt tctcttttaa
200
atgagaaggc caccaatgtc cccttcgtgc tcattgctac tggtaccgtc
250
attattcttt tgggcacctt tggttgtttt gctacctgcc gagcttctgc
300
atggatgcta aaactgtatg caatgtttct gactctcgtt tttttggtcg
350
aactggtcgc tgccatcgta ggatttgttt tcagacatga gattaagaac
400
agctttaaga ataattatga gaaggctttg aagcagtata actctacagg
450
agattataga agccatgcag tagacaagat ccaaaatacg ttgcattgtt
500
gtggtgtcac cgattataga gattggacag atactaatta ttactcagaa
550
aaaggatttc ctaagagttg ctgtaaactt gaagattgta ctccacagag
600
agatgcagac aaagtaaaca atgaaggttg ttttataaag gtgatgacca
650
ttatagagtc agaaatggga gtcgttgcag gaatttcctt tggagttgct
700
tgcttccaac tgattggaat ctttctcgcc tactgccwct ctcgtgccat
750
aacaaataac cagtatgaga tagtgtaacc caatgtatct gtgggcctat
800
tcctctctac ctttaaggac atttagggtc ccccctgtga attagaaagt
850
499
tgcttggctg gagaactgac aacactactt actgatagac caaaaaacta 900
caccagtagg ttgattcaat caagatgtat gtagacctaa aactacacca 950
ataggctgat tcaatcaaga tccgtgctcg cagtgggctg attcaatcaa 1000
gatgtatgtt tgctatgttc taagtccacc ttctatccca ttcatgttag 1050
atcgttgaaa ccctgtatcc ctctgaaaca ctggaagagc tagtaaattg 1100
taaatgaagt 1110
<210> 4 <211> 245 <212> PRT
<213> Homo sapiens <220>
<221> sig_peptide <222> 1-42
<223> Signal Peptide <220>
<221> TRANSMEM
<222> 19-42, 61-83, 92-114, 209-230 <223> Transmembrane Domains
<220>
<221> misc_feature <222> 69-80, 211-222
<223> Prokaryotic Membrane Lipoprotein Lipid Attachment Site. <220>
<221> misc_feature
<222> 75-81, 78-84, 210-216, 214-220, 226-232 <223> N-Myristoylation Site.
<220>
<221> misc_feature <222> 134-138
<223> N-Glycosylation Site.
<220>
<221> misc_feature <222> 160-168, 160-169
<223> Tyrosine Kinase Phosphorylation Site.
<220>
<221> unsure <222> 233
<223> unknown amino acid <400> 4
Met Ala Ser Pro Ser Arg Arg Leu Gin Thr Lys Pro Val Ile Thr 15 10 15
Cys Phe Lys Ser Val Leu Leu Ile Tyr Thr Phe Ile Phe Trp Ile 20 25 30
Thr Gly Val Ile Leu Leu Ala Val Gly Ile Trp Gly Lys Val Ser 35 40 45
500
Leu Glu Asn Tyr Phe Ser Leu Leu Asn Glu Lys Ala Thr Asn Val 50 55 60
Pro Phe Val Leu Ile Ala Thr Gly Thr Val Ile Ile Leu Leu Gly 65 70 75
Thr Phe Gly Cys Phe Ala Thr Cys Arg Ala Ser Ala Trp Met Leu 80 85 90
Lys Leu Tyr Ala Met Phe Leu Thr Leu Val Phe Leu Val Glu Leu 95 100 105
Val Ala Ala Ile Val Gly Phe Val Phe Arg His Glu Ile Lys Asn 110 115 120
Ser Phe Lys Asn Asn Tyr Glu Lys Ala Leu Lys Gin Tyr Asn Ser 125 130 135
Thr Gly Asp Tyr Arg Ser His Ala Val Asp Lys Ile Gin Asn Thr 140 145 150
Leu His Cys Cys Gly Val Thr Asp Tyr Arg Asp Trp Thr Asp Thr 155 160 165
Asn Tyr Tyr Ser Glu Lys Gly Phe Pro Lys Ser Cys Cys Lys Leu 170 175 180
Glu Asp Cys Thr Pro Gin Arg Asp Ala Asp Lys Val Asn Asn Glu 185 190 195
Gly Cys Phe Ile Lys Val Met Thr Ile Ile Glu Ser Glu Met Gly 200 205 210
Val Val Ala Gly Ile Ser Phe Gly Val Ala Cys Phe Gin Leu Ile 215 220 225
Gly Ile Phe Leu Ala Tyr Cys Xaa Ser Arg Ala Ile Thr Asn Asn 230 235 240
Gin Tyr Glu Ile Val 245
<210> 5
<211> 1218
<212> DNA
<213> Homo sapiens
<400> 5
cccacgcgtc cggcgccgtg gcctcgcgtc catctttgcc gttctctcgg
50
acctgtcaca aaggagtcgc gccgccgccg ccgccccctc cctccggtgg
100
gcccgggagg tagagaaagt cagtgccaca gcccgaccgc gctgctctga
150
gccctgggca cgcggaacgg gagggagtct gagggttggg gacgtctgtg
200
agggagggga acagccgctc gagcctgggg cgggcggacc ggactggggc
250
cggggtaggc tctggaaagg gcccgggaga gaggtggcgt tggtcagaac
300
ctgagaaaca gccgagaggt tttccaccga ggcccgcgct tgagggatct
350
501
gaagaggttc ctagaagagg gtgttccctc tttcgggggt cctcaccaga
400
agaggttctt gggggtcgcc cttctgagga ggctgcggct aacagggccc
450
agaactgcca ttggatgtcc agaatcccct gtagttgata atgttgggaa
500
taagctctgc aactttcttt ggcattcagt tgttaaaaac aaataggatg
550
caaattcctc aactccaggt tatgaaaaca gtacttggaa aactgaaaac
600
tacctaaatg atcgtctttg gttgggccgt gttcttagcg agcagaagcc
650
ttggccaggg tctgttgttg actctcgaag agcacatagc ccacttccta
700
gggactggag gtgccgctac taccatgggt aattcctgta tctgccgaga
750
tgacagtgga acagatgaca gtgttgacac ccaacagcaa caggccgaga
800
acagtgcagt acccactgct gacacaagga gccaaccacg ggaccctgtt
850
cggccaccaa ggaggggccg aggacctcat gagccaagga gaaagaaaca
900
aaatgtggat gggctagtgt tggacacact ggcagtaata cggactcttg
950
tagataagta agtatctgac tcacggtcac ctccagtgga atgaaaagtg
1000
ttctgcccgg aaccatgact ttaggactcc ttcagttcct ttaggacata
1050
ctcgccaagc cttgtgctca cagggcaaag gagaatattt taatgctccg
1100
ctgatggcag agtaaatgat aagatttgat gtttttgctt gctgtcatct
1150
actttgtctg gaaatgtcta aatgtttctg tagcagaaaa cacgataaag
1200
ctatgatctt tattagag 1218
<210> 6 <211> 117 <212> PRT
<213> Homo sapiens <220>
<221> sig_peptide <222> 1-16
<223> Signal Peptide <220>
<221> misc_feature
<222> 18-24, 32-38, 34-40, 35-41, 51-57 <223> N-Myristoylation Site.
<220>
<221> misc_feature
<222> 22-26, 50-54, 113-117
<223> Casein Kinase II Phosphorylation Site.
<400> 6
Met Ile Val Phe Gly Trp Ala Val Phe Leu Ala Ser Arg Ser Leu 15 10 15
Gly Gin Gly Leu Leu Leu Thr Leu Glu Glu His Ile Ala His Phe
502
Leu Gly Thr Gly Gly Ala Ala Thr Thr Met Gly Asn Ser Cys Ile 35 40 - 45
Cys Arg Asp Asp Ser Gly Thr Asp Asp Ser Val Asp Thr Gin Gin 50 55 60
Gin Gin Ala Glu Asn Ser Ala Val Pro Thr Ala Asp Thr Arg Ser 65 70 75
Gin Pro Arg Asp Pro Val Arg Pro Pro Arg Arg Gly Arg Gly Pro 80 85 90
His Glu Pro Arg Arg Lys Lys Gin Asn Val Asp Gly Leu Val Leu 95 100 105
Asp Thr Leu Ala Val Ile Arg Thr Leu Val Asp Lys 110 115
<210> 7
<211> 756
<212> DNA
<213> Homo sapiens
<400> 7
ggcacgaggc gctgtccacc cgggggcgtg ggagtgaggt accagattca
50
gcccatttgg ccccgacgcc tctgttctcg gaatccgggt gctgcggatt
100
gaggtcccgg ttcctaacgg actgcaagat ggaggaaggc gggaacctag
150
gaggcctgat taagatggtc catctactgg tcttgtcagg tgcctggggc
200
atgcaaatgt gggtgacctt cgtctcaggc ttcctgcttt tccgaagcct
250
tccccgacat accttcggac tagtgcagag caaactcttc cccttctact
300
tccacatctc catgggctgt gccttcatca acctctgcat cttggcttca
350
cagcatgctt gggctcagct cacattctgg gaggccagcc agctttacct
400
gctgttcctg agccttacgc tggccactgt caacgcccgc tggctggaac
450
cccgcaccac agctgccatg tgggccctgc aaaccgtgga gaaggagcga
500
ggcctgggtg gggaggtacc aggcagccac cagggtcccg atccctaccg
550
ccagctgcga gagaaggacc ccaagtacag tgctctccgc cagaatttct
600
tccgctacca tgggctgtcc tctctttgca atctgggctg cgtcctgagc
650
aatgggctct gtctcgctgg ccttgccctg gaaataagga gcctctagca
700
tgggccctgc atgctaataa atgcttcttc agaaatgaaa aaaaaaaaaa
750
aaaaaa 756
<210> 8 <211> 189 <212> PRT
<213> Homo sapiens
503
<220>
<221> sig_peptide <222> 1-24
<223> Signal Peptide <220>
<221> misc_feature
<222> 4-10, 5-11, 47-53, 170-176, 176-182 <223> N-Myristoylation Site.
<220>
<221> misc_feature <222> 44-85
<223> G-protein Coupled Receptors Proteins.
<220>
<221> misc_feature <222> 54-65
<223> Prokaryotic Mmembrane Lipoprotein Lipid Attachment Site. <220>
<221> misc_feature <222> 82-86
<223> Casein Kinase II Phosphorylation Site.
<220>
<221> TRANSMEM
<222> 86-103, 60-75
<223> Transmembrane Domain
<220>
<221> misc_feature <222> 144-151
<223> Tyrosine Kinase Phosphorylation Site.
<400> 8
Met Glu Glu Gly Gly Asn Leu Gly Gly Leu Ile Lys Met Val His 15 10 15
Leu Leu Val Leu Ser Gly Ala Trp Gly Met Gin Met Trp Val Thr 20 25 30
Phe Val Ser Gly Phe Leu Leu Phe Arg Ser Leu Pro Arg His Thr 35 40 45
Phe Gly Leu Val Gin Ser Lys Leu Phe Pro Phe Tyr Phe His Ile 50 55 60
Ser Met Gly Cys Ala Phe Ile Asn Leu Cys Ile Leu Ala Ser Gin 65 70 75
His Ala Trp Ala Gin Leu Thr Phe Trp Glu Ala Ser Gin Leu Tyr 80 85 90
Leu Leu Phe Leu Ser Leu Thr Leu Ala Thr Val Asn Ala Arg Trp 95 100 105
Leu Glu Pro Arg Thr Thr Ala Ala Met Trp Ala Leu Gin Thr Val 110 115 120
Glu Lys Glu Arg Gly Leu Gly Gly Glu Val Pro Gly Ser His Gin 125 130 135
504
Gly Pro Asp Pro Tyr Arg Gin Leu Arg Glu Lys Asp Pro Lys Tyr 140 145 150
Ser Ala Leu Arg Gin Asn Phe Phe Arg Tyr His Gly Leu Ser Ser 155 160 165
Leu Cys Asn Leu Gly Cys Val Leu Ser Asn Gly Leu Cys Leu Ala 170 175 180
Gly Leu Ala Leu Glu Ile Arg Ser Leu 185
<210> 9
<211> 1508
<212> DNA
<213> Homo sapiens
<400> 9
aattcagatt ttaagcccat tctgcagtgg aatttcatga actagcaaga
50
ggacaccatc ttcttgtatt atacaagaaa ggagtgtacc tatcacacac
100
agggggaaaa atgctctttt gggtgctagg cctcctaatc ctctgtggtt
150
ttctgtggac tcgtaaagga aaactaaaga ttgaagacat cactgataag
200
tacattttta tcactggatg tgactcgggc tttggaaact tggcagccag
250
aacttttgat aaaaagggat ttcatgtaat cgctgcctgt ctgactgaat
300
caggatcaac agctttaaag gcagaaacct cagagagact tcgtactgtg
350
cttctggatg tgaccgaccc agagaatgtc aagaggactg cccagtgggt
400
gaagaaccaa gttggggaga aaggtctctg gggtctgatc aataatgctg
450
gtgttcccgg cgtgctggct cccactgact ggctgacact agaggactac
500
agagaaccta ttgaagtgaa cctgtttgga ctcatcagtg tgacactaaa
550
tatgcttcct ttggtcaaga aagctcaagg gagagttatt aatgtctcca
600
gtgttggagg tcgccttgca atcgttggag ggggctatac tccatccaaa
650
tatgcagtgg aaggtttcaa tgacagctta agacgggaca tgaaagcttt
700
tggtgtgcac gtctcatgca ttgaaccagg attgttcaaa acaaacttgg
750
cagatccagt aaaggtaatt gaaaaaaaac tcgccatttg ggagcagctg
800
tctccagaca tcaaacaaca atatggagaa ggttacattg aaaaaagtct
850
agacaaactg aaaggcaata aatcctatgt gaacatggac ctctctccgg
900
tggtagagtg catggaccac gctctaacaa gtctcttccc taagactcat
950
tatgccgctg gaaaagatgc caaaattttc tggatacctc tgtctcacat
1000
gccagcagct ttgcaagact ttttattgtt gaaacagaaa gcagagctgg
1050
ctaatcccaa ggcagtgtga ctcagctaac cacaaatgtc tcctccaggc
1100
505
tatgaaattg gccgatttca agaacacatc tccttttcaa ccccattcct 1150
tatctgctcc aacctggact catttagatc gtgcttattt ggattgcaaa 12 00
agggagtccc accatcgctg gtggtatccc agggtccctg ctcaagtttt 1250
ctttgaaaag gagggctgga atggtacatc acataggcaa gtcctgccct 1300
gtatttaggc tttgcctgct tggtgtgatg taagggaaat tgaaagactt 1350
gcccattcaa aatgatcttt accgtggcct gccccatgct tatggtcccc 1400
agcatttaca gtaacttgtg aatgttaagt atcatctctt atctaaatat 1450 taaaagataa gtcaacccaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1500
aaaaaaaa 1508
<210> 10 <211> 319 <212> PRT
<213> Homo sapiens <220>
<221> sig_peptide <222> 1-17
<223> Signal Peptide <220>
<221> misc_feature
<222> 36-47, 108-113, 166-171,198-203, 207-212 <223> N-myristoylation Sites.
<220>
<221> misc_feature <222> 39-42
<223> Glycosaminoglycan Attachment Site.
<220>
<221> TRANSMEM <222> 136-152
<223> Transmembrane Domain <220>
<221> misc_feature <222> 161-163, 187-190 and 253-256 <223> N-glycosylation Sites.
<400> 10
Met Leu Phe Trp Val Leu Gly Leu Leu 1 5
Trp Thr Arg Lys Gly Lys Leu Lys Ile 20
Tyr Ile Phe Ile Thr Gly Cys Asp Ser 35
Ala Arg Thr Phe Asp Lys Lys Gly Phe 50
Leu Thr Glu Ser Gly Ser Thr Ala Leu 65
Ile Leu Cys Gly Phe Leu 10 15
Glu Asp Ile Thr Asp Lys 25 30
Gly Phe Gly Asn Leu Ala 40 45
His Val lie Ala Ala Cys 55 60
Lys Ala Glu Thr Ser Glu 70 75
506
Arg Leu Arg Thr Val Leu Leu Asp Val Thr Asp Pro Glu Asn Val 80 85 90
Lys Arg Thr Ala Gin Trp Val Lys Asn Gin Val Gly Glu Lys Gly 95 100 105
Leu Trp Gly Leu Ile Asn Asn Ala Gly Val Pro Gly Val Leu Ala 110 115 120
Pro Thr Asp Trp Leu Thr Leu Glu Asp Tyr Arg Glu Pro Ile Glu 125 130 135
Val Asn Leu Phe Gly Leu Ile Ser Val Thr Leu Asn Met Leu Pro 140 145 150
Leu Val Lys Lys Ala Gin Gly Arg Val Ile Asn Val Ser Ser Val 155 160 165
Gly Gly Arg Leu Ala Ile Val Gly Gly Gly Tyr Thr Pro Ser Lys 170 175 180
Tyr Ala Val Glu Gly Phe Asn Asp Ser Leu Arg Arg Asp Met Lys 185 190 195
Ala Phe Gly Val His Val Ser Cys Ile Glu Pro Gly Leu Phe Lys 200 205 210
Thr Asn Leu Ala Asp Pro Val Lys Val Ile Glu Lys Lys Leu Ala 215 220 225
Ile Trp Glu Gin Leu Ser Pro Asp Ile Lys Gin Gin Tyr Gly Glu 230 235 240
Gly Tyr Ile Glu Lys Ser Leu Asp Lys Leu Lys Gly Asn Lys Ser 245 250 255
Tyr Val Asn Met Asp Leu Ser Pro Val Val Glu Cys Met Asp His 260 265 270
Ala Leu Thr Ser Leu Phe Pro Lys Thr His Tyr Ala Ala Gly Lys 275 280 285
Asp Ala Lys Ile Phe Trp Ile Pro Leu Ser His Met Pro Ala Ala 290 295 300
Leu Gin Asp Phe Leu Leu Leu Lys Gin Lys Ala Glu Leu Ala Asn 305 310 315
Pro Lys Ala Val
<210> 11
<211> 2720
<212> DNA
<213> Homo sapines
<400> 11
gcgggctgtt gacggcgctg cgatggctgc ctgcgagggc aggagaagcg 50 gagctctcgg ttcctctcag tcggacttcc tgacgccgcc agtgggcggg 100
507
gccccttggg ccgtcgccac cactgtagtc atgtacccac cgccgccgcc
150
gccgcctcat cgggacttca tctcggtgac gctgagcttt ggcgagagct
200
atgacaacag caagagttgg cggcggcgct cgtgctggag gaaatggaag
250
caactgtcga gattgcagcg gaatatgatt ctcttcctcc ttgcctttct
300
gcttttctgt ggactcctct tctacatcaa cttggctgac cattggaaag
350
ctctggcttt caggctagag gaagagcaga agatgaggcc agaaattgct
400
gggttaaaac cagcaaatcc acccgtctta ccagctcctc agaaggcgga
450
caccgaccct acatccagcg gagaacttac gggaccacct ctgagatttc cacctgcaga gtcacagaag ttagaccccc acacaaagac aagccaagac
500 550
ctgaaggatg ggacccagga ggaggccaca aaaaggcaag aagcccctgt
600
ggatccccgc ccggaaggag atccgcagag gacagtcatc agctggaggg
650
gagcggtgat cgagcctgag cagggcaccg agctcccttc aagaagagca
700
gaagtgccca ccaagcctcc cctgccaccg gccaggacac agggcacacc
750
agtgcatctg aactatcgcc agaagggcgt gattgacgtc ttcctgcatg
800
catggaaagg ataccgcaag tttgcatggg gccatgacga gctgaagcct
850
gtgtccaggt ccttcagtga gtggtttggc ctcggtctca cactgatcga
900
cgcgctggac accatgtgga tcttgggtct gaggaaagaa tttgaggaag
950
ccaggaagtg ggtgtcgaag aagttacact ttgaaaagga cgtggacgtc
1000
aacctgtttg agagcacgat ccgcatcctg ggggggctcc tgagtgccta
1050
ccacctgtct ggggacagcc tcttcctgag gaaagctgag gattttggaa
1100
atcggctaat gcctgccttc agaacaccat ccaagattcc ttactcggat
1150
gtgaacatcg gtactggagt tgcccacccg ccacggtgga cctccgacag
1200
cactgtggcc gaggtgacca gcattcagct ggagttccgg gagctctccc
1250
gtctcacagg ggataagaag tttcaggagg cagtggagaa ggtgacacag
1300
cacatccacg gcctgtctgg gaagaaggat gggctggtgc ccatgttcat
1350
caatacccac agtggcctct tcacccacct gggcgtattc acgctgggcg
1400
ccagggccga cagctactat gagtacctgc tgaagcagtg gatccagggc
1450
gggaagcagg agacacagct gctggaagac tacgtggaag ccatcgaggg
1500
tgtcagaacg cacctgctgc ggcactccga gcccagtaag ctcacctttg
1550
tgggggagct tgcccacggc cgcttcagtg ccaagatgga ccacctggtg
1600
tgcttcctgc cagggacgct ggctctgggc gtctaccacg gcctgcccgc
1650
cagccacatg gagctggccc aggagctcat ggagacttgt taccagatga
1700
508
accggcagat ggagacgggg ctgagtcccg agatcgtgca cttcaacctt
1750
tacccccagc cgggccgtcg ggacgtggag gtcaagccag cagacaggca
1800
caacctgctg cggccagaga ccgtggagag cctgttctac ctgtaccgcg
1850
tcacagggga ccgcaaatac caggactggg gctgggagat tctgcagagc
1900
ttcagccgat tcacacgggt cccctcgggt ggctattctt ccatcaacaa
1950
tgtccaggat cctcagaagc ccgagcctag ggacaagatg gagagcttct
2000
tcctggggga gacgctcaag tatctgttct tgctcttctc cgatgaccca
2050
aacctgctca gcctatctgg gcctggacgc acccctgcct ctacgtgttc agggtggatg aacaccgaag gctgctggtg cccaccctct tggggacttc
2100 2150
gggtgggcag aggcaccttg ctgggtctgt ggcattttcc aagggcccac
2200
gtagcaccgg caaccgccaa gtggcccagg ctctgaactg gctctgggct
2250
cctcctcgtc tctgctttaa tcaggacacc gtgaggacaa gtgaggccgt
2300
cagtcttggt gtgatgcggg gtgggctggg ccgctggagc ctccgcctgc
2350
ttcctccaga agacacgaat catgactcac gattgctgaa gcctgagcag
2400
gtctctgtgg gccgaccaga ggggggcttc gaggtggtcc ctggtactgg
2450
ggtgaccgag tggacagccc agggtgcagc tctgcccggg ctcgtgaagc
2500
ctcagatgtc cccaatccaa gggtctggag gggctgccgt gactccagag
2550
gcctgaggct ccagggctgg ctctggtgtt tacaagctgg actcagggat
2600
cctcctggcc gccccgcagg gggcttggag ggctggacgg caagtccgtc
2650
tagctcacgg gcccctccag tggaatgggt cttttcggtg gagataaaag
2700
ttgatttgct ctaaccgcaa
2720
<210> 12 <211> 699 <212> PRT
<213> Homo sapiens <220>
<221> TRANSMEM
<222> 21-40 and 84-105
<223> Transmembrane Domain (type II)
<400> 12
Met Ala Ala Cys Glu Gly Arg Arg Ser Gly Ala Leu Gly Ser Ser 15 10 15
Gin Ser Asp Phe Leu Thr Pro Pro Val Gly Gly Ala Pro Trp Ala 20 25 30
Val Ala Thr Thr Val Val Met Tyr Pro Pro Pro Pro Pro Pro Pro 35 40 45
509
His Arg Asp Phe Ile Ser Val Thr Leu Ser Phe Gly Glu Ser Tyr 50 55 60
Asp Asn Ser Lys Ser Trp Arg Arg Arg Ser Cys Trp Arg Lys Trp 65 70 75
Lys Gin Leu Ser Arg Leu Gin Arg Asn Met Ile Leu Phe Leu Leu 80 85 90
Ala Phe Leu Leu Phe Cys Gly Leu Leu Phe Tyr Ile Asn Leu Ala 95 100 105
Asp His Trp Lys Ala Leu Ala Phe Arg Leu Glu Glu Glu Gin Lys 110 115 120
Met Arg Pro Glu Ile Ala Gly Leu Lys Pro Ala Asn Pro Pro Val
125 130 135
Leu Pro Ala Pro Gin Lys Ala Asp Thr Asp Pro Glu Asn Leu Pro
140 145 150
Glu Ile Ser Ser Gin Lys Thr Gin Arg His Ile Gin Arg Gly Pro 155 160 165
Pro His Leu Gin Ile Arg Pro Pro Ser Gin Asp Leu Lys Asp Gly 170 175 180
Thr Gin Glu Glu Ala Thr Lys Arg Gin Glu Ala Pro Val Asp Pro 185 ' 190 195
Arg Pro Glu Gly Asp Pro Gin Arg Thr Val Ile Ser Trp Arg Gly 200 205 210
Ala Val Ile Glu Pro Glu Gin Gly Thr Glu Leu Pro Ser Arg Arg 215 220 225
Ala Glu Val Pro Thr Lys Pro Pro Leu Pro Pro Ala Arg Thr Gin 230 235 240
Gly Thr Pro Val His Leu Asn Tyr Arg Gin Lys Gly Val Ile Asp 245 250 255
Val Phe Leu His Ala Trp Lys Gly Tyr Arg Lys Phe Ala Trp Gly 260 265 270
His Asp Glu Leu Lys Pro Val Ser Arg Ser Phe Ser Glu Trp Phe 275 280 285
Gly Leu Gly Leu Thr Leu Ile Asp Ala Leu Asp Thr Met Trp Ile 290 295 300
Leu Gly Leu Arg Lys Glu Phe Glu Glu Ala Arg Lys Trp Val Ser 305 310 315
Lys Lys Leu His Phe Glu Lys Asp Val Asp Val Asn Leu Phe Glu 320 325 330
Ser Thr Ile Arg Ile Leu Gly Gly Leu Leu Ser Ala Tyr His Leu 335 340 345
Ser Gly Asp Ser Leu Phe Leu Arg Lys Ala Glu Asp Phe Gly Asn 350 355 360
510
Arg Leu Met Pro Ala Phe Arg Thr Pro Ser Lys Ile Pro Tyr Ser 365 370 375
Asp Val Asn Ile Gly Thr Gly Val Ala His Pro Pro Arg Trp Thr 380 385 390
Ser Asp Ser Thr Val Ala Glu Val Thr Ser Ile Gin Leu Glu Phe 395 400 405
Arg Glu Leu Ser Arg Leu Thr Gly Asp Lys Lys Phe Gin Glu Ala 410 415 420
Val Glu Lys Val Thr Gin His Ile His Gly Leu Ser Gly Lys Lys 425 430 435
Asp Gly Leu Val Pro Met Phe Ile Asn Thr His Ser Gly Leu Phe 440 445 450
Thr His Leu Gly Val Phe Thr Leu Gly Ala Arg Ala Asp Ser Tyr 455 460 465
Tyr Glu Tyr Leu Leu Lys Gin Trp Ile Gin Gly Gly Lys Gin Glu 470 475 480
Thr Gin Leu Leu Glu Asp Tyr Val Glu Ala Ile Glu Gly Val Arg 485 490 495
Thr His Leu Leu Arg His Ser Glu Pro Ser Lys Leu Thr Phe Val 500 505 510
Gly Glu Leu Ala His Gly Arg Phe Ser Ala Lys Met Asp His Leu 515 520 525
Val Cys Phe Leu Pro Gly Thr Leu Ala Leu Gly Val Tyr His Gly 530 535 540
Leu Pro Ala Ser His Met Glu Leu Ala Gin Glu Leu Met Glu Thr 545 550 555
Cys Tyr Gin Met Asn Arg Gin Met Glu Thr Gly Leu Ser Pro Glu 560 565 570
Ile Val His Phe Asn Leu Tyr Pro Gin Pro Gly Arg Arg Asp Val 575 580 585
Glu Val Lys Pro Ala Asp Arg His Asn Leu Leu Arg Pro Glu Thr 590 595 600
Val Glu Ser Leu Phe Tyr Leu Tyr Arg Val Thr Gly Asp Arg Lys 605 610 615
Tyr Gin Asp Trp Gly Trp Glu Ile Leu Gin Ser Phe Ser Arg Phe 620 625 630
Thr Arg Val Pro Ser Gly Gly Tyr Ser Ser Ile Asn Asn Val Gin 635 640 645
Asp Pro Gin Lys Pro Glu Pro Arg Asp Lys Met Glu Ser Phe Phe 650 655 660
Leu Gly Glu Thr Leu Lys Tyr Leu Phe Leu Leu Phe Ser Asp Asp 665 670 675
511
Pro Asn Leu Leu Ser Leu Asp Ala Tyr Val Phe Asn Thr Glu Ala 680 685 690
His Pro Leu Pro Ile Trp Thr Pro Ala 695
<210> 13 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 13 cgccagaagg gcgtgattga cgtc 24
<210> 14 <211> 24 <212> DNA
<213> Artificial sequence <220>
<223> Synthetic oligonucleotide probe
<400> 14 ccatccttct tcccagacag gccg 24
<210> 15 <211> 44 <212> DNA
<213> Artificial sequence <220>
<223> Synthetic oligonucleotide probe
<400> 15 gaagcctgtg tccaggtcct
<210> 16 <211> 1524 <212> DNA <213> Homo sapiens
<400> 16 ggcgccgcgt aggcccggga cccatgcgcc gccgcctctc gcgcagctgc cctgggagga ccctcggaag tgttccgtct gcttcttctc cctactctgg cgggcagtca ggggacaagg ccccccagag ccgccccctg cccaccgcct ggcagtgctg tcagtgagtg gtttggcctc ggtc 44
ggccgggccg gccgggctgc gagcgcctgc 50 cgcacgatgt tcccctcgcg gaggaaagcg 100 cggcaggtcc gggttgctct ccggcggcct 150 tccacctgtt cgtggcctgc ctctcgctgg 2 00 ctgcagctca gctgctctgg ggacgtggcc 2 50 gcaggagacc tcgggccctc cccgtgcctg 300 agcactggga agaagacgca tcctggggcc 350 gtgcccttcc gcgaacgctt cgaggagctc 400
512
ctggtcttcg tgccccacat gcgccgcttc ctgagcagga agaagatccg
450
gcaccacatc tacgtgctca accaggtgga ccacttcagg ttcaaccggg
500
cagcgctcat caacgtgggc ttcctggaga gcagcaacag cacggactac
550
attgccatgc acgacgttga cctgctccct ctcaacgagg agctggacta
600
tggctttcct gaggctgggc ccttccacgt ggcctccccg gagctccacc
650
ctctctacca ctacaagacc tatgtcggcg gcatcctgct gctctccaag
700
cagcactacc ggctgtgcaa tgggatgtcc aaccgcttct ggggctgggg
750
ccgcgaggac gacgagttct accggcgcat taagggagct gggctccagc
800
ttttccgccc ctcgggaatc acaactgggt acaagacatt tcgccacctg
850
catgacccag cctggcggaa gagggaccag aagcgcatcg cagctcaaaa
900
acaggagcag ttcaaggtgg acagggaggg aggcctgaac actgtgaagt
950
accatgtggc ttcccgcact gccctgtctg tgggcggggc cccctgcact
1000
gtcctcaaca tcatgttgga ctgtgacaag accgccacac cctggtgcac
1050
attcagctga gctggatgga cagtgaggaa gcctgtacct acaggccata
1100
ttgctcaggc tcaggacaag gcctcaggtc gtgggcccag ctctgacagg
1150
atgtggagtg gccaggacca agacagcaag ctacgcaatt gcagccaccc
1200
ggccgccaag gcaggcttgg gctgggccag gacacgtggg gtgcctggga
1250
cgctgcttgc catgcacagt gatcagagag aggctggggt gtgtcctgtc
1300
cgggaccccc cctgccttcc tgctcaccct actctgacct ccttcacgtg
1350
cccaggcctg tgggtagtgg ggagggctga acaggacaac ctctcatcac
1400
cctactctga cctccttcac gtgcccaggc ctgtgggtag tggggagggc
1450
tgaacaggac aacctctcat cacccccaaa aaaaaaaaaa aaaaaaaaaa
1500
aaaaaaaaaa aaaaaaaaaa aaaa 1524
<210> 17 <211> 327 <212> PRT
<213> Homo sapiens <220>
<221> sig_peptide <222> 1-42
<223> Signal peptide.
<220>
<221> misc_feature
<222> 19-25,65-71,247-253,285-291,303-310 <223> N-myristoylation site.
<220>
513
<221> misc_feature <222> 27-31
<223> cAMP- and cGMP-dependent protein kinase phosphorylation site. <220>
<221> TRANSMEM <222> 29-49
<223> Transmembrane domain (type II).
<220>
<221> misc_feature <222> 154-158
<223> N-glycosylation site.
<220>
<221> misc_feature <222> 226-233
<223> Tyrosine kinase phosphorylation site.
<400> 17
Met Phe Pro Ser Arg Arg Lys Ala Ala Gin Leu Pro Trp Glu Asp 15 10 15
Gly Arg Ser Gly Leu Leu Ser Gly Gly Leu Pro Arg Lys Cys Ser 20 25 30
Val Phe His Leu Phe Val Ala Cys Leu Ser Leu Gly Phe Phe Ser 35 40 45
Leu Leu Trp Leu Gin Leu Ser Cys Ser Gly Asp Val Ala Arg Ala 50 55 60
Val Arg Gly Gin Gly Gin Glu Thr Ser Gly Pro Pro Arg Ala Cys 65 70 75
Pro Pro Glu Pro Pro Pro Glu His Trp Glu Glu Asp Ala Ser Trp 80 85 90
Gly Pro Glu Glu Arg Lys His Phe
His Arg Leu Ala Val Leu Val Pro Phe Arg Glu Arg 95 100
Leu Leu Val Phe Val Pro His Met Arg Arg Phe Leu 110 115
Lys Ile Arg His His Ile Tyr Val Leu Asn Gin Val 125 130
Arg Phe Asn Arg Ala Ala Leu Ile Asn Val 140 145
Glu Ser Ser Asn Ser Thr Asp Tyr Ile Ala Met His 155 160
Gly Asp
Phe
Val
Phe 105
Ser 120
Asp 135
Leu 150
Asp 165
Leu Leu Pro Leu Asn Glu Glu Leu Asp Tyr Gly Phe Pro Glu Ala
170 175 180
Gly Pro Phe His Val Ala Ser Pro Glu Leu His Pro Leu Tyr His
185 190 195
Tyr Lys Thr Tyr Val Gly Gly Ile Leu Leu Leu Ser Lys Gin His
Tyr Arg Leu
Arg Glu Asp
Gin Leu Phe
Arg His Leu
Ile Ala Ala
Gly Leu Asn
Ser Val Gly Cys Asp Lys
<210> 18 <211> 23 <212> DNA <213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 18 gcgaacgctt cgaggagtcc tgg 23
<210> 19 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 19 gcagtgcggg aagccacatg gtac 24
<210> 20 <211> 46 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 20
cttcctgagc aggaagaaga tccggcacca catctacgtg ctcaac 46
<210> 21
<211> 494
<212> DNA
<213> Homo sapiens
514
200 205 210
Cys Asn Gly Met Ser Asn Arg Phe Trp Gly Trp Gly 215 220 225
Asp Glu Phe Tyr Arg Arg Ile Lys Gly Ala Gly Leu 230 235 240
Arg Pro Ser Gly Ile Thr Thr Gly Tyr Lys Thr Phe 245 250 255
His Asp Pro Ala Trp Arg Lys Arg Asp Gin Lys Arg 260 265 270
Gin Lys Gin Glu Gin Phe Lys Val Asp Arg Glu Gly 275 280 285
Thr Val Lys Tyr His Val Ala Ser Arg Thr Ala Leu 290 295 300
Gly Ala Pro Cys Thr Val Leu Asn Ile Met Leu Asp 305 310 315
Thr Ala Thr Pro Trp Cys Thr Phe Ser 320 325
515
<400> 21
caatgtttgc ctatccacct cccccaagcc cctttaccta tgctgctgct
50
aacgctgctg ctgctgctgc tgctgcttaa aggctcatgc ttggagtggg
100
gactggtcgg tgcccagaaa gtctcttctg ccactgacgc ccccatcagg
150
gattgggcct tctttccccc ttcctttctg tgtctcctgc ctcatcggcc
200
tgccatgacc tgcagccaag cccagccccg tggggaaggg gagaaagtgg
250
gggatggcta agaaagctgg gagataggga acagaagagg gtagtgggtg
300
ggctaggggg gctgccttat ttaaagtggt tgtttatgat tcttatacta
350
atttatacaa agatattaag gccctgttca ttaagaaatt gttcccttcc
400
cctgtgttca atgtttgtaa agattgttct gtgtaaatat gtctttataa
450
taaacagtta aaagctgaaa aaaaaaaaaa aaaaaaaaaa aaaa 494
<210> 22 <211> 73 <212> PRT
<213> Homo sapiens <220>
<221> sig_peptide <222> 1-15
<223> Signal peptide.
<220>
<221> misc_feature <222> 3-18
<223> Growth factor and cytokines receptors family.
<400> 22
Met Leu Leu Leu Thr Leu Leu Leu Leu Leu Leu Leu Leu Lys Gly 15 10 15
Ser Cys Leu Glu Trp Gly Leu Val Gly Ala Gin Lys Val Ser Ser 20 25 30
Ala Thr Asp Ala Pro Ile Arg Asp Trp Ala Phe Phe Pro Pro Ser 35 40 45
Phe Leu Cys Leu Leu Pro His Arg Pro Ala Met Thr Cys Ser Gin 50 55 60
Ala Gin Pro Arg Gly Glu Gly Glu Lys Val Gly Asp Gly 65 70
<210> 23
<211> 2883
<212> DNA
<213> Homo sapiens
<400> 23
gggacccatg cggccgtgac ccccggctcc ctagaggccc agcgcagccg 50
cagcggacaa aggagcatgt ccgcgccggg gaaggcccgt cctccggccg 100
516
ccataaggct ccggtcgccg ctgggcccgc gccgcgctcc tgcccgcccg
150
ggctccgggg cggcccgcta ggccagtgcg ccgccgctcg ccccgcaggc
200
cccggcccgc agcatggagc cacccggacg ccggcggggc cgcgcgcagc
250
cgccgctgtt gctgccgctc tcgctgttag cgctgctcgc gctgctggga
300
ggcggcggcg gcggcggcgc cgcggcgctg cccgccggct gcaagcacga
350
tgggcggccc cgaggggctg gcagggcggc gggcgccgcc gagggcaagg
400
tggtgtgcag cagcctggaa ctcgcgcagg tcctgccccc agatactctg
450
cccaaccgca cggtcaccct gattctgagt aacaataaga tatccgagct
500
gaagaatggc tcattttctg ggttaagtct ccttgaaaga ttggacctcc
550
gaaacaatct tattagtagt atagatccag gtgccttctg gggactgtca
600
tctctaaaaa gattggatct gacaaacaat cgaataggat gtctgaatgc
650
agacatattt atttgttttc cgaggactca ttcattatct ccaatctggt caaggaactt tcggctaaac ttgattatct ctttcgggga tgcgtcatta
700 750
cggtctttgg aattccagac tgagtatctt ttgtgtgact gtaacatact
800
gtggatgcat cgctgggtaa aggagaagaa catcacggta cgggatacca
850
ggtgtgttta tcctaagtca ctgcaggccc aaccagtcac aggcgtgaag
900
caggagctgt tgacatgcga ccctccgctt gaattgccgt ctttctacat
950
gactccatct catcgccaag ttgtgtttga aggagacagc cttcctttcc
1000
agtgcatggc ttcatatatt gatcaggaca tgcaagtgtt gtggtatcag
1050
gatgggagaa tagttgaaac cgatgaatcg caaggtattt ttgttgaaaa
1100
gaacatgatt cacaactgct ccttgattgc aagtgcccta accatttcta
1150
atattcaggc tggatctact ggaaattggg gctgtcatgt ccagaccaaa
1200
cgtgggaata atacgaggac tgtggatatt gtggtattag agagttctgc
1250
acagtactgt cctccagaga gggtggtaaa caacaaaggt gacttcagat
1300
ggcccagaac attggcaggc attactgcat atctgcagtg tacgcggaac
1350
acccatggca gtgggatata tcccggaaac ccacaggatg agagaaaagc
1400
ttggcgcaga tgtgatagag gtggcttttg ggcagatgat gattattctc
1450
gctgtcagta tgcaaatgat gtcactagag ttctttatat gtttaatcag
1500
atgcccctca atcttaccaa tgccgtggca acagctcgac agttactggc
1550
ttacactgtg gaagcagcca acttttctga caaaatggat gttatatttg
1600
tggcagaaat gattgaaaaa tttggaagat ttaccaagga ggaaaaatca
1650
517
aaagagctag gtgacgtgat ggttgacatt gcaagtaaca tcatgttggc
1700
tgatgaacgt gtcctgtggc tggcgcagag ggaagctaaa gcctgcagta
1750
ggattgtgca gtgtcttcag cgcattgcta cctaccggct agccggtgga
1800
gctcacgttt attcaacata ttcacccaat attgctctgg aagcttatgt
1850
catcaagtct actggcttca cggggatgac ctgtaccgtg ttccagaaag
1900
tggcagcctc tgatcgtaca ggactttcgg attatgggag gcgggatcca
1950
gagggaaacc tggataagca gctgagcttt aagtgcaatg tttcaaatac
2000
attttcgagt ctggcactaa aggtatgtta cattctgcaa tcatttaaga
2050
ctatttacag ttaaattaga atgctccaaa tgttctgctt cgcaaaataa
2100
ccttattaaa agattttttt ttgcaggaag ataggtatta ttgcttttgc
2150
tactgtttta aagaaaacta accaggaaga actgcattac gactttcaag
2200
ggccctaggc atttttgcct ttgattccct ttcttcacat aaaaatatca
2250
gaaattacat acatgtgaaa tttataactg aaattttatt cagtggtata tgacttaaaa aatgcaaata gtttatttat tactattgtt ttgttttttt
2300 2350
gctcctgatt ttaagacaat aagatgtttt catgggcccc taaaagtatc
2400
atgagccttt ggcactgcgc ctgccaagcc tagtggagaa gtcaaccctg
2450
agaccaggtg tttaatcaag caagctgtat atcaaaattt ttggcagaaa
2500
acacaaatat gtcatatatc tttttttaaa aaaagtattt cattgaagca
2550
agcaaaatga aagcattttt actgattttt aaaattggtg ctttagatat
2600
atttgactac actgtattga agcaaataga ggaggcacaa ctccagcacc
2650
ctaatggaac cacatttttt tcacttagct ttctgtgggc atgtgtaatt
2700
gtattctctg cggtttttaa tctcacagta ctttatttct gtcttgtccc
2750
tcaataatat cacaaacaat attccagtca ttttaatggc tgcataataa
2800
ctgatccaac aggtgttagg tgttctggtt tagtgtgagc actcaataaa
2850
tattgaatga atgaacgaaa aaaaaaaaaa aaa 2883
<210> 24 <211> 616 <212> PRT
<213> Homo sapiens <220>
<221> sig_peptide <222> 1-33
<223> Signal peptide. <220>
<221> TRANSMEM <222> 13-40
518
<223> Transmembrane domain (type II).
<400> 24
Met Glu Pro Pro Gly Arg Arg Arg Gly Arg Ala Gin Pro Pro Leu 15 10 15
Leu Leu Pro Leu Ser Leu Leu Ala Leu Leu Ala Leu Leu Gly Gly 20 25 30
Gly Gly Gly Gly Gly Ala Ala Ala Leu Pro Ala Gly Cys Lys His 35 40 45
Asp Gly Arg Pro Arg Gly Ala Gly Arg Ala Ala Gly Ala Ala Glu 50 55 60
Gly Lys Val Val Cys Ser Ser Leu Glu Leu Ala Gin Val Leu Pro 65 70 75
Pro Asp Thr Leu Pro Asn Arg Thr Val Thr Leu Ile Leu Ser Asn 80 85 90
Asn Lys Ile Ser Glu Leu Lys Asn Gly Ser Phe Ser Gly Leu Ser 95 100 105
Leu Leu Glu Arg Leu Asp Leu Arg Asn Asn Leu Ile Ser Ser Ile 110 115 120
Asp Pro Gly Ala Phe Trp Gly Leu Ser Ser Leu Lys Arg Leu Asp 125 130 135
Leu Thr Asn Asn Arg Ile Gly Cys Leu Asn Ala Asp Ile Phe Arg 140 145 150
Gly Leu Thr Asn Leu Val Arg Leu Asn Leu Ser Gly Asn Leu Phe 155 160 165
Ser Ser Leu Ser Gin Gly Thr Phe Asp Tyr Leu Ala Ser Leu Arg 170 175 180
Ser Leu Glu Phe Gin Thr Glu Tyr Leu Leu Cys Asp Cys Asn Ile 185 190 195
Leu Trp Met His Arg Trp Val Lys Glu Lys Asn Ile Thr Val Arg 200 205 210
Asp Thr Arg Cys Val Tyr Pro Lys Ser Leu Gin Ala Gin Pro Val 215 220 225
Thr Gly Val Lys Gin Glu Leu Leu Thr Cys Asp Pro Pro Leu Glu 230 235 240
Leu Pro Ser Phe Tyr Met Thr Pro Ser His Arg Gin Val Val Phe 245 250 255
Glu Gly Asp Ser Leu Pro Phe Gin Cys Met Ala Ser Tyr Ile Asp 260 265 270
Gin Asp Met Gin Val Leu Trp Tyr Gin Asp Gly Arg Ile Val Glu 275 280 285
Thr Asp Glu Ser Gin Gly Ile Phe Val Glu Lys Asn Met Ile His 290 295 300
519
Asn Cys Ser Leu Ile Ala Ser Ala Leu Thr Ile Ser Asn Ile Gin 305 310 315
Ala Gly Ser Thr Gly Asn Trp Gly Cys His Val Gin Thr Lys Arg 320 325 330
Gly Asn Asn Thr Arg Thr Val Asp Ile Val Val Leu Glu Ser Ser 335 340 345
Ala Gin Tyr Cys Pro Pro Glu Arg Val Val Asn Asn Lys Gly Asp 350 355 360
Phe Arg Trp Pro Arg Thr Leu Ala Gly Ile Thr Ala Tyr Leu Gin 365 370 375
Cys Thr Arg Asn Thr His Gly Ser Gly lie Tyr Pro Gly Asn Pro 380 385 390
Gin Asp Glu Arg Lys Ala Trp Arg Arg Cys Asp Arg Gly Gly Phe 395 400 405
Trp Ala Asp Asp Asp Tyr Ser Arg Cys Gin Tyr Ala Asn Asp Val 410 415 420
Thr Arg Val Leu Tyr Met Phe Asn Gin Met Pro Leu Asn Leu Thr 425 430 435
Asn Ala Val Ala Thr Ala Arg Gin Leu Leu Ala Tyr Thr Val Glu 440 445 450
Ala Ala Asn Phe Ser Asp Lys Met Asp Val Ile Phe Val Ala Glu 455 460 465
Met Ile Glu Lys Phe Gly Arg Phe Thr Lys Glu Glu Lys Ser Lys 470 475 480
Glu Leu Gly Asp Val Met Val Asp Ile Ala Ser Asn Ile Met Leu 485 490 495
Ala Asp Glu Arg Val Leu Trp Leu Ala Gin Arg Glu Ala Lys Ala 500 505 510
Cys Ser Arg Ile Val Gin Cys Leu Gin Arg Ile Ala Thr Tyr Arg 515 520 525
Leu Ala Gly Gly Ala His Val Tyr Ser Thr Tyr Ser Pro Asn Ile 530 535 540
Ala Leu Glu Ala Tyr Val Ile Lys Ser Thr Gly Phe Thr Gly Met 545 550 555
Thr Cys Thr Val Phe Gin Lys Val Ala Ala Ser Asp Arg Thr Gly 560 565 570
Leu Ser Asp Tyr Gly Arg Arg Asp Pro Glu Gly Asn Leu Asp Lys 575 580 585
Gin Leu Ser Phe Lys Cys Asn Val Ser Asn Thr Phe Ser Ser Leu 590 595 600
Ala Leu Lys Val Cys Tyr Ile Leu Gin Ser Phe Lys Thr Ile Tyr 605 610 615
520
Ser
<210> 25 <211> 24 <212> DNA
<213> Artificial sequence <220>
<223> Synthetic oligonucleotide probe
<400> 25 gaggactcac caatctggtt cggc 24
<210> 26 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 26 aactggaaag gaaggctgtc tccc 24
<210> 27 <211> 50 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 27
gtaaaggaga agaacatcac ggtacgggat accaggtgtg tttatcctaa 50
<210> 28 <211> 683 <212> DNA
<213> Homo sapiens <400> 28
gcgtggggat gtctaggagc tcgaaggtgg tgctgggcct ctcggtgctg
50
ctgacggcgg ccacagtggc cggcgtacat gtgaagcagc agtgggacca
100
gcagaggctt cgtgacggag ttatcagaga cattgagagg caaattcgga
150
aaaaagaaaa cattcgtctt ttgggagaac agattatttt gactgagcaa
200
cttgaagcag aaagagagaa gatgttattg gcaaaaggat ctcaaaaatc
250
atgacttgaa tgtgaaatat ctgttggaca gacaacacga gtttgtgtgt
300
gtgtgttgat ggagagtagc ttagtagtat cttcatcttt ttttttggtc
350
actgtccttt taaacttgat caaataaagg acagtgggtc atataagtta
400
ctgctttcag ggtcccttat atctgaataa aggagtgtgg gcagacactt
450
tttggaagag tctgtctggg tgatcctggt agaagcccca ttagggtcac
500
521
tgtccagtgc ttagggttgt tactgagaag cactgccgag cttgtgagaa 550
ggaagggatg gatagtagca tccacctgag tagtctgatc agtcggcatg 600
atgacgaagc cacgagaaca tcgacctcag aaggactgga ggaaggtgaa 650
gtggagggag agacgctcct gatcgtcgaa tcc 683
<210> 29 <211> 81 <212> PRT
<213> Homo sapiens <220>
<221> sig_peptide <222> 1-21
<223> Signal peptide.
<400> 29
Met Ser Arg Ser Ser Lys Val Val Leu Gly Leu Ser Val Leu Leu 15 10 15
Thr Ala Ala Thr Val Ala Gly Val His Val Lys Gin Gin Trp Asp 20 25 30
Gin Gin Arg Leu Arg Asp Gly Val Ile Arg Asp Ile Glu Arg Gin 35 40 45
Ile Arg Lys Lys Glu Asn Ile Arg Leu Leu Gly Glu Gin Ile Ile 50 55 60
Leu Thr Glu Gin Leu Glu Ala Glu Arg Glu Lys Met Leu Leu Ala 65 70 75
Lys Gly Ser Gin Lys Ser 80
<210> 30 <211> 2128 <212> DNA
<213> Homo sapiens <400> 30
ctgtcgtctt tgcttcagcc gcagtcgcca ctggctgcct gaggtgctct
50
tacagcctgt tccaagtgtg gcttaatccg tctccaccac cagatctttc
100
tccgtggatt cctctgctaa gaccgctgcc atgccagtga cggtaacccg
150
caccaccatc acaaccacca cgacgtcatc ttcgggcctg gggtccccca
200
tgatcgtggg gtcccctcgg gccctgacac agcccctggg tctccttcgc
250
ctgctgcagc tggtgtctac ctgcgtggcc ttctcgctgg tggctagcgt
300
gggcgcctgg acggggtcca tgggcaactg gtccatgttc acctggtgct
350
tctgcttctc cgtgaccctg atcatcctca tcgtggagct gtgcgggctc
400
caggcccgct tccccctgtc ttggcgcaac ttccccatca ccttcgcctg
450
522
ctatgcggcc ctcttctgcc tctcggcctc catcatctac cccaccacct
500
atgtccagtt cctgtcccac ggccgttcgc gggaccacgc catcgccgcc
550
accttcttct cctgcatcgc gtgtgtggct tacgccaccg aagtggcctg
600
gacccgggcc cggcccggcg agatcactgg ctatatggcc accgtacccg
650
ggctgctgaa ggtgctggag accttcgttg cctgcatcat cttcgcgttc
700
atcagcgacc ccaacctgta ccagcaccag ccggccctgg agtggtgcgt
750
ggcggtgtac gccatctgct tcatcctagc ggccatcgcc atcctgctga
800
acctggggga gtgcaccaac gtgctaccca tccccttccc cagcttcctg
850
tcggggctgg ccttgctgtc tgtcctcctc tatgccaccg cccttgttct
900
ctggcccctc taccagttcg atgagaagta tggcggccag cctcggcgct
950
cgagagatgt aagctgcagc cgcagccatg cctactacgt gtgtgcctgg
1000
gaccgccgac tggctgtggc catcctgacg gccatcaacc tactggcgta
1050
tgtggctgac ctggtgcact ctgcccacct ggtttttgtc aaggtctaag
1100
actctcccaa gaggctcccg ttccctctcc aacctctttg ttcttcttgc
1150
ccgagttttc tttatggagt acttctttcc tccgcctttc ctctgttttc
1200
ctcttcctgt ctcccctccc tcccaccttt ttctttcctt cccaattcct
1250
tgcactctaa ccagttcttg gatgcatctt cttccttccc tttcctcttg
1300
ctgtttcctt cctgtgttgt tttgttgccc acatcctgtt ttcacccctg
1350
agctgtttct ctttttcttt tctttctttt tttttttttt ttttaagacg
1400
gattctcact ctgtggccca ggctggagtg cagtggtgcg atctcagctc
1450
actgcaaccc ccgcctcctg ggttcaagcg attctcctcc cccagcctcc
1500
caagtagctg ggaggacagg tgtgagctgc cgcacccagc ctgtttctct
1550
ttttccactc ttcttttttc tcatctcttt tctgggttgc ctgtcggctt
1600
tcttatctgc ctgttttgca agcaccttct cctgtgtcct tgggagccct
1650
gagacttctt tctctccttg cctccaccca cctccaaagg tgctgagctc
1700
acatccacac cccttgcagc cgtccatgcc acagcccccc aaggggcccc
1750
attgccaaag catgcctgcc caccctcgct gtgccttagt cagtgtgtac
1800
gtgtgtgtgt gtgtgtgttt ggggggtggg gggtgggtag ctggggattg
1850
ggccctcttt ctcccagtgg aggaaggtgt gcagtgtact tcccctttaa
1900
attaaaaaac atatatatat atatatttgg aggtcagtaa tttccaatgg
1950
gcgggaggca ttaagcaccg accctgggtc cctaggcccc gcctggcact
2000
523
cagccttgcc agagattggc tccagaattt ttgccaggct tacagaacac 2050
ccactgccta gaggccatct taaaggaagc aggggctgga tgcctttcat 2100
cccaactatt ctctgtggta tgaaaaag 2128
<210> 31 <211> 322 <212> PRT
<213> Homo sapiens <400> 31
Met Pro Val Thr Val Thr Arg Thr Thr Ile Thr Thr Thr Thr Thr 15 10 15
Ser Ser Ser Gly Leu Gly Ser Pro Met Ile Val Gly Ser Pro Arg 20 25 30
Ala Leu Thr Gin Pro Leu Gly Leu Leu Arg Leu Leu Gin Leu Val 35 40 45
t
Ser Thr Cys Val Ala Phe Ser Leu Val Ala Ser Val Gly Ala Trp 50 55 60
Thr Gly Ser Met Gly Asn Trp Ser Met Phe Thr Trp Cys Phe Cys 65 70 75
Phe Ser Val Thr Leu Ile Ile Leu Ile Val Glu Leu Cys Gly Leu 80 85 90
Gin Ala Arg Phe Pro Leu Ser Trp Arg Asn Phe Pro Ile Thr Phe 95 100 105
Ala Cys Tyr Ala Ala Leu Phe Cys Leu Ser Ala Ser Ile Ile Tyr 110 115 120
Pro Thr Thr Tyr Val Gin Phe Leu Ser His Gly Arg Ser Arg Asp 125 130 135
His Ala Ile Ala Ala Thr Phe Phe Ser Cys Ile Ala Cys Val Ala 140 145 150
Tyr Ala Thr Glu Val Ala Trp Thr Arg Ala Arg Pro Gly Glu Ile 155 160 165
Thr Gly Tyr Met Ala Thr Val Pro Gly Leu Leu Lys Val Leu Glu 170 175 180
Thr Phe Val Ala Cys Ile Ile Phe Ala Phe Ile Ser Asp Pro Asn 185 190 195
Leu Tyr Gin His Gin Pro Ala Leu Glu Trp Cys Val Ala Val Tyr 200 205 210
Ala Ile Cys Phe Ile Leu Ala Ala Ile Ala Ile Leu Leu Asn Leu 215 220 225
Gly Glu Cys Thr Asn Val Leu Pro Ile Pro Phe Pro Ser Phe Leu 230 235 240
Ser Gly Leu Ala Leu Leu Ser Val Leu Leu Tyr Ala Thr Ala Leu 245 250 255
524
Val Leu Trp Pro Leu Tyr Gin Phe Asp Glu Lys Tyr Gly Gly Gin 260 265 270
Pro Arg Arg Ser Arg Asp Val Ser Cys Ser Arg Ser His Ala Tyr 275 280 285
Tyr Val Cys Ala Trp Asp Arg Arg Leu Ala Val Ala Ile Leu Thr 290 295 300
Ala Ile Asn Leu Leu Ala Tyr Val Ala Asp Leu Val His Ser Ala 305 310 315
His Leu Val Phe Val Lys Val 320
<210> 32
<211> 3680
<212> DNA
<213> Homo sapiens
<400> 32
gaacgtgcca ccatgcccag ctaatttttg tatttttagt agagacgggg
50
tttcaccatg ttggccaggc tggtcttgaa ctcgtgacct catgatccgc
100
tcacctcggc ctcccaaagt gctgggatta caggcatgag ccactgacgc
150
ctggccagcc tatgcatttt taagaaatta ttctgtatta ggtgctgtgc
200
taaacattgg gcactacagt gaccaaaaca gactgaattc cccaagagcc
250
aaagaccagt gagggagacc aacaagaaac aggaaatgca aaagagacca
300
ttattactca ctatgactaa gggtcacaaa tggggtacgt tgatggagag
350
tgatttgtta agagactaca gagggaggac agactaccaa gaggggggcc
400
aggaaagctc ctctgacgag gtggtatttc agcccaaact ggaagaatga
450
gaaagagcta gccagccatc agaatagtcc agaagagatg gggagcacta
500
cactcactac actttggcct gagaaaatag catgggattg gaggaggctg
550
ggggaacacc acttctgccg acctgggcag gaggcattga gggcttgaga
600
aagggcaatg gcagtagcag tagaaaggac agggtaggag cagggacttt
650
gcaggtggaa tcattaggtc ttatcaacag atatgggcaa gcaaagccag
700
gggagaattg atggtaatgc tgaggtttgg agccaggcta gatgggacag
750
tggtgggtga tgcaaaggaa agaggtcagg aagcagggcc agacgtgggg
800
agaaggtgtg ggggtttggt ttccatcttg ccgagtctgc cggaatgtgg
850
atgggaagac caagaggagg agcaaggggc agaggggaag ggaatcttaa
900
agaagtcctg gatgccacac tcttcttcct tcctcctctt ccctctcctc
950
agaggtctca ctcgtggttc ttcatttcct gccctgcctc catctcctct
1000
525
gggtgctggg aaagtggagg attagctgaa gttttgcttc tcggggcctg
1050
tctgaatctc cattgctttc tgggaggaca taattcacct gtcctagctt
1100
cttatcatct tacatttccc tgtagccact gggacatatg tggtgttcct
1150
tcctagctcc tgtctcctcc tcatgccttt gctgggtatg ggcatgttag
1200
ggggaaggtc attgctgtca gaggggcact gactttctaa tggtgttacc
1250
caaggtgaat gttggagaca cagtcgcgat gctgcccaag tcccggcgag
1300
ccctaactat ccaggagatc gctgcgctgg ccaggtcctc cctgcatggt
1350
atgcagcccc tcccatgttt ctggccactt tgtcctttct cctcccgttt
1400
gcacatccct ttggaactgt ttcctgtgag tacatgctgg ggtctcccct
1450
ttcttccctt gctcaggtga atctcagccc cttctcccac ccaaaggttc
1500
acatggatcc taactactgc cacccttcca cctccctgca cctgtgctcc
1550
ctggcctggt cctttaccag gcttctccac cctcccctat ctccaggtat
1600
ttcccaggtg gtgaaggacc acgtgaccaa gcctaccgcc atggcccagg
1650
gccgagtggc tcacctcatt gagtggaagg gctggagcaa gccgagtgac
1700
tcacctgctg ccctggaatc agccttttcc tcctattcag acctcagcga
1750
gggcgaacaa gaggctcgct ttgcagcagg agtggctgag cagtttgcca
1800
tcgcggaagc actgatgact caagctccga cctatgatga gcatggtctt ggactttgct cggtggatgg gggggaatgg cgaggactcc acacagacat
1850 1900
ggctgggcag ctgcccctgg ggccgcacct ccaggacctg ttcaccggcc
1950
accggttctc ccggcctgtg cgccagggct ccgtggagcc tgagagcgac
2000
tgctcacaga ccgtgtcccc agacaccctg tgctctagtc tgtgcagcct
2050
ggaggatggg ttgttgggct ccccggcccg gctggcctcc cagctgctgg
2100
gcgatgagct gcttctcgcc aaactgcccc ccagccggga aagtgccttc
2150
cgcagcctgg gcccactgga ggcccaggac tcactctaca actcgcccct
2200
cacagagtcc tgcctttccc ccgcggagga ggagccagcc ccctgcaagg
2250
actgccagcc actctgccca ccactaacgg gcagctggga acggcagcgg
2300
caagcctctg acctggcctc ttctggggtg gtgtccttag atgaggatga
2350
ggcagagcca gaggaacagt gacccacatc atgcctggca gtggcatgca
2400
tcccccggct gctgccaggg gcagagcctc tgtgcccaag tgtgggctca
2450
aggctcccag cagagctcca cagcctagag ggctcctggg agcgctcgct
2500
tctccgttgt gtgttttgca tgaaagtgtt tggagaggag gcaggggctg
2550
ggctgggggc gcatgtcctg cccccactcc cggggcttgc cgggggttgc'
2600
526
ccggggcctc tggggcatgg ctacagctgt ggcagacagt gatgttcatg
2650
ttcttaaaat gccacacaca catttcctcc tcggataatg tgaaccacta
2700
agggggttgt gactgggctg tgtgagggtg gggtgggagg gggcccagca
2750
accccccacc ctccccatgc ctctctcttc tctgcttttc ttctcacttc
2800
cgagtccatg tgcagtgctt gatagaatca cccccacctg gaggggctgg
2850
ctcctgccct cccggagcct atgggttgag ccgtccctca agggcccctg
2900
cccagctggg ctcgtgctgt gcttcattca cctctccatc gtctctaaat
2950
cttcctcttt tttcctaaag acagaaggtt tttggtctgt tttttcagtc
3000
ggatcttctc ttctctggga ggctttggaa tgatgaaagc atgtaccctc
3050
cacccttttc ctggccccct aatggggcct gggccctttc ccaacccctc
3100
ctaggatgtg cgggcagtgt gctggcgcct cacagccagc cgggctgccc
3150
attcacgcag agctctctga gcgggaggtg gaagaaagga tggctctggt
3200
tgccacagag ctgggacttc atgttcttct agagagggcc acaagagggc
3250
cacaggggtg gccgggagtt gtcagctgat gcctgctgag aggcaggaat
3300
tgtgccagtg agtgacagtc atgagggagt gtctcttctt ggggaggaaa
3350
gaaggtagag cctttctgtc tgaatgaaag gccaaggcta cagtacaggg
3400
ccccgcccca atcctgcatt gccagggtgt ccaaggtcac taatgcccac tggactgtac gtagtggagg gtttttatgg cctctggcag ttgtgggaag
3450 3500
ggtgggtggc tttagaatta agggccttgt aggctttggc aggtaagagg
3550
gcccaaggta agaacgagag ccaacgggca caagcattct atatataagt
3600
ggctcattag gtgtttattt tgttctattt aagaatttgt tttattaaat
3650
taatataaaa atctttgtaa atctctaaaa
3680
<210> 33 <211> 335 <212> PRT
<213> Homo sapiens <400> 33
Met Phe Leu Ala Thr Leu Ser Phe Leu Leu Pro Phe Ala His Pro 15 10 15
Phe Gly Thr Val Ser Cys Glu Tyr Met Leu Gly Ser Pro Leu Ser 20 25 30
Ser Leu Ala Gin Val Asn Leu Ser Pro Phe Ser His Pro Lys Val 35 40 45
His Met Asp Pro Asn Tyr Cys His Pro Ser Thr Ser Leu His Leu 50 55 60
527
Cys Ser Leu Ala Trp Ser Phe Thr Arg Leu Leu His Pro Pro Leu 65 70 75
Ser Pro Gly Ile Ser Gin Val Val Lys Asp His Val Thr Lys Pro 80 85 90
Thr Ala Met Ala Gin Gly Arg Val Ala His Leu Ile Glu Trp Lys 95 100 105
. Gly Trp Ser Lys Pro Ser Asp Ser Pro Ala Ala Leu Glu Ser Ala 110 115 120
Phe Ser Ser Tyr Ser Asp Leu Ser Glu Gly Glu Gin Glu Ala Arg 125 130 135
Phe Ala Ala Gly Val Ala Glu Gin Phe Ala Ile Ala Glu Ala Lys 140 145 150
Leu Arg Ala Trp Ser Ser Val Asp Gly Glu Asp Ser Thr Asp Asp 155 160 165
Ser Tyr Asp Glu Asp Phe Ala Gly Gly Met Asp Thr Asp Met Ala 170 175 180
Gly Gin Leu Pro Leu Gly Pro His Leu Gin Asp Leu Phe Thr Gly 185 190 195
His Arg Phe Ser Arg Pro Val Arg Gin Gly Ser Val Glu Pro Glu 200 205 210
Ser Asp Cys Ser Gin Thr Val Ser Pro Asp Thr Leu Cys Ser Ser 215 220 225
Leu Cys Ser Leu Glu Asp Gly Leu Leu Gly Ser Pro Ala Arg Leu 230 235 240
Ala Ser Gin Leu Leu Gly Asp Glu Leu Leu Leu Ala Lys Leu Pro 245 250 255
Pro Ser Arg Glu Ser Ala Phe Arg Ser Leu Gly Pro Leu Glu Ala 260 265 270
Gin Asp Ser Leu Tyr Asn Ser Pro Leu Thr Glu Ser Cys Leu Ser 275 280 285
Pro Ala Glu Glu Glu Pro Ala Pro Cys Lys Asp Cys Gin Pro Leu 290 295 300
Cys Pro Pro Leu Thr Gly Ser Trp Glu Arg Gin Arg Gin Ala Ser 305 310 315
Asp Leu Ala Ser Ser Gly Val Val Ser Leu Asp Glu Asp Glu Ala 320 325 330
Glu Pro Glu Glu Gin 335
<210> 34 <211> 25 <212> DNA
<213> Artificial Sequence
<220>
528
<223> Synthetic oligonucleotide probe
<400> 34 tgtcctttgt cccagacttc tgtcc 25
<210> 35 <211> 50 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 35
ctggatgcta atgtgtccag taaatgatcc ccttatcccg tcgcgatgct 50
<210> 36 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 36 ttccactcaa tgaggtgagc cactc 25
<210> 37 <211> 23 <212> DNA
<213> Artificial Sequence <22 0>
<223> Synthetic oligonucleotide probe
<400> 37 ggcgagccct aactatccag gag 23
<210> 38 <211> 39 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 38
ggagatcgct gcgctggcca ggtcctccct gcatggtat 39
<210> 39 <211> 22 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 39 ctgctgcaaa gcgagcctct tg 22
<210> 40 <211> 2084
;212> DNA
;213> Homo sapiens
529
:400> 40 ggttcctggg cgctctgtta cacaagcaag atacagccag ccccacctaa
50
ttttgtttcc ctggcaccct cctgctcagt gcgacattgt cacacttaac
100
ccatctgttt tctctaatgc acgacagatt cctttcagac aggacaactg
150
tgatatttca gttcctgatt gtaaatacct cctaagcctg aagcttctgt
200
tactagccat tgtgagcttc agtttcttca tctgcaaaat gggcataata
250
caatctattc ttgccacatc aagggattgt tattccttta aaaaaaaacc
300
aataccaaag aagcctacaa tgttggcctt agccaaaatt ctgttgattt
350
caacgttgtt ttattcactt ctatcgggga gccatggaaa agaaaatcaa
400
gacataaaca caacacagaa cattgcagaa gtttttaaaa caatggaaaa
450
taaacctatt tctttggaaa gtgaagcaaa cttaaactca gataaagaaa
500
atataaccac ctcaaatctc aaggcgagtc attcccctcc tttgaatcta
550
cccaacaaca gccacggaat aacagatttc tccagtaact catcagcaga
600
gcattctttg ggcagtctaa aacccacatc taccatttcc acaagccctc
650
ccttgatcca tagctttgtt tctaaagtgc cttggaatgc acctatagca
700
gatgaagatc ttttgcccat ctcagcacat cccaatgcta cacctgctct
750
gtcttcagaa aacttcactt ggtctttggt caatgacacc gtgaaaactc
800
ctgataacag ccatctgtga ttccattaca cccccttgat gttagcatcc agtggaacca tctcttcaga agtggatggc accaacttct ttaccacaaa
850 900
cagtgatagc ttcactgggt ttacccctta tcaagaaaaa acaactctac
950
agcctacctt aaaattcacc aataattcaa aactctttcc aaatacgtca
1000
gatccccaaa aagaaaatag aaatacagga atagtattcg gggccatttt
1050
aggtgctatt ctgggtgtct cattgcttac tcttgtgggc tacttgttgt
1100
gtggaaaaag gaaaacggat tcattttccc atcggcgact ttatgacgac
1150
agaaatgaac cagttctgcg attagacaat gcaccggaac cttatgatgt
1200
gagttttggg aattctagct actacaatcc aactttgaat gattcagcca
1250
tgccagaaag tgaagaaaat gcacgtgatg gcattcctat ggatgacata
1300
cctccacttc gtacttctgt atagaactaa cagcaaaaag gcgttaaaca
1350
gcaagtgtca tctacatcct agccttttga caaattcatc tttcaaaagg
1400
ttacacaaaa ttactgtcac gtggattttg tcaaggagaa tcataaaagc
1450
aggagaccag tagcagaaat gtagacagga tgtatcatcc aaaggttttc
1500
530
tttcttacaa tttttggcca tcctgaggca tttactaagt agccttaatt
1550
tgtattttag tagtattttc ttagtagaaa atatttgtgg aatcagataa
1600
aactaaaaga tttcaccatt acagccctgc ctcataacta aataataaaa
1650
attattccac caaaaaattc taaaacaatg aagatgactc tttactgctc
1700
tgcctgaagc cctagtacca taattcaaga ttgcattttc ttaaatgaaa
1750
attgaaaggg tgctttttaa agaaaatttg acttaaagct aaaaagagga
1800
catagcccag agtttctgtt attgggaaat tgaggcaata gaaatgacag
1850
acctgtattc tagtacgtta taattttcta gatcagcaca cacatgatca
1900
gcccactgag ttatgaagct gacaatgact gcattcaacg gggccatggc
1950
aggaaagctg accctaccca ggaaagtaat agcttcttta aaagtcttca
2000
aaggttttgg gaattttaac ttgtcttaat atatcttagg cttcaattat
2050
ttgggtgcct taaaaactca atgagaatca tggt 2084
<210> 41 <211> 334 <212> PRT
<213> Homo sapiens <400> 41
Met Leu Ala Leu Ala Lys Ile Leu Leu Ile Ser Thr Leu Phe Tyr 15 10 15
Ser Leu Leu Ser Gly Ser His Gly Lys Glu Asn Gin Asp Ile Asn 20 25 30
Thr Thr Gin Asn Ile Ala Glu Val Phe Lys Thr Met Glu Asn Lys 35 40 45
Pro Ile Ser Leu Glu Ser Glu Ala Asn Leu Asn Ser Asp Lys Glu 50 55 60
Asn Ile Thr Thr Ser Asn Leu Lys Ala Ser His Ser Pro Pro Leu 65 70 75
Asn Leu Pro Asn Asn Ser His Gly Ile Thr Asp Phe Ser Ser Asn 80 85 90
Ser Ser Ala Glu His Ser Leu Gly Ser Leu Lys Pro Thr Ser Thr 95 100 105
Ile Ser Thr Ser Pro Pro Leu Ile His Ser Phe Val Ser Lys Val 110 115 120
Pro Trp Asn Ala Pro Ile Ala Asp Glu Asp Leu Leu Pro Ile Ser 125 130 135
Ala His Pro Asn Ala Thr Pro Ala Leu Ser Ser Glu Asn Phe Thr 140 145 150
Trp Ser Leu Val Asn Asp Thr Val Lys Thr Pro Asp Asn Ser Ser
531
155 160 165
Ile Thr Val Ser Ile Leu Ser Ser Glu Pro Thr Ser Pro Ser Val 170 175 180
Thr Pro Leu Ile Val Glu Pro Ser Gly Trp Leu Thr Thr Asn Ser 185 190 195
Asp Ser Phe Thr Gly Phe Thr Pro Tyr Gin Glu Lys Thr Thr Leu 200 205 210
Gin Pro Thr Leu Lys Phe Thr Asn Asn Ser Lys Leu Phe Pro Asn 215 220 225
Thr Ser Asp Pro Gin Lys Glu Asn Arg Asn Thr Gly Ile Val Phe 230 235 240
Gly Ala Ile Leu Gly Ala Ile Leu Gly Val Ser Leu Leu Thr Leu 245 250 255
Val Gly Tyr Leu Leu Cys Gly Lys Arg Lys Thr Asp Ser Phe Ser 260 265 270
His Arg Arg Leu Tyr Asp Asp Arg Asn Glu Pro Val Leu Arg Leu 275 280 285
Asp Asn Ala Pro Glu Pro Tyr Asp Val Ser Phe Gly Asn Ser Ser 290 295 300
Tyr Tyr Asn Pro Thr Leu Asn Asp Ser Ala Met Pro Glu Ser Glu 305 310 315
Glu Asn Ala Arg Asp Gly Ile Pro Met Asp Asp Ile Pro Pro Leu 320 325 330
Arg Thr Ser Val
<210> 42
<211> 1594
<212> DNA
<213> Homo sapiens
<400> 42
aacaggatct cctcttgcag tctgcagccc aggacgctga ttccagcagc
50
gccttaccgc gcagcccgaa gattcactat ggtgaaaatc gccttcaata
100
cccctaccgc cgtgcaaaag gaggaggcgc ggcaagacgt ggaggccctc
150
ctgagccgca cggtcagaac tcagatactg accggcaagg agctccgagt
200
tgccacccag gaaaaagagg gctcctctgg gagatgtatg cttactctct
250
taggcctttc attcatcttg gcaggactta ttgttggtgg agcctgcatt
300
tacaagtact tcatgcccaa gagcaccatt taccgtggag agatgtgctt
350
ttttgattct gaggatcctg caaattccct tcgtggagga gagcctaact
400
tcctgcctgt gactgaggag gctgacattc gtgaggatga caacattgca
450
532
atcattgatg tgcctgtccc cagtttctct gatagtgacc ctgcagcaat
500
tattcatgac tttgaaaagg gaatgactgc ttacctggac ttgttgctgg
550
ggaactgcta tctgatgccc ctcaatactt ctattgttat gcctccaaaa
600
aatctggtag agctctttgg caaactggcg agtggcagat atctgcctca
650
aacttatgtg gttcgagaag acctagttgc tgtggaggaa attcgtgatg
700
ttagtaacct tggcatcttt atttaccaac tttgcaataa cagaaagtcc
750
ttccgccttc gtcgcagaga cctcttgctg ggtttcaaca aacgtgccat
800
tgataaatgc tggaagatta gacacttccc caacgaattt attgttgaga
850
ccaagatctg tcaagagtaa gaggcaacag atagagtgtc cttggtaata
900
agaagtcaga gatttacaat atgactttaa cattaaggtt tatgggatac
950
tcaagatatt tactcatgca tttactctat tgcttatgct ttaaaaaaag
1000
gaaaaaaaaa aaaactacta accactgcaa gctcttgtca aattttagtt
1050
taattggcat tgcttgtttt ttgaaactga aattacatga gtttcatttt
1100
ttctttgcat ttatagggtt tagatttctg aaagcagcat gaatatatca
1150
cctaacatcc tgacaataaa ttccatccgt tgtttttttt gtttgtttgt
1200
tttttctttt cctttaagta agctctttat tcatcttatg gtggagcaat
1250
tttaaaattt gaaatatttt aaattgtttt tgaacttttt gtgtaaaata
1300
tatcagatct caacattgtt ggtttctttt gtttttcatt ttgtacaact
1350
ttcttgaatt tagaaattac atctttgcag ttctgttagg tgctctgtaa
1400
ttaacctgac ttatatgtga acaattttca tgagacagtc atttttaact
1450
aatgcagtga ttctttctca ctactatctg tattgtggaa tgcacaaaat
1500
tgtgtaggtg ctgaatgctg taaggagttt aggttgtatg aattctacaa
1550
ccctataata aattttactc tatacaaaaa aaaaaaaaaa aaaa 1594
<210> 43 <211> 263 <212> PRT
<213> Homo sapiens <400> 43
Met Val Lys Ile Ala Phe Asn Thr Pro Thr Ala Val Gin Lys Glu 15 10 15
Glu Ala Arg Gin Asp Val Glu Ala Leu Leu Ser Arg Thr Val Arg 20 25 30
Thr Gin Ile Leu Thr Gly Lys Glu Leu Arg Val Ala Thr Gin Glu 35 40 45
Lys Glu Gly Ser Ser Gly Arg Cys Met Leu Thr Leu Leu Gly Leu
533
50 55 60
Ser Phe Ile Leu Ala Gly Leu Ile Val Gly Gly Ala Cys lie Tyr 65 70 75
Lys Tyr Phe Met Pro Lys Ser Thr Ile Tyr Arg Gly Glu Met Cys 80 85 90
Phe Phe Asp Ser Glu Asp Pro Ala Asn Ser Leu Arg Gly Gly Glu 95 100 105
Pro Asn Phe Leu Pro Val Thr Glu Glu Ala Asp Ile Arg Glu Asp 110 115 120
Asp Asn Ile Ala Ile Ile Asp Val Pro Val Pro Ser Phe Ser Asp 125 130 135
Ser Asp Pro Ala Ala Ile Ile His Asp Phe Glu Lys Gly Met Thr 140 145 150
Ala Tyr Leu Asp Leu Leu Leu Gly Asn Cys Tyr Leu Met Pro Leu 155 160 165
Asn Thr Ser Ile Val Met Pro Pro Lys Asn L'eu Val Glu Leu Phe 170 175 180
Gly Lys Leu Ala Ser Gly Arg Tyr Leu Pro Gin Thr Tyr Val Val 185 190 195
Arg Glu Asp Leu Val Ala Val Glu Glu Ile Arg Asp Val Ser Asn 200 205 210
Leu Gly Ile Phe Ile Tyr Gin Leu Cys Asn Asn Arg Lys Ser Phe 215 220 225
Arg Leu Arg Arg Arg Asp Leu Leu Leu Gly Phe Asn Lys Arg Ala 230 235 240
Ile Asp Lys Cys Trp Lys Ile Arg His Phe Pro Asn Glu Phe Ile 245 250 255
Val Glu Thr Lys Ile Cys Gin Glu 260
<210> 44
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 44
gaaagacacg acacagcagc ttgc 24
<210> 45
<211> 20
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
534
<400> 45 gggaactgct atctgatgcc 20
<210> 46 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 46 caggatctcc tcttgcagtc tgcagc 2 6
<210> 47 <211> 28 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 47
cttctcgaac cacataagtt tgaggcag 28
<210> 48 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 48 cacgattccc tccacagcaa ctggg 25
<210> 49
<211> 1969
<212> DNA
<213> Homo sapiens
<400> 49
ggaggaggga gggcgggcag gcgccagccc agagcagccc cgggcaccag
50
cacggactct ctcttccagc ccaggtgccc cccactctcg ctccattcgg
100
cgggagcacc cagtcctgta cgccaaggaa ctggtcctgg gggcaccatg
150
gtttcggcgg cagcccccag cctcctcatc cttctgttgc tgctgctggg
200
gtctgtgcct gctaccgacg cccgctctgt gcccctgaag gccacgttcc
250
tggaggatgt ggcgggtagt ggggaggccg agggctcgtc ggcctcctcc
300
ccgagcctcc cgccaccctg gaccccggcc ctcagcccca catcgatggg
350
gccccagccc acaaccctgg ggggcccatc accccccacc aacttcctgg
400
atgggatagt ggacttcttc cgccagtacg tgatgctgat tgctgtggtg
450
ggctccctgg cctttctgct gatgttcatc gtctgtgccg cggtcatcac
500
535
ccggcagaag cagaaggcct cggcctatta cccatcgtcc ttccccaaga
550
agaagtacgt ggaccagagt gaccgggccg ggggcccccg ggccttcagt
600
gaggtccccg acagagcccc cgacagcagg cccgaggaag ccctggattc
650
ctcccggcag ctccaggccg acatcttggc cgccacccag aacctcaagt
700
cccccaccag ggctgcactg ggcggtgggg acggagccag gatggtggag
750
ggcaggggcg cagaggaaga ggagaagggc agccaggagg gggaccagga
800
agtccaggga catggggtcc cagtggagac accagaggcg caggaggagc
850
cgtgctcagg ggtccttgag ggggctgtgg tggccggtga gggccaaggg
900
gagctggaag ggtctctctt gttagcccag gaagcccagg gaccagtggg
950
tccccccgaa agcccctgtg cttgcagcag tgtccacccc agtgtctaac
1000
agtcctcccg ggctgccagc cctgactgtc gggcccccaa gtggtcacct
1050
ccccgtgtat gaaaaggcct tcagccctga ctgcttcctg acactccctc
1100
cttggcctcc ctgtggtgcc aatcccagca tgtgctgatt ctacagcagg
1150
cagaaatgct ggtccccggt gccccggagg aatcttacca agtgccatca
1200
tccttcacct cagcagcccc aaagggctac atcctacagc acagctcccc
1250
tgacaaagtg agggagggca cgtgtccctg tgacagccag gataaaacat
1300
cccccaaagt gctgggatta caggcgtgag ccaccgtgcc cggcccaaac
1350
tactttttaa aacagctaca gggtaaaatc ctgcagcacc cactctggaa
1400
aatactgctc ttaattttcc tgaaggtggc cccctgtttc tagttggtcc
1450
aggattaggg atgtggggta tagggcattt aaatcctctc aagcgctctc
1500
caagcacccc cggcctgggg gtgagtttct catcccgcta ctgctgctgg
1550
gatcaggttg aatgaatgga actcttcctg tctggcctcc aaagcagcct
1600
agaagctgag gggctgtgtt tgaggggacc tccaccctgg ggaagtccga
1650
ggggctgggg aagggtttct gacgcccagc ctggagcagg ggggccctgg
1700
ccaccccctg ttgctcacac attgtctggc agcctgtgtc cacaatattc
1750
gtcagtcctc gacagggagc ctgggctccg tcctgcttta gggaggctct
1800
ggcaggaggt cctctccccc atccctccat ctggggctcc cccaacctct
1850
gcacagctct ccaggtgctg agatataatg caccagcaca ataaaccttt
1900
attccggcct gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
1950
aaaaaaaaaa aaaaaaaga 1969
;210> 50
536
<211> 283 <212> PRT
<213> Homo sapiens <400> 50
Met Val Ser Ala Ala Ala Pro Ser Leu Leu Ile Leu Leu Leu Leu 15 10 15
Leu Leu Gly Ser Val Pro Ala Thr Asp Ala Arg Ser Val Pro Leu 20 25 30
Lys Ala Thr Phe Leu Glu Asp Val Ala Gly Ser Gly Glu Ala Glu 35 40 45
Gly Ser Ser Ala Ser Ser Pro Ser Leu Pro Pro Pro Trp Thr Pro 50 55 60
Ala Leu Ser Pro Thr Ser Met Gly Pro Gin Pro Thr Thr Leu Gly 65 70 75
Gly Pro Ser Pro Pro Thr Asn Phe Leu Asp Gly Ile Val Asp Phe 80 85 90
Phe Arg Gin Tyr Val Met Leu Ile Ala Val Val Gly Ser Leu Ala 95 100 105
Phe Leu Leu Met Phe Ile Val Cys Ala Ala Val Ile Thr Arg Gin 110 115 120
Lys Gin Lys Ala Ser Ala Tyr Tyr Pro Ser Ser Phe Pro Lys Lys 125 130 135
Lys Tyr Val Asp Gin Ser Asp Arg Ala Gly Gly Pro Arg Ala Phe 140 145 150
Ser Glu Val Pro Asp Arg Ala Pro Asp Ser Arg Pro Glu Glu Ala 155 160 165
Leu Asp Ser Ser Arg Gin Leu Gin Ala Asp Ile Leu Ala Ala Thr 170 175 180
Gin Asn Leu Lys Ser Pro Thr Arg Ala Ala Leu Gly Gly Gly Asp 185 190 195
Gly Ala Arg Met Val Glu Gly Arg Gly Ala Glu Glu Glu Glu Lys 200 205 210
Gly Ser Gin Glu Gly Asp Gin Glu Val Gin Gly His Gly Val Pro 215 220 225
Val Glu Thr Pro Glu Ala Gin Glu Glu Pro Cys Ser Gly Val Leu 230 235 240
Glu Gly Ala Val Val Ala Gly Glu Gly Gin Gly Glu Leu Glu Gly 245 250 255
Ser Leu Leu Leu Ala Gin Glu Ala Gin Gly Pro Val Gly Pro Pro 260 265 270
Glu Ser Pro Cys Ala Cys Ser Ser Val His Pro Ser Val 275 280
537
<210> 51 <211> 1734 <212> DNA <213> Homo sapiens
<400> 51
gtggactctg agaagcccag gcagttgagg acaggagaga gaaggctgca
50
gacccagagg gagggaggac agggagtcgg aaggaggagg acagaggagg
100
gcacagagac gcagagcaag ggcggcaagg aggagaccct ggtgggagga
150
agacactctg gagagagagg gggctgggca gagatgaagt tccaggggcc
200
cctggcctgc ctcctgctgg ccctctgcct gggcagtggg gaggctggcc
250
ccctgcagag cggagaggaa agcactggga caaatattgg ggaggccctt
300
ggacatggcc tgggagacgc cctgagcgaa ggggtgggaa aggccattgg
350
caaagaggcc ggaggggcag ctggctctaa agtcagtgag gcccttggcc
400
aagggaccag agaagcagtt ggcactggag tcaggcaggt tccaggcttt
450
ggcgcagcag atgctttggg caacagggtc ggggaagcag cccatgctct
500
gggaaacact gggcacgaga ttggcagaca ggcagaagat gtcattcgac
550
acggagcaga tgctgtccgc ggctcctggc agggggtgcc tggccacagt
600
ggtgcttggg aaacttctgg aggccatggc atctttggct ctcaaggtgg
650
ccttggaggc cagggccagg gcaatcctgg aggtctgggg actccgtggg
700
tccacggata ccccggaaac tcagcaggca gctttggaat gaatcctcag
750
ggagctccct ggggtcaagg aggcaatgga gggccaccaa actttgggac
800
caacactcag ggagctgtgg cccagcctgg ctatggttca gtgagagcca
850
gcaaccagaa tgaagggtgc acgaatcccc caccatctgg ctcaggtgga
900
ggctccagca actctggggg aggcagcggc tcacagtcgg gcagcagtgg
950
cagtggcagc gcagtggcag aatggtgaca cagcagtggc acaacaatgg agcagcagtg cagcagcagt gcggcagcag ggtggcagca tggcggcagc
1000 1050
agtggtggca gcagtggcaa cagtggtggc agcagaggtg acagcggcag
1100
tgagtcctcc tggggatcca gcaccggctc ctcctccggc aaccacggtg
1150
ggagcggcgg aggaaatgga cataaacccg ggtgtgaaaa gccagggaat
1200
gaagcccgcg ggagcgggga atctgggatt cagggcttca gaggacaggg
1250
agtttccagc aacatgaggg aaataagcaa agagggcaat cgcctccttg
1300
gaggctctgg agacaattat cgggggcaag ggtcgagctg gggcagtgga
1350
ggaggtgacg ctgttggtgg agtcaatact gtgaactctg agacgtctcc
1400
tgggatgttt aactttgaca ctttctggaa gaattttaaa tccaagctgg
1450
538
gtttcatcaa ctgggatgcc ataaacaagg accagagaag ctctcgcatc 1500
ccgtgacctc cagacaagga gccaccagat tggatgggag cccccacact 1550
ccctccttaa aacaccaccc tctcatcact aatctcagcc cttgcccttg 1600
aaataaacct tagctgcccc acaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1650
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1700
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 1734
<210> 52 <211> 440 <212> PRT
<213> Homo sapiens <400> 52
Met Lys Phe Gin Gly Pro Leu Ala Cys Leu Leu Leu Ala Leu Cys 1 5 10 15
Leu Gly Ser Gly Glu Ala Gly Pro Leu Gin Ser Gly Glu Glu Ser 20 25 30
Thr Gly Thr Asn Ile Gly Glu Ala Leu Gly His Gly Leu Gly Asp 35 40 45
Ala Leu Ser Glu Gly Val Gly Lys Ala Ile Gly Lys Glu Ala Gly 50 55 60
Gly Ala Ala Gly Ser Lys Val Ser Glu Ala Leu Gly Gin Gly Thr 65 70 75
Arg Glu Ala Val Gly Thr Gly Val Arg Gin Val Pro Gly Phe Gly 80 85 90
Ala Ala Asp Ala Leu Gly Asn Arg Val Gly Glu Ala Ala His Ala 95 100 105
Leu Gly Asn Thr Gly His Glu Ile Gly Arg Gin Ala Glu Asp Val 110 115 120
Ile Arg His Gly Ala Asp Ala Val Arg Gly Ser Trp Gin Gly Val 125 130 135
Pro Gly His Ser Gly Ala Trp Glu Thr Ser Gly Gly His Gly Ile 140 145 150
Phe Gly Ser Gin Gly Gly Leu Gly Gly Gin Gly Gin Gly Asn Pro 155 160 165
Gly Gly Leu Gly Thr Pro Trp Val His Gly Tyr Pro Gly Asn Ser 170 175 180
Ala Gly Ser Phe Gly Met Asn Pro Gin Gly Ala Pro Trp Gly Gin 185 190 195
Gly Gly Asn Gly Gly Pro Pro Asn Phe Gly Thr Asn Thr Gin Gly 200 205 210
Ala Val Ala Gin Pro Gly Tyr Gly Ser Val Arg Ala Ser Asn Gin
539
215 220 225
Asn Glu Gly Cys Thr Asn Pro Pro Pro Ser Gly Ser Gly Gly Gly 230 235 240
Ser Ser Asn Ser Gly Gly Gly Ser Gly Ser Gin Ser Gly Ser Ser 245 250 255
Gly Ser Gly Ser Asn Gly Asp Asn Asn Asn Gly Ser Ser Ser Gly 260 265 270
Gly Ser Ser Ser Gly Ser Ser Ser Gly Ser Ser Ser Gly Gly Ser 275 280 285
Ser Gly Gly Ser Ser Gly Gly Ser Ser Gly Asn Ser Gly Gly Ser 290 295 300
Arg Gly Asp Ser Gly Ser Glu Ser Ser Trp Gly Ser Ser Thr Gly 305 310 315
Ser Ser Ser Gly Asn His Gly Gly Ser Gly Gly Gly Asn Gly His 320 325 330
Lys Pro Gly Cys Glu Lys Pro Gly Asn Glu Ala Arg Gly Ser Gly 335 340 345
Glu Ser Gly Ile Gin Gly Phe Arg Gly Gin Gly Val Ser Ser Asn 350 355 360
Met Arg Glu Ile Ser Lys Glu Gly Asn Arg Leu Leu Gly Gly Ser 365 370 375
Gly Asp Asn Tyr Arg Gly Gin Gly Ser Ser Trp Gly Ser Gly Gly 380 385 390
Gly Asp Ala Val Gly Gly Val Asn Thr Val Asn Ser Glu Thr Ser 395 400 405
Pro Gly Met Phe Asn Phe Asp Thr Phe Trp Lys Asn Phe Lys Ser 410 415 420
Lys Leu Gly Phe Ile Asn Trp Asp Ala Ile Asn Lys Asp Gin Arg 425 430 435
Ser Ser Arg Ile Pro 440
<210> 53 <211> 3580 <212> DNA
<213> Homo sapiens <400> 53
gaccggtccc tccggtcctg gatgtgcgga ctctgctgca gcgagggctg
50
caggcccgcc gggcggtgct caccgtgccc tggctggtgg agtttctctc
100
ctttgctgac catgttgttc ccttgctgga atattaccgg gacatcttca
150
ctctcctgct gcgcctgcac cggagcttgg tgttgtcgca ggagagtgag
200
gggaagatgt gtttcctgaa caagctgctg ctacttgctg tcctgggctg
250
540
gcttttccag attcccacag tccctgagga cttgttcttt ctggaagagg
300
gtccctcata tgcctttgag gtggacacag tagccccaga gcatggcttg
350
gacaatgcgc ctgtggtgga ccagcagctg ctctacacct gctgccccta
400
catcggagag ctccggaaac tgctcgcttc gtgggtgtca ggcagtagtg
450
gacggagtgg gggcttcatg aggaaaatca cccccaccac taccaccagc
500
ctgggagccc agccttccca gaccagccag gggctgcagg cacagctcgc
550
ccaggccttt ttccacaacc agccgccctc cttgcgccgg accgtagagt
600
tcgtggcaga aagaattgga tcaaactgtg tcaaacatat caaggctaca
650
ctggtggcag atctggtgcg ccaggcagag tcacttctcc aagagcagct
700
ggtgacacag ggagaggaag ggggagaccc agcccagctg ttggagatct
750
tgtgttccca gctgtgccct cacggggccc aggcattggc cctggggcgg
800
gagttctgtc aaaggaagag ccctggggct gtgcgggcgc tgcttccaga
850
ggagaccccg gcagccgttc tgagcagtgc agagaacatt gctgtggggc
900
ttgcaacaga gaaagcctgt gcttggctgt cagccaacat cacagcactg
950
atcaggaggg aggtgaaagc agcagtgagt cgcacacttc gagcccaggg
1000
tcctgaacct gctgcccggg gggagcggag gggctgctcc cgcgcctgac
1050
gtgctctcct tggccgtggg gccacgggac cctgacgagg gagtctcccc
1100
agagcatctg gaacagctcc taggccagct gggccagacg ctgcggtgcc
1150
gccagttcct gtgcccacct gctgagcagc atctggcaaa gtgctctgtg
1200
gagttagctt ccctcctcgt tgcagatcaa attcctatcc tagggccccc
1250
ggcacagtac aggctggaga gagggcaggc tcgaaggctt ctgcacatgc
1300
tgctttcctt gtggaaggaa gactttcagg ggccggttcc gctgcagctg
1350
ctgctgagcc caagaaatgt ggggcttctg gcagacacaa ggccaaggga
1400
gtgggacttg tgggacggat ctgctattct ggagatagag tgctacggga gcctgcctgg gctggtggag gcagcctcca aagggtctga ccaggcccag
1450 1500
tggccagggg actttgctga agaattagca acactgtcta atctgtttct
1550
agccgagccc cacctgccag aaccccagct aagagcctgt gagttggtgc
1600
agccaaaccg gggcactgtg ctggcccaga gctagggctg agaagtggcc
1650
ctgccttggg cattgcacca gaaccctgga cccccgcctc acgaggaggc
1700
ccaagtgccc aatgcagacc ctcactggtt ggggtgtagc tgggtctaca
1750
gtcagacttc ctgctctaag ggtgtcactg cctggcatcc caccacgcga
1800
541
atcctagagg aaggagagtt ggcctgattt gggattatgg cagaaaagtc
1850
cagagatgcc agtcctggag tagaagaggt ggtgtttgtt tatctcttgg
1900
atactaaatg aaatgaggtg tgtgggcttg tcaacacaga attcaagcct
1950
catttgctat cccagcatct cttaaaactt tgtagtcttg gaattcatga
2000
cagaggcaaa tgactcctgc ttaacttatg aagaaagtta aaacatgaat
2050
cttgggagtc tacattttct tatcaccagg agctggactg ccatctcctt
2100
ataaatgcct aacacaggcc gggtctggtg gctcatgcct gtaatcccag
2150
cactttgaga ggcctgaggt cggcggactg cctgaggtca ggaattcaag
2200
accagcctgg ccaacatggc aaaaccccat ctctactaaa aataaaaaaa
2250
ttattagctg ggcatggtgg tgtgtgcctg taatcccagc tactcaggag
2300
gatgaggcag gagacctgct tgaacctgga ggtggaggtt gcagtgagcc
2350
gaggtcgcac cactgcactc cagtctgggt aacagagcga gactttctag
2400
aaaaagccta acaaacagat aaggtaggac tcaaccaact gaaacctgac
2450
tttccccctg taccttcagc ccctgtgcag gtagtaacct cttgagacct
2500
ctccctgacc agggaccaag cacagggcat ttagagcttt ttagaataaa
2550
ctggttttct ttaaaaaaaa aaaaaaaaaa agggcggccg cccttttttt
2600
tttttttttt tttttttttt tttttttttt tttttttttt taaaaagggc
2650
ttttattaaa attctcccca cacgatggct cctgcaatct gccacagctc
2700
tggggcgtgt cctgtaggga aaggccctgt tttccctgag gcggggctgg
2750
gcttgtccat gggtccgcgg agctggccgt gcttggcgcc ctggcgtgtg
2800
tctagctgct tcttgccggg cacagagctg cggggtctgg gggcaccggg
2850
agctaagagc aggctctggt gcaggggtgg aggcctgtct cttaaccgac
2900
accctgaggt gctcctgaga tgctgggtcc accctgagtg gcacggggag
2950
cagctgtggc cggtgctcct tcytaggcca gtcctgggga aactaagctc
3000
gggcccttct gggaatggcc ttgcaaagac tgaggagcta cgaggatggg cgtgtgaaga gtgggtgtgg gggcgccggt gggactcatg ttgttggctg
3050 3100
cagcggcctg gagcgcctct ctcctgagcc tcagtttccc tttccgtcta
3150
atgaagaaca tgccgtctcg gtgtctcagg gctattagga cttgccctca
3200
ggaagtggcc ttggacgagc gtcatgttat tttcacaact gtcctgcgac
3250
gttggcctgg gcacgtcatg gaatggccca tgtccctctg ctgcgtggac
3300
gtcgcggtcg ggagtgcgca gccagaggcg gggccagacg tgcgcctggg
3350
ggtgagggga ggcgccccgg gagggcctca caggaagttg ggctcccgca
3400
542
ccaccaggca gggcgggctc ccgccgccgc cgccgccacc accgtccagg 3450
ggccggtaga caaagtggaa gtcgcgcttg ggctcgctgc gcagcaggta 3500
gcccttgatg cagtgcggca gcgcgtcgtc cgccagctgg aagcagcgcc 3550
cgtccaccag cacgaacagc cggtgcgcct 3580
<210> 54 <211> 280 <212> PRT
<213> Homo sapiens <400> 54
Met Cys Phe Leu Asn Lys Leu Leu Leu Leu Ala Val Leu Gly Trp 15 10 15
Leu Phe Gin Ile Pro Thr Val Pro Glu Asp Leu Phe Phe Leu Glu 20 25 30
Glu Gly Pro Ser Tyr Ala Phe Glu Val Asp Thr Val Ala Pro Glu 35 40 45
His Gly Leu Asp Asn Ala Pro Val Val Asp Gin Gin Leu Leu Tyr 50 55 60
Thr Cys Cys Pro Tyr Ile Gly Glu Leu Arg Lys Leu Leu Ala Ser 65 70 75
Trp Val Ser Gly Ser Ser Gly Arg Ser Gly Gly Phe Met Arg Lys 80 85 90
Ile Thr Pro Thr Thr Thr Thr Ser Leu Gly Ala Gin Pro Ser Gin 95 100 105
Thr Ser Gin Gly Leu Gin Ala Gin Leu Ala Gin Ala Phe Phe His 110 115 120
Asn Gin Pro Pro Ser Leu Arg Arg Thr Val Glu Phe Val Ala Glu 125 130 135
Arg Ile Gly Ser Asn Cys Val Lys His Ile Lys Ala Thr Leu Val 140 145 150
Ala Asp Leu Val Arg Gin Ala Glu Ser Leu Leu Gin Glu Gin Leu 155 160 165
Val Thr Gin Gly Glu Glu Gly Gly Asp Pro Ala Gin Leu Leu Glu 170 175 180
Ile Leu Cys Ser Gin Leu Cys Pro His Gly Ala Gin Ala Leu Ala 185 190 195
Leu Gly Arg Glu Phe Cys Gin Arg Lys Ser Pro Gly Ala Val Arg 200 205 210
Ala Leu Leu Pro Glu Glu Thr Pro Ala Ala Val Leu Ser Ser Ala 215 220 225
Glu Asn Ile Ala Val Gly Leu Ala Thr Glu Lys Ala Cys Ala Trp 230 235 240
543
Leu Ser Ala Asn Ile Thr Ala Leu Ile Arg Arg Glu Val Lys Ala 245 250 255
Ala Val Ser Arg Thr Leu Arg Ala Gin Gly Pro Glu Pro Ala Ala 260 265 270
Arg Gly Glu Arg Arg Gly Cys Ser Arg Ala 275 280
<210> 55 <211> 2401 <212> DNA
<213> Homo sapiens <400> 55
tcccttgaca ggtctggtgg ctggttcggg gtctactgaa ggctgtcttg
50
atcaggaaac tgaagactct ctgcttttgc cacagcagtt cctgcagctt
100
ccttgaggtg tgaacccaca tccctgcccc cagggccacc tgcaggacgc
150
cgacacctac ccctcagcag acgccggaga gaaatgagta gcaacaaaga
200
gcagcggtca gcagtgttcg tgatcctctt tgccctcatc accatcctca
250
tcctctacag ctccaacagt gccaatgagg tcttccatta cggctccctg
300
cggggccgta gccgccgacc tgtcaacctc aagaagtgga gcatcactga
350
cggctatgtc cccattctcg gcaacaagac actgccctct cggtgccacc
400
agtgtgtgat tgtcagcagc tccagccacc tgctgggcac caagctgggc
450
cctgagatcg agcgggctga gtgtacaatc cgcatgaatg atgcacccac
500
cactggctac tcagctgatg tgggcaacaa gaccacctac cgcgtcgtgg
550
cccattccag tgtgttccgc gtgctgagga ggccccagga gtttgtcaac
600
cggacccctg aaaccgtgtt catcttctgg gggcccccga gcaagatgca
650
gaagccccag ggcagcctcg tgcgtgtgat ccagcgagcg ggcctggtgt
700
tccccaacat ggaagcatat gccgtctctc ccggccgcat gcggcaattt
750
gacgacctct tccggggtga gacgggcaag gacagggaga agtctcattc
800
gtggttgagc acaggctggt ttaccatggt gatcgcggtg gagttgtgtg
850
accacgtgca tgtctatggc atggtccccc ccaactactg cagccagcgg
900
ccccgcctcc agcgcatgcc ctaccactac tacgagccca aggggccgga
950
cgaatgtgtc acctacatcc agaatgagca cagtcgcaag ggcaaccacc
1000
accgcttcat caccgagaaa agggtcttct catcgtgggc ccagctgtat
1050
ggcatcacct tctcccaccc ctcctggacc taggccaccc agcctgtggg
1100
acctcaggag ggtcagagga gaagcagcct ccgcccagcc gctaggccag
1150
ggaccatctt ctggccaatc aaggcttgct ggagtgtctc ccagccaatc
1200
544
agggccttga ggaggatgta tcctccagcc aatcagggcc tggggaatct
1250
gttggcgaat cagggatttg ggagtctatg tggttaatca ggggtgtctt
1300
tcttgtgcag tcagggtctg cgcacagtca atcagggtag agggggtatt
1350
tctgagtcaa tctgaggcta aggacatgtc ctttcccatg aggccttggt
1400
tcagagcccc aggaatggac cccccaatca ctccccactc tgctgggata
1450
atggggtcct gtcccaagga gctgggaact tggtgttgcc ccctcaattt
1500
ccagcaccag aaagagagat tgtgtggggg tagaagctgt ctggaggccc
1550
ggccagagaa tttgtggggt tgtggaggtt gtgggggcgg tggggaggtc
1600
ccagaggtgg gaggctggca tccaggtctt ggctctgccc tgagaccttg
1650
gacaaaccct tccccctctc tgggcaccct tctgcccaca ccagtttcca
1700
gtgcggagtc tgagaccctt tccacctccc ctacaagtgc cctcgggtct
1750
gtcctccccg tctggaccct cccagccact atcccttgct ggaaggctca
1800
gctctttggg gggtctgggg tgacctcccc acctcctgga aaactttagg
1850
gtatttttgc gcaaactcct tcagggttgg gggactctga aggaaacggg
1900
acaaaacctt aagctgtttt cttagcccct cagccagctg ccattagctt
1950
ggctcttaaa gggccaggcc tccttttctg ccctctagca gggaggtttt
2000
ccaactgttg gaggcgcctt tggggctgcc cctttgtctg gagtcactgg
2050
gggcttccga gggtctccct cgaccctctg tcgtcctggg atggctgtcg
2100
ggagctgtat cacctgggtt ctgtcccctg gctctgtatc aggcacttta
2150
ttaaagctgg gcctcagtgg ggtgtgtttg tctcctgctc ttctggagcc
2200
tggaaggaaa gggcttcagg aggaggctgt gaggctggag ggaccagatg
2250
gaggaggcca gcagctagcc attgcacact ggggtgatgg gtgggggcgg
2300
tgactgcccc agacttggtt ttgtaatgat ttgtacagga ataaacacac
2350
ctacgctccg gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
2400
a 2401
<210> 56 <211> 299 <212> PRT
<213> Homo sapiens <400> 56
Met Ser Ser Asn Lys Glu Gin Arg Ser Ala Val Phe Val Ile Leu 15 10 15
Phe Ala Leu Ile Thr Ile Leu Ile Leu Tyr Ser Ser Asn Ser Ala 20 25 30
545
Asn Glu Val Phe His Tyr Gly Ser Leu Arg Gly Arg Ser Arg Arg 35 40 45
Pro Val Asn Leu Lys Lys Trp Ser Ile Thr Asp Gly Tyr Val Pro 50 55 60
Ile Leu Gly Asn Lys Thr Leu Pro Ser Arg Cys His Gin Cys Val 65 70 75
Ile Val Ser Ser Ser Ser His Leu Leu Gly Thr Lys Leu Gly Pro 80 85 90
Glu Ile Glu Arg Ala Glu Cys Thr Ile Arg Met Asn Asp Ala Pro 95 100 105
Thr Thr Gly Tyr Ser Ala Asp Val Gly Asn Lys Thr Thr Tyr Arg 110 115 120
Val Val Ala His Ser Ser Val Phe Arg Val Leu Arg Arg Pro Gin 125 130 135
Glu Phe Val Asn Arg Thr Pro Glu Thr Val Phe Ile Phe Trp Gly 140 145 150
Pro Pro Ser Lys Met Gin Lys Pro Gin Gly Ser Leu Val Arg Val 155 160 165
Ile Gin Arg Ala Gly Leu Val Phe Pro Asn Met Glu Ala Tyr Ala 170 175 180
Val Ser Pro Gly Arg Met Arg Gin Phe Asp Asp Leu Phe Arg Gly 185 190 195
Glu Thr Gly Lys Asp Arg Glu Lys Ser His Ser Trp Leu Ser Thr 200 205 210
Gly Trp Phe Thr Met Val Ile Ala Val Glu Leu Cys Asp His Val 215 220 225
His Val Tyr Gly Met Val Pro Pro Asn Tyr Cys Ser Gin Arg Pro 230 235 240
Arg Leu Gin Arg Met Pro Tyr His Tyr Tyr Glu Pro Lys Gly Pro 245 250 255
Asp Glu Cys Val Thr Tyr Ile Gin Asn Glu His Ser Arg Lys Gly 260 265 270
Asn His His Arg Phe Ile Thr Glu Lys Arg Val Phe Ser Ser Trp 275 280 285
Ala Gin Leu Tyr Gly Ile Thr Phe Ser His Pro Ser Trp Thr 290 295
<210> 57
<211> 4277
<212> DNA
<213> Homo sapiens
<400> 57
gtttctcata gttggcgtct tctaaaggaa aaacactaaa atgaggaact 50
546
cagcggaccg ggagcgacgc agcttgaggg aagcatccct agctgttggc
100
gcagaggggc gaggctgaag ccgagtggcc cgaggtgtct gaggggctgg
150
ggcaaaggtg aaagagtttc agaacaagct tcctggaacc catgacccat
200
gaagtcttgt cgacatttat accgtctgag ggtagcagct cgaaactaga
250
agaagtggag tgttgccagg gacggcagta tctctttgtg tgaccctggc
300
ggcctatggg acgttggctt cagacctttg tgatacacca tgctgcgtgg
350
gacgatgacg gcgtggagag gaatgaggcc tgaggtcaca ctggcttgcc
400
tcctcctagc cacagcaggc tgctttgctg acttgaacga ggtccctcag
450
gtcaccgtcc agcctgcgtc caccgtccag aagcccggag gcactgtgat
500
cttgggctgc gtggtggaac ctccaaggat gaatgtaacc tggcgcctga
550
atggaaagga gctgaatggc tcggatgatg ctctgggtgt cctcatcacc
600
cacgggaccc tcgtcatcac tgcccttaac aaccacactg tgggacggta
650
ccagtgtgtg gcccggatgc ctgcgggggc tgtggccagc gtgccagcca
700
ctgtgacact agccaatctc caggacttca agttagatgt gcagcacgtg
750
attgaagtgg atgagggaaa cacagcagtc attgcctgcc acctgcctga
800
gagccacccc aaagcccagg tccggtacag cgtcaaacaa gagtggctgg
850
aggcctccag aggtaactac ctgatcatgc cctcagggaa cctccagatt
900
gtgaatgcca gccaggagga cgagggcatg tacaagtgtg cagcctacaa
950
cccagtgacc caggaagtga aaacctccgg ctccagcgac aggctacgtg
1000
tgcgccgctc caccgctgag gctgcccgca tcatctaccc cccagaggcc
1050
caaaccatca tcgtcaccaa aggccagagt ctcattctgg agtgtgtggc
1100
cagtggaatc ccacccccac gggtcacctg ggccaaggat gggtccagtg
1150
tcaccggcta caacaagacg cgcttcctgc tgagcaacct cctcatcgac
1200
accaccagcg aggaggactc aggcacctac cgctgcatgg ccgacaatgg
1250
ggttgggcag cccggggcag cggtcatcct ctacaatgtc caggtgtttg
1300
aaccccctga ggtcaccatg gagctatccc agctggtcat cccctggggc
1350
cagagtgcca gctgtggctg agcttacctg aggaatgctg tgaggtgcgt tgcccctcat gggaaccccc ctccagccag cgccctccgt cgcctccggc
1400 1450
tctcccgcag ggccctgcgc gtgctcagca tggggcctga ggacgaaggc
1500
gtctaccagt gcatggccga gaacgaggtt gggagcgccc atgccgtagt
1550
ccagctgcgg acctccaggc caagcataac cccaaggcta tggcaggatg
1600
547
ctgagctggc tactggcaca cctcctgtat caccctccaa actcggcaac
1650
cctgagcaga tgctgagggg gcaaccggcg ctccccagac ccccaacgtc
1700
agtggggcct gcttccccga agtgtccagg agagaagggg cagggggctc
1750
ccgccgaggc tcccatcatc ctcagctcgc cccgcacctc caagacagac
1800
tcatatgaac tggtgtggcg gcctcggcat gagggcagtg gccgggcgcc
1850
aatcctctac tatgtggtga aacaccgcaa gcaggtcaca aattcctctg
1900
acgattggac catctctggc attccagcca accagcaccg cctgaccctc
1950
accagacttg accccgggag cttgtatgaa gtggagatgg cagcttacaa
2000
ctgtgcggga gagggccaga cagccatggt caccttccga actggacggc
2050
ggcccaaacc cgagatcatg gccagcaaag agcagcagat ccagagagac
2100
gaccctggag ccagtcccca gagcagcagc cagccagacc acggccgcct
2150
ctccccccca gaagctcccg acaggcccac catctccacg gcctccgaga
2200
cctcagtgta cgtgacctgg attccccgtg ggaatggtgg gttcccaatc
2250
cagtccttcc gtgtggagta caagaagcta aagaaagtgg gagactggat
2300
tctggccacc agcgccatcc ccccatcgcg gctgtccgtg gagatcacgg
2350
gcctagagaa aggcacctcc tacaagtttc gagtccgggc tctgaacatg
2400
ctgggggaga gcgagcccag cgccccctct cggccctacg tggtgtcggg
2450
ctacagcggt cgcgtgtacg agaggcccgt ggcaggtcct tatatcacct
2500
tcacggatgc ggtcaatgag accaccatca tgctcaagtg gatgtacatc
2550
ccagcaagta acaacaacac cccaatccat ggcttttata tctattatcg
2600
acccacagac agtgacaatg atagtgacta caagaaggat atggtggaag
2650
gggacaagta ctggcactcc atcagccacc tgcagccaga gacctcctac
2700
gacattaaga tgcagtgctt caatgaagga ggggagagcg agttcagcaa
2750
cgtgatgatc tgtgagacca aagctcggaa gtcttctggc cagcctggtc
2800
gactgccacc cccaactctg gccccaccac agccgcccct tcctgaaacc
2850
atagagcggc cggtgggcac tggggccatg gtggctcgct ccagcgacct
2900
gccctatctg attgtcgggg tcgtcctggg ctccatcgtt ctcatcatcg
2950
tcaccttcat acaacagacc ccccttctgc tgggttttcc ttgtggaggg tcgaagtgcc cctggtctaa cttccaccct gcaaaaacat cctgcccgta
3000 3050
tactatggtg ccattgggag gactcccagg ccaccaggcc agtggacagc
3100
cctacctcag tggcatcagt ggacgggcct gtgctaatgg gatccacatg
3150
aataggggct gcccctcggc tgcagtgggc tacccgggca tgaagcccca
3200
548
gcagcactgc ccaggcgagc ttcagcagca gagtgacacc agcagcctgc
3250
tgaggcagac ccatcttggc aatggatatg acccccaaag tcaccagatc
3300
acgaggggtc ccaagtctag cccggacgag ggctctttct tatacacact
3350
gcccgacgac tccactcacc agctgctgca gccccatcac gactgctgcc
3400
aacgccagga gcagcctgct gctgtgggcc agtcaggggt gaggagagcc
3450
cccgacagtc ctgtcctgga agcagtgtgg gaccctccat ttcactcagg
3500
gcccccatgc tgcttgggcc ttgtgccagt tgaagaggtg gacagtcctg
3550
actcctgcca agtgagtgga ggagactggt gtccccagca ccccgtaggg
3600
gcctacgtag gacaggaacc tggaatgcag ctctccccgg ggccactggt
3650
gcgtgtgtct tttgaaacac cacctctcac aatttaggca gaagctgata
3700
tcccagaaag actatatatt gttttttttt taaaaaaaaa agaagaaaaa
3750
agagacagag aaaattggta tttatttttc tattatagcc atatttatat
3800
atttatgcac ttgtaaataa atgtatatgt tttataattc tggagagaca
3850
taaggagtcc tacccgttga ggttggagag ggaaaataaa gaagctgcca
3900
cctaacagga gtcacccagg aaagcaccgc acaggctggc gcgggacaga
3950
ctcctaacct ggggcctctg cagtggcagg cgaggctgca ggaggcccac
4000
agataagctg gcaagaggaa ggatcccagg cacatggttc atcacgagca
4050
tgagggaaca gcaaggggca cggtatcaca gcctggagac acccacacag
4100
atggctggat ccggtgctac gggaaacatt ttcctaagat gcccatgaga
4150
acagaccaag atgtgtacag cactatgagc attaaaaaac cttccagaat
4200
caataatccg tggcaacata tctctgtaaa aacaaacact gtaacttcta
4250
aataaatgtt tagtcttccc tgtaaaa 4277
<210> 58 <211> 1115 <212> PRT
<213> Homo sapiens <400> 58
Met Leu Arg Gly Thr Met Thr Ala Trp Arg Gly Met Arg Pro Glu 15 10 15
Val Thr Leu Ala Cys Leu Leu Leu Ala Thr Ala Gly Cys Phe Ala
25 30
Asp Leu Asn Glu Val Pro Gin Val Thr Val Gin Pro Ala Ser Thr
40 45
Val Gin Lys Pro Gly Gly Thr Val Ile Leu Gly Cys Val Val Glu 50 55 60
549
Pro Pro Arg Met Asn Val Thr Trp Arg Leu Asn Gly Lys Glu Leu 65 70 75
Asn Gly Ser Asp Asp Ala Leu Gly Val Leu Ile Thr His Gly Thr 80 85 90
Leu Val Ile Thr Ala Leu Asn Asn His Thr Val Gly Arg Tyr Gin 95 100 105
Cys Val Ala Arg Met Pro Ala Gly Ala Val Ala Ser Val Pro Ala 110 115 120
Thr Val Thr Leu Ala Asn Leu Gin Asp Phe Lys Leu Asp Val Gin 125 130 135
His Val Ile Glu Val Asp Glu Gly Asn Thr Ala Val Ile Ala Cys 140 145 150
His Leu Pro Glu Ser His Pro Lys Ala Gin Val Arg Tyr Ser Val 155 160 165
Lys Gin Glu Trp Leu Glu Ala Ser Arg Gly Asn Tyr Leu Ile Met 170 175 180
Pro Ser Gly Asn Leu Gin Ile Val Asn Ala Ser Gin Glu Asp Glu 185 190 195
Gly Met Tyr Lys Cys Ala Ala Tyr Asn Pro Val Thr Gin Glu Val 200 205 210
Lys Thr Ser Gly Ser Ser Asp Arg Leu Arg Val Arg Arg Ser Thr 215 220 225
Ala Glu Ala Ala Arg Ile Ile Tyr Pro Pro Glu Ala Gin Thr Ile 230 235 240
Ile Val Thr Lys Gly Gin Ser Leu Ile Leu Glu Cys Val Ala Ser 245 250 255
Gly Ile Pro Pro Pro Arg Val Thr Trp Ala Lys Asp Gly Ser Ser 260 265 270
Val Thr Gly Tyr Asn Lys Thr Arg Phe Leu Leu Ser Asn Leu Leu 275 280 285
Ile Asp Thr Thr Ser Glu Glu Asp Ser Gly Thr Tyr Arg Cys Met 290 295 300
Ala Asp Asn Gly Val Gly Gin Pro Gly Ala Ala Val Ile Leu Tyr 305 310 315
Asn Val Gin Val Phe Glu Pro Pro Glu Val Thr Met Glu Leu Ser 320 325 330
Gin Leu Val Ile Pro Trp Gly Gin Ser Ala Lys Leu Thr Cys Glu 335 340 345
Val Arg Gly Asn Pro Pro Pro Ser Val Leu Trp Leu Arg Asn Ala 350 355 360
Val Pro Leu Ile Ser Ser Gin Arg Leu Arg Leu Ser Arg Arg Ala 365 370 375
550
Leu Arg Val Leu Ser Met Gly Pro Glu Asp Glu Gly Val Tyr Gin
380 385 390
Cys Met Ala Glu Asn Glu Val Gly Ser Ala His Ala Val Val Gin
395 400 405
Leu Arg Thr Ser Arg Pro Ser Ile Thr Pro Arg Leu Trp Gin Asp
410 415 420
Ala Glu Leu Ala Thr Gly Thr Pro Pro Val Ser Pro Ser Lys Leu
425 430 435
Gly Asn Pro Glu Gin Met Leu Arg Gly Gin Pro Ala Leu Pro Arg
440 445 450
Pro Pro Thr Ser Val Gly Pro Ala Ser Pro Lys Cys Pro Gly Glu
455 460 465
Lys Gly Gin Gly Ala Pro Ala Glu Ala Pro Ile Ile Leu Ser Ser
470 475 480
Pro Arg Thr Ser Lys Thr Asp Ser Tyr Glu Leu Val Trp Arg Pro
485 490 495
Arg His Glu Gly Ser Gly Arg Ala Pro Ile Leu Tyr Tyr Val Val
500 505 510
Lys His Arg Lys Gin Val Thr Asn Ser Ser Asp Asp Trp Thr Ile
515 520 525
Ser Gly Ile Pro Ala Asn Gin His Arg Leu Thr Leu Thr Arg Leu
530 535 540
Asp Pro Gly Ser Leu Tyr Glu Val Glu Met Ala Ala Tyr Asn Cys
545 550 555
Ala Gly Glu Gly Gin Thr Ala Met Val Thr Phe Arg Thr Gly Arg
560 565 570
Arg Pro Lys Pro Glu Ile Met Ala Ser Lys Glu Gin Gin Ile Gin
575 580 585
Arg Asp Asp Pro Gly Ala Ser Pro Gin Ser Ser Ser Gin Pro Asp
590 595 600
His Gly Arg Leu Ser Pro Pro Glu Ala Pro Asp Arg Pro Thr Ile
605 610 . 615
Ser Thr Ala Ser Glu Thr Ser Val Tyr Val Thr Trp Ile Pro Arg
620 625 630
Gly Asn Gly Gly Phe Pro Ile Gin Ser Phe Arg Val Glu Tyr Lys
635 640 645
Lys Leu Lys Lys Val Gly Asp Trp Ile Leu Ala Thr Ser Ala Ile
650 655 660
Pro Pro Ser Arg Leu Ser Val Glu Ile Thr Gly Leu Glu Lys Gly 665 670 675
Thr Ser Tyr Lys Phe Arg Val Arg Ala Leu
Asn Met Leu Gly Glu
551
680 685 690
Ser Glu Pro Ser Ala Pro Ser Arg Pro Tyr Val Val Ser Gly Tyr
695 700 705
Ser Gly Arg Val Tyr Glu Arg Pro Val Ala Gly Pro Tyr Ile Thr
710 715 720
Phe Thr Asp Ala Val Asn Glu Thr Thr Ile Met Leu Lys Trp Met
725 730 735
Tyr Ile Pro Ala Ser Asn Asn Asn Thr Pro Ile His Gly Phe Tyr
740 745 750
Ile Tyr Tyr Arg Pro Thr Asp Ser Asp Asn Asp Ser Asp Tyr Lys
755 760 765
Lys Asp Met Val Glu Gly Asp Lys Tyr Trp His Ser Ile Ser His
770 775 780
Leu Gin Pro Glu Thr Ser Tyr Asp Ile Lys Met Gin Cys Phe Asn
785 790 795
Glu Gly Gly Glu Ser Glu Phe Ser Asn Val Met Ile Cys Glu Thr
800 805 810
Lys Ala Arg Lys Ser Ser Gly Gin Pro Gly Arg Leu Pro Pro Pro
815 820 825
Thr Leu Ala Pro Pro Gin Pro Pro Leu Pro Glu Thr Ile Glu Arg
830 835 840
Pro Val Gly Thr Gly Ala Met Val Ala Arg Ser Ser Asp Leu Pro
845 850 855
Tyr Leu lie Val Gly Val Val Leu Gly Ser Ile Val Leu Ile Ile
860 865 870
Val Thr Phe Ile Pro Phe Cys Leu Trp Arg Ala Trp Ser Lys Gin
875 880 885
Lys His Thr Thr Asp Leu Gly Phe Pro Arg Ser Ala Leu Pro Pro
890 895 900
Ser Cys Pro Tyr Thr Met Val Pro Leu Gly Gly Leu Pro Gly His
905 910 915
Gin Ala Ser Gly Gin Pro Tyr Leu Ser Gly lie Ser Gly Arg Ala
920 925 930
Cys Ala Asn Gly Ile His Met Asn Arg Gly Cys Pro Ser Ala Ala
935 940 945
Val Gly Tyr Pro Gly Met Lys Pro Gin Gin His Cys Pro Gly Glu
950 955 960
Leu Gin Gin Gin Ser Asp Thr Ser Ser Leu Leu Arg Gin Thr His
965 970 975
Leu Gly Asn Gly Tyr Asp Pro Gin Ser His Gin Ile Thr Arg Gly
980 985 990
552
Pro Lys Ser Ser Pro Asp Glu Gly Ser Phe Leu Tyr Thr Leu Pro 995 1000 1005
Asp Asp Ser
Gin Arg Gin
Arg Ala Pro
Phe His Ser
Glu Val Asp
Cys Pro Gin
Met Gin Leu
Pro Pro Leu
<210> 59 <211> 25 <212> DNA <213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 59 gggaaacaca gcagtcattg cctgc 25
<210> 60 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 60 gcacacgtag cctgtcgctg gage 24
<210> 61 <211> 42 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 61
caccccaaag cccaggtccg gtacagcgtc aaacaagagt gg 42
Thr His Gin Leu Leu Gin Pro His His Asp Cys Cys 1010 1015 1020
Glu Gin Pro Ala Ala Val Gly Gin Ser Gly Val Arg 1025 1030 1035
Asp Ser Pro Val Leu Glu Ala Val Trp Asp Pro Pro 1040 1045 1050
Gly Pro Pro Cys Cys Leu Gly Leu Val Pro Val Glu 1055 1060 1065
Ser Pro Asp Ser Cys Gin Val Ser Gly Gly Asp Trp 1070 1075 1080
His Pro Val Gly Ala Tyr Val Gly Gin Glu Pro Gly 1085 1090 1095
Ser Pro Gly Pro Leu Val Arg Val Ser Phe Glu Thr 1100 1105 1110
Thr Ile 1115
<210> 62 <211> 1661
553
<212> DNA
<213> Homo sapiens
<220>
<221> unsure <222> 678
<223> unknown base <400> 62
cgggaggctg ggtcgtcatg atccggaccc cattgtcggc ctctgcccat
50
cgcctgctcc tcccaggctc ccgcggccga cccccgcgca acatgcagcc
100
cacgggccgc gagggttccc gcgcgctcag ccggcggtat ctgcggcgtc
150
tgctgctcct gctactgctg ctgctgctgc ggcagcccgt aacccgcgcg
200
gagaccacgc cgggcgcccc cagagccctc tccacgctgg gctcccccag
250
cctcttcacc acgccgggtg tccccagcgc cctcactacc ccaggcctca
300
ctacgccagg cacccccaaa accctggacc ttcggggtcg cgcgcaggcc
350
ctgatgcgga gtttcccact cgtggacggc cacaatgacc tgccccaggt
400
cctgagacag cgttacaaga atgtgcttca ggatgttaac ctgcgaaatt
450
tcagccatgg tcagaccagc ctggacaggc ttagagacgg cctcgtgggt
500
gcccagttct ggtcagcctc cgtctcatgc cagtcccagg accagactgc
550
cgtgcgcctc gccctggagc agattgacct cattcaccgc atgtgtgcct
600
cctactctga actcgagctt gtgacctcag ctgaaggtct gaacagctct
650
caaaagctgg cctgcctcat tggcgtgnag ggtggtcact cactggacag
700
cagcctctct gtgctgcgca gtttctatgt gctgggggtg cgctacctga
750
cacttacctt cacctgcagt acaccatggg cagagagttc caccaagttc
800
agacaccaca tgtacaccaa cgtcagcgga ttgacaagct ttggtgagaa
850
agtagtagag gagttgaacc gcctgggcat gatgatagat ttgtcctatg
900
catcggacac cttgataaga agggtcctgg aagtgtctca ggctcctgtg
950
atcttctccc actcagctgc cagagctgtg tgtgacaatt tgttgaatgt
1000
tcccgatgat atcctgcagc ttctgaagaa cggtggcatc gtgatggtga
1050
cactgtccat gggggtgctg cagtgcaacc tgcttgctaa cgtgtccact
1100
gtggcagatc actttgacca catcagggca gtcattggat ctgagttcat
1150
cgggattggt ggaaattatg acgggactgg ccggttccct caggggctgg
1200
aggatgtgtc cacataccca gtcctgatag aggagttgct gagtcgtasc
1250
tggagcgagg aagagcttca aggtgtcctt cgtggaaacc tgctgcgggt
1300
cttcagacaa gtggaaaagg tgagagagga gagcagggcg cagagccccg
1350
554
tggaggctga gtttccatat gggcaactga gcacatcctg ccactcccac 1400
ctcgtgcctc agaatggaca ccaggctact catctggagg tgaccaagca 1450
gccaaccaat cgggtcccct ggaggtcctc aaatgcctcc ccataccttg 1500
ttccaggcct tgtggctgct gccaccatcc caaccttcac ccagtggctc 1550
tgctgacaca gtcggtcccc gcagaggtca ctgtggcaaa gcctcacaaa 1600
gccccctctc ctagttcatt cacaagcata tgctgagaat aaacatgtta 1650
cacatggaaa a 1661
<210> 63 <211> 487 <212> PRT
<213> Homo sapiens <220>
<221> unsure
<222> 196, 386
<223> unknown amino acid
<400> 63
Met Gin Pro Thr Gly Arg Glu Gly Ser Arg Ala Leu Ser Arg Arg 15 10 15
Tyr Leu Arg Arg Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Arg 20 25 30
Gin Pro Val Thr Arg Ala Glu Thr Thr Pro Gly Ala Pro Arg Ala 35 40 45
Leu Ser Thr Leu Gly Ser Pro Ser Leu Phe Thr Thr Pro Gly Val 50 55 60
Pro Ser Ala Leu Thr Thr Pro Gly Leu Thr Thr Pro Gly Thr Pro 65 70 75
Lys Thr Leu Asp Leu Arg Gly Arg Ala Gin Ala Leu Met Arg Ser 80 85 90
Phe Pro Leu Val Asp Gly His Asn Asp Leu Pro Gin Val Leu Arg 95 100 105
Gin Arg Tyr Lys Asn Val Leu Gin Asp Val Asn Leu Arg Asn Phe 110 115 120
Ser His Gly Gin Thr Ser Leu Asp Arg Leu Arg Asp Gly Leu Val 125 130 135
Gly Ala Gin Phe Trp Ser Ala Ser Val Ser Cys Gin Ser Gin Asp 140 145 150
Gin Thr Ala Val Arg Leu Ala Leu Glu Gin Ile Asp Leu Ile His 155 160 165
Arg Met Cys Ala Ser Tyr Ser Glu Leu Glu Leu Val Thr Ser Ala 170 175 180
555
Glu Gly Leu Asn Ser Ser Gin Lys Leu Ala Cys Leu Ile Gly Val 185 190 195
Xaa Gly Gly His Ser Leu Asp Ser Ser Leu Ser Val Leu Arg Ser 200 205 210
Phe Tyr Val Leu Gly Val Arg Tyr Leu Thr Leu Thr Phe Thr Cys 215 220 225
Ser Thr Pro Trp Ala Glu Ser Ser Thr Lys Phe Arg His His Met 230 235 240
Tyr Thr Asn Val Ser Gly Leu Thr Ser Phe Gly Glu Lys Val Val 245 250 255
Glu Glu Leu Asn Arg Leu Gly Met Met Ile Asp Leu Ser Tyr Ala 260 265 270
Ser Asp Thr Leu Ile Arg Arg Val Leu Glu Val Ser Gin Ala Pro 275 280 285
Val Ile Phe Ser His Ser Ala Ala Arg Ala Val Cys Asp Asn Leu 290 295 300
Leu Asn Val Pro Asp Asp Ile Leu Gin Leu Leu Lys Asn Gly Gly 305 310 315
Ile Val Met Val Thr Leu Ser Met Gly Val Leu Gin Cys Asn Leu 320 325 330
Leu Ala Asn Val Ser Thr Val Ala Asp His Phe Asp His Ile Arg 335 340 345
Ala Val Ile Gly Ser Glu Phe Ile Gly Ile Gly Gly Asn Tyr Asp 350 355 360
Gly Thr Gly Arg Phe Pro Gin Gly Leu Glu Asp Val Ser Thr Tyr 365 370 375
Pro Val Leu Ile Glu Glu Leu Leu Ser Arg Xaa Trp Ser Glu Glu 380 385 390
Glu Leu Gin Gly Val Leu Arg Gly Asn Leu Leu Arg Val Phe Arg 395 400 405
Gin Val Glu Lys Val Arg Glu Glu Ser Arg Ala Gin Ser Pro Val 410 415 420
Glu Ala Glu Phe Pro Tyr Gly Gin Leu Ser Thr Ser Cys His Ser 425 430 435
His Leu Val Pro Gin Asn Gly His Gin Ala Thr His Leu Glu Val 440 445 450
Thr Lys Gin Pro Thr Asn Arg Val Pro Trp Arg Ser Ser Asn Ala 455 460 465
Ser Pro Tyr Leu Val Pro Gly Leu Val Ala Ala Ala Thr Ile Pro 470 475 480
Thr Phe Thr Gin Trp Leu Cys 485
556
<210> 64 <211> 25 <212> DNA
<213> Artificial sequence <220>
<223> Synthetic oligonucleotide probe
<400> 64 ccttcacctg cagtacacca tgggc 25
<210> 65 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 65 gtcacacaca gctctggcag ctgag 25
<210> 66 <211> 47 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 66
ccaagttcag acaccacatg tacaccaacg tcagcggatt gacaagc 47
<210> 67
<211> 1564
<212> DNA
<213> Homo sapiens
<400> 67
tgctaggctc tgtcccacaa tgcacccgag agcaggagct gaaagcctct
50
aacacccaca gatccctcta tgactgcaat gtgaggtgtc cggctttgct
100
ggcccagcaa gcctgataag catgaagctc ttatctttgg tggctgtggt
150
cgggtgtttg ctggtgcccc cagctgaagc caacaagagt tctgaagata
200
tccggtgcaa atgcatctgt ccaccttata gaaacatcag tgggcacatt
250
tacaaccaga atgtatccca gaaggactgc aactgcctgc acgtggtgga
300
gcccatgcca gtgcctggcc atgacgtgga ggcctactgc ctgctgtgcg
350
agtgcaggta cgaggagcgc agcaccacca ccatcaaggt catcattgtc
400
atctacctgt ccgtggtggg tgccctgttg ctctacatgg ccttcctgat
450
gctggtggac cctctgatcc gaaagccgga tgcatacact gagcaactgc
500
acaatgagga ggagaatgag gatgctcgct ctatggcagc agctgctgca
550
557
tccctcgggg gaccccgagc aaacacagtc ctggagcgtg tggaaggtgc
600
ccagcagcgg tggaagctgc aggtgcagga gcagcggaag acagtcttcg
650
atcggcacaa gatgctcagc tagatgggct ggtgtggttg ggtcaaggcc
700
ccaacaccat ggctgccagc ttccaggctg gacaaagcag ggggctactt
750
ctcccttccc tcggttccag tcttcccttt aaaagcctgt ggcatttttc
800
ctccttctcc ctaactttag aaatgttgta cttggctatt ttgattaggg
850
aagagggatg tggtctctga tctctgttgt cttcttgggt ctttggggtt
900
gaagggaggg ggaaggcagg ccagaaggga atggagacat tcgaggcggc
950
ctcaggagtg gatgcgatct gtctctcctg gctccactct tgccgccttc
1000
cagctctgag tcttgggaat gttgttaccc ttggaagata aagctgggtc
1050
ttcaggaact cagtgtctgg gaggaaagca tggcccagca ttcagcatgt
1100
gttcctttct gcagtggttc ttatcaccac ctccctccca gccccggcgc
1150
ctcagcccca gccccagctc cagccctgag gacagctctg atgggagagc
1200
tgggccccct gagcccactg ggtcttcagg gtgcactgga agctggtgtt
1250
cgctgtcccc tgtgcacttc tcgcactggg gcatggagtg cccatgcata
1300
ctctgctgcc ggtcccctca cctgcacttg aggggtctgg gcagtccctc
1350
ctctccccag tgtccacagt cactgagcca gacggtcggt tggaacatga
1400
gactcgaggc tgagcgtgga tctgaacacc acagcccctg tacttgggtt
1450
gcctcttgtc cctgaacttc gttgtaccag tgcatggaga gaaaattttg
1500
tcctcttgtc ttagagttgt gtgtaaatca aggaagccat cattaaattg
1550
ttttatttct ctca 1564
<210> 68 <211> 183 <212> PRT
<213> Homo sapiens <400> 68
Met Lys Leu Leu Ser Leu Val Ala Val Val Gly Cys Leu Leu Val 15 10 15
Pro Pro Ala Glu Ala Asn Lys Ser Ser Glu Asp Ile Arg Cys Lys 20 25 30
Cys Ile Cys Pro Pro Tyr Arg Asn Ile Ser Gly His Ile Tyr Asn 35 40 45
Gin Asn Val Ser Gin Lys Asp Cys Asn Cys Leu His Val Val Glu 50 55 60
Pro Met Pro Val Pro Gly His Asp Val Glu Ala Tyr Cys Leu Leu 65 70 75
558
Cys Glu Cys Arg Tyr 80
Ile Ile Val Ile Tyr 95
Met Ala Phe Leu Met 110
Ala Tyr Thr Glu Gin 125
Arg Ser Met Ala Ala 140
Asn Thr Val Leu Glu 155
Leu Gin Val Gin Glu 170
Met Leu Ser
<210> 69
<211> 3170
<212> DNA
<213> Homo sapiens
<400> 69
agcgggtctc gcttgggttc cgctaatttc tgtcctgagg cgtgagactg
50
agttcatagg gtcctgggtc cccgaaccag gaagggttga gggaacacaa
100
tctgcaagcc cccgcgaccc aagtgagggg ccccgtgttg gggtcctccc
150
tccctttgca ttcccacccc tccgggcttt gcgtcttcct ggggaccccc
200
tcgccgggag atggccgcgt tgatgcggag caaggattcg tcctgctgcc
250
tgctcctact ggccgcggtg ctgatggtgg agagctcaca gatcggcagt
300
tcgcgggcca aactcaactc catcaagtcc tctctgggcg gggagacgcc
350
tggtcaggcc gccaatcgat ctgcgggcat gtaccaagga ctggcattcg
400
gcggcagtaa gaagggcaaa aacctggggc aggcctaccc ttgtagcagt
450
gataaggagt gtgaagttgg gaggtattgc cacagtcccc accaaggatc
500
atcggcctgc atggtgtgtc ggagaaaaaa gaagcgctgc caccgagatg
550
gcatgtgctg ccccagtacc cgctgcaata atggcatctg tatcccagtt
600
actgaaagca tcttaacccc tcacatcccg gctctggatg gtactcggca
650
cagagatcga aaccacggtc attactcaaa ccatgacttg ggatggcaga
700
atctaggaag accacacact aagatgtcac atataaaagg gcatgaagga
750
gacccctgcc tacgatcatc agactgcatt gaagggtttt gctgtgctcg
800
Glu Glu Arg Ser Thr Thr Thr Ile Lys Val 85 90
Leu Ser Val Val Gly Ala Leu Leu Leu Tyr 100 105
Leu Val Asp Pro Leu Ile Arg Lys Pro Asp 115 120
Leu His Asn Glu Glu Glu Asn Glu Asp Ala 130 135
Ala Ala Ala Ser Leu Gly Gly Pro Arg Ala 145 150
Arg Val Glu Gly Ala Gin Gin Arg Trp Lys 160 165
Gin Arg Lys Thr Val Phe Asp Arg His Lys 175 180
tcatttctgg gtaccaaaca tgcgactgtg ctcctccaaa ggaacatcat catggtggaa gtgataagaa aatagattag caacttgtct agagaacaag aagagtttag ttatacaaat aaaatactcc gctaaacaag taccttttga agataagaaa cagttgttta ccacaaatac actgatacaa atctctcttt aactataccc ttcacagatg gctaacagag gatagaatta agagacttaa aaaacttttt aggcacaagt tgaatatctg tcagtgtgag tgctgagatc tcctgaagta accaaaatct acgcaagaag cgaagggcct gccagactcc caattgcaga aataaggttc tatagatgat aatgggtgac atgtaaataa cacacagtgg gttgtgctgg aacctacatg tagaataact aaatgaaaac tttgtaacac aaaatcagtc ggaaggcctt ttttttttca gacaaaaaca atcctatgtg ataaattgtg gcaaaaaaat agatcattat gattggtaaa gctggatctg cgtttgttca tggctgttca catgggattt gtcctgtgtc ctcaaataat gacagagtag gcaaaccagt ggttctcatg gtcttgcaaa atgtgtgtca ctgtgaagtt agatgcagaa cacaaaaagg aaatgcagtg tgtacacatt aaattactga aggagaggtt ccagatttct tgttatacaa atggagcatt tacttctgct aatatttcca taggaagaca aaattttagt gttccttcag attcctgctc actagtaaaa ttaaagatgt ttcttaaaga tacatgtatt tactgcactg ggttttggca tctttgaaac gctatcataa cgtactatcc ctcaatttca tgaggtttca
559
gctccatcag ggctggaaat gtatggaaag gaaaatttga gtgtatttaa gaatggctaa gagaaagaaa cagccagtgt tgtggaaaat tgagtagcat tccttcagat attcaacgtt taggttctaa gttaatttac gttcaatcaa aataattgca aataaataac tttacctgta attctacgga tgaatgcatt tacttacaca ccaatatatg ttggccataa catacatact gagtaagcaa acacatagat caggggatgc tatttactat tcaaattatt ggaggtttca ttgccctcta ggggaagtct 850 tttccagcgt 900 atgccaccta 950 tcaccattga 1000 tgcattatag 1050 aataagaaac 1100 acatgaactg 1150 ttccattatg 12 00 gctattatta 1250 gtgactttcc 1300 tgctgattgc 1350 agagtttaac 1400 aaataaaatt 1450 aacagaaaat 1500 gagtcttggt 1550 aaataatggc 1600 aaacaaacag 1650 attaataaga 1700 atgacagtat 1750 atattttcca 1800 gagcagaatt 1850 tgggaaaaga 1900 cctatatttt 1950 ctgtggtaat 2 000 gaaaattggg 2050 catatgtctg 2100 acagtctaaa '2150 gcagatgaat 22 00 tattttatag 2250 caaaatgtac 23 00 taagcttctg 2350
560
actagccaat ggcatcatcc aattttcttc ccaaacctct gcagcatctg 2400
ctttattgcc aaagggctag tttcggtttt ctgcagccat tgcggttaaa 2450
aaatataagt aggataactt gtaaaacctg catattgcta atctatagac 2 500 accacagttt ctaaattctt tgaaaccact ttactacttt ttttaaactt 2550
aactcagttc taaatacttt gtctggagca caaaacaata aaaggttatc 2600
ttatagtcgt gactttaaac ttttgtagac cacaattcac tttttagttt 2 650
tcttttactt aaatcccatc tgcagtctca aatttaagtt ctcccagtag 2700
agattgagtt tgagcctgta tatctattaa aaatttcaac ttcccacata 2750
tatttactaa gatgattaag acttacattt tctgcacagg tctgcaaaaa 2800
caaaaattat aaactagtcc atccaagaac caaagtttgt ataaacaggt 2850
tgctataagc ttgtgaaatg aaaatggaac atttcaatca aacatttcct 2900
atataacaat tattatattt acaatttggt ttctgcaata tttttcttat 2950
gtccaccctt ttaaaaatta ttatttgaag taatttattt acaggaaatg 3000
ttaatgagat gtattttctt atagagatat ttcttacaga aagctttgta 3050
gcagaatata tttgcagcta ttgactttgt aatttaggaa aaatgtataa 3100
taagataaaa tctattaaat ttttctcctc taaaaactga aaaaaaaaaa 3150
aaaaaaaaaa aaaaaaaaaa 3170
<210> 70 <211> 259 <212> PRT
<213> Homo sapiens <400> 70
Met Ala Ala Leu Met Arg Ser Lys Asp 1 5
Leu Leu Ala Ala Val Leu Met Val Glu 20
Ser Arg Ala Lys Leu Asn Ser Ile Lys 35
Thr Pro Gly Gin Ala Ala Asn Arg Ser 50
Leu Ala Phe Gly Gly Ser Lys Lys Gly 65
Tyr Pro Cys Ser Ser Asp Lys Glu Cys 80
His Ser Pro His Gin Gly Ser Ser Ala 95
Lys Lys Lys Arg Cys His Arg Asp Gly 110
Ser 10
Ser 25
Ser 40
Ala 55
Lys 70
Glu 85
Cys
100
Met
115
Ser
Ser
Ser
Gly
Asn
Val
Met
Cys
Cys Cys Gin Ile Leu Gly Met Tyr Leu Gly Gly Arg Val Cys Cys Pro
Leu Leu 15
Gly Ser 30
Gly Glu 45
Gin Gly 60
Gin Ala 75
Tyr Cys 90
Arg Arg 105
Ser Thr 120
561
Arg Cys Asn
Thr Pro His Asn His Gly
Gly Arg Pro
Asp Pro Cys
Ala Arg His
Gly Glu Val
Glu Ile Phe
Val Trp Lys
Cys Gin Lys
<210> 71 <211> 1809 <212> DNA <213> Homo s<
<400> 71
tctcaatctg ctgacctcgt gatccgcctg accttgtaat ccacctacct
50
tggcctccca aagtgttggg attacaggcg tgagccaccg cgcccggcca
100
acatcacgtt tttaaaaatt gatttcttca aattcatggc aaatatttcc
150
cttcccttta acttcttatg tcagaatgag gaaggatagc tgcatttatt
200
tagtcagttt tcattgcata gtaatatttt catgtagtat tttctaagtt
250
atattttagt aattcatatg ttttagatta taggttttaa catacttgtg
300
aaaatacttg atgtgtttta aagccttggg cagaaattct gtattgttga
350
ggatttgttc ttttatcccc cttttaaagt catccgtcct tggctcagga
400
tttggagagc ttgcaccacc aaaaatggca aacatcacca gctcccagat
450
tttggaccag ttgaaagctc cgagtttggg ccagtttacc accaccccaa
500
gtacacagca gaatagtaca agtcacccta caactactac ttcttgggac
550
ctcaagcccc caacatccca gtcctcagtc ctcagtcatc ttgacttcaa
600
atctcaacct gagccatccc cagttcttag ccagttgagc cagcgacaac
650
Asn Gly Ile Cys Ile Pro Val Thr Glu Ser Ile Leu 125 130 135
Ile Pro Ala Leu Asp Gly Thr Arg His Arg Asp Arg 140 145 150
His Tyr Ser Asn His Asp Leu Gly Trp Gin Asn Leu 155 160 165
His Thr Lys Met Ser His Ile Lys Gly His Glu Gly 170 175 180
Leu Arg Ser Ser Asp Cys Ile Glu Gly Phe Cys Cys 185 190 195
Phe Trp Thr Lys Ile Cys Lys Pro Val Leu His Gin 200 205 210
Cys Thr Lys Gin Arg Lys Lys Gly Ser His Gly Leu 215 220 225
Gin Arg Cys Asp Cys Ala Lys Gly Leu Ser Cys Lys 230 235 240
Asp Ala Thr Tyr Ser Ser Lys Ala Arg Leu His Val 245 250 255
Ile apiens
562
agcaccagag ccaggcagtc actgttcctc ctcctggttt ggagtccttt
700
ccttcccagg caaaacttcg agaatcaaca cctggagaca gtccctccac
750
tgtgaacaag cttttgcagc ttcccagcac gaccattgaa aatatctctg
800
tgtctgtcca atacccccag ccagccacag cttctaagat cccaaacaca cccagcttct tcaaacttgc gcagtggaaa taagcggcgg tgcctggttc
850 900
agcagatgtc acaggattaa atgtgcagtt tggggctctg gaatttgggt
950
cagaaccttc tctctctgaa tttggatcag ctccaagcag tgaaaatagt
1000
aatcagattc ccatcagctt gtattcgaag tctttaagtg agcctttgaa
1050
tacatcttta tcaatgacca gtgcagtaca gaactccaca tatacaactt
1100
ccgtcattac ctcctgcagt ctgacaagct catcactgaa ttctgctagt
1150
ccagtagcaa tgtcttcctc ttatgaccag agttctgtgc ataacaggat
1200
cccataccaa agccctgtga gttcatcaga gtcagctcca ggaaccatca
1250
tgaatggaca tggtggtggt cgaagtcagc agacactaga cagtaagtat
1300
agcagcaagc tactcttgtc atggctggtg ccaaccaaac agaggaagag
1350
gatagctcac gtgatgtgga aaacaccagt tggtcaatgg ctcattcgtt
1400
aaaaagcagc ccttttgctt ttttgttttt ggaccaggtg ttggctgtgg
1450
tgttattaga aatgtcttaa ccacagcaag aaggaggtgg tggtctcata
1500
ttcttctgcc ctaatcagac tgcaccacaa gtgcagcata cagtatgcat
1550
tttaaagatg cttgggccag gcggggtggc tgatgcccat aatcccagtg
1600
ctttgggggg ccaaggcagg cagattgccc aagctcagga gtttgagacc
1650
accctgggca acatggtgaa actctgtctc tactaaaata cgaaaaacta
1700
gccgggtgtg gtggcggcgc gtgcctgtaa tcccagctac ttgggaggct
1750
gaggcacaag aatcgcttga gccagcttgg gctacaaagt gagactccgt
1800
ctgaaaaga 1809
<210> 72 <211> 363 <212> PRT
<213> Homo sapiens <400> 72
Met Cys Phe Lys Ala Leu Gly Arg Asn Ser Val Leu Leu Arg lie 15 10 15
Cys Ser Phe Ile Pro Leu Leu Lys Ser Ser Val Leu Gly Ser Gly 20 25 30
Phe Gly Glu Leu Ala Pro Pro Lys Met Ala Asn Ile Thr Ser Ser 35 40 45
563
Gin Ile Leu Asp Gin Leu Lys Ala Pro Ser Leu Gly Gin Phe Thr 50 55 60
Thr Thr Pro Ser Thr Gin Gin Asn Ser Thr Ser His Pro Thr Thr 65 70 75
Thr Thr Ser Trp Asp Leu Lys Pro Pro Thr Ser Gin Ser Ser Val
80 85 90
Leu Ser His Leu Asp Phe Lys Ser Gin Pro Glu Pro Ser Pro Val
95 100 105
Leu Ser Gin Leu Ser Gin Arg Gin Gin His Gin Ser Gin Ala Val 110 115 120
Thr Val Pro Pro Pro Gly Leu Glu Ser Phe Pro Ser Gin Ala Lys 125 130 135
Leu Arg Glu Ser Thr Pro Gly Asp Ser Pro Ser Thr Val Asn Lys 140 145 150
Leu Leu Gin Leu Pro Ser Thr Thr Ile Glu Asn Ile Ser Val Ser 155 160 165
Val His Gin Pro Gin Pro Lys His Ile Lys Leu Ala Lys Arg Arg 170 175 180
Ile Pro Pro Ala Ser Lys Ile Pro Ala Ser Ala Val Glu Met Pro 185 190 195
Gly Ser Ala Asp Val Thr Gly Leu Asn Val Gin Phe Gly Ala Leu 200 205 210
Glu Phe Gly Ser Glu Pro Ser Leu Ser Glu Phe Gly Ser Ala Pro 215 220 225
Ser Ser Glu Asn Ser Asn Gin Ile Pro Ile Ser Leu Tyr Ser Lys 230 235 240
Ser Leu Ser Glu Pro Leu Asn Thr Ser Leu Ser Met Thr Ser Ala 245 250 255
Val Gin Asn Ser Thr Tyr Thr Thr Ser Val Ile Thr Ser Cys Ser 260 265 270
Leu Thr Ser Ser Ser Leu Asn Ser Ala Ser Pro Val Ala Met Ser 275 280 285
Ser Ser Tyr Asp Gin Ser Ser Val His Asn Arg Ile Pro Tyr Gin 290 295 300
Ser Pro Val Ser Ser Ser Glu Ser Ala Pro Gly Thr Ile Met Asn 305 310 315
Gly His Gly Gly Gly Arg Ser Gin Gin Thr Leu Asp Ser Lys Tyr 320 325 330
Ser Ser Lys Leu Leu Leu Ser Trp Leu Val Pro Thr Lys Gin Arg 335 340 345
Lys Arg Ile Ala His Val Met Trp Lys Thr Pro Val Gly Gin Trp 350 355 360
Leu Ile Arg
564
<210> 73 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 73 aattcatggc aaatatttcc cttccc 26
<210> 74 <211> 22 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 74 tggtaaactg gcccaaactc gg 22
<210> 75 <211> 50 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 75
ttaaagtcat ccgtccttgg ctcaggattt ggagagcttg caccaccaaa 50
<210> 76
<211> 1989
<212> DNA
<213> Homo sapiens
<400> 76
gccgagtggg acaaagcctg gggctgggcg ggggccatgg cgctgccatc
50
ccgaatcctg ctttggaaac ttgtgcttct gcagagctct gctgttctcc
100
tgcactcagc ggtggaggag acggacgcgg ggctgtacac ctgcaacctg
150
caccatcact actgccacct ctacgagagc ctggccgtcc gcctggaggt
200
caccgacggc cccccggcca cccccgccta ctgggacggc gagaaggagg
250
tgctggcggt ggcgcgcggc gcacccgcgc ttctgacctg cgtgaaccgc
300
gggcacgtgt ggaccgaccg gcacgtggag gaggctcaac aggtggtgca
350
ctgggaccgg cagccgcccg gggtcccgca cgaccgcgcg gaccgcctgc
400
tggacctcta cgcgtcgggc gagcgccgcg cctacgggcc cctttttctg
450
cgcgaccgcg tggctgtggg cgcggatgcc tttgagcgcg gtgacttctc
500
565
actgcgtatc gagccgctgg aggtcgccga cgagggcacc tactcctgcc
550
acctgcacca ccattactgt ggcctgcacg aacgccgcgt cttccacctg
600
acggtcgccg aaccccacgc ggagccgccc ccccggggct ctccgggcaa
650
cggctccagc cacagcggcg ccccaggccc agaccccaca ctggcgcgcg
700
gccacaacgt catcaatgtc atcgtccccg agagccgagc ccacttcttc
750
cagcagctgg gctacgtgct ggccacgctg ctgctcttca tcctgctact
800
ggtcactgtc ctcctggccg cccgcaggcg ccgcggaggc tacgaatact
850
cggaccagaa gtcgggaaag tcaaagggga aggatgttaa cttggcggag
900
ttcgctgtgg ctgcagggga ccagatgctt tacaggagtg aggacatcca
950
gctagattac aaaaacaaca tcctgaagga gagggcggag ctggcccaca
1000
gccccctgcc tgccaagtac atcgacctag acaaagggtt ccggaaggag
1050
aactgcaaat agggaggccc tgggctcctg gctgggccag cagctgcacc
1100
tctcctgtct gtgctcctcg gggcatctcc tgatgctccg gggctcaccc
1150
cccttccagc ggctggtccc gctttcctgg aatttggcct gggcgtatgc
1200
agaggccgcc tccacacccc tcccccaggg gcttggtggc agcatagccc
1250
ccacccctgc ggcctttgct cacgggtggc cctgcccacc cctggcacaa
1300
ccaaaatccc actgatgccc atcatgccct cagacccttc tgggctctgc
1350
ccgctggggg cctgaagaca ttcctggagg acactcccat cagaacctgg
1400
cagccccaaa actggggtca gcctcagggc aggagtccca ctcctccagg
1450
gctctgctcg tccggggctg ggagatgttc ctggaggagg acactcccat
1500
cagaacttgg cagccttgaa gttggggtca gcctcggcag gagtcccact
1550
cctcctgggg tgctgcctgc caccaagagc tcccccacct gtaccaccat
1600
gtgggactcc aggcaccatc tgttctcccc agggacctgc tgacttgaat
1650
gccagccctt gctcctctgt gttgctttgg gccacctggg gctgcacccc
1700
ctgccctttc tctgccccat ccctacccta gccttgctct cagccacctt
1750
gatagtcact gggctccctg tgacttctga ccctgacacc cctcccttgg
1800
actctgcctg ggctggagtc tagggctggg gctacatttg gcttctgtac
1850
tggctgagga caggggaggg agtgaagttg gtttggggtg gcctgtgttg
1900
ccactctcag caccccacat ttgcatctgc tggtggacct gccaccatca
1950
caataaagtc cccatctgat ttttaaaaaa aaaaaaaaa :
1989
<210> 77 <211> 341
566
<212> PRT
<213> Homo sapiens
<400> 77
Met Ala Leu Pro Ser Arg Ile Leu Leu Trp Lys Leu Val Leu Leu 15 10 15
Gin Ser Ser Ala Val Leu Leu His Ser Ala Val Glu Glu Thr Asp 20 25 30
Ala Gly Leu Tyr Thr Cys Asn Leu His His His Tyr Cys His Leu 35 40 45
Tyr Glu Ser Leu Ala Val Arg Leu Glu Val Thr Asp Gly Pro Pro 50 55 60
Ala Thr Pro Ala Tyr Trp Asp Gly Glu Lys Glu Val Leu Ala Val 65 70 75
Ala Arg Gly Ala Pro Ala Leu Leu Thr Cys Val Asn Arg Gly His 80 85 90
Val Trp Thr Asp Arg His Val Glu Glu Ala Gin Gin Val Val His 95 100 105
Trp Asp Arg Gin Pro Pro Gly Val Pro His Asp Arg Ala Asp Arg 110 115 120
Leu Leu Asp Leu Tyr Ala Ser Gly Glu Arg Arg Ala Tyr Gly Pro 125 130 135
Leu Phe Leu Arg Asp Arg Val Ala Val Gly Ala Asp Ala Phe Glu 140 145 150
Arg Gly Asp Phe Ser Leu Arg Ile Glu Pro Leu Glu Val Ala Asp 155 160 165
Glu Gly Thr Tyr Ser Cys His Leu His His His Tyr Cys Gly Leu 170 175 180
His Glu Arg Arg Val Phe His Leu Thr Val Ala Glu Pro His Ala 185 190 195
Glu Pro Pro Pro Arg Gly Ser Pro Gly Asn Gly Ser Ser His Ser 200 205 210
Gly Ala Pro Gly Pro Asp Pro Thr Leu Ala Arg Gly His Asn Val 215 220 225
Ile Asn Val Ile Val Pro Glu Ser Arg Ala His Phe Phe Gin Gin 230 235 240
Leu Gly Tyr Val Leu Ala Thr Leu Leu Leu Phe Ile Leu Leu Leu 245 250 255
Val Thr Val Leu Leu Ala Ala Arg Arg Arg Arg Gly Gly Tyr Glu 260 265 270
Tyr Ser Asp Gin Lys Ser Gly Lys Ser Lys Gly Lys Asp Val Asn 275 280 285
Leu Ala Glu Phe Ala Val Ala Ala Gly Asp Gin Met Leu Tyr Arg
567
290 295 300
Ser Glu Asp Ile Gin Leu Asp Tyr Lys Asn Asn Ile Leu Lys Glu 305 310 315
Arg Ala Glu Leu Ala His Ser Pro Leu Pro Ala Lys Tyr lie Asp 320 325 330
Leu Asp Lys Gly Phe Arg Lys Glu Asn Cys Lys 335 340
<210> 78
<211> 2243
<212> DNA
<213> Homo sapiens
<400> 78
cgccggaggc agcggcggcg tggcgcagcg gcgacatggc cgttgtctca
50
gaggacgact ttcagcacag ttcaaactcc acctacggaa ccacaagcag
100
cagtctccga gctgaccagg aggcactgct tgagaagctg ctggaccgcc
150
cgccccctgg cctgcagagg cccgaggacc gcttctgtgg cacatacatc
200
atcttcttca gcctgggcat tggcagtcta ctgccatgga acttctttat
250
cactgccaag gagtactgga tgttcaaact ccgcaactcc tccagcccag
300
ccaccgggga ggaccctgag ggctcagaca tcctgaacta ctttgagagc
350
taccttgccg ttgcctccac cgtgccctcc atgctgtgcc tggtggccaa
400
cttcctgctt gtcaacaggg ttgcagtcca catccgtgtc ctggcctcac
450
tgacggtcat cctggccatc ttcatggtga taactgcact ggtgaaggtg
500
gacacttcct cctggacccg tggttttttt gcggtcacca ttgtctgcat
550
ggtgatcctc agcggtgcct ccactgtctt cagcagcagc atctacggca
600
tgaccggctc ctttcctatg aggaactccc aagcactgat atcaggagga
650
gccatgggcg ggacggtcag cgccgtggcc tcattggtgg acttggctgc
700
atccagtgat gtgaggaaca gcgccctggc cttcttcctg acggccacca
750
tcttcctcgt gctctgcatg ggactctacc tgctgctgtc caggctggag
800
tatgccaggt actacatgag gcctgttctt gcggcccatg tgttttctgg
850
tgaagaggag cttccccagg actccctcag tgccccttcg gtggcctcca
900
gattcattga ttcccacaca ccccctctcc gccccatcct gaagaagacg
950
gccagcctgg gcttctgtgt cacctacgtc ttcttcatca ccagcctcat
1000
ctaccccgcc gtctgcacca acatcgagtc cctcaacaag ggctcgggct
1050
cactgtggac caccaagttt ttcatccccc tcactacctt cctcctgtac
1100
aactttgctg acctatgtgg ccggcagctc accgcctgga tccaggtgcc
1150
568
agggcccaac agcaaggcgc tcccagggtt cgtgctcctc cggacctgcc
1200
tcatccccct cttcgtgctc tgtaactacc agccccgcgt ccacctgaag
1250
actgtggtct tccagtccga tgtgtacccc gcactcctca gctccctgct
1300
ggggctcagc aacggctacc tcagcaccct ggccctcctc tacgggccta
1350
agattgtgcc cagggagctg gctgaggcca cgggagtggt gatgtccttt
1400
tatgtgtgct tgggcttaac actgggctca gcctgctcta ccctcctggt
1450
gcacctcatc tagaagggag gacacaagga cattggtgct tcagagcctt
1500
tgaagatgag aagagagtgc aggagggctg ggggccatgg aggaaaggcc
1550
taaagtttca cttggggaca gagagcagag cacactcggg cctcatccct
1600
cccaagatgc cagtgagcca cgtccatgcc cattccgtgc aaggcagata
1650
ttccagtcat attaacagaa cactcctgag acagttgaag aagaaatagc
1700
acaaatcagg ggtactccct tcacagctga tggttaacat tccaccttct
1750
ttctagccct tcaaagatgc tgccagtgtt cgccctagag ttattacaaa
1800
gccagtgcca aaacccagcc atgggctctt tgcaacctcc cagctgcgct
1850
cattccagct gacagcgaga tgcaagcaaa tgctcagctc tccttaccct
1900
gaaggggtct ccctggaatg gaagtcccct ggcatggtca gtcctcaggc
1950
ccaagactca agtgtgcaca gacccctgtg ttctgcgggt gaacaactgc
2000
ccactaacca gactggaaaa cccagaaaga tgggccttcc atgaatgctt
2050
cattccagag ggaccagagg gcctccctgt gcaagggatc aagcatgtct
2100
ggcctgggtt ttcaaaaaaa gagggatcct catgacctgg tggtctatgg
2150
cctgggtcaa gatgagggtc tttcagtgtt cctgtttaca acatgtcaaa
2200
gccattggtt caagggcgta ataaatactt gcgtattcaa aaa 2243
<210> 79 <211> 475 <212> PRT
<213> Homo sapiens <400> 79
Met Ala Val Val Ser Glu Asp Asp Phe Gin His Ser Ser Asn Ser 15 10 15
Thr Tyr Gly Thr Thr Ser Ser Ser Leu Arg Ala Asp Gin Glu Ala 20 25 30
Leu Leu Glu Lys Leu Leu Asp Arg Pro Pro Pro Gly Leu Gin Arg 35 40 45
Pro Glu Asp Arg Phe Cys Gly Thr Tyr Ile Ile Phe Phe Ser Leu 50 55 60
569
Gly Ile Gly Ser Leu Leu Pro Trp Asn Phe Phe Ile Thr Ala Lys 65 70 75
Glu Tyr Trp Met Phe Lys Leu Arg Asn Ser Ser Ser Pro Ala Thr 80 85 90
Gly Glu Asp Pro Glu Gly Ser Asp Ile Leu Asn Tyr Phe Glu Ser 95 100 105
Tyr Leu Ala Val Ala Ser Thr Val Pro Ser Met Leu Cys Leu Val 110 115 120
Ala Asn Phe Leu Leu Val Asn Arg Val Ala Val His Ile Arg Val 125 130 135
Leu Ala Ser Leu Thr Val Ile Leu Ala Ile Phe Met Val Ile Thr 140 145 150
Ala Leu Val Lys Val Asp Thr Ser Ser Trp Thr Arg Gly Phe Phe 155 160 165
Ala Val Thr Ile Val Cys Met Val Ile Leu Ser Gly Ala Ser Thr 170 175 180
Val Phe Ser Ser Ser Ile Tyr Gly Met Thr Gly Ser Phe Pro Met 185 190 195
Arg Asn Ser Gin Ala Leu Ile Ser Gly Gly Ala Met Gly Gly Thr 200 205 210
Val Ser Ala Val Ala Ser Leu Val Asp Leu Ala Ala Ser Ser Asp 215 220 225
Val Arg Asn Ser Ala Leu Ala Phe Phe Leu Thr Ala Thr Ile Phe 230 235 240
Leu Val Leu Cys Met Gly Leu Tyr Leu Leu Leu Ser Arg Leu Glu 245 250 255
Tyr Ala Arg Tyr Tyr Met Arg Pro Val Leu Ala Ala His Val Phe 260 265 270
Ser Gly Glu Glu Glu Leu Pro Gin Asp Ser Leu Ser Ala Pro Ser 275 280 285
Val Ala Ser Arg Phe Ile Asp Ser His Thr Pro Pro Leu Arg Pro 290 295 300
Ile Leu Lys Lys Thr Ala Ser Leu Gly Phe Cys Val Thr Tyr Val 305 310 315
Phe Phe Ile Thr Ser Leu Ile Tyr Pro Ala Val Cys Thr Asn Ile 320 325 330
Glu Ser Leu Asn Lys Gly Ser Gly Ser Leu Trp Thr Thr Lys Phe 335 340 345
Phe Ile Pro Leu Thr Thr Phe Leu Leu Tyr Asn Phe Ala Asp Leu 350 355 360
Cys Gly Arg Gin Leu Thr Ala Trp Ile Gin Val Pro Gly Pro Asn
365
570
370
375
Ser Lys Ala
Pro Leu Phe
Thr Val Val
Leu Leu Gly
Tyr Gly Pro
Val Val Met
Ala Cys Ser
<210> 80 <211> 22 <212> DNA <213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 80 ttttgcggtc accattgtct gc 22
<210> 81 <211> 23 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 81 cgtaggtgac acagaagccc agg 23
<210> 82 <211> 49 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 82
tacggcatga ccggctcctt tcctatgagg aactcccagg cactgatat 49
<210> 83
<211> 1844
<212> DNA
<213> Homo sapiens
Leu Pro Gly Phe Val Leu Leu Arg Thr Cys Leu Ile 380 385 390
Val Leu Cys Asn Tyr Gin Pro Arg Val His Leu Lys 395 400 405
Phe Gin Ser Asp Val Tyr Pro Ala Leu Leu Ser Ser 410 415 420
Leu Ser Asn Gly Tyr Leu Ser Thr Leu Ala Leu Leu 425 430 435
Lys Ile Val Pro Arg Glu Leu Ala Glu Ala Thr Gly 440 445 450
Ser Phe Tyr Val Cys Leu Gly Leu Thr Leu Gly Ser 455 460 465
Thr Leu Leu Val His Leu Ile 470 475
<400> 83
571
gacagtggag ggcagtggag aggaccgcgc tgtcctgctg tcaccaagag
50
ctggagacac catctcccac cgagagtcat ggccccattg gccctgcacc
100
tcctcgtcct cgtccccatc ctcctcagcc tggtggcctc ccaggactgg
150
aaggctgaac gcagccaaga ccccttcgag aaatgcatgc aggatcctga
200
ctatgagcag ctgctcaagg tggtgacctg ggggctcaat cggaccctga
250
agccccagag ggtgattgtg gttggcgctg gtgtggccgg gctggtggcc
300
gccaaggtgc tcagcgatgc tggacacaag gtcaccatcc tggaggcaga
350
taacaggatc gggggccgca tcttcaccta ccgggaccag aacacgggct
400
ggattgggga gctgggagcc atgcgcatgc ccagctctca caggatcctc
450
cacaagctct gccagggcct ggggctcaac ctgaccaagt tcacccagta
500
cgacaagaac acgtggacgg aggtgcacga agtgaagctg cgcaactatg
550
tggtggagaa ggtgcccgag aagctgggct acgccttgcg tccccaggaa
600
aagggccact cgcccgaaga catctaccag atggctctca accaggccct
650
caaagacctc aaggcactgg gctgcagaaa ggcgatgaag aagtttgaaa
700
ggcacacgct cttggaatat cttctcgggg aggggaacct gagccggccg
750
gccgtgcagc ttctgggaga cgtgatgtcc gaggatggct tcttctatct
800
cagcttcgcc gaggccctcc gggcccacag ctgcctcagc gacagactcc
850
agtacagccg catcgtgggt ggctgggacc tgctgccgcg cgcgctgctg
900
agctcgctgt ccgggcttgt gctgttgaac gcgcccgtgg tggcgatgac
950
ccagggaccg cacgatgtgc acgtgcagat cgagacctct cccccggcgc
1000
ggaatctgaa ggtgctgaag gccgacgtgg tgctgctgac ggcgagcgga
1050
ccggcggtga agcgcatcac cttctcgccg ccgctgcccc gccacatgca
1100
ggaggcgctg cggaggctgc actacgtgcc ggccaccaag gtgttcctaa
1150
gcttccgcag gcccttctgg cgcgaggagc acattgaagg cggccactca
1200
aacaccgatc gcccgtcgcg catgattttc tacccgccgc cgcgcgaggg
1250
cgcgctgctg ctggcctcgt acacgtggtc ggacgcggcg gcagcgttcg
1300
ccggcttgag ccgggaagag gcgttgcgct tggcgctcga cgacgtggcg
1350
gcattgcacg ggcctgtcgt gcgccagctc tgggacggca ccggcgtcgt
1400
caagcgttgg gcggaggacc agcacagcca gggtggcttt gtggtacagc
1450
cgccggcgct ctggcaaacc gaaaaggatg actggacggt cccttatggc
1500
cgcatctact ttgccggcga gcacaccgcc tacccgcacg gctgggtgga
1550
572
gacggcggtc aagtcggcgc tgcgcgccgc catcaagatc aacagccgga 1600
aggggcctgc atcggacacg gccagccccg aggggcacgc atctgacatg 1650
gaggggcagg ggcatgtgca tggggtggcc agcagcccct cgcatgacct 17 00
ggcaaaggaa gaaggcagcc accctccagt ccaaggccag ttatctctcc 1750
aaaacacgac ccacacgagg acctcgcatt aaagtatttt cggaaaaaaa 1800
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 1844
<210> 84 <211> 567 <212> PRT
<213> Homo sapiens <400> 84
Met Ala Pro Leu Ala Leu His Leu Leu Val Leu Val Pro Ile Leu 15 10 15
Leu Ser Leu Val Ala Ser Gin Asp Trp Lys Ala Glu Arg Ser Gin 20 25 30
Asp Pro Phe Glu Lys Cys Met Gin Asp Pro Asp Tyr Glu Gin Leu 35 40 45
Leu Lys Val Val Thr Trp Gly Leu Asn Arg Thr Leu Lys Pro Gin 50 55 60
Arg Val Ile Val Val Gly Ala Gly Val Ala Gly Leu Val Ala Ala 65 70 75
Lys Val Leu Ser Asp Ala Gly His Lys Val Thr Ile Leu Glu Ala 80 85 90
Asp Asn Arg Ile Gly Gly Arg Ile Phe Thr Tyr Arg Asp Gin Asn 95 100 105
Thr Gly Trp Ile Gly Glu Leu Gly Ala Met Arg Met Pro Ser Ser 110 115 120
His Arg Ile Leu His Lys Leu Cys Gin Gly Leu Gly Leu Asn Leu 125 130 135
Thr Lys Phe Thr Gin Tyr Asp Lys Asn Thr Trp Thr Glu Val His 140 145 150
Glu Val Lys Leu Arg Asn Tyr Val Val Glu Lys Val Pro Glu Lys 155 160 165
Leu Gly Tyr Ala Leu Arg Pro Gin Glu Lys Gly His Ser Pro Glu 170 175 • 180
Asp Ile Tyr Gin Met Ala Leu Asn Gin Ala Leu Lys Asp Leu Lys 185 190 195
Ala Leu Gly Cys Arg Lys Ala Met Lys Lys Phe Glu Arg His Thr 200 205 210
Leu Leu Glu Tyr Leu Leu Gly Glu Gly Asn Leu Ser Arg Pro Ala 215 220 225
573
Val Gin Leu Leu Gly Asp Val Met Ser Glu Asp Gly Phe Phe Tyr 230 235 240
Leu Ser Phe Ala Glu Ala Leu Arg Ala His Ser Cys Leu Ser Asp 245 250 255
Arg Leu Gin Tyr Ser Arg Ile Val Gly Gly Trp Asp Leu Leu Pro 260 265 270
Arg Ala Leu Leu Ser Ser Leu Ser Gly Leu Val Leu Leu Asn Ala 275 280 285
Pro Val Val Ala Met Thr Gin Gly Pro His Asp Val His Val Gin 290 295 300
Ile Glu Thr Ser Pro Pro Ala Arg Asn Leu Lys Val Leu Lys Ala 305 310 315
Asp Val Val Leu Leu Thr Ala Ser Gly Pro Ala Val Lys Arg Ile 320 325 330
Thr Phe Ser Pro Pro Leu Pro Arg His Met Gin Glu Ala Leu Arg 335 340 345
Arg Leu His Tyr Val Pro Ala Thr Lys Val Phe Leu Ser Phe Arg 350 355 360
Arg Pro Phe Trp Arg Glu Glu His Ile Glu Gly Gly His Ser Asn 365 370 375
Thr Asp Arg Pro Ser Arg Met Ile Phe Tyr Pro Pro Pro Arg Glu 380 385 390
Gly Ala Leu Leu Leu Ala Ser Tyr Thr Trp Ser Asp Ala Ala Ala 395 400 405
Ala Phe Ala Gly Leu Ser Arg Glu Glu Ala Leu Arg Leu Ala Leu 410 415 420
Asp Asp Val Ala Ala Leu His Gly Pro Val Val Arg Gin Leu Trp 425 430 435
Asp Gly Thr Gly Val Val Lys Arg Trp Ala Glu Asp Gin His Ser 440 445 450
Gin Gly Gly Phe Val Val Gin Pro Pro Ala Leu Trp Gin Thr Glu 455 460 465
Lys Asp Asp Trp Thr Val Pro Tyr Gly Arg Ile Tyr Phe Ala Gly 470 475 480
Glu His Thr Ala Tyr Pro His Gly Trp Val Glu Thr Ala Val Lys 485 490 495
Ser Ala Leu Arg Ala Ala Ile Lys Ile Asn Ser Arg Lys Gly Pro 500 505 510
Ala Ser Asp Thr Ala Ser Pro Glu Gly His Ala Ser Asp Met Glu 515 520 525
Gly Gin Gly His Val His Gly Val Ala Ser Ser Pro Ser His Asp 530 535 540
574
Leu Ala Lys Glu Glu Gly Ser His Pro Pro Val Gin Gly Gin Leu 545 550 555
Ser Leu Gin Asn Thr Thr His Thr Arg Thr Ser His 560 565
<210> 85
<211> 3316
<212> DNA
<213> Homo sapiens
<400> 85
ctgacatggc ctgactcggg acagctcaga gcagggcaga actggggaca
50
ctctgggccg gccttctgcc tgcatggacg ctctgaagcc accctgtctc
100
tggaggaacc acgagcgagg gaagaaggac agggactcgt gtggcaggaa
150
gaactcagag ccgggaagcc cccattcact agaagcactg agagatgcgg
200
ccccctcgca gggtctgaat ttcctgctgc tgttcacaaa gatgcttttt
250
atctttaact ttttgttttc cccacttccg accccggcgt tgatctgcat
300
cctgacattt ggagctgcca tcttcttgtg gctgatcacc agacctcaac
350
ccgtcttacc tcttcttgac ctgaacaatc agtctgtggg aattgaggga
400
ggagcacgga agggggtttc ccagaagaac aatgacctaa caagttgctg
450
cttctcagat gccaagacta tgtatgaggt tttccaaaga ggactcgctg
500
tgtctgacaa tgggccctgc ttgggatata gaaaaccaaa ccagccctac
550
agatggctat cttacaaaca ggtgtctgat agagcagagt acctgggttc
600
ctgtctcttg cataaaggtt ataaatcatc accagaccag tttgtcggca
650
tctttgctca gaataggcca gagtggatca tctccgaatt ggcttgttac
700
acgtactcta tggtagctgt acctctgtat gacaccttgg gaccagaagc
750
catcgtacat attgtcaaca aggctgatat cgccatggtg atctgtgaca
800
caccccaaaa ggcattggtg ctgataggga atgtagagaa aggcttcacc
850
ccgagcctga aggtgatcat ccttatggac ccctttgatg atgacctgaa
900
gcaaagaggg gagaagagtg gaattgagat cttatcccta tatgatgctg
950
agaacctagg caaagagcac ttcagaaaac ctgtgcctcc tagcccagaa
1000
gacctgagcg tcatctgctt caccagtggg accacaggtg accccaaagg
1050
agccatgata acccatcaaa atattgtttc aaatgctgct gcctttctca
1100
aatgtgtgga gcatgcttat gagcccactc ctgatgatgt ggccatatcc
1150
tacctccctc tggctcatat gtttgagagg attgtacagg ctgttgtgta
1200
cagctgtgga gccagagttg gattcttcca aggggatatt cggttgctgg
1250
575
ctgacgacat gaagactttg aagcccacat tgtttcccgc ggtgcctcga
1300
ctccttaaca ggatctacga taaggtacaa aatgaggcca agacaccctt
1350
gaagaagttc ttgttgaagc tggctgtttc cagtaaattc aaagagcttc
1400
aaaagggtat catcaggcat gatagtttct gggacaagct catctttgca
1450
aagatccagg acagcctggg cggaagggtt cgtgtaattg tcactggagc
1500
tgcccccatg tccacttcag tcatgacatt cttccgggca gcaatgggat
1550
gtcaggtgta tgaagcttat ggtcaaacag aatgcacagg tggctgtaca
1600
tttacattac ctggggactg gacatcaggt cacgttgggg tgcccctggc
1650
ttgcaattac gtgaagctgg aagatgtggc tgacatgaac tactttacag
1700
tgaataatga tacctgaagg aggagaggtc accctgagaa tgcatcaagg gacacaggaa gtacaaacgt gccctggaca gttcaaagga gtgatggctg
1750 1800
gcttcacaca ggagacattg gtcgctggct cccgaatgga actctgaaga
1850
tcatcgaccg taaaaagaac attttcaagc tggcccaagg agaatacatt
1900
gcaccagaga agatagaaaa tatctacaac aggagtcaac cagtgttaca
1950
aatttttgta cacggggaga gcttacggtc atccttagta ggagtggtgg
2000
ttcctgacac agatgtactt ccctcatttg cagccaagct tggggtgaag
2050
ggctcctttg aggaactgtg ccaaaaccaa gttgtaaggg aagccatttt
2100
agaagacttg cagaaaattg ggaaagaaag tggccttaaa acttttgaac
2150
aggtcaaagc catttttctt catccagagc cattttccat tgaaaatggg
2200
ctcttgacac caacattgaa agcaaagcga ggagagcttt ccaaatactt
2250
tcggacccaa attgacagcc tgtatgagca catccaggat taggataagg
2300
tacttaagta cctgccggcc cactgtgcac tgcttgtgag aaaatggatt
2350
aaaaactatt cttacatttg ttttgccttt cctcctattt ttttttaacc
2400
tgttaaactc taaagccata gcttttgttt tatattgaga catataatgt
2450
gtaaacttag ttcccaaata aatcaatcct gtctttccca tcttcgatgt
2500
tgctaatatt aaggcttcag ggctactttt atcaacatgc ctgtcttcaa
2550
gatcccagtt tatgttctgt gtccttcctc atgatttcca accttaatac
2600
tattagtaac cacaagttca agggtcaaag ggaccctctg tgccttcttc
2650
tttgttttgt gataaacata acttgccaac agtctctatg cttatttaca
2700
tcttctactg ttcaaactaa gagattttta aattctgaaa aactgcttac
2750
aattcatgtt ttctagccac tccacaaacc actaaaattt tagttttagc
2800
576
ctatcactca tgtcaatcat atctatgaga caaatgtctc cgatgctctt 2850
ctgcgtaaat taaattgtgt actgaaggga aaagtttgat cataccaaac 2900
atttcctaaa ctctctagtt agatatctga cttgggagta ttaaaaattg 2950
ggtctatgac atactgtcca aaaggaatgc tgttcttaaa gcattattta 3000
cagtaggaac tggggagtaa atctgttccc tacagtttgc tgctgagctg 3050
gaagctgtgg gggaaggagt tgacaggtgg gcccagtgaa cttttccagt 3100
aaatgaagca agcactgaat aaaaacctcc tgaactggga acaaagatct 3150
acaggcaagc aagatgccca cacaacaggc ttattttctg tgaaggaacc 3200
aactgatctc ccccaccctt ggattagagt tcctgctcta ccttacccac 32 50
agataacaca tgttgtttct acttgtaaat gtaaagtctt taaaataaac 3300
tattacagat aaaaaa 3316 <210> 86 <211> 739 <212> PRT <213> Homo sapiens
<400> 86
Met Asp Ala Leu Lys Pro Pro Cys Leu Trp Arg Asn His Glu Arg 15 10 15
Gly Lys Lys Asp Arg Asp Ser Cys Gly Arg Lys Asn Ser Glu Pro 20 25 30
Gly Ser Pro His Ser Leu Glu Ala Leu Arg Asp Ala Ala Pro Ser 35 40 45
Gin Gly Leu Asn Phe Leu Leu Leu Phe Thr Lys Met Leu Phe Ile 50 55 60
Phe Asn Phe Leu Phe Ser Pro Leu Pro Thr Pro Ala Leu Ile Cys 65 70 75
Ile Leu Thr Phe Gly Ala Ala Ile Phe Leu Trp Leu Ile Thr Arg 80 85 90
Pro Gin Pro Val Leu Pro Leu Leu Asp Leu Asn Asn Gin Ser Val 95 100 105
Gly Ile Glu Gly Gly Ala Arg Lys Gly Val Ser Gin Lys Asn Asn 110 115 120
Asp Leu Thr Ser Cys Cys Phe Ser Asp Ala Lys Thr Met Tyr Glu 125 130 135
Val Phe Gin Arg Gly Leu Ala Val Ser Asp Asn Gly Pro Cys Leu 140 145 150
Gly Tyr Arg Lys Pro Asn Gin Pro Tyr Arg Trp Leu Ser Tyr Lys 155 160 165
Gin Val Ser Asp Arg Ala Glu Tyr Leu Gly Ser Cys Leu Leu His 170 175 180
577
Lys Gly Tyr Lys Ser Ser Pro Asp Gin Phe Val Gly lie Phe Ala 185 190 195
Gin Asn Arg Pro Glu Trp Ile Ile Ser Glu Leu Ala Cys Tyr Thr 200 205 210
Tyr Ser Met Val Ala Val Pro Leu Tyr Asp Thr Leu Gly Pro Glu 215 220 225
Ala Ile Val His Ile Val Asn Lys Ala Asp Ile Ala Met Val Ile 230 235 240
Cys Asp Thr Pro Gin Lys Ala Leu Val Leu Ile Gly Asn Val Glu 245 250 255
Lys Gly Phe Thr Pro Ser Leu Lys Val lie Ile Leu Met Asp Pro 260 265 270
Phe Asp Asp Asp Leu Lys Gin Arg Gly Glu Lys Ser Gly Ile Glu 275 280 285
Ile Leu Ser Leu Tyr Asp Ala Glu Asn Leu Gly Lys Glu His Phe 290 295 300
Arg Lys Pro Val Pro Pro Ser Pro Glu Asp Leu Ser Val Ile Cys 305 310 315
Phe Thr Ser Gly Thr Thr Gly Asp Pro Lys Gly Ala Met Ile Thr 320 325 330
His Gin Asn Ile Val Ser Asn Ala Ala Ala Phe Leu Lys Cys Val 335 340 345
Glu His Ala Tyr Glu Pro Thr Pro Asp Asp Val Ala Ile Ser Tyr 350 355 360
Leu Pro Leu Ala His Met Phe Glu Arg Ile Val Gin Ala Val Val 365 370 375
Tyr Ser Cys Gly Ala Arg Val Gly Phe Phe Gin Gly Asp Ile Arg 380 385 390
Leu Leu Ala Asp Asp Met Lys Thr Leu Lys Pro Thr Leu Phe Pro 395 400 405
Ala Val Pro Arg Leu Leu Asn Arg Ile Tyr Asp Lys Val Gin Asn 410 415 420
Glu Ala Lys Thr Pro Leu Lys Lys Phe Leu Leu Lys Leu Ala Val 425 430 435
Ser Ser Lys Phe Lys Glu Leu Gin Lys Gly Ile Ile Arg His Asp 440 445 450
Ser Phe Trp Asp Lys Leu Ile Phe Ala Lys Ile Gin Asp Ser Leu 455 460 465
Gly Gly Arg Val Arg Val Ile Val Thr Gly Ala Ala Pro Met Ser 470 475 480
Thr Ser Val Met Thr Phe Phe Arg Ala Ala Met Gly Cys Gin Val
578
485 490 495
Tyr Glu Ala Tyr Gly Gin Thr Glu Cys Thr Gly Gly Cys Thr Phe 500 505 510
Thr Leu Pro Gly Asp Trp Thr Ser Gly His Val Gly Val Pro Leu 515 520 525
Ala Cys Asn Tyr Val Lys Leu Glu Asp Val Ala Asp Met Asn Tyr 530 535 540
Phe Thr Val Asn Asn Glu Gly Glu Val Cys Ile Lys Gly Thr Asn 545 550 555
Val Phe Lys Gly Tyr Leu Lys Asp Pro Glu Lys Thr Gin Glu Ala 560 565 570
Leu Asp Ser Asp Gly Trp Leu His Thr Gly Asp Ile Gly Arg Trp 575 580 585
Leu Pro Asn Gly Thr Leu Lys Ile Ile Asp Arg Lys Lys Asn Ile 590 595 600
Phe Lys Leu Ala Gin Gly Glu Tyr Ile Ala Pro Glu Lys Ile Glu 605 610 615
Asn Ile Tyr Asn Arg Ser Gin Pro Val Leu Gin Ile Phe Val His 620 625 630
Gly Glu Ser Leu Arg Ser Ser Leu Val Gly Val Val Val Pro Asp 635 640 645
Thr Asp Val Leu Pro Ser Phe Ala Ala Lys Leu Gly Val Lys Gly 650 655 660
Ser Phe Glu Glu Leu Cys Gin Asn Gin Val Val Arg Glu Ala Ile 665 670 675
Leu Glu Asp Leu Gin Lys Ile Gly Lys Glu Ser Gly Leu Lys Thr 680 685 690
Phe Glu Gin Val Lys Ala Ile Phe Leu His Pro Glu Pro Phe Ser 695 700 705
Ile Glu Asn Gly Leu Leu Thr Pro Thr Leu Lys Ala Lys Arg Gly 710 715 720
Glu Leu Ser Lys Tyr Phe Arg Thr Gin Ile Asp Ser Leu Tyr Glu 725 730 735
His Ile Gin Asp
<210> 87
<211> 2725
<212> DNA
<213> Homo sapiens
<400> 87
ggaggcggag gccgcggcga gccgggccga gcagtgaggg ccctagcggg 50
gcccgagcgg ggcccggggc ccctaagcca ttcctgaagt catgggctgg 100
579
ccaggacatt ggtgacccgc caatccggta tggacgactg gaagcccagc
150
cccctcatca agccctttgg ggctcggaag aagcggagct ggtaccttac
200
ctggaagtat aaactgacaa accagcgggc cctgcggaga ttctgtcaga
250
caggggccgt gcttttcctg ctggtgactg tcattgtcaa tatcaagttg
300
atcctggaca ctcggcgagc catcagtgaa gccaatgaag acccagagcc
350
agagcaagac tatgatgagg ccctaggccg cctggagccc ccacggcgca
400
gaggcagtgg tccccggcgg gtcctggacg tagaggtgta ttcaagtcgc
450
agcaaagtat atgtggcagt ggatggcacc acggtgctgg aggatgaggc
500
ccgggagcag ggccggggca tccatgtcat tgtcctcaac caggccacgg
550
gccacgtgat ggcaaaacgt gtgtttgaca cgtactcacc tcatgaggat
600
gaggccatgg tgctattcct caacatggta gcgcccggcc gagtgctcat
650
ctgcactgtc ctctgctgag aaggatgagg gagcctgggc gctccttcca agccaggctg cctcaaggac gccctgccct acagccaagg gggctggagg
700 750
gacacatggg ccttcgtggg acgaaaagga ggtcctgtct tcggggagaa
800
acattctaag tcacctgccc tctcttcctg gggggaccca gtcctgctga
850
agacagatgt gccattgagc tcagcagaag aggcagagtg ccactgggca
900
gacacagagc tgaaccgtcg ccgccggcgc ttctgcagca aagttgaggg
950
ctatggaagt gtatgcagct gcaaggaccc cacacccatc gagttcagcc
1000
ctgacccact cccagacaac aaggtcctca atgtgcctgt ggctgtcatt
1050
gcagggaacc gacccaatta cctgtacagg atgctgcgct ctctgctttc
1100
agcccagggg gtgtctcctc agatgataac agttttcatt gacggctact
1150
atgaggaacc catggatgtg gtggcactgt ttggtctgag gggcatccag
1200
catactccca tcagcatcaa gaatgcccgc gtgtctcagc actacaaggc
1250
cagcctcact gccactttca acctgtttcc ggaggccaag tttgctgtgg
1300
ttctggaaga ggacctggac attgctgtgg attttttcag tttcctgagc
1350
caatccatcc acctactgga ggaggatgac agcctgtact gcatctctgc
1400
ctggaatgac caggggtatg aacacacggc tgaggaccca gcactactgt
1450
accgtgtgga gaccatgcct gggctgggct gggtgctcag gaggtccttg
1500
tacaaggagg agcttgagcc caagtggcct acaccggaaa agctctggga
1550
ttgggacatg tggatgcgga tgcctgaaca acgccggggc cgagagtgca
1600
tcatccctga cgtttcccga tcctaccact ttggcatcgt cggcctcaac
1650
580
atgaatggct actttcacga ggcctacttc aagaagcaca agttcaacac
17 00
ggttccaggt gtccagctca ggaatgtgga cagtctgaag aaagaagctt
1750
atgaagtgga agttcacagg ctgctcagtg aggctgaggt tctggaccac
1800
agcaagaacc cttgtgaaga ctctttcctg ccagacacag agggccacac
1850
ctacgtggcc tttattcgaa tggagaaaga tgatgacttc accacctgga
1900
cccagcttgc caagtgcctc catatctggg acctggatgt gcgtggcaac
1950
catcggggcc tgtggagatt gtttcggaag aagaaccact tcctggtggt
2000
gggggtcccg gcttccccct actcagtgaa gaagccaccc tcagtcaccc
2050
caattttcct ggagccaccc ccaaaggagg agggagcccc aggagcccca
2100
gaacagacat gagacctcct ccaggaccct gcggggctgg gtactgtgta
2150
cccccaggct ggctagccct tccctccatc ctgtaggatt ttgtagatgc
2200
tggtaggggc tggggctacc ttgtttttaa catgagactt aattactaac
2250
tccaagggga ctatttattt gggttcccct acttccttgt gctccaacac tggagaaggg cccgttcctg caggagagta agttaaaagt cctgggaatc
2300 2350
attacgatcc ctagcagctc atcctgccct ttgaataccc tcactttcca
2400
ggcctggctc agaatctaac ctatttattg actgtcctga gggccttgaa
2450
aacaggccga acctggaggg cctggatttc tttttgggct ggaatgctgc
2500
cctgagggtg gggctggctc ttactcagga aactgctgtg cccaacccat
2550
ggacaggccc agctggggcc cacatgctga cacagactca ctcagagacc
2600
cttagacact ggaccaggcc tcctctcagc cttctctttg tccagatttc
2650
caaagctgga taagttggtc attgattaaa aaaggagaag ccctctggga
2700
aaaaaaaaaa aaaaaaaaaa aaaaa 2725
<210> 88
<211> 660
<212> PRT
<213> Homo sapiens
<400> 88
Met Asp Asp Trp Lys Pro Ser Pro Leu Ile Lys Pro Phe Gly Ala 15 10 15
Arg Lys Lys Arg Ser Trp Tyr Leu Thr Trp Lys Tyr Lys Leu Thr 20 25 30
Asn Gin Arg Ala Leu Arg Arg Phe Cys Gin Thr Gly Ala Val Leu 35 40 45
Phe Leu Leu Val Thr Val Ile Val Asn Ile Lys Leu Ile Leu Asp 50 55 60
Thr Arg Arg Ala Ile Ser Glu Ala Asn Glu Asp Pro Glu Pro Glu
65
581
70
75
Gin Asp Tyr Asp Glu Ala Leu Gly Arg Leu Glu Pro Pro Arg Arg 80 85 90
Arg Gly Ser Gly Pro Arg Arg Val Leu Asp Val Glu Val Tyr Ser 95 100 105
Ser Arg Ser Lys Val Tyr Val Ala Val Asp Gly Thr Thr Val Leu 110 115 120
Glu Asp Glu Ala Arg Glu Gin Gly Arg Gly Ile His Val Ile Val 125 130 135
Leu Asn Gin Ala Thr Gly His Val Met Ala Lys Arg Val Phe-Asp 140 145 150
Thr Tyr Ser Pro His Glu Asp Glu Ala Met Val Leu Phe Leu Asn 155 160 165
Met Val Ala Pro Gly Arg Val Leu Ile Cys Thr Val Lys Asp Glu 170 175 180
Gly Ser Phe His Leu Lys Asp Thr Ala Lys Ala Leu Leu Arg Ser 185 190 195
Leu Gly Ser Gin Ala Gly Pro Ala Leu Gly Trp Arg Asp Thr Trp 200 205 210
Ala Phe Val Gly Arg Lys Gly Gly Pro Val Phe Gly Glu Lys His 215 220 225
Ser Lys Ser Pro Ala Leu Ser Ser Trp Gly Asp Pro Val Leu Leu 230 235 240
Lys Thr Asp Val Pro Leu Ser Ser Ala Glu Glu Ala Glu Cys His 245 250 255
Trp Ala Asp Thr Glu Leu Asn Arg Arg Arg Arg Arg Phe Cys Ser 260 265 270
Lys Val Glu Gly Tyr Gly Ser Val Cys Ser Cys Lys Asp Pro Thr 275 280 285
Pro Ile Glu Phe Ser Pro Asp Pro Leu Pro Asp Asn Lys Val Leu 290 295 300
Asn Val Pro Val Ala Val Ile Ala Gly Asn Arg Pro Asn Tyr Leu 305 310 315
Tyr Arg Met Leu Arg Ser Leu Leu Ser Ala Gin Gly Val Ser Pro 320 325 330
Gin Met Ile Thr Val Phe Ile Asp Gly Tyr Tyr Glu Glu Pro Met 335 340 345
Asp Val Val Ala Leu Phe Gly Leu Arg Gly Ile Gin His Thr Pro 350 355 360
Ile Ser Ile Lys Asn Ala Arg Val Ser Gin His Tyr Lys Ala Ser 365 370 375
582
Leu Thr Ala Thr Phe Asn Leu Phe Pro Glu Ala Lys Phe Ala Val 380 385 390
Val Leu Glu Glu Asp Leu Asp Ile Ala Val Asp Phe Phe Ser Phe 395 400 405
Leu Ser Gin Ser Ile His Leu Leu Glu Glu Asp Asp Ser Leu Tyr 410 415 420
Cys Ile Ser Ala Trp Asn Asp Gin Gly Tyr Glu His Thr Ala Glu 425 430 435
Asp Pro Ala Leu Leu Tyr Arg Val Glu Thr Met Pro Gly Leu Gly 440 445 450
Trp Val Leu Arg Arg Ser Leu Tyr Lys Glu Glu Leu Glu Pro Lys 455 460 465
Trp Pro Thr Pro Glu Lys Leu Trp Asp Trp Asp Met Trp Met Arg 470 475 480
Met Pro Glu Gin Arg Arg Gly Arg' Glu Cys lie Ile Pro Asp Val 485 490 495
Ser Arg Ser Tyr His Phe Gly Ile Val Gly Leu Asn Met Asn Gly 500 505 510
Tyr Phe His Glu Ala Tyr Phe Lys Lys His Lys Phe Asn Thr Val 515 520 525
Pro Gly Val Gin Leu Arg Asn Val Asp Ser Leu Lys Lys Glu Ala 530 535 540
Tyr Glu Val Glu Val His Arg Leu Leu Ser Glu Ala Glu Val Leu 545 550 555
Asp His Ser Lys Asn Pro Cys Glu Asp Ser Phe Leu Pro Asp Thr 560 565 570
Glu Gly His Thr Tyr Val Ala Phe Ile Arg Met Glu Lys Asp Asp 575 580 585
Asp Phe Thr Thr Trp Thr Gin Leu Ala Lys Cys Leu His Ile Trp 590 595 600
Asp Leu Asp Val Arg Gly Asn His Arg Gly Leu Trp Arg Leu Phe 605 610 615
Arg Lys Lys Asn His Phe Leu Val Val Gly Val Pro Ala Ser Pro 620 625 630
Tyr Ser Val Lys Lys Pro Pro Ser Val Thr Pro Ile Phe Leu Glu 635 640 645
Pro Pro Pro Lys Glu Glu Gly Ala Pro Gly Ala Pro Glu Gin Thr 650 655 660
<210> 89 <211> 25 <212> DNA
<213> Artificial Sequence
583
<220>
<223> Synthetic oligonucleotide probe
<400> 89 gatggcaaaa cgtgtgtttg acacg 25
<210> 90 <211> 22 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 90 cctcaaccag gccacgggcc ac 22
<210> 91 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 91 cccaggcaga gatgcagtac aggc 24
<210> 92 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 92 cctccagtag gtggatggat tggctc 26
<210> 93 <211> 47 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 93
ctcacctcat gaggatgagg ccatggtgct attcctcaac atggtag 47
<210> 94
<211> 3037
<212> DNA
<213> Homo sapiens
<400> 94
cggacgcgtg ggctgctggt gggaaggcct aaagaactgg aaagcccact 50 ctcttggaac caccacacct gtttaaagaa cctaagcacc atttaaagcc 100
actggaaatt tgttgtctag tggttgtggg tgaataaagg agggcagaat 150
584
ggatgatttc atctccatta gcctgctgtc tctggctatg ttggtgggat
200
gttacgtggc cggaatcatt cccttggctg ttaatttctc agaggaacga
250
ctgaagctgg tgactgtttt gggtgctggc cttctctgtg gaactgctct
300
ggcagtcatc gtgcctgaag gagtacatgc cctttatgaa gatattcttg
350
agggaaaaca ccaccaagca agtgaaacac ataatgtgat tgcatcagac
400
aaagcagcag aaaaatcagt tgtccatgaa catgagcaca gccacgacca
450
cacacagctg catgcctata ttggtgtttc cctcgttctg ggcttcgttt
500
tcatgttgct ggtggaccag attggtaact cccatgtgca ttctactgac
550
gatccagaag cagcaaggtc tagcaattcc aaaatcacca ccacgctggg
600
tctggttgtc catgctgcag ctgatggtgt tgctttggga gcagcagcat
650
ctacttcaca gaccagtgtc cagttaattg tgtttgtggc aatcatgcta
700
cataaggcac cagctgcttt tggactggtt tccttcttga tgcatgctgg
750
cttagagcgg aatcgaatca gaaagcactt gctggtcttt gcattggcag
800
caccagttat gtccatggtg acatacttag gactgagtaa gagcagtaaa
850
gaagcccttt cgggacattt cagaggtgaa ctttatgttg cgccacggga ccacagtaca gtggccatgc tgtcctccct ttttctctgc gaggtgggcg
900 950
gaatagggca cagccacaag cccgatgcca cgggagggag aggcctcagc
1000
cgcctggaag tggcagccct ggttctgggt tgcctcatcc ctctcatcct
1050
gtcagtagga caccagcatt aaatgttcaa ggtccagcct tggtccaggg
1100
ccgtttgcca tccagtgaga acagccggca cgtgacagct actcacttcc
1150
tcagtctctt gtctcacctt gcgcatctct acatgtattc ctagagtcca
1200
gaggggaggt gaggttaaaa cctgagtaat ggaaaagctt ttagagtaga
1250
aacacattta cgttgcagtt agctatagac atcccattgt gttatctttt
1300
aaaaggccct tgacattttg cgttttaata tttctcttaa ccctattctc
1350
agggaagatg gaatttagtt ttaaggaaaa gaggagaact tcatactcac
1400
aatgaaatag tgattatgaa aatacagtgt tctgtaatta agctatgtct
1450
ctttcttctt agtttagagg ctctgctact ttatccattg atttttaaca
1500
tggttcccac catgtaagac tggtgcttta gcatctatgc cacatgcgtt
1550
gatggaaggt catagcaccc actcacttag atgctaaagg tgattctagt
1600
taatctggga ttagggtcag gaaaatgata gcaagacaca ttgaaagctc
1650
tctttatact caaaagagat atccattgaa aagggatgtc tagagggatt
1700
taaacagctc ctttggcacg tgcctctctg aatccagcct gccattccat
1750
585
caaatggagc aggagaggtg ggaggagctt ctaaagaggt gactggtatt
1800
ttgtagcatt ccttgtcaag ttctcctttg cagaatacct gtctccacat
1850
tcctagagag gagccaagtt ctagtagttt cagttctagg ctttccttca
1900
agaacagtca gatcacaaag tgtctttgga aattaaggga tattaaattt
1950
taagtgattt ttggatggtt attgatatct ttgtagtagc tttttttaaa
2000
agactaccaa aatgtatggt tgtccttttt ttttgttttt ttttttttta
2050
attatttctc ttagcagatc agcaatccct ctagggacct aaatactagg
2100
tcagctttgg cgacactgtg tcttctcaca taaccacctg tagcaagatg
2150
gatcataaat gagaagtgtt tgcctattga tttaaagctt attggaatca
2200
tgtctcttgt ctcttcgtct tttctttgct tttcttctaa cttttccctc
2250
tagcctctcc tcgccacaat ttgctgctta ctgctggtgt taatatttgt
2300
gtgggatgaa ttcttatcag gacaaccact tctcgaactg taataatgaa
2350
gataataata tctttattct ttatcccctt caaagaaatt acctttgtgt
2400
caaatgccgc tttgttgagc ccttaaaata ccacctcctc atgtgtaaat
2450
tgacacaatc ttaacagact actaatctgg aggataattt taatttaaac ttttttcata aattgagata tttgccaaaa gcaaaagtgt tttttgtaaa
2500 2550
ccctgtcttg tcaaataagt gtataatatt gtattattaa tttattttta
2600
ctttctatac catttcaaaa cacattacac taagggggaa ccaagactag
2650
tttcttcagg gcagtggacg tagtagtttg taaaaacgtt ttctatgacg
2700
cataagctag catgcctatg atttatttcc ttcatgaatt tgtcactgga
2750
tcagcagctg tggaaataaa gcttgtgagc cctctgctgg ccacagtgag
2800
gaaagtagca caaataggat acagttgtat gtagtcattg gcaacaattg
2850
catacaattt tactaccaag agaaggtata gtatggaaag tccaaatgac
2900
ttccttgatt ggatgttaac agctgactgg tgtgagactt gaggtttcat
2950
ctagtccttc aaaactatat ggttgcctag attctctctg gaaactgact
3000
ttgtcaaata aatagcagat tgtagtgtca aaaaaaa 3 037
<210> 95 <211> 307 <212> PRT
<213> Homo sapiens <400> 95
Met Asp Asp Phe Ile Ser Ile Ser Leu Leu Ser Leu Ala Met Leu 1 5 10 15
Val Gly Cys Tyr Val Ala Gly Ile Ile Pro Leu Ala Val Asn Phe
586
Ser Glu Glu Arg Leu Lys Leu Val Thr Val Leu Gly Ala Gly Leu
40 45
Leu Cys Gly Thr Ala Leu Ala Val Ile Val Pro Glu Gly Val His
50 55 60
Ala Leu Tyr Glu Asp Ile Leu Glu Gly Lys His His Gin Ala Ser
65 70 75
Glu Thr His Asn Val Ile Ala Ser Asp Lys Ala Ala Glu Lys Ser
80 85 90
Val Val His Glu His Glu His Ser His Asp His Thr Gin Leu His
95 100 105
Ala Tyr Ile Gly Val Ser Leu Val Leu Gly Phe Val Phe Met Leu
110 115 120
Leu Val Asp Gin Ile Gly Asn Ser His Val His Ser Thr Asp Asp
125 130 135
Pro Glu Ala Ala Arg Ser Ser Asn Ser Lys Ile Thr Thr Thr Leu
140 145 150
Gly Leu Val Val His Ala Ala Ala Asp Gly Val Ala Leu Gly Ala
155 160 165
Ala Ala Ser Thr Ser Gin Thr Ser Val Gin Leu Ile Val Phe Val
170 175 180
Ala Ile Met Leu His Lys Ala Pro Ala Ala Phe Gly Leu Val Ser
185 190 195
Phe Leu Met His Ala Gly Leu Glu Arg Asn Arg Ile Arg Lys His
200 205 210
Leu Leu Val Phe Ala Leu Ala Ala Pro Val Met Ser Met Val Thr
215 220 225
Tyr Leu Gly Leu Ser Lys Ser Ser Lys Glu Ala Leu Ser Glu Val
230 235 240
Asn Ala Thr Gly Val Ala Met Leu Phe Ser Ala Gly Thr Phe Leu
245 250 255
Tyr Val Ala Thr Val His Val Leu Pro Glu Val Gly Gly Ile Gly
260 265 270
His Ser His Lys Pro Asp Ala Thr Gly Gly Arg Gly Leu Ser Arg
275 280 285
Leu Glu Val Ala Ala Leu Val Leu Gly Cys Leu Ile Pro Leu Ile
290 295 300
Leu Ser Val Gly His Gin His 305
<210> 96 <211> 25 <212> DNA
<213> Artificial Sequence
587
<22 0>
<223> Synthetic oligonucleotide probe
<400> 96 gttgtgggtg aataaaggag ggcag 25
<210> 97 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 97 ctgtgctcat gttcatggac aactg 25
<210> 98 <211> 50 <212> DNA
<213> Artificial Sequence <22 0>
<223> Synthetic oligonucleotide probe <400> 98
ggatgatttc atctccatta gcctgctgtc tctggctatg ttggtgggat 50
<210> 99
<211> 1429
<212> DNA
<213> Homo sapiens
<400> 99
gctcgaggcc ggcggcggcg ggagagcgac ccgggcggcc tcgtagcggg
50
gccccggatc cccgagtggc ggccggagcc tcgaaaagag attctcagcg
100
ctgattttga gatgatgggc ttgggaaacg ggcgtcgcag catgaagtcg
150
ccgcccctcg tgctggccgc cctggtggcc tgcatcatcg tcttgggctt
200
caactactgg attgcgagct cccggagcgt ggacctccag acacggatca
250
tggagctgga aggcagggtc cgcagggcgg ctgcagagag aggcgccgtg
300
gagctgaaga agaacgagtt ccagggagag ctggagaagc agcgggagca
350
gcttgacaaa atccagtcca gccacaactt ccagctggag agcgtcaaca
400
agctgtacca ggacgaaaag gcggttttgg tgaataacat caccacaggt
450
gagaggctca tccgagtgct gcaagaccag ttaaagaccc tgcagaggaa
500
ttacggcagg ctgcagcagg atgtcctcca gtttcagaag aaccagacca
550
acctggagag gaagttctcc tacgacctga gccagtgcat caatcagatg
600
aaggaggtga aggaacagtg tgaggagcga atagaagagg tcaccaaaaa
650
ggggaatgaa gctgtagctt ccagagacct gagtgaaaac aacgaccaga
700
588
gacagcagct ccaagccctc agtgagcctc agcccaggct gcaggcagca
750
ggcctgccac acacagaggt gccacaaggg aagggaaacg tgcttggtaa
800
cagcaagtcc cagacaccag cccccagttc cgaagtggtt ttggattcaa
850
agagacaagt tgagaaagag gaaaccaatg agatccaggt ggtgaatgag
900
gagcctcaga gggacaggct gccgcaggag ccaggccggg agcaggtggt
950
ggaagacaga cctgtaggtg gaagaggctt cgggggagcc ggagaactgg
1000
gccagacccc acaggtgcag gctgccctgt cagtgagcca ggaaaatcca
1050
gagatggagg gccctgagcg agaccagctt gtcatccccg acggacagga
1100
ggaggagcag gaagctgccg gggaagggag aaaccagcag aaactgagag
1150
gagaagatga ctacaacatg gatgaaaatg aagcagaatc tgagacagac
1200
aagcaagcag ccctggcagg gaatgacaga aacatagatg tttttaatgt
1250
tgaagatcag aaaagagaca ccataaattt acttgatcag cgtgaaaagc
1300
ggaatcatac actctgaatt gaactggaat cacatatttc acaacagggc
1350
cgaagagatg actataaaat gttcatgagg gactgaatac tgaaaactgt
1400
gaaatgtact aaataaaatg tacatctga :
1429
<210> 100 <211> 401 <212> PRT <213> Homo sapiens
<400> 100
Met Met Gly Leu Gly Asn Gly Arg Arg Ser Met Lys Ser Pro Pro
1 5 10 15
Leu Val Leu Ala Ala Leu Val Ala Cys Ile Ile Val Leu Gly Phe
25 30
Asn Tyr Trp Ile Ala Ser Ser Arg Ser Val Asp Leu Gin Thr Arg
40 45
Ile Met Glu Leu Glu Gly Arg Val Arg Arg Ala Ala Ala Glu Arg
50 55 60
Gly Ala Val Glu Leu Lys Lys Asn Glu Phe Gin Gly Glu Leu Glu
65 70 75
Lys Gin Arg Glu Gin Leu Asp Lys Ile Gin Ser Ser His Asn Phe
80 85 90
Gin Leu Glu Ser Val Asn Lys Leu Tyr Gin Asp Glu Lys Ala Val
95 100 105
Leu Val Asn Asn Ile Thr Thr Gly Glu Arg Leu Ile Arg Val Leu
110 115 120
Gin Asp Gin Leu Lys Thr Leu Gin Arg Asn Tyr Gly Arg Leu Gin
589
125 130 135
Gin Asp Val Leu Gin Phe Gin Lys Asn Gin Thr Asn Leu Glu Arg 140 145 150
Lys Phe Ser Tyr Asp Leu Ser Gin Cys Ile Asn Gin Met Lys Glu 155 160 165
Val Lys Glu Gin Cys Glu Glu Arg Ile Glu Glu Val Thr Lys Lys 170 175 180
Gly Asn Glu Ala Val Ala Ser Arg Asp Leu Ser Glu Asn Asn Asp 185 190 195
Gin Arg Gin Gin Leu Gin Ala Leu Ser Glu Pro Gin Pro Arg Leu 200 205 210
Gin Ala Ala Gly Leu Pro His Thr Glu Val Pro Gin Gly Lys Gly 215 220 225
Asn Val Leu Gly Asn Ser Lys Ser Gin Thr Pro Ala Pro Ser Ser 230 235 240
Glu Val Val Leu Asp Ser Lys Arg Gin Val Glu Lys Glu Glu Thr 245 250 255
Asn Glu Ile Gin Val Val Asn Glu Glu Pro Gin Arg Asp Arg Leu 260 265 270
Pro Gin Glu Pro Gly Arg Glu Gin Val Val Glu Asp Arg Pro Val 275 280 285
Gly Gly Arg Gly Phe Gly Gly Ala Gly Glu Leu Gly Gin Thr Pro
290 295 300
Gin Val Gin Ala Ala Leu Ser Val Ser Gin Glu Asn Pro Glu Met
305 310 315
Glu Gly Pro Glu Arg Asp Gin Leu Val Ile Pro Asp Gly Gin Glu 320 325 330
Glu Glu Gin Glu Ala Ala Gly Glu Gly Arg Asn Gin Gin Lys Leu 335 340 345
Arg Gly Glu Asp Asp Tyr Asn Met Asp Glu Asn Glu Ala Glu Ser 350 355 360
Glu Thr Asp Lys Gin Ala Ala Leu Ala Gly Asn Asp Arg Asn Ile 365 370 375
Asp Val Phe Asn Val Glu Asp Gin Lys Arg Asp Thr Ile Asn Leu 380 385 390
Leu Asp Gin Arg Glu Lys Arg Asn His Thr Leu 395 400
<210> 101
<211> 3671
<212> DNA
<213> Homo sapiens
<400> 101
ggatgcagaa agcctcagtg ttgctcttcc tggcctgggt ctgcttcctc 50
590
ttctacgctg gcattgccct cttcaccagt ggcttcctgc tcacccgttt
100
ggagctcacc aaccatagca gctgccaaga gcccccaggc cctgggtccc
150
tgccatgggg gagccaaggg aaacctgggg cctgctggat ggcttcccga
200
ttttcgcggg ttgtgttggt gctgatagat gctctgcgat ttgacttcgc
250
ccagccccag cattcacacg tgcctagaga gcctcctgtc tccctaccct
300
tcctgggcaa actaagctcc ttgcagagga tcctggagat tcagccccac
350
catgcccggc tctaccgatc tcaggttgac cctcctacca ccaccatgca
400
gcgcctcaag gccctcacca ctggctcact gcctaccttt attgatgctg
450
gtagtaactt cgccagccac gccatagtgg aagacaatct cattaagcag
500
ctcaccagtg caggaaggcg tgtagtcttc atgggagatg atacctggaa
550
agaccttttc cctggtgctt tctccaaagc tttcttcttc ccatccttca
600
atgtcagaga cctagacaca gtggacaatg gcatcctgga acacctctac
650
cccaccatgg acagtggtga atgggacgtg ctgattgctc acttcctggg
700
tgtggaccac tgtggccaca agcatggccc tcaccaccct gaaatggcca
750
agaaacttag ccagatggac caggtgatcc agggacttgt ggagcgtctg
800
gagaatgaca cactgctggt agtggctggg gaccatggga tgaccacaaa
850
tggagaccat ggaggggaca gtgagctgga ggtctcagct gctctctttc
900
tgtatagccc cacagcagtc ttccccagca ccccaccaga ggagccagag
950
gtgattcctc aagttagcct tgtgcccacg ctggccctgc tgctgggcct
1000
gcccatccca tttgggaata tcggggaagt gatggctgag ctattctcag
1050
ggggtgagga ctcccagccc cactcctctg ctttagccca agcctcagct
1100
ctccatctca atgctcagca ggtgtcccga tttcttcata cctactcagc
1150
tgctactcag gaccttcaag ctaaggagct tcatcagctg cagaacctct
1200
tctccaaggc ctctgctgac taccagtggc ttctccagag ccccaagggg
1250
gctgaggcga cactgccgac tgtgattgct gagctgcagc agttcctgcg
1300
gggagctcgg gccatgtgca tcgagtcttg ggctcgtttc tctctggtcc
1350
gcatggcggg gggtactgct ctcttggctg cttcctgctt tatctgcctg
1400
ctggcatctc agtgggcaat atccccaggc tttccattct gccctctact
1450
cctgacacct gtggcctggg gcctggttgg ggccatagcg tatgctggac
1500
tcctgggaac tattgagctg aagctagatc tagtgcttct aggggctgtg
1550
gctgcagtga gctcattcct cccttttctg tggaaagcct gggctggctg
1600
591
ggggtccaag aggcccctgg caaccctgtt tcccatccct gggcccgtcc
1650
tgttactcct gctgtttcgc ttggctgtgt tcttctctga tagttttgtt
1700
gtagctgagg ccagggccac ccccttcctt ttgggctcat tcatcctgct
1750
cctggttgtc cagcttcact gggagggcca gctgcttcca cctaagctac
1800
tcacaatgcc ccgccttggc acttcagcca caacaaaccc cccacggcac
1850
aatggtgcat atgccctgag gcttggaatt gggttgcttt tatgtacaag
1900
gctagctggg ctttttcatc gttgccctga agagacacct gtttgccact
1950
cctctccctg gctgagtcct ctggcatcca tggtgggtgg tcgagccaag
2000
aatttatggt atggagcttg tgtggcggcg ctggtggccc tgttagctgc
2050
cgtgcgcttg tggcttcgcc gctatggtaa tctcaagagc cccgagccac
2100
ccatgctctt tgtgcgctgg ggactgcccc taatggcatt gggtactgct
2150
gcctactggg cattggcgtc gggggcagat gaggctcccc cccgtctccg
2200
ggtcctggtc tctggggcat ccatggtgct gcctcgggct gtagcagggc
2250
tggctgcttc agggctcgcg ctgctgctct ggaagcctgt gacagtgctg
2300
gtgaaggctg gggcaggcgc tccaaggacc aggactgtcc tcactccctt
2350
ctcaggcccc cccacttctc aagctgactt ggattatgtg gtccctcaaa
2400
tctaccgaca catgcaggag gagttccggg gccggttaga gaggaccaaa
2450
tctcagggtc agctgctatg ccctgactgt gtcacagccc ggctgcttat tcaccctgtt cagttgggga ggccttccca gtgtctactc cttctgctgt
2500 2550
tgcatgcgga gcgcatcagc cttgtgttcc tgcttctgtt tctgcagagc
2600
ttccttctcc tacatctgct tgctgctggg atacccgtca ccacccctgg
2650
tccttttact gtgccatggc aggcagtctc ggcttgggcc ctcatggcca
2700
cacagacctt ctactccaca ggccaccagc ctgtctttcc agccatccat
2750
tggcatgcag ccttcgtggg attcccagag ggtcatggct cctgtacttg
2800
gctgcctgct ttgctagtgg gagccaacac ctttgcctcc cacctcctct
2850
ttgcagtagg ttgcccactg ctcctgctct ggcctttcct gtgtgagagt
2900
caagggctgc ggaagagaca gcagccccca gggaatgaag ctgatgccag
2950
agtcagaccc gaggaggaag aggagccact gatggagatg cggctccggg
3000
atgcgcctca gcacttctat gcagcactgc tgcagctggg cctcaagtac
3050
ctctttatcc ttggtattca gattctggcc tgtgccttgg cagcctccat
3100
ccttcgcagg catctcatgg tctggaaagt gtttgcccct aagttcatat
3150
592
ttgaggctgt gggcttcatt gtgagcagcg tgggacttct cctgggcata
3200
gctttggtga tgagagtgga tggtgctgtg agctcctggt tcaggcagct
3250
atttctggcc cagcagaggt agcctagtct gtgattactg gcacttggct
3300
acagagagtg ctggagaaca gtgtagcctg gcctgtacag gtactggatg
3350
atctgcaaga caggctcagc catactctta ctatcatgca gccaggggcc
3400
gctgacatct aggacttcat tattctataa ttcaggacca cagtggagta
3450
tgatccctaa ctcctgattt ggatgcatct gagggacaag gggggcggtc
3500
tccgaagtgg aataaaatag gccgggcgtg gtgacttgca cctataatcc
3550
cagcactttg ggaggcagag gtgggaggat tgcttggtcc caggagttca
3600
agaccagcct gtggaacata acaagacccc gtctctacta tttaaaaaaa
3650
agtgtaataa aatgataata t 3671
<210> 102 <211> 1089 <212> PRT
<213> Homo sapiens <400> 102
Met Gin Lys Ala Ser Val Leu Leu Phe Leu Ala Trp Val Cys Phe 15 10 15
Leu Phe Tyr Ala Gly lie Ala Leu Phe Thr Ser Gly Phe Leu Leu 20 25 30
Thr Arg Leu Glu Leu Thr Asn His Ser Ser Cys Gin Glu Pro Pro 35 40 45
Gly Pro Gly Ser Leu Pro Trp Gly Ser Gin Gly Lys Pro Gly Ala 50 55 60
Cys Trp Met Ala Ser Arg Phe Ser Arg Val Val Leu Val Leu Ile 65 70 75
Asp Ala Leu Arg Phe Asp Phe Ala Gin Pro Gin His Ser His Val 80 85 90
Pro Arg Glu Pro Pro Val Ser Leu Pro Phe Leu Gly Lys Leu Ser 95 100 105
Ser Leu Gin Arg Ile Leu Glu Ile Gin Pro His His Ala Arg Leu 110 115 120
Tyr Arg Ser Gin Val Asp Pro Pro Thr Thr Thr Met Gin Arg Leu 125 130 135
Lys Ala Leu Thr Thr Gly Ser Leu Pro Thr Phe Ile Asp Ala Gly 140 145 150
Ser Asn Phe Ala Ser His Ala Ile Val Glu Asp Asn Leu Ile Lys 155 160 165
Gin Leu Thr Ser Ala Gly Arg Arg Val Val Phe Met Gly Asp Asp
170
593
175
180
Thr Trp Lys Asp Leu Phe Pro Gly Ala Phe Ser Lys Ala Phe Phe 185 190 195
Phe Pro Ser Phe Asn Val Arg Asp Leu Asp Thr Val Asp Asn Gly 200 205 210
Ile Leu Glu His Leu Tyr Pro Thr Met Asp Ser Gly Glu Trp Asp 215 220 225
Val Leu Ile Ala His Phe Leu Gly Val Asp His Cys Gly His Lys 230 235 240
His Gly Pro His His Pro Glu Met Ala Lys Lys Leu Ser Gin Met 245 250 255
Asp Gin Val Ile Gin Gly Leu Val Glu Arg Leu Glu Asn Asp Thr 260 265 270
Leu Leu Val Val Ala Gly Asp His Gly Met Thr Thr Asn Gly Asp 275 280 285
His Gly Gly Asp Ser Glu Leu Glu Val Ser Ala Ala Leu Phe Leu 290 295 300
Tyr Ser Pro Thr Ala Val Phe Pro Ser Thr Pro Pro Glu Glu Pro 305 310 315
Glu Val Ile Pro Gin Val Ser Leu Val Pro Thr Leu Ala Leu Leu 320 325 330
Leu Gly Leu Pro Ile Pro Phe Gly Asn Ile Gly Glu Val Met Ala 335 340 345
Glu Leu Phe Ser Gly Gly Glu Asp Ser Gin Pro His Ser Ser Ala 350 355 360
Leu Ala Gin Ala Ser Ala Leu His Leu Asn Ala Gin Gin Val Ser 365 370 375
Arg Phe Leu His Thr Tyr Ser Ala Ala Thr Gin Asp Leu Gin Ala 380 385 390
Lys Glu Leu His Gin Leu Gin Asn Leu Phe Ser Lys Ala Ser Ala 395 400 405
Asp Tyr Gin Trp Leu Leu Gin Ser Pro Lys Gly Ala Glu Ala Thr 410 415 420
Leu Pro Thr Val Ile Ala Glu Leu Gin Gin Phe Leu Arg Gly Ala 425 430 435
Arg Ala Met Cys Ile Glu Ser Trp Ala Arg Phe Ser Leu Val Arg 440 445 450
Met Ala Gly Gly Thr Ala Leu Leu Ala Ala Ser Cys Phe Ile Cys 455 460 465
Leu Leu Ala Ser Gin Trp Ala Ile Ser Pro Gly Phe Pro Phe Cys 470 475 480
594
Pro Leu Leu Leu Thr Pro Val Ala Trp Gly Leu Val Gly Ala Ile 485 490 495
Ala Tyr Ala Gly Leu Leu Gly Thr Ile Glu Leu Lys Leu Asp Leu 500 505 510
Val Leu Leu Gly Ala Val Ala Ala Val Ser Ser Phe Leu Pro Phe 515 520 525
Leu Trp Lys Ala Trp Ala Gly Trp Gly Ser Lys Arg Pro Leu Ala 530 535 540
Thr Leu Phe Pro Ile Pro Gly Pro Val Leu Leu Leu Leu Leu Phe 545 550 555
Arg Leu Ala Val Phe Phe Ser Asp Ser Phe Val Val Ala Glu Ala 560 565 570
Arg Ala Thr Pro Phe Leu Leu Gly Ser Phe Ile Leu Leu Leu Val 575 580 585
Val Gin Leu His Trp Glu Gly Gin Leu Leu Pro Pro Lys Leu Leu 590 595 600
Thr Met Pro Arg Leu Gly Thr Ser Ala Thr Thr Asn Pro Pro Arg 605 610 615
His Asn Gly Ala Tyr Ala Leu Arg Leu Gly Ile Gly Leu Leu Leu 620 625 630
Cys Thr Arg Leu Ala Gly Leu Phe His Arg Cys Pro Glu Glu Thr 635 640 645
Pro Val Cys His Ser Ser Pro Trp Leu Ser Pro Leu Ala Ser Met 650 655 660
Val Gly Gly Arg Ala Lys Asn Leu Trp Tyr Gly Ala Cys Val Ala 665 670 675
Ala Leu Val Ala Leu Leu Ala Ala Val Arg Leu Trp Leu Arg Arg 680 685 690
Tyr Gly Asn Leu Lys Ser Pro Glu Pro Pro Met Leu Phe Val Arg 695 700 705
Trp Gly Leu Pro Leu Met Ala Leu Gly Thr Ala Ala Tyr Trp Ala 710 715 720
Leu Ala Ser Gly Ala Asp Glu Ala Pro Pro Arg Leu Arg Val Leu 725 730 735
Val Ser Gly Ala Ser Met Val Leu Pro Arg Ala Val Ala Gly Leu 740 745 750
Ala Ala Ser Gly Leu Ala Leu Leu Leu Trp Lys Pro Val Thr Val 755 760 765
Leu Val Lys Ala Gly Ala Gly Ala Pro Arg Thr Arg Thr Val Leu 770 775 780
Thr Pro Phe Ser Gly Pro Pro Thr Ser Gin Ala Asp Leu Asp Tyr 785 790 795
595
Val Val Pro Gin Ile Tyr Arg His Met Gin Glu Glu Phe Arg Gly 800 805 810
Arg Leu Glu Arg Thr Lys Ser Gin Gly Pro Leu Thr Val Ala Ala 815 820 825
Tyr Gin Leu Gly Ser Val Tyr Ser Ala Ala Met Val Thr Ala Leu 830 835 840
Thr Leu Leu Ala Phe Pro Leu Leu Leu Leu His Ala Glu Arg Ile 845 850 855
Ser Leu Val Phe Leu Leu Leu Phe Leu Gin Ser Phe Leu Leu Leu 860 865 870
His Leu Leu Ala Ala Gly Ile Pro Val Thr Thr Pro Gly Pro Phe 875 880 885
Thr Val Pro Trp Gin Ala Val Ser Ala Trp Ala Leu Met Ala Thr 890 895 900
Gin Thr Phe Tyr Ser Thr Gly His Gin Pro Val Phe Pro Ala Ile 905 910 915
His Trp His Ala Ala Phe Val Gly Phe Pro Glu Gly His Gly Ser 920 925 930
Cys Thr Trp Leu Pro Ala Leu Leu Val Gly Ala Asn Thr Phe Ala 935 940 945
Ser His Leu Leu Phe Ala Val Gly Cys Pro Leu Leu Leu Leu Trp 950 955 960
Pro Phe Leu Cys Glu Ser Gin Gly Leu Arg Lys Arg Gin Gin Pro 965 970 975
Pro Gly Asn Glu Ala Asp Ala Arg Val Arg Pro Glu Glu Glu Glu 980 985 990
Glu Pro Leu Met Glu Met Arg Leu Arg Asp Ala Pro Gin His Phe 995 1000 1005
Tyr Ala Ala Leu Leu Gin Leu Gly Leu Lys Tyr Leu Phe Ile Leu 1010 1015 1020
Gly Ile Gin Ile Leu Ala Cys Ala Leu Ala Ala Ser Ile Leu Arg 1025 1030 1035
Arg His Leu Met Val Trp Lys Val Phe Ala Pro Lys Phe Ile Phe 1040 . 1045 1050
Glu Ala Val Gly Phe Ile Val Ser Ser Val Gly Leu Leu Leu Gly 1055 1060 1065
Ile Ala Leu Val Met Arg Val Asp Gly Ala Val Ser Ser Trp Phe 1070 1075 1080
Arg Gin Leu Phe Leu Ala Gin Gin Arg 1085
<210> 103
596
<211> 1743
<212> DNA
<213> Homo sapiens
<400> 103
tgccgctgcc gccgctgctg ctgttgctcc tggcggcgcc ttggggacgg
50
gcagttccct gtgtctctgg tggtttgcct aaacctgcaa acatcacctt
100
cttatccatc aacatgaaga atgtcctaca atggactcca ccagagggtc
150
ttcaaggagt taaagttact tacactgtgc agtatttcat cacaaattgg
200
cccaccagag gtggcactga ctacagatga gaagtccatt tctgttgtcc
250
tgacagctcc agagaagtgg aagagaaatc cagaagacct tcctgtttcc
300
atgcaacaaa tatactccaa tctgaagtat aacgtgtctg tgttgaatac
350
taaatcaaac agaacgtggt cccagtgtgt gaccaaccac acgctggtgc
400
tcacctggct ggagccgaac actctttact gcgtacacgt ggagtccttc
450
gtcccagggc cccctcgccg tgctcagcct tctgagaagc agtgtgccag
500
gactttgaaa gatcaatcat cagagttcaa ggctaaaatc atcttctggt
550
atgttttgcc catatctatt accgtgtttc ttttttctgt gatgggctat
600
tccatctacc gatatatcca cgttggcaaa gagaaacacc cagcaaattt
650
gattttgatt tatggaaatg aatttgacaa aagattcttt gtgcctgctg
700
aaaaaatcgt gattaacttt atcaccctca atatctcgga tgattctaaa
750
atttctcatc aggatatgag tttactggga aaaagcagtg atgtatccag
800
ccttaatgat cctcagccca gcgggaacct gaggccccct caggaggaag
850
aggaggtgaa acatttaggg tatgcttcgc atttgatgga aattttttgt
900
gactctgaag aaaacacgga aggtacttct ctcacccagc aagagtccct
950
cagcagaaca atacccccgg ataaaacagt cattgaatat gaatatgatg
1000
tcagaaccac tgacatttgt gcggggcctg aagagcagga gctcagtttg
1050
caggaggagg tgtccacaca aggaacatta ttggagtcgc aggcagcgtt
1100
ggcagtcttg ggcccgcaaa cgttacagta ctcatacacc cctcagctcc
1150
aagacttaga ccccctggcg caggagcaca cagactcgga ggaggggccg
1200
gaggaagagc catcgacgac cctggtcgac tgggatcccc aaactggcag
1250
gctgtgtatt ccttcgctgt ccagcttcga ccaggattca gagggctgcg
1300
agccttctga gggggatggg ctcggagagg agggtcttct atctagactc
1350
tatgaggagc cggctccaga caggccacca ggagaaaatg aaacctatct
1400
catgcaattc atggaggaat gggggttata tgtgcagatg gaaaactgat
1450
597
gccaacactt ccttttgcct tttgtttcct gtgcaaacaa gtgagtcacc 1500
cctttgatcc cagccataaa gtacctggga tgaaagaagt tttttccagt 1550
ttgtcagtgt ctgtgagaat tacttatttc ttttctctat tctcatagca 1600
cgtgtgtgat tggttcatgc atgtaggtct cttaacaatg atggtgggcc 1650
tctggagtcc aggggctggc cggttgttct atgcagagaa agcagtcaat 1700
aaatgtttgc cagactgggt gcagaattta ttcaggtggg tgt 1743
<210> 104 <211> 442 <212> PRT
<213> Homo sapiens <400> 104
Met Ser Tyr Asn Gly Leu His Gin. Arg Val Phe Lys Glu Leu Lys 15 10 15
Leu Leu Thr Leu Cys Ser Ile Ser Ser Gin Ile Gly Pro Pro Glu 20 25 30
Val Ala Leu Thr Thr Asp Glu Lys Ser Ile Ser Val Val Leu Thr 35 40 45
Ala Pro Glu Lys Trp Lys Arg Asn Pro Glu Asp Leu Pro Val Ser 50 55 60
Met Gin Gin Ile Tyr Ser Asn Leu Lys Tyr Asn Val Ser Val Leu 65 70 75
Asn Thr Lys Ser Asn Arg Thr Trp Ser Gin Cys Val Thr Asn His 80 85 90
Thr Leu Val Leu Thr Trp Leu Glu Pro Asn Thr Leu Tyr Cys Val 95 100 105
His Val Glu Ser Phe Val Pro Gly Pro Pro Arg Arg Ala Gin Pro 110 115 120
Ser Glu Lys Gin Cys Ala Arg Thr Leu Lys Asp Gin Ser Ser Glu 125 130 135
Phe Lys Ala Lys Ile Ile Phe Trp Tyr Val Leu Pro Ile Ser Ile 140 145 150
Thr Val Phe Leu Phe Ser Val Met Gly Tyr Ser Ile Tyr Arg Tyr 155 160 165
Ile His Val Gly Lys Glu Lys His Pro Ala Asn Leu Ile Leu Ile 170 175 180
Tyr Gly Asn Glu Phe Asp Lys Arg Phe Phe Val Pro Ala Glu Lys 185 190 195
Ile Val Ile Asn Phe Ile Thr Leu Asn Ile Ser Asp Asp Ser Lys 200 205 210
Ile Ser His Gin Asp Met Ser Leu Leu Gly Lys Ser Ser Asp Val 215 220 225
598
Ser Ser Leu Asn Asp Pro Gin Pro Ser Gly Asn Leu Arg Pro Pro 230 235 240
Gin Glu Glu Glu Glu Val Lys His Leu Gly Tyr Ala Ser His Leu 245 250 255
Met Glu Ile Phe Cys Asp Ser Glu Glu Asn Thr Glu Gly Thr Ser 260 265 270
Leu Thr Gin Gin Glu Ser Leu Ser Arg Thr Ile Pro Pro Asp Lys 275 280 285
Thr Val Ile Glu Tyr Glu Tyr Asp Val Arg Thr Thr Asp Ile Cys 290 295 300
Ala Gly Pro Glu Glu Gin Glu Leu Ser Leu Gin Glu Glu Val Ser 305 310 315
Thr Gin Gly Thr Leu Leu Glu Ser Gin Ala Ala Leu Ala Val Leu 320 325 330
Gly Pro Gin Thr Leu Gin Tyr Ser Tyr Thr Pro Gin Leu Gin Asp 335 340 345
Leu Asp Pro Leu Ala Gin Glu His Thr Asp Ser Glu Glu Gly Pro 350 355 360
Glu Glu Glu Pro Ser Thr Thr Leu Val Asp Trp Asp Pro Gin Thr 365 370 375
Gly Arg Leu Cys Ile Pro Ser Leu Ser Ser Phe Asp Gin Asp Ser 380 385 390
Glu Gly Cys Glu Pro Ser Glu Gly Asp Gly Leu Gly Glu Glu Gly 395 400 405
Leu Leu Ser Arg Leu Tyr Glu Glu Pro Ala Pro Asp Arg Pro Pro 410 415 420
Gly Glu Asn Glu Thr Tyr Leu Met Gin Phe Met Glu Glu Trp Gly 425 430 435
Leu Tyr Val Gin Met Glu Asn 440
<210> 105
<211> 21
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 105
cgctgctgct gttgctcctg g 21
<210> 106
<211> 18
<212> DNA
<213> Artificial Sequence
599
<220>
<223> Synthetic oligonucleotide probe
<400> 106 cagtgtgcca ggactttg 18
<210> 107 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 107 agtcgcaggc agcgttgg 18
<210> 108 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 108 ctcctccgag tctgtgtgct cctgc 25
<210> 109 <211> 51 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 109
ggacgggcag ttccctgtgt ctctggtggt ttgcctaaac ctgcaaacat 50
c 51
<210> 110
<211> 1114
<212> DNA
<213> Homo sapiens
<400> 110
cggacgcgtg ggcggacgcg tgggcggacg cgtgggtctc tgcggggaga 50 cgccagcctg cgtctgccat ggggctcggg ttgaggggct ggggacgtcc 100 tctgctgact gtggccaccg ccctgatgct gcccgtgaag ccccccgcag 150 gctcctgggg ggcccagatc atcgggggcc acgaggtgac cccccactcc 200 aggccctaca tggcatccgt gcgcttcggg ggccaacatc actgcggagg 250 cttcctgctg cgagcccgct gggtggtctc ggccgcccac tgcttcagcc 300 acagagacct ccgcactggc ctggtggtgc tgggcgccca cgtcctgagt 350 actgcggagc ccacccagca ggtgtttggc atcgatgctc tcaccacgca 400
600
ccccgactac caccccatga cccacgccaa cgacatctgc ctgctgcggc 450
tgaacggctc tgctgtcctg ggccctgcag tggggctgct gaggctgcca 500
gggagaaggg ccaggccccc cacagcgggg acacggtgcc gggtggctgg 550
ctggggcttc gtgtctgact ttgaggagct gccgcctgga ctgatggagg 600
ccaaggtccg agtgctggac ccggacgtct gcaacagctc ctggaagggc 650
cacctgacac ttaccatgct ctgcacccgc agtggggaca gccacagacg 700
gggcttctgc tcggccgact ccggagggcc cctggtgtgc aggaaccggg 750
ctcacggcct cgtttccttc tcgggcctct ggtgcggcga ccccaagacc 800
cccgacgtgt acacgcaggt gtccgccttt gtggcctgga tctgggacgt 850
ggttcggcgg agcagtcccc agcccggccc cctgcctggg accaccaggc 900
ccccaggaga agccgcctga gccacaacct tgcggcatgc aaatgagatg 950
gccgctccag gcctggaatg ttccgtggct gggccccacg ggaagcctga 1000
tgttcagggt tggggtggga cgggcagcgg tggggcacac ccattccaca 1050
tgcaaagggc agaagcaaac ccagtaaaat gttaactgac aaaaaaaaaa 1100
aaaaaaaaaa gaaa 1114
<210> 111 <211> 283 <212> PRT
<213> Homo sapiens <400> 111
Met Gly Leu Gly Leu Arg Gly Trp Gly Arg Pro Leu Leu Thr Val 15 10 15
Ala Thr Ala Leu Met Leu Pro Val Lys Pro Pro Ala Gly Ser Trp 20 25 30
Gly Ala Gin Ile Ile Gly Gly His Glu Val Thr Pro His Ser Arg
40 45 Pro Tyr Met Ala Ser Val Arg Phe Gly Gly Gin His His Cys Gly
50 55 60
Gly Phe Leu Leu Arg Ala Arg Trp Val Val Ser Ala Ala His Cys 65 70 75
Phe Ser His Arg Asp Leu Arg Thr Gly Leu Val Val Leu Gly Ala 80 85 90
His Val Leu Ser Thr Ala Glu Pro Thr Gin Gin Val Phe Gly Ile 95 100 105
Asp Ala Leu Thr Thr His Pro Asp Tyr His Pro Met Thr His Ala 110 115 120
Asn Asp Ile Cys Leu Leu 125
Arg Leu Asn Gly Ser Ala Val Leu Gly 130 135
601
Pro Ala Val Gly Leu Leu Arg Leu Pro Gly Arg Arg Ala Arg Pro 140 145 150
Pro Thr Ala Gly Thr Arg Cys Arg Val Ala Gly Trp Gly Phe Val 155 160 165
Ser Asp Phe Glu Glu Leu Pro Pro Gly Leu Met Glu Ala Lys Val 170 175 180
Arg Val Leu Asp Pro Asp Val Cys Asn Ser Ser Trp Lys Gly His 185 190 195
Leu Thr Leu Thr Met Leu Cys Thr Arg Ser Gly Asp Ser His Arg 200 205 210
Arg Gly Phe Cys Ser Ala Asp Ser Gly Gly Pro L6u Val Cys Arg 215 220 225
Asn Arg Ala His Gly Leu Val Ser Phe Ser Gly Leu Trp Cys Gly 230 235 240
Asp Pro Lys Thr Pro Asp Val Tyr Thr Gin Val Ser Ala Phe Val 245 250 255
Ala Trp Ile Trp Asp Val Val Arg Arg Ser Ser Pro Gin Pro Gly 260 265 270
Pro Leu Pro Gly Thr Thr Arg Pro Pro Gly Glu Ala Ala 275 280
<21.0> 112 <211> 24 <212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 112 gacgtctgca acagctcctg gaag 24
<210> 113 <211> 23 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 113 cgagaaggaa acgaggccgt gag 23
<210> 114 <211> 44 <212> DNA <213> Artificial
Sequence
<220>
<223> Synthetic oligonucleotide probe <400> 114
tgacacttac catgctctgc acccgcagtg gggacagcca caga 44
602
<210> 115 <211> 1808 <212> DNA
<213> Homo sapiens <400> 115
gagctaccca ggcggctggt gtgcagcaag ctccgcgccg actccggacg
50
cctgacgcct gacgcctgtc cccggcccgg catgagccgc tacctgctgc
100
cgctgtcggc gctgggcacg gtagcaggcg ccgccgtgct gctcaaggac
150
tatgtcaccg gtggggcttg ccccagcaag gccaccatcc ctgggaagac
200
ggtcatcgtg acgggcgcca acacaggcat cgggaagcag accgccttgg
250
aactggccag gagaggaggc aacatcatcc tggcctgccg agacatggag
300
aagtgtgagg cggcagcaaa ggacatccgc ggggagaccc tcaatcacca
350
tgtcaacgcc cggcacctgg acttggcttc cctcaagtct atccgagagt
400
ttgcagcaaa gatcattgaa gaggaggagc gagtggacat tctaatcaac
450
aacgcgggtg tgatgcggtg cccccactgg accaccgagg acggcttcga
500
gatgcagttt ggcgttaacc acctgggtca ctttctcttg acaaacttgc
550
tgctggacaa gctgaaagcc tcagcccctt cgcggatcat caacctctcg
600
tccctggccc atgttgctgg gcacatagac tttgacgact tgaactggca
650
gacgaggaag tataacacca aagccgccta ctgccagagc aagctcgcca
700
tcgtcctctt caccaaggag ctgagccggc ggctgcaagg ctctggtgtg
750
actgtcaacg ccctgcaccc cggcgtggcc aggacagagc tgggcagaca
800
cacgggcatc catggctcca ccttctccag caccacactc gggcccatct
850
tctggctgct ggtcaagagc cccgagctgg ccgcccagcc cagcacatac
900
ctggccgtgg cggaggaact ggcggatgtt tccggaaagt acttcgatgg
950
actcaaacag aaggccccgg cccccgaggc tgaggatgag gaggtggccc
1000
ggaggctttg ggctgaaagt gcccgcctgg tgggcttaga ggctccctct
1050
gtgagggagc agcccctccc cagataacct ctggagcaga tttgaaagcc
1100
aggatggcgc ctccagaccg aggacagctg tccgccatgc ccgcagcttc
1150
ctggcactac ctgagccggg agacccagga ctggcggccg ccatgcccgc
1200
agtaggttct agggggcggt gctggccgca gtggactggc ctgcaggtga
1250
gcactgcccc gggctctggc tggttccgtc tgctctgctg ccagcagggg
1300
agaggggcca tctgatgctt cccctgggaa tctaaactgg gaatggccga
1350
ggaggaaggg gctctgtgca cttgcaggcc acgtcaggag agccagcggt
1400
603
gcctgtcggg gagggttcca aggtgctccg tgaagagcat gggcaagttg 1450
tctgacactt ggtggattct tgggtccctg tgggaccttg tgcatgcatg 1500
gtcctctctg agccttggtt tcttcagcag tgagatgctc agaataactg 1550
ctgtctccca tgatggtgtg gtacagcgag ctgttgtctg gctatggcat 1600
ggctgtgccg ggggtgtttg ctgagggctt cctgtgccag agcccagcca 1650
gagagcaggt gcaggtgtca tcccgagttc aggctctgca cggcatggag 1700
tgggaacccc accagctgct gctacaggac ctgggattgc ctgggactcc 1750
caccttccta tcaattctca tggtagtcca aactgcagac tctcaaactt 1800
gctcattt 1808
<210> 116 <211> 331 <212> PRT
<213> Homo sapiens <400> 116
Met Ser Arg Tyr Leu Leu Pro Leu Ser Ala Leu Gly Thr Val Ala 1 5 10 15
Gly Ala Ala Val Leu Leu Lys Asp Tyr Val Thr Gly Gly Ala Cys 20 25 30
Pro Ser Lys Ala Thr Ile Pro Gly Lys Thr Val Ile Val Thr Gly 35 40 45
Ala Asn Thr Gly Ile Gly Lys Gin Thr Ala Leu Glu Leu Ala Arg 50 55 60
Arg Gly Gly Asn Ile Ile Leu Ala Cys Arg Asp Met Glu Lys Cys 65 70 75
Glu Ala Ala Ala Lys Asp Ile Arg Gly Glu Thr Leu Asn His His 80 85 90
Val Asn Ala Arg His Leu Asp Leu Ala Ser Leu Lys Ser Ile Arg 95 100 105
Glu Phe Ala Ala Lys Ile Ile Glu Glu Glu Glu Arg Val Asp Ile
110 115 120
Leu Ile Asn Asn Ala Gly Val Met Arg Cys Pro His Trp Thr Thr
125 130 135
Glu Asp Gly Phe Glu Met Gin Phe Gly Val Asn His Leu Gly His 140 145 150
Phe Leu Leu Thr Asn Leu Leu Leu Asp Lys Leu Lys Ala Ser Ala 155 160 165
Pro Ser Arg Ile Ile Asn Leu Ser Ser Leu Ala His Val Ala Gly 170 175 180
His Ile Asp Phe Asp Asp Leu Asn Trp Gin Thr Arg Lys Tyr Asn 185 190 195
604
Thr Lys Ala Ala Tyr Cys Gin Ser Lys Leu Ala Ile Val Leu Phe 200 205 210
Thr Lys Glu Leu Ser Arg Arg Leu Gin Gly Ser Gly Val Thr Val 215 220 225
Asn Ala Leu His Pro Gly Val Ala Arg Thr Glu Leu Gly Arg His 230 235 240
Thr Gly Ile His Gly Ser Thr Phe Ser Ser Thr Thr Leu Gly Pro 245 250 255
Ile Phe Trp Leu Leu Val Lys Ser Pro Glu Leu Ala Ala Gin Pro 260 265 270
Ser Thr Tyr Leu Ala Val Ala Glu Glu Leu Ala Asp Val Ser Gly 275 280 285
Lys Tyr Phe Asp Gly Leu Lys Gin Lys Ala Pro Ala Pro Glu Ala 290 295 300
Glu Asp Glu Glu Val Ala Arg Arg Leu Trp Ala Glu Ser Ala Arg 305 310 315
Leu Val Gly Leu Glu Ala Pro Ser Val Arg Glu Gin Pro Leu Pro 320 325 330
Arg
<210> 117 <211> 2249 <212> DNA <213> Homo sapiens
<400> 117
gaagttcgcg agcgctggca tgtggtcctg gggcgcggct ggcggcgctg
50
ctggcggtgc tggcgctcgg gacaggagac ccagaaaggg ctgcggctcg
100
gggcgacacg ttctcggcgc tgaccagcgt ggcgcgcgcc ctggcgcccg
150
agcgccggct gctggggctg ctgaggcggt acctgcgcgg ggaggaggcg
200
cggctgcggg acctgactag attctacgac aaggtacttt ctttgcatga
250
ggattcaaca acccctgtgg ctaaccctct gcttgcattt actctcatca
300
aacgcctgca gtctgactgg aggaatgtgg tacatagtct ggaggccagt
350
gagaacatcc gagctctgaa ggatggctat gagaaggtgg agcaagacct
400
tccagccttt gaggaccttg agggagcagc aagggccctg atgcggctgc
450
aggacgtgta catgctcaat gtgaaaggcc tggcccgagg tgtctttcag
500
agagtcactg gctctgccat cactgacctg tacagcccca aacggctctt
550
ttctctcaca ggggatgact gcttccaagt tggcaaggtg gcctatgaca
600
tgggggatta ttaccatgcc attccatggc tggaggaggc tgtcagtctc
650
605
ttccgaggat cttacggaga gtggaagaca gaggatgagg caagtctaga
700
agatgccttg gatcacttgg cctttgctta tttccgggca ggaaatgttt
750
cgtgtgccct cagcctctct cgggagtttc ttctctacag cccagataat
800
aagaggatgg ccaggaatgt cttgaaatat gaaaggctct tggcagagag
850
ccccaaccac gtggtagctg aggctgtcat ccagaggccc aatatacccc
900
acctgcagac cagagacacc tacgaggggc tatgtcagac cctgggttcc
950
cagcccactc tctaccagat ccctagcctc tactgttcct atgagaccaa
1000
ttccaacgcc tacctgctgc tccagcccat ccggaaggag gtcatccacc
1050
tggagcccta cattgctctc taccatgact tcgtcagtga ctcagaggct
1100
cagaaaatta gagaacttgc agaaccatgg ctacagaggt cagtggtggc
1150
atcaggggag aagcagttac aagtggagta ccgcatcagc aaaagtgcct
1200
ggctgaagga cactgttgac ccaaaactgg tgaccctcaa ccaccgcatt
1250
gctgccctca caggccttga tgtccggcct ccctatgcag agtatctgca
1300
ggtggtgaac tatggcatcg gaggacacta tgagcctcac tttgaccatg
1350
ctacgtcacc aagcagcccc ctctacagaa tgaagtcagg aaaccgagtt
1400
gcaacattta tgatctatct gagctcggtg gaagctggag gagccacagc
1450
cttcatctat gccaacctca gcgtgcctgt ggttaggaat gcagcactgt
1500
tttggtggaa cctgcacagg agtggtgaag gggacagtga cacacttcat
1550
gctggctgtc ctgtcctggt gggagataag tgggtggcca acaagtggat
1600
acatgagtat ggacaggaat tccgcagacc ctgcagctcc agccctgaag
1650
actgaactgt tggcagagag aagctggtgg agtcctgtgg ctttccagag
1700
aagccaggag ccaaaagctg gggtaggaga ggagaaagca gagcagcctc
1750
ctggaagaag gccttgtcag ctttgtctgt gcctcgcaaa tcagaggcaa
1800
gggagaggtt gttaccaggg gacactgaga atgtacattt gatctgcccc
1850
agccacggaa ggcctgagag gtcagagtag ggaagtttct gatgcacagt ggagttcaga acaaaggagg tactctctgt ggggagtgga tgggaacagg
1900 1950
acatctcaac agtctcaggt tcgatcagtg ggtcttttgg cactttgaac
2000
cttgaccaca gggaccaaga agtggcaatg aggacacctg caggaggggc
2050
tagcctgact cccagaactt taagactttc tccccactgc cttctgctgc
2100
agcccaagca gggagtgtcc ccctcccaga agcatatccc agatgagtgg
2150
tacattatat aaggattttt tttaagttga aaacaacttt cttttctttt
2200
606
tgtatgatgg ttttttaaca cagtcattaa aaatgtttat aaatcaaaa 2249
<210> 118 <211> 544 <212> PRT
<213> Homo sapiens <400> 118
Met Gly Pro Gly Ala Arg Leu Ala Ala Leu Leu Ala Val Leu Ala
10 15
Leu Gly Thr Gly Asp Pro Glu Arg Ala Ala Ala Arg Gly Asp Thr
25 30
Phe Ser Ala Leu Thr Ser Val Ala Arg Ala Leu Ala Pro Glu Arg
40 45
Arg Leu Leu Gly Leu Leu Arg Arg Tyr Leu Arg Gly Glu Glu Ala
50 55 60
Arg Leu Arg Asp Leu Thr Arg Phe Tyr Asp Lys Val Leu Ser Leu
65 70 75
His Glu Asp Ser Thr Thr Pro Val Ala Asn Pro Leu Leu Ala Phe
80 85 90
Thr Leu Ile Lys Arg Leu Gin Ser Asp Trp Arg Asn Val Val His
95 100 105
Ser Leu Glu Ala Ser Glu Asn Ile Arg Ala Leu Lys Asp Gly Tyr
110 115 120
Glu Lys Val Glu Gin Asp Leu Pro Ala Phe Glu Asp Leu Glu Gly
125 130 135
Ala Ala Arg Ala Leu Met Arg Leu Gin Asp Val Tyr Met Leu Asn
140 145 150
Val Lys Gly Leu Ala Arg Gly Val Phe Gin Arg Val Thr Gly Ser
155 160 165
Ala Ile Thr Asp Leu Tyr Ser Pro Lys Arg Leu Phe Ser Leu Thr
170 175 180
Gly Asp Asp Cys Phe Gin Val Gly Lys Val Ala Tyr Asp Met Gly
185 190 195
Asp Tyr Tyr His Ala Ile Pro Trp Leu Glu Glu Ala Val Ser Leu
200 205 210
Phe Arg Gly Ser Tyr Gly Glu Trp Lys Thr Glu Asp Glu Ala Ser
215 220 225
Leu Glu Asp Ala Leu Asp His Leu Ala Phe Ala Tyr Phe Arg Ala
230 235 240
Gly Asn Val Ser Cys Ala Leu Ser Leu Ser Arg Glu Phe Leu Leu
245 250 255
Tyr Ser Pro Asp Asn Lys Arg Met Ala Arg Asn Val Leu Lys Tyr
260 265 270
607
Glu Arg Leu Leu Ala Glu Ser Pro Asn His Val Val Ala Glu Ala 275 280 285
Val Ile Gin Arg Pro Asn Ile Pro His Leu Gin Thr Arg Asp Thr 290 295 300
Tyr Glu Gly Leu Cys Gin Thr Leu Gly Ser Gin Pro Thr Leu Tyr 305 310 315
Gin Ile Pro Ser Leu Tyr Cys Ser Tyr Glu Thr Asn Ser Asn Ala 320 325 330
Tyr Leu Leu Leu Gin Pro Ile Arg Lys Glu Val Ile His Leu Glu 335 340 345
Pro Tyr Ile Ala Leu Tyr His Asp Phe Val Ser Asp Ser Glu Ala 350 355 360
Gin Lys Ile Arg Glu Leu Ala Glu Pro Trp Leu Gin Arg Ser Val 365 370 375
Val Ala Ser Gly Glu Lys Gin Leu Gin Val Glu Tyr Arg Ile Ser 380 385 390
Lys Ser Ala Trp Leu Lys Asp Thr Val Asp Pro Lys Leu Val Thr 395 400 405
Leu Asn His Arg Ile Ala Ala Leu Thr Gly Leu Asp Val Arg Pro 410 415 420
Pro Tyr Ala Glu Tyr Leu Gin Val Val Asn Tyr Gly Ile Gly Gly 425 430 435
His Tyr Glu Pro His Phe Asp His Ala Thr Ser Pro Ser Ser Pro 440 445 450
Leu Tyr Arg Met Lys Ser Gly Asn Arg Val Ala Thr Phe Met Ile 455 460 465
Tyr Leu Ser Ser Val Glu Ala Gly Gly Ala Thr Ala Phe Ile Tyr 470 475 480
Ala Asn Leu Ser Val Pro Val Val Arg Asn Ala Ala Leu Phe Trp 485 490 495
Trp Asn Leu His Arg Ser Gly Glu Gly Asp Ser Asp Thr Leu His 500 505 510
Ala Gly Cys Pro Val Leu Val Gly Asp Lys Trp Val Ala Asn Lys 515 520 525
Trp Ile His Glu Tyr Gly Gin Glu Phe Arg Arg Pro Cys Ser Ser 530 535 540
Ser Pro Glu Asp
<210> 119 <211> 23 <212> DNA
<213> Artificial Sequence
608
<220>
<223> Synthetic oligonucleotide probe
<400> 119 cgggacagga gacccagaaa ggg 23
<210> 120 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 120 ggccaagtga tccaaggcat cttc 24
<210> 121 <211> 49 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 121
ctgcgggacc tgactagatt ctacgacaag gtactttctt tgcatgggg 49
<210> 122
<211> 1778
<212> DNA
<213> Homo sapiens
<400> 122
gagataggga gtctgggttt aagttcctgc tccatctcag gagcccctgc
50
tcccacccct aggaagccac cagactccac ggtgtggggc caatcaggtg
100
gaatcggccc tggcaggtgg ggccacgagc gctggctgag ggaccgagee
150
ggagagcccc ggagcccccg taacccgcgc ggggagegee caggatgccg
200
cgcggggact cggagcaggt gcgctactgc gcgcgcttct cctacctctg
250
gctcaagttt tcacttatca tctattccac cgtgttctgg ctgattgggg
300
ccctggtcct gtctgtgggc atetatgeag aggttgagcg gcagaaatat
350
aaaacccttg aaagtgcctt cctggctcca gccatcatcc tcatcctcct
400
gggcgtcgtc atgttcatgg tctccttcat tggtgtgctg gcgtccctcc
450
gtgacaacct ctcatcatgg gtaccttctc ageteattgg caagcattca tggcgtggtg tgtacatcct gccttgacct tgggatctgc tccggaacca
500 550
gaccattgac ttcctgaacg acaacattcg aagaggaatt gagaactact
600
atgatgatct ggacttcaaa aacatcatgg actttgttca gaaaaagttc
650
aagtgctgtg gcggggagga ctaccgagat tggagcaaga atcagtacca
700
cgactgcagt gcccctggac ccctggcctg tggggtgccc tacacctgct
750
609
gcatcaggaa cacgacagaa gttgtcaaca ccatgtgtgg ctacaaaact
800
atcgacaagg agcgtttcag tgtgcaggat gtcatctacg tgcggggctg
850
caccaacgcc gtgatcatct ggttcatgga caactacacc atcatggcgt
900
gcatcctcct gggcatcctg cttccccagt tcctgggggt gctgctgacg
950
ctgctgtaca tcacccgggt ggaggacatc atcatggagc actctgtcac
1000
tgatgggctc ctggggcccg gtgccaagcc cagcgtggag gcggcaggca
1050
cgggatgctg cttgtgctac cccaattagg gcccagcctg ccatggcagc
1100
tccaacaagg accgtctggg atagcacctc tcagtcaaca tcgtggggct
1150
ggacagggct gcggcccctc tgcccacact cagtactgac caaagccagg
1200
gctgtgtgtg cctgtgtgta ggtcccacgg cctctgcctc cccagggagc
1250
agagcctggg cctcccctaa gaggctttcc ccgaggcagc tctggaatct
1300
gtgcccacct ggggcctggg gaacaaggcc ctcctttctc caggcctggg
1350
ctacagggga gggagagcct gaggctctgc tcagggccca tttcatctct
1400
ggcagtgcct tggcggtggt attcaaggca gttttgtagc acctgtaatt
1450
ggggagaggg agtgtgcccc tcggggcagg agggaagggc atctggggaa
1500
gggcaggagg gaagagctgt ccatgcagcc acgcccatgg ccaggttggc
1550
ctcttctcag cctcccaggt gccttgagcc ctcttgcaag ggcggctgct
1600
tccttgagcc tagttttttt ttacgtgatt tttgtaacat tcattttttt
1650
gtacagataa caggagtttc tgactaatca aagctggtat ttccccgcat
1700
gtcttattct tgcccttccc ccaaccagtt tgttaatcaa acaataaaaa
1750
catgttttgt tttgttttta aaaaaaaa 1778
<210> 123 <211> 294 <212> PRT <213> Homo sapiens
<400> 123
Met Pro Arg Gly Asp Ser Glu Gin Val Arg Tyr Cys Ala Arg Phe
10 15
Ser Tyr Leu Trp Leu Lys Phe Ser Leu Ile Ile Tyr Ser Thr Val
25 30 Phe Trp Leu Ile Gly Ala Leu Val Leu Ser Val Gly Ile Tyr Ala
40 45
Glu Val Glu Arg Gin Lys Tyr Lys Thr Leu Glu Ser Ala Phe Leu
50 55 60
Ala Pro Ala Ile Ile Leu Ile Leu Leu Gly Val Val Met Phe Met
65 70 75
610
Val Ser Phe Ile Gly Val Leu Ala Ser Leu Arg Asp Asn Leu Tyr
80 85 90
Leu Leu Gin Ala Phe Met Tyr Ile Leu Gly Ile Cys Leu Ile Met
95 100 105
Glu Leu Ile Gly Gly Val Val Ala Leu Thr Phe Arg Asn Gin Thr
110 115 120
Ile Asp Phe Leu Asn Asp Asn Ile Arg Arg Gly Ile Glu Asn Tyr
125 130 135
Tyr Asp Asp Leu Asp Phe Lys Asn Ile Met Asp Phe Val Gin Lys
140 145 150
Lys Phe Lys Cys Cys Gly Gly Glu Asp Tyr Arg Asp Trp Ser Lys
155 160 165
Asn Gin Tyr His Asp Cys Ser Ala Pro Gly Pro Leu Ala Cys Gly
170 175 180
Val Pro Tyr Thr Cys Cys Ile Arg Asn Thr Thr Glu Val Val Asn
185 190 195
Thr Met Cys Gly Tyr Lys Thr Ile Asp Lys Glu Arg Phe Ser Val
200 205 210
Gin Asp Val Ile Tyr Val Arg Gly Cys Thr Asn Ala Val Ile Ile
215 220 225
Trp Phe Met Asp Asn Tyr Thr Ile Met Ala Cys Ile Leu Leu Gly
230 235 240
Ile Leu Leu Pro Gin Phe Leu Gly Val Leu Leu Thr Leu Leu Tyr
245 250 255
Ile Thr Arg Val Glu Asp Ile Ile Met Glu His Ser Val Thr Asp
260 265 270
Gly Leu Leu Gly Pro Gly Ala Lys Pro Ser Val Glu Ala Ala Gly
275 280 285
Thr Gly Cys Cys Leu Cys Tyr Pro Asn 290
<210> 124
<211> 25
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 124
atcatctatt ccaccgtgtt ctggc 25
<210> 125
<211> 25
<212> DNA
<213> Artificial Sequence
611
<220>
<223> Synthetic oligonucleotide probe
<400> 125 gacagagtgc tccatgatga tgtcc 25
<210> 126 <211> 50 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 126
cctgtctgtg ggcatctatg cagaggttga gcggcagaaa tataaaaccc 50
<210> 127
<211> 1636
<212> DNA
<213> Homo sapiens
<400> 127
gaggagcggg ccgaggactc cagcgtgccc aggtctggca tcctgcactt
50
gctgccctct gacacctggg aagatggccg gcccgtggac cttcaccctt
100
ctctgtggtt tgctggcagc caccttgatc caagccaccc tcagtcccac
150
tgcagttctc atcctcggcc caaaagtcat caaagaaaag ctgacacagg
200
agctgaagga ccacaacgcc accagcatcc tgcagcagct gccgctgctc
250
agtgccatgc gggaaaagcc agccggaggc atccctgtgc tgggcagcct
300
ggtgaacacc gtcctgaagc acatcatctg gctgaaggtc atcacagcta
350
acatcctcca gctgcaggtg aagccctcgg ccaatgacca ggagctgcta
400
gtcaagatcc ccctggacat ggtggctgga ttcaacacgc ccctggtcaa
450
gaccatcgtg gagttccaca tgacgactga ggcccaagcc accatccgca
500
tggacaccag tgcaagtggc cccacccgcc tggtcctcag tgactgtgcc
550
accagccatg ggagcctgcg catccaactg ctgtataagc tctccttcct
600
ggtgaacgcc ttagctaagc aggtcatgaa cctcctagtg ccatccctgc
650
ccaatctagt gaaaaaccag ctgtgtcccg tgatcgaggc ttccttcaat
700
ggcatgtatg cagacctcct gcagctggtg aaggtgccca tttccctcag
750
cattgaccgt ctggagtttg accttctgta tcctgccatc aagggtgaca
800
ccattcagct accaagtggt ctacctgggg tcaataactc gccaagttgt tgcagcttcc tggactcaca ctgacaatgc gggaaaggtg ccaccctgga
850 900
caacatcccg ttcagcctca tcgtgagtca ggacgtggtg aaagctgcag
950
tggctgctgt gctctctcca gaagaattca tggtcctgtt ggactctgtg
1000
612
cttcctgaga gtgcccatcg gctgaagtca agcatcgggc tgatcaatga
1050
aaaggctgca gataagctgg gatctaccca gatcgtgaag atcctaactc
1100
aggacactcc cgagtttttt atagaccaag gccatgccaa ggtggcccaa
1150
ctgatcgtgc tggaagtgtt tccctccagt gaagccctcc gccctttgtt
1200
caccctgggc atcgaagcca gctcggaagc tcagttttac accaaaggtg
1250
accaacttat actcaacttg aataacatca gctctgatcg gatccagctg
1300
atgaactctg ggattggctg gttccaacct gatgttctga aaaacatcat
1350
cactgagatc atccactcca tcctgctgcc gaaccagaat ggcaaattaa
1400
gatctggggt cccagtgtca ttggtgaagg ccttgggatt cgaggcagct
1450
gagtcctcac tgaccaagga tgcccttgtg cttactccag cctccttgtg
1500
gaaacccagc tctcctgtct cccagtgaag acttggatgg cagccatcag
1550
ggaaggctgg gtcccagctg ggagtatggg tgtgagctct atagaccatc
1600
cctctctgca atcaataaac acttgcctgt gaaaaa 1636
<210> 128 <211> 484 <212> PRT
<213> Homo sapiens <400> 128
Met Ala Gly Pro Trp Thr Phe Thr Leu Leu Cys Gly Leu Leu Ala 1 5 10 15
Ala Thr Leu Ile Gin Ala Thr Leu Ser Pro Thr Ala Val Leu Ile 20 25 30
Leu Gly Pro Lys Val Ile Lys Glu Lys Leu Thr Gin Glu Leu Lys 35 40 45
Asp His Asn Ala Thr Ser Ile Leu Gin Gin Leu Pro Leu Leu Ser 50 55 60
Ala Met Arg Glu Lys Pro Ala Gly Gly Ile Pro Val Leu Gly Ser 65 70 75
Leu Val Asn Thr Val Leu Lys His Ile Ile Trp Leu Lys Val Ile 80 85 90
Thr Ala Asn Ile Leu Gin Leu Gin Val Lys Pro Ser Ala Asn Asp 95 100 105
Gin Glu Leu Leu Val Lys Ile Pro Leu Asp Met Val Ala Gly Phe 110 115 120
Asn Thr Pro Leu Val Lys Thr Ile Val Glu Phe His Met Thr Thr 125 130 135
Glu Ala Gin Ala Thr Ile Arg Met Asp Thr Ser Ala Ser Gly Pro 140 145 150
613
Thr Arg Leu Val Leu Ser Asp Cys Ala Thr Ser His Gly Ser Leu 155 160 165
Arg Ile Gin Leu Leu Tyr Lys Leu Ser Phe Leu Val Asn Ala Leu 170 175 180
Ala Lys Gin Val Met Asn Leu Leu Val Pro Ser Leu Pro Asn Leu 185 190 195
Val Lys Asn Gin Leu Cys Pro Val Ile Glu Ala Ser Phe Asn Gly 200 205 210
Met Tyr Ala Asp Leu Leu Gin Leu Val Lys Val Pro Ile Ser Leu 215 220 225
Ser Ile Asp Arg Leu Glu Phe Asp Leu Leu Tyr Pro Ala Ile Lys 230 235 240
Gly Asp Thr Ile Gin Leu Tyr Leu Gly Ala Lys Leu Leu Asp Ser 245 250 255
Gin Gly Lys Val Thr Lys Trp Phe Asn Asn Ser Ala Ala Ser Leu 260 265 270
Thr Met Pro Thr Leu Asp Asn Ile Pro Phe Ser Leu Ile Val Ser 275 280 285
Gin Asp Val Val Lys Ala Ala Val Ala Ala Val Leu Ser Pro Glu 290 295 300
Glu Phe Met Val Leu Leu Asp Ser Val Leu Pro Glu Ser Ala His 305 310 315
Arg Leu Lys Ser Ser Ile Gly Leu Ile Asn Glu Lys Ala Ala Asp 320 325 330
Lys Leu Gly Ser Thr Gin Ile Val Lys Ile Leu Thr Gin Asp Thr 335 340 345
Pro Glu Phe Phe Ile Asp Gin Gly His Ala Lys Val Ala Gin Leu 350 355 360
lie Val Leu Glu Val Phe Pro Ser Ser Glu Ala Leu Arg Pro Leu 365 370 375
Phe Thr Leu Gly Ile Glu Ala Ser Ser Glu Ala Gin Phe Tyr Thr 380 385 390
Lys Gly Asp Gin Leu Ile Leu Asn Leu Asn Asn Ile Ser Ser Asp 395 400 405
Arg Ile Gin Leu Met Asn Ser Gly Ile Gly Trp Phe Gin Pro Asp 410 415 420
Val Leu Lys Asn Ile Ile Thr Glu Ile Ile His Ser Ile Leu Leu 425 430 435
Pro Asn Gin Asn Gly Lys Leu Arg Ser Gly Val Pro Val Ser Leu 440 445 450
Val Lys Ala Leu Gly Phe Glu Ala Ala Glu Ser Ser Leu Thr Lys 455 460 465
614
Asp Ala Leu Val Leu Thr Pro Ala Ser Leu Trp Lys Pro Ser Ser 470 475 480
Pro Val Ser Gin
<210> 129
<211> 2213
<212> DNA
<213> Homo sapiens
<400> 129
gagcgaacat ggcagcgcgt tggcggtttt ggtgtgtctc tgtgaccatg
50
gtggtggcgc tgctcatcgt ttgcgacgtt ccctcagcct ctgcccaaag
100
aaagaaggag atggtgttat ctgaaaaggt tagtcagctg atggaatgga
150
ctaacaaaag acctgtaata agaatgaatg gagacaagtt ccgtcgcctt
200
gtgaaagccc caccgagaaa ttactccgtt atcgtcatgt tcactgctct
250
ccaactgcat agacagtgtg tcgtttgcaa gcaagctgat gaagaattcc
300
agatcctggc aaactcctgg cgatactcca gtgcattcac caacaggata
350
ttttttgcca tggtggattt tgatgaaggc tctgatgtat ttcagatgct
400
aaacatgaat tcagctccaa ctttcatcaa ctttcctgca aaagggaaac
450
ccaaacgggg tgatacatat gagttacagg tgcggggttt ttcagctgag
500
cagattgccc ggtggatcgc cgacagaact gatgtcaata ttagagtgat
550
tagaccccca aattatgctg gtccccttat gttgggattg cttttggctg
600
ttattggtgg acttgtgtat cttcgaagaa gtaatatgga atttctcttt
650
aataaaactg gatgggcttt tgcagctttg tgttttgtgc ttgctatgac
700
atctggtcaa atgtggaacc atataagagg accaccatat gcccataaga
750
atccccacac gggacatgtg aattatatcc atggaagcag tcaagcccag
800
tttgtagctg aaacacacat tgttcttctg tttaatggtg gagttacctt
850
aggaatggtg cttttatgtg aagctgctac ctctgacatg gatattggaa
900
agcgaaagat aatgtgtgtg gctggtattg gacttgttgt attattcttc
950
agttggatgc tctctatttt tagatctaaa tatcatggct acccatacag
1000
ctttctgatg agttaaaaag gtcccagaga tatatagaca ctggagtact
1050
ggaaattgaa aaacgaaaat cgtgtgtgtt tgaaaagaag aatgcaactt
1100
gtatattttg tattacctct ttttttcaag tgatttaaat agttaatcat
1150
ttaaccaaag aagatgtgta gtgccttaac aagcaatcct ctgtcaaaat
1200
ctgaggtatt tgaaaataat tatcctctta accttctctt cccagtgaac
1250
615
tttatggaac atttaattta gtacaattaa gtatattata aaaattgtaa
1300
aactactact ttgttttagt tagaacaaag ctcaaaacta ctttagttaa
1350
cttggtcatc tgattttata ttgccttatc caaagatggg gaaagtaagt
1400
cctgaccagg tgttcccaca tatgcctgtt acagataact acattaggaa
1450
ttcattctta gcttcttcat ctttgtgtgg atgtgtatac tttacgcatc
1500
tttccttttg agtagagaaa ttatgtgtgt catgtggtct tctgaaaatg
1550
gaacaccatt cttcagagca cacgtctagc cctcagcaag acagttgttt
16 00
ctcctcctcc ttgcatattt cctactgcgc tccagcctga gtgatagagt
1650
gagactctgt ctcaaaaaaa agtatctcta aatacaggat tataatttct
1700
gcttgagtat ggtgttaact accttgtatt tagaaagatt tcagattcat
1750
tccatctcct tagttttctt ttaaggtgac ccatctgtga taaaaatata
1800
gcttagtgct aaaatcagtg taacttatac atggcctaaa atgtttctac
1850
aaattagagt ttgtcactta ttccatttgt acctaagaga aaaataggct
1900
cagttagaaa aggactccct ggccaggcgc agtgacttac gcctgtaatc
1950
tcagcacttt gggaggccaa ggcaggcaga tcacgaggtc aggagttcga
2000
gaccatcctg gccaacatgg tgaaaccccg tctctactaa aaatataaaa
2050
attagctggg tgtggtggca ggagcctgta atcccagcta cacaggaggc
2100
tgaggcacga gaatcacttg aactcaggag atggaggttt cagtgagccg
2150
agatcacgcc actgcactcc agcctggcaa cagagcgaga ctccatctca
2200
aaaaaaaaaa aaa 2213
<210> 130 <211> 335 <212> PRT <213> Homo sapiens
<400> 130
Met Ala Ala Arg Trp Arg Phe Trp Cys Val Ser Val Thr Met Val 15 10 15
Val Ala Leu Leu Ile Val Cys Asp Val Pro Ser Ala Ser Ala Gin 20 25 30
Arg Lys Lys Glu Met Val Leu Ser Glu Lys Val Ser Gin Leu Met 35 40 45
Glu Trp Thr Asn Lys Arg Pro Val Ile Arg Met Asn Gly Asp Lys 50 55 60
Phe Arg Arg Leu Val Lys Ala Pro Pro Arg Asn Tyr Ser Val Ile 65 70 75
616
Val Met Phe Thr Ala Leu Gin Leu His Arg Gin Cys Val Val Cys 80 85 90
Lys Gin Ala Asp Glu Glu Phe Gin Ile Leu Ala Asn Ser Trp Arg 95 100 105
Tyr Ser Ser Ala Phe Thr Asn Arg Ile Phe Phe Ala Met Val Asp 110 115 120
Phe Asp Glu Gly Ser Asp Val Phe Gin Met Leu Asn Met Asn Ser 125 130 135
Ala Pro Thr Phe Ile Asn Phe Pro Ala Lys Gly Lys Pro Lys Arg 140 145 150
Gly Asp Thr Tyr Glu Leu Gin Val Arg Gly Phe Ser Ala Glu Gin 155 160 165
Ile Ala Arg Trp Ile Ala Asp Arg Thr Asp Val Asn Ile Arg Val 170 175 180
Ile Arg Pro Pro Asn Tyr Ala Gly Pro Leu Met Leu Gly Leu Leu 185 190 195
Leu Ala Val Ile Gly Gly Leu Val Tyr Leu Arg Arg Ser Asn Met 200 205 210
Glu Phe Leu Phe Asn Lys Thr Gly Trp Ala Phe Ala Ala Leu Cys 215 220 225
Phe Val Leu Ala Met Thr Ser Gly Gin Met Trp Asn His Ile Arg 230 235 240
Gly Pro Pro Tyr Ala His Lys Asn Pro His Thr Gly His Val Asn 245 250 255
Tyr Ile His Gly Ser Ser Gin Ala Gin Phe Val Ala Glu Thr His 260 265 270
Ile Val Leu Leu Phe Asn Gly Gly Val Thr Leu Gly Met Val Leu 275 280 285
Leu Cys Glu Ala Ala Thr Ser Asp Met Asp Ile Gly Lys Arg Lys 290 295 300
Ile Met Cys Val Ala Gly Ile Gly Leu Val Val Leu Phe Phe Ser 305 310 315
Trp Met Leu Ser Ile Phe Arg Ser Lys Tyr His Gly Tyr Pro Tyr 320 325 330
Ser Phe Leu Met Ser 335
<210> 131 <211> 2476 <212> DNA
<213> Homo sapiens
<400> 131
aagcaaccaa actgcaagct ttgggagttg ttcgctgtcc ctgccctgct 50 ctgctaggga gagaacgcca gagggaggcg gctggcccgg cggcaggctc 100
617
tcagaaccgc taccggcgat gctactgctg tgggtgtcgg tggtcgcagc
150
cttggcgctg gcggtactgg cccccggagc aggggagcag aggcggagag
200
cagccaaagc gcccaatgtg gtgctggtcg tgagcgactc cttcgatgga
250
aggttaacat ttcatccagg aagtcaggta gtgaaacttc cttttatcaa
300
ctttatgaag acacgtggga cttcctttct gaatgcctac acaaactctc
350
caatttgttg cccatcacgc gcagcaatgt ggagtggcct cttcactcac
400
ttaacagaat cttggaataa ttttaagggt ctagatccaa attatacaac
450
atggatggat gtcatggaga ggcatggcta ccgaacacag aaatttggga
500
aactggacta tacttcagga catcactcca ttagtaatcg tgtggaagcg
550
tggacaagag atgttgcttt cttactcaga caagaaggca ggcccatggt
600
taatcttatc cgtaacagga ctaaagtcag agtgatggaa agggattggc
650
agaatacaga caaagcagta aactggttaa gaaaggaagc aattaattac
700
actgaaccat ttgttattta cttgggatta aatttaccac acccttaccc
750
ttcaccatct tctggagaaa attttggatc ttcaacattt cacacatctc
800
tttattggct tgaaaaagtg tctcatgatg ccatcaaaat cccaaagtgg
850
tcacctttgt cagaaatgca ccctgtagat tattactctt cttatacaaa
900
aaactgcact ggaagattta caaaaaaaga aattaagaat attagagcat
950
tttattatgc tatgtgtgct gagacagatg ccatgcttgg tgaaattatt
1000
ttggcccttc atcaattaga tcttcttcag aaaactattg tcatatactc
1050
ctcagaccat ggagagctgg ccatggaaca tcgacagttt tataaaatga
1100
gcatgtacga ggctagtgca catgttccgc ttttgatgat gggaccagga
1150
attaaagccg gcctacaagt atcaaatgtg gtttctcttg tggatattta
1200
ccctaccatg cttgatattg ctggaattcc tctgcctcag aacctgagtg
1250
gatactcttt gttgccgtta tcatcagaaa catttaagaa tgaacataaa
1300
gtcaaaaacc tgcatccacc ctggattctg agtgaattcc atggatgtaa
1350
tgtgaatgcc tccacctaca tgcttcgaac taaccactgg aaatatatag
1400
cctattcgga tggtgcatca atattgcctc aactctttga tctttcctcg
1450
gatccagatg aattaacaaa tgttgctgta aaatttccag aaattactta
1500
ttctttggat cagaagcttc attccattat aaactaccct aaagtttctg
1550
cttctgtcca ccagtataat aaagagcagt ttatcaagtg gaaacaaagt
1600
ataggacaga attattcaaa cgttatagca aatcttaggt ggcaccaaga
1650
ctggcagaag gaaccaagga agtatgaaaa
618
tgcaattgat cagtggctta
1700
aaacccatat gaatccaaga gcagtttgaa caaaaagttt aaaaatagtg
1750
ttctagagat acatataaat atattacaag atcataatta tgtattttaa
1800
atgaaacagt tttaataatt accaagtttt ggccgggcac agtggctcac
1850
acctgtaatc ccaggacttt gggaggctga ggaaagcaga tcacaaggtc
1900
aagagattga gaccatcctg gccaacatgg tgaaaccctg tctctactaa
1950
aaatacaaaa attagctggg cgcggtggtg cacacctata gtctcagcta
2000
ctcagaggct gaggcaggag gatcgcttga acccgggagg cagcagttgc
2050
agtgagctga gattgcgcca ctgtactcca gcctggcaac agagtgagac
2100
tgtgtcgcaa aaaaataaaa ataaaataat aataattacc aatttttcat
2150
tattttgtaa gaatgtagtg tattttaaga taaaatgcca atgattataa
2200
aatcacatat tttcaaaaat ggttattatt taggcctttg tacaatttct
2250
aacaatttag tggaagtatc aaaaggattg aagcaaatac tgtaacagtt
2300
atgttccttt aaataataga gaatataaaa tattgtaata atatgtatca
2350
taaaatagtt gtatgtgagc atttgatggt gaaaaaaaaa aaaaaaaaaa
2400
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
2450
aaaaaaaaaa aaaaaaaaaa aaaaaa 2476
<210> 132 <211> 536 <212> PRT <213> Homo sapiens
<400> 132
Met Leu Leu Leu Trp Val Ser Val Val Ala Ala Leu Ala Leu Ala 15 10 15
Val Leu Ala Pro Gly Ala Gly Glu Gin Arg Arg Arg Ala Ala Lys 20 25 30
Ala Pro Asn Val Val Leu Val Val Ser Asp Ser Phe Asp Gly Arg 35 40 45
Leu Thr Phe His Pro Gly Ser Gin Val Val Lys Leu Pro Phe Ile 50 55 60
Asn Phe Met Lys Thr Arg Gly Thr Ser Phe Leu Asn Ala Tyr Thr 65 70 75
Asn Ser Pro Ile Cys Cys Pro Ser Arg Ala Ala Met Trp Ser Gly 80 85 90
Leu Phe Thr His Leu Thr Glu Ser Trp Asn Asn Phe Lys Gly Leu 95 100 105
Asp Pro Asn Tyr Thr Thr Trp Met Asp Val Met Glu Arg His Gly
619
110 115 120
Tyr Arg Thr Gin Lys Phe Gly Lys Leu Asp Tyr Thr Ser Gly His 125 130 135
His Ser Ile Ser Asn Arg Val Glu Ala Trp Thr Arg Asp Val Ala 140 145 150
Phe Leu Leu Arg Gin Glu Gly Arg Pro Met Val Asn Leu Ile Arg 155 160 165
Asn Arg Thr Lys Val Arg Val Met Glu Arg Asp Trp Gin Asn Thr 170 175 180
Asp Lys Ala Val Asn Trp Leu Arg Lys Glu Ala Ile Asn Tyr Thr 185 190 195
Glu Pro Phe Val Ile Tyr Leu Gly Leu Asn Leu Pro His Pro Tyr 200 205 210
Pro Ser Pro Ser Ser Gly Glu Asn Phe Gly Ser Ser Thr Phe His 215 220 225
Thr Ser Leu Tyr Trp Leu Glu Lys Val Ser His Asp Ala Ile Lys 230 235 240
Ile Pro Lys Trp Ser Pro Leu Ser Glu Met His Pro Val Asp Tyr 245 250 255
Tyr Ser Ser Tyr Thr Lys Asn Cys Thr Gly Arg Phe Thr Lys Lys 260 265 270
Glu Ile Lys Asn Ile Arg Ala Phe Tyr Tyr Ala Met Cys Ala Glu 275 280 285
Thr Asp Ala Met Leu Gly Glu Ile Ile Leu Ala Leu His Gin Leu 290 295 300
Asp Leu Leu Gin Lys Thr Ile Val Ile Tyr Ser Ser Asp His Gly 305 310 315
Glu Leu Ala Met Glu His Arg Gin Phe Tyr Lys Met Ser Met Tyr 320 325 330
Glu Ala Ser Ala His Val Pro Leu Leu Met Met Gly Pro Gly Ile 335 340 345
Lys Ala Gly Leu Gin Val Ser Asn Val Val Ser Leu Val Asp Ile 350 355 360
Tyr Pro Thr Met Leu Asp Ile Ala Gly Ile Pro Leu Pro Gin Asn 365 370 375
Leu Ser Gly Tyr Ser Leu Leu Pro Leu Ser Ser Glu Thr Phe Lys 380 385 390
Asn Glu His Lys Val Lys Asn Leu His Pro Pro Trp Ile Leu Ser 395 400 405
Glu Phe His Gly Cys Asn Val Asn Ala Ser Thr Tyr Met Leu Arg 410 415 420
Thr Asn His Trp Lys Tyr Ile Ala Tyr Ser Asp Gly Ala Ser Ile
425
Leu Pro Gin Leu Phe Asp 440
Asn Val Ala Val Lys Phe 455
Lys Leu His Ser Ile Ile 470
His Gin Tyr Asn Lys Glu 485
Gly Gin Asn Tyr Ser Asn 500
Asp Trp Gin Lys Glu Pro 515
Trp Leu Lys Thr His Met 530
<210> 133
<211> 1475
<212> DNA
<213> Homo sapiens
<400> 133
gagagaagtc agcctggcag agagactctg aaatgaggga ttagaggtgt
50
tcaaggagca agagcttcag cctgaagaca agggagcagt ccctgaagac
100
gcttctactg agaggtctgc catggcctct cttggcctcc aacttgtggg
150
ctacatccta ggccttctgg ggcttttggg cacactggtt gccatgctgc
200
tccccagctg gaaaacaagt tcttatgtcg gtgccagcat tgtgacagca
250
gttggcttct ccaagggcct ctggatggaa tgtgccacac acagcacagg
300
catcacccag tgtgacatct atagcaccct tctgggcctg cccgctgaca
350
tccaggctgc ccaggccatg atggtgacat ccagtgcaat ctcctccctg
400
gcctgcatta tctctgtggt gggcatgaga tgcacagtct tctgccagga
450
atcccgagcc aaagacagag tggcggtagc aggtggagtc tttttcatcc
500
ttggaggcct cctgggattc attcctgttg cctggaatct tcatgggatc
550
ctacgggact tctactcacc actggtgcct gacagcatga aatttgagat
600
tggagaggct ctttacttgg gcattatttc ttccctgttc tccctgatag
650
ctggaatcat cctctgcttt tcctgctcat cccagagaaa tcgctccaac
700
tactacgatg cctaccaagc ccaacctctt gccacaagga gctctccaag
750
gcctggtcaa cctcccaaag tcaagagtga gttcaattcc tacagcctga
800
cagggtatgt gtgaagaacc aggggccaga gctggggggt ggctgggtct
850
620
430 435
Leu Ser Ser Asp Pro Asp Glu Leu Thr 445 450
Pro Glu Ile Thr Tyr Ser Leu Asp Gin 460 465
Asn Tyr Pro Lys Val Ser Ala Ser Val 475 480
Gin Phe Ile Lys Trp Lys Gin Ser Ile 490 495
Val Ile Ala Asn Leu Arg Trp His Gin 505 510
Arg Lys Tyr Glu Asn Ala Ile Asp Gin 520 525
Asn Pro Arg Ala Val 535
621
gtgaaaaaca gtggacagca ccccgagggc cacaggtgag ggacactacc 900
actggatcgt gtcagaaggt gctgctgagg atagactgac tttggccatt 950
ggattgagca aaggcagaaa tgggggctag tgtaacagca tgcaggttga 1000
attgccaagg atgctcgcca tgccagcctt tctgttttcc tcaccttgct 1050
gctcccctgc cctaagtccc caaccctcaa cttgaaaccc cattccctta 1100
agccaggact cagaggatcc ctttgccctc tggtttacct gggactccat 1150
ccccaaaccc actaatcaca tcccactgac tgaccctctg tgatcaaaga 1200
ccctctctct ggctgaggtt ggctcttagc tcattgctgg ggatgggaag 1250
gagaagcagt ggcttttgtg ggcattgctc taacctactt ctcaagcttc 1300
cctccaaaga aactgattgg ccctggaacc tccatcccac tcttgttatg 1350
actccacagt gtccagacta atttgtgcat gaactgaaat aaaaccatcc 1400
tacggtatcc agggaacaga aagcaggatg caggatggga ggacaggaag 1450
gcagcctggg acatttaaaa aaata 147 5
<210> 134 <211> 230 <212> PRT
<213> Homo sapiens <400> 134
Met Ala Ser.Leu Gly Leu Gin Leu Val Gly Tyr Ile Leu Gly Leu 15 10 15
Leu Gly Leu Leu Gly Thr Leu Val Ala Met Leu Leu Pro Ser Trp 20 25 30
Lys Thr Ser Ser Tyr Val Gly Ala Ser Ile Val Thr Ala Val Gly 35 40 45
Phe Ser Lys Gly Leu Trp Met Glu Cys Ala Thr His Ser Thr Gly 50 55 60
Ile Thr Gin Cys Asp Ile Tyr Ser Thr Leu Leu Gly Leu Pro Ala 65 70 75
Asp Ile Gin Ala Ala Gin Ala Met Met Val Thr Ser Ser Ala Ile 80 85 90
Ser Ser Leu Ala Cys Ile Ile Ser Val Val Gly Met Arg Cys Thr 95 100 105
Val Phe Cys Gin Glu Ser Arg Ala Lys Asp Arg Val Ala Val Ala 110 115 120
Gly Gly Val Phe Phe Ile Leu Gly Gly Leu Leu Gly Phe Ile Pro 125 130 135
Val Ala Trp Asn Leu His Gly Ile Leu Arg Asp Phe Tyr Ser Pro 140 145 150
622
Leu Val Pro Asp Ser Met 155
Leu Gly Ile Ile Ser Ser 170
Leu Cys Phe Ser Cys Ser 185
Asp Ala Tyr Gin Ala Gin 200
Pro Gly Gin Pro Pro Lys 215
Leu Thr Gly Tyr Val 230
<210> 135 <211> 610 <212> DNA <213> Homo sapiens
<400> 135
gcactgctgc tgtcccatca gctgctctga agctccatgg tgcccagaat
50
cttcgctcct gcttatgtgt cagtctgtct cctcctcttg tgtccaaggg
100
aagtcatcgc tcccgctggc tcagaaccat ggctgtgcca gccggcaccc
150
aggtgtggag acaagatcta caaccccttg gagcagtgct gttacaatga
200
cgccatcgtg tccctgagcg agacccgcca atgtggtccc ccctgcacct
250
tctggccctg ctttgagctc tgctgtcttg attcctttgg cctcacaaac
300
gattttgttg tgaagctgaa ggttcagggt gtgaattccc agtgccactc
350
atctcccatc tccagtaaat gtgaaagcag aagacgtttt ccctgagaag
400
acatagaaag aaaatcaact ttcactaagg catctcagaa acataggcta
450
aggtaatatg tgtaccagta gagaagcctg aggaatttac aaaatgatgc
500
agctccaagc cattgtatgg cccatgtggg agactgatgg gacatggaga
550
atgacagtag attatcagga aataaataaa gtggtttttc caatgtacac
600
acctgtaaaa
610
<210> 136 <211> 119 <212> PRT
<213> Homo sapiens <400> 136
Met Val Pro Arg Ile Phe Ala Pro Ala Tyr Val Ser Val Cys Leu 15 10 15
Leu Leu Leu Cys Pro Arg Glu Val Ile Ala Pro Ala Gly Ser Glu 20 25 30
Lys Phe Glu Ile Gly Glu Ala Leu Tyr 160 165
Leu Phe Ser Leu Ile Ala Gly Ile Ile 175 180
Ser Gin Arg Asn Arg Ser Asn Tyr Tyr 190 195
Pro Leu Ala Thr Arg Ser Ser Pro Arg 205 210
Val Lys Ser Glu Phe Asn Ser Tyr Ser 220 225
623
Pro Trp Leu Cys Gin Pro Ala Pro Arg Cys Gly Asp Lys lie Tyr 35 40 45
Asn Pro Leu Glu Gin Cys Cys Tyr Asn Asp Ala Ile Val Ser Leu 50 55 60
Ser Glu Thr Arg Gin Cys Gly Pro Pro Cys Thr Phe Trp Pro Cys 65 70 75
Phe Glu Leu Cys Cys Leu Asp Ser Phe Gly Leu Thr Asn Asp Phe 80 85 90
Val Val Lys Leu Lys Val Gin Gly Val Asn Ser Gin Cys His Ser 95 100 105
Ser Pro Ile Ser Ser Lys Cys Glu Ser Arg Arg Arg Phe Pro 110 115
<210> 137 <211> 771 <212> DNA <213> Homo sapiens
<400> 137
ctccactgca accacccaga gccatggctc cccgaggctg catcgtagct
50
gtctttgcca ttttctgcat ctccaggctc ctctgctcac acggagcccc
100
agtggccccc atgactcctt acctgatgct gtgccagcca cacaagagat
150
gtggggacaa gttctacgac cccctgcagc actgttgcta tgatgatgcc
200
gtcgtgccct tggccaggac ccagacgtgt ggaaactgca ccttcagagt
250
ctgctttgag cagtgctgcc cctggacctt catggtgaag ctgataaacc
300
agaactgcga ctcagcccgg acctcggatg acaggctttg tcgcagtgtc
350
agctaatgga acatcagggg aacgatgact cctggattct ccttcctggg
400
tgggcctgga gaaagaggct ggtgttacct gagatctggg atgctgagtg
450
gctgtttggg ggccagagaa acacacactc aactgcccac ttcattctgt
500
gacctgtctg aggcccaccc tgcagctgcc ctgaggaggc ccacaggtcc
550
ccttctagaa ttctggacag catgagatgc gtgtgctgat gggggcccag
600
ggactctgaa ccctcctgat gacccctatg gccaacatca acccggcacc
650
accccaaggc tggctgggga acccttcacc cttctgtgag attttccatc
700
atctcaagtt ctcttctatc caggagcaaa gcacaggatc ataataaatt
750
tatgtacttt ataaatgaaa a 771
<210> 138 <211> 110 <212> PRT
<213> Homo sapiens
624
<400> 138
Met Ala Pro Arg Gly Cys Ile Val Ala Val Phe Ala Ile Phe Cys 15 10 15
Ile Ser Arg Leu Leu Cys Ser His Gly Ala Pro Val Ala Pro Met 20 25 30
Thr Pro Tyr Leu Met Leu Cys Gin Pro His Lys Arg Cys Gly Asp 35 40 45
Lys Phe Tyr Asp Pro Leu Gin His Cys Cys Tyr Asp Asp Ala Val 50 55 60
Val Pro Leu Ala Arg Thr Gin Thr Cys Gly Asn Cys Thr Phe Arg 65 70 75
Val Cys Phe Glu Gin Cys Cys Pro Trp Thr Phe Met Val Lys Leu 80 85 90
Ile Asn Gin Asn Cys Asp Ser Ala Arg Thr Ser Asp Asp Arg Leu 95 100 105
Cys Arg Ser Val Ser 110
<210> 139
<211> 2044
<212> DNA
<213> Homo sapiens
<400> 139
gggggcgggt gcctggagca cggcgctggg gccgcccgca gcgctcactc
50
gctcgcactc agtcgcggga ggcttccccg cgccggccgc gtcccgcccg
100
ctccccggca ccagaagttc ctctgcgcgt ccgacggcga catgggcgtc
150
cccacggccc tggaggccgg cagctggcgc tggggatccc tgctcttcgc
200
tctcttcctg gctgcgtccc taggtccggt ggcagccttc aaggtcgcca
250
cgccgtattc cctgtatgtc tgtcccgagg ggcagaacgt caccctcacc
300
tgcaggctct tgggccctgt ggacaaaggg cacgatgtga ccttctacaa
350
gacgtggtac cgcagctcga ggggcgaggt gcagacctgc tcagagcgcc
400
ggcccatccg caacctcacg ttccaggacc ttcacctgca ccatggaggc
450
caccaggctg ccaacaccag ccacgacctg gctcagcgcc acgggctgga
500
gtcggcctcc gaccaccatg gcaacttctc catcaccatg cgcaacctga
550
ccctgctgga tagcggcctc tactgctgcc tggtggtgga gatcaggcac
600
caccactcgg agcacagggt ccatggtgcc atggagctgc aggtgcagac
650
aggcaaagat gcaccatcca actgtgtggt gtacccatcc tcctcccagg
700
atagtgaaaa catcacggct gcagccctgg ctacgggtgc ctgcatcgta
750
ggaatcctct gcctccccct catcctgctc ctggtctaca agcaaaggca
800
625
ggcagcctcc aaccgccgtg cccaggagct ggtgcggatg gacagcaaca
850
ttcaagggat tgaaaacccc ggctttgaag cctcaccacc tgcccagggg
900
atacccgagg ccaaagtcag gcaccccctg tcctatgtgg cccagcggca
950
gccttctgag ctcctccagg tctgggcggc ccccggagac atctgctttc gtcttcttcc ggagcccagc catccctgga acccccctgt ccctgtccct
1000 1050
gactctccaa actttgaggt catctagccc agctggggga cagtgggctg
1100
ttgtggctgg gtctggggca ggtgcatttg agccagggct ggctctgtga
1150
gtggcctcct tggcctcggc cctggttccc tccctcctgc tctgggctca
1200
gatactgtga catcccagaa gcccagcccc tcaacccctc tggatgctac
1250
atggggatgc tggacggctc agcccctgtt ccaaggattt tggggtgctg
1300
agattctccc ctagagacct gaaattcacc agctacagat gccaaatgac
1350
ttacatctta agaagtctca gaacgtccag cccttcagca gctctcgttc
1400
tgagacatga gccttgggat gtggcagcat cagtgggaca agatggacac
1450
tgggccaccc tcccaggcac cagacacagg gcacggtgga gagacttctc
1500
ccccgtggcc gccttggctc ccccgttttg cccgaggctg ctcttctgtc
1550
agacttcctc tttgtaccac agtggctctg gggccaggcc tgcctgccca
1600
ctggccatcg ccaccttccc cagctgcctc ctaccagcag tttctctgaa
1650
gatctgtcaa caggttaagt caatctgggg cttccactgc ctgcattcca
1700
gtccccagag cttggtggtc ccgaaacggg aagtacatat tggggcatgg
1750
tggcctccgt gagcaaatgg tgtcttgggc aatctgaggc caggacagat
1800
gttgccccac ccactggaga tggtgctgag ggaggtgggt ggggccttct
1850
gggaaggtga gtggagaggg gcacctgccc cccgccctcc ccatccccta
1900
ctcccactgc tcagcgcggg ccattgcaag ggtgccacac aatgtcttgt
1950
ccaccctggg acacttctga gtatgaagcg ggatgctatt aaaaactaca
2000
tggggaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaga 2 044
<210> 140 <211> 311 <212> PRT
<213> Homo sapiens <400> 140
Met Gly Val Pro Thr Ala Leu Glu Ala Gly Ser Trp Arg Trp Gly 1,5 10 15
Ser Leu Leu Phe Ala Leu Phe Leu Ala Ala Ser Leu Gly Pro Val 20 25 30
Ala Ala Phe Lys Val Ala Thr Pro Tyr Ser Leu Tyr Val Cys Pro
626
40 45
Glu Gly Gin Asn Val Thr Leu Thr Cys Arg Leu Leu Gly Pro Val 50 55 60
Asp Lys Gly His Asp Val Thr Phe Tyr Lys Thr Trp Tyr Arg Ser 65 70 75
Ser Arg Gly Glu Val Gin Thr Cys Ser Glu Arg Arg Pro Ile Arg 80 85 90
Asn Leu Thr Phe Gin Asp Leu His Leu His His Gly Gly His Gin 95 100 105
Ala Ala Asn Thr Ser His Asp Leu Ala Gin Arg His Gly Leu Glu 110 115 120
Ser Ala Ser Asp His His Gly Asn Phe Ser Ile Thr Met Arg Asn 125 130 135
Leu Thr Leu Leu Asp Ser Gly Leu Tyr Cys Cys Leu Val Val Glu 140 145 150
Ile Arg His His His Ser Glu His Arg Val His Gly Ala Met Glu 155 160 165
Leu Gin Val Gin Thr Gly Lys Asp Ala Pro Ser Asn Cys Val Val 170 175 180
Tyr Pro Ser Ser Ser Gin Asp Ser Glu Asn Ile Thr Ala Ala Ala 185 190 195
Leu Ala Thr Gly Ala Cys Ile Val Gly Ile Leu Cys Leu Pro Leu 200 205 210
Ile Leu Leu Leu Val Tyr Lys Gin Arg Gin Ala Ala Ser Asn Arg 215 220 225
Arg Ala Gin Glu Leu Val Arg Met Asp Ser Asn Ile Gin Gly Ile 230 235 240
Glu Asn Pro Gly Phe Glu Ala Ser Pro Pro Ala Gin Gly Ile Pro 245 250 255
Glu Ala Lys Val Arg His Pro Leu Ser Tyr Val Ala Gin Arg Gin 260 265 270
Pro Ser Glu Ser Gly Arg His Leu Leu Ser Glu Pro Ser Thr Pro 275 280 285
Leu Ser Pro Pro Gly Pro Gly Asp Val Phe Phe Pro Ser Leu Asp 290 295 300
Pro Val Pro Asp Ser Pro Asn Phe Glu Val Ile 305 310
<210> 141 <211> 1732 <212> DNA
<213> Homo sapiens
<400> 141
50
100
150
200
250
300
350
400
450
500
550
600
650
700
750
800
850
900
950
1000
1050
1100
1150
1200
1250
1300
1350
1400
1450
1500
1550
1600
cgcgcctctc ctttccccgc ccttcctgcc ccccgctcta ctgtgcctcc accagcggcc tcctccttgc tcgagcccgc ccggcgagag ctgcgctgta ctgtccgcct gtcccaagtg aagtcctgcc tgcccatgag gctgcacaga ggctgcccag agatgaggca atggggtgag agaggcccgg catccctcgc tcgtcctgaa tactcccacg gccctgcatc tgacct'gtcc aagtgctgca gatcagttct catcggtatc gaggcctcgg ggaaactgag agaatcttcc gaaagaggca ggaacgtctt ccttctgctg gttctctttc ctcctttcct gcagcagacc gtgtcctttt tgaccctggg tgggactcgc ccagacatgt ctggcacccc cctgctcaga gtccactgcc tgtggaacct agcacaacgg ctgttcccct gggccagatc cacccctccc agtgagcaat acatcctcag gcaccccagc cacttcagac ggagaaacat gggaggtgtg ctatgcacct caccgagtac agatttgccc accaggtgtc cccaagccca acttggtgga gctcagagag acttgactca cagcacttcc cctagcccag
627
gaccttcctt cacctttctc gcccaccgct tcctggggtc cgtctccctt gaaaggatgg gctgctctgg tctgcctttt tacttggagc gggcgcccat cccagcctgt cacactccct gaccatgtac cccgcctgcc tactgcggcc actgccagac cggatgaaga gatccatgtt ccccactggc ccaagggagc aagaaagcct gcacccggcc gtgaggatgg ccctgccgtc agaggacaaa ccaaggcacc gacaacctgc gatctacctc gtgaagtacc gatcaagaaa gactgctcgc accctggagc cgtctctcca ttcttcccac gcttcctggc tgtgggttga cctcccgact ttcccgaggt ttccccctgg ccatgggaag cacaaggcct gtgagttgtt gacggagcca ctggactccg caacacggag caaccagtgt tcacaacctg tcctgctgcc ggacagtgtg ccagtgatgc ctcagcgccc aggcagcaca gtgtgcatgg ttccgtgcct ccgccaggac accccgagaa gcagaccctg gggccgggtc gtcgctttgc tggaagctgg tggcccaagg gtcaggaagc tggcccccac tgaaggtcac
628
ggccagtcca gacaaagtga ccaagacata acaaagacct aacagttgca 1650
gatatgagct gtataattgt tgttattata tattaataaa taagaagttg 17 00
cattaccctc aaaaaaaaaa aaaaaaaaaa aa 1732
<210> 142 <211> 451 <212> PRT
<213> Homo sapiens <400> 142
Met Val Pro Glu Val Arg Val Leu Ser Ser Leu Leu Gly Leu Ala 15 10 15
Leu Leu Trp Phe Pro Leu Asp Ser His Ala Arg Ala Arg Pro Asp 20 25 30
Met Phe Cys Leu Phe His Gly Lys Arg Tyr Ser Pro Gly Glu Ser 35 40 45
Trp His Pro Tyr Leu Glu Pro Gin Gly Leu Met Tyr Cys Leu Arg 50 55 60
Cys Thr Cys Ser Glu Gly Ala His Val Ser Cys Tyr Arg Leu His 65 70 75
Cys Pro Pro Val His Cys Pro Gin Pro Val Thr Glu Pro Gin Gin 80 85 90
Cys Cys Pro Lys Cys Val Glu Pro His Thr Pro Ser Gly Leu Arg 95 100 105
Ala Pro Pro Lys Ser Cys Gin His Asn Gly Thr Met Tyr Gin His 110 115 120
Gly Glu Ile Phe Ser Ala His Glu Leu Phe Pro Ser Arg Leu Pro 125 130 135
Asn Gin Cys Val Leu Cys Ser Cys Thr Glu Gly Gin Ile Tyr Cys 140 145 150
Gly Leu Thr Thr Cys Pro Glu Pro Gly Cys Pro Ala Pro Leu Pro 155 160 165
Leu Pro Asp Ser Cys Cys Gin Ala Cys Lys Asp Glu Ala Ser Glu 170 175 180
Gin Ser Asp Glu Glu Asp Ser Val Gin Ser Leu His Gly Val Arg 185 190 195
His Pro Gin Asp Pro Cys Ser Ser Asp Ala Gly Arg Lys Arg Gly 200 205 210
Pro Gly Thr Pro Ala Pro Thr Gly Leu Ser Ala Pro Leu Ser Phe 215 220 225
Ile Pro Arg His Phe Arg Pro Lys Gly Ala Gly Ser Thr Thr Val 230 235 240
Lys Ile Val Leu Lys Glu Lys His Lys Lys Ala Cys Val His Gly
629
245 250 255
Gly Lys Thr Tyr Ser His Gly Glu Val Trp His Pro Ala Phe Arg 260 265 270
Ala Phe Gly Pro Leu Pro Cys Ile Leu Cys Thr Cys Glu Asp Gly 275 280 285
Arg Gin Asp Cys Gin Arg Val Thr Cys Pro Thr Glu Tyr Pro Cys 290 295 300
Arg His Pro Glu Lys Val Ala Gly Lys Cys Cys Lys Ile Cys Pro 305 310 315
Glu Asp Lys Ala Asp Pro Gly His Ser Glu Ile Ser Ser Thr Arg 320 325 330
Cys Pro Lys Ala Pro Gly Arg Val Leu Val His Thr Ser Val Ser 335 340 345
Pro Ser Pro Asp Asn Leu Arg Arg Phe Ala Leu Glu His Glu Ala 350 355 360
Ser Asp Leu Val Glu Ile Tyr Leu Trp Lys Leu Val Lys Asp Glu 365 370 375
Glu Thr Glu Ala Gin Arg Gly Glu Val Pro Gly Pro Arg Pro His 380 385 390
Ser Gin Asn Leu Pro Leu Asp Ser Asp Gin Glu Ser Gin Glu Ala 395 400 405
Arg Leu Pro Glu Arg Gly Thr Ala Leu Pro Thr Ala Arg Trp Pro 410 415 420
Pro Arg Arg Ser Leu Glu Arg Leu Pro Ser Pro Asp Pro Gly Ala 425 430 435
Glu Gly His Gly Gin Ser Arg Gin Ser Asp Gin Asp Ile Thr Lys 440 445 450
Thr
<210> 143 <211> 693 <212> DNA
<213> Homo sapiens <400> 143
ctagcctgcg ccaaggggta gtgagaccgc gcggcaacag cttgcggctg 50 cggggagctc ccgtgggcgc tccgctggct gtgcaggcgg ccatggattc 100 cttgcggaaa atgctgatct cagtcgcaat gctgggcgca ggggctggcg 150 tgggctacgc gctcctcgtt atcgtgaccc cgggagagcg gcggaagcag 2 00 gaaatgctaa aggagatgcc actgcaggac ccaaggagca gggaggaggc 2 50 ggccaggacc cagcagctat tgctggccac tctgcaggag gcagcgacca 300
cgcaggagaa cgtggcctgg aggaagaact
630
ggatggttgg cggcgaaggc
350
ggcgccagcg ggaggtcacc gtgagaccgg acttgcctcc gtgggcgccg
400
gaccttggct tgggcgcagg aatccgaggc agcctttctc cttcgtgggc
450
ccagcggaga gtccggaccg agataccatg ccaggactct ccggggtcct
500
gtgagctgcc gtcgggtgag cacgtttccc ccaaaccctg gactgactgc
550
tttaaggtcc gcaaggcggg ccagggccga gacgcgagtc ggatgtggtg
600
aactgaaaga accaataaaa tcatgttcct ccaaaaaaaa aaaaaaaaaa
650
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 693
<210> 144 <211> 93 <212> PRT
<213> Homo sapiens
<400> 144 Met Asp Ser Leu Arg 1 5
Ala Gly Ala Gly Val 20
Gly Glu Arg Arg Lys 35
Asp Pro Arg Ser Arg 50
Leu Ala Thr Leu Gin 65
Trp Arg Lys Asn Trp 80
Arg Ser Pro
<210> 145
<211> 1883
<212> DNA
<213> Homo sapiens
<400> 145
caggagagaa ggcaccgccc ccaccccgcc tccaaagcta accctcgggc
50
ttgaggggaa gaggctgact gtacgttcct tctactctgg caccactctc
100
caggctgcca tggggcccag cacccctctc ctcatcttgt tccttttgtc
150
atggtcggga cccctccaag gacagcagca ccaccttgtg gagtacatgg
200
aacgccgact agctgcttta gaggaacggc tggcccagtg ccaggaccag
250
agtagtcggc atgctgctga gctgcgggac ttcaagaaca agatgctgcc
300
actgctggag gtggcagaga aggagcggga ggcactcaga actgaggccg
350
Lys Met Leu Ile Ser Val Ala Met Leu Gly 10 15
Gly Tyr Ala Leu Leu Val Ile Val Thr Pro 25 30
Gin Glu Met Leu Lys Glu Met Pro Leu Gin 40 45
Glu Glu Ala Ala Arg Thr Gin Gin Leu Leu 55 60
Glu Ala Ala Thr Thr Gin Glu Asn Val Ala 70 75
Met Val Gly Gly Glu Gly Gly Ala Ser Gly 85 90
631
acaccatctc cgggagagtg gatcgtctgg agcgggaggt agactatctg
400
gagacccaga acccagctct gccctgtgta gagtttgatg agaaggtgac
450
tggaggccct gggaccaaag gcaagggaag aaggaatgag aagtacgata
500
tggtgacaga ctgtggctac acaatctctc aagtgagatc aatgaagatt
550
ctgaagcgat ttggtggccc agctggtcta tggaccaagg atccactggg
600
gcaaacagag aagatctacg tgttagatgg gacacagaat gacacagcct
650
ttgtcttccc aaggctgcgt gacttcaccc ttgccatggc tgcccggaaa
700
gcttcccgag tccgggtgcc cttcccctgg gtaggcacag ggcagctggt
750
atatggtggc tttctttatt ttgctcggag gcctcctgga agacctggtg
800
gaggtggtga gatggagaac actttgcagc taatcaaatt ccacctggca
850
aaccgaacag tggtggacag ctcagtattc ccagcagagg ggctgatccc
900
cccctacggc ttgacagcag acacctacat cgacctggta gctgatgagg
950
aaggtctttg ggctgtctat gccacccggg aggatgacag gcacttgtgt
1000
ctggccaagt tagatccaca gacactggac acagagcagc agtgggacac
1050
accatgtccc agagagaatg ctgaggctgc ctttgtcatc tgtgggaccc
1100
tctatgtcgt ctataacacc cgtcctgcca gtcgggcccg catccagtgc
1150
tcctttgatg ccagcggcac cctgacccct gaacgggcag cactccctta
1200
ttttccccgc agatatggtg cccatgccag cctccgctat aacccccgag
1250
aacgccagct ctatgcctgg gatgatggct accagattgt ctataagctg
1300
gagatgagga agaaagagga ggaggtttga ggagctagcc ttgttttttg
1350
catctttctc actcccatac atttatatta tatccccact aaatttcttg
1400
ttcctcattc ttcaaatgtg ggccagttgt ggctcaaatc ctctatattt
1450
ttagccaatg gcaatcaaat tctttcagct cctttgtttc atacggaact
1500
ccagatcctg agtaatcctt ttagagcccg aagagtcaaa accctcaatg
1550
ttccctcctg ctctcctgcc ccatgtcaac aaatttcagg ctaaggatgc
1600
cccagaccca gggctctaac cttgtatgcg ggcaggccca gggagcaggc
1650
agcagtgttc ttcccctcag agtgacttgg ggagggagaa ataggaggag
1700
acgtccagct ctgtcctctc ttcctcactc ctcccttcag tgtcctgagg
1750
aacaggactt tctccacatt gttttgtatt gcaacatttt gcattaaaag
1800
gaaaatccac aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
1850
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 1883
632
<210> 146 <211> 406 <212> PRT
<213> Homo sapiens <400> 146
Met Gly Pro Ser Thr Pro Leu Leu Ile Leu Phe Leu Leu Ser Trp 15 10 15
Ser Gly Pro Leu Gin Gly Gin Gin His His Leu Val Glu Tyr Met 20 25 30
Glu Arg Arg Leu Ala Ala Leu Glu Glu Arg Leu Ala Gin Cys Gin 35 40 45
Asp Gin Ser Ser Arg His Ala Ala Glu Leu Arg Asp Phe Lys Asn 50 55 60
Lys Met Leu Pro Leu Leu Glu Val Ala Glu Lys Glu Arg Glu Ala 65 70 75
Leu Arg Thr Glu Ala Asp Thr Ile Ser Gly Arg Val Asp Arg Leu 80 85 90
Glu Arg Glu Val Asp Tyr Leu Glu Thr Gin Asn Pro Ala Leu Pro 95 100 105
Cys Val Glu Phe Asp Glu Lys Val Thr Gly Gly Pro Gly Thr Lys 110 115 120
Gly Lys Gly Arg Arg Asn Glu Lys Tyr Asp Met Val Thr Asp Cys 125 130 135
Gly Tyr Thr Ile Ser Gin Val Arg Ser Met Lys Ile Leu Lys Arg 140 145 150
Phe Gly Gly Pro Ala Gly Leu Trp Thr Lys Asp Pro Leu Gly Gin 155 160 165
Thr Glu Lys Ile Tyr Val Leu Asp Gly Thr Gin Asn Asp Thr Ala 170 175 180
Phe Val Phe Pro Arg Leu Arg Asp Phe Thr Leu Ala Met Ala Ala 185 190 195
Arg Lys Ala Ser Arg Val Arg Val Pro Phe Pro Trp Val Gly Thr 200 205 210
Gly Gin Leu Val Tyr Gly Gly Phe Leu Tyr Phe Ala Arg Arg Pro 215 220 225
Pro Gly Arg Pro Gly Gly Gly Gly Glu Met Glu Asn Thr Leu Gin 230 235 240
Leu Ile Lys Phe His Leu Ala Asn Arg Thr Val Val Asp Ser Ser 245 250 255
Val Phe Pro Ala Glu Gly Leu lie Pro Pro Tyr Gly Leu Thr Ala 260 265 270
Asp Thr Tyr Ile Asp Leu Val Ala Asp Glu Glu Gly Leu Trp Ala
275
633
280
285
Val Tyr Ala Thr Arg Glu Asp Asp Arg His Leu Cys Leu Ala Lys 290 295 300
Leu Asp Pro Gin Thr Leu Asp Thr Glu Gin Gin Trp Asp Thr Pro 305 310 315
Cys Pro Arg Glu Asn Ala Glu Ala Ala Phe Val Ile Cys Gly Thr 320 325 330
Leu Tyr Val Val Tyr Asn Thr Arg Pro Ala Ser Arg Ala Arg Ile 335 340 345
Gin Cys Ser Phe Asp Ala Ser Gly Thr Leu Thr Pro Glu Arg Ala 350 355 360
Ala Leu Pro Tyr Phe Pro Arg Arg Tyr Gly Ala His Ala Ser Leu 365 370 375
Arg Tyr Asn Pro Arg Glu Arg Gin Leu Tyr Ala Trp Asp Asp Gly 380 385 390
Tyr Gin Ile Val Tyr Lys Leu Glu Met Arg Lys Lys Glu Glu Glu 395 400 405
Val
<210> 147
<211> 2052
<212> DNA
<213> Homo sapiens
<400> 147
gacagctgtg tctcgatgga gtagactctc agaacagcgc agtttgccct
50
ccgctcacgc agagcctctc cgtggcttcc gcaccttgag cattaggcca
100
gttctcctct tctctctaat ccatccgtca cctctcctgt catccgtttc
150
catgccgtga ggtccattca cagaacacat ccatggctct catgctcagt
200
ttggttctga gtctcctcaa gctgggatca gggcagtggc aggtgtttgg
250
gccagacaag cctgtccagg ccttggtggg ggaggacgca gcattctcct
300
gtttcctgtc tcctaagacc aatgcagagg ccatggaagt gcggttcttc
350
aggggccagt tctctagcgt ggtccacctc tacagggacg ggaaggacca
400
gccatttatg cagatgccac agtatcaagg caggacaaaa ctggtgaagg
450
attctattgc ggaggggcgc atctctctga ggctggaaaa cattactgtg
500
ttggatgctg gcctctatgg gtgcaggatt agttcccagt cttactacca
550
gaaggccatc tgggagctac aggtgtcagc actgggctca gttcctctca
600
tttccatcac gggatatgtt gatagagaca tccagctact ctgtcagtcc
650
tcgggctggt tcccccggcc cacagcgaag tggaaaggtc cacaaggaca
700
634
ggatttgtcc acagactcca ggacaaacag agacatgcat ggcctgtttg
750
atgtggagat ctctctgacc gtccaagaga acgccgggag catatcctgt
800
tccatgcggc atgctcatct gagccgagag gtggaatcca gggtacagat
850
aggagatacc tttttcgagc ctatatcgtg gcacctggct accaaagtac
900
tgggaatact ctgctgtggc ctattttttg gcattgttgg actgaagatt
950
ttcttctcca aattccagtg gaaaatccag gcggaactgg actggagaag
1000
aaagcacgga caggcagaat tgagagacgc ccggaaacac gcagtggagg
1050
tgactctgga aaaactgtaa tccagagacg cccatagaaa gctcacccga agctccccag agctctgcgt gaggtgcctc ttctgatctg actctgagaa
1100 1150
gagatttaca aggaagagtg tggtggcttc tcagagtttc caagcaggga
1200
aacattactg ggaggtggac ggaggacaca ataaaaggtg gcgcgtggga
1250
gtgtgccggg atgatgtgga caggaggaag gagtacgtga ctttgtctcc
1300
cgatcatggg tactgggtcc teagactgaa tggagaacat ttgtatttca
1350
cattaaatcc ccgttttatc agcgtcttcc ccaggacccc acctacaaaa
1400
ataggggtct tcctggacta tgagtgtggg accatctcct tcttcaacat
1450
aaatgaccag tcccttattt ataccctgac atgtcggttt gaaggcttat
1500
tgaggcccta cattgagtat ccgtcctata atgagcaaaa tggaactccc
1550
atagtcatct gcccagtcac ccaggaatca gagaaagagg cctcttggca
1600
aagggcctct gcaatcccag agacaagcaa cagtgagtcc tcctcacagg
1650
caaccacgcc cttcctcccc aggggtgaaa tgtaggatga atcacatccc
1700
acattcttct ttagggatat taaggtctct ctcccagatc caaagtcccg
1750
cagcagccgg ccaaggtggc ttccagatga agggggactg gcctgtccac
1800
atgggagtca ggtgtcatgg ctgccctgag ctgggaggga agaaggctga
1850
cattacattt agtttgctct cactccatct ggctaagtga tcttgaaata
1900
ccacctctca ggtgaagaac cgtcaggaat tcccatctca caggctgtgg
1950
tgtagattaa gtagacaagg aatgtgaata atgcttagat cttattgatg
2000
acagagtgta tcctaatggt ttgttcatta tattacactt tcagtaaaaa
2050
aa 2052
<210> 148 <211> 500 <212> PRT
<213> Homo sapiens <400> 148
635
Met Ala Leu Met Leu Ser Leu Val Leu Ser Leu Leu Lys Leu Gly 15 10 15
Ser Gly Gin Trp Gin Val Phe Gly Pro Asp Lys Pro Val Gin Ala 20 25 30
Leu Val Gly Glu Asp Ala Ala Phe Ser Cys Phe Leu Ser Pro Lys 35 40 45
Thr Asn Ala Glu Ala Met Glu Val Arg Phe Phe Arg Gly Gin Phe 50 55 60
Ser Ser* Val Val His Leu Tyr Arg Asp Gly Lys Asp Gin Pro Phe 65 70 75
Met Gin Met Pro Gin Tyr Gin Gly Arg Thr Lys Leu Val Lys Asp
80 85 90
Ser Ile Ala Glu Gly Arg Ile Ser Leu Arg Leu Glu Asn Ile Thr
95 100 105
Val Leu Asp Ala Gly Leu Tyr Gly Cys Arg Ile Ser Ser Gin Ser 110 115 120
Tyr Tyr Gin Lys Ala Ile Trp Glu Leu Gin Val Ser Ala Leu Gly 125 130 135
Ser Val Pro Leu Ile Ser Ile Thr Gly Tyr Val Asp Arg Asp Ile 140 145 150
Gin Leu Leu Cys Gin Ser Ser Gly Trp Phe Pro Arg Pro Thr Ala 155 160 165
Lys Trp Lys Gly Pro Gin Gly Gin Asp Leu Ser Thr Asp Ser Arg 170 175 180
Thr Asn Arg Asp Met His Gly Leu Phe Asp Val Glu Ile Ser Leu 185 190 195
Thr Val Gin Glu Asn Ala Gly Ser Ile Ser Cys Ser Met Arg His 200 205 210
Ala His Leu Ser Arg Glu Val Glu Ser Arg Val Gin Ile Gly Asp 215 220 225
Thr Phe Phe Glu Pro Ile Ser Trp His Leu Ala Thr Lys Val Leu 230 235 240
Gly Ile Leu Cys Cys Gly Leu Phe Phe Gly Ile Val Gly Leu Lys 245 250 255
Ile Phe Phe Ser Lys Phe Gin Trp Lys Ile Gin Ala Glu Leu Asp 260 265 270
Trp Arg Arg Lys His Gly Gin Ala Glu Leu Arg Asp Ala Arg Lys 275 280 285
His Ala Val Glu Val Thr Leu Asp Pro Glu Thr Ala His Pro Lys 290 295 300
Leu Cys Val Ser Asp Leu Lys Thr Val Thr His Arg Lys Ala Pro 305 310 315
636
Gin Glu Val Pro His Ser Glu Lys Arg Phe Thr Arg Lys Ser Val 320 325 330
Val Ala Ser Gin Ser Phe Gin Ala Gly Lys His Tyr Trp Glu Val 335 340 345
Asp Gly Gly His Asn Lys Arg Trp Arg Val Gly Val Cys Arg Asp 350 355 360
Asp Val Asp Arg Arg Lys Glu Tyr Val Thr Leu Ser Pro Asp His 365 370 375
Gly Tyr Trp Val Leu Arg Leu Asn Gly Glu His Leu Tyr Phe Thr 380 385 390
Leu Asn Pro Arg Phe lie Ser Val Phe Pro Arg Thr Pro Pro Thr 395 400 405
Lys Ile Gly Val Phe Leu Asp Tyr Glu Cys Gly Thr Ile Ser Phe 410 415 420
Phe Asn Ile Asn Asp Gin Ser Leu Ile Tyr Thr Leu Thr Cys Arg 425 430 435
Phe Glu Gly Leu Leu Arg Pro Tyr Ile Glu Tyr Pro Ser Tyr Asn 440 445 450
Glu Gin Asn Gly Thr Pro Ile Val Ile Cys Pro Val Thr Gin Glu 455 460 465
Ser Glu Lys Glu Ala Ser Trp Gin Arg Ala Ser Ala Ile Pro Glu 470 475 480
Thr Ser Asn Ser Glu Ser Ser Ser Gin Ala Thr Thr Pro Phe Leu 485 490 495
Pro Arg Gly Glu Met 500
<210> 149
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 149
gcgtggtcca cctctacagg gacg 24
<210> 150
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 150
ggaactgacc cagtgctgac acc 23
<210> 151
637
<211> 45 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 151
gcagatgcca cagtatcaag gcaggacaaa actggtgaag gattc 45
<210> 152 <211> 2294 <212> DNA
<213> Homo sapiens <400> 152
gcgatggtgc gcccggtggc ggtggcggcg gcggttgcgg aggcttcctt 50
ggtcggattg caacgaggag aagatgactg accaaccgac tggctgaatg 100 aatgaatggc ggagccgagc gcgccatgag gagcctgccg agcctgggcg 150
gcctcgccct gttgtgctgc gccgccgccg ccgccgccgt cgcctcagcc 200
gcctcggcgg ggaatgtcac cggtggcggc ggggccgcgg ggcaggtgga 2 50
cgcgtcgccg ggccccgggt tgcggggcga gcccagccac cccttcccta 300
gggcgacggc tcccacggcc caggccccga ggaccgggcc cccgcgcgcc 3 50
accgtccacc gacccctggc tgcgacttct ccagcccagt ccccggagac 400
cacccctctt tgggcgactg ctggaccctc ttccaccacc tttcaggcgc 450
cgctcggccc ctcgccgacc acccctccgg cggcggaacg cacttcgacc 500
acctctcagg cgccgaccag acccgcgccg accacccttt cgacgaccac 550
tggcccggcg ccgaccaccc ctgtagcgac caccgtaccg gcgcccacga 600
ctccccggac cccgaccccc gatctcccca gcagcagcaa cagcagcgtc 650
ctccccaccc cacctgccac cgaggccccc tcttcgcctc ctccagagta 700
tgtatgtaac tgctctgtgg ttggaagcct gaatgtgaat cgctgcaacc 750
agaccacagg gcagtgtgag tgtcggccag gttatcaggg gcttcactgt 800
gaaacctgca aagagggctt ttacctaaat tacacttctg ggctctgtca 850
gccatgtgac tgtagtccac atggagctct cagcataccg tgcaacaggt 900
aagcaacaga gggtggaact gaagtttatt ttattttagc aagggaaaaa 950
aaaaggctgc tactctcaag gaccatactg gtttaaacaa aggaggatga 1000
gggtcataga tttacaaaat attttatata cttttattct cttactttat 1050
atgttatatt taatgtcagg atttaaaaac atctaattta ctgatttagt 1100
tcttcaaaag cactagagtc gccaattttt ctctgggata atttctgtaa 1150
atttcatggg aaaaaattat tgaagaataa atctgctttc tggaagggct 12 00
638
ttcaggcatg aaacctgcta ggaggtttag aaatgttctt atgtttatta
1250
atataccatt ggagtttgag gaaatttgtt gtttggttta tttttctctc
1300
taatcaaaat tctacatttg tttctttgga catctaaagc ttaacctggg
1350
ggtaccctaa tttatttaac tagtggtaag tagactggtt ttactctatt
1400
taccagtaca tttttgagac caaaagtaga ttaagcagga attatcttta
1450
aactattatg ttatttggag gtaatttaat ctagtggaat aatgtactgt
1500
tatctaagca tttgccttgt actgcactga aagtaattat tctttgacct
1550
tatgtgaggc acttggcttt ttgtggaccc caagtcaaaa aactgaagag
1600
acagtattaa ataatgaaaa aaataatgac aggttatact cagtgtaacc
1650
tgggtataac agtaatttcc ccaagatctg tttcactgag ctgccactta cttgtttctt cgagctgtgt ctcaaggttg tccttgggca ttgtgaagat
1700 1750
taaatgagtt gatatatata aaatgcctag cacatgtcac tcaataaatt
1800
ctggtttgtt ttaatttcaa aggaatatta tggactgaaa tgagagaaca
1850
tgttttaaga acttttagct ccttgacaaa gaagtgcttt atactttagc
1900
actaaatatt ttaaatgctt tataaatgat attatactgt tatggaatat
1950
tgtatcatat tgtagtttat taaaaatgta gaagaggctg ggcgcggtgg
2000
ctcacgcctg taatcctagc actttgggag gccaaggcgg gtggatcact
2050
tgaggccagg agttctagat gagcctggcc agcacagtga aaccccgtct
2100
ctactaaaaa tacaaacaaa ttagctgggc gtggtggcac acacctgtag
2150
tcccagctac tcgggaggct gaggcaggag aatcggttga acccgggagg
2200
tggaggttgc agtgagctga gatcgcgcca ctgcactcca gcctggtgag
2250
agagggagac tctgtcttaa aaaaaaaaaa aaaaaaaaaa aaaa 2294
<210> 153 <211> 258 <212> PRT
<213> Homo sapiens <400> 153
Met Arg Ser Leu Pro Ser Leu Gly Gly Leu Ala Leu Leu Cys Cys 15 10 15
Ala Ala Ala Ala Ala Ala Val Ala Ser Ala Ala Ser Ala Gly Asn 20 25 30
Val Thr Gly Gly Gly Gly Ala Ala Gly Gin Val Asp Ala Ser Pro 35 40 45
Gly Pro Gly Leu Arg Gly Glu Pro Ser His Pro Phe Pro Arg Ala 50 55 60
639
Thr Ala Pro Thr Ala Gin Ala Pro Arg Thr Gly Pro Pro Arg Ala 65 70 75
Thr Val His Arg Pro Leu Ala Ala Thr Ser Pro Ala Gin Ser Pro 80 85 90
Glu Thr Thr Pro Leu Trp Ala Thr Ala Gly Pro Ser Ser Thr Thr 95 100 105
Phe Gin Ala Pro Leu Gly Pro Ser Pro Thr Thr Pro Pro Ala Ala 110 115 120
Glu Arg Thr Ser Thr Thr Ser Gin Ala Pro Thr Arg Pro Ala Pro 125 130 135
Thr Thr Leu Ser Thr Thr Thr Gly Pro Ala Pro Thr Thr Pro Val 140 145 150
Ala Thr Thr Val Pro Ala Pro Thr Thr Pro Arg Thr Pro Thr Pro 155 160 165
Asp Leu Pro Ser Ser Ser Asn Ser Ser Val Leu Pro Thr Pro Pro 170 175 180
Ala Thr Glu Ala Pro Ser Ser Pro Pro Pro Glu Tyr Val Cys Asn 185 190 195
Cys Ser Val Val Gly Ser Leu Asn Val Asn Arg Cys Asn Gin Thr 200 205 210
Thr Gly Gin Cys Glu Cys Arg Pro Gly Tyr Gin Gly Leu His Cys 215 220 225
Glu Thr Cys Lys Glu Gly Phe Tyr Leu Asn Tyr Thr Ser Gly Leu 230 235 240
Cys Gin Pro Cys Asp Cys Ser Pro His Gly Ala Leu Ser Ile Pro 245 250 255
Cys Asn Arg
<210> 154 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 154 aactgctctg tggttggaag cctg 24
<210> 155 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 155
640
cagtcacatg gctgacagac ccac 24
<210> 156 <211> 38 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 156
aggttatcag gggcttcact gtgaaacctg caaagagg 38
<210> 157 <211> 689 <212> DNA
<213> Homo sapiens <400> 157
tgcggcgcag tgtagacctg ggaggatggg cggcctgctg ctggctgctt
50
ttctggcttt ctggaccctg ggtctcggtg agcagcttct cccagggccc tgggccctgg aggccgtgtg tacgtgcttg gttgggaaga cggtggcctc
100 150
ccgggaaaag ggctttgcca tggagaagga catgaagaac gtcgtggggg
200
tggtggtgac cctcactcca gaaaacaacc tgcggacgct gtcctctcag
250
cacgggctgg gagggtgtga ccagagtgtc atggacctga taaagcgaaa
300
ctccggatgg gtgtttgaga atccctcaat aggcgtgctg gagctctggg
350
tgctggccac caacttcaga gactatgcca tcatcttcac tcagctggag
400
ttcggggacg agcccttcaa caccgtggag ctgtacagtc tgacggagac
450
agccagccag gaggccatgg ggctcttcac caagtggagc aggagcctgg
500
gcttcctgtc acagtagcag gcccagctgc agaaggacct cacctgtgct
550
cacaagatcc ttctgtgagt gctgcgtccc cagtagggat ggcgcccaca
600
gggtcctgtg acctcggcca gtgtccaccc acctcgctca gcggctcccg
650
gggcccagca ccagctcaga ataaagcgat tccacagca i
689
<210> 158 <211> 163 <212> PRT
<213> Homo sapiens
<400> 158 Met Gly Gly Leu 1
Pro Arg Ala Gin
Leu Leu Gly Pro
Gly Phe Ala Met
Leu Leu Ala Ala Phe Leu Ala Leu Val Ser Val 5 10 15
Ala Val Trp Leu Gly Arg Leu Asp Pro Glu Gin 20 25 30
Trp Tyr Val Leu Ala Val Ala Ser Arg Glu Lys 35 40 45
Glu Lys Asp Met Lys Asn Val Val Gly Val Val
50
Val Thr Leu Thr Pro Glu 65
His Gly Leu Gly Gly Cys 80
Arg Asn Ser Gly Trp Val 95
Glu Leu Trp Val Leu Ala 110
Phe Thr Gin Leu Glu Phe 125
Leu Tyr Ser Leu Thr Glu 140
Phe Thr Lys Trp Ser Arg 155
<210> 159 <211> 1665 <212> DNA
<213> Homo sapiens <400> 159
aacagacgtt ccctcgcggc cctggcacct ctaaccccag acatgctgct
50
gctgctgctg cccctgctct gggggaggga gagggcggaa ggacagacaa
100
gtaaactgct gacgatgcag agttccgtga cggtgcagga aggcctgtgt
150
gtccatgtgc cctgctcctt ctcctacccc tcgcatggct ggatttaccc
200
tggcccagta gttcatggct actggttccg ggaaggggcc aatacagacc
250
aggatgctcc agtggccaca aacaacccag ctcgggcagt gtgggaggag
300
actcgggacc gattccacct ccttggggac ccacatacca agaattgcac
350
cctgagcatc agagatgcca gaagaagtga tgcggggaga tacttctttc
400
gtatggagaa aggaagtata aaatggaatt ataaacatca ccggctctct
450
gtgaatgtga cagccttgac ccacaggccc aacatcctca tcccaggcac
500
cctggagtcc ggctgccccc agaatctgac ctgctctgtg ccctgggcct
550
gtgagcaggg gacaccccct atgatctcct ggatagggac ctccgtgtcc
600
cccctggacc cctccaccac ccgctcctcg gtgctcaccc tcatcccaca
650
gccccaggac catggcacca gcctcacctg tcaggtgacc ttccctgggg
700
ccagcgtgac cacgaacaag accgtccatc tcaacgtgtc ctacccgcct
750
cagaacttga ccatgactgt cttccaagga gacggcacag tatccacagt
800
cttgggaaat ggctcatctc tgtcactccc agagggccag tctctgcgcc
850
641
55 60
Asn Asn Leu Arg Thr Leu Ser Ser Gin 70 75
Asp Gin Ser Val Met Asp Leu Ile Lys 85 90
Phe Glu Asn Pro Ser Ile Gly Val Leu 100 105
Thr Asn Phe Arg Asp Tyr Ala Ile Ile 115 120
Gly Asp Glu Pro Phe Asn Thr Val Glu 130 135
Thr Ala Ser Gin Glu Ala Met Gly Leu 145 150
Ser Leu Gly Phe Leu Ser Gin 160
642
tggtctgtgc agttgatgca gttgacagca atccccctgc caggctgagc 900
ctgagctgga gaggcctgac cctgtgcccc tcacagccct caaacccggg 950
ggtgctggag ctgccttggg tgcacctgag ggatgcagct gaattcacct 1000
gcagagctca gaaccctctc ggctctcagc aggtctacct gaacgtctcc 1050
ctgcagagca aagccacatc aggagtgact cagggggtgg tcgggggagc 1100
tggagccaca gccctggtct tcctgtcctt ctgcgtcatc ttcgttgtag 1150
tgaggtcctg caggaagaaa tcggcaaggc cagcagcggg cgtgggagat 12 00
acgggcatag aggatgcaaa cgctgtcagg ggttcagcct ctcaggggcc 1250
cctgactgaa ccttgggcag aagacagtcc cccagaccag cctcccccag 1300
cttctgcccg ctcctcagtg ggggaaggag agctccagta tgcatccctc 13 50
agcttccaga tggtgaagcc ttgggactcg cggggacagg aggccactga 1400
caccgagtac tcggagatca agatccacag atgagaaact gcagagactc 1450 accctgattg agggatcaca gcccctccag gcaagggaga agtcagaggc 1500
tgattcttgt agaattaaca gccctcaacg tgatgagcta tgataacact 1550
atgaattatg tgcagagtga aaagcacaca ggctttagag tcaaagtatc 1600
tcaaacctga atccacactg tgccctccct tttatttttt taactaaaag 1650
acagacaaat tccta 1665
<210> 160 <211> 463 <212> PRT
<213> Homo sapiens <400> 160
Met Leu Leu Leu Leu Leu Pro Leu Leu Trp Gly Arg Glu Arg Ala 15 10 15
Glu Gly Gin Thr Ser Lys Leu Leu Thr Met Gin Ser Ser Val Thr 20 25 30
Val Gin Glu Gly Leu Cys Val His Val Pro Cys Ser Phe Ser Tyr 35 40 45
Pro Ser His Gly Trp Ile Tyr Pro Gly Pro Val Val His Gly Tyr 50 55 60
Trp Phe Arg Glu Gly Ala Asn Thr Asp Gin Asp Ala Pro Val Ala 65 70 75
Thr Asn Asn Pro Ala Arg Ala Val Trp Glu Glu Thr Arg Asp Arg 80 85 90
Phe His Leu Leu Gly Asp Pro His Thr Lys Asn Cys Thr Leu Ser 95 100 105
Ile Arg Asp Ala Arg Arg Ser Asp Ala Gly Arg Tyr Phe Phe Arg
643
110 115 120
Met Glu Lys Gly Ser Ile Lys Trp Asn Tyr Lys His His Arg Leu 125 130 135
Ser Val Asn Val Thr Ala Leu Thr His Arg Pro Asn Ile Leu Ile 140 145 150
Pro Gly Thr Leu Glu Ser Gly Cys Pro Gin Asn Leu Thr Cys Ser 155 160 165
Val Pro Trp Ala Cys Glu Gin Gly Thr Pro Pro Met Ile Ser Trp 170 175 180
Ile Gly Thr Ser Val Ser Pro Leu Asp Pro Ser Thr Thr Arg Ser 185 190 195
Ser Val Leu Thr Leu Ile Pro Gin Pro Gin Asp His Gly Thr Ser 200 205 210
Leu Thr Cys Gin Val Thr Phe Pro Gly Ala Ser Val Thr Thr Asn 215 220 225
Lys Thr Val His Leu Asn Val Ser Tyr Pro Pro Gin Asn Leu Thr
230 235 240
Met Thr Val Phe Gin Gly Asp Gly Thr Val Ser Thr Val Leu Gly
245 250 255
Asn Gly Ser Ser Leu Ser Leu Pro Glu Gly Gin Ser Leu Arg Leu 260 265 270
Val Cys Ala Val Asp Ala Val Asp Ser Asn Pro Pro Ala Arg Leu 275 280 285
Ser Leu Ser Trp Arg Gly Leu Thr Leu Cys Pro Ser Gin Pro Ser 290 295 300
Asn Pro Gly Val Leu Glu Leu Pro Trp Val His Leu Arg Asp Ala 305 310 315
Ala Glu Phe Thr Cys Arg Ala Gin Asn Pro Leu Gly Ser Gin Gin 320 325 330
Val Tyr Leu Asn Val Ser Leu Gin Ser Lys Ala Thr Ser Gly Val 335 340 345
Thr Gin Gly Val Val Gly Gly Ala Gly Ala Thr Ala Leu Val Phe 350 355 360
Leu Ser Phe Cys Val Ile Phe Val Val Val Arg Ser Cys Arg Lys 365 370 375
Lys Ser Ala Arg Pro Ala Ala Gly Val Gly Asp Thr Gly Ile Glu 380 385 390
Asp Ala Asn Ala Val Arg Gly Ser Ala Ser Gin Gly Pro Leu Thr 395 400 405
Glu Pro Trp Ala Glu Asp Ser Pro Pro Asp Gin Pro Pro Pro Ala 410 415 420
Ser Ala Arg Ser Ser Val Gly Glu Gly Glu Leu Gin Tyr Ala Ser
644
425 430 435
Leu Ser Phe Gin Met Val Lys Pro Trp Asp Ser Arg Gly Gin Glu 440 445 450
Ala Thr Asp Thr Glu Tyr Ser Glu Ile Lys Ile His Arg 455 460
<210> 161 <211> 739 <212> DNA <213> Homo sapiens
<400> 161
gacgcccagt gacctgccga ggtcggcagc acagagctct ggagatgaag
50
accctgttcc tgggtgtcac gctcggcctg gccgctgccc tgtccttcac
100
cctggaggag gaggatatca cagggacctg gtacgtgaag gccatggtgg
150
tcgataagga ctttccggag gacaggaggc ccaggaaggt gtccccagtg
200
aaggtgacag ccctgggcgg tgggaagttg gaagccacgt tcaccttcat
250
gagggaggat cggtgcatcc agaagaaaat cctgatgcgg aagacggagg
300
agcctggcaa atacagcgcc tatgggggca ggaagctcat gtacctgcag
350
gagctgccca ggagggacca ctacatcttt tactgcaaag accagcacca
400
tgggggcctg ctccacatgg gaaagcttgt gggtaggaat tctgatacca
450
accgggaggc cctggaagaa tttaagaaat tggtgcagcg caagggactc
500
tcggaggagg acattttcac gcccctgcag acgggaagct gcgttcccga
550
acactaggca gcccccgggt ctgcacctcc agagcccacc ctaccaccag
600
acacagagcc cggaccacct ggacctaccc tccagccatg acccttccct
650
gctcccaccc acctgactcc aaataaagtc cttttccccc aaaaaaaaaa
700
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 739
<210> 162 <211> 170 <212> PRT
<213> Homo sapiens <400> 162
Met Lys Thr Leu Phe Leu Gly Val Thr Leu Gly Leu Ala Ala Ala 1 5 10 15
Leu Ser Phe Thr Leu Glu Glu Glu Asp Ile Thr Gly Thr Trp Tyr 20 25 30
Val Lys Ala Met Val Val Asp Lys Asp Phe Pro Glu Asp Arg Arg 35 40 45
Pro Arg Lys Val Ser Pro Val Lys Val Thr Ala Leu Gly Gly Gly 50 55 60
645
Lys Leu Glu Ala Thr Phe Thr Phe Met Arg Glu Asp Arg Cys Ile 65 70 75
Gin Lys Lys Ile Leu Met Arg Lys Thr Glu Glu Pro Gly Lys Tyr 80 85 . 90
Ser Ala Tyr Gly Gly Arg Lys Leu Met Tyr Leu Gin Glu Leu Pro 95 100 105
Arg Arg Asp His Tyr Ile Phe Tyr Cys Lys Asp Gin His His Gly 110 115 120
Gly Leu Leu His Met Gly Lys Leu Val Gly Arg Asn Ser Asp Thr 125 130 135
Asn Arg Glu Ala Leu Glu Glu Phe Lys Lys Leu Val Gin Arg Lys 140 145 150
Gly Leu Ser Glu Glu Asp Ile Phe Thr Pro Leu Gin Thr Gly Ser 155 160 165
Cys Val Pro Glu His 170
<210> 163 <211> 22 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 163 ggagatgaag accctgttcc tg 22
<210> 164 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 164 ggagatgaag accctgttcc tgggtg 26
<210> 165 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 165 gtcctccgga aagtccttat c 21
<210> 166 <211> 25 <212> DNA
<213> Artificial Sequence
646
<220>
<223> Synthetic oligonucleotide probe
<400> 166 gcctagtgtt cgggaacgca gcttc 25
<210> 167 <211> 50 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 167
cagggacctg gtacgtgaag gccatggtgg tcgataagga ctttccggag 50
<210> 168 <211> 45 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 168
ctgtccttca ccctggagga ggaggatatc acagggacct ggtac 45
<210> 169
<211> 1204
<212> DNA
<213> Homo sapiens
<400> 169
gttccgcaga tgcagaggtt gaggtggctg cgggactgga agtcatcggg
50
cagaggtctc acagcagcca aggaacctgg ggcccgctcc tcccccctcc
100
aggccatgag gattctgcag ttaatcctgc ttgctctggc aacagggctt
150
gtagggggag agaccaggat catcaagggg ttcgagtgca agcctcactc
200
ccagccctgg caggcagccc tgttcgagaa gacgcggcta ctctgtgggg
250
cgacgctcat cgcccccaga tggctcctga cagcagccca ctgcctcaag
300
ccccgctaca tagttcacct ggggcagcac aacctccaga aggaggaggg
350
ctgtgagcag acccggacag ccactgagtc cttcccccac cccggcttca
400
acaacagcct ccccaacaaa gaccaccgca atgacatcat gctggtgaag
450
atggcatcgc cagtctccat cacctgggct gtgcgacccc tcaccctctc
500
ctcacgctgt gtcactgctg gcaccagctg cctcatttcc ggctggggca
550
gcacgtccag cccccagtta cgcctgcctc acaccttgcg atgcgccaac
600
atcaccatca ttgagcacca gaagtgtgag aacgcctacc ccggcaacat
650
cacagacacc atggtgtgtg ccagcgtgca ggaagggggc aaggactcct
700
647
gccagggtga ctccgggggc cctctggtct gtaaccagtc tcttcaaggc 750
attatctcct ggggccagga tccgtgtgcg atcacccgaa agcctggtgt 800
ctacacgaaa gtctgcaaat atgtggactg gatccaggag acgatgaaga 850
acaattagac tggacccacc caccacagcc catcaccctc catttccact 900
tggtgtttgg ttcctgttca ctctgttaat aagaaaccct aagccaagac 950
cctctacgaa cattctttgg gcctcctgga ctacaggaga tgctgtcact 1000
taataatcaa cctggggttc gaaatcagtg agacctggat tcaaattctg 1050
ccttgaaata ttgtgactct gggaatgaca acacctggtt tgttctctgt 1100
tgtatcccca gccccaaaga cagctcctgg ccatatatca aggtttcaat 1150
aaatatttgc taaatgaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 12 00
aaaa 12 04
<210> 170 <211> 250 <212> PRT <213> Homo sapiens
<400> 170
Met Arg Ile Leu Gin Leu Ile Leu Leu Ala Leu Ala Thr Gly Leu 15 10 15
Val Gly Gly Glu Thr Arg Ile Ile Lys Gly Phe Glu Cys Lys Pro 20 25 30
His Ser Gin Pro Trp Gin Ala Ala Leu Phe Glu Lys Thr Arg Leu 35 40 45
Leu Cys Gly Ala Thr Leu Ile Ala Pro Arg Trp Leu Leu Thr Ala 50 55 60
Ala His Cys Leu Lys Pro Arg Tyr Ile Val His Leu Gly Gin His 65- 70 75
Asn Leu Gin Lys Glu Glu Gly Cys Glu Gin Thr Arg Thr Ala Thr 80 85 90
Glu Ser Phe Pro His Pro Gly Phe Asn Asn Ser Leu Pro Asn Lys 95 100 105
Asp His Arg Asn Asp Ile Met Leu Val Lys Met Ala Ser Pro Val 110 115 120
Ser Ile Thr Trp Ala Val Arg Pro Leu Thr Leu Ser Ser Arg Cys 125 130 135
Val Thr Ala Gly Thr Ser Cys Leu Ile Ser Gly Trp Gly Ser Thr 140 145 150
Ser Ser Pro Gin Leu Arg Leu Pro His Thr Leu Arg Cys Ala Asn 155 160 165
Ile Thr Ile Ile Glu His Gin Lys Cys Glu Asn Ala Tyr Pro Gly
170
648
175
180
Asn Ile Thr Asp Thr Met Val Cys Ala Ser Val Gin Glu Gly Gly 185 190 195
Lys Asp Ser Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys Asn 200 205 210
Gin Ser Leu Gin Gly Ile Ile Ser Trp Gly Gin Asp Pro Cys Ala 215 220 225
Ile Thr Arg Lys Pro Gly Val Tyr Thr Lys Val Cys Lys Tyr Val 230 235 240
Asp Trp Ile Gin Glu Thr Met Lys Asn Asn 245 250
<210> 171 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 171 ggctgcggga ctggaagtca tcggg 25
<210> 172 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 172 ctccaggcca tgaggattct gcag 24
<210> 173 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 173 cctctggtct gtaaccag 18
<210> 174 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 174 tctgtgatgt tgccggggta ggcg 24
<210> 175
649
<211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 175 cgtgtagaca ccaggctttc gggtg 25
<210> 176 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 176 cccttgatga tcctggtc 18
<210> 177 <211> 50 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 177
aggccatgag gattctgcag ttaatcctgc ttgctctggc aacagggctt 50 <210> 178 <211> 43 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 178
gagagaccag gatcatcaag gggttcgagt gcaagcctca ctc 43
<210> 179 <211> 907 <212> DNA
<213> Homo sapiens <400> 179
gagcagtgtt ctgctggagc cgatgccaaa aaccatgcat ttcttattca 50 gattcattgt tttcttttat ctgtggggcc tttttactgc tcagagacaa 100 aagaaagagg agagcaccga agaagtgaaa atagaagttt tgcatcgtcc 150 agaaaactgc tctaagacaa gcaagaaggg agacctacta aatgcccatt 2 00 atgacggcta cctggctaaa gacggctcga aattctactg cagccggaca 2 50 caaaatgaag gccaccccaa atggtttgtt cttggtgttg ggcaagtcat 300
aaaaggccta gacattgcta tgacagatat gtgccctgga gaaaagcgaa 350
650
aagtagttat acccccttca tttgcatacg gaaaggaagg ctatgcagaa
400
ggcaagattc caccggatgc tacattgatt tttgagattg aactttatgc
450
tgtgaccaaa ggaccacgga gcattgagac atttaaacaa atagacatgg
500
acaatgacag gcagctctct aaagccgaga taaacctcta cttgcaaagg
550
gaatttgaaa aagatgagaa gccacgtgac aagtcatatc aggatgcagt
600
tttagaagat atttttaaga agaatgacca tgatggtgat ggcttcattt
650
ctcccaagga atacaatgta taccaacacg atgaactata gcatatttgt
700
atttctactt ttttttttta gctatttact gtactttatg tataaaacaa
750
agtcactttt ctccaagttg tatttgctat ttttccccta tgagaagata
800
ttttgatctc cccaatacat tgattttggt ataataaatg tgaggctgtt
850
ttgcaaactt aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
900
aaaaaaa 907
<210> 180 <211> 222 <212> PRT
<213> Homo sapiens <400> 180
Met Pro Lys Thr Met His Phe Leu Phe Arg Phe Ile Val Phe Phe 15 10 15
Tyr Leu Trp Gly Leu Phe Thr Ala Gin Arg Gin Lys Lys Glu Glu 20 25 30
Ser Thr Glu Glu Val Lys Ile Glu Val Leu His Arg Pro Glu Asn 35 40 45
Cys Ser Lys Thr Ser Lys Lys Gly Asp Leu Leu Asn Ala His Tyr 50 55 60
Asp Gly Tyr Leu Ala Lys Asp Gly Ser Lys Phe Tyr Cys Ser Arg 65 70 75
Thr Gin Asn Glu Gly His Pro Lys Trp Phe Val Leu Gly Val Gly 80 85 90
Gin Val Ile Lys Gly Leu Asp Ile Ala Met Thr Asp Met Cys Pro 95 100 105
Gly Glu Lys Arg Lys Val Val Ile Pro Pro Ser Phe Ala Tyr Gly 110 115 120
Lys Glu Gly Tyr Ala Glu Gly Lys Ile Pro Pro Asp Ala Thr Leu 125 130 135
Ile Phe Glu Ile Glu Leu Tyr Ala Val Thr Lys Gly Pro Arg Ser 140 145 150
Ile Glu Thr Phe Lys Gin Ile Asp Met Asp Asn Asp Arg Gin Leu 155 160 165
651
Ser Lys Ala
Asp Glu Lys
Asp Ile Phe
Pro Lys Glu
<210> 181 <211> 22 <212> DNA <213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 181 gtgttctgct ggagccgatg cc 22
<210> 182 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 182
gacatggaca atgacagg 18 <210> 183 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 183 cctttcagga tgtaggag 18
<210> 184 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 184 gatgtctgcc accccaag 18
<210> 185 <211> 27 <212> DNA
<213> Artificial Sequence
Glu Ile Asn Leu Tyr Leu Gin Arg Glu Phe Glu Lys 170 175 180
Pro Arg Asp Lys Se'r Tyr Gin Asp Ala Val Leu Glu 185 190 195
Lys Lys Asn Asp His Asp Gly Asp Gly Phe Ile Ser 200 205 210
Tyr Asn Val Tyr Gin His Asp Glu Leu 215 220
<220>
<223> Synthetic oligonucleotide probe
652
<400> 185 gcatcctgat atgacttgtc acgtggc 27
<210> 186 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 186 tacaagaggg aagaggagtt gcac 24
<210> 187 <211> 52 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 187
gcccattatg acggctacct ggctaaagac ggctcgaaat tctactgcag 50
cc 52
<210> 188 <211> 573 <212> DNA <213> Homo sapiens
<400> 188
cagaaatgca gggaccattg cttcttccag gcctctgctt tctgctgagc 50
ctctttggag ctgtgactca gaaaaccaaa acttcctgtg ctaagtgccc 100
cccaaatgct tcctgtgtca ataacactca ctgcacctgc aaccatggat 150
atacttctgg atctgggcag aaactattca cattcccctt ggagacatgt 200
aacgccaggc atggtggctc gcgcctgtaa tcccagttct ttgggaagcc 250
aaggcaggtg gatcacctga ggtcaggagt ttgagaccag cctggccaac 300
atagtgaaac cccgtgtcta ctaaaaatac aaaaatcagc cgggcgtggt 350
ggtgcatgcc tgcaatccca gttactcggg aggctgaggc aggagaatcg 400
cttgaactca ggaggcagaa gttgcagtga acccagatcc tgccattgca 450
ctccagcatg gatgacagag caagactccg tctcaaaaag aaaagatagt 500
ttcttgtttc atttcgcgac tgccctctca gtgtttcctg ggatcccctc 550
ccaaataaag tacttatatt ctc 573
<210> 189
<211> 74
<212> PRT
<213> Homo sapiens
653
<400> 189
Met Gin Gly Pro Leu Leu Leu Pro Gly Leu Cys Phe Leu Leu Ser 15 10 15
Leu Phe Gly Ala Val Thr Gin Lys Thr Lys Thr Ser Cys Ala Lys 20 25 30
Cys Pro Pro Asn Ala Ser Cys Val Asn Asn Thr His Cys Thr Cys 35 40 45
Asn His Gly Tyr Thr Ser Gly Ser Gly Gin Lys Leu Phe Thr Phe 50 55 60
Pro Leu Glu Thr Cys Asn Ala Arg His Gly Gly Ser Arg Leu 65 70
<210> 190 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 190 agggaccatt gcttcttcca ggcc 24
<210> 191 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 191 cgttacatgt ctccaagggg aatg 24
<210> 192 <211> 50 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 192
cctgtgctaa gtgcccccca aatgcttcct gtgtcaataa cactcactgc 50
<210> 193 <211> 1091 <212> DNA
<213> Homo sapiens <400> 193
caagcaggtc atccccttgg tgaccttcaa agagaagcag agagggcaga 50 ggtggggggc acagggaaag ggtgacctct gagattcccc ttttccccca 100 gactttggaa gtgacccacc atggggctca gcatcttttt gctcctgtgt 150
gttcttgggc tcagccaggc agccacaccg aagattttca atggcactga 200
654
gtgtgggcgt aactcacagc cgtggcaggt ggggctgttt gagggcacca 250
gcctgcgctg cgggggtgtc cttattgacc acaggtgggt cctcacagcg 300
gctcactgca gcggcagcag gtactgggtg cgcctggggg aacacagcct 350
cagccagctc gactggaccg agcagatccg gcacagcggc ttctctgtga 400
cccatcccgg ctacctggga gcctcgacga gccacgagca cgacctccgg 450
ctgctgcggc tgcgcctgcc cgtccgcgta accagcagcg ttcaacccct 500
gcccctgccc aatgactgtg caaccgctgg caccgagtgc cacgtctcag 550
gctggggcat caccaaccac ccacggaacc cattcccgga tctgctccag 600
tgcctcaacc tctccatcgt ctcccatgcc acctgccatg gtgtgtatcc 650
cgggagaatc acgagcaaca tggtgtgtgc aggcggcgtc ccggggcagg 700
atgcctgcca gggtgattct gggggccccc tggtgtgtgg gggagtcctt 750
caaggtctgg tgtcctgggg gtctgtgggg ccctgtggac aagatggcat 800
ccctggagtc tacacctata tttgcaagta tgtggactgg atccggatga 850
tcatgaggaa caactgacct gtttcctcca cctccacccc caccccttaa 900
cttgggtacc cctctggccc tcagagcacc aatatctcct ccatcacttc 950
ccctagctcc actcttgttg gcctgggaac ttcttggaac tttaactcct 1000
gccagccctt ctaagaccca cgagcggggt gagagaagtg tgcaatagtc 1050
tggaataaat ataaatgaag gaggggcaaa aaaaaaaaaa a 1091 <210> 194 <211> 248 <212> PRT
<213> Homo sapiens <400> 194
Met Gly Leu Ser Ile Phe Leu Leu Leu Cys Val Leu Gly Leu Ser 15 10 15
Gin Ala Ala Thr Pro Lys Ile Phe Asn Gly Thr Glu Cys Gly Arg 20 25 30
Asn Ser Gin Pro Trp Gin Val Gly Leu Phe Glu Gly Thr Ser Leu 35 40 45
Arg Cys Gly Gly Val Leu Ile Asp His Arg Trp Val Leu Thr Ala 50 55 60
Ala His Cys Ser Gly Ser Arg Tyr Trp Val Arg Leu Gly Glu His 65 70 75
Ser Leu Ser Gin Leu Asp Trp Thr Glu Gin Ile Arg His Ser Gly 80 85 90
Phe Ser Val Thr His Pro Gly Tyr Leu Gly Ala Ser Thr Ser His 95 100 105
655
Glu His Asp Leu Arg Leu Leu Arg Leu Arg Leu Pro Val Arg Val
110 115 120
Thr Ser Ser Val Gin Pro Leu Pro Leu Pro Asn Asp Cys Ala Thr
125 130 135
Ala Gly Thr Glu Cys His Val Ser Gly Trp Gly Ile Thr Asn His
140 145 150
Pro Arg Asn Pro Phe Pro Asp Leu Leu Gin Cys Leu Asn Leu Ser
155 160 165
Ile Val Ser His Ala Thr Cys His Gly Val Tyr Pro Gly Arg Ile
170 175 180
Thr Ser Asn Met Val Cys Ala Gly Gly Val Pro Gly Gin Asp Ala
185 190 195
Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys Gly Gly Val Leu
200 205 210
Gin Gly Leu Val Ser Trp Gly Ser Val Gly Pro Cys Gly Gin Asp
215 220 225
Gly Ile Pro Gly Val Tyr Thr Tyr Ile Cys Lys Tyr Val Asp Trp
230 235 240
Ile Arg Met Ile Met Arg Asn Asn 245
<210> 195 <211> 1485 <212> DNA <213> Homo sapiens
<400> 195
gcggccacac gcagctagcc ggagcccgga ccaggcgcct gtgcctcctc
50
ctcgtccctc gccgcgtccg cgaagcctgg agccggcggg agccccgcgc
100
tcgccatgtc gggcgagctc agcaacaggt tccaaggagg gaaggcgttc
150
ggcttgctca aagcccggca ggagaggagg ctggccgaga tcaaccggga
200
gtttctgtgt gaccagaagt acagtgatga agagaacctt ccagaaaagc
250
tcacagcctt caaagagaag tacatggagt ttgacctgaa caatgaaggc
300
gagattgacc tgatgtcttt aaagaggatg atggagaagc ttggtgtccc
350
caagacccac ctggagatga agaagatgat ctcagaggtg acaggagggg
400
tcagtgacac tatatcctac cgagactttg tgaacatgat gctggggaaa
450
cggtcggctg tcctcaagtt agtcatgatg tttgaaggaa aagccaacga
500
gagcagcccc aagccagttg gcccccctcc agagagagac attgctagcc
550
tgccctgagg accccgcctg gactccccag ccttcccacc ccatacctcc
600
ctcccgatct tgctgccctt cttgacacac tgtgatctct ctctctctca
650
656
tttgtttggt cattgagggt ttgtttgtgt tttcatcaat gtctttgtaa 700
agcacaaatt atctgcctta aaggggctct gggtcgggga atcctgagcc 750
ttgggtcccc tccctctctt cttccctcct tccccgctcc ctgtgcagaa 800
gggctgatat caaaccaaaa actagagggg gcagggccag ggcagggagg 850
cttccagcct gtgttcccct cacttggagg aaccagcact ctccatcctt 900
tcagaaagtc tccaagccaa gttcaggctc actgacctgg ctctgacgag 950
gaccccaggc cactctgaga agaccttgga gtagggacaa ggctgcaggg 1000
cctctttcgg gtttccttgg acagtgccat ggttccagtg ctctggtgtc 1050
acccaggaca cagccactcg gggccccgct gccccagctg atccccactc 1100
attccacacc tcttctcatc ctcagtgatg tgaaggtggg aaggaaagga 1150
gcttggcatt gggagccctt caagaaggta ccagaaggaa ccctccagtc 12 00
ctgctctctg gccacacctg tgcaggcagc tgagaggcag cgtgcagccc 12 50
tactgtccct tactggggca gcagagggct tcggaggcag aagtgaggcc 13 00
tggggtttgg ggggaaaggt cagctcagtg ctgttccacc ttttagggag 1350
gatactgagg ggaccaggat gggagaatga ggagtaaaat gctcacggca 1400
aagtcagcag cactggtaag ccaagactga gaaatacaag gttgcttgtc 1450
tgaccccaat ctgcttgaaa aaaaaaaaaa aaaaa 1485
<210> 196 <211> 150 <212> PRT
<213> Homo sapiens <400> 196
Met Ser Gly Glu Leu Ser Asn Arg Phe Gin Gly Gly Lys Ala Phe 15 10 15
Gly Leu Leu Lys Ala Arg Gin Glu Arg Arg Leu Ala Glu Ile Asn 20 25 30
Arg Glu Phe Leu Cys Asp Gin Lys Tyr Ser Asp Glu Glu Asn Leu 35 40 45
Pro Glu Lys Leu Thr Ala Phe Lys Glu Lys Tyr Met Glu Phe Asp 50 55 60
Leu Asn Asn Glu Gly Glu Ile Asp Leu Met Ser Leu Lys Arg Met 65 70 75
Met Glu Lys Leu Gly Val Pro Lys Thr His Leu Glu Met Lys Lys 80 85 90
Met Ile Ser Glu Val Thr Gly Gly Val Ser Asp Thr Ile Ser Tyr 95 100 105
657
Arg Asp Phe Val Asn Met Met Leu Gly Lys Arg Ser Ala Val Leu 110 115 120
Lys Leu Val Met Met Phe Glu Gly Lys Ala Asn Glu Ser Ser Pro 125 130 135
Lys Pro Val Gly Pro Pro Pro Glu Arg Asp Ile Ala Ser Leu Pro 140 145 150
<210> 197 <211> 4842 <212> DNA
<213> Homo sapiens <400> 197
cgcgctcccc gcgcgcctcc tcgggctcca cgcgtcttgc cccgcagagg
50
cagcctcctc caggagcggg gccctgcaca ccatggcccc cgggtgggca
100
ggggtcggcg ccgccgtgcg cgcccgcctg gcgctggcct tggcgctggc
150
gagcgtcctg agtgggcctc cagccgtcgc ctgccccacc aagtgtacct
200
gctccgctgc cagcgtggac tgccacgggc tgggcctccg cgcggttcct
250
cggggcatcc cccgcaacgc tgagcgcctt gacctggaca gaaataatat
300
caccaggatc accaagatgg acttcgctgg gctcaagaac ctccgagtct
350
tgcatctgga agacaaccag gtcagcgtca tcgagagagg cgccttccag
400
gacctgaagc agctagagcg actgcgcctg aacaagaata agctgcaagt
450
ccttccagaa ttgcttttcc agagcacgcc gaagctcacc agactagatt
500
tgagtgaaaa ccagatccag gggatcccga ggaaggcgtt ccgcggcatc
550
accgatgtga agaacctgca actggacaac aaccacatca gctgcattga
600
agatggagcc ttccgagcgc tgcgcgattt ggagatcctt accctcaaca
650
acaacaacat cagtcgcatc ctggtcacca gcttcaacca catgccgaag
700
atccgaactc tgcgcctcca ctccaaccac ctctactgcg actgccacct
750
ggcctggctc tcggattggc tgcgacagcg acggacagtt ggccagttca
800
cactctgcat ggctcctgtg catttgaggg gcttcaacgt ggcggatgtg
850
cagaagaagg agtacgtgtg cccagccccc cactcggagc ccccatcctg
900
caatgccaac tccatctcct gcccttcgcc ctgcacgtgc agcaataaca
950
tcgtggactg tcgaggaaag ggcttgatgg agattcctgc caacttgccg
1000
gagggcatcg tcgaaatacg cctagaacag aactccatca aagccatccc
1050
tgcaggagcc ttcacccagt acaagaaact gaagcgaata gacatcagca
1100
agaatcagat atcggatatt gctccagatg ccttccaggg cctgaaatca
1150
ctcacatcgc tggtcctgta tgggaacaag atcaccgaga ttgccaaggg
1200
658
actgtttgat gggctggtgt ccctacagct gctcctcctc aatgccaaca
1250
agatcaactg cctgcgggtg aacacgtttc aggacctgca gaacctcaac
1300
ttgctctccc tgtatgacaa caagctgcag accatcagca aggggctctt
1350
cgcccctctg cagtccatcc agacactcca cttagcccaa aacccatttg
1400
tgtgcgactg ccacttgaag tggctggccg actacctcca ggacaacccc
1450
atcgagacaa gcggggcccg ctgcagcagc ccgcgccgac tcgccaacaa
1500
gcgcatcagc cagatcaaga gcaagaagtt ccgctgctca ggctccgagg
1550
attaccgcag caggttcagc agcgagtgct tcatggacct cgtgtgcccc
1600
gagaagtgtc gctgtgaggg cacgattgtg gactgctcca accagaagct
1650
ggtccgcatc ccaagccacc tccctgaata tgtcaccgac ctgcgactga
1700
atgacaatga ggtatctgtt ctggaggcca ctggcatctt caagaagttg
1750
cccaacctgc ggaaaataaa tctgagtaac aataagatca aggaggtgcg
1800
agagggagct ttcgatggag cagccagcgt gcaggagctg atgctgacag
1850
ggaaccagct ggagaccgtg cacgggcgcg tgttccgtgg cctcagtggc
1900
ctcaaaacct tgatgctgag gagtaacttg atcagctgtg tgagtaatga
1950
cacctttgcc ggcctgagtt cggtgagact gctgtccctc tatgacaatc
2000
ggatcaccac catcacccct ggggccttca ccacgcttgt ctccctgtcc
2050
accataaacc tcctgtccaa ccccttcaac tgcaactgcc acctggcctg
2100
gctcggcaag tggttgagga agaggcggat cgtcagtggg aaccctaggt
2150
gccagaagcc caggacttca atttttcctc cctgtgatgg aaggagattc caacgaggag ccatccagga agtagctgcc tgtggccatc agctgagccc
2200 2250
gcgctgcccg gagcagtgca cctgtatgga gacagtggtg cgatgcagca
2300
acaaggggct ccgcgccctc cccagaggca tgcccaagga tgtgaccgag
2350
ctgtacctgg aaggaaacca cctaacagcc gtgcccagag agctgtccgc
2400
cctccgacac ctgacgctta ttgacctgag caacaacagc atcagcatgc
2450
tgaccaatta caccttcagt aacatgtctc acctctccac tctgatcctg
2500
agctacaacc ggctgaggtg catccccgtc cacgccttca acgggctgcg
2550
gtccctgcga gtgctaaccc tccatggcaa tgacatttcc agcgttcctg
2600
aaggctcctt caacgacctc acatctcttt cccatctggc gctgggaacc
2650
aacccactcc actgtgactg cagtcttcgg tggctgtcgg agtgggtgaa
2700
ggcggggtac aaggagcctg gcatcgcccg ctgcagtagc cctgagccca
2750
659
tggctgacag gctcctgctc accaccccaa cccaccgctt ccagtgcaaa
2800
gggccagtgg acatcaacat tgtggccaaa tgcaatgcct gcctctccag
2850
cccgtgcaag aataacggga catgcaccca ggaccctgtg gagctgtacc
2900
gctgtgcctg cccctacagc tacaagggca aggactgcac tgtgcccatc
2950
aacacctgca tccagaaccc ctgtcagcat ggaggcacct gccacctgag
3000
tgacagccac aaggatgggt tcagctgctc ctgccctctg ggctttgagg
3050
ggcagcggtg tgagatcaac ccagatgact gtgaggacaa cgactgcgaa
3100
aacaatgcca cctgcgtgga cgggatcaac aactacgtgt gtatctgtcc
3150
gcctaactac acaggtgagc tatgcgacga ggtgattgac cactgtgtgc
3200
ctgagctgaa cctctgtcag catgaggcca agtgcatccc cctggacaaa
3250
ggattcagct gcgagtgtgt ccctggctac agcgggaagc tctgtgagac
3300
agacaatgat gactgtgtgg cccacaagtg ccgccacggg gcccagtgcg
3350
tggacacaat caatggctac acatgcacct gcccccaggg cttcagtgga
3400
cccttctgtg aacacccccc acccatggtc ctactgcaga ccagcccatg
3450
cgaccagtac gagtgccaga acggggccca gtgcatcgtg gtgcagcagg
3500
agcccacctg ccgctgccca ccaggcttcg ccggccccag atgcgagaag
3550
ctcatcactg tcaacttcgt gggcaaagac tcctacgtgg aactggcctc
3600
cgccaaggtc cgaccccagg ccaacatctc cctgcaggtg gccactgaca
3650
aggacaacgg catccttctc tacaaaggag acaatgaccc cctggcactg
3700
gagctgtacc agggccacgt gcggctggtc tatgacagcc tgagttcccc
3750
tccaaccaca gtgtggagct gtgtacagtg ggtgacgcta tggagacagt aaccagaccc gaatgatggg tgaacctagt cagtttcaca agtggacaaa
3800 3850
ggaactccaa agagcctggg gaagctccag aagcagccag cagtgggcat
3900
caacagcccc ctctaccttg gaggcatccc cacctccacc ggcctctccg
3950
ccttgcgcca gggcacggac cggcctctag gcggcttcca cggatgcatc
4000
catgaggtgc gcatcaacaa cgagctgcag gacttcaagg ccctcccacc
4050
acagtccctg ggggtgtcac caggctgcaa gtcctgcacc gtgtgcaagc
4100
acggcctgtg ccgctccgtg gagaaggaca gcgtggtgtg cgagtgccgc
4150
ccaggctgga ccggcccact ctgcgaccag gaggcccggg acccctgcct
4200
cggccacaga tgccaccatg gaaaatgtgt ggcaactggg acctcataca
4250
tgtgcaagtg tgccgagggc tatggagggg acttgtgtga caacaagaat
4300
gactctgcca atgcctgctc agccttcaag tgtcaccatg ggcagtgcca
4350
660
catctcagac caaggggagc cctactgcct gtgccagccc ggctttagcg 4400
gcgagcactg ccaacaagag aatccgtgcc tgggacaagt agtccgagag 4450
gtgatccgcc gccagaaagg ttatgcatca tgtgccacag cctccaaggt 4500
gcccatcatg gaatgtcgtg ggggctgtgg gccccagtgc tgccagccca 4550
cccgcagcaa gcggcggaaa tacgtcttcc agtgcacgga cggctcctcg 4600
tttgtagaag aggtggagag acacttagag tgcggctgcc tcgcgtgttc 4650
ctaagcccct gcccgcctgc ctgccacctc tcggactcca gcttgatgga 4700
gttgggacag ccatgtggga ccccctggtg attcagcatg aaggaaatga 4750
agctggagag gaaggtaaag aagaagagaa tattaagtat attgtaaaat 4800
aaacaaaaaa tagaacttaa aaaaaaaaaa aaaaaaaaaa aa 4842
<210> 198 <211> 1523 <212> PRT
<213> Homo sapiens <400> 198
Met Ala Pro Gly Trp Ala Gly Val Gly Ala Ala Val Arg Ala Arg 15 10 15
Leu Ala Leu Ala Leu Ala Leu Ala Ser Val Leu Ser Gly Pro Pro 20 25 30
Ala Val Ala Cys Pro Thr Lys Cys Thr Cys Ser Ala Ala Ser Val 35 40 45
Asp Cys His Gly Leu Gly Leu Arg Ala Val Pro Arg Gly Ile Pro 50 55 60
Arg Asn Ala Glu Arg Leu Asp Leu Asp Arg Asn Asn Ile Thr Arg 65 70 75
Ile Thr Lys Met Asp Phe Ala Gly Leu Lys Asn Leu Arg Val Leu 80 85 90
His Leu Glu Asp Asn Gin Val Ser Val Ile Glu Arg Gly Ala Phe 95 100 105
Gin Asp Leu Lys Gin Leu Glu Arg Leu Arg Leu Asn Lys Asn Lys 110 115 120
Leu Gin Val Leu Pro Glu Leu Leu Phe Gin Ser Thr Pro Lys Leu 125 130 135
Thr Arg Leu Asp Leu Ser Glu Asn Gin Ile Gin Gly Ile Pro Arg 140 145 150
Lys Ala Phe Arg Gly Ile Thr Asp Val Lys Asn Leu Gin Leu Asp 155 160 165
Asn Asn His Ile Ser Cys Ile Glu Asp Gly Ala Phe Arg Ala Leu 170 175 180
661
Arg Asp Leu Glu Ile Leu Thr Leu Asn Asn Asn Asn Ile Ser Arg 185 190 195
Ile Leu Val Thr Ser Phe Asn His Met Pro Lys Ile Arg Thr Leu 200 205 210
Arg Leu His Ser Asn His Leu Tyr Cys Asp Cys His Leu Ala Trp 215 220 225
Leu Ser Asp Trp Leu Arg Gin Arg Arg Thr Val Gly Gin Phe Thr 230 235 240
Leu Cys Met Ala Pro Val His Leu Arg Gly Phe Asn Val Ala Asp 245 250 255
Val Gin Lys Lys Glu Tyr Val Cys Pro Ala Pro His Ser Glu Pro 260 265 270
Pro Ser Cys Asn Ala Asn Ser Ile Ser Cys Pro Ser Pro Cys Thr 275 280 285
Cys Ser Asn Asn Ile Val Asp Cys Arg Gly Lys Gly Leu Met Glu 290 295 300
Ile Pro Ala Asn Leu Pro Glu Gly Ile Val Glu Ile Arg Leu Glu 305 310 315
Gin Asn Ser Ile Lys Ala Ile Pro Ala Gly Ala Phe Thr Gin Tyr 320 325 330
Lys Lys Leu Lys Arg Ile Asp Ile Ser Lys Asn Gin Ile Ser Asp 335 340 345
Ile Ala Pro Asp Ala Phe Gin Gly Leu Lys Ser Leu Thr Ser Leu 350 355 360
Val Leu Tyr Gly Asn Lys Ile Thr Glu Ile Ala Lys Gly Leu Phe 365 370 375
Asp Gly Leu Val Ser Leu Gin Leu Leu Leu Leu Asn Ala Asn Lys 380 385 390
Ile Asn Cys Leu Arg Val Asn Thr Phe Gin Asp Leu Gin Asn Leu 395 400 405
Asn Leu Leu Ser Leu Tyr Asp Asn Lys Leu Gin Thr Ile Ser Lys 410 415 420
Gly Leu Phe Ala Pro Leu Gin Ser Ile Gin Thr Leu His Leu Ala 425 430 435
Gin Asn Pro Phe Val Cys Asp Cys His Leu Lys Trp Leu Ala Asp 440 445 450
Tyr Leu Gin Asp Asn Pro Ile Glu Thr Ser Gly Ala Arg Cys Ser 455 460 465
Ser Pro Arg Arg Leu Ala Asn Lys Arg Ile Ser Gin Ile Lys Ser 470 475 480
Lys Lys Phe Arg Cys Ser Gly Ser Glu Asp Tyr Arg Ser Arg Phe
662
485 490 495
Ser Ser Glu Cys Phe Met Asp Leu Val Cys Pro Glu Lys Cys Arg
500 505 510
Cys Glu Gly Thr Ile Val Asp Cys Ser Asn Gin Lys Leu Val Arg
515 520 525
lie Pro Ser His Leu Pro Glu Tyr Val Thr Asp Leu Arg Leu Asn
530 535 540
Asp Asn Glu Val Ser Val Leu Glu Ala. Thr Gly Ile Phe Lys Lys
545 550 555
Leu Pro Asn Leu Arg Lys Ile Asn Leu Ser Asn Asn Lys Ile Lys
560 565 570
Glu Val Arg Glu Gly Ala Phe Asp Gly Ala Ala Ser Val Gin Glu
575 580 585
Leu Met Leu Thr Gly Asn Gin Leu Glu Thr Val His Gly Arg Val
590 595 600
Phe Arg Gly Leu Ser Gly Leu Lys Thr Leu Met Leu Arg Ser Asn
605 610 615
Leu Ile Ser Cys Val Ser Asn Asp Thr Phe Ala Gly Leu Ser Ser
620 625 630
Val Arg Leu Leu Ser Leu Tyr Asp Asn Arg Ile Thr Thr Ile Thr
635 640 645
Pro Gly Ala Phe Thr Thr Leu Val Ser Leu Ser Thr Ile Asn Leu
650 655 660
Leu Ser Asn Pro Phe Asn Cys Asn Cys His Leu Ala Trp Leu Gly
665 670 675
Lys Trp Leu Arg Lys Arg Arg Ile Val Ser Gly Asn Pro Arg Cys
680 685 690
Gin Lys Pro Phe Phe Leu Lys Glu Ile Pro Ile Gin Asp Val Ala
695 700 705
Ile Gin Asp Phe Thr Cys Asp Gly Asn Glu Glu Ser Ser Cys Gin
710 715 720
Leu Ser Pro Arg Cys Pro Glu Gin Cys Thr Cys Met Glu Thr Val
725 730 735
Val Arg Cys Ser Asn Lys Gly Leu Arg Ala Leu Pro Arg Gly Met
740 745 750
Pro Lys Asp Val Thr Glu Leu Tyr Leu Glu Gly Asn His Leu Thr
755 760 765
Ala Val Pro Arg Glu Leu Ser Ala Leu Arg His Leu Thr Leu Ile
770 775 780
Asp Leu Ser Asn Asn Ser Ile Ser Met Leu Thr Asn Tyr Thr Phe
785 790 795
Ser
Asn
Met
Ser
His 800
Leu
Ser
Thr
Leu
Ile 805
Leu
Ser
Tyr
Asn Arg 810
Leu
Arg
Cys
Ile
Pro 815
Val
His
Ala
Phe
Asn 820
Gly
Leu
Arg
Ser Leu 825
Arg
Val
Leu
Thr
Leu 830
His
Gly Asn Asp
Ile 835
Ser
Ser
Val
Pro Glu 840
Gly
Ser
Phe
Asn
Asp 845
Leu
Thr
Ser
Leu
Ser 850
His
Leu
Ala
Leu Gly 855
Thr
Asn
Pro
Leu
His 860
Cys
Asp
Cys
Ser
Leu 865
Arg
Trp
Leu
Ser Glu 870
Trp
Val
Lys
Ala
Gly 875
Tyr
Lys
Glu
Pro
Gly 880
Ile
Ala
Arg
Cys Ser 885
Ser
Pro
Glu
Pro
Met 890
Ala
Asp
Arg
Leu
Leu 895
Leu
Thr
Thr
Pro Thr 900
His
Arg
Phe
Gin
Cys 905
Lys
Gly
Pro
Val
Asp 910
Ile
Asn
Ile
Val Ala 915
Lys
Cys
Asn
Ala
Cys 920
Leu
Ser
Ser
Pro
Cys 925
Lys
Asn Asn Gly Thr 930
Cys
Thr
Gin
Asp
Pro 935
Val
Glu
Leu
Tyr
Arg 940
Cys
Ala
Cys
Pro Tyr 945
Ser
Tyr
Lys
Gly
Lys 950
Asp
Cys
Thr
Val
Pro 955
Ile
Asn
Thr
Cys Ile 960
Gin
Asn
Pro
Cys
Gin 965
His
Gly Gly Thr
Cys 970
His
Leu
Ser
Asp Ser 975
His
Lys
Asp
Gly
Phe 980
Ser
Cys
Ser
Cys
Pro 985
Leu
Gly
Phe
Glu Gly 990
Gin
Arg
Cys
Glu
Ile 995
Asn
Pro
Asp
Asp
Cys 1000
Glu
Asp
Asn
Asp Cys 1005
Glu
Asn
Asn
Ala Thr 1010
Cys
Val
Asp
Gly
Ile 1015
Asn
Asn
Tyr
Val Cys 1020
Ile
Cys
Pro
Pro Asn 1025
Tyr
Thr
Gly Glu
Leu 1030
Cys
Asp
Glu
Val Ile 1035
Asp
His
Cys
Val Pro 1040
Glu
Leu
Asn
Leu
Cys 1045
Gin
His
Glu
Ala Lys 1050
Cys
Ile
Pro
Leu Asp 1055
Lys
Gly
Phe
Ser
Cys 1060
Glu
Cys
Val
Pro Gly 1065
Tyr
Ser
Gly
Lys Leu 1070
Cys
Glu
Thr
Asp
Asn 1075
Asp
Asp
Cys
Val Ala 1080
His
Lys
Cys
Arg His 1085
Gly
Ala
Gin
Cys
Val 1090
Asp
Thr
Ile
Asn Gly 1095
Tyr
Thr
Cys
Thr Cys 1100
Pro
Gin Gly Phe
Ser 1105
Gly
Pro
Phe
Cys Glu 1110
664
His Pro Pro Pro Met Val Leu Leu Gin Thr Ser Pro - Cys Asp Gin 1115 1120 1125
Tyr Glu Cys Gin Asn Gly Ala Gin Cys Ile Val Val Gin Gin Glu 1130 1135 1140
Pro Thr Cys Arg Cys Pro Pro Gly Phe Ala Gly Pro Arg Cys Glu 1145 1150 1155
Lys Leu Ile Thr Val Asn Phe Val Gly Lys Asp Ser Tyr Val Glu 1160 1165 1170
Leu Ala Ser Ala Lys Val Arg Pro Gin Ala Asn Ile Ser Leu Gin 1175 1180 1185
Val Ala Thr Asp Lys Asp Asn Gly Ile Leu Leu Tyr Lys Gly Asp 1190 1195 1200
Asn Asp Pro Leu Ala Leu Glu Leu Tyr Gin Gly His Val Arg Leu 1205 1210 1215
Val Tyr Asp Ser Leu Ser Ser Pro Pro Thr Thr Val Tyr Ser Val 1220 1225 1230
Glu Thr Val Asn Asp Gly Gin Phe His Ser Val Glu Leu Val Thr 1235 1240 1245
Leu Asn Gin Thr Leu Asn Leu Val Val Asp Lys Gly Thr Pro Lys 1250 1255 1260
Ser Leu Gly Lys Leu Gin Lys Gin Pro Ala Val Gly Ile Asn Ser 1265 1270 1275
Pro Leu Tyr Leu Gly Gly Ile Pro Thr Ser Thr Gly Leu Ser Ala 1280 1285 1290
Leu Arg Gin Gly Thr Asp Arg Pro Leu Gly Gly Phe His Gly Cys 1295 1300 1305
Ile His Glu Val Arg Ile Asn Asn Glu Leu Gin Asp Phe Lys Ala 1310 1315 1320
Leu Pro Pro Gin Ser Leu Gly Val Ser Pro Gly Cys Lys Ser Cys 1325 1330 1335
Thr Val Cys Lys His Gly Leu Cys Arg Ser Val Glu Lys Asp Ser 1340 1345 1350
Val Val Cys Glu Cys Arg Pro Gly Trp Thr Gly Pro Leu Cys Asp 1355 1360 1365
Gin Glu Ala Arg Asp Pro Cys Leu Gly His Arg Cys His His Gly 1370 1375 1380
Lys Cys Val Ala Thr Gly Thr Ser Tyr Met Cys Lys Cys Ala Glu 1385 1390 1395
Gly Tyr Gly Gly Asp Leu Cys Asp Asn Lys Asn Asp Ser Ala Asn 1400 1405 1410
Ala Cys Ser Ala Phe Lys Cys His His Gly Gin Cys His Ile Ser
1415
665
1420
1425
Asp Gin Gly Glu Pro 1430
Glu His Cys Gin Gin 1445
Glu Val Ile Arg Arg 1460
Ser Lys Val Pro Ile 1475
Cys Cys Gin Pro Thr 1490
Cys Thr Asp Gly Ser 1505
Glu Cys Gly Cys Leu 1520
<210> 199 <211> 24 <212> DNA <213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 199 atggagattc ctgccaactt gccg 24
<210> 200 <211> 24 <212> DNA
<213> Artificial Service <220>
<221> Artificial Sequence <222> full
<223> Synthetic oligonucleotide probe
<400> 200 ttgttggcat tgaggaggag cagc 24
<210> 201 <211> 50 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 201
gagggcatcg tcgaaatacg cctagaacag aactccatca aagccatccc 50
<210> 202 <211> 753 <212> DNA <213> Homo sapiens
Tyr Cys Leu Cys Gin Pro Gly Phe Ser Gly 1435 1440
Glu Asn Pro Cys Leu Gly Gin Val Val Arg 1450 1455
Gin Lys Gly Tyr Ala Ser Cys Ala Thr Ala 1465 1470
Met Glu Cys Arg Gly Gly Cys Gly Pro Gin 1480 1485
Arg Ser Lys Arg Arg Lys Tyr Val Phe Gin 1495 1500
Ser Phe Val Glu Glu Val Glu Arg His Leu 1510 1515
Ala Cys Ser
666
<400> 202
ggatgcagga cgctcccctg agctgcctgt caccgactag gtggagcagt
50
gtttcttccg cagactcaac tgagaagtca gcctctgggg caggcaccag
100
gaatctgcct tttcagttct gtctccggca ggctttgagg atgaaggctg
150
cgggcattct gaccctcatt ggctgcctgg tcacaggcgc cgagtccaaa
200
atctacactc gttgcaaact ggcaaaaata ttetcgaggg ctggcctgga
250
caattactgg ggcttcagcc ttggaaactg gatctgcatg gcatattatg
300
agagcggcta caacaccaca gccccgacgg tcctggatga cggcagcatc
350
gactatggca tcttccagat caacagcttc gcgtggtgca gacgcggaaa
400
gctgaaggag aacaaccact gccatgtcgc ctgctcagcc ttgatcactg
450
atgacctcac agatgcaatt atctgtgcca ggaaaattgt taaagagaca
500
caaggaatga actattggca aggctggaag aaacattgtg agggcagaga
550
cctgtccgag tggaaaaaag gctgtgaggt ttcctaaact ggaactggac
600
ccaggatgct ttgcagcaac gccctaggat ttgcagtgaa tgtccaaatg
650
cctgtgtcat cttgtcccgt ttcctcccaa tattccttct caaacttgga
700
gagggaaaat taagctatac ttttaagaaa ataaatattt ccatttaaat
750
gtc 753
<210> 203 <211> 148 <212> PRT <213> Homo sapiens
<400> 203
Met Lys Ala Ala Gly Ile Leu Thr Leu Ile Gly Cys Leu Val Thr
10 15
Gly Ala Glu Ser Lys Ile Tyr Thr Arg Cys Lys Leu Ala Lys Ile
25 30
Phe Ser Arg Ala Gly Leu Asp Asn Tyr Trp Gly Phe Ser Leu Gly
40 45
Asn Trp Ile Cys Met Ala Tyr Tyr Glu Ser Gly Tyr Asn Thr Thr
50 55 60
Ala Pro Thr Val Leu Asp Asp Gly Ser Ile Asp Tyr Gly Ile Phe
65 70 75
Gin Ile Asn Ser Phe Ala Trp Cys Arg Arg Gly Lys Leu Lys Glu
80 85 90
Asn Asn His Cys His Val Ala Cys Ser Ala Leu Ile Thr Asp Asp
95 100 105
Leu Thr Asp Ala Ile Ile Cys Ala Arg Lys Ile Val Lys Glu Thr
110 115 120
667
Gin Gly Met Asn Tyr Trp Gin Gly Trp Lys Lys His Cys Glu Gly 125 130 135
Arg Asp Leu Ser Glu Trp Lys Lys Gly Cys Glu Val Ser 140 145
<210> 204 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 204 gcaggctttg aggatgaagg ctgc 24
<210> 205 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 205 ctcattggct gcctggtcac aggc 24
<210> 206 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 206 ccagtcggac aggtctctcc cctc 24
<210> 207 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 207 tcagtgacca aggctgagca ggcg 24
<210> 208 <211> 47 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 208
ctacactcgt tgcaaactgg caaaaatatt ctcgagggct ggcctgg 47
668
<210> 209
<211> 1648
<212> DNA
<213> Homo sapiens
<400> 209
caggccattt gcatcccact gtccttgtgt tcggagccag gccacaccgt
50
cctcagcagt gtcatgtgtt aaaaacgcca agctgaatat atcatgcccc
100
tattaaaact tgtacatggc tccccattgg tttttggaga aaagttcaag
150
ctttttacct tggtgtctgc ctgtatccca gtgttcaggc tggctagacg
200
gcggaagaag atcctatttt actgtcactt cccagatctg cttctcacca
250
agagagattc ttttcttaaa cgactataca gggccccaat tgactggata
300
gaggaataca ccacaggcat ggcagactgc atcttagtca acagccagtt
350
cacagctgct gtttttaagg aaacattcaa gtccctgtct cacatagacc
400
ctgatgtcct ctatccatct ctaaatgtca ccagctttga ctcagttgtt
450
cctgaaaagc tggatgacct agtccccaag gggaaaaaat tcctgctgct
500
ctccatcaac agatacgaaa ggaagaaaaa tctgactttg gcactggaag
550
ccctagtaca gctgcgtgga agattgacat cccaagattg ggagagggtt
600
catctgatcg tggcaggtgg ttatgacgag agagtcctgg agaatgtgga
650
acattatcag gaattgaaga aaatggtcca acagtccgac cttggccagt
700
atgtgacctt cttgaggtct ttctcagaca aacagaaaat ctccctcctc
750
cacagctgca cgtgtgtgct ttacacacca agcaatgagc actttggcat
800
tgtccctctg gaagccatgt acatgcagtg cccagtcatt gctgttaatt
850
cgggtggacc cttggagtcc attgaccaca gtgtcacagg gtttctgtgt
900
gagcctgacc cggtgcactt ctcagaagca atagaaaagt tcatccgtga
950
accttcctta aaagccacca tgggcctggc tggaagagcc agagtgaagg
1000
aaaaattttc ccctgaagca tttacagaac agctctaccg atatgttacc
1050
aaactgctgg tataatcaga ttgtttttaa gatctccatt aatgtcattt
1100
ttatggattg tagacccagt tttgaaacca aaaaagaaac ctagaatcta
1150
atgcagaaga gatcttttaa aaaataaact tgagtcttga atgtgagcca
1200
ctttcctata taccacacct ccctgtccac ttttcagaaa aaccatgtct
1250
tttatgctat aatcattcca aattttgcca gtgttaagtt acaaatgtgg
1300
tgtcattcca tgttcagcag agtattttaa ttatattttc tcgggattat
1350
tgctcttctg tctataaatt ttgaatgata ctgtgcctta attggttttc
1400
669
atagtttaag tgtgtatcat tatcaaagtt gattaatttg gcttcatagt 1450
ataatgagag cagggctatt gtagttccca gattcaatcc accgaagtgt 1500
tcactgtcat ctgttaggga atttttgttt gtcctgtctt tgcctggatc 1550
catagcgaga gtgctctgta ttttttttaa gataatttgt atttttgcac 1600
actgagatat aataaaaggt gtttatcata aaaaaaaaaa aaaaaaaa 1648
<210> 210 <211> 323 <212> PRT
<213> Homo sapiens <400> 210
Met Pro Leu Leu Lys Leu Val His Gly Ser Pro Leu Val Phe Gly 1 5 10 15
Glu Lys Phe Lys Leu Phe Thr Leu Val Ser Ala Cys Ile Pro Val 20 25 30
Phe Arg Leu Ala Arg Arg Arg Lys Lys Ile Leu Phe Tyr Cys His 35 40 45
Phe Pro Asp Leu Leu Leu Thr Lys Arg Asp Ser Phe Leu Lys Arg 50 55 60
Leu Tyr Arg Ala Pro Ile Asp Trp Ile Glu Glu Tyr Thr Thr Gly 65 70 75
Met Ala Asp Cys Ile Leu Val Asn Ser Gin Phe Thr Ala Ala Val 80 85 90
Phe Lys Glu Thr Phe Lys Ser Leu Ser His Ile Asp Pro Asp Val 95 100 105
Leu Tyr Pro Ser Leu Asn Val Thr Ser Phe Asp Ser Val Val Pro 110 115 120
Glu Lys Leu Asp Asp Leu Val Pro Lys Gly Lys Lys Phe Leu Leu 125 130 135
Leu Ser Ile Asn Arg Tyr Glu Arg Lys Lys Asn Leu Thr Leu Ala 140 145 150
Leu Glu Ala Leu Val Gin Leu Arg Gly Arg Leu Thr Ser Gin Asp 155 160 165
Trp Glu Arg Val His Leu Ile Val Ala Gly Gly Tyr Asp Glu Arg
170 175 180
Val Leu Glu Asn Val Glu His Tyr Gin Glu Leu Lys Lys Met Val
185 190 195
Gin Gin Ser Asp Leu Gly Gin Tyr Val Thr Phe Leu Arg Ser Phe 200 205 210
Ser Asp Lys Gin Lys Ile Ser Leu Leu His Ser Cys Thr Cys Val 215 220 225
Leu Tyr Thr Pro Ser Asn Glu His Phe Gly Ile Val Pro Leu Glu 230 235 240
670
Ala Met Tyr Met Gin Cys Pro Val Ile Ala Val Asn Ser Gly Gly 245 250 255
Pro Leu Glu Ser Ile Asp His Ser Val Thr Gly Phe Leu Cys Glu 260 265 270
Pro Asp Pro Val His Phe Ser Glu Ala Ile Glu Lys Phe Ile Arg 275 280 285
Glu Pro Ser Leu Lys Ala Thr Met Gly Leu Ala Gly Arg Ala Arg 290 295 300
Val Lys Glu Lys Phe Ser Pro Glu Ala Phe Thr Glu Gin Leu Tyr 305 310 315
Arg Tyr Val Thr Lys Leu Leu Val 320
<210> 211
<211> 1554
<212> DNA
<213> Homo sapiens
<400> 211
gactacgccg atccgagacg tggctccctg ggcggcagaa ccatgttgga
50
cttcgcgatc ttcgccgtta ccttcttgct ggcgttggtg ggagccgtgc
100
tctacctcta tccggcttcc agacaagctg caggaattcc agggattact
150
ccaactgaag aaaaagatgg taatcttcca gatattgtga atagtggaag
200
tttgcatgag ttcctggtta atttgcatga gagatatggg cctgtggtct
250
ccttctggtt tggcaggcgc ctcgtggtta gtttgggcac tgttgatgta
300
ctgaagcagc atatcaatcc caataagaca tcggaccctt ttgaaaccat
350
gctgaagtca ttattaaggt atcaatctgg tggtggcagt gtgagtgaaa
400
accacatgag gaaaaaattg tatgaaaatg gtgtgactga ttctctgaag
450
agtaactttg ccctcctcct aaagctttca gaagaattat tagataaatg
500
gctctcctac ccagagaccc agcacgtgcc cctcagccag catatgcttg
550
gttttgctat gaagtctgtt acacagatgg taatgggtag tacatttgaa
600
gatgatcagg aagtcattcg cttccagaag aatcatggca cagtttggtc
650
tgagattgga ggaaaaaaca aaaggctttc atatgaagat tagatgggtc gccctcatgc acttgataaa aactggagtc aacatgactc tgttttaagg
700 750
aacatcataa aagaacgaaa aggaaggaac ttcagtcaac atattttcat
800
tgactcctta gtacaaggga accttaatga ccaacagatc ctagaagaca
850
gtatgatatt ttctctggcc agttgcataa taactgcaaa attgtgtacc
900
tgggcaatct gttttttaac cacctctgaa gaagttcaaa aaaaattata
950
671
tgaagagata aaccaagttt ttggaaatgg tcctgttact ccagagaaaa
1000
ttgagcagct cagatattgt cagcatgtgc tttgtgaaac tgttcgaact
1050
gccaaactga ctccagtttc tgcccagctt caagatattg aaggaaaaat
1100
tgaccgattt attattccta gagagaccct cgtcctttat gcccttggtg
1150
tggtacttca ggatcctaat acttggccat ctccacacaa gtttgatcca
1200
gatcggtttg atgatgaatt agtaatgaaa actttttcct cacttggatt
1250
ctcaggcaca caggagtgtc cagagttgag gtttgcatat atggtgacca
1300
cagtacttct tagtgtattg gtgaagagac tgcacctact ttctgtggag
1350
ggacaggtta ttgaaacaaa gtatgaactg gtaacatcat caagggaaga
1400
agcttggatc actgtctcaa agagatatta aaattttata catttaaaat
1450
cattgttaaa ttgattgagg aaaacaacca tttaaaaaaa atctatgttg
1500
aatcctttta taaaccagta tcactttgta atataaacac ctatttgtac
1550
ttaa 1554
<210> 212 <211> 462 <212> PRT
<213> Homo sapiens <400> 212
Met Leu Asp Phe Ala Ile Phe Ala Val Thr Phe Leu Leu Ala Leu 15 10 15
Val Gly Ala Val Leu Tyr Leu Tyr Pro Ala Ser Arg Gin Ala Ala 20 25 30
Gly Ile Pro Gly Ile Thr Pro Thr Glu Glu Lys Asp Gly Asn Leu 35 40 45
Pro Asp Ile Val Asn Ser Gly Ser Leu His Glu Phe Leu Val Asn 50 55 60
Leu His Glu Arg Tyr Gly Pro Val Val Ser Phe Trp Phe Gly Arg 65 70 75
Arg Leu Val Val Ser Leu Gly Thr Val Asp Val Leu Lys Gin His 80 85 90
Ile Asn Pro Asn Lys Thr Ser Asp Pro Phe Glu Thr Met Leu Lys 95 100 105
Ser Leu Leu Arg Tyr Gin Ser Gly Gly Gly Ser Val Ser Glu Asn 110 115 120
His Met Arg Lys Lys Leu Tyr Glu Asn Gly Val Thr Asp Ser Leu 125 130 135
Lys Ser Asn Phe Ala Leu Leu Leu Lys Leu Ser Glu Glu Leu Leu 140 145 150
672
Asp Lys Trp Leu Ser Tyr Pro Glu Thr Gin His Val Pro Leu Ser 155 160 165
Gin His Met Leu Gly Phe Ala Met Lys Ser Val Thr Gin Met Val 170 175 180
Met Gly Ser Thr Phe Glu Asp Asp Gin Glu Val Ile Arg Phe Gin 185 190 195
Lys Asn His Gly Thr Val Trp Ser Glu Ile Gly Lys Gly Phe Leu 200 205 210
Asp Gly Ser Leu Asp Lys Asn Met Thr Arg Lys Lys Gin Tyr Glu 215 220 225
Asp Ala Leu Met Gin Leu Glu Ser Val Leu Arg Asn Ile Ile Lys 230 235 240
Glu Arg Lys Gly Arg Asn Phe Ser Gin His Ile Phe Ile Asp Ser 245 250 255
Leu Val Gin Gly Asn Leu Asn Asp Gin Gin Ile Leu Glu Asp Ser 260 265 270
Met Ile Phe Ser Leu Ala Ser Cys Ile Ile Thr Ala Lys Leu Cys 275 280 285
Thr Trp Ala Ile Cys Phe Leu Thr Thr Ser Glu Glu Val Gin Lys 290 295 300
Lys Leu Tyr Glu Glu Ile Asn Gin Val Phe Gly Asn Gly Pro Val 305 310 315
Thr Pro Glu Lys Ile Glu Gin Leu Arg Tyr Cys Gin His Val Leu 320 325 330
Cys Glu Thr Val Arg Thr Ala Lys Leu Thr Pro Val Ser Ala Gin 335 340 345
Leu Gin Asp Ile Glu Gly Lys Ile Asp Arg Phe Ile Ile Pro Arg 350 355 360
Glu Thr Leu Val Leu Tyr Ala Leu Gly Val Val Leu Gin Asp Pro 365 370 ' 375
Asn Thr Trp Pro Ser Pro His Lys Phe Asp Pro Asp Arg Phe Asp 380 385 390
Asp Glu Leu Val Met Lys Thr Phe Ser Ser Leu Gly Phe Ser Gly 395 400 405
Thr Gin Glu Cys Pro Glu Leu Arg Phe Ala Tyr Met Val Thr Thr 410 415 420
Val Leu Leu Ser Val Leu Val Lys Arg Leu His Leu Leu Ser Val 425 430 435
Glu Gly Gin Val Ile Glu Thr Lys Tyr Glu Leu Val Thr Ser Ser 440 445 450
Arg Glu Glu Ala Trp Ile Thr Val Ser Lys Arg Tyr 455 460
673
<210> 213 <211> 759 <212> DNA <213> Homo sapiens
<400> 213
ctagatttgt cggcttgcgg ggagacttca ggagtcgctg tctctgaact
50
tccagcctca gagaccgccg cccttgtccc cgagggccat gggccgggtc
100
tcagggcttg tgccctctcg cttcctgacg ctcctggcgc atctggtggt
150
cgtcatcacc ttattctggt cccgggacag caacatacag gcctgcctgc
200
ctctcacgtt cacccccgag gagtatgaca agcaggacat tcagctggtg
250
gccgcgctct ctgtcaccct gggcctcttt gcagtggagc tggccggttt
300
cctctcagga gtctccatgt tcaacagcac ccagagcctc atctccattg
350
gggctcactg tagtgcatcc gtggccctgt ccttcttcat attcgagcgt
400
tgggagtgca ctacgtattg gtacattttt gtcttctgca gtgcccttcc
450
agctgtcact gaaatggctt tattcgtcac cgtctttggg ctgaaaaaga
500
aacccttctg attaccttca tgacgggaac ctaaggacga agcctacagg
550
ggcaagggcc gcttcgtatt cctggaagaa ggaaggcata ggcttcggtt
600
ttcccctcgg aaactgcttc tgctggagga tatgtgttgg aataattacg
650
tcttgagtct gggattatcc gcattgtatt tagtgctttg taataaaata
700
tgttttgtag taacattaag acttatatac agttttaggg gacaattaaa
750
aaaaaaaaa '
759
<210> 214 <211> 140 <212> PRT
<213> Homo sapiens <400> 214
Met Gly Arg Val Ser Gly Leu Val Pro Ser Arg Phe Leu Thr Leu 1 5 10 15
Leu Ala His Leu Val Val Val Ile Thr Leu Phe Trp Ser Arg Asp 20 25 30
Ser Asn Ile Gin Ala Cys Leu Pro Leu Thr Phe Thr Pro Glu Glu 35 40 45
Tyr Asp Lys Gin Asp Ile Gin Leu Val Ala Ala Leu Ser Val Thr 50 55 60
Leu Gly Leu Phe Ala Val Glu Leu Ala Gly Phe Leu Ser Gly Val 65 70 75
Ser Met Phe Asn Ser Thr Gin Ser Leu Ile Ser Ile Gly Ala His 80 85 90
674
Cys Ser Ala Ser Val Ala Leu Ser Phe Phe Ile Phe Glu Arg Trp 95 100 105
Glu Cys Thr Thr Tyr Trp Tyr Ile Phe Val Phe Cys Ser Ala Leu 110 115 120
Pro Ala Val Thr Glu Met Ala Leu Phe Val Thr Val Phe Gly Leu 125 130 135
Lys Lys Lys Pro Phe 140
<210> 215 <211> 697 <212> DNA
<213> Homo sapiens <400> 215
tcccggaccc tgccgccctg ccactatgtc ccgccgctct atgctgcttg
50
cctgggctct ccccagcctc cttcgactcg gagcggctca ggagacagaa
100
gacccggcct gctgcagccc catagtgccc cggaacgagt ggaaggccct
150
ggcatcagag tgcgcccagc acctgagcct gcccttacgc tatgtggtgg
200
tatcgcacac ggcgggcagc agctgcaaca cccccgcctc gtgccagcag
250
caggcccgga atgtgcagca ctaccacatg aagacactgg gctggtgcga
300
cgtgggctac aacttcctga ttggagaaga cgggctcgta tacgagggcc
350
gtggctggaa cttcacgggt gcccactcag gtcacttatg gaaccccatg
400
tccattggca tcagcttcat gggcaactac atggatcggg tgcccacacc
450
ccaggccatc cgggcagccc agggtctact ggcctgcggt gtggctcagg
500
gagccctgag gtccaactat gtgctcaaag gacaccggga tgtgcagcgt
550
acactctctc caggcaacca gctctaccac ctcatccaga attggccaca
600
ctaccgctcc ccctgaggcc ctgctgatcc gcaccccatt cctcccctcc
650
catggccaaa aaccccactg tctccttctc caataaagat gtagctc 697
<210> 216 <211> 196 <212> PRT <213> Homo sapiens
<400> 216
Met Ser Arg Arg Ser Met Leu Leu Ala Trp Ala Leu Pro Ser Leu 15 10 15
Leu Arg Leu Gly Ala Ala Gin Glu Thr Glu Asp Pro Ala Cys Cys 20 25 30
Ser Pro Ile Val Pro Arg Asn Glu Trp Lys Ala Leu Ala Ser Glu 35 40 45
Cys Ala Gin
His Thr Ala
Gin Ala Arg
Cys Asp Val
Tyr Glu Gly
Leu Trp Asn
Met Asp Arg
Leu Leu Ala
Val Leu Lys
Asn Gin Leu
Pro
<210> 217 <211> 1871 <212> DNA <213> Homo sapiens
<400> 217
ctgggacccc gaaaagagaa ggggagagcg aggggacgag agcggaggag
50
gaagatgcaa ctgactcgct gctgcttcgt gttcctggtg cagggtagcc
100
tctatctggt catctgtggc caggatgatg gtcctcccgg ctcagaggac
150
cctgagcgtg atgaccacga gggccagccc cggccccggg tgcctcggaa
200
gcggggccac atctcaccta agtcccgccc catggccaat tccactctcc
250
tagggctgct ggccccgcct ggggaggctt ggggcattct tgggcagccc
300
cccaaccgcc cgaaccacag ccccccaccc tcagccaagg tgaagaaaat
350
ctttggctgg ggcgacttct actccaacat caagacggtg gccctgaacc
400
tgctcgtcac agggaagatt gtggaccatg gcaatgggac cttcagcgtc
450
cacttccaac acaatgccac aggccaggga aacatctcca tcagcctcgt
500
gccccccagt aaagctgtag agttccacca ggaacagcag atcttcatcg
550
aagccaaggc ctccaaaatc ttcaactgcc ggatggagtg ggagaaggta
600
675
His Leu Ser Leu Pro Leu Arg Tyr Val Val Val Ser 50 55 60
Gly Ser Ser Cys Asn Thr Pro Ala Ser Cys Gin Gin 65 70 75
Asn Val Gin His Tyr His Met Lys Thr Leu Gly Trp 80 85 90
Gly Tyr Asn Phe Leu Ile Gly Glu Asp Gly Leu Val 95 100 105
Arg Gly Trp Asn Phe Thr Gly Ala His Ser Gly His 110 115 120
Pro Met Ser Ile Gly Ile Ser Phe Met Gly Asn Tyr 125 130 135
Val Pro Thr Pro Gin Ala Ile Arg Ala Ala Gin Gly 140 145 150
Cys Gly Val Ala Gin Gly Ala Leu Arg Ser Asn Tyr 155 160 165
Gly His Arg Asp Val Gin Arg Thr Leu Ser Pro Gly 170 175 180
Tyr His Leu Ile Gin Asn Trp Pro His Tyr Arg Ser 185 190 195
676
gaacggggcc gccggacctc gctttgcacc cacgacccag ccaagatctg
650
ctcccgagac cacgctcaga gctcagccac ctggagctgc tcccagccct
700
tcaaagtcgt ctgtgtctac atcgccttct acagcacgga ctatcggctg
750
gtccagaagg tgtgcccaga ttacaactac catagtgata ccccctacta
800
cccatctggg tgacccgggg caggccacag aggccaggcc agggctggaa
850
ggacaggcct gcqcatgcag gagaccatct ggacaccggg cagggaaggg
900
gttgggcctc aggcagggag gggggtggag acgaggagat gccaagtggg
950
gccagggcca agtctcaagt ggcagagaaa gggtcccaag tgctggtccc
1000
aacctgaagc tgtggagtga ctagatcaca ggagcactgg aggaggagtg
1050
ggctctctgt gcagcctcac agggctttgc cacggagcca cagagagatg
1100
ctgggtcccc gaggcctgtg ggcaggccga tcagtgtggc cccagatcaa
1150
gtcatgggag gaagctaagc ccttggttct tgccatcctg aggaaagata
1200
gcaacaggga gggggagatt tcatcagtgt ggacagcctg tcaacttagg
1250
atggatggct gagagggctt cctaggagcc agtcagcagg gtggggtggg
1300
gccagaggag ctctccagcc ctgcctagtg ggcgccctga gccccttgtc
1350
gtgtgctgag catggcatga ggctgaagtg gcaaccctgg ggtctttgat
1400
gtcttgacag attgaccatc tgtctccagc caggccaccc ctttccaaaa
1450
ttccctcttc tgccagtact ccccctgtac cacccattgc tgatggcaca
1500
cccatcctta agctaagaca ggacgattgt ggtcctccca cactaaggcc
1550
acagcccatc cgcgtgctgt gtgtccctct tccaccccaa cccctgctgg
1600
ctcctctggg agcatccatg tcccggagag gggtccctca acagtcagcc
1650
tcacctgtca gaccggggtt ctcccggatc tggatggcgc cgccctctca
1700
gcagcgggca cgggtggggc ggggccgggc cgcagagcat gtgctggatc
1750
tgttctgtgt gtctgtctgt gggtgggggg aggggaggga agtcttgtga
1800
aaccgctgat tgctgacttt tgtgtgaaga atcgtgttct tggagcagga
1850
aataaagctt gccccggggc a 1871
<210> 218 <211> 252 <212> PRT
<213> Homo sapiens <400> 218
Met Gin Leu Thr Arg Cys Cys Phe Val Phe Leu Val Gin Gly Ser 15 10 15
677
Leu Tyr Leu Val Ile Cys Gly Gin Asp Asp Gly Pro Pro Gly Ser 20 25 30
Glu Asp Pro Glu Arg Asp Asp His Glu Gly Gin Pro Arg Pro Arg 35 40 45
Val Pro Arg Lys Arg Gly His Ile Ser Pro Lys Ser Arg Pro Met 50 55 60
Ala Asn Ser Thr Leu Leu Gly Leu Leu Ala Pro Pro Gly Glu Ala 65 70 75
Trp Gly Ile Leu Gly Gin Pro Pro Asn Arg Pro Asn His Ser Pro 80 85 90
Pro Pro Ser Ala Lys Val Lys Lys Ile Phe Gly Trp Gly Asp Phe 95 100 105
Tyr Ser Asn Ile Lys Thr Val Ala Leu Asn Leu Leu Val Thr Gly 110 115 120
Lys Ile Val Asp His Gly Asn Gly Thr Phe Ser Val His Phe Gin 125 130 135
His Asn Ala Thr Gly Gin Gly Asn Ile Ser Ile Ser Leu Val Pro 140 145 150
Pro Ser Lys Ala Val Glu Phe His Gin Glu Gin Gin Ile Phe Ile 155 160 165
Glu Ala Lys Ala Ser Lys Ile Phe Asn Cys Arg Met Glu Trp Glu 170 175 180
Lys Val Glu Arg Gly Arg Arg Thr Ser Leu Cys Thr His Asp Pro 185 190 195
Ala Lys Ile Cys Ser Arg Asp His Ala Gin Ser Ser Ala Thr Trp 200 205 210
Ser Cys Ser Gin Pro Phe Lys Val Val Cys Val Tyr Ile Ala Phe 215 220 225
Tyr Ser Thr Asp Tyr Arg Leu Val Gin Lys Val Cys Pro Asp Tyr 230 235 240
Asn Tyr His Ser Asp Thr Pro Tyr Tyr Pro Ser Gly 245 250
<210> 219
<211> 2065
<212> DNA
<213> Homo sapiens
<400> 219
gtgaatgtga gggtttgatg actttcagat gtctaggaac cagagtgggt 50
gcaggggccc caggcagggc tgattcttgg gcggaggaga gtagggtaaa 100
gggttctgca tgagctcctt aaaggacaaa ggtaacagag ccagcgagag 150
agctcgaggg gagactttga cttcaagcca cagaattggt ggaagtgtgc 200
gcgccgccgc cgccgtcgct cctgcagcgc tgtcgaccta gccgctagca 250
678
tcttcccgag caccgggatc ccggggtagg aggcgacgcg ggcgagcacc
300
agcgccagcc ggctgcggct gcccacacgg ctcaccatgg gctccgggcg
350
ccgggcgctg tccgcggtgc cggccgtgct gctggtcctc acgctgccgg
400
ggctgcccgt ctgggcacag aacgacacgg agcccatcgt gctggagggc
450
aagtgtctgg tggtgtgcga ctcgaacccg gccacggact ccaagggctc
500
ctcttcctcc ccgctgggga tatcggtccg ggcggccaac tccaaggtcg
550
ccttctcggc ggtgcggagc accaaccacg agccatccga gatgagcaac
600
aagacgcgca tcatttactt cgatcagatc ctggtgaatg tgggtaattt
650
tttcacattg gagtctgtct ttgtagcacc aagaaaagga atttacagtt
700
tcagttttca cgtgattaaa gtctaccaga gccaaactat ccaggttaac
750
ttgatgttaa atggaaaacc agtaatatct gcctttgcgg gggacaaaga
800
tgttactcgt gaagctgcca cgaatggtgt cctgctctac ctagataaag
850
aggataaggt ttacctaaaa ctggagaaag gtaatttggt tggaggctgg
900
cagtattcca cgttttctgg ctttctggtg ttccccctat aggattcaat
950
ttctccatga tgttcatcca ggtgagggat gacccactcc tgagttattg
1000
gaagatcatt ttttcatcat tggattgatg tcttttattg gtttctcatg
1050
ggtggatatg gattctaagg attctagcct gtctgaacca atacaaaatt
1100
tcacagatta tttgtgtgtg tctgtttcag tatatttgga ttgggactct
1150
aagcagataa tacctatgct taaatgtaac agtcaaaagc tgtctgcaag
1200
acttattctg aatttcattt cctgggatta ctgaattagt tacagatgtg
1250
gaattttatt tgtttagttt taaaagactg gcaaccaggt ctaaggatta
1300
gaaaactcta aagttctgac ttcaatcaac ggttagtgtg atactgccaa
1350
agaactgtat actgtgttaa tatattgatt atatttgttt ttattccttt
1400
ggaattagtt tgtttggttc ttgtaaaaaa cttggatttt ttttttcagt
1450
aactggtatt atgttttctc ttaaaataag gtaatgaatg gcttgcccac
1500
aaatttacct tgactacgat atcatcgaca tgacttctct caaaaaaaaa
1550
gaatgcttca tagttgtatt ttaattgtat atgtgaaaga gtcatatttt
1600
ccaagttata ttttctaaga agaagaatag atcataaatc tgacaaggaa
1650
aaagttgctt acccaaaatc taagtgctca atccctgagc ctcagcaaaa
1700
cagctcccct aatgattgat ccgagggaaa aataaccact tcttatactt ttattaaaaa tattgctcaa cctaaggttt ctttaattaa tttttttttc
1750 1800
679
cgtagacatg accactttat taactggtgg tgggatgctg ttgtttctaa 1850
ttatacctat ttttcaaggc ttctgttgta tttgaagtat catctggttt 1900
tgccttaact ctttaaattg tatatattta tctgtttagc taatattaaa 1950
ttcaaatatc ccatatctaa atttagtgca atatcttgtc ttttgtatag 2000
gtcatatgaa ttcataaaat tatttatgtc tgttatagaa taaagattaa 2050
tatatgttaa aaaaa 2065
<210> 220 <211> 201 <212> PRT <213> Homo sapiens
<400> 220
Met Gly Ser Gly Arg Arg Ala Leu Ser Ala Val Pro Ala Val Leu 15 10 15
Leu Val Leu Thr Leu Pro Gly Leu Pro Val Trp Ala Gin Asn Asp 20 25 30
Thr Glu Pro Ile Val Leu Glu Gly Lys Cys Leu Val Val Cys Asp 35 40 45
Ser Asn Pro Ala Thr Asp Ser Lys Gly Ser Ser Ser Ser Pro Leu 50 55 60
Gly Ile Ser Val Arg Ala Ala Asn Ser Lys Val Ala Phe Ser Ala 65 70 75
Val Arg Ser Thr Asn His Glu Pro Ser Glu Met Ser Asn Lys Thr 80 85 90
Arg Ile Ile Tyr Phe Asp Gin Ile Leu Val Asn Val Gly Asn Phe 95 100 105
Phe Thr Leu Glu Ser Val Phe Val Ala Pro Arg Lys Gly Ile Tyr 110 115 120
Ser Phe Ser Phe His Val Ile Lys Val Tyr Gin Ser Gin Thr Ile 125 130 135
Gin Val Asn Leu Met Leu Asn Gly Lys Pro Val Ile Ser Ala Phe 140 145 150
Ala Gly Asp Lys Asp Val Thr Arg Glu Ala Ala Thr Asn Gly Val 155 160 165
Leu Leu Tyr Leu Asp Lys Glu Asp Lys Val Tyr Leu Lys Leu Glu 170 175 180
Lys Gly Asn Leu Val Gly Gly Trp Gin Tyr Ser Thr Phe Ser Gly 185 190 195
Phe Leu Val Phe Pro Leu 200
<210> 221 <211> 20
680
<212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 221 acggctcacc atgggctccg 20
<210> 222 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 222 aggaagagga gcccttggag tccg 24
<210> 223 <211> 40 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 223
cgtgctggag ggcaagtgtc tggtggtgtg cgactcgaac 40
<210> 224 <211> 902 <212> DNA <213> Homo sapiens
<400> 224
cggtggccat gactgcggcc gtgttcttcg gctgcgcctt cattgccttc
50
gggcctgcgc tcgcccttta tgtcttcacc atcgccatcg agccgttgcg
100
tatcatcttc ctcatcgccg gagctttctt ctggttggtg tctctactga
150
tttcgtccct tgtttggttc atggcaagag tcattattga caacaaagat
200
ggaccaacac agaaatatct gctgatcttt ggagcgtttg tctctgtcta
250
tatccaagaa atgttccgat ttgcatatta taaactctta aaaaaagcca
300
gtgaaggttt gaagagtata aacccaggtg agacagcacc ctctatgcga
350
ctgctggcct atgtttctgg cttgggcttt ggaatcatga gtggagtatt
400
ttcctttgtg aataccctat ctgactcctt ggggccaggc acagtgggca
450
ttcatggaga ttctcctcaa ttcttccttt attcagcttt catgacgctg
500
gtcattatct tgctgcatgt attctggggc attgtatttt ttgatggctg
550
tgagaagaaa aagtggggca tcctccttat cgttctcctg acccacctgc
600
tggtgtcagc ccagaccttc ataagttctt attatggaat aaacctggcg
650
681
tcagcattta taatcctggt gctcatgggc acctgggcat tcttagctgc 700
gggaggcagc tgccgaagcc tgaaactctg cctgctctgc caagacaaga 750
actttcttct ttacaaccag cgctccagat aacctcaggg aaccagcact 800
tcccaaaccg cagactacat ctttagagga agcacaactg tgcctttttc 850
tgaaaatccc tttttctggt ggaattgaga aagaaataaa actatgcaga 900
ta 902
<210> 225 <211> 257 <212> PRT <213> Homo sapiens
<400> 225
Met Thr Ala Ala Val Phe Phe Gly Cys Ala Phe Ile Ala Phe Gly 1 5 10 15
Pro Ala Leu Ala Leu Tyr Val Phe Thr Ile Ala Ile Glu Pro Leu 20 25 30
Arg Ile Ile Phe Leu Ile Ala Gly Ala Phe Phe Trp Leu Val Ser 35 40 45
Leu Leu Ile Ser Ser Leu Val Trp Phe Met Ala Arg Val Ile Ile 50 55 60
Asp Asn Lys Asp Gly Pro Thr Gin Lys Tyr Leu Leu Ile Phe Gly 65 70 75
Ala Phe Val Ser Val Tyr lie Gin Glu Met Phe Arg Phe Ala Tyr 80 85 90
Tyr Lys Leu Leu Lys Lys Ala Ser Glu Gly Leu Lys Ser Ile Asn 95 100 105
Pro Gly Glu Thr Ala Pro Ser Met Arg Leu Leu Ala Tyr Val Ser 110 115 120
Gly Leu Gly Phe Gly Ile Met Ser Gly Val Phe Ser Phe Val Asn 125 130 _ 135
Thr Leu Ser Asp Ser Leu Gly Pro Gly Thr Val Gly Ile His Gly 140 145 150
Asp Ser Pro Gin Phe Phe Leu Tyr Ser Ala Phe Met Thr Leu Val 155 160 165
Ile Ile Leu Leu His Val Phe Trp Gly Ile Val Phe Phe Asp Gly 170 175 180
Cys Glu Lys Lys Lys Trp Gly Ile Leu Leu Ile Val Leu Leu Thr 185 190 195
His Leu Leu Val Ser Ala Gin Thr Phe Ile Ser Ser Tyr Tyr Gly 200 205 210
Ile Asn Leu Ala Ser Ala Phe Ile Ile Leu Val Leu Met Gly Thr 215 220 225
682
Trp Ala Phe Leu Ala Ala Gly Gly Ser Cys Arg Ser Leu Lys Leu 230 235 240
Cys Leu Leu Cys Gin Asp Lys Asn Phe Leu Leu Tyr Asn Gin Arg 245 250 255
Ser Arg
<210> 226
<211> 3939
<212> DNA
<213> Homo sapiens
<400> 226
cggcaaccag ccgccgccac caccgctgcc actgccgccc tgccggggcc
50
atgttcgctc tgggcttgcc cttcttggtg ctcttggtgg cctcggtcga
100
gagccatctg ggggttctgg ggcccaagaa cgtctcgcag aaagacgccg
150
agtttgagcg cacctacgtg gacgaggtca acagcgagct ggtcaacatc
200
tacaccttca accatactgt gacccgcaac aggacagagg gcgtgcgtgt
250
gtctgtgaac gtcctgaaca agcagaaggg ggcgccgttg ctgtttgtgg
300
tccgccagaa ggaggctgtg gtgtccttcc aggtgcccct aatcctgcga
350
gggatgtttc agcgcaagta cctctaccaa aaagtggaac gaaccctgtg
400
tcagcccccc accaagaatg agtcggagat tcagttcttc tacgtggatg
450
tgtccaccct gtcaccagtc aacaccacat accagctccg ggtcagccgc
500
atggacgatt ttgtgctcag gactggggag cagttcagct tcaataccac
550
agcagcacag ccccagtact tcaagtatga gttccctgaa ggcgtggact
600
cggtaattgt caaggtgacc tccaacaagg ccttcccctg ctcagtcatc
650
tccattcagg atgtgctgtg tcctgtctat gacctggaca acaacgtagc
700
cttcatcggc atgtaccaga cgatgaccaa gaaggcggcc atcaccgtac
750
agcgcaaaga cttccccagc aacagctttt atgtggtggt ggtggtgaag
800
accgaagacc aagcctgcgg gggctccctg cctttctacc ccttcgcaga
850
agatgaaccg gtcgatcaag ggcaccgcca gaaaaccctg tcagtgctgg
900
tgtctcaagc agtcacgtct gaggcatacg tcagtgggat gctcttttgc
950
ctgggtatat ttctctcctt ttacctgctg accgtcctcc tggcctgctg
1000
ggagaactgg aggcagaaga agaagaccct gctggtggcc attgaccgag
1050
cctgcccaga aagcggtcac cctcgagtcc tggctgattc ttttcctggc
1100
agttcccctt atgagggtta caactatggc tcctttgaga atgtttctgg
1150
683
atctaccgat ggtctggttg acagcgctgg cactggggac ctctcttacg
1200
gttaccaggg ccgctccttt gaacctgtag gtactcggcc ccgagtggac
1250
tccatgagct ctgtggagga ggatgactac gacacattga ccgacatcga
1300
ttccgacaag aatgtcattc gcaccaagca atacctctat gtggctgacc
1350
tggcacggaa ggacaagcgt gttctgcgga aaaagtacca gatctacttc
1400
tggaacattg ccaccattgc tgtcttctat gcccttcctg tggtgcagct
1450
ggtgatcacc taccagacgg tggtgaatgt cacagggaat caggacatct
1500
gctactacaa cttcctctgc gcccacccac tgggcaatct cagcgccttc
1550
aacaacatcc tcagcaacct ggggtacatc ctgctggggc tgcttttcct
1600
gctcatcatc ctgcaacggg agatcaacca caaccgggcc ctgctgcgca
1650
atgacctctg tgccctggaa tgtgggatcc ccaaacactt tgggcttttc
1700
tacgccatgg gcacagccct gatgatggag gggctgctca gtgcttgcta
1750
tcatgtgtgc cccaactata ccaatttcca gtttgacaca tcgttcatgt
1800
acatgatcgc cggactctgc atgctgaagc tctaccagaa gcggcacccg
1850
gacatcaacg ccagcgccta cagtgcctac gcctgcctgg ccattgtcat
1900
cttcttctct gtgctgggcg tggtctttgg caaagggaac acggcgttct
1950
ggatcgtctt ctccatcatt cacatcatcg ccaccctgct cctcagcacg
2000
cagctctatt acatgggccg gtggaaactg gactcgggga tcttccgccg
2050
catcctccac gtgctctaca cagactgcat ccggcagtgc agcgggccgc
2100
tctacgtgga ccgcatggtg ctgctggtca tgggcaacgt catcaactgg
2150
tcgctggctg cctatgggct tatcatgcgc cccaatgatt tcgcttccta
2200
cttgttggcc attggcatct gcaacctgct cctttacttc gccttctaca
2250
tcatcatgaa gctccggagt ggggagagga tcaagctcat ccccctgctc
2300
tgcatcgttt gcacctccgt ggtctggggc ttcgcgctct tcttcttctt
2350
ccagggactc agcacctggc agaaaacccc tgcagagtcg agggagcaca
2400
accgggactg catcctcctc gacttctttg acgaccacga catctggcac
2450
ttcctctcct ccatcgccat gttcgggtcc ttcctggtgt tgctgacact
2500
ggatgacgac ctggatactg tgcagcggga caagatctat gtcttctagc
2550
aggagctggg cccttcgctt cacctcaagg ggccctgagc tcctttgtgt
2600
catagaccgg tcactctgtc gtgctgtggg gatgagtccc agcaccgctg
2650
cccagcactg gatggcagca ggacagccag gtctagctta ggcttggcct
2700
684
gggacagcca tggggtggca tggaaccttg cagctgccct ctgccgagga
2750
gcaggcctgc ttctcagtgt tcccctggaa tggggccttc cccccagatg catgggcccc ttggccaaat tgtcctttgg tgctgctttc ctctccattt
2800 2850
gtccctttgc aagaggaagg atggaaggga caccctcccc atttcatgcc
2900
ttgcattttg cccgtcctcc tccccacaat gccccagcct gggacctaag
2950
gcctcttttt cctcccatac tcccactcca gggcctagtc tggggcctga
3000
atctctgtcc tgtatcaggg ccccagttct ctttgggctg tccctggctg
3050
ccatcactgc ccattccagt cagccaggat ggatgggggt atgagatttt
3100
gggggttggc cagctggtgc cagacttttg gtgctaaggc ctgcaagggg
3150
cctggggcag tgcgtattct cttccctctg acctgtgctc agggctggct
3200
ctttagcaat gcgctcagcc caatttgaga accgccttct gattcaagag
3250
gctgaattca gaggtcacct cttcatccca tcagctccca gactgatgcc
3300
agcaccagga ctggagggag aagcgcctca ccccttccct tccttctttc
3350
caggccctta gtcttgccaa accccagctg gtggcctttc agtgccattg
3400
acactgccca agaatgtcca ggggcaaagg agggatgata cagagttcag
3450
cccgttctgc ctccacagct gtgggcaccc cagtgcctac cttagaaagg
3500
ggcttcagga agggatgtgc tgtttccctc tacgtgccca gtcctagcct
3550
cgctctagga cccagggctg gcttctaagt ttccgtccag tcttcaggca
3600
agttctgtgt tagtcatgca cacacatacc tatgaaacct tggagtttac
3650
aaagaattgc cccagctctg ggcaccctgg ccaccctggt ccttggatcc
3700
ccttcgtccc acctggtcca ccccagatgc tgaggatggg ggagctcagg
3750
cggggcctct gctttgggga tgggaatgtg tttttctccc aaacttgttt
3800
ttatagctct gcttgaaggg ctgggagatg aggtgggtct ggatcttttc
3850
tcagagcgtc tccatgctat ggttgcattt ccgttttcta tgaatgaatt
3900
tgcattcaat aaacaaccag actcaaaaaa aaaaaaaaa 3 939
<210> 227 <211> 832 <212> PRT
<213> Homo sapiens <400> 227
Met Phe Ala Leu Gly Leu Pro Phe Leu Val Leu Leu Val Ala Ser 15 10 15
Val Glu Ser His Leu Gly Val Leu Gly Pro Lys Asn Val Ser Gin 20 25 30
Lys Asp Ala Glu Phe Glu Arg Thr Tyr Val Asp Glu Val Asn Ser
685
40 45
Glu Leu Val Asn Ile Tyr Thr Phe Asn His Thr Val Thr Arg Asn 50 55 60
Arg Thr Glu Gly Val Arg Val Ser Val Asn Val Leu Asn Lys Gin 65 70 75
Lys Gly Ala Pro Leu Leu Phe Val Val Arg Gin Lys Glu Ala Val 80 85 90
Val Ser Phe Gin Val Pro Leu Ile Leu Arg Gly Met Phe Gin Arg 95 100 105
Lys Tyr Leu Tyr Gin Lys Val Glu Arg Thr Leu Cys Gin Pro Pro 110 115 120
Thr Lys Asn Glu Ser Glu Ile Gin Phe Phe Tyr Val Asp Val Ser 125 130 135
Thr Leu Ser Pro Val Asn Thr Thr Tyr Gin Leu Arg Val Ser Arg 140 145 150
Met Asp Asp Phe Val Leu Arg Thr Gly Glu Gin Phe Ser Phe Asn 155 160 165
Thr Thr Ala Ala Gin Pro Gin Tyr Phe Lys Tyr Glu Phe Pro Glu 170 175 180
Gly Val Asp Ser Val Ile Val Lys Val Thr Ser Asn Lys Ala Phe 185 190 195
Pro Cys Ser Val Ile Ser Ile Gin Asp Val Leu Cys Pro Val Tyr 200 205 210
Asp Leu Asp Asn Asn Val Ala Phe Ile Gly Met Tyr Gin Thr Met 215 220 225
Thr Lys Lys Ala Ala Ile Thr Val Gin Arg Lys Asp Phe Pro Ser 230 235 240
Asn Ser Phe Tyr Val Val Val Val Val Lys Thr Glu Asp Gin Ala 245 250 255
Cys Gly Gly Ser Leu Pro Phe Tyr Pro Phe Ala Glu Asp Glu Pro 260 265 270
Val Asp Gin Gly His Arg Gin Lys Thr Leu Ser Val Leu Val Ser 275 280 285
Gin Ala Val Thr Ser Glu Ala Tyr Val Ser Gly Met Leu Phe Cys 290 295 300
Leu Gly Ile Phe Leu Ser Phe Tyr Leu Leu Thr Val Leu Leu Ala 305 310 315
Cys Trp Glu Asn Trp Arg Gin Lys Lys Lys Thr Leu Leu Val Ala 320 325 330
Ile Asp Arg Ala Cys Pro Glu Ser Gly His Pro Arg Val Leu Ala 335 340 345
Asp Ser Phe Pro Gly Ser Ser Pro Tyr Glu Gly Tyr Asn Tyr Gly
350
686
355
360
Ser Phe Glu Asn Val Ser Gly Ser Thr Asp Gly Leu Val Asp Ser
365 370 375
Ala Gly Thr Gly Asp Leu Ser Tyr Gly Tyr Gin Gly Arg Ser Phe
380 385 390
Glu Pro Val Gly Thr Arg Pro Arg Val Asp Ser Met Ser Ser Val
395 400 405
Glu Glu Asp Asp Tyr Asp Thr Leu Thr Asp Ile Asp Ser Asp Lys
410 415 420
Asn Val Ile Arg Thr Lys Gin Tyr Leu Tyr Val Ala Asp Leu Ala
425 430 435
Arg Lys Asp Lys Arg Val Leu Arg Lys Lys Tyr Gin Ile Tyr Phe
440 445 450
Trp Asn Ile Ala Thr Ile Ala Val Phe Tyr Ala Leu Pro Val Val
455 460 465
Gin Leu Val Ile Thr Tyr Gin Thr Val Val Asn Val Thr Gly Asn
470 475 480
Gin Asp Ile Cys Tyr Tyr Asn Phe Leu Cys Ala His Pro Leu Gly
485 490 495
Asn Leu Ser Ala Phe Asn Asn Ile Leu Ser Asn Leu Gly Tyr Ile
500 505 510
Leu Leu Gly Leu Leu Phe Leu Leu Ile Ile Leu Gin Arg Glu Ile
515 520 525
Asn His Asn Arg Ala Leu Leu Arg Asn Asp Leu Cys Ala Leu Glu
530 535 540
Cys Gly Ile Pro Lys His Phe Gly Leu Phe Tyr Ala Met Gly Thr
545 550 555
Ala Leu Met Met Glu Gly Leu Leu Ser Ala Cys Tyr His Val Cys
560 565 570
Pro Asn Tyr Thr Asn Phe Gin Phe Asp Thr Ser Phe Met Tyr Met
575 580 585
Ile Ala Gly Leu Cys Met Leu Lys Leu Tyr Gin Lys Arg His Pro
590 595 600
Asp Ile Asn Ala Ser Ala Tyr Ser Ala Tyr Ala Cys Leu Ala Ile
605 610 615
Val Ile Phe Phe Ser Val Leu Gly Val Val Phe Gly Lys Gly Asn
620 625 630
Thr Ala Phe Trp Ile Val Phe Ser Ile Ile His Ile Ile Ala Thr
635 640 645
Leu Leu Leu Ser Thr Gin Leu Tyr Tyr Met Gly Arg Trp Lys Leu
650 655 660
687
Asp Ser Gly Ile Phe Arg Arg Ile Leu His Val Leu Tyr Thr Asp 665 670 675
Cys Ile Arg Gin Cys Ser Gly Pro Leu Tyr Val Asp Arg Met Val 680 685 690
Leu Leu Val Met Gly Asn Val Ile Asn Trp Ser Leu Ala Ala Tyr 695 700 705
Gly Leu Ile Met Arg Pro Asn Asp Phe Ala Ser Tyr Leu Leu Ala 710 715 720
Ile Gly Ile Cys Asn Leu Leu Leu Tyr Phe Ala Phe Tyr Ile Ile 725 730 " 735
Met Lys Leu Arg Ser Gly Glu Arg Ile Lys Leu Ile Pro Leu Leu 740 745 750
Cys Ile Val Cys Thr Ser Val Val Trp Gly Phe Ala Leu Phe Phe 755 760 765
Phe Phe Gin Gly Leu Ser Thr Trp Gin Lys Thr Pro Ala Glu Ser 770 775 780
Arg Glu His Asn Arg Asp Cys Ile Leu Leu Asp Phe Phe Asp Asp 785 790 795
His Asp lie Trp His Phe Leu Ser Ser Ile Ala Met Phe Gly Ser 800 805 810
Phe Leu Val Leu Leu Thr Leu Asp Asp Asp Leu Asp Thr Val Gin 815 820 825
Arg Asp Lys Ile Tyr Val Phe 830
<210> 228
<211> 2848
<212> DNA
<213> Homo sapiens
<400> 228
gctcaagtgc cctgccttgc cccacccagc ccagcctggc cagagccccc
50
tggagaagga gctctcttct tgcttggcag ctggaccaag ggagccagtc
100
ttgggcgctg gagggcctgt cctgaccatg gtccctgcct ggctgtggct
150
gctttgtgtc tccgtccccc aggctctccc caaggcccag cctgcagagc
200
tgtctgtgga agttccagaa aactatggtg gaaatttccc tttatacctg
250
accaagttgc cgctgccccg tgagggggct gaaggccaga tcgtgctgtc
300
aggggactca ggcaaggcaa ctgagggccc atttgctatg gatccagatt
350
ctggcttcct gctggtgacc agggccctgg accgagagga gcaggcagag
400
taccagctac aggtcaccct ggagatgcag gatggacatg tcttgtgggg
450
tccacagcct gtgcttgtgc acgtgaagga tgagaatgac caggtgcccc
500
688
atttctctca agccatctac agagctcggc tgagccgggg taccaggcct
550
ggcatcccct tcctcttcct tgaggcttca gaccgggatg agccaggcac
600
agccaactcg gatcttcgat tccacatcct gagccaggct ccagcccagc
650
cttccccaga agccccaagg catgttccag ggagcaccag ctggagcctc ccttgaccac ggctgggggc gccctggaga tctggccctc ggacctacca
700 750
gctgttggta caggtcaagg acatgggtga ccaggcctca ggccaccagg
800
ccactgccac cgtggaagtc tccatcatag agagcacctg ggtgtcccta
850
gagcctatcc acctggcaga gaatctcaaa gtcctatacc cgcaccacat
900
ggcccaggta cactggagtg ggggtgatgt gcactatcac ctggagagcc
950
atcccccggg accctttgaa gtgaatgcag agggaaacct ctacgtgacc
1000
agagagctgg acagagaagc ccaggctgag tacctgctcc aggtgcgggc
1050
tcagaattcc catggcgagg actatgcggc ccctctggag ctgcacgtgc
1100
tggtgatgga tgagaatgac aacgtgccta tctgccctcc ccgtgacccc
1150
acagtcagca tccctgagct cagtccacca ggtactgaag tgactagact
1200
gtcagcagag gatgcagatg cccccggctc ccccaattcc cacgttgtgt
1250
atcagctcct gagccctgag cctgaggatg gggtagaggg gagagccttc
1300
caggtggacc ccacttcagg cagtgtgacg ctgggggtgc tcccactccg
1350
agcaggccag aacatcctgc ttctggtgct ggccatggac ctggcaggcg
1400
cagagggtgg cttcagcagc acgtgtgaag tcgaagtcgc agtcacagat
1450
atcaatgatc acgcccctga gttcatcact tcccagattg ggcctataag
1500
cctccctgag gatgtggagc ccgggactct ggtggccatg ctaacagcca
1550
ttgatgctga cctcgagccc gccttccgcc tcatggattt tgccattgag
1600
aggggagaca cagaagggac ttttggcctg gattgggagc cagactctgg
1650
gcatgttaga ctcagactct gcaagaacct cagttatgag gcagctccaa
1700
gtcatgaggt ggtggtggtg gtgcagagtg tggcgaagct ggtggggcca
1750
ggcccaggcc ctggagccac cgccacggtg actgtgctag tggagagagt
1800
gatgccaccc cccaagttgg accaggagag ctacgaggcc agtgtcccca
1850
tcagtgcccc agccggctct ttcctgctga ccatccagcc ctccgacccc
1900
atcagccgaa ccctcaggtt ctccctagtc aatgactcag agggctggct
1950
ctgcattgag aaattctccg gggaggtgca caccgcccag tccctgcagg
2000
gcgcccagcc tggggacacc tacacggtgc ttgtggaggc ccaggataca
2050
gccctgactc ttgcccctgt gccctcccaa tacctctgca caccccgcca
2100
689
agaccatggc ttgatcgtga gtggacccag caaggacccc gatctggcca
2150
gtgggcacgg tccctacagc ttcacccttg gtcccaaccc cacggtgcaa
2200
cgggattggc gcctccagac tctcaatggt tcccatgcct acctcacctt
2250
ggccctgcat gccacaatgc tgggtggagc ccagatgtgg cacgtgaaca cagctcctgg cataatcccc ttcgagtgat gtggtggtca cgtgtgtcgc
2300 2350
tgcaacgtgg aggggcagtg catgcgcaag gtgggccgca tgaagggcat
2400
gcccacgaag ctgtcggcag tgggcatcct tgtaggcacc ctggtagcaa
2450
taggaatctt cctcatcctc attttcaccc actggaccat gtcaaggaag
2500
aaggacccgg atcaaccagc agacagcgtg cccctgaagg cgactgtctg
2550
aatggcccag gcagctctag ctgggagctt ggcctctggc tccatctgag
2600
tcccctggga gagagcccag cacccaagat ccagcagggg acaggacaga
2650
gtagaagccc ctccatctgc cctggggtgg aggcaccatc accatcacca
2700
ggcatgtctg cagagcctgg acaccaactt tatggactgc ccatgggagt
2750
gctccaaatg tcagggtgtt tgcccaataa taaagcccca gagaactggg
2800
ctgggcccta tgggaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaag 2848
<210> 229 <211> 807 <212> PRT
<213> Homo sapiens <400> 229
Met Val Pro Ala Trp Leu Trp Leu Leu Cys Val Ser Val Pro Gin 1 5 10 15
Ala Leu Pro Lys Ala Gin Pro Ala Glu Leu Ser Val Glu Val Pro 20 25 30
Glu Asn Tyr Gly Gly Asn Phe Pro Leu Tyr Leu Thr Lys Leu Pro 35 40 45
Leu Pro Arg Glu Gly Ala Glu Gly Gin Ile Val Leu Ser Gly Asp 50 55 60
Ser Gly Lys Ala Thr Glu Gly Pro Phe Ala.Met Asp Pro Asp Ser 65 70 75
Gly Phe Leu Leu Val Thr Arg Ala Leu Asp Arg Glu Glu Gin Ala 80 85 90
Glu Tyr Gin Leu Gin Val Thr Leu Glu Met Gin Asp Gly His Val 95 100 105
Leu Trp Gly Pro Gin Pro Val Leu Val His Val Lys Asp Glu Asn 110 115 120
Asp Gin Val Pro His Phe Ser Gin Ala Ile Tyr Arg Ala Arg Leu 125 130 135
690
Ser Arg Gly Thr Arg Pro Gly Ile Pro Phe Leu Phe Leu Glu Ala 140 145 150
Ser Asp Arg Asp Glu Pro Gly Thr Ala Asn Ser Asp Leu Arg Phe 155 160 165
His Ile Leu Ser Gin Ala Pro Ala Gin Pro Ser Pro Asp Met Phe 170 175 180
Gin Leu Glu Pro Arg Leu Gly Ala Leu Ala Leu Ser Pro Lys Gly 185 190 195
Ser Thr Ser Leu Asp His Ala Leu Glu Arg Thr Tyr Gin Leu Leu 200 205 210
Val Gin Val Lys Asp Met Gly Asp Gin Ala Ser Gly His Gin Ala 215 220 225
Thr Ala Thr Val Glu Val Ser Ile Ile Glu Ser Thr Trp Val Ser 230 235 240
Leu Glu Pro Ile His Leu Ala Glu Asn Leu Lys Val Leu Tyr Pro 245 250 255
His His Met Ala Gin Val His Trp Ser Gly Gly Asp Val His Tyr 260 265 270
His Leu Glu Ser His Pro Pro Gly Pro Phe Glu Val Asn Ala Glu 275 280 285
Gly Asn Leu Tyr Val Thr Arg Glu Leu Asp Arg Glu Ala Gin Ala 290 295 300
Glu Tyr Leu Leu Gin Val Arg Ala Gin Asn Ser His Gly Glu Asp 305 310 315
Tyr Ala Ala Pro Leu Glu Leu His Val Leu Val Met Asp Glu Asn 320 325 330
Asp Asn Val Pro Ile Cys Pro Pro Arg Asp Pro Thr Val Ser Ile 335 340 345
Pro Glu Leu Ser Pro Pro Gly Thr Glu Val Thr Arg Leu Ser Ala 350 355 360
Glu Asp Ala Asp Ala Pro Gly Ser Pro Asn Ser His Val Val Tyr 365 370 375
Gin Leu Leu Ser Pro Glu Pro Glu Asp Gly Val Glu Gly Arg Ala 380 385 390
Phe Gin Val Asp Pro Thr Ser Gly Ser Val Thr Leu Gly Val Leu 395 400 405
Pro Leu Arg Ala Gly Gin Asn Ile Leu Leu Leu Val Leu Ala Met 410 415 420
Asp Leu Ala Gly Ala Glu Gly Gly Phe Ser Ser Thr Cys Glu Val 425 430 435
Glu Val Ala Val Thr Asp Ile Asn Asp His Ala Pro Glu Phe Ile 440 445 450
691
Thr Ser Gin Ile Gly Pro Ile Ser Leu Pro Glu Asp Val Glu Pro 455 460 465
Gly Thr Leu Val Ala Met Leu Thr Ala Ile Asp Ala Asp Leu Glu 470 475 480
Pro Ala Phe Arg Leu Met Asp Phe Ala Ile Glu Arg Gly Asp Thr 485 490 495
Glu Gly Thr Phe Gly Leu Asp Trp Glu Pro Asp Ser Gly His Val 500 505 510
Arg Leu Arg Leu Cys Lys Asn Leu Ser Tyr Glu Ala Ala Pro Ser 515 520 525
His Glu Val Val Val Val Val Gin Ser Val Ala Lys Leu Val Gly 530 535 540
Pro Gly Pro Gly Pro Gly Ala Thr Ala Thr Val Thr Val Leu Val 545 550 555
Glu Arg Val Met Pro Pro Pro Lys Leu Asp Gin Glu Ser Tyr Glu 560 565 570
Ala Ser Val Pro Ile Ser Ala Pro Ala Gly Ser Phe Leu Leu Thr 575 580 585
Ile Gin Pro Ser Asp Pro Ile Ser Arg Thr Leu Arg Phe Ser Leu 590 595 600
Val Asn Asp Ser Glu Gly Trp Leu Cys Ile Glu Lys Phe Ser Gly 605 610 615
Glu Val His Thr Ala Gin Ser Leu Gin Gly Ala Gin Pro Gly Asp 620 625 630
Thr Tyr Thr Val Leu Val Glu Ala Gin Asp Thr Ala Leu Thr Leu 635 640 645
Ala Pro Val Pro Ser Gin Tyr Leu Cys Thr Pro Arg Gin Asp His 650 655 660
Gly Leu Ile Val Ser Gly Pro Ser Lys Asp Pro Asp Leu Ala Ser 665 670 675
Gly His Gly Pro Tyr Ser Phe Thr Leu Gly Pro Asn Pro Thr Val 680 685 690
Gin Arg Asp Trp Arg Leu Gin Thr Leu Asn Gly Ser His Ala Tyr 695 700 705
Leu Thr Leu Ala Leu His Trp Val Glu Pro Arg Glu His Ile Ile 710 715 720
Pro Val Val Val Ser His Asn Ala Gin Met Trp Gin Leu Leu Val 725 730 735
Arg Val Ile Val Cys Arg Cys Asn Val Glu Gly Gin Cys Met Arg 740 745 750
Lys Val Gly Arg Met Lys Gly Met Pro Thr Lys Leu Ser Ala Val
692
755 760 765
Gly Ile Leu Val Gly Thr Leu Val Ala Ile Gly Ile Phe Leu Ile 770 775 780
Leu Ile Phe Thr His Trp Thr Met Ser Arg Lys Lys Asp Pro Asp 785 790 795
Gin Pro Ala Asp Ser Val Pro Leu Lys Ala Thr Val 800 805
<210> 230 <211> 50 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 230
cgccttaccg cgcagcccga agattcacta tggtgaaaat cgccttcaat 50
<210> 231
<211> 1257
<212> DNA
<213> Homo sapiens
<400> 231
ggagagaggc gcgcgggtga aaggcgcatt gatgcagcct gcggcggcct
50
cggagcgcgg cggagccaga cgctgaccac gttcctctcc tcggtctcct
100
ccgcctccag ctccgcgctg cccggcagcc gggagccatg cgaccccagg
150
gccccgccgc ctccccgcag cggctccgcg gcctcctgct gctcctgctg
200
ctgcagctgc ccgcgccgtc gagcgcctct gagatcccca aggggaagca
250
aaaggcgcag ctccggcaga gggaggtggt ggacctgtat aatggaatgt
300
gcttacaagg gccagcagga gtgcctggtc gagacgggag ccctggggcc
350
aatgttattc cgggtacacc tgggatccca ggtcgggatg gattcaaagg
400
agaaaagggg gaatgtctga gggaaagctt tgaggagtcc tggacaccca
450
actacaagca gtgttcatgg agttcattga attatggcat agatcttggg
500
aaaattgcgg agtgtacatt tacaaagatg cgttcaaata gtgctctaag
550
agttttgttc agtggctcac ttcggctaaa atgcagaaat gcatgctgtc
600
agcgttggta tttcacattc aatggagctg aatgttcagg acctcttccc
650
attgaagcta taatttattt ggaccaagga agccctgaaa tgaattcaac
700
aattaatatt catcgcactt cttctgtgga aggactttgt gaaggaattg
750
gtgctggatt agtggatgtt gctatctggg ttggcacttg ttcagattac
800
ccaaaaggag atgcttctac tggatggaat tcagtttctc gcatcattat
850
693
tgaagaacta ccaaaataaa tgctttaatt ttcatttgct acctcttttt 900
ttattatgcc ttggaatggt tcacttaaat gacattttaa ataagtttat 950
gtatacatct gaatgaaaag caaagctaaa tatgtttaca gaccaaagtg 1000
tgatttcaca ctgtttttaa atctagcatt attcattttg cttcaatcaa 1050
aagtggtttc aatatttttt ttagttggtt agaatacttt cttcatagtc 1100
acattctctc aacctataat ttggaatatt gttgtggtct tttgtttttt 1150
ctcttagtat agcattttta aaaaaatata aaagctacca atctttgtac 12 00
aatttgtaaa tgttaagaat tttttttata tctgttaaat aaaaattatt 12 50
tccaaca 1257
<210> 232 <211> 243 <212> PRT
<213> Homo sapiens <400> 232
Met Arg Pro Gin Gly Pro Ala Ala Ser Pro Gin Arg Leu Arg Gly 15 10 15
Leu Leu Leu Leu Leu Leu Leu Gin Leu Pro Ala Pro Ser Ser Ala 20 25 30
Ser Glu Ile Pro Lys Gly Lys Gin Lys Ala Gin Leu Arg Gin Arg 35 40 45
Glu Val Val Asp Leu Tyr Asn Gly Met Cys Leu Gin Gly Pro Ala 50 55 60
Gly Val Pro Gly Arg Asp Gly Ser Pro Gly Ala Asn Val Ile Pro 65 70 75
Gly Thr Pro Gly Ile Pro Gly Arg Asp Gly Phe Lys Gly Glu Lys 80 85 90
Gly Glu Cys Leu Arg Glu Ser Phe Glu Glu Ser Trp Thr Pro Asn 95 100 105
Tyr Lys Gin Cys Ser Trp Ser Ser Leu Asn Tyr Gly Ile Asp Leu 110 115 120
Gly Lys Ile Ala Glu Cys Thr Phe Thr Lys Met Arg Ser Asn Ser 125 130 135
Ala Leu Arg Val Leu Phe Ser Gly Ser Leu Arg Leu Lys Cys Arg 140 145 150
Asn Ala Cys Cys Gin Arg Trp Tyr Phe Thr Phe Asn Gly Ala Glu 155 160 165
Cys Ser Gly Pro Leu Pro Ile Glu Ala Ile Ile Tyr Leu Asp Gin 170 175 180
Gly Ser Pro Glu Met Asn Ser Thr Ile Asn Ile His Arg Thr Ser 185 190 195
694
Ser Val Glu Gly Leu Cys Glu Gly Ile Gly Ala Gly Leu Val Asp 200 205 210
Val Ala Ile Trp Val Gly Thr Cys Ser Asp Tyr Pro Lys Gly Asp 215 220 225
Ala Ser Thr Gly Trp Asn Ser Val Ser Arg Ile Ile Ile Glu Glu 230 235 240
Leu Pro Lys
<210> 233
<211> 2786
<212> DNA
<213> Homo sapiens
<400> 233
ccggggacat gaggtggata ctgttcattg gggcccttat tgggtccagc
50
atctgtggcc aagaaaaatt ttttggggac caagttttga ggattaatgt
100
cagaaatgga gacgagatca gcaaattgag tcaactagtg aattcaaaca
150
acttgaagct caatttctgg aaatctccct cctccttcaa tcggcctgtg
200
gatgtcctgg tcccatctgt cagtctgcag gcatttaaat ccttcctgag
250
atcccagggc ttagagtacg cagtgacaat tgaggacctg caggcccttt
300
tagacaatga agatgatgaa atgcaacaca atgaagggca agaacggagc
350
agtaataact tcaactacgg ggcttaccat tccctggaag ctatttacca
400
cgagatggac aacattgccg cagactttcc tgacctggcg aggagggtga
450
agattggaca ttcgtttgaa aaccggccga tgtatgtact gaagttcagc
500
actgggaaag gcgtgaggcg gccggccgtt tggctgaatg caggcatcca
550
ttcccgagag tggatctccc aggccactgc aatctggacg gcaaggaaga
600
ttgtatctga ttaccagagg gatccagcta tcacctccat cttggagaaa
650
atggatattt tcttgttgcc tgtggccaat cctgatggat atgtgtatac
700
tcaaactcaa aaccgattat ggaggaagac gcggtcccga aatcctggaa
750
gctcctgcat tggtgctgac ccaaatagaa actggaacgc tagttttgca
800
ggaaagggag ccagcgacaa cccttgctcc gaagtgtacc atggacccca
850
cgccaattcg gaagtggagg tgaaatcagt ggtagatttc atccaaaaac
900
atgggaattt caagggcttc atcgacctgc acagctactc gcagctgctg
950
atgtatccat atgggtactc agtcaaaaag gccccagatg ccgaggaact
1000
cgacaaggtg gcgaggcttg cggccaaagc tctggcttct gtgtcgggca
1050
ctgagtacca agtgggtccc acctgcacca ctgtctatcc agctagcggg
1100
695
agcagcatcg actgggcgta tgacaacggc atcaaatttg cattcacatt
1150
tgagttgaga gataccggga cctatggctt cctcctgcca gctaaccaga
1200
tcatccccac tgcagaggag acgtggctgg ggctgaagac catcatggag
1250
catgtgcggg acaacctcta ctaggcgatg gctctgctct gtctacattt
1300
atttgtaccc acacgtgcac gcactgaggc cattgttaaa ggagctcttt
1350
cctacctgtg tgagtcagag ccctctgggt ttgtggagca cacaggcctg
1400
cccctctcca aagaactggt gccagctccc tctgccagcc tggagtcgtg tgctcaattt tgtcctggcg tggtcctgct gtgtccctgc gtttttgatg
1450 1500
agccttttgt ctgtttctcc ttccaccctg ctggctgggc ggctgcactc
1550
agcatcaccc cttcctgggt ggcatgtctc tctctacctc atttttagaa
1600
ccaaagaaca tctgagatga ttctctaccc tcatccacat ctagccaagc
1650
cagtgacctt gctctggtgg cactgtggga gacaccactt gtctttaggt
1700
gggtctcaaa gatgatgtag aatttccttt aatttctcgc agtcttcctg
1750
gaaaatattt tcctttgagc agcaaatctt gtagggatat cagtgaaggt
1800
ctctccctcc ctcctctcct gttttttttt tttttgagac agagttttgc
1850
tcttgttgcc caggctggag tgtgatggct cgatcttggc tcaccacaac
1900
ctctgcctcc tgggttcaag caattctcct gcctcagcct cttgagtagc
1950
ttggtttata ggcgcatgcc accatgcctg gctaattttg tgtttttagt
2000
agagacaggg tttctccatg ttggtcaggc tggtctcaaa ctcccaacct
2050
caggtgatct gccctccttg gcctcccaga gtgctgggat tacaggtgtg
2100
agccactgtg ccgggcccgt cccctccttt tttaggcctg aatacaaagt
2150
agaagatcac tttccttcac tgtgctgaga atttctagat actacagttc
2200
ttactcctct cttccctttg ttattcagtg tgaccaggat ggcgggaggg
2250
gatctgtgtc actgtaggta ctgtgcccag gaaggctggg tgaagtgacc
2300
atctaaattg caggatggtg aaattatccc catctgtcct aatgggctta
2350
cctcctcttt gccttttgaa ctcacttcaa agatetaggc ctcatcttac
2400
aggtcctaaa tcactcatct ggcctggata atctcactgc cctggcacat
2450
tcccatttgt gctgtggtgt atcctgtgtt tccttgtcct ggtttgtgtg
2500
tgtgtgtgtg tgtgtgtgtg tgtgtgtgtt tgtgtgtgtg tgtctgtcta
2550
ttttgtatcc tggaccacaa gttcctaagt agagcaagaa ttcatcaacc
2600
agctgcctct tgtttcattt cacctcagca cgtaccatct gtccttttgt
2650
696
tgttgttgtt ttgtttttgt ttttttgctt ttaccaaaca tgtctgtaaa 2700
tcttaacctc ctgcctagga tttgtacagc atctggtgtg tgcttataag 2750
ccaataaata ttcaatgtga aaaaaaaaaa aaaaaa 2786
<210> 234 <211> 421 <212> PRT
<213> Homo sapiens <400> 234
Met Arg Trp Ile Leu Phe Ile Gly Ala Leu Ile Gly Ser Ser Ile 15 10 15
Cys Gly Gin Glu Lys Phe Phe Gly Asp Gin Val Leu Arg Ile Asn 20 25 30
Val Arg Asn Gly Asp Glu Ile Ser Lys Leu Ser Gin Leu Val Asn 35 40 45
Ser Asn Asn Leu Lys Leu Asn Phe Trp Lys Ser Pro Ser Ser Phe 50 55 . 60
Asn Arg Pro Val Asp Val Leu Val Pro Ser Val Ser Leu Gin Ala 65 70 75
Phe Lys Ser Phe Leu Arg Ser Gin Gly Leu Glu Tyr Ala Val Thr 80 85 90
Ile Glu Asp Leu Gin Ala Leu Leu Asp Asn Glu Asp Asp Glu Met 95 100 105
Gin His Asn Glu Gly Gin Glu Arg Ser Ser Asn Asn Phe Asn Tyr 110 115 120
Gly Ala Tyr His Ser Leu Glu Ala Ile Tyr His Glu Met Asp Asn 125 130 135
Ile Ala Ala Asp Phe Pro Asp Leu Ala Arg Arg Val Lys Ile Gly 140 145 150
His Ser Phe Glu Asn Arg Pro Met Tyr Val Leu Lys Phe Ser Thr 155 160 165
Gly Lys Gly Val Arg Arg Pro Ala Val Trp Leu Asn Ala Gly Ile 170 175 180
His Ser Arg Glu Trp Ile Ser Gin Ala Thr Ala Ile Trp Thr Ala 185 190 195
Arg Lys Ile Val Ser Asp Tyr Gin Arg Asp Pro Ala Ile Thr Ser 200 205 210
Ile Leu Glu Lys Met Asp Ile Phe Leu Leu Pro Val Ala Asn Pro 215 220 225
Asp Gly Tyr Val' Tyr Thr Gin Thr Gin Asn Arg Leu Trp Arg Lys 230 235 240
Thr Arg Ser Arg Asn Pro Gly Ser Ser Cys Ile Gly Ala Asp Pro 245 250 255
697
Asn Arg Asn Trp Asn Ala Ser Phe Ala Gly Lys Gly Ala Ser Asp 260 265 270
Asn Pro Cys Ser Glu Val Tyr His Gly Pro His Ala Asn Ser Glu 275 280 285
Val Glu Val Lys Ser Val Val Asp Phe Ile Gin Lys His Gly Asn 290 295 300
Phe Lys Gly Phe Ile Asp Leu His Ser Tyr Ser Gin Leu Leu Met 305 310 315
Tyr Pro Tyr Gly Tyr Ser Val Lys Lys Ala Pro Asp Ala Glu Glu 320 325 330
Leu Asp Lys Val Ala Arg Leu Ala Ala Lys Ala Leu Ala Ser Val 335 340 345
Ser Gly Thr Glu Tyr Gin Val Gly Pro Thr Cys Thr Thr Val Tyr 350 355 360
Pro Ala Ser Gly Ser Ser Ile Asp Trp Ala Tyr Asp Asn Gly Ile 365 370 375
Lys Phe Ala Phe Thr Phe Glu Leu Arg Asp Thr Gly Thr Tyr Gly 380 385 390
Phe Leu Leu Pro Ala Asn Gin Ile Ile Pro Thr Ala Glu Glu Thr 395 400 405
Trp Leu Gly Leu Lys Thr Ile Met Glu His Val Arg Asp Asn Leu 410 415 420
Tyr
<210> 235 <211> 1743 <212> DNA
<213> Homo sapiens <400> 235
caaccatgca aggacagggc aggagaagag gaacctgcaa agacatattt
50
tgttccaaaa tggcatctta cctttatgga gtactctttg ctgttggcct
100
ctgtgctcca atctactgtg tgtccccggc caatgccccc agtgcatacc
150
cccgcccttc ctccacaaag agcacccctg cctcacaggt gtattccctc
200
aacaccgact ttgccttccg cctataccgc aggctggttt tggagacccc
250
gagtcagaac atcttcttct cccctgtgag tgtctccact tccctggcca
300
tgctctccct tggggcccac tcagtcacca agacccagat tctccagggc
350
ctgggcttca acctcacaca cacaccagag tctgccatcc accagggctt
400
ccagcacctg gttcactcac tgactgttcc cagcaaagac ctgaccttga
450
agatgggaag tgccctcttc gtcaagaagg agctgcagct gcaggcaaat
500
698
ttcttgggca atgtcaagag gctgtatgaa gcagaagtct tttctacaga
550
tttctccaac ccctccattg cccaggcgag gatcaacagc catgtgaaaa
600
agaagaccca agggaaggtt gtagacataa tccaaggcct tgaccttctg
650
acggccatgg ttctggtgaa tcacattttc tttaaagcca agtgggagaa
700
gccctttcac cttgaatata caagaaagaa cttcccattc ctggtgggcg
750
agcaggtcac tgtgcaagtc cccatgatgc accagaaaga gcagttcgct
800
tttggggtgg atacagagct gaactgcttt gtgctgcaga tggattacaa
850
gggagatgcc gtggccttct ttgtcctccc tagcaagggc aagatgaggc
900
aactggaaca ggccttgtca gccagaacac tgataaagtg gagccactca
950
ctccagaaaa ggtggataga ggtgttcatc cccagatttt ccatttctgc
1000
ctcctacaat ctggaaacca tcctcccgaa gatgggcatc caaaatgcct
1050
ttgacaaaaa tgctgatttt tctggaattg caaagagaga ctccctgcag
1100
gtttctaaag caacccacaa ggctgtgctg gatgtcagtg aagagggcac
1150
tgaggccaca gcagctacca ccaccaagtt catagtccga tcgaaggatg
1200
gtccctctta cttcactgtc tccttcaata ggaccttcct gatgatgatt
1250
acaaataaag ccacagacgg tattctcttt ctagggaaag tggaaaatcc
1300
cactaaatcc taggtgggaa atggcctgtt aactgatggc acattgctaa
1350
tgcacaagaa ataacaaacc acatccctct ttctgttctg agggtgcatt
1400
tgaccccagt ggagctggat tcgctggcag ggatgccact tccaaggctc
1450
aatcaccaaa ccatcaacag ggaccccagt cacaagccaa cacccattaa
1500
ccccagtcag tgcccttttc cacaaattct cccaggtaac tagcttcatg
1550
ggatgttgct gggttaccat atttccattc cttggggctc ccaggaatgg
1600
aaatacgcca acccaggtta ggcacctcta ttgcagaatt acaataacac
1650
attcaataaa actaaaatat gaattcaaaa aaaaaaaaaa aaaaaaaaaa
1700
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa 1743
<210> 236 <211> 417 <212> PRT
<213> Homo sapiens <400> 236
Met Ala Ser Tyr Leu Tyr Gly Val Leu Phe Ala Val Gly Leu Cys 15 10 15
Ala Pro Ile Tyr Cys Val Ser Pro Ala Asn Ala Pro Ser Ala Tyr 20 25 30
699
Pro Arg Pro Ser Ser Thr Lys Ser Thr Pro Ala Ser Gin Val Tyr 35 40 45
Ser Leu Asn Thr Asp Phe Ala Phe Arg Leu Tyr Arg Arg Leu Val 50 55 60
Leu Glu Thr Pro Ser Gin Asn Ile Phe Phe Ser Pro Val Ser Val 65 70 75
Ser Thr Ser Leu Ala Met Leu Ser Leu Gly Ala His Ser Val Thr 80 85 90
Lys Thr Gin Ile Leu Gin Gly Leu Gly Phe Asn Leu Thr His Thr 95 100 105
Pro Glu Ser Ala Ile His Gin Gly Phe Gin His Leu Val His Ser 110 115 120
Leu Thr Val Pro Ser Lys Asp Leu Thr Leu Lys Met Gly Ser Ala 125 130 135
Leu Phe Val Lys Lys Glu Leu Gin Leu Gin Ala Asn Phe Leu Gly 140 145 150
Asn Val Lys Arg Leu Tyr Glu Ala Glu Val Phe Ser Thr Asp Phe 155 160 165
Ser Asn Pro Ser Ile Ala Gin Ala Arg Ile Asn Ser His Val Lys 170 175 180
Lys Lys Thr Gin Gly Lys Val Val Asp Ile Ile Gin Gly Leu Asp 185 190 195
Leu Leu Thr Ala Met Val Leu Val Asn His Ile Phe Phe Lys Ala 200 205 210
Lys Trp Glu Lys Pro Phe His Leu Glu Tyr Thr Arg Lys Asn Phe 215 220 225
Pro Phe Leu Val Gly Glu Gin Val Thr Val Gin Val Pro Met Met 230 235 240
His Gin Lys Glu Gin Phe Ala Phe Gly Val Asp Thr Glu Leu Asn 245 250 255
Cys Phe Val Leu Gin Met Asp Tyr Lys Gly Asp Ala Val Ala Phe 260 265 270
Phe Val Leu Pro Ser Lys Gly Lys Met Arg Gin Leu Glu Gin Ala 275 280 285
Leu Ser Ala Arg Thr Leu Ile Lys Trp Ser His Ser Leu Gin Lys 290 295 300
Arg Trp Ile Glu Val Phe Ile Pro Arg Phe Ser Ile Ser Ala Ser 305 310 315
Tyr Asn Leu Glu Thr Ile Leu Pro Lys Met Gly Ile Gin Asn Ala 320 325 330
Phe Asp Lys Asn Ala Asp Phe Ser Gly Ile Ala Lys Arg Asp Ser 335 340 345
700
Leu Gin Val
Glu Glu Gly
Val Arg Ser
Arg Thr Phe
Leu Phe Leu
<210> 237 <211> 23 <212> DNA <213> Artificial
<220>
<221> misc_feature <222> 1-23
<223> Synthetic construct.
<400> 237 caaccatgca aggacagggc agg 23
<210> 238 <211> 47 <212> DNA <213> Artificial
<220>
<221> misc_feature <222> 1-47
<223> Synthetic construct.
<400> 238
ctttgctgtt ggcctctgtg ctcccaacca tgcaaggaca gggcagg 47
<210> 239 <211> 24 <212> DNA <213> Artificial
<220>
<221> misc_feature <222> 1-24
<223> Synthetic construct.
<400> 239 tgactcgggg tctccaaaac cagc 24
<210> 240 <211> 24 <212> DNA <213> Artificial
<220>
<221> misc_feature
Ser Lys Ala Thr His Lys Ala Val Leu Asp Val Ser
350 355 360
Thr Glu Ala Thr Ala Ala Thr Thr Thr Lys Phe Ile
365 370 375
Lys Asp Gly Pro Ser Tyr Phe Thr Val Ser Phe Asn
380 385 390
Leu Met Met Ile Thr Asn Lys Ala Thr Asp Gly Ile
395 400 405
Gly Lys Val Glu Asn Pro Thr Lys Ser 410 415
701
<222> 1-24
<223> Synthetic construct.
<400> 240 ggtataggcg gaaggcaaag tcgg 24
<210> 241 <211> 48 <212> DNA <213> Artificial
<220>
<221> misc_feature <222> 1-48
<223> Synthetic construct.
<400> 241
ggcatcttac ctttatggag tactctttgc tgttggcctc tgtgctcc 48
<210> 242
<211> 2436
<212> DNA
<213> Homo sapiens
<400> 242
ggctgaccgt gctacattgc ctggaggaag cctaaggaac ccaggcatcc
50
agctgcccac gcctgagtcc aagattcttc ccaggaacac aaacgtagga
100
gacccacgct cctggaagca ccagccttta tctcttcacc ttcaagtccc
150
ctttctcaag aatcctctgt tctttgccct ctaaagtctt ggtacatcta
200
ggacccaggc atcttgcttt ccagccacaa agagacagat gaagatgcag
250
aaaggaaatg ttctccttat gtttggtcta ctattgcatt tagaagctgc
300
aacaaattcc aatgagacta gcacctctgc caacactgga tccagtgtga
350
tctccagtgg agccagcaca gccaccaact ctgggtccag tgtgacctcc
400
agtggggtca gcacagccac catctcaggg tccagcgtga cctccaatgg
450
ggtcagcata gtcaccaact ctgagttcca tacaacctcc agtgggatca
500
gcacagccac caactctgag ttcagcacag cgtccagtgg gatcagcata
550
gccaccaact ctgagtccag cacaacctcc agtggggcca gcacagccac
600
caactctgag tccagcacac cctccagtgg ggccagcaca gtcaccaact
650
ctgggtccag tgtgacctcc agtggagcca gcactgccac caactctgag
700
tccagcacag tgtccagtag ggccagcact gccaccaact ctgagtctag
750
cacactctcc agtggggcca gcacagccac caactctgac tccagcacaa
800
cctccagtgg ggctagcaca gccaccaact ctgagtccag cacaacctcc
850
agtggggcca gcacagccac caactctgag tccagcacag tgtccagtag
900
ggccagcact gccaccaact ctgagtccag cacaacctcc agtggggcca
950
702
gcacagccac caactctgag tccagaacga cctccaatgg ggctggcaca
1000
gccaccaact ctgagtccag cacgacctcc agtggggcca gcacagccac
1050
caactctgac tccagcacag tgtccagtgg ggccagcact gccaccaact
1100
ctgagtccag cacgacctcc agtggggcca gcacagccac caactctgag
1150
tccagcacga cctccagtgg ggctagcaca gccaccaact ctgactccag
1200
cacaacctcc agtggggccg gcacagccac caactctgag tccagcacag
1250
tgtccagtgg gatcagcaca gtcaccaatt ctgagtccag cacaccctcc
1300
agtggggcca acacagccac caactctgag tccagtacga cctccagtgg
1350
ggccaacaca gccaccaact ctgagtccag cacagtgtcc agtggggcca
1400
gcactgccac caactctgag tccagcacaa cctccagtgg ggtcagcaca
1450
gccaccaact ctgagtccag cacaacctcc agtggggcta gcacagccac
1500
caactctgac tccagcacaa cctccagtga ggccagcaca gccaccaact
1550
ctgagtctag cacagtgtcc agtgggatca gcacagtcac caattctgag
1600
tccagcacaa cctccagtgg ggccaacaca gccaccaact ctgggtccag
1650
tgtgacctct gcaggctctg gaacagcagc tctgactgga atgcacacaa
1700
cttcccatag tgcatctact gcagtgagtg aggcaaagcc tggtgggtcc
1750
ctggtgccgt gggaaatctt cctcatcacc ctggtctcgg ttgtggcggc
1800
cgtggggctc tttgctgggc tcttcttctg tgtgagaaac agcctgtccc
1850
tgagaaacac ctttaacaca gctgtctacc accctcatgg cctcaaccat
1900
ggccttggtc caggccctgg agggaatcat ggagcccccc acaggcccag
1950
gtggagtcct aactggttct ggaggagacc agtatcatcg atagccatgg
2000
agatgagcgg gaggaacagc gggccctgag cagccccgga agcaagtgcc
2050
gcattcttca ggaaggaaga gacctgggca cccaagaicct ggtttccttt
2100
cattcatccc aggagacccc tcccagcttt gtttgagatc ctgaaaatct
2150
tgaagaaggt attcctcacc tttcttgcct ttaccagaca ctggaaagag
2200
aatactatat tgctcattta gctaagaaat aaatacatct catctaacac
2250
acacgacaaa gagaagctgt gcttgccccg gggtgggtat ctagctctga
2300
gatgaactca gttataggag aaaacctcca tgctggactc catctggcat
2350
tcaaaatctc cacagtaaaa tccaaagacc tcaaaaaaaa aaaaaaaaaa
2400
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 2436
<210> 243
703
<211> 596 <212> PRT <213> Homo sapiens
<400> 243
Met Lys Met Gin Lys Gly Asn Val Leu Leu Met Phe Gly Leu Leu
10 15
Leu His Leu Glu Ala Ala Thr Asn Ser Asn Glu Thr Ser Thr Ser
25 30
Ala Asn Thr Gly Ser Ser Val Ile Ser Ser Gly Ala Ser Thr Ala
40 45
Thr Asn Ser Gly Ser Ser Val Thr Ser Ser Gly Val Ser Thr Ala
50 55 60
Thr Ile Ser Gly Ser Ser Val Thr Ser Asn Gly Val Ser Ile Val
65 70 75
Thr Asn Ser Glu Phe His Thr Thr Ser Ser Gly Ile Ser Thr Ala
80 85 90
Thr Asn Ser Glu Phe Ser Thr Ala Ser Ser Gly Ile Ser Ile Ala
95 100 105
Thr Asn Ser Glu Ser Ser Thr Thr Ser Ser Gly Ala Ser Thr Ala
110 115 120
Thr Asn Ser Glu Ser Ser Thr Pro Ser Ser Gly Ala Ser Thr Val
125 130 135
Thr Asn Ser Gly Ser Ser Val Thr Ser Ser Gly Ala Ser Thr Ala
140 145 150
Thr Asn Ser Glu Ser Ser Thr Val Ser Ser Arg Ala Ser Thr Ala
155 160 165
Thr Asn Ser Glu Ser Ser Thr Leu Ser Ser Gly Ala Ser Thr Ala
170 175 180
Thr Asn Ser Asp Ser Ser Thr Thr Ser Ser Gly Ala Ser Thr Ala
185 190 195
Thr Asn Ser Glu Ser Ser Thr Thr Ser Ser Gly Ala Ser Thr Ala
200 205 210
Thr Asn Ser Glu Ser Ser Thr Val Ser Ser Arg Ala Ser Thr Ala
215 220 225
Thr Asn Ser Glu Ser Ser Thr Thr Ser Ser Gly Ala Ser Thr Ala
230 235 240
Thr Asn Ser Glu Ser Arg Thr Thr Ser Asn Gly Ala Gly Thr Ala
245 250 255
Thr Asn Ser Glu Ser Ser Thr Thr Ser Ser Gly Ala Ser Thr Ala
260 265 270
Thr Asn Ser Asp Ser Ser Thr Val Ser Ser Gly Ala Ser Thr Ala
275 280 285
Thr Asn Ser Glu Ser Ser Thr Thr Ser Ser Gly Ala Ser Thr Ala
704
290 295 300
Thr Asn Ser Glu Ser Ser Thr Thr Ser Ser Gly Ala Ser Thr Ala 305 310 315
Thr Asn Ser Asp Ser Ser Thr Thr Ser Ser Gly Ala Gly Thr Ala 320 325 330
Thr Asn Ser Glu Ser Ser Thr Val Ser Ser Gly Ile Ser Thr Val 335 340 345
Thr Asn Ser Glu Ser Ser Thr Pro Ser Ser Gly Ala Asn Thr Ala 350 355 360
Thr Asn Ser Glu Ser Ser Thr Thr Ser Ser Gly Ala Asn Thr Ala 365 370 375
Thr Asn Ser Glu Ser Ser Thr Val Ser Ser Gly Ala Ser Thr Ala 380 385 390
Thr Asn Ser Glu Ser Ser Thr Thr Ser Ser Gly Val Ser Thr Ala 395 400 405
Thr Asn Ser Glu Ser Ser Thr Thr Ser Ser Gly Ala Ser Thr Ala 410 415 420
Thr Asn Ser Asp Ser Ser Thr Thr Ser Ser Glu Ala Ser Thr Ala 425 430 435
Thr Asn Ser Glu Ser Ser Thr Val Ser Ser Gly Ile Ser Thr Val 440 445 450
Thr Asn Ser Glu Ser Ser Thr Thr Ser Ser Gly Ala Asn Thr Ala 455 460 465
Thr Asn Ser Gly Ser Ser Val Thr Ser Ala Gly Ser Gly Thr Ala 470 475 480
Ala Leu Thr Gly Met His Thr Thr Ser His Ser Ala Ser Thr Ala 485 490 495
Val Ser Glu Ala Lys Pro Gly Gly Ser Leu Val Pro Trp Glu Ile 500 505 510
Phe Leu Ile Thr Leu Val Ser Val Val Ala Ala Val Gly Leu Phe 515 520 525
Ala Gly Leu Phe Phe Cys Val Arg Asn Ser Leu Ser Leu Arg Asn 530 535 540
Thr Phe Asn Thr Ala Val Tyr His Pro His Gly Leu Asn His Gly 545 550 555
Leu Gly Pro Gly Pro Gly Gly Asn His Gly Ala Pro His Arg Pro 560 565 570
Arg Trp Ser Pro Asn Trp Phe Trp Arg Arg Pro Val Ser Ser Ile 575 580 585
Ala Met Glu Met Ser Gly Arg Asn Ser Gly Pro 590 595
705
<210> 244 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 244 gaagcaccag cctttatctc ttcacc 26
<210> 245 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 245 gtcagagttg gtggctgtgc tagc 24
<210> 246 <211> 48 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 246
ggacccaggc atcttgcttt ccagccacaa agagacagat gaagatgc 48
<210> 247 <211> 957 <212> DNA <213> Homo sapiens
<400> 247
gggagagagg ataaatagca gcgtggcttc cctggctcct ctctgcatcc
50
ttcccgacct tcccagcaat atgcatcttg cacgtctggt cggctcctgc
100
tccctccttc tgctactggg ggccctgtct ggatgggcgg ccagcgatga
150
ccccattgag aaggtcattg aagggatcaa ccgagggctg agcaatgcag
200
agagagaggt gggcaaggcc ctggatggca tcaacagtgg aatcacgcat
250
gccggaaggg aagtggagaa ggttttcaac ggacttagca acatggggag
300
ccacaccggc aaggagttgg acaaaggcgt ccaggggctc aaccacggca
350
tggacaaggt tgcccatgag atcaaccatg gtattggaca agcaggaaag
400
gaagcagaga agcttggcca tggggtcaac aacgctgctg gacaggccgg
450
gaaggaagca gacaaagcgg tccaagggtt ccacactggg gtccaccagg
500
ctgggaagga agcagagaaa cttggccaag gggtcaacca tgctgctgac
550
caggctggaa aggaagtgga gaagcttggc caaggtgccc accatgctgc
600
706
tggccaggcc gggaaggagc tgcagaatgc tcataatggg gtcaaccaag 650
ccagcaagga ggccaaccag ctgctgaatg gcaaccatca aagcggatct 7 00
tccagccatc aaggaggggc cacaaccacg ccgttagcct ctggggcctc 750
agtcaacacg cctttcatca accttcccgc cctgtggagg agcgtcgcca 800
acatcatgcc ctaaactggc atccggcctt gctgggagaa taatgtcgcc 850
gttgtcacat cagctgacat gacctggagg ggttgggggt gggggacagg 900
tttctgaaat ccctgaaggg ggttgtactg ggatttgtga ataaacttga 950
tacacca 957
<210> 248 <211> 247 <212> PRT
<213> Homo sapiens <400> 248
Met His Leu Ala Arg Leu Val Gly Ser Cys Ser Leu Leu Leu Leu 15 10 15
Leu Gly Ala Leu Ser Gly Trp Ala Ala Ser Asp Asp Pro Ile Glu 20 25 30
Lys Val Ile Glu Gly Ile Asn Arg Gly Leu Ser Asn Ala Glu Arg 35 40 45
Glu Val Gly Lys Ala Leu Asp Gly Ile Asn Ser Gly Ile Thr His 50 55 60
Ala Gly Arg Glu Val Glu Lys Val Phe Asn Gly Leu Ser Asn Met 65 70 75
Gly Ser His Thr Gly Lys Glu Leu Asp Lys Gly Val Gin Gly Leu 80 85 90
Asn His Gly Met Asp Lys Val Ala His Glu Ile Asn His Gly Ile 95 100 105
Gly Gin Ala Gly Lys Glu Ala Glu Lys Leu Gly His Gly Val Asn 110 115 120
Asn Ala Ala Gly Gin Ala Gly Lys Glu Ala Asp Lys Ala Val Gin 125 • 130 135
Gly Phe His Thr Gly Val His Gin Ala Gly Lys Glu Ala Glu Lys 140 145 150
Leu Gly Gin Gly Val Asn His Ala Ala Asp Gin Ala Gly Lys Glu 155 160 165
Val Glu Lys Leu Gly Gin Gly Ala His His Ala Ala Gly Gin Ala 170 175 180
Gly Lys Glu Leu Gin Asn Ala His Asn Gly Val Asn Gin Ala Ser 185 190 195
Lys Glu Ala Asn Gin Leu Leu Asn Gly Asn His Gin Ser Gly Ser
200
707
205
210
Ser Ser His Gin Gly Gly Ala Thr Thr Thr Pro Leu Ala Ser Gly 215 220 225
Ala Ser Val Asn Thr Pro Phe Ile Asn Leu Pro Ala Leu Trp Arg 230 235 240
Ser Val Ala Asn Ile Met Pro 245
<210> 249 <211> 23 <212> DNA
<213> Artificial Sequence <22 0>
<223> Synthetic oligonucleotide probe
<400> 249 caatatgcat cttgcacgtc tgg 23
<210> 250 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 250 aagcttctct gcttcctttc ctgc 24
<210> 251 <211> 43 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 251
tgaccccatt gagaaggtca ttgaagggat caaccgaggg ctg 43
<210> 252
<211> 3781
<212> DNA
<213> Homo sapiens
<400> 252
ctccgggtcc ccaggggctg cgccgggccg gcctggcaag ggggacgagt 50 cagtggacac tccaggaaga gcggccccgc ggggggcgat gaccgtgcgc 100 tgaccctgac tcactccagg tccggaggcg ggggcccccg gggcgactcg 150 ggggcggacc gcggggcgga gctgccgccc gtgagtccgg ccgagccacc 200 tgagcccgag ccgcgggaca ccgtcgctcc tgctctccga atgctgcgca 250 ccgcgatggg cctgaggagc tggctcgccg ccccatgggg cgcgctgccg 300
cctcggccac cgctgctgct gctcctgctg ctgctgctcc tgctgcagcc 350
708
gccgcctccg acctgggcgc tcagcccccg gatcagcctg cctctgggct
400
ctgaagagcg gccattcctc agattcgaag ctgaacacat ctccaactac
450
acagcccttc tgctgagcag ggatggcagg accctgtacg tgggtgctcg
500
agaggccctc tttgcactca gtagcaacct cagcttcctg ccaggcgggg
550
agtaccagga gctgctttgg ggtgcagacg cagagaagaa acagcagtgc
600
agcttcaagg gcaaggaccc acagcgcgac tgtcaaaact acatcaagat
650
cctcctgccg ctcagcggca gtcacctgtt cacctgtggc acagcagcct
700
tcagccccat gtgtacctac atcaacatgg agaacttcac cctggcaagg
750
gacgagaagg ggaatgtcct cctggaagat ggcaagggcc gttgtccctt
800
cgacccgaat ttcaagtcca ctgccctggt ggttgatggc gagctctaca
850
ctggaacagt cagcagcttc caagggaatg acccggccat ctcgcggagc
900
caaagccttc gccccaccaa gaccgagagc tccctcaact ggctgcaaga
950
cccagctttt gtggcctcag cctacattcc tgagagcctg ggcagcttgc
1000
aaggcgatga tgacaagatc tactttttct tcagcgagac tggccaggaa
1050
tttgagttct ttgagaacac cattgtgtcc cgcattgccc gcatctgcaa
1100
gggcgatgag ggtggagagc gggtgctaca gcagcgctgg acctccttcc
1150
tcaaggccca gctgctgtgc tcacggcccg acgatggctt ccccttcaac
1200
gtgctgcagg atgtcttcac gctgagcccc agcccccagg actggcgtga
1250
cacccttttc tatggggtct tcacttccca gtggcacagg ggaactacag
1300
aaggctctgc cgtctgtgtc ttcacaatga aggatgtgca gagagtcttc
1350
agcggcctct acaaggaggt gaaccgtgag acacagcagt ggtacaccgt
1400
gacccacccg gtgcccacac cccggcctgg agcgtgcatc accaacagtg
1450
cccgggaaag gaagatcaac tcatccctgc agctcccaga ccgcgtgctg
1500
aacttcctca aggaccactt cctgatggac gggcaggtcc gaagccgcat
1550
gctgctgctg cagccccagg ctcgctacca gcgcgtggct gtacaccgcg
1600
tccctggcct gcaccacacc tacgatgtcc tcttcctggg cactggtgac
1650
ggccggctcc acaaggcagt gagcgtgggc ccccgggtgc acatcattga
1700
ggagctgcag atcttctcat cgggacagcc cgtgcagaat ctgctcctgg
1750
acacccacag ggggctgctg tatgcggcct cacactcggg cgtagtccag
1800
gtgcccatgg ccaactgcag cctgtaccgg agctgtgggg actgcctcct
1850
cgcccgggac ccctactgtg cttggagcgg ctccagctgc aagcacgtca
1900
709
gcctctacca gcctcagctg gccaccaggc cgtggatcca ggacatcgag
1950
ggagccagcg ccaaggacct ttgcagcgcg tcttcggttg tgtccccgtc
2000
ttttgtacca acaggggaga agccatgtga gcaagtccag ttccagccca
2050
acacagtgaa cactttggcc tgcccgctcc tctccaacct ggcgacccga
2100
ctctggctac gcaacggggc ccccgtcaat gcctcggcct cctgccacgt
2150
gctacccact ggggacctgc tgctggtggg cacccaacag ctgggggagt
2200
tccagtgctg gtcactagag gagggcttcc agcagctggt agccagctac
2250
tgcccagagg tggtggagga cggggtggca gaccaaacag atgagggtgg
2300
cagtgtaccc gtcattatca gcacatcgcg tgtgagtgca ccagctggtg
2350
gcaaggccag ctggggtgca gacaggtcct actggaagga gttcctggtg
2400
atgtgcacgc tctttgtgct ggccgtgctg ctcccagttt tattcttgct
2450
ctaccggcac cggaacagca tgaaagtctt cctgaagcag ggggaatgtg
2500
ccagcgtgca ccccaagacc tgccctgtgg tgctgccccc tgagacccgc
2550
ccactcaacg ccagtccctg gcctagggcc tcagacagcc ccctagcacc ccccgggggc ccgctcgatc ccgagtcttc accgagggta actgagtcag
2600 2650
agaagaggcc actcagcatc caagacagct tcgtggaggt atccccagtg
2700
tgcccccggc cccgggtccg ccttggctcg gagatccgtg actctgtggt
2750
gtgagagctg acttccagag gacgctgccc tggcttcagg ggctgtgaat
2800
gctcggagag ggtcaactgg acctcccctc cgctctgctc ttcgtggaac
2850
acgaccgtgg tgcccggccc ttgggagcct tggagccagc tggcctgctg
2900
ctctccagtc aagtagcgaa gctcctacca cccagacacc caaacagccg
2950
tggccccaga ggtcctggcc aaatatgggg gcctgcctag gttggtggaa
3000
cagtgctcct tatgtaaact gagccctttg tttaaaaaac aattccaaat
3050
gtgaaactag aatgagaggg aagagatagc atggcatgca gcacacacgg
3100
ctgctccagt tcatggcctc ccaggggtgc tggggatgca tccaaagtgg
3150
ttgtctgaga cagagttgga aaccctcacc aactggcctc ttcaccttcc
3200
acattatccc gctgccaccg gctgccctgt ctcactgcag attcaggacc
3250
agcttgggct gcgtgcgttc tgccttgcca gtcagccgag gatgtagttg
3300
ttgctgccgt cgtcccacca cctcagggac cagagggcta ggttggcact
3350
gcggccctca ccaggtcctg ggctcggacc caactcctgg acctttccag
3400
cctgtatcag gctgtggcca cacgagagga cagcgcgagc tcaggagaga
3450
710
tttcgtgaca atgtacgcct ttccctcaga attcagggaa gagactgtcg 3500
cctgccttcc tccgttgttg cgtgagaacc cgtgtgcccc ttcccaccat 3550
atccaccctc gctccatctt tgaactcaaa cacgaggaac taactgcacc 3600
ctggtcctct ccccagtccc cagttcaccc tccatccctc accttcctcc 3650
actctaaggg atatcaacac tgcccagcac aggggccctg aatttatgtg 3700
gtttttatac attttttaat aagatgcact ttatgtcatt ttttaataaa 3750
gtctgaagaa ttactgttta aaaaaaaaaa a 3781
<210> 253 <211> 837 <212> PRT
<213> Homo sapiens <400> 253
Met Leu Arg Thr Ala Met Gly Leu Arg Ser Trp Leu Ala Ala Pro 15 10 15
Trp Gly Ala Leu Pro Pro Arg Pro Pro Leu Leu Leu Leu Leu Leu 20 25 30
Leu Leu Leu Leu Leu Gin Pro Pro Pro Pro Thr Trp Ala Leu Ser 35 40 45
Pro Arg Ile Ser Leu Pro Leu Gly Ser Glu Glu Arg Pro Phe Leu 50 55 60
Arg Phe Glu Ala Glu His Ile Ser Asn Tyr Thr Ala Leu Leu Leu 65 70 75
Ser Arg Asp Gly Arg Thr Leu Tyr Val Gly Ala Arg Glu Ala Leu 80 85 90
Phe Ala Leu Ser Ser Asn Leu Ser Phe Leu Pro Gly Gly Glu Tyr 95 100 105
Gin Glu Leu Leu Trp Gly Ala Asp Ala Glu Lys Lys Gin Gin Cys 110 115 120
Ser Phe Lys Gly Lys Asp Pro Gin Arg Asp Cys Gin Asn Tyr Ile 125 130 135
Lys Ile Leu Leu Pro Leu Ser Gly Ser His Leu Phe Thr Cys Gly 140 145 150
Thr Ala Ala Phe Ser Pro Met Cys Thr Tyr Ile Asn Met Glu Asn 155 160 165
Phe Thr Leu Ala Arg Asp Glu Lys Gly Asn Val Leu Leu Glu Asp 170 175 • 180
Gly Lys Gly Arg Cys Pro Phe Asp Pro Asn Phe Lys Ser Thr Ala 185 190 195
Leu Val Val Asp Gly Glu Leu Tyr Thr Gly Thr Val Ser Ser Phe 200 205 210
711
Gin Gly Asn Asp Pro Ala Ile Ser Arg Ser Gin Ser Leu Arg Pro 215 220 225
Thr Lys Thr Glu Ser Ser Leu Asn Trp Leu Gin Asp Pro Ala Phe 230 235 240
Val Ala Ser Ala Tyr Ile Pro Glu Ser Leu Gly Ser Leu Gin Gly 245 250 255
Asp Asp Asp Lys Ile Tyr Phe Phe Phe Ser Glu Thr Gly Gin Glu 260 265 270
Phe Glu Phe Phe Glu Asn Thr Ile Val Ser Arg Ile Ala Arg Ile 275 280 285
Cys Lys Gly Asp Glu Gly Gly Glu Arg Val Leu Gin Gin Arg Trp 290 295 300
Thr Ser Phe Leu Lys Ala Gin Leu Leu Cys Ser Arg Pro Asp Asp 305 310 315
Gly Phe Pro Phe Asn Val Leu Gin Asp Val Phe Thr Leu Ser Pro 320 325 330
Ser Pro Gin Asp Trp Arg Asp Thr Leu Phe Tyr Gly Val Phe Thr 335 340 345
Ser Gin Trp His Arg Gly Thr Thr Glu Gly Ser Ala Val Cys Val 350 355 360
Phe Thr Met Lys Asp Val Gin Arg Val Phe Ser Gly Leu Tyr Lys 365 370 375
Glu Val Asn Arg Glu Thr Gin Gin Trp Tyr Thr Val Thr His Pro 380 385 390
Val Pro Thr Pro Arg Pro Gly Ala Cys Ile Thr Asn Ser Ala Arg 395 400 405
Glu Arg Lys Ile Asn Ser Ser Leu Gin Leu Pro Asp Arg Val Leu 410 415 420
Asn Phe Leu Lys Asp His Phe Leu Met Asp Gly Gin Val Arg Ser 425 430 435
Arg Met Leu Leu Leu Gin Pro Gin Ala Arg Tyr Gin Arg Val Ala 440 445 450
Val His Arg Val Pro Gly Leu His His Thr Tyr Asp Val Leu Phe 455 460 465
Leu Gly Thr Gly Asp Gly Arg Leu His Lys Ala Val Ser Val Gly 470 475 480
Pro Arg Val His lie Ile Glu Glu Leu Gin Ile Phe Ser Ser Gly 485 490 495
Gin Pro Val Gin Asn Leu Leu Leu Asp Thr His Arg Gly Leu Leu 500 505 510
Tyr Ala Ala Ser His Ser Gly Val Val Gin Val Pro Met Ala Asn 515 520 525
712
Cys Ser Leu Tyr Arg Ser Cys Gly Asp Cys Leu Leu Ala Arg Asp 530 535 540
Pro Tyr Cys Ala Trp Ser Gly Ser Ser Cys Lys His Val Ser Leu 545 550 555
Tyr Gin Pro Gin Leu Ala Thr Arg Pro Trp Ile Gin Asp Ile Glu 560 565 570
Gly Ala Ser Ala Lys Asp Leu Cys Ser Ala Ser Ser Val Val Ser 575 580 585
Pro Ser Phe Val Pro Thr Gly Glu Lys Pro Cys Glu Gin Val Gin 590 595 600
Phe Gin Pro Asn Thr Val Asn Thr Leu Ala Cys Pro Leu Leu Ser 605 610 615
Asn Leu Ala Thr Arg Leu Trp Leu Arg Asn Gly Ala Pro Val Asn 620 625 630
Ala Ser Ala Ser Cys His Val Leu Pro Thr Gly Asp Leu Leu Leu 635 640 645
Val Gly Thr Gin Gin Leu Gly Glu Phe Gin Cys Trp Ser Leu Glu 650 655 660
Glu Gly Phe Gin Gin Leu Val Ala Ser Tyr Cys Pro Glu Val Val 665 670 675
Glu Asp Gly Val Ala Asp Gin Thr Asp Glu Gly Gly Ser Val Pro 680 685 690
Val Ile Ile Ser Thr Ser Arg Val Ser Ala Pro Ala Gly Gly Lys 695 700 705
Ala Ser Trp Gly Ala Asp Arg Ser Tyr Trp Lys Glu Phe Leu Val 710 715 720
Met Cys Thr Leu Phe Val Leu Ala Val Leu Leu Pro Val Leu Phe 725 730 735
Leu Leu Tyr Arg His Arg Asn Ser Met Lys Val Phe Leu Lys Gin 740 745 750
Gly Glu Cys Ala Ser Val His Pro Lys Thr Cys Pro Val Val Leu 755 760 765
Pro Pro Glu Thr Arg Pro Leu Asn Gly Leu Gly Pro Pro Ser Thr 770 775 780
Pro Leu Asp His Arg Gly Tyr Gin Ser Leu Ser Asp Ser Pro Pro 785 790 795
Gly Ala Arg Val Phe Thr Glu Ser Glu Lys Arg Pro Leu Ser Ile 800 805 810
Gin Asp Ser Phe Val Glu Val Ser Pro Val Cys Pro Arg Pro Arg 815 820 825
Val Arg Leu Gly Ser Glu Ile Arg Asp Ser Val Val
713
830 835
<210> 254 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 254 agcccgtgca gaatctgctc ctgg 24
<210> 255 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 255 tgaagccagg gcagcgtcct ctgg 24
<210> 256 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 256 gtacaggctg cagttggc 18
<210> 257 <211> 41 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 257
agaagccatg tgagcaagtc cagttccagc ccaacacagt g 41
<210> 258 <211> 45 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 258
gagctgcaga tcttctcatc gggacagccc gtgcagaatc tgctc 45
<210> 259
<211> 4563
<212> DNA
<213> Homo sapiens
714
<220>
<221> unsure
<222> 3635
<223> unknown base
<400> 259
ctaagccgga ggatgtgcag ctgcggcggc ggcgccggct acgaagagga 50
cggggacagg cgccgtgcga accgagccca gccagccgga ggacgcgggc 100
agggcgggac gggagcccgg actcgtctgc cgccgccgtc gtcgccgtcg 150
tgccggcccc gcgtccccgc gcgcgagcgg gaggagccgc cgccacctcg 200
cgcccgagcc gccgctagcg cgcgccgggc atggtcccct cttaaaggcg 250
caggccgcgg cggcgggggc gggtgtgcgg aacaaagcgc cggcgcgggg 300
cctgcgggcg gctcgggggc cgcgatgggc gcggcgggcc cgcggcggcg 350
gcggcgctgc ccgggccggg cctcgcggcg ctagggcggg ctggcctccg 400
tgggcggggg cagcgggctg agggcgcgcg gagcctgcgg cggcggcggc 450
ggcggcggcg gcggcccggc gggcggagcg gcgcgggcat ggccgcgcgc 500
ggccggcgcg cctggctcag cgtgctgctc gggctcgtcc tgggcttcgt 550
gctggcctcg cggctcgtcc tgccccgggc ttccgagctg aagcgagcgg 600
gcccacggcg ccgcgccagc cccgagggct gccggtccgg gcaggcggcg 650
gcttcccagg ccggcggggc gcgcggcgat gcgcgcgggg cgcagctctg 700 gccgcccggc tcggacccag atggcggccc gcgcgacagg aactttctct 750
tcgtgggagt catgaccgcc cagaaatacc tgcagactcg ggccgtggcc 800
gcctacagaa catggtccaa gacaattcct gggaaagttc agttcttctc 850
aagtgagggt tctgacacat ctgtaccaat tccagtagtg ccactacggg 900
gtgtggacga ctcctacccg ccccagaaga agtccttcat gatgctcaag 950
tacatgcacg accactactt ggacaagtat gaatggttta tgagagcaga 1000
tgatgacgtg tacatcaaag gagaccgtct ggagaacttc ctgaggagtt 1050
tgaacagcag cgagcccctc tttcttgggc agacaggcct gggcaccacg 1100
gaagaaatgg gaaaactggc cctggagcct ggtgagaact tctgcatggg 1150
ggggcctggc gtgatcatga gccgggaggt gcttcggaga atggtgccgc 1200
acattggcaa gtgtctccgg gagatgtaca ccacccatga ggacgtggag 12 50
gtgggaaggt gtgtccggag gtttgcaggg gtgcagtgtg tctggtctta 1300
tgagatgcgg cagctttttt atgagaatta cgagcagaac aaaaaggggt 1350
acattagaga tctccataac agtaaaattc accaagctat cacattacac 1400
cccaacaaaa acccacccta ccagtacagg ctccacagct acatgctgag 1450
715
ccgcaagata tccgagctcc gccatcgcac aatacagctg caccgcgaaa
1500
ttgtcctgat gagcaaatac agcaacacag aaattcataa agaggacctc
1550
cagctgggaa tccctccctc cttcatgagg tttcagcccc gccagcgaga
1600
ggagattctg gaatgggagt ttctgactgg aaaatacttg tattcggcag
1650
ttgacggcca gccccctcga agaggaatgg actccgccca gagggaagcc
1700
ttggacgaca ttgtcatgca ggtcatggag atgatcaatg ccaacgccaa
1750
gaccagaggg cgcatcattg acttcaaaga gatccagtac ggctaccgcc
1800
gggtgaaccc catgtatggg gctgagtaca tcctggacct gctgcttctg
1850
tacaaaaagc acaaagggaa gaaaatgacg gtccctgtga ggaggcacgc
1900
gtatttacag cagactttca gcaaaatcca gtttgtggag catgaggagc
1950
tggatgcaca agagttggcc aagagaatca atcaggaatc tggatccttg
2000
tcctttctct caaactccct gaagaagctc gtcccctttc agctccctgg
2050
gtcgaagagt gagcacaaag aacccaaaga taaaaagata aacatactga
2100
ttcctttgtc tgggcgtttc gacatgtttg tgagatttat gggaaacttt
2150
gagaagacgt gtcttatccc caatcagaac gtcaagctcg tggttctgct
2200
tttcaattct gactccaacc ctgacaaggc caaacaagtt gaactgatga
2250
gagattaccg tctggagagt cattaagtac tttcaagagc cctaaagccg cctggccctg acatgcagat gaagtaggat tttgcctgtg cctcccagtt
2300 2350
taacaatgaa tctttgctct tcttctgcga cgtcgacctc gtgtttacta
2400
cagaattcct tcagcgatgt cgagcaaata cagttctggg ccaacaaata
2450
tattttccaa tcatcttcag ccagtatgac ccaaagattg tttatagtgg
2500
gaaagttccc agtgacaacc attttgcctt tactcagaaa actggcttct
2550
ggagaaacta tgggtttggc atcacgtgta tttataaggg agatcttgtc
2600
cgagtgggtg gctttgatgt ttccatccaa ggctgggggc tggaggatgt
2650
ggaccttttc aacaaggttg tccaggcagg tttgaagacg tttaggagcc
2700
aggaagtagg agtagtccac gtccaccatc ctgtcttttg tgatcccaat
2750
cttgacccca aacagtacaa aatgtgcttg gggtccaaag catcgaccta
2800
tgggtccacc cagcagctgg ctgagatgtg gctggaaaaa aatgatccaa
2850
gttacagtaa aagcagcaat aataatggct cagtgaggac agcctaatgt
2900
ccagctttgc tggaaaagac gtttttaatt atctaattta tttttcaaaa
2950
attttttgta tgatcagttt ttgaagtccg tatacaagga tatattttac
3000
716
aagtggtttt cttacatagg actcctttaa gattgagctt tctgaacaag
3050
aaggtgatca gtgtttgcct ttgaacacat cttcttgctg aacattatgt
3100
agcagacctg cttaactttg acttgaaatg tacctgatga acaaaacttt
3150
tttaaaaaaa tgttttcttt tgagaccctt tgctccagtc ctatggcaga
3200
aaacgtgaac attcctgcaa agtattattg taacaaaaca ctgtaactct
3250
ggtaaatgtt ctgttgtgat tgttaacatt ccacagattc taccttttgt
3300
gttttgtttt ttttttttac aattgtttta aagccatttc atgttccagt
3350
tgtaagataa ggaaatgtga taatagctgt ttcatcattg tcttcaggag
3400
agctttccag agttgatcat ttcctctcat ggtactctgc tcagcatggc
3450
cacgtaggtt ttttgtttgt tttgttttgt tctttttttg agacggagtc
3500
tcactctgtt acccaggctg gaatgcagtg gcgcaatctt ggctcacttt
3550
aacctccact tccctggttc aagcaattcc cctgcctttg cctcccgagt
3600
agctgggatt acaggcacac accaccacgc ccagntagtt tttttgtatt
3650
tttagtagag acggggtttc accatgcaag cccagctggc cacgtaggtt
3700
ttaaagcaag gggcgtgaag aaggcacagt gaggtatgtg gctgttctcg
3750
tggtagttca ttcggcctaa atagacctgg cattaaattt caagaaggat
3800
ttggcatttt ctcttcttga cccttctctt taaagggtaa aatattaatg
3850
tttagaatga gaaacataca caaagatgaa cacatacacc ttattacaat ctaatcaaaa aaatctgatg cgttggggaa tacacagact aaatgtattt
3900 3950
ggttttgttc ctttcatcct gtctgtgtta tgtgggtgga gatggttttc
4000
attctttcat tactgttttg ttttatcctt tgtatctgaa atacctttaa
4050
tttatttaat atctgttgtt cagagctctg ccatttcttg agtacctgtt
4100
agttagtatt atttatgtgt atcgggagtg tgtttagtct gttttatttg
4150
cagtaaaccg atctccaaag atttcctttt ggaaacgctt tttcccctcc
4200
ttaattttta tattccttac tgttttacta aatattaagt gttctttgac
4250
aattttggtg ctcatgtgtt ttggggacaa aagtgaaatg aatctgtcat
4300
tataccagaa agttaaattc tcagatcaaa tgtgccttaa taaatttgtt
4350
ttcatttaga tttcaaacag tgatagactt gccattttaa tacacgtcat
4400
tggagggctg cgtatttgta aatagcctga tgctcatttg gaaaaataaa
4450
ccagtgaaca atatttttct attgtacttt tcgaaccatt ttgtctcatt
4500
attcctgttt tagctgaaga attgtattac atttggagag taaaaaactt
4550
aaacacgaaa aaa 4563
717
<210> 260 <211> 802 <212> PRT
<213> Homo sapiens <400> 260
Met Ala Ala Arg Gly Arg Arg Ala Trp Leu Ser Val Leu Leu Gly 15 10 15
Leu Val Leu Gly Phe Val Leu Ala Ser Arg Leu Val Leu Pro Arg 20 25 30
Ala Ser Glu Leu Lys Arg Ala Gly Pro Arg Arg Arg Ala Ser Pro 35 40 45
Glu Gly Cys Arg Ser Gly Gin Ala Ala Ala Ser Gin Ala Gly Gly 50 55 60
Ala Arg Gly Asp Ala Arg Gly Ala Gin Leu Trp Pro Pro Gly Ser 65 70 75
Asp Pro Asp Gly Gly Pro Arg Asp Arg Asn Phe Leu Phe Val Gly 80 85 90
Val Met Thr Ala Gin Lys Tyr Leu Gin Thr Arg Ala Val Ala Ala 95 100 105
Tyr Arg Thr Trp Ser Lys Thr Ile Pro Gly Lys Val Gin Phe Phe 110 115 120
Ser Ser Glu Gly Ser Asp Thr Ser Val Pro Ile Pro Val Val Pro 125 130 135
Leu Arg Gly Val Asp Asp Ser Tyr Pro Pro Gin Lys Lys Ser Phe 140 145 150
Met Met Leu Lys Tyr Met His Asp His Tyr Leu Asp Lys Tyr Glu 155 160 165
Trp Phe Met Arg Ala Asp Asp Asp Val Tyr Ile Lys Gly Asp Arg 170 175 180
Leu Glu Asn Phe Leu Arg Ser Leu Asn Ser Ser Glu Pro Leu Phe 185 190 195
Leu Gly Gin Thr Gly Leu Gly Thr Thr Glu Glu Met Gly Lys Leu 200 205 210
Ala Leu Glu Pro Gly Glu Asn Phe Cys Met Gly Gly Pro Gly Val 215 220 225
Ile Met Ser Arg Glu Val Leu Arg Arg Met Val Pro His Ile Gly 230 235 240
Lys Cys Leu Arg Glu Met Tyr Thr Thr His Glu Asp Val Glu Val 245 250 255
Gly Arg Cys Val Arg Arg Phe Ala Gly Val Gin Cys Val Trp Ser 260 265 270
Tyr Glu Met Arg Gin Leu Phe Tyr Glu Asn Tyr Glu Gin Asn Lys 275 280 285
718
Lys Gly Tyr Ile Arg Asp Leu His Asn Ser Lys Ile His Gin Ala 290 295 300
Ile Thr Leu His Pro Asn Lys Asn Pro Pro Tyr Gin Tyr Arg Leu 305 310 315
His Ser Tyr Met Leu Ser Arg Lys Ile Ser Glu Leu Arg His Arg 320 325 330
Thr Ile Gin Leu His Arg Glu Ile Val Leu Met Ser Lys Tyr Ser 335 340 345
Asn Thr Glu Ile His Lys Glu Asp Leu Gin Leu Gly Ile Pro Pro 350 355 360
Ser Phe Met Arg Phe Gin Pro Arg Gin Arg Glu Glu Ile Leu Glu 365 370 375
Trp Glu Phe Leu Thr Gly Lys Tyr Leu Tyr Ser Ala Val Asp Gly 380 385 390
Gin Pro Pro Arg Arg Gly Met Asp Ser Ala Gin Arg Glu Ala Leu 395 400 405
Asp Asp Ile Val Met Gin Val Met Glu Met Ile Asn Ala Asn Ala 410 415 420
Lys Thr Arg Gly Arg Ile Ile Asp Phe Lys Glu Ile Gin Tyr Gly 425 430 435
Tyr Arg Arg Val Asn Pro Met Tyr Gly Ala Glu Tyr Ile Leu Asp 440 445 450
Leu Leu Leu Leu Tyr Lys Lys His Lys Gly Lys Lys Met Thr Val 455 460 465
Pro Val Arg Arg His Ala Tyr Leu Gin Gin Thr Phe Ser Lys Ile 470 475 480
Gin Phe Val Glu His Glu Glu Leu Asp Ala Gin Glu Leu Ala Lys 485 490 495
Arg Ile Asn Gin Glu Ser Gly Ser Leu Ser Phe Leu Ser Asn Ser 500 505 510
Leu Lys Lys Leu Val Pro Phe Gin Leu Pro Gly Ser Lys Ser Glu 515 520 525
His Lys Glu Pro Lys Asp Lys Lys Ile Asn Ile Leu Ile Pro Leu 530 535 540
Ser Gly Arg Phe Asp Met Phe Val Arg Phe Met Gly Asn Phe Glu 545 550 555
Lys Thr Cys Leu Ile Pro Asn Gin Asn Val Lys Leu Val Val Leu 560 565 570
Leu Phe Asn Ser Asp Ser Asn Pro Asp Lys Ala Lys Gin Val Glu 575 580 585
Leu Met Arg Asp Tyr Arg Ile Lys Tyr Pro Lys Ala Asp Met Gin
719
590 595 600
Ile Leu Pro Val Ser Gly Glu Phe Ser Arg Ala Leu Ala Leu Glu 605 610 615
Val Gly Ser Ser Gin Phe Asn Asn Glu Ser Leu Leu Phe Phe Cys 620 625 630
Asp Val Asp Leu Val Phe Thr Thr Glu Phe Leu Gin Arg Cys Arg 635 640 645
Ala Asn Thr Val Leu Gly Gin Gin Ile Tyr Phe Pro Ile Ile Phe 650 655 660
Ser Gin Tyr Asp Pro Lys Ile Val Tyr Ser Gly Lys Val Pro Ser 665 670 675
Asp Asn His Phe Ala Phe Thr Gin Lys Thr Gly Phe Trp Arg Asn 680 685 690
Tyr Gly Phe Gly Ile Thr Cys Ile Tyr Lys Gly Asp Leu Val Arg 695 700 705
Val Gly Gly Phe Asp Val Ser Ile Gin Gly Trp Gly Leu Glu Asp 710 715 720
Val Asp Leu Phe Asn Lys Val Val Gin Ala Gly Leu Lys Thr Phe 725 730 735
Arg Ser Gin Glu Val Gly Val Val His Val His His Pro Val Phe 740 745 750
Cys Asp Pro Asn Leu Asp Pro Lys Gin Tyr Lys Met Cys Leu Gly 755 760 765
Ser Lys Ala Ser Thr Tyr Gly Ser Thr Gin Gin Leu Ala Glu Met 770 775 780
Trp Leu Glu Lys Asn Asp Pro Ser Tyr Ser Lys Ser Ser Asn Asn 785 790 795
Asn Gly Ser Val Arg Thr Ala 800
<210> 261
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 261
gtgccactac ggggtgtgga cgac 24
<210> 262
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 262 tcccatttct tccgtggtgc ccag 24
<210> 263 <211> 46 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 263
ccagaagaag tccttcatga tgctcaagta catgcacgac cactac 46
<210> 264
<211> 1419
<212> DNA
<213> Homo sapiens
<400> 264
ggacaaccgt tgctgggtgt cccagggcct gaggcaggac ggtactccgc
50
tgacaccttc cctttcggcc ttgaggttcc cagcctggtg gccccaggac
100
gttccggtcg catggcagag tgctacggac gacgcctatg aagcccttag
150
tccttctagt tgcgcttttg ctatggcctt cgtctgtgcc ggcttatccg
200
agcataactg tgacacctga tgaagagcaa aacttgaatc attatataca
250
agttttagag aacctagtac gaagtgttcc ctctggggag ccaggtcgtg
300
agaaaaaatc taactctcca aaacatgttt attctatagc atcaaaggga
350
tcaaaattta aggagctagt tacacatgga gacgcttcaa ctgagaatga
400
tgttttaacc aatcctatca gtgaagaaac tacaactttc cctacaggag
450
gcttcacacc ggaaatagga aagaaaaaac acacggaaag taccccattc
500
tggtcgatca aaccaaacaa tgtttccatt gttttgcatg cagaggaacc
550 .
ttatattgaa aatgaagagc cagagccaga gccggagcca gctgcaaaac
600
aaactgaggc accaagaatg ttgccagttg ttactgaatc atctacaagt
650
ccatatgtta cctcatacaa gtcacctgtc accactttag ataagagcac
700
tggcattgag atctctacag aatcagaaga tgttcctcag ctctcaggtg
750
aaactgcgat agaaaaaccc gaagagtttg gaaagcaccc agagagttgg
800
aataatgatg acattttgaa aaaaatttta gatattaatt cacaagtgca
850
acaggcactt cttagtgaca ccagcaaccc agcatataga gaagatattg
900
aagcctctaa agatcaccta aaacgaagcc ttgctctagc agcagcagca
950
gaacataaat taaaaacaat gtataagtcc cagttattgc cagtaggacg
1000
aacaagtaat aaaattgatg acatcgaaac tgttattaac atgctgtgta
1050
721
attctagatc taaactctat gaatatttag atattaaatg tgttccacca 1100
gagatgagag aaaaagctgc tacagtattc aatacattaa aaaatatgtg 1150
tagatcaagg agagtcacag ccttattaaa agtttattaa acaataatat 1200
aaaaatttta aacctacttg atattccata acaaagctga tttaagcaaa 1250
ctgcattttt tcacaggaga aataatcata ttcgtaattt caaaagttgt 1300
ataaaaatat tttctattgt agttcaaatg tgccaacatc tttatgtgtc 1350
atgtgttatg aacaattttc atatgcacta aaaacctaat ttaaaataaa 1400
attttggttc aggaaaaaa 1419
<210> 265 <211> 350 <212> PRT
<213> Homo sapiens <400> 265
Met Lys Pro Leu Val Leu Leu Val Ala Leu Leu Leu Trp Pro Ser 15 10 15
Ser Val Pro Ala Tyr Pro Ser Ile Thr Val Thr Pro Asp Glu Glu 20 25 30
Gin Asn Leu Asn His Tyr Ile Gin Val Leu Glu Asn Leu Val Arg 35 40 45
Ser Val Pro Ser Gly Glu Pro Gly Arg Glu Lys Lys Ser Asn Ser 50 55 60
Pro Lys His Val Tyr Ser Ile Ala Ser Lys Gly Ser Lys Phe Lys 65 70 75
Glu Leu Val Thr His Gly Asp Ala Ser Thr Glu Asn Asp Val Leu 80 85 90
Thr Asn Pro Ile Ser Glu Glu Thr Thr Thr Phe Pro Thr Gly Gly 95 100 105
Phe Thr Pro Glu Ile Gly Lys Lys Lys His Thr Glu Ser Thr Pro 110 115 120
Phe Trp Ser Ile Lys Pro Asn Asn Val Ser Ile Val Leu His Ala 125 130 135
Glu Glu Pro Tyr Ile Glu Asn Glu Glu Pro Glu Pro Glu Pro Glu 140 145 150
Pro Ala Ala Lys Gin Thr Glu Ala Pro Arg Met Leu Pro Val Val 155 160 165
Thr Glu Ser Ser Thr Ser Pro Tyr Val Thr Ser Tyr Lys Ser Pro 170 175 180
Val Thr Thr Leu Asp Lys Ser Thr Gly lie Glu Ile Ser Thr Glu 185 190 195
Ser Glu Asp Val Pro Gin Leu Ser Gly Glu Thr Ala Ile Glu Lys
722
200 205 210
Pro Glu Glu Phe Gly Lys His Pro Glu Ser Trp Asn Asn Asp Asp
215 220 225
Ile Leu Lys Lys Ile Leu Asp Ile Asn Ser Gin Val Gin Gin Ala
230 235 240
Leu Leu Ser Asp Thr Ser Asn Pro Ala Tyr Arg Glu Asp Ile Glu
245 250 255
Ala Ser Lys Asp His Leu Lys Arg Ser Leu Ala Leu Ala Ala Ala
260 265 270
Ala Glu His Lys Leu Lys Thr Met Tyr Lys Ser Gin Leu Leu Pro
275 280 285
Val Gly Arg Thr Ser Asn Lys Ile Asp Asp Ile Glu Thr Val Ile
290 295 300
Asn Met Leu Cys Asn Ser Arg Ser Lys Leu Tyr Glu Tyr Leu Asp
305 310 315
Ile Lys Cys Val Pro Pro Glu Met Arg Glu Lys Ala Ala Thr Val
320 325 330
Phe Asn Thr Leu Lys Asn Met Cys Arg Ser Arg Arg Val Thr Ala
335 340 345
Leu Leu Lys Val Tyr 350
<210> 266 <211> 2403 <212> DNA
<213> Homo sapiens <400> 266
cggctcgagc ggctcgagtg aagagcctct ccacggctcc tgcgcctgag
50
acagctggcc ttcatagtgt tgacctccaa gagatcaacc atcatccatc cacaggaata cacccctgct tccatggctt gtcatctgtt ttgtgctcat
100 150
tttggttctc agtttctacg agctggtgtc aggacagtgg caagtcactg
200
gaccgggcaa gtttgtccag gccttggtgg gggaggacgc cgtgttctcc
250
tgctccctct ttcctgagac cagtgcagag gctatggaag tgcggttctt
300
caggaatcag ttccatgctg tggtccacct ctacagagat ggggaagact
350
gggaatctaa gcagatgcca cagtatcgag ggagaactga gtttgtgaag
400
gactccattg caggggggcg tgtctctcta aggctaaaaa acatcactcc
450
ctcggacatc ggcctgtatg ggtgctggtt cagttcccag atttacgatg
500
aggaggccac ctgggagctg cgggtggcag cactgggctc acttcctctc
550
atttccatcg tgggatatgt tgacggaggt atccagttac tctgcctgtc
600
ctcaggctgg ttcccccagc ccacagccaa gtggaaaggt ccacaaggac
650
723
aggatttgtc ttcagactcc agagcaaatg cagatgggta cagcctgtat
700
gatgtggaga tctccattat agtccaggaa aatgctggga gcatattgtg
750
ttccatccac cttgctgagc agagtcatga ggtggaatcc aaggtattga
800
taggagagac gtttttccag ccctcacctt ggcgcctggc ttctatttta
850
ctcgggttac tctgtggtgc cctgtgtggt gttgtcatgg ggatgataat
900
tgttttcttc aaatccaaag ggaaaatcca ggcggaactg gactggagaa
950
gaaagcacgg acaggcagaa ttgagagacg cccggaaaca cgcagtggag
1000
gtgactctgg atccagagac ggctcacccg aagctctgcg tttctgatct
1050
gaaaactgta acccatagaa aagctcccca ggaggtgcct cactctgaga
1100
agagatttac aaggaagagt gtggtggctt ctcagggttt ccaagcaggg
1150
agacattact gggaggtgga cgtgggacaa aatgtagggt ggtatgtggg
1200
agtgtgtcgg gatgacgtag acagggggaa gaacaatgtg actttgtctc
1250
ccaacaatgg gtattgggtc ctcagactga caacagaaca tttgtatttc
1300
acattcaatc cccattttat cagcctcccc cccagcaccc ctcctacacg
1350
agtaggggtc ttcctggact atgagggtgg gaccatctcc ttcttcaata
1400
caaatgacca gtcccttatt tataccctgc tgacatgtca gtttgaaggc
1450
ttgttgagac cctatatcca gcatgcgatg tatgacgagg aaaaggggac
1500
tcccatattc atatgtccag tgtcctgggg atgagacaga gaagaccctg
1550
cttaaagggc cccacaccac agacccagac acagccaagg gagagtgctc
1600
ccgacaggtg gccccagctt cctctccgga gcctgcgcac agagagtcac
1650
gccccccact agcagcggca ctcctttagg gtcacagctt gagctgaggt ccagatgagg tcttctgccc ggggattggc tgagccctgc ctgaccctgt
1700 1750
gggagtcaga agccatggct gccctgaagt ggggacggaa tagactcaca
1800
ttaggtttag tttgtgaaaa ctccatccag ctaagcgatc ttgaacaagt
1850
cacaacctcc caggctcctc atttgctagt cacggacagt gattcctgcc
1900
tcacaggtga agattaaaga gacaacgaat gtgaatcatg cttgcaggtt
1950
tgagggcaca gtgtttgcta atgatgtgtt tttatattat acattttccc
2000
accataaact ctgtttgctt attccacatt aatttacttt tctctatacc
2050
aaatcaccca tggaatagtt attgaacacc tgctttgtga ggctcaaaga
2100
ataaagagga ggtaggattt ttcactgatt ctataagccc agcattacct
2150
gataccaaaa ccaggcaaag aaaacagaag aagaggaagg aaaactacag
2200
724
gtccatatcc ctcattaaca cagacacaaa aattctaaat aaaattttaa 22 50
caaattaaac taaacaatat atttaaagat gatatataac tactcagtgt 2300
ggtttgtccc acaaatgcag agttggttta atatttaaat atcaaccagt 23 50
gtaattcagc acattaataa agtaaaaaag aaaaccataa aaaaaaaaaa 2400
aaa 2403
<210> 267 <211> 466 <212> PRT <213> Homo sapiens
<400> 267
Met Ala Phe Val Leu lie Leu Val Leu Ser Phe Tyr Glu Leu Val 15 10 15
Ser Gly Gin Trp Gin Val Thr Gly Pro Gly Lys Phe Val Gin Ala 20 25 30
Leu Val Gly Glu Asp Ala Val Phe Ser Cys Ser Leu Phe Pro Glu 35 40 45
Thr Ser Ala Glu Ala Met Glu Val Arg Phe Phe Arg Asn Gin Phe 50 55 60
His Ala Val Val His Leu Tyr Arg Asp Gly Glu Asp Trp Glu Ser 65 70 75
Lys Gin Met Pro Gin Tyr Arg Gly Arg Thr Glu Phe Val Lys Asp 80 85 90
Ser Ile Ala Gly Gly Arg Val Ser Leu Arg Leu Lys Asn Ile Thr 95 100 105
Pro Ser Asp Ile Gly Leu Tyr Gly Cys Trp Phe Ser Ser Gin Ile 110 115 120
Tyr Asp Glu Glu Ala Thr Trp Glu Leu Arg Val Ala Ala Leu Gly 125 130 135
Ser Leu Pro Leu Ile Ser Ile Val Gly Tyr Val Asp Gly Gly Ile 140 145 150
Gin Leu Leu Cys Leu Ser Ser Gly Trp Phe Pro Gin Pro Thr Ala 155 160 165
Lys Trp Lys Gly Pro Gin Gly Gin Asp Leu Ser Ser Asp Ser Arg 170 175 180
Ala Asn Ala Asp Gly Tyr Ser Leu Tyr Asp Val Glu Ile Ser Ile 185 190 195
Ile Val Gin Glu Asn Ala Gly Ser Ile Leu Cys Ser Ile His Leu 200 205 210
Ala Glu Gin Ser His Glu Val Glu Ser Lys Val Leu Ile Gly Glu 215 220 225
Thr Phe Phe Gin Pro Ser Pro Trp Arg Leu Ala Ser Ile Leu Leu
230
725
235
240
Gly Leu Leu Cys Gly Ala Leu Cys Gly Val Val Met Gly Met Ile 245 250 255
Ile Val Phe Phe Lys Ser Lys Gly Lys Ile Gin Ala Glu Leu Asp 260 265 270
Trp Arg Arg Lys His Gly Gin Ala Glu Leu Arg Asp Ala Arg Lys 275 280 285
His Ala Val Glu Val Thr Leu Asp Pro Glu Thr Ala His Pro Lys 290 295 300
Leu Cys Val Ser Asp Leu Lys Thr Val Thr His Arg Lys Ala Pro 305 310 315
Gin Glu Val Pro His Ser Glu Lys Arg Phe Thr Arg Lys Ser Val 320 325 330
Val Ala Ser Gin Gly Phe Gin Ala Gly Arg His Tyr Trp Glu Val 335 340 345
Asp Val Gly Gin Asn Val Gly Trp Tyr Val Gly Val Cys Arg Asp 350 355 360
Asp Val Asp Arg Gly Lys Asn Asn Val Thr Leu Ser Pro Asn Asn 365 370 375
Gly Tyr Trp Val Leu Arg Leu Thr Thr Glu His Leu Tyr Phe Thr 380 385 390
Phe Asn Pro His Phe Ile Ser Leu Pro Pro Ser Thr Pro Pro Thr 395 400 405
Arg Val Gly Val Phe Leu Asp Tyr Glu Gly Gly Thr Ile Ser Phe 410 415 420
Phe Asn Thr Asn Asp Gin Ser Leu Ile Tyr Thr Leu Leu Thr Cys 425 430 435
Gin Phe Glu Gly Leu Leu Arg'Pro Tyr Ile Gin His Ala Met Tyr 440 445 450
Asp Glu Glu Lys Gly Thr Pro Ile Phe Ile Cys Pro Val Ser Trp 455 460 465
Gly
<210> 268
<211> 2103
<212> DNA
<213> Homo sapiens
<400> 268
ccttcacagg actcttcatt gctggttggc aatgatgtat cggccagatg 50 tggtgagggc taggaaaaga gtttgttggg aaccctgggt tatcggcctc 100 gtcatcttca tatccctgat tgtcctggca gtgtgcattg gactcactgt 150
726
tcattatgtg agatataatc aaaagaagac ctacaattac tatagcacat
200
tgtcatttac aactgacaaa ctatatgctg agtttggcag agaggcttct
250
aacaatttta cagaaatgag ccagagactt gaatcaatgg tgaaaaatgc
300
attttataaa tctccattaa gggaagaatt tgtcaagtct caggttatca
350
agttcagtca acagaagcat ggagtgttgg ctcatatgct gttgatttgt
400
agatttcact ctactgagga tcctgaaact gtagataaaa ttgttcaact
450
tgttttacat gaaaagctgc aagatgctgt aggaccccct aaagtagatc
500
ctcactcagt taaaattaaa aaaatcaaca agacagaaac agacagctat
550
ctaaaccatt gctgcggaac acgaagaagt aaaactctag gtcagagtct
600
caggatcgtt ggtgggacag aagtagaaga gggtgaatgg ccctggcagg
650
ctagcctgca gtgggatggg agtcatcgct gtggagcaac cttaattaat
700
gccacatggc ttgtgagtgc tgctcactgt tttacaacat ataagaaccc
750
tgccagatgg actgcttcct ttggagtaac aataaaacct tcgaaaatga
800
aacggggtct ccggagaata attgtccatg aaaaatacaa acacccatca
850
catgactatg atatttctct tgcagagctt tctagccctg ttccctacac
900
aaatgcagta catagagttt gtctccctga tgcatcctat gagtttcaac
950
caggtgatgt gatgtttgtg acaggatttg gagcactgaa aaatgatggt
1000
tacagtcaaa atcatcttcg acaagcacag gtgactctca tagacgctac
1050
aacttgcaat gaacctcaag cttacaatga cgccataact cctagaatgt
1100
tatgtgctgg ctccttagaa ggaaaaacag atgcatgcca gggtgactct
1150
ggaggaccac tggttagttc agatgctaga gatatctggt accttgctgg
1200
aatagtgagc tggggagatg aatgtgcgaa acccaacaag cctggtgttt
1250
atactagagt taagagacaa tacggccttg aagcctcatg cgggactgga gaacagataa ttacttcaaa catttttttt aactggtatc tgttttttgg
1300 1350
gtgtggaggc catttttaga gatacagaat tggagaagac ttgcaaaaca
1400
gctagatttg actgatctca ataaactgtt tgcttgatgc atgtattttc
1450
ttcccagctc tgttccgcac gtaagcatcc tgcttctgcc agatcaactc
1500
tgtcatctgt gagcaatagt tgaaacttta tgtacataga gaaatagata
1550
atacaatatt acattacagc ctgtattcat ttgttctcta gaagttttgt
1600
cagaattttg acttgttgac ataaatttgt aatgcatata tacaatttga
1650
agcactcctt ttcttcagtt cctcagctcc tctcatttca gcaaatatcc
1700
attttcaagg tgcagaacaa ggagtgaaag aaaatataag aagaaaaaaa
1750
727
tcccctacat tttattggca cagaaaagta ttaggtgttt ttcttagtgg 1800
aatattagaa atgatcatat tcattatgaa aggtcaagca aagacagcag 1850
aataccaatc acttcatcat ttaggaagta tgggaactaa gttaaggaag 1900
tccagaaaga agccaagata tatccttatt ttcatttcca aacaactact 1950
atgataaatg tgaagaagat tctgtttttt tgtgacctat aataattata 2000
caaacttcat gcaatgtact tgttctaagc aaattaaagc aaatatttat 2050
ttaacattgt tactgaggat gtcaacatat aacaataaaa tataaatcac 2100
cca 2103
<210> 269 <211> 423 <212> PRT <213> Homo sapiens
<400> 269
Met Met Tyr Arg Pro Asp Val Val Arg Ala Arg Lys Arg Val Cys 15 10 15
Trp Glu Pro Trp Val Ile Gly Leu Val Ile Phe Ile Ser Leu Ile 20 25 30
Val Leu Ala Val Cys Ile Gly Leu Thr Val His Tyr Val Arg Tyr 35 40 45
Asn Gin Lys Lys Thr Tyr Asn Tyr Tyr Ser Thr Leu Ser Phe Thr 50 55 60
Thr Asp Lys Leu Tyr Ala Glu Phe Gly Arg Glu Ala Ser Asn Asn 65 70 75
Phe Thr Glu Met Ser Gin Arg Leu Glu Ser Met Val Lys Asn Ala 80 85 90
Phe Tyr Lys Ser Pro Leu Arg Glu Glu Phe Val Lys Ser Gin Val 95 100 105
Ile Lys Phe Ser Gin Gin Lys His Gly Val Leu Ala His Met Leu 110 115 120
Leu Ile Cys Arg Phe His Ser Thr Glu Asp Pro Glu Thr Val Asp 125 130 135
Lys Ile Val Gin Leu Val Leu His Glu Lys Leu Gin Asp Ala Val 140 145 150
Gly Pro Pro Lys Val Asp Pro His Ser Val Lys Ile Lys Lys Ile 155 160 165
Asn Lys Thr Glu Thr Asp Ser Tyr Leu Asn His Cys Cys Gly Thr 170 175 180
Arg Arg Ser Lys Thr Leu Gly Gin Ser Leu Arg Ile Val Gly Gly 185 190 195
Thr Glu Val Glu Glu Gly Glu Trp Pro Trp Gin Ala Ser Leu Gin
200
728
205
210
Trp Asp Gly Ser His Arg Cys Gly Ala Thr Leu Ile Asn Ala Thr 215 220 225
Trp Leu Val Ser Ala Ala His Cys Phe Thr Thr Tyr Lys Asn Pro 230 235 240
Ala Arg Trp Thr Ala Ser Phe Gly Val Thr Ile Lys Pro Ser Lys 245 250 255
Met Lys Arg Gly Leu Arg Arg Ile Ile Val His Glu Lys Tyr Lys 260 265 270
His Pro Ser His Asp Tyr Asp Ile Ser Leu Ala Glu Leu Ser Ser 275 280 285
Pro Val Pro Tyr Thr Asn Ala Val His Arg Val Cys Leu Pro Asp 290 295 300
Ala Ser Tyr Glu Phe Gin Pro Gly Asp Val Met Phe Val Thr Gly 305 310 315
Phe Gly Ala Leu Lys Asn Asp Gly Tyr Ser Gin Asn His Leu Arg 320 325 330
Gin Ala Gin Val Thr Leu Ile Asp Ala Thr Thr Cys Asn Glu Pro 335 340 345
Gin Ala Tyr Asn Asp Ala Ile Thr Pro Arg Met Leu Cys Ala Gly 350 355 360
Ser Leu Glu Gly Lys Thr Asp Ala Cys Gin Gly Asp Ser Gly Gly 365 370 375
Pro Leu Val Ser Ser Asp Ala Arg Asp Ile Trp Tyr Leu Ala Gly 380 385 390
Ile Val Ser Trp Gly Asp Glu Cys Ala Lys Pro Asn Lys Pro Gly 395 400 405
Val Tyr Thr Arg Val Thr Ala Leu Arg Asp Trp Ile Thr Ser Lys 410 415 420
Thr Gly Ile
<210> 270
<211> 1170
<212> DNA
<213> Homo sapiens
<400> 270
gtcgaaggtt ataaaagctt ccagccaaac ggcattgaag ttgaagatac 50 aacctgacag cacagcctga gatcttgggg atccctcagc ctaacaccca 100 cagacgtcag ctggtggatt cccgctgcat caaggcctac ccactgtctc 150 catgctgggc tctccctgcc ttctgtggct cctggccgtg accttcttgg 200 ttcccagagc tcagcccttg gcccctcaag actttgaaga agaggaggca 250
729
gatgagactg agacggcgtg gccgcctttg ccggctgtcc cctgcgacta 3 00
cgaccactgc cgacacctgc aggtgccctg caaggagcta cagagggtcg 350
ggccggcggc ctgcctgtgc ccaggactct ccagccccgc ccagccgccc 400
gacccgccgc gcatgggaga agtgcgcatt gcggccgaag agggccgcgc 450
agtggtccac tggtgtgccc ccttctcccc ggtcctccac tactggctgc 500
tgctttggga cggcagcgag gctgcgcaga aggggccccc gctgaacgct 550
acggtccgca gagccgaact gaaggggctg aagccagggg gcatttatgt 600
cgtttgcgta gtggccgcta acgaggccgg ggcaagccgc gtgccccagg 650
ctggaggaga gggcctcgag ggggccgaca tccctgcctt cgggccttgc 700
agccgccttg cggtgccgcc caacccccgc actctggtcc acgcggccgt 750
cggggtgggc acggccctgg ccctgctaag ctgtgccgcc ctggtgtggc 800
acttctgcct gcgcgatcgc tggggctgcc cgcgccgagc cgccgcccga 850
gccgcagggg cgctctgaaa ggggcctggg ggcatctcgg gcacagacag 900
ccccacctgg ggcgctcagc ctggcccccg ggaaagagga aaacccgctg 950
cctccaggga gggctggacg gcgagctggg agccagcccc aggctccagg 1000
gccacggcgg agtcatggtt ctcaggactg agcgcttgtt taggtccggt 1050
acttggcgct ttgtttcctg gctgaggtct gggaaggaat agaaaggggc 1100
ccccaatttt tttttaagcg gccagataat aaataatgta acctttgcgg 1150
ttaaaaaaaa aaaaaaaaaa 1170
<210> 271 <211> 238 <212> PRT
<213> Homo sapiens <400> 271
Met Leu Gly Ser Pro Cys Leu Leu Trp Leu Leu Ala Val Thr Phe 15 10 15
Leu Val Pro Arg Ala Gin Pro Leu Ala Pro Gin Asp Phe Glu Glu 20 25 30
Glu Glu Ala Asp Glu Thr Glu Thr Ala Trp Pro Pro Leu Pro Ala 35 40 45
Val Pro Cys Asp Tyr Asp His Cys Arg His Leu Gin Val Pro Cys 50 55 60
Lys Glu Leu Gin Arg Val Gly Pro Ala Ala Cys Leu Cys Pro Gly 65 70 75
Leu Ser Ser Pro Ala Gin Pro Pro Asp Pro Pro Arg Met Gly Glu 80 85 90
730
Val Arg Ile Ala Ala Glu Glu Gly Arg Ala Val Val His Trp Cys 95 100 • 105
Ala Pro Phe Ser Pro Val Leu His Tyr Trp Leu Leu Leu Trp Asp 110 115 120
Gly Ser Glu Ala Ala Gin Lys Gly Pro Pro Leu Asn Ala Thr Val 125 130 135
Arg Arg Ala Glu Leu Lys Gly Leu Lys Pro Gly Gly Ile Tyr Val 140 145 150
Val Cys Val Val Ala Ala Asn Glu Ala Gly Ala Ser Arg Val Pro 155 160 165
Gin Ala Gly Gly Glu Gly Leu Glu Gly Ala Asp Ile Pro Ala Phe 170 175 180
Gly Pro Cys Ser Arg Leu Ala Val Pro Pro Asn Pro Arg Thr Leu 185 190 195
Val His Ala Ala Val Gly Val Gly Thr Ala Leu Ala Leu Leu Ser 200 205 210
Cys Ala Ala Leu Val Trp His Phe Cys Leu Arg Asp Arg Trp Gly 215 220 225
Cys Pro Arg Arg Ala Ala Ala Arg Ala Ala Gly Ala Leu 230 235
<210> 272
<211> 2397
<212> DNA
<213> Homo sapiens
<400> 272
agagaaagaa gcgtctccag ctgaagccaa tgcagccctc cggctctccg
50
cgaagaagtt ccctgccccg atgagccccc gccgtgcgtc cccgactatc
100
cccaggcggg cgtggggcac cgggcccagc gccgacgatc gctgccgttt
150
tgcccttggg agtaggatgt ggtgaaagga tggggcttct cccttacggg
200
gctcacaatg gccagagaag attccgtgaa gtgtctgcgc tgcctgctct
250
acgccctcaa tctgctcttt tggttaatgt ccatcagtgt gttggcagtt
300
tctgcttgga tgagggacta cctaaataat gttctcactt taactgcaga
350
aacgagggta gaggaagcag tcattttgac ttactttcct gtggttcatc
400
cggtcatgat tgctgtttgc tgtttcctta tcattgtggg gatgttagga
450
tattgtggaa cggtgaaaag aaatctgttg cttcttgcat ggtactttgg
500
aagtttgctt gtcattttct gtgtagaact ggcttgtggc gtttggacat
550
atgaacagga acttatggtt ccagtacaat ggtcagatat ggtcactttg
600
aaagccagga tgacaaatta tggattacct agatatcggt ggcttactca
650
731
tgcttggaat ttttttcaga gagagtttaa gtgctgtgga gtagtatatt
700
tcactgactg gttggaaatg acagagatgg actggccccc agattcctgc
750
tgtgttagag aattcccagg atgttccaaa caggcccacc aggaagatct
800
cagtgacctt tatcaagagg gttgtgggaa gaaaatgtat tcctttttga
850
gaggaaccaa acaactgcag gtgctgaggt ttctgggaat ctccattggg
900
gtgacacaaa tcctggccat gattctcacc attactctgc tctgggctct
950
gtattatgat agaagggagc ctgggacaga ccaaatgatg tccttgaaga
1000
atgacaactc tcagcacctg tcatgtccct cagtagaact gttgaaacca
1050
agcctgtcaa gaatctttga acacacatcc atggcaaaca gctttaatac
1100
acactttgag atggaggagt tataaaaaga aatgtcacag aagaaaacca
1150
caaacttgtt ttattggact tgtgaatttt tgagtacata ctatgtgttt
1200
cagaaatatg tagaaataaa aatgttgcca taaaataaca cctaagcata
1250
tactattcta tgctttaaaa tgaggatgga aaagtttcat gtcataagtc
1300
accacctgga caataattga tgcccttaaa atgctgaaga cagatgtcat
1350
acccactgtg tagcctgtgt atgactttta ctgaacacag ttatgttttg
1400
aggcagcatg gtttgattag catttccgca tccatgcaaa cgagtcacat
1450
atggtgggac tggagccata gtaaaggttg atttacttct accaactagt
1500
atataaagta ctaattaaat gctaacatag gaagttagaa aatactaata
1550
acttttatta ctcagcgatc tattcttctg atgctaaata aattatatat
1600
cagaaaactt tcaatattgg tgactaccta aatgtgattt ttgctggtta
1650
ctaaaatatt cttaccactt aaaagagcaa gctaacacat tgtcttaagc
1700
tgatcaggga ttttttgtat ataagtctgt gttaaatctg tataattcag
1750
tcgatttcag ttctgataat gttaagaata accattatga aaaggaaaat
1800
ttgtcctgta tagcatcatt atttttagcc tttcctgtta ataaagcttt
1850
actattctgt taaccactaa cctgggctta ttttgaaaat tattacacat taccagtgtg ataactgtta atacatagga tttaaatact atcattattc
1900 1950
agaatgtagt ctggtcttta ggaagtatta ataagaaaat ttgcacataa
2000
cttagttgat tcagaaagga cttgtatgct gtttttctcc caaatgaaga
2050
ctctttttga cactaaacac tttttaaaaa gcttatcttt gccttctcca
2100
aacaagaagc aatagtctcc aagtcaatat aaattctaca gaaaatagtg
2150
ttctttttct ccagaaaaat gcttgtgaga atcattaaaa catgtgacaa
2200
732
tttagagatt ctttgtttta tttcactgat taatatactg tggcaaatta 22 50
cacagattat taaatttttt tacaagagta tagtatattt atttgaaatg 23 00 ggaaaagtgc attttactgt attttgtgta ttttgtttat ttctcagaat 2350 atggaaagaa aattaaaatg tgtcaataaa tattttctag agagtaa 2397
<210> 273 <211> 305 <212> PRT <213> Homo sapiens
<400> 273
Met Ala Arg Glu Asp Ser Val Lys Cys Leu Arg Cys Leu Leu Tyr 15 10 15
Ala Leu Asn Leu Leu Phe Trp Leu Met Ser Ile Ser Val Leu Ala 20 25 30
Val Ser Ala Trp Met Arg Asp Tyr Leu Asn Asn Val Leu Thr Leu 35 40 45
Thr Ala Glu Thr Arg Val Glu Glu Ala Val Ile Leu Thr Tyr Phe 50 55 60
Pro Val Val His Pro Val Met Ile Ala Val Cys Cys Phe Leu Ile 65 70 75
lie Val Gly Met Leu Gly Tyr Cys Gly Thr Val Lys Arg Asn Leu 80 85 90
Leu Leu Leu Ala Trp Tyr Phe Gly Ser Leu Leu Val Ile Phe Cys 95 100 105
Val Glu Leu Ala Cys Gly Val Trp Thr Tyr Glu Gin Glu Leu Met 110 115 120
Val Pro Val Gin Trp Ser Asp Met Val Thr Leu Lys Ala Arg Met 125 130 135
Thr Asn Tyr Gly Leu Pro Arg Tyr Arg Trp Leu Thr His Ala Trp 140 145 150
Asn Phe Phe Gin Arg Glu Phe Lys Cys Cys Gly Val Val Tyr Phe 155 160 165
Thr Asp Trp Leu Glu Met Thr Glu Met Asp Trp Pro Pro Asp Ser 170 175 180
Cys Cys Val Arg Glu Phe Pro Gly Cys Ser Lys Gin Ala His Gin 185 190 195
Glu Asp Leu Ser Asp Leu Tyr Gin Glu Gly Cys Gly Lys Lys Met 200 205 210
Tyr Ser Phe Leu Arg Gly Thr Lys Gin Leu Gin Val Leu Arg Phe 215 220 225
Leu Gly Ile Ser Ile Gly Val Thr Gin Ile Leu Ala Met Ile Leu 230 235 240
Thr Ile Thr Leu Leu 245
Gly Thr Asp Gin Met 260
Leu Ser Cys Pro Ser 275
Ile Phe Glu His Thr 290
Glu Met Glu Glu Leu 305
<210> 274
<211> 2063
<212> DNA
<213> Homo sapiens
<400> 274
gagagaggca gcagcttgct cagcggacaa ggatgctggg cgtgagggac
50
caaggcctgc cctgcactcg ggcctcctcc agccagtgct gaccagggac
100
ttctgacctg ctggccagcc aggacctgtg tggggaggcc ctcctgctgc
150
cttggggtga caatctcagc tccaggctac agggagaccg ggaggatcac
200
agagccagca tgttacagga tcctgacagt gatcaacctc tgaacagcct
250
cgatgtcaaa cccctgcgca aaccccgtat ccccatggag accttcagaa
300
aggtggggat ccccatcatc atagcactac tgagcctggc gagtatcatc
350
attgtggttg tcctcatcaa ggtgattctg gataaatact acttcctctg
400
cgggcagcct ctccacttca tcccgaggaa gcagctgtgt gacggagagc
450
tggactgtcc cttgggggag gacgaggagc actgtgtcaa gagcttcccc
500
gaagggcctg cagtggcagt ccgcctctcc aaggaccgat ccacactgca
550
ggtgctggac tcggccacag ggaactggtt ctctgcctgt ttcgacaact
600
tcacagaagc tctcgctgag acagcctgta ggcagatggg ctacagcaga
650
gctgtggaga ttggcccaga ccaggatctg gatgttgttg aaatcacaga
700
aaacagccag gagcttcgca tgcggaactc aagtgggccc tgtctctcag
750
gctccctggt ccccgtgtgg ctccctgcac tgggtgggga tgtcttgcct ggaggcctct gtgggaagag gtggattctt cctgaagacc ggccttggca
800 850
ggtcagcatc cagtacgaca aacagcacgt ctgtggaggg agcatcctgg
900
acccccactg ggtcctcacg gcagcccact gcttcaggaa acataccgat
950
gtgttcaact ggaaggtgcg ggcaggctca gacaaactgg gcagcttccc
1000
atccctggct gtggccaaga tcatcatcat tgaattcaac cccatgtacc
1050
733
Trp Ala Leu Tyr Tyr Asp Arg Arg Glu Pro 250 255
Met Ser Leu Lys Asn Asp Asn Ser Gin His 265 270
Val Glu Leu Leu Lys Pro Ser Leu Ser Arg 280 285
Ser Met Ala Asn Ser Phe Asn Thr His Phe
295
300
734
ccaaagacaa tgacatcgcc ctcatgaagc tgcagttccc actcactttc
1100
tcaggcacag tcaggcccat ctgtctgccc ttctttgatg aggagctcac
1150
tccagccacc ccactctgga tcattggatg gggctttacg aagcagaatg
1200
gagggaagat gtctgacata ctgctgcagg cgtcagtcca ggtcattgac
1250
agcacacggt gcaatgcaga cgatgcgtac cagggggaag tcaccgagaa
1300
gatgatgtgt gcaggcatcc cggaaggggg tgtggacacc tgccagggtg
1350
acagtggtgg gcccctgatg taccaatctg accagtggca tgtggtgggc
1400
atcgttagct ggggctatgg ctgcgggggc ccgagcaccc caggagtata
1450
caccaaggtc tcagcctatc tcaactggat ctacaatgtc tggaaggctg
1500
agctgtaatg ctgctgcccc tttgcagtgc tgggagccgc ttccttcctg
1550
ccctgcccac ctggggatcc cccaaagtca gacacagagc aagagtcccc
1600
ttgggtacac ccctctgccc acagcctcag catttcttgg agcagcaaag
1650
ggcctcaatt cctgtaagag accctcgcag cccagaggcg cccagaggaa
1700
gtcagcagcc ctagctcggc cacacttggt gctcccagca tcccagggag
1750
agacacagcc cactgaacaa ggtctcaggg gtattgctaa gccaagaagg
1800
aactttccca cactactgaa tggaagcagg ctgtcttgta aaagcccaga
1850
tcactgtggg ctggagagga gaaggaaagg gtctgcgcca gccctgtccg
1900
tcttcaccca tccccaagcc tactagagca agaaaccagt tgtaatataa
1950
aatgcactgc cctactgttg gtatgactac cgttacctac tgttgtcatt
2000
gttattacag ctatggccac tattattaaa gagctgtgta acatctctgg
2050
caaaaaaaaa aaa 2063
<210> 275 <211> 432 <212> PRT
<213> Homo sapiens <400> 275
Met Leu Gin Asp Pro Asp Ser Asp Gin Pro Leu Asn Ser Leu Asp 15 10 15
Val Lys Pro Leu Arg Lys Pro Arg Ile Pro Met Glu Thr Phe Arg 20 25 30
Lys Val Gly Ile Pro Ile Ile Ile Ala Leu Leu Ser Leu Ala Ser 35 40 45
Ile Ile Ile Val Val Val Leu Ile Lys Val Ile Leu Asp Lys Tyr 50 55 60
Tyr Phe Leu Cys Gly Gin Pro Leu His Phe Ile Pro Arg Lys Gin 65 70 75
735
Leu Cys Asp Gly Glu Leu Asp Cys Pro Leu Gly Glu Asp Glu Glu 80 85 90
His Cys Val Lys Ser Phe Pro Glu Gly Pro Ala Val Ala Val Arg 95 100 105
Leu Ser Lys Asp Arg Ser Thr Leu Gin Val Leu Asp Ser Ala Thr 110 115 120
Gly Asn Trp Phe Ser Ala Cys Phe Asp Asn Phe Thr Glu Ala Leu 125 130 135
Ala Glu Thr Ala Cys Arg Gin Met Gly Tyr Ser Arg Ala Val Glu 140 145 150
lie Gly Pro Asp Gin Asp Leu Asp Val Val Glu Ile Thr Glu Asn 155 160 165
Ser Gin Glu Leu Arg Met Arg Asn Ser Ser Gly Pro Cys Leu Ser 170 175 180
Gly Ser Leu Val Ser Leu His Cys Leu Ala Cys Gly Lys Ser Leu 185 190 195
Lys Thr Pro Arg Val Val Gly Gly Glu Glu Ala Ser Val Asp Ser 200 205 210
Trp Pro Trp Gin Val Ser Ile Gin Tyr Asp Lys Gin His Val Cys 215 220 225
Gly Gly Ser Ile Leu Asp Pro His Trp Val Leu Thr Ala Ala His 230 235 240
Cys Phe Arg Lys His Thr Asp Val Phe Asn Trp Lys Val Arg Ala 245 250 255
Gly Ser Asp Lys Leu Gly Ser Phe Pro Ser Leu Ala Val Ala Lys 260 265 270
Ile Ile Ile Ile Glu Phe Asn Pro Met Tyr Pro Lys Asp Asn Asp 275 280 285
Ile Ala Leu Met Lys Leu Gin Phe Pro Leu Thr Phe Ser Gly Thr 290 295 300
Val Arg Pro Ile Cys Leu Pro Phe Phe Asp Glu Glu Leu Thr Pro 305 310 315
Ala Thr Pro Leu Trp Ile Ile Gly Trp Gly Phe Thr Lys Gin Asn 320 325 330
Gly Gly Lys Met Ser Asp Ile Leu Leu Gin Ala Ser Val Gin Val 335 340 345
Ile Asp Ser Thr Arg Cys Asn Ala Asp Asp Ala Tyr Gin Gly Glu 350 355 360
Val Thr Glu Lys Met Met Cys Ala Gly Ile Pro Glu Gly Gly Val 365 370 375
Asp Thr Cys Gin Gly Asp Ser Gly Gly Pro Leu Met Tyr Gin Ser 380 385 390
736
Asp Gin Trp His Val Val Gly Ile Val Ser Trp Gly Tyr Gly Cys 395 400 405
Gly Gly Pro Ser Thr Pro Gly Val Tyr Thr Lys Val Ser Ala Tyr 410 415 420
Leu Asn Trp Ile Tyr Asn Val Trp Lys Ala Glu Leu 425 430
<210> 276
<211> 3143
<212> DNA
<213> Homo sapiens
<400> 276
gggctgaggc actgagagac cggaaagcct ggcattccag agggagggaa
50
acgcagcggc atccccaggc tccagagctc cctggtgaca gtctgtggct
100
gagcatggcc ctcccagccc tgggcctgga cccctggagc ctcctgggcc
150
ttttcctctt ccaactgctt cagctgctgc tgccgacgac gaccgcgggg
200
ggaggcgggc aggggcccat gcccagggtc agatactatg caggggatga
250
acgtagggca cttagcttct tccaccagaa gggcctccag gattttgaca
300
ctctgctcct gagtggtgat ggaaatactc tctacgtggg ggctcgagaa
350
gccattctgg ccttggatat ccaggatcca ggggtcccca ggctaaagaa
400
catgataccg tggccagcca gtgacagaaa aaagagtgaa tgtgccttta
450
agaagaagag caatgagaca cagtgtttca acttcatccg tgtcctggtt
500
tcttacaatg tcacccatct ctacacctgc ggcaccttcg ccttcagccc
550
tgcttgtacc ttcattgaac ttcaagattc ctacctgttg cccatctcgg
600
aggacaaggt catggaggga aaaggccaaa gcccctttga ccccgctcac
650
aagcatacgg ctgtcttggt ggatgggatg ctctattctg gtactatgaa
700
caacttcctg ggcagtgagc ccatcctgat gcgcacactg ggatcccagc
750
ctgtcctcaa gaccgacaac ttcctccgct ggctgcatca tgacgcctcc
800
tttgtggcag ccatcccttc gacccaggtc gtctacttct tcttcgagga
850
gacagccagc gagtttgact tctttgagag gctccacaca tcgcgggtgg
900
ctagagtctg caagaatgac gtgggcggcg aaaagctgct gcagaagaag
950
tggaccacct gcccttcaac tcctgaaggc gtcatccgcc ccagctgctc acgcggtcct tgcacccagc gctccccgcc cggggcagct gattctccca
1000 1050
cagctcccca catctacgca gtcttcacct cccagtggca ggttggcggg
1100
accaggagct ctgcggtttg tgccttctct ctcttggaca ttgaacgtgt
1150
ctttaagggg aaatacaaag agttgaacaa agaaacttca cgctggacta
1200
737
cttatagggg ccctgagacc aacccccggc caggcagttg ctcagtgggc
1250
ccctcctctg ataaggccct gaccttcatg aaggaccatt tcctgatgga
1300
tgagcaagtg gtggggacgc ccctgctggt gaaatctggc gtggagtata
1350
cacggcttgc agtggagaca gcccagggcc ttgatgggca cagccatctt
1400
gtcatgtacc tgggaaccac cacagggtcg ctccacaagg ctgtggtaag
1450
tggggacagc agtgctcatc tggtggaaga gattcagctg ttccctgacc
1500
ctgaacctgt tcgcaacctg cagctggccc ccacccaggg tgcagtgttt
1550
gtaggcttct caggaggtgt ctggagggtg ccccgagcca actgtagtgt
1600
ctatgagagc tgtgtggact gtgtccttgc ccgggacccc cactgtgcct
1650
gggaccctga gtcccgaacc tgttgcctcc tgtctgcccc caacctgaac
1700
tcctggaagc aggacatgga gcgggggaac ccagagtggg catgtgccag
1750
tggccccatg agcaggagcc ttcggcctca gagccgcccg caaatcatta
1800
aagaagtcct ggctgtcccc aactccatcc tggagctccc ctgcccccac
1850
ctgtcagcct tggcctctta ttattggagt catggcccag cagcagtccc
1900
agaagcctct tccactgtct acaatggctc cctcttgctg atagtgcagg
1950
atggagttgg gggtctctac cagtgctggg caactgagaa tggcttttca
2000
taccctgtga tctcctactg ggtggacagc caggaccaga ccctggccct
2050
ggatcctgaa ctggcaggca tcccccggga gcatgtgaag gtcccgttga
2100
ccagggtcag tggtggggcc gccctggctg cccagcagtc ctactggccc
2150
cactttgtca ctgtcactgt cctctttgcc ttagtgcttt caggagccct
2200
catcatcctc gtggcctccc cattgagagc actccgggct cggggcaagg
2250
ttcagggctg tgagaccctg cgccctgggg agaaggcccc gttaagcaga
2300
gagcaacacc tccagtctcc caaggaatgc aggacctctg ccagtgatgt
2350
ggacgctgac aacaactgcc taggcactga ggtagcttaa actctaggca
2400
caggccgggg ctgcggtgca ggcacctggc catgctggct gggcggccca
2450
agcacagccc tgactaggat gacagcagca caaaagacca cctttctccc
2500
ctgagaggag cttctgctac tctgcatcac tgatgacact cagcagggtg
2550
atgcacagca agctctctaa gtctgcctcc cagggtgggg cctatgggac gctaccccca tcccttctac gacctgctcc caagcacatg tacactgata
2600 2650
ttgaagaacc tggagaggat ccttcagttc tggccattcc agggaccctc
2700
cagaaacaca gtgtttcaag agaccctaaa aaacctgcct gtcccaggac
2750
738
cctatggtaa tgaacaccaa acatctaaac aatcatatgc taacatgcca
2800
ctcctggaaa ctccactctg aagctgccgc tttggacacc aacactccct
2850
tctcccaggg tcatgcaggg atctgctccc tcctgcttcc cttaccagtc
2900
gtgcaccgct gactcccagg aagtctttcc tgaagtctga ccacctttct
2950
tcttgcttca gttggggcag actctgatcc cttctgccct ggcagaatgg
3000
caggggtaat ctgagccttc ttcactcctt taccctagct gaccccttca
3050
cctctccccc tcccttttcc tttgttttgg gattcagaaa actgcttgtc
3100
agagactgtt tattttttat taaaaatata aggcttaaaa aaa 3143
<210> 277 <211> 761 <212> PRT
<213> Homo sapiens <400> 277
Met Ala Leu Pro Ala Leu Gly Leu Asp Pro Trp Ser Leu Leu Gly 15 10 15
Leu Phe Leu Phe Gin Leu Leu Gin Leu Leu Leu Pro Thr Thr Thr 20 25 30
Ala Gly Gly Gly Gly Gin Gly Pro Met Pro Arg Val Arg Tyr Tyr 35 40 45
Ala Gly Asp Glu Arg Arg Ala Leu Ser Phe Phe His Gin Lys Gly 50 55 60
Leu Gin Asp Phe Asp Thr Leu Leu Leu Ser Gly Asp Gly Asn Thr 65 70 75
Leu Tyr Val Gly Ala Arg Glu Ala Ile Leu Ala Leu Asp Ile Gin 80 85 90
Asp Pro Gly Val Pro Arg Leu Lys Asn Met Ile Pro Trp Pro Ala 95 100 105
Ser Asp Arg Lys Lys Ser Glu Cys Ala Phe Lys Lys Lys Ser Asn 110 115 120
Glu Thr Gin Cys Phe Asn Phe Ile Arg Val Leu Val Ser Tyr Asn 125 130 135
Val Thr His Leu Tyr Thr Cys Gly Thr Phe Ala Phe Ser Pro Ala 140 145 150
Cys Thr Phe Ile Glu Leu Gin Asp Ser Tyr Leu Leu Pro Ile Ser 155 160 165
Glu Asp Lys Val Met Glu Gly Lys Gly Gin Ser Pro Phe Asp Pro 170 175 180
Ala His Lys His Thr Ala Val Leu Val Asp Gly Met Leu Tyr Ser 185 190 195
Gly Thr Met Asn Asn Phe Leu Gly Ser Glu Pro Ile Leu Met Arg 200 205 210
739
Thr Leu Gly Ser Gin Pro Val Leu Lys Thr Asp Asn Phe Leu Arg 215 220 225
Trp Leu His His Asp Ala Ser Phe Val Ala Ala Ile Pro Ser Thr 230 235 240
Gin Val Val Tyr Phe Phe Phe Glu Glu Thr Ala Ser Glu Phe Asp 245 250 255
Phe Phe Glu Arg Leu His Thr Ser Arg Val Ala Arg Val Cys Lys 260 265 270
Asn Asp Val Gly Gly Glu Lys Leu Leu Gin Lys Lys Trp Thr Thr 275 280 285
Phe Leu Lys Ala Gin Leu Leu Cys Thr Gin Pro Gly Gin Leu Pro 290 295 300
Phe Asn Val Ile Arg His Ala Val Leu Leu Pro Ala Asp Ser Pro 305 310 315
Thr Ala Pro His Ile Tyr Ala Val Phe Thr Ser Gin Trp Gin Val 320 325 330
Gly Gly Thr Arg Ser Ser Ala Val Cys Ala Phe Ser Leu Leu Asp 335 340 345
Ile Glu Arg Val Phe Lys Gly Lys Tyr Lys Glu Leu Asn Lys Glu 350 355 360
Thr Ser Arg Trp Thr Thr Tyr Arg Gly Pro Glu Thr Asn Pro Arg 365 370 375
Pro Gly Ser Cys Ser Val Gly Pro Ser Ser Asp Lys Ala Leu Thr 380 385 390
Phe Met Lys Asp His Phe Leu Met Asp Glu Gin Val Val Gly Thr 395 400 405
Pro Leu Leu Val Lys Ser Gly Val Glu Tyr Thr Arg Leu Ala Val 410 415 420
Glu Thr Ala Gin Gly Leu Asp Gly His Ser His Leu Val Met Tyr 425 430 435
Leu Gly Thr Thr Thr Gly Ser Leu His Lys Ala Val Val Ser Gly 440 445 450
Asp Ser Ser Ala His Leu Val Glu Glu Ile Gin Leu Phe Pro Asp 455 460 465
Pro Glu Pro Val Arg Asn Leu Gin Leu Ala Pro Thr Gin Gly Ala 470 475 480
Val Phe Val Gly Phe Ser Gly Gly Val Trp Arg Val Pro Arg Ala 485 490 495
Asn Cys Ser Val Tyr Glu Ser Cys Val Asp Cys Val Leu Ala Arg 500 505 510
Asp Pro His Cys Ala Trp Asp Pro Glu Ser Arg Thr Cys Cys Leu
515
740
520
525
Leu Ser Ala Pro Asn Leu Asn Ser Trp Lys Gin Asp Met Glu Arg 530 535 540
Gly Asn Pro Glu Trp Ala Cys Ala Ser Gly Pro Met Ser Arg Ser 545 550 555
Leu Arg Pro Gin Ser Arg Pro Gin Ile Ile Lys Glu Val Leu Ala 560 565 570
Val Pro Asn Ser Ile Leu Glu Leu Pro Cys Pro His Leu Ser Ala 575 580 585
Leu Ala Ser Tyr Tyr Trp Ser His Gly Pro Ala Ala Val Pro Glu 590 595 600
Ala Ser Ser Thr Val Tyr Asn Gly Ser Leu Leu Leu Ile Val Gin 605 610 615
Asp Gly Val Gly Gly Leu Tyr Gin Cys Trp Ala Thr Glu Asn Gly 620 625 630
Phe Ser Tyr Pro Val Ile Ser Tyr Trp Val Asp Ser Gin Asp Gin 635 640 645
Thr Leu Ala Leu Asp Pro Glu Leu Ala Gly Ile Pro Arg Glu His 650 655 660
Val Lys Val Pro Leu Thr Arg Val Ser Gly Gly Ala Ala Leu Ala 665 670 675
Ala Gin Gin Ser Tyr Trp Pro His Phe Val Thr Val Thr Val Leu 680 685 690
Phe Ala Leu Val Leu Ser Gly Ala Leu Ile Ile Leu Val Ala Ser 695 700 705
Pro Leu Arg Ala Leu Arg Ala Arg Gly Lys Val Gin Gly Cys Glu 710 715 720
Thr Leu Arg Pro Gly Glu Lys Ala Pro Leu Ser Arg Glu Gin His 725 730 735
Leu Gin Ser Pro Lys Glu Cys Arg Thr Ser Ala Ser Asp Val Asp 740 745 750
Ala Asp Asn Asn Cys Leu Gly Thr Glu Val Ala 755 760
<210> 278 <211> 24 <212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 278 ctgctggtga aatctggcgt ggag 24
<210> 279
741
<211> 24 "
<212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 279 gtctggtcct ggctgtccac ccag 24
<210> 280 <211> 45 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 280
catcttgtca tgtacctggg aaccaccaca gggtcgctcc acaag 45
<210> 281
<211> 2320
<212> DNA
<213> Homo sapiens
<400> 281
agggtccctt agccgggcgc agggcgcgca gcccaggctg agatccgcgg
50
cttccgtaga agtgagcatg gctgggcagc gagtgcttct tctagtgggc
100
ttccttctcc ctggggtcct gctctcagag gctgccaaaa tcctgacaat
150
atctacagta ggtggaagcc attatctact gatggaccgg gtttctcaga
200
ttcttcaaga tcacggtcat aatgtcacca tgcttaacca caaaagaggt
250
ccttttatgc cagattttaa aaaggaagaa aaatcatatc aagttatcag
300
ttggcttgca cctgaagatc atcaaagaga atttaaaaag agttttgatt
350
tctttctgga agaaacttta ggtggcagag gaaaatttga aaacttatta
400
aatgttctag aatacttggc gttgcagtgc agtcattttt taaatagaaa
450
ggatatcatg gattccttaa agaatgagaa cttcgacatg gtgatagttg
500
aaacttttga ctactgtcct ttcctgattg ctgagaagct tgggaagcca
550
tttgtggcca ttctttccac ttcattcggc tctttggaat ttgggctacc
600
aatccccttg tcttatgttc cagtattccg ttccttgctg actgatcaca
650
tggacttctg gggccgagtg aagaattttc tgatgttctt tagtttctgc
700
aggaggcaac agcacatgca gtctacattt gacaacacca tcaaggaaca
750
tttcacagaa ggctctaggc cagttttgtc tcatcttcta ctgaaagcag
800
agttgtggtt cttcccaaca cattaactct ctgtttatgt gactttgcct tggaggcttg ttgattttgc atggaaaaac tcgacctctg ctattaaacc
850 900
742
agtaccacaa gacttggaga acttcattgc caagtttggg gactctggtt
950
ttgtccttgt gaccttgggc tccatggtga acacctgtca gaatccggaa
1000
atcttcaagg agatgaacaa tgcctttgct cacctacccc aaggggtgat
1050
atggaagtgt cagtgttctc attggcccaa agatgtccac ctggctgcaa
1100
atgtgaaaat tgtggactgg cttcctcaga gtgacctcct ggctcaccca
1150
agcatccgtc tgtttgtcac ccacggcggg cagaatagca taatggaggc
1200
catccagcat ggtgtgccca tggtggggat ccctctcttt ggagaccagc
1250
ctgaaaacat ggtccgagta gaagccaaaa agtttggtgt ttctattcag
1300
ttaaagaagc tcaaggcaga gacattggct cttaagatga aacaaatcat
1350
ggaagacaag agatacaagt ccgcggcagt ggctgccagt gtcatcctgc
1400
gctcccaccc gctcagcccc acacagcggc tggtgggctg gattgaccac
1450
gtcctccaga cagggggcgc gacgcacctc aagccctatg tctttcagca
1500
gccctggcat gagcagtacc tgttcgacgt ttttgtgttt ctgctggggc
1550
tcactctggg gactctatgg ctttgtggga agctgctggg catggctgtc
1600
tggtggctgc gtggggccag aaaggtgaag gagacataag gccaggtgca
1650
gccttggcgg ggtctgtttg gtgggcgatg tcaccatttc tagggagctt
1700
cccactagtt ctggcagccc cattctctag tccttctagt tatctcctgt
1750
tttcttgaag aacaggaaaa atggccaaaa atcatccttt ccacttgcta
1800
attttgctac aaattcatcc ttactagctc ctgcctgcta gcagaaatct
1850
ttccagtcct cttgtcctcc tttgtttgcc atcagcaagg gctatgctgt
1900
gattctgtct ctgagtgact tggaccactg accctcagat ttccagcctt
1950
aaaatccacc ttccttctca tgcgcctctc cgaatcacac cctgactctt
2000
ccagcctcca tgtccagacc tagtcagcct ctctcactcc tgcccctact
2050
atctatcatg gaataacatc caagaaagac accttgcata ttctttcagt
2100
ttctgttttg ttctcccaca tattctcttc aatgctcagg aagcctgccc
2150
tgtgcttgag agttcagggc cggacacagg ctcacaggtc tccacattgg
2200
gtccctgtct ctggtgccca cagtgagctc cttcttggct gagcaggcat
2250
ggagactgta ggtttccaga tttcctgaaa aataaaagtt tacagcgtta
2300
tctctcccca acctcactaa
2320
<210> 282 <211> 523 <212> PRT
<213> Homo sapiens
743
<400> 282
Met Ala Gly Gin Arg Val Leu Leu Leu Val Gly Phe Leu Leu Pro 15 10 15
Gly Val Leu Leu Ser Glu Ala Ala Lys Ile Leu Thr Ile Ser Thr 20 25 30
Val Gly Gly Ser His Tyr Leu Leu Met Asp Arg Val Ser Gin Ile 35 40 45
Leu Gin Asp His Gly His Asn Val Thr Met Leu Asn His Lys Arg 50 55 60
Gly Pro Phe Met Pro Asp Phe Lys Lys Glu Glu Lys Ser Tyr Gin 65 70 75
Val Ile Ser Trp Leu Ala Pro Glu Asp His Gin Arg Glu Phe Lys 80 85 90
Lys Ser Phe Asp Phe Phe Leu Glu Glu Thr Leu Gly Gly Arg Gly 95 100 105
Lys Phe Glu Asn Leu Leu Asn Val Leu Glu Tyr Leu Ala Leu Gin 110 115 120
Cys Ser His Phe Leu Asn Arg Lys Asp Ile Met Asp Ser Leu Lys 125 130 135
Asn Glu Asn Phe Asp Met Val Ile Val Glu Thr Phe Asp Tyr Cys 140 145 150
Pro Phe Leu Ile Ala Glu Lys Leu Gly Lys Pro Phe Val Ala Ile 155 160 165
Leu Ser Thr Ser Phe Gly Ser Leu Glu Phe Gly Leu Pro Ile Pro 170 175 180
Leu Ser Tyr Val Pro Val Phe Arg Ser Leu Leu Thr Asp His Met 185 190 195
Asp Phe Trp Gly Arg Val Lys Asn Phe Leu Met Phe Phe Ser Phe 200 205 210
Cys Arg Arg Gin Gin His Met Gin Ser Thr Phe Asp Asn Thr Ile 215 220 225
Lys Glu His Phe Thr Glu Gly Ser Arg Pro Val Leu Ser His Leu 230 235 240
Leu Leu Lys Ala Glu Leu Trp Phe Ile Asn Ser Asp Phe Ala Phe 245 250 255
Asp Phe Ala Arg Pro Leu Leu Pro Asn Thr Val Tyr Val Gly Gly 260 265 270
Leu Met Glu Lys Pro Ile Lys Pro Val Pro Gin Asp Leu Glu Asn 275 280 285
Phe Ile Ala Lys Phe Gly Asp Ser Gly Phe Val Leu Val Thr Leu 290 295 300
744
Gly Ser Met Val Asn Thr Cys Gin Asn Pro Glu Ile Phe Lys Glu 305 310 315
Met Asn Asn Ala Phe Ala His Leu Pro Gin Gly Val Ile Trp Lys 320 325 330
Cys Gin Cys Ser His Trp Pro Lys Asp Val His Leu Ala Ala Asn 335 340 345
Val Lys Ile Val Asp Trp Leu Pro Gin Ser Asp Leu Leu Ala His 350 355 360
Pro Ser Ile Arg Leu Phe Val Thr His Gly Gly Gin Asn Ser Ile 365 370 375
Met Glu Ala Ile Gin His Gly Val Pro Met Val Gly Ile Pro Leu 380 385 390
Phe Gly Asp Gin Pro Glu Asn Met Val Arg Val Glu Ala Lys Lys 395 400 405
Phe Gly Val Ser Ile Gin Leu Lys Lys Leu Lys Ala Glu Thr Leu 410 415 420
Ala Leu Lys Met Lys Gin Ile Met Glu Asp Lys Arg Tyr Lys Ser 425 430 435
Ala Ala Val Ala Ala Ser Val Ile Leu Arg Ser His Pro Leu Ser 440 445 450
Pro Thr Gin Arg Leu Val Gly Trp Ile Asp His Val Leu Gin Thr 455 460 465
Gly Gly Ala Thr His Leu Lys Pro Tyr Val Phe Gin Gin Pro Trp 470 475 480
His Glu Gin Tyr Leu Phe Asp Val Phe Val Phe Leu Leu Gly Leu 485 490 495
Thr Leu Gly Thr Leu Trp Leu Cys Gly Lys Leu Leu Gly Met Ala 500 505 510
Val Trp Trp Leu Arg Gly Ala Arg Lys Val Lys Glu Thr 515 520
<210> 283
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 283
tgcctttgct cacctacccc aagg 24
<210> 284
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
745
<223> Synthetic oligonucleotide probe <400> 284 tcaggctggt ctccaaagag aggg 24
<210> 285 <211> 45 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 285
cccaaagatg tccacctggc tgcaaatgtg aaaattgtgg actgg 45
<210> 286
<211> 2340
<212> DNA
<213> Homo sapiens
<400> 286
gggctgttga tttgtggggg attttgaaga gaggaggaat aggaggaagg
50
ggttgagggg ctgcctctgg catatgcaca cactcacaca ttctgtcaca
100
cccgtcacac acacatacca tgttctccat ccccccaggt ccagccctca
150
gtgctgtccc atccagcagg gctaccctga agctctggct gcagccctcc
200
cgtccagtgg gcaggcggct tcatccctcc tttctctccc aaagcccaac
250
tgctgtcact gcatgctctg ccaaggagga gggaactgca gtgacagcag
300
gagtaagagt gggaggcagg acagagctgg gacacaggta tggagagggg
350
gttcagcgag cctagagagg gcagactatc agggtgccgg cggtgagaat
400
ccagggagag gagcggaaac agaagagggg cagaagaccg gggcacttgt
450
gggttgcaga gcccctcagc catgttggga gccaagccac actggctacc
500
aggtccccta cacagtcccg ggctgccctt ggttctggtg cttctggccc
550
tgggggccgg gtgggcccag gaggggtcag agcccgtcct gctggagggg
600
gagtgcctgg tggtctgtga gcctggccga gctgctgcag gggggcccgg
650
gggagcagcc ctgggagagg caccccctgg gcgagtggca tttgctgcgg
700
tccgaagcca ccaccatgag ccagcagggg aaaccggcaa tggcaccagt
750
ggggccatct acttcgacca ggtcctggtg aacgagggcg gtggctttga
800
ccgggcctct ggctccttcg tagcccctgt ccggggtgtc tacagcttcc
850
ggttccatgt ggtgaaggtg tacaaccgcc aaactgtcca ggtgagcctg
900
atgctgaaca cgtggcctgt catctcagcc tttgccaatg atcctgacgt
950
gacccgggag gcagccacca gctctgtgct actgcccttg gaccctgggg
1000
accgagtgtc tctgcgcctg cgtcggggga atctactggg tggttggaaa
1050
746
tactcaagtt tttcaagcac tctctggctt aagaatccag cctcatcttc cccctgacaa cctctctgag ctttcttctg gacccaagtc ccctctcttg
1100 1150
ccccagaaac agcagaggca ggagagagac tccctctggc tcctatccca
1200
cctctttgca tgggaccctg tgccaaacac ccaagtttaa gagaagagta
1250
gagctgtggc atctccagac caggcctttc cacccaccca cccccagtta
1300
ccctcccagc cacctgctgc atctgttcct gcctgcagcc ctaggatcag
1350
ggcaaggttt ggcaagaagg aagatctgca ctactttgcg gcctctgctc
1400
ctccggttcc cccaccccag cttcctgctc aatgctgatc agggacaggt
1450
ggcgcaggtg agcctgacag gcccccacag gagcccagat ggacaagcct
1500
cagcgtaccc tgcaggcttc ttcctgtgag gaaagccagc atcacggatc
1550
tcagccagca ccgtcagaag ctgagccagc accgtatggg ctagggtggg
1600
aggctcagcc acaggcagaa gggtgggaag ggcctggagt ctgtggctgg
1650
tgaggaagga aggagggtgt attgtctaga ctgaacatgg tacacattct
1700
gcatgtatag cagagcagcc agcaggtagc aatcctggct gtccttctat
1750
gctggatccc agatggactc tggcccttac ctccccacct gagattaggg
1800
tgagtgtgtt tgctctggct gagagcagag ctgagagcag gtatacagag
1850
ctggaagtgg accatggaaa acatcgataa ccatgcatcc tcttgcttgg
1900
ccacctcctg aaactgctcc acctttgaag tttgaacttt agtccctcca
1950
cactctgact gctgcctcct tcctcccagc tctctcactg agttatcttc
2000
actgtacctg ttccagcata tccccactat ctctctttct cctgatctgt
2050
gctgtcttat tctcctcctt aggcttccta ttacctggga ttccatgatt
2100
cattccttca gaccctctcc tgccagtatg ctaaaccctc cctctctctt
2150
tcttatcccg ctgtcccatt ggcccagcct ggatgaatct atcaataaaa
2200
caactagaga atggtggtca gtgagacact atagaattac taaggagaag
2250
atgcctctgg agtttggatc gggtgttaca ggtacaagta ggtatgttgc
2300
agaggaaaat aaatatcaaa ctgtatacta aaattaaaaa
2340
<210> 287 <211> 205 <212> PRT
<213> Homo sapiens <400> 287
Met Leu Gly Ala Lys Pro His Trp Leu Pro Gly Pro Leu His Ser 15 10 15
Pro Gly Leu Pro Leu Val Leu Val Leu Leu Ala Leu Gly Ala Gly
747
25 30
Trp Ala Gin Glu Gly Ser Glu Pro Val Leu Leu Glu Gly Glu Cys 35 40 45
Leu Val Val Cys Glu Pro Gly Arg Ala Ala Ala Gly Gly Pro Gly 50 55 60
Gly Ala Ala Leu Gly Glu Ala Pro Pro Gly Arg Val Ala Phe Ala 65 70 75
Ala Val Arg Ser His His His Glu Pro Ala Gly Glu Thr Gly Asn 80 85 90
Gly Thr Ser Gly Ala Ile Tyr Phe Asp Gin Val Leu Val Asn Glu 95 100 105
Gly Gly Gly Phe Asp Arg Ala Ser Gly Ser Phe Val Ala Pro Val 110 115 120
Arg Gly Val Tyr Ser Phe Arg Phe His Val Val Lys Val Tyr Asn 125 130 135
Arg Gin Thr Val Gin Val Ser Leu Met Leu Asn Thr Trp Pro Val 140 145 150
Ile Ser Ala Phe Ala Asn Asp Pro Asp Val Thr Arg Glu Ala Ala 155 160 165
Thr Ser Ser Val Leu Leu Pro Leu Asp Pro Gly Asp Arg Val Ser 170 175 180
Leu Arg Leu Arg Arg Gly Asn Leu Leu Gly Gly Trp Lys Tyr Ser 185 190 195
Ser Phe Ser Gly Phe Leu Ile Phe Pro Leu 200 205
<210> 288 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 288 aggcagccac cagctctgtg ctac 24
<210> 289 <211> 27 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 289 cagagaggga agatgaggaa gccagag 27
<210> 290 <211> 42
748
<212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 290
ctgtgctact gcccttggac cctggggacc gagtgtctct gc 42
<210> 291 <211> 1570 <212> DNA <213> Homo sapiens
<400> 291
gctgtttctc tcgcgccacc actggccgcc ggccgcagct ccaggtgtcc
50
tagccgccca gcctcgacgc cgtcccggga cccctgtgct ctgcgcgaag
100
ccctggcccc gggggccggg gcatgggcca ggggcgcggg gtgaagcggc
150
ttcccgcggg gccgtgactg ggcgggcttc agccatgaag accctcatag
200
ccgcctactc cggggtcctg cgcggcgagc gtcaggccga ggctgaccgg
250
agccagcgct ctcacggagg acctgcgctg tcgcgcgagg ggtctgggag
300
atggggcact ggatccagca tcctctccgc cptccaggac ctcttctctg
350
tcacctggct caataggtcc aaggtggaaa agcagctaca ggtcatctca
400
gtgctccagt gggtcctgtc cttccttgta ctgggagtgg cctgcagtgc
450
catcctcatg tacatattct gcactgattg ctggctcatc gctgtgctct
500
acttcacttg gctggtgttt gactggaaca cacccaagaa aggtggcagg
550
aggtcacagt gggtccgaaa ctgggctgtg tggcgctact ttcgagacta
600
ctttcccatc cagctggtga agacacacaa cctgctgacc accaggaact
650
atatctttgg ataccacccc catggtatca tgggcctggg tgccttctgc
700
aacttcagca cagaggccac agaagtgagc aagaagttcc caggcatacg
750
gccttacctg gctacactgg caggcaactt ccgaatgcct gtgttgaggg
800
agtacctgat gtctggaggt atctgccctg tcagccggga caccatagac
850
tatttgcttt caaagaatgg gagtggcaat gctatcatca tcgtggtcgg
900
gggtgcggct gagtctctga gctccatgcc tggcaagaat gcagtcaccc
950
tgcggaaccg caagggcttt gtgaaactgg ccctgcgtca tggagctgac
1000
ctggttccca tctactcctt tggagagaat gaagtgtaca agcaggtgat
1050
cttcgaggag ggctcctggg gccgatgggt ccagaagaag ttccagaaat
1100
acattggttt cgccccatgc atcttccatg gtcgaggcct cttctcctcc
1150
gacacctggg ggctggtgcc ctactccaag cccatcacca ctgttgtggg
1200
749
agagcccatc accatcccca agctggagca cccaacccag caagacatcg 1250
acctgtacca caccatgtac atggaggccc tggtgaagct cttcgacaag 13 00
cacaagacca agttcggcct cccggagact gaggtcctgg aggtgaactg 13 50
agccagcctt cggggccaat tccctggagg aaccagctgc aaatcacttt 1400
tttgctctgt aaatttggaa gtgtcatggg tgtctgtggg ttatttaaaa 1450
gaaattataa caattttgct aaaccaaaaa aaaaaaaaaa aaaaaaaaaa 1500
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1550
aaaaaaaaaa aaaaaaaaaa 157 0
<210> 292 <211> 388 <212> PRT <213> Homo sapiens
<400> 292
Met Lys Thr Leu Ile Ala Ala Tyr Ser Gly Val Leu Arg Gly Glu 15 10 15
Arg Gin Ala Glu Ala Asp Arg Ser Gin Arg Ser His Gly Gly Pro 20 25 30
Ala Leu Ser Arg Glu Gly Ser Gly Arg Trp Gly Thr Gly Ser Ser 35 40 45
Ile Leu Ser Ala Leu Gin Asp Leu Phe Ser Val Thr Trp Leu Asn 50 55 60
Arg Ser Lys Val Glu Lys Gin Leu Gin Val Ile Ser Val Leu Gin 65 70 75
Trp Val Leu Ser Phe Leu Val Leu Gly Val Ala Cys Ser Ala Ile 80 85 90
Leu Met Tyr Ile Phe Cys Thr Asp Cys Trp Leu Ile Ala Val Leu 95 100 105
Tyr Phe Thr Trp Leu Val Phe Asp Trp Asn Thr E?ro Lys Lys Gly 110 115 120
Gly Arg Arg Ser Gin Trp Val Arg Asn Trp Ala Val Trp Arg Tyr 125 130 135
Phe Arg Asp Tyr Phe Pro Ile Gin Leu Val Lys Thr His Asn Leu 140 145 150
Leu Thr Thr Arg Asn Tyr Ile Phe Gly Tyr His Pro His Gly Ile 155 160 165
Met Gly Leu Gly Ala Phe Cys Asn Phe Ser Thr Glu Ala Thr Glu 170 175 180
Val Ser Lys Lys Phe Pro Gly Ile Arg Pro Tyr Leu Ala Thr Leu 185 190 195
750
Ala Gly Asn Phe Arg Met Pro Val Leu Arg Glu Tyr Leu Met Ser 200 205 210
Gly Gly Ile Cys Pro Val Ser Arg Asp Thr Ile Asp Tyr Leu Leu 215 220 225
Ser Lys Asn Gly Ser Gly Asn Ala Ile Ile Ile Val Val Gly Gly 230 235 240
Ala Ala Glu Ser Leu Ser Ser Met Pro Gly Lys Asn Ala Val Thr 245 250 255
Leu Arg Asn Arg Lys Gly Phe Val Lys Leu Ala Leu Arg His Gly 260 265 270
Ala Asp Leu Val Pro Ile Tyr Ser Phe Gly Glu Asn Glu Val Tyr 275 280 285
Lys Gin Val Ile Phe Glu Glu Gly Ser Trp Gly Arg Trp Val Gin 290 295 300
Lys Lys Phe Gin Lys Tyr Ile Gly Phe Ala Pro Cys Ile Phe His 305 310 315
Gly Arg Gly Leu Phe Ser Ser Asp Thr Trp Gly Leu Val Pro Tyr 320 325 330
Ser Lys Pro Ile Thr Thr Val Val Gly Glu Pro Ile Thr Ile Pro 335 340 345
Lys Leu Glu His Pro Thr Gin Gin Asp Ile Asp Leu Tyr His Thr 350 355 360
Met Tyr Met Glu Ala Leu Val Lys Leu Phe Asp Lys His Lys Thr 365 370 375
Lys Phe Gly Leu Pro Glu Thr Glu Val Leu Glu Val Asn 380 385
<210> 293
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 293
gctgacctgg ttcccatcta ctcc 24
<210> 294
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 294
cccacagaca cccatgacac ttcc 24
<210> 295
751
<211> 50 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 295
aagaatgaat tgtacaaagc aggtgatctt cgaggagggc tcctggggcc 50
<210> 296
<211> 3060
<212> DNA
<213> Homo sapiens
<400> 296
gggcggcggg atgggggccg ggggcggcgg gcgccgcact cgctgaggcc
50
ccgacgcagg gccgggccgg gcccagggcc gaggagcgcg gcggccagag
100
cggggccgcg gaggcgacgc cggggacgcc cgcgcgacga gcaggtggcg
150
gcggctgcag gcttgtccag ccggaagccc tgagggcagc tgttcccact
200
ggctctgctg accttgtgcc ttggacggct gtcctcagcg aggggccgtg
250
cacccgctcc tgagcagcgc catgggcctg ctggccttcc tgaagaccca
300
gttcgtgctg cacctgctgg tcggctttgt cttcgtggtg agtggtctgg
350
tcatcaactt cgtccagctg tgcacgctgg cgctctggcc ggtcagcaag
400
cagctctacc gccgcctcaa ctgccgcctc gcctactcac tctggagcca
450
actggtcatg ctgctggagt ggtggtcctg cacggagtgt acactgttca
500
cggaccaggc cacggtagag cgctttggga aggagcacgc agtcatcatc
550
ctcaaccaca acttcgagat cgacttcctc tgtgggtgga ccatgtgtga
600
gcgcttcgga gtgctgggga gctccaaggt cctcgctaag aaggagctgc
650
tctacgtgcc cctcatcggc tggacgtggt actttctgga gattgtgttc
700
tgcaagcgga agtgggagga ggaccgggac accgtggtcg aagggctgag
750
gcgcctgtcg gactaccccg agtacatgtg gtttctcctg tactgcgagg
800
ggacgcgctt cacggagacc aagcaccgcg ttagcatgga ggtggcggct
850
gctaaggggc ttcctgtcct caagtaccac ctgctgccgc ggaccaaggg
900
cttcaccacc gcagtcaagt gcctccgggg gacagtcgca gctgtctatg
950
atgtaaccct gaacttcaga ggaaacaaga acccgtccct gctggggatc
1000
ctctacggga agaagtacga ggcggacatg tgcgtgagga gatttcctct
1050
ggaagacatc ccgctggatg aaaaggaagc agctcagtgg cttcataaac
1100
tgtaccagga gaaggacgcg ctccaggaga tatataatca gaagggcatg
1150
752
tttccagggg agcagtttaa gcctgcccgg aggccgtgga ccctcctgaa
1200
cttcctgtcc tgggccacca ttctcctgtc tcccctcttc agttttgtct
1250
tgggcgtctt tgccagcgga tcacctctcc tgatcctgac tttcttgggg
1300
tttgtgggag cagcttcctt tggagttcgc agactgatag gagaatcgct
1350
tgaacctggg aggtggagat tgcagtgagc tgagatggca tcactgtact
1400
ccagcctagg caacagagca agactcagtc tcaaaaaaaa aaaaaaacaa
1450
aaaaacccca gaaattctgg agttgaactg tgtagttact gacatgaaaa
1500
attcactaga ggctgaacag cagatttgag caggcagaaa aaaatcagca
1550
agcttgaaga tggtaccttg agatttttca ggctaatgaa aaaagaatga
1600
aggaaaatta acagcctcag agacccatgg tgcaccgtca cacaaatcaa
1650
catatgcatg atgagagtcc cagaaggaga ggagagaaag ggtcagaaag
1700
aatggccaca agctgatgaa aaacagtaac ctacccactc aggaagctca
1750
gtgaactcca atgaggatga atatcagaga tccacaccta gatatttcat
1800
aatcaaagtg tcaaatgaca aagaatcttg aaagcagcaa gagatgagca
1850
acttatcttg ttcaaaggat ctttgatcag attaacagct catttctcct
1900
cagaaatcat gggagccagg agatagtggg atgaacactg ttgaaggcaa
1950
aaccttcaac tgtaattatt ggacttttga gtcttagatg gtcctgacct
2000
ctttgtcttc agggacagtt tttcaattta atccctaata acaattagtc
2050
aagcttcctt gacctgtagg aaggcctgtc tttaggccgg gcacagtggc
2100
ttacacctgt aatcccagca ctttgggagg cccagacggg tggatcattt
2150
ggggtcaggc tgatctcaaa ctcctgagtt caggtgatct gcccgcctca
2200
gcctcccaaa gtgttgtgat tgcaggcgtg agccactgcg cctggccgga
2250
atttcttttt aaggctgaat gatgggggcc aggcacgatg gctcacgcct
2300
gtgatcccaa gtagcttgga ttgtaaacat gcaccaccat gcctggctaa
2350
tttttgtatt tttagtagag acgtgttagc caggctggtc tcgatctcct
2400
gacctcaagt gaccacctgc ctcagcctcc caaagtactg ggattacagg
2450
cgtgagccac tgtgcctggc cttgagcatc ttgtgatgtg cttattggcc
2500
atttgtatat cttctatctt ctttggggaa atgtctgttc aagtcctttg
2550
cctttttaaa tttttattat ttatttattt atttattttg agacagggtc
2600
ttgttctgtt gcccaggctg gagtacagtg gcacagtctt ggctcactgc
2650
agcctcgacc tcctgggctg cagtgatcct cccacctcag cctcccttgt
2700
753
agctgtattt ttttgtattt tgtattttgt agctgtagtt tttgtatttt 2750
ttgtggagac agcatttcac catgatgccc aggctggtct tgaactcctg 2800
agctcaagtg atctgcctgc ttcagcctcc caaagtgctg ggattacaga 2850
catgagccac tgcacctggc aaactcccaa aattcaacac acacacacaa 2900
aaaaccacct gattcaaaat gggcagaggg gccgggtgtg gccccaacta 2950 ccagggagac tgaagtggga ggatcgcttg ggcatgagaa gtcgaggctg 3000
cagtgagtcg aggttgtgcg actgcattcc agcctggaca acagagtgag 3050
accctgtctc 3060
<210> 297 <211> 368 <212> PRT <213> Homo sapiens
<400> 297
Met Gly Leu Leu Ala Phe Leu Lys Thr Gin Phe Val Leu His Leu 15 10 15
Leu Val Gly Phe Val Phe Val Val Ser Gly Leu Val Ile Asn Phe 20 25 30
Val Gin Leu Cys Thr Leu Ala Leu Trp Pro Val Ser Lys Gin Leu 35 40 45
Tyr Arg Arg Leu Asn Cys Arg Leu Ala Tyr Ser Leu Trp Ser Gin 50 55 60
Leu Val Met Leu Leu Glu Trp Trp Ser Cys Thr Glu Cys Thr Leu 65 70 75
Phe Thr Asp Gin Ala Thr Val Glu Arg Phe Gly Lys Glu His Ala 80 85 90
Val Ile Ile Leu Asn His Asn Phe Glu Ile Asp Phe Leu Cys Gly 95 100 105
Trp Thr Met Cys Glu Arg Phe Gly Val Leu Gly Ser Ser Lys Val 110 115 120
Leu Ala Lys Lys Glu Leu Leu Tyr Val Pro Leu Ile Gly Trp Thr 125 130 135
Trp Tyr Phe Leu Glu Ile Val Phe Cys Lys Arg Lys Trp Glu Glu 140 145 150
Asp Arg Asp Thr Val Val Glu Gly Leu Arg Arg Leu Ser Asp Tyr 155 160 165
Pro Glu Tyr Met Trp Phe Leu Leu Tyr Cys Glu Gly Thr Arg Phe 170 175 180
Thr Glu Thr Lys His Arg Val Ser Met Glu Val Ala Ala Ala Lys 185 190 195
Gly Leu Pro Val Leu Lys Tyr His Leu Leu Pro Arg Thr Lys Gly 200 205 210
754
Phe Thr Thr Ala Val Lys Cys Leu Arg Gly Thr Val Ala Ala Val 215 220 225
Tyr Asp Val Thr Leu Asn Phe Arg Gly Asn Lys Asn Pro Ser Leu 230 235 240
Leu Gly Ile Leu Tyr Gly Lys Lys Tyr Glu Ala Asp Met Cys Val 245 250 255
Arg Arg Phe Pro Leu Glu Asp Ile Pro Leu Asp Glu Lys Glu Ala 260 265 270
Ala Gin Trp Leu His Lys Leu Tyr Gin Glu Lys Asp Ala Leu Gin 275 280 285
Glu Ile Tyr Asn Gin Lys Gly Met Phe Pro Gly Glu Gin Phe Lys 290 295 300
Pro Ala Arg Arg Pro Trp Thr Leu Leu Asn Phe Leu Ser Trp Ala 305 310 315
Thr Ile Leu Leu Ser Pro Leu Phe Ser Phe Val Leu Gly Val Phe 320 325 330
Ala Ser Gly Ser Pro Leu Leu Ile Leu Thr Phe Leu Gly Phe Val 335 340 345
Gly Ala Ala Ser Phe Gly Val Arg Arg Leu Ile Gly Glu Ser Leu 350 355 360
Glu Pro Gly Arg Trp Arg Leu Gin 365
<210> 298 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 298 cttcctctgt gggtggacca tgtg 24
<210> 299 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 299 gccacctcca tgctaacgcg g 21
<210> 300 <211> 45 <212> DNA
<213> Artificial Sequence <220>
755
<223> Synthetic oligonucleotide probe <400> 300
ccaaggtcct cgctaagaag gagctgctct acgtgcccct catcg 45
<210> 301 <211> 1334 <212> DNA <213> Homo sapiens
<400> 301
gatattcttt atttttaaga atctgaagta ctatgcatca ctccctccaa
50
tgtcctgggg cagccaccag gcatattcat ctttgtgtgt gtttttcttt
100
tgctttagca ctggggcact tcttgcttat ttctttggta ggaaaggggc
150
tcagtttgtc ttgtggggtt ggtggcaggc aggccggctt acgcctgata
200
cggccctggg ttagaaggga agggaagata aacttttata caaatgggga
250
tagctggggt ctgagacctg cttcctcagt aaaattcctg ggatctgcct
300
ataccttctt ttctctaacc tggcataccc tgcttaaagc ctctcagggc
350
ttctctctgt tcttaggatc aaagtattta gagctacaag agccctcatg
400
gtctggcccc tgcccccctg gccagcttca ttgtacatgt ggtgttctct
450
tgtcgttcct gtaatgtggt atgccatggg gtctttgcac aagcctttcc
500
tctttggctg gacactgttc cctgcccccc ccatactctt cctacttaat
550
atgtagtcat cctgcagatt tcaattctaa catcattttc tccagggatc
600
ctggcctgac agaatctcat cttgtttaat gctctcataa gaccacttgt
650
ttcccttttg cagcacttgc cactcagttg tatctttatg tgcgtttgtg
700
gttgtatggg ttgtgtctgt tccccagaat gcccagctct gagctgcgtg
750
agggtcaagg gcattgctgt gcctgccagg tatagtgcct acatgtggtg
800
ggtgctcatg ttttagagac taaatggagg aggagatgag gaaaagattg
850
aaatctctca gttcaccaga tggtgtaggg cccagcattg taaattcaca
900
cgttgactgt gcttgtgaat tatctgggga tgcaggtcct gattcagtag
950
gcccaggttg ggcatctcta acaaactccc acgtgatgct gatgctggtc
1000
ctatgaacta tactaaatag taagaatcta tggagccagg ctgggcatgg
1050
tggctcacac ctatgatccc agcactttgg gaggctgagg caggctgatc
1100
acctggagtc aggatttcaa gactagcctg gccaacatgg tggaacccca
1150
tctgtactaa aaatacacaa attagctggg catggtggca catgcctgta
1200
gtcccagcta cttgggaggc tgaagcaaga gaatcgcttg aacctgggag
1250
gcggaggttg cagtgagccg agatcaggcc actgtattcc aaccagggtg
1300
756
acagagtgag actctatgtc caaaaaaaaa aaaa 1334
<210> 302 <211> 143 <212> PRT <213> Homo sapiens
<400> 302
Met His His Ser Leu Gin Cys Pro Gly Ala Ala Thr Arg His Ile 15 10 15
His Leu Cys Val Cys Phe Ser Phe Ala Leu Ala Leu Gly His Phe 20 25 30
Leu Leu Ile Ser Leu Val Gly Lys Gly Leu Ser Leu Ser Cys Gly 35 40 45
Val Gly Gly Arg Gin Ala Gly Leu Arg Leu Ile Arg Pro Trp Val 50 55 60
Arg Arg Glu Gly Lys Ile Asn Phe Tyr Thr Asn Gly Asp Ser Trp 65 70 75
Gly Leu Arg Pro Ala Ser Ser Val Lys Phe Leu Gly Ser Ala Tyr 80 85 90
Thr Phe Phe Ser Leu Thr Trp His Thr Leu Leu Lys Ala Ser Gin 95 100 105
Gly Phe Ser Leu Phe Leu Gly Ser Lys Tyr Leu Glu Leu Gin Glu 110 115 120
Pro Ser Trp Ser Gly Pro Cys Pro Pro Gly Gin Leu His Cys Thr 125 130 135
Cys Gly Val Leu Leu Ser Phe Leu 140
<210> 303
<211> 1768
<212> DNA
<213> Homo sapiens
<400> 303
ggctggactg gaactcctgg tcccaagtga tccacccgcc tcagcctccc
50
aaggtgctgt gattataggt gtaagccacc gtgtctggcc tctgaacaac
100
tttttcagca actaaaaaag ccacaggagt tgaactgcta ggattctgac
150
tatgctgtgg tggctagtgc tcctactcct acctacatta aaatctgttt
200
tttgttctct tgtaactagc ctttaccttc ctaacacaga ggatctgtca
250
ctgtggctct ggcccaaacc tgaccttcac tctggaacga gaacagaggt
300
ttctacccac accgtcccct cgaagccggg gacagcctca ccttgctggc
350
ctctcgctgg agcagtgccc tcaccaactg tctcacgtct ggaggcactg
400
actcgggcag tgcaggtagc tgagcctctt ggtagctgcg gctttcaagg
450
757
tgggccttgc cctggccgta gaagggattg acaagcccga agatttcata
500
ggcgatggct cccactgccc aggcatcagc cttgctgtag tcaatcactg
550
ccctggggcc aggacgggcc gtggacacct gctcagaagc agtgggtgag
600
acatcacgct gcccgcccat ctaacctttt catgtcctgc acatcacctg
650
atccatgggc taatctgaac tctgtcccaa ggaacccaga gcttgagtga
700
gctgtggctc agacccagaa ggggtctgct tagaccacct ggtttatgtg
750
acaggacttg cattctcctg gaacatgagg gaacgccgga ggaaagcaaa
800
gtggcaggga aggaacttgt gccaaattat gggtcagaaa agatggaggt
850
gttgggttat cacaaggcat cgagtctcct gcattcagtg gacatgtggg
900
ggaagggctg ccgatggcgc atgacacact cgggactcac ctctggggcc
950
atcagacagc cgtttccgcc ccgatccacg taccagctgc tgaagggcaa
1000
ctgcaggccg atgctctcat cagccaggca gcagccaaaa tctgcgatca
1050
ccagccaggg gcagccgtct gggaaggagc aagcaaagtg accatttctc
1100
ctcccctcct tccctctgag aggccctcct atgtccctac taaagccacc
1150
agcaagacat agctgacagg ggctaatggc tcagtgttgg cccaggaggt
1200
cagcaaggcc tgagagctga tcagaagggc ctgctgtgcg aacacggaaa
1250
tgcctccagt aagcacaggc tgcaaaatcc ccaggcaaag gactgtgtgg
1300
ctcaatttaa atcatgttct agtaattgga gctgtcccca agaccaaagg
1350
agctagagct tggttcaaat gatctccaag ggcccttata ccccaggaga
1400
ctttgatttg aatttgaaac cccaaatcca aacctaagaa ccaggtgcat
1450
taagaatcag ttattgccgg gtgtggtggc ctgtaatgcc aacattttgg
1500
gaggccgagg cgggtagatc acctgaggtc aggagttcaa gaccagcctg
1550
gccaacatgg tgaaacccct gtctctacta aaaatacaaa aaaactagcc
1600
aggcatggtg gtgtgtgcct gtatcccagc tactcgggag gctgagacag
1650
gagaattact tgaacctggg aggtgaagga ggctgagaca ggagaatcac
1700
ttcagcctga gcaacacagc gagactctgt ctcagaaaaa ataaaaaaag
1750
aattatggtt atttgtaa 1768
<210> 304 <211> 109 <212> PRT <213> Homo sapiens
<400> 304
Met Leu Trp Trp Leu Val Leu Leu Leu Leu Pro Thr Leu Lys Ser
1 5
Val Phe Cys Ser Leu 20
Asp Leu Ser Leu Trp 35
Thr Arg Thr Glu Val 50
Thr Ala Ser Pro Cys 65
Thr Val Ser Arg Leu 80
Glu Pro Leu Gly Ser 95
Arg Arg Arg Asp
<210> 305 <211> 989 <212> DNA
<213> Homo sapiens <400> 305
gcgggcccgc gagtccgaga cctgtcccag gagctccagc tcacgtgacc
50
tgtcactgcc tcccgccgcc tcctgcccgc gccatgaccc agccggtgcc
100
ccggctctcc gtgcccgccg cgctggccct gggctcagcc gcactgggcg
150
ccgccttcgc cactggcctc ttcctgggga ggcggtgccc cccatggcga
200
ggccggcgag agcagtgcct gcttcccccc gaggacagcc gcctgtggca
250
gtatcttctg agccgctcca tgcgggagca cccggcgctg cgaagcctga
300
ggctgctgac cctggagcag ccgcaggggg attctatgat gacctgcgag
350
caggcccagc tcttggccaa cctggcgcgg ctcatccagg ccaagaaggc
400
gctggacctg ggcaccttca cgggctactc cgccctggcc ctggccctgg
450
cgctgcccgc ggacgggcgc gtggtgacct gcgaggtgga cgcgcagccc
500
ccggagctgg gacggcccct gtggaggcag gccgaggcgg agcacaagat
550
cgacctccgg ctgaagcccg ccttggagac cctggacgag ctgctggcgg
600
cgggcgaggc cggcaccttc gacgtggccg tggtggatgc ggacaaggag
650
aactgctccg cctactacga gcgctgcctg cagctgctgc gacccggagg
700
catcctcgcc gtcctcagag tcctgtggcg cgggaaggtg ctgcaacctc
750
cgaaagggga cgtggcggcc gagtgtgtgc gaaacctaaa cgaacgcatc
800
cggcgggacg tcagggtcta catcagcctc ctgcccctgg gcgatggact
850
758
Val Thr Ser Leu Tyr Leu Pro Asn Thr Glu 25 30
Leu Trp Pro Lys Pro Asp Leu His Ser Gly 40 45
Ser Thr His Thr Val Pro Ser Lys Pro Gly 55 60
Trp Pro Leu Ala Gly Ala Val Pro Ser Pro 70 75
Glu Ala Leu Thr Arg Ala Val Gin Val Ala 85 90
Cys Gly Phe Gin Gly Gly Pro Cys Pro Gly 100 105
759
caccttggcc ttcaagatct agggctggcc cctagtgagt gggctcgagg 900
gagggttgcc tgggaacccc aggaattgac cctgagtttt aaattcgaaa 950
ataaagtggg gctgggacac aaaaaaaaaa aaaaaaaaa 989
<210> 306 <211> 262 <212> PRT
<213> Homo sapiens <400> 306
Met Thr Gin Pro Val Pro Arg Leu Ser Val Pro Ala Ala Leu Ala 15 10 15
Leu Gly Ser Ala Ala Leu Gly Ala Ala Phe Ala Thr Gly Leu Phe 20 25 30
Leu Gly Arg Arg Cys Pro Pro Trp Arg Gly Arg Arg Glu Gin Cys 35 40 45
Leu Leu Pro Pro Glu Asp Ser Arg Leu Trp Gin Tyr Leu Leu Ser 50 55 60
Arg Ser Met Arg Glu His Pro Ala Leu Arg Ser Leu Arg Leu Leu 65 70 75
Thr Leu Glu Gin Pro Gin Gly Asp Ser Met Met Thr Cys Glu Gin 80 85 90
Ala Gin Leu Leu Ala Asn Leu Ala Arg Leu Ile Gin Ala Lys Lys 95 100 105
Ala Leu Asp Leu Gly Thr Phe Thr Gly Tyr Ser Ala Leu Ala Leu 110 115 120
Ala Leu Ala Leu Pro Ala Asp Gly Arg Val Val Thr Cys Glu Val 125 130 135
Asp Ala Gin Pro Pro Glu Leu Gly Arg Pro Leu Trp Arg Gin Ala 140 145 150
Glu Ala Glu His Lys Ile Asp Leu Arg Leu Lys Pro Ala Leu Glu 155 160 165
Thr Leu Asp Glu Leu Leu Ala Ala Gly Glu Ala Gly Thr Phe Asp 170 175 180
Val Ala Val Val Asp Ala Asp Lys Glu Asn Cys Ser Ala Tyr Tyr 185 190 195
Glu Arg Cys Leu Gin Leu Leu Arg Pro Gly Gly Ile Leu Ala Val 200 205 210
Leu Arg Val Leu Trp Arg Gly Lys Val Leu Gin Pro Pro Lys Gly 215 220 225
Asp Val Ala Ala Glu Cys Val Arg Asn Leu Asn Glu Arg Ile Arg 230 235 240
Arg Asp Val Arg Val Tyr Ile Ser Leu Leu Pro Leu Gly Asp Gly 245 250 255
760
Leu Thr Leu Ala Phe Lys Ile 260
<210> 307 <211> 2272 <212> DNA
<213> Homo sapiens <400> 307
ccgccgccgc agccgctacc gccgctgcag ccgctttccg cggcctgggc
50
ctctcgccgt cagcatgcca cacgccttca agcccgggga cttggtgttc
100
gctaagatga agggctaccc tcactggcct gccaggatcg acgacatcgc
150
ggatggcgcc gtgaagcccc cacccaacaa gtaccccatc tttttctttg
200
gcacacacga aacagccttc ctgggaccca aggacctgtt cccctacgac
250
aaatgtaaag acaagtacgg gaagcccaac aagaggaaag gcttcaatga
300
agggctgtgg gagatccaga acaaccccca cgccagctac agcgcccctc
350
cgccagtgag ctcctccgac agcgaggccc ccgaggccaa ccccgccgac
400
ggcagtgacg ctgacgagga cgatgaggac cggggggtca tggccgtcac
450
agcggtaacc gccacagctg ccagcgacag gatggagagc gactcagact
500
cagacaagag tagcgacaac agtggcctga agaggaagac gcctgcgcta
550
aagatgtcgg tctcgaaacg agcccgaaag gcctccagcg acctggatca
600
ggccagcgtg tccccatccg aagaggagaa ctcggaaagc tcatctgagt
650
cggagaagac cagcgaccag gacttcacac ctgagaagaa agcagcggtc
700
cgggcgccac ggaggggccc tctgggggga cggaaaaaaa agaaggcgcc
750
gtcagcctcc gactccgact ccaaggccga ttcggacggg gccaagcctg
800
agccggtggc catggcgcgg tcggcgtcct cctcctcctc ttcctcctcc
850
tcctccgact ccgatgtgtc tgtgaagaag cctccgaggg gcaggaagcc
900
agcggagaag cctctcccga agccgcgagg gcggaaaccg aagcctgaac
950
ggcctccgtc cagctccagc agtgacagtg acagcgacga ggtggaccgc
1000
atcagtgagt ggaagcggcg ggacgaggcg cggaggcgcg agctggaggc
1050
ccggcggcgg cgagagcagg aggaggagct gcggcgcctg cgggagcagg
1100
agaaggagga gaaggagcgg aggcgcgagc gggccgaccg cggggaggct
1150
gagcggggca gcggcggcag cagcggggac gagctcaggg aggacgatga
1200
gcccgtcaag aagcggggac gcaagggccg gggccggggt cccccgtcct
1250
cctctgactc cgagcccgag gccgagctgg agagagaggc caagaaatca
1300
761
gcgaagaagc cgcagtcctc aagcacagag cccgccagga aacctggcca
1350
gaaggagaag agagtgcggc ccgaggagaa gcaacaagcc aagcccgtga
1400
aggtggagcg gacccggaag cggtccgagg gcttctcgat ggacaggaag
1450
gtagagaaga agaaagagcc ctccgtggag gagaagctgc agaagctgca
1500
cagtgagatc aagtttgccc taaaggtcga cagcccggac gtgaagaggt
1550
gcctgaatgc cctagaggag ctgggaaccc tgcaggtgac ctctcagatc
1600
ctccagaaga acacagacgt ggtggccacc ttgaagaaga ttcgccgtta
1650
caaagcgaac aaggacgtaa tggagaaggc agcagaagtc tatacccggc
1700
tcaagtcgcg aaggctggga ggtcctcggc tggagaagga ccaaagatcg gaaggccgag aggcggtgca gagaagctgg gaaagtgaac ccggggagga
1750 1800
gctggccggg gaggaggccc cccaggagaa ggcggaggac aagcccagca
1850
ccgatctctc agccccagtg aatggcgagg ccacatcaca gaagggggag
1900
agcgcagagg acaaggagca cgaggagggt cgggactcgg aggaggggcc
1950
aaggtgtggc tcctctgaag acctgcacga cagcgtacgg gagggtcccg
2000
acctggacag gcctgggagc gaccggcagg agcgcgagag ggcacggggg
2050
gactcggagg ccctggacga ggagagctga gccgcgggca gccaggccca
2100
gcccccgccc gagctcaggc tgcccctctc cttccccggc tcgcaggaga
2150
gcagagcaga gaactgtggg gaacgctgtg ctgtttgtat ttgttccctt
2200
gggttttttt ttcctgccta atttctgtga tttccaacca acatgaaatg
2250
actataaacg gttttttaat ga 2272
<210> 308 <211> 671 <212> PRT
<213> Homo sapiens <400> 308
Met Pro His Ala Phe Lys Pro Gly Asp Leu Val Phe Ala Lys Met 15 10 15
Lys Gly Tyr Pro His Trp Pro Ala Arg Ile Asp Asp Ile Ala Asp 20 25 30
Gly Ala Val Lys Pro Pro Pro Asn Lys Tyr Pro Ile Phe Phe Phe 35 40 45
Gly Thr His Glu Thr Ala Phe Leu Gly Pro Lys Asp Leu Phe Pro 50 55 60
Tyr Asp Lys Cys Lys Asp Lys Tyr Gly Lys Pro Asn Lys Arg Lys 65 70 75
Gly Phe Asn Glu Gly Leu Trp Glu Ile Gin Asn Asn Pro His Ala 80 85 90
762
Ser Tyr Ser Ala Pro Pro Pro Val Ser Ser Ser Asp Ser Glu Ala 95 100 105
Pro Glu Ala Asn Pro Ala Asp Gly Ser Asp Ala Asp Glu Asp Asp 110 115 120
Glu Asp Arg Gly Val Met Ala Val Thr Ala Val Thr Ala Thr Ala 125 130 135
Ala Ser Asp Arg Met Glu Ser Asp Ser Asp Ser Asp Lys Ser Ser 140 145 150
Asp Asn Ser Gly Leu Lys Arg Lys Thr Pro Ala Leu Lys Met Ser 155 160 165
Val Ser Lys Arg Ala Arg Lys Ala Ser Ser Asp Leu Asp Gin Ala 170 175 180
Ser Val Ser Pro Ser Glu Glu Glu Asn Ser Glu Ser Ser Ser Glu 185 190 195
Ser Glu Lys Thr Ser Asp Gin Asp Phe Thr Pro Glu Lys Lys Ala 200 205 210
Ala Val Arg Ala Pro Arg Arg Gly Pro Leu Gly Gly Arg Lys Lys 215 220 225
Lys Lys Ala Pro Ser Ala Ser Asp Ser Asp Ser Lys Ala Asp Ser 230 235 240
Asp Gly Ala Lys Pro Glu Pro Val Ala Met Ala Arg Ser Ala Ser 245 250 255
Ser Ser Ser Ser Ser Ser Ser Ser Ser Asp Ser Asp Val Ser Val 260 265 270
Lys Lys Pro Pro Arg Gly Arg Lys Pro Ala Glu Lys Pro Leu Pro 275 280 285
Lys Pro Arg Gly Arg Lys Pro Lys Pro Glu Arg Pro Pro Ser Ser 290 295 300
Ser Ser Ser Asp Ser Asp Ser Asp Glu Val Asp Arg Ile Ser Glu 305 310 315
Trp Lys Arg Arg Asp Glu Ala Arg Arg Arg Glu Leu Glu Ala Arg 320 325 330
Arg Arg Arg Glu Gin Glu Glu Glu Leu Arg Arg Leu Arg Glu Gin 335 340 345
Glu Lys Glu Glu Lys Glu Arg Arg Arg Glu Arg Ala Asp Arg Gly 350 355 360
Glu Ala Glu Arg Gly Ser Gly Gly Ser Ser Gly Asp Glu Leu Arg 365 370 375
Glu Asp Asp Glu Pro Val Lys Lys Arg Gly Arg Lys Gly Arg Gly 380 385 390
Arg Gly Pro Pro Ser Ser Ser Asp Ser Glu Pro Glu Ala Glu Leu 395 400 405
763
Glu Arg Glu Ala Lys Lys Ser Ala Lys Lys Pro Gin Ser Ser Ser 410 415 420
Thr Glu Pro Ala Arg Lys Pro Gly Gin Lys Glu Lys Arg Val Arg 425 430 435
Pro Glu Glu Lys Gin Gin Ala Lys Pro Val Lys Val Glu Arg Thr 440 445 450
Arg Lys Arg Ser Glu Gly Phe Ser Met Asp Arg Lys Val Glu Lys 455 460 465
Lys Lys Glu Pro Ser Val Glu Glu Lys Leu Gin Lys Leu His Ser 470 475 480
Glu Ile Lys Phe Ala Leu Lys Val Asp Ser Pro Asp Val Lys Arg 485 490 495
Cys Leu Asn Ala Leu Glu Glu Leu Gly Thr Leu Gin Val Thr Ser 500 505 510
Gin Ile Leu Gin Lys Asn Thr Asp Val Val Ala Thr Leu Lys Lys 515 520 525
Ile Arg Arg Tyr Lys Ala Asn Lys Asp Val Met Glu Lys Ala Ala 530 535 540
Glu Val Tyr Thr Arg Leu Lys Ser Arg Val Leu Gly Pro Lys Ile 545 550 555
Glu Ala Val Gin Lys Val Asn Lys Ala Gly Met Glu Lys Glu Lys 560 565 570
Ala Glu Glu Lys Leu Ala Gly Glu Glu Leu Ala Gly Glu Glu Ala 575 580 585
Pro Gin Glu Lys Ala Glu Asp Lys Pro Ser Thr Asp Leu Ser Ala 590 595 600
Pro Val Asn Gly Glu Ala Thr Ser Gin Lys Gly Glu Ser Ala Glu 605 610 615
Asp Lys Glu His Glu Glu Gly Arg Asp Ser Glu Glu Gly Pro Arg 620 625 630
Cys Gly Ser Ser Glu Asp Leu His Asp Ser Val Arg Glu Gly Pro 635 640 645
Asp Leu Asp Arg Pro Gly Ser Asp Arg Gin Glu Arg Glu Arg Ala 650 655 660
Arg Gly Asp Ser Glu Ala Leu Asp Glu Glu Ser 665 670
<210> 309
<211> 3871
<212> DNA
<213> Homo sapiens
<400> 309
gttggttctc ctggatcttc accttaccaa ctgcagatct tgggactcat 50
764
cagcctcaat aattatatta aattaacacc atttgaaaga gaacattgtt
100
ttcatcatga atgctaataa agatgaaaga cttaaagcca gaagccaaga
150
ttttcacctt tttcctgctt tgatgatgct aagcatgacc atgttgtttc
200
ttccagtcac tggcactttg aagcaaaata ttccaagact caagctaacc
250
tacaaagact tgctgctttc aaatagctgt attccctttt tgggttcatc
300
agaaggactg gattttcaaa ctcttctctt agatgaggaa agaggcaggc
350
tgctcttggg agccaaagac cacatctttc tactcagtct ggttgactta
400
aacaaaaatt ttaagaagat ttattggcct gctgcaaagg aacgggtgga
450
attatgtaaa ttagctggga aagatgccaa tacagaatgt gcaaatttca
500
tcagagtact ggagcatttc tcagccctat atccaatatg aacaaaactc tgggtatatt acatatatgt gatcttggag gtgtggaact tctacaagga
550 600
ggatattata ttcaaactag acacacataa tttggagtct ggcagactga
650
aatgtccttt cgatcctcag cagccttttg cttcagtaat gacagatgag
700
tacctctact ctggaacagc ttctgatttc cttggcaaag atactgcatt
750
cactcgatcc cttgggccta ctcatgacca ccactacatc agaactgaca
800
tttcagagca ctactggctc aatggagcaa aatttattgg aactttcttc
850
ataccagaca cctacaatcc agatgatgat aaaatatatt tcttctttcg
900
tgaatcatct caagaaggca gtacctccga taaaaccatc ctttctcgag
950
ttggaagagt ttgtaagaat gatgtaggag gacaacgcag cctgataaac
1000
aagtggacga cttttcttaa ggccagactg atttgctcaa ttcctggaag
1050
tgatggggca gatacttact ttgatgagct tcaagatatt tatttactcc
1100
ccacaagaga tgaaagaaat cctgtagtat atggagtctt tactacaacc
1150
agctccatct tcaaaggctc tgctgtttgt gtgtatagca tggctgacat
1200
cagagcagtt tttaatggtc catatgctca taaggaaagt gcagaccatc
1250
gttgggtgca gtatgatggg agaattcctt atccacggcc tggtacatgt
1300
ccaagcaaaa cctatgaccc actgattaag tccacccgag attttccaga
1350
tgatgtcatc agtttcataa agcggcactc tgtgatgtat aagtccgtat
1400
acccagttgc aggaggacca acgttcaaga gaatcaatgt ggattacaga
1450
ctgacacaga tagtggtgga tcatgtcatt gcagaagatg gccagtacga
1500
tgtaatgttt cttggaacag acattggaac tgtcctcaaa gttgtcagca
1550
tttcaaagga aaagtggaat atggaagagg tagtgctgga ggagttgcag
1600
765
atattcaagc actcatcaat catcttgaac atggaattgt ctctgaagca
1650
gcaacaattg tacattggtt cccgagatgg attagttcag ctctccttgc
1700
acagatgcga cacttatggg aaagcttgcg cagactgttg tcttgccaga
1750
gacccctact gtgcctggga tggaaatgca tgctctcgat atgctcctac
1800
ttctaaaagg agagctagac gccaagatgt aaaatatggc gacccaatca
1850
cccagtgctg ggacatcgaa gacagcatta gtcatgaaac tgctgatgaa
1900
aaggtgattt ttggcattga atttaactca acctttctgg aatgtatacc
1950
taaatcccaa caagcaacta ttaaatggta tatccagagg tcaggggatg
2000
agcatcgaga ggagttgaag cccgatgaaa gaatcatcaa aacggaatat
2050
gggctactga ttcgaagttt gcagaagaag gattctggga tgtattactg
2100
caaagcccag atgtcattga gagcacactt gaatgaacag tcatccacac atggaaaata catagtgaag cccagagggc ctgactttga agagcatgag
2150 2200
gaggggcagg tcaaggatct attggctgag tcacggttga gatacaaaga
2250
ctacatccaa atccttagca gcccaaactt cagcctcgac cagtactgcg
2300
aacagatgtg gcacagggag aagcggagac agagaaacaa ggggggccca
2350
aagtggaagc acatgcagga aatgaagaag aaacgaaatc gaagacatca
2400
cagagacctg gatgagctcc ctagagctgt agccacgtag ttttctactt
2450
aatttaaaga aaagaattcc ttacctataa aaacattgcc ttctgttttg
2500
tatatccctt atagtaattc ataaatgctt cccatggagt tttgctaagg
2550
cacaagacaa taatctgaat aagacaatat gtgatgaata taagaaaggg
2600
caaaaaattc atttgaacca gttttccaag aacaaatctt gcacaagcaa
2650
agtataagaa ttatcctaaa aatagggggt ttacagttgt aaatgtttta
2700
tgttttgagt tttggaattt attgtcatgt aaatagttga gctaagcaag
2750
ccccgaattt gatagtgtat aaggtgcttt attccctcga atgtccatta
2800
agcatggaat ttaccatgca gttgtgctat gttcttatga acagatatat
2850
cattcctatt gagaaccagc taccttgtgg tagggaataa gaggtcagac
2900
acaaattaag acaactccca ttatcaacag gaactttctc agtgagccat
2950
tcactcctgg agaatggtat aggaatttgg agaggtgcat tatttctttc
3000
tggccactgg ggttaaattt agtgtactac aacattgatt tactgaaggg
3050
cactaatgtt tcccccagga tttctattga ctagtcagga gtaacaggtt
3100
cacagagaga agttggtgct tagttatgtg ttttttagag tatatactaa
3150
gctctacagg gacagaatgc ttaataaata ctttaataag atatgggaaa
3200
766
atattttaat aaaacaagga aaacataatg atgtataatg catcctgatg
3250
ggaaggcatg cagatgggat ttgttagaag acagaaggaa agacagccat
3300
aaattctggc tttggggaaa actcatatcc ccatgaaaag gaagaacaat
3350
cacaaataaa gtgagagtaa tgtaatggag ctcttttcac tagggtataa
3400
gtagctgcca atttgtaatt catctgttaa aaaaaatcta gattataaca
3450
aactgctagc aaaatctgag gaaacataaa ttcttctgaa gaatcatagg
3500
aagagtagac attttattta taaccaatga tatttcagta tatattttct
3550
ctcttttaaa aaatatttat catactctgt atattatttc tttttactgc
3600
ctttattctc tcctgtatat tggattttgt gattatattt gagtgaatag
3650
gagaaaacaa tatataacac acagagaatt aagaaaatga catttctggg
3700
gagtggggat aacggaaagg atatatttgt gttaaattaa tgaataacag ctctttgaca aacgagtgta tcttcactca aaattttaac accttttctc
3750 3800
attgctgagt taatctgttg taattgtagt attgtttttg taatttaaca
3850
ataaataagc ctgctacatg t 3871
<210> 310 <211> 777 <212> PRT
<213> Homo sapiens <400> 310
Met Asn Ala Asn Lys Asp Glu Arg Leu Lys Ala Arg Ser Gin Asp 15 10 15
Phe His Leu Phe Pro Ala Leu Met Met Leu Ser Met Thr Met Leu 20 25 30
Phe Leu Pro Val Thr Gly Thr Leu Lys Gin Asn Ile Pro Arg Leu 35 40 45
Lys Leu Thr Tyr Lys Asp Leu Leu Leu Ser Asn Ser Cys Ile Pro 50 55 60
Phe Leu Gly Ser Ser Glu Gly Leu Asp Phe Gin Thr Leu Leu Leu 65 70 75
Asp Glu Glu Arg Gly Arg Leu Leu Leu Gly Ala Lys Asp His Ile 80 85 90
Phe Leu Leu Ser Leu Val Asp Leu Asn Lys Asn Phe Lys Lys Ile 95 100 105
Tyr Trp Pro Ala Ala Lys Glu Arg Val Glu Leu Cys Lys Leu Ala 110 115 120
Gly Lys Asp Ala Asn Thr Glu Cys Ala Asn Phe Ile Arg Val Leu 125 130 135
Gin Pro Tyr Asn Lys Thr His Ile Tyr Val Cys Gly Thr Gly Ala
140
767
145
150
Phe His Pro Ile Cys Gly Tyr Ile Asp Leu Gly Val Tyr Lys Glu 155 160 165
Asp Ile Ile Phe Lys Leu Asp Thr His Asn Leu Glu Ser Gly Arg 170 175 180
Leu Lys Cys Pro Phe Asp Pro Gin Gin Pro Phe Ala Ser Val Met 185 190 195
Thr Asp Glu Tyr Leu Tyr Ser Gly Thr Ala Ser Asp Phe Leu Gly 200 205 210
Lys Asp Thr Ala Phe Thr Arg Ser Leu Gly Pro Thr His Asp His 215 220 225
His Tyr Ile Arg Thr Asp Ile Ser Glu His Tyr Trp Leu Asn Gly 230 235 240
Ala Lys Phe Ile Gly Thr Phe Phe Ile Pro Asp Thr Tyr Asn Pro 245 250 255
Asp Asp Asp Lys Ile Tyr Phe Phe Phe Arg Glu Ser Ser Gin Glu 260 265 270
Gly Ser Thr Ser Asp Lys Thr Ile Leu Ser Arg Val Gly Arg Val 275 280 285
Cys Lys Asn Asp Val Gly Gly Gin Arg Ser Leu Ile Asn Lys Trp 290 295 300
Thr Thr Phe Leu Lys Ala Arg Leu Ile Cys Ser Ile Pro Gly Ser 305 310 315
Asp Gly Ala Asp Thr Tyr Phe Asp Glu Leu Gin Asp Ile Tyr Leu 320 325 330
Leu Pro Thr Arg Asp Glu Arg Asn Pro Val Val Tyr Gly Val Phe 335 340 345
Thr Thr Thr Ser Ser Ile Phe Lys Gly Ser Ala Val Cys Val Tyr 350 355 360
Ser Met Ala Asp Ile Arg Ala Val Phe Asn Gly Pro Tyr Ala His 365 370 375
Lys Glu Ser Ala Asp His Arg Trp Val Gin Tyr Asp Gly Arg Ile 380 385 390
Pro Tyr Pro Arg Pro Gly Thr Cys Pro Ser Lys Thr Tyr Asp Pro 395 400 405
Leu Ile Lys Ser Thr Arg Asp Phe Pro Asp Asp Val Ile Ser Phe 410 415 420
Ile Lys Arg His Ser Val Met Tyr Lys Ser Val Tyr Pro Val Ala 425 430 435
Gly Gly Pro Thr Phe Lys Arg Ile Asn Val Asp Tyr Arg Leu Thr 440 445 450
768
Gin Ile Val Val Asp His Val Ile Ala Glu Asp Gly Gin Tyr Asp 455 460 465
Val Met Phe Leu Gly Thr Asp Ile Gly Thr Val Leu Lys Val Val 470 475 480
Ser Ile Ser Lys Glu Lys Trp Asn Met Glu Glu Val Val Leu Glu 485 490 495
Glu Leu Gin Ile Phe Lys His Ser Ser Ile Ile Leu Asn Met Glu 500 505 510
Leu Ser Leu Lys Gin Gin Gin Leu Tyr Ile Gly Ser Arg Asp Gly 515 520 525
Leu Val Gin Leu Ser Leu His Arg Cys Asp Thr Tyr Gly Lys Ala 530 535 540
Cys Ala Asp Cys Cys Leu Ala Arg Asp Pro Tyr Cys Ala Trp Asp 545 550 555
Gly Asn Ala Cys Ser Arg Tyr Ala Pro Thr Ser Lys Arg Arg Ala 560 565 570
Arg Arg Gin Asp Val Lys Tyr Gly Asp Pro Ile Thr Gin Cys Trp 575 580 585
Asp Ile Glu Asp Ser Ile Ser His Glu Thr Ala Asp Glu Lys Val 590 595 600
Ile Phe Gly Ile Glu Phe Asn Ser Thr Phe Leu Glu Cys Ile Pro 605 610 615
Lys Ser Gin Gin Ala Thr Ile Lys Trp Tyr Ile Gin Arg Ser Gly 620 625 630
Asp Glu His Arg Glu Glu Leu Lys Pro Asp Glu Arg Ile Ile Lys 635 640 645
Thr Glu Tyr Gly Leu Leu Ile Arg Ser Leu Gin Lys Lys Asp Ser 650 655 660
Gly Met Tyr Tyr Cys Lys Ala Gin Glu His Thr Phe Ile His Thr 665 670 675
Ile Val Lys Leu Thr Leu Asn Val Ile Glu Asn Glu Gin Met Glu 680 685 690
Asn Thr Gin Arg Ala Glu His Glu Glu Gly Gin Val Lys Asp Leu 695 700 705
Leu Ala Glu Ser Arg Leu Arg Tyr Lys Asp Tyr Ile Gin Ile Leu 710 715 720
Ser Ser Pro Asn Phe Ser Leu Asp Gin Tyr Cys Glu Gin Met Trp 725 730 735
His Arg Glu Lys Arg Arg Gin Arg Asn Lys Gly Gly Pro Lys Trp 740 745 750
Lys His Met Gin Glu Met Lys Lys Lys Arg Asn Arg Arg His His 755 760 765
769
Arg Asp Leu Asp Glu Leu Pro Arg Ala Val Ala Thr 770 775
<210> 311 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 311 caacgcagcc gtgataaaca agtgg 2 5
<210> 312 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 312
gcttggacat gtaccaggcc gtgg 24 <210> 313 <211> 45 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 313
ggccagactg atttgctcaa ttcctggaag tgatggggca gatac 45
<210> 314
<211> 3934
<212> DNA
<213> Homo sapiens
<400> 314
ccctgacctc cctgagccac actgagctgg aagccgcaga ggtcatcctg
50
gagcatgccc accgcgggga gcagacaacc tcccaggtaa gctgggagca
100
agacctgaag ctgtttcttc aggagcctgg tgtattttcc cccaccccac
150
ctcagcagtt tcagccagca gggactgatc aggtgtgtgt cctggagtgg
200
ggagcagaag gcgtggctgg caagagtggc ctggagaaag aggttcagcg
250
cttgaccagc cgagctgccc gtgactacaa gatccagaac catgggcatc
300
gggtgaggtg ggggggcaca ggtgtcatgt gcaccttctt gtctcagcaa
350
gaagagctga gagaggggat cttggagcca ttgagggtgt catggagcta
400
cagaggggag ggaaaggtat tttaaggtaa cagtgtggca caatagttaa
450
gagcacagtt tttggagcta gaccgacata ggttcaaatt ctcttctgtt
500
770
gcttcctagt tctgtagccc caggtaaggg agtgacttaa cctctctgga
550
cttcaatttc ctcatcacta aagtagggcc aataatagca cccacctcat
600
agggaagatt aaatgacata atgtatgtga tgcaactagc aaagtaccag
650
tcccatagta agtcatgccc cacagtattt ccacccaccc ctgttctctg
700
ccttcccaac caggtactgc aacgactgga gcagaggcgg cagcaggctt
750
cagagcggga ggctccaagc atagaacaga ggttacagga agtgcgagag
800
agcatccgcc gggcacaggt gagccaggtg aagggggctg cccggctggc
850
cctgctgcag ggggctggct tagatgtgga gcgctggctg aagccagcca
900
tgacccaggc ccaggatgag gtggagcagg agcggcggct cagtgaggct
950
cggctgtccc agagggacct ctctccaacc gctgaggatg ctgagctttc
1000
tgactttgag gaatgtgagg agacgggaga gctctttgag gagcctgccc
1050
cccaagccct ggccacgagg gccctcccct gccctgcaca cgtggtattt
1100
cgctatcagg cagggcgtga ggatgagctg acaatcacgg agggtgagtg
1150
gctggaggtc atagaggagg gagatgctga cgaatgggtc aaggctcgga
1200
accagcacgg cgaggtaggc tttgtccctg agcgatatct caacttcccg
1250
gacctctccc tcccagagag cagccaagac agtgacaatc cctgcggggc
1300
agagcccaca gcattcctgg cacaggccct gtacagctac accggacaga
1350
gtgcagagga gctgagcttc cctgaggggg cactcatccg tctgctgccc
1400
cgggcccaag atggagtaga tgacggcttc tggaggggag aatttggggg
1450
ccgtgttggg gtcttcccct ccctgctggt ggaagagctg cttggccccc
1500
cagggccacc tgaactctct gaccctgaac agatgctgcc gtccccttct
1550
cctcccagct tctccccacc tgcacctacc tctgtgttgg atgggccccc
1600
tgcacctgtc ctgcctgggg acaaagccct ggacttccct gggttcctgg
1650
acatgatggc acctcgactc aggccgatgc gtccaccacc tcccccgccg
1700
gctaaagccc cggatcctgg ccacccagat cccctcacct gaaggccagg
1750
gaagccttga cccccagtga tgctgctgtc cctatcttca agctgtcaga
1800
ccacaccatc aatgatccag agcaacacag ccaaaagctg gaatcgccct
1850
tatttccacc ctcacctcca agggtggaaa cttgcccctt cccatttcta
1900
gagctggaac ccactccttt ttttcccatt gttctatcat ctctaggacc
1950
ggaactacta ccttctcttc tgtcatgacc ctatctaggg tggtgaaatg
2000
cctgaaatct ctggggctgg aaaccatcca tcaaggtctc tagtagttct
2050
771
ggcccacctc tttccccacc ctggctccat gacccacccc actctggatg
2100
ccagggtcac tggggttggg ctggggagag gaacaggcct tgggaatcag
2150
gagctggagc caggatgcga agcagctgta atggtctgag cggatttatt
2200
gacaatgaat aaagggcacg aaggccaggc cagggcctgg gcctcttgtg
2250
ctaagagggc agggggccta cggtgctatt gctttagggg cccaccacgg
2300
gcaggggcct gctcccagct gccacgctct atcatatgga gcgaggtgtt
2350
ggggaaggcg gggcaggcag cctgttgcag gcaggggaag gagaagagac
2400
tgaggggctg tgacctctcc tgaggccccc agcctgagac tgtgcaactc
2450
caggtggaag tagagctggt ccctcagctg gggggcagtg ctgtccagtg
2500
gaggggaggg ctttcacgcc cacccacccc ctggccctgc cagctggtag
2550
tccatcagca caatgaagga gacttggaga agaggaagaa taacactgtt
2600
gcttcctgtt caagctgtgt ccagcttttc ccctggggct ccaggacctt
2650
ccctacctcc accaccaaac caagggattt atagcaaagg ctaagcctgc
2700
agtttactct gtgatgggaa gggggttcag gatgagatta ggagccgaaa cctcatttag ggcttaaata ggctcaggca gtttaagtag gactcacctc
2750 2800
acatactccc tgctccctgt ggtagagaca cctgagagaa aggggagggg
2850
tcaacaatga gagaccagga gtaggtccta tcagtgcccc ccagagtaga
2900
gagcaataag agcccagccc agtgcagtcc cggctgtgtt ttcctacctg
2950
gtgatcagaa gtgtctggtt tgcttggctg cccatttgcc tcttgagtgg
3000
gcagccctgg gcttgggccc ctccctccgg ccctcagtgt tggctctgca
3050
gaagctctgg ggttcccttc aagtgcacga ggggttaggc tgctgtccct
3100
gagtcctcca ttctgtactg gggggctggc taggacctgg ggctgtggcc
3150
tctcaggggg cagcctctcc atggcaggca tccctgcctt gggctgccct
3200
cccccagacc cctgaccacc ccctgggtcc tgtcccccac cagagcccca
3250
gctcctgtct gtgggggagc catcacggtg ttcgtgcagt ccatagcgct
3300
tctcaatgtg tgtcacccgg aacctgggag gggagggaac actggggttt
3350
aggaccacaa ctcagaggct gcttggccct cccctctgac cagggacatc
3400
ctgagtttgg tggctacttc cctctggcct aaggtagggg aggccttctc
3450
agattgtggg gcacattgtg tagcctgact tctgctggag ctcccagtcc
3500
aggaggaaag agccaaggcc cacttttggg atcaggtgcc tgatcactgg
3550
gccccctacc tcagcccccc tttccctgga gcacctgccc cacctgccca
3600
cagagaacac agtggtctcc cctgtccggg ggcggctttt tccttccttg
3650
772
gagcgtccct gacggacaag tggaggcctc ttgctgcggc tgcaatggat 3700
gcaaggggct gcagagccca ggtgcactgt gtgatgatgg gagggggctc 3750
cgtcctgcag gctggaggtg gcatccacac tggacagcag gaggagggga 3800
gtgagggtaa catttccatt tcccttcatg ttttgtttct tacgttcttt 3 850
cagcatgctc cttaaaaccc cagaagcccc aatttcccca agccccattt 3900
tttcttgtct ttatctaata aactcaatat taag 3934
<210> 315 <211> 370 <212> PRT
<213> Homo sapiens <400> 315
Met Gin Leu Ala Lys Tyr Gin Ser His Ser Lys Ser Cys Pro Thr 15 10 15
Val Phe Pro Pro Thr Pro Val Leu Cys Leu Pro Asn Gin Val Leu 20 25 30
Gin Arg Leu Glu Gin Arg Arg Gin Gin Ala Ser Glu Arg Glu Ala 35 40 45
Pro Ser Ile Glu Gin Arg Leu Gin Glu Val Arg Glu Ser Ile Arg 50 55 60
Arg Ala Gin Val Ser Gin Val Lys Gly Ala Ala Arg Leu Ala Leu 65 70 75
Leu Gin Gly Ala Gly Leu Asp Val Glu Arg Trp Leu Lys Pro Ala 80 85 90
Met Thr Gin Ala Gin Asp Glu Val Glu Gin Glu Arg Arg Leu Ser 95 100 105
Glu Ala Arg Leu Ser Gin Arg Asp Leu Ser Pro Thr Ala Glu Asp 110 115 120
Ala Glu Leu Ser Asp Phe Glu Glu Cys Glu Glu Thr Gly Glu Leu 125 130 135
Phe Glu Glu Pro Ala Pro Gin Ala Leu Ala Thr Arg Ala Leu Pro 140 145 150
Cys Pro Ala His Val Val Phe Arg Tyr Gin Ala Gly Arg Glu Asp 155 160 165
Glu Leu Thr Ile Thr Glu Gly Glu Trp Leu Glu Val Ile Glu Glu 170 175 180
Gly Asp Ala Asp Glu Trp Val Lys Ala Arg Asn Gin His Gly Glu 185 190 195
Val Gly Phe Val Pro Glu Arg Tyr Leu Asn Phe Pro Asp Leu Ser 200 205 210
Leu Pro Glu Ser Ser Gin Asp Ser Asp Asn Pro Cys Gly Ala Glu
773
215 220 225
Pro Thr Ala Phe Leu Ala Gin Ala Leu Tyr Ser Tyr Thr Gly Gin 230 235 240
Ser Ala Glu Glu Leu Ser Phe Pro Glu Gly Ala Leu Ile Arg Leu 245 250 255
Leu Pro Arg Ala Gin Asp Gly Val Asp Asp Gly Phe Trp Arg Gly 260 265 270
Glu Phe Gly Gly Arg Val Gly Val Phe Pro Ser Leu Leu Val Glu 275 280 285
Glu Leu Leu Gly Pro Pro Gly Pro Pro Glu Leu Ser Asp Pro Glu 290 295 300
Gin Met Leu Pro Ser Pro Ser Pro Pro Ser Phe Ser Pro Pro Ala 305 310 315
Pro Thr Ser Val Leu Asp Gly Pro Pro Ala Pro Val Leu Pro Gly 320 325 330
Asp Lys Ala Leu Asp Phe Pro Gly Phe Leu Asp Met Met Ala Pro 335 340 345
Arg Leu Arg Pro Met Arg Pro Pro Pro Pro Pro Pro Ala Lys Ala 350 355 360
Pro Asp Pro Gly His Pro Asp Pro Leu Thr 365 370
<210> 316
<211> 4407
<212> DNA
<213> Homo sapiens
<400> 316
cacagggaga cccacagaca catatgcacg agagagacag aggaggaaag
50
agacagagac aaaggcacag cggaagaagg cagagacagg gcaggcacag
100
aagcggccca gacagagtcc tacagaggga gaggccagag aagctgcaga
150
agacacaggc agggagagac aaagatccag gaaaggaggg ctcaggagga
200
gagtttggag aagccagacc cctgggcacc tctcccaagc ccaaggacta
250
agttttctcc atttccttta acggtcctca gcccttctga aaactttgcc
300
tctgaccttg gcaggagtcc aagcccccag gctacagaga ggagctttcc
350
aaagctaggg tgtggaggac ttggtgccct agacggcctc agtccctccc
400
agctgcagta ccagtgccat gtcccagaca ggctcgcatc ccgggagggg
450
cttggcaggg cgctggctgt ggggagccca accctgcctc ctgctcccca
500
ttgtgccgct ctcctggctg gtgtggctgc ttctgctact gctggcctct
550
ctcctgccct cagcccggct ggccagcccc ctcccccggg aggaggagat
600
774
cgtgtttcca gagaagctca acggcagcgt cctgcctggc tcgggcgccc
650
ctgccaggct gttgtgccgc ttgcaggcct ttggggagac gctgctacta
700
gagctggagc aggactccgg tgtgcaggtc gaggggctga cagtgcagta
750
cctgggccag gcgcctgagc tgctgggtgg agcagagcct ggcacctacc
800
tgactggcac catcaatgga gatccggagt cggtggcatc tctgcactgg
850
gatgggggag ccctgttagg cgtgttacaa tatcgggggg ctgaactcca
900
cctccagccc ctggagggag gcacccctaa ctctgctggg ggacctgggg
950
ctcacatcct acgccggaag agtcctgcca gcggtcaagg tcccatgtgc
1000
aacgtcaagg ctcctcttgg aagccccagc cccagacccc gaagagccaa
1050
gcgctttgct tcactgagta gatttgtgga gacactggtg gtggcagatg
1100
acaagatggc cgcattccac ggtgcggggc taaagcgcta cctgctaaca
1150
gtgatggcag cagcagccaa ggccttcaag cacccaagca tccgcaatcc
1200
tgtcagcttg gtggtgactc ggctagtgat cctggggtca ggcgaggagg
1250
ggccccaagt ggggcccagt gctgcccaga ccctgcgcag cttctgtgcc
1300
tggcagcggg gcctcaacac ccctgaggac tcgggccctg accactttga
1350
cacagccatt ctgtttaccc gtcaggacct gtgtggagtc tccacttgcg
1400
acacgctggg tatggctgat gtgggcaccg tctgtgaccc ggctcggagc
1450
tgtgccattg tggaggatga tgggctccag tcagccttca ctgctgctca
1500
tgaactgggt catgtcttca acatgctcca tgacaactcc aagccatgca
1550
tcagtttgaa tgggcctttg agcacctctc gccatgtcat ggcccctgtg
1600
atggctcatg tggatcctga ggagccctgg tccccctgca gtgcccgctt
1650
catcactgac ttcctggaca atggctatgg gcactgtctc ttagacaaac
1700
cagaggctcc attgcatctg cctgtgactt tccctggcaa ggactatgat
1750
gctgaccgcc agtgccagct gaccttcggg cccgactcac gccattgtcc
1800
acagctgccg ccgccctgtg ctgccctctg gtgctctggc cacctcaatg
1850
gccatgccat gtgccagacc aaacactcgc cctgggccga tggcacaccc
1900
tgcgggcccg cacaggcctg catgggtggt cgctgcctcc acatggacca
1950
gctccaggac ttcaatattc cacaggctgg tggctggggt ccttggggac
2000
catggggtga ctgctctcgg acctgtgggg gtggtgtcca gttctcctcc
2050
cgagactgca cgaggcctgt cccccggaat ggtggcaagt actgtgaggg
2100
ccgccgtacc cgcttccgct cctgcaacac tgaggactgc ccaactggct
2150
775
cagccctgac cttccgcgag gagcagtgtg ctgcctacaa ccaccgcacc
2200
gacctcttca agagcttccc agggcccatg gactgggttc ctcgctacac
2250
aggcgtggcc ccccaggacc agtgcaaact cacctgccag gcccgggcac
2300
tgggctacta ctatgtgctg gagccacggg tggtagatgg gaccccctgt
2350
tccccggaca gctcctcggt ctgtgtccag ggccgatgca tccatgctgg
2400
ctgtgatcgc atcattggct ccaagaagaa gtttgacaag tgcatggtgt
2450
gcggagggga cggttctggt tgcagcaagc agtcaggctc cttcaggaaa
2500
ttcaggtacg gatacaacaa tgtggtcact atccccgcgg gggccaccca
2550
cattcttgtc cggcagcagg gaaaccctgg ccaccggagc atctacttgg
2600
ccctgaagct gccagatggc tcctatgccc tcaatggtga atacacgctg
2650
atgccctccc ccacagatgt ggtactgcct ggggcagtca gcttgcgcta
2700
cagcggggcc actgcagcct cagagacact gtcaggccat gggccactgg
2750
cccagccttt gacactgcaa gtcctagtgg ctggcaaccc ccaggacaca
2800
cgcctccgat acagcttctt cgtgccccgg ccgacccctt caacgccacg
2850
ccccactccc caggactggc tgcaccgaag agcacagatt ctggagatcc
2900
ttcggcggcg cctttctggg cccctgggcg caccggggcc ggcaggaaat tcggacttag aacctcacta ctgggagaaa tcccggctgc gagagagctt
2950 3000
ctgttgctgc ctcatgctaa gactcagtgg ggaggggctg tgggcgtgag
3050
acctgcccct cctctctgcc ctaatgcgca ggctggccct gccctggttt
3100
cctgccctgg gaggcagtga tgggttagtg gatggaaggg gctgacagac
3150
agccctccat ctaaactgcc ccctctgccc tgcgggtcac aggagggagg
3200
gggaaggcag ggagggcctg ggccc'cagtt gtatttattt agtatttatt
3250
cacttttatt tagcaccagg gaaggggaca aggactaggg tcctggggaa
3300
cctgacccct gacccctcat agccctcacc ctggggctag gaaatccagg
3350
gtggtggtga taggtataag tggtgtgtgt atgcgtgtgt gtgtgtgtgt
3400
gaaaatgtgt gtgtgcttat gtatgaggta caacctgttc tgctttcctc
3450
ttcctgaatt ttattttttg ggaaaagaaa agtcaagggt agggtgggcc
3500
ttcagggagt gagggattat cttttttttt ttttctttct ttctttcttt
3550
tttttttttg agacagaatc tcgctctgtc gcccaggctg gagtgcaatg
3600
gcacaatctc ggctcactgc atcctccgcc tcccgggttc aagtgattct
3650
catgcctcag cctcctgagt agctgggatt acaggctcct gccaccacgc
3700
ccagctaatt tttgttttgt tttgtttgga gacagagtct cgctattgtc
3750
776
accagggctg gaatgatttc agctcactgc aaccttcgcc acctgggttc
3800
cagcaattct cctgcctcag cctcccgagt agctgagatt ataggcacct
3850
accaccacgc ccggctaatt tttgtatttt tagtagagac ggggtttcac
3900
catgttggcc aggctggtct cgaactcctg accttaggtg atccactcgc
3950
cttcatctcc caaagtgctg ggattacagg cgtgagccac cgtgcctggc
4000
cacgcccaac taatttttgt atttttagta gagacagggt ttcaccatgt
4050
tggccaggct gctcttgaac tcctgacctc aggtaatcga cctgcctcgg
4100
cctcccaaag tgctgggatt acaggtgtga gccaccacgc ccggtacata
4150
ttttttaaat tgaattctac tatttatgtg atccttttgg agtcagacag
4200
atgtggttgc atcctaactc catgtctctg agcattagat ttctcatttg
4250
ccaataataa tacctccctt agaagtttgt tgtgaggatt aaataatgta
4300
aataaagaac tagcataaca ctcaaaaaaa aaaaaaaaaa aaaaaaaaaa
4350
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
4400
aaggaaa 4407
<210> 317 <211> 837 <212> PRT
<213> Homo sapiens <400> 317
Met Ser Gin Thr Gly Ser His Pro Gly Arg Gly Leu Ala Gly Arg 15 10 15
Trp Leu Trp Gly Ala Gin Pro Cys Leu Leu Leu Pro Ile Val Pro 20 25 30
Leu Ser Trp Leu Val Trp Leu Leu Leu Leu Leu Leu Ala Ser Leu 35 40 45
Leu Pro Ser Ala Arg Leu Ala Ser Pro Leu Pro Arg Glu Glu Glu 50 55 60
Ile Val Phe Pro Glu Lys Leu Asn Gly Ser Val Leu Pro Gly Ser 65 70 75
Gly Ala Pro Ala Arg Leu Leu Cys Arg Leu Gin Ala Phe Gly Glu 80 85 90
Thr Leu Leu Leu Glu Leu Glu Gin Asp Ser Gly Val Gin Val Glu 95 100 105
Gly Leu Thr Val Gin Tyr Leu Gly Gin Ala Pro Glu Leu Leu Gly 110 115 120
Gly Ala Glu Pro Gly Thr Tyr Leu Thr Gly Thr Ile Asn Gly Asp 125 130 135
Ill
Pro Glu Ser Val Ala Ser Leu His Trp Asp Gly Gly Ala Leu Leu 140 145 150
Gly Val Leu Gin Tyr Arg Gly Ala Glu Leu His Leu Gin Pro Leu 155 160 165
Glu Gly Gly Thr Pro Asn Ser Ala Gly Gly Pro Gly Ala His lie 170 175 180
Leu Arg Arg Lys Ser Pro Ala Ser Gly Gin Gly Pro Met Cys Asn 185 190 195
Val Lys Ala Pro Leu Gly Ser Pro Ser Pro Arg Pro Arg Arg Ala 200 205 210
Lys Arg Phe Ala Ser Leu Ser Arg Phe Val Glu Thr Leu Val Val 215 220 225
Ala Asp Asp Lys Met Ala Ala Phe His Gly Ala Gly Leu Lys Arg 230 235 240
Tyr Leu Leu Thr Val Met Ala Ala Ala Ala Lys Ala Phe Lys His .245 250 255
Pro Ser Ile Arg Asn Pro Val Ser Leu Val Val Thr Arg Leu Val 260 265 270
Ile Leu Gly Ser Gly Glu Glu Gly Pro Gin Val Gly Pro Ser Ala 275 280 285
Ala Gin Thr Leu Arg Ser Phe Cys Ala Trp Gin Arg Gly Leu Asn 290 295 300
Thr Pro Glu Asp Ser Gly Pro Asp His Phe Asp Thr Ala Ile Leu 305 310 315
Phe Thr Arg Gin Asp Leu Cys Gly Val Ser Thr Cys Asp Thr Leu 320 325 330
Gly Met Ala Asp Val Gly Thr Val Cys Asp Pro Ala Arg Ser Cys 335 340 345
Ala Ile Val Glu Asp Asp Gly Leu Gin Ser Ala Phe Thr Ala Ala 350 355 360
His Glu Leu Gly His Val Phe Asn Met Leu His Asp Asn Ser Lys 365 370 375
Pro Cys Ile Ser Leu Asn Gly Pro Leu Ser Thr Ser Arg His Val 380 385 390
Met Ala Pro Val Met Ala His Val Asp Pro Glu Glu Pro Trp Ser 395 400 405
Pro Cys Ser Ala Arg Phe Ile Thr Asp Phe Leu Asp Asn Gly Tyr 410 415 420
Gly His Cys Leu Leu Asp Lys Pro Glu Ala Pro Leu His Leu Pro 425 430 435
Val Thr Phe Pro Gly Lys Asp Tyr Asp Ala Asp Arg Gin Cys Gin 440 445 450
778
Leu Thr Phe Gly Pro Asp Ser Arg His Cys Pro Gin Leu Pro Pro 455 460 465
Pro Cys Ala Ala Leu Trp Cys Ser Gly His Leu Asn Gly His Ala 470 475 480
Met Cys Gin Thr Lys His Ser Pro Trp Ala Asp Gly Thr Pro Cys 485 490 495
Gly Pro Ala Gin Ala Cys Met Gly Gly Arg Cys Leu His Met Asp 500 505 510
Gin Leu Gin Asp Phe Asn Ile Pro Gin Ala Gly Gly Trp Gly Pro 515 520 525
Trp Gly Pro Trp Gly Asp Cys Ser Arg Thr Cys Gly Gly Gly Val 530 535 540
Gin Phe Ser Ser Arg Asp Cys Thr Arg Pro Val Pro Arg Asn Gly 545 550 555
Gly Lys Tyr Cys Glu Gly Arg Arg Thr Arg Phe Arg Ser Cys Asn 560 565 570
Thr Glu Asp Cys Pro Thr Gly Ser Ala Leu Thr Phe Arg Glu Glu 575 580 585
Gin Cys Ala Ala Tyr Asn His Arg Thr Asp Leu Phe Lys Ser Phe 590 595 600
Pro Gly Pro Met Asp Trp Val Pro Arg Tyr Thr Gly Val Ala Pro 605 610 615
Gin Asp Gin Cys Lys Leu Thr Cys Gin Ala Arg Ala Leu Gly Tyr 620 625 630
Tyr Tyr Val Leu Glu Pro Arg Val Val Asp Gly Thr Pro Cys Ser 635 640 645
Pro Asp Ser Ser Ser Val Cys Val Gin Gly Arg Cys Ile His Ala 650 655 660
Gly Cys Asp Arg Ile Ile Gly Ser Lys Lys Lys Phe Asp Lys Cys 665 670 675
Met Val Cys Gly Gly Asp Gly Ser Gly Cys Ser Lys Gin Ser Gly 680 685 690
Ser Phe Arg Lys Phe Arg Tyr Gly Tyr Asn Asn Val Val Thr Ile 695 700 705
Pro Ala Gly Ala Thr His Ile Leu Val Arg Gin Gin Gly Asn Pro 710 715 720
Gly His Arg Ser Ile Tyr Leu Ala Leu Lys Leu Pro Asp Gly Ser 725 730 735
Tyr Ala Leu Asn Gly Glu Tyr Thr Leu Met Pro Ser Pro Thr Asp 740 745 750
Val Val Leu Pro Gly Ala Val Ser Leu Arg Tyr Ser Gly Ala Thr
755
779
760
765
Ala Ala Ser
Leu Thr Leu
Leu Arg Tyr
Arg Pro Thr
Glu Ile Leu
<210> 318 <211> 23 <212> DNA <213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 318 ccctgaagct gccagatggc tcc 23
<210> 319 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 319 ctgtgctctt cggtgcagcc agtc 24
<210> 320 <211> 43 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 320
ccacagatgt ggtactgcct ggggcagtca gcttgcgcta cag 43
<210> 321 <211> 1197 <212> DNA
<213> Homo sapiens <400> 321
cagcagtggt ctctcagtcc tctcaaagca aggaaagagt actgtgtgct 50 gagagaccat ggcaaagaat cctccagaga attgtgaaga ctgtcacatt 10 0 ctaaatgcag aagcttttaa atccaagaaa atatgtaaat cacttaagat 150 ttgtggactg gtgtttggta tcctggccct aactctaatt gtcctgtttt 200
Glu Thr Leu Ser Gly His Gly Pro Leu Ala Gin Pro 770 775 780
Gin Val Leu Val Ala Gly Asn Pro Gin Asp Thr Arg 785 790 795
Ser Phe Phe Val Pro Arg Pro Thr Pro Ser Thr Pro 800 805 810
Pro Gin Asp Trp Leu His Arg Arg Ala Gin Ile Leu 815 820 825
Arg Arg Arg Pro Trp Ala Gly Arg Lys 830 835
780
gggggagcaa gcacttctgg ccggaggtac ccaaaaaagc ctatgacatg 250
gagcacactt tctacagcaa tggagagaag aagaagattt acatggaaat 300
tgatcctgtg accagaactg aaatattcag aagcggaaat ggcactgatg 350
aaacattgga agtgcacgac tttaaaaacg gatacactgg catctacttc 400
gtgggtcttc aaaaatgttt tatcaaaact cagattaaag tgattcctga 450
attttctgaa ccagaagagg aaatagatga gaatgaagaa attaccacaa 500
ctttctttga acagtcagtg atttgggtcc cagcagaaaa gcctattgaa 550
aaccgagatt ttcttaaaaa ttccaaaatt ctggagattt gtgataacgt 600
gaccatgtat tggatcaatc ccactctaat atcagtttct gagttacaag 650
actttgagga ggagggagaa gatcttcact ttcctgccaa cgaaaaaaaa 700
gggattgaac aaaatgaaca gtgggtggtc cctcaagtga aagtagagaa 7 50
gacccgtcac gccagacaag caagtgagga agaacttcca ataaatgact 800
atactgaaaa tggaatagaa tttgatccca tgctggatga gagaggttat 850
tgttgtattt actgccgtcg aggcaaccgc tattgccgcc gcgtctgtga 900
acctttacta ggctactacc catatccata ctgctaccaa ggaggacgag 950
tcatctgtcg tgtcatcatg ccttgtaact ggtgggtggc ccgcatgctg 1000
gggagggtct aataggaggt ttgagctcaa atgcttaaac tgctggcaac 1050
atataataaa tgcatgctat tcaatgaatt tctgcctatg aggcatctgg 1100 cccctggtag ccagctctcc agaattactt gtaggtaatt cctctcttca 1150
tgttctaata aacttctaca ttatcaccaa aaaaaaaaaa aaaaaaa 1197
<210> 322 <211> 317 <212> PRT
<213> Homo sapiens <400> 322
Met Ala Lys Asn Pro Pro Glu Asn Cys Glu Asp Cys His Ile Leu 15 10 15
Asn Ala Glu Ala Phe Lys Ser Lys Lys Ile Cys Lys Ser Leu Lys 20 25 30
Ile Cys Gly Leu Val Phe Gly Ile Leu Ala Leu Thr Leu Ile Val 35 40 45
Leu Phe Trp Gly Ser Lys His Phe Trp Pro Glu Val Pro Lys Lys 50 55 60
Ala Tyr Asp Met Glu His Thr Phe Tyr Ser Asn Gly Glu Lys Lys 65 70 75
Lys Ile Tyr Met Glu Ile Asp Pro Val Thr Arg Thr Glu Ile Phe
781
80 85 90
Arg Ser Gly Asn Gly Thr Asp Glu Thr Leu Glu Val His Asp Phe 95 100 105
Lys Asn Gly Tyr Thr Gly Ile Tyr Phe Val Gly Leu Gin Lys Cys 110 115 120
Phe Ile Lys Thr Gin Ile Lys Val Ile Pro Glu Phe Ser Glu Pro 125 130 135
Glu Glu Glu Ile Asp Glu Asn Glu Glu Ile Thr Thr Thr Phe Phe 140 145 150
Glu Gin Ser Val Ile Trp Val Pro Ala Glu Lys Pro Ile Glu Asn 155 160 165
Arg Asp Phe Leu Lys Asn Ser Lys Ile Leu Glu Ile Cys Asp Asn 170 175 180
Val Thr Met Tyr Trp lie Asn Pro Thr Leu lie Ser Val Ser Glu 185 190 195
Leu Gin Asp Phe Glu Glu Glu Gly Glu Asp Leu His Phe Pro Ala 200 205 210
Asn Glu Lys Lys Gly Ile Glu Gin Asn Glu Gin Trp Val Val Pro 215 220 225
Gin Val Lys Val Glu Lys Thr Arg His Ala Arg Gin Ala Ser Glu 230 235 240
Glu Glu Leu Pro Ile Asn Asp Tyr Thr Glu Asn Gly Ile Glu Phe 245 250 255
Asp Pro Met Leu Asp Glu Arg Gly Tyr Cys Cys Ile Tyr Cys Arg 260 265 270
Arg Gly Asn Arg Tyr Cys Arg Arg Val Cys Glu Pro Leu Leu Gly 275 280 285
Tyr Tyr Pro Tyr Pro Tyr Cys Tyr Gin Gly Gly Arg Val Ile Cys 290 295 300
Arg Val Ile Met Pro Cys Asn Trp Trp Val Ala Arg Met Leu Gly 305 310 315
Arg Val
<210> 323 <211> 1174 <212> DNA <213> Homo sapiens
<400> 323
gcggaactgg ctccggctgg cacctgagga gcggcgtgac cccgagggcc 50 cagggagctg cccggctggc ctaggcaggc agccgcacca tggccagcac 100 ggccgtgcag cttctgggct tcctgctcag cttcctgggc atggtgggca 150 cgttgatcac caccatcctg ccgcactggc ggaggacagc gcacgtgggc 2 00
782
accaacatcc tcacggccgt gtcctacctg aaagggctct ggatggagtg 250
tgtgtggcac agcacaggca tctaccagtg ccagatctac cgatccctgc 300
tggcgctgcc ccaagacctc caggctgccc gcgccctcat ggtcatctcc 3 50
tgcctgctct cgggcatagc ctgcgcctgc gccgtcatcg ggatgaagtg 400
cacgcgctgc gccaagggca cacccgccaa gaccaccttt gccatcctcg 450
gcggcaccct cttcatcctg gccggcctcc tgtgcatggt ggccgtctcc 500
tggaccacca acgacgtggt gcagaacttc tacaacccgc tgctgcccag 550
cggcatgaag tttgagattg gccaggccct gtacctgggc ttcatctcct 600
cgtccctctc gctcattggt ggcaccctgc tttgcctgtc ctgccaggac 650
gaggcaccct acaggcccta ccaggccccg cccagggcca ccacgaccac 700
tgcaaacacc gcacctgcct accagccacc agctgcctac aaagacaatc 750
gggccccctc agtgacctcg gccacgcaca gcgggtacag gctgaacgac 800
tacgtgtgag tccccacagc ctgcttctcc cctgggctgc tgtgggctgg 850
gtccccggcg ggactgtcaa tggaggcagg ggttccagca caaagtttac 900
ttctgggcaa tttttgtatc caaggaaata atgtgaatgc gaggaaatgt 950
ctttagagca cagggacaga gggggaaata agaggaggag aaagctctct 1000
ataccaaaga ctgaaaaaaa aaatcctgtc tgtttttgta tttattatat 1050
atatttatgt gggtgatttg ataacaagtt taatataaag tgacttggga 1100
gtttggtcag tggggttggt ttgtgatcca ggaataaacc ttgcggatgt 1150 ggctgtttat gaaaaaaaaa aaaa 1174
<210> 324 <211> 239 <212> PRT
<213> Homo sapiens <400> 324
Met Ala Ser Thr Ala Val Gin Leu Leu Gly Phe Leu Leu Ser Phe 15 10 15
Leu Gly Met Val Gly Thr Leu Ile Thr Thr Ile Leu Pro His Trp 20 25 30
Arg Arg Thr Ala His Val Gly Thr Asn Ile Leu Thr Ala Val Ser 35 40 45
Tyr Leu Lys Gly Leu Trp Met Glu Cys Val Trp His Ser Thr Gly 50 55 60
Ile Tyr Gin Cys Gin Ile Tyr Arg Ser Leu Leu Ala Leu Pro Gin 65 70 75
Asp Leu Gin Ala Ala Arg Ala Leu Met Val Ile Ser Cys Leu Leu
783
80 85 90
Ser Gly Ile Ala Cys Ala Cys Ala Val Ile Gly Met Lys Cys Thr 95 100 105
Arg Cys Ala Lys Gly Thr Pro Ala Lys Thr Thr Phe Ala Ile Leu 110 115 120
Gly Gly Thr Leu Phe Ile Leu Ala Gly Leu Leu Cys Met Val Ala 125 130 135
Val Ser Trp Thr Thr Asn Asp Val Val Gin Asn Phe Tyr Asn Pro 140 145 150
Leu Leu Pro Ser Gly Met Lys Phe Glu Ile Gly Gin Ala Leu Tyr 155 160 165
Leu Gly Phe Ile Ser Ser Ser Leu Ser Leu Ile Gly Gly Thr Leu 170 175 180
Leu Cys Leu Ser Cys Gin Asp Glu Ala Pro Tyr Arg Pro Tyr Gin 185 190 195
Ala Pro Pro Arg Ala Thr Thr Thr Thr Ala Asn Thr Ala Pro Ala 200 205 210
Tyr Gin Pro Pro Ala Ala Tyr Lys Asp Asn Arg Ala Pro Ser Val 215 220 225
Thr Ser Ala Thr His Ser Gly Tyr Arg Leu Asn Asp Tyr Val 230 235
<210> 325 <211> 2121 <212> DNA
<213> Homo sapiens <400> 325
gagctcccct ggcagcttct caggagcgcg cgcaggcggc ttagcttcac agggcgggcg accttcggca gccaggatca gcaggagggc tgtccaccac
50 100
cacatgccaa gtggtggcgt tcctcctgtc catcctgggg ctggccggct
150
gcatcgcggc caccgggatg gacatgtgga gcacccagga cctgtacgac
200
aaccccgtca cctccgtgtt ccagtacgaa gggctctgga ggagctgcgt
250
gaggcagagt tcaggcttca ccgaatgcag gccctatttc accatcctgg
300
gacttccagc catgctgcag gcagtgcgag ccctgatgat cgtaggcatc
350
gtcctgggtg ccattggcct cctggtatcc atctttgccc tgaaatgcat
400
ccgcattggc agcatggagg actctgccaa agccaacatg acactgacct
450
ccgggatcat gttcattgtc tcaggtcttt gtgcaattgc tggagtgtct
500
gtgtttgcca acatgctggt gactaacttc tggatgtcca cagctaacat
550
gtacaccggc atgggtggga tggtgcagac tgttcagacc aggtacacat
600
ttggtgcggc tctgttcgtg ggctgggtcg ctggaggcct cacactaatt
650
784
gggggtgtga tgatgtgcat cgcctgccgg ggcctggcac cagaagaaac
700
caactacaaa gccgtttctt atcatgcctc aggccacagt gttgcctaca
750
agcctggagg cttcaaggcc agcactggct ttgggtccaa caccaaaaac
800
aagaagatat acgatggagg tgcccgcaca gaggacgagg tacaatctta
850
tccttccaag cacgactatg tgtaatgctc taagacctct cagcacgggc
900
ggaagaaact cccggagagc tcacccaaaa aacaaggaga tcccatctag
950
atttcttctt gcttttgact cacagctgga agttagaaaa gcctcgattt
1000
catctttgga gaggccaaat ggtcttagcc tcagtctctg tctctaaata
1050
ttccaccata aaacagctga gttatttatg aattagaggc tatagctcac
1100
attttcaatc ctctatttct ttttttaaat ataactttct actctgatga
1150
gagaatgtgg ttttaatctc tctctcacat tttgatgatt tagacagact
1200
ccccctcttc ctcctagtca ataaacccat tgatgatcta tttcccagct
1250
tatccccaag aaaacttttg aaaggaaaga gtagacccaa agatgttatt
1300
ttctgctgtt tgaattttgt ctccccaccc ccaacttggc tagtaataaa
1350
cacttactga agaagaagca ataagagaaa gatatttgta atctctccag
1400
cccatgatct cggttttctt acactgtgat cttaaaagtt accaaaccaa
1450
agtcattttc agtttgaggc aaccaaacct ttctactgct gttgacatct
1500
tcttattaca gcaacaccat tctaggagtt tcctgagctc tccactggag
1550
tcctctttct gtcgcgggtc agaaattgtc cctagatgaa tgagaaaatt
1600
atttttttta taaaatgata atttaagtcc cactatctct taaatatagt gtgaaatagc taaaataaat ctcaccccta aatgttttag catgtggata
1650 1700
gaaggaaatg aaaaaataat tgctttgaca ttgtctatat ggtactttgt
1750
aaagtcatgc ttaagtacaa attccatgaa aagctcacac ctgtaatcct
1800
agcactttgg gaggctgagg aggaaggatc acttgagccc agaagttcga
1850
gactagcctg ggcaacatgg agaagccctg tctctacaaa atacagagag
1900
aaaaaatcag ccagtcatgg tggcatacac ctgtagtccc agcattccgg
1950
gaggctgagg tgggaggatc acttgagccc agggaggttg gggctgcagt
2000
gagccatgat cacaccactg cactccagcc aggtgacata gcgagatcct
2050
gtctaaaaaa ataaaaaata aataatggaa cacagcaagt cctaggaagt
2100
aggttaaaac taattcttta a 2121
<210> 326 <211> 261
785
<212> PRT
<213> Homo sapiens
<400> 326
Met Ser Thr Thr Thr Cys Gin Val Val Ala Phe Leu Leu Ser Ile 15 10 15
Leu Gly Leu Ala Gly Cys Ile Ala Ala Thr Gly Met Asp Met Trp 20 25 30
Ser Thr Gin Asp Leu Tyr Asp Asn Pro Val Thr Ser Val Phe Gin 35 40 45
Tyr Glu Gly Leu Trp Arg Ser Cys Val Arg Gin Ser Ser Gly Phe 50 55 60
Thr Glu Cys Arg Pro Tyr Phe Thr Ile Leu Gly Leu Pro Ala Met 65 70 75
Leu Gin Ala Val Arg Ala Leu Met Ile Val Gly Ile Val Leu Gly 80 85 90
Ala Ile Gly Leu Leu Val Ser Ile Phe Ala Leu Lys Cys Ile Arg 95 100 105
Ile Gly Ser Met Glu Asp Ser Ala Lys Ala Asn Met Thr Leu Thr 110 115 120
Ser Gly Ile Met Phe Ile Val Ser Gly Leu Cys Ala Ile Ala Gly 125 130 ' 135
Val Ser Val Phe Ala Asn Met Leu Val Thr Asn Phe Trp Met Ser 140 145 150
Thr Ala Asn Met Tyr Thr Gly Met Gly Gly Met Val Gin Thr Val 155 160 165
Gin Thr Arg Tyr Thr Phe Gly Ala Ala Leu Phe Val Gly Trp Val 170 175 180
Ala Gly Gly Leu Thr Leu Ile Gly Gly Val Met Met Cys Ile Ala 185 190 195
Cys Arg Gly Leu Ala Pro Glu Glu Thr Asn Tyr Lys Ala Val Ser 200 205 210
Tyr His Ala Ser Gly His Ser Val Ala Tyr Lys Pro Gly Gly Phe 215 220 225
Lys Ala Ser Thr Gly Phe Gly Ser Asn Thr Lys Asn Lys Lys Ile 230 235 240
Tyr Asp Gly Gly Ala Arg Thr Glu Asp Glu Val Gin Ser Tyr Pro 245 250 255
Ser Lys His Asp Tyr Val 260
<210> 327 <211> 2010 <212> DNA <213> Homo sapiens
786
<400> 327
ggaaaaactg ttctcttctg tggcacagag aaccctgctt caaagcagaa
50
gtagcagttc cggagtccag ctggctaaaa ctcatcccag aggataatgg
100
caacccatgc cttagaaatc gctgggctgt ttcttggtgg tgttggaatg
150
gtgggcacag tggctgtcac tgtcatgcct cagtggagag tgtcggcctt
200
cattgaaaac aacatcgtgg tttttgaaaa cttctgggaa ggactgtgga
250
tgaattgcgt gaggcaggct aacatcagga tgcagtgcaa aatctatgat
300
tccctgctgg ctctttctcc ggacctacag gcagccagag gactgatgtg
350
tgctgcttcc gtgatgtcct tcttggcttt catgatggcc atccttggca
400
tgaaatgcac caggtgcacg ggggacaatg agaaggtgaa ggctcacatt
450
ctgctgacgg ctggaatcat cttcatcatc acgggcatgg tggtgctcat
500
ccctgtgagc tgggttgcca atgccatcat cagagatttc tataactcaa
550
tagtgaatgt tgcccaaaaa cgtgagcttg gagaagctct ctacttagga
600
tggaccacgg cactggtgct gattgttgga ggagctctgt tctgctgcgt
650
tttttgttgc aacgaaaaga gcagtagcta cagatactcg ataccttccc
700
atcgcacaac ccaaaaaagt tatcacaccg gaaagaagtc accgagcgtc
750
tactccagaa gtcagtatgt gtagttgtgt atgttttttt aactttacta
800
taaagccatg caaatgacaa aaatctatat tactttctca aaatggaccc
850
caaagaaact ttgatttact gttcttaact gcctaatctt aattacagga
900
actgtgcatc agctatttat gattctataa gctatttcag cagaatgaga
950
tattaaaccc aatgctttga ttgttctaga aagtatagta atttgttttc
1000
taaggtggtt caagcatcta ctctttttat catttacttc aaaatgacat
1050
tgctaaagac tgcattattt tactactgta atttctccac gacatagcat
1100
tatgtacata gatgagtgta acatttatat ctcacataga gacatgctta
1150
tatggtttta tttaaaatga aatgccagtc cattacactg aataaataga
1200
actcaactat tgcttttcag ggaaatcatg gatagggttg aagaaggtta
1250
ctattaattg tttaaaaaca gcttagggat taatgtcctc catttataat
1300
gaagattaaa atgaaggctt taatcagcat tgtaaaggaa attgaatggc
1350
tttctgatat gctgtttttt agcctaggag ttagaaatcc taacttcttt
1400
atcctcttct cccagaggct ttttttttct tgtgtattaa attaacattt
1450
ttaaaacgca gatattttgt caaggggctt tgcattcaaa ctgcttttcc
1500
agggctatac tcagaagaaa gataaaagtg
787
tgatctaaga aaaagtgatg
1550
gttttaggaa agtgaaaata tttttgtttt tgtatttgaa gaagaatgat
1600
gcattttgac aagaaatcat atatgtatgg atatatttta ataagtattt
1650
gagtacagac tttgaggttt catcaatata aataaaagag cagaaaaata
1700
tgtcttggtt ttcatttgct taccaaaaaa acaacaacaa aaaaagttgt
1750
cctttgagaa cttcacctgc tcctatgtgg gtacctgagt caaaattgtc
1800
atttttgttc tgtgaaaaat aaatttcctt cttgtaccat ttctgtttag
1850
ttttactaaa atctgtaaat actgtatttt tctgtttatt ccaaatttga
1900
tgaaactgac aatccaattt gaaagtttgt gtcgacgtct gtctagctta
1950
aatgaatgtg ttctatttgc tttatacatt tatattaata aattgtacat
2000
ttttctaatt 2010
<210> 328 <211> 225 <212> PRT <213> Homo sapiens
<400> 328
Met Ala Thr His Ala Leu Glu Ile Ala Gly Leu Phe Leu Gly Gly
10 15
Val Gly Met Val Gly Thr Val Ala Val Thr Val Met Pro Gin Trp
25 30
Arg Val Ser Ala Phe Ile Glu Asn Asn Ile Val Val Phe Glu Asn
40 45
Phe Trp Glu Gly Leu Trp Met Asn Cys Val Arg Gin Ala Asn Ile
50 55 60
Arg Met Gin Cys Lys Ile Tyr Asp Ser Leu Leu Ala Leu Ser Pro
65 70 75
Asp Leu Gin Ala Ala Arg Gly Leu Met Cys Ala Ala Ser Val Met
80 85 90
Ser Phe Leu Ala Phe Met Met Ala Ile Leu Gly Met Lys Cys Thr
95 100 105
Arg Cys Thr Gly Asp Asn Glu Lys Val Lys Ala His Ile Leu Leu
110 115 120
Thr Ala Gly Ile Ile Phe Ile Ile Thr Gly Met Val Val Leu Ile
125 130 135
Pro Val Ser Trp Val Ala Asn Ala Ile Ile Arg Asp Phe Tyr Asn
140 145 150
Ser Ile Val Asn Val Ala Gin Lys Arg Glu Leu Gly Glu Ala Leu
155 160 165
Tyr Leu Gly Trp Thr Thr Ala Leu Val Leu Ile Val Gly Gly Ala
788
170 175 180
Leu Phe Cys Cys Val Phe Cys Cys Asn Glu Lys Ser Ser Ser Tyr 185 190 195
Arg Tyr Ser Ile Pro Ser His Arg Thr Thr Gin Lys Ser Tyr His 200 205 210
Thr Gly Lys Lys Ser Pro Ser Val Tyr Ser Arg Ser Gin Tyr Val 215 220 225
<210> 329 <211> 1315 <212> DNA <213> Homo sapiens
<400> 329
tcgccatggc ctctgccgga atgcagatcc tgggagtcgt cctgacactg
50
ctgggctggg tgaatggcct ggtctcctgt gccctgccca tgtggaaggt
100
gaccgctttc atcggcaaca gcatcgtggt ggcccaggtg gtgtgggagg
150
gcctgtggat gtcctgcgtg gtgcagagca ccggccagat gcagtgcaag
200
gtgtacgact cactgctggc gctgccacag gacctgcagg ctgcacgtgc
250
cctctgtgtc atcgccctcc ttgtggccct gttcggcttg ctggtctacc
300
ttgctggggc caagtgtacc acctgtgtgg aggagaagga ttccaaggcc
350
cgcctggtgc tcacctctgg gattgtcttt gtcatctcag gggtcctgac
400
gctaatcccc gtgtgctgga cggcgcatgc catcatccgg gacttctata
450
accccctggt ggctgaggcc caaaagcggg agctgggggc ctccctctac
500
ttgggctggg cggcctcagg ccttttgttg ctgggtgggg ggttgctgtg
550
ctgcacttgc ccctcggggg ggtcccaggg ccccagccat tacatggccc
600
gctactcaac atctgcccct gccatctctc gggggccctc tgagtaccct
650
accaagaatt gagccatcca acgtctgacg gaagtggcag tggaggggaa tgcccaacag tgggggctcc ctttgggatg gctggcgcta ggttcgtacc
700 750
ttttgtttct gcctcctgct atttttcttt tgactgagga tatttaaaat
800
tcatttgaaa actgagccaa ggtgttgact cagactctca cttaggctct
850
gctgtttctc acccttggat gatggagcca aagaggggat gctttgagat
900
tctggatctt gacatgccca tcttagaagc cagtcaagct atggaactaa
950
tgcggaggct gcttgctgtg ctggctttgc aacaagacag actgtcccca
1000
agagttcctg ctgctgctgg gggctgggct tccctagatg tcactggaca
1050
gctgcccccc atcctactca ggtctctgga gctcctctct tcacccctgg
1100
aaaaacaaat catctgttaa caaaggactg cccacctccg gaacttctga
1150
789
cctctgtttc ctccgtcctg ataagacgtc caccccccag ggccaggtcc 1200
cagctatgta gacccccgcc cccacctcca acactgcacc cttctgccct 12 50
gcccccctcg tctcaccccc tttacactca catttttatc aaataaagca 1300
tgttttgtta gtgca 1315
<210> 330 <211> 220 <212> PRT <213> Homo sapiens
<400> 330
Met Ala Ser Ala Gly Met Gin lie Leu Gly Val Val Leu Thr Leu 15 10 15
Leu Gly Trp Val Asn Gly Leu Val Ser Cys Ala Leu Pro Met Trp 20 25 30
Lys Val Thr Ala Phe Ile Gly Asn Ser Ile Val Val Ala Gin Val 35 40 45
Val Trp Glu Gly Leu Trp Met Ser Cys Val Val Gin Ser Thr Gly 50 55 60
Gin Met Gin Cys Lys Val Tyr Asp Ser Leu Leu Ala Leu Pro Gin 65 70 75
Asp Leu Gin Ala Ala Arg Ala Leu Cys Val Ile Ala Leu Leu Val 80 85 90
Ala Leu Phe Gly Leu Leu Val Tyr Leu Ala Gly Ala Lys Cys Thr 95 100 105
Thr Cys Val Glu Glu Lys Asp Ser Lys Ala Arg Leu Val Leu Thr 110 115 120
Ser Gly Ile Val Phe Val Ile Ser Gly Val Leu Thr Leu Ile Pro 125 130 135
Val Cys Trp Thr Ala His Ala Ile Ile Arg Asp Phe Tyr Asn Pro 140 145 150
Leu Val Ala Glu Ala Gin Lys Arg Glu Leu Gly Ala Ser Leu Tyr 155 160 165
Leu Gly Trp Ala Ala Ser Gly Leu Leu Leu Leu Gly Gly Gly Leu 170 175 180
Leu Cys Cys Thr Cys Pro Ser Gly Gly Ser Gin Gly Pro Ser His 185 190 195
Tyr Met Ala Arg Tyr Ser Thr Ser Ala Pro Ala Ile Ser Arg Gly 200 205 210
Pro Ser Glu Tyr Pro Thr Lys Asn Tyr Val 215 220
<210> 331 <211> 1160 <212> DNA
790
<213> Homo sapiens
<400> 331
gccaaggaga acatcatcaa agacttctct agactcaaaa ggcttccacg
50
ttctacatct tgagcatctt ctaccactcc gaattgaacc agtcttcaaa
100
gtaaaggcaa tggcatttta tcccttgcaa attgctgggc tggttcttgg
150
gttccttggc atggtgggga ctcttgccac aacccttctg cctcagtggt
200
ggagtatcag cttttgttgg cagcaacatt attgtctttg agaggctctg
250
ggaagggctc tggatgaatt gcatccgaca agccagggtc cggttgcaat
300
gcaagttcta tagctccttg ttggctctcc cgcctgccct ggaaacagcc
350
cgggccctca tgtgtgtggc tgttgctctc tccttgatcg ccctgcttat
400
tggcatctgt ggcatgaagc aggtccagtg cacaggctct aacgagaggg
450
ccaaagcata ccttctggga acttcaggag tcctcttcat cctgacgggt
500
atcttcgttc tgattccggt gagctggaca gccaatataa tcatcagaga
550
tttctacaac ccagccatcc acataggtca gaaacgagag ctgggagcag
600
cacttttcct tggctgggca agcgctgctg tcctcttcat tggagggggt
650
ctgctttgtg gattttgctg ctgcaacaga aagaagcaag ggtacagata
700
tccagtgcct ggctaccgtg tgccacacac agataagcga agaaatacga
750
caatgcttag taagacctcc accagttatg tctaatgcct ccttttggct
800
ccaagtatgg actatggtca atgttttfcta taaagtcctg ctagaaactg
850
taagtatgtg aggcaggaga acttgcttta tgtctagatt tacattgata
900
cgaaagtttc aatttgttac tggtggtagg aatgaaaatg acttacttgg
950
acattctgac ttcaggtgta ttaaatgcat tgactattgt tggacccaat
1000
cgctgctcca attttcatat tctaaattca agtataccca taatcattag
1050
caagtgtaca ctgataagaa atgatggact tctaaagttg acttattact aaattgatat ttttgaccat tctataacaa catgtattat taaaacatat
1100 1150
acctattcta
1160
<210> 332 <211> 173 <212> PRT <213> Homo sapiens
<400> 332
Met Asn Cys Ile Arg Gin Ala Arg Val Arg Leu Gin Cys Lys Phe 1 5 10 15
Tyr Ser Ser Leu Leu Ala Leu Pro Pro Ala Leu Glu Thr Ala Arg 20 25 30
791
Ala Leu Met Cys Val Ala Val Ala Leu Ser Leu Ile Ala Leu Leu 35 40 45
Ile Gly Ile Cys Gly Met Lys Gin Val Gin Cys Thr Gly Ser Asn 50 55 60
Glu Arg Ala Lys Ala Tyr Leu Leu Gly Thr Ser Gly Val Leu Phe 65 70 75
Ile Leu Thr Gly Ile Phe Val Leu Ile Pro Val Ser Trp Thr Ala 80 85 90
Asn Ile Ile Ile Arg Asp Phe Tyr Asn Pro Ala Ile His Ile Gly 95 100 105
Gin Lys Arg Glu Leu Gly Ala Ala Leu Phe Leu Gly Trp Ala Ser 110 115 120
Ala Ala Val Leu Phe Ile Gly Gly Gly Leu Leu Cys Gly Phe Cys 125 130 135
Cys Cys Asn Arg Lys Lys Gin Gly Tyr Arg Tyr Pro Val Pro Gly 140 145 150
Tyr Arg Val Pro His Thr Asp Lys Arg Arg Asn Thr Thr Met Leu 155 160 165
Ser Lys Thr Ser Thr Ser Tyr Val 170
<210> 333 <211> 535 <212> DNA <213> Homo sapiens
<400> 333
agtgacaatc tcagagcagc ttctacacca cagccatttc cagcatgaag
50
atcactgggg gtctccttct gctctgtaca gtggtctatt tctgtagcag
100
ctcagaagct gctagtctgt ctccaaaaaa agtggactgc agcatttaca
150
agaagtatcc agtggtggcc atcccctgcc ccatcacata cctaccagtt
200
tgtggttctg actacatcac ctatgggaat gaatgtcact tgtgtaccga
250
gagcttgaaa aaattctcca agtaatggaa tggacataga gagttcagtt gagaaaggaa tcttcacgat tgatattctc ggaagttgct atcatcatct
300 350
tcatcatccc aggctctgac tgagtttctt tcagttttac tgatgttctg
400
ggtgggggac agagccagat tcagagtaat cttgactgaa tggagaaagt
450
ttctgtgcta cccctacaaa cccatgcctc actgacagac cagcattttt
500
tttttaacac gtcaataaaa aaataatctc ccaga 535
<210> 334 <211> 85 <212> PRT
<213> Homo sapiens
792
<400> 334
Met Lys Ile Thr Gly Gly Leu Leu Leu Leu Cys Thr Val Val Tyr 15 10 15
Phe Cys Ser Ser Ser Glu Ala Ala Ser Leu Ser Pro Lys Lys Val 20 25 30
Asp Cys Ser Ile Tyr Lys Lys Tyr Pro Val Val Ala Ile Pro Cys 35 40 45
Pro Ile Thr Tyr Leu Pro Val Cys Gly Ser Asp Tyr Ile Thr Tyr 50 55 60
Gly Asn Glu Cys His Leu Cys Thr Glu Ser Leu Lys Ser Asn Gly 65 70 75
Arg Val Gin Phe Leu His Asp Gly Ser Cys 80 85
<210> 335 <211> 742 <212> DNA <213> Homo sapiens
<400> 335
cccgcgcccg gttctccctc gcagcacctc gaagtgcgcc cctcgccctc
50
ctgctcgcgc cccgccgcca tggctgcctc ccccgcgcgg cctgctgtcc
100
tggccctgac cgggctggcg ctgctcctgc tcctgtgctg gggcccaggt
150
ggcataagtg gaaataaact caagctgatg cttcaaaaac gagaagcacc
200
tgttccaact aagactaaag tggccgttga tgagaataaa gccaaagaat
250
tccttggcag cctgaagcgc cagaagcggc agctgtggga ccggactcgg
300
cccgaggtgc agcagtggta ccagcagttt ctctacatgg gctttgatga
350
agcgaaattt gaagatgaca tcacctattg gcttaacaga gatcgaaatg
400
gacatgaata ctatggcgat tactaccaac gtcactatga tgaagactct
450
gcaattggtc cccggagccc ctacggcttt aggcatggag ccagcgtcaa
500
ctacgatgac tactaaccat gacttgccac acgctgtaca agaagcaaat
550
agcgattctc tttgctctat ttcatgtatc ttcagcagat tcctaatgcc cttttctacc ttacactact tactttgtgt tggtttctga gatcaaaaaa
600 650
gaagagttaa aacaacacat gtaaatgcct tttgatattt catgggaatg
700
cctctcattt aaaaatagaa ataaagcatt ttgttaaaaa ga 742
<210> 336 <211> 148 <212> PRT
<213> Homo sapiens <400> 336
Met Ala Ala Ser Pro Ala Arg Pro Ala Val Leu Ala Leu Thr Gly 15 10 15
793
Leu Ala Leu Leu Leu Leu Leu Cys Trp Gly Pro Gly Gly Ile Ser 20 25 30
Gly Asn Lys Leu Lys Leu Met Leu Gin Lys Arg Glu Ala Pro Val 35 40 45
Pro Thr Lys Thr Lys Val Ala Val Asp Glu Asn Lys Ala Lys Glu 50 55 60
Phe Leu Gly Ser Leu Lys Arg Gin Lys Arg Gin Leu Trp Asp Arg 65 70 75
Thr Arg Pro Glu Val Gin Gin Trp Tyr Gin Gin Phe Leu Tyr Met 80 85 90
Gly Phe Asp Glu Ala Lys Phe Glu Asp Asp Ile Thr Tyr Trp Leu 95 100 105
Asn Arg Asp Arg Asn Gly His Glu Tyr Tyr Gly Asp Tyr Tyr Gin 110 115 120
Arg His Tyr Asp Glu Asp Ser Ala Ile Gly Pro Arg Ser Pro Tyr 125 130 135
Gly Phe Arg His Gly Ala - Ser Val Asn Tyr Asp Asp Tyr 140 145
<210> 337
<211> 1310
<212> DNA
<213> Homo sapiens
<400> 337
cggctcgagc ccgcccggaa gtgcccgagg ggccgcgatg gagctggggg
50
agccgggcgc tcggtagcgc ggcgggcaag gcaggcgcca tgaccctgat
100
tgaaggggtg ggtgatgagg tgaccgtcct tttctcggtg cttgcctgcc
150
ttctggtgct ggcccttgcc tgggtctcaa cgcacaccgc tgagggcggg
200
gacccactgc cccagccgtc agggacccca acgccatccc agcccagcgc
250
agccatggca gctaccgaca gcatgagagg ggaggcccca ggggcagaga
300
cccccagcct gagacacaga ggtcaagctg cacagccaga gcccagcacg
350
gggttcacag gctacggctg caacaccgcc aaattcctca agccccggac atgattcaga tccccgcagg gcaggtggcc agcccctcgt agggcctggc
400 450
cccacgacac cattggctcc ttgaaaagga cccagtttcc cggccgggaa
500
cagcaggtgc gactcatcta ccaagggcag ctgctaggcg acgacaccca
550
gaccctgggc agccttcacc tccctcccaa ctgcgttctc cactgccacg
600
tgtccacgag agtcggtccc ccaaatcccc cctgcccgcc ggggtccgag
650
cccggcccct ccgggctgga aatcggcagc ctgctgctgc ccctgctgct
700
cctgctgttg ctgctgctct ggtactgcca gatccagtac cggcccttct
750
794
ttcccctgac cgccactctg ggcctggccg gcttcaccct gctcctcagt 800
ctcctggcct ttgccatgta ccgcccgtag tgcctccgcg ggcgcttggc 850
agcgtcgccg gcccctccgg accttgctcc ccgcgccgcg gcgggagctg 900
ctgcctgccc aggcccgcct ctccggcctg cctcttcccg ctgccctgga 950
gcccagccct gcgccgcaga ggactcccgg gactggcgga ggccccgccc 1000
tgcgaccgcc ggggctcggg gccacctccc ggggctgctg aacctcagcc 1050
cgcactggga gtgggctcct cggggtcggg catctgctgt cgctgcctcg 1100
gccccgggca gagccgggcc gccccggggg cccgtcttag tgttctgccg 1150
gaggacccag ccgcctccaa tccctgacag ctccttgggc tgagttgggg 12 00
acgccaggtc ggtgggaggc tggtgaaggg gagcggggag gggcagagga 1250
gttccccgga acccgtgcag attaaagtaa ctgtgaagtt ttaaaaaaaa 1300
aaaaaaaaaa 1310
<210> 338 <211> 246 <212> PRT
<213> Homo sapiens <400> 338
Met Thr Leu Ile Glu Gly Val Gly Asp Glu Val Thr Val Leu Phe 15 10 15
Ser Val Leu Ala Cys Leu Leu Val Leu Ala Leu Ala Trp Val Ser 20 25 30
Thr His Thr Ala Glu Gly Gly Asp Pro Leu Pro Gin Pro Ser Gly 35 40 45
Thr Pro Thr Pro Ser Gin Pro Ser Ala Ala Met Ala Ala Thr Asp 50 55 60
Ser Met Arg Gly Glu Ala Pro Gly Ala Glu Thr Pro Ser Leu Arg 65 70 75
His Arg Gly Gin Ala Ala Gin Pro Glu Pro Ser Thr Gly Phe Thr 80 85 90
Ala Thr Pro Pro Ala Pro Asp Ser Pro Gin Glu Pro Leu Val Leu 95 100 105
Arg Leu Lys Phe Leu Asn Asp Ser Glu Gin Val Ala Arg Ala Trp 110 115 120
Pro His Asp Thr Ile Gly Ser Leu Lys Arg Thr Gin Phe Pro Gly 125 130 135
Arg Glu Gin Gin Val Arg Leu Ile Tyr Gin Gly Gin Leu Leu Gly 140 145 150
Asp Asp Thr Gin Thr Leu Gly Ser Leu His Leu Pro Pro Asn Cys
795
155 160 165
Val Leu His Cys His Val Ser Thr Arg Val Gly Pro Pro Asn Pro 170 175 180
Pro Cys Pro Pro Gly Ser Glu Pro Gly Pro Ser Gly Leu Glu Ile 185 190 195
Gly Ser Leu Leu Leu Pro Leu Leu Leu Leu Leu Leu Leu Leu Leu 200 205 210
Trp Tyr Cys Gin Ile Gin Tyr Arg Pro Phe Phe Pro Leu Thr Ala 215 . 220 225
Thr Leu,Gly Leu Ala Gly Phe Thr Leu Leu Leu Ser Leu Leu Ala 230 235 240
Phe Ala Met Tyr Arg Pro 245
<210> 339 <211> 849 <212> DNA <213> Homo sapiens
<400> 339
gagattggaa acagccaggt tggagcagtg agtgagtaag gaaacctggc
50
tgccctctcc agattcccca ggctctcaga gaagatcagc agaaagtctg
100
caagacccta agaaccatca gccctcagct gcacctcctc ccctccaagg
150
atgacaaagg cgctactcat ctatttggtc agcagctttc ttgccctaaa
200
tcaggccagc ctcatcagtc gctgtgactt ggcccaggtg ctgcagctgg
250
aggacttgga tgggtttgag ggttactccc tgagtgactg gctgtgcctg
300
gcttttgtgg aaagcaagtt caacatatca aagataaatg aaaatgcgga
350
tggaagcttt gactatggcc tcttccagat caacagccac tactggtgca
400
acgattataa gagttactcg gaaaaccttt gccacgtaga ctgtcaagat
450
ctgctgaatc ccaaccttct tgcaggcatc cactgcgcaa aaaggattgt
500
gtccggagca cgggggatga acaactgggt agaatggagg ttgcactgtt
550
caggccggcc actctcctac tggctgacag gatgccgcct gagatgaaac
600
agggtgcggg ggattcttca tgcaccgtgg tttcttcttc agtcattcca ctactgcctc agactcctgt cacttcatgt cctcactcag tattttcttc
650 700
ccttcccatt tacaactaaa actgaccaga gccccaggaa taaatggttt
750
tcttggcttc ctccttactc ccatctggac ccagtcccct ggttcctgtc
800
tgttatttgt aaactgagga ccacaataaa gaaatcttta tatttatcg 849
<210> 340 <211> 148 <212> PRT
<213> Homo sapiens
796
<400> 340
Met Thr Lys Ala Leu Leu Ile Tyr Leu Val Ser Ser Phe Leu Ala 15 10 15
Leu Asn Gin Ala Ser Leu Ile Ser Arg Cys Asp Leu Ala Gin Val 20 25 30
Leu Gin Leu Glu Asp Leu Asp Gly Phe Glu Gly Tyr Ser Leu Ser 35 40 45
Asp Trp Leu Cys Leu Ala Phe Val Glu Ser Lys Phe Asn Ile Ser 50 55 60
Lys Ile Asn Glu Asn Ala Asp Gly Ser Phe Asp Tyr Gly Leu Phe 65 70 75
Gin Ile Asn Ser His Tyr Trp Cys Asn Asp Tyr Lys Ser Tyr Ser 80 85 90
Glu Asn Leu Cys His Val Asp Cys Gin Asp Leu Leu Asn Pro Asn 95 100 105
Leu Leu Ala Gly Ile His Cys Ala Lys Arg Ile Val Ser Gly Ala 110 115 120
Arg Gly Met Asn Asn Trp Val Glu Trp Arg Leu His Cys Ser Gly 125 130 135
Arg Pro Leu Ser Tyr Trp Leu Thr Gly Cys Arg Leu Arg 140 145
<210> 341 <211> 23 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 341 ccctccaagg atgacaaagg cgc 23
<210> 342 <211> 29 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 342 ggtcagcagc tttcttgccc taaatcagg 2 9
<210> 343 <211> 24 <212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
797
<400> 343 atctcaggcg gcatcctgtc agcc 24
<210> 344 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 344 gtggatgcct gcaagaaggt tggg 24
<210> 345 <211> 45 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 345
agctttcttg ccctaaatca ggccagcctc atcagtcgct gtgac 45
<210> 346 <211> 2575 <212> DNA <213> Homo sapiens
<400> 346
tctgacctga ctggaagcgt ccaaagaggg acggctgtca gccctgcttg
50
actgagaacc caccagctca tcccagacac ctcatagcaa cctatttata
100
caaaggggga aagaaacacc tgagcagaat ggaatcatta tttttttccc
150
aaggagaaaa ccggggtaaa gggagggaag caattcaatt tgaagtccct
200
gtgaatgggc tttcagaagg caattaaaga aatccactca gagaggactt
250
ggggtgaaac ttgggtcctg tggttttctg attgtaagtg gaagcaggtc
300
ttgcacacgc tgttggcaaa tgtcaggacc aggttaagtg actggcagaa
350
aaacttccag gtggaacaag caacccatgt tctgctgcaa gcttgaagga
400
gcctggagcg ggagaaagct aacttgaaca tgacctgttg catttggcaa
450
gttctagcaa catgctccta aggaagcgat acaggcacag accatgcaga
500
ctccagttcc ggcgatgttg tcctgctgct caccctcccc cctgatgctg accacaccct ggatgcgtcc gcaccagact tgatgatggt gtcacagccc
550 600
aagccagcaa gcacagccct gaagccaggt accgcctgga ctttggggaa
650
tcccaggatt gggtactgga agctgaggat gagggtgaag agtacagccc
700
tctggagggc ctgccaccct ttatctcact gcgggaggat cagctgctgg
750
tggccgtggc cttaccccag gccagaagga accagagcca gggcaggaga
800
798
ggtgggagct accgcctcat caagcagcca aggaggcagg ataaggaagc
850
cccaaagagg gactgggggg ctgatgagga cggggaggtg tctgaagaag
900
aggagttgac cccgttcagc ctggacccac gtggcctcca ggaggcactc
950
agtgcccgca tccccctcca gagggctctg cccgaggtgc ggcacccact
1000
gtgtctgcag cagcaccctc aggacagcct gcccacagcc agcgtcatcc
1050
tctgtttcca tgatgaggcc tggtccactc tcctgcggac tgtacacagc
1100
atcctcgaca cagtgcccag ggccttcctg aaggagatca tcctcgtgga
1150
cgacctcagc cagcaaggac aactcaagtc tgctctcagc gaatatgtgg
1200
ccaggctgga gggggtgaag ttactcagga gcaacaagag gctgggtgcc
1250
atcagggccc ggatgctggg ggccaccaga gccaccgggg atgtgctcgt
1300
cttcatggat gcccactgcg agtgccaccc aggctggctg gagcccctcc
1350
tcagcagaat agctggtgac aggagccgag tggtatctcc ggtgatagat
1400
gtgattgact ggaagacttt ccagtattac ccctcaaagg acctgcagcg
1450
tggggtgttg gactggaagc tggatttcca ctgggaacct ttgccagagc
1500
atgtgaggaa ggccctccag tcccccataa gccccatcag gagccctgtg
1550
gtgcccggag aggtggtggc catggacaga cattacttcc aaaacactgg
1600
agcgtatgac tctcttatgt cgctgcgagg tggtgaaaac ctcgaactgt
1650
ctttcaaggc ctggctctgt ggtggctctg ttgaaatcct tccctgctct
1700
cgggtaggac acatctacca aaatcaggat tcccattccc ccctcgacca
1750
ggaggccacc ctgaggaaca gggttcgcat tgctgagacc tggctggggt
1800
cattcaaaga aaccttctac aagcatagcc cagaggcctt ctccttgagc
1850
aaggctgaga agccagactg catggaacgc ttgcagctgc aaaggagact
1900
gggttgtcgg acattccact ggtttctggc taatgtctac cctgagctgt
1950
acccatctga acccaggccc agtttctctg gaaagctcca caacactgga
2000
cttgggctct gtgcagactg ccaggcagaa ggggacatcc tgggctgtcc
2050
catggtgttg gctccttgca gtgacagccg gcagcaacag tacctgcagc
2100
acaccagcag gctgtcaggc gaaggagatt aggagcaggt cactttggca gattcttcag gcccacagca aactgcacgg cctgtgcttt aggaaggcct
2150 2200
ggccatccac cagcagcact gggacttcca ggagaatggg atgattgtcc
2250
acattctttc tgggaaatgc atggaagctg tggtgcaaga aaacaataaa
2300
gatttgtacc tgcgtccgtg tgatggaaaa gcccgccagc agtggcgatt
2350
799
tgaccagata aatgctgtgg atgaacgatg aatgtcaatg tcagaaggaa 2400
aagagaattt tggccatcaa aatccagctc caagtgaacg taaagagctt 2450
atatatttca tgaagctgat ccttttgtgt gtgtgctcct tgtgttagga 2500
gagaaaaaag ctctatgaaa gaatatagga agtttctcct tttcacacct 2550
tatttcattg actgctggct gctta 2575
<210> 347 <211> 639 <212> PRT <213> Homo sapiens
<400> 347
Met Leu Leu Arg Lys Arg Tyr Arg His Arg Pro Cys Arg Leu Gin 15 10 15
Phe Leu Leu Leu Leu Leu Met Leu Gly Cys Val Leu Met Met Val 20 25 30
Ala Met Leu His Pro Pro His His Thr Leu His Gin Thr Val Thr 35 40 45
Ala Gin Ala Ser Lys His Ser Pro Glu Ala Arg Tyr Arg Leu Asp 50 55 60
Phe Gly Glu Ser Gin Asp Trp Val Leu Glu Ala Glu Asp Glu Gly 65 70 75
Glu Glu Tyr Ser Pro Leu Glu Gly Leu Pro Pro Phe Ile Ser Leu 80 85 90
Arg Glu Asp Gin Leu Leu Val Ala Val Ala Leu Pro Gin Ala Arg 95 100 105
Arg Asn Gin Ser Gin Gly Arg Arg Gly Gly Ser Tyr Arg Leu Ile 110 115 120
Lys Gin Pro Arg Arg Gin Asp Lys Glu Ala Pro Lys Arg Asp Trp 125 130 135
Gly Ala Asp Glu Asp Gly Glu Val Ser Glu Glu Glu Glu Leu Thr 140 145 150
Pro Phe Ser Leu Asp Pro Arg Gly Leu Gin Glu Ala Leu Ser Ala 155 160 165
Arg Ile Pro Leu Gin Arg Ala Leu Pro Glu Val Arg His Pro Leu 170 175 180
Cys Leu Gin Gin His Pro Gin Asp Ser Leu Pro Thr Ala Ser Val 185 190 195
Ile Leu Cys Phe His Asp Glu Ala Trp Ser Thr Leu Leu Arg Thr 200 205 210
Val His Ser Ile Leu Asp Thr Val Pro Arg Ala Phe Leu Lys Glu 215 220 225
Ile Ile Leu Val Asp Asp Leu Ser Gin Gin Gly Gin Leu Lys Ser
230
800
235
240
Ala Leu Ser Glu Tyr Val Ala Arg Leu Glu Gly Val Lys Leu Leu 245 250 255
Arg Ser Asn Lys Arg Leu Gly Ala Ile Arg Ala Arg Met Leu Gly 260 265 270
Ala Thr Arg Ala Thr Gly Asp Val Leu Val Phe Met Asp Ala His 275 280 285
Cys Glu Cys His Pro Gly Trp Leu Glu Pro Leu Leu Ser Arg Ile 290 295 300
Ala Gly Asp Arg Ser Arg Val Val Ser Pro Val Ile Asp Val Ile 305 310 315
Asp Trp Lys Thr Phe Gin Tyr Tyr Pro Ser Lys Asp Leu Gin Arg 320 325 330
Gly Val Leu Asp Trp Lys Leu Asp Phe His Trp Glu Pro Leu Pro 335 340 345
Glu His Val Arg Lys Ala Leu Gin Ser Pro Ile Ser Pro Ile Arg 350 355 360
Ser Pro Val Val Pro Gly Glu Val Val Ala Met Asp Arg His Tyr 365 370 375
Phe Gin Asn Thr Gly Ala Tyr Asp Ser Leu Met Ser Leu Arg Gly 380 385 390
Gly Glu Asn Leu Glu Leu Ser Phe Lys Ala Trp Leu Cys Gly Gly 395 400 405
Ser Val Glu Ile Leu Pro Cys Ser Arg Val Gly His Ile Tyr Gin 410 415 420
Asn Gin Asp Ser His Ser Pro Leu Asp Gin Glu Ala Thr Leu Arg 425 430 435
Asn Arg Val Arg Ile Ala Glu Thr Trp Leu Gly Ser Phe Lys Glu 440 445 450
Thr Phe Tyr Lys His Ser Pro Glu Ala Phe Ser Leu Ser Lys Ala 455 460 465
Glu Lys Pro Asp Cys Met Glu Arg Leu Gin Leu Gin Arg Arg Leu 470 475 480
Gly Cys Arg Thr Phe His Trp Phe Leu Ala Asn Val Tyr Pro Glu 485 490 495
Leu Tyr Pro Ser Glu Pro Arg Pro Ser Phe Ser Gly Lys Leu His 500 505 510
Asn Thr Gly Leu Gly Leu Cys Ala Asp Cys Gin Ala Glu Gly Asp 515 520 525
Ile Leu Gly Cys Pro Met Val Leu Ala Pro Cys Ser Asp Ser Arg 530 535 540
801
Gin Gin Gin Tyr Leu Gin His Thr Ser Arg Lys Glu Ile His Phe 545 550 555
Gly Ser Pro
Ile Leu Gin
His Trp Asp
Gly Lys Cys
Tyr Leu Arg
Asp Gin Ile
<210> 348 <211> 23 <212> DNA <213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 348 ggagaggtgg tggccatgga cag 23
<210> 349 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 349 ctgtcactgc aaggagccaa cacc 24
<210> 350 <211> 45 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 350
tatgtcgctg cgaggtggtg aaaacctcga actgtctttc aaggc 45
<210> 351 <211> 2524 <212> DNA
<213> Homo sapiens <400> 351
cgccaagcat gcagtaaagg ctgaaaatct gggtcacagc tgaggaagac 50
Gin His Leu Cys Phe Ala Val Arg Gin Glu Gin Val 560 565 570
Asn Cys Thr Glu Glu Gly Leu Ala Ile His Gin Gin 575 580 585
Phe Gin Glu Asn Gly Met Ile Val His Ile Leu Ser 590 595 600
Met Glu Ala Val Val Gin Glu Asn Asn Lys Asp Leu 605 610 615
Pro Cys Asp Gly Lys Ala Arg Gin Gin Trp Arg Phe 620 625 630
Asn Ala Val Asp Glu Arg 635
802
ctcagacatg gagtccagga tgtggcctgc gctgctgctg tcccacctcc
100
tccctctctg gccactgctg ttgctgcccc tcccaccgcc tgctcagggc
150
tcttcatcct cccctcgaac cccaccagcc ccagcccgcc ccccgtgtgc
200
caggggaggc ccctcggccc cacgtcatgt gtgcgtgtgg gagcgagcac
250
ctccaccaag ccgatctcct cgggtcccaa gatcacgtcg gcaagtcctg
300
cctggcactg cacccccagc caccccatca ggctttgagg aggggccgcc
350
ctcatcccaa tacccctggg ctatcgtgtg gggtcccacc gtgtctcgag
400
aggatggagg ggaccccaac tctgccaatc ccggatttct ggactatggt
450
tttgcagccc ctcatgggct cgcaacccca caccccaact cagactccat
500
gcgaggtgat ggagatgggc ttatccttgg agaggcacct gccaccctgc
550
ggccattcct gttcgggggc cgtggggaag gtgtggaccc ccagctctat
600
gtcacaatta ccatctccat catcattgtt ctcgtggcca ctggcatcat
650
cttcaagttc tgctgggacc gcagccagaa gcgacgcaga ccctcagggc
700
agcaaggtgc cctgaggcag gaggagagcc agcagccact gacagacctg
750
tccccggctg gagtcactgt gctgggggcc ttcggggact cacctacccc
800
cacccctgac catgaggagc cccgaggggg accccggcct gggatgcccc
850
accccaaggg ggctccagcc ttccagttga accggtgagg gcaggggcaa
900
tgggatggga gggcaaagag ggaaggcaac ttaggtcttc agagctgggg
950
tgggggtgcc ctctggatgg gtagtgagga ggcaggcgtg gcctcccaca
1000
gcccctggcc ctcccaaggg ggctggacca gctcctctct gggaggcacc
1050
cttccttctc ccagtctctc aggatctgtg tcctattctc tgctgcccat
1100
aactccaact ctgccctctt tggttttttc tcatgccacc ttgtctaaga
1150
caactctgcc ctcttaacct tgattccccc tctttgtctt gaacttcccc
1200
ttctattctg gcctacccct tggttcctga ctgtgccctt tccctcttcc
1250
tctcaggatt cccctggtga atctgtgatg cccccaatgt tggggtgcag
1300
ccaagcagga ggccaagggg ccggcacagc ccccatccca ctgagggtgg
1350
ggcagctgtg gggagctggg gccacagggg ctcctggctc ctgccccttg
1400
cacaccaccc ggaacactcc ccagccccac gggcaatcct atctgctcgc
1450
cctcctgcag gtgggggcct cacatatctg tgacttcggg tccctgtccc
1500
cacccttgtg cactcacatg aaagccttgc acactcacct ccaccttcac
1550
aggccatttg cacacgctcc tgcaccctct ccccgtccat accgctccgc
1600
803
tcagctgact ctcatgttct ctcgtctcac atttgcactc tctccttccc
1650
acattctgtg ctcagctcac tcagtggtca gcgtttcctg cacactttac
1700
ctctcatgtg cgtttcccgg cctgatgttg tggtggtgtg cggcgtgctc
1750
actctctccc tcatgaacac ccacccacct cgtttccgca gcccctgcgt
1800
gctgctccag aggtgggtgg gaggtgagct gggggctcct tgggccctca
1850
tcggtcatgg tctcgtccca ttccacacca tttgtttctc tgtctcccca
1900
tcctactcca aggatgccgg catcaccctg agggctcccc cttgggaatg
1950
gggtagtgag gccccagact tcacccccag cccactgcta aaatctgttt
2000
tctgacagat gggttttggg gagtcgcctg ctgcactaca tgagaaaggg
2050
actcccattt gcccttccct ttctcctaca gtcccttttg tcttgtctgt
2100
cctggctgtc tgtgtgtgtg ccattctctg gacttcagag ccccctgagc
2150
cagtcctccc ttcccagcct ccctttgggc ctccctaact ccacctaggc
2200
tgccagggac cggagtcagc tggttcaagg ccatcgggag ctctgcctcc
2250
aagtctaccc ttcccttccc ggactccctc ctgtcccctc ctttcctccc
2300
tccttccttc cactctcctt ccttttgctt ccctgccctt tccccctcct
2350
caggttcttc cctccttctc actggttttt ccaccttcct ccttcccttc
2400
ttccctggct cctaggctgt gatatatatt tttgtattat ctctttcttc
2450
ttcttgtggt gatcatcttg aattactgtg ggatgtaagt ttcaaaattt
2500
tcaaataaag cctttgcaag ataa 2524
<210> 352 <211> 243 <212> PRT <213> Homo sapiens
<400> 352
Met Arg Pro Gin Gly Pro Ala Ala Ser Pro Gin Arg Leu Arg Gly 15 10 15
Leu Leu Leu Leu Leu Leu Leu Gin Leu Pro Ala Pro Ser Ser Ala 20 25 30
Ser Glu Ile Pro Lys Gly Lys Gin Lys Ala Gin Leu Arg Gin Arg 35 40 45
Glu Val Val Asp Leu Tyr Asn Gly Met Cys Leu Gin Gly Pro Ala 50 55 60
Gly Val Pro Gly Arg Asp Gly Ser Pro Gly Ala Asn Val Ile Pro 65 70 75
Gly Thr Pro Gly Ile Pro Gly Arg Asp Gly Phe Lys Gly Glu Lys 80 85 90
804
Gly Glu Cys Leu Arg Glu Ser Phe Glu Glu Ser Trp Thr Pro Asn 95 100 105
Tyr Lys Gin
Gly Lys Ile
Ala Leu Arg
Asn Ala Cys
Cys Ser Gly
Gly Ser Pro
Ser Val Glu
Val Ala Ile
Ala Ser Thr
Leu Pro Lys
<210> 353 <211> 480 <212> DNA <213> Homo s<
<400> 353
gttaaccagc gcagtcctcc gtgcgtcccg cccgccgctg ccctcactcc
50
cggccaggat ggcatcctgt ctggccctgc gcatggcgct gctgctggtc
100
tccggggttc tggcccctgc ggtgctcaca gacgatgttc cacaggagcc
150
cgtgcccacg ctgtggaacg agccggccga gctgccgtcg ggagaaggcc
200
ccgtggagag caccagcccc ggccgggagc ccgtggacac cggtccccca
250
gcccccaccg tcgcgccagg acccgaggac agcaccgcgc aggagcggct
300
ggaccagggc ggcgggtcgc tggggcccgg cgctatcgcg gccatcgtga
350
tcgccgccct gctggccacc tgcgtggtgc tggcgctcgt ggtcgtcgcg
400
ctgagaaagt tttctgcctc ctgaagcgaa taaaggggcc gcgcccggcc
450
gcggcgcgac tcggcaaaaa aaaaaaaaaa
480
<210> 354 <211> 121 <212> PRT
Cys Ser Trp Ser Ser Leu Asn Tyr Gly Ile Asp Leu 110 115 120
Ala Glu Cys Thr Phe Thr Lys Met Arg Ser Asn Ser 125 130 135
Val Leu Phe Ser Gly Ser Leu Arg Leu Lys Cys Arg 140 145 150
Cys Gin Arg Trp Tyr Phe Thr Phe Asn Gly Ala Glu 155 160 165
Pro Leu Pro Ile Glu Ala Ile Ile Tyr Leu Asp Gin 170 175 180
Glu Met Asn Ser Thr Ile Asn Ile His Arg Thr Ser 185 190 195
Gly Leu Cys Glu Gly Ile Gly Ala Gly Leu Val Asp 200 205 210
Trp Val Gly Thr Cys Ser Asp Tyr Pro Lys Gly Asp 215 220 225
Gly Trp Asn Ser Val Ser Arg Ile Ile Ile Glu Glu 230 235 240
apiens
<213> Homo sapiens
<400> 354
Met Ala Ser Cys Leu 1 5
Gly Val Leu Ala Pro 20
Pro Val Pro Thr Leu 35
Glu Gly Pro Val Glu 50
Thr Gly Pro Pro Ala 65
Thr Ala Gin Glu Arg 80
Gly Ala Ile Ala Ala 95
Val Val Leu Ala Leu 110
Ser
<210> 355 <211> 2134 <212> DNA <213> Homo sapiens
<400> 355
ggccgttggt tggtgcgcgg ctgaagggtg tggcgcgagc agcgtcgttg 50
gttggccggc ggcgggccgg gacgggcatg gccctgctgc tgtgcctggt 100
gtgcctgacg gcggcgctgg cccacggctg tctgcactgc cacagcaact 150
tctccaagaa gttctccttc taccgccacc atgtgaactt caagtcctgg 200
tgggtgggcg acatccccgt gtcaggggcg ctgctcaccg actggagcga 2 50
cgacacgatg aaggagctgc acctggccat ccccgccaag atcacccggg 300
agaagctgga ccaagtggcg acagcagtgt accagatgat ggatcagctg 350
taccagggga agatgtactt ccccgggtat ttccccaacg agctgcgaaa 400
catcttccgg gagcaggtgc acctcatcca gaacgccatc atcgaaaggc 450
acctggcacc aggcagctgg ggaggagggc agctctccag ggagggaccc 500
agcctagcac ctgaaggatc aatgccatca ccccgcgggg acctccccta 550
agtagccccc agaggcgctg ggagtgttgc caccgccctc ccctgaagtt 600
tgctccatct cacgctgggg gtcaacctgg ggaccccttc cctccgggcc 650 atggacacac atacatgaaa accaggccgc atcgactgtc agcaccgctg 70 0
805
Ala Leu Arg Met Ala Leu Leu Leu Val Ser 10 15
Ala Val Leu Thr Asp Asp Val Pro Gin Glu 25 30
Trp Asn Glu Pro Ala Glu Leu Pro Ser Gly 40 45
Ser Thr Ser Pro Gly Arg Glu Pro Val Asp 55 60
Pro Thr Val Ala Pro Gly Pro Glu Asp Ser 70 75
Leu Asp Gin Gly Gly Gly Ser Leu Gly Pro 85 90
Ile Val Ile Ala Ala Leu Leu Ala Thr Cys 100 105
Val Val Val Ala Leu Arg Lys Phe Ser Ala 115 120
806
tggcatcttc cagtacgaga ccatctcctg caacaactgc acagactcgc
750
acgtcgcctg ctttggctat aactgcgagt agggctcagg catcacaccc
800
acccgtgcca gggccctact gtccctgggg tcccaggctc tccttggagg
850
gggctccccg ccttccacct ggctgtcatc gggtagggcg gggccgtggg
900
ttcaggggcg caccacttcc aagcctgtgt cccacaggtc ctcggcgcag
950
tggaagtcag ctgtccaggg cctcctgaac tacataaata actggcacaa
1000
gtaagtcccc tcctcaaacc aacacaggca gtgtgtgtat gtgagcacct
1050
cgtgggtgag tatgtgtggg gcacaggctg gctccctcag ctcccacgtc
1100
ctagaggggc tcccgaggag gtggaacctc aacccagctc tgcgcaggag
1150
gcggctgcag tccttttctc cctcaaaggt ctccgaccct cagctggagg
1200
cgggcatctt tcctaaaggg tccccatagg gtctggttcc accccatccc
1250
aggtctgtgg tcagagcctg ggagggttcc ctacgatggt taggggtgcc
1300
ccatggaggg gctgactgcc ccacattgcc tttcagacag gacacgagca
1350
tgaggtaagg ccgccctgac ctggacttca gggggagggg gtaaagggag
1400
agaggagggg ggctaggggg tcctctagat cagtgggggc actgcaggtg
1450
gggctctccc tatacctggg acacctgctg gatgtcacct ctgcaaccac
1500
acccatgtgg tggtttcatg aacagaccac gctcctctgc cttctcctgg
1550
cctgggacac acagagccac cccggccttg tgagtgaccc agagaaggga
1600
ggcctcggga gaaggggtgc tcgtaagcca acaccagcgt gccgcggcct
1650
gcacaccctt cggacatccc aggcacgagg gtgtcgtgga tgtggccaca
1700
cataggacca cacgtcccag ctgggaggag aggcctgggg cccccaggga
1750
gggaggcagg gggtggggga catggagagc tgaggcagcc tcgtctcccc
1800
gcagcctggt atcgccagcc ttaaggtgtc tggagccccc acacttggcc
1850
aacctgacct tggaagatgc tgctgagtgt ctcaagcagc actgacagca
1900
gctgggcctg ccccagggca acgtgggggc ggagactcag ctggacagcc
1950
cctgcctgtc actctggagc tgggctgctg ctgcctcagg accccctctc
2000
cgaccccgga cagagctgag ctggccaggg ccaggagggc gggagggagg
2050
gaatgggggt gggctgtgcg cagcatcagc gcctgggcag gtccgcagag
2100
ctgcgggatg tgattaaagt ccctgatgtt tctc 2134
<210> 356 <211> 157 <212> PRT
807
<213> Homo sapiens <400> 356
Met Ala Leu Leu Leu Cys Leu Val Cys Leu Thr Ala Ala Leu Ala 15 10 15
His Gly Cys Leu His Cys His Ser Asn Phe Ser Lys Lys Phe Ser 20 25 30
Phe Tyr Arg His His Val Asn Phe Lys Ser Trp Trp Val Gly Asp 35 40 45
Ile Pro Val Ser Gly Ala Leu Leu Thr Asp Trp Ser Asp Asp Thr 50 55 60
Met Lys Glu Leu His Leu Ala Ile Pro Ala Lys Ile Thr Arg Glu 65 70 75
Lys Leu Asp Gin Val Ala Thr Ala Val Tyr Gin Met Met Asp Gin 80 85 90
Leu Tyr Gin Gly Lys Met Tyr Phe Pro Gly Tyr Phe Pro Asn Glu 95 100 105
Leu Arg Asn Ile Phe Arg Glu Gin Val His Leu Ile Gin Asn Ala 110 115 120
Ile Ile Glu Arg His Leu Ala Pro Gly Ser Trp Gly Gly Gly Gin 125 130 135
Leu Ser Arg Glu Gly Pro Ser Leu Ala Pro Glu Gly Ser Met Pro 140 145 150
Ser Pro Arg Gly Asp Leu Pro 155
<210> 357 <211> 1536 <212> DNA
<213> Homo sapiens <400> 357
agcaggagca ggagagggac aatggaagct gccccgtcca ggttcatgtt
50
cctcttattt ctcctcacgt gtgagctggc tgcagaagtt gctgcagaag
100
ttgagaaatc ctcagatggt cctggtgctg cccaggaacc cacgtggctc
150
acagatgtcc cagctgccat ggaattcatt gctgccactg aggtggctgt
200
cataggcttc ttccaggatt tagaaatacc agcagtgccc atactccata
250
gcatggtgca aaaattccca ggcgtgtcat ttgggatcag cactgattct
300
gaggttctga cacactacaa catcactggg aacaccatct gcctctttcg
350
cctggtagac aatgaacaac tgaatttaga ggacgaagac attgaaagca
400
ttgatgccac caaattgagc cgtttcattg agatcaacag cctccacatg
450
gtgacagagt acaaccctgt gactgtgatt gggttattca acagcgtaat
500
808
tcagattcat agaacatgca ctcctcctga cagataccag taatgaacaa aaggcagcca ggcctcccca agctcttcca gagtatgaag ggggaagatt
550 600
ctctttattc tggtggacag tggtatgaaa gaaaatggga aggtgatatc
650
atttttcaaa ctaaaggagt ctcaactgcc agctttggca atttaccaga
700
ctctagatga cgagtgggat acactgccca cagcagaagt ttccgtagag
750
catgtgcaaa acttttgtga tggattccta agtggaaaat tgttgaaaga
800
aaatcgtgaa tcagaaggaa agactccaaa ggtggaactc tgacttctcc
850
ttggaactac atatggccaa gtatctactt tatgcaaagt aaaaaggcac
900
aactcaaatc tcagagacac taaacaacag gatcactagg cctgccaacc
950
acacacacac gcacgtgcac acacgcacgc acgcgtgcac acacacacgc
1000
gcacacacac acacacacag agcttcattt cctgtcttaa aatctcgttt
1050
tctcttcttc cttcttttaa atttcatatc ctcactccct atccaatttc
1100
cttcttatcg tgcattcata ctctgtaagc ccatctgtaa cacacctaga
1150
tcaaggcttt aagagactca ctgtgatgcc tctatgaaag agaggcattc
1200
ctagagaaag attgttccaa tttgtcattt aatatcaagt ttgtatactg
1250
cacatgactt acacacaaca tagttcctgc tcttttaagg ttacctaagg
1300
gttgaaactc taccttcttt cataagcaca tgtccgtctc tgactcagga
1350
tcaaaaacca aaggatggtt ttaaacacct ttgtgaaatt gtctttttgc
1400
cagaagttaa aggctgtctc caagtccctg aactcagcag aaatagacca
1450
tgtgaaaact ccatgcttgg ttagcatctc caactcccta tgtaaatcaa
1500
caacctgcat aataaataaa aggcaatcat gttata 1536
<210> 358 <211> 273 <212> PRT
<213> Homo sapiens <400> 358
Met Glu Ala Ala Pro Ser Arg Phe Met Phe Leu Leu Phe Leu Leu 15 10 15
Thr Cys Glu Leu Ala Ala Glu Val Ala Ala Glu Val Glu Lys Ser 20 25 30
Ser Asp Gly Pro Gly Ala Ala Gin Glu Pro Thr Trp Leu Thr Asp 35' 40 45
Val Pro Ala Ala Met Glu Phe Ile Ala Ala Thr Glu Val Ala Val 50 55 60
Ile Gly Phe Phe Gin Asp Leu Glu Ile Pro Ala Val Pro Ile Leu 65 70 75
809
His Ser Met Val Gin. Lys Phe Pro Gly Val Ser Phe Gly Ile Ser
80 85 90
Thr Asp Ser Glu Val Leu Thr His Tyr Asn Ile Thr Gly Asn Thr
95 100 105
Ile Cys Leu Phe Arg Leu Val Asp Asn Glu Gin Leu Asn Leu Glu 110 115 120
Asp Glu Asp lie Glu Ser Ile Asp Ala Thr Lys Leu Ser Arg Phe 125 130 135
Ile Glu Ile Asn Ser Leu His Met Val Thr Glu Tyr Asn Pro Val 140 145 150
Thr Val Ile Gly Leu Phe Asn Ser Val Ile Gin Ile His Leu Leu 155 160 165
Leu Ile Met Asn Lys Ala Ser Pro Glu Tyr Glu Glu Asn Met His 170 175 180
Arg Tyr Gin Lys Ala Ala Lys Leu Phe Gin Gly Lys Ile Leu Phe 185 190 195
Ile Leu Val Asp Ser Gly Met Lys Glu Asn Gly Lys Val Ile Ser 200 205 210
Phe Phe Lys Leu Lys Glu Ser Gin Leu Pro Ala Leu Ala Ile Tyr 215 220 225
Gin Thr Leu Asp Asp Glu Trp Asp Thr Leu Pro Thr Ala Glu Val 230 235 240
Ser Val Glu His Val Gin Asn Phe Cys Asp Gly Phe Leu Ser Gly 245 250 255
Lys Leu Leu Lys Glu Asn Arg Glu Ser Glu Gly Lys Thr Pro Lys 260 265 270
Val Glu Leu
<210> 359 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 359 ccagcagtgc ccatactcca tagc 24
<210> 360 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 360 tgacgagtgg gatacactgc 20
810
<210> 361 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 361 gctctacgga aacttctgct gtgg 24
<210> 362 <211> 50 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 362
attcccaggc gtgtcatttg ggatcagcac tgattctgag gttctgacac 50
<210> 363
<211> 1777
<212> DNA
<213> Homo sapiens
<400> 363
ggagagccgc ggctgggacc ggagtgggga gcgcggcgtg gaggtgccac
50
ccggcgcggg tggcggagag atcagaagcc tcttccccaa gccgagccaa
100
cctcagcggg gacccgggct cagggacgcg gcggcggcgg cggcgactgc
150
agtggctgga cgatggcagc gtccgccgga gccggggcgg tgattgcagc
200
cccagacagc cggcgctggc tgtggtcggt gctggcggcg gcgcttgggc
250
tcttgacagc tggagtatca gccttggaag tatatacgcc aaaagaaatc
300
ttcgtggcaa atggtacaca agggaagctg acctgcaagt tcaagtctac
350
tagtacgact ggcgggttga cctcagtctc ctggagcttc cagccagagg
400
gggccgacac tactgtgtcg tttttccact actcccaagg gcaagtgtac
450
cttgggaatt atccaccatt taaagacaga atcagctggg ctggagacct
500
tgacaagaaa gatgcatcaa tcaacataga aaatatgcag tttatacaca
550
atggcaccta tatctgtgat gtcaaaaacc ctcctgacat cgttgtccag
600
cctggacaca ttaggctcta tgtcgtagaa aaagagaatt tgcctgtgtt
650
tccagtttgg gtagtggtgg gcatagttac tgctgtggtc ctaggtctca
700
ctctgctcat cagcatgatt ctggctgtcc tctatagaag gaaaaactct
750
aaacgggatt acactggctg cagtacatca gagagtttgt caccagttaa
800
gcaggctcct cggaagtccc cctccgacac tgagggtctt gtaaagagtc
850
811
tgccttctgg atctcaccag ggcccagtca tatatgcaca gttagaccac
900
tccggcggac tgcggatatc atcacagtga cgaaagaatt caagattaac aagagaatac aagtcagagt ctagaacata ctgtggtgta tcctcagcaa
950 1000
gaaacaaaac caaactggac tctcgtgcag aaaatgtagc ccattaccac
1050
atgtagcctt ggagacccag gcaaggacaa gtacacgtgt actcacagag
1100
ggagagaaag atgtgtacaa aggatatgta taaatattct atttagtcat
1150
cctgatatga ggagccagtg ttgcatgatg aaaagatggt atgattctac
1200
atatgtaccc attgtcttgc tgtttttgta ctttcttttc aggtcattta
1250
caattgggag atttcagaaa cattcctttc accatcattt agaaatggtt
1300
tgccttaatg gagacaatag cagatcctgt agtatttcca gtagacatgg
1350
ccttttaatc taagggctta agactgatta gtcttagcat ttactgtagt
1400
tggaggatgg agatgctatg atggaagcat acccagggtg gcctttagca
1450
cagtatcagt accatttatt tgtctgccgc ttttaaaaaa tacccattgg
1500
ctatgccact tgaaaacaat ttgagaagtt tttttgaagt ttttctcact
1550
aaaatatggg gcaattgtta gccttacatg ttgtgtagac ttactttaag
1600
tttgcaccct tgaaatgtgt catatcaatt tctggattca taatagcaag
1650
attagcaaag gataaatgcc gaaggtcact tcattctgga cacagttgga
1700
tcaatactga ttaagtagaa aatccaagct ttgcttgaga acttttgtaa
1750
cgtggagagt aaaaagtatc g<
<210> 364 <211> 269 <212> PRT
<213> Homo sapiens
<400> 364 Met Ala Ala Ser Ala Gly 1 5
Ser Arg Arg Trp Leu Trp 20
Leu Thr Ala Gly Val Ser 35
Ile Phe Val Ala Asn Gly 50
Lys Ser Thr Ser Thr Thr 65
Phe Gin Pro Glu Gly Ala 80
gtttta 1777
Ala Gly Ala Val Ile Ala Ala Pro Asp 10 15
Ser Val Leu Ala Ala Ala Leu Gly Leu 25 30
Ala Leu Glu Val Tyr Thr Pro Lys Glu 40 45
Thr Gin Gly Lys Leu Thr Cys Lys Phe 55 60
Gly Gly Leu Thr Ser Val Ser Trp Ser 70 75
Asp Thr Thr Val Ser Phe Phe His Tyr 85 90
812
Ser Gin Gly Gin Val Tyr Leu Gly Asn Tyr Pro Pro Phe Lys Asp
95 100 105
Arg Ile Ser Trp Ala Gly Asp Leu Asp Lys Lys Asp Ala Ser Ile
110 115 120
Asn Ile Glu Asn Met Gin Phe Ile His Asn Gly Thr Tyr Ile Cys
125 130 135
Asp Val Lys Asn Pro Pro Asp Ile Val Val Gin Pro Gly His Ile
140 145 150
Arg Leu Tyr Val Val Glu Lys Glu Asn Leu Pro Val Phe Pro Val
155 160 165
Trp Val Val Val Gly Ile Val Thr Ala Val Val Leu Gly Leu Thr
170 175 180
Leu Leu Ile Ser Met Ile Leu Ala Val Leu Tyr Arg Arg Lys Asn
185 190 195
Ser Lys Arg Asp Tyr Thr Gly Cys Ser Thr Ser Glu Ser Leu Ser
200 205 210
Pro Val Lys Gin Ala Pro Arg Lys Ser Pro Ser Asp Thr Glu Gly
215 220 225
Leu Val Lys Ser Leu Pro Ser Gly Ser His Gin Gly Pro Val Ile
230 235 240
Tyr Ala Gin Leu Asp His Ser Gly Gly His His Ser Asp Lys Ile
245 250 255
Asn Lys Ser Glu Ser Val Val Tyr Ala Asp Ile Arg Lys Asn 260 265
<210> 365 <211> 1321 <212> DNA
<213> Homo sapiens <400> 365
gccggctgtg cagagacgcc atgtaccggc tcctgtcagc agtgactgcc
50
cgggctgccg cccccggggg cttggcctca agctgcggac gacgcggggt
100
ccatcagcgc gccgggctgc cgcctctcgg ccacggctgg gtcgggggcc
150
tcgggctggg gctggggctg gcgctcgggg tgaagctggc aggtgggctg
200
aggggcgcgg ccccggcgca gtcccccgcg gcccccgacc ctgaggcgtc
250
gcctctggcc gagccgccac aggagcagtc cctcgccccg tggtctccgc
300
agaccccggc gccgccctgc tccaggtgct tcgccagagc catcgagagc
350
agccgcgacc tgctgcacag gatcaaggat gaggtgggcg caccgggcat
400
agtggttgga gtttctgtag atggaaaaga agtctggtca gaaggtttag
450
gttatgctga tgttgagaac cgtgtaccat gtaaaccaga gacagttatg
500
cgaattgcta gcatcagcaa aagtctcacc atggttgctc ttgccaaatt
550
813
gtgggaagca gggaaactgg atcttgatat tccagtacaa cattatgttc
600
ccgaattccc agaaaaagaa tatgaaggtg aaaaggtttc tgtcacaaca
650
agattactga aaaaaaggtg tttcccattt aaagaagaga aagtggaatt aagcttataa cgtcattatg agccttgaag aaaaggacat atgatgaaag
700 750
agaatgttgc atttgagcaa gaaaaagaag gcaaaagtaa tgaaaagaat
800
gattttacta aatttaaaac agagcaggag aatgaagcca aatgccggaa
850
ttcaaaacct ggcaagaaaa agaatgattt tgaacaaggc gaattatatt
900
tgagagaaaa gtttgaaaat tcaattgaat ccctaagatt atttaaaaat
950
gatcctttgt tcttcaaacc tggtagtcag tttttgtatt caacttttgg
1000
ctatacccta ctggcagcca tagtagagag agcttcagga tgtaaatatt
1050
tggactatat gcagaaaata ttccatgact tggatatgct gacgactgtg
1100
caggaagaaa acgagccagt gatttacaat agagcaaggt aaatgaatac
1150
cttctgctgt gtctagctat atcgcatctt aacactattt tattaattaa
1200
aagtcaaatt ttctttgttt ccattccaaa atcaacctgc cacattttgg
1250
gagcttttct acatgtctgt tttctcatct gtaaagtgaa ggaagtaaaa
1300
catgtttata aagtaaaaaa a 1321
<210> 366 <211> 373 <212> PRT
<213> Homo sapiens <400> 366
Met Tyr Arg Leu Leu Ser Ala Val Thr Ala Arg Ala Ala Ala Pro 15 10 15
Gly Gly Leu Ala Ser Ser Cys Gly Arg Arg Gly Val His Gin Arg 20 25 30
Ala Gly Leu Pro Pro Leu Gly His Gly Trp Val Gly Gly Leu Gly 35 40 45
Leu Gly Leu Gly Leu Ala Leu Gly Val Lys Leu Ala Gly Gly Leu 50 55 60
Arg Gly Ala Ala Pro Ala Gin Ser Pro Ala Ala Pro Asp Pro Glu 65 70 75
Ala Ser Pro Leu Ala Glu Pro Pro Gin Glu Gin Ser Leu Ala Pro 80 85 90
Trp Ser Pro Gin Thr Pro Ala Pro Pro Cys Ser Arg Cys Phe Ala 95 100 105
Arg Ala Ile Glu Ser Ser Arg Asp Leu Leu His Arg Ile Lys Asp 110 115 120
814
Glu Val Gly Ala Pro Gly Ile Val Val Gly Val Ser Val Asp Gly 125 130 135
Lys Glu Val Trp Ser Glu Gly Leu Gly Tyr Ala Asp Val Glu Asn 140 145 150
Arg Val Pro Cys Lys Pro Glu Thr Val Met Arg Ile Ala Ser Ile 155 160 165
Ser Lys Ser Leu Thr Met Val Ala Leu Ala Lys Leu Trp Glu Ala 170 175 180
Gly Lys Leu Asp Leu Asp Ile Pro Val Gin His Tyr Val Pro Glu 185 190 195
Phe Pro Glu Lys Glu Tyr Glu Gly Glu Lys Val Ser Val Thr Thr 200 205 210
Arg Leu Leu Ile Ser His Leu Ser Gly Ile Arg His Tyr Glu Lys 215 220 225
Asp Ile Lys Lys Val Lys Glu Glu Lys Ala Tyr Lys Ala Leu Lys 230 235 240
Met Met Lys Glu Asn Val Ala Phe Glu Gin Glu Lys Glu Gly Lys 245 250 255
Ser Asn Glu Lys Asn Asp Phe Thr Lys Phe Lys Thr Glu Gin Glu 260 265 270
Asn Glu Ala Lys Cys Arg Asn Ser Lys Pro Gly Lys Lys Lys Asn 275 280 285
Asp Phe Glu Gin Gly Glu Leu Tyr Leu Arg Glu Lys Phe Glu Asn 290 295 300
Ser Ile Glu Ser Leu Arg Leu Phe Lys Asn Asp Pro Leu Phe Phe 305 310 315
Lys Pro Gly Ser Gin Phe Leu Tyr Ser Thr Phe Gly Tyr Thr Leu 320 325 330
Leu Ala Ala Ile Val Glu Arg Ala Ser Gly Cys Lys Tyr Leu Asp 335 340 345
Tyr Met Gin Lys Ile Phe His Asp Leu Asp Met Leu Thr Thr Val 350 355 360
Gin Glu Glu Asn Glu Pro Val Ile Tyr Asn Arg Ala Arg 365 370
<210> 367 <211> 30 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 367 tggaaaagaa gtctggtcag aaggtttagg 30
815
<210> 368 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 368 catttggctt cattctcctg ctctg 25
<210> 369 <211> 28 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 369 aaaacctcag aacaactcat tttgcacc 28
<210> 370 ,
<211> 41 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 370
gtctcaccat ggttgctctt gccaaattgt gggaagcagg g 41
<210> 371
<211> 1150
<212> DNA
<213> Homo sapiens
<400> 371
gtgacactat agaagagcta tgacgtcgca tgcacgcgta cgtaagctcg
50
gaattcggct cgaggctggt gggaagaagc cgagatggcg gcagccagcg
100
ctggggcaac ccggctgctc ctgctcttgc tgatggcggt agcagcgccc
150
agtcgagccc ggggcagcgg ctgccgggcc gggactggtg cgcgaggggc
200
tggggcggaa ggtcgagagg gcgaggcctg tggcacggtg gggctgctgc
250
tggagcactc atttgagatc gatgacagtg ccaacttccg gaagcggggc
300
tcactgctct ggaaccagca ggatggtacc ttgtccctgt cacagcggca
350
gctcagcgag gaggagcggg gccgactccg ggatgtggca gccctgaatg
400
gcctgtaccg ggtccggatc ccaaggcgac ccggggccct ggatggcctg
450
gaagctggtg gctatgtctc ctcctttgtc cctgcgtgct ccctggtgga
500
gtcgcacctg tcggaccagc tgaccctgca cgtggatgtg gccggcaacg
550
tggtgggcgt gtcggtggtg acgcaccccg ggggctgccg gggccatgag
600
816
gtggaggacg tggacctgga gctgttcaac acctcggtgc agctgcagcc 650
gcccaccaca gccccaggcc ctgagacggc ggccttcatt gagcgcctgg 700
agatggaaca ggcccagaag gccaagaacc cccaggagca gaagtccttc 750
ttcgccaaat actggatgta catcattccc gtcgtcctgt tcctcatgat 800
gtcaggagcg ccagacaccg ggggccaggg tgggggtggg ggtgggggtg 850
gtggtggggg tagtggcctt tgctgtgtgc caccctccct gtaagtctat 900
ttaaaaacat cgacgataca ttgaaatgtg tgaacgtttt gaaaagctac 950
agcttccagc agccaaaagc aactgttgtt ttggcaagac ggtcctgatg 1000
tacaagcttg attgaaattc actgctcact tgatacgtta ttcagaaacc 1050
caaggaatgg ctgtccccat cctcatgtgg ctgtgtggag ctcagctgtg 1100
ttgtgtggca gtttattaaa ctgtccccca gatcgacacg caaaaaaaaa 1150
<210> 372 <211> 269 <212> PRT
<213> Homo sapiens <400> 372
Met Ala Ala Ala Ser Ala Gly Ala Thr Arg Leu Leu Leu Leu Leu 15 10 15
Leu Met Ala Val Ala Ala Pro Ser Arg Ala Arg Gly Ser Gly Cys 20 25 30
Arg Ala Gly Thr Gly Ala Arg Gly Ala Gly Ala Glu Gly Arg Glu 35 40 45
Gly Glu Ala Cys Gly Thr Val Gly Leu Leu Leu Glu His Ser Phe 50 55 60
Glu Ile Asp Asp Ser Ala Asn Phe Arg Lys Arg Gly Ser Leu Leu 65 70 75
Trp Asn Gin Gin Asp Gly Thr Leu Ser Leu Ser Gin Arg Gin Leu 80 85 90
Ser Glu Glu Glu Arg Gly Arg Leu Arg Asp Val Ala Ala Leu Asn 95 100 105
Gly Leu Tyr Arg Val Arg Ile Pro Arg Arg Pro Gly Ala Leu Asp 110 115 120
Gly Leu Glu Ala Gly Gly Tyr Val Ser Ser Phe Val Pro Ala Cys 125 130 135
Ser Leu Val Glu Ser His Leu Ser Asp Gin Leu Thr Leu His Val 140 145 150
Asp Val Ala Gly Asn Val Val Gly Val Ser Val Val Thr His Pro 155 160 165
817
Gly Gly Cys Arg Gly His Glu Val Glu Asp Val Asp Leu Glu Leu 170 175 180
Phe Asn Thr Ser Val Gin Leu Gin Pro Pro Thr Thr Ala Pro Gly 185 190 195
Pro Glu Thr Ala Ala Phe Ile Glu Arg Leu Glu Met Glu Gin Ala 200 205 210
Gin Lys Ala Lys Asn Pro Gin Glu Gin Lys Ser Phe Phe Ala Lys 215 220 225
Tyr Trp Met Tyr Ile Ile Pro Val Val Leu Phe Leu Met Met Ser 230 • 235 240
Gly Ala Pro Asp Thr Gly Gly Gin Gly Gly Gly Gly Gly Gly Gly 245 250 255
Gly Gly Gly Gly Ser Gly Leu Cys Cys Val Pro Pro Ser Leu 260 265
<210> 373
<211> 1706
<212> DNA
<213> Homo sapiens
<400> 373
ggagcgctgc tggaacccga gccggagccg gagccacagc ggggagggtg
50
gcctggcggc ctggagccgg acgtgtccgg ggcgtccccg cagaccgggg
100
cagcaggtcg tccgggggcc caccatgctg gtgactgcct accttgcttt
150
tgtaggcctc ctggcctcct gcctggggct ggaactgtca agatgccggg
200
ctaaaccccc tggaagggcc tgcagcaatc cctccttcct tcggtttcaa
250
ctggacttct atcaggtcta cttcctggcc ctggcagctg attggcttca
300
ggccccctac ctctataaac tctaccagca ttactacttc ctggaaggtc
350
aaattgccat cctctatgtc tgtggccttg cctctacagt cctctttggc
400
ctagtggcct cctcccttgt ggattggctg ggtcgcaaga attcttgtgt
450
cctcttctcc ctgacttact cactatgctg cttaaccaaa ctctctcaag
500
actactttgt gctgctagtg gggcgagcac ttggtgggct gtccacagcc
550
ctgctcttct cagccttcga ggcctggtat atccatgagc acgtggaacg
600
gcatgacttc cctgctgagt ggatcccagc tacctttgct cgagctgcct
650
tctggaacca tgtgctggct gtagtggcag gtgtggcagc tgaggctgta
700
gccagctgga tagggctggg gcctgtagcg ccctttgtgg ctgccatccc
750
tctcctggct ctggcagggg ccttggccct tcgaaactgg ggggagaact
800
atgaccggca gcgtgccttc tcaaggacct gtgctggagg cctgcgctgc
850
ctcctgtcgg accgccgcgt gctgctgctg ggcaccatac aagctctatt
900
818
tgagagtgtc atcttcatct ttgtcttcct ctggacacct gtgctggacc
950
cacacggggc ccctctgggc attatcttct ccagcttcat ggcagccagc
1000
ctgcttggct cttccctgta ccgtatcgcc acctccaaga ggtaccacct
1050
tcagcccatg cacctgctgt cccttgctgt gctcatcgtc gtcttctctc
1100
tcttcatgtt gactttctct accagcccag gccaggagag tccggtggag
1150
tccttcatag cctttctact tattgagttg gcttgtggat tatactttcc
1200
cagcatgagc ttcctacgga gaaaggtgat ccctgagaca gagcaggctg
1250
gtgtactcaa ctggttccgg gtacctctgc actcactggc ttgcctaggg
1300
ctccttgtcc tccatgacag tgatcgaaaa acaggcactc ggaatatgtt
1350
cagcatttgc tctgctgtca tggtgatggc tctgctggca gtggtgggac
1400
tcttcaccgt ggtaaggcat gatgctgagc tgcgggtacc ttcacctact
1450
gaggagccct atgcccctga gctgtaaccc cactccagga caagatagct
1500
gggacagact cttgaattcc agctatccgg gattgtacag atctctctgt
1550
gactgacttt gtgactgtcc tgtggtttct cctgccattg ctttgtgttt
1600
gggaggacat gatgggggtg atggactgga aagaaggtgc caaaagttcc
1650
ctctgtgtta ctcccattta gaaaataaac acttttaaat gatcaaaaaa
1700
aaaaaa 1706
<210> 374 <211> 450 <212> PRT
<213> Homo sapiens <400> 374
Met Leu Val Thr Ala Tyr Leu Ala Phe Val Gly Leu Leu Ala Ser 15 10 15
Cys Leu Gly Leu Glu Leu Ser Arg Cys Arg Ala Lys Pro Pro Gly 20 25 30
Arg Ala Cys Ser Asn Pro Ser Phe Leu Arg Phe Gin Leu Asp Phe 35 40 45
Tyr Gin Val Tyr Phe Leu Ala Leu Ala Ala Asp Trp Leu Gin Ala 50 55 60
Pro Tyr Leu Tyr Lys Leu Tyr Gin His Tyr Tyr Phe Leu Glu Gly 65 70 75
Gin Ile Ala Ile Leu Tyr Val Cys Gly Leu Ala Ser Thr Val Leu 80 85 90
Phe Gly Leu Val Ala Ser Ser Leu Val Asp Trp Leu Gly Arg Lys 95 100 105
819
Asn Ser Cys Val Leu Phe Ser Leu Thr Tyr Ser Leu Cys Cys Leu 110 115 120
Thr Lys Leu Ser Gin Asp Tyr Phe Val Leu Leu Val Gly Arg Ala 125 130 135
Leu Gly Gly Leu Ser Thr Ala Leu Leu Phe Ser Ala Phe Glu Ala 140 145 150
Trp Tyr Ile His Glu His Val Glu Arg His Asp Phe Pro Ala Glu 155 160 165
Trp Ile Pro Ala Thr Phe Ala Arg Ala Ala Phe Trp Asn His Val 170 175 180
Leu Ala Val Val Ala Gly Val Ala Ala Glu Ala Val Ala Ser Trp 185 190 195
Ile Gly Leu Gly Pro Val Ala Pro Phe Val Ala Ala Ile Pro Leu 200 205 210
Leu Ala Leu Ala Gly Ala Leu Ala Leu Arg Asn Trp Gly Glu Asn 215 220 225
Tyr Asp Arg Gin Arg Ala Phe Ser Arg Thr Cys Ala Gly Gly Leu 230 235 240
Arg Cys Leu Leu Ser Asp Arg Arg Val Leu Leu Leu Gly Thr Ile 245 250 255
Gin Ala Leu Phe Glu Ser Val Ile Phe Ile Phe Val Phe Leu Trp 260 265 270
Thr Pro Val Leu Asp Pro His Gly Ala Pro Leu Gly Ile Ile Phe 275 280 285
Ser Ser Phe Met Ala Ala Ser Leu Leu Gly Ser Ser Leu Tyr Arg 290 295 300
Ile Ala Thr Ser Lys Arg Tyr His Leu Gin Pro Met His Leu Leu 305 310 315
Ser Leu Ala Val Leu Ile Val Val Phe Ser Leu Phe Met Leu Thr 320 325 330
Phe Ser Thr Ser Pro Gly Gin Glu Ser Pro Val Glu Ser Phe Ile 335 340 345
Ala Phe Leu Leu Ile Glu Leu Ala Cys Gly Leu Tyr Phe Pro Ser 350 355 360
Met Ser Phe Leu Arg Arg Lys Val Ile Pro Glu Thr Glu Gin Ala 365 370 375
Gly Val Leu Asn Trp Phe Arg Val Pro Leu His Ser Leu Ala Cys 380 385 390
Leu Gly Leu Leu Val Leu His Asp Ser Asp Arg Lys Thr Gly Thr 395 400 405
Arg Asn Met Phe Ser Ile Cys Ser Ala Val Met Val Met Ala Leu 410 415 420
820
Leu Ala Val Val Gly Leu Phe Thr Val Val Arg His Asp Ala Glu 425 430 435
Leu Arg Val Pro Ser Pro Thr Glu Glu Pro Tyr Ala Pro Glu Leu 440 445 450
<210> 375 <211> 1098 <212> DNA <213> Artificial
<400> 375
gcgacgcgcg gcggggcggc gagaggaaac gcggcgccgg gccgggcccg
50
gccctggaga tggtccccgg cgccgcgggc tggtgttgtc tcgtgctctg
100
gctccccgcg tgcgtcgcgg cccacggctt ccgtatccat gattatttgt
150
actttcaagt gctgagtcct ggggacattc gatacatctt cacagccaca
200
cctgccaagg actttggtgg tatctttcac acaaggtatg agcagattca
250
ccttgtcccc gctgaacctc cagaggcctg cggggaactc agcaacggtt
300
tcttcatcca ggaccagatt gctctggtgg agaggggggg ctgctccttc
350
ctctccaaga ctcgggtggt ccaggagcac ggcgggcggg cggtgatcat
400
ctctgacaac gcagttgaca atgacagctt ctacgtggag atgatccagg
450
acagtaccca gcgcacagct gacatccccg ccctcttcct gctcggccga
500
gacggctaca tgatccgccg ctctctggaa cagcatgggc tgccatgggc
550
catcatttcc atcccagtca atgtcaccag catccccacc tttgagctgc
600
tgcaaccgcc ctggaccttc tggtagaaga gtttgtccca cattccagcc
650
ataagtgact ctgagctggg aaggggaaac ccaggaattt tgctacttgg
700
aatttggaga tagcatctgg ggacaagtgg agccaggtag aggaaaaggg
750
tttgggcgtt gctaggctga aagggaagcc acaccactgg ccttcccttc
800
cccagggccc ccaagggtgt ctcatgctac aagaagaggc aagagacagg
850
ccccagggct tctggctaga acccgaaaca aaaggagctg aaggcaggtg
900
gcctgagagc catctgtgac ctgtcacact cacctggctc cagcctcccc
950
tacccagggt ctctgcacag tgaccttcac agcagttgtt ggagtggttt
1000
aaagagctgg tgtttgggga ctcaataaac cctcactgac tttttagcaa
1050
taaagcttct catcagggtt gcaaaaaaaa aaaaaaaaaa aaaaaaaa 1098
<210> 376 <211> 188 <212> PRT <213> Homo sapiens
821
<400> 376
Met Val Pro Gly Ala Ala Gly Trp Cys Cys Leu Val Leu Trp Leu 15 10 15
Pro Ala Cys Val Ala Ala His Gly Phe Arg Ile His Asp Tyr Leu 20 25 30
Tyr Phe Gin Val Leu Ser Pro Gly Asp Ile Arg Tyr Ile Phe Thr 35 40 45
Ala Thr Pro Ala Lys Asp Phe Gly Gly Ile Phe His Thr Arg Tyr 50 55 60
Glu Gin Ile His Leu Val Pro Ala Glu Pro Pro Glu Ala Cys Gly 65 70 75
Glu Leu Ser Asn Gly Phe Phe Ile Gin Asp Gin Ile Ala Leu Val 80 85 90
Glu Arg Gly Gly Cys Ser Phe Leu Ser Lys Thr Arg Val Val Gin 95 100 105
Glu His Gly Gly Arg Ala Val Ile Ile Ser Asp Asn Ala Val Asp 110 115 120
Asn Asp Ser Phe Tyr Val Glu Met Ile Gin Asp Ser Thr Gin Arg 125 130 135
Thr Ala Asp Ile Pro Ala Leu Phe Leu Leu Gly Arg Asp Gly Tyr 140 145 150
Met Ile Arg Arg Ser Leu Glu Gin His Gly Leu Pro Trp Ala lie 155 160 165
Ile Ser Ile Pro Val Asn Val Thr Ser lie Pro Thr Phe Glu Leu 170 175 180
Leu Gin Pro Pro Trp Thr Phe Trp 185
<210> 377 <211> 496 <212> DNA <213> Artificial
<220>
<221> unsure <222> 396
<223> unknown base <400> 377
tctgcctcca ctgctctgtg ctgggatcat ggaacttgca ctgctgtgtg 50 ggctggtggt gatggctggt gtgattccaa tccagggcgg gatcctgaac 100 ctgaacaaga tggtcaagca agtgactggg aaaatgccca tcctctccta 150 ctggccctac ggctgtcact gcggactagg tggcagaggc caacccaaag 2 00 atgccacgga ctggtgctgc cagacccatg actgctgcta tgaccacctg 250 aagacccagg ggtgcggcat ctacaaggac aacaacaaaa gcagcataca 300
822
ttgtatggat tctatctgga aaaaaaaaaa aaaaaaaaaa
<210> 378 <211> 116 <212> PRT <213> Homo sapiens
<400> 378
Met Glu Leu Ala Leu Leu Cys Gly Leu Val Val Met Ala Gly Val 15 10 15
Ile Pro Ile Gin Gly Gly Ile Leu Asn Leu Asn Lys Met Val Lys 20 25 30
Gin Val Thr Gly Lys Met Pro Ile Leu Ser Tyr Trp Pro Tyr Gly 35 40 45
Cys His Cys Gly Leu Gly Gly Arg Gly Gin Pro Lys Asp Ala Thr 50 55 60
Asp Trp Cys Cys Gin Thr His Asp Cys Cys Tyr Asp His Leu Lys 65 70 75
Thr Gin Gly Cys Gly Ile Tyr Lys Asp Asn Asn Lys Ser Ser Ile 80 85 90
His Cys Met Asp Leu Ser Gin Arg Tyr Cys Leu Met Ala Val Phe 95 100 105
Asn Val Ile Tyr Leu Glu Asn Glu Asp Ser Glu 110 115
<210> 379 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 379 ctgcctccac tgctctgtgc tggg 24
<210> 380 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 380 cagagcagtg gatgttcccc tggg 24
ttatctcaac gctattgttt aatggctgtg tttaatgtga 350
aaatgaggac tccgaataaa aagctattac tawttnaaaa 400
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 450 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 496
<210> 381
823
<211> 45 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 381
ctgaacaaga tggtcaagca agtgactggg aaaatgccca tcctc 45
<210> 382 <211> 764 <212> DNA <213> Homo sapiens
<400> 382
ctcgcttctt ccttctggat gggggcccag ggggcccagg agagtataaa
50
ggcgatgtgg agggtgcccg gcacaaccag acgcccagtc acaggcgaga
100
gccctgggat gcaccggcca gaggccatgc tgctgctgct cacgcttgcc
150
ctcctggggg gccccacctg ggcagggaag atgtatggcc ctggaggagg
200
caagtatttc agcaccactg aagactacga ccatgaaatc acagggctgc
250
gggtgtctgt aggtcttctc ctggtgaaaa gtgtccaggt gaaacttgga
300
gactcctggg acgtgaaact gggagcctta ggtgggaata cccaggaagt
350
caccctgcag ccaggcgaat acatcacaaa agtctttgtc gccttccaag
400
ctttcctccg gggtatggtc atgtacacca gcaaggaccg ctatttctat
450
tttgggaagc ttgatggcca gatctcctct gcctacccca gccaagaggg
500
gcaggtgctg gtgggcatct atggccagta tcaactcctt ggcatcaaga
550
gcattggctt tgaatggaat tatccactag aggagccgac cactgagcca
600
ccagttaatc tcacatactc agcaaactca cccgtgggtc gctagggtgg
650
ggtatggggc catccgagct gaggccatct gtgtggtggt ggctgatggt
700
actggagtaa ctgagtcggg acgctgaatc tgaatccacc aataaataaa
750
gcttctgcag aaaa 7 64
<210> 383 <211> 178 <212> PRT <213> Homo sapiens
<400> 383
Met His Arg Pro Glu Ala Met Leu Leu Leu Leu Thr Leu Ala Leu 15 10 15
Leu Gly Gly Pro Thr Trp Ala Gly Lys Met Tyr Gly Pro Gly Gly 20 25 30
Gly Lys Tyr Phe Ser Thr Thr Glu Asp Tyr Asp His Glu Ile Thr 35 40 45
824
Gly Leu Arg
Val Lys Leu
Gly Asn Thr
Lys Val Phe
Tyr Thr Ser
Gin Ile Ser
Gly Ile Tyr
Phe Glu Trp
Val Asn Leu
<210> 384 <211> 2379 <212> DNA <213> Homo s<
<400> 384
gctgagcgtg tgcgcggtac ggggctctcc tgccttctgg gctccaacgc 50
agctctgtgg ctgaactggg tgctcatcac gggaactgct gggctatgga 100
atacagatgt ggcagctcag gtagccccaa attgcctgga agaatacatc 150
atgtttttcg ataagaagaa attgtaggat ccagtttttt ttttaaccgc 200
cccctcccca ccccccaaaa aaactgtaaa gatgcaaaaa cgtaatatcc 2 50
atgaagatcc tattacctag gaagattttg atgttttgct gcgaatgcgg 300
tgttgggatt tatttgttct tggagtgttc tgcgtggctg gcaaagaata 350
atgttccaaa atcggtccat ctcccaaggg gtccaatttt tcttcctggg 400
tgtcagcgag ccctgactca ctacagtgca gctgacaggg gctgtcatgc 450
aactggcccc taagccaaag caaaagacct aaggacgacc tttgaacaat 500
acaaaggatg ggtttcaatg taattaggct actgagcgga tcagctgtag 550
cactggttat agcccccact gtcttactga caatgctttc ttctgccgaa 600
cgaggatgcc ctaagggctg taggtgtgaa ggcaaaatgg tatattgtga 650
atctcagaaa ttacaggaga taccctcaag tatatctgct ggttgcttag 700
Val Ser Val Gly Leu Leu Leu Val Lys Ser Val Gin 50 55 60
Gly Asp Ser Trp Asp Val Lys Leu Gly Ala Leu Gly 65 70 75
Gin Glu Val Thr Leu Gin Pro Gly Glu Tyr Ile Thr 80 85 90
Val Ala Phe Gin Ala Phe Leu Arg Gly Met Val Met 95 100 105
Lys Asp Arg Tyr Phe Tyr Phe Gly Lys Leu Asp Gly 110 115 120
Ser Ala Tyr Pro Ser Gin Glu Gly Gin Val Leu Val 125 130 135
Gly Gin Tyr Gin Leu Leu Gly Ile Lys Ser Ile Gly 140 145 150
Asn Tyr Pro Leu Glu Glu Pro Thr Thr Glu Pro Pro 155 160 165
Thr Tyr Ser Ala Asn Ser Pro Val Gly Arg 170 175
apiens
825
gtttgtccct tcgctataac agccttcaaa aacttaagta taatcaattt
750
aaagggctca accagctcac ctggctatac cttgaccata accatatcag
800
caatattgac gaaaatgctt ttaatggaat acgcagactc aaagagctga
850
ttcttagttc caatagaatc tcctattttc ttaacaatac cttcagacct
900
gtgacaaatt tacggaactt ggatctgtcc tataatcagc tgcattctct
950
gggatctgaa cagtttcggg gcttgcggaa gctgctgagt ttacatttac
1000
ggtctaactc cctgagaacc atccctgtgc gaatattcca agactgccgc
1050
aacctggaac ttttggacct gggatataac cggatccgaa gtttagccag
1100
gaatgtcttt gctggcatga tcagactcaa agaacttcac ctggagcaca
1150
atcaattttc caagctcaac ctggcccttt ttccaaggtt ggtcagcctt
1200
cagaaccttt acttgcagtg gaataaaatc agtgtcatag gacagaccat
1250
gtcctggacc tggagctcct tacaaaggct tgatttatca ggcaatgaga
1300
tcgaagcttt cagtggaccc agtgttttcc agtgtgtccc gaatctgcag
1350
cgcctcaacc tggattccaa caagctcaca tttattggtc aagagatttt
1400
ggattcttgg atatccctca atgacatcag tcttgctggg aatatatggg
1450
aatgcagcag aaatatttgc tcccttgtaa actggctgaa aagttttaaa
1500
ggtctaaggg agaatacaat tatctgtgcc agtcccaaag agctgcaagg
1550
agtaaatgtg atcgatgcag tgaagaacta cagcatctgt ggcaaaagta
1600
ctacagagag gtttgatctg gccagggctc tcccaaagcc gacgtttaag
1650
cccaagctcc ccaggccgaa gcatgagagc aaaccccctt tgcccccgac
1700
ggtgggagcc acagagcccg gcccagagac cgatgctgac gccgagcaca
1750
tctctttcca taaaatcatc gcgggcagcg tggcgctttt cctgtccgtg
1800
ctcgtcatcc tgctggttat ctacgtgtca tggaagcggt accctgcgag
1850
catgaagcag ctgcagcagc gctccctcat gcgaaggcac aggaaaaaga
1900
aaagacagtc cctaaagcaa atgactccca gcacccagga attttatgta
1950
gattataaac ccaccaacac ggagaccagc gagatgctgc tgaatgggac
2000
gggaccctgc acctataaca aatcgggctc cagggagtgt gaggtatgaa
2050
ccattgtgat aaaaagagct cttaaaagct gggaaataag tggtgcttta
2100
ttgaactctg gtgactatca agggaacgcg atgccccccc tccccttccc
2150
tctccctctc actttggtgg caagatcctt ccttgtccgt tttagtgcat
2200
tcataatact ggtcattttc ctctcataca taatcaaccc attgaaattt
2250
826
aaataccaca atcaatgtga agcttgaact ccggtttaat ataataccta 2300
ttgtataaga ccctttactg attccattaa tgtcgcattt gttttaagat 23 50
aaaacttctt tcataggtaa aaaaaaaaa 2379
<210> 385 <211> 513 <212> PRT
<213> Homo sapiens <400> 385
Met Gly Phe Asn Val Ile Arg Leu Leu Ser Gly Ser Ala Val Ala 15 10 15
Leu Val Ile Ala Pro Thr Val Leu Leu Thr Met Leu Ser Ser Ala 20 25 30
Glu Arg Gly Cys Pro Lys Gly Cys Arg Cys Glu Gly Lys Met Val 35 40 45
Tyr Cys Glu Ser Gin Lys Leu Gin Glu Ile Pro Ser Ser Ile Ser 50 55 60
Ala Gly Cys Leu Gly Leu Ser Leu Arg Tyr Asn Ser Leu Gin Lys 65 70 75
Leu Lys Tyr Asn Gin Phe Lys Gly Leu Asn Gin Leu Thr Trp Leu 80 85 90
Tyr Leu Asp His Asn His Ile Ser Asn lie Asp Glu Asn Ala Phe 95 100 105
Asn Gly Ile Arg Arg Leu Lys Glu Leu Ile Leu Ser Ser Asn Arg 110 115 120
Ile Ser Tyr Phe Leu Asn Asn Thr Phe Arg Pro Val Thr Asn Leu 125 130 135
Arg Asn Leu Asp Leu Ser Tyr Asn Gin Leu His Ser Leu Gly Ser 140 145 150
Glu Gin Phe Arg Gly Leu Arg Lys Leu Leu Ser Leu His Leu Arg 155 160 165
Ser Asn Ser Leu Arg Thr Ile Pro Val Arg Ile Phe Gin Asp Cys 170 175 180
Arg Asn Leu Glu Leu Leu Asp Leu Gly Tyr Asn Arg Ile Arg Ser 185 190 195
Leu Ala Arg Asn Val Phe Ala Gly Met Ile Arg Leu Lys Glu Leu 200 205 210
His Leu Glu His Asn Gin Phe Ser Lys Leu Asn Leu Ala Leu Phe 215 220 225
Pro Arg Leu Val Ser Leu Gin Asn Leu Tyr Leu Gin Trp Asn Lys 230 235 240
Ile Ser Val Ile Gly Gin Thr Met Ser Trp Thr Trp Ser Ser Leu 245 250 255
827
Gin Arg Leu Asp Leu Ser Gly Asn Glu Ile Glu Ala Phe Ser Gly 260 265 270
Pro Ser Val Phe Gin Cys Val Pro Asn Leu Gin Arg Leu Asn Leu 275 280 285
Asp Ser Asn Lys Leu Thr Phe Ile Gly Gin Glu Ile Leu Asp Ser 290 295 300
Trp Ile Ser Leu Asn Asp Ile Ser Leu Ala Gly Asn Ile Trp Glu 305 310 315
Cys Ser Arg Asn Ile Cys Ser Leu Val Asn Trp Leu Lys Ser Phe 320 325 330
Lys Gly Leu Arg Glu Asn Thr Ile Ile Cys Ala Ser Pro Lys Glu 335 340 345
Leu Gin Gly Val Asn Val Ile Asp Ala Val Lys Asn Tyr Ser Ile 350 355 360
Cys Gly Lys Ser Thr Thr Glu Arg Phe Asp Leu Ala Arg Ala Leu 365 370 375
Pro Lys Pro Thr Phe Lys Pro Lys Leu Pro Arg Pro Lys His Glu 380 385 390
Ser Lys Pro Pro Leu Pro Pro Thr Val Gly Ala Thr Glu Pro Gly 395 400 405
Pro Glu Thr Asp Ala Asp Ala Glu His Ile Ser Phe His Lys Ile 410 415 420
Ile Ala Gly Ser Val Ala Leu Phe Leu Ser Val Leu Val Ile Leu 425 430 435
Leu Val Ile Tyr Val Ser Trp Lys Arg Tyr Pro Ala Ser Met Lys 440 445 450
Gin Leu Gin Gin Arg Ser Leu Met Arg Arg His Arg Lys Lys Lys 455 460 465
Arg Gin Ser Leu Lys Gin Met Thr Pro Ser Thr Gin Glu Phe Tyr 470 475 480
Val Asp Tyr Lys Pro Thr Asn Thr Glu Thr Ser Glu Met Leu Leu 485 490 495
Asn Gly Thr Gly Pro Cys Thr Tyr Asn Lys Ser Gly Ser Arg Glu 500 505 510
Cys Glu Val
<210> 386 <211> 24 <212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
828
<400> 386 ctgggatctg aacagtttcg gggc 24
<210> 387 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 387 ggtccccagg acatggtctg tccc 24
<210> 388 <211> 48 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 388
gctgagttta catttacggt ctaactccct gagaaccatc cctgtgcg 48
<210> 389
<211> 1449
<212> DNA
<213> Homo sapiens
<400> 389
agttctgaga aagaaggaaa taaacacagg caccaaacca ctatcctaag
50
ttgactgtcc tttaaatatg tcaagatcca gacttttcag tgtcacctca
100
gcgatctcaa cgatagggat cttgtgtttg ccgctattcc agttggtgct
150
ctcggaccta ccatgcgaag aagatgaaat gtgtgtaaat tataatgacc
200
aacaccctaa tggctggtat atctggatcc tcctgctgct ggttttggtg
250
gcagctcttc tctgtggagc tgtggtcctc tgcctccagt gctggctgag
300
gagaccccga attgattctc acaggcgcac catggcagtt tttgctgttg
350
gagacttgga ctctatttat gggacagaag cagctgtgag tccaactgtt
400
ggaattcacc ttcaaactca aacccctgac ctatatcctg ttcctgctcc
450
atgttttggc cctttaggct ccccacctcc atatgaagaa attgtaaaaa
500
caacctgatt ttaggtgtgg attatcaatt taaagtatta acgacatctg
550
taattccaaa acatcaaatt taggaatagt tatttcagtt gttggaaatg
600
tccagagatc tattcatata gtctgaggaa ggacaattcg acaaaagaat
650
ggatgttgga aaaaattttg gtcatggaga tgtttaaata gtaaagtagc
700
aggcttttga tgtgtcactg ctgtatcata cttttatgct acacaaccaa
750
attaatgctt ctccactagt atccaaacag
829
gcaacaatta ggtgctggaa
800
gtagtttcca tcacatttag gactccactg cagtatacag cacaccattt
850
tctgctttaa actctttcct agcatggggt ccataaaaat tattataatt
900
taacaatagc ccaagccgag aatccaacat gtccagaacc agaaccagaa
950
agatagtatt tgaatgaagg tgaggggaga gagtaggaaa aagaaaagtt
1000
tggagttgaa gggtaaagga taaatgaaga ggaaaaggaa aagattacaa
1050
gtctcagcaa aaacaagagg ttttatgccc caacctgaag aggaagaaat
1100
tgtagataga aggtgaagga gattgctgaa gatatagagc acatataatg
1150
ccaacacggg gagaaaagaa aatttcccct tttacagtaa tgaatgtggc
1200
ctccatagtc catagtgttt ctctggagcc tcagggcttg gcatttattg
1250
cagcatcatg ctaagaacct tcggcatagg tatctgttcc catgaggact
1300
gcagaagtag caatgagaca tcttcaagtg gcattttggc agtggccatc
1350
agcaggggga cagacaaaaa catccatcac agatgacata tgatcttcag
1400
ctgacaaatt tgttgaacaa aacaataaac atcaatagat atctaaaaa :
L449
<210> 390 <211> 146 <212> PRT
<213> Homo sapiens <400> 390
Met Ser Arg Ser Arg Leu Phe Ser Val Thr Ser Ala Ile Ser Thr 15 10 15
Ile Gly Ile Leu Cys Leu Pro Leu Phe Gin Leu Val Leu Ser Asp 20 25 30
Leu Pro Cys Glu Glu Asp Glu Met Cys Val Asn Tyr Asn Asp Gin 35 40 45
His Pro Asn Gly Trp Tyr Ile Trp Ile Leu Leu Leu Leu Val Leu 50 55 60
Val Ala Ala Leu Leu Cys Gly Ala Val Val Leu Cys Leu Gin Cys 65 70 75
Trp Leu Arg Arg Pro Arg Ile Asp Ser His Arg Arg Thr Met Ala 80 85 90
Val Phe Ala Val Gly Asp Leu Asp Ser Ile Tyr Gly Thr Glu Ala 95 100 105
Ala Val Ser Pro Thr Val Gly Ile His Leu Gin Thr Gin Thr Pro 110 115 120
Asp Leu Tyr Pro Val Pro Ala Pro Cys Phe Gly Pro Leu Gly Ser 125 130 135
Pro Pro Pro Tyr Glu Glu Ile Val Lys Thr Thr
140
830
145
<210> 391 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 391 cttttcagtg tcacctcagc gatctc 26
<210> 392 <211> 23 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 392
ccaaaacatg gagcaggaac agg 23 <210> 393 <211> 47 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 393
ccagttggtg ctctcggacc taccatgcga agaagatgaa atgtgtg 47
<210> 394
<211> 2340
<212> DNA
<213> Homo sapiens
<400> 394
gagcggagta aaatctccac aagctgggaa caaacctcgt cccaactccc
50
acccaccggc gtttctccag ctcgatctgg aggctgcttc gccagtgtgg
100
gacgcagctg acgcccgctt attagctctc gctgcgtcgc cccggctcag
150
aagctccgtg gcggcggcga ccgtgacgag aagcccacgg ccagctcagt
200
tctcttctac tttgggagag agagaaagtc agatgcccct tttaaactcc
250
ctcttcaaaa ctcatctcct gggtgactga gttaatagag tggatacaac
300
cttgctgaag atgaagaata tacaatattg aggatatttt tttctttttt
350
ttttcaagtc ttgatttgtg gcttacctca agttaccatt tttcagtcaa
400
gtctgtttgt ttgcttcttc agaaatgttt tttacaatct caagaaaaaa
450
tatgtcccag aaattgagtt tactgttgct tgtatttgga ctcatttggg
500
gattgatgtt actgcactat acttttcaac aaccaagaca tcaaagcagt
550
831
gtcaagttac gtgagcaaat actagactta agcaaaagat atgttaaagc
600
tctagcagag gaaaataaga acacagtgga tgtcgagaac ggtgcttcta
650
tggcaggata tgcggatctg aaaagaacaa ttgctgtcct tctggatgac
700
attttgcaac gattggtgaa gctggagaac aaagttgact atattgttgt
750
gaatggctca gcagccaaca ccaccaatgg tactagtggg aatttggtgc
800
cagtaaccac aaataaaaga acgaatgtct cgggcagtat cagatagcag
850
ttgaaaatca ccttgtgctg ctccatccac tgtggattat atcctatggc
900
agaaaagctt tataattgct ggcttaggac agagcaatac tttacaataa
950
aagctctaca cattttcaag gagtatgctg gattcatgga actctaattc
1000
tgtacataaa aattttaaag ttatttgttt gctttcaggc aagtctgttc
1050
aatgctgtac tatgtcctta aagagaattt ggtaacttgg ttgatgtggt
1100
aagcagatag gtgagttttg tataaatctt ttgtgtttga gatcaagctg
1150
aaatgaaaac actgaaaaac atggattcat ttctataaca catttattta
1200
agtatataac acgttttttg gacaagtgaa gaatgtttaa tcattctgtc
1250
atttgttctc aatagatgta actgttagac tacggctatt tgaaaaaatg
1300
tgcttattgt actatatttt gttattccaa ttatgagcag agaaaggaaa
1350
tataatgttg aaaataatgt tttgaaatca tgacccaaag aatgtattga
1400
tttgcactat ccttcagaat aactgaaggt taattattgt atatttttaa
1450
aaattacact tataagagta taatcttgaa atgggtagca gccactgtcc
1500
attacctatc gtaaacattg gggcaattta ataacagcat taaaatagtt
1550
gtaaactcta atcttatact tattgaagaa taaaagatat ttttatgatg
1600
agagtaacaa taaagtattc atgatttttc acatacatga atgttcattt
1650
aaaagtttaa tcctttgagt gtctatgcta tcaggaaagc acattatttc
1700
catatttggg ttaattttgc ttttattata ttggtctagg aggaagggac
1750
tttggagaat ggaactcttg aggactttag ccaggtgtat ataataaagg
1800
tacttttgtg ctgcattaaa ttgcttggaa agtgttaaca ttatattata
1850
taagagtatc ctttatgaaa ttttgaattt gtataacaga tgcattagat
1900
attcatttta tataatggcc acttaaaata agaacattta aaatataaac
1950
tatgaagatt gactatcttt tcaggaaaaa agctgtatat agcacaggga
2000
accctaatct tgggtaattc tagtataaaa caaattatac ttttatttaa
2050
atttcccttg tagcaaatct aattgccaca tggtgcccta tatttcatag
2100
832
tatttattct ctatagtaac tgcttaagtg cagctagctt ctagatttag 2150
actatataga atttagatat tgtattgttc gtcattataa tatgctacca 22 00
catgtagcaa taattacaat attttattaa aataaatatg tgaaatattg 2250
tttcatgaaa gacagatttc caaatctctc ttctcttctc tgtactgtct 2300
acctttatgt gaagaaatta attatatgcc attgccaggt 2340
<210> 395 <211> 140 <212> PRT <213> Homo sapiens
<400> 395
Met Phe Phe Thr Ile Ser Arg Lys Asn Met Ser Gin Lys Leu Ser 15 10 15
Leu Leu Leu Leu Val Phe Gly Leu Ile Trp Gly Leu Met Leu Leu 20 25 30
His Tyr Thr Phe Gin Gin Pro Arg His Gin Ser Ser Val Lys Leu 35 40 45
Arg Glu Gin Ile Leu Asp Leu Ser Lys Arg Tyr Val Lys Ala Leu 50 55 60
Ala Glu Glu Asn Lys Asn Thr Val Asp Val Glu Asn Gly Ala Ser 65 70 75
Met Ala Gly Tyr Ala Asp Leu Lys Arg Thr Ile Ala Val Leu Leu 80 85 90
Asp Asp Ile Leu Gin Arg Leu Val Lys Leu Glu Asn Lys Val Asp 95 100 105
Tyr Ile Val Val Asn Gly Ser Ala Ala Asn Thr Thr Asn Gly Thr 110 115 120
Ser Gly Asn Leu Val Pro Val Thr Thr Asn Lys Arg Thr Asn Val 125 130 135
Ser Gly Ser Ile Arg 140
<210> 396
<211> 2639
<212> DNA
<213> Homo sapiens
<400> 396
cgcggccggg ccgccggggt gagcgtgccg aggcggctgt ggcgcaggct 50 tccagccccc accatgccgt ggcccctgct gctgctgctg gccgtgagtg 100 gggcccagac aacccggcca tgcttccccg ggtgccaatg cgaggtggag 150 accttcggcc ttttcgacag cttcagcctg actcgggtgg attgtagcgg 200 cctgggcccc cacatcatgc cggtgcccat ccctctggac acagcccact 2 50
833
tggacctgtc ctccaaccgg ctggagatgg tgaatgagtc ggtgttggcg
300
gggccgggct acacgacgtt ggctggcctg gatctcagcc acaacctgct
350
caccagcatc tcacccactg ccttctcccg ccttcgctac ctggagtcgc
400
ttgacctcag ccacaatggc ctgacagccc tgccagccga gagcttcacc
450
agctcacccc tgagcgacgt gaaccttagc cacaaccagc tccgggaggt
500
ctcagtgtct gccttcacga cgcacagtca gggccgggca ctacacgtgg
550
acctctccca caacctcatt caccgcctcg tgccccaccc cacgagggcc
600
ggcctgcctg cgcccaccat tcagagcctg aacctggcct ggaaccggct
650
ccatgccgtg cccaacctcc gagacttgcc cctgcgctac ctgagcctgg
700
atgggaaccc tctagctgtc attggtccgg gtgccttcgc ggggctggga
750
ggccttacac acctgtctct ggccagcctg cagaggctcc ctgagctggc
800
gcccagtggc ttccgtgagc taccgggcct gcaggtcctg gacctgtcgg
850
gcaaccccaa gcttaactgg gcaggagctg aggtgttttc aggcctgagc
900
tccctgcagg agctggacct ttcgggcacc aacctggtgc ccctgcctga
950
ggcgctgctc ctccacctcc cggcactgca gagcgtcagc gtgggccagg
1000
atgtgcggtg ccggcgcctg gtgcgggagg gcacctaccc ccggaggcct
1050
ggctccagcc ccaaggtgcc cctgcactgc gtagacaccc gggaatctgc
1100
tgccaggggc cccaccatct tgtgacaaat ggtgtggccc agggccacat
1150
aacagactgc tgtcctgggc tgcctcaggt cccgagtaac ttatgttcaa
1200
tgtgccaaca ccagtgggga gcccgcaggc ctatgtggca gcgtcaccac
1250
aggagttgtg ggcctaggag aggctttgga cctgggagcc acacctagga
1300
gcaaagtctc acccctttgt ctacgttgct tccccaaacc atgagcagag
1350
ggacttcgat gccaaaccag actcgggtcc cctcctgctt cccttcccca
1400
cttatccccc aagtgccttc cctcatgcct gggccggcct gacccgcaat
1450
gggcagaggg tgggtgggac cccctgctgc agggcagagt tcaggtccac
1500
tgggctgagt gtccccttgg gcccatggcc cagtcactca ggggcgagtt
1550
tcttttctaa catagccctt tctttgccat gaggccatga ggcccgcttc
1600
atccttttct atttccctag aaccttaatg gtagaaggaa ttgcaaagaa
1650
tcaagtccac ccttctcatg tgacagatgg ggaaactgag gccttgagaa
1700
ggaaaaaggc taatctaagt tcctgcgggc agtggcatga ctggagcaca
1750
gcctcctgcc tcccagcccg gacccaatgc actttcttgt ctcctctaat
1800
aagccccacc ctccccgcct gggctcccct
834
tgctgccctt gcctgttccc
1850
cattagcaca ggagtagcag cagcaggaca ggcaagagcc tcacaagtgg
1900
gactctgggc ctctgaccag ctgtgcggca tgggctaagt cactctgccc
1950
ttcggagcct ctggaagctt agggcacatt ggttccagcc tagccagttt
2000
ctcaccctgg gttggggtcc cccagcatcc agactggaaa cctacccatt
2050
ttcccctgag catcctctag atgctgcccc aaggagttgc tgcagttctg
2100
gagcctcatc tggctgggat ctccaagggg cctcctggat tcagtcccca
2150
ctggccctga gcacgacagc ccttcttacc ctcccaggaa tgccgtgaaa
2200
ggagacaagg tctgcccgac ccatgtctat gctctacccc cagggcagca
2250
tctcagcttc cgaaccctgg gctgtttcct tagtcttcat tttataaaag
2300
ttgttgcctt tttaacggag tgtcactttc aaccggcctc ccctacccct
2350
gctggccggg gatggagaca tgtcatttgt aaaagcagaa aaaggttgca
2400
tttgttcact tttgtaatat tgtcctgggc ctgtgttggg gtgttggggg
2450
aagctgggca ccccacaggg tcagtggcca cagtgagctc catgggcatc tgtcttcccc aggggctggc cacctgccta cccacagaga gcccatcatc
2500 2550
tatctaaccg gtccttgatt taataaacac tataaaaggt ttaaaaaaaa
2600
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 263 9
<210> 397 <211> 353 <212> PRT <213> Homo sapiens
<400> 397
Met Pro Trp Pro Leu Leu Leu Leu Leu Ala Val Ser Gly Ala Gin 15 10 15
Thr Thr Arg Pro Cys Phe Pro Gly Cys Gin Cys Glu Val Glu Thr 20 25 30
Phe Gly Leu Phe Asp Ser Phe Ser Leu Thr Arg Val Asp Cys Ser 35 40 45
Gly Leu Gly Pro His Ile Met Pro Val Pro Ile Pro Leu Asp Thr 50 55 60
Ala His Leu Asp Leu Ser Ser Asn Arg Leu Glu Met Val Asn Glu 65 70 75
Ser Val Leu Ala Gly Pro Gly Tyr Thr Thr Leu Ala Gly Leu Asp 80 85 90
Leu Ser His Asn Leu Leu Thr Ser Ile Ser Pro Thr Ala Phe Ser 95 100 105
Arg Leu Arg Tyr
Leu Glu Ser Leu Asp Leu Ser 110 115
His Asn Gly Leu 120
835
Thr Ala Leu Pro Ala Glu Ser Phe Thr Ser Ser Pro Leu Ser Asp 125 130 135
Val Asn Leu Ser His Asn Gin Leu Arg Glu Val Ser Val Ser Ala 140 145 150
Phe Thr Thr His Ser Gin Gly Arg Ala Leu His Val Asp Leu Ser 155 160 165
His Asn Leu Ile His Arg Leu Val Pro His Pro Thr Arg Ala Gly 170 175 180
Leu Pro Ala Pro Thr Ile Gin Ser Leu Asn Leu Ala Trp Asn Arg 185 190 195
Leu His Ala Val Pro Asn Leu Arg Asp Leu Pro Leu Arg Tyr Leu 200 205 210
Ser Leu Asp Gly Asn Pro Leu Ala Val Ile Gly Pro Gly Ala Phe 215 220 225
Ala Gly Leu Gly Gly Leu Thr His Leu Ser Leu Ala Ser Leu Gin 230 235 240
Arg Leu Pro Glu Leu Ala Pro Ser Gly Phe Arg Glu Leu Pro Gly 245 250 255
Leu Gin Val Leu Asp Leu Ser Gly Asn Pro Lys Leu Asn Trp Ala 260 265 270
Gly Ala Glu Val Phe Ser Gly Leu Ser Ser Leu Gin Glu Leu Asp 275 280 285
Leu Ser Gly Thr Asn Leu Val Pro Leu Pro Glu Ala Leu Leu Leu 290 295 300
His Leu Pro Ala Leu Gin Ser Val Ser Val Gly Gin Asp Val Arg 305 310 315
Cys Arg Arg Leu Val Arg Glu Gly Thr Tyr Pro Arg Arg Pro Gly 320 325 330
Ser Ser Pro Lys Val Pro Leu His Cys Val Asp Thr Arg Glu Ser 335 340 345
Ala Ala Arg Gly Pro Thr Ile Leu 350
<210> 398 <211> 23 <212> DNA
<213> Artificial Sequence
<220>
<223> Synthetic oligonucleotide probe
<400> 398 ccctgccagc cgagagcttc acc 23
<210> 399 <211> 23
836
<212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 399 ggttggtgcc cgaaaggtcc age 23
<210> 400 <211> 44 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 400
caaccccaag cttaactggg caggagctga ggtgttttca ggee 44
<210> 401 <211> 1571 <212> DNA
<213> Homo sapiens <400> 401
gatggcgcag ccacagcttc tgtgagattc gatttctccc cagttcccct
50
gtgggtctga gaggctatat ggggaccaga gcgtcaattc agggtgagct cccaaaacaa acgttggctt gttttgacat tctggaaggg ttcccctgaa
100 150
atgtcattct ctatctattc actgcaagtg cctgctgttc caggccttac
200
ctgctgggca etaaeggegg agccaggatg gggacagaat aaaggagcca
250
cgacctgtgc caccaactcg cactcagact ctgaactcag acctgaaatc
300
ttctcttcac gggaggcttg gcagtttttc ttactcctgt ggtctccaga
350
tttcaggcct aagatgaaag cctctagtct tgccttcagc cttctctctg
400
ctgcgtttta tctcctatgg actccttcca ctggactgaa gacactcaat
450
ttgggaagct gtgtgatcgc cacaaacctt caggaaatac gaaatggatt
500
ttctgagata eggggcagtg tgcaagccaa agatggaaac attgacatca
550
gaatcttaag gaggactgag tetttgeaag acacaaagcc tgegaatega
600
tgctgcctcc tgcgccattt gctaagactc tatctggaca gggtatttaa
650
aaactaccag acccctgacc attatactct ccggaagatc agcagcctcg
700
ccaattcctt tcttaccatc aagaaggacc tccggctctc tcatgcccac
750
atgacatgcc attgtgggga ggaagcaatg aagaaataca gccagattct
800
gagtcacttt gaaaagctgg aacctcaggc agcagttgtg aaggctttgg
850
gggaactaga cattcttctg caatggatgg aggagacaga ataggaggaa
900
agtgatgctg ctgctaagaa tattcgaggt caagagctcc agtcttcaat
950
837
acctgcagag gaggcatgac cccaaaccac catctcttta ctgtactagt 1000
cttgtgctgg tcacagtgta tcttatttat gcattacttg cttccttgca 1050
tgattgtctt tatgcatccc caatcttaat tgagaccata cttgtataag 1100
atttttgtaa tatctttctg ctattggata tatttattag ttaatatatt 1150
tatttatttt ttgctattta atgtatttat ttttttactt ggacatgaaa 12 00
ctttaaaaaa attcacagat tatatttata acctgactag agcaggtgat 1250
gtatttttat acagtaaaaa aaaaaaacct tgtaaattct agaagagtgg 13 00
ctaggggggt tattcatttg tattcaacta aggacatatt tactcatgct 1350
gatgctctgt gagatatttg aaattgaacc aatgactact taggatgggt 1400
tgtggaataa gttttgatgt ggaattgcac atctacctta caattactga 1450
ccatccccag tagactcccc agtcccataa ttgtgtatct tccagccagg 1500
aatcctacac ggccagcatg tatttctaca aataaagttt tctttgcata 1550
ccaaaaaaaa aaaaaaaaaa a 1571
<210> 402 <211> 261 <212> PRT <213> Homo sapiens
<400> 402
Met Arg Gin Phe Pro Lys Thr Ser Phe Asp Ile Ser Pro Glu Met 1 5 10 15
Ser Phe Ser Ile Tyr Ser Leu Gin Val Pro Ala Val Pro Gly Leu 20 25 30
Thr Cys Trp Ala Leu Thr Ala Glu Pro Gly Trp Gly Gin Asn Lys 35 40 45
Gly Ala Thr Thr Cys Ala Thr Asn Ser His Ser Asp Ser Glu Leu 50 55 60
Arg Pro Glu Ile Phe Ser Ser Arg Glu Ala Trp Gin Phe Phe Leu 65 70 75
Leu Leu Trp Ser Pro Asp Phe Arg Pro Lys Met Lys Ala Ser Ser 80 85 90
Leu Ala Phe Ser Leu Leu Ser Ala Ala Phe Tyr Leu Leu Trp Thr 95 100 105
Pro Ser Thr Gly Leu Lys Thr Leu Asn Leu Gly Ser Cys Val Ile 110 115 120
Ala Thr Asn Leu Gin Glu Ile Arg Asn Gly Phe Ser Glu Ile Arg 125 130 135
Gly Ser Val Gin Ala Lys Asp Gly Asn Ile Asp Ile Arg Ile Leu 140 145 150
838
Arg Arg Thr Glu Ser Leu Gin Asp Thr Lys Pro Ala Asn Arg Cys 155 160 165
Cys Leu Leu Arg His Leu Leu Arg Leu Tyr Leu Asp Arg Val Phe 170 175 180
Lys Asn Tyr Gin Thr Pro Asp His Tyr Thr Leu Arg Lys Ile Ser 185 190 195
Ser Leu Ala Asn Ser Phe Leu Thr Ile Lys Lys Asp Leu Arg Leu 200 205 210
Ser His Ala His Met Thr Cys His Cys Gly Glu Glu Ala Met Lys 215 220 225
Lys Tyr Ser Gin Ile Leu Ser His Phe Glu Lys Leu Glu Pro Gin 230 235 240
Ala Ala Val Val Lys Ala Leu Gly Glu Leu Asp Ile Leu Leu Gin 245 250 255
Trp Met Glu Glu Thr Glu 260
<210> 403 <211> 28 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 403 ctcctgtggt ctccagattt caggccta 28
<210> 404 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 404 agtcctcctt aagattctga tgtcaa 2 6
<210> 405 <211> 998 <212> DNA
<213> Homo sapiens <400> 405
ccgttatcgt cttgcgctac tgctgaatgt ccgtcccgga ggaggaggag 50 aggcttttgc cgctgaccca gagatggccc cgagcgagca aattcctact 10 0 gtccggctgc gcggctaccg tggccgagct agcaaccttt cccctggatc 150 tcacaaaaac tcgactccaa atgcaaggag aagcagctct tgctcggttg 200
ggagacggtg caagagaatc tgccccctat aggggaatgg tgcgcacagc 250
839
cctagggatc attgaagagg aaggctttct aaagctttgg caaggagtga
300
cacccgccat ttacagacac gtagtgtatt ctggaggtcg aatggtcaca
350
tatgaacatc tccgagaggt tgtgtttggc aaaagtgaag atgagcatta
400
tcccctttgg aaatcagtca ttggagggat gatggctggt gttattggcc
450
agtttttagc caatccaact gacctagtga aggttcagat gcaaatggaa
500
ggaaaaagga aactggaagg aaaaccattg cgatttcgtg gtgtacatca
550
tgcatttgca aaaatcttag ctgaaggagg aatacgaggg ctttgggcag
600
gctgggtacc caatatacaa agagcagcac tggtgaatat gggagattta
650
accacttatg atacagtgaa acactacttg gtattgaata caccacttga
700
ggacaatatc atgactcacg gtttatcaag tttatgttct ggactggtag
750
cttctattct gggaacacca gccgatgtca tcaaaagcag aataatgaat
800
caaccacgag ataaacaagg aaggggactt ttgtataaat catcgactga
850
ctgcttgatt caggctgttc aaggtgaagg attcatgagt ctatataaag
900
gctttttacc atcttggctg agaatgaccc cttggtcaat ggtgttctgg
950
cttacttatg aaaaaatcag agagatgagt ggagtcagtc cattttaa 998
<210> 406 <211> 323 <212> PRT <2X3> Homo sapiens
<400> 406
Met Ser Val Pro Glu Glu Glu Glu Arg Leu Leu Pro Leu Thr Gin 15 10 15
Arg Trp Pro Arg Ala Ser Lys Phe Leu Leu Ser Gly Cys Ala Ala 20 25 30
Thr Val Ala Glu Leu Ala Thr Phe Pro Leu Asp Leu Thr Lys Thr 35 40 45
Arg Leu Gin Met Gin Gly Glu Ala Ala Leu Ala Arg Leu Gly Asp 50 55 60
Gly Ala Arg Glu Ser Ala Pro Tyr Arg Gly Met Val Arg Thr Ala 65 ' 70 75
Leu Gly Ile Ile Glu Glu Glu Gly Phe Leu Lys Leu Trp Gin Gly 80 85 90
Val Thr Pro Ala Ile Tyr Arg His Val Val Tyr Ser Gly Gly Arg 95 100 105
Met Val Thr Tyr Glu His Leu Arg Glu Val Val Phe Gly Lys Ser 110 115 120
Glu Asp Glu His Tyr Pro Leu Trp Lys Ser Val Ile Gly Gly Met
840
125 130 135
Met Ala Gly Val Ile Gly Gin Phe Leu Ala Asn Pro Thr Asp Leu 140 145 150
Val Lys Val Gin Met Gin Met Glu Gly Lys Arg Lys Leu Glu Gly 155 160 165
Lys Pro Leu Arg Phe Arg Gly Val His His Ala Phe Ala Lys Ile 170 175 180
Leu Ala Glu Gly Gly Ile Arg Gly Leu Trp Ala Gly Trp Val Pro 185 190 195
Asn Ile Gin Arg Ala Ala Leu Val Asn Met Gly Asp Leu Thr Thr 200 205 210
Tyr Asp Thr Val Lys His Tyr Leu Val Leu Asn Thr Pro Leu Glu 215 220 225
Asp Asn Ile Met Thr His Gly Leu Ser Ser Leu Cys Ser Gly Leu 230 235 240
Val Ala Ser Ile Leu Gly Thr Pro Ala Asp Val Ile Lys Ser Arg 245 250 255
Ile Met Asn Gin Pro Arg Asp Lys Gin Gly Arg Gly Leu Leu Tyr 260 265 270
Lys Ser Ser Thr Asp Cys Leu Ile Gin Ala Val Gin Gly Glu Gly 275 280 285
Phe Met Ser Leu Tyr Lys Gly Phe Leu Pro Ser Trp Leu Arg Met 290 295 300
Thr Pro Trp Ser Met Val Phe Trp Leu Thr Tyr Glu Lys Ile Arg 305 310 315
Glu Met Ser Gly Val Ser Pro Phe 320
<210> 407 <211> 31 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 407
cgcggatccc gttatcgtct tgcgctactg c 31
<210> 408 <211> 34 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 408
gcggaattct taaaatggac tgactccact catc 34
841
<210> 409 <211> 1487 <212> DNA
<213> Homo sapiens <400> 409
cggacgcgtg ggcgcgggac gccggcaggg ttgtggcgca gcagtctcct
50
tcctgcgcgc gcgcctgaag tcggcgtggg cgtttgagga agctgggata
100
cagcatttaa tgaaaaattt atgcttaaga agtaaaaatg gcaggcttcc
150
tagataattt tcgttggcca gaatgtgaat gtattgactg gagtgagaga
200
agaaatgctg tggcatctgt tgtcgcaggt atattgtttt ttacaggctg
250
gtggataatg attgatgcag ctgtggtgta tcctaagcca gaacagttga
300
accatgcctt tcacacatgt ggtgtatttt ccacattggc tttcttcatg
350
ataaatgctg tatccaatgc tcaggtgaga ggtgatagct atgaaagcgg
400
ctgtttagga agaacaggtg ctcgagtttg gcttttcatt ggtttcatgt
450
tgatgtttgg gtcacttatt gcttccatgt ggattctttt tggtgcatat
500
gttacccaaa atactgatgt ttatccggga ctagctgtgt tttttcaaaa
550
tgcacttata ttttttagca ctctgatcta caaatttgga agaaccgaag
600
agctatggac ctgagatcac ttcttaagtc acattttcct tttgttatat
650
tctgtttgta tagattgtac gataggtttt attaaatgtt ttatctctca ttgtttcttt gtacacattg acatttttat ccaaatggag gttctgagtt
700 750
ttgaaatagt tttatgaaat ttctttattt ttcattgcat agactgttaa
800
tatgtatata atacaagact atatgaattg gataatgagt atcagttttt
850
tattcctgag atttagaact tgatctactc cctgagccag ggttacatca
900
tcttgtcatt ttagaagtaa ccactcttgt ctctctggct gggcacggtg
950
gctcatgcct gtaatcccag cactttggga ggccgaggcg ggccgattgc
1000
ttgaggtcaa gtgtttgaga ccagcctggc caacatggcg aaaccccatc
1050
tactaaaaat acaaaaatta gccaggcatg gtggtgggtg cctgtaatcc
1100
cagctacctg ggaggctgag gcaggagaat cgcttgaacc cggggggcag
1150
aggttgcagt gagctgagtt tgcgccactg cactctagcc tgggggagaa
1200
agtgaaactc cctctcaaaa aaaagaccac tctcagtatc tctgatttct
1250
gaagatgtac aaaaaaatat agcttcatat atctggaatg agcactgagc
1300
cataaaaggt tttcagcaag ttgtaactta ttttggccta aaaatgaggt
1350
ttttttggta aagaaaaaat atttgttctt atgtattgaa gaagtgtact
1400
842
tttatataat gattttttaa atgcccaaag gactagtttg aaagcttctt 1450
ttaaaaagaa ttcctctaat atgactttat gtgagaa 1487
<210> 410 <211> 158 <212> PRT <213> Homo sapiens
<400> 410 Met Ala Gly Phe Leu Asp Asn Phe Arg 1 5
Ile Asp Trp Ser Glu Arg Arg Asn Ala 20
Gly Ile Leu Phe Phe Thr Gly Trp Trp 35
Val Val Tyr Pro Lys Pro Glu Gin Leu 50
Cys Gly Val Phe Ser Thr Leu Ala Phe 65
Ser Asn Ala Gin Val Arg Gly Asp Ser 80
Gly Arg Thr Gly Ala Arg Val Trp Leu 95
Met Phe Gly Ser Leu Ile Ala Ser Met 110
Tyr Val Thr Gin Asn Thr Asp Val Tyr 125
Phe Gin Asn Ala Leu Ile Phe Phe Ser 140
Gly Arg Thr Glu Glu Leu Trp Thr 155
<210> 411 <211> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 411 gtttgaggaa gctgggatac 2 0
<210> 412 <211> 20 <212> DNA
<213> Artificial Sequence
Trp Pro Glu Cys Glu Cys 10 15
Val Ala Ser Val Val Ala 25 30
Ile Met Ile Asp Ala Ala 40 45
Asn His Ala Phe His Thr 55 60
Phe Met Ile Asn Ala Val 70 75
Tyr Glu Ser Gly Cys Leu 85 90
Phe Ile Gly Phe Met Leu
100 105
Trp Ile Leu Phe Gly Ala
115 120
Pro Gly Leu Ala Val Phe
130 135
Thr Leu Ile Tyr Lys Phe 145 150
<220>
<223> Synthetic oligonucleotide probe
843
<400> 412 ccaaactcga gcacctgttc 20
<210> 413 <211> 40 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 413
atggcaggct tcctagataa ttttcgttgg ccagaatgtg 40
<210> 414
<211> 1337
<212> DNA
<213> Homo sapiens
<400> 414
gttgatggca aacttcctca aaggaggggc agagcctgcg cagggcagga
50
gcagctggcc cactggcggc ccgcaacact ccgtctcacc ctctgggccc
100
actgcatcta gaggagggcc gtctgtgagg ccactacccc tccagcaact
150
gggaggtggg actgtcagaa gctggcccag ggtggtggtc agctgggtca
200
gggacctacg gcacctgctg gaccacctcg ccttctccat cgaagcaggg
250
aagtgggagc ctcgagccct cgggtggaag ctgaccccaa gccacccttc
300
acctggacag gatgagagtg tcaggtgtgc ttcgcctcct ggccctcatc
350
tttgccatag tcacgacatg gatgtttatt cgaagctaca tgagcttcag
400
catgaaaacc atccgtctgc cacgctggct ggcagcctcg cccaccaagg
450
agatccaggt taaaaagtac aagtgtggcc tcatcaagcc ctgcccagcc
500
aactactttg cgtttaaaat ctgcagtggg gccgccaacg tcgtgggccc
550
tactatgtgc tttgaagacc gcatgatcat gagtcctgtg aaaaacaatg
600
tgggcagagg cctaaacatc gccctggtga atggaaccac gggagctgtg
650
ctgggacaga aggcatttga catgtactct ggagatgtta tgcacctagt
700
gaaattcctt aaagaaattc cggggggtgc actggtgctg gtggcctcct
750
acgacgatcc agggaccaaa atgaacgatg aaagcaggaa actcttctct
800
gacttgggga gttcctacgc aaaacaactg ggcttccggg acagctgggt
850
cttcatagga gccaaagacc tcaggggtaa aagccccttt gagcagttct
900
taaagaacag cccagacaca aacaaatacg agggatggcc agagctgctg
950
gagatggagg gctgcatgcc cccgaagcca ttttagggtg gctgtggctc
1000
ttcctcagcc aggggcctga agaagctcct gcctgactta ggagtcagag
1050
844
cccggcaggg gctgaggagg aggagcaggg ggtgctgcgt ggaaggtgct 1100
gcaggtcctt gcacgctgtg tcgcgcctct cctcctcgga aacagaaccc 1150
tcccacagca catcctaccc ggaagaccag cctcagaggg tccttctgga 1200
accagctgtc tgtggagaga atggggtgct ttcgtcaggg actgctgacg 1250
gctggtcctg aggaaggaca aactgcccag acttgagccc aattaaattt 1300
tatttttgct ggttttgaaa aaaaaaaaaa aaaaaaa 1337
<210> 415 <211> 224 <212> PRT
<213> Homo sapiens <400> 415
Met Arg Val Ser Gly Val Leu Arg Leu Leu Ala Leu Ile Phe Ala 15 10 15
Ile Val Thr Thr Trp Met Phe Ile Arg Ser Tyr Met Ser Phe Ser 20 25 30
Met Lys Thr Ile Arg Leu Pro Arg Trp Leu Ala Ala Ser Pro Thr 35 40 45
Lys Glu Ile Gin Val Lys Lys Tyr Lys Cys Gly Leu Ile Lys Pro 50 55 60
Cys Pro Ala Asn Tyr Phe Ala Phe Lys Ile Cys Ser Gly Ala Ala 65 70 75
Asn Val Val Gly Pro Thr Met Cys Phe Glu Asp Arg Met Ile Met 80 85 90
Ser Pro Val Lys Asn Asn Val Gly Arg Gly Leu Asn Ile Ala Leu 95 100 105
Val Asn Gly Thr Thr Gly Ala Val Leu Gly Gin Lys Ala Phe Asp 110 115 120
Met Tyr Ser Gly Asp Val Met His Leu Val Lys Phe Leu Lys Glu 125 130 135
Ile Pro Gly Gly Ala Leu Val Leu Val Ala Ser Tyr Asp Asp Pro 140 145 150
Gly Thr Lys Met Asn Asp Glu Ser Arg Lys Leu Phe Ser Asp Leu 155 160 165
Gly Ser Ser Tyr Ala Lys Gin Leu Gly Phe Arg Asp Ser Trp Val 170 175 180
Phe Ile Gly Ala Lys Asp Leu Arg Gly Lys Ser Pro Phe Glu Gin 185 190 195
Phe Leu Lys Asn Ser Pro Asp Thr Asn Lys Tyr Glu Gly Trp Pro 200 205 210
Glu Leu Leu Glu Met Glu Gly Cys Met Pro Pro Lys Pro Phe 215 220
845
<210> 416 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 416 gccatagtca cgacatggat g 21
<210> 417 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 417 ggatggccag agctgctg 18
<210> 418 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 418 aaagtacaag tgtggcctca tcaagc 2 6
<210> 419 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 419 tctgactcct aagtcaggca ggag 24
<210> 420 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 420 attctctcca cagacagctg gttc 24
<210> 421 <211> 46 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
846
<400> 421
gtacaagtgt ggcctcatca agccctgccc agccaactac tttgcg 46
<210> 422 <211> 1701 <212> DNA
<213> Homo sapiens <220>
<221> unsure
<222> 1528
<223> unknown base
<400> 422
gagactgcag agggagataa agagagaggg caaagaggca gcaagagatt
50
tgtcctgggg atccagaaac ccatgatacc ctactgaaca ccgaatcccc
100
tggaagccca cagagacaga gacagcaaga gaagcagaga taaatacact
150
cacgccagga gctcgctcgc tctctctctc tctctctcac tcctccctcc
200
ctctctctct gcctgtccta gtcctctagt cctcaaattc ccagtcccct
250
gcaccccttc ctgggacact atgttgttct ccgccctcct gctggaggtg
300
atttggatcc tggctgcaga tgggggtcaa cactggacgt atgagggccc
350
acatggtcag gaccattggc cagcctctta ccctgagtgt ggaaacaatg
400
cccagtcgcc catcgatatt cagacagaca gtgtgacatt tgaccctgat
450
ttgcctgctc tgcagcccca cggatatgac cagcctggca ccgagccttt
500
ggacctgcac aacaatggcc acacagtgca actctctctg ccctctaccc
550
tgtatctggg tggacttccc cgaaaatatg tagctgccca gctccacctg
600
cactggggtc agaaaggatc cccagggggg tcagaacacc agatcaacag
650
tgaagccaca tttgcagagc tccacattgt acattatgac tctgattcct
700
atgacagctt gagtgaggct gctgagaggc ctcagggcct ggctgtcctg
750
ggcatcctaa ttgaggtggg tgagactaag aatatagctt atgaacacat
800
tctgagtcac ttgcatgaag tcaggcataa agatcagaag acctcagtgc
850
ctcccttcaa cctaagagag ctgctcccca aacagctggg gcagtacttc
900
cgctacaatg gctcgctcac aactccccct tgctaccaga gtgtgctctg
950
gacagttttt tatagaaggt cccagatttc aatggaacag ctggaaaagc
1000
ttcaggggac attgttctcc acagaagagg agccctctaa gcttctggta
1050
cagaactacc gagcccttca gcctctcaat cagcgcatgg tctttgcttc
1100
tttcatccaa gcaggatcct cgtataccac aggtgaaatg ctgagtctag
1150
gtgtaggaat cttggttggc tgtctctgcc ttctcctggc tgtttatttc
1200
847
attgctagaa agattcggaa gaagaggctg gaaaaccgaa agagtgtggt 1250
cttcacctca gcacaagcca cgactgaggc ataaattcct tctcagatac 13 00
catggatgtg gatgacttcc cttcatgcct atcaggaagc ctctaaaatg 1350
gggtgtagga tctggccaga aacactgtag gagtagtaag cagatgtcct 1400
ccttcccctg gacatctctt agagaggaat ggacccaggc tgtcattcca 1450
ggaagaactg cagagccttc agcctctcca aacatgtagg aggaaatgag 1500
gaaatcgctg tgttgttaat gcagaganca aactctgttt agttgcaggg 1550
gaagtttggg atatacccca aagtcctcta ccccctcact tttatggccc 1600
tttccctaga tatactgcgg gatctctcct taggataaag agttgctgtt 1650
gaagttgtat atttttgatc aatatatttg gaaattaaag tttctgactt 17 00
t 1701
<210> 423 <211> 337 <212> PRT
<213> Homo sapiens <400> 423
Met Leu Phe Ser Ala Leu Leu Leu Glu Val Ile Trp Ile Leu Ala 1 5 10 15
Ala Asp Gly Gly Gin His Trp Thr Tyr Glu Gly Pro His Gly Gin 20 25 30
Asp His Trp Pro Ala Ser Tyr Pro Glu Cys Gly Asn Asn Ala Gin 35 40 45
Ser Pro Ile Asp Ile Gin Thr Asp Ser Val Thr Phe Asp Pro Asp
50 55 60 Leu Pro Ala Leu Gin Pro His Gly Tyr Asp Gin Pro Gly Thr Glu
65 70 75
Pro Leu Asp Leu His Asn Asn Gly His Thr Val Gin Leu Ser Leu 80 85 90
Pro Ser Thr Leu Tyr Leu Gly Gly Leu Pro Arg Lys Tyr Val Ala 95 100 105
Ala Gin Leu His Leu His Trp Gly Gin Lys Gly Ser Pro Gly Gly 110 115 120
Ser Glu His Gin Ile Asn Ser Glu Ala Thr Phe Ala Glu Leu His 125 130 135
Ile Val His Tyr Asp Ser Asp Ser Tyr Asp Ser Leu Ser Glu Ala 140 145 150
Ala Glu Arg Pro Gin Gly Leu Ala Val Leu Gly Ile Leu Ile Glu 155 160 165
Val Gly Glu Thr Lys Asn Ile Ala Tyr Glu His Ile Leu Ser His
848
170 175 180
Leu His Glu Val Arg His Lys Asp Gin Lys Thr Ser Val Pro Pro 185 190 195
Phe Asn Leu Arg Glu Leu Leu Pro Lys Gin Leu Gly Gin Tyr Phe 200 205 210
Arg Tyr Asn Gly Ser Leu Thr Thr Pro Pro Cys Tyr Gin Ser Val 215 220 225
Leu Trp Thr Val Phe Tyr Arg Arg Ser Gin Ile Ser Met Glu Gin 230 235 240
Leu Glu Lys Leu Gin Gly Thr Leu Phe Ser Thr Glu Glu Glu Pro 245 250 255
Ser Lys Leu Leu Val Gin Asn Tyr Arg Ala Leu Gin Pro Leu Asn 260 265 270
Gin Arg Met Val Phe Ala Ser Phe Ile Gin Ala Gly Ser Ser Tyr 275 280 285
Thr Thr Gly Glu Met Leu Ser Leu Gly Val Gly Ile Leu Val Gly 290 295 300
Cys Leu Cys Leu Leu Leu Ala Val Tyr Phe Ile Ala Arg Lys Ile 305 310 315
Arg Lys Lys Arg Leu Glu Asn Arg Lys Ser Val Val Phe Thr Ser 320 325 330
Ala Gin Ala Thr Thr Glu Ala 335
<210> 424 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 424 gtaaagtcgc tggccagc 18
<210> 425 <211> 18 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 425 cccgatctgc ctgctgta 18
<210> 426 <211> 24 <212> DNA
<213> Artificial Sequence
849
<220>
<223> Synthetic oligonucleotide probe
<400> 426 ctgcactgta tggccattat tgtg 24
<210> 427 <211> 45 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe <400> 427
cagaaaccca tgatacccta ctgaacaccg aatcccctgg aagcc 45
<210> 428 <211> 1073 <212> DNA
<213> Homo sapiens <400> 428
aatttttcac cagagtaaac ttgagaaacc aactggacct tgagtattgt
50
acattttgcc tcgtggaccc aaaggtagca atctgaaaca tgaggagtac
100
gattctactg ttttgtcttc taggatcaac tcggtcatta ccacagctca
150
aacctgcttt gggactccct cccacaaaac tggctccgga tcagggaaca
200
ctaccaaacc aacagcagtc aaatcaggtc tttccttctt taagtctgat
250
accattaaca cagatgctca cactggggcc agatctgcat ctgttaaatc
300
ctgctgcagg aatgacacct ggtacccaga cccacccatt gaccctggga
350
gggttgaatg tacaacagca actgcaccca catgtgttac caatttttgt
400
cacacaactt ggagcccagg gcactatcct aagctcagag gaattgccac
450
aaatcttcac cccaccagtc gagcctcatc aggcaggggc atccattcct taatccagat tgttcccggg gtccaggatg aggcatcctg gaagccttcc
500 550
agcaggagga gcaggtgtaa atcctgccac ccagggaacc ccagcaggcc
600
gcctcccaac tcccagtggc acagatgacg actttgcagt gaccacccct
650
gcaggcatcc aaaggagcac acatgccatc gaggaagcca ccacagaatc
700
agcaaatgga attcagtaag ctgtttcaaa ttttttcaac taagctgcct
750
cgaatttggt gatacatgtg aatctttatc attgattata ttatggaata
800
gattgagaca cattggatag tcttagaaga aattaattct taatttacct
850
gaaaatattc ttgaaatttc agaaaatatg ttctatgtag agaatcccaa
900
cttttaaaaa caataattca atggataaat ctgtctttga aatataacat
950
tatgctgcct ggatgatatg catattaaaa catatttgga aaactggaaa
1000
850
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1050
aaaaaaaaaa aaaaaaaaaa aaa 1073
<210> 429 <211> 209 <212> PRT
<213> Homo sapiens <400> 429
Met Arg Ser Thr Ile Leu Leu Phe Cys Leu Leu Gly Ser Thr Arg 15 10 15
Ser Leu Pro Gin Leu Lys Pro Ala Leu Gly Leu Pro Pro Thr Lys 20 25 30
Leu Ala Pro Asp Gin Gly Thr Leu Pro Asn Gin Gin Gin Ser Asn 35 40 45
Gin Val Phe Pro Ser Leu Ser Leu Ile Pro Leu Thr Gin Met Leu 50 55 60
Thr Leu Gly Pro Asp Leu His Leu Leu Asn Pro Ala Ala Gly Met 65 70 75
Thr Pro Gly Thr Gin Thr His Pro Leu Thr Leu Gly Gly Leu Asn 80 85 90
Val Gin Gin Gin Leu His Pro His Val Leu Pro Ile Phe Val Thr 95 100 105
Gin Leu Gly Ala Gin Gly Thr Ile Leu Ser Ser Glu Glu Leu Pro 110 115 120
Gin Ile Phe Thr Ser Leu Ile Ile His Ser Leu Phe Pro Gly Gly 125 130 135
Ile Leu Pro Thr Ser Gin Ala Gly Ala Asn Pro Asp Val Gin Asp 140 145 150
Gly Ser Leu Pro Ala Gly Gly Ala Gly Val Asn Pro Ala Thr Gin 155 160 165
Gly Thr Pro Ala Gly Arg Leu Pro Thr Pro Ser Gly Thr Asp Asp 170 175 180
Asp Phe Ala Val Thr Thr Pro Ala Gly Ile Gin Arg Ser Thr His 185 190 195
Ala Ile Glu Glu Ala Thr Thr Glu Ser Ala Asn Gly Ile Gin 200 205
<210> 430 <211> 22 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 430 tccaggtgga ccccacttca gg 22
851
<210> 431 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide Probe
<400> 431 gggaggctta taggcccaat ctgg 24
<210> 432 <211> 50 <212> DNA
<213> Artificial Sequence <220>
<223> Synthetic oligonucleotide probe
<400> 432
ggcttcagca gcaggtgtga agtcgaagtc gcagtcacag atatcaatga 50 1
Claims (34)
1. An isolated nucleic acid having at least 80% sequence identity to a nucleotide sequence that encodes a polypeptide comprising the amino acid sequence shown in Figure 32 (SEQ ID NO: 54).
2. A nucleic acid sequence of Claim 1, wherein said nucleotide sequence comprises the nucleotide sequence shown in Figure 31 (SEQ ID NO: 53).
3. A nucleic acid of Claim 1 or Claim 2, wherein said nucleotide sequence comprises die full-length coding 10 sequence of the sequence shown in Figure 31 (SEQ ID NO: 53).
4. An isolated nucleic acid which comprises the full-length coding sequence of the DNA deposited under accession number ATCC 203287. 15
5. A vector comprising a nucleic acid of any one of Claims 1 to 4.
6. A vector of Claim 5, wherein said nucleic acid is operably linked to control sequences recognised by a host cell transformed with the vector. 2 0
7. A non-human host cell comprising a vector of Claim 5 or Claim 6.
8. A non-human host cell of Claim 7, wherein said cell is a CHO cell, an E. coli all or a yeast cell.
9. A process for producing a polypeptide comprising culturing a host cell of Claim 7 or Claim 8 under 2 5 conditions suitable for expression of said polypeptide and recovering said polypeptide from the cell culture.
10. An isolated PR01295 polypeptide laving at least 80% sequence identity to the amino acid sequence shown in Figure 32 (SEQ ID NO: 54). 30
11. An isolated PR01295 polypeptide having at least 80% sequence identity to the amino acid sequence encoded by the nucleotide deposited under ATCC 203287.
12. An isolated polypeptide comprising a polypeptide sequence having at least about 80% amino acid sequence identity to the extracellular domain of a PR01295 polypeptide of SEQ ID NO: 54. 35
13. An isolated polypeptide having at least about 80% amino acid sequence identity to a PRO1295 polypeptide of SEQ ID NO: 54 lacking its associated signal peptide.
14. An isolated nucleic acid encoding a polypeptide of any one of Claims 10 to 13. 852 nKraSwr^fv 17 MAR 230*1 RECEIVED
15. IS. A chimeric molecule comprising a polypeptide according to any one of Claims 10 to 13 fused to a heterologous amino acid sequence. 5
16. A chimeric molecule of Claim 15, wherein said heterologous amino acid sequence is an epitope tag sequence.
17. A chimeric molecule of Claim IS, wherein said heterologous amino acid sequence is a Fc region of an immunoglobulin. 10
18. An antibody which specifically binds to a polypeptide according to any one of Claims 10 to 13.
19. An antibody of Claim 18, wherein said antibody is a humanised antibody. , 15
20. An antibody of Claim 18 or Claim 19, wherein said antibody is a monoclonal antibody.
21. An antibody of any one of claims 18 to 20, wherein said antibody is a chimeric antibody.
22. A pharmaceutical composition comprising a polypeptide of any one of claims 10 to 13 and a ^ ^ pharmaceuticaliy acceptable carrier, diluent or excipient.
23. A use of a polypeptide of any one of claims 10 to 13 in the manufacture of a medicament for treating tumor.
24. An isolated nucleic acid as defined in claim 1 substantially as herein described with reference to 2 5 any example thereof and with or without reference to the Figures.
25. A vector as defined in claim 5 substantially as herein described with reference to any example thereof and with or without reference to the Figures. 30
26. A non-human host cell as defined in claim 7 substantially as herein described with reference to any example thereof and with or without reference to the Figures. 35 40
27. A process for producing a polypeptide as claimed in claim 9 substantially as herein described with reference to any example thereof and with or without reference to the Figures.
28. An isolated PR01295 polypeptide as defined in claim 10 or claim 11 substantially as herein described with reference to any example thereof and with or without reference to the Figures. 853 ElLECUiAL PROPERTY QF^CE- OP N.Z 17 l.'.AR 23S1! received
29. An isolated polypeptide as defined in claim 12 or claim 13 substantially as herein described with reference to any example thereof and with or without reference to the Figures.
30. An isolated nucleic acid as claimed in claim 14 substantially as herein described with reference to any example thereof and with or without reference to the Figures.
31. A chimeric molecule as defined in claim 15 substantially as herein described with reference to any example thereof and with or without reference to the Figures.
32. An antibody as defined in claim 18 substantially as herein described with reference to any example thereof and with or without reference to the Figures.
33. A pharmaceutical composition as claimed in claim 22 substantially as herein described with reference to any example thereof and with or without reference to the Figures.
34. A use as claimed in claim 23 substantially as herein described with reference to any example thereof and with or without reference to the Figures. 854
Applications Claiming Priority (101)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US9871698P | 1998-09-01 | 1998-09-01 | |
US9875098P | 1998-09-01 | 1998-09-01 | |
US9874998P | 1998-09-01 | 1998-09-01 | |
US9882198P | 1998-09-02 | 1998-09-02 | |
US9880398P | 1998-09-02 | 1998-09-02 | |
US9884398P | 1998-09-02 | 1998-09-02 | |
US9964298P | 1998-09-09 | 1998-09-09 | |
US9959698P | 1998-09-09 | 1998-09-09 | |
US9953698P | 1998-09-09 | 1998-09-09 | |
US9959898P | 1998-09-09 | 1998-09-09 | |
US9960298P | 1998-09-09 | 1998-09-09 | |
US9975498P | 1998-09-10 | 1998-09-10 | |
US9981298P | 1998-09-10 | 1998-09-10 | |
US9979298P | 1998-09-10 | 1998-09-10 | |
US9980898P | 1998-09-10 | 1998-09-10 | |
US9981598P | 1998-09-10 | 1998-09-10 | |
US9974198P | 1998-09-10 | 1998-09-10 | |
US9976398P | 1998-09-10 | 1998-09-10 | |
US9981698P | 1998-09-10 | 1998-09-10 | |
US10038898P | 1998-09-15 | 1998-09-15 | |
US10038598P | 1998-09-15 | 1998-09-15 | |
US10039098P | 1998-09-15 | 1998-09-15 | |
US10066498P | 1998-09-16 | 1998-09-16 | |
US10062798P | 1998-09-16 | 1998-09-16 | |
US10066198P | 1998-09-16 | 1998-09-16 | |
US10058498P | 1998-09-16 | 1998-09-16 | |
US10066298P | 1998-09-16 | 1998-09-16 | |
US10071098P | 1998-09-17 | 1998-09-17 | |
US10068398P | 1998-09-17 | 1998-09-17 | |
US10093098P | 1998-09-17 | 1998-09-17 | |
US10071198P | 1998-09-17 | 1998-09-17 | |
US10068498P | 1998-09-17 | 1998-09-17 | |
US10091998P | 1998-09-17 | 1998-09-17 | |
US10084898P | 1998-09-18 | 1998-09-18 | |
US10084998P | 1998-09-18 | 1998-09-18 | |
US10106898P | 1998-09-18 | 1998-09-18 | |
US10107198P | 1998-09-18 | 1998-09-18 | |
US10101498P | 1998-09-18 | 1998-09-18 | |
US10127998P | 1998-09-22 | 1998-09-22 | |
US10147998P | 1998-09-23 | 1998-09-23 | |
US10147498P | 1998-09-23 | 1998-09-23 | |
US10147798P | 1998-09-23 | 1998-09-23 | |
US10147298P | 1998-09-23 | 1998-09-23 | |
US10147698P | 1998-09-23 | 1998-09-23 | |
US10147198P | 1998-09-23 | 1998-09-23 | |
US10147598P | 1998-09-23 | 1998-09-23 | |
US10174398P | 1998-09-24 | 1998-09-24 | |
US10173898P | 1998-09-24 | 1998-09-24 | |
US10191698P | 1998-09-24 | 1998-09-24 | |
US10174198P | 1998-09-24 | 1998-09-24 | |
US10191598P | 1998-09-24 | 1998-09-24 | |
US10220798P | 1998-09-29 | 1998-09-29 | |
US10230798P | 1998-09-29 | 1998-09-29 | |
US10233198P | 1998-09-29 | 1998-09-29 | |
US10233098P | 1998-09-29 | 1998-09-29 | |
US10224098P | 1998-09-29 | 1998-09-29 | |
US10248498P | 1998-09-30 | 1998-09-30 | |
US10257198P | 1998-09-30 | 1998-09-30 | |
US10248798P | 1998-09-30 | 1998-09-30 | |
US10257098P | 1998-09-30 | 1998-09-30 | |
US10268798P | 1998-10-01 | 1998-10-01 | |
US10268498P | 1998-10-01 | 1998-10-01 | |
US10296598P | 1998-10-02 | 1998-10-02 | |
US10325898P | 1998-10-06 | 1998-10-06 | |
US10344998P | 1998-10-06 | 1998-10-06 | |
US10339598P | 1998-10-07 | 1998-10-07 | |
US10332898P | 1998-10-07 | 1998-10-07 | |
US10331498P | 1998-10-07 | 1998-10-07 | |
US10340198P | 1998-10-07 | 1998-10-07 | |
US10331598P | 1998-10-07 | 1998-10-07 | |
US10339698P | 1998-10-07 | 1998-10-07 | |
US10371198P | 1998-10-08 | 1998-10-08 | |
US10363398P | 1998-10-08 | 1998-10-08 | |
US10367898P | 1998-10-08 | 1998-10-08 | |
US10367998P | 1998-10-08 | 1998-10-08 | |
US10425798P | 1998-10-14 | 1998-10-14 | |
US10498798P | 1998-10-20 | 1998-10-20 | |
US10500098P | 1998-10-20 | 1998-10-20 | |
US10500298P | 1998-10-20 | 1998-10-20 | |
US10510498P | 1998-10-21 | 1998-10-21 | |
US10516998P | 1998-10-22 | 1998-10-22 | |
US10526698P | 1998-10-22 | 1998-10-22 | |
US10569498P | 1998-10-26 | 1998-10-26 | |
US10569398P | 1998-10-26 | 1998-10-26 | |
US10588198P | 1998-10-27 | 1998-10-27 | |
US10606298P | 1998-10-27 | 1998-10-27 | |
US10588298P | 1998-10-27 | 1998-10-27 | |
US10580798P | 1998-10-27 | 1998-10-27 | |
US10603098P | 1998-10-28 | 1998-10-28 | |
US10602998P | 1998-10-28 | 1998-10-28 | |
US10603398P | 1998-10-28 | 1998-10-28 | |
US10617898P | 1998-10-28 | 1998-10-28 | |
US10602398P | 1998-10-28 | 1998-10-28 | |
US10603298P | 1998-10-28 | 1998-10-28 | |
US10850098P | 1998-10-29 | 1998-10-29 | |
US10624898P | 1998-10-29 | 1998-10-29 | |
US10638498P | 1998-10-29 | 1998-10-29 | |
US10646498P | 1998-10-30 | 1998-10-30 | |
US10685698P | 1998-11-03 | 1998-11-03 | |
US09/946,374 US20030073129A1 (en) | 1998-09-01 | 2001-09-04 | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
US10/015,480 US7074912B2 (en) | 1998-09-01 | 2001-12-11 | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
Publications (1)
Publication Number | Publication Date |
---|---|
NZ510464A true NZ510464A (en) | 2004-05-28 |
Family
ID=56289939
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
NZ531664A NZ531664A (en) | 1998-09-01 | 1999-09-01 | Pro1317 polypeptides and sequences thereof with homology to the semaphorin B glycoprotein family |
NZ510464A NZ510464A (en) | 1998-09-01 | 1999-09-01 | Further pro polypeptides and sequences thereof |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
NZ531664A NZ531664A (en) | 1998-09-01 | 1999-09-01 | Pro1317 polypeptides and sequences thereof with homology to the semaphorin B glycoprotein family |
Country Status (3)
Country | Link |
---|---|
CA (1) | CA2339043A1 (en) |
NZ (2) | NZ531664A (en) |
WO (1) | WO2000012708A2 (en) |
Families Citing this family (126)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000029422A1 (en) * | 1998-11-12 | 2000-05-25 | Human Genome Sciences, Inc. | 31 human secreted proteins |
US20020137890A1 (en) | 1997-03-31 | 2002-09-26 | Genentech, Inc. | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
US7057017B2 (en) | 1997-04-16 | 2006-06-06 | Millennium Pharmaceuticals, Inc. | Human dickkopf-related protein and nucleic acid molecules and uses therefor |
US20050196832A1 (en) | 1997-09-18 | 2005-09-08 | Genentech, Inc. | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
US20030166105A1 (en) | 1997-09-18 | 2003-09-04 | Genentech, Inc. | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
US20020192752A1 (en) | 1998-09-09 | 2002-12-19 | Genentech, Inc. | Compositions and methods for the treatment of immune related diseases |
WO2000017353A1 (en) * | 1998-09-22 | 2000-03-30 | Genentech, Inc. | Ucp4 |
US6573068B1 (en) | 1997-11-13 | 2003-06-03 | Genset, S. A. | Claudin-50 protein |
EP1046650A4 (en) | 1997-12-22 | 2001-12-19 | Ono Pharmaceutical Co | PEPTIDES, METHOD FOR ASSAYING HUMAN PEPSINOGEN II or HUMAN PEPSIN II, AND ASSAYING KIT |
JPH11187882A (en) * | 1997-12-26 | 1999-07-13 | Ono Pharmaceut Co Ltd | Novel polypeptide, its production, cdna coding for the same polypeptide, vector comprising the same cdna, host cell transformed by the same vector, antibody of the same polypeptide, and pharmaceutical composition containing the polypeptide of antibody |
US7446181B2 (en) | 1998-01-15 | 2008-11-04 | Millennium Pharmaceuticals, Inc. | Antibodies that bind human Dickkopf-1 proteins |
US7045333B1 (en) | 1998-01-16 | 2006-05-16 | Incyte Corporation | Human protease molecules |
US6203979B1 (en) | 1998-01-16 | 2001-03-20 | Incyte Pharmaceuticals, Inc. | Human protease molecules |
JP2002502589A (en) | 1998-02-09 | 2002-01-29 | ヒューマン ジノーム サイエンシーズ, インコーポレイテッド | 45 human secreted proteins |
US6245550B1 (en) * | 1998-03-20 | 2001-06-12 | Smithkline Beecham Corporation | Cytokine family member EF-7 |
US6365369B1 (en) | 1998-04-01 | 2002-04-02 | Human Genome Sciences, Inc. | Prostate specific secreted protein |
WO1999058668A1 (en) * | 1998-05-14 | 1999-11-18 | Ono Pharmaceutical Co., Ltd. | NOVEL POLYPEPTIDES, cDNAS ENCODING THE SAME AND UTILIZATION THEREOF |
US6949339B1 (en) | 1998-05-21 | 2005-09-27 | Diadexus, Inc. | Method of diagnosing, monitoring, and staging colon cancer |
US6914130B2 (en) | 1998-06-17 | 2005-07-05 | Genentech, Inc. | Compositions and methods for the diagnosis and treatment of tumor |
US20030166132A1 (en) | 1998-08-26 | 2003-09-04 | Genentech, Inc. | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
WO2000012721A1 (en) * | 1998-08-31 | 2000-03-09 | Long Yu | A novel human lysozyme gene, its encoding polypeptide and the method for preparing them |
US20030114652A1 (en) * | 1998-09-01 | 2003-06-19 | Genentech, Inc. | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
JP2002523760A (en) | 1998-09-02 | 2002-07-30 | ダイアデクスアス・インコーポレーテッド | Novel methods for diagnosing, monitoring, staging, imaging and treating various cancers |
US7244817B2 (en) * | 1998-09-10 | 2007-07-17 | Genentech, Inc. | Pro1357 polypeptide |
US7244428B2 (en) * | 1998-09-10 | 2007-07-17 | Genentech, Inc. | PRO1357 antibodies |
US6344541B1 (en) | 1998-09-25 | 2002-02-05 | Amgen Inc. | DKR polypeptides |
US7414112B2 (en) | 1998-10-08 | 2008-08-19 | Genentech, Inc. | Antibodies to PRO1550 polypeptides |
WO2000028031A2 (en) | 1998-11-10 | 2000-05-18 | Emory University | Mitogenic regulators |
WO2000034486A1 (en) * | 1998-12-09 | 2000-06-15 | Shionogi & Co., Ltd. | Human secretory phospholipase a¿2? |
WO2000043512A1 (en) * | 1999-01-25 | 2000-07-27 | Zymogenetics, Inc. | Human polypeptide having multiple epidermal growth factor (egf) -like domains, zntr2 |
US7166427B1 (en) | 1999-02-26 | 2007-01-23 | The Ohio University Research Foundation | Detecting the expression of the DESC1 gene in squamous cell carcinoma |
AU3139800A (en) * | 1999-03-11 | 2000-09-28 | Mount Sinai Hospital Corporation | Novel human kallikrein-like genes |
US8410248B2 (en) | 1999-03-12 | 2013-04-02 | Human Genome Sciences Inc. | HWBAO62 polypeptides |
AU782985B2 (en) * | 1999-04-09 | 2005-09-15 | Curagen Corporation | Novel human proteins and polynucleotides encoding them |
CA2369384A1 (en) * | 1999-04-12 | 2000-10-19 | Agensys, Inc. | Prostate-restricted gene 30p3c8 expressed in prostate cancer |
AU5799500A (en) * | 1999-06-29 | 2001-01-22 | Mcgill University | Human alpha 1,2-mannosidase |
JP2003506017A (en) | 1999-07-12 | 2003-02-18 | メルク パテント ゲゼルシャフト ミット ベシュレンクテル ハフトング | Seripanklin |
WO2001008634A2 (en) * | 1999-08-03 | 2001-02-08 | Human Genome Sciences, Inc. | Polynucleotide encoding a human serine protease |
JP3988821B2 (en) * | 1999-08-31 | 2007-10-10 | ジェネンテック・インコーポレーテッド | Compositions and methods for treating immune related diseases |
US7576182B1 (en) | 1999-08-31 | 2009-08-18 | Genentech, Inc. | Compositions and methods for the treatment of immune related diseases |
DE60044138D1 (en) * | 1999-09-29 | 2010-05-20 | Teijin Ltd | POLYPEPTIDES AND FOR THESE ENCODING GENES |
JP2003529334A (en) * | 1999-10-13 | 2003-10-07 | キュラジェン コーポレイション | Proteins and polynucleotides encoded by them |
ATE402953T1 (en) * | 1999-10-18 | 2008-08-15 | Lexicon Pharmaceuticals Inc | HUMAN CHORDIN AND POLYNUCLEOTIDES THAT CODE FOR IT |
EP1224201A4 (en) * | 1999-10-29 | 2005-03-02 | Human Genome Sciences Inc | 32 human secreted proteins |
AU1437901A (en) * | 1999-10-29 | 2001-05-14 | Human Genome Sciences, Inc. | 26 human secreted proteins |
AU1174501A (en) * | 1999-11-10 | 2001-06-06 | Compugen Ltd. | Chordin-like homologs |
WO2001036604A2 (en) * | 1999-11-18 | 2001-05-25 | Corvas International, Inc. | Nucleic acids encoding endotheliases, endotheliases and uses thereof |
CA2388922A1 (en) * | 1999-11-19 | 2001-05-25 | Human Genome Sciences, Inc. | 23 human secreted proteins |
WO2001038505A2 (en) * | 1999-11-23 | 2001-05-31 | Zymogenetics, Inc. | Secreted protein, zalpha37 |
AU2225401A (en) * | 1999-12-27 | 2001-07-09 | Takeda Chemical Industries Ltd. | Novel protein and dna thereof |
WO2001053352A2 (en) * | 2000-01-19 | 2001-07-26 | University Of Virginia Patent Foundation | Sperm specific proteins |
MXPA02007119A (en) * | 2000-01-19 | 2003-04-14 | Amgen Inc | Chondromodulin i related peptide. |
WO2001055390A1 (en) * | 2000-01-28 | 2001-08-02 | Human Genome Sciences, Inc. | Human serpin polynucleotides, polypeptides, and antibodies |
AU2001237958A1 (en) * | 2000-01-31 | 2001-08-07 | Human Genome Sciences, Inc. | 22 human secreted proteins |
AU4196801A (en) * | 2000-03-02 | 2001-09-12 | Amgen Inc | Chordin-like-2 molecules and uses thereof |
AU2001255168A1 (en) * | 2000-03-03 | 2001-09-17 | Genentech Inc. | Compositions and methods for the treatment of immune related diseases |
US7301016B2 (en) | 2000-03-07 | 2007-11-27 | Millennium Pharmaceuticals, Inc. | Human transferase family members and uses thereof |
AU2001241080A1 (en) * | 2000-03-10 | 2001-09-17 | Toshio Kitamura | Mouse adipocyte-origin genes |
AU2001245746A1 (en) * | 2000-03-15 | 2001-09-24 | Fibrogen, Inc. | Alpha(iii) subunit of prolyl 4-hydroxylase |
US20020031801A1 (en) * | 2000-03-24 | 2002-03-14 | Millennium Pharmaceuticals, Inc. | 18806, a novel trypsin serine protease-like molecule and uses thereof |
US20030096952A1 (en) * | 2000-03-30 | 2003-05-22 | Kumud Majumder | Novel proteins and nucleic acids encoding same |
GB0009907D0 (en) * | 2000-04-20 | 2000-06-07 | Smithkline Beecham Biolog | Novel compounds |
WO2002000709A2 (en) * | 2000-06-23 | 2002-01-03 | Eli Lilly And Company | Cerebellin homologous polypeptides and therapeutic uses thereof |
AU2001271622A1 (en) * | 2000-06-30 | 2002-01-14 | Millennium Pharmaceuticals, Inc. | 32626, a human udp-glycosyltransferase and uses thereof |
US6750034B1 (en) * | 2000-06-30 | 2004-06-15 | Ortho-Mcneil Pharmaceutical, Inc. | DNA encoding human serine protease D-G |
WO2002008277A2 (en) * | 2000-07-19 | 2002-01-31 | Eli Lilly And Company | Nucleic acids, vectors, host cells, polypeptides, and uses thereof |
WO2002008260A2 (en) * | 2000-07-26 | 2002-01-31 | Stanford University | Bstp-ecg1 protein and related reagents and methods of use thereof |
ATE547525T1 (en) | 2000-08-08 | 2012-03-15 | Zymogenetics Inc | SOLUBLE ZCYCTOR 11 CYTOKINE RECEPTORS |
AU2001286785B2 (en) * | 2000-08-24 | 2006-08-17 | Genentech, Inc. | Compositions and methods for the diagnosis and treatment of tumor |
EP1445318A2 (en) * | 2000-08-24 | 2004-08-11 | Genetech, Inc. | Compositions and methods for the diagnosis and treatment of tumor |
ATE412009T1 (en) * | 2000-08-24 | 2008-11-15 | Genentech Inc | METHOD FOR INHIBITING IL-22 INDUCED PAP1 |
EP1311662A2 (en) * | 2000-08-24 | 2003-05-21 | Genentech, Inc. | Compositions and methods for the diagnosis and treatment of tumor |
ES2267814T3 (en) | 2000-09-15 | 2007-03-16 | Zymogenetics, Inc. | METHOD TO TREAT INFLAMMATION. |
JP4274792B2 (en) * | 2000-10-06 | 2009-06-10 | キリンホールディングス株式会社 | Novel N-acetylglucosaminyltransferase and polynucleotide encoding the same |
AU2002217838A1 (en) | 2000-11-14 | 2002-05-27 | Bristol-Myers Squibb Company | A human serpin secreted from lymphoid cells LSI-01 |
US7045326B2 (en) * | 2001-02-23 | 2006-05-16 | The Regents Of The University Of California | Mono- and diacylglycerol acyltransferases and methods of use thereof |
US20020119138A1 (en) | 2001-02-26 | 2002-08-29 | Sylvaine Cases | Diacylglycerol o-acyltransferase 2alpha (DGAT2alpha) |
EP1383531A2 (en) | 2001-02-28 | 2004-01-28 | Eli Lilly And Company | Use of lp82 to treat hematopoietic disorders |
US20040086908A1 (en) | 2002-03-07 | 2004-05-06 | Chandrasekher Yasmin A. | Soluble heterodimeric cytokine receptor |
US7034134B2 (en) | 2001-04-26 | 2006-04-25 | Bristol-Myers Squibb Company | Polynucleotide encoding a novel metalloprotease highly expressed in the testis, MMP-29 |
WO2002088363A2 (en) * | 2001-05-02 | 2002-11-07 | Bayer Healthcare Ag | Regulation of novel human prolyl 4-hydroxylases |
KR100576674B1 (en) * | 2001-06-20 | 2006-05-10 | 제넨테크, 인크. | Compositions and Methods for the Diagnosis and Treatment of Tumor |
JP4702819B2 (en) | 2001-08-01 | 2011-06-15 | 生化学工業株式会社 | Chondroitin synthase |
ATE516042T1 (en) | 2001-09-18 | 2011-07-15 | Genentech Inc | COMPOSITIONS AND METHODS FOR THE TREATMENT AND DIAGNOSIS OF TUMORS |
WO2003033652A2 (en) * | 2001-10-12 | 2003-04-24 | Amgen Inc | TUMOR ENDOTHELIAL MARKER 5α MOLECULES AND USES THEREOF |
PT1531843E (en) | 2001-12-17 | 2011-10-27 | Zymogenetics Inc | Method for treating cervical cancer |
KR20040101502A (en) * | 2002-04-16 | 2004-12-02 | 제넨테크, 인크. | Compositions and Methods for the Diagnosis and Treatment of Tumor |
EP2305710A3 (en) | 2002-06-03 | 2013-05-29 | Genentech, Inc. | Synthetic antibody phage libraries |
EP1546171A4 (en) * | 2002-08-14 | 2008-01-02 | Lg Life Sciences Ltd | Gene families associated with stomach cancer |
EP1562587A4 (en) * | 2002-09-11 | 2006-07-19 | Genentech Inc | Novel compositions and methods for the treatment of immune related diseases |
GB0224436D0 (en) * | 2002-10-21 | 2002-11-27 | Univ Cambridge Tech | Polypetides methods and means |
DE10254601A1 (en) | 2002-11-22 | 2004-06-03 | Ganymed Pharmaceuticals Ag | Gene products differentially expressed in tumors and their use |
EP2308968A1 (en) * | 2002-11-26 | 2011-04-13 | Genentech, Inc. | Compositions and methods for the treatment of immune related diseases |
CA2511522A1 (en) * | 2002-12-26 | 2004-07-15 | Takeda Pharmaceutical Company Limited | Novel proteins and use thereof |
US7452968B2 (en) * | 2003-01-02 | 2008-11-18 | University Of Southern California | Secreted protein factor and cell membrane-bound splice variant |
CN103864934A (en) | 2003-03-24 | 2014-06-18 | 津莫吉尼蒂克斯公司 | Anti-il-22ra antibodies and binding partners and methods of using in inflammation |
EP1633850A4 (en) * | 2003-05-16 | 2007-04-04 | Diadexus Inc | Ovr115 ANTIBODY COMPOSITION AND METHODS OF USE |
AU2004243569A1 (en) * | 2003-05-27 | 2004-12-09 | Laboratoires Serono Sa | Novel IL-8-like polypeptides |
EP1660537A2 (en) | 2003-08-14 | 2006-05-31 | Dyax Corp. | Endotheliase-2 ligands |
WO2005052001A2 (en) | 2003-11-21 | 2005-06-09 | Zymogenetics, Inc. | Anti-il-20 receptor antibodies and binding partners and methods of using in inflammation |
JPWO2005061704A1 (en) * | 2003-12-24 | 2007-07-12 | 武田薬品工業株式会社 | Cancer preventive / therapeutic agent |
US20050186577A1 (en) | 2004-02-20 | 2005-08-25 | Yixin Wang | Breast cancer prognostics |
DE102004024617A1 (en) | 2004-05-18 | 2005-12-29 | Ganymed Pharmaceuticals Ag | Differentially expressed in tumors gene products and their use |
WO2005118631A1 (en) | 2004-06-03 | 2005-12-15 | Takeda Pharmaceutical Company Limited | Novel protein complex and use thereof |
AU2005267722B2 (en) | 2004-08-04 | 2009-10-08 | Amgen Inc. | Antibodies to Dkk-1 |
UA96122C2 (en) | 2004-10-22 | 2011-10-10 | Займодженетикс, Инк | Anti-il-22ra antibody or antigen binding fragment thereof and methods of use |
EP1790664A1 (en) | 2005-11-24 | 2007-05-30 | Ganymed Pharmaceuticals AG | Monoclonal antibodies against claudin-18 for treatment of cancer |
US7655778B2 (en) | 2006-02-28 | 2010-02-02 | Curonix Co., Ltd. | SISP-1, a novel p53 target gene and use thereof |
KR100781753B1 (en) * | 2006-05-24 | 2007-12-04 | 연세대학교 산학협력단 | Methods to Promote Angiogenesis Including DX2 |
KR100954322B1 (en) * | 2006-06-14 | 2010-04-21 | 주식회사 엘지생명과학 | Novel LlFFL313 Gene Associated with Pancreatic Cancer |
WO2008014410A2 (en) * | 2006-07-26 | 2008-01-31 | Zymogenetics, Inc. | Zimlig2 polynucleotides and polypeptides and methods of use |
PT2398902T (en) | 2009-02-20 | 2023-12-05 | Astellas Pharma Inc | METHODS AND COMPOSITIONS FOR DIAGNOSIS AND TREATMENT OF CANCER |
UA117799C2 (en) | 2009-11-11 | 2018-10-10 | Ґанімед Фармасьютікалз Аґ | Monoclonal Antibody Specific to Claudin 6 (CLDN6) |
EP2404936A1 (en) | 2010-07-06 | 2012-01-11 | Ganymed Pharmaceuticals AG | Cancer therapy using CLDN6 target-directed antibodies in vivo |
NZ724296A (en) | 2011-05-13 | 2020-05-29 | Ganymed Pharmaceuticals Ag | Antibodies for treatment of cancer expressing claudin 6 |
WO2013167153A1 (en) | 2012-05-09 | 2013-11-14 | Ganymed Pharmaceuticals Ag | Antibodies useful in cancer diagnosis |
WO2013174404A1 (en) | 2012-05-23 | 2013-11-28 | Ganymed Pharmaceuticals Ag | Combination therapy involving antibodies against claudin 18.2 for treatment of cancer |
RU2678127C2 (en) | 2012-11-13 | 2019-01-23 | Бионтех Аг | Agents for treatment of claudin expressing cancer diseases |
WO2014127785A1 (en) | 2013-02-20 | 2014-08-28 | Ganymed Pharmaceuticals Ag | Combination therapy involving antibodies against claudin 18.2 for treatment of cancer |
WO2014146672A1 (en) | 2013-03-18 | 2014-09-25 | Ganymed Pharmaceuticals Ag | Therapy involving antibodies against claudin 18.2 for treatment of cancer |
CN105474016B (en) * | 2013-07-12 | 2018-09-07 | 勃拉姆斯有限公司 | AUGURIN immunity tests |
WO2015014376A1 (en) | 2013-07-31 | 2015-02-05 | Biontech Ag | Diagnosis and therapy of cancer involving cancer stem cells |
KR20230132603A (en) | 2017-01-11 | 2023-09-15 | 브리스톨-마이어스 스큅 컴퍼니 | Psgl-1 antagonists and uses thereof |
US11603406B2 (en) | 2017-03-14 | 2023-03-14 | Five Prime Therapeutics, Inc. | Antibodies binding to VISTA at acidic pH |
SG11202008593PA (en) | 2018-03-21 | 2020-10-29 | Five Prime Therapeutics Inc | ANTIBODIES BINDING TO VISTA AT ACIDIC pH |
CN112638948A (en) | 2018-07-11 | 2021-04-09 | 戊瑞治疗有限公司 | Antibodies that bind to VISTA at acidic pH |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0972022A2 (en) * | 1997-01-21 | 2000-01-19 | Human Genome Sciences, Inc. | Polynucleotides and polypeptides encoding receptors |
JP2002500037A (en) * | 1998-01-08 | 2002-01-08 | ジェネティックス・インスチチュート・インコーポレーテッド | Secreted proteins and polynucleotides encoding them |
-
1999
- 1999-09-01 CA CA002339043A patent/CA2339043A1/en not_active Abandoned
- 1999-09-01 WO PCT/US1999/020111 patent/WO2000012708A2/en not_active Application Discontinuation
- 1999-09-01 NZ NZ531664A patent/NZ531664A/en not_active IP Right Cessation
- 1999-09-01 NZ NZ510464A patent/NZ510464A/en not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
CA2339043A1 (en) | 2000-03-09 |
WO2000012708A2 (en) | 2000-03-09 |
NZ531664A (en) | 2005-07-29 |
WO2000012708A3 (en) | 2001-10-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
NZ510464A (en) | Further pro polypeptides and sequences thereof | |
US7101970B2 (en) | PRO1570 polypeptides | |
CA2496312A1 (en) | Colon tumour marker pro4799 polypeptides and nucleic acids encoding the same | |
US7196173B2 (en) | PRO1315 polypeptide encoded by a nuclear acid highly expressed in esophageal tumors | |
US7319008B2 (en) | Nucleic acid underexpressed in melanoma | |
US20080286821A1 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same | |
NZ531668A (en) | Pro1788 polypeptides and sequences thereof with homology to leucine-rich repeat proteins | |
NZ540169A (en) | Pro1303 polypeptides and sequences thereof with homology to a lysozyme like protein | |
NZ531665A (en) | PRO1558 polypeptides and sequences thereof with homology to methyltransferase enzymes | |
NZ531663A (en) | PRO1410 polypeptide isolated from a human brain meningioma tissue library polypeptides and sequences thereof | |
NZ531662A (en) | Pro 1293 polypeptides and sequences thereof with homology to an immunoglobulin heavy chain variable region protein | |
NZ531667A (en) | Pro1755 polypeptides and sequences thereof with homology to transmembrane proteins | |
NZ531666A (en) | PRO1555 polypeptides and sequences thereof with homology to transmembrane proteins | |
US7208321B2 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same | |
US7423127B2 (en) | Antibodies to a polypeptide encoded by a nucleic acid overexpressed in melanoma | |
US7193057B2 (en) | Antibodies to a polypeptide encoded by a nucleic acid underexpressed in rectal tumor | |
EP1623992A2 (en) | Secreted and transmembrane polypeptides and nucleic acids encoding the same |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PSEA | Patent sealed | ||
RENW | Renewal (renewal fees accepted) | ||
RENW | Renewal (renewal fees accepted) | ||
RENW | Renewal (renewal fees accepted) | ||
LAPS | Patent lapsed |