CN1177049C - Human protein with function of suppressing cancer cell growth and its coding sequence - Google Patents
Human protein with function of suppressing cancer cell growth and its coding sequenceInfo
- Publication number
- CN1177049C CN1177049C CNB011267275A CN01126727A CN1177049C CN 1177049 C CN1177049 C CN 1177049C CN B011267275 A CNB011267275 A CN B011267275A CN 01126727 A CN01126727 A CN 01126727A CN 1177049 C CN1177049 C CN 1177049C
- Authority
- CN
- China
- Prior art keywords
- ctg
- ccc
- cag
- cca
- cct
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Landscapes
- Peptides Or Proteins (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
Abstract
The present invention discloses a novel human protein with the function of inhibiting cancer, polynucleotide for encoding the polypeptide and a method for preparing the polypeptide by a recombinant technology. The present invention also discloses a method of using the polypeptide to treat various diseases, such as cancers. The present invention also discloses an antagonist of the polypeptide and a therapeutic effect thereof. The present invention also discloses the application of the polynucleotide for encoding the human protein with the function of inhibiting cancer.
Description
Technical field
The invention belongs to biological technical field, specifically, the present invention relates to new coding and have the proteic polynucleotide of people of cancer suppressing function and the polypeptide of this polynucleotide encoding.The invention still further relates to the purposes and the preparation of these polynucleotide and polypeptide.
Background technology
The research of people's gene group is international focus at present, removes human chromosome DNA large scale sequencing, outside the method for expressed sequence order-checking (EST), also lacks the screening that begins from function and has the high-throughout method of functional gene.
Cancer is one of principal disease of harm humans health.In order to treat effectively and prophylaxis of tumours, people more and more pay close attention to genetic treatment of tumor at present.Therefore, this area presses for people's albumen and the agonist/inhibitor thereof that development research has cancer suppressing function.
Summary of the invention
The purpose of this invention is to provide the new people's protein polypeptide of a class with cancer suppressing function with and fragment, analogue and derivative.
Another object of the present invention provides the polynucleotide of these polypeptide of coding.
Another object of the present invention provides the method for these polypeptide of production and the purposes of this polypeptide and encoding sequence.
In a first aspect of the present invention, novel isolated protein polypeptide with cancer suppressing function is provided, it comprises the polypeptide of the aminoacid sequence with the group of being selected from down: SEQ ID NO:2,5,8,11,14,17,20,23,26,29,32,35; Or its conservative property variation polypeptide or its active fragments or its reactive derivative.
Preferably, this polypeptide is the polypeptide with aminoacid sequence of the group of being selected from down: SEQ ID NO:2,5,8,11,14,17,20,23,26,29,32,35.
In a second aspect of the present invention, a kind of isolating polynucleotide are provided, it comprises a nucleotide sequence, and this nucleotide sequence is shown at least 85% homogeny with a kind of nucleotides sequence that is selected from down group: the polynucleotide of the above-mentioned protein polypeptide with cancer suppressing function of (a) encoding; (b) with polynucleotide (a) complementary polynucleotide.Preferably, the polypeptide of this polynucleotide encoding has the aminoacid sequence of the group of being selected from down: SEQ ID NO:2,5,8,11,14,17,20,23,26,29,32,35.More preferably, the sequence of these polynucleotide is selected from down group: SEQ ID NO:3,6,9,12,15,18,21,24,27,30,33,36 coding region sequence or full length sequence.
In a third aspect of the present invention, the carrier that contains above-mentioned polynucleotide is provided, and has been transformed or host cell of transduceing or the host cell that is directly transformed or transduce by above-mentioned polynucleotide by this carrier.
In a fourth aspect of the present invention, the preparation method who prepares the polypeptide of the protein-active with cancer suppressing function is provided, this method comprises: (a) have under the proteic condition of cancer suppressing function suitable the expression, cultivate the above-mentioned host cell that is transformed or transduce; (b) from culture, isolate the polypeptide of protein-active with cancer suppressing function.
In a fifth aspect of the present invention, provide and above-mentioned protein polypeptide specificity bonded antibody with cancer suppressing function.The nucleic acid molecule that can be used for detecting also is provided, and it contains, and continuous 10 Nucleotide are to full length nucleotide in the above-mentioned polynucleotide, and preferably it contains the about 10-800 of a successive Nucleotide.
In a sixth aspect of the present invention, a kind of pharmaceutical composition is provided, it contains the protein polypeptide and the pharmaceutically acceptable carrier with cancer suppressing function of the present invention of safe and effective amount.These pharmaceutical compositions can be treated illnesss such as cancer and cellular abnormality propagation.
Others of the present invention are because the disclosure of this paper is conspicuous to those skilled in the art.
Embodiment
The present invention adopts large-scale cDNA clone transfection cancer cells, has on the basis of cancer suppressing action in acquisition, proves new gene through order-checking, further obtains full length cDNA clone.DNA transfection evidence, the albumen with cancer suppressing function of the present invention has the effect that suppresses clone's formation to cancer cells (liver cancer cell), and its inhibiting rate is more than 50% or 50%.
As used herein, " isolating " is meant that material separates (if natural substance, primal environment promptly is a natural surroundings) from its primal environment.Do not have separation and purification as polynucleotide under the native state in the active somatic cell and polypeptide, but same polynucleotide or polypeptide as from native state with in other materials that exist separately, then for separation and purification.
As used herein, " isolating albumen or polypeptide with cancer suppressing function " is meant that the protein polypeptide with cancer suppressing function is substantially free of natural relative other albumen, lipid, carbohydrate or other material.Those skilled in the art can have the albumen of cancer suppressing function with the purified technology of protein purifying of standard.Basically pure polypeptide can produce single master tape on non-reduced polyacrylamide gel.
Polypeptide of the present invention can be recombinant polypeptide, natural polypeptides, synthetic polypeptide, preferred recombinant polypeptide.Polypeptide of the present invention can be the product of natural purifying, or the product of chemosynthesis, or uses recombinant technology to produce from protokaryon or eucaryon host (for example, bacterium, yeast, higher plant, insect and mammalian cell).The host used according to the recombinant production scheme, polypeptide of the present invention can be glycosylated, maybe can be nonglycosylated.Polypeptide of the present invention also can comprise or not comprise initial methionine residues.
The present invention also comprises the proteic fragment of the people with cancer suppressing function, derivative and analogue.As used herein, term " fragment ", " derivative " are meant with " analogue " and keep natural identical biological function or the active polypeptide of people's albumen with cancer suppressing function of the present invention basically.Polypeptide fragment of the present invention, derivative or analogue can be that (i) has one or more conservative or substituted polypeptide of non-conservation amino-acid residue (preferred conservative amino acid residue), and the amino-acid residue of such replacement can be also can not encoded by genetic code, or (ii) in one or more amino-acid residues, has a polypeptide of substituted radical, or (iii) mature polypeptide and another compound (such as the compound that prolongs the polypeptide transformation period, polyoxyethylene glycol for example) merge formed polypeptide, or (iv) additional aminoacid sequence is fused to this peptide sequence and the polypeptide that forms (as leader sequence or secretion sequence or be used for the sequence or the proteinogen sequence of this polypeptide of purifying).According to the instruction of this paper, these fragments, derivative and analogue belong to the known scope of those skilled in the art.
Polynucleotide of the present invention can be dna form or rna form.Dna form comprises the DNA of cDNA, genomic dna or synthetic.DNA can be strand or double-stranded.DNA can be coding strand or noncoding strand.Be example with PP10187 albumen (in this application, its clone numbering is adopted in proteinic name), the coding region sequence of encoding mature polypeptide can be identical with the coding region sequence shown in the SEQ ID NO:3 or the varient of degeneracy.As used herein, " varient of degeneracy " is meant that in the present invention coding has the protein of SEQ ID NO:2, but with the differentiated nucleotide sequence of coding region sequence shown in the SEQ IDNO:3.Be example with PP12104 albumen (in this application, its clone numbering is adopted in proteinic name) again, the coding region sequence of encoding mature polypeptide can be identical with the coding region sequence shown in the SEQ ID NO:6 or the varient of degeneracy.As used herein, " varient of degeneracy " is meant that in the present invention coding has the protein of SEQ ID NO:5, but with the differentiated nucleotide sequence of coding region sequence shown in the SEQ ID NO:6.Have the albumen of cancer suppressing function for of the present invention other, can the rest may be inferred.
The polynucleotide of encoding mature polypeptide comprise: the encoding sequence of an encoding mature polypeptide; The encoding sequence of mature polypeptide and various additional code sequence; Encoding sequence of mature polypeptide (with optional additional code sequence) and non-coding sequence.
Term " polynucleotide of coded polypeptide " can be the polynucleotide that comprise this polypeptide of encoding, and also can be the polynucleotide that also comprise additional code and/or non-coding sequence.
The invention still further relates to the varient of above-mentioned polynucleotide, its coding has the polypeptide of identical aminoacid sequence or fragment, analogue and the derivative of polypeptide with the present invention.The varient of these polynucleotide can be the allelic variant of natural generation or the varient that non-natural takes place.These nucleotide diversity bodies comprise and replace varient, deletion mutation body and insert varient.As known in the art, allelic variant is the replacement form of polynucleotide, and it may be replacement, disappearance or the insertion of one or more Nucleotide, but can be from not changing the function of its encoded polypeptides in fact.
The invention still further relates to and above-mentioned sequence hybridization and two sequences between have at least 50%, preferably at least 70%, the polynucleotide of at least 80% homogeny more preferably.The present invention be more particularly directed under stringent condition and the interfertile polynucleotide of polynucleotide of the present invention.In the present invention, " stringent condition " is meant: (1) than hybridization under low ionic strength and the comparatively high temps and wash-out, as 0.2 * SSC, and 0.1%SDS, 60 ℃; Or (2) hybridization the time is added with denaturing agent, as 50% (v/v) methane amide, 0.1% calf serum/0.1%Ficoll, 42 ℃ etc.; Or (3) only at the homogeny between the two sequences at least more than 95%, be more preferably 97% and just hybridize when above.And the polypeptide of interfertile polynucleotide encoding has identical biological function (is example with PP10187 albumen) and activity with the mature polypeptide shown in the SEQ IDNO:2.
The invention still further relates to nucleic acid fragment with above-mentioned sequence hybridization.As used herein, the length of " nucleic acid fragment " contains 15 Nucleotide at least, better is at least 30 Nucleotide, is more preferably at least 50 Nucleotide, preferably more than at least 100 Nucleotide.The amplification technique (as PCR) that nucleic acid fragment can be used for nucleic acid has the proteic polynucleotide of cancer suppressing function to determine and/or to separate to encode.
Polypeptide among the present invention and polynucleotide preferably provide with isolating form, more preferably are purified to homogeneous.
Dna sequence dna of the present invention can obtain with several method.For example, with hybridization technique DNA isolation well known in the art.These technology including, but not limited to: 1) with probe and genome or the hybridization of cDNA library to detect homology nucleotide sequence and 2) antibody screening of expression library to be to detect the dna fragmentation of the clone with common structure feature.
The proteic specific DNA fragment sequence that coding has cancer suppressing function produces also and can obtain with following method: 1) separate double chain DNA sequence from genomic dna; 2) the chemical synthesising DNA sequence is to obtain the double-stranded DNA of required polypeptide.
In the above-mentioned method of mentioning, isolation of genomic DNA is least commonly used.When the whole aminoacid sequence of the polypeptide product of needs was known, the direct chemical of dna sequence dna is synthetic to be the method for often selecting for use.When if required amino acid whose whole sequence is not known, the direct chemical of dna sequence dna is synthetic to be impossible, and the method for selecting for use is the separation of cDNA sequence.The standard method that separates interested cDNA is from the donorcells separating mRNA of this gene of high expression level and carries out reverse transcription, forms plasmid or phage cDNA library.Extract the existing multiple proven technique of method of mRNA, test kit also can obtain (Qiagene) from commercial channels.And the construction cDNA library also is usual method (Sambrook, et al., Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory.New York, 1989).Also can obtain the cDNA library of commercial offers, as the different cDNA library of Clontech company.When being used in combination the polymeric enzyme reaction technology, even few expression product also can be cloned.
Available ordinary method is screened gene of the present invention from these cDNA libraries.These methods include, but is not limited to: (1) DNA-DNA or DNA-RNA hybridization; (2) function of marker gene occurs or forfeiture; (3) mensuration has the level of the proteic transcript of cancer suppressing function; (4), detect the protein product of genetic expression by immunological technique or mensuration biologic activity.Aforesaid method can singly be used, but also several different methods combined utilization.
In (1) kind method, hybridizing used probe is and any a part of homology of polynucleotide of the present invention that at least 15 Nucleotide of its length better are at least 30 Nucleotide, are more preferably at least 50 Nucleotide, preferably at least 100 Nucleotide.In addition, the length of probe within 2kb, preferably is within the 1kb usually.Probe used herein is the dna sequence dna of chemosynthesis on the basis of gene DNA sequence information of the present invention normally.Gene of the present invention itself or fragment are certainly as probe.The mark of dna probe can be used radio isotope, fluorescein or enzyme (as alkaline phosphatase) etc.
In (4) kind method, detect the protein product of protein gene expression and can use immunological technique such as Western blotting, radioimmunoprecipitation, enzyme-linked immunosorbent assay (ELISA) etc. with cancer suppressing function.
Use method (Saiki, the et al.Science 1985 of round pcr DNA amplification/RNA; 230:1350-1354) be optimized for acquisition gene of the present invention.When particularly being difficult to obtain the cDNA of total length from the library, can preferably use RACE method (the terminal rapid amplifying method of RACE-cDNA), the primer that is used for PCR can suitably be selected according to sequence information of the present invention disclosed herein, and available ordinary method is synthetic.Available ordinary method is as the DNA/RNA fragment by gel electrophoresis separation and purifying amplification.
The gene of the present invention that obtains as mentioned above, perhaps the available ordinary method of mensuration of the nucleotide sequence of various dna fragmentations etc. such as dideoxy chain termination (Sanger et al.PNAS, 1977,74:5463-5467).This class nucleotide sequencing is available commercial sequencing kit etc. also.In order to obtain the cDNA sequence of total length, order-checking need be carried out repeatedly.Sometimes need to measure a plurality of clones' cDNA sequence, just can be spliced into the cDNA sequence of total length.
The present invention also relates to comprise the carrier of polynucleotide of the present invention, and the host cell that produces through genetically engineered with carrier of the present invention or albumen coded sequence with cancer suppressing function, and the method that produces polypeptide of the present invention through recombinant technology.
Recombinant DNA technology (Science, 1984 by routine; 224:1431), can utilize polymerized nucleoside acid sequence of the present invention to can be used to express or produce the protein polypeptide with cancer suppressing function of reorganization.In general following steps are arranged:
(1). have the proteic polynucleotide of people (or varient) of cancer suppressing function with coding of the present invention, or transform or the transduction proper host cell with the recombinant expression vector that contains these polynucleotide;
(2). the host cell of in suitable medium, cultivating;
(3). separation, protein purification from substratum or cell.
Among the present invention, the people's albumen polynucleotide sequence with cancer suppressing function can be inserted in the recombinant expression vector.Term " recombinant expression vector " refers to that bacterial plasmid well known in the art, phage, yeast plasmid, vegetable cell virus, mammalian cell virus are as adenovirus, retrovirus or other carriers.The carrier of Shi Yonging includes but not limited in the present invention: and the expression vector based on T7 of in bacterium, expressing (Rosenberg, et al.Gene, 1987,56:125); The pMSXND expression vector of in mammalian cell, expressing (Lee and Nathans, J Bio Chem.263:3521,1988) and at the carrier that derives from baculovirus of expressed in insect cells.In a word, as long as can duplicate in host and stablize, any plasmid and carrier can be used.A key character of expression vector is to contain replication orgin, promotor, marker gene and translation controlling elements usually.
Method well-known to those having ordinary skill in the art can be used to make up and contains people's encoding histone dna sequence dna with cancer suppressing function and suitable transcribing/the translate expression vector of control signal.These methods comprise (Sambroook, et al.) such as extracorporeal recombinant DNA technology, DNA synthetic technology, the interior recombinant technologys of body.Described dna sequence dna can effectively be connected on the suitable promotor in the expression vector, and is synthetic to instruct mRNA.The representative example of these promotors has: colibacillary lac or trp promotor; Lambda particles phage P
LPromotor; Eukaryotic promoter comprises LTRs and some other known may command gene expression promoter in protokaryon or eukaryotic cell or its virus of CMV immediate early promoter, early stage and late period SV40 promotor, retrovirus.Expression vector also comprises ribosome bind site and the transcription terminator that translation initiation is used.
In addition, expression vector preferably comprises one or more selected markers, to be provided for selecting the phenotypic character of transformed host cells, cultivate Tetrahydrofolate dehydrogenase, neomycin resistance and the green fluorescent protein (GFP) of usefulness as eukaryotic cell, or be used for colibacillary tsiklomitsin or amicillin resistance.
Comprise the carrier of above-mentioned suitable dna sequence dna and suitable promotor or control sequence, can be used to transform appropriate host cell, so that it can marking protein.
Host cell can be a prokaryotic cell prokaryocyte, as bacterial cell; Or eukaryotic cell such as low, as yeast cell; Or higher eucaryotic cells, as mammalian cell.Representative example has: intestinal bacteria, streptomyces; The bacterial cell of Salmonella typhimurium; Fungal cell such as yeast; Vegetable cell; The insect cell of fruit bat S2 or Sf9; The zooblast of CHO, COS or Bowes melanoma cells etc.
When polynucleotide of the present invention are expressed in higher eucaryotic cells, be enhanced if will make to transcribe when in carrier, inserting enhancer sequence.Enhanser is the cis acting factor of DNA, and nearly 10 to 300 base pairs act on promotor transcribing with enhancing gene usually.Can for example be included in the SV40 enhanser of 100 to 270 base pairs of replication origin side in late period one, at the polyoma enhanser of replication origin side in late period one and adenovirus enhanser etc.
Persons skilled in the art all know how to select appropriate carriers, promotor, enhanser and host cell.
Can carry out with routine techniques well known to those skilled in the art with the recombinant DNA transformed host cell.When the host was prokaryotic organism such as intestinal bacteria, the competent cell that can absorb DNA can be used CaCl in exponential growth after date results
2Method is handled, and used step is well-known in this area.Alternative is to use MgCl
2If desired, transforming also the method for available electroporation carries out.When the host is an eukaryote, can select following DNA transfection method for use: coprecipitation of calcium phosphate method, conventional mechanical method such as microinjection, electroporation, liposome packing etc.
The transformant that obtains can be cultivated with ordinary method, expresses the polypeptide of coded by said gene of the present invention.According to used host cell, used substratum can be selected from various conventional substratum in the cultivation.Under the condition that is suitable for the host cell growth, cultivate.After host cell grows into suitable cell density, induce the promotor of selection with suitable method (as temperature transition or chemical induction), cell is cultivated for some time again.
Recombinant polypeptide in the above methods can wrap by in cell, extracellular or on cytolemma, express or be secreted into the extracellular.If desired, can utilize its physics, the separating by various separation methods with other characteristic and the albumen of purification of Recombinant of chemistry.These methods are well-known to those skilled in the art.The example of these methods includes, but are not limited to: conventional renaturation handles, with protein precipitant handle (salt analysis method), centrifugal, the broken bacterium of infiltration, superly handle, the combination of super centrifugal, sieve chromatography (gel-filtration), adsorption chromatography, ion exchange chromatography, high performance liquid chromatography (HPLC) and other various liquid chromatography (LC) technology and these methods.
The people's albumen or the polypeptide with cancer suppressing function of reorganization are of use in many ways.These purposes include, but is not limited to: directly have the disease (as cancer) due to the low or forfeiture of the protein function of cancer suppressing function as pharmacological agent and be used to screen and promote or antagonism has antibody, polypeptide or other part of the protein function of cancer suppressing function.For example, antibody can be used for activating or suppressing to have the proteic function of people of cancer suppressing function.The people's protein screening peptide library that has a cancer suppressing function with the reorganization of expressing can be used for seeking the peptide molecule that can suppress or stimulate the people's protein function with cancer suppressing function of therapeutic value.
The present invention also provides screening of medicaments to improve (agonist) or check the method that (antagonist) has the proteic medicament of people of cancer suppressing function to identify.Agonist improves the biological function such as stimulate cellular proliferation of the people's albumen with cancer suppressing function, and antagonist prevention disorder such as the various cancer relevant with cell hyperproliferation with treatment.For example, can be in the presence of medicine, the proteic film preparation of people that mammalian cell or expression is had cancer suppressing function is cultivated with the people's albumen with cancer suppressing function of mark.Measure the medicine raising then or check this interactional ability.
The proteic antagonist of people with cancer suppressing function comprises antibody, compound, acceptor disappearance thing and the analogue etc. that filter out.Described antagonist can and be eliminated its function with the people's protein binding with cancer suppressing function, or suppresses to have the proteic generation of people of cancer suppressing function, or combines with the avtive spot of polypeptide and to make polypeptide can not bring into play biological function.The proteic antagonist of people with cancer suppressing function can be used for therepic use.
In screening during as the compound of antagonist, albumen of the present invention can be added during bioanalysis measures, determine by measuring albumen and the interaction between its acceptor that compounds affect has cancer suppressing function whether compound is antagonist.With the same quadrat method of above-mentioned SCREENED COMPOUND, can filter out the acceptor disappearance thing and the analogue of antagonist action.
Polypeptide of the present invention can be directly used in disease treatment, for example, and various malignant tumours and cellular abnormality propagation etc.
Polypeptide of the present invention, and fragment, derivative, analogue or their cell can be used as antigen to produce antibody.These antibody can be polyclone or monoclonal antibody.Polyclonal antibody can obtain by the method with this polypeptide direct injection animal.The technology of preparation monoclonal antibody comprises hybridoma technology, three knurl technology, people B-quadroma technology, EBV-hybridoma technology etc.
Can be with polypeptide of the present invention and antagonist and suitable pharmaceutical carrier combination back use.These carriers can be water, glucose, ethanol, salt, damping fluid, glycerine and their combination.Composition comprises the polypeptide or the antagonist of safe and effective amount and carrier and the vehicle that does not influence effect of drugs.These compositions can be used as medicine and are used for disease treatment.
The present invention also provides medicine box or the test kit that contains one or more containers, and one or more medicinal compositions compositions of the present invention are housed in the container.With these containers, can have by the given indicative prompting of government authorities of making, using or selling medicine or biological products, the government authorities that this prompting reflects production, uses or sells permits it to use on human body.In addition, polypeptide of the present invention can be used in combination with other treatment compound.
Pharmaceutical composition can be with mode administration easily, as by in part, intravenously, intraperitoneal, intramuscular, subcutaneous, the nose or the route of administration of intracutaneous.Albumen with cancer suppressing function comes administration with the amount that treats and/or prevents concrete indication effectively.The proteic amount with cancer suppressing function and the dosage range that are applied to the patient will depend on many factors, as administering mode, person's to be treated healthiness condition and diagnostician's judgement.
The proteic polynucleotide of people with cancer suppressing function also can be used for multiple therapeutic purpose.Gene therapy technology can be used for treating since have that the proteic nothing of cancer suppressing function is expressed or the proteic expression with cancer suppressing function of unusual/non-activity due to cell proliferation, growth or metabolic disturbance.The gene therapy vector of reorganization can be used for treating the protein expression with cancer suppressing function or the disease of active caused by abnormal.Deriving from the expression vector of virus such as protein gene that retrovirus, adenovirus, adeno-associated virus (AAV), hsv, parvovirus etc. can be used for having cancer suppressing function is transferred in the cell.The method that structure carries the recombinant viral vector of the protein gene with cancer suppressing function be found in existing document (Sambrook, etal.).The people protein gene of reorganization with cancer suppressing function can be packaged in the liposome and be transferred in the cell in addition.
Suppress to have cancer suppressing function people's protein mRNA oligonucleotide (comprising sense-rna and DNA) and ribozyme also within the scope of the invention.Ribozyme is the enzyme sample RNA molecule that a kind of energy specificity is decomposed specific RNA, and its mechanism of action is to carry out the endonuclease effect after ribozyme molecule and the hybridization of complementary target RNA-specific.The RNA of antisense and DNA and ribozyme can obtain with existing any RNA or DNA synthetic technology, as the technology widespread use of solid phase phosphoamide chemical synthesis synthetic oligonucleotide.Antisense rna molecule can be transcribed acquisition by the dna sequence dna of this RNA that encodes in external or body.This dna sequence dna has been incorporated into the downstream of rna polymerase promoter of carrier.In order to increase the stability of nucleic acid molecule, available several different methods is modified it, and as increasing the sequence length of both sides, the connection between the ribonucleoside is used phosphoric acid thioester bond or peptide bond but not phosphodiester bond.
Polynucleotide imports tissue or intracellular method comprises: directly be injected into polynucleotide in the in-vivo tissue; Or external by carrier (as virus, phage or plasmid etc.) earlier with the polynucleotide transfered cell in, again cell is transplanted in the body etc.
Polypeptide of the present invention also can be used as the peptide spectrum analysis, for example, the polypeptide available physical, chemistry or enzyme carry out the specificity cutting, and carry out the two-dimentional or three-dimensional gel electrophoresis analysis of one dimension.
The present invention also provides the antibody at the people's proteantigen determinant with cancer suppressing function.These antibody include, but is not limited to: the fragment that polyclonal antibody, monoclonal antibody, chimeric antibody, single-chain antibody, Fab fragment and Fab expression library produce.These antibody can prepare with ordinary method.The anti-proteic antibody of people with cancer suppressing function can be used in the immunohistochemistry technology, detects the people's albumen with cancer suppressing function in the biopsy specimen.
With the also available labelled with radioisotope of the protein bound monoclonal antibody of the people with cancer suppressing function, inject in the body and can follow the tracks of its position and distribution.Antibody among the present invention can be used for treating or prevents and the relevant disease of people's albumen with cancer suppressing function.The antibody that gives suitable dosage can stimulate or block proteic generation of the people with cancer suppressing function or activity.
Antibody also can be used for designing the immunotoxin at a certain privileged sites in the body.As have cancer suppressing function people's albumen high-affinity monoclonal antibody can with bacterium or plant poison (as diphtheria toxin, ricin, abrine etc.) covalent attachment.
Available people's albumen or the polypeptide immune animal of the production of polyclonal antibody with cancer suppressing function, as rabbit, mouse, rat etc.Multiple adjuvant can be used for the enhancing immunity reaction, includes but not limited to freund's adjuvant etc.
Have cancer suppressing function people's protein monoclonal antibody can with hybridoma technology production (Kohler and Milstein.Nature, 1975,256:495-497).With the variable region bonded chimeric antibody in human constant region and inhuman source can with existing technology production (Morrison et al, PNAS, 1985,81:6851).And the technology of existing manufacture order chain antibody (U.S.PatNo.4946778) also can be used for producing the anti-proteic single-chain antibody of people with cancer suppressing function.
Can be incorporated into the rondom polypeptide storehouse that solid formation forms by the various amino acid that may make up by screening with the protein bound peptide molecule of the present invention obtains.During screening, must carry out mark to people's protein molecular with cancer suppressing function.
The invention still further relates to quantitatively and detection and localization has the diagnostic testing process of people's protein level of cancer suppressing function.These tests are known in the art, and comprise that FISH measures and radioimmunoassay.The people's protein level that is detected in the test with cancer suppressing function, the disease that can have the importance of people's albumen in various diseases of cancer suppressing function with laying down a definition and be used to diagnose albumen to work with cancer suppressing function.
Proteic polynucleotide with cancer suppressing function can be used for having the diagnosis and the treatment of the protein related diseases of cancer suppressing function.Aspect diagnosis, the proteic polynucleotide with cancer suppressing function can be used for detecting have cancer suppressing function proteic expression whether or under morbid state, have an abnormal exprssion of cancer suppressing function.As the protein D NA sequence with cancer suppressing function can be used for the hybridization of biopsy specimen is had with judgement the proteic abnormal expression of cancer suppressing function.Hybridization technique comprises the Southern blotting, Northern blotting, in situ hybridization etc.These technological methods all are disclosed mature technologies, and relevant test kit all can obtain from commercial channels.Part or all of polynucleotide of the present invention can be used as probe stationary on microarray (Microarray) or DNA chip (being called " gene chip " again), is used for analyzing the differential expression analysis and the gene diagnosis of tissue gene.Carry out RNA-polymerase chain reaction (RT-PCR) amplification in vitro with the special primer of the albumen with cancer suppressing function and also can detect proteic transcription product with cancer suppressing function.
The sudden change that detection has the protein gene of cancer suppressing function also can be used for diagnosing the relevant disease of albumen with cancer suppressing function.Form with protein mutation of cancer suppressing function comprises that to have point mutation that the protein D NA sequence of cancer suppressing function compares, transposition, disappearance, reorganization and other any unusual etc. with normal wild type.Available existing technology such as Southern blotting, dna sequence analysis, PCR and in situ hybridization detect sudden change.In addition, sudden change might influence proteic expression, therefore can judge indirectly that with Northern blotting, Western blotting gene has or not sudden change.
Sequence of the present invention identifies it also is valuable to karyomit(e).These sequences can be specifically at certain bar human chromosome particular location and and can with its hybridization.At present, need to identify the concrete site of each gene on the karyomit(e).Yet have only chromosomal marker thing seldom to can be used for the marker chromosomes position now based on actual sequence data (repetition polymorphism).For these sequences are associated with disease related gene.The first step is positioned dna sequence dna of the present invention on the karyomit(e) exactly.
In brief, prepare PCR primer (preferred 15-35bp), sequence can be positioned on the karyomit(e) according to cDNA.Then, these primers are used for the somatocyte hybrid cell that the PCR screening contains each bar human chromosome.Have only those hybrid cells that contain corresponding to the people's gene of primer can produce the fragment of amplification.
The PCR localization method of somatocyte hybrid cell is that DNA is navigated to concrete chromosomal quick method.Use Oligonucleolide primers of the present invention,, can utilize one group to realize inferior location from specific chromosomal fragment or a large amount of genomic clone by similar approach.Other the similar strategy that can be used for chromosomal localization comprises in situ hybridization, uses the karyomit(e) prescreen and the hybridization preliminary election of the airflow classification of mark, thereby makes up the special cDNA storehouse of karyomit(e).
The cDNA clone is carried out fluorescence in situ hybridization (FISH) with Metaphase Chromosome, can in a step, accurately carry out chromosomal localization.The summary of this technology is referring to Verma etc., Human Chromosomes:a Manual of BasicTechniques, Pergamon Press, New York (1988).
In case sequence is positioned to chromosome position accurately, the physical location of this sequence on karyomit(e) just can be associated with the gene map data.These data for example are found in, V.Mckusick, Mendelian Inheritance in Man (can by with the online acquisition of Johns Hopkins University Welch Medical Library).Can pass through linkage analysis then, determine gene and navigated to relation between the disease on the chromosomal region already.
Then, need to measure ill and not cDNA between diseased individuals or genome sequence difference.If observe certain sudden change in some or all of diseased individuals, and this sudden change is not observed in any normal individual, then this sudden change may be the cause of disease of disease.More ill and diseased individuals not is usually directed at first seek the variation of structure in the karyomit(e), as from the horizontal visible of karyomit(e) or use based on detectable disappearance of the PCR of cDNA sequence or transposition.
Pyrenoids thuja acid full length sequence or its fragment with cancer suppressing function of the present invention can obtain with the method for pcr amplification method, recombination method or synthetic usually.For the pcr amplification method, can be disclosed according to the present invention about nucleotide sequence, especially open reading frame sequence designs primer, and with commercially available cDNA storehouse or by the prepared cDNA storehouse of ordinary method well known by persons skilled in the art as template, amplification and must relevant sequence.When sequence is longer, usually needs to carry out twice or pcr amplification repeatedly, and then the fragment that each time amplifies is stitched together by proper order.
In case obtained relevant sequence, just can obtain relevant sequence in large quantity with recombination method.This normally is cloned into carrier with it, changes cell again over to, separates obtaining relevant sequence then from the host cell after the propagation by ordinary method.
In addition, also the method for available synthetic is synthesized relevant sequence, especially fragment length more in short-term.Usually, by first synthetic a plurality of small segments, and then connect and to obtain the very long fragment of sequence.
At present, can be fully come the dna sequence dna of code book invention albumen (or its fragment, or derivatives thereof) by chemosynthesis.This dna sequence dna can be introduced then in the various dna moleculars (as carrier) and cell in this area.In addition, also can will suddenly change and introduce in the protein sequence of the present invention by chemosynthesis.
In addition, because the albumen with cancer suppressing function of the present invention has the natural acid sequence that is derived from the people, therefore, compare with the albumen of the same clan that derives from other species, estimate to have higher active and/or lower side effect (for example in the intravital immunogenicity of people lower or do not have) being applied to man-hour.
Below in conjunction with specific embodiment, further set forth the present invention.Should be understood that these embodiment only to be used to the present invention is described and be not used in and limit the scope of the invention.The experimental technique of unreceipted actual conditions in the following example, usually according to people such as normal condition such as Sambrook, molecular cloning: laboratory manual (New York:Cold Spring Harbor LaboratoryPress, 1989) condition described in, or the condition of advising according to manufacturer.
The acquisition of embodiment 1:cDNA gene and the restraining effect that the cancer cells clone is formed
PP10187, PP12104, PP12585, PP14397, PP14434 and PP14630 obtain by making up the human placenta cDNA library with ordinary method; FP248, FP291, FP633, FP782, FP793 and FP944 obtain by make up human fetal cDNA library with ordinary method.Get the placenta tissue (PP clone) or the fetal tissue (FP clone) at 3,6,9 monthly ages, (GIBCO BRL company) extracts total RNA by manufacturer's specification sheets with Trizol reagent, extracts mRNA with the mRNA test kit (Pharmacia company) of purifying.Make up the cDNA library of above-mentioned mRNA with pCMV-script TMXR cDNA library construction test kit (Stratagene company).Wherein ThermoScript II is used MMLV-RT-Superscript II (GIBCO BRL) instead, and reverse transcription reaction carries out at 42 ℃.Transform XL 10-Gold recipient cell, obtained 1 * 10
6The cDNA library of cfu/ μ g cDNA titre.The first round is picking cDNA clone at random, is probe with high abundance cDNA clone with the cDNA clone who has proved cancer inhibitor cell growth function thereafter, screening by hybridization cDNA library, weak positive and negative clone of picking.With Qiagen 96 orifice plate plasmid extraction test kits, carry out the extraction of plasmid DNA by shop instruction.Plasmid DNA and empty carrier transfection simultaneously hepatoma cell line 7721.After the 100ng DNA alcohol precipitation drying, add 6 μ l H
2Transfection is treated in the O dissolving.Add 0.74 μ l liposome and 9.3 μ l serum-free mediums in every part of DNA sample, behind the mixing, room temperature was placed 10 minutes.Add 150 μ l serum-free mediums in every pipe, divide equally and add 3 holes and grow in 7721 cells of 96 orifice plates, placed 2 hours for 37 ℃, every hole adds 50 μ l serum-free mediums again, 37 ℃ 24 hours.Every hole is changed 100 μ l and is trained liquid entirely, 37 ℃ 24 hours, change the full training liquid 100 μ l that contain G418,37 ℃ 24-48 hour, the limit is observed, the training liquid that G418 concentration does not wait is changed on the limit.After about 2-3 time, there is the clone to form up to the microscopy cell, counting.Find that above-mentioned clone has anticancer clone formation effect, the result is as shown in the table.
CDNA clone's transfectional cell (7721) clone formation situation
CDNA clones title | CDNA clones number (three repetitions) | Empty carrier clone number (three repetitions) | ||||
PP10187 | 12 | 9 | 7 | 15 | 18 | 13 |
PP12104 | 5 | 8 | 7 | 15 | 18 | 13 |
PP12585 | 8 | 12 | 10 | 15 | 18 | 13 |
PP14397 | 8 | 17 | 10 | 15 | 18 | 13 |
PP14434 | 2 | 6 | 3 | 15 | 18 | 13 |
PP14630 | 12 | 12 | 3 | 15 | 18 | 13 |
FP248 | 3 | 1 | 2 | 15 | 18 | 13 |
FP291 | 2 | 5 | 7 | 15 | 18 | 13 |
FP633 | 8 | 7 | 4 | 15 | 18 | 13 |
FP782 | 8 | 10 | 2 | 15 | 18 | 13 |
FP793 | 8 | 12 | 8 | 15 | 18 | 13 |
FP944 | 7 | 9 | 8 | 15 | 18 | 13 |
The cDNA clone is adopted two deoxidation cessation method, on the ABI377 automatic dna sequencer, measure the nucleotide sequence of the nearly 500bp of one end.After the analysis, be defined as novel gene cloning, carry out the other end order-checking, do not obtain full length cDNA sequence yet, the design primer checks order once more, up to obtaining full length sequence (SEQ ID NO:1,4,7,10,13,16,19,22,25,28,31,34).
Embodiment 2: PCR obtains full-length gene from placenta or fetus cDNA:
Get the placenta tissue (PP clone) or the fetal tissue (FP clone) at 3,6,9 monthly ages, (GIBCOBRL company) extracts total RNA by manufacturer's specification sheets with Trizol reagent, extracts mRNA with the mRNA test kit (Pharmacia company) of purifying.With MMLV-RT-Superscript II (GIBCO BRL), ThermoScript II is carried out reverse transcription reaction at 42 ℃, obtains placenta or fetus cDNA.Utilize the special primer (as shown in the table) of each gene, by 97 ℃ of 3 ' 1 circulation.94 ℃ 30 " 60 ℃ 30 " 72 ℃ of 1 ' 35 circulation, pcr amplification is carried out in 72 ℃ of 10 ' 1 circulation, and acquisition contains the amplified production of each protein gene of complete open reading frame sequence.Amplified production is through sequence verification, and the sequence that records with embodiment 1 conforms to, and changes amplified production over to host cell with routine techniques subsequently, obtains recombinant protein (SEQ ID NO:2,5,8,11,14,17,20,23,26,29,32,35).
Gene specific primer
Clone's title | Special primer 1 (5 ' → 3 ') | Special primer 2 (5 ' → 3 ') |
PP10187 | (69)GGGTCGAGCATGTAGCGG | AACGGAACAGGTGGTCCC(1706) |
PP12104 | (173)GGCCTTCTCAGTCATCATCC | GGTTTCTCCTCGTTCCGT(700) |
PP12585 | (97)CGTGGCTTCCGCTTCTAT | GGTTGGTCTGGGAGTCGT(1842) |
PP14397 | (41)CTGGCGGCTGTGGCTCTA | GTGTAAAGGACGAACGGT(1904) |
PP14434 | (283)TTACTACACCACGGAAGGC | GACAGAACGACTCGGGAG(3445) |
PP14630 | (1592)GGGGTCAAGGCGGAGAAGA | GGGAGGTGGAAACGGGTT(3065) |
FP248 | (300)GGTCTTGATGTGGCAGGAGT | GATTGACCCGAAACAGGTGG(2549) |
FP291 | (38)TTAGGAGGTCTGAAAGCAA | AAGGAGGAATAGGGAGGA(2177) |
FP633 | (19)TCACCTGGATTTTATGGC | AAGTAGGAACAGTACGAG(2271) |
FP782 | (163)GAATCCTCGGGGCGGTCA | CGCTGACGGAACCGGAGA(2418) |
FP793 | (1)GCCCTCCCTGACATTGCT | AGACCCGCTAAGACTCGG(1533) |
FP944 | (38)ATAGCTCCTCCAGTCCAC | TCTTCTTTACGGACATCC(2115) |
Embodiment 3:cDNA cloned sequence is analyzed
1.PP10187
A: nucleotide sequence (SEQ ID NO:1) length: 1902 bases
1 GCGCGGCCGG GGCGGAGACT TCGGGCCCGG CTGGCGGGCG GCGCCGGGAG CGCGGGGGCG
61 GCGGGCCCGG GTCGAGCATG TAGCGGCTGC TGGCGGCGGG GCTCCCGGGG CGGGCCGGGC
121 GGGCCGCGGG AGCCGCACGC GGCGATATGG AAGAGGAGGG CAAGAAGGGC AAGAAGCCTG
181 GAATTGTCTC GCCATTTAAA CGAGTATTCC TAAAAGGTGA AAAGAGTAGA GATAAGAAAG
241 CCCATGAGAA GGTGACAGAG AGGCGCCCTC TGCACACTGT GGTGTTGTCA TTGCCTGAGC
301 GCGTCGAGCC AGACAGACTG CTGAGCGACT ATATTGAGAA GGAGGTAAAG TATTTAGGTC
361 AGTTAACGTC CATACCAGGA TACCTGAATC CCTCCAGTAG GACTGAAATC CTGCATTTCA
421 TAGACAATGC AAAGAGAGCC CACCAGCTTC CGGGACACTT GACTCAGGAG CACGATGCTG
481 TGCTCAGCCT GTCTGCGTAC AACGTCAAGC TGGCCTGGAG GGACGGGGAG GATATCATCC
541 TCAGGGTGCC CATCCATGAC ATCGCCGCCG TCTCCTATGT TCGGGATGAC GCTGCACACC
601 TGGTGGTCCT GAAGACAGCC CAGGACCCAG GGATCTCCCC CAGCCAGAGT CTGTGTGCGG
661 AAAGTTCCAG AGGCCTCAGT GCAGGCTCCC TGTCGGAGAG TGCAGTTGGG CCCGTGGAGG
721 CATGCTGCCT GGTCATCCTG GCTGCAGAGA GCAAGGTCGC TGCGGAGGAG CTTTGCTGTC
781 TGCTAGGCCA GGTCTTCCAG GTTGTTTACA CGGAGTCCAC CATCGACTTT CTGGACAGAG
841 CGATATTTGA TGGGGCCTCT ACCCCGACCC ACCACCTGTC CCTGCACAGC GATGACTCTT
901 CTACAAAAGT GGACATTAAG GAGACCTACG AGGTGGAAGC CAGCACTTTC TGCTTCCCTG
961 AATCTGTGGA TGTGGGTGGT GCATCACCCC ACAGCAAGAC CATCAGTGAG AGCGAGCTGA
1021 GCGCCAGCGC CACTGAGCTG CTGCAGGACT ACATGCTGAC GCTGCGCACC AAGCTGTCAT
1081 CACAGGAGAT CCAGCAGTTT GCAGCACTGC TGCACGAGTA CCGCAATGGG GCCTCTATCC
1141 ACGAGTTCTG CATCAACCTG CGGCAGCTCT ACGGGGACAG CCGCAAGTTC CTGCTGCTTG
1201 GTCTGAGGCC CTTCATCCCT GAGAAGGACA GCCAGCACTT CGAGAACTTC CTGGAGACCA
1261 TTGGCGTGAA GGATGGCCGC GGCATCATCA CTGACAGCTT TGGCAGGCAC CGGCGGGCCC
1321 TGAGCACCAC ATCCAGTTCC ACCACCAATG GGAACAGGGC CACGGGCAGC TCTGATGACC
1381 GGTCGGCACC CTCAGAGGGG GATGAGTGGG ACCGCATGAT CTCGGACATC AGCAGCGACA
1441 TTGAGGCGCT GGGCTGCAGC ATGGCCCAGG ACTCAGCATT ATGGACAGTG GATGGGGGGG
1501 CACCCACACC TTCCGCGCAG TTGTCATAGG CCTTCCCAAA AGGAGCTGCC CAGACCTGGG
1561 TGTCAGCCCT TGGTGGTGGC CAGGGAAAGG CGCCCGGTGC AAATGGCCCC GGGCGGCCCA
1621 GGTCCTTTAC TGTGAAGGAG CAGGGAGCTG CCGAGGGACA CGAGCCTCAG TGCGGGGTGG
1681 AAGGCTTTTT GCCTTGTCCA CCAGGGGTCA GCCAAGCCCT GCAGTGTGTC CCCGCTCGGG
1741 GAGGGCCCGG CCGAGCGGGC AGGGAAAACC AGTCCTGTGG GCTGGGCCCT TGGACGGCTG
1801 TCAGTTTTGC ACATGATGTT CCTATTGTAA CTCTCAGAGA CCTTAAAAAA AAGTTTACTG
1861 CAATGTTAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA
B: nucleotide sequence (SEQ ID NO:2) length: 460 amino acid
1 MEEEGKKGKK PGIVSPFKRV FLKGEKSRDK KAHEKVTERR PLHTVVLSLP ERVEPDRLLS
61 DYIEKEVKYL GQLTSIPGYL NPSSRTEILH FIDNAKRAHQ LPGHLTQEHD AVLSLSAYNV
121 KLAWRDGEDI ILRVPIHDIA AVSYYRDDAA HLVVLKTAQD PGISPSQSLC AESSRGLSAG
181 SLSESAVGPV EACCLVILAA ESKVAAEELC CLLGQVFQVV YTESTIDFLD RAIFDGASTP
241 THHLSLHSDD SSTKVDIKET YEVEASTFCF PESVDVGGAS PHSKTISESE LSASATELLQ
301 DYMLTLRTKL SSQEIQQFAA LLHEYRNGAS IHEFCINLRQ LYGDSRKFLL LGLRPFIPEK
361 DSQHFENFLE TIGVKDGRGI ITDSFGRHRR ALSTTSSSTT NGNRATGSSD DRSAPSEGDE
421 WDRMISDISS DIEALGCSMA QDSALWTVDG GAPTPSAQLS
C. Nucleotide and amino acid composite sequence (SEQ ID NO:3) clone number: PP10187
Start code: 147ATG stops coding: 1527TAG protein molecular weight: 50371.67
(annotate: what (1) provided is the position that initial sum stops first Nucleotide of coding, and (2) molecular weight unit is dalton)
1 GC GCG GCC GGG GCG GAG ACT TCG GGC CCG GCT GGC GGG CGG CGC CGG 47
48 GAG CGC GGG GGC GGC GGG CCC GGG TCG AGC ATG TAG CGG CTG CTG GCG 95
96 GCG GGG CTC CCG GGG CGG GCC GGG CGG GCC GCG GGA GCC GCA CGC GGC 143
144 GAT ATG GAA GAG GAG GGC AAG AAG GGC AAG AAG CCT GGA ATT GTC TCG 191
1 Met Glu Glu Glu Gly LysLys Gly Lys Lys Pro Gly Ile Val Ser 15
192 CCA TTT AAA CGA GTA TTC CTA AAA GGT GAA AAG AGT AGA GAT AAG AAA 239
16 Pro Phe Lys Arg Val Phe Leu Lys Gly Glu Lys Ser Arg Asp Lys Lys 31
240 GCC CAT GAG AAG GTG ACA GAG AGG CGC CCT CTG CAC ACT GTG GTG TTG 287
32 Ala His Glu Lys Val Thr Glu Arg Arg Pro Leu His Thr Val Val Leu 47
288 TCA TTG CCT GAG CGC GTC GAG CCA GAC AGA CTG CTG AGC GAC TAT ATT 335
48 Ser Leu Pro Glu Arg Val Glu Pro Asp Arg Leu Leu Ser Asp Tyr Ile 63
336 GAG AAG GAG GTA AAG TAT TTA GGT CAG TTA ACG TCC ATA CCA GGA TAC 383
64 Glu Lys Glu Val Lys Tyr Leu Gly Gln Leu Thr Ser Ile Pro Gly Tyr 79
384 CTG AAT CCC TCC AGT AGG ACT GAA ATC CTG CAT TTC ATA GAC AAT GCA 431
80 Leu Asn Pro Ser Ser Arg Thr Glu Ile Leu His Phe Ile Asp Asn Ala 95
432 AAG AGA GCC CAC CAG CTT CCG GGA CAC TTG ACT CAG GAG CAC GAT GCT 479
96 Lys Arg Ala His Gln Leu Pro Gly His Leu Thr Gln Glu His Asp Ala 111
480 GTG CTC AGC CTG TCT GCG TAC AAC GTC AAG CTG GCC TGG AGG GAC GGG 527
112 Val Leu Ser Leu Ser Ala Tyr Asn Val Lys Leu Ala Trp Arg Asp Gly 127
528 GAG GAT ATC ATC CTC AGG GTG CCC ATC CAT GAC ATC GCC GCC GTC TCC 575
128 Glu Asp Ile Ile Leu Arg Val Pro Ile His Asp Ile Ala Ala Val Ser 143
576 TAT GTT CGG GAT GAC GCT GCA CAC CTG GTG GTC CTG AAG ACA GCC CAG 623
144 Tyr Val Arg Asp Asp Ala Ala His Leu Val Val Leu Lys Thr Ala Gln 159
624 GAC CCA GGG ATC TCC CCC AGC CAG AGT CTG TGT GCG GAA AGT TCC AGA 671
160 Asp Pro Gly Ile Ser Pro Ser Gln Ser Leu Cys Ala Glu Ser Ser Arg 175
672 GGC CTC AGT GCA GGC TCC CTG TCG GAG AGT GCA GTT GGG CCC GTG GAG 719
176 Gly Leu Ser Ala Gly Ser Leu Ser Glu Ser Ala Val Gly Pro Val Glu 191
720 GCA TGC TGC CTG GTC ATC CTG GCT GCA GAG AGC AAG GTC GCT GCG GAG 767
192 Ala Cys Cys Leu Val Ile Leu Ala Ala Glu Ser Lys Val Ala Ala Glu 207
768 GAG CTT TGC TGT CTG CTA GGC CAG GTC TTC CAG GTT GTT TAC ACG GAG 815
208 Glu Leu Cys Cys Leu Leu Gly Gln Val Phe Gln Val Val Tyr Thr Glu 223
816 TCC ACC ATC GAC TTT CTG GAC AGA GCG ATA TTT GAT GGG GCC TCT ACC 863
224 Ser Thr Ile Asp Phe Leu Asp Arg Ala Ile Phe Asp Gly Ala Ser Thr 239
864 CCG ACC CAC CAC CTG TCC CTG CAC AGC GAT GAC TCT TCT ACA AAA GTG 911
240 Pro Thr His His Leu Ser Leu His Ser Asp Asp Ser Ser Thr Lys Val 255
912 GAC ATT AAG GAG ACC TAC GAG GTG GAA GCC AGC ACT TTC TGC TTC CCT 959
256 Asp Ile Lys Glu Thr Tyr Glu Val Glu Ala Ser Thr Phe Cys Phe Pro 271
960 GAA TCT GTG GAT GTG GGT GGT GCA TCA CCC CAC AGC AAG ACC ATC AGT 1007
272 Glu Ser Val Asp Val Gly Gly Ala Ser Pro His Ser Lys Thr Ile Ser 287
1008 GAG AGC GAG CTG AGC GCC AGC GCC ACT GAG CTG CTG CAG GAC TAC ATG 1055
288 Glu Ser Glu Leu Ser Ala Ser Ala Thr Glu Leu Leu Gln Asp Tyr Met 303
1056 CTG ACG CTG CGC ACC AAG CTG TCA TCA CAG GAG ATC CAG CAG TTT GCA 1103
304 Leu Thr Leu Arg Thr Lys Leu Ser Ser Gln Glu Ile Gln Gln Phe Ala 319
1104 GCA CTG CTG CAC GAG TAC CGC AAT GGG GCC TCT ATC CAC GAG TTC TGC 1151
320 Ala Leu Leu His Glu Tyr Arg Asn Gly Ala Ser Ile His Glu Phe Cys 335
1152 ATC AAC CTG CGG CAG CTC TAC GGG GAC AGC CGC AAG TTC CTG CTG CTT 1199
336 Ile Asn Leu Arg Gln Leu Tyr Gly Asp Ser Arg Lys Phe Leu Leu Leu 351
1200 GGT CTG AGG CCC TTC ATC CCT GAG AAG GAC AGC CAG CAC TTC GAG AAC 1247
352 Gly Leu Arg Pro Phe Ile Pro Glu Lys Asp Ser Gln His Phe Glu Asn 367
1248 TTC CTG GAG ACC ATT GGC GTG AAG GAT GGC CGC GGC ATC ATC ACT GAC 1295
368 Phe Leu Glu Thr Ile Gly Val Lys Asp Gly Arg Gly Ile Ile Thr Asp 383
1296 AGC TTT GGC AGG CAC CGG CGG GCC CTG AGC ACC ACA TCC AGT TCC ACC 1343
384 Ser Phe Gly Arg His Arg Arg Ala Leu Ser Thr Thr Ser Ser Ser Thr 399
1344 ACC AAT GGG AAC AGG GCC ACG GGC AGC TCT GAT GAC CGG TCG GCA CCC 1391
400 Thr Asn Gly Asn Arg Ala Thr Gly Ser Ser Asp Asp Arg Ser Ala Pro 415
1392 TCA GAG GGG GAT GAG TGG GAC CGC ATG ATC TCG GAC ATC AGC AGC GAC 1439
416 Ser Glu Gly Asp Glu Trp Asp Arg Met Ile Ser Asp Ile Ser Ser Asp 431
1440 ATT GAG GCG CTG GGC TGC AGC ATG GCC CAG GAC TCA GCA TTA TGG ACA 1487
432 Ile Glu Ala Leu Gly Cys Ser Met Ala Gln Asp Ser Ala Leu Trp Thr 447
1488 GTG GAT GGG GGG GCA CCC ACA CCT TCC GCG CAG TTG TCA TAG GCC TTC 1535
448 Val Asp Gly Gly Ala Pro Thr Pro Ser Ala Gln Leu Ser *** 461
1536 CCA AAA GGA GCT GCC CAG ACC TGG GTG TCA GCC CTT GGT GGT GGC CAG 1583
1584 GGA AAG GCG CCC GGT GCA AAT GGC CCC GGG CGG CCC AGG TCC TTT ACT 1631
1632 GTG AAG GAG CAG GGA GCT GCC GAG GGA CAC GAG CCT CAG TGC GGG GTG 1679
1680 GAA GGC TTT TTG CCT TGT CCA CCA GGG GTC AGC CAA GCC CTG CAG TGT 1727
1728 GTC CCC GCT CGG GGA GGG CCC GGC CGA GCG GGC AGG GAA AAC CAG TCC 1775
1776 TGT CGG CTG GGC CCT TGG ACG GCT GTC AGT TTT GCA CAT GAT GTT CCT 1823
1824 ATT GTA ACT CTC AGA GAC CTT AAA AAA AAG TTT ACT GCA ATG TTA AAA 1871
1872 AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA A 1902
2.PPl2104
A: nucleotide sequence (SEQ ID NO:4) length: 884 bases
l GGCCGTGGAG GCTCCAGGTG TTCTCTCTGC CCCAGCAGAG CCCGGCAGGA GCCCCAACAG
61 GAAGCCAGCG CGGCATGGCT GCCACCGACT TCGTGCAGGA GATGCGCGCC GTGGGCGAGA
121 GGCTGCTGCT CAAGCTGCAG AGACTGCCCC AGGCTGAGCC CGTGGAGATC GTGGCCTTCT
181 CAGTCATCAT CCTTTTCACA GCTACTGTTC TGCTGTTGCT GCTGATAGCC TGCAGCTGCT
241 GCTGCACTCA CTGCTGCTGC CCTGAGCGGA GAGGCAGGAA GGTCCAGGTG CAGCCGACAC
301 CACCATGACG GACGGGCGAT GGCTGAGGAG AAGCTGGAGA GGAGATGGCC AATGCCATGA
361 CACAGGCCAT CAGCCTGGCC CTGCAGCCCT TACCCCTCAA GACCAGGCTC CCCTGGCCCC
421 AGCTCTGGCC CAGCCCAGGT ACCTGGACAC TGACAACTTG AGCCCTACCA AGGAAACAAG
481 GGCTGGTATA GGTGCAAACC TCTCATCTGC CAGTGGACAC TGGGTGCTGG GGAGTCAGCT
541 GTTTCAAAGA CTGGGTCAAC TGCCTGGGCT TCTTCGCCTA CCTGCACTTT TTAACAAAAC
601 AAGGAAGTAG GGGTCCCCAT ACCTTGATGG AGAACAGTCC CCACCTGTGG GCAATTGGCC
661 CTTGGGGCTC TGCTGATACA TGCCAAAGAG GAGCAAGGCA ATCAGAGGGG CTTTGTGCAA
721 TAGCTTCTGC ATCCGAGCTC CCGCCAGAGC GTGAGCATGT CAGTATTCTA GTCCAGTATT
781 TGCCAGTTTC CAAGTAAAAG CTTTTGTGTT ACGTGAAAAA AAAAAAAAAA AAAAAAAAAA
841 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA
B: nucleotide sequence (SEQ ID NO:5) length: 101 amino acid
1 MTDGRWLRRS WRGDGQCHDT GHQPGPAALT PQDQAPLAPA LAQPRYLDTD NLSPTKETRA
61 GIGANLSSAS GHWVLGSQLF QRLGQLPGLL RLPALFNKTR K
C. Nucleotide and amino acid composite sequence (SEQ ID NO:6) clone number: PP12104
Start code: 305ATG stops coding: 608TAG protein molecular weight: 11008.83
1 G GCC GTG GAG GCT CCA GGT GTT CTC TCT GCC CCA GCA GAG CCC GGC 46
47 AGG AGC CCC AAC AGG AAG CCA GCG CGG CAT GGC TGC CAC CGA CTT CGT 94
95 GCA GGA GAT GCG CGC CGT GGG CGA GAG GCT GCT GCT CAA GCT GCA GAG 142
143 ACT GCC CCA GGC TGA GCC CGT GGA GAT CGT GGC CTT CTC AGT CAT CAT 190
191 CCT TTT CAC AGC TAC TGT TCT GCT GTT GCT GCT GAT AGC CTG CAG CTG 238
239 CTG CTG CAC TCA CTG CTG CTG CCC TGA GCG GAG AGG CAG GAA GGT CCA 286
287 GGT GCA GCC GAC ACC ACC ATG ACG GAC GGG CGA TGG CTG AGG AGA AGC 334
1 Met Thr Asp Gly Arg Trp Leu Arg Arg Ser 10
335 TGG AGA GGA GAT GGC CAA TGC CAT GAC ACA GGC CAT CAG CCT GGC CCT 382
11 Trp Arg Gly Asp Gly Gln Cys His Asp Thr Gly His Gln Pro Gly Pro 26
383 GCA GCC CTT ACC CCT CAA GAC CAG GCT CCC CTG GCC CCA GCT CTG GCC 430
27 Ala Ala Leu Thr Pro Gln Asp Gln Ala Pro Leu Ala Pro Ala Leu Ala 42
431 CAG CCC AGG TAC CTG GAC ACT GAC AAC TTG AGC CCT ACC AAG GAA ACA 478
43 Gln Pro Arg Tyr Leu Asp Thr Asp Asn Leu Ser Pro Thr Lys Glu Thr 58
479 AGG GCT GGT ATA GGT GCA AAC CTC TCA TCT GCC AGT GGA CAC TGG GTG 526
59 Arg Ala Gly Ile Gly Ala Asn Leu Ser Ser Ala Ser Gly His Trp Val 74
527 CTG GGG AGT CAG CTG TTT CAA AGA CTG GGT CAA CTG CCT GGG CTT CTT 574
75 Leu Gly Ser Gln Leu Phe Gln Arg Leu Gly Gln Leu Pro Gly Leu Leu 90
575 CGC CTA CCT GCA CTT TTT AAC AAA ACA AGG AAG TAG GGG TCC CCA TAC 622
91 Arg Leu Pro Ala Leu Phe Asn Lys Thr Arg Lys *** 102
623 CTT GAT GGA GAA CAG TCC CCA CCT GTG GGC AAT TGG CCC TTG GGG CTC 670
671 TGC TGA TAC ATG CCA AAG AGG AGC AAG GCA ATC AGA GGG GCT TTG TGC 718
719 AAT AGC TTC TGC ATC CGA GCT CCC GCC AGA GCG TGA GCA TGT CAG TAT 766
767 TCT AGT CCA GTA TTT GCC AGT TTC CAA GTA AAA GCT TTT GTG TTA CGT 814
815 GAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA 862
863 AAA AAA AAA AAA AAA AAA AAA A 884
3.PP12585
A: nucleotide sequence (SEQ ID NO:7) length: 2855 bases
1 GCCTTTAACT CCCAGGAATT CATGGGCCAG CTGTATCAGT GGGAGCCCTT CACTGAAGTC
61 CAGGGCTCCC AGCGCTGTGA ACTGAACTGC CGGCCCCGTG GCTTCCGCTT CTATGTCCGT
121 CACACTGAAA AGGTCCAGGA TGGGACCCTG TGTCAGCCTG GAGCCCCTGA CATCTGTGTG
181 GCTGGACGCT GTCTGAGCCC CGGCTGTGAT GGGATCCTTG GCTCTGGCAG GCGTCCTGAT
241 GGCTGTGGAG TCTGTGGGGG TGATGATTCT ACCTGTCGCC TTGTTTCGGG GAACCTCACT
301 GACCGAGGGG GCCCCCTGGG CTATCAGAAG ATCTTGTGGA TTCCAGCGGG AGCCTTGCGG
361 CTCCAGATTG CCCAGCTCCG GCCTAGCTCC AACTACCTGG CACTTCGTGG CCCTGGGGGC
421 CCGGTCCATC ATCAATGGGA ACTGGGCTGT GGATCCCCCT GGGTCCTACA GGGCCGGCGG
481 GACCGTCTTT CGATATAACC GTCCTCCCAG GGAGGAGGGC AAAGGGGAGA GTCTGTCGGC
541 TGAAGGCCCC ACCACCCAGC CTGTGGATGT CTATATGATC TTTCAGGAGG AAAACCCAGG
601 CGTTTTTTAT CAGTATGTCA TCTCTTCACC TCCTCCAATC CTTGAGAACC CCACCCCAGA
661 GCCCCCTGTC CCCCAGCTTC AGCCGGAGAT TCTGAGGGTG GAGCCCCCAC TTGCTCCGGC
721 ACCCCGCCCA GCCCGGACCC CAGGCACCCT CCAGCGTCAG GTGCGGATCC CCCAGATGCC
781 CGCCCCGCCC CATCCCAGGA CACCCCTGGG GTCTCCAGCT GCGTACTGGA AACGAGTGGG
841 ACACTCTGCA TGCTCAGCGT CCTGCGGGAA AGGTGTCTGG CGCCCCATTT TCCTCTGCAT
901 CTCCCGTGAG TCGGGAGAGG AACTGGATGA ACGCAGCTGT GCCGCGGGTG CCAGGCCCCC
961 AGCCTCCCCT GAACCCTGCC ACGGCACCCC ATGCCCCCCA TACTGGGAGG CTGGCGAGTG
1021 GACATCCTGC AGCCGCTCCT GTGGCCCCGG CACCCAGCAC CGCCAGCTGC AGTGCCGGCA
1081 GGAATTTGGG GGGGGTGGCT CCTCGGTGCC CCCGGAGCGC TGTGGACATC TCCCCCGGCC
1141 CAACATCACC CAGTCTTGCC AGCTGCGCCT CTGTGGCCAT TGGGAAGTTG GCTCTCCTTG
1201 GAGCCAGTGC TCCGTGCGGT GCGGCCGGGG CCAGAGAAGC CGGCAGGTTC GCTGTGTTGG
1261 GAACAACGGT GATGAAGTGA GCGAGCAGGA GTGTGCGTCA GGCCCCCCAC AGCCCCCCAG
1321 CAGAGAGGCC TGTGACATGG GGCCCTGTAC TACTGCCTGG TTCCACAGCG ACTGGAGCTC
1381 CAAGTGCTCA ACCGAGTGTG GGACGGGAAT CCAGCGGCGC TCTGTGGTCT GCCTTGGGAG
1441 TGGGGCAGCC CTCGGGCCAG GCCAGGGGGA AGCAGGAGCA GGAACTGGGC AGAGCTGTCC
1501 AACAGGGAAG CCGGCCCCCT GACATGCGCG CCTGCAGCCT GGGGCCCTGT GAGAGAACTT
1561 GGCGCTGGTA CACAGGGCCC TGGGGTGAGT GCTCCTCCGA ATGTGGCTCT GGCACACAGC
1621 GTAGAGACAT CATCTGTGTA TCCAAACTGG GGACGGAGTT CAACGTGACT TCTCCGAGCA
1681 ACTGTTCTCA CCTCCCCAGG CCCCCTGCCC TGCAGCCCTG TCAAGGGCAG GCCTGCCAGG
1741 ACCGATGGTT TTCCACGCCC TGGAGCCCAT GTTCTCGCTC CTGCCAAGGG GGAACGCAGA
1801 CACGGGAGGT CCAGTGCCTG AGCACCAACC AGACCCTCAG CACCCGATGC CCTCCTCAAC
1861 TGCGGCCCTC CAGGAAGCGC CCCTGTAACA GCCAACCCTG CAGCCAGCGC CCTGATGATC
1921 AATGCAAGGA CAGCTCTCCA CATTGCCCCC TGGTGGTACA GGCCCGGCTC TGCGTCTACC
1981 CCTACTACAC AGCCACCTGT TGCCGCTCTT GCGCACATGT CCTGGAGCGG TCTCCCCAGG
2041 ATCCCTCCTG AAAGGGGTCC GGGGCACCTT CACGGTTTTC TGTGCCACCA TCGGTCACCC
2101 ATTGATCGGC CCACTCTGAA CCCCCTGGCT CTCCAGCCTG TCCCAGTCTC AGCAGGGATG
2161 TCCTCCAGGT GACAGAGGGT GGCAAGGTGA CTGACACAAA GTGACTTTCA GGGCTGTGGT
2221 CAGGCCCATG TGGTGGTGTG ATGGGTGTGT GCACATATGC CTCAGGTGTG CTTTTGGGAC
2281 TGCATGGATA TGTGTGTGCT CAAACGTGTA TCACTTTTCA AAAAGAGGTT ACACAGACTG
2341 AGAAGGACAA GACCTGTTTC CTTGAGACTT TCCTAGGTGG AAAGGAAAGC AAGTCTGCAG
2401 TTCCTTGCTA ATCTGAGCTA CTTAGAGTGT GGTCTCCCCA CCAACTCCAG TTTTGTGCCC
2461 TAAGCCTCAT TTCTCATGTT CAGACCTCAC ATCTTCTAAG CCGCCCTGTG TCTCTGACCC
2521 CTTCTCATTT GCCTAGTATC TCTGCCCCTG CCTCCCTAAT TAGCTAGGGC TGGGGTCAGC
2581 CACTGCCAAT CCTGCCTTAC TCAGGAAGGC AGGAGGAAAG AGACTGCCTC TCCAGAGCAA
2641 GGCCCAGCTG GGCAGAGGGT GAAAAAGAGA AATGTGAGCA TTCGTTCCCC CACCACCCCG
2701 CCCAGCCCCT AGCCCCACTC CCTGCCTCCT GAAATGGTTC CCACCCAGAA CTAATTTATT
2761 TTTTATTAAA GATGGTCATG ACAAATGGGA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA
2821 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAA
B: nucleotide sequence (SEQ ID NO:8) length: 437 amino acid
1 MGSLALAGVL MAVESVGVMI LPVALFRGTS LTEGAPWAIR RSCGFQREPC GSRLPSSGLA
61 PTTWHFVALG ARSIINGNWA VDPPGSYRAG GTVFRYNRPP REEGKGESLS AEGPTTQPVD
121 VYMIFQEENP GVFYQYVISS PPPILENPTP EPPVPQLQPE ILRVEPPLAP APRPARTPGT
181 LQRQVRIPQM PAPPHPRTPL GSPAAYWKRV GHSACSASCG KGVWRPIFLC ISRESGEELD
241 ERSCAAGARP PASPEPCHGT PCPPYWEAGE WTSCSRSCGP GTQHRQLQCR QEFGGGGSSV
301 PPERCGHLPR PNITQSCQLR LCGHWEVGSP WSQCSVRCGR GQRSRQVRCV GNNGDEVSEQ
361 ECASGPPQPP SREACDMGPC TTAWFHSDWS SKCSTECGTG IQRRSVVCLG SGAALGPGQG
421 EAGAGTGQSC PTGKPAP
C. Nucleotide and amino acid composite sequence (SEQ ID NO:9) clone number: PP12585
Start code: 209ATG stops coding: 1520TGA protein molecular weight: 46499.12
1 G CCT TTA ACT CCC AGG AAT TCA TGG GCC AGC TGT ATC AGT GGG AGC 46
47 CCT TCA CTG AAG TCC AGG GCT CCC AGC GCT GTG AAC TGA ACT GCC GGC 94
95 CCC GTG GCT TCC GCT TCT ATG TCC GTC ACA CTG AAA AGG TCC AGG ATG 142
143 GGA CCC TGT GTC AGC CTG GAG CCC CTG ACA TCT GTG TGG CTG GAC GCT 190
191 GTC TGA GCC CCG GCT GTG ATG GGA TCC TTG GCT CTG GCA GGC GTC CTG 238
1 Met Gly Ser Leu Ala Leu Ala Gly Val Leu 10
239 ATG GCT GTG GAG TCT GTG GGG GTG ATG ATT CTA CCT GTC GCC TTG TTT 286
11 Met Ala Val Glu Ser Val Gly Val Met Ile Leu Pro Val Ala Leu Phe 26
287 CGG GGA ACC TCA CTG ACC GAG GGG GCC CCC TGG GCT ATC AGA AGA TCT 334
27 Arg Gly Thr Ser Leu Thr Glu Gly Ala Pro Trp Ala Ile Arg Arg Ser 42
335 TGT GGA TTC CAG CGG GAG CCT TGC GGC TCC AGA TTG CCC AGC TCC GGC 382
43 Cys Gly Phe Gln Arg Glu Pro Cys Gly Ser Arg Leu Pro Ser Ser Gly 58
383 CTA GCT CCA ACT ACC TGG CAC TTC GTG GCC CTG GGG GCC CGG TCC ATC 430
59 Leu Ala Pro Thr Thr Trp His Phe Val Ala Leu Gly Ala Arg Ser Ile 74
431 ATC AAT GGG AAC TGG GCT GTG GAT CCC CCT GGG TCC TAC AGG GCC GGC 478
75 Ile Asn Gly Asn Trp Ala Val Asp Pro Pro Gly Ser Tyr Arg Ala Gly 90
479 GGG ACC GTC TTT CGA TAT AAC CGT CCT CCC AGG GAG GAG GGC AAA GGG 526
91 Gly Thr Val Phe Arg Tyr Asn Arg Pro Pro Arg Glu Glu Gly Lys Gly 106
527 GAG AGT CTG TCG GCT GAA GGC CCC ACC ACC CAG CCT GTG GAT GTC TAT 574
107 Glu Ser Leu Ser Ala Glu Gly Pro Thr Thr Gln Pro Val Asp Val Tyr 122
575 ATG ATC TTT CAG GAG GAA AAC CCA GGC GTT TTT TAT CAG TAT GTC ATC 622
123 Met Ile Phe Gln Glu Glu Asn Pro Gly Val Phe Tyr Gln Tyr Val Ile 138
623 TCT TCA CCT CCT CCA ATC CTT GAG AAC CCC ACC CCA GAG CCC CCT GTC 670
139 Ser Ser Pro Pro Pro Ile Leu Glu Asn Pro Thr Pro Glu Pro Pro Val 154
671 CCC CAG CTT CAG CCG GAG ATT CTG AGG GTG GAG CCC CCA CTT GCT CCG 718
155 Pro Gln Leu Gln Pro Glu Ile Leu Arg Val Glu Pro Pro Leu Ala Pro 170
719 GCA CCC CGC CCA GCC CGG ACC CCA GGC ACC CTC CAG CGT CAG GTG CGG 766
171 Ala Pro Arg Pro Ala Arg Thr Pro Gly Thr Leu Gln Arg Gln Val Arg 186
767 ATC CCC CAG ATG CCC GCC CCG CCC CAT CCC AGG ACA CCC CTG GGG TCT 814
187 Ile Pro Gln Met Pro Ala Pro Pro His Pro Arg Thr Pro Leu Gly Ser 202
815 CCA GCT GCG TAC TGG AAA CGA GTG GGA CAC TCT GCA TGC TCA GCG TCC 862
203 Pro Ala Ala Tyr Trp Lys Arg Val Gly His Ser Ala Cys Ser Ala Ser 218
863 TGC GGG AAA GGT GTC TGG CGC CCC ATT TTC CTC TGC ATC TCC CGT GAG 910
219 Cys Gly Lys Gly Val Trp Arg Pro Ile Phe Leu Cys Ile Ser Arg Glu 234
911 TCG GGA GAG GAA CTG GAT GAA CGC AGC TGT GCC GCG GGT GCC AGG CCC 958
235 Ser Gly Glu Glu Leu Asp Glu Arg Ser Cys Ala Ala Gly Ala Arg Pro 250
959 CCA GCC TCC CCT GAA CCC TGC CAC GGC ACC CCA TGC CCC CCA TAC TGG 1006
251 Pro Ala Ser Pro Glu Pro Cys His Gly Thr Pro Cys Pro Pro Tyr Trp 266
1007 GAG GCT GGC GAG TGG ACA TCC TGC AGC CGC TCC TGT GGC CCC GGC ACC 1054
267 Glu Ala Gly Glu Trp Thr Ser Cys Ser Arg Ser Cys Gly Pro Gly Thr 282
1055 CAG CAC CGC CAG CTG CAG TGC CGG CAG GAA TTT GGG GGG GGT GGC TCC 1102
283 Gln His Arg Gln Leu Gln Cys Arg Gln Glu Phe Gly Gly Gly Gly Ser 298
1103 TCG GTG CCC CCG GAG CGC TGT GGA CAT CTC CCC CGG CCC AAC ATC ACC 1150
299 Ser Val Pro Pro Glu Arg Cys Gly His Leu Pro Arg Pro Asn Ile Thr 314
1151 CAG TCT TGC CAG CTG CGC CTC TGT GGC CAT TGG GAA GTT GGC TCT CCT 1198
315 Gln Ser Cys Gln Leu Arg Leu Cys Gly His Trp Glu Val Gly Ser Pro 330
1199 TGG AGC CAG TGC TCC GTG CGG TGC GGC CGG GGC CAG AGA AGC CGG CAG 1246
331 Trp Ser Gln Cys Ser Val Arg Cys Gly Arg Gly Gln Arg Ser Arg Gln 346
1247 GTT CGC TGT GTT GGG AAC AAC GGT GAT GAA GTG AGC GAG CAG GAG TGT 1294
347 Val Arg Cys Val Gly Asn Asn Gly Asp Glu Val Ser Glu Gln Glu Cys 362
1295 GCG TCA GGC CCC CCA CAG CCC CCC AGC AGA GAG GCC TGT GAC ATG GGG 1342
363 Ala Ser Gly Pro Pro Gln Pro Pro Ser Arg Glu Ala Cys Asp Met Gly 378
1343 CCC TGT ACT ACT GCC TGG TTC CAC AGC GAC TGG AGC TCC AAG TGC TCA 1390
379 Pro Cys Thr Thr Ala Trp Phe His Ser Asp Trp Ser Ser Lys Cys Ser 394
1391 ACC GAG TGT GGG ACG GGA ATC CAG CGG CGC TCT GTG GTC TGC CTT GGG 1438
395 Thr Glu Cys Gly Thr Gly Ile Gln Arg Arg Ser Val Val Cys Leu Gly 410
1439 AGT GGG GCA GCC CTC GGG CCA GGC CAG GGG GAA GCA GGA GCA GGA ACT 1486
411 Ser Gly Ala Ala Leu Gly Pro Gly Gln Gly Glu Ala Gly Ala Gly Thr 426
1487 GGG CAG AGC TGT CCA ACA GGG AAG CCG GCC CCC TGA CAT GCG CGC CTG 1534
427 Gly Gln Ser Cys Pro Thr Gly Lys Pro Ala Pro *** 438
1535 CAG CCT GGG GCC CTG TGA GAG AAC TTG GCG CTG GTA CAC AGG GCC CTG 1582
1583 GGG TGA GTG CTC CTC CGA ATG TGG CTC TGG CAC ACA GCG TAG AGA CAT 1630
1631 CAT CTG TGT ATC CAA ACT GGG GAC GGA GTT CAA CGT GAC TTC TCC GAG 1678
1679 CAA CTG TTC TCA CCT CCC CAG GCC CCC TGC CCT GCA GCC CTG TCA AGG 1726
1727 GCA GGC CTG CCA GGA CCG ATG GTT TTC CAC GCC CTG GAG CCC ATG TTC 1774
1775 TCG CTC CTG CCA AGG GGG AAC GCA GAC ACG GGA GGT CCA GTG CCT GAG 1822
1823 CAC CAA CCA GAC CCT CAG CAC CCG ATG CCC TCC TCA ACT GCG GCC CTC 1870
1871 CAG GAA GCG CCC CTG TAA CAG CCA ACC CTG CAG CCA GCG CCC TGA TGA 1918
1919 TCA ATG CAA GGA CAG CTC TCC ACA TTG CCC CCT GGT GGT ACA GGC CCG 1966
1967 GCT CTG CGT CTA CCC CTA CTA CAC AGC CAC CTG TTG CCG CTC TTG CGC 2014
2015 ACA TGT CCT GGA GCG GTC TCC CCA GGA TCC CTC CTG AAA GGG GTC CGG 2062
2063 GGC ACC TTC ACG GTT TTC TGT GCC ACC ATC GGT CAC CCA TTG ATC GGC 2110
2111 CCA CTC TGA ACC CCC TGG CTC TCC AGC CTG TCC CAG TCT CAG CAG GGA 2158
2159 TGT CCT CCA GGT GAC AGA GGG TGG CAA GGT GAC TGA CAC AAA GTG ACT 2206
2207 TTC AGG GCT GTG GTC AGG CCC ATG TGG TGG TGT GAT GGG TGT GTG CAC 2254
2255 ATA TGC CTC AGG TGT GCT TTT GGG ACT GCA TGG ATA TGT GTG TGC TCA 2302
2303 AAC GTG TAT CAC TTT TCA AAA AGA GGT TAC ACA GAC TGA GAA GGA CAA 2350
2351 GAC CTG TTT CCT TGA GAC TTT CCT AGG TGG AAA GGA AAG CAA GTC TGC 2398
2399 AGT TCC TTG CTA ATC TGA GCT ACT TAG AGT GTG GTC TCC CCA CCA ACT 2446
2447 CCA GTT TTG TGC CCT AAG CCT CAT TTC TCA TGT TCA GAC CTC ACA TCT 2494
2495 TCT AAG CCG CCC TGT GTC TCT GAC CCC TTC TCA TTT GCC TAG TAT CTC 2542
2543 TGC CCC TGC CTC CCT AAT TAG CTA GGG CTG GGG TCA GCC ACT GCC AAT 2590
2591 CCT GCC TTA CTC AGG AAG GCA GGA GGA AAG AGA CTG CCT CTC CAG AGC 2638
2639 AAG GCC CAG CTG GGC AGA GGG TGA AAA AGA GAA ATG TGA GCA TTC GTT 2686
2687 CCC CCA CCA CCC CGC CCA GCC CCT AGC CCC ACT CCC TGC CTC CTG AAA 2734
2735 TGG TTC CCA CCC AGA ACT AAT TTA TTT TTT ATT AAA GAT GGT CAT GAC 2782
2783 AAA TGG GAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA 2830
2831 AAA AAA AAA AAA AAA AAA AAA AAA A 2855
4.PP14397
A: nucleotide sequence (SEQ ID NO:10) length: 1936 bases
1 GCTCATGGCG CCGGCGTCGC GGTTGCTCGC GCTCTGGGCG CTGGCGGCTG TGGCTCTACC
61 CGGCTCCGGG GCGGAGGGCG ACGGCGGGTG GCGCCCGGGC GGGCCGGGGG CCGTGGCGGA
121 GGAGGAGCGC TGCACGGTGG AGCGTCGGGC CGACCTCACC TACGCGGAGT TCGTGCAGCA
181 GTACGCCTTC GTCAGGCCCG TCATCCTGCA GGGACTCACG GACAACTCGA GGTTCCGGGC
241 CCTGTGCTCC CGCGACAGGT TGCTGGCTTC GTTTGGGGAC AGAGTGGTCC GGCTGAGCAC
301 CGCCAACACC TACTCCTACC ACAAAGTGGA CTTGCCCTTC CAGGAGTATG TGGAGCAGCT
361 GCTGCACCCC CAGGACCCCA CCTCCCTGGG CAATGACACC CTGTACTTCT TCGGGGACAA
421 CAACTTCACC GAGTGGGCCT CTCTCTTTCG GCACTACTCC CCACCCCCAT TTGGCCTGCT
481 GGGAACCGCT CCAGCTTACA GCTTTGGAAT CGCAGGAGCT GGCTCGGGGG TGCCCTTCCA
541 CTGGCATGGA CCCGGGTACT CAGAAGTGAT CTACGGTCGT AAGCGCTGGT TCCTTTACCC
601 ACCTGAGAAG ACGCCAGAGT TCCACCCCAA CAAGACCACG CTGGCCTGGC TCCGGGACAC
661 ATACCCAGCC CTGCCACCGT CTGCACGGCC CCTGGAGTGT ACCATCCGGG CTGGTGAGGT
721 GCTGTACTTC CCCGACCGCT GGTGGCATGC TACGCTCAAC CTTGACACCA GCGTCTTCAT
781 CTCCACCTTC CTCGGCTAGC CAAAACAGCT GGCAGGACTG CCGGTCACAC ACCAGCACGT
841 CCCACCTCGT GCTCACGGAT TTTATTACAC AGATAGTGGC GGCAATGGCC TCAGCCCAGC
901 CCACCCTCAC CTGCTTTTCC AGCCCACAAA GGGGGACGAT CACGGCCCAG CAAAAGCGAT
961 GCTGAGAGGG GAAACAGTCC AGAGTCCAAC AGCAGAACTT GGGGGAAGCG GTCGGGGTGG
1021 CCAGGAACAT AAACTATGTA TAGGGGCCGG GGGCTTCTGC CCAGGGCTCC CCTGGACCAG
1081 GACGCCAGGT AGGGCAGGGA ACCTCAGTAG TCCTCCACCC AGCCATTCTC AGAGATGAAT
1141 GCGTCAATAA CCTCCTTCAT AGCCAAGTTG GGGATGAGCT GTTCCTGGGT CAGGGGGCTC
1201 CGGGTCACGG GGTCAAAATG ACCCACACGC TGCAGTGACA AGAAGGGCAG AGGGCAGTCA
1261 TGGGGCCCAG GACCATGCCA CTGGCCCTGC TCCCCCAGCC GCAGCCTCAC CTGCAGGTGC
1321 TCCTCGATGT CCTTGCGGTC GTAGGTGATG CCACTGGGCG TGATGCACGG CTCCCGCATC
1381 AGCTCAAAGC TGATCTTGCC ACACAGGTAG TCGGGGATGT CTCGCTTCTG TGGCACAGGG
1441 GCACACGGTC AGAGGCTGAA AAGGGGCACT GCACGAGCAC CTGCCAGCCA TCGGCAGCAA
1501 GCGACACACA CTCACCTTCC TCTTCTCATC CACCTGAGAA AAAAGCTCGT CCATGTCCGC
1561 CATGTACTTG TCCTGTGAAG AGTTGAGTGC TGTGCTTGGG GGAGACACCC CACCTCCCTC
1621 CTCCATGGGG CACAGACCCA ACACAAGGCG GGGATGCTCC CACGCCACGT GCACACACAC
1681 AGACCCACAT GTGGGTGGGG GGCACCCTCA CGTGCTTGGC CTCAATGCAG GCCTGCTGGG
1741 CCCGGACGTG GCTGTCGTCC TCATCACCCT CGTGGTTTCG CTGGCACTCT TCCAGCTCCC
1801 TGGGGGTTGA CCAGGAGCCG GTCAGAGATG GACCTGGCCA GATGTCTGAC CACACCCCAA
1861 TCTCAGAGCT AACATCCACA CTTCCCCACA TTTCCTGCTT GCCAGTAAAG CCTTCGATAA
1921 AAAAAAAAAA AAAAAA
B: nucleotide sequence (SEQ ID NO:11) length: 264 amino acid
1 MAPASRLLAL WALAAVALPG SGAEGDGGWR PGGPGAVAEE ERCTVERRAD LTYAEFVQQY
61 AFVRPVILQG LTDNSRFRAL CSRDRLLASF GDRVVRLSTA NTYSYHKVDL PFQEYVEQLL
121 HPQDPTSLGN DTLYFFGDNN FTEWASLFRH YSPPPFGLLG TAPAYSFGIA GAGSGVPFHW
181 HGPGYSEVIY GRKRWFLYPP EKTPEFHPNK TTLAWLRDTY PALPPSARPL ECTIRAGEVL
241 YFPDRWWHAT LNLDTSVFIS TFLG
C. Nucleotide and amino acid composite sequence (SEQ ID NO:12) clone number: PP14397
Start code: 5ATG stops coding: 797TAG protein molecular weight: 29507.63
1 G CTC ATG GCG CCG GCG TCG CGG TTG GTC GCG CTC TGG GCG CTG GCG 46
1 Met Ala Pro Ala Ser Arg Leu Leu Ala Leu Trp Ala Leu Ala 14
47 GCT GTG GCT CTA CCC GGC TCC GGG GCG GAG GGC GAC GGC GGG TGG CGC 94
15 Ala Val Ala Leu Pro Gly Ser Gly Ala Glu Gly Asp Gly Gly Trp Arg 30
95 CCG GGC GGG CCG GGG GCC GTG GCG GAG GAG GAG CGC TGC ACG GTG GAG 142
31 Pro Gly Gly Pro Gly Ala Val Ala Glu Glu Glu Arg Cys Thr Val Glu 46
143 CGT CGG GCC GAC CTC ACC TAC GCG GAG TTC GTG CAG CAG TAC GCC TTC 190
47 Arg Arg Ala Asp Leu Thr Tyr Ala Glu Phe Val Gln Gln Tyr Ala Phe 62
191 GTC AGC CCC GTC ATC CTG CAG GGA CTC ACG GAC AAC TCG AGG TTC CGG 238
63 Val Arg Pro Val Ile Leu Gln Gly Leu Thr Asp Asn Ser Arg Phe Arg 78
239 GCC CTG TGC TCC CGC GAC AGG TTG CTG GCT TCG TTT GGG GAC AGA GTG 286
79 Ala Leu Cys Ser Arg Asp Arg Leu Leu Ala Ser Phe Gly Asp Arg Val 94
287 GTC CGG CTG AGC ACC GCC AAC ACC TAC TCC TAC CAC AAA GTG GAC TTG 334
95 Val Arg Leu Ser Thr Ala Asn Thr Tyr Ser Tyr His Lys Val Asp Leu 110
335 CCC TTC CAG GAG TAT GTG GAG CAG CTG CTG CAC CCC CAG GAC CCC ACC 382
111 Pro Phe Gln Glu Tyr Val Glu Gln Leu Leu His Pro Gln Asp Pro Thr 126
383 TCC CTG GGC AAT GAC ACC CTG TAC TTC TTC GGG GAC AAC AAC TTC ACC 430
127 Ser Leu Gly Asn Asp Thr Leu Tyr Phe Phe Gly Asp Asn Asn Phe Thr 142
431 GAG TGG GCC TCT CTC TTT CGG CAC TAC TCC CCA CCC CCA TTT GGC CTG 478
143 Glu Trp Ala Ser Leu Phe Arg His Tyr Ser Pro Pro Pro Phe Gly Leu 158
479 CTG GGA ACC GCT CCA GCT TAC AGC TTT GGA ATC GCA GGA GCT GGC TCG 526
159 Leu Gly Thr Ala Pro Ala Tyr Ser Phe Gly Ile Ala Gly Ala Gly Ser 174
527 GGG GTG CCC TTC CAC TGG CAT GGA CCC GGG TAC TCA GAA GTG ATC TAC 574
175 Gly Val Pro Phe His Trp His Gly Pro Gly Tyr Ser Glu Val Ile Tyr 190
575 GGT CGT AAG CGC TGG TTC CTT TAC CCA CCT GAG AAG ACG CCA GAG TTC 622
191 Gly Arg Lys Arg Trp Phe Leu Tyr Pro Pro Glu Lys Thr Pro Glu Phe 206
623 CAC CCC AAC AAG ACC ACG CTG GCC TGG CTC CGG GAC ACA TAC CCA GCC 670
207 His Pro Asn Lys Thr Thr Leu Ala Trp Leu Arg Asp Thr Tyr Pro Ala 222
671 CTG CCA CCG TCT GCA CGG CCC CTG GAG TGT ACC ATC CGG GCT GGT GAG 718
223 Leu Pro Pro Ser Ala Arg Pro Leu Glu Cys Thr Ile Arg Ala Gly Glu 238
719 GTG CTG TAC TTC CCC GAC CGC TGG TGG CAT GCT ACG CTC AAC CTT GAC 766
239 Val Leu Tyr Phe Pro Asp Arg Trp Trp His Ala Thr Leu Asn Leu Asp 254
767 ACC AGC GTC TTC ATC TCC ACC TTC CTC GGC TAG CCA AAA CAG CTG GCA 814
255 Thr Ser Val Phe Ile Ser Thr Phe Leu Gly *** 265
815 GGA CTG CCG GTC ACA CAC CAG CAC GTC CCA CCT CGT GCT CAC GGA TTT 862
863 TAT TAC ACA GAT AGT GGC GGC AAT GGC CTC AGC CCA GCC CAC CCT CAC 910
911 CTG CTT TTC CAG CCC ACA AAG GGG GAC GAT CAC GGC CCA GCA AAA GCG 958
959 ATG CTG AGA GGG GAA ACA GTC CAG AGT CCA ACA GCA GAA CTT GGG GGA 1006
1007 AGC GGT CGG GGT GGC CAG GAA CAT AAA CTA TGT ATA GGG GCC GGG GGC 1054
1055 TTC TGC CCA GGG CTC CCC TGG ACC AGG ACG CCA GGT AGG GCA GGG AAC 1102
1103 CTC AGT AGT CCT CCA CCC AGC CAT TCT CAG AGA TGA ATG CGT CAA TAA 1150
1151 CCT CCT TCA TAG CCA AGT TGG GGA TGA GCT GTT CCT GGG TCA GGG GGC 1198
1199 TCC GGG TCA CGG GGT CAA AAT GAC CCA CAC GCT GCA GTG ACA AGA AGG 1246
1247 GCA GAG GGC AGT CAT GGG GCC CAG GAC CAT GCC ACT GGC CCT GCT CCC 1294
1295 CCA GCC GCA GCC TCA CCT GCA GGT GCT CCT CGA TGT CCT TGC GGT CGT 1342
1343 AGG TGA TGC CAC TGG GCG TGA TGC ACG GCT CCC GCA TCA GCT CAA AGC 1390
1391 TGA TCT TGC CAC ACA GGT AGT CGG GGA TGT CTC GCT TCT GTG GCA CAG 1438
1439 GGG CAC ACG GTC AGA GGC TGA AAA GGG GCA CTG CAC GAG CAC CTG CCA 1486
1487 GCC ATC GGC AGC AAG CGA CAC ACA CTC ACC TTC CTC TTC TCA TCC ACC 1534
1535 TGA GAA AAA AGC TCG TCC ATG TCC GCC ATG TAC TTG TCC TGT GAA GAG 1582
1583 TTG AGT GCT GTG CTT GGG GGA GAC ACC CCA CCT CCC TCC TCC ATG GGG 1630
1631 CAC AGA CCC AAC ACA AGG CGG GGA TGC TCC CAC GCC ACG TGC ACA CAC 1678
1679 ACA GAC CCA CAT GTG GGT GGG GGG CAC CCT CAC GTG CTT GGC CTC AAT 1726
1727 GCA GGC CTG CTG GGC CCG GAC GTG GCT GTC GTC CTC ATC ACC CTC GTG 1774
1775 GTT TCG CTG GCA CTC TTC CAG CTC CCT GGG GGT TGA CCA GGA GCC GGT 1822
1823 CAG AGA TGG ACC TGG CCA GAT GTC TGA CCA CAC CCC AAT CTC AGA GCT 1870
1871 AAC ATC CAC ACT TCC CCA CAT TTC CTG CTT GCC AGT AAA GCC TTC GAT 1918
1919 AAA AAA AAA AAA AAA AAA 1936
5.PP14434
A: nucleotide sequence (SEQ ID NO:13) length: 4130 bases
1 GCTGAGTCGT CTTGGTCCCA GGAGCCAGTA GTGAAGGCAA CAGTCTGCCC ACCTGTGGAC
61 ACCAGATCCT GGGAGCTCCT GGTTAGCAAG TGAGATCTCT GGGATGTCAG TGAGGCTGGT
121 TGAAGACCAG AGGTAAACTG CAGAGGTCAC CACCCCCACC ATGTCCCAGG TGATGTCCAG
181 CCCACTGCTG GCAGGAGGCC ATGCTGTCAG CTTGGCGCCT TGTGATGAGC CCAGGAGGAC
241 CCTGCACCCA GCACCCAGCC CCAGCCTGCC ACCCCAGTGT TCTTACTACA CCACGGAAGG
301 CTGGGGAGCC CAGGCCCTGA TGGCCCCCGT GCCCTGCATG GGGCCCCCTG GCCGACTCCA
361 GCAAGCCCCA CAGGTGGAGG CCAAAACCAC CTGCTTCCTG CCGTCCCCTG GTGAGAAAGG
421 CCTTGGGGAC CCCAGAGGAC CTTGACTCCT ACATTGACTT CTCACTGGAG AGCCTCAATC
481 AGATGATCCT GGAACTGGAC CCCACCTTCC AGCTGCTTCC CCCAGGGACT GGGGGCTCCC
541 AGGCTGAGCT GGCCCAGAGC ACCATGTCAA TGAGAAAGAA GGAGGAATCT GAAGCCTTGG
601 ACATAAAGTA CATCGAGGTG ACCTCCGCCA GATCAAGGTG CCACGATGGC CCCCAGCACT
661 GCTCCAGCCC CTCTGTCACC CCGCCCTTCG GCTCCCTTCG CAGTGGTGGC CTCCTCCTTT
721 CCAGAGACGT CCCCCGAGAG ACACGAAGCA GCAGTGAGAG CCTCATCTTC TCTGGGAACC
781 AGGGCAGGGG GCACCAGCGC CCTCTGCCCC CCTCAGAGGG TCTCTCCCCT CGACCCCCAA
841 ATTCCCCCAG CATCTCAATC CCTTGCATGG GGAGCAAGGC CTCGAGCCCC CATGGTTTGG
901 GCTCCCCGCT GGTGGCTTCT CCAAGACTGG AGAAGCGGCT GGGAGGCCTG GCCCCACAGC
961 GGGGCAGCAG GATCTCTGTG CTGTCAGCCA GCCCAGTGTC TGATGTCAGC TATATGTTTG
1021 GAAGCAGCCA GTCCCTTCTG CACTCAGCAA CTCCAGCCAT CAGTCATCTT CCAGATCCTT
1081 GGAAAGTCCA GCCAACTCTT CCTCCAGCCT CCACAGCCTT GGCTCAGTGT CCCTGTGTAC
1141 AAGACCCAGT GACTTCCAGG CTCCCAGAAA CCCCACCCTA ACCATGGGCC AACCCAGAAC
1201 ACCCCACTCT CCACCACTGG CCAAAGAACA TGCCAGCAGC TGCCCCCCAT CCATCACCAA
1261 CTCCATGGTG GACATACCCA TTGTGCTGAT CAACGGCTGG CCAGAACCAG GGTCTTCTCC
1321 ACCCCAGCGG ACCCCAGGAC ACCAGAACTC CGTTCAACCT GGAGCTGCTT CTCCCAGCAA
1381 CCCCTGTCCA GCCACCAGGA GCAACAGCCA GACCCTGTCA GATGCCCCCT TTACCACATG
1441 CCCAGAGGGT CCCGCCAGGG ACATGCAGCC CACCATGAAG TTCGTGATGG ACACATCTAA
1501 ATACTGGTTT AAGCCAAACA TCACCCGAGA GCAAGCAATC GAGCTGCTGA GGAAGGAGGA
1561 GCCAGGGGCT TTTTGTCATA AGGGACAGCT CTTCATACCG AGGCTCCTTC GGCCTGGCCC
1621 TGAAGGTGCA GGAGGTTCCC GCGTCTGCTC AGAGTCGACC AGGTGAGGAC AGCAATGACC
1681 TCATCCGACA CTTCCTCATC GAGTCGTCTG CCAAAGGAGT GCATCTCAAA GGAGCAGATG
1741 AGGAGCCCTA CTTTGGGAGC CTCTCTGCCT TCGTGTGCCA GCATTCCATC ATGGCCCTGG
1801 CCCTGCCCTG CAAACTCACC ATCCCACAGA GAGAACTGGG AGGTGCAGAT GGGGCCTCGG
1861 ACTCTACAGA CAGCCCAGCC TCCTGCCAGA AGAAATCTGC GGGCTGCCAC ACCCTGTACC
1921 TGAGCTCAGT GAGCGTGGAG ACCCTGACTG GAGCCCTGGC CGTCAGAAAG CCATCTCCAC
1981 CACCTTTGAG AGGGACATCC TCCCCACGCC CACCGTGGTC CACTTCAAAG TCACAGAGCA
2041 GGGCATCACT CTGACTGATG TCCAGAGGAA GGTGTTTTTC CGGCGCCATT ACCCACTCAC
2101 CACCCTCCGC TTCTGTGGTA TGGACCCTGA GCAACGGAAG TGGCAGAAGT ACTGCAAACC
2161 CTCCTGGATC TTTGGGTTTG TGGCCAAGAG CCAGACAGAG CCTCAGGAGA ACGTATGCCA
2221 CCTCTTTGCG GAGTATGACA TGGTCCAGCC AGCCTCGCAG GTCATCGGCC TGGTGACTGC
2281 TCTGCTGCAG GACGCAGAAA GGATGTAGGG GAGAGACTGC CTGTGCACCT AACCAACACC
2341 TCCAGGGGCT CGCTAAGGAG CCCCCCTCCA CCCCCTGAAT GGGTGTGGCT TGTGGCATAT
2401 TGACAGACCA ATCTATGGGA CTAGGGGGAT TGGCATCAAG TTGACACCCT TGAACCTGCT
2461 ATGGCCTTCA GCAGTCACCA TCATCCAGAC CCCCCGGGCC TCAGTTTCCT CAATCATAGA
2521 AGAAGACCAA TAGACAAGAT CAGCTGTTCT TAGATGCTGG TGGGCATTTG AACATGCTCC
2581 TCCATGATTC TGAAGCATGC ACACCTCTGA AGACCCCTGC ATGAAAATAA CCTCCAAGGA
2641 CCCTCTGACC CCATCGACCT GGGCCCTGCC ACACAACAGT CTGAGCAAGA GACCTGCAGC
2701 CCCTGTTTCG TGGCAGACAG CAGGTGCCTG GCGGTGACCC ACGGGGCTCC TGGCTTGCAG
2761 CTGGTGATGG TCAAGAACTG ACTACAAAAC AGGAATGGAT AGACTCTATT TCCTTCCATA
2821 TCTGTTCCTC TGTTCCTTTT CCCACTTTCT GGGTGGCTTT TTGGGTCCAC CCAGCCAGGA
2881 TGCTGCAGGC CAAGCTGGGT GTGGTATTTA GGGCAGCTCA GCAGGGGGAA CTTGTCCCCA
2941 TGGTCAGAGG AGACCCAGCT GTCCTGCACC CCCTTGCAGA TGAGTATCAC CCCATCTTTT
3001 CTTTCCACTT GGTTTTTATT TTTATTTTTT TTTGAGACAG AGTCTCACTG TCACCCAGGC
3061 TGAACTGCAG TGGTGTGATC TAGGCTCACT GCAACCTCCA CCTCCCAGGT TCAAGCAATT
3121 ATCCTGCCTC AGGCTCCCAA GTAGCTGGGA TTACAGGCAT GTGCAACTCA CCCAGCTAAT
3181 TTTGTATTTT TAGTAGAGAC AGGGTTTCAC CATGTTGGCC AGGCTGGTCT TGAACTCCTG
3241 ACCGCAGTAA TCCACCTGCT TCGGCCTCCC AAAGTGCTGG GATTACAGGC GCAACGACCC
3301 AGCCCAGCTT CTTTCCATTC CTTGATAGGC GAGTATTCCA AAGCTGGTAT CGTAGCTGCC
3361 CTAATGTTGC ATATTAGGCG GCGGGGGCAG AGATAAGGGC CATCTCTCTG TGATTCTGCC
3421 TCAGCTCCTG TCTTGCTGAG CCCTCCCCCA ACCCACGCTC CAACACACAC ACACACACAC
3481 ACACACACAC ACACACACAC ACACACACAC ACACACACGC CCCTCTACTG CTATGTGGCT
3541 TCAACCAGCC TCACAGCCAC ACGGGGGAAG CAGAGAGTCA AGAATGCAAA GAGGCCGCTT
3601 CCCTAAGAGG CTTGGAGGAG CTGGGCTCTA TCCCACACCC ACCCCCACCC CACCCCCACC
3661 CAGCCTCCAG AAGCTGGAAC CATTTCTCCC GCAGCCTGAG TTCCTAAGGA AACCACCCTA
3721 CCGGGGTGGA AGGGAGGGTC AGGGAAGAAA CCCACTCTTG CTCTACGAGG AGCAAGTGCC
3781 TGCCCCCTCC CAGCAGCCAG CCCTGCCAAA GTTGCATTAT CTTTGGCCAA GGCTGGGCCT
3841 GACGGTTATG ATTTCAGCCC TGGGCCTGCA GGAGAGGCTG AGACCAGCCC ACCCAGCCAG
3901 TGGTCGAGCA CTGCCCCGCC GCCAAAGTCT GCAGAATGTG AGATGAGGTT CTCAAGGTCA
3961 CAGGCCCCAG TCCCAGCCTG GGGGCTGCAG AGGCCCCCAT ATACTCTGCT ACAGCTCCTA
4021 TCATGAAAAA TAAAATGTTT GTCTTTGCAA AACAGTAAAA AAAAAAAAAA AAAAAAAAAA
4081 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA
B: nucleotide sequence (SEQ ID NO:14) length: 232 amino acid
1 MILELDPTFQ LLPPGTGGSQ AELAQSTMSM RKKEESEALD IKYIEVTSAR SRCHDGPQHC
61 SSPSVTPPFG SLRSGGLLLS RDVPRETRSS SESLIFSGNQ GRGHQRPLPP SEGLSPRPPN
121 SPSISIPCMG SKASSPHGLG SPLVASPRLE KRLGGLAPQR GSRISVLSAS PVSDVSYMFG
181 SSQSLLHSAT PAISHLPDPW KVQPTLPPAS TALAQCPCVQ DPVTSRLPET PP
C. Nucleotide and amino acid composite sequence (SEQ ID NO:15) clone number: PP14434
Start code: 483ATG stops coding: 1179TAA protein molecular weight: 24467.33
1 GC TGA GTC GTC TTG GTC CCA GGA GCC AGT AGT GAA GGC AAC AGT CTG 47
48 CCC ACC TGT GGA CAC CAG ATC CTG GGA GCT CCT GGT TAG CAA GTG AGA 95
96 TCT CTG GGA TGT CAG TGA GGC TGG TTG AAG ACC AGA GGT AAA CTG CAG 143
144 AGG TCA CCA CCC CCA CCA TGT CCC AGG TGA TGT CCA GCC CAC TGC TGG 191
192 CAG GAG GCC ATG CTG TCA GCT TGG CGC CTT GTG ATG AGC CCA GGA GGA 239
240 CCC TGC ACC CAG CAC CCA GCC CCA GCC TGC CAC CCC AGT GTT CTT ACT 287
288 ACA CCA CGG AAG GCT GGG GAG CCC AGG CCC TGA TGG CCC CCG TGC CCT 335
336 GCA TGG GGC CCC CTG GCC GAC TCC AGC AAG CCC CAC AGG TGG AGG CCA 383
384 AAA CCA CCT GCT TCC TGC CGT CCC CTG GTG AGA AAG GCC TTG GGG ACC 431
432 CCA GAG GAC CTT GAC TCC TAC ATT GAC TTC TCA CTG GAG AGC CTC AAT 479
480 CAG ATG ATC CTG GAA CTG GAC CCC ACC TTC CAG CTG CTT CCC CCA GGG 527
1 Met Ile Leu Glu Leu Asp Pro Thr Phe Gln Leu Leu Pro Pro Gly 15
528 ACT GGG GGC TCC CAG GCT GAG CTG GCC CAG AGC ACC ATG TCA ATG AGA 575
16 Thr Gly Gly Ser Gln Ala Glu Leu Ala Gln Ser Thr Met Ser Met Arg 31
576 AAG AAG GAG GAA TCT GAA GCC TTG GAC ATA AAG TAC ATC GAG GTG ACC 623
32 Lys Lys Glu Glu Ser Glu Ala Leu Asp Ile Lys Tyr Ile Glu Val Thr 47
624 TCC GCC AGA TCA AGG TGC CAC GAT GGC CCC CAG CAC TGC TCC AGC CCC 671
48 Ser Ala Arg Ser Arg Cys His Asp Gly Pro Gln His Cys Ser Ser Pro 63
672 TCT GTC ACC CCG CCC TTC GGC TCC CTT CGC AGT GGT GGC CTC CTC CTT 719
64 Ser Val Thr Pro Pro Phe Gly Ser Leu Arg Ser Gly Gly Leu Leu Leu 79
720 TCC AGA GAC GTC CCC CGA GAG ACA CGA AGC AGC AGT GAG AGC CTC ATC 767
80 Ser Arg Asp Val Pro Arg Glu Thr Arg Ser Ser Ser Glu Ser Leu Ile 95
768 TTC TCT GGG AAC CAG GGC AGG GGG CAC CAG CGC CCT CTG CCC CCC TCA 815
96 Phe Ser Gly Asn Gln Gly Arg Gly His Gln Arg Pro Leu Pro Pro Ser 111
816 GAG GGT CTC TCC CCT CGA CCC CCA AAT TCC CCC AGC ATC TCA ATC CCT 863
112 Glu Gly Leu Ser Pro Arg Pro Pro Asn Ser Pro Ser Ile Ser Ile Pro 127
864 TGC ATG GGG AGC AAG GCC TCG AGC CCC CAT GGT TTG GGC TCC CCG CTG 911
128 Cys Met Gly Ser Lys Ala Ser Ser Pro His Gly Leu Gly Ser Pro Leu 143
912 GTG GCT TCT CCA AGA CTG GAG AAG CGG CTG GGA GGC CTG GCC CCA CAG 959
144 Val Ala Ser Pro Arg Leu Glu Lys Arg Leu Gly Gly Leu Ala Pro Gln 159
960 CGG GGC AGC AGG ATC TCT GTG CTG TCA GCC AGC CCA GTG TCT GAT GTC 1007
160 Arg Gly Ser Arg Ile Ser Val Leu Ser Ala Ser Pro Val Ser Asp Val 175
1008 AGC TAT ATG TTT GGA AGC AGC CAG TCC CTT CTG CAC TCA GCA ACT CCA 1055
176 Ser Tyr Met Phe Gly Ser Ser Gln Ser Leu Leu His Ser Ala Thr Pro 191
1056 GCC ATC AGT CAT CTT CCA GAT CCT TGG AAA GTC CAG CCA ACT CTT CCT 1103
192 Ala Ile Ser His Leu Pro Asp Pro Trp Lys Val Gln Pro Thr Leu Pro 207
1104 CCA GCC TCC ACA GCC TTG GCT CAG TGT CCC TGT GTA CAA GAC CCA GTG 1151
208 Pro Ala Ser Thr Ala Leu Ala Gln Cys Pro Cys Val Gln Asp Pro Val 223
1152 ACT TCC AGG CTC CCA GAA ACC CCA CCC TAA CCA TGG GCC AAC CCA GAA 1199
224 Thr Ser Arg Leu Pro Glu Thr Pro Pro *** 233
1200 CAC CCC ACT CTC CAC CAC TGG CCA AAG AAC ATG CCA GCA GCT GCC CCC 1247
1248 CAT CCA TCA CCA ACT CCA TGG TGG ACA TAC CCA TTG TGC TGA TCA ACG 1295
1296 GCT GGC CAG AAC CAG GGT CTT CTC CAC CCC AGC GGA CCC CAG GAC ACC 1343
1344 AGA ACT CCG TTC AAC CTG GAG CTG CTT CTC CCA GCA ACC CCT GTC CAG 1391
1392 CCA CCA GGA GCA ACA GCC AGA CCC TGT CAG ATG CCC CCT TTA CCA CAT 1439
1440 GCC CAG AGG GTC CCG CCA GGG ACA TGC AGC CCA CCA TGA AGT TCG TGA 1487
1488 TGG ACA CAT CTA AAT ACT GGT TTA AGC CAA ACA TCA CCC GAG AGC AAG 1535
1536 CAA TCG AGC TGC TGA GGA AGG AGG AGC CAG GGG CTT TTT GTC ATA AGG 1583
1584 GAC AGC TCT TCA TAC CGA GGC TCC TTC GGC CTG GCC CTG AAG GTG CAG 1631
1632 GAG GTT CCC GCG TCT GCT CAG AGT CGA CCA GGT GAG GAC AGC AAT GAC 1679
1680 CTC ATC CGA CAC TTC CTC ATC GAG TCG TCT GCC AAA GGA GTG CAT CTC 1727
1728 AAA GGA GCA GAT GAG GAG CCC TAC TTT GGG AGC CTC TCT GCC TTC GTG 1775
1776 TGC CAG CAT TCC ATC ATG GCC CTG GCC CTG CCC TGC AAA CTC ACC ATC 1823
1824 CCA CAG AGA GAA CTG GGA GGT GCA GAT GGG GCC TCG GAC TCT ACA GAC 1871
1872 AGC CCA GCC TCC TGC CAG AAG AAA TCT GCG GGC TGC CAC ACC CTG TAC 1919
1920 CTG AGC TCA GTG AGC GTG GAG ACC CTG ACT GGA GCC CTG GCC GTC AGA 1967
1968 AAG CCA TCT CCA CCA CCT TTG AGA GGG ACA TCC TCC CCA CGC CCA CCG 2015
2016 TGG TCC ACT TCA AAG TCA CAG AGC AGG GCA TCA CTC TGA CTG ATG TCC 2063
2064 AGA GGA AGG TGT TTT TCC GGC GCC ATT ACC CAC TCA CCA CCC TCC GCT 2111
2112 TCT GTG GTA TGG ACC CTG AGC AAC GGA AGT GGC AGA AGT ACT GCA AAC 2159
2160 CCT CCT GGA TCT TTG GGT TTG TGG CCA AGA GCC AGA CAG AGC CTC AGG 2207
2208 AGA ACG TAT GCC ACC TCT TTG CGG AGT ATG ACA TGG TCC AGC CAG CCT 2255
2256 CGC AGG TCA TCG GCC TGG TGA CTG CTC TGC TGC AGG ACG CAG AAA GGA 2303
2304 TGT AGG GGA GAG ACT GCC TGT GCA CCT AAC CAA CAC CTC CAG GGG CTC 2351
2352 GCT AAG GAG CCC CCC TCC ACC CCC TGA ATG GGT GTG GCT TGT GGC ATA 2399
2400 TTG ACA GAC CAA TCT ATG GGA CTA GGG GGA TTG GCA TCA AGT TGA CAC 2447
2448 CCT TGA ACC TGC TAT GGC CTT CAG CAG TCA CCA TCA TCC AGA CCC CCC 2495
2496 GGG CCT CAG TTT CCT CAA TCA TAG AAG AAG ACC AAT AGA CAA GAT CAG 2543
2544 CTG TTC TTA GAT GCT GGT GGG CAT TTG AAC ATG CTC CTC CAT GAT TCT 2591
2592 GAA GCA TGC ACA CCT CTG AAG ACC CCT GCA TGA AAA TAA CCT CCA AGG 2639
2640 ACC CTC TGA CCC CAT CGA CCT GGG CCC TGC CAC ACA ACA GTC TGA GCA 2687
2688 AGA GAC CTG CAG CCC CTG TTT CGT GGC AGA CAG CAG GTG CCT GGC GGT 2735
2736 GAC CCA CGG GGC TCC TGG CTT GCA GCT GGT GAT GGT CAA GAA CTG ACT 2783
2784 ACA AAA CAG GAA TGG ATA GAC TCT ATT TCC TTC CAT ATC TGT TCC TCT 2831
2832 GTT CCT TTT CCC ACT TTC TGG GTG GCT TTT TGG GTC CAC CCA GCC AGG 2879
2880 ATG CTG CAG GCC AAG CTG GGT GTG GTA TTT AGG GCA GCT CAG CAG GGG 2927
2928 GAA CTT GTC CCC ATG GTC AGA GGA GAC CCA GCT GTC CTG CAC CCC CTT 2975
2976 GCA GAT GAG TAT CAC CCC ATC TTT TCT TTC CAC TTG GTTT TT ATT TTT 3023
3024 ATT TTT TTT TGA GAC AGA GTC TCA CTG TCA CCC AGG CTG AAC TGC AGT 3071
3072 GGT GTG ATC TAG GCT CAC TGC AAC CTC CAC CTC CCA GGT TCA AGC AAT 3119
3120 TAT CCT GCC TCA GGC TCC CAA GTA GCT GGG ATT ACA GGC ATG TGC AAC 3167
3168 TCA CCC AGC TAA TTT TGT ATT TTT AGT AGA GAC AGG GTT TCA CCA TGT 3215
3216 TGG CCA GGC TGG TCT TGA ACT CCT GAC CGC AGT AAT CCA CCT GCT TCG 3263
3264 GCC TCC CAA AGT GCT GGG ATT ACA GGC GCA ACG ACC CAG CCC AGC TTC 3311
3312 TTT CCA TTC CTT GAT AGG CGA GTA TTC CAA AGC TGG TAT CGT AGC TGC 3359
3360 CCT AAT GTT GCA TAT TAG GCG GCG GGG GCA GAG ATA AGG GCC ATC TCT 3407
3408 CTG TGA TTC TGC CTC AGC TCC TGT CTT GCT GAG CCC TCC CCC AAC CCA 3455
3456 CGC TCC AAC ACA CAC ACA CAC ACA CAC ACA CAC ACA CAC ACA CAC ACA 3503
3504 CAC ACA CAC ACA CAC GCC CCT CTA CTG CTA TGT GGC TTC AAC CAG CCT 3551
3552 CAC AGC CAC ACG GGG GAA GCA GAG AGT CAA GAA TGC AAA GAG GCC GCT 3599
3600 TCC CTA AGA GGC TTG GAG GAG CTG GGC TCT ATC CCA CAC CCA CCC CCA 3647
3648 CCC CAC CCC CAC CCA GCC TCC AGA AGC TGG AAC CAT TTC TCC CGC AGC 3695
3696 CTG AGT TCC TAA GGA AAC CAC CCT ACC GGG GTG GAA GGG AGG GTC AGG 3743
3744 GAA GAA ACC CAC TCT TGC TCT ACG AGG AGC AAG TGC CTG CCC CCT CCC 3791
3792 AGC AGC CAG CCC TGC CAA AGT TGC ATT ATC TTT GGC CAA GGC TGG GCC 3839
3840 TGA CGG TTA TGA TTT CAG CCC TGG GCC TGC AGG AGA GGC TGA GAC CAG 3887
3888 CCC ACC CAG CCA GTG GTC GAG CAC TGC CCC GCC GCC AAA GTC TGC AGA 3935
3936 ATG TGA GAT GAG GTT CTC AAG GTC ACA GGC CCC AGT CCC AGC CTG GGG 3983
3984 GCT GCA GAG GCC CCC ATA TAC TCT GCT ACA GCT CCT ATC ATG AAA AAT 4031
4032 AAA ATG TTT GTC TTT GCA AAA CAG TAA AAA AAA AAA AAA AAA AAA AAA 4079
4080 AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA 4127
4128 AAA 4130
6.PP14630
A: nucleotide sequence (SEQ ID NO:16) length: 3128 bases
1 GTTGACCGCA GCCCAGGAGC CCATGCCGAA GAGAGCCACC AGCAGGGCCA GACACGCTTT
61 GGCTCTCCTG GGAAGTGAAG ACTTCTTCTA GCTTGGAAGA AACAGAGAAA CAGATAGGAT
121 GCAGAGCCGG AGTCAGAACA GGGTGAGACC AGGGGAACTG GGGGAGCCCC GGGCAGGAGC
181 GCAGATGGGG AGGATTGGCT AAAACGTACC CAGCGGGACC GCCGGCAGGC AGGGAAGCAG
241 GAAGCCGGCC CGAGTCGCTG CGGCCCCAGG GGAGGAGGCC GAAGGAGCGA GTGGAGCCCT
301 CCTCTCCAGG GAGAGGCGGG GCTGGAGTCA ACGCCACCAC CACCCCAAGC CGGACGCCGG
361 GAAAGGGCGC CTACCAGGAG GAGCCACGCA CACCCCAGGG CTTCACGGTG CCCGCGACCG
421 CAAGAGCCAA GTCAGATCCC AGGGGAGTGG CCGGGACGGC TCTGCTCGGA GGGGCGCCGC
481 CCTCTCGGTA CCGTCAGGTG GGCGACTCGG GAGCCGTCCG GTCCCTGCGC CTCCCGGTCC
541 CGCGCCCAGT GCGCTCCCGC TCCAGTGCCG GCTTCCCGCC ACGCCGGGAC ACCGCCGTCC
601 GGCCCAGGGA CGCCACCTCA GTCCGGAGCG CGCAGGACCC GGCAGCAGCC ACGACCCGAC
661 TCACCCGTGG CGGTTCCGGG CCCGCCCCCG GCATGGCCCA CGACCGGAAG TCCCGCCCCG
721 GAAGCTGGTC GGCGGGCGGG CGGGCGTTGA GAGGAACCGG GGGGTTGTCG GTCTATAAGC
781 CTCGCCCGTT CCGCTCCCTG GGGCTTCCCC GAGCGCCGTC GGTGGTCATG GCTGCCCCAG
841 CCTCCCGGCA GGTCCGACGC AGAGCCCGGG CAGCGCCGCG GGCCCCGCTC GGCCGAGGAC
901 TGGTGGTGGG ACCGGCTGGC GCCGAGGGGC TCGGGGTACC ACCTGCTGCA GTCCGACAGC
961 ATGCTGCTGG TGCTGTCCGA ACCCGGCCCC GCCCGGCCCC GCGCACAGCG CGCGCTTCCC
1021 GCCGCACTCC CCGGCAGCCG CCCCGGGGCC CCAGCGCCGC GGCCAAGCCC AAGGCCGGGC
1081 TCAGGTCCGA GGCGGCGGCC GCGCCCGCAC CCGCACCGGC ACCCACGCCC ACGCCCGAGG
1141 AAGGGCCCGA CGCGGGCTGG GGAGACCGCA TTCCCTTGGA AATCCTGGTG CAGATTTTCG
1201 GGTTGTTGGT GGCGGCGGAC GGCCCCATGC CCTTCCTGGG CAGGTAACGC TGGTGCCGGG
1261 CCCGCCGCCG AGCGTAGCGG CTTGGGCCAG ACGTGGTCCG AGCGGTGGCC CGGGCGGGGG
1321 CGGAGGGCGA AAGCATCGGA GCGCGCACCG CTCAGTCCGA GAGCGCAGCC CCTTAGGTGC
1381 CCAGTTGGAG TCCCAGGAGC CCGGCTTTGA GCCGGGGTGT CTACTGCGCT GCGAGAGGGG
1441 GGCATCGCCT ACGGAGGGGC CGGCACCCTC AGCACGCTGT CCTCCCAGGG CTGCGCGCGT
1501 GTGCCGCCGC TGGCAGGAGG CCGCTTCCCA ACCCGCGCTC TGGCACACCG TGACCCTGTC
1561 GTCCCCGCTG GTCGGCCGGC CTGCCAAGGG CGGGGTCAAG GCGGAGAAGA AGCTCCTTGC
1621 TTCCCTGGAG TGGCTTATGC CCAATCGGTT TTCACAGCTC CAGAGGCTGA CCCTCATCCA
1681 CTGGAAGTCT CAGGTACACC CCGTGTTGAA GGTGAGAGCT CCAGGCTGTC CTGCACATCA
1741 GCTGTGACAC TCTGGGACTG TTCAGTACTC TAGGAAGTGG GTCAGGCACC TTGGGGGCCC
1801 AACGCTGCTC CGTGGGGATG TCTGCCTGCC TGCCTGGTTC TCTTTTCCTG CTGTTTCCTC
1861 CAGCAGGGAG GTATCAGAGG CGGGGACACC CAAGTAGGCC TGGCATGGGC AGAAAGGAGG
1921 TCACAGCTAA GGCGGTAGAG TGGGGTTGGC ACCAGCCACT TGTCTGTTTC CCTTGTGGAT
1981 CTTAGCCTGT CGTCTCCCAA CCCCAGCTGC CCCTCTGTCT CCCCGCAGCT GGTAGGTGAG
2041 TGCTGTCCTC GGCTCACTTT CCTCAAGCTC TCCGGCTGCC ACGGTGTGAC TGCTGACGCT
2101 CTGGTCATGC TAGCCAAAGC CTGCTGCCAG CTCCATAGCC TGGACCTACA GCACTCCATG
2161 GTGGAGTCCA CAGCTGTGGT GAGCTTCTTG GAGGAGGCAG GGTCCCGAAT GCGCAAGTTG
2221 TGGCTGACCT ACAGCTCCCA GACGACAGCC ATCCTGGGCG CACTGCTGGG CAGCTGCTGC
2281 CCCCAGCTCC AGGTCCTGGA GGTGAGCACC GGCATCAACC GTAATAGCAT TCCCCTTCAG
2341 CTGCCTGTCG AGGCTCTGCA GAAAGGCTGC CCTCAGCTCC AGGTGCTGCG GCTGTTGAAC
2401 CTGATGTGGC TGCCCAAGCC TCCGGGACGA GGGGTGGCTC CCGGACCAGG CTTCCCTAGC
2461 CTAGAGGAGC TCTGCCTGGC GAGCTCAACC TGCAACTTTG TGAGCAACGA GGTCCTGGGC
2521 CGCCTACTCC ACGGCTCTCC CAACCTGCGC TTACTGGATC TTCGTGGCTG TGCGCGCATC
2581 ACGCCGGCTG GCCTTCAGGA TCTGCCATGT CGGGAGCTGG AGCAGCTTCA TCTGGGCCTG
2641 TATGGCACGT CAGACCGGCT GACTCTAGCC AAGGAGGGCA GCCCCTTTTT GACCCAGAAG
2701 TGGTGCCATA CACTGCGAGA ACTGGACTTG AGTGGCCAGG GGTTCAGTGA GAAGGACCTG
2761 GAGCAGGCCC TGGCTGCCTT CTTAAGCACC CCTGGGGGCT CACACCCAGC CCTGTGCTCT
2821 CTTAACCTCA GGGGCACCCG GGTCACACCA AGCACTGTCA GCTCTGTGAT CAGCAGCTGC
2881 CCGGGCCTGC TCTACCTCAA CCTGGAGTCC TGCCGCTGCC TTCCCCGGGG TCTGAAGCGG
2941 GCCTACCGGG GCCTGGAGGA AGTCCAGTGG TGTCTGGAGC AGCTGCTCAC CAGCCCCTCA
3001 CCCAGCTAGG CAGCCACAGA CCTGGGACAC CTCAGCCAGC TTGCCCACCC TCCACCTTTG
3061 CCCAATTTCA GATATTTGAG CATTTTGTTA AAATAAAACA TTTTTAGGAA AAAAAAAAAA
3121 AAAAAAAA
B: nucleotide sequence (SEQ ID N0:17) length: 300 amino acid
1 MLAKACCQLH SLDLQHSMVE STAVVSFLEE AGSRMRKLWL TYSSQTTAIL GALLGSCCPQ
61 LQVLEVSTGI NRNSIPLQLP VEALQKGCPQ LQVLRLLNLM WLPKPPGRGV APGPGFPSLE
121 ELCLASSTCN FVSNEVLGRL LHGSPNLRLL DLRGCARITP AGLQDLPCRE LEQLHLGLYG
181 TSDRLTLAKE GSPFLTQKWC HTLRELDLSG QGFSEKDLEQ ALAAFLSTPG GSHPALCSLN
241 LRGTRVTPST VSSVISSCPG LLYLNLESCR CLPRGLKRAY RGLEEVQWCL EQLLTSPSPS
C. Nucleotide and amino acid composite sequence (SEQ ID NO:18) clone number: PP14630
Start code: 2107 ATG stop coding: 3007 TAG protein molecular weights: 32570.05
1 GTT GAC CGC AGC CCA GGA GCC CAT GCC GAA GAG AGC CAC CAG CAG GGC 48
49 CAG ACA CGC TTT GGC TCT CCT GGG AAG TGA AGA CTT CTT CTA GCT TGG 96
97 AAG AAA CAG AGA AAC AGA TAG GAT GCA GAG CCG GAG TCA GAA CAG GGT 144
145 GAG ACC AGG GGA ACT GGG GGA GCC CCG GGC AGG AGC GCA GAT GGG GAG 192
193 GAT TGG CTA AAA CGT ACC CAG CGG GAC CGC CGG CAG GCA GGG AAG CAG 240
241 GAA GCC GGC CCG AGT CGC TGC GGC CCC AGG GGA GGA GGC CGA AGG AGC 288
289 GAG TGG AGC CCT CCT CTC CAG GGA GAG GCG GGG CTG GAG TCA ACG CCA 336
337 CCA CCA CCC CAA GCC GGA CGC CGG GAA AGG GCG CCT ACC AGG AGG AGC 384
385 CAC GCA CAC CCC AGG GCT TCA CGG TGC CCG CGA CCG CAA GAG CCA AGT 432
433 CAG ATC CCA GGG GAG TGG CCG GGA CGG CTC TGC TCG GAG GGG CGC CGC 480
481 CCT CTC GGT ACC GTC AGG TGG GCG ACT CGG GAG CCG TCC GGT CCC TGC 528
529 GCC TCC CGG TCC CGC GCC CAG TGC GCT CCC GCT CCA GTG CCG GCT TCC 576
577 CGC CAC GCC GGG ACA CCG CCG TCC GGC CCA GGG ACG CCA CCT CAG TCC 624
625 GGA GCG CGC AGG ACC CGG CAG CAG CCA CGA CCC GAC TCA CCC GTG GCG 672
673 GTT CCG GGC CCG CCC CCG GCA TGG CCC ACG ACC GGA AGT CCC GCC CCG 720
721 GAA GCT GGT CGG CGG GCG GGC GGG CGT TGA GAG GAA CCG GGG GGT TGT 768
769 CGG TCT ATA AGC CTC GCC CGT TCC GCT CCC TGG GGC TTC CCC GAG CGC 816
817 CGT CGG TGG TCA TGG CTG CCC CAG CCT CCC GGC AGG TCC GAC GCA GAG 864
865 CCC GGG CAG CGC CGC GGG CCC CGC TCG GCC GAG GAC TGG TGG TGG GAC 912
913 CGG CTG GCG CCG AGG GGC TCG GGG TAC CAC CTG CTG CAG TCC GAC AGC 960
961 ATG CTG CTG GTG CTG TCC GAA CCC GGC CCC GCC CGG CCC CGC GCA CAG 1008
1009 CGC GCG CTT CCC GCC GCA CTC CCC GGC AGC CGC CCC GGG GCC CCA GCG 1056
1057 CCG CGG CCA AGC CCA AGG CCG GGC TCA GGT CCG AGG CGG CGG CCG CGC 1104
1105 CCG CAC CCG CAC CGG CAC CCA CGC CCA CGC CCG AGG AAG GGC CCG ACG 1152
1153 CGG GCT GGG GAG ACC GCA TTC CCT TGG AAA TCC TGG TGC AGA TTT TCG 1200
1201 GGT TGT TGG TGG CGG CGG ACG GCC CCA TGC CCT TCC TGG GCA GGT AAC 1248
1249 GCT GGT GCC GGG CCC GCC GCC GAG CGT AGC GGC TTG GGC CAG ACG TGG 1296
1297 TCC GAG CGG TGG CCC GGG CGG GGG CGG AGG GCG AAA GCA TCG GAG CGC 1344
1345 GCA CCG CTC AGT CCG AGA GCG CAG CCC CTT AGG TGC CCA GTT GGA GTC 1392
1393 CCA GGA GCC CGG CTT TGA GCC GGG GTG TCT ACT GCG CTG CGA GAG GGG 1440
1441 GGC ATC GCC TAC GGA GGG GCC GGC ACC CTC AGC ACG CTG TCC TCC CAG 1488
1489 GGC TGC GCG CGT GTG CCG CCG CTG GCA GGA GGC CGC TTC CCA ACC CGC 1536
1537 GCT CTG GCA CAC CGT GAC CCT GTC GTC CCC GCT GGT CGG CCG GCC TGC 1584
1585 CAA GGG CGG GGT CAA GGC GGA GAA GAA GCT CCT TGC TTC CCT GGA GTG 1632
1633 GCT TAT GCC CAA TCG GTT TTC ACA GCT CCA GAG GCT GAC CCT CAT CCA 1680
1681 CTG GAA GTC TCA GGT ACA CCC CGT GTT GAA GGT GAG AGC TCC AGG CTG 1728
1729 TCC TGC ACA TCA GCT GTG ACA CTC TGG GAC TGT TCA GTA CTC TAG GAA 1776
1777 GTG GGT CAG GCA CCT TGG GGG CCC AAC GCT GCT CCG TGG GGA TGT CTG 1824
1825 CCT GCC TGC CTG GTT CTC TTT TCC TGC TGT TTC CTC CAG CAG GGA GGT 1872
1873 ATC AGA GGC GGG GAC ACC CAA GTA GGC CTG GCA TGG GCA GAA AGG AGG 1920
1921 TCA CAG CTA AGG CGG TAG AGT GGG GTT GGC ACC AGC CAC TTG TCT GTT 1968
1969 TCC CTT GTG GAT CTT AGC CTG TCG TCT CCC AAC CCC AGC TGC CCC TCT 2016
2017 GTC TCC CCG CAG CTG GTA GGT GAG TGC TGT CCT CGG CTC ACT TTC CTC 2064
2065 AAG CTC TCC GGC TGC CAC GGT GTG ACT GCT GAC GCT CTG GTC ATG CTA 2112
1 Met Leu 2
2113 GCC AAA GCC TGC TGC CAG CTC CAT AGC CTG GAC CTA CAG CAC TCC ATG 2160
3 Ala Lys AAa Cys Cys Gln Leu His Ser Leu Asp Leu Gln His Ser Met 18
2161 GTG GAG TCC ACA GCT GTG GTG AGC TTC TTG GAG GAG GCA GGG TCC CGA 2208
19 Val Glu Ser Thr Ala Val Val Ser Phe Leu Glu Glu Ala Gly Ser Arg 34
2209 ATG CGC AAG TTG TGG CTG ACC TAC AGC TCC CAG ACG ACA GCC ATC CTG 2256
35 Met Arg Lys Leu Trp Leu Thr Tyr Ser Ser Gln Thr Thr Ala Ile Leu 50
2257 GGC GCA CTG CTG GGC AGC TGC TGC CCC CAG CTC CAG GTC CTG GAG GTG 2304
51 Gly Ala Leu Leu Gly Ser Cys Cys Pro Gln Leu Gln Val Leu Glu Val 66
2305 AGC ACC GGC ATC AAC CGT AAT AGC ATT CCC CTT CAG CTG CCT GTC GAG 2352
67 Ser Thr Gly Ile Asn Arg Asn Ser Ile Pro Leu Gln Leu Pro Val Glu 82
2353 GCT CTG CAG AAA GGC TGC CCT CAG CTC CAG GTG CTG CGG CTG TTG AAC 2400
83 Ala Leu Gln Lys Gly Cys Pro Gln Leu Gln Val Leu Arg Leu Leu Asn 98
2401 CTG ATG TGG CTG CCC AAG CCT CCG GGA CGA GGG GTG GCT CCC GGA CCA 2448
99 Leu Met Trp Leu Pro Lys Pro Pro Gly Arg Gly Val Ala Pro Gly Pro 114
2449 GGC TTC CCT AGC CTA GAG GAG CTC TGC CTG GCG AGC TCA ACC TGC AAC 2496
115 Gly Phe Pro Ser Leu Glu Glu Leu Cys Leu Ala Ser Ser Thr Cys Asn 130
2497 TTT GTG AGC AAC GAG GTC CTG GGC CGC CTA CTC CAC GGC TCT CCC AAC 2544
131 Phe Val Ser Asn Glu Val Leu Gly Arg Leu Leu His Gly Ser Pro Asn 146
2545 CTG CGC TTA CTG GAT CTT CGT GGC TGT GCG CGC ATC ACG CCG GCT GGC 2592
147 Leu Arg Leu Leu Asp Leu Arg Gly Cys Ala Arg Ile Thr Pro Ala Gly 162
2593 CTT CAG GAT CTG CCA TGT CGG GAG CTG GAG CAG CTT CAT CTG GGC CTG 2640
163 Leu Gln Asp Leu Pro Cys Arg Glu Leu Glu Gln Leu His Leu Gly Leu 178
2641 TAT GGC ACG TCA GAC CGG CTG ACT CTA GCC AAG GAG GGC AGC CCC TTT 2688
179 Tyr Gly Thr Ser Asp Arg Leu Thr Leu Ala Lys Glu Gly Ser Pro Phe 194
2689 TTG ACC CAG AAG TGG TGC CAT ACA CTG CGA GAA CTG GAC TTG AGT GGC 2736
195 Leu Thr Gln Lys Trp Cys His Thr Leu Arg Glu Leu Asp Leu Ser Gly 210
2737 CAG GGG TTC AGT GAG AAG GAC CTG GAG CAG GCC CTG GCT GCC TTC TTA 2784
211 Gln Gly Phe Ser Glu Lys Asp Leu Glu Gln Ala Leu Ala Ala Phe Leu 226
2785 AGC ACC CCT GGG GGC TCA CAC CCA GCC CTG TGC TCT CTT AAC CTC AGG 2832
227 Ser Thr Pro Gly Gly Ser His Pro Ala Leu Cys Ser Leu Asn Leu Arg 242
2833 GGC ACC CGG GTC ACA CCA AGC ACT GTC AGC TCT GTG ATC AGC AGC TGC 2880
243 Gly Thr Arg Val Thr Pro Ser Thr Val Ser Ser Val Ile Ser Ser Cys 258
2881 CCG GGC CTG CTC TAC CTC AAC CTG GAG TCC TGC CGC TGC CTT CCC CGG 2928
259 Pro Gly Leu Leu Tyr Leu Asn Leu Glu Ser Cys Arg Cys Leu Pro Arg 274
2929 GGT CTG AAG CGG GCC TAC CGG GGC CTG GAG GAA GTC CAG TGG TGT CTG 2976
275 Gly Leu Lys Arg Ala Tyr Arg Gly Leu Glu Glu Val Gln Trp Cys Leu 290
2977 GAG CAG CTG CTC ACC AGC CCC TCA CCC AGC TAG GCA GCC ACA GAC CTG 3024
291 Glu Gln Leu Leu Thr Ser Pro Ser Pro Ser *** 301
3025 GGA CAC CTC AGC CAG CTT GCC CAC CCT CCA CCT TTG CCC AAT TTC AGA 3072
3073 TAT TTG AGC ATT TTG TTA AAA TAA AAC ATT TTT AGG AAA AAA AAA AAA 3120
3121 AAA AAA AA 3128
7.FP248
A: nucleotide sequence (SEQ ID NO:19) length: 2978 bases
1 GGCTTACTGC GGAACAGCAG GTTGGTGTCC ACAGCATTCA GCATCTGGAA CACGGCCTGG
61 GGGTGGTAGC AGATGGTGTG GCCTGTGTAA CAGACGGGCA GCTGGTGGCT CAGGGCTCCC
121 TGTCTCCTGG GGGGCTGTAG CCCACCCCTG CCCCTCTTCA GCCCCAAGCT CCTCCTCATC
181 CACCCCCTTT CCTCTCTTGT GGGTGGTGAG TCAGGGAGGG TCAAGGGTGC TTGGGTGGGG
241 GCAGGCCTGG GGGTCTTTAA GCCCGGCCTC GCCCTCCCAC CTATGAAGAG TGTCACCCAG
301 GTCTTGATGT GGCAGGAGTG GCTGTGCAGG TAGCTGAGGG CCTGTGACAA GTGGATGCTG
361 AATTCCTCCT GGCTGCGGGT CATCTGGCCG GAGGAGAGCA CACCTGGCTG GGACGGGCTG
421 GGGGCCCTGG GCATACCAGG GTCCTCCAGC CCCAGTTTGC CTTGCCCAGC CCCTGGAGCT
481 GCTATAGCAG CTGTGTGTGA GACGGGAGTG AATGGGGGAG CCGCAAGCCT GGTCTCCCCC
541 ACTCCCTGGC CCTGTTCCTC CCCACCGATC CCAGGCCTCT CACCAGGCAG GTCCAGAGGA
601 AGTGGCGGGC GCTGAGGCCC CTCTCCCAGG CCAGCGTGCA GAAGAGGGTG TGCAGTAGCC
661 GCCAGCGGAG CAGCACAGCA CAGCGGTAGA AGGTGAACTT GGCCTGCTGC AGGGACAGGC
721 GTGGAGGGTC ACCCACCGCC TATGTCTGAG CCCTTTCCCC TAGAGGCCAC AGGCCTCCAT
781 GGGCACTGGA GGCAGCCTGC TGTGCGGGTG TTCCCTGGTT CTGTCCTGCT TGTGCCCCTC
841 GGCAAGCCTC CCCGACCCTG GCAACAGCAC CTGGCCCCCC GGAGCCCAGG CTGGTCTGCC
901 CGCGGCCCTG GCCCTGCCCC TTCCACGGTT GCCCCGTATC CTTTTCCCCA TGGCAGGGAG
961 ACCCGCGAGG CCAAGCTCTG ACTTCGTGGG CTGTGCACAG GGCATGTGCT GCCACGGCAG
1021 GCAGGGCACA GTCCATATTC ACACAAGCTC TGTGAGCTGC TGGACCCCCT GCCCCGTTAC
1081 AGGGACTGGG GGCACAGCAG TGAGCAGAAA AGACCGGGTC CTGCCGCATC GACGGCAAGT
1141 CTCGCTCGCA TGCGTGTGCG CAGTGGGGGA GCGGGCAGGC CAACTGTGGT CACAGAAACC
1201 AGTGCAGATG GCCAGACCCT CTGCCCGCCA CTTGCTGCCC CGTGGGAGCT CCCCCAACTC
1261 TCAGGCAGTG CTGCTGCCAT CCGTCTGCCC CGTACCCTGG CCTCCTGTGG GTCCCAGCCC
1321 TGGCCAAGGT GAAGGCCTAT CACCTGCCTT TCCTGGGGTG GGCACTGACC GTGGCGACAG
1381 CTGGGCATTG GTCCTTCAGG TGTAGGAGCA GGGGCACCAT GCTCTGGTGC ACCTGGGTCC
1441 GCAGCCGCTC AGCTCCCTGT CTGCCATGGC CGCCACCAGG TCCCCGAACA GTGCCATGGC
1501 TGCCGCCCGA ATCCCGTCCG CTCCTGCAAG GCAGAGGCTC AGAGGCACGG CCAGACCTGT
1561 CCAGGGGTCC CAGCTTTGGT CCAAGTTGGG ACCCCACACC TTGTAGGGGT ACAACTCAGT
1621 TCTTTCTGCA GGATTCCTTC ACCCAGGCTC TCGGCTCCCG GGGAAGGGAA GGGCTGGGGC
1681 TCCCCCCTCA CACAGGGCTC TCTGGTGTCC CTGAGGATGT AAGAATCACA TCTCCCTTCT
1741 ACCCACAACT GCATCCTGGC AGCCCAGGCC TCATGAATGC ATTTGAGGGG CGCCTATCCC
1801 CCAGATTCCC CAGTGAAGGA AAAACAGCCA CGCTAACCGT GCTGACATGT CAGAAAGCAA
1861 ATCTGGGTAG GCTGTCGTGG GGCCGGAGAC CACTGCACTT GGAGACTGAA CGTGAGAACT
1921 AGTTCAGCCC TGTGGAGAAG GGGTGAGCGC CAGGGGCCCA CCCAGCCCAC CTGCTCAGTG
1981 CCCACCTCAA CCGAGGGCCA CAAGCGGGCA GCTCCCCAGG GCCTGGGCCT CCCTGCTTGC
2041 AGCTTGGGGT TCCCCCTTTG GCAGAGGAAG GGAGCTGGGT TGGGAGGAGG GTCCTGGAGG
2101 GCGTGAGCAG GAGGTAGCCC CGCCCTGCCC CAGTGCTCAC GTCATTAAAG AAGGAGCGTG
2161 TGCTGATGGC AACGCCGAGG CTCTGACTCC CTGTGCCCTG CGCGCCCAGG CGGTGCAGCG
2221 TGTCTGACAC GGTGCCCATG ATGCACACGA TCACCTGGTC GCTGCTCTGG AAGAAGCCGT
2281 CGAGCAAGGG CTGCAGCTGT CCCTGGAGCA GGCTTCCCTG GGGGTGGTGG CGGCTGGTGG
2341 GCGGAGAGCA CTAGGACCCG CTGGCCCCAT GCCCCCGCCT CGTTCTCCCA CTTGACCCCC
2401 TCACCCCATC TGCACTGGGT GGGGGGGTGG GGGGTGGTGA TTCAGAGCTG TGGGTCCCGG
2461 CCTGGCCCTT CCCGCCATGG GGAGCTGTGG GTCCCGGCCT GGCCCTGCCC ACCGTGGGGA
2521 GCTGTGCCCC TAACTGGGCT TTGTCCACCA GGAGCTGCGT GGTCTGGACA GGGTGGCCTC
2581 TTTTCTTTTT TGAGACAGGG TCTCACACTG TCACCCAGGC TGGAGTACAG TGGTGTGACC
2641 ATAGCTTACT GCAGCCTTGA CCTCCCAGGC TCAAGCAATC CTCCCAACAT AACCTCCTGA
2701 GTAGCTGGGT CTACAGCCAG TTCCGTCTTC TCCACTTGAT TGCGTGTTTG CAGTGAGTTC
2761 CTGAGTTATC GGGATGAATG GTCAGAAAGC AGTGTGCCCA CCAATGGATG TTTCCTGCCA
2821 TTGGGCCCCT TAGCAGTGAC TTGGGTGCAA ACCAGGGTGA TTTTTACAAC TTTTAAACAA
2881 GTTTAAAGCA ACTTTGTTGG CTTTCTTAGC TGTTTGCTTT GAAAATATTA AATTCATTTT
2941 CAGCAAGTGA GTCATTCAAC AAAAAAAAAA AAAAAAAA
B: nucleotide sequence (SEQ ID NO:20) length: 208 amino acid
1 MGTGGSLLCG CSLVLSCLCP SASLPDPGNS TWPPGAQAGL PAALALPLPR LPRILFPMAG
61 RPARPSSDFV GCAQGMCCHG RQGTVHIHTS SVSCWTPCPV TGTGGTAVSR KDRVLPHRRQ
121 VSLACVCAVG ERAGQLWSQK PVQMARPSAR HLLPRGSSPN SQAVLLPSVC PVPWPPVGPS
181 PGQGEGLSPA FPGVGTDRGD SWALVLQV
C. Nucleotide and amino acid composite sequence (SEQ ID NO:21) clone number: FP248
Start code: 779ATG stops coding: 1403TAG protein molecular weight: 21374.62
1 G GCT TAC TGC GGA ACA GCA GGT TGG TGT CCA CAG CAT TCA GCA TCT 46
47 GGA ACA CGG CCT GGG GGT GGT AGC AGA TGG TGT GGC CTG TGT AAC AGA 94
95 CGG GCA GCT GGT GGC TCA GGG CTC CCT GTC TCC TGG GGG GCT GTA GCC 142
143 CAC CCC TGC CCC TCT TCA GCC CCA AGC TCC TCC TCA TCC ACC CCC TTT 190
191 CCT CTC TTG TGG GTG GTG AGT CAG GGA GGG TCA AGG GTG CTT GGG TGG 238
239 GGG CAG GCC TGG GGG TCT TTA AGC CCG GCC TCG CCC TCC CAC CTA TGA 286
287 AGA GTG TCA CCC AGG TCT TGA TGT GGC AGG AGT GGC TGT GCA GGT AGC 334
335 TGA GGG CCT GTG ACA AGT GGA TGC TGA ATT CCT CCT GGC TGC GGG TCA 382
383 TCT GGC CGG AGG AGA GCA CAC CTG GCT GGG ACG GGC TGG GGG CCC TGG 430
431 GCA TAC CAG GGT CCT CCA GCC CCA GTT TGC CTT GCC CAG CCC CTG GAG 478
479 CTG CTA TAG CAG CTG TGT GTG AGA CGG GAG TGA ATG GGG GAG CCG CAA 526
527 GCC TGG TCT CCC CCA CTC CCT GGC CCT GTT CCT CCC CAC CGA TCC CAG 574
575 GCC TCT CAC CAG GCA GGT CCA GAG GAA GTG GCG GGC GCT GAG GCC CCT 622
623 CTC CCA GGC CAG CGT GCA GAA GAG GGT GTG CAG TAG CCG CCA GCG GAG 670
671 CAG CAC AGC ACA GCG GTA GAA GGT GAA CTT GGC CTG CTG CAG GGA CAG 718
719 GCG TGG AGG GTC ACC CAC CGC CTA TGT CTG AGC CCT TTC CCC TAG AGG 766
767 CCA CAG GCC TCC ATG GGC ACT GGA GGC AGC CTG CTG TGC GGG TGT TCC 814
1 Met Gly Thr Gly Gly Ser Leu Leu Cys Gly Cys Ser 12
815 CTG GTT CTG TCC TGC TTG TGC CCC TCG GCA AGC CTC CCC GAC CCT GGC 862
13 Leu Val Leu Ser Cys Leu Cys Pro Ser Ala Ser Leu Pro Asp Pro Gly 28
863 AAC AGC ACC TGG CCC CCC GGA GCC CAG GCT GGT CTG CCC GCG GCC CTG 910
29 Asn Ser Thr Trp Pro Pro Gly Ala Gln Ala Gly Leu Pro Ala Ala Leu 44
911 GCC CTG CCC CTT CCA CGG TTG CCC CGT ATC CTT TTC CCC ATG GCA GGG 958
45 Ala Leu Pro Leu Pro Arg Leu Pro Arg Ile Leu Phe Pro Met Ala Gly 60
959 AGA CCC GCG AGG CCA AGC TCT GAC TTC GTG GGC TGT GCA CAG GGC ATG 1006
61 Arg Pro Ala Arg Pro Ser Ser Asp Phe Val Gly Cys Ala Gln Gly Met 76
1007 TGC TGC CAC GGC AGG CAG GGC ACA GTC CAT ATT CAC ACA AGC TCT GTG 1054
77 Cys Cys His Gly Arg Gln Gly Thr Val His Ile His Thr Ser Ser Val 92
1055 AGC TGC TGG ACC CCC TGC CCC GTT ACA GGG ACT GGG GGC ACA GCA GTG 1102
93 Ser Cys Trp Thr Pro Cys Pro Val Thr Gly Thr Gly Gly Thr Ala Val 108
1103 AGC AGA AAA GAC CGG GTC CTG CCG CAT CGA CGG CAA GTC TCG CTC GCA 1150
109 Ser Arg Lys Asp Arg Val Leu Pro His Arg Arg Gln Val Ser Leu Ala 124
1151 TGC GTG TGC GCA GTG GGG GAG CGG GCA GGC CAA CTG TGG TCA CAG AAA 1198
125 Cys Val Cys Ala Val Gly Glu Arg Ala Gly Gln Leu Trp Ser Gln Lys 140
1199 CCA GTG CAG ATG GCC AGA CCC TCT GCC CGC CAC TTG CTG CCC CGT GGG 1246
141 Pro Val Gln Met Ala Arg Pro Ser Ala Arg His Leu Leu Pro Arg Gly 156
1247 AGC TCC CCC AAC TCT CAG GCA GTG CTG CTG CCA TCC GTC TGC CCC GTA 1294
157 Ser Ser Pro Asn Ser Gln Ala Val Leu Leu Pro Ser Val Cys Pro Val 172
1295 CCC TGG CCT CCT GTG GGT CCC AGC CCT GGC CAA GGT GAA GGC CTA TCA 1342
173 Pro Trp Pro Pro Val Gly Pro Ser Pro Gly Gln Gly Glu Gly Leu Ser 188
1343 CCT GCC TTT CCT GGG GTG GGC ACT GAC CGT GGC GAC AGC TGG GCA TTG 1390
189 Pro Ala Phe Pro Gly Val Gly Thr Asp Arg Gly Asp Ser Trp Ala Leu 204
1391 GTC CTT CAG GTG TAG GAG CAG GGG CAC CAT GCT CTG GTG CAC CTG GGT 1438
205 Val Leu Gln Val *** 209
1439 CCG CAG CCG CTC AGC TCC CTG TCT GCC ATG GCC GCC ACC AGG TCC CCG 1486
1487 AAC AGT GCC ATG GCT GCC GCC CGA ATC CCG TCC GCT CCT GCA AGG CAG 1534
1535 AGG CTC AGA GGC ACG GCC AGA CCT GTC CAG GGG TCC CAG CTT TGG TCC 1582
1583 AAG TTG GGA CCC CAC ACC TTG TAG GGG TAC AAC TCA GTT CTT TCT GCA 1630
1631 GGA TTC CTT CAC CCA GGC TCT CGG CTC CCG GGG AAG GGA AGG GCT GGG 1678
1679 GCT CCC CCC TCA CAC AGG GCT CTC TGG TGT CCC TGA GGA TGT AAG AAT 1726
1727 CAC ATC TCC CTT CTA CCC ACA ACT GCA TCC TGG CAG CCC AGG CCT CAT 1774
1775 GAA TGC ATT TGA GGG GCG CCT ATC CCC CAG ATT CCC CAG TGA AGG AAA 1822
1823 AAC AGC CAC GCT AAC CGT GCT GAC ATG TCA GAA AGC AAA TCT GGG TAG 1870
1871 GCT GTC GTG GGG CCG GAG ACC ACT GCA CTT GGA GAC TGA ACG TGA GAA 1918
1919 CTA GTT CAG CCC TGT GGA GAA GGG GTG AGC GCC AGG GGC CCA CCC AGC 1966
1967 CCA CCT GCT CAG TGC CCA CCT CAA CCG AGG GCC ACA AGC GGG CAG CTC 2014
2015 CCC AGG GCC TGG GCC TCC CTG CTT GCA GCT TGG GGT TCC CCC TTT GGC 2062
2063 AGA GGA AGG GAG CTG GGT TGG GAG GAG GGT CCT GGA GGG CGT GAG CAG 2110
2111 GAG GTA GCC CCG CCC TGC CCC AGT GCT CAC GTC ATT AAA GAA GGA GCG 2158
2159 TGT GCT GAT GGC AAC GCC GAG GCT CTG ACT CCC TGT GCC CTG CGC GCC 2206
2207 CAG GCG GTG CAG CGT GTC TGA CAC GGT GCC CAT GAT GCA CAC GAT CAC 2254
2255 CTG GTC GCT GCT CTG GAA GAA GCC GTC GAG CAA GGG CTG CAG CTG TCC 2302
2303 CTG GAG CAG GCT TCC CTG GGG GTG GTG GCG GCT GGT GGG CGG AGA GCA 2350
2351 CTA GGA CCC GCT GGC CCC ATG CCC CCG CCT CGT TCT CCC ACT TGA CCC 2398
2399 CCT CAC CCC ATC TGC ACT GGG TGG GGG GGT GGG GGG TGG TGA TTC AGA 2446
2447 GCT GTG GGT CCC GGC CTG GCC CTT CCC GCC ATG GGG AGC TGT GGG TCC 2494
2495 CGG CCT GGC CCT GCC CAC CGT GGG GAG CTG TGC CCC TAA CTG GGC TTT 2542
2543 GTC CAC CAG GAG CTG CGT GGT CTG GAC AGG GTG GCC TCT TTT CTT TTT 2590
2591 TGA GAC AGG GTC TCA CAC TGT CAC CCA GGC TGG AGT ACA GTG GTG TGA 2638
2639 CCA TAG CTT ACT GCA GCC TTG ACC TCC CAG GCT CAA GCA ATC CTC CCA 2686
2687 ACA TAA CCT CCT GAG TAG CTG GGT CTA CAG CCA GTT CCG TCT TCT CCA 2734
2735 CTT GAT TGC GTG TTT GCA GTG AGT TCC TGA GTT ATC GGG ATG AAT GGT 2782
2783 CAG AAA GCA GTG TGC CCA CCA ATG GAT GTT TCC TGC CAT TGG GCC CCT 2830
2831 TAG CAG TGA CTT GGG TGC AAA CCA GGG TGA TTT TTA CAA CTT TTA AAC 2878
2879 AAG TTT AAA GCA ACT TTG TTG GCT TTC TTA GCT GTT TGC TTT GAA AAT 2926
2927 ATT AAA TTC ATT TTC AGC AAG TGA GTC ATT CAA CAA AAA AAA AAA AAA 2974
2975 AAA A 2978
8.FP291
A: nucleotide sequence (SEQ ID NO:22) length: 2350 bases
1 GCTTTGCCCT GCAAGTGGAG TGGGCTCTGT TCTGGTCTTA GGAGGTCTGA AAGCAATACC
61 AGCCCCTGCG CGTCTCACGC AGGCCCCAGC GTTAATTAAC CAGCAGTCGC TCCTGACAGC
121 CAAGCCCCTG TATCATCTCC CAGGTCCAGG ACCCTGTGGG AGAAAGGGTC CGGCAAGTCA
181 CAGACCCAGC CAGCGCAAAG GGCGATCGAG GAGAACCTTC CAGGAACCGT CCCAGATCCT
241 AAACGTGAGG GTCTGCTGAG ATGCCCATCA TGGTAGATGA CATCATGAGG CTTCTGTGCT
301 CCATTTCCCA GGAGAGGAAG ATGAAGGCGG CCGTCAAGCA CTCTGGAAAG GGTGCCATGG
361 TTGCGGGGGC AATGGCCTTC GTCGGTGGCT TGGTTGGTGG CCCACCAGGA ATAGCTGTTG
421 GGGGGACTGT TGGGGGCCTG TTAGGTGCCT GGATGACGAG TGGACAGTTC AAGCCCGTTC
481 CTCAGATCTT AATGGAGCTG CCCCCCGCTG AACAGCGGAA ACTTGTTAAC GAAGCCATGG
541 CCATCATCGG GAACTTGGAC TGGACAGACG CTGTGCAGCT GACTGCTCTG GTCATGAGCA
601 ACCAGGCCAT GCAGCAGAGG CTCTTGGCCA TGCTAACGAC CTATGTCACC AAAGAGCTGC
661 AGGCTGAGAT CCGCTATGAG GACTAGGCCC CACCCCCTGG GGAGTGGGGC TCTCATTTTA
721 GATGATTCTA TGGCTCATTG AAGGTGACCA GGGAATGGGG GAAGCCTGAC TTCTGGGAAA
781 CTTGTCTATT ACCAGCATTG ATTGTAAGCC GCCATGCTAG TGCATTGTTG GACTGTCTTG
841 CTGGAGAGGC ATCAGCCATG CCTCTTGGGC TGTGTGTGCC ATGGAAAGGT ACAGGCGTGC
901 CTGCTGTGGG CATTCACGGA CAGCTTGGAG TCCTCCCGAT GACCATGTGG GTCTAAAAGG
961 TTCTCATATT GTTGGCTGGC CGACCATAGC AGATGCTTCT TTTTCTATGA AGTGTACCTC
1021 CAGGTACCTG TCACAGGATC GAATCCCCTT AGGCTCTGGG GTCTTGGGGT TCAGAGCCAG
1081 CTCTTGTCAG GATTGTGCTA ACCTGCTGAG TGTTTCCTTC CCTTGGCTAA GGACTTCCCA
1141 GTGGTCTCAT GGTGACAACT GTCCCTCCTT TGAGTGAAGC CACAGATGCC AGGCAGGCTC
1201 TATGCTCTGC TACAGGCTAT GCCCTAGACG CTCTTGAGTA ACTTTAGTTT AGCTTGTAGG
1261 AGCTTCTTCT GCCTCCTTAA GAAAAATCAT GGCACTTCAT TATTTTTTAT TTTCCATGTC
1321 GATTCTGGAT TCTTTTTGTT CTCTACACTT TCGGAGCCTA AATTCATTTA GCTCTGTTTT
1381 CTTTTTCTTC TTACTATGTT TCCTTGGGCT AGCCTTGAAG TCCTGGGCTC AAGCTACATA
1441 GCTGGGACTG TAGGCTTGTG CCACCATACC TAGCTTTACC TCAAAGACTG TTGTCCACTC
1501 TACTTTTCCT AAGCCCTCCC GTGCTCTCCT GTCTGCTGGC TGCAATCTAT AATTCATTTG
1561 TGTGAAAAGA CAAAATTACC AATGACTTGA GTTCAGAGAT CCTAACATAG CTAGGTGCTG
1621 TGGTGCAGAA CCATACCCAC GGTCCCAGAT AATTGAAAAG CCGAAGGGGG AACATCCTTG
1681 GGCCCAGGAA TCCAAGGACA ACCGGACATT GTGGAAGCCA GGAGATAGGT GGGGACACAC
1741 ACATCTTCTT AGTTGGCTTT TATTGCAGTC ACAGAATCAA GAGGCAGCAA GTGCCCTGGT
1801 GAGACGGAGA GAGAGGGTGG ACAAGGGTGA GCAAGGGAAA GTGGAAACAG CAAGAAAACT
1861 AGATTAACCT CCCGTGACAC TGGCTGTTTA TGGATTGGGC TAGCTTCGCC TCCATTTCCC
1921 TAAGTGACAG AGGCTCAAAT CTGACATTGG CTTGTTTGTT TTCCTTGTGT TGTGGATGGA
1981 AGCTAGGGCC TTGCATAGGC TAAGTGCCAC CCCTGAGCGC TGCTCCTAGT CCTCCTCTCT
2041 GCTGATTCCC GCCCAGCCTG GTCCCAGCCC TTGAGTGACT CAGCTTTGGT CTGGTCGGTC
2101 GGTCCCAGTG CTGGAGCTCA ACATCTAACA TTGTATCCAT TTGATTAAGT ATTTGATAAT
2161 TCCTCCTTAT CCCTCCTTGG AGGTGCTCAC TCACGGAGGG CAGGAGCTAT GTGTCTCGGT
2221 CACTTCTCCT GCCTCCAGCT TGCTCCTGCC TGGCACCATG GCTTTTGTGG GGTAGACACT
2281 CAATAAATAT ATATTTTTAA AGCTAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA
2341 AAAAAAAAAA
B: nucleotide sequence (SEQ ID NO:23) length: 141 amino acid
1 MPIMVDDIMR LLCSISQERK MKAAVKHSGK GAMVAGAMAF VGGLVGGPPG IAVGGTVGGL
61 LGAWMTSGQF KPVPQILMEL PPAEQRKLVN EAMAIIGNLD WTDAVQLTAL VMSNQAMQQR
121 LLAMLTTYVT KELQAEIRYE D
C. Nucleotide and amino acid composite sequence (SEQ ID NO:24) clone number: FP291
Start code: 261ATG stops coding: 684TAG protein molecular weight: 15086.09
1 GC TTT GCC CTG CAA GTG GAG TGG GCT CTG TTC TGG TCT TAG GAG GTC 47
48 TGA AAG CAA TAC CAG CCC CTG CGC GTC TCA CGC AGG CCC CAG CGT TAA 95
96 TTA ACC AGC AGT CGC TCC TGA CAG CCA AGC CCC TGT ATC ATC TCC CAG 143
144 GTC CAG GAC CCT GTG GGA GAA AGG GTC CGG CAA GTC ACA GAC CCA GCC 191
192 AGC GCA AAG GGC GAT CGA GGA GAA CCT TCC AGG AAC CGT CCC AGA TCC 239
240 TAA ACG TGA GGG TCT GCT GAG ATG CCC ATC ATG GTA GAT GAC ATC ATG 287
1 Met Pro Ile Met Val Asp Asp Ile Met 9
288 AGG CTT CTG TGC TCC ATT TCC CAG GAG AGG AAG ATG AAG GCG GCC GTC 335
10 Arg Leu Leu Cys Ser Ile Ser Gln Glu Arg Lys Met Lys Ala Ala Val 25
336 AAG CAC TCT GGA AAG GGT GCC ATG GTT GCG GGG GCA ATG GCC TTC GTC 383
26 Lys His Ser Gly Lys Gly Ala Met Val Ala Gly Ala Met Ala Phe Val 41
384 GGT GGC TTG GTT GGT GGC CCA CCA GGA ATA GCT GTT GGG GGG ACT GTT 431
42 Gly Gly Leu Val Gly Gly Pro Pro Gly Ile Ala Val Gly Gly Thr Val 57
432 GGG GGC CTG TTA GGT GCC TGG ATG ACG AGT GGA CAG TTC AAG CCC GTT 479
58 Gly Gly Leu Leu Gly Ala Trp Met Thr Ser Gly Gln Phe Lys Pro Val 73
480 CCT CAG ATC TTA ATG GAG CTG CCC CCC GCT GAA CAG CGG AAA CTT GTT 527
74 Pro Gln Ile Leu Met Glu Leu Pro Pro Ala Glu Gln Arg Lys Leu Val 89
528 AAC GAA GCC ATG GCC ATC ATC GGG AAC TTG GAC TGG ACA GAC GCT GTG 575
90 Asn Glu Ala Met Ala Ile Ile Gly Asn Leu Asp Trp Thr Asp Ala Val 105
576 CAG CTG ACT GCT CTG GTC ATG AGC AAC CAG GCC ATG CAG CAG AGG CTC 623
106 Gln Leu Thr Ala Leu Val Met Ser Asn Gln Ala Met Gln Gln Arg Leu 121
624 TTG GCC ATG CTA ACG ACC TAT GTC ACC AAA GAG CTG CAG GCT GAG ATC 671
122 Leu Ala Met Leu Thr Thr Tyr Val Thr Lys Glu Leu Gln Ala Glu Ile 137
672 CGC TAT GAG GAC TAG GCC CCA CCC CCT GGG GAG TGG GGC TCT CAT TTT 719
138 Arg Tyr Glu Asp *** 142
720 AGA TGA TTC TAT GGC TCA TTG AAG GTG ACC AGG GAA TGG GGG AAG CCT 767
768 GAC TTC TGG GAA ACT TGT CTA TTA CCA GCA TTG ATT GTA AGC CGC CAT 815
816 GCT AGT GCA TTG TTG GAC TGT CTT GCT GGA GAG GCA TCA GCC ATG CCT 863
864 CTT GGG CTG TGT GTG CCA TGG AAA GGT ACA GGC GTG CCT GCT GTG GGC 911
912 ATT CAC GGA CAG CTT GGA GTC CTC CCG ATG ACC ATG TGG GTC TAA AAG 959
960 GTT CTC ATA TTG TTG GCT GGC CGA CCA TAG CAG ATG CTT CTT TTT CTA 1007
1008 TGA AGT GTA CCT CCA GGT ACC TGT CAC AGG ATC GAA TCC CCT TAG GCT 1055
1056 CTG GGG TCT TGG GGT TCA GAG CCA GCT CTT GTC AGG ATT GTG CTA ACC 1103
1104 TGC TGA GTG TTT CCT TCC CTT GGC TAA GGA CTT CCC AGT GGT CTC ATG 1151
1152 GTG ACA ACT GTC CCT CCT TTG AGT GAA GCC ACA GAT GCC AGG CAG GCT 1199
1200 CTA TGC TCT GCT ACA GGC TAT GCC CTA GAC GCT CTT GAG TAA CTT TAG 1247
1248 TTT AGC TTG TAG GAG CTT CTT CTG CCT CCT TAA GAA AAA TCA TGG CAC 1295
1296 TTC ATT ATT TTT TAT TTT CCA TGT CGA TTC TGG ATT CTT TTT GTT CTC 1343
1344 TAC ACT TTC GGA GCC TAA ATT CAT TTA GCT CTG TTT TCT TTT TCT TCT 1391
1392 TAC TAT GTT TCC TTG GGC TAG CCT TGA AGT CCT GGG CTC AAG CTA CAT 1439
1440 AGC TGG GAC TGT AGG CTT GTG CCA CCA TAC CTA GCT TTA CCT CAA AGA 1487
1488 CTG TTG TCC ACT CTA CTT TTC CTA AGC CCT CCC GTG CTC TCC TGT CTG 1535
1536 CTG GCT GCA ATC TAT AAT TCA TTT GTG TGA AAA GAC AAA ATT ACC AAT 1583
1584 GAC TTG AGT TCA GAG ATC CTA ACA TAG CTA GGT GCT GTG GTG CAG AAC 1631
1632 CAT ACC CAC GGT CCC AGA TAA TTG AAA AGC CGA AGG GGG AAC ATC CTT 1679
1680 GGG CCC AGG AAT CCA AGG ACA ACC GGA CAT TGT GGA AGC CAG GAG ATA 1727
1728 GGT GGG GAC ACA CAC ATC TTC TTA GTT GGC TTT TAT TGC AGT CAC AGA 1775
1776 ATC AAG AGG CAG CAA GTG CCC TGG TGA GAC GGA GAG AGA GGG TGG ACA 1823
1824 AGG GTG AGC AAG GGA AAG TGG AAA CAG CAA GAA AAC TAG ATT AAC CTC 1871
1872 CCG TGA CAC TGG CTG TTT ATG GAT TGG GCT AGC TTC GCC TCC ATT TCC 1919
1920 CTA AGT GAC AGA GGC TCA AAT CTG ACA TTG GCT TGT TTG TTT TCC TTG 1967
1968 TGT TGT GGA TGG AAG CTA GGG CCT TGC ATA GGC TAA GTG CCA CCC CTG 2015
2016 AGC GCT GCT CCT AGT CCT CCT CTC TGC TGA TTC CCG CCC AGC CTG GTC 2063
2064 CCA GCC CTT GAG TGA CTC AGC TTT GGT CTG GTC GGT CGG TCC CAG TGC 2111
2112 TGG AGC TCA ACA TCT AAC ATT GTA TCC ATT TGA TTA AGT ATT TGA TAA 2159
2160 TTC CTC CTT ATC CCT CCT TGG AGG TGC TCA CTC ACG GAG GGC AGG AGC 2207
2208 TAT GTG TCT CGG TCA CTT CTC CTG CCT CCA GCT TGC TCC TGC CTG GCA 2255
2256 CCA TGG CTT TTG TGG GGT AGA CAC TCA ATA AAT ATA TAT TTT TAA AGC 2303
2304 TAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AA 2350
9.FP633
A: nucleotide sequence (SEQ ID NO:25) length: 2375 bases
1 GCCAAAGGAG ACAAAGGATC ACCTGGATTT TATGGCAAAA AGGGTGCAAA AGGTGAAAAG
61 GGGAATGCTG GCTTCCCTGG CCTCCCTGGA CCTGCTGGAG AACCAGGAAG ACATGGAAAG
121 GATGGATTAA TGGGTAGTCC CGGTTTCAAG GGAGAAGCAG GATCCCCTGG TGCTCCGGGG
181 CAGGATGGAA CACGGGGAGA GCCTACATGT GCATATGAGT TCAGCCACGT GAAAGAGTGA
241 CCTGTACTTC TTTATGTCTC TTCACAAACT GTGGTCTTCA TATTCTTCAT GATTGCATCA
301 GGAATCCCAG GATTTCCTGG AAACCGAGGA TTAATGGGCC AAAAGGGAGA AATTGGGCCT
361 CCAGGACAGC AAGGAAAAAA AGGAGCCCCA GGGATGCCTG GTTTAATGGG AAGCAATGGC
421 TCACCAGGCC AGCCTGGAAC ACCGGGATCT AAGGGAAGCA AAGGTGAACC TGGAATTCAA
481 GGGATGCCTG GGGCTTCTGG GCTCAAGGGA GAACCAGGAG CAACGGGTTC CCCAGGAGAA
541 CCAGGATACA TGGGTTTACC CGGGATTCAA GGAAAAAAGG GGGACAAAGG AAATCAAGGT
601 GAAAAAGGTA TTCAGGGTCA AAAGGGAGAA AATGGAAGAC AGGGAATTCC AGGGCAACAG
661 GGAATTCAAG GCCATCATGG TGCAAAAGGA GAGAGAGGTG AAAAGGGAGA ACCTGGTGTC
721 CGAGGTGCCA TTGGATCAAA AGGAGAATCT GGGGTGGATG GCTTGATGGG GCCCGCAGGT
781 CCTAAGGGGC AACCTGGGGA TCCAGGTCCT CAGGGACCCC CAGGTTTGGA TGGGAAGCCC
841 GGAAGAGAGT TTTCAGAACA ATTTATTCGA CAAGTTTGCA CAGATGTAAT AAGAGGCCCA
901 GAGGGTCCCA GAGGATTACC TGGTTTGCCA GGAAGAGATG GTGTTCCTGG ATTAGTGGGT
961 GTCCCTGGAC GTCCAGGTGT CAGAGGATTA AAAGGCCTAC CAGGAAGAAA TGGGGAAAAA
1021 GGGAGCCAAG GGTTTGGGTA TCCTGGAGAA CAAGGTCCTC CTGGTCCCCC AGGTCCAGAG
1081 GGCCCTCCTG GAATAAGCAA AGAAGGTCCT CCAGGAGACC CAGGTCTCCC TGGCAAAGAT
1141 GGAGACCATG GAAAACCTGG AATCCAAGGG CAACCAGGCC CCCCAGGCAT CTGCGACCCA
1201 TCACTATGTT TTAGTGTAAT TGCCAGAAGA GATCCGTTCA GAAAAGGACC AAACTATTAG
1261 TGTCTGATGC CTCATTCAGC AGCCTAGGCA TGGTGCTTTT TCTGTGGTCT TTTGCATCTC
1321 AGGAAGATAA CCAACAGTAA TCCCTTGAAA AGAAACTTAA GTACCTCGGT GTTTTTATTT
1381 TTTTTTTCTT ATGGAAAAAA ATATAAAAGA TCACATATAC TGATTTTAAA GGCTCCTCAG
1441 TCATTTGGAG CCCTTGGATT AGCAGCATTA ATTAAATCTC AAGGGTTTCT TGTAAAGTCC
1501 ATTTATGTTA ATCAAAGTTG AATATAAAAA TCCACCATTG CCTGTTAGCC AGTCAGTTTT
1561 AGTCACTGTG AAATATTTCA CATTCAGCCT CCATGCAGTA GAGATTTGAG TTTAATTTCA
1621 TGTCCATGTG ACTTTCATGT TTCCTATCTC ATAGCTCATG CTACTACATA AGCCAAAACA
1681 TGTATCTCAT CATTGGAAGT AAGATCAGGG CTGATATTCA CCTGGGATAG ACAGTATTGG
1741 TGAACTACTC ATTTACTACA GTGTCTCAGC CTTGATAAGG GGCAGTGGAT TGCCTGTTGT
1801 TCGGTGTTGT GAATAGCACC TCTGAATAAG ATTAGAGTGT TTCTTAATTC ATTTCAAACT
1861 CTAAAATTAG ATTAATGGTG GTGCTAAGAA AGAGTATTAA TTACTTTGGG AATGGTCAAA
1921 ATTAACATTA AAAACATTTT AGACAAAAAG TTTCATTGTA CATTCAAAGA AAATGTAAGT
1981 TTGGAAGTAC TAAAAGACTA TTTTATACTT GTTGATTAAT CGGAATGTTT GTTGTATGCC
2041 TTCATTTTCC ATTTCACTTA TATGTGTATG TCCATATATG TTAATTTTCA TTGTAGCAAA
2101 GCTAATGGAA ATAAAGCTAA TGCTCTAGTT GAAAGAAAAG GAAAACTCCT GAAATCCTAG
2161 AATGTCTTGT TATTTTTAGC TGACTGTAAA ATATTATGAA CAGTCTTTGT GTATTGTGCT
2221 TAATGCTTTT GTAAGAAACA GAATTTGAAA TATTTCATCC TTGTCATGCT CAAAATTTTG
2281 TTACATGCTT GTTATTCAGA GTATAATAAA GTTTTGTACA GGCCTGAAAA AAAAAAAAAA
2341 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAA
B: nucleotide sequence (SEQ ID NO:26) length: 323 amino acid
1 MIASGIPGFP GNRGLMGQKG EIGPPGQQGK KGAPGMPGLM GSNGSPGQPG TPGSKGSKGE
61 PGIQGMPGAS GLKGEPGATG SPGEPGYMGL PGIQGKKGDK GNQGEKGIQG QKGENGRQGI
121 PGQQGIQGHH GAKGERGEKG EPGVRGAIGS KGESGVDGLM GPAGPKGQPG DPGPQGPPGL
181 DGKPGREFSE QFIRQVCTDV IRGPEGPRGL PGLPGRDGVP GLVGVPGRPG VRGLKGLPGR
241 NGEKGSQGFG YPGEQGPPGP PGPEGPPGIS KEGPPGDPGL PGKDGDHGKP GIQGQPGPPG
301 ICDPSLCFSV IARRDPFRKG PNY
C. Nucleotide and amino acid composite sequence (SEQ ID NO:27) clone number: FP633
Start code: 289 ATG stop coding: 1258 TAG protein molecular weights: 31648.92
1 GCC AAA GGA GAC AAA GGA TCA CCT GGA TTT TAT GGC AAA AAG GGT GCA 48
49 AAA GGT GAA AAG GGG AAT GCT GGC TTC CCT GGC CTC CCT GGA CCT GCT 96
97 GGA GAA CCA GGA AGA CAT GGA AAG GAT GGA TTA ATG GGT AGT CCC GGT 144
145 TTC AAG GGA GAA GCA GGA TCC CCT GGT GCT CCG GGG CAG GAT GGA ACA 192
193 CGG GGA GAG CCT ACA TGT GCA TAT GAG TTC AGC CAC GTG AAA GAG TGA 240
241 CCT GTA CTT CTT TAT GTC TCT TCA CAA ACT GTG GTC TTC ATA TTC TTC 288
289 ATG ATT GCA TCA GGA ATC CCA GGA TTT CCT GGA AAC CGA GGA TTA ATG 336
1 Met Ile Ala Ser Gly Ile Pro Gly Phe Pro Gly Asn Arg Gly Leu Met 16
337 GGC CAA AAG GGA GAA ATT GGG CCT CCA GGA CAG CAA GGA AAA AAA GGA 384
17 Gly Gln Lys Gly Glu Ile Gly Pro Pro Gly Gln Gln Gly Lys Lys Gly 32
385 GCC CCA GGG ATG CCT GGT TTA ATG GGA AGC AAT GGC TCA CCA GGC CAG 432
33 Ala Pro Gly Met Pro Gly Leu Met Gly Ser Asn Gly Ser Pro Gly Gln 48
433 CCT GGA ACA CCG GGA TCT AAG GGA AGC AAA GGT GAA CCT GGA ATT CAA 480
49 Pro Gly Thr Pro Gly Ser Lys Gly Ser Lys Gly Glu Pro Gly Ile Gln 64
481 GGG ATG CCT GGG GCT TCT GGG CTC AAG GGA GAA CCA GGA GCA ACG GGT 528
65 Gly Met Pro Gly Ala Ser Gly Leu Lys Gly Glu Pro Gly Ala Thr Gly 80
529 TCC CCA GGA GAA CCA GGA TAC ATG GGT TTA CCC GGG ATT CAA GGA AAA 576
81 Ser Pro Gly Glu Pro Gly Tyr Met Gly Leu Pro Gly Ile Gln Gly Lys 96
577 AAG GGG GAC AAA GGA AAT CAA GGT GAA AAA GGT ATT CAG GGT CAA AAG 624
97 Lys Gly Asp Lys Gly Asn Gln Gly Glu Lys Gly Ile Gln Gly Gln Lys 112
625 GGA GAA AAT GGA AGA CAG GGA ATT CCA GGG CAA CAG GGA ATT CAA GGC 672
113 Gly Glu Asn Gly Arg Gln Gly Ile Pro Gly Gln Gln Gly Ile Gln Gly 128
673 CAT CAT GGT GCA AAA GGA GAG AGA GGT GAA AAG GGA GAA CCT GGT GTC 720
129 His His Gly Ala Lys Gly Glu Arg Gly Glu Lys Gly Glu Pro Gly Val 144
721 CGA GGT GCC ATT GGA TCA AAA GGA GAA TCT GGG GTG GAT GGC TTG ATG 768
145 Arg Gly Ala Ile Gly Ser Lys Gly Glu Ser Gly Val Asp Gly Leu Met 160
769 GGG CCC GCA GGT CCT AAG GGG CAA CCT GGG GAT CCA GGT CCT CAG GGA 816
161 Gly Pro Ala Gly Pro Lys Gly Gln Pro Gly Asp Pro Gly Pro Gln Gly 176
817 CCC CCA GGT TTG GAT GGG AAG CCC GGA AGA GAG TTT TCA GAA CAA TTT 864
177 Pro Pro Gly Leu Asp Gly Lys Pro Gly Arg Glu Phe Ser Glu Gln Phe 192
865 ATT CGA CAA GTT TGC ACA GAT GTA ATA AGA GGC CCA GAG GGT CCC AGA 912
193 Ile Arg Gln Val Cys Thr Asp Val Ile Arg Gly Pro Glu Gly Pro Arg 208
913 GGA TTA CCT GGT TTG CCA GGA AGA GAT GGT GTT CCT GGA TTA GTG GGT 960
209 Gly Leu Pro Gly Leu Pro Gly Arg Asp Gly Val Pro Gly Leu Val Gly 224
961 GTC CCT GGA CGT CCA GGT GTC AGA GGA TTA AAA GGC CTA CCA GGA AGA 1008
225 Val Pro Gly Arg Pro Gly Val Arg Gly Leu Lys Gly Leu Pro Gly Arg 240
1009 AAT GGG GAA AAA GGG AGC CAA GGG TTT GGG TAT CCT GGA GAA CAA GGT 1056
241 Asn Gly Glu Lys Gly Ser Gln Gly Phe Gly Tyr Pro Gly Glu Gln Gly 256
1057 CCT CCT GGT CCC CCA GGT CCA GAG GGC CCT CCT GGA ATA AGC AAA GAA 1104
257 Pro Pro Gly Pro Pro Gly Pro Glu Gly Pro Pro Gly Ile Ser Lys Glu 272
1105 GGT CCT CCA GGA GAC CCA GGT CTC CCT GGC AAA GAT GGA GAC CAT GGA 1152
273 Gly Pro Pro Gly Asp Pro Gly Leu Pro Gly Lys Asp Gly Asp His Gly 288
1153 AAA CCT GGA ATC CAA GGG CAA CCA GGC CCC CCA GGC ATC TGC GAC CCA 1200
289 Lys Pro Gly Ile Gln Gly Gln Pro Gly Pro Pro Gly Ile Cys Asp Pro 304
1201 TCA CTA TGT TTT AGT GTA ATT GCC AGA AGA GAT CCG TTC AGA AAA GGA 1248
305 Ser Leu Cys Phe Ser Val Ile Ala Arg Arg Asp Pro Phe Arg Lys Gly 320
1249 CCA AAC TAT TAG TGT CTG ATG CCT CAT TCA GCA GCC TAG GCA TGG TGC 1296
321 Pro Asn Tyr *** 324
1297 TTT TTC TGT GGT CTT TTG CAT CTC AGG AAG ATA ACC AAC AGT AAT CCC 1344
1345 TTG AAA AGA AAC TTA AGT ACC TCG GTG TTT TTA TTT TTT TTT TCT TAT 1392
1393 GGA AAA AAA TAT AAA AGA TCA CAT ATA CTG ATT TTA AAG GCT CCT CAG 1440
1441 TCA TTT GGA GCC CTT GGA TTA GCA GCA TTA ATT AAA TCT CAA GGG TTT 1488
1489 CTT GTA AAG TCC ATT TAT GTT AAT CAA AGT TGA ATA TAA AAA TCC ACC 1536
1537 ATT GCC TGT TAG CCA GTC AGT TTT AGT CAC TGT GAA ATA TTT CAC ATT 1584
1585 CAG CCT CCA TGC AGT AGA GAT TTG AGT TTA ATT TCA TGT CCA TGT GAG 1632
1633 TTT CAT GTT TCC TAT CTC ATA GCT CAT GCT ACT ACA TAA GCC AAA ACA 1680
1681 TGT ATC TCA TCA TTG GAA GTA AGA TCA GGG CTG ATA TTC ACC TGG GAT 1728
1729 AGA CAG TAT TGG TGA ACT ACT CAT TTA CTA CAG TGT CTC AGC CTT GAT 1776
1777 AAG GGG CAG TGG ATT GCC TGT TGT TCG GTG TTG TGA ATA GCA CCT CTG 1824
1825 AAT AAG ATT AGA GTG TTT CTT AAT TCA TTT CAA ACT CTA AAA TTA GAT 1872
1873 TAA TGG TGG TGC TAA GAA AGA GTA TTA ATT ACT TTG GGA ATG GTC AAA 1920
1921 ATT AAC ATT AAA AAC ATT TTA GAC AAA AAG TTT CAT TGT ACA TTC AAA 1968
1969 GAA AAT GTA AGT TTG GAA GTA CTA AAA GAC TAT TTT ATA CTT GTT GAT 2016
2017 TAA TCG GAA TGT TTG TTG TAT GCC TTC ATT TTC CAT TTC ACT TAT ATG 2064
2065 TGT ATG TCC ATA TAT GTT AAT TTT CAT TGT AGC AAA GCT AAT GGA AAT 2112
2113 AAA GCT AAT GCT CTA GTT GAA AGA AAA GGA AAA CTC CTG AAA TCC TAG 2160
2161 AAT GTC TTG TTA TTT TTA GCT GAC TGT AAA ATA TTA TGA ACA GTC TTT 2208
2209 GTG TAT TGT GCT TAA TGC TTT TGT AAG AAA CAG AAT TTG AAA TAT TTC 2256
2257 ATC CTT GTC ATG CTC AAA ATT TTG TTA CAT GCT TGT TAT TCA GAG TAT 2304
2305 AAT AAA GTT TTG TAC AGG CCT GAA AAA AAA AAA AAA AAA AAA AAA AAA 2352
2353 AAA AAA AAA AAA AAA AAA AAA AA 2375
10.FP782
A: nucleotide sequence (SEQ ID NO:28) length: 2630 bases
1 GGTTTTTATT CTGTTTATTA TATGTCTCTG TCTCTCTCTA TTGTGTGTGT GTGTGTGTGT
61 GTGTGTGTGT GTGTGTGTGT GTGGTGCAGA AGTGCCACCC CCAGGGCCCT GTCAACCTCT
121 CTTTTCTCCT CCATGGCTGT CTGCCTGCGT ATCTGTCTCT GAGAATCCTC GGGGCGGTCA
181 GGGGATGTCA GGAGGGGAAG GAACCGCCCT CCCTATCTTG CTGCTCCTCT TGGCACTCAG
241 GGGCACCTTC CATGGAGCCA GACCGGGTGG AGGGGCTTCT GGGATTTGGT GTCTGCTGCT
301 GCCAGAGCAG GAACCCCCAG TCTAGGACTT GGGCATTTTA ACAGGGAGAA AGTAGTGGCT
361 TCCCTTTTCT CTCTCTCCTC CTTTTTCCCT TTAAGCCCAC AGATTCAGGT CATGCCAAAA
421 GCTCTCTGGT TGTAACCTGG AGACATGTGG AGGGGAATGG CGATGGGATT ATAGGACTCT
481 CCCCATCTCG GGCCCTGACC CTGACCCTTG CCACCAACCC AAAGACAGCT GGTGGGTTTC
541 CCCTTGGAGA CAATCCTGCG TTTGCCTGGG CCGGCCCTGG CTGCCCTCAG CTTTCGCTGA
601 TCTGCCCGGC CTGGAGCCTC CCATCACCCC GCTTCTTGTT GGGCCTCAGG CACTGGTTAC
661 CAGAAGGGGG TCTGGGTCTG CTCAGGATCA TGTTTTGTAG CACCTCCTGT TGGAGGGGTG
721 GAGGGATGTT CCCCTGAGCC AGGCTGAGAC TAGAACCCCA TCTTCCCTGA GCCAGGCTGA
781 GACTAGAACC CCATCTTCCC CACCACGCCA CCCCTGTGGC TGCTACAGGA GCACAGTAGT
841 GAAGGCCTGA GCTCCAGGTT TGAAAGACCC AACTGGAGCG TGGGGCGGGC AGGCAGGGGT
901 TAGTGAAAGG ACACTTCCAG GGTTAGGACA GAGCATTTAG CCTTCTGGAA GAACCCCTGC
961 CTGGGGTGGG ACTGTGCAGG CCAGAGAAGG TGGCATGGGC CTGAACCCAC CTGGACTGAC
1021 TTCTGCACTG AAGCCACAGA TGGAGGGTAG GCTGGTGGGT GGGGGTGGTT CGTTCTCTAG
1081 CCGGGGCAGA CACCCAGCTG GCTGGGTCCT TCCTCAGCCT TGCCTCCTCC TGTCCCCAAC
1141 CCTTTCCTTT CCTCCTGCTT GCGGACTGCT TGGTCCCCTC TCCTTCCCTC CTTCCAGCTG
1201 TTTCCTAGTT ACCACCTACC CCTGGCCGTG GACTGATCAG ACCAGCATTC AAAATAAAAG
1261 TTTGTTCCAA GTTGACAGTG TGGTGCTCCC TGCCCAGCCC TCCAGGTGGA GGTGCTGCCA
1321 CGGGAACGCA GTTCGTCTGC CTGCCCTGGG GCCCCTGGCG ACAGCTGGGA GCAGGGCAGT
1381 GCTGTGAGGA GCCCAGCTTT CCCAGTCAGG CAGGCATGGC TTCCGTGTTC AGGCTCCCTC
1441 ACCAGCTGGT GACACGGGAC AAGCTTACAA ACCTTCTCTG AACCTCAGTT TTCTCATTTA
1501 CAAGAGCAAA GCCATCCATC ACCTTGTGTG GATTCAGAGA ATGTGAGGCC CTGGGGTGTC
1561 CTACACAAGG GAAAGGCTTG CTCAGTGAGC GGTCTGCACA CCGTTAGCCA CCCTGCCACC
1621 TCTGTGCCCT GGGCATGCTC CAAAGGAAAG CTCTGGCTGG GACTGCCAGG AGTCTCACAC
1681 GCTCCTGTTG ACATTCCCAG CAGCCGCCCC TGAGGTCGAT GTTTGTTCTG TTTTTCTTTT
1741 TCTTTTTTGA GACGGAGTCT CGCTGTGTTG CCAGGCTGGA GTGCAGTGGT GTGATCTCTG
1801 CTCACTGCAA CCTCCGCCTG CCAGTTTCAA GTGATTCTCT GCCTCAGCCT TCTGAGTAGC
1861 TGGGACTACA GGTGCACGCC ACCACGCCCA GCTAACTTTT TGTATTTTAG TAGAGACAGG
1921 GTTTCGCCAT GTCGGCCAGG GTGGTCTTGA TCTCCTGACC TCATGATCCA CCCGCCTCAG
1981 CCTCCCAAAG TGCTGGGATT ACAGGTATGA GCCACCGCAC CGGGCCTGTT CTATTTTTCT
2041 AGTTAAGGGA ACTGAAGCTC AGAGAGGTGT CACCAGCAGG TGTTCATTCC CATGCCAGCC
2101 TTGCCCCCCG GCTTTTCCCA GGCAGGCTCC TGCGTGCCCA CTGGCTCCAG CCTGGTCCTC
2161 TGTCTCTTGG CTGCTTCACT CCTGCTCTTT GTCCCGACTC TGGCCCTGCT TACAGGGGCC
2221 ACTACCTGCT GGTGCCTCCA TAACAAGCGT CTGGCGTTGA GACCCCTGGC ATGGCAGGGG
2281 CTTTGGGGTC TGGTTTCCAC AAGGCTTAGC CATGGCAGAA CCTCGTTTTA TTTTAACTCT
2341 TTGCCCCTAC AAACAAACAG CAGTACTTGC CAGAACCATT CTTGGGATTC AGGAGCTCGG
2401 GCGACTGCCT TGGCCTCTGG CCGCACCCAG GAGGGTGGGG TTGGATCTGT GTAGTTGCCA
2461 GGCCCACACC TGCCAGCAGG GGGCTGACTG GATCCATGCT TTACTGTGTT TAATGGGGGT
2521 AACAGGGGTC CCTACAGCCC TCCCAGCTAA ACATTTGGAA CAAAACACCA GCCCTTTTGT
2581 AGTGGATGCA GAATAAAATT GTTAATCCAA TCAAAAAAAA AAAAAAAAAA
B: nucleotide sequence (SEQ ID NO:29) length: 130 amino acid
1 MGLNPPGLTS ALKPQMEGRL VGGGGSFSSR GRHPAGWVLP QPCLLLSPTL SFPPACGLLG
61 PLSFPPSSCF LVTTYPWPWT DQTSIQNKSL FQVDSVVLPA QPSRWRCCHG NAVRLPALGP
121 LATAGSRAVL
C. Nucleotide and amino acid composite sequence (SEQ ID NO:30) clone number: FP782
Start code: 995ATG stops coding: 1385TGA protein molecular weight: 13703.18
1 G GTT TTT ATT CTG TTT ATT ATA TGT CTC TGT CTC TCT CTA TTG TGT 46
47 GTG TGT GTG TGT GTG TGT GTG TGT GTG TGT GTG TGT GGT GCA GAA GTG 94
95 CCA CCC CCA GGG CCC TGT CAA CCT CTC TTT TCT CCT CCA TGG CTG TCT 142
143 GCC TGC GTA TCT GTC TCT GAG AAT CCT CGG GGC GGT CAG GGG ATG TCA 190
191 GGA GGG GAA GGA ACC GCC CTC CCT ATC TTG CTG CTC CTC TTG GCA CTC 238
239 AGG GGC ACC TTC CAT GGA GCC AGA CCG GGT GGA GGG GCT TCT GGG ATT 286
287 TGG TGT CTG CTG CTG CCA GAG CAG GAA CCC CCA GTC TAG GAC TTG GGC 334
335 ATT TTA ACA GGG AGA AAG TAG TGG CTT CCC TTT TCT CTC TCT CCT CCT 382
383 TTT TCC CTT TAA GCC CAC AGA TTC AGG TCA TGC CAA AAG CTC TCT GGT 430
431 TGT AAC CTG GAG ACA TGT GGA GGG GAA TGG CGA TGG GAT TAT AGG ACT 478
479 CTC CCC ATC TCG GGC CCT GAC CCT GAC CCT TGC CAC CAA CCC AAA GAC 526
527 AGC TGG TGG GTT TCC CCT TGG AGA CAA TCC TGC GTT TGC CTG GGC CGG 574
575 CCC TGG CTG CCC TCA GCT TTC GCT GAT CTG CCC GGC CTG GAG CCT CCC 622
623 ATC ACC CCG CTT CTT GTT GGG CCT CAG GCA CTG GTT ACC AGA AGG GGG 670
671 TCT GGG TCT GCT CAG GAT CAT GTT TTG TAG CAC CTC CTG TTG GAG GGG 718
719 TGG AGG GAT GTT CCC CTG AGC CAG GCT GAG ACT AGA ACC CCA TCT TCC 766
767 CTG AGC CAG GCT GAG ACT AGA ACC CCA TCT TCC CCA CCA CGC CAC CCC 814
815 TGT GGC TGC TAC AGG AGC ACA GTA GTG AAG GCC TGA GCT CCA GGT TTG 862
863 AAA GAC CCA ACT GGA GCG TGG GGC GGG CAG GCA GGG GTT AGT GAA AGG 910
911 ACA CTT CCA GGG TTA GGA CAG AGC ATT TAG CCT TCT GGA AGA ACC CCT 958
959 GCC TGG GGT GGG ACT GTG CAG GCC AGA GAA GGT GGC ATG GGC CTG AAC 1006
1 Met Gly Leu Asn 4
1007 CCA CCT GGA CTG ACT TCT GCA CTG AAG CCA CAG ATG GAG GGT AGG CTG 1054
5 Pro Pro Gly Leu Thr Ser Ala Leu Lys Pro Gln Met Glu Gly Arg Leu 20
1055 GTG GGT GGG GGT GGT TCG TTC TCT AGC CGG GGC AGA CAC CCA GCT GGC 1102
21 Val Gly Gly Gly Gly Ser Phe Ser Ser Arg Gly Arg His Pro Ala Gly 36
1103 TGG GTC CTT CCT CAG CCT TGC CTC CTC CTG TCC CCA ACC CTT TCC TTT 1150
37 Trp Val Leu Pro Gln Pro Cys Leu Leu Leu Ser Pro Thr Leu Ser Phe 52
1151 CCT CCT GCT TGC GGA CTG CTT GGT CCC CTC TCC TTC CCT CCT TCC AGC 1198
53 Pro Pro Ala Cys Gly Leu Leu Gly Pro Leu Ser Phe Pro Pro Ser Ser 68
1199 TGT TTC CTA GTT ACC ACC TAC CCC TGG CCG TGG ACT GAT CAG ACC AGC 1246
69 Cys Phe Leu Val Thr Thr Tyr Pro Trp Pro Trp Thr Asp Gln Thr Ser 84
1247 ATT CAA AAT AAA AGT TTG TTC CAA GTT GAC AGT GTG GTG CTC CCT GCC 1294
85 Ile Gln Asn Lys Ser Leu Phe Gln Val Asp Ser Val Val Leu Pro Ala 100
1295 CAG CCC TCC AGG TGG AGG TGC TGC CAC GGG AAC GCA GTT CGT CTG CCT 1342
101 Gln Pro Ser Arg Trp Arg Cys Cys His Gly Asn Ala Val Arg Leu Pro 116
1343 GCC CTG GGG CCC CTG GCG ACA GCT GGG AGC AGG GCA GTG CTG TGA GGA 1390
117 Ala Leu Gly Pro Leu Ala Thr Ala Gly Ser Arg Ala Val Leu *** 131
1391 GCC CAG CTT TCC CAG TCA GGC AGG CAT GGC TTC CGT GTT CAG GCT CCC 1438
1439 TCA CCA GCT GGT GAC ACG GGA CAA GCT TAC AAA CCT TCT CTG AAC CTC 1486
1487 AGT TTT CTC ATT TAC AAG AGC AAA GCC ATC CAT CAC CTT GTG TGG ATT 1534
1535 CAG AGA ATG TGA GGC CCT GGG GTG TCC TAC ACA AGG GAA AGG CTT GCT 1582
1583 CAG TGA GCG GTC TGC ACA CCG TTA GCC ACC CTG CCA CCT CTG TGC CCT 1630
1631 GGG CAT GCT CCA AAG GAA AGC TCT GGC TGG GAC TGC CAG GAG TCT CAC 1678
1679 ACG CTC CTG TTG ACA TTC CCA GCA GCC GCC CCT GAG GTC GAT GTT TGT 1726
1727 TCT GTT TTT CTT TTT CTT TTT TGA GAC GGA GTC TCG CTG TGT TGC CAG 1774
1775 GCT GGA GTG CAG TGG TGT GAT CTC TGC TCA CTG CAA CCT CCG CCT GCC 1822
1823 AGT TTC AAG TGA TTC TCT GCC TCA GCC TTC TGA GTA GCT GGG ACT ACA 1870
1871 GGT GCA CGC CAC CAC GCC CAG CTA ACT TTT TGT ATT TTA GTA GAG ACA 1918
1919 GGG TTT CGC CAT GTC GGC CAG GGT GGT CTT GAT CTC CTG ACC TCA TGA 1966
1967 TCC ACC CGC CTC AGC CTC CCA AAG TGC TGG GAT TAC AGG TAT GAG CCA 2014
2015 CCG CAC CGG GCC TGT TCT ATT TTT CTA GTT AAG GGA ACT GAA GCT CAG 2062
2063 AGA GGT GTC ACC AGC AGG TGT TCA TTC CCA TGC CAG CCT TGC CCC CCG 2110
2111 GCT TTT CCC AGG CAG GCT CCT GCG TGC CCA CTG GCT CCA GCC TGG TCC 2158
2159 TCT GTC TCT TGG CTG CTT CAC TCC TGC TCT TTG TCC CGA CTC TGG CCC 2206
2207 TGC TTA CAG GGG CCA CTA CCT GCT GGT GCC TCC ATA ACA AGC GTC TGG 2254
2255 CGT TGA GAC CCC TGG CAT GGC AGG GGC TTT GGG GTC TGG TTT CCA CAA 2302
2303 GGC TTA GCC ATG GCA GAA CCT CGT TTT ATT TTA ACT CTT TGC CCC TAC 2350
2351 AAA CAA ACA GCA GTA CTT GCC AGA ACC ATT CTT GGG ATT CAG GAG CTC 2398
2399 GGG CGA CTG CCT TGG CCT CTG GCC GCA CCC AGG AGG GTG GGG TTG GAT 2446
2447 CTG TGT AGT TGC CAG GCC CAC ACC TGC CAG CAG GGG GCT GAC TGG ATC 2494
2495 CAT GCT TTA CTG TGT TTA ATG GGG GTA ACA GGG GTC CCT ACA GCC CTC 2542
2543 CCA GCT AAA CAT TTG GAA CAA AAC ACC AGC CCT TTT GTA GTG GAT GCA 2590
2591 GAA TAA AAT TGT TAA TCC AAT CAA AAA AAA AAA AAA AAA A 2630
11.FP793
A: nucleotide sequence (SEQ ID NO:31) length: 1694 bases
1 GCCCTCCCTG ACATTGCTGG CACACAGGCA TGATTGCTGG TTCCATTGGA TATGGATCTG
61 GATTGTCAGC GTATTTGAAC AGTATTAAGG GAGGGTTTTG CAGTAGAGAC AGAAGGTGAC
121 GTAAGGTGGT TCCTAAGGTG GGTTCGGTCA GTCCCCTAAT AGTTTGTTTT CAGTGTATTG
181 AGACCAGGAG AATTGATTTA CTTCAGACCT ATTTTTTGAA ACTCCCTGTT GGTCTGGTGT
241 CATGCGTTTA GAGAGACTCT GAACTTCGTT TTCTGTTTTC CTTAGATAAT GATAGCACTT
301 CATGAGCACC ATATGTGAAG CACACATACG TTCTGTCTGA CTCTCAGAAC AGTGCCACAT
361 TGGCATTGTC ATCATTTACA GATGACATCT GAGAGCTAGG GTGACTTGCC CAAGCCCAAG
421 CAGCTGGTAG CAGAGTTGGG ATCTTAACCC TCACCACACC GGTGTTTTCA AGAGAGAACT
481 TGGTCGTGGA GAACTGGAGA ACAGATACTG CCTAAGTTAC ATTTAGTTTT AGACTAATTT
541 AATACTCCAT TTAAGAAACA AACAGAACAG CTGTGCAGTT AGGAAAGTGG AGGGGATTTG
601 AATGAGGAGG AGGCGGGTTA GAGTTAGAAC CGTCTTTGCT GAACAAGTAG TGTGTTTGCA
661 GAAAGGGAAG TTGAAGTTAC GTTGTCCTAA AATACAGCAA GGAAGAGGAG GTGCTTAATA
721 CCACGGAAAA AAGTCTGTGT CATGTTTCTG TGAGGTTAAA AGTAATTTGA TAGTGTATGT
781 CATGCTGCTG ACATATTCCA TGTGTTTGAT ATCTTCCCAG CAAAATAATC AGCTCTCATT
841 TTCCCTTACA GAGTCAACTG TTCATTTTAT TTCAAAATTG GAGCATGTCG TCATGGAGAC
901 AGGTGCTCTC GGTTGCACAA TAAACCGACG TTTAGCCAGA CCATTGCCCT CTTGAACATT
961 TACCGTAACC CTCAAAACTC TTCCCAGTCT GCTGACGGTT TGCGCTGTGC CGTGAGCGAT
1021 GTGGAGATGC AGGAACACTA TGATGAGTTT TTTGAGGAGG TTTTTACAGA AATGGAGGAG
1081 AAGTATGGGG AAGTAGAGGA GATGAACGTC TGTGACAACC TGGGAGACCA CCTGGTGGGG
1141 AACGTGTACG TCAAGTTTCG CCGTGAGGAA GATGCGGAAA AGGCTGTGAT TGACTTGAAT
1201 AACCGTTGGT TTAATGGACA GCCGATCCAC GCCGAGCTGT CACCCGTGAC GGACTTCAGA
1261 GAAGCCTGCT GCCGTCAGTA TGAGATGGGA GAATGCACAC GAGGCGGCTT CTGCAACTTC
1321 ATGCATTTGA AGCCCATTTC CAGAGAGCTG CGGCGGGAGC TGTATGGCCG CCGTCGCAAG
1381 AAGCATAGAT CAAGATCCCG ATCCCGGGAG CGTCGTTCTC GGTCTAGAGA CCGTGGTCGT
1441 GGCGGTGGCG GTGGCGGTGG TGGAGGTGGC GGCGGACGGG AGCGTGACAG GAGGCGGTCG
1501 AGAGATCGTG AAAGATCTGG GCGATTCTGA GCCATGCCAT TTTTACCTTA TGTCTGCTAG
1561 AAAGTGTTGT AGTTGATTGA CCAAACCAGT TCATAAGGGG AATTTTTTTT AAAAAACAAC
1621 AAAAAAAAAA ACATACAAAG ATGGGTTTCT GAATAAAATT TGTAGTGATA ACAGTAAAAA
1681 AAAAAAAAAA AAAA
B: nucleotide sequence (SEQ ID NO:32) length: 167 amino acid
1 MQEHYDEFFE EVFTEMEEKY GEVEEMNVCD NLGDHLVGNV YVKFRREEDA EKAVIDLNNR
61 WFNGQPIHAE LSPVTDFREA CCRQYEMGEC TRGGFCNFMH LKPISRELRR ELYGRRRKKH
121 RSRSRSRERR SRSRDRGRGG GGGGGGGGGG RERDRRRSRD RERSGRF
C. Nucleotide and amino acid composite sequence (SEQ ID NO:33) clone number: FP793
Start code: 1027ATG stops coding: 1528TGA protein molecular weight: 19713.96
1 GCC CTC CCT GAC ATT GCT GGC ACA CAG GCA TGA TTG CTG GTT CCA TTG 48
49 GAT ATG GAT CTG GAT TGT CAG CGT ATT TGA ACA GTA TTA AGG GAG GGT 96
97 TTT GCA GTA GAG ACA GAA GGT GAC GTA AGG TGG TTC CTA AGG TGG GTT 144
145 CGG TCA GTC CCC TAA TAG TTT GTT TTC AGT GTA TTG AGA CCA GGA GAA 192
193 TTG ATT TAC TTC AGA CCT ATT TTT TGA AAC TCC CTG TTG GTC TGG TGT 240
241 CAT GCG TTT AGA GAG ACT CTG AAC TTC GTT TTC TGT TTT CCT TAG ATA 288
289 ATG ATA GCA CTT CAT GAG CAC CAT ATG TGA AGC ACA CAT ACG TTC TGT 336
337 CTG ACT CTC AGA ACA GTG CCA CAT TGG CAT TGT CAT CAT TTA CAG ATG 384
385 ACA TCT GAG AGC TAG GGT GAC TTG CCC AAG CCC AAG CAG CTG GTA GCA 432
433 GAG TTG GGA TCT TAA CCC TCA CCA CAC CGG TGT TTT CAA GAG AGA ACT 480
481 TGG TCG TGG AGA ACT GGA GAA CAG ATA CTG CCT AAG TTA CAT TTA GTT 528
529 TTA GAC TAA TTT AAT ACT CCA TTT AAG AAA CAA ACA GAA CAG CTG TGC 576
577 AGT TAG GAA AGT GGA GGG GAT TTG AAT GAG GAG GAG GCG GGT TAG AGT 624
625 TAG AAC CGT CTT TGC TGA ACA AGT AGT GTG TTT GCA GAA AGG GAA GTT 672
673 GAA GTT ACG TTG TCC TAA AAT ACA GCA AGG AAG AGG AGG TGC TTA ATA 720
721 CCA CGG AAA AAA GTC TGT GTC ATG TTT CTG TGA GGT TAA AAG TAA TTT 768
769 GAT AGT GTA TGT CAT GCT GCT GAC ATA TTC CAT GTG TTT GAT ATC TTC 816
817 CCA GCA AAA TAA TCA GCT CTC ATT TTC CCT TAC AGA GTC AAC TGT TCA 864
865 TTT TAT TTC AAA ATT GGA GCA TGT CGT CAT GGA GAC AGG TGC TCT CGG 912
913 TTG CAC AAT AAA CCG ACG TTT AGC CAG ACC ATT GCC CTC TTG AAC ATT 960
961 TAC CGT AAC CCT CAA AAC TCT TCC CAG TCT GCT GAC GGT TTG CGC TGT 1008
1009 GCC GTG AGC GAT GTG GAG ATG CAG GAA CAC TAT GAT GAG TTT TTT GAG 1056
1 Met Gln Glu His Tyr Asp Glu Phe Phe Glu 10
1057 GAG GTT TTT ACA GAA ATG GAG GAG AAG TAT GGG GAA GTA GAG GAG ATG 1104
11 Glu Val Phe Thr Glu Met Glu Glu Lys Tyr Gly Glu Val Glu Glu Met 26
1105 AAC GTC TGT GAC AAC CTG GGA GAC CAC CTG GTG GGG AAC GTG TAC GTC 1152
27 Asn Val Cys Asp Asn Leu Gly Asp His Leu Val Gly Asn Val Tyr Val 42
1153 AAG TTT CGC CGT GAG GAA GAT GCG GAA AAG GCT GTG ATT GAC TTG AAT 1200
43 Lys Phe Arg Arg Glu Glu Asp Ala Glu Lys Ala Val Ile Asp Leu Asn 58
1201 AAC CGT TGG TTT AAT GGA CAG CCG ATC CAC GCC GAG CTG TCA CCC GTG 1248
59 Asn Arg Trp Phe Asn Gly Gln Pro Ile His Ala Glu Leu Ser Pro Val 74
1249 ACG GAC TTC AGA GAA GCC TGC TGC CGT CAG TAT GAG ATG GGA GAA TGC 1296
75 Thr Asp Phe Arg Glu Ala Cys Cys Arg Gln Tyr Glu Met Gly Glu Cys 90
1297 ACA CGA GGC GGC TTC TGC AAC TTC ATG CAT TTG AAG CCC ATT TCC AGA 1344
91 Thr Arg Gly Gly Phe Cys Asn Phe Met His Leu Lys Pro Ile Ser Arg 106
1345 GAG CTG CGG CGG GAG CTG TAT GGC CGC CGT CGC AAG AAG CAT AGA TCA 1392
107 Glu Leu Arg Arg Glu Leu Tyr Gly Arg Arg Arg Lys Lys His Arg Ser 122
1393 AGA TCC CGA TCC CGG GAG CGT CGT TCT CGG TCT AGA GAC CGT GGT CGT 1440
123 Arg Ser Arg Ser Arg Glu Arg Arg Ser Arg Ser Arg Asp Arg Gly Arg 138
1441 GGC GGT GGC GGT GGC GGT GGT GGA GGT GGC GGC GGA CGG GAG CGT GAC 1488
139 Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Arg Glu Arg Asp 154
1489 AGG AGG CGG TCG AGA GAT CGT GAA AGA TCT GGG CGA TTC TGA GCC ATG 1536
155 Arg Arg Arg Ser Arg Asp Arg Glu Arg Ser Gly Arg Phe *** 168
1537 CCA TTT TTA CCT TAT GTC TGC TAG AAA GTG TTG TAG TTG ATT GAC CAA 1584
1585 ACC AGT TCA TAA GGG GAA TTT TTT TTA AAA AAC AAC AAA AAA AAA AAC 1632
1633 ATA CAA AGA TGG GTT TCT GAA TAA AAT TTG TAG TGA TAA CAG TAA AAA 1680
1681 AAA AAA AAA AAA AA 1694
12.FP944
A: nucleotide sequence (SEQ ID NO:34) length: 2244 bases
1 GCCCAGTCTT GTGGCTTCAA ATGCTATCTG TGTGTGGATA GCTCCTCCAG TCCACACCTC
61 TTTTCTGAAT CTTTTTTCAA ATTCACACAT GTGAATTTGA ATTCCAGGCT CATCACATCC
121 AGGTGTGTTC CCCACACCTC CACCTGAATG TCTATTAGAA ATCTCACACC CCTCTTGTCC
181 AAAACTGAGC TCTTGATCTT TAACCAGAAA CTTTATCTCA GCTATTGGCA ACTCCAGGTT
241 TCTGTTTTGC TCAGGCCACA GACTTCAGAG TCATTCTCAC ACTGCCTTTT CTCTCATACA
301 CCATTATGTA CCTGTCAGCA GGTCCCAAAT ATCTAGAATT TTGATTGCTT CACATCCCTA
361 TAGCAACTCT CCTGGTTCAA GCCCATCATC ATCTGGATTT TTGGGAAGTG GTCTTAACTG
421 GTGTGCTGCC CTTCCCCTCG GCTCTCAACC CTATTTTCAA CATAGCAGTC AGAATAATCT
481 AAAATATAAA CCAGATCATA TCACTCCTCT GCTCAACACC CCCTCCCAAT ACCTTCATGT
541 CATCCTCAGA GTTATTGTCC AAGCCTCTAA CTGTGGTCTA CAAGTCTTTA CATGATCTGT
601 CCTGTTACAT TTTCCACTCT CGCTCCAGAC CTGACCTAAG AGGCTTTTCT CTGAACTTGC
661 TAAACATGCT CTTACCTGAG GGTCCATTCC CACTGCCTCT GCCTGGAATG CTCTTCCCCA
721 GACATGCATC TGCGTGGATC ATTTCCATGC CTCCTCAGGT CTTTGCCCTC ATGAAGCCTT
781 CCCTGACAAC CCTATTTAAG ATTGTAATCC CCTCCCACAG TGGCCCCTAT CTGCTAGAGA
841 CTGAATGTTT TTGTCCCCCC CAAAATTCCT ATGTTGAAGC CTTAATCTCA ACATAGGAGG
901 TTTGGCCTTT GTGAGTTAGT TTTAGATGAG GTCATGTGGG TAATAGAATT AGTGCATTTA
961 TAAGGGCGTG AAGAGCCCTC ACTGTGAGGA AACCAGAATG TAAGCCTTCA CCAGAACTTC
1021 TTCATGCTGG CACCCTGGTC TTAGACTTCC CAGCCACTAG ATCTGTGAGA AAGAAATGTT
1081 TGTTGTTTAA GCCACCCTGT CTATGGTACT CTTGTTACAG CAACCTGAAC TAAGACACTA
1141 TCCCTCTCCC CTGCTTTCTT TTTCTGTTAT CACCATAATT AATGTTTTAC TTATATATGA
1201 TTTGTTAATG TATGTCTCTT CCCACTAGAA ACCGAATTCT ATGACAGCAA GGGTTTTTGT
1261 TTTCATTATT GTCATATTAC CTGTGCCTGG CATGTGATAA GCCCTAAGTA CTTGTTGACT
1321 GAATTAACAT ATGAGATCTG TGAGTCTGAA AAAAGAAGTA CATCATTCCT TCACATGTCA
1381 TTTTGAGCCA CTGTTAACAG ATTACATAAC TGTTATTGGA AGTGCCTCAC TACTCCATTT
1441 GGAAAGTATT CCTGGAATAC ACGATGAATG ACTCAGTAGA AGTTCTGGCT ATGGGGGAGT
1501 GCCATCAATA AATAAAAATA ATACCATTGC CTGTTGGTTA TAAGTTGCTG TCACAGACTA
1561 TGAAGGTATT TGAATTGTAT TTTCAGGAAG AGAAACAGAT ATGCAACCAA TCTGATTTGG
1621 TTCCACACCC TGCCACAGAA GTCCCAAAGG AGAAGCAAAT GCCACTTGTA GACTCAAACT
1681 CTGGACTTCA TTCTCAGCAC CATCTCTTAA CCTACTGAGT AAATCCTATG AACCTGACTA
1741 GGGTGGCAAC TGAAGTTGTA GTACTTTCTG TCTTCAAAGG AGGGTGAACC TGGAGTATGT
1801 TGGAAGAAGT GTTAGGTTCA TTAGAGACAC CTCTTTCATA ACTGCTGTCA CCTAGAATGT
1861 CTATTCTCTA AGCACCTTCT TGAAAGAAGC ACATTCCCCC AGGAGGAGGA AATGGTTTGT
1921 TTTCTTTAGT GACAGTTGGC CCAGGATCCT AAGGTAAAGG AGTATGGGCA CTAGACAAAA
1981 TGAGTTTTAT AATAGTCGAT TGGAAGGAAT GGAAGTTTTT GAATCACAGG CTTCTGCCTT
2041 GAAAGGGCAA GTCCGTGGTG GTCATGAACC ACCCAGAGTG TGTACAAGGC TCCTGCCAGA
2101 AGAAATGCCT GTAGGATTAA AAGCAAAACA GCAGAATCAA AAAGGAAGGG CACGATGATG
2161 CCCAGGTGCA GCCTATATTA TTAACAGACA AGGAAAGAAT CCAAATAAAA GTTCTGTAAA
2221 CCAAAAAAAA AAAAAAAAAA AAAA
B: nucleotide sequence (SEQ ID NO:35) length: 119 amino acid
1 MSSSELLSKP LTVVYKSLHD LSCYIFHSRS RPDLRGFSLN LLNMLLPEGP FPLPLPGMLF
61 PRHASAWIIS MPPQVFALMK PSLTTLFKIV IPSHSGPYLL ETECFCPPQN SYVEALIST
C. Nucleotide and amino acid composite sequence (SEQ ID NO:36) clone number: FP944
Start code: 537ATG stops coding: 894TAG protein molecular weight: 13317.99
1 GC CCA GTC TTG TGG CTT CAA ATG CTA TCT GTG TGT GGA TAG CTC CTC 47
48 CAG TCC ACA CCT CTT TTC TGA ATC TTT TTT CAA ATT CAC ACA TGT GAA 95
96 TTT GAA TTC CAG GCT CAT CAC ATC CAG GTG TGT TCC CCA CAC CTC CAC 143
144 CTG AAT GTC TAT TAG AAA TCT CAC ACC CCT CTT GTC CAA AAC TGA GCT 191
192 CTT GAT CTT TAA CCA GAA ACT TTA TCT CAG CTA TTG GCA ACT CCA GGT 239
240 TTC TGT TTT GCT CAG GCC ACA GAC TTC AGA GTC ATT CTC ACA CTG CCT 287
288 TTT CTC TCA TAC ACC ATT ATG TAC CTG TCA GCA GGT CCC AAA TAT CTA 335
336 GAA TTT TGA TTG CTT CAC ATC CCT ATA GCA ACT CTC CTG GTT CAA GCC 383
384 CAT CAT CAT CTG GAT TTT TGG GAA GTG GTC TTA ACT GGT GTG CTG CCC 431
432 TTC CCC TCG GCT CTC AAC CCT ATT TTC AAC ATA GCA GTC AGA ATA ATC 479
480 TAA AAT ATA AAC CAG ATC ATA TCA CTC CTC TGC TCA ACA CCC CCT CCC 527
528 AAT ACC TTC ATG TCA TCC TCA GAG TTA TTG TCC AAG CCT CTA ACT GTG 575
1 Met Ser Ser Ser Glu Leu Leu Ser Lys Pro Leu Thr Val 13
576 GTC TAC AAG TCT TTA CAT GAT CTG TCC TGT TAC ATT TTC CAC TCT CGC 623
14 Val Tyr Lys Ser Leu His Asp Leu Ser Cys Tyr Ile Phe His Ser Arg 29
624 TCC AGA CCT GAC CTA AGA GGC TTT TCT CTG AAC TTG CTA AAC ATG CTC 671
30 Ser Arg Pro Asp Leu Arg Gly Phe Ser Leu Asn Leu Leu Asn Met Leu 45
672 TTA CCT GAG GGT CCA TTC CCA CTG CCT CTG CCT GGA ATG CTC TTC CCC 719
46 Leu Pro Glu Gly Pro Phe Pro Leu Pro Leu Pro Gly Met Leu Phe Pro 61
720 AGA CAT GCA TCT GCG TGG ATC ATT TCC ATG CCT CCT CAG GTC TTT GCC 767
62 Arg His Ala Ser Ala Trp Ile Ile Ser Met Pro Pro Gln Val Phe Ala 77
768 CTC ATG AAG CCT TCC CTG ACA ACC CTA TTT AAG ATT GTA ATC CCC TCC 815
78 Leu Met Lys Pro Ser Leu Thr Thr Leu Phe Lys Ile Val Ile Pro Ser 93
816 CAC AGT GGC CCC TAT CTG CTA GAG ACT GAA TGT TTT TGT CCC CCC CAA 863
94 His Ser Gly Pro Tyr Leu Leu Glu Thr Glu Cys Phe Cys Pro Pro Gln 109
864 AAT TCC TAT GTT GAA GCC TTA ATC TCA ACA TAG GAG GTT TGG CCT TTG 911
110 Asn Ser Tyr Val Glu Ala Leu Ile Ser Thr *** 120
912 TGA GTT AGT TTT AGA TGA GGT CAT GTG GGT AAT AGA ATT AGT GCA TTT 959
960 ATA AGG GCG TGA AGA GCC CTC ACT GTG AGG AAA CCA GAA TGT AAG CCT 1007
1008 TCA CCA GAA CTT CTT CAT GCT GGC ACC CTG GTC TTA GAC TTC CCA GCC 1055
1056 ACT AGA TCT GTG AGA AAG AAA TGT TTG TTG TTT AAG CCA CCC TGT CTA 1103
1104 TGG TAC TCT TGT TAC AGC AAC CTG AAC TAA GAC ACT ATC CCT CTC CCC 1151
1152 TGC TTT CTT TTT CTG TTA TCA CCA TAA TTA ATG TTT TAC TTA TAT ATG 1199
1200 ATT TGT TAA TGT ATG TCT CTT CCC ACT AGA AAC CGA ATT CTA TGA CAG 1247
1248 CAA GGG TTT TTG TTT TCA TTA TTG TCA TAT TAC CTG TGC CTG GCA TGT 1295
1296 GAT AAG CCC TAA GTA CTT GTT GAC TGA ATT AAC ATA TGA GAT CTG TGA 1343
1344 GTC TGA AAA AAG AAG TAC ATC ATT CCT TCA CAT GTC ATT TTG AGC CAC 1391
1392 TGT TAA CAG ATT ACA TAA CTG TTA TTG GAA GTG CCT CAC TAC TCC ATT 1439
1440 TGG AAA GTA TTC CTG GAA TAC ACG ATG AAT GAC TCA GTA GAA GTT CTG 1487
1488 GCT ATG GGG GAG TGC CAT CAA TAA ATA AAA ATA ATA CCA TTG CCT GTT 1535
1536 GGT TAT AAG TTG CTG TCA CAG ACT ATG AAG GTA TTT GAA TTG TAT TTT 1583
1584 CAG GAA GAG AAA CAG ATA TGC AAC CAA TCT GAT TTG GTT CCA CAC CCT 1631
1632 GCC ACA GAA GTC CCA AAG GAG AAG CAA ATG CCA CTT GTA GAC TCA AAC 1679
1680 TCT GGA CTT CAT TCT CAG CAC CAT CTC TTA ACC TAC TGA GTA AAT CCT 1727
1728 ATG AAC CTG ACT AGG GTG GCA ACT GAA GTT GTA GTA CTT TCT GTC TTC 1775
1776 AAA GGA GGG TGA ACC TGG AGT ATG TTG GAA GAA GTG TTA GGT TCA TTA 1823
1824 GAG ACA CCT CTT TCA TAA CTG CTG TCA CCT AGA ATG TCT ATT CTC TAA 1871
1872 GCA CCT TCT TGA AAG AAG CAC ATT CCC CCA GGA GGA GGA AAT GGT TTG 1919
1920 TTT TCT TTA GTG ACA GTT GGC CCA GGA TCC TAA GGT AAA GGA GTA TGG 1967
1968 GCA CTA GAC AAA ATG AGT TTT ATA ATA GTC GAT TGG AAG GAA TGG AAG 2015
2016 TTT TTG AAT CAC AGG CTT CTG CCT TGA AAG GGC AAG TCC GTG GTG GTC 2063
2064 ATG AAC CAC CCA GAG TGT GTA CAA GGC TCC TGC CAG AAG AAA TGC CTG 2111
2112 TAG GAT TAA AAG CAA AAC AGC AGA ATC AAA AAG GAA GGG CAC GAT GAT 2159
2160 GCC CAG GTG CAG CCT ATA TTA TTA ACA GAC AAG GAA AGA ATC CAA ATA 2207
2208 AAA GTT CTG TAA ACC AAA AAA AAA AAA AAA AAA AAA A 2244
Claims (5)
1. isolating polynucleotide is characterized in that, it is selected from down group:
(a) polynucleotide of the following polypeptide of coding, described polypeptide has the aminoacid sequence of the group of being selected from down: SEQ ID NO:2,5,8,11,14,17,20,23,26,29,32,35;
(b) with polynucleotide (a) complementary polynucleotide.
2. polynucleotide as claimed in claim 1 is characterized in that, the polypeptide of this polynucleotide encoding has the aminoacid sequence of the group of being selected from down: SEQ ID NO:2,5,8,11,14,17,20,23,26,29,32,35.
3. polynucleotide as claimed in claim 1 is characterized in that, the sequence of these polynucleotide is selected from down group:
SEQ ID NO:3,6,9,12,15,18,21,24,27,30,33,36 coding region sequence or full length sequence.
4. a carrier is characterized in that, it contains the described polynucleotide of claim 1.
5. a genetically engineered host cell is characterized in that, it is a kind of host cell that is selected from down group:
(a) host cell that transforms or transduce with the described carrier of claim 4;
(b) host cell that transforms or transduce with the described polynucleotide of claim 1.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB011267275A CN1177049C (en) | 2001-09-12 | 2001-09-12 | Human protein with function of suppressing cancer cell growth and its coding sequence |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB011267275A CN1177049C (en) | 2001-09-12 | 2001-09-12 | Human protein with function of suppressing cancer cell growth and its coding sequence |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1403479A CN1403479A (en) | 2003-03-19 |
CN1177049C true CN1177049C (en) | 2004-11-24 |
Family
ID=4666734
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB011267275A Expired - Fee Related CN1177049C (en) | 2001-09-12 | 2001-09-12 | Human protein with function of suppressing cancer cell growth and its coding sequence |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1177049C (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11180541B2 (en) * | 2017-09-28 | 2021-11-23 | Geltor, Inc. | Recombinant collagen and elastin molecules and uses thereof |
AU2020270920A1 (en) | 2019-04-12 | 2021-11-04 | Geltor, Inc. | Recombinant elastin and production thereof |
-
2001
- 2001-09-12 CN CNB011267275A patent/CN1177049C/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN1403479A (en) | 2003-03-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1177049C (en) | Human protein with function of suppressing cancer cell growth and its coding sequence | |
CN1170850C (en) | Human angiogenin-like protein and coding sequence and application thereof | |
CN1169954C (en) | Human protein able to suppress growth of cancer cells and its coding sequence | |
CN1177864C (en) | Novel human protein with expression difference in liver cancer tissue and its code sequence | |
CN1209373C (en) | Human protein with suppression to cancer cell growth and its coding sequence | |
CN1177048C (en) | Human protein with function of suppressing cancer cell growth and its coding sequence | |
CN1177050C (en) | Human protein with function of suppressing cancer cell growth and its coding sequence | |
CN1199998C (en) | Human protein with suppression to cancer cell growth and its coding sequence | |
CN1155615C (en) | Human protein with cancer cell growth suppressing function and its coding sequence | |
CN1199996C (en) | New human protein having cancer cell growth in hibiting function and its code sequence | |
CN1199994C (en) | New human protein with cancer cell growth inhibiting function and its coding sequence | |
CN1169955C (en) | Human protein able to suppress growth of cancer cells and its coding sequence | |
CN1199997C (en) | New human protein having mouse NIH/3T3 cell conversion promoting function and its code sequence | |
CN1169831C (en) | Human protein with cancer call growth suppressing function and its coding sequence | |
CN1205225C (en) | Human protein with cancer inhibiting function and its coding sequence | |
CN1230445C (en) | Novel human protein with function for promoting mouse NIH/313 cell transformation and coding sequence thereof | |
CN1194989C (en) | Novel human protein able to suppress cancer cell growth and its coding sequence | |
CN1193041C (en) | New human protein with the function of inhibiting cancer cell growth and its encoding sequence | |
CN1222616C (en) | Novel human protein with cancer-inhibiting function and coding sequence thereof | |
CN1209374C (en) | Human Protein for promoting transform of 3T3 cell and its coding sequence | |
CN1194010C (en) | New human protein with the function of inhibiting cancer cell growth and its coding sequence | |
CN1169956C (en) | Human protein able to suppress growth of cancer cells and its coding sequence | |
CN1209370C (en) | New human protein with cancer inhibiting function and its encoding sequence | |
CN1155616C (en) | Human protein with cancer cell growth promoting function and its coding sequence | |
CN1169958C (en) | Human protein able to suppress growth of cancer cells and its coding sequence |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C19 | Lapse of patent right due to non-payment of the annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |