WO2004090526A1 - 試料解析方法及び試料解析プログラム - Google Patents
試料解析方法及び試料解析プログラム Download PDFInfo
- Publication number
- WO2004090526A1 WO2004090526A1 PCT/JP2004/004621 JP2004004621W WO2004090526A1 WO 2004090526 A1 WO2004090526 A1 WO 2004090526A1 JP 2004004621 W JP2004004621 W JP 2004004621W WO 2004090526 A1 WO2004090526 A1 WO 2004090526A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sample
- data
- sample analysis
- profiles
- corrected
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N30/00—Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
- G01N30/02—Column chromatography
- G01N30/86—Signal analysis
- G01N30/8675—Evaluation, i.e. decoding of the signal into analytical information
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16C—COMPUTATIONAL CHEMISTRY; CHEMOINFORMATICS; COMPUTATIONAL MATERIALS SCIENCE
- G16C20/00—Chemoinformatics, i.e. ICT specially adapted for the handling of physicochemical or structural data of chemical particles, elements, compounds or mixtures
- G16C20/20—Identification of molecular entities, parts thereof or of chemical compositions
-
- H—ELECTRICITY
- H01—ELECTRIC ELEMENTS
- H01J—ELECTRIC DISCHARGE TUBES OR DISCHARGE LAMPS
- H01J49/00—Particle spectrometers or separator tubes
- H01J49/02—Details
- H01J49/04—Arrangements for introducing or extracting samples to be analysed, e.g. vacuum locks; Arrangements for external adjustment of electron- or ion-optical components
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N30/00—Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
- G01N30/02—Column chromatography
- G01N30/04—Preparation or injection of sample to be analysed
- G01N2030/042—Standards
- G01N2030/045—Standards internal
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N30/00—Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
- G01N30/02—Column chromatography
- G01N30/62—Detectors specially adapted therefor
- G01N30/72—Mass spectrometers
- G01N30/7233—Mass spectrometers interfaced to liquid or supercritical fluid chromatograph
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N30/00—Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
- G01N30/02—Column chromatography
- G01N30/86—Signal analysis
- G01N30/8624—Detection of slopes or peaks; baseline correction
- G01N30/8641—Baseline
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N30/00—Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
- G01N30/02—Column chromatography
- G01N30/86—Signal analysis
- G01N30/8665—Signal analysis for calibrating the measuring apparatus
Definitions
- the present invention relates to a sample analysis method and a sample analysis program using multidimensional data obtained as a result of sample analysis.
- LC-MS liquid chromatography mass spectrometry
- MS mass spectrometry
- the mass Z charge is plotted on the horizontal axis.
- the spectrum data can be obtained in two dimensions as a graph with the ratio (hereinafter abbreviated as m / z) and the ion intensity on the vertical axis.
- the role of the LC is to simply fractionate the sample to adapt to the throughput of the MS.
- the time axis is corrected and superimposed so that multiple chromatographic results can be compared with each other in order to use the chromatographic information not only as a fraction but also as information indicating the characteristics of the sample ( align) methods have been proposed.
- Typical examples are Dynamic Time Warping (hereinafter abbreviated as DTW) and Correlation Optimized Warping (hereinafter abbreviated as COW). Both are one implementation based on the dynamic programming algorithm.
- the Euclidean distance or correlation is used as an indicator of the distance or similarity of two chromatographies (V. Pravdova, B. Walczak, DL Massart, A comparison of two algorithms for warping of analytical signals, Anal. Chim. Acta 456: 77-92 (2002)).
- these methods are applied to chromatograms represented in two dimensions, the time axis of the chromatography and the signal intensity, they correct at least one-dimensional parameters in multidimensional data. is not.
- Such a superposition method is based on the premise that the chromatograms to be compared are similar to each other in the spectrum graphs. In fact, whether DTW or COW, overlaying is performed with the aim of minimizing the distance or maximizing the correlation between the profiles to be compared. There is a good possibility that no match can be obtained.
- a method based on such high commonality is expected to have many factors that are expected to fluctuate, as in actual disease state analysis and drug responsiveness analysis, and the amount of each fluctuation is minute, and individual differences Inappropriate when it is likely to be mixed with measurement errors.
- an object of the present invention is to provide a sample analysis method and a sample analysis program capable of achieving excellent analytical performance when analyzing components contained in a sample in view of the above-described situation. I do. Disclosure of the invention
- the present invention that has achieved the above objects includes the following.
- the multidimensional data is three-dimensional data obtained from the results of chromatography mass analysis, which consists of a parameter indicating the mass / charge ratio, a parameter indicating the ionic strength, and a parameter indicating the retention time. Can be mentioned. At this time, it is preferable to correct the parameter indicating the holding time in the step a.
- a profile related to parameters excluding the parameter to be corrected is defined as a reference profile, and an evaluation function serving as a measure of arrangement similarity regarding a plurality of reference profiles among a plurality of samples can be given.
- the allocation of each profile is performed as an optimal search problem that optimizes the value of the evaluation function. Can be requested.
- the evaluation function is defined by one or more terms selected from the group consisting of:
- ⁇ A term about the degree of agreement or disagreement of the signal derived from the reference material between the profiles to be compared
- a dynamic programming algorithm can be used when optimizing the value of the evaluation function as an optimal solution search problem for the parameter to be corrected.
- the correspondence between the data points derived from the reference material is determined. Is preferably set to improve the score. Furthermore, in this case, it is preferable that the constraint on the correspondence between the data points derived from the reference material be that they always correspond at specified points.
- the constraint on the correspondence between the data points derived from the reference material be that they always correspond at specified points.
- the sample analysis method according to the above (1) in particular, in the above-mentioned step a, by using information derived from the standard substance added in advance, the accuracy of the analysis can be further improved, and the ability of the correction processing can be improved. .
- a method having such features is named an internal standard guided optimal profile alignment (z-OPAL) method.
- the above-described sample analysis method includes an input unit having a function of inputting various data, an arithmetic processing unit having a function of executing arithmetic processing according to a program, and a function of displaying a result of the arithmetic processing and the like.
- This can be realized as a program to be executed by a computer including a display unit having
- the sample analysis method can detect and identify substances having different amounts between different types of samples. Specifically, multi-dimensional data for multiple samples 3D data consisting of a parameter indicating the mass / charge ratio obtained as a result of chromatography-mass spectrometry, a parameter indicating the ionic strength, and a parameter indicating the retention time were measured. By comparing dimensional data, signals with significantly different ionic strengths can be detected and identified. Further analysis is performed on a substance that has generated a signal having characteristics of these signals, that is, properties that are sufficiently close to the mass-to-charge ratio and the retention time, so that the substance can be identified. .
- a substance having a significantly different abundance between the disease group and the healthy group can be detected and identified.
- the substances identified in this way can be used as biomass resources.
- the results of biomarker detection and identification can be used to diagnose diseases and select treatment methods.
- FIG. 1 is a diagram showing an example of three-dimensional spectrum data obtained by a sample analysis method and a sample analysis program according to the present invention.
- FIG. 2 is a diagram illustrating an example of three-dimensional data.
- FIG. 3 is a diagram showing an example of another three-dimensional data set for searching for a correspondence relationship with the three-dimensional data shown in FIG.
- FIG. 4 is a diagram showing an optimal correspondence arrangement between the three-dimensional data shown in FIG. 2 and the three-dimensional data shown in FIG.
- FIG. 5 is a diagram showing a concept of searching for an optimal correspondence arrangement between the three-dimensional data shown in FIG. 2 and the three-dimensional data shown in FIG. Fig. 6 is a diagram showing that if the route is limited using information from the reference material in the optimal arrangement search shown in Fig. 5, the gray part of the search space no longer needs to be searched.
- FIG. 7 is a diagram showing that if more information from the reference material is used to increase the constraint conditions on the route, the space that does not need to be searched increases, and the search efficiency is further improved.
- FIG. 8 is a diagram illustrating a result obtained by superimposing waveforms that are different on the time axis in five measurement results of the same type and adding signals by using the sample analysis program according to the present invention. .
- FIG. 9 shows that the sample analysis program according to the present invention superimposes profiles obtained from seven different types of sample measurements on the same time axis so that different samples can be compared with each other at each time point.
- FIG. 10 is a diagram showing a calculated difference spectrum between two different types of samples by the sample analysis program according to the present invention.
- FIG. 11 is a diagram showing signals selected by the sample analysis program according to the present invention as signals having a significant quantitative change between sample groups.
- FIG. 12 is a diagram showing that a sample analysis program according to the present invention is applied to a marker search using an actual clinical specimen, and that signals can be classified according to grooving based on different pathological diagnosis results.
- FIG. 13 is a diagram showing a signal obtained by performing a statistical test on the results shown in FIG. 12 and picking up signals that change quantitatively according to different pathological diagnosis results.
- FIG. 14 is a diagram showing the results of associating each signal of the results shown in FIG. 13 with protein identification by MSZMS.
- FIG. 15 shows that among the proteins associated with known proteins in the form shown in FIG. 14, in particular, those proteins known to be associated with cancer metastasis were found by the sample analysis system according to the present invention.
- 6 is a table showing some of the results obtained.
- a sample to be analyzed is collected.
- the sample to be analyzed is not particularly limited, and examples thereof include a tissue section of an organ derived from an animal individual, a body fluid component such as plasma and lymph, an organ such as a green leaf and a petal of a plant, an environmental soil and a water component.
- the analytes contained in these samples are not particularly limited to, but include, for example, organic compounds, inorganic compounds, organometallic compounds, metal ions, peptides, proteins, metalloproteins, and post-translational modifications including phosphorylation.
- Examples include a peptide that has undergone post-translational modification including phosphorylation, a nucleic acid, a carbohydrate, a lipid, and the like. Particularly preferred are a peptide, a protein, a metalloprotein, and a peptide or protein that has undergone post-translational modification.
- the collected sample is preferably subjected to various treatments as necessary according to the purpose of the analysis and the characteristics of the collected sample. For example, (a) separation or fractionation of a group of proteins, (ii) enzymatic and Z- or chemical cleavage of a group of proteins, (ii) separation or fractionation of a peptide mixture generated by cleavage, and (D) It is preferable to perform a pre-analytical preparation in which all or some of the components of the standard are added.
- (a) separation or fractionation of proteins refers to one-dimensional sodium dodecyl sulfate (SDS) electrophoresis, two-dimensional electrophoresis, capillary electrophoresis, ion-exchange chromatography, It can be carried out by gel filtration chromatography, normal phase chromatography, reverse phase chromatography, affinity chromatography, or multidimensional separation and fractionation by a combination thereof.
- “(i) enzymatic and / or chemical cleavage of proteins” refers to trypsin digestion, chymotrypsin digestion, Lys-C digestion, Asp-N digestion, Glu-C digestion, cyanogen bromide digestion or any of these. The cutting can be performed by a combination or the like.
- ( ⁇ ) Separation or fractionation of the peptide mixture generated by cleavage refers to one-dimensional sodium dodecyl sulfate (SDS) electrophoresis, two-dimensional electrophoresis, capillary electrophoresis, ion-exchange chromatography, Gel filtration chromatography 1. Normal phase chromatography, reverse phase chromatography, affinity chromatography, or multidimensional separation and fractionation using a combination thereof. be able to.
- SDS sodium dodecyl sulfate
- the standard substance is one that can be ionized by the selected ionization method and elutes within the LC retention time of the measurement. It is preferable to select one having high reproducibility of the ion intensity.
- preferred standards include, for example, organic compounds, inorganic compounds, organometallic compounds, metal ions, peptides, proteins, metalloproteins, post-translationally modified peptides including phosphorylation, and phosphorylation Examples include proteins, nucleic acids, carbohydrates, lipids, and the like containing post-translational modifications, and more preferably, commercially available peptides, proteins, naturally occurring substances, or synthesized substances.
- the various processes before the analysis shown in (a) to (e) are, for example, “in order of ⁇ , e, i, ⁇ ”, “in order of e, i, ⁇ ”, “i, e, ⁇ ”.
- the order can be performed in the order of, in the order of e and a, in the order of e and a, in the order of a and e, or only e.
- multidimensional data on the sample is obtained by analyzing the sample. Specifically, a sample is analyzed by LC-MS, and multidimensional data consisting of m / " ⁇ ionic strength and retention time is measured.
- analysis by LC-MS means that the sample is Separation or fractionation according to the principle, and then the components contained in the separated or fractionated sample are measured by the principle of mass spectrometry.
- the retention time means that the sample is separated or fractionated according to the principle of chromatography. MZz and ionic strength are measured as a result of mass spectrometry.
- chromatography is not particularly limited, but various types of chromatography such as reverse phase chromatography, capillary electrophoresis, affinity chromatography, chromatofocusing, isoelectric focusing, gel filtration chromatography, etc. Principle can be applied.
- LC in the present specification means not only liquid chromatography but also broad and general chromatography.
- LC-MS chromatography provides reproducible elution profiles, high resolution, and MS It is preferable that a molecular ion can be directly introduced into the polymer.
- preferable conditions in liquid chromatography are as follows.
- reverse phase liquid chromatography using a C18 column using an eluent containing a strong acid such as formic acid at a low concentration in a water-acetonitrile solution is preferable.
- reverse-phase liquid chromatography using a C4 column using an eluent containing a strong acid such as formic acid at a low concentration in a water-acetonitrile solution is preferred.
- the mass spectrometry is not particularly limited, but includes a magnetic field mass spectrometer, a time-of-flight mass spectrometer, a quadrupole mass spectrometer, an ion trap mass spectrometer, a Fourier transform mass spectrometer, or a hybrid or tandem mass spectrometer thereof. Can be performed.
- a magnetic field mass spectrometer More preferably, a magnetic field mass spectrometer, a time-of-flight mass spectrometer, a quadrupole mass spectrometer, an ion trap mass spectrometer, a Fourier transform mass spectrometer, or a combination thereof that can be combined with electrospray ionization or nanoelectrospray ionization It is preferable to perform mass spectrometry using a hybrid or tandem mass spectrometer.
- the analysis result of the sample can be obtained as three-dimensional data.
- data on retention time, signal on mZz and data on ionic strength are input to a computer via input means, and are processed by arithmetic processing means in accordance with the algorithm described in detail below.
- arithmetic processing means in accordance with the algorithm described in detail below.
- This algorithm can be installed in computer software. By installing the software on a computer, the algorithm can be realized on a computer by arithmetic processing means such as a CPU. Therefore, the three-dimensional data as shown in FIG. 1 can be displayed on a display device of a computer.
- the analysis result of the sample can be obtained as a profile plotted in a three-dimensional space, so that the analysis capability of the sample can be dramatically improved. Cut off.
- data can be acquired as a superposition of a large number of spectra having a spread in the direction of the axis indicating the retention time, and the data can be obtained by comparing with the conventional analysis method.
- spectrum-based identification can be performed. For this reason, for example, the component analysis of each sample can be performed more strictly by comparing the multidimensional data obtained for a plurality of samples.
- the holding time measured as described above can be corrected by the algorithm according to the present invention under the control of the arithmetic processing means.
- the retention time often fluctuates non-linearly because factors such as the composition of the mobile phase in LC, the flow rate, and the column temperature cause minute changes with time. Therefore, regarding the three-dimensional data obtained by the analysis method according to the present invention, when the analysis is performed on a plurality of samples, the axis indicating the retention time between the samples may vary non-linearly. Conceivable. Therefore, in the algorithm according to the present invention, the holding time is corrected (hereinafter also referred to as time axis correction).
- the time axis correction targeted by the algorithm according to the present invention is expressed in a two-dimensional space having a holding time and one signal strength, such as the time axis correction of a chromatogram by the DTW algorithm or the like in the conventional method. It is not a one-dimensional profile correction.
- the data targeted by the present invention is one in which the profile to be corrected with respect to the time axis is expressed in at least two or more dimensions.
- the algorithm is not limited to the correction of the retention time, and can be widely applied to the case where at least one-dimensional parameter is corrected when a multi-dimensional parameter is obtained.
- the algorithm is based on the multidimensional parameters (eg, cubic This can be applied when correcting at least one-dimensional parameters in (original parameters). Therefore, in the following description, an algorithm in the case of acquiring + 9- dimensional measurement data will be described.
- x and y are column vectors having the dimension N of the number of data points.
- a data point is a + 9-dimensional vector that constitutes one row of the profile matrix (Z), and represents a set of measurement parameters and values for one element to be measured.
- Z profile matrix
- s means IDS is the number of reference points.
- any value of Z () must fall within a range in which the value taken by each reference point can be estimated.
- each data point of x “> and ⁇ 2) ie, ie ⁇ l, ..., p ⁇
- Z (1) and Z (2) can be different, not all data points correspond one-to-one, but also include data points that have no corresponding partner.
- the evaluation score E of the association in the entire profile is calculated using the following evaluation function, and the evaluation score is a higher score as a measure of similarity. It is also possible to define it as a measure of distance, or as a “score” if it is smaller. In the following, Explained in the definition,
- x represents the value of the r-th parameter at the i-th data point
- NN 2 is the total number of data points in the first and second profiles, respectively.
- the function / is a function that gives a distance of the degree of similarity of a corresponding point, and examples thereof include the following functions.
- the first item on the right side is the parameter to be corrected.
- the second item is how much the measured parameter to be adjusted is shifted after correction. Is the penalty according to the distance on the parameter measure
- the third item is a score given as a bonus that two points match in all parameters by parameter correction, and conversely the fourth item is on the parameter axis to be corrected This is equivalent to the penalty score for the two points not matching.
- the fifth item is a section for evaluating a signal match with the reference material as a bonus, as described later.
- ⁇ , ⁇ ⁇ and ⁇ are coefficients in terms including each of them, and are values that can be set as appropriate.
- ⁇ can be set to 1.0
- ] 3 can be set to 0.1
- ⁇ can be set to 0 if the points match by parameter correction
- ⁇ can be set to 100 if they do not match.
- the function ', zo is a function that gives 1 if the value of the parameter r of interest corresponds to the data point specified by /, zo, and 0 if it does not, and conversely, (, _ /) Is a function that corresponds to 0 if it corresponds and 1 if it does not.
- the second item indicates a measure of the arrangement similarity between the samples with respect to the profile (reference profile) excluding the correction symmetry parameter.
- the expression that gives the penalty for two points of disagreement is Although an example has been shown in which a constant is obtained depending on the response, a value calculated by a predetermined function may be used.
- the fourth item can be calculated by a function that considers whether adjacent data points correspond, the length of a column in which uncorresponding data points appear, and the like.
- represents a distance in a general vector space, and is not necessarily limited to the Euclidean distance.
- the value shall be replaced with 0 (or an appropriate alternative to missing value).
- the evaluation function is not limited to the function represented by the above formula (I).
- the evaluation function is not limited to the function represented by the above formula (I).
- the evaluation function is not limited to the above formula (I), and it is also possible to define a function that can be a measure of the arrangement similarity of the reference profile between samples.
- the following special score is given, for example, depending on whether the corresponding point is a reference point derived from a reference material or not.
- the evaluation function in this case, the distance, ie, Definition
- 6 /, b) -.
- the parameter indicating the retention time is corrected for the three-dimensional data acquired in “2. I can.
- the optimization algorithm is applied to the three-dimensional data obtained in “2. Sample analysis” above, it can be explained according to the following procedures (a) to (d).
- the operation to correct the retention time is realized by comparing two three-dimensional parameter sets, rather than targeting a single three-dimensional parameter set consisting of mZz, ionic strength, and correction time.
- the three-dimensional parameter aggregate is such that in a matrix in which mZz and retention time are taken in rows and columns, respectively, the ion intensity enters the matrix element at the position corresponding to m / z and retention time. It is expressed in a simple form.
- the operation of correcting the retention time is two matrices in Z (1) and Z (2) : This is nothing more than an operation to determine the correspondence between columns corresponding to the retention time axis (hereinafter referred to as “search for corresponding arrangement”).
- search for corresponding arrangement an operation to determine the correspondence between columns corresponding to the retention time axis.
- FIG. 5 shows all possible correspondences of the retention time with respect to the three-dimensional parameter aggregates Z (1) and Z (2) shown in FIGS. 2 and 3.
- the retention time of z (1) is indicated in the horizontal direction and the retention time of z (2) is indicated in the vertical direction.
- A There are cases where there are corresponding retention times in z (1) and z (2) , respectively.
- the score for judging the quality of the corresponding arrangement with respect to the holding time can be defined as follows, for example.
- the score at the top left point of iL that is, the score at the point where the correspondence is not yet determined at all, is set to 0.
- a score can be set for each of (a), (mouth) and (c) as follows.
- the score to be added is set to a value that reflects how similar or apart the raZz parameter and the ion intensity parameter are between z (1) and z (2). can do.
- a score is defined as the similarity. For example, if the ion intensity is detected under the specified mZz in Z (1) but the ion intensity is not detected under the same mZz in Z (2) , or vice versa
- the score can be set to reduce a certain value (penalty score).
- a value calculated by multiplying the absolute value of the difference between the two ion intensities by a predetermined coefficient (Penalty score) can be set.
- the score may be calculated by a function such that the greater the difference between the two ion intensities, the smaller the score.
- the deviation of the retention time in z (1) and z (2) can also be reflected in the score.
- the score can be set so as to reduce the value (penalty score) calculated by multiplying the absolute value of the difference between the retention times in Z (1) and Z (2) by a predetermined coefficient.
- the score may be calculated by a function such that the larger the difference between the retention times in (1) and (2), the smaller the score.
- the signals derived from the reference material correspond to Z (1) and Z (2) . It is preferable to take special measures in calculating the score in addition to devising the calculation method. In particular, since it is strongly desired that these points are matched between z (1) ⁇ Pi z (2), when associated with z (1) and z (2) as a monitor reference material derived signal A big score is given, and a big score is given when only one signal derived from the standard is found.
- the score is calculated stepwise from the upper left corner to the lower right corner of the grid in FIG. 5, and the score at the time when the grid finally reaches the lower right corner is obtained. Will be the score corresponding to.
- the method described as the procedures (a) to (d) can be rephrased as the optimal solution search method based on the dynamic programming, but the algorithm applicable in the present invention is limited to the dynamic programming. It is not done. In other words, it can be implemented using other optimal search algorithms by treating it as a more general search problem that optimizes the objective evaluation function.
- Such an algorithm can be implemented by, for example, an A * algorithm, a genetic algorithm (GA), a simulated annealing (SA), a non-linear programming method using a steepest descent method, or the like.
- the method described as steps (d) is a method based on so-called dynamic programming, and is similar to the DTW and COW methods in that it is based on dynamic programming.
- DTW and COW use the Euclidean distance or correlation as the evaluation function for the form and calculation method of the evaluation function, and compare the time-series data point sequence as it is or segment it at time intervals at fixed intervals, and compare each section.
- the method is limited to a method in which the search is performed under the same global constraint, starting at time 0 of two profiles and using the end time of each profile as the goal.
- the method using COW or COW is basically a non-linear time axis for a time series profile represented as two-dimensional data, that is, a data set represented by the time axis and signal intensity axis. Those Conform superimposed profile of the intensity by performing contraction.
- methods using DTW or COW require (1) one or more cutting planes that take specific values for a particular axis, or (2) all values along a particular axis. Making the superposition operation by aggregation is easily considered as a natural extension of these methods.
- three-dimensional data consisting of the retention time, mZ z, and ionic strength obtained by LC-MS analysis can also be limited to some specific m / z.
- the time axis can be corrected by adding all the ion intensities along the retention time axis, such as a total ion chromatogram (TIC).
- TIC total ion chromatogram
- steps (a) to (d) differs from the method in which DTW and C ⁇ W are expanded, in that the multidimensional profile excluding the dimension to be corrected (retention time axis) is used.
- the dimensions of the correction target can be expanded or contracted to achieve superposition of multidimensional profiles.
- the methods that extend D TW and C OW are as follows: (1) There is no guarantee that the same result as overlaying the entire profile while maintaining accuracy will be obtained if the method limited to specific sections is used.
- Some reference materials can be used as landmarks when adjusting parameters to be corrected (time axis, etc.)
- the calculation method can be modified as follows. That is, in the above-described algorithm, it can be modified so that the peak portion of the signal derived from the standard substance is treated as a point that must pass.
- the path that can be the solution is always this point.
- There is a constraint that you must pass through By setting such a constraint condition, the path passing through the lower left and upper right subspaces of the search space divided by the line passing through the column 15 and the row 13 is excluded. The space that would otherwise be required can be reduced (Figure 6).
- the sample analysis program according to the present invention can more accurately perform the superposition of profiles, Processing efficiency can be greatly improved.
- the search space becomes more limited, so that the accuracy of overlaying the profiles can be improved and the efficiency can be improved.
- the search space becomes more limited, so that the accuracy of overlaying the profiles can be improved and the efficiency can be improved.
- the search space is reduced to a maximum of 1 if the division is performed at equal intervals in the best case.
- the search space can be maximized by selecting the standard material so that the signal derived from the reference material is evenly and widely distributed. .
- the method for modifying the algorithm for limiting the search space to increase the search efficiency is as follows: before and after the diagonal line from the upper left starting point to the lower right destination point in the search space shown in Fig. 5, If it is limited to a space with a width, any constraint condition can be considered. However, in this case, certain prerequisite knowledge of how much should be limited May not generally be obtained. Furthermore, in this case, if the starting point and the arrival point are greatly shifted for each multidimensional data to be compared, there is a possibility that the optimum route to be obtained may protrude from the limited space. For example, the elution start time of chromatography can fluctuate greatly, so if it is not possible to reliably observe this time, it is not appropriate to limit the search to a space with a predetermined width before and after the diagonal line. I want to.
- the search space can be reduced to a maximum of close to ⁇ ", which is excellent both in terms of reliability and efficiency.
- the optimal route search by performing the optimal route search only in one or several subspaces limited by the signal derived from the reference material, it is possible to obtain a partial optimal profile superposition.
- the similarity (or distance) between the profiles can be measured by using the value of the aforementioned evaluation function as an index of the degree of the profile superposition.
- the main signals appear intensively in a limited time domain, so the optimal path search is performed only in the subspace and the value of the evaluation function is obtained, so that the files and their profiles are obtained.
- the similarity (or distance) between the samples that caused the turbulence can be determined efficiently.
- a new corrected value is generated for the corrected parameter.
- the retention time of the chromatogram determines the retention time after collection.
- one of two superimposed two-dimensional data is set as reference data, and the retention time of the other multidimensional data is made to coincide with the retention time of the reference data. (Asymmetric type) and a method of correcting both two superimposed multidimensional data (symmetric type). In particular, it is preferable to obtain the holding time after correction in a symmetric type.
- the retention time of the reference data is used as it is for the coincident point in order to match the retention time axis of the reference data. If the corresponding point in the data is not obtained, the retention time after correction can be determined by interpolation using the points that match before and after that point.
- the corrected retention time is obtained by interpolation from the set of the corrected retention times of the closest corresponding points before and after that. If it is not possible to perform the correction using the internal data, the corrected retention time is calculated by extrapolation using the average time scale of the entire data set as a coefficient, based on the corrected retention time of the nearest corresponding point. It is possible to obtain
- Which output method is selected can be appropriately selected according to the purpose of use of the sample analysis method according to the present invention. For example, if the purpose is to obtain an average from the results of multiple measurements of the same sample to offset measurement errors, or to obtain a representative profile from measurements of multiple samples under very similar conditions If you want, the output method of (2) above is effective. According to the output method (2), the output profile is limited to the common part, so that the data size can be reduced and the processing efficiency can be increased.
- the output method (1) when detecting a difference between groups of different sample groups, the output method (1) must be used.
- the data size generally increases, but no information loss occurs.
- the output method (1) it is also possible to superimpose the common profile with a higher weight.
- a new point is to correspond to the corresponding point in the previous superposition process, a new term to improve the score of the evaluation function is provided in the evaluation function, and the same point is set as much as possible. It is also possible to adjust so that they overlap. That is, for example, the evaluation score is calculated using a new evaluation function in which a new term such as-,) is added to the end of the evaluation function given by the above equation (I).
- ⁇ , __ / is set to 1 if the previous superimposed point can be handled, and set to 0 otherwise.
- the output of the sample analysis program according to the present invention has the following format. ⁇ Information on points newly obtained by superposition processing
- the output is such that this information is repeated as many times as the number of data points obtained as a result of the superposition process. However, if there is no corresponding point, There is no information for tuset 1 or 2. In this way, the output also includes information on the points of the corresponding input data set, so that each point of the finally obtained overlay profile is converted to the original multidimensional It is possible to determine which of the data comes from. In addition to the above information, additional information can be added and output if necessary.
- Aggregation or quantization processing may be performed on some of the parameters obtained from the superimposed profiles obtained as described above, if necessary. For example, especially when all points are output as described in (1) above, the time axis resolution may be too detailed beyond the required level. In this case, it is better to further consolidate points that are extremely close on the time axis into a single point for later processing.
- the intensity of the aggregated points can be replaced by the sum of the individual point intensities before aggregation. Similarly, points that are closer than the required resolution on the m / Z axis can be aggregated. However, this operation may be performed each time the overlay processing is performed, or may be performed only once after performing the required overlay first.
- the sample analysis method it is preferable to normalize the measured ionic strength prior to the above “3. Data analysis”.
- the normalization of ionic strength will be described below, but the method of normalizing ionic strength is not limited at all. Specifically, first, the RAW file obtained as a result of the LC-MS analysis is converted into a text file using, for example, Xcalibur TM utility software. Next, the following series of data processing is applied by a program written in C language and Perl language.
- ionic strength predetermined value e.g. 10 2 or less
- the mZz value and the retention time value of the original data are rounded so that m / z is in increments of 1 and the retention time is in increments of 0.2, and have the same value (mZz, retention time Data points in) are added and counted.
- signals having mZz values near 715 and 877 can be used as standard signals.
- m / z was within ⁇ 1 range, and for the retention time, the signal of ra / z 715 (715 ⁇ 1) was obtained from the early elution signal group.
- the signal derived from the standard substance can be searched.
- further correction is performed when the signal intensity derived from the standard is corrected to 10 7 .
- the peptide itself for example, peptide T (Ala- Ser- Thr -When Thr-Asn-Tyr-Thr) and j3 force somorphin ⁇ (Tyr-Pro-Phe-Pro-Gly-Prolie) were used as standard substances, signals with mZz values around 859 and around 791 respectively It can be a signal.
- the former peptide is relatively hydrophilic, and the latter is hydrophobic.
- the former has a lower retention time value and the latter has a higher retention time value.
- the retention time of peptides from most samples lies between the retention times of the two peptides.
- the value of m-noz is in the range of ⁇ 1 and the retention time can be roughly estimated from the chromatogram obtained by measuring only the reference material in advance. Therefore, by searching within a certain range before and after that, a signal derived from the reference material can be found.
- the intensity of peptide ion signals other than those used as the standard substance among peptide ion signals derived from the protein can be obtained. It is desirable to be as low as possible.
- the measured ionic strength values can be normalized, and quantitative comparison of ionic strength among a plurality of samples can be performed. Note that normalization of the measured ion intensity value should be performed prior to the above-described correction of the retention time.
- the sample analysis method uses a three-dimensional data consisting of mZz, normalized ionic strength, and corrected retention time by the sample analysis method according to the present invention.
- analysis of various components such as a protein group contained in the sample is performed on a computer. It can be carried out.
- the component analysis includes (a) an addition method and (b) a subtraction method.
- the correspondence between data points can be accurately obtained in a plurality of three-dimensional data obtained by the sample analysis method according to the present invention. Therefore, the difference between the normalized ionic strength values of the data points can be obtained.
- the correspondence between data points can be accurately obtained among a plurality of acquired three-dimensional data. Can be. Therefore, all the three-dimensional data can be added according to the above-described addition method. Then, the arithmetic mean can be obtained by dividing the obtained sum of the three-dimensional data by the number of samples. If necessary, a weight can be set for each sample, and a weighted average reflecting the weight can be calculated.
- a representative value of the range can be obtained.
- the component analysis approaches described in (1) to (3) above may use a database storing a plurality of three-dimensional spectrum data obtained by the sample analysis method according to the present invention, or may use the database. This may be performed using the stored data and the data actually obtained. In any case, the component analysis approaches described in (1) to (3) above can be easily realized using a computer.
- the protein group from which the signal is derived is determined by tandem MS analysis in which the range is limited to the obtained signal region. Can be identified. That is, in the sample analysis method according to the present invention, when a peptide molecule ion having a specific mZz value is detected when the sample is analyzed by LC-MS, the
- the CID spectrum can be measured.
- the obtained CID vector is input to a computer, and a database search software is used to search for a protein sequence obtained from a primary protein structure database, a genomic sequence database, or a cDNA sequence database.
- a database search software is used to search for a protein sequence obtained from a primary protein structure database, a genomic sequence database, or a cDNA sequence database.
- information such as protein or amino acid sequence registered in the database can be obtained, and the obtained information is applied to the obtained CID spectrum. Can be associated.
- Example 1 a peptide sample obtained by mixing a protease digest of a protein whose amino acid sequence is already known was measured by LC-MS, and the retention time, ra / The algorithm according to the present invention was applied to a three-dimensional profile consisting of the z-value and the ionic strength, and the measured peptide sample was quantitatively characterized.
- Example 1 as a model experiment for comparative quantification, several peptide samples in which protease digests of proteins whose amino acid sequences were already known were mixed were each measured by LC-MS, and the sample analysis of the present invention was performed. By applying the method and comparing each 3D open file, it was shown that differences in the types of proteins contained in each peptide sample were detected.
- Tryptic digests of the 24 proteins listed below were prepared as peptide samples in this example.
- subtilis alfaamylase (17) maglutathione S-transferase, (18) sigglutamin Acid dehydrogenase, I. (19) ⁇ Shirak topoperoxidase, (20) Koji power Biamylo d'arcosidase, (21) ⁇ Sagi phosphoryla Ze 8, (22) Ushibetagara click Toshidaze, (23) Usagi lactate dehydrogenase, (24) Niwatori egg white Rizochi one arm. These digests were purchased from Michrom BioResources. Tryptic digests of each of these 24 proteins were mixed as shown below to prepare a total of three peptide samples (Groups A to C).
- Group A Trypsin digest of 20 types of proteins (1), (2), (7) to (24). The proteins that characterize group A are (1) and (2).
- Group B Trypsin digest of 20 types of proteins (3), (4), (7) to (24). The proteins that characterize group B are (3) and (4).
- Group C Trypsin digests of 20 proteins from (5) to (24). The proteins that characterize group C are (5) and (6). Three samples of each group were prepared.
- the peptide sample was analyzed by the following apparatus and operation (Kawakami, T. et al, Jpn. J. Electrophoresis 44: 185-190 (2000)).
- the peptide sample concentrated under reduced pressure was dissolved in 45 ⁇ l of a solvent having a mixing ratio of trifluoroacetic acid, acetonitrile and water of 0.1: 2: 98. This is the solution.
- the concentration of mobile phase B was increased linearly from 5% to 85%, and peptide fragments were eluted continuously.
- the flow rate at this time was about 1 ⁇ 1 / min.
- the LC eluate was directly introduced into the ion source of an LCQ TM ion trap mass spectrometer (ThermoQuest) through a New Objective PicoChip TM needle (20 m ID).
- the position of the NanoESI needle allows for fine adjustment of the distance to the heating capillary.
- the spray voltage was not $ 21, and the eluent was charged directly. No gas was used for the mist and the spray current was 3.0 mA.
- the file containing the three-dimensional parameter aggregate was converted to a text file using the Xcal ibur TM utility software.
- the following data processing (1) to (4) were executed by a program written in C language and Perl language.
- signals with m / z values near 715 and 877 derived from chicken egg white lysozyme are used as standard signals.
- m / z In the range of soil 1 before and after the search, the retention time was searched in the range of 6 to 16 minutes for the signal of mZz 715, and in the range of 13 to 23 minutes for the signal of mZz 877.
- the retention time axis was linearly transformed so that the peak positions of the mZz 715 signal and the mZz 877 signal were 10 minutes and 20 minutes, respectively, with respect to the retention time.
- the representative points of three-dimensional profiles obtained from a sample of each of the three groups A, B and C were determined. That is, as described above, samples belonging to the same group were aggregated. The ion intensity at the point where m / z and the retention time overlap was added and tabulated. The higher the score used in this example, the better the score.
- the coefficients in the formula are as follows. The ionic strength difference is calculated as the absolute value of the difference between the common logarithms. The coefficient multiplied by 1 was used. The difference between the retention times was obtained by multiplying the absolute value of the difference by a factor of 1000. When the signals at the corresponding data points in each group were both derived from the standard substance, the addition point was set to 50,000. If there was no corresponding retention time point in one group, the score was 5000 points. In the present embodiment, these are simply added to obtain a score.
- Group A 495, 524, 546, 560, 671, 696, 779, 845, 871, 908, 962, etc.
- Group B 451, 464, 509, 513, 546, 555, 583, 585, 626, 635, 649, 653, 701, 720, 723, 740, 741, 753, 768, 789, 819, 821, 847, 873 , 886, 922, 928, 952, 966, 973, 978, 1057, 1230, etc.
- Group C 636, —670, 674, 679, 683, 718, 734, 735, 770, 824, 870, 918, etc.
- each sample was subjected to LC-MSZMS analysis in order to obtain a CID spectrum of a peptide molecule ion detected as a specific signal.
- the analysis conditions were as described above, except for the following operations. In other words, when performing LC-MSZMS analysis, the measurement conditions of the ion trap mass spectrometer are changed, and when a peptide molecular ion having the m / z value listed above is detected, the CID of the ion must be performed.
- the sample was measured with the measurement conditions set as follows.
- Example 2 a sample obtained by mixing another protein sample having a different concentration in a protein mixture having a predetermined concentration composition was subjected to protease digestion, and measurement was performed by LC-MS.
- the method according to the present invention to the three-dimensional data consisting of the retention time, m / z value, and ionic strength, and comparing the three-dimensional data obtained by measuring samples with different concentrations, Signal was detected. This demonstrates that the method can detect substances that change quantitatively.
- Tryptic digests of the six proteins listed below were prepared as peptide samples in this example.
- These proteins were purchased from Sigma.
- Tryptic digests of each of these six types of proteins were mixed as shown below to prepare a total of seven types of peptide samples.
- peptidyl Bok sample analyzed by the device and the following procedures (Kawakarai, T. et al, Jpn J. Electrophoresi s 44:. 185-190 (2000)) 0 first, the peptide sample was concentrated under reduced pressure, Torifuruoro acetate, mixing ratio of ⁇ Se Tonitoriru and water 0.1: 2: dissolved in a solvent 45 mu 1 of 98. This is used as the lysis solution.
- the peptide fragment was continuously eluted at a flow rate of about 1 ⁇ l / min.
- the eluate of the LC was passed through a New Objective Pi coChip TM needle (20 ⁇ inside diameter).
- the position of the dollar allows fine adjustment of the distance to the heating capillary
- the spray voltage is charged directly to the eluent instead of the needle
- the gas is not used for spraying and the spray current was set to 3.0 mA
- the Turbo Scan method was applied to reduce the number of scans in the mass spectrometer, and this measurement was performed five times in each group to determine the three-dimensional parameters corresponding to each sample. A total of 35 groups were obtained in 7 groups, and Fig. 1 shows an example of the profile obtained.
- Files containing 3D data were converted to text files using Xcal ibur TM utility software.
- the following data processing (1) to (4) was executed using programs written in C, C ++ and Perl languages. To remove the data of 1 noise level, ionic strength was removed by dividing the 10 2 following signals.
- the standard-derived signals used were mZ z 858.9, retention time 9 minutes, and two signals near m / z 791.0, retention time 25 minutes.
- all the signal intensities derived from the standard substance were added together, and the value was normalized to be 10 9 .
- two points were selected from the two standard substance signals described above, one point each giving an intensity peak.
- Fig. 8 shows the chromatogram of the profile obtained from five measurements of a sample with a BSA concentration of 500 fmol, near m / z 620.0 and a residence time of 15 to 19 minutes. Five gray waveforms that are slightly shifted on the time axis
- the output options of the superimposition profile were all points including mismatch.
- the points that satisfy the following conditions have been consolidated into one. That is, all data points in the above range are checked in order of signal intensity, and those that are determined to fall within the range approximated by a Gaussian distribution with the peak signal at the top are aggregated. .
- Figure 9 shows an example of a port file after time axis correction and aggregation for each of the seven samples with different BSA concentrations.
- a section cut at a specific mZz value (752 in this example) is shown as a chromatogram in which intensity is plotted along the time axis.
- the aggregated signal around 17 minutes and ⁇ 19 minutes shows the highest peak for the 06 sample with the highest BSA concentration (shown as “DS: Spl 06-Ave” in the figure; 5 pmoles of BSA). , Spl 04, ..., these signals can be determined to be BSA-derived signals.
- the gentle peak around 25 minutes appears in all samples, it can be determined that the peak is derived from a common substance other than BSA or the background.
- Figure 10 shows a sample with a BSA concentration of 500 4 shows a difference profile of femtomoles with sample 1. The line extending above the mZz-retention time plane is sample 5, and the line extending below is the signal strongly observed in sample 1.
- Fig. 11 The signals selected under the above conditions are shown in Fig. 11 for the case of sample I.
- 127 signals remained as meeting the above conditions.
- the size of the plot mark indicates the signal intensity in the profile of Sample II.
- those with the plot mark ⁇ indicate those that were associated with the BSA signal in the process described below, and those with X did not.
- the signals selected under the same conditions described above were matched with BSA-derived signals. , 75, 81 (described above), 76, and 48% of the signals were determined to be BSA-derived signals. Note that the correct answer rate of the last sample (BSA concentration: 5 picomoles) decreased, but this was due to a change in the profile threshold due to the presence of many strong signals derived from high concentrations of BSA. This is probably due to the increase in signals. In fact, about the same as samples of other concentrations Adjusting the selection condition (2) to 3 ⁇ 10 6 so that a number of signals were selected, the correct answer rate was 75%.
- One of the ideas according to the present invention is to save the search space of the dynamic programming.
- the effect was evaluated by measuring the CPU time. For example, a 43-45% reduction in CPU time was obtained.
- two kinds of signals derived from the standard are used, so if the signals are completely evenly distributed, the time reduction of lZ3 can be expected.However, many signals are actually between the two standard signals. ,
- the search space is divided unevenly. Taking this into account, the reduction of about 45% is almost as expected, and is considered to be sufficiently effective for practical use.
- Example 3 using a real patient-derived tissue sample, a signal derived from a protein that significantly fluctuated among several disease state groups was determined, and based on that signal, MS ZMS analysis was performed. By identifying such a protein, it was shown that the method was effective, especially in search of biomarkers.
- lung adenocarcinoma using surgically excised tissue, protein was extracted from the tissue by the method described below and measured. The obtained profiles were divided into groups that were determined to have lymph node metastasis and those that were not determined by pathological diagnosis at a later date.Signals that fluctuated significantly between the two groups were detected, and MS ZM S Analysis was performed to identify the protein.
- the samples used were surgically resected lung sections from 36 different lung cancer patients. Pathological diagnosis divides these patients into four groups: a group with large and small tumors, and a group with or without metastasis to regional lymph nodes.
- sample buffer for sodium dodecyl sulfate (SDS) -polyacryl / reamide gel electrophoresis (PAGE).
- SDS sodium dodecyl sulfate
- PAGE reamide gel electrophoresis
- the composition of the sample buffer is as follows. 62.5 mM Tris-HCl (pH 6.8), 2% w / v SDS, 5% v / v 2 -Mercaptoethanol, 10% v / v glycerin, 0.0025 ° /. w / v bromophenol blue. This suspension was shaken at room temperature for 30 minutes, and then centrifuged into a supernatant and a precipitate. The protein concentration of the supernatant was determined by a modification of the Lowry method.
- a sample buffer solution of the same composition was added to the sample supernatant for 100 g of the protein to reduce the total volume to 50 mL.
- a 1 M aqueous Tris solution was added to adjust the pH to 8.8.
- 2 L of 400 mM dithiothreitol was applied and the mixture was incubated at 60 ° C. for 30 minutes.
- 10 L of a 400 mM odoacetamide solution was added, and the mixture was allowed to stand at room temperature in the dark for 60 minutes.
- About 5 ⁇ 5 of 1.0N hydrochloric acid was added to return the pH to 6.8. This solution was subjected to Laemmli SDS-PAGE.
- the polyacrylamide gel used in this case consisted of a discontinuous buffer system, ie, concentrated gel (pH 6.8) at the top and separation gel (pH 8.8) at the bottom.
- the polyacrylamide gel concentrations were 4% and 12.5%, respectively, and the overall size was 14 cm wide, 14 cm high, and lmm thick.
- the current during electrophoresis was a constant 10 mA.
- the electrophoresis was stopped when the swimming front of the dye promophenol reached from the interface between the concentrated gel and the separation gel to 48 flats of the separation gel.
- the polyacrylamide gel was shaken in an aqueous solution of 40% methanol and 10% acetic acid to fix the proteins separated in the polyacrylamide gel. Thereafter, the polyacrylamide gel was washed twice with water.
- the washed polyacryl midgel was cut into 24 gel pieces per sample and fractionated. That is, the sample was cut out in a ladder shape at an equal width of 2 mm in the direction perpendicular to the electrophoresis direction, and each section was further divided into dice having a side of about lmm.
- the internal standard protein was added to each sample gel fraction while immobilized in the gel.
- a cut-out section of the gel containing a fixed amount of the standard protein shown above was added to each sample gel fraction.
- the gel pieces were washed with a sufficient amount of water and then dehydrated with acetonitrile.
- the water and acetonitrile remaining in the gel pieces were distilled off under reduced pressure, and then an aqueous trypsin solution was added to such an extent that all the gel pieces were immersed, and the mixture was left on ice for 45 minutes.
- An aqueous solution that did not permeate into the gel was removed, and a 50 mM aqueous solution of ammonium bicarbonate was added to such an extent that all the gel pieces were immersed.
- the mixture was kept at 37 ° C for 16 hours to perform a digestion reaction.
- the peptide samples were separated by the following equipment and operation (Kawakami, T. et al, Jpn. J. Electrophoresis 44: 185-190 (2000 )).
- the peptide sample concentrated under reduced pressure was dissolved in 45 ⁇ l of a solvent having a mixing ratio of trifluoroacetic acid, acetonitrile and water of 0.1: 2: 98. This is used as the solution.
- a MAGICMS TM C18 capillary column 20 ⁇ l of the lysis solution was introduced into a MAGICMS TM C18 capillary column (0.2 mm ID, 50 mm length, 5 ⁇ particle size, 200 ⁇ pore size) manufactured by BioResources. Elution of the peptides was performed using a MAGIC 2002 TM HPLC system (Michrom BioResources).
- the HPLC mobile phase A was a solvent in which formic acid, acetonitrile and water were mixed at a volume ratio of 0.1: 2: 98, while the mixing ratio of the mobile phase B was 0.1: 90: 10.
- the concentration of mobile phase B was increased linearly from 5% to 85%, and peptide fragments were eluted continuously.
- the flow rate at this time was about 1 ⁇ 1 / min.
- the eluate of the LC is passed through a New Objective PicoChip TM needle (20 im inner diameter), and the LCQ TM ion trap mass spectrometer is used. (ThermoQuest) was introduced directly into the ion source. The position of the anoESI needle allows for fine adjustment of the distance to the heating capillary.
- the spray voltage was charged directly to the eluent instead of the needle. No gas was used for spraying, and the spray current was 3.0 mA.
- the obtained LC-MS profile data consists of 36 samples x 24 bands, totaling 864.
- analysis was performed using programs created in C, C ++ and Perl languages in the following procedure.
- signals with m / z values near 715 and 877 derived from chicken egg white lysozyme are used as standard signals, and when searching for signals derived from these standard substances from sample measurement data, m / z The search was performed in the range of ⁇ 1; the retention time was searched in the range of 10 minutes ⁇ 5 minutes for the signal of m / z 715; and the signal of m / z 877 was searched in the range of 18 minutes ⁇ 5 minutes.
- the absolute intensity of the obtained signal derived from the standard the relative intensity among all signals, and If one of the intensity ratios of the two standard-derived signals is far from that of the other, check the profile plot individually and search for the peak of the signal group that is considered to be the standard-derived signal. After adjusting it to be at the center point of the time parameter, it was taken again.
- the signal intensity derived from the standard substance was corrected to 10 7 by dividing the ionic strength of each signal by the obtained total ion intensity value of the signal derived from the standard substance and multiplying the obtained value by 10 7 .
- the retention time axis was linearly transformed so that the peak positions of the 111 / ⁇ 715 signal and the ⁇ 877 signal were 10 minutes and 20 minutes, respectively, with respect to the retention time.
- the profiles superimposed on all bands were treated as the profile of each sample. Specifically, the profiles between adjacent bands were sequentially superimposed and added and counted using the profile superimposition function of the sample analysis program according to the present invention. That is, the band first
- the penalty of the difference (absolute value) on the time axis ⁇ 1.0
- the penalty of the difference of signal intensity i3 1.0 (however, the absolute value of the difference after converting the signal intensity into a common logarithm) )
- a bonus point ⁇ 100 for a point match
- a penalty ⁇ 10 for a mismatch point
- a bonus point ⁇ (i, J) S m 1000 for a signal from a standard.
- the output options of the superimposition profile were all points including mismatch. Each time the overlaying process was completed, data points were aggregated so that the retention time and m / z were 1.0 and 1.0, respectively.
- the profiles are superimposed within the group and aggregated.
- a file was obtained, and then a profile overlapping operation was similarly performed between the groups.
- the parameters for the superimposition process at this time were the same as in the above-mentioned inter-band superposition process.
- the order of superposition within the group, superposition was performed sequentially from the closest one based on the evaluation function score of the superposition processing under the same parameters that was previously performed by brute force.
- the two groups, the difference in tumor size within the group with lymph node metastasis, and the difference in the tumor size within the group without lymph node metastasis were overlapped. Groups with and without metastases were overlapped.
- Fig. 12 shows the final superposition profile, with the signal appearing in the lymph node metastasis positive group plotted upward and the signal present in the negative group negative plotted downward.
- FIG. 13 shows the same plot as in FIG. 12 at the point where the p-value was less than 0.005 in the above test. At this stage, 5,889 signals were obtained.
- sample analysis method and the sample analysis program according to the present invention when analyzing components contained in a sample, excellent analytical performance can be achieved. Therefore, according to the present invention, it is possible to provide a sample analysis method and a sample analysis program which are very effective and useful when comprehensively analyzing a large number of components contained in a sample to be analyzed.
- sample analysis method and sample analysis program according to the present invention are very effective for the purpose of searching for substances related to a difference in the state of any disease using actual clinical specimens. Its usefulness is extremely high in that it can be used to search for and develop diagnostic methods.
Landscapes
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Immunology (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Library & Information Science (AREA)
- Pathology (AREA)
- Health & Medical Sciences (AREA)
- Crystallography & Structural Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computing Systems (AREA)
- Theoretical Computer Science (AREA)
- Other Investigation Or Analysis Of Materials By Electrical Means (AREA)
- Analysing Materials By The Use Of Radiation (AREA)
Abstract
Description
Claims
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/551,148 US20060194329A1 (en) | 2003-03-31 | 2004-03-31 | Sample analyzing method and sample analyzing program |
EP04724777A EP1626274A4 (en) | 2003-03-31 | 2004-03-31 | SAMPLE ANALYSIS PROCEDURE AND SAMPLE ANALYSIS PROGRAM |
JP2005505220A JP4185933B2 (ja) | 2003-03-31 | 2004-03-31 | 試料解析方法及び試料解析プログラム |
CA002521108A CA2521108A1 (en) | 2003-03-31 | 2004-03-31 | Sample analyzing method and sample analyzing program |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2003095732 | 2003-03-31 | ||
JP2003-095732 | 2003-03-31 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2004090526A1 true WO2004090526A1 (ja) | 2004-10-21 |
Family
ID=33156638
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2004/004621 WO2004090526A1 (ja) | 2003-03-31 | 2004-03-31 | 試料解析方法及び試料解析プログラム |
Country Status (5)
Country | Link |
---|---|
US (1) | US20060194329A1 (ja) |
EP (1) | EP1626274A4 (ja) |
JP (2) | JP4185933B2 (ja) |
CA (1) | CA2521108A1 (ja) |
WO (1) | WO2004090526A1 (ja) |
Cited By (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007023876A1 (ja) * | 2005-08-25 | 2007-03-01 | Mcbi, Inc. | 多数試料中に含まれる物質の存在量を比較解析する方法 |
JP2007108060A (ja) * | 2005-10-14 | 2007-04-26 | Kao Corp | 肌質の評価方法 |
JP2007147459A (ja) * | 2005-11-28 | 2007-06-14 | Kazusa Dna Kenkyusho | 情報処理装置、プログラム、及びコンピュータ読み取り可能な記録媒体 |
JP2007535672A (ja) * | 2004-04-30 | 2007-12-06 | マイクロマス ユーケー リミテッド | 質量分析計 |
JP2008249440A (ja) * | 2007-03-30 | 2008-10-16 | Hitachi High-Technologies Corp | クロマトグラフィー質量分析方法、及びクロマトグラフ質量分析装置 |
JP2008261741A (ja) * | 2007-04-12 | 2008-10-30 | Kao Corp | 肌質判定方法 |
JP2008541095A (ja) * | 2005-05-12 | 2008-11-20 | ウオーターズ・インベストメンツ・リミテツド | 化学分析データの視覚化 |
JP2008542690A (ja) * | 2005-05-06 | 2008-11-27 | エクソンモービル リサーチ アンド エンジニアリング カンパニー | 2次元(多次元)分離概念が例として適用された2次元(多次元)分離ガスクロマトグラフィーx質量分析(GCxMS)技術のための新規のデータ処理/視覚化の方法 |
JP2008545973A (ja) * | 2005-06-03 | 2008-12-18 | ウオーターズ・インベストメンツ・リミテツド | 化学分析のためのポリペプチド関連情報のカタログの発生および使用 |
JP2009500617A (ja) * | 2005-07-08 | 2009-01-08 | メタノミクス ゲーエムベーハー | 化学試料を特徴づけるシステムおよび方法 |
JP2009503480A (ja) * | 2005-07-25 | 2009-01-29 | メタノミクス ゲーエムベーハー | クロマトグラフィー/マススペクトロメトリーを用いてサンプルを分析する手段と方法 |
JP2009080010A (ja) * | 2007-09-26 | 2009-04-16 | Hitachi High-Technologies Corp | 質量分析システム |
WO2009051277A1 (ja) | 2007-10-18 | 2009-04-23 | Medical Proteoscope Co., Ltd. | 手術後の予後を推定する方法及び診断キット |
JP2009156722A (ja) * | 2007-12-27 | 2009-07-16 | Hitachi High-Technologies Corp | 質量分析スペクトル分析方法 |
JP2009170238A (ja) * | 2008-01-16 | 2009-07-30 | Hitachi Ltd | 質量分析計及び質量分析方法 |
JP2009294231A (ja) * | 2009-09-24 | 2009-12-17 | Hitachi High-Technologies Corp | クロマトグラフィー質量分析方法、及びクロマトグラフ質量分析装置 |
JP2011220907A (ja) * | 2010-04-13 | 2011-11-04 | Shimadzu Corp | クロマトグラムデータ処理方法及び装置 |
WO2012118209A1 (ja) * | 2011-03-02 | 2012-09-07 | 株式会社堀場エステック | ガスクロマトグラフ用データ処理装置及びそれに用いられるデータ処理プログラム |
JP2014134504A (ja) * | 2013-01-11 | 2014-07-24 | Toyama Univ | 試料解析装置、試料解析方法、及び試料解析プログラム |
WO2014128912A1 (ja) * | 2013-02-22 | 2014-08-28 | 株式会社島津製作所 | データ処理装置及びデータ処理方法 |
US9500629B2 (en) | 2012-11-05 | 2016-11-22 | Shimadzu Corporation | Chromatogram data processing method and device |
WO2018134998A1 (ja) * | 2017-01-23 | 2018-07-26 | 株式会社島津製作所 | クロマトグラムデータ処理装置 |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060045207A1 (en) * | 2004-08-25 | 2006-03-02 | Jie Cheng | Peak detection in mass spectroscopy data analysis |
US7897405B2 (en) * | 2008-02-11 | 2011-03-01 | Thermo Finnigan Llc | Method for identifying the elution time of an analyte |
US8589081B2 (en) * | 2009-07-24 | 2013-11-19 | Bio-Rad Laboratories, Inc. | System and method to determine sigma of a clinical diagnostic process |
US20130204539A1 (en) * | 2011-06-01 | 2013-08-08 | Tsumura & Co. | Feature value preparing method, feature value preparing program, and feature value preparing device for pattern or fp |
EP2837934A4 (en) * | 2012-04-10 | 2015-08-19 | Univ Gifu | METHOD FOR IDENTIFYING AND QUANTIFYING ANIMAL HAIR SPECIES |
JP6020314B2 (ja) | 2013-04-04 | 2016-11-02 | 株式会社島津製作所 | クロマトグラフ質量分析データ処理装置 |
JP6060793B2 (ja) * | 2013-04-19 | 2017-01-18 | 株式会社島津製作所 | ピーク検出装置 |
US10309940B2 (en) * | 2013-08-05 | 2019-06-04 | Shimadzu Corporation | Data processing device for chromatograph and data processing method for chromatograph |
US10416134B2 (en) * | 2014-09-03 | 2019-09-17 | Shimadzu Corporation | Chromatogram data processing method and chromatogram data processing apparatus |
GB201511508D0 (en) * | 2015-07-01 | 2015-08-12 | Ge Healthcare Bio Sciences Ab | Method for determining a size of biomolecules |
JP6654119B2 (ja) * | 2016-09-20 | 2020-02-26 | 株式会社日立製作所 | プラントデータ表示処理装置及びプラント制御システム |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS63108260A (ja) * | 1986-10-24 | 1988-05-13 | Jeol Ltd | 質量分析装置を用いた定性分析方法 |
JPH07198703A (ja) * | 1993-12-28 | 1995-08-01 | Tokai Rubber Ind Ltd | クロマトグラフ分析結果に基づくポリマーの自動解析装置 |
JPH11344482A (ja) * | 1998-06-02 | 1999-12-14 | Jeol Ltd | 質量分析システム |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2904061B2 (ja) * | 1995-06-07 | 1999-06-14 | 株式会社島津製作所 | 液体クロマトグラフ質量分析装置 |
US5885841A (en) * | 1996-09-11 | 1999-03-23 | Eli Lilly And Company | System and methods for qualitatively and quantitatively comparing complex admixtures using single ion chromatograms derived from spectroscopic analysis of such admixtures |
CA2466837A1 (en) * | 2001-11-13 | 2003-05-22 | Caprion Pharmaceuticals Inc. | Mass intensity profiling system and uses thereof |
US6989100B2 (en) * | 2002-05-09 | 2006-01-24 | Ppd Biomarker Discovery Sciences, Llc | Methods for time-alignment of liquid chromatography-mass spectrometry data |
-
2004
- 2004-03-31 JP JP2005505220A patent/JP4185933B2/ja not_active Expired - Lifetime
- 2004-03-31 WO PCT/JP2004/004621 patent/WO2004090526A1/ja active Application Filing
- 2004-03-31 US US10/551,148 patent/US20060194329A1/en not_active Abandoned
- 2004-03-31 EP EP04724777A patent/EP1626274A4/en not_active Withdrawn
- 2004-03-31 CA CA002521108A patent/CA2521108A1/en not_active Abandoned
-
2008
- 2008-05-26 JP JP2008136968A patent/JP2008241721A/ja active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS63108260A (ja) * | 1986-10-24 | 1988-05-13 | Jeol Ltd | 質量分析装置を用いた定性分析方法 |
JPH07198703A (ja) * | 1993-12-28 | 1995-08-01 | Tokai Rubber Ind Ltd | クロマトグラフ分析結果に基づくポリマーの自動解析装置 |
JPH11344482A (ja) * | 1998-06-02 | 1999-12-14 | Jeol Ltd | 質量分析システム |
Non-Patent Citations (2)
Title |
---|
PINKSTON J D, ET AL: "Characterization of low molecular weight alkoxylated polymers using long column SFC/MS and an image analysis based quantitation approach", JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY, vol. 13, no. 10, 10 October 2002 (2002-10-10), pages 1195 - 1208, XP004383143 * |
See also references of EP1626274A4 * |
Cited By (31)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2007535672A (ja) * | 2004-04-30 | 2007-12-06 | マイクロマス ユーケー リミテッド | 質量分析計 |
JP2008542690A (ja) * | 2005-05-06 | 2008-11-27 | エクソンモービル リサーチ アンド エンジニアリング カンパニー | 2次元(多次元)分離概念が例として適用された2次元(多次元)分離ガスクロマトグラフィーx質量分析(GCxMS)技術のための新規のデータ処理/視覚化の方法 |
US8323575B2 (en) | 2005-05-12 | 2012-12-04 | Waters Technologies Corporation | Visualization of chemical-analysis data |
JP2008541095A (ja) * | 2005-05-12 | 2008-11-20 | ウオーターズ・インベストメンツ・リミテツド | 化学分析データの視覚化 |
JP2008545973A (ja) * | 2005-06-03 | 2008-12-18 | ウオーターズ・インベストメンツ・リミテツド | 化学分析のためのポリペプチド関連情報のカタログの発生および使用 |
JP2009500617A (ja) * | 2005-07-08 | 2009-01-08 | メタノミクス ゲーエムベーハー | 化学試料を特徴づけるシステムおよび方法 |
JP2009503480A (ja) * | 2005-07-25 | 2009-01-29 | メタノミクス ゲーエムベーハー | クロマトグラフィー/マススペクトロメトリーを用いてサンプルを分析する手段と方法 |
JP2011203262A (ja) * | 2005-07-25 | 2011-10-13 | Metanomics Gmbh | クロマトグラフィー/マススペクトロメトリーを用いてサンプルを分析する手段と方法 |
WO2007023876A1 (ja) * | 2005-08-25 | 2007-03-01 | Mcbi, Inc. | 多数試料中に含まれる物質の存在量を比較解析する方法 |
JP2007108060A (ja) * | 2005-10-14 | 2007-04-26 | Kao Corp | 肌質の評価方法 |
JP2007147459A (ja) * | 2005-11-28 | 2007-06-14 | Kazusa Dna Kenkyusho | 情報処理装置、プログラム、及びコンピュータ読み取り可能な記録媒体 |
JP2008249440A (ja) * | 2007-03-30 | 2008-10-16 | Hitachi High-Technologies Corp | クロマトグラフィー質量分析方法、及びクロマトグラフ質量分析装置 |
US7742879B2 (en) | 2007-03-30 | 2010-06-22 | Hitachi High-Technologies Corporation | Method and apparatus for chromatography mass spectrometry |
JP4602374B2 (ja) * | 2007-03-30 | 2010-12-22 | 株式会社日立ハイテクノロジーズ | クロマトグラフィー質量分析方法、及びクロマトグラフ質量分析装置 |
JP2008261741A (ja) * | 2007-04-12 | 2008-10-30 | Kao Corp | 肌質判定方法 |
JP2009080010A (ja) * | 2007-09-26 | 2009-04-16 | Hitachi High-Technologies Corp | 質量分析システム |
WO2009051277A1 (ja) | 2007-10-18 | 2009-04-23 | Medical Proteoscope Co., Ltd. | 手術後の予後を推定する方法及び診断キット |
JP2009156722A (ja) * | 2007-12-27 | 2009-07-16 | Hitachi High-Technologies Corp | 質量分析スペクトル分析方法 |
JP2009170238A (ja) * | 2008-01-16 | 2009-07-30 | Hitachi Ltd | 質量分析計及び質量分析方法 |
JP2009294231A (ja) * | 2009-09-24 | 2009-12-17 | Hitachi High-Technologies Corp | クロマトグラフィー質量分析方法、及びクロマトグラフ質量分析装置 |
JP2011220907A (ja) * | 2010-04-13 | 2011-11-04 | Shimadzu Corp | クロマトグラムデータ処理方法及び装置 |
WO2012118209A1 (ja) * | 2011-03-02 | 2012-09-07 | 株式会社堀場エステック | ガスクロマトグラフ用データ処理装置及びそれに用いられるデータ処理プログラム |
US9222923B2 (en) | 2011-03-02 | 2015-12-29 | Horiba Stec, Co., Ltd. | Data processing device for gas chromatograph and data processing program used in same |
US9500629B2 (en) | 2012-11-05 | 2016-11-22 | Shimadzu Corporation | Chromatogram data processing method and device |
JP2014134504A (ja) * | 2013-01-11 | 2014-07-24 | Toyama Univ | 試料解析装置、試料解析方法、及び試料解析プログラム |
WO2014128912A1 (ja) * | 2013-02-22 | 2014-08-28 | 株式会社島津製作所 | データ処理装置及びデータ処理方法 |
CN105008908A (zh) * | 2013-02-22 | 2015-10-28 | 株式会社岛津制作所 | 数据处理装置以及数据处理方法 |
JP6004080B2 (ja) * | 2013-02-22 | 2016-10-05 | 株式会社島津製作所 | データ処理装置及びデータ処理方法 |
JPWO2014128912A1 (ja) * | 2013-02-22 | 2017-02-02 | 株式会社島津製作所 | データ処理装置及びデータ処理方法 |
WO2018134998A1 (ja) * | 2017-01-23 | 2018-07-26 | 株式会社島津製作所 | クロマトグラムデータ処理装置 |
JPWO2018134998A1 (ja) * | 2017-01-23 | 2019-06-27 | 株式会社島津製作所 | クロマトグラムデータ処理装置 |
Also Published As
Publication number | Publication date |
---|---|
EP1626274A1 (en) | 2006-02-15 |
JP2008241721A (ja) | 2008-10-09 |
CA2521108A1 (en) | 2004-10-21 |
US20060194329A1 (en) | 2006-08-31 |
JPWO2004090526A1 (ja) | 2006-07-06 |
JP4185933B2 (ja) | 2008-11-26 |
EP1626274A4 (en) | 2009-08-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2004090526A1 (ja) | 試料解析方法及び試料解析プログラム | |
Cannon et al. | Ultraviolet photodissociation for characterization of whole proteins on a chromatographic time scale | |
Kolch et al. | Capillary electrophoresis–mass spectrometry as a powerful tool in clinical diagnosis and biomarker discovery | |
Pontillo et al. | CE‐MS‐based proteomics in biomarker discovery and clinical application | |
US7498568B2 (en) | Real-time analysis of mass spectrometry data for identifying peptidic data of interest | |
Pomastowski et al. | Two-dimensional gel electrophoresis in the light of new developments | |
Vlahou et al. | Proteomic approaches in the search for disease biomarkers | |
US20200144040A1 (en) | System and method of data-dependent acquisition by mass spectrometry | |
JP4160558B2 (ja) | 結果主導型ストラテジーを利用して生体分子を特徴付けるための方法 | |
US9472386B2 (en) | Chromatograph mass spectrometry data processing apparatus | |
EP2761288B1 (en) | Method for analysis of samples in targeted proteomics applications, computer program product and set of reference peptides | |
US20050116159A1 (en) | Mass spectrometic quantification of chemical mixture components | |
JP2006522340A (ja) | 質量分析データの分析法 | |
JP2009540319A (ja) | 質量分析バイオマーカーアッセイ | |
Gao et al. | Guidelines for the routine application of the peptide hits technique | |
JP2004506216A (ja) | 混合物の化学成分を同定および定量するための方法およびシステム | |
Lombard-Banek et al. | A microanalytical capillary electrophoresis mass spectrometry assay for quantifying angiotensin peptides in the brain | |
JP2005522713A (ja) | 生物学的分子の定量 | |
JP7431933B2 (ja) | 質量分析を使用する低存在量のポリペプチドの絶対定量のための方法 | |
US12072322B2 (en) | Method for matrix effect correction in quantitative mass spectrometric analysis of analytes in complex matrices | |
Kilpatrick et al. | Optimizing high-resolution mass spectrometry for the identification of low-abundance post-translational modifications of intact proteins | |
Salzano et al. | Mass spectrometry for protein identification and the study of post translational modifications | |
Štěpánová et al. | CE–MS Approaches for Peptidomics | |
Müller et al. | Molecular scanner experiment with human plasma: improving protein identification by using intensity distributions of matching peptide masses | |
Lu et al. | A new approach for sequencing human IRS1 phosphotyrosine‐containing peptides using CapLC‐Q‐TOFmicro |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2005505220 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006194329 Country of ref document: US Ref document number: 2521108 Country of ref document: CA Ref document number: 10551148 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2004724777 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 2004724777 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 10551148 Country of ref document: US |