WO2013068781A1 - Methods and apparatus for image analysis using profile weighted intensity features - Google Patents
Methods and apparatus for image analysis using profile weighted intensity features Download PDFInfo
- Publication number
- WO2013068781A1 WO2013068781A1 PCT/IB2011/002971 IB2011002971W WO2013068781A1 WO 2013068781 A1 WO2013068781 A1 WO 2013068781A1 IB 2011002971 W IB2011002971 W IB 2011002971W WO 2013068781 A1 WO2013068781 A1 WO 2013068781A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- cell
- image
- profile
- border
- weighted
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/60—Type of objects
- G06V20/69—Microscopic objects, e.g. biological cells or cellular parts
- G06V20/695—Preprocessing, e.g. image segmentation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/42—Global feature extraction by analysis of the whole pattern, e.g. using frequency domain transformations or autocorrelation
Definitions
- This invention relates generally to image processing techniques for identifying and characterizing objects within digital images. More particularly, in certain embodiments, the invention relates to methods and apparatus for characterizing a new morphological feature referred to herein as a profile weighted intensity feature, of cells in a digital image, and applying this feature in the automated classification of cell phenotype, for example, in image analysis software.
- An accurate and efficient automated cell phenotype classification method requires identifying morphological characteristics of individual cells, as pictured in a digital image, which are useful for distinguishing different cell phenotypes.
- image processing techniques to perform cell phenotype classification
- a cell type having a unique size may be identified by evaluating the sizes of the cells in the image.
- a cell type having a particular characteristic shape or color may be identified by evaluating the shapes and colors of the cells in the image. The more a morphological feature (e.g., size, shape, or color) varies from one cell type to the next, the more useful that feature is for distinguishing different types of cells during cell phenotype classification.
- Automated image processing techniques are useful to standardize the classification process for improved accuracy and speed of cell classification.
- existing automated image processing techniques are often incapable of distinguishing among the different cell phenotypes in an image.
- Existing image processing techniques can also be overly complicated, difficult to describe and implement, and computationally intensive.
- profile weighted intensity features are a type of weighted mean or sum intensity of one or more objects (e.g., cells) in an image.
- a computationally efficient and more accurate tool is provided for identifying and classifying objects in images. For example, it has been discovered that automated cell phenotype characterization is improved by determining a mean intensity that more heavily weights pixel intensity near the borders of the cell (e.g., the outer cell border and/or the border of the cell nucleus), and using this mean intensity value in the algorithm to characterize cell phenotype. It has also been discovered that relative (e.g., normalized) intensities may be more favorable than absolute intensities for cell classification because large cell-to-cell variations in cellular samples are cancelled out or reduced when divided by another intensity of the same cell. In one embodiment, the profile weighted intensity features described herein are normalized.
- a weighted mean intensity may be divided by a corresponding unweighted mean intensity, yielding a relative (unitless) feature.
- a new sliding parabola erosion operation in the determination of a distance image - facilitate the efficient use of this new class of morphological features in the classification of objects (e.g., cells) in a digital image.
- the invention relates to a method for determining a profile weighted intensity feature for a cell useful for classifying cell phenotype, the profile weighted intensity feature determined from one or more images of the cell.
- the method includes the steps of: (a) detecting a cell in an image; (b) computing a profile image for the cell, wherein pixel intensity of the profile image is a function of the closest distance from the pixel to a border of the cell; and (c) computing a profile weighted intensity feature for the cell using the profile image of the cell as a weight image, wherein the profile weighted intensity feature for the cell is a weighted mean intensity of pixels of the cell.
- the method includes the step of classifying cell phenotype using the computed profile weighted intensity feature.
- the border may be, for example, an outer border of the cell.
- the profile image is computed such that pixels near the border of the cell are emphasized and pixels a given distance away from the border of the cell are deemphasized.
- the distance image is determined via sliding parabola erosion operation performed on a cell mask image.
- the calculated features include at least one of unnormalized intensities and normalized intensities.
- step (c) includes computing a profile weighted intensity feature for the cell from a filtered image of an acquired input image of the cell using the profile image of the cell as a weight image.
- the invention in another aspect, relates to a method for determining a profile weighted intensity feature for a cell useful for classifying cell phenotype, the profile weighted intensity feature determined from one or more images of the cell.
- the method includes the steps of: (a) detecting a cell and detecting a nucleus of the cell from one or more images of the cell; (b) computing a profile image for the cell, wherein pixel intensity of the profile image is a function of both (i) the closest distance from the pixel to an outer border of the cell and (ii) the closest distance from the pixel to an outer border of a nucleus of the cell; and (c) computing a profile weighted intensity feature for the cell using the profile image of the cell as a weight image, wherein the profile weighted intensity feature for the cell is a weighted mean intensity of pixels of the cell.
- the profile weighted intensity feature for the cell is computed from a filtered image of an acquired input image of the cell using the profile image of
- the invention in another aspect, relates to a method for determining a profile weighted intensity feature for a cell useful for classifying cell phenotype, the profile weighted intensity feature determined from one or more images of the cell.
- the method includes the steps of: (a) detecting a plurality of borders of a cell from one or more images of the cell; (b) computing a profile image for the cell, wherein pixel intensity of the profile image is a function of the nearest distances from the pixel to each of the plurality of detected borders of the cell; and (c) computing a profile weighted intensity feature for the cell using the profile image of the cell as a weight image, wherein the profile weighted intensity feature for the cell is a weighted mean intensity of pixels of the cell.
- the profile weighted intensity feature for the cell is computed from a filtered image of an acquired input image of the cell using the profile image of the cell as a weight image.
- the invention in another aspect, relates to an apparatus for determination of a profile weighted intensity feature for a cell useful for classifying cell phenotype, the profile weighted intensity feature determined from one or more images of the cell.
- the apparatus includes a memory for storing a code defining a set of instructions, and a processor for executing the set of instructions.
- the code includes a profile weighted intensity module configured to: (i) detect a cell in an image; (ii) compute a profile image for the cell, wherein pixel intensity of the profile image is a function of the closest distance from the pixel to a border of the cell (e.g., an outer border of the cell); and (iii) compute a profile weighted intensity feature for the cell using the profile image of the cell as a weight image, wherein the profile weighted intensity feature for the cell is a weighted mean intensity of pixels of the cell.
- a profile weighted intensity module configured to: (i) detect a cell in an image; (ii) compute a profile image for the cell, wherein pixel intensity of the profile image is a function of the closest distance from the pixel to a border of the cell (e.g., an outer border of the cell); and (iii) compute a profile weighted intensity feature for the cell using the profile image of the cell as a weight image, wherein the profile weighted intensity feature
- the profile weighted intensity module is configured to classify cell phenotype using the computed profile weighted intensity feature.
- the profile image is computed such that pixels near the border of the cell are emphasized and pixels a given distance away from the border of the cell are deemphasized.
- the distance image is determined via sliding parabola erosion operation performed on a cell mask image.
- FIG. 1 is an image of a cell acquired using a detector that is sensitive to light emitted by a specific fluorophore, according to an illustrative embodiment of the present invention.
- FIG. 2 is an image of the cell of FIG. 1 acquired using a detector that is sensitive to a nucleus stain, according to an illustrative embodiment of the present invention.
- FIG. 3 is an image of the cell of FIG. 1 in which the cell and the cell nucleus have been identified, according to an illustrative embodiment of the present invention.
- FIG. 4 is a distance image for the cell of FIG. 1 in which pixel intensity is a function of distance from an outer border of the cell, according to an illustrative embodiment of the present invention.
- FIG. 5 is a profile image calculated from the distance image of FIG. 4, according to an illustrative embodiment of the present invention.
- FIG. 6 is a distance image for the cell of FIG. 1 in which pixel intensity is a function of distance from a nucleus border, according to an illustrative embodiment of the present invention.
- FIG. 7 is a profile image calculated from the distance image of FIG. 6, according to an illustrative embodiment of the present invention.
- FIG. 8 is an image obtained by applying a texture energy filter to the image of FIG. 1 , according to an illustrative embodiment of the present invention.
- FIG. 9 is a flowchart of a method for determining profile weighted intensity features of cells useful for classifying cell phenotype, according to an illustrative embodiment of the present invention.
- FIGS. 10a and 10b include images from two different samples of a cell-based assay representing two different cell classes, according to an illustrative embodiment of the present invention.
- FIGS. 1 1 a and 1 lb include images from the same cell-based assay depicted in FIGS. 10a and 10b, respectively, with the detected cells masked (i.e., pixels outside the cells are black) and cell and nucleus borders highlighted, according to an illustrative embodiment of the present invention.
- FIG. 12 is an x,y-plot of a best pair of features for separating the two classes of objects depicted in FIGS. 1 la and l ib, according to an illustrative embodiment of the present invention.
- apparatus, systems, methods, and processes of the claimed invention encompass variations and adaptations developed using information from the embodiments described herein. Adaptation and/or modification of the apparatus, systems, methods, and processes described herein may be performed by those of ordinary skill in the relevant art.
- a cell phenotype classification process begins with the acquisition of an image or images depicting one or more cells.
- the next step is typically a segmentation step to detect one or more cells or portions of cells (e.g., a cell nucleus) in the image or images.
- a set of features is extracted from the identified cells and used to characterize and classify the cells.
- one or more image filters are applied as part of the feature extraction step.
- the filters may include one or more texture energy filters, which are a special subset of image filters with non-negative output.
- the same filter that is used for extraction of texture features is additionally used for extraction of morphology features.
- unweighted mean intensity of a texture-filtered image is a texture feature
- profile weighted mean intensity of the same texture-filtered image is a morphology feature.
- a distance image is an image in which pixel intensity is proportional to a distance between the pixel and some other location in the image.
- the intensity at each pixel within the cell may depend on the distance between the pixel and an edge of the cell, such as the outer edge of the cell or the edge of the nucleus.
- the relationship between intensity and distance may be linear or non-linear.
- Weighted sum calculation and weighted mean calculation are examples of image processing techniques. Weighted sum calculation is determined by applying weights to the pixel intensities in an image (e.g., using another image or a set of data to provide the weights) and then adding the weighted pixel intensities.
- weighted mean calculation is determined by applying weights to the pixel intensities in an image and then averaging the weighted pixel intensities.
- the resulting value obtained for both weighted sum calculation and weighted mean calculation is influenced more by some pixels (i.e., those that are weighted heavily) and less by other pixels (i.e., those that are not weighted heavily).
- the methods and apparatus described herein provide a new family of morphological features for characterizing and/or classifying one or more objects (e.g., biological cells) in an input image.
- the new family of morphological features is obtained by applying a filter (e.g., a texture energy filter) to the image.
- a distance image is calculated in which pixel intensity is a function of distance to the closest border pixel.
- the distance image is advantageously calculated using the sliding parabola erosion technique described in U.S. Patent Application No. 13/230,377, filed September 12, 201 1 , entitled “Methods and Apparatus for Fast Identification of Relevant Features for Classification or Regression," the disclosure of which is hereby incorporated by reference herein in its entirety.
- the profile image is then used as a weight image for calculating the mean or sum intensity of another image, such as the input image.
- FIGS. 1 through 8 are images of a cell 100, in accordance with certain embodiments of the invention.
- FIG. 1 is an image of the cell 100 acquired with a detector that is sensitive to light emitted by a specific fluorophore.
- Morphological features e.g., numerical properties
- the intensity variations may be quantified to identify and characterize morphological features of interest.
- FIG. 2 is another image of the cell 100 acquired with a detector that is sensitive to a nucleus stain. This type of image may be useful for nucleus detection.
- FIG. 3 is a similar image of the cell 100 after the cell 100, a cell nucleus 302, and cytoplasm 304 have been identified during segmentation. Pixels not belonging to the cell 100 are black. Pixels belonging to a border 306 of the nucleus 302 are highlighted.
- one input image is acquired for nuclei detection and another image is used for feature calculation or detection. In one embodiment, more than one image may be used for calculation of features.
- FIG. 3 is a similar image of the cell 100 after the cell 100, a cell nucleus 302, and cytoplasm 304 have been identified during segmentation. Pixels not belonging to the cell 100 are black. Pixels belonging to a border 306 of the nucleus 302 are highlighted.
- one input image is acquired for nuclei detection and another image is used for feature calculation or detection. In one embodiment, more than one image may be used for
- FIG. 4 is a distance image for the cell 100 in which pixels outside of the cell 100 are black and each pixel within the cell 100 has an intensity that is equal to or a function of (e.g., a linear function, a polynomial function, an exponential function, and/or a trigonometric function) the distance between the pixel and an outer border 402 of the cell 100.
- FIG. 5 is a profile image for the cell 100 calculated from the distance image of FIG. 4. The profile image characterizes or emphasizes the part of cytoplasm 304 closest to the outer border 402 (e.g., a cell membrane).
- FIG. 6 is another distance image for the cell 100 in which each pixel within cytoplasm 304 of the cell 100 has an intensity equal to a function of the distance from the nucleus border 306. Pixels outside of the cell 100 and pixels within the nucleus 302 are black.
- FIG. 7 is a profile image for the cell 100 calculated from the distance image of FIG. 6. The profile image characterizes or emphasizes a part of cytoplasm 304 closest to the nucleus 302.
- a profile image is a function of a single distance, even if there is more than one border (e.g., the nucleus border 306 and the outer border 402), as shown in FIGS. 5 and 7.
- the profile image is equal to or a function of two distances.
- a distance image may be generated in which the intensity of the distance image is a function of distance from a nucleus border and an outer cell border.
- a profile image may then be generated from the distance image such that the intensities in the profile image are a function of the distances from the two borders. The essence of the step of calculating a profile image does not depend on the number of borders considered for the calculation.
- each profile image is calculated from a given distance image.
- each profile image may be obtained using a different mathematical relationship between pixel intensity in the distance image and pixel intensity in the profile image.
- FIG. 8 is an image of the cell 100 obtained by applying a texture energy filter to the image of FIG. 1. As depicted, the texture energy filter detects and visualizes ridge-like structures 802 within the cell 100. In certain embodiments, morphological features characterizing the cell 100 are derived from this image in addition to or instead of the image of FIG. 1.
- an object includes more than one border.
- a cell may have a border at its outer edge (i.e., a cell border) and a border at the edge of its nucleus (i.e., a nucleus border).
- the intensity of a pixel within the cell may be a function of the distance to the cell border and the distance to the nucleus border.
- intensities with the profile function i.e., the function used to calculate intensities within a cell in a profile image
- a profile function is a function of a distance, or distances if there are many borders.
- w is the width of the region near the border that will be characterized by the given intensity profile feature.
- the width w may be specified by a user and/or may have a default value.
- the profile function is high near the border of the cell and low when distance to the border is much higher than parameter w.
- the profile image must be calculated individually for each cell since each cell has an individual shape of the border line.
- the profile image is used as weight function to calculate weighted average intensity of the input image for each cell.
- each cell is characterized by a single numeric attribute, which may be referred to as the profile weighted intensity feature.
- the profile weighted intensity feature may be calculated for each combination of input and profile images.
- FIG. 9 is a flowchart of a method 900 for determining profile weighted intensity features of cells useful for classifying cell phenotype, in accordance with one embodiment of the invention.
- the features are determined from one or more images of the cells.
- the method 900 includes detecting a cell in an image.
- the method 900 includes computing a profile image for the cell, wherein pixel intensity of the profile image is a function of the closest distance from the pixel to a border of the cell.
- the border may be, for example, a cell border and/or a nucleus border. In one embodiment, the border is identified by applying an image filter and/or mask to the input image.
- the profile image is computed such that pixels near the border of the cell are emphasized and pixels a given distance away from the border of the cell are deemphasized.
- the method 900 includes computing a profile weighted intensity feature for the cell using the profile image of the cell as a weight image, wherein the profile weighted intensity feature for the cell is a weighted mean intensity.
- the method 900 may also include classifying cell phenotype using at least the computed profile weighted intensity.
- the method 900 determines a distance image by performing a sliding parabola erosion operation on a mask image of the cell, thereby producing an eroded mask image that is related to or a known function of the distance image.
- the parabola-eroded mask image may then be used to determine or reconstruct the distance image.
- the profile image is not necessarily equal to or proportional to the distance image.
- Other functions of the distance image may be utilized to determine the profile image, for example exponential, Gaussian, or step functions.
- sliding parabola erosion may be a preferable method for determining the distance image
- other operations or techniques may also be used.
- the distance image may be determined using rolling ball erosion and/or binary mask erosion/dilation operations, instead of sliding parabola erosion.
- the distance image is determined by calculating the distance between each pixel and one or more border.
- sliding parabola erosion is a preferred technique because it is inexpensive (i.e., not computationally intensive) and accurate.
- the method 900 may also include calculating a profile weighted intensity for the cell.
- image intensity is weighted by one or more profile functions, each calculated from one or more distance image.
- the pixel intensity of a profile image is a function of both (i) the closest distance from the pixel to an outer border of the cell and (ii) the closest distance from the pixel to an outer border of a nucleus of the cell.
- the pixel intensity of a profile image is a function of the nearest distances from the pixel to each of a plurality of detected borders of the cell.
- the profile weighted intensity feature for the cell may be calculated using the profile image of the cell as a weight image.
- calculation of the profile weighted intensity involves (i) filtering the image, (ii) calculating a distance image R(x,y), or a series of distance images Ri(x,y), R 2 (x,y), . . . , RN(X,V), and (iii) calculating a profile image using a profile function W(R) or W(Ri, R 2 , . . . , R N ), or a series of profile functions Wi(R), W 2 (R), . . . , W M (R), or W, (R, , R 2 , . . . , RN), W 2 (R,, R 2 , . . . , R N ), . . . , W M (R., R 2 , . . . , RN).
- Calculation of a distance image R(x,y) may take into consideration one or more borders, such as the cell border and the nucleus border.
- the distance from border may be measured in two different directions, with one of the directions considered to be a negative direction.
- sliding parabola erosion may be used to calculate a distance image R(x,y).
- Calculation of a profile function, or a series of profile functions also takes into consideration the distance from one or more borders. If there is more than one distance function (e.g., a function for the distance from cell border and a function for the distance from the nucleus border), then the set of profile functions may be diversified.
- W(x, y) W(R(x, y)) .
- a non-normalized feature is mean intensity of an input image weighted by a profile function. For example, if the profile image is denoted by W and the input image is denoted by I, then the non-normalized profile feature F u is expressed as
- This normalized profile feature F n is expressed as
- Filtering the input image is an optional step.
- the filtering may be performed using one or more texture energy filters, for example.
- profile images are created using several different methods. For example, a single distance image may be used to create a plurality of profile images. A different mathematical function may be used for each profile image to relate intensity in the profile image to intensity in the distance image.
- profile images are created from distance images having multiple borders, such as the distance image in FIG. 6. Compared to a single border, two or more borders allows a higher diversity of profile images to be created. The number of features may be further diversified by using not only the original image but also a filtered image or even a set of filtered images. For each filtered image, additional profile images may be calculated and, as described here, used to identify and characterized morphological features. In certain embodiments, the calculated features may be unnormalized intensities, or normalized ones.
- FIGS. 10a through 12 illustrate the use of profile weighted intensity features for the classification of cell phenotype, in accordance with certain embodiments of the invention.
- FIGS. 10a and 10b include a pair of images of cells 1000 from two different samples of a cell- based assay.
- FIGS. 1 l a and 1 lb include images of the same cells 1000 in FIGS. 10a and 10b, respectively, but the detected cells are masked (i.e., pixels outside the cells are turned black) and cell borders and nucleus borders are highlighted. Comparing the pair of figures, one may notice that the cell class depicted in FIG. 1 lb includes bright spotty regions 1 100 around the nuclei of the cells 1000. These spotty regions 1 100 are less noticeable and located more randomly in the cells 1000 of the other cell class depicted in FIG. 1 la.
- FIG. 12 includes an x,y-plot of the best pair of features for separating the two classes of objects depicted in FIGS. 1 la and 1 lb.
- the best pair of features for separating the two classes has been automatically selected by the computer among 165 features routinely calculated for each cell.
- Each data point represents a cell, with the white and black circles corresponding to two classes of cells, each from different control samples.
- the x-axis in this figure is a profile weighted intensity feature calculated from a dark-filtered image. This feature characterizes texture in a region of cytoplasm near the nucleus. Indeed, in this region, the cells of the two classes differ the most.
- the grouping or separation of the black and white circles in this figure indicates that the weighted profile feature (represented by the x-axis) is useful for distinguishing between different cell phenotypes.
- the methods and apparatus described herein are applicable universally to data or images of any dimensionality. For example, the equations presented above can be easily applied to a 3-D image, in which R and W are functions of x, y, and z. Because 3-D images generally have more voxels than 2-D images, calculation times are generally longer for 3-D images.
- the methods described above are implemented on a computer using computer software.
- the software includes three building blocks to calculate intensity properties, calculate texture properties, and calculate morphology properties.
- the software implements a method for extracting many morphology features in parallel, thereby enabling features from several (e.g., five) different families to be combined.
- One of the families is profile weighted intensity.
- profile weighted intensity features a user applies the building block for calculating morphology properties. The user then ensures that the family of profile weighted intensity features is selected (by default, it is selected).
- a user may select one or more filters (e.g., an energy filter) to apply and specify input parameters for the filter(s).
- a wide set of features is calculated automatically whenever the user requests a classification or regression task. Later, when the tuning is completed (i.e. the relevant features have been identified), only the relevant features will be calculated.
- the most expensive step in the computations may be the calculation of the distance image (or more than one distance image if different borders are involved).
- sliding parabola erosion may be employed.
- the calculation time for performing sliding parabola erosion on a 1360 x 1024 pixel image is about 0.17 seconds, using a Dell Latitude 630 (2.2 GHz laptop). All other operations are relatively cheap (i.e., faster).
- the whole set is calculated at little or no additional cost. By comparison, when there are several distance functions in a set, each distance function requires approximately the same amount of computation time.
- embodiments of the present invention may be provided as one or more computer-readable programs embodied on or in one or more articles of manufacture.
- the article of manufacture may be any suitable hardware apparatus, such as, for example, a floppy disk, a hard disk, a CD ROM, a CD-RW, a CD-R, a DVD ROM, a DVD-RW, a DVD- R, a flash memory card, a PROM, a RAM, a ROM, or a magnetic tape.
- the computer-readable programs may be implemented in any programming language. Some examples of languages that may be used include C, C++, or JAVA.
- the software programs may be further translated into machine language or virtual machine instructions and stored in a program file in that form. The program file may then be stored on or in one or more of the articles of manufacture.
- a computer hardware apparatus may be used in carrying out any of the methods described herein.
- the apparatus may include, for example, a general purpose computer, an embedded computer, a laptop or desktop computer, or any other type of computer that is capable of running software, issuing suitable control commands, receiving graphical user input, and recording information.
- the computer typically includes one or more central processing units for executing the instructions contained in software code that embraces one or more of the methods described herein.
- the software may include one or more modules recorded on machine-readable media, where the term machine-readable media encompasses software, hardwired logic, firmware, object code, and the like. Additionally, communication buses and I/O ports may be provided to link any or all of the hardware components together and permit communication with other computers and computer networks, including the internet, as desired.
- the computer may include a memory or register for storing data.
- the modules described herein may be software code or portions of software code.
- a module may be a single subroutine, more than one subroutine, and/or portions of one or more subroutines.
- the module may also reside on more than one machine or computer.
- a module defines data by creating the data, receiving the data, and/or providing the data.
- the module may reside on a local computer, or may be accessed via network, such as the Internet. Modules may overlap - for example, one module may contain code that is part of another module, or is a subset of another module.
- the computer can be a general purpose computer, such as a commercially available personal computer that includes a CPU, one or more memories, one or more storage media, one or more output devices, such as a display, and one or more input devices, such as a keyboard.
- the computer operates using any commercially available operating system, such as any version of the WindowsTM operating systems from Microsoft Corporation of Redmond, Wash., or the LinuxTM operating system from Red Hat Software of Research Triangle Park, N.C.
- the computer is programmed with software including commands that, when operating, direct the computer in the performance of the methods of the invention.
- commands can be provided in the form of software, in the form of programmable hardware such as flash memory, ROM, or programmable gate arrays (PGAs), in the form of hard-wired circuitry, or in some combination of two or more of software, programmed hardware, or hard-wired circuitry.
- Commands that control the operation of a computer are often grouped into units that perform a particular action, such as receiving information, processing information or data, and providing information to a user.
- Such a unit can comprise any number of instructions, from a single command, such as a single machine language instruction, to a plurality of commands, such as a plurality of lines of code written in a higher level programming language such as C++.
- Such units of commands are referred to generally as modules, whether the commands include software, programmed hardware, hard-wired circuitry, or a combination thereof.
- the computer and/or the software includes modules that accept input from input devices, that provide output signals to output devices, and that maintain the orderly operation of the computer.
- the computer also includes at least one module that renders images and text on the display.
- the computer is a laptop computer, a minicomputer, a mainframe computer, an embedded computer, or a handheld computer.
- the memory is any conventional memory such as, but not limited to, semiconductor memory, optical memory, or magnetic memory.
- the storage medium is any conventional machine-readable storage medium such as, but not limited to, floppy disk, hard disk, CD-ROM, and/or magnetic tape.
- the display is any conventional display such as, but not limited to, a video monitor, a printer, a speaker, an alphanumeric display.
- the input device is any conventional input device such as, but not limited to, a keyboard, a mouse, a touch screen, a microphone, and/or a remote control.
- the computer can be a stand-alone computer or interconnected with at least one other computer by way of a network. This may be an internet connection.
- an "image" - for example, an image of one or more cells - includes any visual representation, such as a photo, a video frame, streaming video, as well as any electronic, digital or mathematical analogue of a photo, video frame, or streaming video.
- the methods and apparatus described herein are used for cell phenotype classification and may include the feature selection module described in U.S. Patent Application No. 13/230,377, filed September 12, 201 1 , entitled “Methods and Apparatus for Fast Identification of Relevant Features for Classification or Regression," the disclosure of which is hereby incorporated by reference herein in its entirety.
- the methods and apparatus described herein utilize sliding parabola erosion.
- a sliding parabola erosion procedure is described in U.S. Patent Application No. 13/230,433, filed September 12, 201 1 , entitled “Methods and Apparatus for Image Analysis and Modification Using Fast Sliding Parabola Erosion," the disclosure of which is hereby incorporated by reference herein in its entirety.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Image Analysis (AREA)
Abstract
A new morphological feature referred to herein as profile weighted intensity feature is provided, useful for automated classification of objects, such as cells, that are depicted in digital images. In certain embodiments, the profile weighted intensity feature is determined by automatically identifying a border of a cell in an input image, determining a distance image for the cell, computing a profile function for the cell, and computing a mean intensity of at least a portion of the input image weighted by the profile function for the cell.
Description
METHODS AND APPARATUS FOR IMAGE ANALYSIS USING PROFILE
WEIGHTED INTENSITY FEATURES
Technical Field
[0001] This invention relates generally to image processing techniques for identifying and characterizing objects within digital images. More particularly, in certain embodiments, the invention relates to methods and apparatus for characterizing a new morphological feature referred to herein as a profile weighted intensity feature, of cells in a digital image, and applying this feature in the automated classification of cell phenotype, for example, in image analysis software.
Background
[0002] The ability to automatically classify objects into categories of interest has applications across a wide range of industries and scientific fields, including biology, social sciences, and finance. One particular application of interest is the classification of biological cells according to cell phenotype.
[0003] An accurate and efficient automated cell phenotype classification method requires identifying morphological characteristics of individual cells, as pictured in a digital image, which are useful for distinguishing different cell phenotypes. Thus, when using image processing techniques to perform cell phenotype classification, it is desired to identify morphological features that vary according to the different cell types in an image and are characteristic of those cell types. A cell type having a unique size, for example, may be identified by evaluating the sizes of the cells in the image. Likewise, a cell type having a particular characteristic shape or color may be identified by evaluating the shapes and colors of the cells in the image. The more a morphological feature (e.g., size, shape, or color) varies from one cell type to the next, the more useful that feature is for distinguishing different types of cells during cell phenotype classification.
[0004] Automated image processing techniques are useful to standardize the classification process for improved accuracy and speed of cell classification. However, existing automated image processing techniques are often incapable of distinguishing among the different cell
phenotypes in an image. Existing image processing techniques can also be overly complicated, difficult to describe and implement, and computationally intensive.
[0005] There is a need for more accurate and efficient image processing techniques for identifying different types of objects in an image. In particular, there is a need for new morphological features that may be used to characterize cells in an image, for the purpose of automated cell phenotype classification.
Summary of the Invention
[0006] The methods and apparatus described herein are capable of robust and efficient identification and characterization of morphological features of objects within an image. A new family of morphological features, referred to herein as profile weighted intensity features, is provided. These profile weighted intensity features are a type of weighted mean or sum intensity of one or more objects (e.g., cells) in an image.
[0007] By utilizing these new morphological features, a computationally efficient and more accurate tool is provided for identifying and classifying objects in images. For example, it has been discovered that automated cell phenotype characterization is improved by determining a mean intensity that more heavily weights pixel intensity near the borders of the cell (e.g., the outer cell border and/or the border of the cell nucleus), and using this mean intensity value in the algorithm to characterize cell phenotype. It has also been discovered that relative (e.g., normalized) intensities may be more favorable than absolute intensities for cell classification because large cell-to-cell variations in cellular samples are cancelled out or reduced when divided by another intensity of the same cell. In one embodiment, the profile weighted intensity features described herein are normalized. For example, for a given cell, a weighted mean intensity may be divided by a corresponding unweighted mean intensity, yielding a relative (unitless) feature. Furthermore, advances in the efficiency of certain computational steps - for example, the use of a new sliding parabola erosion operation in the determination of a distance image - facilitate the efficient use of this new class of morphological features in the classification of objects (e.g., cells) in a digital image.
[0008] In one aspect, the invention relates to a method for determining a profile weighted intensity feature for a cell useful for classifying cell phenotype, the profile weighted intensity feature determined from one or more images of the cell. The method includes the steps of: (a) detecting a cell in an image; (b) computing a profile image for the cell, wherein pixel intensity of the profile image is a function of the closest distance from the pixel to a border of the cell;
and (c) computing a profile weighted intensity feature for the cell using the profile image of the cell as a weight image, wherein the profile weighted intensity feature for the cell is a weighted mean intensity of pixels of the cell.
[0009] In certain embodiments, the method includes the step of classifying cell phenotype using the computed profile weighted intensity feature. The border may be, for example, an outer border of the cell. In one embodiment, the profile image is computed such that pixels near the border of the cell are emphasized and pixels a given distance away from the border of the cell are deemphasized. In certain embodiments, the distance image is determined via sliding parabola erosion operation performed on a cell mask image. In one embodiment, the calculated features include at least one of unnormalized intensities and normalized intensities. In certain embodiments, step (c) includes computing a profile weighted intensity feature for the cell from a filtered image of an acquired input image of the cell using the profile image of the cell as a weight image.
[0010] In another aspect, the invention relates to a method for determining a profile weighted intensity feature for a cell useful for classifying cell phenotype, the profile weighted intensity feature determined from one or more images of the cell. The method includes the steps of: (a) detecting a cell and detecting a nucleus of the cell from one or more images of the cell; (b) computing a profile image for the cell, wherein pixel intensity of the profile image is a function of both (i) the closest distance from the pixel to an outer border of the cell and (ii) the closest distance from the pixel to an outer border of a nucleus of the cell; and (c) computing a profile weighted intensity feature for the cell using the profile image of the cell as a weight image, wherein the profile weighted intensity feature for the cell is a weighted mean intensity of pixels of the cell. In certain embodiments, the profile weighted intensity feature for the cell is computed from a filtered image of an acquired input image of the cell using the profile image of the cell as a weight image.
[0011] In another aspect, the invention relates to a method for determining a profile weighted intensity feature for a cell useful for classifying cell phenotype, the profile weighted intensity feature determined from one or more images of the cell. The method includes the steps of: (a) detecting a plurality of borders of a cell from one or more images of the cell; (b) computing a profile image for the cell, wherein pixel intensity of the profile image is a function of the nearest distances from the pixel to each of the plurality of detected borders of the cell; and (c) computing a profile weighted intensity feature for the cell using the profile image of the cell as
a weight image, wherein the profile weighted intensity feature for the cell is a weighted mean intensity of pixels of the cell. In certain embodiments, the profile weighted intensity feature for the cell is computed from a filtered image of an acquired input image of the cell using the profile image of the cell as a weight image.
[0012] In another aspect, the invention relates to an apparatus for determination of a profile weighted intensity feature for a cell useful for classifying cell phenotype, the profile weighted intensity feature determined from one or more images of the cell. The apparatus includes a memory for storing a code defining a set of instructions, and a processor for executing the set of instructions. The code includes a profile weighted intensity module configured to: (i) detect a cell in an image; (ii) compute a profile image for the cell, wherein pixel intensity of the profile image is a function of the closest distance from the pixel to a border of the cell (e.g., an outer border of the cell); and (iii) compute a profile weighted intensity feature for the cell using the profile image of the cell as a weight image, wherein the profile weighted intensity feature for the cell is a weighted mean intensity of pixels of the cell.
[0013] In certain embodiments, the profile weighted intensity module is configured to classify cell phenotype using the computed profile weighted intensity feature. In certain embodiments, the profile image is computed such that pixels near the border of the cell are emphasized and pixels a given distance away from the border of the cell are deemphasized. In certain embodiments, the distance image is determined via sliding parabola erosion operation performed on a cell mask image.
[0014] Elements of embodiments described with respect to a given aspect of the invention may be used in various embodiments of another aspect of the invention. For example, it is contemplated that features of dependent claims depending from one independent claim can be used in apparatus and/or methods of any of the other independent claims.
Brief Description of the Drawings
[0015] The objects and features of the invention can be better understood with reference to the drawing described below, and the claims.
[0016] FIG. 1 is an image of a cell acquired using a detector that is sensitive to light emitted by a specific fluorophore, according to an illustrative embodiment of the present invention.
[0017] FIG. 2 is an image of the cell of FIG. 1 acquired using a detector that is sensitive to a nucleus stain, according to an illustrative embodiment of the present invention.
[0018] FIG. 3 is an image of the cell of FIG. 1 in which the cell and the cell nucleus have been identified, according to an illustrative embodiment of the present invention.
[0019] FIG. 4 is a distance image for the cell of FIG. 1 in which pixel intensity is a function of distance from an outer border of the cell, according to an illustrative embodiment of the present invention.
[0020] FIG. 5 is a profile image calculated from the distance image of FIG. 4, according to an illustrative embodiment of the present invention.
[0021] FIG. 6 is a distance image for the cell of FIG. 1 in which pixel intensity is a function of distance from a nucleus border, according to an illustrative embodiment of the present invention.
[0022] FIG. 7 is a profile image calculated from the distance image of FIG. 6, according to an illustrative embodiment of the present invention.
[0023] FIG. 8 is an image obtained by applying a texture energy filter to the image of FIG. 1 , according to an illustrative embodiment of the present invention.
[0024] FIG. 9 is a flowchart of a method for determining profile weighted intensity features of cells useful for classifying cell phenotype, according to an illustrative embodiment of the present invention.
[0025] FIGS. 10a and 10b include images from two different samples of a cell-based assay representing two different cell classes, according to an illustrative embodiment of the present invention.
[0026] FIGS. 1 1 a and 1 lb include images from the same cell-based assay depicted in FIGS. 10a and 10b, respectively, with the detected cells masked (i.e., pixels outside the cells are black) and cell and nucleus borders highlighted, according to an illustrative embodiment of the present invention.
[0027] FIG. 12 is an x,y-plot of a best pair of features for separating the two classes of objects depicted in FIGS. 1 la and l ib, according to an illustrative embodiment of the present invention.
Description
[0028] It is contemplated that apparatus, systems, methods, and processes of the claimed invention encompass variations and adaptations developed using information from the embodiments described herein. Adaptation and/or modification of the apparatus, systems,
methods, and processes described herein may be performed by those of ordinary skill in the relevant art.
[0029] Throughout the description, where systems are described as having, including, or comprising specific components, or where processes and methods are described as having, including, or comprising specific steps, it is contemplated that, additionally, there are systems of the present invention that consist essentially of, or consist of, the recited components, and that there are processes and methods according to the present invention that consist essentially of, or consist of, the recited processing steps.
[0030] It should be understood that the order of steps or order for performing certain actions is immaterial so long as the invention remains operable. Moreover, two or more steps or actions may be conducted simultaneously.
[0031] The mention herein of any publication, for example, in the Background section, is not an admission that the publication serves as prior art with respect to any of the claims presented herein. The Background section is presented for purposes of clarity and is not meant as a description of prior art with respect to any claim.
[0032] When using image processing techniques, a cell phenotype classification process begins with the acquisition of an image or images depicting one or more cells. The next step is typically a segmentation step to detect one or more cells or portions of cells (e.g., a cell nucleus) in the image or images. Next, a set of features is extracted from the identified cells and used to characterize and classify the cells. In certain embodiments, one or more image filters are applied as part of the feature extraction step. The filters may include one or more texture energy filters, which are a special subset of image filters with non-negative output. In certain embodiments, the same filter that is used for extraction of texture features is additionally used for extraction of morphology features. In one embodiment, unweighted mean intensity of a texture-filtered image is a texture feature, and profile weighted mean intensity of the same texture-filtered image is a morphology feature.
[0033] Among the different types of images, a distance image is an image in which pixel intensity is proportional to a distance between the pixel and some other location in the image. For example, in a distance image of a biological cell, the intensity at each pixel within the cell may depend on the distance between the pixel and an edge of the cell, such as the outer edge of the cell or the edge of the nucleus. The relationship between intensity and distance may be linear or non-linear.
[0034] Weighted sum calculation and weighted mean calculation are examples of image processing techniques. Weighted sum calculation is determined by applying weights to the pixel intensities in an image (e.g., using another image or a set of data to provide the weights) and then adding the weighted pixel intensities. Similarly, weighted mean calculation is determined by applying weights to the pixel intensities in an image and then averaging the weighted pixel intensities. The resulting value obtained for both weighted sum calculation and weighted mean calculation is influenced more by some pixels (i.e., those that are weighted heavily) and less by other pixels (i.e., those that are not weighted heavily).
[0035] The methods and apparatus described herein provide a new family of morphological features for characterizing and/or classifying one or more objects (e.g., biological cells) in an input image. In one embodiment, the new family of morphological features is obtained by applying a filter (e.g., a texture energy filter) to the image. A distance image is calculated in which pixel intensity is a function of distance to the closest border pixel. For example, the distance image is advantageously calculated using the sliding parabola erosion technique described in U.S. Patent Application No. 13/230,377, filed September 12, 201 1 , entitled "Methods and Apparatus for Fast Identification of Relevant Features for Classification or Regression," the disclosure of which is hereby incorporated by reference herein in its entirety. The profile image is then used as a weight image for calculating the mean or sum intensity of another image, such as the input image.
[0036] FIGS. 1 through 8 are images of a cell 100, in accordance with certain embodiments of the invention. FIG. 1 is an image of the cell 100 acquired with a detector that is sensitive to light emitted by a specific fluorophore. Morphological features (e.g., numerical properties) within the cell 100 are indicated by variations in intensity within the image. The intensity variations may be quantified to identify and characterize morphological features of interest.
[0037] FIG. 2 is another image of the cell 100 acquired with a detector that is sensitive to a nucleus stain. This type of image may be useful for nucleus detection. For example, FIG. 3 is a similar image of the cell 100 after the cell 100, a cell nucleus 302, and cytoplasm 304 have been identified during segmentation. Pixels not belonging to the cell 100 are black. Pixels belonging to a border 306 of the nucleus 302 are highlighted. In certain embodiments, one input image is acquired for nuclei detection and another image is used for feature calculation or detection. In one embodiment, more than one image may be used for calculation of features.
[0038] FIG. 4 is a distance image for the cell 100 in which pixels outside of the cell 100 are black and each pixel within the cell 100 has an intensity that is equal to or a function of (e.g., a linear function, a polynomial function, an exponential function, and/or a trigonometric function) the distance between the pixel and an outer border 402 of the cell 100. FIG. 5 is a profile image for the cell 100 calculated from the distance image of FIG. 4. The profile image characterizes or emphasizes the part of cytoplasm 304 closest to the outer border 402 (e.g., a cell membrane).
[0039] FIG. 6 is another distance image for the cell 100 in which each pixel within cytoplasm 304 of the cell 100 has an intensity equal to a function of the distance from the nucleus border 306. Pixels outside of the cell 100 and pixels within the nucleus 302 are black. FIG. 7 is a profile image for the cell 100 calculated from the distance image of FIG. 6. The profile image characterizes or emphasizes a part of cytoplasm 304 closest to the nucleus 302.
[0040] In certain embodiments, a profile image is a function of a single distance, even if there is more than one border (e.g., the nucleus border 306 and the outer border 402), as shown in FIGS. 5 and 7. In alternative embodiments, the profile image is equal to or a function of two distances. For example, a distance image may be generated in which the intensity of the distance image is a function of distance from a nucleus border and an outer cell border. A profile image may then be generated from the distance image such that the intensities in the profile image are a function of the distances from the two borders. The essence of the step of calculating a profile image does not depend on the number of borders considered for the calculation.
[0041] In certain embodiments, more than one profile image is calculated from a given distance image. For example, each profile image may be obtained using a different mathematical relationship between pixel intensity in the distance image and pixel intensity in the profile image.
[0042] FIG. 8 is an image of the cell 100 obtained by applying a texture energy filter to the image of FIG. 1. As depicted, the texture energy filter detects and visualizes ridge-like structures 802 within the cell 100. In certain embodiments, morphological features characterizing the cell 100 are derived from this image in addition to or instead of the image of FIG. 1.
[0043] As mentioned above with respect to FIGS. 5 and 7, in certain embodiments, an object includes more than one border. For example, a cell may have a border at its outer edge (i.e., a
cell border) and a border at the edge of its nucleus (i.e., a nucleus border). When two borders are identified for a single cell, there are two distances to consider for calculation of the distance image and/or the profile image. For example, the intensity of a pixel within the cell may be a function of the distance to the cell border and the distance to the nucleus border. In other words, intensities with the profile function (i.e., the function used to calculate intensities within a cell in a profile image) may be defined as a function of both distances in such case.
[0044] It is a convention that distances inside and outside the border are of opposite sign, · considered as a signed coordinate. From the viewpoint of the present methods and apparatus, this convention may be useful. For example, the profile function must be defined on both sides of the border between nucleus and cytoplasm.
[0045] A profile function is a function of a distance, or distances if there are many borders. For example, the profile function may be profile=exp(-distance*distance/(2*w2)), where profile is pixel intensity within a cell of the profile image, distance is the distance from a pixel to a border (or from a pixel to more than one border), and w is a width parameter. In certain embodiments, w is the width of the region near the border that will be characterized by the given intensity profile feature. The width w may be specified by a user and/or may have a default value. In this particular example, the profile function is high near the border of the cell and low when distance to the border is much higher than parameter w. The profile image must be calculated individually for each cell since each cell has an individual shape of the border line.
[0046] After the profile function has been used to calculate pixel intensities within the profile image, the profile image is used as weight function to calculate weighted average intensity of the input image for each cell. With this feature, each cell is characterized by a single numeric attribute, which may be referred to as the profile weighted intensity feature. When more than one input image and/or more than one profile image have been obtained, the profile weighted intensity feature may be calculated for each combination of input and profile images.
[0047] FIG. 9 is a flowchart of a method 900 for determining profile weighted intensity features of cells useful for classifying cell phenotype, in accordance with one embodiment of the invention. The features are determined from one or more images of the cells. At step 902, the method 900 includes detecting a cell in an image. At step 904, the method 900 includes computing a profile image for the cell, wherein pixel intensity of the profile image is a function of the closest distance from the pixel to a border of the cell. The border may be, for example, a
cell border and/or a nucleus border. In one embodiment, the border is identified by applying an image filter and/or mask to the input image. In certain embodiments, the profile image is computed such that pixels near the border of the cell are emphasized and pixels a given distance away from the border of the cell are deemphasized. At step 906, the method 900 includes computing a profile weighted intensity feature for the cell using the profile image of the cell as a weight image, wherein the profile weighted intensity feature for the cell is a weighted mean intensity. The method 900 may also include classifying cell phenotype using at least the computed profile weighted intensity.
[0048] In certain embodiments, the method 900 determines a distance image by performing a sliding parabola erosion operation on a mask image of the cell, thereby producing an eroded mask image that is related to or a known function of the distance image. The parabola-eroded mask image may then be used to determine or reconstruct the distance image. However, the profile image is not necessarily equal to or proportional to the distance image. Other functions of the distance image may be utilized to determine the profile image, for example exponential, Gaussian, or step functions.
[0049] While sliding parabola erosion may be a preferable method for determining the distance image, other operations or techniques may also be used. For example, the distance image may be determined using rolling ball erosion and/or binary mask erosion/dilation operations, instead of sliding parabola erosion. In certain embodiments, the distance image is determined by calculating the distance between each pixel and one or more border. In one embodiment, sliding parabola erosion is a preferred technique because it is inexpensive (i.e., not computationally intensive) and accurate.
[0050] The method 900 may also include calculating a profile weighted intensity for the cell. When calculating a weighted mean intensity or intensities, image intensity is weighted by one or more profile functions, each calculated from one or more distance image.
[0051] Alternatively, in certain embodiments, the pixel intensity of a profile image is a function of both (i) the closest distance from the pixel to an outer border of the cell and (ii) the closest distance from the pixel to an outer border of a nucleus of the cell. In another embodiment, the pixel intensity of a profile image is a function of the nearest distances from the pixel to each of a plurality of detected borders of the cell. When used for cell phenotype classification, the profile weighted intensity feature for the cell may be calculated using the profile image of the cell as a weight image.
[0052] In certain embodiments, calculation of the profile weighted intensity involves (i) filtering the image, (ii) calculating a distance image R(x,y), or a series of distance images Ri(x,y), R2(x,y), . . . , RN(X,V), and (iii) calculating a profile image using a profile function W(R) or W(Ri, R2, . . . , RN), or a series of profile functions Wi(R), W2(R), . . . , WM(R), or W, (R, , R2, . . . , RN), W2(R,, R2, . . . , RN), . . . , WM(R., R2, . . . , RN).
[0053] Calculation of a distance image R(x,y) may take into consideration one or more borders, such as the cell border and the nucleus border. In the case of an internal border (e.g., the nucleus border between a nucleus and a cytoplasm), the distance from border may be measured in two different directions, with one of the directions considered to be a negative direction. As mentioned, sliding parabola erosion may be used to calculate a distance image R(x,y).
[0054] Calculation of a profile function, or a series of profile functions, also takes into consideration the distance from one or more borders. If there is more than one distance function (e.g., a function for the distance from cell border and a function for the distance from the nucleus border), then the set of profile functions may be diversified. In one embodiment, the profile function is a function of the distance, R, i.e., W=W(R). The profile image is then
W(x, y) = W(R(x, y)) .
[0055] When calculating a feature characterizing an object, in certain embodiments, a non- normalized feature is mean intensity of an input image weighted by a profile function. For example, if the profile image is denoted by W and the input image is denoted by I, then the non-normalized profile feature Fu is expressed as
Ej ~ f W{x,y)axdy
In some embodiments, it is desirable to normalize the profile feature by dividing it by another intensity. This normalized profile feature Fn is expressed as
_ / WQc. >)/ C*. y dxdy f dxdy
FFI ~ J W Cv. y}dxdy J / ix. y)dxdy
[0056] Filtering the input image is an optional step. The filtering may be performed using one or more texture energy filters, for example.
[0057] In certain embodiments, profile images are created using several different methods. For example, a single distance image may be used to create a plurality of profile images. A different mathematical function may be used for each profile image to relate intensity in the profile image to intensity in the distance image. In another approach, profile images are created from distance images having multiple borders, such as the distance image in FIG. 6. Compared to a single border, two or more borders allows a higher diversity of profile images to be created. The number of features may be further diversified by using not only the original image but also a filtered image or even a set of filtered images. For each filtered image, additional profile images may be calculated and, as described here, used to identify and characterized morphological features. In certain embodiments, the calculated features may be unnormalized intensities, or normalized ones.
[0058] FIGS. 10a through 12 illustrate the use of profile weighted intensity features for the classification of cell phenotype, in accordance with certain embodiments of the invention.
FIGS. 10a and 10b include a pair of images of cells 1000 from two different samples of a cell- based assay. FIGS. 1 l a and 1 lb include images of the same cells 1000 in FIGS. 10a and 10b, respectively, but the detected cells are masked (i.e., pixels outside the cells are turned black) and cell borders and nucleus borders are highlighted. Comparing the pair of figures, one may notice that the cell class depicted in FIG. 1 lb includes bright spotty regions 1 100 around the nuclei of the cells 1000. These spotty regions 1 100 are less noticeable and located more randomly in the cells 1000 of the other cell class depicted in FIG. 1 la.
[0059] FIG. 12 includes an x,y-plot of the best pair of features for separating the two classes of objects depicted in FIGS. 1 la and 1 lb. The best pair of features for separating the two classes has been automatically selected by the computer among 165 features routinely calculated for each cell. Each data point represents a cell, with the white and black circles corresponding to two classes of cells, each from different control samples. The x-axis in this figure is a profile weighted intensity feature calculated from a dark-filtered image. This feature characterizes texture in a region of cytoplasm near the nucleus. Indeed, in this region, the cells of the two classes differ the most. The grouping or separation of the black and white circles in this figure indicates that the weighted profile feature (represented by the x-axis) is useful for distinguishing between different cell phenotypes.
[0060] The methods and apparatus described herein are applicable universally to data or images of any dimensionality. For example, the equations presented above can be easily applied to a 3-D image, in which R and W are functions of x, y, and z. Because 3-D images generally have more voxels than 2-D images, calculation times are generally longer for 3-D images.
[0061] In certain embodiments, the methods described above are implemented on a computer using computer software. In one specific embodiment, to perform feature extraction, the software includes three building blocks to calculate intensity properties, calculate texture properties, and calculate morphology properties. In one embodiment, the software implements a method for extracting many morphology features in parallel, thereby enabling features from several (e.g., five) different families to be combined. One of the families is profile weighted intensity. To calculate profile weighted intensity features, a user applies the building block for calculating morphology properties. The user then ensures that the family of profile weighted intensity features is selected (by default, it is selected). Next, a user may select one or more filters (e.g., an energy filter) to apply and specify input parameters for the filter(s). In another embodiment, a wide set of features is calculated automatically whenever the user requests a classification or regression task. Later, when the tuning is completed (i.e. the relevant features have been identified), only the relevant features will be calculated.
[0062] Regarding computation times, the most expensive step in the computations may be the calculation of the distance image (or more than one distance image if different borders are involved). For calculation of a distance image, sliding parabola erosion may be employed. In one embodiment, the calculation time for performing sliding parabola erosion on a 1360 x 1024 pixel image is about 0.17 seconds, using a Dell Latitude 630 (2.2 GHz laptop). All other operations are relatively cheap (i.e., faster). In another embodiment, when a series of profile functions are all based on the same distance function, the whole set is calculated at little or no additional cost. By comparison, when there are several distance functions in a set, each distance function requires approximately the same amount of computation time.
[0063] It should be noted that embodiments of the present invention may be provided as one or more computer-readable programs embodied on or in one or more articles of manufacture. The article of manufacture may be any suitable hardware apparatus, such as, for example, a floppy disk, a hard disk, a CD ROM, a CD-RW, a CD-R, a DVD ROM, a DVD-RW, a DVD- R, a flash memory card, a PROM, a RAM, a ROM, or a magnetic tape. In general, the
computer-readable programs may be implemented in any programming language. Some examples of languages that may be used include C, C++, or JAVA. The software programs may be further translated into machine language or virtual machine instructions and stored in a program file in that form. The program file may then be stored on or in one or more of the articles of manufacture.
[0064] A computer hardware apparatus may be used in carrying out any of the methods described herein. The apparatus may include, for example, a general purpose computer, an embedded computer, a laptop or desktop computer, or any other type of computer that is capable of running software, issuing suitable control commands, receiving graphical user input, and recording information. The computer typically includes one or more central processing units for executing the instructions contained in software code that embraces one or more of the methods described herein. The software may include one or more modules recorded on machine-readable media, where the term machine-readable media encompasses software, hardwired logic, firmware, object code, and the like. Additionally, communication buses and I/O ports may be provided to link any or all of the hardware components together and permit communication with other computers and computer networks, including the internet, as desired. The computer may include a memory or register for storing data.
[0065] In certain embodiments, the modules described herein may be software code or portions of software code. For example, a module may be a single subroutine, more than one subroutine, and/or portions of one or more subroutines. The module may also reside on more than one machine or computer. In certain embodiments, a module defines data by creating the data, receiving the data, and/or providing the data. The module may reside on a local computer, or may be accessed via network, such as the Internet. Modules may overlap - for example, one module may contain code that is part of another module, or is a subset of another module.
[0066] The computer can be a general purpose computer, such as a commercially available personal computer that includes a CPU, one or more memories, one or more storage media, one or more output devices, such as a display, and one or more input devices, such as a keyboard. The computer operates using any commercially available operating system, such as any version of the Windows™ operating systems from Microsoft Corporation of Redmond, Wash., or the Linux™ operating system from Red Hat Software of Research Triangle Park, N.C. The computer is programmed with software including commands that, when operating, direct the
computer in the performance of the methods of the invention. Those of skill in the
programming arts will recognize that some or all of the commands can be provided in the form of software, in the form of programmable hardware such as flash memory, ROM, or programmable gate arrays (PGAs), in the form of hard-wired circuitry, or in some combination of two or more of software, programmed hardware, or hard-wired circuitry. Commands that control the operation of a computer are often grouped into units that perform a particular action, such as receiving information, processing information or data, and providing information to a user. Such a unit can comprise any number of instructions, from a single command, such as a single machine language instruction, to a plurality of commands, such as a plurality of lines of code written in a higher level programming language such as C++. Such units of commands are referred to generally as modules, whether the commands include software, programmed hardware, hard-wired circuitry, or a combination thereof. The computer and/or the software includes modules that accept input from input devices, that provide output signals to output devices, and that maintain the orderly operation of the computer. The computer also includes at least one module that renders images and text on the display. In alternative embodiments, the computer is a laptop computer, a minicomputer, a mainframe computer, an embedded computer, or a handheld computer. The memory is any conventional memory such as, but not limited to, semiconductor memory, optical memory, or magnetic memory. The storage medium is any conventional machine-readable storage medium such as, but not limited to, floppy disk, hard disk, CD-ROM, and/or magnetic tape. The display is any conventional display such as, but not limited to, a video monitor, a printer, a speaker, an alphanumeric display. The input device is any conventional input device such as, but not limited to, a keyboard, a mouse, a touch screen, a microphone, and/or a remote control. The computer can be a stand-alone computer or interconnected with at least one other computer by way of a network. This may be an internet connection.
[0067] As used herein, an "image" - for example, an image of one or more cells - includes any visual representation, such as a photo, a video frame, streaming video, as well as any electronic, digital or mathematical analogue of a photo, video frame, or streaming video. Any apparatus described herein, in certain embodiments, includes a display for displaying an image or any other result produced by the processor. Any method described herein, in certain embodiments, includes a step of displaying an image or any other result produced via the method.
[0068] In certain embodiments, the methods and apparatus described herein are used for cell phenotype classification and may include the feature selection module described in U.S. Patent Application No. 13/230,377, filed September 12, 201 1 , entitled "Methods and Apparatus for Fast Identification of Relevant Features for Classification or Regression," the disclosure of which is hereby incorporated by reference herein in its entirety.
[0069] In certain embodiments, the methods and apparatus described herein utilize sliding parabola erosion. A sliding parabola erosion procedure is described in U.S. Patent Application No. 13/230,433, filed September 12, 201 1 , entitled "Methods and Apparatus for Image Analysis and Modification Using Fast Sliding Parabola Erosion," the disclosure of which is hereby incorporated by reference herein in its entirety.
Equivalents
[0070] While the invention has been particularly shown and described with reference to specific preferred embodiments, it should be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.
[0071] What is claimed is:
Claims
1. A method for determining a profile weighted intensity feature for a cell useful for classifying cell phenotype, said profile weighted intensity feature determined from one or more images of the cell, the method comprising the steps of:
(a) detecting a cell in an image;
(b) computing a profile image for the cell, wherein pixel intensity of the profile image is a function of the closest distance from the pixel to a border of the cell; and
(c) computing a profile weighted intensity feature for the cell using the profile image of the cell as a weight image, wherein the profile weighted intensity feature for the cell is a weighted mean intensity of pixels of the cell.
2. The method of claim 1, further comprising the step of classifying cell phenotype using the computed profile weighted intensity feature.
3. The method of claim 1 or 2, wherein the border is an outer border of the cell.
4. The method of any one of the preceding claims, wherein the profile image is computed such that pixels near the border of the cell are emphasized and pixels a given distance away from the border of the cell are deemphasized.
5. The method of any one of the preceding claims, wherein the distance image is determined via sliding parabola erosion operation performed on a cell mask image.
6. The method of any one of the preceding claims, wherein the profile weighted intensity feature includes at least one of unnormalized intensities and normalized intensities.
7. The method of any one of the preceding claims, wherein step (c) comprises computing a profile weighted intensity feature for the cell from a filtered image of an acquired input image of the cell using the profile image of the cell as a weight image.
8. A method for determining a profile weighted intensity feature for a cell useful for classifying cell phenotype, said profile weighted intensity feature determined from one or more images of the cell, the method comprising the steps of:
(a) detecting a cell and detecting a nucleus of the cell from one or more images of the cell;
(b) computing a profile image for the cell, wherein pixel intensity of the profile image is a function of both (i) the closest distance from the pixel to an outer border of the cell and (ii) the closest distance from the pixel to an outer border of a nucleus of the cell; and
(c) computing a profile weighted intensity feature for the cell using the profile image of the cell as a weight image, wherein the profile weighted intensity feature for the cell is a weighted mean intensity of pixels of the cell.
9. The method of claim 8, wherein the profile weighted intensity feature for the cell is computed from a filtered image of an acquired input image of the cell using the profile image of the cell as a weight image.
10. A method for determining a profile weighted intensity feature for a cell useful for classifying cell phenotype, said profile weighted intensity feature determined from one or more images of the cell, the method comprising the steps of:
(a) detecting a plurality of borders of a cell from one or more images of the cell; (b) computing a profile image for the cell, wherein pixel intensity of the profile image is a function of the nearest distances from the pixel to each of the plurality of detected borders of the cell; and
(c) computing a profile weighted intensity feature for the cell using the profile image of the cell as a weight image, wherein the profile weighted intensity feature for the cell is a weighted mean intensity of pixels of the cell.
11. The method of claim 10, wherein the profile weighted intensity feature for the cell is computed from a filtered image of an acquired input image of the cell using the profile image of the cell as a weight image.
12. An apparatus for determination of a profile weighted intensity feature for a cell useful for classifying cell phenotype, said profile weighted intensity feature determined from one or more images of the cell, the apparatus comprising:
(a) a memory for storing a code defining a set of instructions; and
(b) a processor for executing the set of instructions,
wherein the code comprises a profile weighted intensity module configured to:
(i) detect a cell in an image;
(ii) compute a profile image for the cell, wherein pixel intensity of the profile image is a function of the closest distance from the pixel to a border of the cell; and
(iii) compute a profile weighted intensity feature for the cell using the profile image of the cell as a weight image, wherein the profile weighted intensity feature for the cell is a weighted mean intensity of pixels of the cell.
13. The apparatus of claim 12, wherein the profile weighted intensity module is configured to classify cell phenotype using the computed profile weighted intensity feature.
14. The apparatus of claim 12 or 13, wherein the border is the outer border of the cell.
15. The apparatus of any one of claims 12 to 14, wherein the profile image is computed such that pixels near the border of the cell are emphasized and pixels a given distance away from the border of the cell are deemphasized.
16. The apparatus of any one of claims 12 to 15, wherein the distance image is determined via sliding parabola erosion operation performed on a cell mask image.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP11811118.6A EP2776975A1 (en) | 2011-11-08 | 2011-11-08 | Methods and apparatus for image analysis using profile weighted intensity features |
PCT/IB2011/002971 WO2013068781A1 (en) | 2011-11-08 | 2011-11-08 | Methods and apparatus for image analysis using profile weighted intensity features |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/IB2011/002971 WO2013068781A1 (en) | 2011-11-08 | 2011-11-08 | Methods and apparatus for image analysis using profile weighted intensity features |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2013068781A1 true WO2013068781A1 (en) | 2013-05-16 |
Family
ID=45509546
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IB2011/002971 WO2013068781A1 (en) | 2011-11-08 | 2011-11-08 | Methods and apparatus for image analysis using profile weighted intensity features |
Country Status (2)
Country | Link |
---|---|
EP (1) | EP2776975A1 (en) |
WO (1) | WO2013068781A1 (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8705834B2 (en) | 2011-11-08 | 2014-04-22 | Perkinelmer Cellular Technologies Germany Gmbh | Methods and apparatus for image analysis using threshold compactness features |
US8942459B2 (en) | 2011-09-12 | 2015-01-27 | Perkinelmer Cellular Technologies Germany Gmbh | Methods and apparatus for fast identification of relevant features for classification or regression |
WO2015065697A1 (en) * | 2013-10-28 | 2015-05-07 | Molecular Devices, Llc | Method and system for classifying and identifying individual cells in a microscopy image |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2012254A1 (en) * | 2007-07-06 | 2009-01-07 | Evotec Technologies GmbH | Method for quantifying an underlying property of a multitude of samples |
-
2011
- 2011-11-08 WO PCT/IB2011/002971 patent/WO2013068781A1/en active Application Filing
- 2011-11-08 EP EP11811118.6A patent/EP2776975A1/en not_active Ceased
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2012254A1 (en) * | 2007-07-06 | 2009-01-07 | Evotec Technologies GmbH | Method for quantifying an underlying property of a multitude of samples |
Non-Patent Citations (6)
Title |
---|
D. W. KNOWLES: "Automated local bright feature image analysis of nuclear protein distribution identifies changes in tissue phenotype", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES, vol. 103, no. 12, 1 January 2006 (2006-01-01), pages 4445 - 4450, XP055025472, ISSN: 0027-8424, DOI: 10.1073/pnas.0509944102 * |
GANG LIN ET AL: "AUTOMATED 3-D QUANTIFICATION OF BRAIN TISSUE AT THE CELLULAR SCALE FROM MULTI-PARAMETER CONFOCAL MICROSCOPY IMAGES", BIOMEDICAL IMAGING: FROM NANO TO MACRO, 2007. ISBI 2007. 4TH IEEE INTE RNATIONAL SYMPOSIUM ON, IEEE, PI, 1 April 2007 (2007-04-01), pages 1040 - 1043, XP031084455, ISBN: 978-1-4244-0671-5 * |
ILKER ERSOY ET AL: "Multi-feature contour evolution for automatic live cell segmentation in time lapse imagery", ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, 2008. EMBS 2008. 30TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE, IEEE, PISCATAWAY, NJ, USA, 20 August 2008 (2008-08-20), pages 371 - 374, XP031507968, ISBN: 978-1-4244-1814-5 * |
JOAKIM LINDBLAD ET AL: "Image analysis for automatic segmentation of cytoplasms and classification of Rac1 activation", CYTOMETRY, vol. 57A, no. 1, 1 January 2004 (2004-01-01), pages 22 - 33, XP055025469, ISSN: 0196-4763, DOI: 10.1002/cyto.a.10107 * |
M. WANG: "Feature extraction, selection and classifier design in automated time-lapse fluorescence microscope image analysis", MICROSCOPY: SCIENCE, TECHNOLOGY, APPLICATIONS AND EDUCATION, 1 January 2010 (2010-01-01), XP055025485, Retrieved from the Internet <URL:http://www.formatex.info/microscopy4/1378-1388.pdf> [retrieved on 20120424] * |
NILUFAR S ET AL: "Automatic blood cell classification based on joint histogram based feature and bhattacharya kernel", SIGNALS, SYSTEMS AND COMPUTERS, 2008 42ND ASILOMAR CONFERENCE ON, IEEE, PISCATAWAY, NJ, USA, 26 October 2008 (2008-10-26), pages 1915 - 1918, XP031475638, ISBN: 978-1-4244-2940-0 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8942459B2 (en) | 2011-09-12 | 2015-01-27 | Perkinelmer Cellular Technologies Germany Gmbh | Methods and apparatus for fast identification of relevant features for classification or regression |
US8705834B2 (en) | 2011-11-08 | 2014-04-22 | Perkinelmer Cellular Technologies Germany Gmbh | Methods and apparatus for image analysis using threshold compactness features |
US9443129B2 (en) | 2011-11-08 | 2016-09-13 | Perkinelmer Cellular Technologies Germany Gmbh | Methods and apparatus for image analysis using threshold compactness features |
WO2015065697A1 (en) * | 2013-10-28 | 2015-05-07 | Molecular Devices, Llc | Method and system for classifying and identifying individual cells in a microscopy image |
US9830502B2 (en) | 2013-10-28 | 2017-11-28 | Dh Technologies Development Pte. Ltd. | Method and system for classifying and identifying individual cells in a microscopy image |
Also Published As
Publication number | Publication date |
---|---|
EP2776975A1 (en) | 2014-09-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8942459B2 (en) | Methods and apparatus for fast identification of relevant features for classification or regression | |
US9443129B2 (en) | Methods and apparatus for image analysis using threshold compactness features | |
JP6710135B2 (en) | Cell image automatic analysis method and system | |
CN105894036B (en) | A kind of characteristics of image template matching method applied to mobile phone screen defects detection | |
JP6503382B2 (en) | Digital Holographic Microscopy Data Analysis for Hematology | |
Wang et al. | Automated blob detection using iterative Laplacian of Gaussian filtering and unilateral second-order Gaussian kernels | |
EP2377073B1 (en) | Multi-nucleated cell classification and micronuclei scoring | |
CN107918931A (en) | Image processing method and system | |
Medina-Carnicer et al. | A novel histogram transformation to improve the performance of thresholding methods in edge detection | |
CN112669275A (en) | PCB surface defect detection method and device based on YOLOv3 algorithm | |
CN116664565A (en) | Hidden crack detection method and system for photovoltaic solar cell | |
Lee et al. | HiComet: a high-throughput comet analysis tool for large-scale DNA damage assessment | |
JP7348588B2 (en) | Anomaly detection system and anomaly detection program | |
CN115880520A (en) | Defect grade classification method and system based on template matching and defect segmentation | |
CN114140669A (en) | Welding defect recognition model training method and device and computer terminal | |
Peng et al. | PombeX: robust cell segmentation for fission yeast transillumination images | |
CN113780287A (en) | Optimal selection method and system for multi-depth learning model | |
Ahmad et al. | A geometric-based method for recognizing overlapping polygonal-shaped and semi-transparent particles in gray tone images | |
EP2776975A1 (en) | Methods and apparatus for image analysis using profile weighted intensity features | |
US8600144B2 (en) | Methods and apparatus for image analysis using profile weighted intensity features | |
CN117593420A (en) | Plane drawing labeling method, device, medium and equipment based on image processing | |
Mosaliganti et al. | Tensor classification of N-point correlation function features for histology tissue segmentation | |
EP2756451B1 (en) | Methods and apparatus for fast identification of relevant features for classification or regression | |
EP2776974B1 (en) | Methods and apparatus for image analysis using threshold compactness features | |
EP3289566A1 (en) | Method and apparatus for determining temporal behaviour of an object |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11811118 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2011811118 Country of ref document: EP |