Loading metrics

Open Access

Peer-reviewed

Research Article

Texture Descriptors Ensembles Enable Image-Based Classification of Maturation of Human Stem Cell-Derived Retinal Pigmented Epithelium

Loris Nanni ,

Contributed equally to this work with: Loris Nanni, Michelangelo Paci

* E-mail: loris.nanni@unipd.it (LN); michelangelo.paci@tut.fi (MP)

Affiliation Department of Information Engineering, University of Padua, Padua, Italy
⨯
Michelangelo Paci ,

Contributed equally to this work with: Loris Nanni, Michelangelo Paci

* E-mail: loris.nanni@unipd.it (LN); michelangelo.paci@tut.fi (MP)

Affiliation Department of Electronics and Communications Engineering, Tampere University of Technology, BioMediTech, Tampere, Finland
⨯
Florentino Luciano Caetano dos Santos,

Affiliation Department of Electronics and Communications Engineering, Tampere University of Technology, BioMediTech, Tampere, Finland
⨯
Heli Skottman,

Affiliation University of Tampere, BioMediTech, Tampere, Finland
⨯
Kati Juuti-Uusitalo,

Affiliation University of Tampere, BioMediTech, Tampere, Finland
⨯
Jari Hyttinen

Affiliation Department of Electronics and Communications Engineering, Tampere University of Technology, BioMediTech, Tampere, Finland
⨯

Texture Descriptors Ensembles Enable Image-Based Classification of Maturation of Human Stem Cell-Derived Retinal Pigmented Epithelium

Loris Nanni,
Michelangelo Paci,
Florentino Luciano Caetano dos Santos,
Heli Skottman,
Kati Juuti-Uusitalo,
Jari Hyttinen

Published: February 19, 2016
https://doi.org/10.1371/journal.pone.0149399

Figures

Abstract

Aims

A fast, non-invasive and observer-independent method to analyze the homogeneity and maturity of human pluripotent stem cell (hPSC) derived retinal pigment epithelial (RPE) cells is warranted to assess the suitability of hPSC-RPE cells for implantation or in vitro use. The aim of this work was to develop and validate methods to create ensembles of state-of-the-art texture descriptors and to provide a robust classification tool to separate three different maturation stages of RPE cells by using phase contrast microscopy images. The same methods were also validated on a wide variety of biological image classification problems, such as histological or virus image classification.

Methods

For image classification we used different texture descriptors, descriptor ensembles and preprocessing techniques. Also, three new methods were tested. The first approach was an ensemble of preprocessing methods, to create an additional set of images. The second was the region-based approach, where saliency detection and wavelet decomposition divide each image in two different regions, from which features were extracted through different descriptors. The third method was an ensemble of Binarized Statistical Image Features, based on different sizes and thresholds. A Support Vector Machine (SVM) was trained for each descriptor histogram and the set of SVMs combined by sum rule. The accuracy of the computer vision tool was verified in classifying the hPSC-RPE cell maturation level.

Dataset and Results

The RPE dataset contains 1862 subwindows from 195 phase contrast images. The final descriptor ensemble outperformed the most recent stand-alone texture descriptors, obtaining, for the RPE dataset, an area under ROC curve (AUC) of 86.49% with the 10-fold cross validation and 91.98% with the leave-one-image-out protocol. The generality of the three proposed approaches was ascertained with 10 more biological image datasets, obtaining an average AUC greater than 97%.

Conclusions

Here we showed that the developed ensembles of texture descriptors are able to classify the RPE cell maturation stage. Moreover, we proved that preprocessing and region-based decomposition improves many descriptors’ accuracy in biological dataset classification. Finally, we built the first public dataset of stem cell-derived RPE cells, which is publicly available to the scientific community for classification studies. The proposed tool is available at https://www.dei.unipd.it/node/2357 and the RPE dataset at http://www.biomeditech.fi/data/RPE_dataset/. Both are available at https://figshare.com/s/d6fb591f1beb4f8efa6f.

Citation: Nanni L, Paci M, Caetano dos Santos FL, Skottman H, Juuti-Uusitalo K, Hyttinen J (2016) Texture Descriptors Ensembles Enable Image-Based Classification of Maturation of Human Stem Cell-Derived Retinal Pigmented Epithelium. PLoS ONE 11(2): e0149399. https://doi.org/10.1371/journal.pone.0149399

Editor: Qinghui Zhang, North Shore Long Island Jewish Health System, UNITED STATES

Received: June 16, 2015; Accepted: February 1, 2016; Published: February 19, 2016

Copyright: © 2016 Nanni et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The RPE classification tool is available at https://www.dei.unipd.it/node/2357. The RPE dataset is available at http://www.biomeditech.fi/data/RPE_dataset. Data have also been uploaded to Figshare: https://dx.doi.org/10.6084/m9.figshare.2070109.

Funding: This work was financially supported by the Finnish Funding Agency for Technology and Innovation (www.tekes.fi, Human Spare Parts project, MP and FLCS) and the Academy of Finland (www.aka.fi, grant numbers 252225 JH, 218050 HS and 137801 KJU). The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Abbreviations: All, fusion by sum rule among Saliency, Edge and Wavelet; AUC, area under the ROC curve; BR, mammography dataset; Bsif, binarized statistical image features; CHO, chinese hamster ovary cell dataset; CLBP, completed LBP; CoALBP, multi-scale co-occurrence of adjacent LBP; COLORS, Set of statistical descriptors computed on the RPE color images.; Comb, fusion by sum rule among Wa, OR and Ga. Preprocessing applied to the original image and the two MRS images.; DENSE, multi-scale densely sampled complete LBP; DENSEri, multi-scale densely sampled complete rotation invariant LBP; DoG, difference of gaussians; dPG, degree of pigmentation; Edge, Edge variant of a texture descriptor; F, ensemble of the best methods tested on the additional datasets (see section 3.2); Full_Bsif, ensemble based on Bsif with variable filter sizes and thresholds; GA, Gabor filter applied to the original image; HeLa, 2D HeLa dataset; hESC, human embryonic stem cells; HI, hystopathology dataset; hiPSC, human induced pluripotent stem cells; HOG, histogram of oriented gradients; hPSC, human pluripotent stem cells; LBP, local binary pattern; LBP-HF, multi-scale LBP histogram Fourier features; LBPri, rotation invariant LBP; LCP, local configuration pattern; LCPri, rotation invariant LCP; LE, locate endogenous mouse sub-cellular organelles dataset; LQP, local quinary pattern; LT, locate transfected mouse sub-cellular organelles dataset; LTP, local ternary pattern; LTPri, rotation invariant LTP; MLCP, multi-quinary coding version of LCP; MLPQ, multi-ternary coding version of LPQ; MLPQens, ensemble of MLPQ; MLQP, multi-threshold version of LQP; Morph, Strandmark morphological features; MRS, gaussian scale space representation; NTLBP, multi-scale noise tolerant LBP; O, features extracted only from the original image; OR, features extracted from the three orientation images obtained from the original image; PAP, Pap test dataset; PR, protein dataset; RICLBP, multi-scale rotation invariant co-occurrence of adjacent LBP; RNAi, RNA dataset; RPE, retinal pigmented epithelium; Saliency, saliency map; SFFS, sequential forward floating selection; SFS, sequential forward selection; Size_Bsif, ensemble based on Bsif with variable filter sizes; SVM, support vector machine; TEO, Finnish National Authority for Medicolegal Affairs; Wa, features extracted from the four images obtained by wavelet decomposition; WaH, features extracted from the images obtained a two level wavelet decomposition; Wavelet, wavelet decomposition into horizontal, vertical and diagonal coefficients; WLD, Weber law descriptor; VIR, virus dataset

Introduction

The retinal pigment epithelial (RPE) cells reside in the back of the eye between the photoreceptor cells and choroid. The RPE monolayer is vitally important for the vision as RPE cells compose a diffusion barrier to protect photoreceptor cells from humoral substances, but also maintain the viability of photoreceptor cells [1]. The RPE cell differentiation and maturation is a slow process, modulated by culturing environmental trophic factors [2,3]. The morphology changes during maturation [4]: from the elongated, so called “fusiform morphology”, of immature RPE; via “epithelioid morphology” i.e. rounder but still without pigmentation (after one to two weeks of culture); to “cobblestone morphology” (approximately after a month) when the cells have condensed and become heavily pigmented [4]. This phenomenon can be seen both in primary RPE [4] and in human pluripotent stem cells (hPSC) derived RPE cell maturation [5]. Recently, in the first human embryonic stem cells (hESC) RPE transplantations to humans, it was demonstrated that less pigmented cells integrated better than heavily pigmented cells [6]. Furthermore, new serial plating methods to expand the hPSC-RPE cell number [7,8] need a quality and purity evaluation after every plating step [8]. These applications would benefit from a non-invasive and reliable method to assess the maturity development of hPSC-RPE cells. The benefits of cell morphology analysis for both RPE tissue [9] and hPSC-RPE cell cultures [10] has already been shown. However, this has been mainly done by manual examination and therefore is affected by inter- and intra-operator variability, making it less suitable for clinical application. In particular, Jiang et al. [9] recently published a computer vision approach for RPE tissue explants, discriminating between age (young, <61days-old vs old, ≥100 or 180 days-old) and genotype (control vs rd10, considered to be a model for autosomal recessive retinitis pigmentosa). The analysis was based on 21 morphological features of the cells, including aspect ratio and area, by means of principal component analysis. In [11], three degrees of pigmentation were considered as a good maturation marker. A manual approach was chosen, where two observers subjectively classified the cell pigmentation levels and an objective pigmentation measurement was inferred from the Photoshop's Info Palette for a set of manually-selected points.

In this paper, we focused on the specific problem of the classification of the maturation level of hPSC-RPE cells by means of a different approach: texture analysis. Together with the increasing availability of advanced and accurate image acquisition techniques, texture analysis has become nowadays a common processing approach for medical and biological images. Its versatility makes it applicable to images acquired with diverse modalities: from medical imaging to microscopy [12–15].

In spite of the recent progresses in texture analysis, textural information in medical images is still often assessed by conventional features such as first order statistics (e.g. variance, kurtosis and skewness), second order statistics (Haralick features extracted from the grey level co-occurrence matrix [16]) or wavelet features. In [17], wavelets and a subset of the Haralick features were extracted from X-ray images to diagnose presence of osteosarcomas. First order statistics and features extracted from the grey level co-occurrence matrix and run-length matrix proved their utility in colorectal polyp identification in colonoscopy [18], classification of intracardiac masses (thrombi, malignant, and benign tumors) for cardiac tumor detection [19] and breast cancer malignancy classification in histological images [20].

More recent techniques, such as local binary pattern (LBP) [21] or texture descriptors derived from it (e.g. local ternary pattern (LTP) [22], local quinary pattern (LQP) [13], etc.), were applied to medical imaging for the examination of Pap test samples [12] or in the inquiry of endoscopy images of healthy and celiac disease duodenal tissue [14]. Another important research area, where texture descriptors are commonly used, is cell classification. Due to the availability of many datasets (2D HeLa dataset (HeLa) [23], chinese hamster ovary cells (CHO) [24], etc.), this field is very prolific for specific classification tasks and for the development of more and more accurate texture descriptors. In [13], the multi-threshold approach was applied to LTP and LQP and tested by classifying six different datasets of cellular and subcellular organelles. In [15], a new variant of LBP, the rotation invariant co-occurrence among adjacent LBP (RICLBP), obtained outstanding results in the MIVIA HEp-2 dataset. It suggests also that clinical tests, such as the antinuclear antibody test, can benefit of improved accuracy texture descriptors.

In spite of the efforts performed during the latest years to improve the discriminant power of texture descriptors, preprocessing did not receive the same attention. Recently published preprocessing approaches exploit the separation of the texture image in two different regions or maps, e.g. textural information extracted from edge information. In [25], the Difference of Gaussians (DoG) filter was used to compute from a given image two maps representing the “positive” and the “negative” sides of the image edges, resulting in a classification accuracy improvement. A similar approach was exploited by [26] (details in section 2.3) for the extraction, through Sobel filtering, of an edge and a non-edge region from a texture image to compute LBP, LTP, etc. on the original image masked by each map. The technique is interesting since it can be combined with many state-of-the-art texture descriptors. In order to differentiate from the canonical preprocessing, we named the descriptors combined with this approach as region-based descriptors. In addition to them, well-assessed preprocessing algorithms were tested, e.g. wavelet [27] and Gabor filters [28]. We paid particular attention to preprocessing techniques in this study, to improve the descriptors classification power.

The main aim of this paper was to develop three simple but effective methods to create ensembles of texture descriptors: the ensemble of preprocessing, the region-based approach and the ensemble of Binarized Statistical Image Features (Bsif). We validated them on a wide range of biological image datasets, with particular focus on the quantification of hPSC-RPE cell maturation stages, which enables a user-independent method to analyze the cell cultures before their use in implantation or as in vitro cell models. To find the most suitable ensemble of descriptors for the classification of the three developmental stages, we tested a combination of large sets of both preprocessing methods and texture descriptors. In the perspective of using hPSC-RPE cells for drug tests or transplantation, we used phase-contrast microscopy images, which is a noninvasive assessment method. Our work resulted in a methodological core for a software tool in order to assess quantitatively the level of development of hPSC-RPE cells. The same ensembling methods resulted effective also for other classification problems, ranging from medical diagnostic to virus images.

The pipeline of the process consisted in the following steps. First, we considered many state-of-the-art stand-alone texture descriptors in order to select the best ones. Second, they were combined together and with techniques to augment and enhance the features extracted from each image, to improve the classification performances. Finally, the best performing features sets resulting from the previous step were combined together, thus resulting in ensembles which improved significantly the performances of the pre-existing stand-alone descriptors and of the ensembles based on a single descriptor. For classification, we used Support Vector Machines (SVM).

In detail, we propose the following novelties. First, an ensemble of preprocessing approaches (based on wavelet decomposition, Gabor filtering, orientation image and Multi-scale approach by Gaussian filtering) was applied to create a set of images to be used together with the original one. A different descriptor was extracted from each processed image and the set of SVMs combined by sum rule. Second, saliency detection and wavelet decomposition were tested for the region-based descriptors. Each image was divided into two regions from which histograms were extracted by different texture descriptors. From each histogram, a specific SVM is trained [29]. Finally, the partial scores obtained by the different SVMs were combined by sum rule. Third, the original stand-alone versions of the Bsif [30] was improved by combining different Bsif sets. They were obtained by (i) varying the size of the filter and (ii) introducing a threshold while building the Bsif image. We constructed a new ensemble, using a set of sizes and thresholds, which greatly outperformed the stand-alone version.

Materials and Methods

2.1 Proposed approaches

In this work, we developed and validated three methods for ensembling texture descriptors and techniques such as preprocessing or region-based feature extraction, to describe images with an augmented feature set and eventually improving their classification. The first method was a combination of preprocessing methods, to create additional images from the original one and then extract texture information from each of them, enhancing the feature set describing the original image. The second approach worked differently, obtaining the augmentation of the feature set by means of the region-based approach. Saliency detection and wavelet decomposition divide each image in two different regions, from which features were extracted through different descriptors. Again, the original image is described by an enhanced feature set. Different texture descriptors were tested with the first two approaches. The third method augmented the feature set describing the original image by combining feature vectors extracted by Bsif, based on different filter sizes and binarization thresholds. The most effective feature sets extracted according to the aforementioned approaches were ensembled together. To test and optimize our approaches for the hPSC-RPE classification problem, we built a new RPE image dataset (Section 2.2): 1862 subwindows were extracted from 195 phase contrast images of maturing hPSC-RPE cells. Finally, we analyzed how the three approaches performed independently on the analyzed datasets: in order to generalize their viability, 10 large datasets of medical and biological images were used (Section 2.3).

The remainder of this section is organized as follows: the section 2.1.1 briefly explains the basic texture descriptors used in this paper; the section 2.1.2 is dedicated to preprocessing techniques and their ensemble; the section 2.1.3 presents the region-based approach using, saliency maps and wavelet maps; the section 2.1.4 explains the ensemble of Bsif; and, finally, the section 2.1.5 details the new multi-quinary coding tests.

2.1.1 Texture descriptors.

The standards texture descriptors used in this work are summarized in Table 1, together with the chosen parameters. As for the LBP-based approaches, we tested both uniform and rotation invariant uniform bins (see section 3 for details).

Download:

Table 1. Texture descriptors and their parameter sets.

https://doi.org/10.1371/journal.pone.0149399.t001

2.1.2 Preprocessing.

One of the aims was to improve the performance of texture descriptors by using a set of different preprocessing methods before feature extraction. When using a given preprocessing approach, a new set of images was produced to be then processed by an ensemble of descriptors. Classification was performed separately for each descriptor using SVMs, with both linear and radial basis function kernels, as the base classifier. For each dataset, the best kernel and set of parameters were chosen using a 5-fold cross-validation approach on the training data. SVMs were implemented using the tool LibSVM (available at http://www.csie.ntu.edu.tw/~cjlin/libsvm/) and combined by sum rule.

The image preprocessing phase included the testing of the following four methods: decomposition by wavelets, multi-resolution by Gaussian filters, orientation image and Gabor filters. Of note, only the training data is used for finding the parameters of the different approaches while the test set is blind. The flowchart of the preprocessing is reported in Fig 1.

Download:

Fig 1. Flowchart of the preprocessing.

https://doi.org/10.1371/journal.pone.0149399.g001

Wavelet transform is frequently used in many computer vision problems related with detection and recognition of objects of interest. Wavelet transform [27], to be used for 2D decomposition, requires a 2D scaling function, φ(x,y) and three 2-D wavelets functions, ψⁱ(x,y), where i represents the three possible intensity variations along horizontal, vertical and diagonal edges i = {H,V,D}.

As both functions are separable, the scaled and translated basis functions are defined as:

For the three discrete wavelet transform functions (W^H,W^V and W^D for horizontal, vertical and diagonal respectively) of a M x N function f(x,y), the used formulation is: where j₀ is an arbitrary starting scale and the W_φ(j₀,m,n) coefficients represent an approximation (on the initial scale j₀) of f(x,y). The coefficients represent the three directional (horizontal, vertical, and diagonal) details for higher scales than j₀.

In our experiments, we used the Daubechies wavelet family (Wa) with four vanishing moments. An example of Wa processing is shown in Fig 2.

Download:

Fig 2. Preprocessing by Wa.

Rows represent the three classes fusiform, epithelioid and cobblestone. Left: original image; right: horizontal, vertical and diagonal details.

https://doi.org/10.1371/journal.pone.0149399.g002

The second preprocessing technique was a multi-scale approach by means of Gaussian scale-space representation (MRS). The original image was filtered to obtain two smoothed versions by using a 2D symmetric Gaussian lowpass filter of size k pixels (here we use k = 3 and k = 5) with standard deviation 1. Illustrative results of MRS preprocessing are shown in Fig 3.

Download:

Fig 3. Preprocessing by MRS.

Rows represent the three classes fusiform, epithelioid and cobblestone. Left: original image; center: image filtered by a lowpass filter k = 3; right: image filtered by a lowpass filter k = 5.

https://doi.org/10.1371/journal.pone.0149399.g003

The third preprocessing technique was the Orientation image (OR). In the Orientation image the image gradient is soft quantized [43] using d orientations (here d = 3), thereby producing d processed images. This method is used to reduce the noise or other forms of degradation.

In detail, OR computation is organized in 3 steps:

for each pixel, the p gradient magnitude m(p) and orientation θ(p) are computed. θ(p) is then discretized over [0, 2π]. The pixel label is a d-dimensional vector with just one non-null element i which is equal to m(p) if the discretized θ(p) corresponds to the i-th bin, i.e.. A more refined approach, the soft decomposition of the magnitude, consists in quantizing m(p) in two parts to be assigned to the directions of the p’s two nearest neighbors. Details about soft decomposition are reported in [43];
to include in each pixel the p information also from its neighborhood, a local histogram of orientations computed all over the pixels contained into a squared-shape image patch (Cell) centered in p and size w. At pixel p the new feature vector is , where .
finally, a self-similarity measurement computed over n cells centered in the c_j pixels surrounding p (this topological structure is a circular block of radius L centered in p): , where and τ is a threshold slightly greater than 0 to make the mapping stronger in near-uniform regions.

This produces d = 3 different orientation images (for details refer to [43]); an example of OR preprocessing is shown in Fig 4.

Download:

Fig 4. Preprocessing by OR.

Rows represent the three classes fusiform, epithelioid and cobblestone. Left: original image; right: the three oriented images.

https://doi.org/10.1371/journal.pone.0149399.g004

The final preprocessing technique was Gabor filters. A 2D Gabor filter is a Gaussian kernel function modulated by a sinusoidal plane wave that is able to detect frequencies in various scales and directions. Gabor wavelets are derived from the convolution of the input image with a family of Gabor kernels. This family, or bank, of Gabor filters is created by dilating and rotating a specific function. Gabor wavelets approximate, to a certain level, the perception in the primary human visual cortex [28]. In this study four scales {1, 2, 3, 4} and four directions {0°, 45°, 90°, 135°} were implemented, thus 16 images are obtained. Choosing a specific frequency and direction allows creating a map containing the local frequency and orientation information for each pixel in an image.

A symmetric Gabor filter has the following general form in the spatial domain: where ν is the frequency of the sinusoidal wave, θ is the orientation, and σ is the standard deviation of the Gaussian envelope [44]. An example of Gabor filter and convolved image is shown in Fig 5.

Download:

Fig 5. Preprocessing by Gabor filters.

Rows represent the three classes fusiform, epithelioid and cobblestone. Left: original image; right: convolved images at scale 4.

https://doi.org/10.1371/journal.pone.0149399.g005

The various preprocessing techniques, and their ensembles, used in Section 3 are summarized in Table 2.

Download:

Table 2. Preprocessing approaches.

https://doi.org/10.1371/journal.pone.0149399.t002

2.1.3 Region-based descriptors.

This idea was mainly inspired by [26], where the edge-based LBP variant (Edge) is proposed. This bases on the evidence that, when an observer needs to fixate the attention to a particular image, the most likely perceived locations are the ones that present the highest spatial frequency edge information [45].

The Edge descriptor is computed as follows:

applying LBP to an image to obtain the LBP image (LBPI);
detecting the edges in the original image by means of Sobel filter. Two binary maps are created from the edge information: the edge map (E, where edge pixels are set to 1 and non-edge pixels to 0), and the non-edge map (NE, where edge pixels are set to 0 and non-edge pixels to 1);
combining LBPI with the E and NE masks, to obtain two histograms (H_E for edge pixels and H_NE for non-edge pixels), see (Abdesselam, 2013) for details;
mounting the final histogram (weighted concatenation):

where w_E and w_NE represent the empirically determined weight that express the greater relevance of edge regions in capturing the viewer’s visual attention;

Unlike in [26], in this study the two histograms, H_E and H_NE, were not combined into one feature vector but they were used separately to train two different SVMs that are then combined by sum rule.

Two methods for extracting the two maps were tested: the former based on saliency and the latter on wavelet decomposition. It should be noted that for both approaches, as in Edge, the descriptor was extracted initially from the original image and the two regions were used only to calculate the two histograms. The flowchart of the region-based approach is reported in Fig 6.

Download:

Fig 6. Flowchart of the region-based approach.

https://doi.org/10.1371/journal.pone.0149399.g006

In detail:

the chosen descriptors (namely LBP, LTP, LPQ, RICLBP and WLD) were applied to the texture image to get the labeled image DescI;
two maps, Map⁺ and Map^-, were computed according to saliency or wavelet (details in the next sections);
two histograms, H⁺ and H^-, were computed by combining DescI with Map⁺ and Map^-, respectively;
H⁺ and H^- were used to train two different SVMs that were then combined by sum rule.

Saliency: We used the method proposed in [46] to extract a saliency map from the image. Given an image x, the signature is defined as where sign() represents the sign operator and DCT() is the Discrete Cosine Transform. Hou et al. [46] demonstrated analytically and experimentally that the support of the foreground of an image can be approximated by the reconstructed image , obtained by transforming back to the spatial domain the image signature, as follows:

For images whose foreground is evident compared to their background, the saliency map m is defined as where g is a Gaussian kernel aimed to blur the noise induced by the sign quantization and o is the entrywise matrix product operator. To build the saliency map the standard deviation of the Gaussian kernel was set to 2.

For each image two regions were extracted. The first contained the pixels with saliency higher and the latter contained the pixels with saliency lower than a prefixed threshold. To build the saliency map two different thresholds were applied: 0.5 and 0.7. Hence, for each image, two saliency maps and four histograms were extracted.

Wavelet: The wavelet decomposition (see section 2.1.2) used four wavelets, and the horizontal, vertical and diagonal coefficients matrices were considered. These matrices were resized to the size of the original image and then the mean value of each image was calculated. Each image was divided in two regions whose pixels were respectively greater and smaller than the mean value.

The region-based methods, as well as the baselines, are summarized in the following Table 3.

Download:

Table 3. Region-based methods and baselines (BAS) used for comparison.

https://doi.org/10.1371/journal.pone.0149399.t003

2.1.4 Binarized Statistical Image Features.

The Bsif descriptor assigns an n-bit label to each pixel of a given image by exploiting a set of n linear filters. Given a neighborhood of l x l pixels and a set of n linear filters of the same size, the n-bit label to be assigned to the central pixel of the neighborhood is obtained by binarizing where x is the l² x 1 vector notation of the l x l neighborhood and W is a n x l² matrix representing the stack of the vector notations of the filters. In detail, the i-th digit of s is a function of the i-th linear filter w_i and it is expressed as thus each bit of the Bsif code can be obtained as

The set of filters w_i is estimated by maximizing, through independent component analysis, the statistical independence of the filter responses s_i on a set of patches from natural images. In the original Bsif, the binarized feature b_i, was obtained by setting b_i = 1 if s_i > th and b_i = 0 where th = 0. We improved the stand-alone version of Bsif by combining different Bsif in two ensembles, Size_Bsif and Full_Bsif. Size_Bsif was obtained by varying the filter size = {3, 5, 7, 9, 11} (i.e. we use 5 different filters). The second ensemble, Full_Bsif, was derived from Size_Bsif by varying also the threshold th used to binarize the image. In detail, we used the following thresholds th = {-9, -6, -3, 0, 3, 6, 9} for each different size of the filter and, the 35 SVMs trained with these Bsif-based descriptors (for each couple of size and threshold a different SVM is trained) were combined by sum rule.

2.1.5 Multi-quinary coding.

Variants of the original LBP descriptor were proposed, based on modifications on the binarizing function s(x), originally defined in [21] as: where x = q_p −q_c, q_c represents the central pixel in the neighborhood and q_p each of the surrounding pixels.

In [22], LTP was defined by encoding the same difference x with 3 values, by means of the threshold τ:

In [13], this approach was extended to LQP by introducing two thresholds τ₁ and τ₂ (τ_1<τ₂), thus getting the quinary coding:

These variants of the binary coding allow a lower sensitivity to noise, especially in near-uniform regions, and a higher level of granularity that allows catching more textural features with respect to the original version. However, to compensate the increased verbosity of the ternary and quinary codings, the ternary patterns are split into one positive and one negative binary patterns, according to the sign of its components, while the quinary patterns are split into four binary patterns, according to b_c(d): where c∈{-2, -1, 1, 2} and d represents a single digit of the quinary pattern. For instance, the first binary pattern results from c = 2, the second one from c = 1 and so on for c = -1 and c = -2.

After computing one histogram for each binary pattern, the six partial histograms (two for LTP and four for LQP) are concatenated into a final histogram.

Moreover, in [13], a multi-threshold version of LQP, namely multi-threshold LQP (MLQP) was proposed, using a set of 25 couples of thresholds (τ₁ = {1,3,5,7,9} and τ₂ = {τ₁+2, τ₁+4,…, τ₁+11}) and combining the 25 SVMs trained with the histograms. Usually in LBP, and in its variants, a circular neighborhood allows obtaining a rotation invariant descriptor. However, in some problems, anisotropy is an important source of information. To use the anisotropic structural information, several neighborhood shapes (such as parabola, ellipse and hyperbole) were used in [12] (Table 4).

Download:

Table 4. Loci of points defining the different neighborhood topologies.

For each geometric locus defined in [12], its formal definition and parameters are reported.

https://doi.org/10.1371/journal.pone.0149399.t004

In MLQP, the threshold selection is a critical task: in [13], we set the thresholds manually to get good performance in studied datasets. The proposed thresholds were stable enough also in the RPE classification problem. The performance of MLQP was enhanced by building a large set of LQP coupling the set of thresholds with the geometric loci presented in [12] and summarized in Table 4.

All the loci of points (with the exception of the circle) were rotated by β = {0°, 45°, 90°, 135°} to catch the anisotropic structural information according to different orientations as in Fig 7. The flowchart of the quinary coding and the use of the geometric loci is reported in Fig 8.

Download:

Fig 7. The different neighborhood topologies.

From left to right in line 1: circle, ellipse, parabola, hyperbola and spiral. We represented the central pixel of the neighborhood (green) and the points forming the neighborhood (red). In line 2, 3 and 4 the different rotation angles β are represented.

https://doi.org/10.1371/journal.pone.0149399.g007

Download:

Fig 8. Flowchart of the quinary coding and usage of the geometric loci.

https://doi.org/10.1371/journal.pone.0149399.g008

Afterwards, the Sequential Forward Floating Selection (SFFS) was applied, using the training data for selecting only a single subset of the MLQP descriptors.

SFFS and its predecessor Sequential Forward Selection (SFS) are top-down searches that sequentially select a subset of features from the original set of candidates in order find an optimal subset.

Starting from the empty subset S₀, SFS sequentially adds the k-th feature, maximizing the objective function when combined with the subset S_k-1 of the previously selected k-1 features, thus getting the current subset S_k. However, the main drawback is that the selected features cannot be reevaluated and discarded after the addition of a new feature.

SFFS [47] improves SFS by carrying out backward steps after the inclusion of a new feature as long as the objective function rises. For instance, after the k-th step forward, i.e. the selection of the k-th feature, each feature in S_k is removed from the subset to get a smaller subset whose performance is compared with S_k−1’s. If results in a greater objective function than S_k−1, then it replaces S_k−1.

We used SFFS as a feature selection method, where each feature was assigned a couple of thresholds and a geometric locus, to find the most useful (thresholds, locus) sets for MLQP. Therefore, we selected a set of MLQP descriptors, where each descriptor was used to train a given SVM: the objective function of SFFS was the maximization of the area under the ROC curve (obtained combining by sum rule the set of SVMs) using an internal 10-fold cross validation in the training data. The same procedures (thresholds, geometric loci and feature selection) were used also to extend the Local Configuration Pattern (LCP) descriptor to its multi-threshold quinary version MLCP.

2.1.6 Color descriptors.

For the RPE dataset only (Section 2.2) we used an additional set of descriptors, not based on texture, but on colors (COLORS). COLORS consists in the concatenation of statistics computed on the three channels of a color image: mean, homogeneity, standard deviation, third, fourth and fifth moments and the marginal histograms (8 bins for each channel) [48,49].

2.2 The RPE dataset

2.2.1 Cell culture.

Two hESC lines (Regea 08/023; 46, XY, Regea 08/017; 46,XX) [50] and one human induced pluripotent stem cell (hiPSC) line, (UTA.04511.WTS 46, XY) [51] were used for this study. Cell lines were cultured on top of mitomycin-treated (10 μg/ml,Sigma-Aldrich) (i.e. mitotically inactivated) human foreskin fibroblasts feeder cells (CRL-2429TM, ATCC, Manassas, VA, USA). The undifferentiated cells were cultured similarly as in Sorkio et al. [52] and after one week of culture the differentiation was induced by reducing the KO-SR concentration to 15%, removing the bFGF and commencing the floating culture as previously described in Vaajasaari et al. [53]. Floating aggregates were fed thrice a week and grown for 70–195 days. The pigmented areas of floating aggregates were manually dissected, dissociated with 1x Trypsin-EDTA and replated on collagen IV from human placenta (5 μg/cm², Sigma-Aldrich). Adherently cultured cells were imaged for the fusiform morphology after 8 days (range 6–9 days), for the epithelioid morphology after 9 days (range 8–9) and for the cobblestone morphology after 19 days (range 17–24) of culturing.

2.2.2 Ethical issues.

The National Authority for Medicolegal Affairs Finland has approved our research with human embryos (Decision number 1426/32/300/05). We also have a supportive statement from the local ethics committee of the Pirkanmaa hospital district Finland to derive and expand hESC lines for research purposes (R05116). Local ethics committee of the Pirkanmaa Hospital District has given a supportive statement to generate iPSC lines for research purposes (R11028), and use them to ophthalmic research (R14023). No new hESC or hiPSC lines were generated in this study.

2.2.3 Image acquisition.

The cell culture images were acquired for analysis with the same settings (25–125 ms exposure time, 2560 x 1920 pixels, dynamic contrast and autowhite balance) from cell cultures using a Nikon Eclipse TE200S phase-contrast microscope (Nikon Instruments Europe B.V., Amstelveen, Netherlands) with the 20x objective and Ph1 phase contrast. Cell imaging parameters are described in Table 5.

Download:

Table 5. Image acquisition parameters.

https://doi.org/10.1371/journal.pone.0149399.t005

2.2.4 Building the RPE dataset.

Each acquired image was divided into 16 subwindows which were manually labeled into 4 classes by two trained operators; samples particularly difficult to be labeled were inspected by a specialist. Subwindows containing clutters, out-of-focus elements or just background were discarded. The criteria of inclusion and examples are shown for each class in Table 6 and in Fig 9.

Download:

Fig 9. Illustrative images of the RPE maturation stages (classes).

From left to right: fusiform, epithelioid, cobblestone and mixed (Fusiform and Epithelioid).

https://doi.org/10.1371/journal.pone.0149399.g009

Download:

Table 6. Class properties used for building the ground truth.

https://doi.org/10.1371/journal.pone.0149399.t006

In Fig 9, the four RPE classes are represented from left to right: the fusiform cell type, the epithelioid with its characteristic globular shape, the final maturation stage cobblestone and a mixed class example with an epithelioid cell in the middle of the image and, in the left side, a cluster of fusiform cells. The final dataset includes a total of 1862 subwindows: the number of subwindows per class is reported in Table 6. Before using the RPE images, they were converted to gray scale by means of the standard MATLAB (The MathWorks, Inc., Natick, Massachusetts, United States) function rgb2gray.

2.3 Validation in other datasets

For validating some of the proposed variants of texture descriptors and the system for RPE classification, we ran several comparisons also in other datasets. As testing protocol, we used the 5-fold cross validation, except for the VIR dataset for which the 10-fold validation protocol was provided by the original author.

The following datasets were used:

PAP: this dataset [54] contains 917 images unevenly distributed among 7 classes of cells, acquired during Pap tests for the diagnosis of cervical cancer. The dataset is available upon request to Loris Nanni [nanni@dei.unipd.it];
VIR: this dataset [55] contains 1500 images, evenly divided into 10 classes, of viruses extracted using negative stain transmission electron microscopy. The 10-fold validation protocol shared by the authors was used. The mask for background subtraction was not used and the features were extracted from the whole images. The dataset is available at http://www.cb.uu.se/~gustaf/virustexture/;
HI: this Histopathology dataset [56] is composed of 2828 images from different organs, unevenly distributed among 4 classes, representative of the four fundamental tissues (connective, epithelial, muscular, and nervous). The dataset is available upon request to Loris Nanni [nanni@dei.unipd.it];
BR: this dataset [57] is a subset of the digital database for screening mammography [58] and contains 1394 images of breast tissue, 810 control, 273 malignant and 311 benign breast cancers. The dataset is available upon request to Geraldo Braz Junior [ge.braz@gmail.com];
PR: this dataset, reported in [59] contains 329 proteins, divided into DNA-binding (118 samples) and non-DNA-binding (231 samples). From the 3D tertiary structure of each protein, its 2D distance matrix was computed (considering only atoms that belong to the protein backbone) and used to extract texture features. The dataset is available upon request to Loris Nanni [nanni@dei.unipd.it];
CHO: this cell dataset [24] contains 327 fluorescent microscopy images of Chinese Hamster Ovary cells and distributed into 5 classes. The dataset is available at http://ome.grc.nia.nih.gov/iicbu2008/hela/index.html#cho;
HeLa: the 2D HeLa dataset [23] consists in 862 single cell images, divided into 10 staining classes, from fluorescence microscope acquisitions on HeLa cells. The dataset is available at http://ome.grc.nia.nih.gov/iicbu2008/hela/index.html;
LE: the LOCATE ENDOGENOUS mouse sub-cellular organelles dataset [60] contains 502 images, unevenly distributed among 10 classes, of endogenous proteins or features of specific organelles. The dataset is available at http://locate.imb.uq.edu.au/;
LT: the LOCATE TRANSFECTED mouse sub-cellular organelles dataset [60] contains 553 images, unevenly distributed in 11 classes, of fluorescence- or epitope-tagged protein transiently expressed in specific organelles. The dataset is available at http://locate.imb.uq.edu.au/;
RNAi: this dataset contains 200 fluorescence microscopy images, evenly distributed among 10 classes, of fly cells subjected to a set of gene-knockdowns using RNAi and stained with DAPI to visualize their nuclei. The dataset is available at http://ome.grc.nia.nih.gov/iicbu2008/rnai/index.html.

Results and Discussion

3.1 Experimental results for the RPE dataset

The chosen performance indicator was the area under the ROC curve (AUC) as it is more reliable than accuracy. AUC allows summarizing in one scalar value the ROC curve. In multi-class problems the one-versus-all approach was used: each of the m classes was considered as “positive” and the remaining m-1 classes as “negative”, thus obtaining m partial AUCs. Finally, the global AUC was computed as an average of the partial AUCs.

As testing protocol a 10-fold cross validation was used. Of note, the 10-fold was applied at image level, so all the sub-windows of a given image belonged or to the training set or to the test set.

Of note, when referring to a texture descriptor combined with preprocessing/region-based approaches, we use the notation preprocessing(descriptor). For example, Comb(RICLBP) means RICLBP combined with the various preprocessing Wa, OR and Ga applied to the original image and the two images obtained by MRS (see Table 2).

In the first test, reported in Table 7 several texture descriptors and five different ensembles (the last five rows) were compared. The methods named A+B are the fusions by sum rule between the methods A and B. The LBP-based approaches use uniform bins, except in presence of the suffix -ri, representing the rotation invariant bins. From the results reported in Table 7, it is clear that the best performances were obtained using descriptor ensembles. The best performance was reached by the ensemble RICLBP+MLPQens+MLCP, while the best stand-alone approach was RICLBP. We used an already published method [61] for assessing the difference between two approaches in the same dataset. MLCP, i.e. the multi-threshold quinary ensemble of LCP built according to [13] (see section 2.1.5), outperforms RICLBP with a probability of 80% using a one sided test of significance with p = 0.05. However, the proposed ensembles RICLBP + MLPQens + MLCP and Comb(RICLBP) + Full_Bsif + MLPQens + MLCP outperform each stand-alone approach with a probability of 95%, using a one sided test of significance with p-value of 0.05.

Download:

Table 7. Performance (AUC) comparison among different texture descriptors.

https://doi.org/10.1371/journal.pone.0149399.t007

We tested also the effectiveness of COLORS, obtaining an AUC of 89.10%.

In Table 8 different approaches based on Bsif were compared. The method named Baseline represents the standard stand-alone Bsif with size = 7. Moreover, Full_Bsif was coupled with the best previous ensemble (i.e. RICLBP+MLPQens+MLCP) increasing slightly the performance.

Download:

Table 8. Performance (AUC) comparison among different Bsif-based approaches.

https://doi.org/10.1371/journal.pone.0149399.t008

The performance of the best descriptors was presented in Table 9, coupled with the preprocessing approaches detailed in section 2.2 and Table 2.

Download:

Table 9. Performance (AUC) obtained coupling the best texture descriptors with different preprocessing methods.

https://doi.org/10.1371/journal.pone.0149399.t009

Notice that all the preprocessing approaches are coupled with only stand-alone texture descriptors due to the high computation time. The performances of all the descriptors were improved when the features were extracted from Comb, comparing with the performance obtained by O.

The region-based methods, proposed in section 2.3 and Table 3, were compared in Table 10.

Download:

Table 10. Performances (AUC) of the region-based approaches.

https://doi.org/10.1371/journal.pone.0149399.t010

It is clear that a descriptor applied to a set of processed images drastically outperformed the same descriptor obtained using only the original image. However, only some methods’ performances benefited from the different preprocessing methods. The baseline multi-quinary approach (i.e. all the descriptors extracted from the circle neighborhood) and the effect of SFFS are reported in Table 11. SFFS supervised selection improved MLQP but had no positive effect on MLCP.

Download:

Table 11. Performance (AUC) of the multi-quinary approaches.

https://doi.org/10.1371/journal.pone.0149399.t011

The final ensemble was created by sum rule among Comb(RICLBP), Full_Bsif, MLPQens and MLCP. It obtained an AUC of 86.49%. As shown in Table 6, the RPE dataset was unbalanced towards the cobblestone class, consequently a single 10-fold cross validation risks to create a training set not representative of the classes with fewer subwindows. Therefore we ran a higher-performance but more computationally demanding protocol: the leave-one-image-out, consisting in leaving out one full image (i.e. the image divided into the 16 subwindows) for each round. We tested such protocol on our best ensemble Comb(RICLBP) + MLPQens + MLCP + Full_Bsif, whose performance increased from 86.49% to 91.98%. A further improvement was obtained by the fusion by sum rule between such ensemble and COLORS, obtaining an AUC of 95.00%.

3.2 Results with other datasets

Afterwards, the proposed ensemble of Bsif, the preprocessing applied before feature extraction and the region-based descriptors were validated with the other datasets.

Other tests, e.g. coupling MLCP with the selection of the geometric loci, were not performed due to their huge computational time.

As in the previous tests, the performance indicator was the AUC. Moreover, the experiments were statistically validated with the Wilcoxon signed rank test and the Bonferroni-Holm method.

The performances of Bsif and of standard texture descriptors, as baseline, were compared in Table 12. The three best baseline methods were CLBP, RICLBP and LTP. LTP outperformed all the other baseline approaches (except RICLBP and CLBP) with a p-value of 0.05. Furthermore, there was no difference between the performance of RICLBP and LTP. Moreover, Full_Bsif outperformed both Bsif, Size_Bsif and all the baseline approaches, including LTP with a p-value of 0.05.

Download:

Table 12. Comparison of the performance (AUC) of standard texture descriptors and Bsif coding.

https://doi.org/10.1371/journal.pone.0149399.t012

In order to avoid reporting a massive amount of results, in the following we summarized only the results from the best performing baseline descriptors, i.e. RICLBP, LPQ and LTP.

The effect of preprocessing on the additional datasets was reported in Table 13. We also tested the ensemble of preprocessing O+Wa+OR, i.e. the sum rule among the preprocessing approaches O, Wa and OR applied to the original image and to the two images obtained by MRS. We can conclude that, among the stand-alone preprocessing, O is the best one and that the best approach is the ensemble O+Wa+OR, which outperformed the baseline approach O with a p-value of 0.05.

Download:

Table 13. AUC obtained using the preprocessing approaches and LTP, RICLBP and LPQ.

https://doi.org/10.1371/journal.pone.0149399.t013

The results reported in Table 14 showed that the region-based approaches outperformed, with a p-value of 0.05, the standard application of texture descriptors. Especially, compared to the baseline O, All+O improved AUC (or did not perform worse) for all the datasets and for the three tested descriptors with a p-value = 0.05.

Download:

Table 14. AUC obtained using the region-based approaches and LTP, RICLBP and LQP.

https://doi.org/10.1371/journal.pone.0149399.t014

Finally, to summarize the best techniques presented in this section, we created a further ensemble F where we combine Full_Bsif, O+Wa+OR(LTP), O+Wa+OR(RICLBP), O+Wa+OR(LPQ), All+O(LTP), All+O(RICLBP) and All+O(LPQ). F was compared in Table 15 to Full_Bsif: F performed better than Full_Bsif, or at least equally, on all the additional datasets with a p-value of 0.05.

Download:

Table 15. Comparison of Full_Bsif and the ensemble F of the best methods investigate in this section (AUC is reported).

https://doi.org/10.1371/journal.pone.0149399.t015

Conclusions

In this work, we assembled and tested many state-of-art texture descriptors and a large set of preprocessing methods for demanding image classification tasks and their application in a new and very specific biological problem, i.e. the automatic assessment of the maturation level of hPSC-RPE cells. This is a very well warranted problem as RPE cells are planned to be used for implantation and for in vitro cell models for drug and disease modeling. In all these applications the perquisite for the maturation assessment is the non-invasiveness and consequently there is the need for label-free methods. Thus, analysis methods based on just phase contrast microscopy images are welcomed.

The first aim of this work consisted in applying three new methods to create ensembles of texture descriptors (based on combinations of preprocessing techniques, region-based approaches and Bsif with different filter sizes and binarization thresholds) to find the most suitable descriptors for the texture-based classification of the considered datasets, in particular of the RPE dataset. A combination of different preprocessing techniques (i.e. Wa, OR and Ga applied at three different scale of representation obtained by MRS) allowed to boost all the best performing descriptors (see Table 9, with the exception of WLD, for which MRS was not necessary). It is interesting to note that while each descriptor obtained the best performance with a different preprocessing, the fusions Wa+OR+Ga and Comb, improved the single best preprocessing for all the descriptors. Similar improvements were obtained by the region-based methods, in particular by combining the region selection by Edge, Wavelet and Saliency (see Table 10). However, it is interesting to note that a global combination of preprocessing and region-based feature extraction did not provide significant improvements compared to using the two approaches individually (see Table 10, last column). Therefore, we chose to exploit only preprocessing, hence obtaining the new ensemble Comb(RICLBP) + Full_Bsif + MLPQens + MLCP. This approach performance was AUC = 86.49%, which is the best result observed on the RPE dataset.

The second aim of this work was to provide the methodological core for a software tool in order to assess quantitatively the level of development of hPSC-RPE cells, compared to the classification provided in [4,62]. Our study primarily shows that a computer vision system is able to classify the RPE cell maturation stage and, secondly, it enables a correct and repeatable estimation of the maturation level on new images, a necessary step before using these specific cells and tissues, e.g. for drug testing. The most accurate ensemble, Comb(RICLBP) + Full_Bsif + MLPQens + MLCP, got an AUC of 86.49%, confirming that image processing methods can be employed to classify the maturity of RPE in microscopy images. Moreover, we proved that using the higher-performance, but slower to be validated, leave-one-image-out classification protocol we obtained for the best method Comb(RICLBP) + Full_Bsif + MLPQens + MLCP AUC over 91%.

The only related studies are [9] and [11]. Jiang et al. [9] reported a correlation between two specific morphological features, cell area and aspect ratio, two maturation stages (young, <61days-old vs old, ≥100 or 180 days-old) and two genotypes (control vs rd10). However, the cell source was different, mouse eyes vs RPE cells derived from hPSCs, as well as the maturation stages, since their focus was only in the cobblestone stage. In Kamao et al. [11], the degree of pigmentation (objective dPG, in the original publication) was assessed as the main marker on hPSC-RPE maturation, by means of a manual assessment of the RGB values from the Photoshop's Info Palette, for single cells (obtained by manual segmentation) and cell-groups. In spite of the findings, in particular that the objective dPG correlated with the RPE function, the technique of Kamao et al. [11] required user interaction, which might not be objective and is not suitable for huge number of images.

To prove the feasibility of the proposed methods not only on RPE images, but also for a wider range of biological applications, additional tests were performed on a selection of 10 datasets, spacing from diagnostic to microscopy images (electronic transmission as well as fluorescence imaging). We observed that the best-performing configurations of the three new proposed approaches (namely region-based descriptors, the ensemble of preprocessing algorithms and the improved Bsif) provided good classification results, obtaining an average AUC greater than 95%. In particular, we compared all the tested/proposed ensemble approaches and the best method resulted to be Full_Bsif that outperformed all the other approaches with a p-value of 0.05. This result highlights the key role of these methods in improving many texture descriptors’ accuracy in the classification of different kind of biological images. Of note, the effectiveness of Full_Bsif has to be seen all over the tested datasets (see Table 12). By changing the filter size and the binarization threshold we can obtain Size_Bsif and Full_Bsif which work better than the baseline Bsif on all the tested datasets (see Tables 8 and 12). On the RPE dataset, Full_Bsif by itself obtains results lower than RICLBP+MLPQens+MLCP (82.79% vs 84.60%), nevertheless Full_Bsif obtains statistically higher performances on the additional datasets, thus resulting the best ensemble based on a single descriptor. Finally, we tested on the additional datasets, a last ensemble F built by gathering the best techniques of the aforementioned three new approaches: Full_Bsif, O+Wa+OR(LTP), O+Wa+OR(RICLBP), O+Wa+OR(LPQ), All+O(LTP), All+O(RICLBP) and All+O(LPQ). F outperformed Full_Bsif in six datasets and obtained equal AUC in the other four. Due to the variety of the additional datasets, F represents our proposed ensemble to process a generic dataset as well as a suggestion to other researchers for further studies on this topic.

Of note, the approaches investigated in section 3.2 showed better performances on the additional datasets than in the RPE dataset. This is due to the nature of each dataset, e.g. staining, microscopy technique, etc. The RPE dataset was acquired directly imaging the cell cultures through a phase-contrast microscope. On the other hand, among the additional datasets we had more complex imaging techniques such as electronic transmission (VIR) or fluorescence imaging (CHO, HeLa, LE, LT, RNai). Such techniques involve sample treatment (e.g. ultra-thin samples for electron transmission, staining with antibodies for immunofluorescence or other stains for histology images). In spite of the better image quality which then affects the classification performance, such processing necessarily alters the samples and it is time-consuming. Moreover, many datasets provided pre-segmented images, thus excluding the textural information from the background.

The main limitation in the RPE study is the strict standards we had to define and fulfill for the image acquisition, necessary to build a reliable dataset, but demanding to be implemented in the laboratory practice.

As future work, we will build new methods to build region-based approaches for the classification of biological image datasets. Furthermore, we aim to use a heterogeneous ensemble of classifiers, instead of a stand-alone SVM, to improve the performance. Finally, to remove potentially confounding patterns, we plan to design an automatic method to remove the background from samples such as fusiform and epithelioid images.

In conclusion, in this paper we presented three methods to developed ensembles of texture descriptors, proving that specific preprocessing and ensembling techniques improve the performance of many state-of-art texture descriptors. Moreover, we automated the classification of the maturation stages of RPE cells by means of an ensemble of texture descriptors. Such methods were finally validated on a wide set of general biological image analysis problems.

Acknowledgments

Outi Heikkilä, Outi Melin, and Hanna Pekkanen are thanked for the skillful technical assistance.

Author Contributions

Conceived and designed the experiments: LN MP HS KJU JH. Performed the experiments: LN MP FLCS KJU. Analyzed the data: LN MP FLCS KJU. Contributed reagents/materials/analysis tools: HS KJU. Wrote the paper: LN MP FLCS HS KJU JH.

References

1. Strauss O. The Retinal Pigment Epithelium in Visual Function. Physiol Rev. 2005;85: 845–881. pmid:15987797
- View Article
- PubMed/NCBI
- Google Scholar
2. Burke JM, Skumatz CM, Irving PE, McKay BS. Phenotypic heterogeneity of retinal pigment epithelial cells in vitro and in situ. Exp Eye Res. 1996;62: 63–73. pmid:8674514
- View Article
- PubMed/NCBI
- Google Scholar
3. Rowland TJ, Blaschke AJ, Buchholz DE, Hikita ST, Johnson LV, Clegg DO. Differentiation of human pluripotent stem cells to retinal pigmented epithelium in defined conditions using purified extracellular matrix proteins. J Tissue Eng Regen Med. 2013;7: 642–653. pmid:22514096
- View Article
- PubMed/NCBI
- Google Scholar
4. McKay BS, Burke JM. Separation of Phenotypically Distinct Subpopulations of Cultured Human Retinal Pigment Epithelial Cells. Exp Cell Res. 1994;213: 85–92. pmid:7517370
- View Article
- PubMed/NCBI
- Google Scholar
5. Juuti-Uusitalo K, Delporte C, Grégoire F, Perret J, Huhtala H, Savolainen V, et al. Aquaporin expression and function in human pluripotent stem cell-derived retinal pigmented epithelial cells. Invest Ophthalmol Vis Sci. 2013;54: 3510–3519. pmid:23687169
- View Article
- PubMed/NCBI
- Google Scholar
6. Schwartz SD, Hubschman J-P, Heilwell G, Franco-Cardenas V, Pan CK, Ostrick RM, et al. Embryonic stem cell trials for macular degeneration: a preliminary report. Lancet. 2012;379: 713–720. pmid:22281388
- View Article
- PubMed/NCBI
- Google Scholar
7. Croze RH, Buchholz DE, Radeke MJ, Thi WJ, Hu Q, Coffey PJ, et al. ROCK Inhibition Extends Passage of Pluripotent Stem Cell-Derived Retinal Pigmented Epithelium. Stem Cells Transl Med. 2014;3: 1066–1078. pmid:25069775
- View Article
- PubMed/NCBI
- Google Scholar
8. Singh R, Phillips MJ, Kuai D, Meyer J, Martin JM, Smith M a., et al. Functional analysis of serially expanded human iPS cell-derived RPE cultures. Investig Ophthalmol Vis Sci. 2013;54: 6767–6778.
- View Article
- Google Scholar
9. Jiang Y, Qi X, Chrenek M a, Gardner C, Boatright JH, Grossniklaus HE, et al. Functional principal component analysis reveals discriminating categories of retinal pigment epithelial morphology in mice. Invest Ophthalmol Vis Sci. 2013;54: 7274–7283. pmid:24114543
- View Article
- PubMed/NCBI
- Google Scholar
10. Vugler A, Carr A-J, Lawrence J, Chen LL, Burrell K, Wright A, et al. Elucidating the phenomenon of HESC-derived RPE: anatomy of cell genesis, expansion and retinal transplantation. Exp Neurol. 2008;214: 347–361. pmid:18926821
- View Article
- PubMed/NCBI
- Google Scholar
11. Kamao H, Mandai M, Wakamiya S, Ishida J, Goto K, Ono T, et al. Objective Evaluation of the Degree of Pigmentation in Human Induced Pluripotent Stem Cell-Derived RPE. Invest Ophthalmol Vis Sci. 2014;55: 8309–8318. pmid:25389202
- View Article
- PubMed/NCBI
- Google Scholar
12. Nanni L, Lumini A, Brahnam S. Local binary patterns variants as texture descriptors for medical image analysis. Artif Intell Med. 2010;49: 117–125. pmid:20338737
- View Article
- PubMed/NCBI
- Google Scholar
13. Paci M, Nanni L, Lahti A, Aalto-Setala K, Hyttinen J, Severi S. Non-Binary Coding for Texture Descriptors in Sub-Cellular and Stem Cell Image Classification. Curr Bioinform. 2013;8: 208–219.
- View Article
- Google Scholar
14. Vécsei A, Amann G, Hegenbart S, Liedlgruber M, Uhl A. Automated Marsh-like classification of celiac disease in children using local texture operators. Comput Biol Med. 2011;41: 313–325. pmid:21513927
- View Article
- PubMed/NCBI
- Google Scholar
15. Nosaka R, Fukui K. HEp-2 cell classification using rotation invariant co-occurrence among local binary patterns. Pattern Recognit. Elsevier; 2014;47: 2428–2436.
- View Article
- Google Scholar
16. Haralick R, Dinstein , Shanmugam K. Textural features for image classification. IEEE Trans Syst Man Cybern. 1973;SMC-3: 610–621.
- View Article
- Google Scholar
17. Hu S, Xu C, Guan W, Tang Y, Liu Y. Texture feature extraction based on wavelet transform and gray-level co-occurrence matrices applied to osteosarcoma diagnosis. Biomed Mater Eng. 2014;24: 129–143. pmid:24211892
- View Article
- PubMed/NCBI
- Google Scholar
18. Fu JJC, Yu Y-W, Lin H-M, Chai J-W, Chen CC-C. Feature extraction and pattern classification of colorectal polyps in colonoscopic imaging. Comput Med imaging Graph. Elsevier Ltd; 2014;38: 267–275.
- View Article
- Google Scholar
19. Strzelecki M, Materka A, Drozdz J, Krzeminska-Pakula M, Kasprzak JD. Classification and segmentation of intracardiac masses in cardiac tumor echocardiograms. Comput Med Imaging Graph. 2006;30: 95–107. pmid:16476535
- View Article
- PubMed/NCBI
- Google Scholar
20. Loukas C, Kostopoulos S, Tanoglidi A, Glotsos D, Sfikas C, Cavouras D. Breast cancer characterization based on image classification of tissue sections visualized under low magnification. Comput Math Methods Med. 2013;2013: 829461. pmid:24069067
- View Article
- PubMed/NCBI
- Google Scholar
21. Ojala T, Pietikäinen M, Mäenpää T. Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns. IEEE Trans Pattern Anal Mach Intell. Los Alamitos, CA, USA: IEEE Computer Society; 2002;24: 971–987.
22. Tan X, Triggs B. Enhanced local texture feature sets for face recognition under difficult lighting conditions. Image Process IEEE Trans. 2010;19: 1635–1650.
- View Article
- Google Scholar
23. Boland MV, Murphy RF. A neural network classifier capable of recognizing the patterns of all major subcellular structures in fluorescence microscope images of HeLa cells. Bioinformatics. 2001;17: 1213–1223. pmid:11751230
- View Article
- PubMed/NCBI
- Google Scholar
24. Boland MV, Markey MK, Murphy RF. Automated recognition of patterns characteristic of subcellular structures in fluorescence microscopy images. Cytometry. 1998;33: 366–375. pmid:9822349
- View Article
- PubMed/NCBI
- Google Scholar
25. Vu N, Nguyen T, Garcia C. Improving texture categorization with biologically-inspired filtering. Image Vis Comput. 2014;32: 424–436.
- View Article
- Google Scholar
26. Abdesselam A. Improving Local Binary Patterns Techniques by Using Edge Information. Lect Notes Softw Eng. 2013;1: 360–363.
- View Article
- Google Scholar
27. Mallat S. A theory for multiresolution signal decomposition: the wavelet representation. Pattern Anal Mach Intell IEEE Trans. 1989;11: 674–693.
- View Article
- Google Scholar
28. Daugman J. Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. J Opt Soc Am A. 1985;2: 1160–1169. pmid:4020513
- View Article
- PubMed/NCBI
- Google Scholar
29. dos Santos FLC, Paci M, Nanni L, Brahnam S, Hyttinen J. Computer vision for virus image classification. Biosyst Eng. 2015;
- View Article
- Google Scholar
30. Kannala J, Rahtu E. Bsif: Binarized statistical image features. Pattern Recognit (ICPR), 2012 21st Int Conf. 2012; 1363–1366.
31. Zhao G, Ahonen T, Matas J, Pietikäinen M. Rotation-invariant image and video description with local binary pattern features. IEEE Trans Image Process. 2012;21: 1465–1477. pmid:22086501
- View Article
- PubMed/NCBI
- Google Scholar
32. Ojansivu V, Heikkilä J. Blur Insensitive Texture Classification Using Local Phase Quantization. Lecture Notes in Computer Science. Springer Berlin Heidelberg; 2008. pp. 236–243.
33. Dalal N, Triggs B. Histograms of Oriented Gradients for Human Detection. 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05). IEEE; 2005. pp. 886–893.
34. Nanni L, Brahnam S, Lumini A. Combining different local binary pattern variants to boost performance. Expert Syst Appl. 2011;38: 6209–6216.
- View Article
- Google Scholar
35. Strandmark P, Ulen J, Kahl F. HEp-2 staining pattern classification. Pattern Recognition (ICPR), 2012 21st International Conference on. 2012. pp. 33–36.
36. Guo Y, Zhao G, Pietikäinen M. Texture Classification using a Linear Configuration Model based Descriptor. Procedings of the British Machine Vision Conference 2011. British Machine Vision Association; 2011. pp. 119.1–119.10.
37. Fathi A, Naghsh-Nilchi AR. Noise tolerant local binary pattern operator for efficient texture analysis. Pattern Recognit Lett. Elsevier B.V.; 2012;33: 1093–1100.
- View Article
- Google Scholar
38. Ylioinas J, Hadid A, Guo Y, Pietikäinen M. Efficient image appearance description using dense sampling based local binary patterns. Computer Vision—ACCV 2012 Lecture Notes in Computer Science. 2013. pp. 375–388.
- View Article
- Google Scholar
39. Guo Z, Zhang L, Zhang D. A Completed Modeling of Local Binary Pattern Operator for Texture Classification. IEEE Trans Image Process. 2010;16: 1657–1663.
- View Article
- Google Scholar
40. Nosaka R, Ohkawa Y, Fukui K. Feature Extraction Based on Co-occurrence of Adjacent Local Binary Patterns. Proceedings of the 5th Pacific Rim Conference on Advances in Image and Video Technology—Volume Part II. Berlin, Heidelberg: Springer-Verlag; 2012. pp. 82–91.
41. Chen J, Shan S, He C, Zhao G, Pietikäinen M, Chen X, et al. WLD: a robust local image descriptor. IEEE Trans Pattern Anal Mach Intell. 2010;32: 1705–1720. pmid:20634562
- View Article
- PubMed/NCBI
- Google Scholar
42. Nanni L, Brahnam S, Lumini A, Barrier T. Ensemble of Local Phase Quantization Variants with Ternary Encoding. In: Brahnam S, Jain LC, Nanni L, Lumini A, editors. Local binary patterns: New variants and new applications. Springer; 2014. pp. 177–188.
43. Vu N. Exploring patterns of gradient orientations and magnitudes for face recognition. Inf Forensics Secur IEEE Trans. 2013;8: 295–304.
- View Article
- Google Scholar
44. Manthalkar R, Biswas P, Chatterji B. Rotation invariant texture classification using even symmetric Gabor filters. Pattern Recognit Lett. 2003;24: 2061–2068.
- View Article
- Google Scholar
45. Baddeley RJ, Tatler BW. High frequency edges (but not contrast) predict where we fixate: A Bayesian system identification analysis. Vision Res. 2006;46: 2824–2833. pmid:16647742
- View Article
- PubMed/NCBI
- Google Scholar
46. Hou X, Harel J, Koch C. Image Signature: Highlighting Sparse Salient Regions. IEEE Trans Pattern Anal Mach Intell. 2011;34: 194–201. pmid:21788665
- View Article
- PubMed/NCBI
- Google Scholar
47. Pudil P, Novovičová J, Kittler J. Floating search methods in feature selection. Pattern Recognit Lett. 1994;15: 1119–1125.
- View Article
- Google Scholar
48. Bianconi F, Fernández A, González E, Saetta SA. Performance analysis of colour descriptors for parquet sorting. Expert Syst Appl. Elsevier Ltd; 2012;40: 1636–1644.
- View Article
- Google Scholar
49. Nanni L, Munaro M, Ghidoni S, Menegatti E, Brahnam S. Ensemble of different approaches for a reliable person re-identification system. Appl Comput Informatics. King Saud University; 2015;
- View Article
- Google Scholar
50. Skottman H. Derivation and characterization of three new human embryonic stem cell lines in Finland. In Vitro Cell Dev Biol Anim. 2010;46: 206–209. pmid:20177999
- View Article
- PubMed/NCBI
- Google Scholar
51. Toivonen S, Ojala M, Hyysalo A, Ilmarinen T, Rajala K, Pekkanen-Mattila M, et al. Comparative analysis of targeted differentiation of human induced pluripotent stem cells (hiPSCs) and human embryonic stem cells reveals variability associated with incomplete transgene silencing in retrovirally derived hiPSC lines. Stem Cells Transl Med. 2013;2: 83–93. pmid:23341440
- View Article
- PubMed/NCBI
- Google Scholar
52. Sorkio A, Hongisto H, Kaarniranta K, Uusitalo H, Juuti-Uusitalo K, Skottman H. Structure and barrier properties of human embryonic stem cell-derived retinal pigment epithelial cells are affected by extracellular matrix protein coating. Tissue Eng Part A. 2014;20: 622–634. pmid:24044751
- View Article
- PubMed/NCBI
- Google Scholar
53. Vaajasaari H, Ilmarinen T, Juuti-Uusitalo K, Rajala K, Onnela N, Narkilahti S, et al. Toward the defined and xeno-free differentiation of functional human pluripotent stem cell-derived retinal pigment epithelial cells. Mol Vis. 2011;17: 558–575. pmid:21364903
- View Article
- PubMed/NCBI
- Google Scholar
54. Jantzen J, Norup J, Dounias G, Bjerregaard B. Pap-smear benchmark data for pattern classification. Nature inspired Smart Information Systems (NiSIS), EU co-ordination action Albufeira, Portugal: NiSIS. 2005. pp. 1–9.
55. Kylberg G, Uppström M, Sintorn I. Virus texture analysis using local binary patterns and radial density profiles. 18th Iberoamerican Congress on Pattern Recognition (CIARP). Martin S, Kim S-W; 2011. pp. 573–580.
56. Cruz-Roa A, Caicedo JC, González FA. Visual pattern mining in histology image collections using bag of features. Artif Intell Med. Elsevier B.V.; 2011;52: 91–106.
- View Article
- Google Scholar
57. Braz G Junior, Cardoso de Paiva A, Corrêa Silva A, Cesar Muniz de Oliveira A. Classification of breast tissues using Moran’s index and Geary's coefficient as texture signatures and SVM. Comput Biol Med. Elsevier; 2009;39: 1063–1072.
- View Article
- Google Scholar
58. Heath M, Bowyer K, Kopans D, Kegelmeyer P Jr, Moore R, Chang K, et al. Current Status of the Digital Database for Screening Mammography. In: Karssemeijer N, Thijssen M, Hendriks J, Erning L, editors. Digital Mammography SE—75. Springer Netherlands; 1998. pp. 457–460.
59. Nanni L, Shi J-Y, Brahnam S, Lumini A. Protein classification using texture descriptors extracted from the protein backbone image. J Theor Biol. Elsevier; 2010;264: 1024–1032.
- View Article
- Google Scholar
60. Fink JL, Aturaliya RN, Davis MJ, Zhang F, Hanson K, Teasdale MS, et al. LOCATE: a mouse protein subcellular localization database. Nucleic Acids Res. 2006;34: D213–D217. pmid:16381849
- View Article
- PubMed/NCBI
- Google Scholar
61. Hanley JA, McNeil BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology. Radiological Society of North America; 1982;143: 29–36.
- View Article
- Google Scholar
62. Juuti-Uusitalo K, Vaajasaari H, Ryhänen T, Narkilahti S, Suuronen R, Mannermaa E, et al. Efflux protein expression in human stem cell-derived retinal pigment epithelial cells. PLoS One. 2012;7: e30089. pmid:22272278
- View Article
- PubMed/NCBI
- Google Scholar

Subject Areas
?

For more information about PLOS Subject Areas, click here.
We want your feedback. Do these Subject Areas make sense for this article? Click the target next to the incorrect Subject Area and let us know. Thanks for your help!

Preprocessing
Is the Subject Area "Preprocessing" applicable to this article?

Thanks for your feedback.
Imaging techniques
Is the Subject Area "Imaging techniques" applicable to this article?

Thanks for your feedback.
Fluorescence imaging
Is the Subject Area "Fluorescence imaging" applicable to this article?

Thanks for your feedback.
RNA interference
Is the Subject Area "RNA interference" applicable to this article?

Thanks for your feedback.
Support vector machines
Is the Subject Area "Support vector machines" applicable to this article?

Thanks for your feedback.
Cell cultures
Is the Subject Area "Cell cultures" applicable to this article?

Thanks for your feedback.
Data acquisition
Is the Subject Area "Data acquisition" applicable to this article?

Thanks for your feedback.
Phase contrast microscopy
Is the Subject Area "Phase contrast microscopy" applicable to this article?

Thanks for your feedback.