Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Characteristic Gene Selection Based on Robust Graph Regularized Non-Negative Matrix Factorization

Published: 01 November 2016 Publication History

Abstract

Many methods have been considered for gene selection and analysis of gene expression data. Nonetheless, there still exists the considerable space for improving the explicitness and reliability of gene selection. To this end, this paper proposes a novel method named robust graph regularized non-negative matrix factorization for characteristic gene selection using gene expression data, which mainly contains two aspects: Firstly, enforcing L21-norm minimization on error function which is robust to outliers and noises in data points. Secondly, it considers that the samples lie in low-dimensional manifold which embeds in a high-dimensional ambient space, and reveals the data geometric structure embedded in the original data. To demonstrate the validity of the proposed method, we apply it to gene expression data sets involving various human normal and tumor tissue samples and the results demonstrate that the method is effective and feasible.

References

[1]
H. B. Saber and M. Elloumi, "DNA microarray data analysis: A new survey on biclustering," Int. J. Comput. Biol., vol. 4, no. 1, pp. 21-37, 2015.
[2]
Y. Li and Z. Zhang, "Computational Biology in microRNA," Wiley Interdisciplinary Rev., RNA, vol. 6, pp. 435-452, 2015.
[3]
B. Bayar, N. Bouaynaya, and R. Shterenberg, "Probabilistic non-negative matrix factorization: Theory and application to microarray data analysis," J. Bioinformat. Comput. Biol., vol. 12, no. 01, pp. 1450001, 2014.
[4]
C.-H. Zheng, L. Zhang, T.-Y. Ng, C. K. Shiu, and D.-S. Huang, "Metasample-based sparse representation for tumor classification," IEEE/ACM Trans. Comput. Biol. Bioinformat., vol. 8, no. 5, pp. 1273-1282, Sep./Oct. 2011.
[5]
J.-X. Liu, Y.-L. Gao, Y. Xu, C.-H. Zheng, and J. You, "Differential expression analysis on RNA-Seq count data based on penalized matrix decomposition," IEEE Trans. NanoBiosci., vol. 13, no. 1, pp. 12-18, Mar. 2014.
[6]
J.-X. Liu, C.-H. Zheng, and Y. Xu, "Extracting plants core genes responding to abiotic stresses by penalized matrix decomposition," Comput. Biol. Med., vol. 42, no. 5, pp. 582-589, 2012.
[7]
J.-X. Liu, Y. Xu, C.-H. Zheng, H. Kong, and Z.-H. Lai, "RPCA-based tumor classification using gene expression data," IEEE/ ACM Trans. Comput. Biol. Bioinformat., vol. 12, no. 4, pp. 964-970, Jul./Aug. 2015.
[8]
M. Journée, Y. Nesterov, P. Richtárik, and R. Sepulchre, "Generalized power method for sparse principal component analysis," J. Mach. Learning Res., vol. 11, pp. 517-553, 2010.
[9]
P. K. Yalavarthy, B. W. Pogue, H. Dehghani, and K. D. Paulsen, "Weight-matrix structured regularization provides optimal generalized least-squares estimate in diffuse optical tomography," Med. Phys., vol. 34, no. 6, pp. 2085-2098, 2007.
[10]
P. O. Hoyer, "Non-negative matrix factorization with sparseness constraints," J. Mach. Learning Res., vol. 5, pp. 1457-1469, 2004.
[11]
K. Devarajan, "Nonnegative matrix factorization: An analytical and interpretive tool in computational biology," PLoS Comput. Biol., vol. 4, no. 7, pp. e1000029, 2008.
[12]
J.-P. Brunet, P. Tamayo, T. R. Golub, and J. P. Mesirov, "Metagenes and molecular pattern discovery using matrix factorization," Proc. Natl. Acad. Sci., vol. 101, no. 12, pp. 4164-4169, 2004.
[13]
R. Giancarlo and F. Utro, "Speeding up the consensus clustering methodology for microarray data analysis," Algorithms Molecular Biol., vol. 6, no. 1, p. 1, 2011.
[14]
D. Cai, X. He, J. Han, and T. S. Huang, "Graph regularized nonnegative matrix factorization for data representation," IEEE Trans. Pattern Anal. Mach. Intell., vol. 33, no. 8, pp. 1548-1560, Aug. 2011.
[15]
D. Wang, Y.-L. Gao, J.-X. Liu, J.-G. Yu, and C.-G. Wen, "Application of graph regularized non-negative matrix factorization in characteristic gene selection," in Intelligent Computing Theories and Methodologies. New York, NY, USA: Springer, 2015, pp. 601-611.
[16]
J.-X. Liu, Y. Xu, Y.-L. Gao, C.-H. Zheng, D. Wang, and Q. Zhu, "A class-information-based sparse component analysis method to identify differentially expressed genes on RNA-Seq data," May 2015, (in press).
[17]
J. Huang, F. Nie, H. Huang, and C. Ding, "Robust manifold nonnegative matrix factorization," ACM Trans. Knowl. Discovery Data, vol. 8, no. 3, pp. 11, 2014.
[18]
X. Long, H. Lu, Y. Peng, and W. Li, "Graph regularized discriminative non-negative matrix factorization for face recognition," Multimedia Tools Appl., vol. 72, no. 3, pp. 2679-2699, 2014.
[19]
B. Geng, D. Tao, C. Xu, Y. Yang, and X.-S. Hua, "Ensemble manifold regularization," IEEE Trans. Pattern Anal. Mach. Intell., vol. 34, no. 6, pp. 1227-1233, Jun. 2012.
[20]
F. Nie, H. Huang, X. Cai, and C. H. Ding, "Efficient and robust feature selection via joint '2, 1-norms minimization," in Proc. Adv. Neural Inf. Process. Syst., 2010, pp. 1813-1821.
[21]
D. Kong, C. Ding, and H. Huang, "Robust nonnegative matrix factorization using l21-norm," in Proc. 20th ACM Int. Conf. Inf. Knowl. Manage., 2011, pp. 673-682.
[22]
S. Yang, C. Hou, C. Zhang, Y. Wu, and S. Weng, "Robust non-negative matrix factorization via joint sparse and graph regularization," in Proc. Int. Joint Conf. Neural Netw., 2013, pp. 1-5.
[23]
N. Guan, D. Tao, and Z. Luo, "Manifold regularized discriminative nonnegative matrix factorization with fast gradient descent," IEEE Trans. Image Process., vol. 20, no. 7, pp. 2030-2048, Jun. 2011.
[24]
J. Liu, J.-X. Liu, Y.-L. Gao, X.-Z. Kong, X.-S. Wang, and D. Wang, "A P-norm robust feature extraction method for identifying differentially expressed genes," PloS One, vol. 10, no. 7, pp. e0133124, 2015.
[25]
S. Ortega-Martorell, P. J. Lisboa, A. Vellido, M. Julià-Sapé, and C. Arús, "Non-negative matrix factorisation methods for the spectral decomposition of MRS data from human brain tumours," BMC Bioinformat., vol. 13, no. 1, p. 38, 2012.
[26]
Z. Zhang, Y. Xu, J. Yang, X. Li, and D. Zhang, "A survey of sparse representation: Algorithms and applications," IEEE Access, vol. 3, pp. 490-530, 2015.
[27]
M. Belkin and P. Niyogi, "Laplacian eigenmaps and spectral techniques for embedding and clustering," in Proc. Adv. Neural Inf. Process. Syst., 2001, pp. 585-591.
[28]
C. Deng, H. Xiaofei, H. Jiawei, and S. H. Thomas, "Graph regularized nonnegative matrix factorization for data representation," IEEE Trans. Pattern Anal. Mach. Intell., vol. 33, no. 8, pp. 1548- 1560, Aug. 2011.
[29]
X. Liu, D. Zhai, D. Zhao, G. Zhai, and W. Gao, "Progressive image denoising through hybrid graph Laplacian regularization: A unified framework," IEEE Trans. Image Process., vol. 23, no. 4, pp. 1491-1503, Apr. 2014.
[30]
G.-C. Wu and D. Baleanu, "Variational iteration method for the Burgers' flow with fractional derivatives--New Lagrange multipliers," Appl. Math. Model., vol. 37, no. 9, pp. 6183-6190, 2013.
[31]
F. Facchinei, C. Kanzow, and S. Sagratella, "Solving quasi-variational inequalities via their KKT conditions," Math. Program., vol. 144, nos. 1/2, pp. 369-412, 2014.
[32]
D. D. Lee and H. S. Seung, "Algorithms for non-negative matrix factorization," in Proc. Adv. Neural Inf. Process. Syst., 2001, pp. 556-562.
[33]
Y. Zhang and Z.-C. Mu, "Ear recognition based on improved NMFSC," J. Comput. Appl., vol. 4, pp. 010, 2006.
[34]
G. Nyamundanda, I. C. Gormley, and L. Brennan, "A dynamic probabilistic principal components model for the analysis of longitudinal metabolomics data," J. Royal Statist. Soc., Ser. C (Appl. Statist.), vol. 63, pp. 763-782, 2014.
[35]
D. M. Witten, R. Tibshirani, and T. Hastie, "A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis," Biostatistics, vol. 10, no. 3, pp. 515-534, 2009.
[36]
C.-H. Zheng, D.-S. Huang, L. Zhang, and K. Xiang-Zhen, "Tumor clustering using nonnegative matrix factorization with gene selection," IEEE Trans. Inf. Technol. Biomed., vol. 13, no. 4, pp. 599-607, Jul. 2009.
[37]
Y. Gao and G. Church, "Improving molecular cancer class discovery through sparse non-negative matrix factorization," Bioinformatics, vol. 21, no. 21, pp. 3970-3975, 2005.
[38]
M. A. Shipp, N. Ross Ken, and P. Tamayo, "Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning," Nature Med., vol. 8, no. 1, pp. 68-74, 2002.
[39]
A. Bhattacharjee, W. G. Richards, and J. Staunton, "Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses," Proc. Natl. Acad. Sci. USA, vol. 98, no. 24, pp. 13790-13795, 2001.
[40]
Y. Shizhun, H. Chenping, and Z. Changshui, "Robust nonnegative matrix factorization via joint sparse and graph regularization for transfer learning," Neural Comput. Appl., vol. 23, no. 2, pp. 541-559, 2013.
[41]
M.-Y. Wu, D.-Q. Dai, X.-F. Zhang, and Y. Zhu, "Cancer subtype discovery and biomarker identification via a new robust network clustering algorithm," PloS One, vol. 8, no. 6, p. e66256, 2013.
[42]
R. Siegel, E. Ward, O. Brawley, and A. Jemal, "Cancer statistics, 2011," CA: Cancer J. Clin., vol. 61, no. 4, pp. 212-236, 2011.
[43]
S. L. Pomeroy, P. Tamayo, M. Gaasenbeek, L. M. Sturla, M. Angelo, M. E. McLaughlin, J. Y. H. Kim, L. C. Goumnerova, P. M. Black, and C. Lau, "Prediction of central nervous system embryonal tumour outcome based on gene expression," Nature, vol. 415, no. 6870, pp. 436-442, 2002.
[44]
B. J. Druker and M. Talpaz, "Efficacy and safety of a specific inhibitor of the BCR-ABL tyrosine kinase in chronic myeloid leukemia," New Engl. J. Med., vol. 344, no. 14, pp. 1031-1037, 2001.
[45]
A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. Royal Statist. Soc. Ser. B (Methodol.), vol. 39, pp. 1-38, 1977.

Cited By

View all
  • (2023)Label consistency-based deep semisupervised NMF for tumor recognitionEngineering Applications of Artificial Intelligence10.1016/j.engappai.2022.105511117:PAOnline publication date: 1-Jan-2023
  • (2023)Dynamic scaling factor based differential evolution with multi-layer perceptron for gene selection from pathway information of microarray dataMultimedia Tools and Applications10.1007/s11042-022-13964-z82:9(13453-13478)Online publication date: 1-Apr-2023
  • (2023)Robust graph regularization nonnegative matrix factorization for link prediction in attributed networksMultimedia Tools and Applications10.1007/s11042-022-12943-882:3(3745-3768)Online publication date: 1-Jan-2023
  • Show More Cited By
  1. Characteristic Gene Selection Based on Robust Graph Regularized Non-Negative Matrix Factorization

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image IEEE/ACM Transactions on Computational Biology and Bioinformatics
      IEEE/ACM Transactions on Computational Biology and Bioinformatics  Volume 13, Issue 6
      November 2016
      198 pages

      Publisher

      IEEE Computer Society Press

      Washington, DC, United States

      Publication History

      Published: 01 November 2016
      Published in TCBB Volume 13, Issue 6

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 20 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2023)Label consistency-based deep semisupervised NMF for tumor recognitionEngineering Applications of Artificial Intelligence10.1016/j.engappai.2022.105511117:PAOnline publication date: 1-Jan-2023
      • (2023)Dynamic scaling factor based differential evolution with multi-layer perceptron for gene selection from pathway information of microarray dataMultimedia Tools and Applications10.1007/s11042-022-13964-z82:9(13453-13478)Online publication date: 1-Apr-2023
      • (2023)Robust graph regularization nonnegative matrix factorization for link prediction in attributed networksMultimedia Tools and Applications10.1007/s11042-022-12943-882:3(3745-3768)Online publication date: 1-Jan-2023
      • (2023)Graph non-negative matrix factorization with alternative smoothed regularizationsNeural Computing and Applications10.1007/s00521-022-07200-w35:14(9995-10009)Online publication date: 1-May-2023
      • (2018)Regularized Non-Negative Matrix Factorization for Identifying Differentially Expressed Genes and Clustering SamplesIEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)10.1109/TCBB.2017.266555715:3(974-987)Online publication date: 1-May-2018
      • (2018)Performance Analysis of Non-negative Matrix Factorization Methods on TCGA DataIntelligent Computing Theories and Application10.1007/978-3-319-95933-7_50(407-418)Online publication date: 15-Aug-2018

      View Options

      Login options

      Full Access

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media