Abstract
In this paper, we propose a new method for finding similarity of effects based on quantile regression models. Clustering of effects curves (CEC) techniques are applied to quantile regression coefficients, which are one-to-one functions of the order of the quantile. We adopt the quantile regression coefficients modeling (QRCM) framework to describe the functional form of the coefficient functions by means of parametric models. The proposed method can be utilized to cluster the effect of covariates with a univariate response variable, or to cluster a multivariate outcome. We report simulation results, comparing our approach with the existing techniques. The idea of combining CEC with QRCM permits simplifying computation and interpretation of the results, and may improve the ability to identify clusters. We illustrate a variety of applications, highlighting the advantages and the usefulness of the described method.
Similar content being viewed by others
References
Abramowitz M, Stegun I (1964) Handbook of mathematical functions: with formulas, graphs, and mathematical tables, vol 55. Courier Corporation, Chelmsford
Adelfio G, Chiodi M, D’Alessandro A, Luzio D (2011) Fpca algorithm for waveform clustering. J Commun Comput 8(6):494–502
Adelfio G, Chiodi M, D’Alessandro A, Luzio D, D’Anna G, Mangano G (2012) Simultaneous seismic wave clustering and registration. Comput Geosci 44:60–69
Adelfio G, Di Salvo F, Chiodi M (2016) Space-time FPCA algorithm for clustering of multidimensional curves. In: Proceeding of the 48th scientific meeting of the Italian Statistical Society, Salerno
Bouveyron C, Brunet-Saumard C (2014) Model-based clustering of high-dimensional data: a review. Comput Stat Data Anal 71:52–78
Clogg C, Petkova E, Haritou A (1995) Statistical methods for comparing regression coefficients between models. Am J Sociol 100(5):1261–1293
Fisher R (1936) The use of multiple measurements in taxonomic problems. Ann Hum Genet 7(2):179–188
Frumento P (2017) QRCM: quantile regression coefficients modeling. https://CRAN.R-project.org/package=qrcm, r package version 2.1
Frumento P, Bottai M (2016) Parametric modeling of quantile regression coefficient functions. Biometrics 72(1):74–84
Garcia-Escudero L, Gordaliza A (2005) A proposal for robust curve clustering. J Classif 22(2):185–201
Gower J (1975) Generalized procrustes analysis. Psychometrika 40(1):33–51
Hotelling H (1933) Analysis of a complex of statistical variables into principal components. J Educ Psychol 24(6):417
Jacques J, Preda C (2014) Functional data clustering: a survey. Adv Data Anal Classif 8(3):231–255
James G (2007) Curve alignment by moments. Ann Appl Stat 1:480–501
Kneip A, Gasser T (1992) Statistical tools to analyze data representing a sample of curves. Ann Stat 20:1266–1305
Koenker R (2005) Quantile regression, vol 38. Cambridge University Press, Cambridge
Koenker R, Bassett G Jr (1978) Regression quantiles. Econom J Econom Soc 46:33–50
Pearson K (1901) On lines and planes of closest fit to systems of points in space. In: Proceedings of of the 17th ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems (SIGMOD)
Ramsay J (2006) Functional data analysis. Wiley, New York
Ramsay J, Li X (1998) Curve registration. J R Stat Soc Ser B 60(2):351–363
Sangalli L, Secchi P, Vantini S, Veneziani A (2009) A case study in exploratory functional data analysis: geometrical features of the internal carotid artery. J Am Stat Assoc 104(485):37–48
Silverman B (1995) Incorporating parametric effects into functional principal components analysis. J R Stat Soc Ser B 57:673–689
Sottile G, Adelfio G (2017) clustEff: clusters of effect curves in quantile regression models. R package version 0.1.1. https://CRAN.R-project.org/package=clustEff
Vichi M, Saporta G (2009) Clustering and disjoint principal component analysis. Comput Stat Data Anal 53(8):3194–3208
Wang K, Gasser T (1997) Alignment of curves by dynamic time warping. Ann Stat 25(3):1251–1276
Acknowledgements
We would like to thank the two anonymous reviewers for their suggestions and comments, that allowed us to considerably improve the manuscript.
Funding This paper has been partially supported by the national grant of the Italian Ministry of Education University and Research (MIUR) for the PRIN-2015 program (Progetti di ricerca di Rilevante Interesse Nazionale), “Prot. 20157PRZC4 - Research Project Title Complex space-time modelling and functional analysis for probabilistic forecast of seismic events. PI: Giada Adelfio”.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Rights and permissions
About this article
Cite this article
Sottile, G., Adelfio, G. Clusters of effects curves in quantile regression models. Comput Stat 34, 551–569 (2019). https://doi.org/10.1007/s00180-018-0817-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-018-0817-8