Abstract
We consider a regression modeling of the quantiles of residual life, remaining lifetime at a specific time. We propose a smoothed induced version of the existing non-smooth estimating equations approaches for estimating regression parameters. The proposed estimating equations are smooth in regression parameters, so solutions can be readily obtained via standard numerical algorithms. Moreover, the smoothness in the proposed estimating equations enables one to obtain a robust sandwich-type covariance estimator of regression estimators aided by an efficient resampling method. To handle data subject to right censoring, the inverse probability of censoring distribution is used as a weight. The consistency and asymptotic normality of the proposed estimator are established. Extensive simulation studies are conducted to validate the proposed estimator’s performance in various finite samples settings. We apply the proposed method to dental study data evaluating the longevity of dental restorations.
Similar content being viewed by others
Data availability statement
The code used in the simulation study is included in the supplementary information. For confidentiality reasons, the dental restoration dataset is available upon request.
References
Bai F, Huang J, Zhou Y (2016) Semiparametric inference for the proportional mean residual life model with right-censored length-biased data. Stat Sin 46:1129–1158
Bai F, Chen X, Chen Y, Huang T (2019) A general quantile residual life model for length-biased right-censored data. Scand J Stat 46(4):1191–1205
Bang H, Tsiatis AA (2002) Median regression with censored cost data. Biometrics 58(3):643–649
Brown B, Wang YG (2005) Standard errors and covariance matrices for smoothed rank estimators. Biometrika 92(1):149–158
Brown B, Wang YG (2007) Induced smoothing for rank regression with censored survival times. Stat Med 26(4):828–836
Cai J, He H, Song X, Sun L (2017) An additive-multiplicative mean residual life model for right-censored data. Biom J 59(3):579–592
Caplan D, Li Y, Wang W, Kang S, Marchini L, Cowen H, Yan J (2019) Dental restoration longevity among geriatric and special needs patients. JDR Clin Transl Res 4(1):41–48
Chen YQ (2007) Additive expectancy regression. J Am Stat Assoc 102(477):153–166
Chen YQ, Cheng S (2006) Linear life expectancy regression with censored data. Biometrika 93(2):303–313
Chen Y, Jewell N, Lei X, Cheng S (2005) Semiparametric estimation of proportional mean residual life model in presence of censoring. Biometrics 61(1):170–178
Cheng YJ, Huang CY (2014) Combined estimating equation approaches for semiparametric transformation models with length-biased survival data. Biometrics 70(3):608–618
Chiou SH, Kang S, Yan J (2014a) Fast accelerated failure time modeling for case-cohort data. Stat Comput 24(4):559–568
Chiou SH, Kang S, Yan J et al (2014b) Fitting accelerated failure time models in routine survival analysis with r package aftgee. J Stat Softw 61(11):1–23
Chiou S, Kang S, Yan J (2015a) Rank-based estimating equations with general weight for accelerated failure time models: an induced smoothing approach. Stat Med 34(9):1495–1510
Chiou SH, Kang S, Yan J (2015b) Semiparametric accelerated failure time modeling for clustered failure times from stratified sampling. J Am Stat Assoc 110(510):621–629
Choi S, Kang S, Huang X (2018) Smoothed quantile regression analysis of competing risks. Biom J 60(5):934–946
Fleming TR, Harrington DP (2011) Counting processes and survival analysis, vol 169. Wiley, Hoboken
Fu L, Wang YG, Bai Z (2010) Rank regression for analysis of clustered data: a natural induced smoothing approach. Comput Anal Data Anal 54(4):1036–1050
Huang Y (2010) Quantile calculus and censored regression. Ann Stat 38(3):1607
Johnson LM, Strawderman RL (2009) Induced smoothing for the semiparametric accelerated failure time model: asymptotics and extensions to clustered data. Biometrika 96(3):577–590
Jung SH (1996) Quasi-likelihood for median regression models. J Am Stat Assoc 91(433):251–257
Jung SH, Jeong JH, Bandos H (2009) Regression on quantile residual life. Biometrics 65(4):1203–1212
Kang S (2017) Fitting semiparametric accelerated failure time models for nested case-control data. J Stat Comput Simul 87(4):652–663
Karatzas I, Shreve S (1988) Brownian motion and stochastic calculus. Springer, New York
Kassebaum N, Bernabé E, Dahiya M, Bhandari B, Murray C, Marcenes W (2015) Global burden of untreated caries: a systematic review and metaregression. J Dent Res 94(5):650–658
Kim MO, Zhou M, Jeong JH (2012) Censored quantile regression for residual lifetimes. Lifetime Data Anal 18(2):177–194
Koenker R (2005) Quantile regression. Econometric society monographs. Cambridge University Press, Cambridge
Koenker R, Bassett G (1978) Regression quantiles. Econometrica 46(1):33–50
Koenker R, Portnoy S, Ng PT, Zeileis A, Grosjean P, Ripley BD (2012) Package ‘quantreg’
Li R, Huang X, Cortes JE (2016) Quantile residual life regression with longitudinal biomarker measurements for dynamic prediction. J R Stat Soc Ser C Appl Stat 65(5):755–773
Lin D (2000) On fitting cox’s proportional hazards models to survey data. Biometrika 87(1):37–47
Liu S, Ghosh SK (2008) Regression analysis of mean residual life function. Tech. rep., North Carolina State University. Dept. of Statistics
Maguluri G, Zhang CH (1994) Estimation in the mean residual life regression model. J R Stat Soc Ser B (Methodological) 56(3):477–489
Mansourvar Z, Martinussen T, Scheike TH (2016) An additive-multiplicative restricted mean residual life model. Scand J Stat 43(2):487–504
Oakes D, Dasu T (2003) Inference for the proportional mean residual life model. Lecture Notes-Monogr Ser 43:105–116
Pang L, Lu W, Wang HJ (2012) Variance estimation in censored quantile regression via induced smoothing. Comput Stat Data Anal 56(4):785–796
Peng L (2021) Quantile regression for survival data. Annu Rev Stat Appl 8:413–437
Peng L, Huang Y (2008) Survival analysis with quantile regression models. J Am Stat Assoc 103(482):637–649
Portnoy S (2003) Censored regression quantiles. J Am Stat Assoc 98(464):1001–1012
Portnoy S, Lin G (2010) Asymptotics for censored regression quantiles. J Nonparametric Stat 22(1):115–130
Portnoy S, Koenker R et al (1997) The gaussian hare and the Laplacian tortoise: computability of squared-error versus absolute-error estimators. Stat Sci 12(4):279–300
R Core Team (2021) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/. Accessed 1 Dec 2021
Shorack GR, Wellner JA (2009) Empirical processes with applications to statistics. SIAM, Philadelphia
Sun L, Zhang Z (2009) A class of transformed mean residual life models with censored survival data. J Am Stat Assoc 104(486):803–815
Sun L, Song X, Zhang Z (2012) Mean residual life models with time-dependent coefficients under right censoring. Biometrika 99(1):185–197
Wang HJ, Wang L (2009) Locally weighted censored quantile regression. J Am Stat Assoc 104(487):1117–1128
Wei Y, Pere A, Koenker R, He X (2006) Quantile regression methods for reference growth charts. Stat Med 25(8):1369–1382
Whang YJ (2006) Smoothed empirical likelihood methods for quantile regression models. Econom Theory 22:173–205
Ying Z, Sit T (2017) Survival analysis: a quantile perspective. In: Koenker R, Chernozhukov V, He X, Peng L (eds) Handbook of quantile regression. Chapman and Hall/CRC, London, pp 69–87. https://www.taylorfrancis.com/chapters/edit/10.1201/9781315120256-6/survival-analysis-quantile-perspective-zhiliang-ying-tony-sit
Ying Z, Jung SH, Wei LJ (1995) Survival analysis with median regression models. J Am Stat Assoc 90(429):178–184
Zhang Z, Zhao X, Sun L (2010) Goodness-of-fit tests for additive mean residual life model under right censoring. Lifetime Data Anal 16(3):385–408
Acknowledgements
This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government(MSIT) (2020R1A2C1A0101313911)
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Appendix 1. Proof of Theorem 1
Appendix 1. Proof of Theorem 1
In this Appendix, we provide a proof of Theorem 1: consistency and asymptotic normality of the proposed induced smoothed estimator.
First, we establish the consistency of the proposed estimator \(\hat{\beta }_{IS}\). The consistency of the non-smooth counterpart, \(\hat{\beta }_{NS}\), is shown in (Li et al. 2016). Based on this consistency result, it suffices to we prove that, as \(n \rightarrow \infty\), the difference between \(\tilde{U}_{t_0}(\beta , \tau , H)\) and \(U_{t_0}(\beta , \tau )\) scaled by \(n^{1/2}\) converges uniformly to zero in probability for \(\beta\) in the compact neighborhood of \(\beta _0\).
Let \(\sigma _i=(X_i^\top H X_i)^{1/2}\), \(\epsilon _i(\beta ) = X_i\beta - \log (Z_i-t_0)\) and \(d_i(\beta ) = \mathop {\mathrm{sign}}\nolimits (\epsilon _i^\beta )\Phi (-|\epsilon _i^\beta /\sigma _i|)\). Then,
To show \(\Vert D^{(1)}_n(\beta ) \Vert \xrightarrow {p} 0\) as \(n \rightarrow \infty\), we first note that
Let \(\omega _{1i}^*\) be the line segment lying between \(X_i^{\top } (\beta -\beta _0)\) and \(X_i^{\top } (\beta -\beta _0)+\sigma _i t\). Then,
It follows from Conditions C1 and C3 that \(\sup _i g_{T_i - t_0} \{\sigma _i t + X_i^{\top } (\beta -\beta _0)\}<\infty\). Since \(\int _{-\infty }^{\infty } \Phi (-|t |)\{2I[t>0]-1\}dt=0\), we have
Again, by Condition C1,
Thus,
Note that \(\sum _{i=1}^{n} \sigma _i^2 = \text{ tr }(X H X^{\top }) = \text{ tr }(H X^{\top }X)\) is bounded by \(H=O(n^{-1})\) and Condition C2. Then, \(\sum _{i=1}^{n} |E\{d_i(\beta )\} |\le M \sum _{i=1}^{n}\sigma _i^2 /2\) is also bounded. Therefore,
By applying Condition C3, we have
It follows from the arguments similar to evaluating \(\mathop {\mathrm{E}}\nolimits \{d_i(\beta ) | T_i \ge t_0\}\) combining with Conditions C1 and C2, we have, as \(n \rightarrow \infty\), \(\Vert \mathop {\mathrm{E}}\nolimits \{d_i^2(\beta ) | T_i \ge t_0 \} \Vert \rightarrow 0\). This implies \(\Vert \mathop {\mathrm{Var}}\nolimits \{D_n^{(1)}(\beta )\} \Vert \rightarrow 0\). Then, by the Weak Law of Large Numbers,
for \(\beta\) in a compact neighborhood of \(\beta _0\).
To show \(\Vert D^{(2)}_n(\beta ) \Vert \xrightarrow {p} 0\) as \(n \rightarrow \infty\), we use the martingale representation of the Kaplan–Meier estimator (Fleming and Harrington 2011). Specifically, \(\hat{G}(t)\) can be represented as
where \(M_i^C(u) = N_i^C(u)-\int _{0}^{t}Y_i(u)d\Lambda ^C(s)\), \(N_i^C(u)=(1-\delta _i)I[Z_i\le u], \Lambda ^C(u) = -\log \{G(u)\}\), \(Y(u) = \sum _{i=1}^n Y_i(u)\), and \(Y_i(u) = I[Z_i \ge u]\). By combining this with an application of the functional delta method and the uniform convergence result of \(\hat{G}(\cdot )\) to \(G(\cdot )\), we have
where
Using the arguments similar to those used to establish \(\Vert D_n^{(1)}(\beta ) \Vert \xrightarrow {p} 0\), as \(n\rightarrow \infty\), it can be shown that \(\mathop {\mathrm{E}}\nolimits \left\{ \left| Y_i(u)d_i(\beta ) \right| |\ T_i \ge t_0 \right\} = O(n^{-1/2})\). Thus,
It then follows that, as \(n \rightarrow \infty\)
uniformly in \(\beta\) for \(\beta\) in the compact neighborhood of \(\beta _0\). By applying the martingale central limit theorem and the Kolmogorov–Centsov Theorem (Karatzas and Shreve 1988, p53),
By combining these results, it follows from Lemma 1 in Lin (2000) that
By combining (11) and (12), we have
Note that both \(\tilde{U}_{t_0}(\beta , \tau , H)\) and \(U_{t_0}(\beta , \tau )\) are monotone functions, thus the point-wise convergence could be strengthened to uniform convergence Shorack and Wellner (2009).
To establish the asymptotic equivalence of \(n^{1/2}(\hat{\beta }_{IS} - \beta _0)\) and \(n^{1/2}(\hat{\beta }_{NS} - \beta _0)\), it suffices to show that the following two convergence results hold: As \(n \rightarrow \infty\),
Note that Eq. (13) implies (ii). Thus, we prove (i). For any vectors \(a, b \in R^p\),
It follows from the variable transformation \(t = \epsilon (\beta _0)/\sigma _i\) and the Taylor expansion at 0 that
where \(\omega _{2i}^*\) is some value lying between 0 and \(\sigma _i t\).
By Condition C1, we have
Since \(\int _{-\infty }^{\infty } \phi \left( -t \right) g_{T_i - t_0}(0)dt = 0\), we have \(\mathop {\mathrm{E}}\nolimits \left\{ \phi \Big (-\frac{\epsilon _i(\beta _0)}{\sigma _i}\Big )\Big (\frac{1}{\sigma _i}\Big ) \bigg |T_i > t_0, X_i \right\} \rightarrow g_{T_i - t_0}(0)\) and, therefore,
By Condition C1 and applying the arguments in Pang et al. (2012, p 795, Appendix), it can be shown that
Then,
By combining the results in Eqs. (14) and (15), we have \(\left\Vert \hat{A}(\beta _0, H) - A\right\Vert \rightarrow 0\) as \(n \rightarrow \infty\). This completes the proof of (i).
Rights and permissions
About this article
Cite this article
Kim, K.H., Caplan, D.J. & Kang, S. Smoothed quantile regression for censored residual life. Comput Stat 38, 1001–1022 (2023). https://doi.org/10.1007/s00180-022-01262-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-022-01262-z