Abstract
In this work we propose a low rank approximation of areal, particularly three dimensional, data utilizing additional weights. This way we enable effective compression if additional information indicates that parts of the data are of higher interest than others. The guiding example are high fidelity finite element simulations of an abdominal aortic aneurysm, i.e. a deformed blood vessel. The additional weights encapsulate the areas of high stress, which we assume indicates the rupture risk of the aorta. The stress values on the grid are modeled as a Gaussian Markov random field and we define our approximation as a basis of vectors that solve a series of optimization problems. Each of these problems describes the minimization of an expected weighted quadratic loss. We provide an effective numerical heuristic to compute the basis under general conditions, which relies on the sparsity of the precision matrix to ensure acceptable computing time even for large grids. We explicitly explore two such bases on the surface of a high fidelity finite element grid and show their efficiency for compression. Finally, we utilize the approach as part of a larger model to predict the van Mises stress in areas of interest using low and high fidelity simulations.
Similar content being viewed by others
References
Bank D, Koenigstein N, Giryes R (2020) Autoencoders. CoRR arXiv:abs/2003.05991
Biehler J, Gee MW, Wall WA (2014) Towards efficient uncertainty quantification in complex and large-scale biomechanical problems based on a Bayesian multi-fidelity scheme. Biomech Model Mechanobiol 14(3):489–513
Biehler J, Kehl S, Gee MW, Schmies F, Pelisek J, Maier A, Reeps C, Eckstein HH, Wall WA (2016) Probabilistic noninvasive prediction of wall properties of abdominal aortic aneurysms using Bayesian regression. Biomech Model Mechanobiol
Davis TA (2006) Direct methods for sparse linear systems. Soc Ind Appl Math 10(1137/1):9780898718881
Davis TA (2021) Suitesparse. https://github.com/DrTimothyAldenDavis/SuiteSparse Accessed 22 Sep 2021
Gabriel KR, Zamir S (1979) Lower rank approximation of matrices by least squares with any choice of weights. Technometrics 21(4):489–498
Ghojogh B, Karray F, Crowley M (2019) Eigenvalue and generalized eigenvalue problems: tutorial. arXiv:1903.11240
Golub GH, Heath M, Wahba G (1979) Generalized cross-validation as a method for choosing a good ridge parameter. Technometrics 21(2):215–223. https://doi.org/10.1080/00401706.1979.10489751
Hristopulos DT, Agou VD (2020) Stochastic local interaction model with sparse precision matrix for space-time interpolation. Spatial Stat 40:100403
Li KC (1986) Asymptotic optimality of cl and generalized cross-validation in ridge regression with application to spline smoothing. Ann Stat 1101–1112
Ma C, Wang K, Chi Y, Chen Y (2019) Implicit regularization in nonconvex statistical estimation: Gradient descent converges linearly for phase retrieval, matrix completion, and blind deconvolution. Found Comput Math 20(3):451–632. https://doi.org/10.1007/s10208-019-09429-9
Maharani M (1808) Saputro DRS (2021) Generalized cross validation (gcv) in smoothing spline nonparametric regression models. J Phys Conf Ser 1:012053. https://doi.org/10.1088/1742-6596/1808/1/012053
Nocedal J, Wright SJ (2006) Numerical optimization, 2nd edn. Springer, New York
Rue H, Held L (2005) Gaussian Markov random fields : theory and applications. Chapman & Hall/CRC, London
Singh TP, Moxon JV, Gasser TC, Golledge J (2021) Systematic review and meta-analysis of peak wall stress and peak wall rupture index in ruptured and asymptomatic intact abdominal aortic aneurysms. J Am Heart Assoc 10(8):e019772. https://doi.org/10.1161/JAHA.120.019772
Soltanpour S, Boufama B, Jonathan WuQ (2017) A survey of local feature methods for 3d face recognition. Pattern Recognit 72:391–406 https://doi.org/10.1016/j.patcog.2017.08.003, ’www.sciencedirect.com/science/article/pii/S0031320317303072’
Speelman L, Schurink GWH, Bosboom EMH, Buth J, Breeuwer M, van de Vosse FN, Jacobs MH (2010) The mechanical role of thrombus on the growth rate of an abdominal aortic aneurysm. J Vasc Surg 51(1):19–26
Striegel C, Biehler J, Wall W, Kauermann G (2022) A multifidelity function-on-function model applied to an abdominal aortic aneurysm. Technometrics pp 1–26, https://doi.org/10.1080/00401706.2021.2024453
Tamuz O, Mazeh T, Zucker S (2005) Correcting systematic effects in a large set of photometric light curves. Mon Not R Astron Soc 356:1466–1470. https://doi.org/10.1111/j.1365-2966.2004.08585.x
Williams CK, Rasmussen CE (2006) Gaussian processes for machine learning, vol 2. MIT Press, Cambridge
Xuan Y (2016) Rspectra. https://github.com/yixuan/RSpectra/ Accessed 9 Aug 2018
Zebari R, Abdulazeez A, Zeebaree D, Zebari D, Saeed J (2020) A comprehensive review of dimensionality reduction techniques for feature selection and feature extraction. J Appl Sci Technol Trends 1(2):56–70
Funding
No funds, grants, or other support was received.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Code and Data
full code and data to reproduce the results is available at: https://drive.google.com/drive/folders/1c68u1bATuOZEIAXuOYzIFb3urSNQwUav?usp=sharing.
Reformulation as Tensor Decomposition
reformulation_tensor_decomposition.pdf contains a reformulation of the first basis vector problem as a tensor decomposition.
Simulation Study
simulation_study.pdf contains as small simulation study with sampled weight vectors.
Histogram of the number of neighbors for the high fidelity grid points
neighbors_count.pdf.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
1.1 Miscellaneous numerics
There are a number of crucial numerical concerns that need to be addressed in order to make the basis computation problem laid out in the previous parts solvable in practice. Firstly, we only know the precision matrix, \(\varvec{Q}\), but not its inverse the GMRF covariance matrix \(\varvec{\varSigma }\), as seen in (8). \(\varvec{Q}\), the standard GMRF precision matrix (Rue and Held 2005) is sparse. Moreover, this matrix is not invertible in a strict sense, as the constant vector is an eigenvector with eigenvalue 0. We can however fix this by adding an identity matrix, i.e. in this work we use
where \(\epsilon \) is a small constant, i.e. \(10^{-4}\) and \(\varvec{I}_{m_{y}}\) the \(m_{y}\) dimensional identity matrix.
It is not useful nor common to actually invert \({\tilde{\varvec{Q}}}\) as this is computationally expensive and results in a dense matrix which consumes a large amount of memory. Instead we compute a sparse Cholesky decomposition which is enough to rapidly compute the matrix / vector operations involving \(\varvec{\varSigma }\) in the formulae for the function value and gradient, like (8) and (9). The decomposition is not sufficient to actually compute the full Hessian matrix as seen in (10). However, this is not necessary as the ability to compute the product of the Hessian with a given vector is sufficient in our case.
There are specialized routines for the case of GMRF type precision matrices which do not only build on the highly sparse nature but also the (up to permutation) band like shape of the matrix in order to achieve a computational cost of up to \(O(n^{3/2})\). This results in a large advantage with regard to computation time in comparison to the ordinary Cholesky decomposition routines which also do not result in sparse matrices. We refer to Davis (2006) as a comprehensive reference. For the computation we use the R package Matrix, which is based on the CHOLMOD (Davis 2021) library. On our machine with 64 GB of RAM and 12 core AMD Ryzen \(9 \quad 3900\)X processor the computation of the decomposition for our 80,000 dimensional precision matrix takes just 40 s.
1.2 Low fidelity compression error
See Fig. 11.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Striegel, C., Biehler, J. & Kauermann, G. Weighted high dimensional data reduction of finite element features: an application on high pressure of an abdominal aortic aneurysm. Comput Stat 39, 2771–2789 (2024). https://doi.org/10.1007/s00180-023-01388-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-023-01388-8