Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article
Public Access

Spatial Auto-regressive Dependency Interpretable Learning Based on Spatial Topological Constraints

Published: 12 August 2019 Publication History

Abstract

Spatial regression models are widely used in numerous areas, including detecting and predicting traffic volume, air pollution, and housing prices. Unlike conventional regression models, which commonly assume independent and identical distributions among observations, existing spatial regression requires the prior knowledge of spatial dependency among the observations in different spatial locations. Such a spatial dependency is typically predefined by domain experts or heuristics. However, without sufficient consideration on the context of the specific prediction task, it is prohibitively difficult for one to pre-define the numerical values of the spatial dependency without bias. More importantly, in many situations, the existing techniques are insufficient to sense the complete connectivity and topological patterns among spatial locations (e.g., in underground water networks and human brain networks). Until now, these issues have been extremely difficult to address and little attention has been paid to the automatic optimization of spatial dependency in relation to a prediction task, due to three challenges: (1) necessity and complexity of modeling the spatial topological constraints; (2) incomplete prior spatial knowledge; and (3) difficulty in optimizing under spatial topological constraints that are usually discrete or nonconvex. To address these challenges, this article proposes a novel convex framework that automatically jointly learns the prediction mapping and spatial dependency based on spatial topological constraints. There are two different scenarios to be addressed. First, when the prior knowledge on existence of conditional independence among spatial locations is known (e.g., via spatial contiguity), we propose the first model named Spatial-Autoregressive Dependency Learning I (SADL-I) to further quantify such spatial dependency. However, when the knowledge on the conditional independence is unknown or incomplete, our second model named Spatial-Autoregressive Dependency Learning II (SADL-II) is proposed to automatically learn the conditional independence pattern as well as quantify the numerical values of the spatial dependency based on spatial topological constraints. Topological constraints are usually discrete and nonconvex, which makes them extremely difficult to be optimized together with continuous optimization problems of spatial regression. To address this, we propose a convex and continuous equivalence of the original discrete topological constraints with a theoretical guarantee. The proposed models are then transferred to convex problems that can be iteratively optimized by our new efficient algorithms until convergence to a global optimal solution. Extensive experimentation using several real-world datasets demonstrates the outstanding performance of the proposed models. The code of our SADL framework is available at: http://mason.gmu.edu/∼lzhao9/materials/codes/SADL.

References

[1]
Achim Ahrens and Arnab Bhattacharjee. 2015. Two-step lasso estimation of the spatial weights matrix. Econometrics 3, 1 (2015), 128--155.
[2]
Onureena Banerjee, Laurent El Ghaoui, and Alexandre d’Aspremont. 2008. Model selection through sparse maximum likelihood estimation for multivariate Gaussian or binary data. J. Machine Learn. Res. 9, Mar (2008), 485--516.
[3]
Sudipto Banerjee, Bradley P. Carlin, and Alan E. Gelfand. 2014. Hierarchical Modeling and Analysis for Spatial Data. CRC Press.
[4]
Arnab Bhattacharjee and Chris Jensen-Butler. 2013. Estimation of the spatial weights matrix under structural constraints. Region. Sci. Urban Econ. 43, 4 (2013), 617--634.
[5]
Christopher M. Bishop. 2006. Pattern Recognition and Machine Learning. Springer.
[6]
Stephen Boyd, Neal Parikh, Eric Chu, Borja Peleato, and Jonathan Eckstein. 2011. Distributed optimization and statistical learning via the alternating direction method of multipliers. Found. Trends® Machine Learn. 3, 1 (2011), 1--122.
[7]
Chris Brunsdon, A. Stewart Fotheringham, and Martin E. Charlton. 1996. Geographically weighted regression: A method for exploring spatial nonstationarity. Geogr. Anal. 28, 4 (1996), 281--298.
[8]
Mete Celik, Baris M. Kazar, Shashi Shekhar, and Daniel Boley. 2006. Parameter estimation for the spatial autoregression model: A rigorous approach. In Proceedings of the 2nd NASA Data Mining Workshop: Issues and Applications in Earth Science with the 38th Symposium on the Interface of Computing Science, Statistics and Applications.
[9]
Turgay Celik. 2014. Spatial entropy-based global and local image contrast enhancement. IEEE Trans. Image Proc. 23, 12 (2014), 5298--5308.
[10]
Feng Chen, Baojian Zhou, Adil Alim, and Liang Zhao. 2017. A generic framework for interesting subspace cluster detection in multi-attributed networks. In Proceedings of the International Conference on Data Mining. IEEE, 41--50.
[11]
Noel Cressie and Christopher K. Wikle. 2015. Statistics for Spatio-temporal Data. John Wiley 8 Sons.
[12]
Samuel I. Daitch, Jonathan A. Kelner, and Daniel A. Spielman. 2009. Fitting a graph to vector data. In Proceedings of the 26th International Conference on Machine Learning. ACM, 201--208.
[13]
Patrick Danaher, Pei Wang, and Daniela M. Witten. 2014. The joint graphical lasso for inverse covariance estimation across multiple classes. J. Royal Statist. Soc.: Series B (Statist. Methodol.) 76, 2 (2014), 373--397.
[14]
Xiaowen Dong, Dorina Thanou, Pascal Frossard, and Pierre Vandergheynst. 2016. Learning Laplacian matrix in smooth graph signal representations. IEEE Trans. Sig. Proc. 64, 23 (2016), 6160--6173.
[15]
Steven Farber, Antonio Páez, and Erik Volz. 2009. Topology and dependency tests in spatial and network autoregressive models. Geogr. Anal. 41, 2 (2009), 158--180.
[16]
Jerome Friedman, Trevor Hastie, and Robert Tibshirani. 2008. Sparse inverse covariance estimation with the graphical lasso. Biostatistics 9, 3 (2008), 432--441.
[17]
Tian Gao, Ziheng Wang, and Qiang Ji. 2015. Structured feature selection. In Proceedings of the IEEE International Conference on Computer Vision (ICCV’15). 4256--4264.
[18]
Yuyang Gao and Liang Zhao. 2018. Incomplete Label Multi-Task Ordinal Regression for Spatial Event Scale Forecasting. Retrieved from: https://aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16942.
[19]
Virgilio Gomez-Rubio, Roger S. Bivand, and Håvard Rue. 2017. Estimating spatial econometrics models with integrated nested Laplace approximation. Retrieved from: arXiv preprint arXiv:1703.01273.
[20]
B. D. Greenshields, W. S. Channing, H. H. Miller et al. 1935. A study of traffic capacity. In Highw. Res. Board Proc., Vol. 1935. National Research Council (USA), Highway Research Board.
[21]
David Hallac, Youngsuk Park, Stephen Boyd, and Jure Leskovec. 2017. Network inference via the time-varying graphical lasso. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 205--213.
[22]
Chinmay Hegde, Piotr Indyk, and Ludwig Schmidt. 2015. A nearly-linear time framework for graph-structured sparsity. In Proceedings of the International Conference on Machine Learning. 928--937.
[23]
Harry H. Kelejian and Ingmar R. Prucha. 2010. Specification and estimation of spatial autoregressive models with autoregressive and heteroskedastic disturbances. J. Economet. 157, 1 (2010), 53--67.
[24]
Brenden Lake and Joshua Tenenbaum. 2010. Discovering structure by learning sparse graphs. In Proceedings of the Meeting of the Cognitive Science Society (CogSci’10).
[25]
Clifford Lam and Pedro C. L. Souza. 2016. Detection and estimation of block structure in spatial weight matrix. Economet. Rev. 35, 8–10 (2016), 1347--1376.
[26]
Clifford Lam, Pedro C. L. Souza et al. 2014. Regularization for Spatial Panel Time Series Using the Adaptive lasso. LSE, STICERD.
[27]
Sangsoo Lee and Daniel Fambro. 1999. Application of subset autoregressive integrated moving average model for short-term freeway traffic volume forecasting. Transport. Res. Rec.: J. Transport. Res. Board 1678 (1999), 179--188.
[28]
Hongfei Li, Catherine A. Calder, and Noel Cressie. 2012. One-step estimation of spatial dependence parameters: Properties and extensions of the APLE statistic. J. Multivar. Anal. 105, 1 (2012), 68--84.
[29]
Po-Ling Loh and Martin J. Wainwright. 2012. Structure estimation for discrete graphical models: Generalized covariance matrices and their inverses. In Proceedings of the Conference on Advances in Neural Information Processing Systems. 2087--2095.
[30]
Jesús Mur and Ana Angulo. 2006. The spatial Durbin model and the common factor tests. Spat. Econ. Anal. 1, 2 (2006), 207--226.
[31]
Nasser M. Nasrabadi. 2007. Pattern recognition and machine learning. J. Electron. Imag. 16, 4 (2007), 049901.
[32]
E. Oktavia, Widyawan, and I. W. Mustika. 2016. Inverse distance weighting and Kriging spatial interpolation for data center thermal monitoring. In Proceedings of the 1st International Conference on Information Technology, Information Systems and Electrical Engineering (ICITISEE’16). 69--74.
[33]
Iwao Okutani and Yorgos J. Stephanedes. 1984. Dynamic prediction of traffic volume through Kalman filtering theory. Transport. Res. Part B: Methodol. 18, 1 (1984), 1--11.
[34]
Philipp Otto and Rick Steinert. 2018. Estimation of the spatial weighting matrix for spatiotemporal data under the presence of structural breaks. Retrieved from: arXiv preprint arXiv:1810.06940.
[35]
R. Kelley Pace and Ronald Barry. 1997. Sparse spatial autoregressions. Stat. Prob. Lett. 33, 3 (1997), 291--297.
[36]
Michael J. Paul and Mark Dredze. 2012. A model for mining public health topics from Twitter. Health 11 (2012), 16--6.
[37]
Xi Qu and Lung-fei Lee. 2015. Estimating a spatial autoregressive model with an endogenous spatial weight matrix. J. Economet. 184, 2 (2015), 209--232.
[38]
Somwrita Sarkar and Sanjay Chawla. 2016. Inferring the contiguity matrix for spatial autoregressive analysis with applications to house price prediction. Retrieved from: arXiv preprint arXiv:1607.01999.
[39]
Shashi Shekhar, Zhe Jiang, Reem Y. Ali, Emre Eftelioglu, Xun Tang, Venkata Gunturi, and Xun Zhou. 2015. Spatiotemporal data mining: A computational perspective. ISPRS Int. J. Geo-Inform. 4, 4 (2015), 2306--2338.
[40]
Wei Shi and Lung-fei Lee. 2017. Spatial dynamic panel data models with interactive fixed effects. J. Economet. 197, 2 (2017), 323--347.
[41]
Brian L. Smith, Billy M. Williams, and R. Keith Oswald. 2002. Comparison of parametric and nonparametric models for traffic flow forecasting. Transport. Res. Part C: Emerg. Technol. 10, 4 (2002), 303--321.
[42]
David S. Stoffer. 1986. Estimation and identification of space-time ARMAX models in the presence of missing data. J. Amer. Statist. Assoc. 81, 395 (1986), 762--772.
[43]
Martin Sundin, Arun Venkitaraman, Magnus Jansson, and Saikat Chatterjee. 2017. A connectedness constraint for learning sparse graphs. In Proceedings of the Signal Processing Conference (EUSIPCO’17). IEEE, 151--155.
[44]
Waldo R. Tobler. 1970. A computer movie simulating urban growth in the Detroit region. Econ. Geogr. 46, sup1 (1970), 234--240.
[45]
Sen Yang, Qian Sun, Shuiwang Ji, Peter Wonka, Ian Davidson, and Jieping Ye. 2015. Structural graphical lasso for learning mouse brain connectivity. In Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1385--1394.
[46]
Yao Zhang, Yun Xiong, Xinyue Liu, Xiangnan Kong, and Yangyong Zhu. 2017. Meta-path graphical lasso for learning heterogeneous connectivities. In Proceedings of the SIAM International Conference on Data Mining. SIAM, 642--650.
[47]
Liang Zhao, Amir Alipour-Fanid, Martin Slawski, and Kai Zeng. 2018. Prediction-time efficient classification using feature computational dependencies. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery 8 Data Mining. ACM, 2787--2796.
[48]
Liang Zhao, Feng Chen, Jing Dai, Ting Hua, Chang-Tien Lu, and Naren Ramakrishnan. 2014. Unsupervised spatial event detection in targeted domains with applications to civil unrest modeling. PloS one 9, 10 (2014), e110206.
[49]
Liang Zhao, Feng Chen, Chang-Tien Lu, and Naren Ramakrishnan. 2016. Multi-resolution spatial event forecasting in social media. In Proceedings of the 16th IEEE International Conference on Data Mining (ICDM’16). IEEE, 689--698.
[50]
Liang Zhao, Feng Chen, and Yanfang Ye. 2019. Efficient learning with exponentially-many conjunctive precursors for interpretable spatial event forecasting. IEEE Trans. Knowl. Data Eng. IEEE, 1--13.
[51]
Liang Zhao, Qian Sun, Jieping Ye, Feng Chen, Chang-Tien Lu, and Naren Ramakrishnan. 2015. Multi-task learning for spatio-temporal event forecasting. In Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1503--1512.
[52]
Ali Ziat, Edouard Delasalles, Ludovic Denoyer, and Patrick Gallinari. 2017. Spatio-temporal neural networks for space-time series forecasting and relations discovery. In Proceedings of the IEEE International Conference on Data Mining (ICDM’17). IEEE, 705--714.

Cited By

View all
  • (2024)A Generalization of the Convolution Theorem and Its Connections to Non-Stationarity and the Graph Frequency DomainIEEE Transactions on Signal Processing10.1109/TSP.2024.342343272(3424-3437)Online publication date: 1-Jan-2024
  • (2024)Short-Term Multistep Prediction of Process Industry Product Quality With CNN-BiLSTM Network Based on Parallel Attention MechanismsIEEE Sensors Journal10.1109/JSEN.2024.341399024:15(25070-25081)Online publication date: 1-Aug-2024
  • (2024)Explainable Stacking-Based Learning Model for Traffic ForecastingJournal of Transportation Engineering, Part A: Systems10.1061/JTEPBS.TEENG-8208150:4Online publication date: Apr-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Spatial Algorithms and Systems
ACM Transactions on Spatial Algorithms and Systems  Volume 5, Issue 3
September 2019
189 pages
ISSN:2374-0353
EISSN:2374-0361
DOI:10.1145/3356873
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 August 2019
Accepted: 01 June 2019
Revised: 01 May 2019
Received: 01 October 2018
Published in TSAS Volume 5, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Spatial auto-regressive
  2. alternating direction method of multipliers
  3. graphical LASSO
  4. spatial topological constraints

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1,068
  • Downloads (Last 6 weeks)156
Reflects downloads up to 21 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)A Generalization of the Convolution Theorem and Its Connections to Non-Stationarity and the Graph Frequency DomainIEEE Transactions on Signal Processing10.1109/TSP.2024.342343272(3424-3437)Online publication date: 1-Jan-2024
  • (2024)Short-Term Multistep Prediction of Process Industry Product Quality With CNN-BiLSTM Network Based on Parallel Attention MechanismsIEEE Sensors Journal10.1109/JSEN.2024.341399024:15(25070-25081)Online publication date: 1-Aug-2024
  • (2024)Explainable Stacking-Based Learning Model for Traffic ForecastingJournal of Transportation Engineering, Part A: Systems10.1061/JTEPBS.TEENG-8208150:4Online publication date: Apr-2024
  • (2024)Water quality prediction using LSTM with combined normalizer for efficient water managementDesalination and Water Treatment10.1016/j.dwt.2024.100183317(100183)Online publication date: Jan-2024
  • (2024)Graph-based active semi-supervised learning: Case study in water quality monitoringAdvanced Engineering Informatics10.1016/j.aei.2024.10290262(102902)Online publication date: Oct-2024
  • (2024)A real-valued label noise cleaning method based on ensemble iterative filtering with noise scoreInternational Journal of Machine Learning and Cybernetics10.1007/s13042-024-02137-z15:9(4093-4118)Online publication date: 29-Apr-2024
  • (2024)Production-Environment-Oriented Innovative Approach for Next-Generation Traffic Flow Prediction Using Machine Learning and Deep LearningComputing and Machine Learning10.1007/978-981-97-6588-1_29(383-401)Online publication date: 23-Oct-2024
  • (2023)IWQP4Net: An Efficient Convolution Neural Network for Irrigation Water Quality PredictionWater10.3390/w1509165715:9(1657)Online publication date: 24-Apr-2023
  • (2023)Exploring and Visualizing Research Progress and Emerging Trends of Event Prediction: A SurveyApplied Sciences10.3390/app13241334613:24(13346)Online publication date: 18-Dec-2023
  • (2023)A survey on applications of machine learning algorithms in water quality assessment and water supply and managementWater Supply10.2166/ws.2023.03323:2(895-922)Online publication date: 10-Feb-2023
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media