Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2345316.2345369acmotherconferencesArticle/Chapter ViewAbstractPublication Pagescom-geoConference Proceedingsconference-collections
research-article

Modeling spatial dependencies and semantic concepts in data mining

Published: 01 July 2012 Publication History

Abstract

Data mining is the process of discovering new patterns and relationships in large datasets. However, several studies have shown that general data mining techniques often fail to extract meaningful patterns and relationships from the spatial data owing to the violation of fundamental geospatial principles. In this tutorial, we introduce basic principles behind explicit modeling of spatial and semantic concepts in data mining. In particular, we focus on modeling these concepts in the widely used classification, clustering, and prediction algorithms. Classification is the process of learning a structure or model (from user given inputs) and applying the known model to the new data. Clustering is the process of discovering groups and structures in the data that are "similar," without applying any known structures in the data. Prediction is the process of finding a function that models (explains) the data with least error. One common assumption among all these methods is that the data is independent and identically distributed. Such assumptions do not hold well in spatial data, where spatial dependency and spatial heterogeneity are a norm. In addition, spatial semantics are often ignored by the data mining algorithms. In this tutorial we cover recent advances in explicitly modeling of spatial dependencies and semantic concepts in data mining.

References

[1]
J. Aldstadt and A. Getis. Using amoeba to create a spatial weights matrix and identify spatial clusters. Geographical Analysis, 38(4):327--343, 2006.
[2]
C. Ambroise and G. Govaert. Convergence of an em-type algorithm for spatial clustering. Pattern Recogn. Lett., 19:919--927, August 1998.
[3]
L. Anselin. Local indicators of spatial associationÑlisa. Geographical Analysis, 27(2):93--115, 1995.
[4]
J. Bilmes. A gentle tutorial on the em algorithm and its application to parameter estimation for gaussian mixture and hidden markov models. Technical Report, University of Berkeley, ICSI-TR-97-021, 1997., 1997.
[5]
D. M. Blei, A. Y. Ng, and M. I. Jordan. Latent dirichlet allocation. J. Mach. Learn. Res., 3:993--1022, Mar. 2003.
[6]
P. Congdon. Modelling Spatial Dependencies, pages 297--332. John Wiley & Sons, Ltd, 2007.
[7]
D.A. and Griffith. Spatially autoregressive models. In E. in Chief:ÊÊRob Kitchin and N. Thrift, editors, International Encyclopedia of Human Geography, pages 396--402. Elsevier, Oxford, 2009.
[8]
A. S. Fotheringham. Òthe problem of spatial autocorrelationÓ and local spatial statistics. Geographical Analysis, 41(4):398--403, 2009.
[9]
P. J. Green. On use of the em algorithm for penalized likelihood estimation. Journal of the Royal Statistical Society, Series B, 52(3):443--452, 1990.
[10]
Y. Jhung and P. H. Swain. Bayesian Contextual Classification Based on Modified M-Estimates and Markov Random Fields. IEEE Transaction on Pattern Analysis and Machine Intelligence, 34(1):67--75, 1996.
[11]
G. Jun, R. R. Vatsavai, and J. Ghosh. Spatially adaptive classification and active learning of multispectral data with gaussian processes. In ICDM Workshop on Spatial and Spatiotemporal Data Mining (SSTDM), pages 597--603, 2009.
[12]
B. M. Kazar, S. Shekhar, D. J. Lilja, R. R. Vatsavai, and R. K. Pace. Comparing exact and approximate spatial auto-regression model solutions for spatial data analysis. In GIScience, pages 140--161, 2004.
[13]
S. Z. Li. Markov random field modeling in image analysis. Springer-Verlag New York, Inc., Secaucus, NJ, USA, 2010.
[14]
H. J. Miller and J. Han. Geographic Data Mining and Knowledge Discovery. CRC Press, Boca Raton, FL, USA, 2009.
[15]
C. E. Rasmussen and C. K. I. Williams. Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning). The MIT Press, 2005.
[16]
S. Shekhar, P. Schrater, R. Vatsavai, W. Wu, and S. Chawla. Spatial contextual classification and prediction models for mining geospatial data. IEEE Transaction on Multimedia, 4(2):174--188, 2002.
[17]
A. H. Solberg, T. Taxt, and A. K. Jain. A Markov Random Field Model for Classification of Multisource Satellite Imagery. IEEE Transaction on Geoscience and Remote Sensing, 34(1):100--113, 1996.
[18]
S. Tadjudin and D. A. Landgrebe. Covariance estimation with limited training samples. IEEE Trans. Geosciences and Remote Sensing., 37(4):2113--2118, 1999.
[19]
A. K. H. Tung, J. Hou, and J. Han. Spatial clustering in the presence of obstacles. In In Proc. 2001 Intl. Conf. On Data Engineering, pages 359--367, 2001.
[20]
R. R. Vatsavai, A. Cheriyadat, and S. Gleason. Supervised semantic classification for nuclear proliferation monitoring. IEEE Applied Imagery Pattern Recognition Workshop. Proceedings, pages 1--10, 10 2010.
[21]
R. R. Vatsavai, A. Cheriyadat, and S. S. Gleason. Unsupervised semantic labeling framework for identification of complex facilities in high-resolution remote sensing images. In ICDM Workshops, pages 273--280, 2010.
[22]
R. R. Vatsavai, S. Shekhar, and T. E. Burk. A spatial semi-supervised learning method for classification of multi-spectral remote sensing imagery. In Seventh International Workshop on Multimedia Data Mining, MDM/KDD, 2006.
[23]
C. E. Warrender and M. F. Augusteijn. Fusion of image classifications using Bayesian techniques with Markov rand fields. International Journal of Remote Sensing, 20(10):1987--2002, 1999.
[24]
L. Xu and M. I. Jordan. On convergence properties of the EM algorithm for gaussian mixtures. Neural Computation, 8(1):129--151, 1996.
  1. Modeling spatial dependencies and semantic concepts in data mining

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    COM.Geo '12: Proceedings of the 3rd International Conference on Computing for Geospatial Research and Applications
    July 2012
    212 pages
    ISBN:9781450311137
    DOI:10.1145/2345316

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 July 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article

    Conference

    COM.Geo '12

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 158
      Total Downloads
    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 12 Nov 2024

    Other Metrics

    Citations

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media