Synonyms
Data analysis; Knowledge discovery from data; Pattern discovery
Definition
Data miningis the process of discovering knowledge or patterns from massive amounts of data. As a young research field, data mining represents the confluence of a number of research fields, including database systems, machine learning, statistics, pattern recognition, high-performance computing, and specific application fields, such as WWW, multimedia, and bioinformatics, with broad applications. As an interdisciplinary field, data mining has several major research themes based on its mining tasks, including pattern-mining and analysis, classification and predictive modeling, cluster and outlier analysis, and multidimensional (OLAP) analysis. Data mining can also be categorized based on the kinds of data to be analyzed, such as multi-relational data mining, text mining, stream mining, web mining, multimedia (or image, video) mining, spatiotemporal data mining, information network analysis, biological...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Recommended Reading
Duda RO, Hart PE, Stork DG. Pattern classification. 2nd ed. New York: Wiley; 2001.
Han J, Kamber M. Data mining: concepts and techniques. 2nd ed. Amsterdam: Morgan Kaufmann; 2006.
Hastie T, Tibshirani R, Friedman J. The elements of statistical learning: data mining, inference, and prediction. New York: Springer; 2001.
Tan P, Steinbach M, Kumar V. Introduction to data mining. Boston: Addison Wesley; 2005.
Witten IH, Frank E. Data mining: practical machine learning tools and techniques. 2nd ed. Amsterdam: Morgan Kaufmann; 2005.
Dasu T, Johnson T. Exploratory data mining and data cleaning. New York: Wiley; 2003.
Chakrabarti S. Mining the web: statistical analysis of hypertex and semi-structured data. Morgan Kaufmann; 2002.
Liu B. Web data mining: exploring hyperlinks, contents, and usage data. New York: Springer; 2006.
Agrawal R, Srikant R. Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference on Very Large Data Bases; 1994. p. 487–99.
Han J, Cheng H, Xin D, Yan X. Frequent pattern mining: current status and future directions. Data Min Knowl Disc. 2007;15(1):55–86.
Mitchell TM. Machine learning. New York: McGraw-Hill; 1997.
Cheng H, Yan X, Han J, Yu PS. Direct discriminative pattern mining for effective classification. In: Proceedings of the 24th International Conference on Data Engineering; 2008. p. 169–78.
Zhang T, Ramakrishnan R, Livny M. BIRCH: an efficient data clustering method for very large databases. In: Proceedings of the ACM-SIGMOD International Conference on Management of Data; 1996.p. 103–14.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this entry
Cite this entry
Han, J. (2018). Data Mining. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_104
Download citation
DOI: https://doi.org/10.1007/978-1-4614-8265-9_104
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering