Synonyms
Definition
Text mining is the art of data mining from text data collections. The goal is to discover knowledge (or information, patterns) from text data, which are unstructured or semi-structured. It is a subfield of Data Mining (DM), which is also known as Knowledge Discovery in Databases (KDD). KDD is to discover knowledge from various data sources, including text data, relational databases, Web data, user log data, etc. Text Mining is also related to other research fields, including Machine Learning (ML), Information Retrieval (IR), Natural Language Processing (NLP), Information Extraction (IE), Statistics, Pattern Recognition (PR), Artificial Intelligence (AI), etc.
Historical Background
The phrase of Knowledge Discovery in Databases (KDD) was first used at 1st KDD workshop in 1989. Marti Hearst [4] first used the term of text data mining (TDM) and differentiated it with other concepts such as information retrieval and natural language...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Recommended Reading
Andreas H., Andreas N., and Gerhard P. A brief survey of text mining. J. Computat. Linguistics Lang. Technol., 20(1):19–62, 2005.
Bing L. Web Data Mining: Exploring Hyperlinks, Contents and Usage Data. Springer, Berlin, pp. 411–447.2007,
Dipanjan D. and Martins A.F.T. A Survey on Automatic Text Summarization. Literature Survey for the Language and Statistics II course at Carnegie Mellon University, November, 2007.
Hearst M. Untangling text data mining. In Proc. 27th Annual Meeting of the Assoc. for Computational Linguistics, 1999.
Informative and indicative summarization. Available at: http://www1.cs.columbia.edu/~min/papers/sigirDuc01/node2.html
Liebman M. Bioinformatics: an editorial perspective. Available at: (http://www.netsci.org/Science/Bioinform/feature01.html)
Usama F., Gregory P.-S., and Padhraic S. From data mining to knowledge discovery in databases. AI Mag., 17(3):37–54, 1996.
Wayne C.L. Multilingual topic detection and tracking: successful research enabled by corpora and evaluation. In Proc. Conf. on Language Resources and Evaluation, 2000.
Witten I.H. Text mining. In Practical Handbook of Internet Computing, M.P. Singh (eds.). Chapman and Hall/CRC Press, Boca Raton, FL, 2005, pp. 14-1–14-22.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer Science+Business Media, LLC
About this entry
Cite this entry
Cai, Y., Sun, JT. (2009). Text Mining. In: LIU, L., ÖZSU, M.T. (eds) Encyclopedia of Database Systems. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-39940-9_418
Download citation
DOI: https://doi.org/10.1007/978-0-387-39940-9_418
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-35544-3
Online ISBN: 978-0-387-39940-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering