Abstract
The discovery of new and potentially meaningful relationships between concepts in the biomedical literature has attracted the attention of a lot of researchers in text mining. The main motivation is found in the increasing availability of the biomedical literature which makes it difficult for researchers in biomedicine to keep up with research progresses without the help of automatic knowledge discovery techniques. More than 14 million abstracts of this literature are contained in the Medline collection and are available online. In this paper we present the application of an association rule mining method to Medline abstracts in order to detect associations between concepts as indication of the existence of a biomedical relation among them. The discovery process fully exploits the MeSH (Medical Subject Headings) taxonomy, that is, a set of hierarchically related biomedical terms which permits to express associations at different levels of abstraction (generalized association rules). We report experimental results on a collection of abstracts obtained by querying Medline on a specific disease and we show the effectiveness of some filtering and browsing techniques designed to manage the huge amount of generalized associations that may be generated on real data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Agrawal, R., Srikant, R.: Fast Algorithms for Mining Association Rules. In: Proceedings of the Twentieth International Conference on Very Large Databases, Santiago, Cile (1994)
Aronson, A.R., Rindflesch, T.C.: Query expansion using the UMLS Metathesaurus. In: Proceedings of AMIA, Annual American Medical Informatics Association Conference, Nashville, TN, pp. 485–489 (1997)
Berardi, M., Lapi, M., Leo, P., Malerba, D., Marinelli, C., Scioscia, G.: A data mining approach to PubMed query refinement. In: 2nd International Workshop on Biological Data Management in conjunction with DEXA 2004, Zaragoza, Spain (2004)
Ferrucci, D., Lally, A.: UIMA: An Architectural Approach to Unstructured Information Processing in the Corporate Research Environment. Natural Language Engineering 10(3-4), 327–348 (2004)
Hristovski, D., Stare, J., Peterlin, B., Dzeroski, S.: Supporting discovery in medicine by association rule mining in Medline and UMLS. In: Proceedings of MedInfo Conference, London, England, vol. 10(2), pp. 1344–1348 (2001)
Kahng, J., Liao, W.-H.K., McLeod, D.: Mining Generalized Term Associations: Count Propagation Algorithm. In: KDD, pp. 203–206 (1997)
Klemettinen, M., Mannila, H., Ronkainen, P., Toivonen, H., Verkamo, A.I.: Finding Interesting Rules from Large Sets of Discovered Association Rules. In: Proceedings of the 3rd International Conference on Information and Knowledge Management, Gaithersburg, Maryland, pp. 401–407 (1994)
Kodratoff, Y., Azé, J.: Rating the Interest of Rules Induced from Data within Texts. In: Proceedings of Twelfth International Conference On Database and Expert Systems Application, Munich, Germany (2001)
Lindsay, R.K., Gordon, M.D.: Literature-based discovery by lexical statistics. Journal of the American Society for Information Science 50(7), 574–587 (1999)
Perutz, M.F.: Will biomedicine outgrow support? Nature 399, 299–301 (1999)
Pratt, W., Yetisgen-Yildiz, M.: LitLinker: Capturing Connections across the Biomedical Literature. In: Proceedings of the International Conference on Knowledge Capture (K-Cap 2003), Florida (2003)
Srikant, R., Agrawal, R.: Mining Generalized Association Rules. In: Proceedings of the 21st International Conference on Very Large Databases, Zurich, Switzerland (1995)
Srinivasan, P.: Query expansion and MEDLINE. Information Processing and Management 32(4), 431–443 (1996)
Srinivasan, P.: Text Mining: Generating Hypotheses from Medline. Journal of the American Society for Information Science 55(4), 396–413 (2004)
Swanson, D.R.: Fish oil, Raynaud’s syndrome, and undiscovered public knowledge. Perspectives in Biology and Medicine 30, 7–18 (1986)
Toivonen, H., Klemetinen, M., Ronkainen, P., Hatonen, K., Mannila, H.: Pruning and grouping discovered association rules. In: MLnet Workshop on Statistics, Machine Learning, and Discovery in Databases, pp. 47–52 (1995)
Weeber, M., Klein, H., Berg, L., Vos, R.: Using concepts in literature-based discovery: simulating Swanson’s Raynaud-Fish Oil and Migraine-Magnesium discoveries. Journal of the American Society for Information Science 52(7), 548–557 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Berardi, M., Lapi, M., Leo, P., Loglisci, C. (2005). Mining Generalized Association Rules on Biomedical Literature. In: Ali, M., Esposito, F. (eds) Innovations in Applied Artificial Intelligence. IEA/AIE 2005. Lecture Notes in Computer Science(), vol 3533. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11504894_68
Download citation
DOI: https://doi.org/10.1007/11504894_68
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26551-1
Online ISBN: 978-3-540-31893-4
eBook Packages: Computer ScienceComputer Science (R0)