Abstract
Annotation of MEDLINE citations with controlled vocabulary terms improves the quality of retrieval results. Due to variety in descriptions of similar clinical phenomena and abundance of negation and uncertainty, annotation of clinical radiology reports for subsequent indexing and retrieval with a search engine is even more important. Provided with an opportunity to add about 4,000 radiology reports to collections indexed with NLM image retrieval engine Open-i, we needed to assure good retrieval quality. To accomplish this, we explored automatic and manual approaches to annotation, as well as developed a small controlled vocabulary of chest x-ray indexing terms and guidelines for manual annotation. Manual annotation captured the most salient findings in the reports and normalized the sparse distinct descriptions of similar findings to one controlled vocabulary term. This paper presents the vocabulary and the manual annotation process, as well as an evaluation of the automatic annotation of the reports.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Rogers, F.B.: Medical subject headings. Bull. Med. Libr. Assoc. 51, 114–116 (1963)
Haynesm, R.B., Wilczynski, N., McKibbon, K.A., Walker, C.J., Sinclair, J.C.: Developing optimal search strategies for detecting clinically sound studies in MEDLINE. J. Am. Med. Inform. Assoc. 1(6), 447–458 (1994)
Hersh, W., Voorhees, E.: TREC genomics special issue overview. Inf. Retr. 12(1), 1–15 (2009)
Darmoni, S.J., Soualmia, L.F., Letord, C., Jaulent, M.C., Griffon, N., Thirion, B., Névéol, A.: Improving information retrieval using medical subject headings concepts: a test case on rare and chronic diseases. J. Med. Libr. Assoc. 100(3), 176–183 (2012)
Mork, J.G., Jimeno-Yepes, A.J., Aronson, A.R.: The NLM Medical Text Indexer System for Indexing Biomedical Literature. BioASQ Workshop, Valencia, Spain, 27 September 2013
MeSH on Demand. http://www.nlm.nih.gov/mesh/MeSHonDemand.html
Demner-Fushman, D., Antani, S., Simpson, M.S., Thoma, G.R.: Design and development of a multimodal biomedical information retrieval system. JCSE 6(2), 68–177 (2012)
Langlotz, C.P.: RadLex: a new method for indexing online educational materials. Radiographics 26(6), 1595–1597 (2006)
RadLex. http://www.radlex.org/
Humphreys, B.L., Lindberg, D.A.B.: Building the unified medical language system. In: Proceedings Annual Symposium on Computer Application in Medical Care, pp. 475–480 (1989)
Goodman, L.R., Felson, B.: Felson’s principles of chest roentgenology?: a programmed text. Saunders, Philadelphia (1999)
Daffner, R.H.: Clinical radiology: the essentials, 2nd edn. Williams & Wilkins, Baltimore (1999)
Hersh, W.R., Müller, H., Jensen, J.R., Yang, J., Gorman, P.N., Ruch, P.: Advancing biomedical image retrieval: development and analysis of a test collection. J. Am. Med. Inform. Assoc. 13(5), 488–496 (2006)
The ImageCLEF campaign. http://www.imageclef.org/2013/medical
Aronson, A.R., Lang, F.M.: An overview of MetaMap: historical perspective and recent advances. J. Am. Med. Inform. Assoc. 17(3), 229–236 (2010)
Smith, B., Ashburner, M., Rosse, C., Bard, J., Bug, W., Ceusters, W., et al.: The OBO foundry: coordinated evolution of ontologies to support biomedical data integration. Nat. Biotech. 25(11), 1251–1255 (2007)
Kuranz, J., Gilles, B.: Indexing electronic medical records using a taxonomy. Bull. Am. Soc. Inf. Sci. Technol. 39(2), 30–33 (2013)
Call, B.: Indexing electronic medical records using a taxonomy. In: Proceedings of the 2013 International Workshop on Data Management & Analytics for Healthcare, pp. 5–8. ACM (2013)
McCray, A.T., Burgun, A., Bodenreider, O.: Aggregating UMLS semantic types for reducing conceptual complexity. Stud. Health Technol. Inform. (Proc Medinfo) 84((Pt 1)), 216–220 (2001)
Zaharee, M.: Building controlled vocabularies for metadata harmonization. Bull. Am. Soc. Inf. Sci. Technol. 39(2), 39–42 (2013)
RSNA radiology reporting initiative. http://www.radreport.org/template/0000102
Lin, J., Wilbur, W.J.: PubMed related articles: a probabilistic topic-based model for content similarity. BMC Bioinf. 8(1), 423 (2007)
Chapman, W.W., Bridewell, W., Hanbury, P., Cooper, G.F., Buchanan, B.G.: A simple algorithm for identifying negated findings and diseases in discharge summaries. J. Biomed. Inform. 34(5), 301–310 (2001)
MEDLINE Indexing Online Training Course. http://www.nlm.nih.gov/bsd/indexing/training/USE_010.htm
Siegel, S., Castellan, N.: Nonparametric Statistics for the Behavioral Sciences, 2nd edn. McGraw-Hill, New York (1988)
Khare, R., Good, B.M., Leaman, R., Su, A.I., Lu, Z.: Crowdsourcing in biomedicine: challenges and opportunities. Brief Bioinform, 17 April 2015
Bekhuis, T., Demner-Fushman, D., Crowley, R.S.: Comparative effectiveness research designs: an analysis of terms and coverage in medical subject headings (MeSH) and emtree. J. Med. Libr. Assoc. 101(2), 92–100 (2013)
Acknowledgments
This work was supported by the intramural research program of the U. S. National Library of Medicine, National Institutes of Health.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Demner-Fushman, D., Shooshan, S.E., Rodriguez, L., Antani, S., Thoma, G.R. (2015). Annotation of Chest Radiology Reports for Indexing and Retrieval. In: Müller, H., Jimenez del Toro, O., Hanbury, A., Langs, G., Foncubierta Rodriguez, A. (eds) Multimodal Retrieval in the Medical Domain. MRDM 2015. Lecture Notes in Computer Science(), vol 9059. Springer, Cham. https://doi.org/10.1007/978-3-319-24471-6_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-24471-6_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24470-9
Online ISBN: 978-3-319-24471-6
eBook Packages: Computer ScienceComputer Science (R0)