Abstract
The CCKS 2018 presented a named entity recognition (NER) task focusing on Chinese electronic medical records (EMR). The Knowledge Engineering Group of Tsinghua University and Yidu Cloud Beijing Technology Co., Ltd. provided an annotated dataset for this task, which is the only publicly available dataset in the field of Chinese EMR. Using this dataset, 69 systems were developed for the task. The performance of the systems showed that the traditional CRF and Bi-LSTM model were the most popular models for the task. The system achieved the highest performance by combining CRF or Bi-LSTM model with complex feature engineering, indicating that feature engineering is still indispensable. These results also showed that the performance of the task could be augmented with rule-based systems to determine clinical named entities.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
The annotated dataset has not been deposited in a public repository but is available to the research community under data use agreements from the corresponding author on request.
References
de Bruijn, B., Cherry, C., Kiritchenko, S., Martin, J., Zhu, X.: Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010. J. Am. Med. Inf. Assoc. 18(5), 557 (2011)
Jiang, M., et al.: A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries. JAMIA 18, 601–606 (2011)
Kundeti, S.R., Vijayananda, J., Mujjiga, S., Kalyan, M.: Clinical named entity recognition: challenges and opportunities. In: IEEE International Conference on Big Data, pp. 1937–1945 (2016)
Luo, L., Li, N., Li, S.S., Yang, Z., Lin, H.: Dutir at the ccks-2018 task1: a neural network ensemble approach for chinese clinical named entity recognition. In: CCKS Tasks (2018)
Meystre, S.M., Savova, G.K., Kipper-Schuler, K.C., Hurdle, J.F.: Extracting information from textual documents in the electronic health record: a review of recent research. In: Yearbook of Medical Informatics, pp. 128–144, January 2008
Pradhan, S., Elhadad, N., Chapman, W.W., Manandhar, S., Savova, G.: Semeval-2014 task 7: analysis of clinical text. In: SemEval@COLING, pp. 54–62 (2014)
Qiu, W., Chen, M., Ding, R., Xie, P.: Heiheihahei at ccks clinical entity recognition task: a neural-based ensemble approach. In: CCKS Tasks (2018)
Ratinov, L., Roth, D.: Design challenges and misconceptions in named entity recognition. In: CoNLL, June 2009
Settles, B.: Biomedical named entity recognition using conditional random fields and rich feature sets. In: JNLPBA, pp. 104–107 (2004)
Suominen, H., et al.: Overview of the ShARe/CLEF eHealth evaluation lab 2013. In: Forner, P., Müller, H., Paredes, R., Rosso, P., Stein, B. (eds.) CLEF 2013. LNCS, vol. 8138, pp. 212–231. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-40802-1_24
Uzuner, O., South, B.R., Shen, S., DuVall, S.L.: 2010 i2b2/va challenge on concepts, assertions, and relations in clinical text. J. Am. Med. Inf. Assoc. 18(5), 552 (2011)
Yang, X., Huang, W.: A conditional random fields approach to clinical name entity recognition. In: CCKS Tasks (2018)
Zhang, J., et al.: Category multi-representation: a unified solution for named entity recognition in clinical texts. In: PAKDD, pp. 275–287 (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zhang, J., Li, J., Jiao, Z., Yan, J. (2019). Overview of CCKS 2018 Task 1: Named Entity Recognition in Chinese Electronic Medical Records. In: Zhu, X., Qin, B., Zhu, X., Liu, M., Qian, L. (eds) Knowledge Graph and Semantic Computing: Knowledge Computing and Language Understanding. CCKS 2019. Communications in Computer and Information Science, vol 1134. Springer, Singapore. https://doi.org/10.1007/978-981-15-1956-7_14
Download citation
DOI: https://doi.org/10.1007/978-981-15-1956-7_14
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1955-0
Online ISBN: 978-981-15-1956-7
eBook Packages: Computer ScienceComputer Science (R0)