A New Benchmark and Evaluation Schema for Chinese Typo Detection and Correction

Authors

  • Dingmin Wang Tsinghua University
  • Gabriel Pui Cheong Fung The Chinese University of Hong Kong
  • Maxime Debosschere The Chinese University of Hong Kong
  • Shichao Dong The Chinese University of Hong Kong
  • Jia Zhu South China Normal University
  • Kam-Fai Wong The Chinese University of Hong Kong

DOI:

https://doi.org/10.1609/aaai.v32i1.12173

Keywords:

NLP

Abstract

Despite the vast amount of research related to Chinese typo detection, we still lack a publicly available benchmark dataset for evaluation. Furthermore, no precise evaluation schema for Chinese typo detection has been defined. In response to these problems: (1) we release a benchmark dataset to assist research on Chinese typo correction; (2) we present an evaluation schema which was adopted in our NLPTEA 2017 Shared Task on Chinese Spelling Check; and (3) we report new improvements to our Chinese typo detection system ACT.

Downloads

Published

2018-04-29

How to Cite

Wang, D., Fung, G. P. C., Debosschere, M., Dong, S., Zhu, J., & Wong, K.-F. (2018). A New Benchmark and Evaluation Schema for Chinese Typo Detection and Correction. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). https://doi.org/10.1609/aaai.v32i1.12173