A New Benchmark and Evaluation Schema for Chinese Typo Detection and Correction

Authors

Dingmin Wang Tsinghua University
Gabriel Pui Cheong Fung The Chinese University of Hong Kong
Maxime Debosschere The Chinese University of Hong Kong
Shichao Dong The Chinese University of Hong Kong
Jia Zhu South China Normal University
Kam-Fai Wong The Chinese University of Hong Kong

DOI:

https://doi.org/10.1609/aaai.v32i1.12173

Keywords:

NLP

Abstract

Despite the vast amount of research related to Chinese typo detection, we still lack a publicly available benchmark dataset for evaluation. Furthermore, no precise evaluation schema for Chinese typo detection has been defined. In response to these problems: (1) we release a benchmark dataset to assist research on Chinese typo correction; (2) we present an evaluation schema which was adopted in our NLPTEA 2017 Shared Task on Chinese Spelling Check; and (3) we report new improvements to our Chinese typo detection system ACT.

Downloads

Published

2018-04-29

How to Cite

Wang, D., Fung, G. P. C., Debosschere, M., Dong, S., Zhu, J., & Wong, K.-F. (2018). A New Benchmark and Evaluation Schema for Chinese Typo Detection and Correction. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). https://doi.org/10.1609/aaai.v32i1.12173

Download Citation

Issue

Vol. 32 No. 1 (2018): Thirty-Second AAAI Conference on Artificial Intelligence

Section

Student Abstract Track