Abstract
Predictive text entry systems on computers like kana-to-kanji conversion provide a mechanism that enables users to select among possible words for a given input. Mistakes in selection are relatively common, and they introduce real-word errors. A proofreading system is thus needed to detect and correct real-word errors on a computer without imposing troublesome operations on users. To this end, a practical proofreading system for Japanese text is proposed. The system automatically detects possible real-word homonym errors, and for each detected word, suggests substitution candidates of the same pronunciation. The user can either choose the most appropriate one or leave the original untouched. The system uses an algorithm based on the Naïve Bayesian method. Although the proofreading system was implemented for homonym errors in Japanese text, its design concept and algorithm are also applicable to other languages. The client program of the proofreading system is implemented on the Emacs text editor and works in real time.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Brown, P.F., Pietra, S.A.D., Pietra, V.J.D., Mercer, R.L.: Word-sense disambiguation using statistical methods. In: Proceedings of the 29th Annual Meeting of the Association of Computational Linguistics ACL 1991, pp. 264–270 (1991)
Gale, W.A., Church, K.W., Yarowsky, D.: A method for disambiguating word sense in a large corpus. Computers and the Humanities 26(5-6), 415–439 (1992)
Golding, A.R.: A Bayesian hybrid method for context-sensitive spelling correction. In: Proceedings of the 3rd Annual Workshop on Very Large Corpora, pp. 39–53 (1995)
Ide, N., Veronis, J.: Introduction to the special issue on word sense disambiguation: the state of the art. Computational Linguistics 24(1), 1–40 (1998)
James, C.L., Reischel, K.M.: Text input for mobile devices: comparing model prediction to Actual Performance. In: Proceedings of the Annual SIGCHI Conference, pp. 365–371 (2001)
Kukich, K.: Techniques for automatically correcting words in text. Computing Surveys 24(4), 377–439 (1992)
Oku, M., Matsuoka, K.: A method for detecting Japanese homophone errors in compound nouns based on character cooccurrence and its evaluation. Natural Language Processing 4(3), 83–99 (1997) (in Japanese)
Shinnou, H.: Japanese homophone disambiguation using a decision list given added weight to evidences on compounds. IPSJ Journal 39(12), 3200–3206 (1998) (in Japanese)
Shinnou, H.: Detection of Japanese homophone errors by a decision list including a written word as a default evidence. IPSJ Journal 41(4), 1046–1053 (2000) (in Japanese)
Yarowsky, D.: Decision lists for lexical ambiguity resolution: application to accent restration in Spanish and French. In: Proceedings of the 32nd Annual Meeting of the Association of Computational Linguistics (ACL 1994), pp. 88–95 (1994)
Yarowsky, D.: Unsupervised word sense disambiguation rivaling supervised methods. In: Proceedings of the 33rd Annual Meeting of the Association of Computational Linguistics (ACL 1995), pp. 189–196 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Iwasaki, H., Tanaka-Ishii, K. (2005). An Interactive Proofreading System for Inappropriately Selected Words on Using Predictive Text Entry. In: Su, KY., Tsujii, J., Lee, JH., Kwong, O.Y. (eds) Natural Language Processing – IJCNLP 2004. IJCNLP 2004. Lecture Notes in Computer Science(), vol 3248. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30211-7_80
Download citation
DOI: https://doi.org/10.1007/978-3-540-30211-7_80
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24475-2
Online ISBN: 978-3-540-30211-7
eBook Packages: Computer ScienceComputer Science (R0)