An Interactive Proofreading System for Inappropriately Selected Words on Using Predictive Text Entry

Hideya Iwasaki²² &
Kumiko Tanaka-Ishii²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3248))

Included in the following conference series:

International Conference on Natural Language Processing

1648 Accesses
1 Citations

Abstract

Predictive text entry systems on computers like kana-to-kanji conversion provide a mechanism that enables users to select among possible words for a given input. Mistakes in selection are relatively common, and they introduce real-word errors. A proofreading system is thus needed to detect and correct real-word errors on a computer without imposing troublesome operations on users. To this end, a practical proofreading system for Japanese text is proposed. The system automatically detects possible real-word homonym errors, and for each detected word, suggests substitution candidates of the same pronunciation. The user can either choose the most appropriate one or leave the original untouched. The system uses an algorithm based on the Naïve Bayesian method. Although the proofreading system was implemented for homonym errors in Japanese text, its design concept and algorithm are also applicable to other languages. The client program of the proofreading system is implemented on the Emacs text editor and works in real time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Diagnostic Tools in plWordNet Development Process

SlideKey: Impact of In-depth Previews for a Predictive Text Entry Method

The Application of Computer Intelligent Proofreading System in English Phrase Translation

References

Brown, P.F., Pietra, S.A.D., Pietra, V.J.D., Mercer, R.L.: Word-sense disambiguation using statistical methods. In: Proceedings of the 29th Annual Meeting of the Association of Computational Linguistics ACL 1991, pp. 264–270 (1991)
Google Scholar
Gale, W.A., Church, K.W., Yarowsky, D.: A method for disambiguating word sense in a large corpus. Computers and the Humanities 26(5-6), 415–439 (1992)
Article Google Scholar
Golding, A.R.: A Bayesian hybrid method for context-sensitive spelling correction. In: Proceedings of the 3rd Annual Workshop on Very Large Corpora, pp. 39–53 (1995)
Google Scholar
Ide, N., Veronis, J.: Introduction to the special issue on word sense disambiguation: the state of the art. Computational Linguistics 24(1), 1–40 (1998)
Google Scholar
James, C.L., Reischel, K.M.: Text input for mobile devices: comparing model prediction to Actual Performance. In: Proceedings of the Annual SIGCHI Conference, pp. 365–371 (2001)
Google Scholar
Kukich, K.: Techniques for automatically correcting words in text. Computing Surveys 24(4), 377–439 (1992)
Article Google Scholar
Oku, M., Matsuoka, K.: A method for detecting Japanese homophone errors in compound nouns based on character cooccurrence and its evaluation. Natural Language Processing 4(3), 83–99 (1997) (in Japanese)
Google Scholar
Shinnou, H.: Japanese homophone disambiguation using a decision list given added weight to evidences on compounds. IPSJ Journal 39(12), 3200–3206 (1998) (in Japanese)
Google Scholar
Shinnou, H.: Detection of Japanese homophone errors by a decision list including a written word as a default evidence. IPSJ Journal 41(4), 1046–1053 (2000) (in Japanese)
Google Scholar
Yarowsky, D.: Decision lists for lexical ambiguity resolution: application to accent restration in Spanish and French. In: Proceedings of the 32nd Annual Meeting of the Association of Computational Linguistics (ACL 1994), pp. 88–95 (1994)
Google Scholar
Yarowsky, D.: Unsupervised word sense disambiguation rivaling supervised methods. In: Proceedings of the 33rd Annual Meeting of the Association of Computational Linguistics (ACL 1995), pp. 189–196 (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

The University of Electro-Communications, Chofu-shi, Tokyo, 182–8585
Hideya Iwasaki
The University of Tokyo, Bunkyo-ku, Tokyo, 113-8658
Kumiko Tanaka-Ishii

Authors

Hideya Iwasaki
View author publications
You can also search for this author in PubMed Google Scholar
Kumiko Tanaka-Ishii
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Behavior Design Corporation, IV Science-Based Industrial Park Hsinchu, 2F, No.5, Industry E. Rd, Taiwan
Keh-Yih Su
University of Tokyo, Hongo 7-3-1, Bunkyo-ku, Tokyo 113-0033, JST CREST, Honcho 4-1-8, Kawaguchi-shi,, 332-0012, Saitama,
Jun’ichi Tsujii
Pohang University of Science and Technology (POSTECH), AITrc, Republic of Korea
Jong-Hyeok Lee
Language Information Sciences Research Centre, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong
Oi Yee Kwong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Iwasaki, H., Tanaka-Ishii, K. (2005). An Interactive Proofreading System for Inappropriately Selected Words on Using Predictive Text Entry. In: Su, KY., Tsujii, J., Lee, JH., Kwong, O.Y. (eds) Natural Language Processing – IJCNLP 2004. IJCNLP 2004. Lecture Notes in Computer Science(), vol 3248. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30211-7_80

Download citation

DOI: https://doi.org/10.1007/978-3-540-30211-7_80
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24475-2
Online ISBN: 978-3-540-30211-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics