Nothing Special   »   [go: up one dir, main page]

skip to main content
10.3115/1118637.1118642dlproceedingsArticle/Chapter ViewAbstractPublication PagessemiticConference Proceedingsconference-collections
Article
Free access

Machine transliteration of names in Arabic text

Published: 11 July 2002 Publication History

Abstract

We present a transliteration algorithm based on sound and spelling mappings using finite state machines. The transliteration models can be trained on relatively small lists of names. We introduce a new spelling-based model that is much more accurate than state-of-the-art phonetic-based models and can be trained on easier-to-obtain training data. We apply our transliteration algorithm to the transliteration of names from Arabic into English. We report on the accuracy of our algorithm based on exact-matching criterion and based on human-subjective evaluation. We also compare the accuracy of our system to the accuracy of human translators.

References

[1]
Mansur Arbabi, Scott M. Fischthal, Vincent C. Cheng, and Elizabeth Bart. 1994. Algorithms for Arabic Names Transliteration. IBM Journal of Research and Development, 38(2).
[2]
Asanee Kawtrakul, Amarin Deemagarn, Chalathip Thumkanon, Navapat Khantonthong, and Paul McFetridge. 1998. Backward Transliteration for Thai Document Retrieval. In Proceedings of the 1998 IEEE Asia-Pacific Conference on Circuits and Systems (APCCAS), pages 563--566.
[3]
Kevin Knight and Jonathan Graehl. 1997. Machine Transliteration. In Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics, pages 128--135. Morgan Kaufmann.
[4]
Klaus Lagally. 1999. ArabTEX: A System for Typesetting Arabic, User Manual Version 3.09. Technical Report 1998/09, Universitat Stuttgart, Fakultät Informatik, Breitwiesenstraße 20--22, 70565 Stuttgart, Germany.
[5]
Bonnie G. Stalls and Kevin Knight. 1998. Translating Names and Technical Terms in Arabic Text. In Proceedings of the COLING/ACL Workshop on Computational Approaches to Semitic Languages.

Cited By

View all
  • (2019)A Rule-Based Kurdish Text Transliteration SystemACM Transactions on Asian and Low-Resource Language Information Processing10.1145/327862318:2(1-8)Online publication date: 18-Jan-2019
  • (2019)Phonology-Augmented Statistical Framework for Machine Transliteration Using Limited Linguistic ResourcesIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2018.287526927:1(199-211)Online publication date: 1-Jan-2019
  • (2012)Report of NEWS 2012 machine transliteration shared taskProceedings of the 4th Named Entity Workshop10.5555/2392777.2392779(10-20)Online publication date: 12-Jul-2012
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
SEMITIC '02: Proceedings of the ACL-02 workshop on Computational approaches to semitic languages
July 2002
85 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 11 July 2002

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 12 of 21 submissions, 57%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)57
  • Downloads (Last 6 weeks)9
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2019)A Rule-Based Kurdish Text Transliteration SystemACM Transactions on Asian and Low-Resource Language Information Processing10.1145/327862318:2(1-8)Online publication date: 18-Jan-2019
  • (2019)Phonology-Augmented Statistical Framework for Machine Transliteration Using Limited Linguistic ResourcesIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2018.287526927:1(199-211)Online publication date: 1-Jan-2019
  • (2012)Report of NEWS 2012 machine transliteration shared taskProceedings of the 4th Named Entity Workshop10.5555/2392777.2392779(10-20)Online publication date: 12-Jul-2012
  • (2012)Regularized interlingual projectionsProceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning10.5555/2390948.2390951(12-23)Online publication date: 12-Jul-2012
  • (2012)Leveraging supplemental representations for sequential transductionProceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies10.5555/2382029.2382085(396-406)Online publication date: 3-Jun-2012
  • (2011)Mining named entities with temporally correlated bursts from multilingual web news streamsProceedings of the fourth ACM international conference on Web search and data mining10.1145/1935826.1935870(237-246)Online publication date: 9-Feb-2011
  • (2011)Machine transliteration surveyACM Computing Surveys10.1145/1922649.192265443:3(1-46)Online publication date: 29-Apr-2011
  • (2010)Finite-state scriptural translationProceedings of the 23rd International Conference on Computational Linguistics: Posters10.5555/1944566.1944657(791-800)Online publication date: 23-Aug-2010
  • (2010)Language independent transliteration mining system using finite state automata frameworkProceedings of the 2010 Named Entities Workshop10.5555/1870457.1870465(57-61)Online publication date: 16-Jul-2010
  • (2010)Report of NEWS 2010 transliteration generation shared taskProceedings of the 2010 Named Entities Workshop10.5555/1870457.1870458(1-11)Online publication date: 16-Jul-2010
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media