research-article

Machine Translation Errors: English and Iraqi Arabic

Authors:

M. AwadAuthors Info & Claims

ACM Transactions on Asian Language Information Processing (TALIP), Volume 10, Issue 1

Article No.: 2, Pages 1 - 19

https://doi.org/10.1145/1929908.1929910

Published: 01 March 2011 Publication History

Abstract

Errors in machine translations of English-Iraqi Arabic dialogues were analyzed using the methods developed for the Human Translation Error Rate measure (HTER). Human annotations were used to refine the Translation Error Rate (TER) annotations. The analyses were performed on approximately 100 translations into each language from four translation systems. Results include high frequencies of pronoun errors and errors involving the copula in translations to English. High frequencies of errors in subject/person inflection and closed-word classes characterized translations to Iraqi Arabic. There were similar frequencies of word order errors in both translation directions and low frequencies of polarity errors. The problems associated with many errors can be predicted from structural differences between the two languages. Also problematic is the need to insert lexemes not present in the source or vice versa. Some problems associated with deictic elements like pronouns will require knowledge of the discourse context to resolve.

References

[1]

Abels, K. 2005. Expletive negation in Russian: A conspiracy theory. J. Slavic Linguist. 13, 1, 5--74.

[2]

Al-Ajlouny, M. 2007. Contrastive analysis and diglossia. Int. J. Arabic-English Stud. 8, 151--158.

[3]

Badawi, el-S. and Hinds, M. 1986. A Dictionary of Egyptian Arabic: Arabic-English. Librairie du Liban, Beirut.

[4]

Banerjee, S. and Lavie, A. 2005. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. In Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for MT and/or Summarization (MTSE’05). 65--73.

[5]

Condon, S., Phillips, J., Doran, C., Aberdeen, J., Parvaz, D., Oshika, B., Sanders, G., and Schlenoff, C. 2008. Applying automated metrics to speech translation dialogs. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC’08).

[6]

Espinal, M. T. 2000. Expletive negation, negative concord and feature checking. In Catalan Working Papers in Linguistics 8, 47--69.

[7]

Haywood, J. A. and Nahmad, H. M. 1965. A New Arabic Grammar of the Written Language. Lund Humphries, London.

[8]

Krifka, M. Forthcoming. How to interpret “expletive” negation under bevor in German. http://amor.cms.hu-berlin.de/~h2816i3x/.

[9]

Llitjós, A., Carbonell, J., and Lavie, A. 2005. A framework for interactive and automatic refinement of transfer-based machine translation. In Proceedings of the 10th Annual Conference of the European Association for Machine Translation (EAMT’05).

[10]

Matusov, E., Zens, R., Vilar, D., Mauser, A., Popović, M., Hasan, S., and Ney, H. 2006. The RWTH machine translation system. In Proceedings of the TC-STAR Workshop on Speech-to-Speech Translation (SST’06). 31--36.

[11]

McCarus, E. N. 1979. A Course in Levantine Arabic. International Book Centre.

[12]

NIST. 2007. Post editing guidelines for GALE machine translation evaluation version 3.0.2. http://projects.ldc.upenn.edu/gale/Translation/Editors/GALEpostedit_guidelines-3.0.2.pdf.

[13]

NIST. 2008. MTPostEditor_V1.2.0.jar. V1.2.2. Available at Linguistic Data Consortium (LDC). GALE: Machine translation post-editing resource page for current post-editors. http://projects.ldc.upenn.edu/gale/Translation/Editors/.

[14]

Noor, H. 1996. EEnglish syntactic errors by Arabic speaking learners: Reviewed. ERIC Document ID#ED423660. http://www.eric.ed.gov/PDFS/ED423660.pdf.

[15]

Popović, M. and Ney, H. 2007. Word error rates: Decomposition over POS classes and applications for error analysis. In Proceedings of the 2nd Workshop on Statistical Machine Translation (SMT’07). 48--55.

Digital Library

[16]

Przybocki, M., Peterson, K., and Bronsart, S. 2008. Official results of the NIST 2008 “Metrics for MAchine TRanslation” challenge (MetricsMATR’08). http://nist.gov/speech/tests/metricsmatr/2008/results/.

[17]

Sanders, G., Bronsart, S., Condon, S., and Schlenoff, C. 2008. Odds of successful transfer of low-level concepts: A key metric for bidirectional speech-to-speech machine translation in DARPA’s TRANSTAC program. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC’08).

[18]

Stallard, D., Kao, C., Krstovski, K., Liu, D., Natarajan, P., Prasad, R., Saleem, S., and Subramanian, K. 2008. Recent improvements and performance analysis of ASR and MT in a speech-to-speech translation system. In Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP’08). 4973--4976.

[19]

Snover, M., Dorr, B., Schwartz, R., Micciula, L., and Makhoul, J. 2006. A study of translation error rate with targeted human annotation. In Proceedings of the Association for Machine Translation in the Americas (AMTA’06). 223--231.

[20]

Tillmann, C., Vogel, S., Ney, H., Zubiaga, A., and Sawaf, H. 1997. Accelerated DP based search for statistical translation. In Proceedings of the European Conference on Speech Communication and Technology (EUROSPEECH’97). 2667--2670.

[21]

Vilar, D., Xu, J., D’Haro, L., and Ney, H. 2006. Error analysis of statistical machine translation output. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC’06). 697--702.

[22]

Watson, J. 1993. A Syntax of San’ani Arabic. Harrassowitz, Wiesbaden, Germany.

[23]

Weiss, B., Schlenoff, C., Sanders, G., Steves, M., Condon, S., Phillips, J., and Parvaz, D. 2008. Performance evaluation of speech translation systems. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC’08).

Cited By

Chatterji S(2021)Hindi Correspondence of Bengali Nominal SuffixesJournal of Multimedia Information System10.33851/JMIS.2021.8.4.2218:4(221-232)Online publication date: 31-Dec-2021
https://doi.org/10.33851/JMIS.2021.8.4.221

Index Terms

Machine Translation Errors: English and Iraqi Arabic
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Machine translation

Recommendations

Evaluation of English to Arabic Machine Translation Systems using BLEU and GTM
ICETC '17: Proceedings of the 9th International Conference on Education Technology and Computers

The aim of this research study is to compare the effectiveness of three systems: Google Translator, Bing Translator and Golden Alwafi that are used to translate the corpus sentences from English language to Arabic language and then evaluate these ...
Discriminative Phrase-Based Models for Arabic Machine Translation

A design for an Arabic-to-English translation system is presented. The core of the system implements a standard phrase-based statistical machine translation architecture, but it is extended by incorporating a local discriminative phrase selection model ...
Syntactic discriminative language model rerankers for statistical machine translation

This article describes a method that successfully exploits syntactic features for n-best translation candidate reranking using perceptrons. We motivate the utility of syntax by demonstrating the superior performance of parsers over n-gram language ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Asian Language Information Processing

ACM Transactions on Asian Language Information Processing Volume 10, Issue 1

March 2011

88 pages

ISSN:1530-0226

EISSN:1558-3430

DOI:10.1145/1929908

Issue’s Table of Contents

Copyright © 2011 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 March 2011

Accepted: 01 November 2010

Revised: 01 August 2010

Received: 01 June 2010

Published in TALIP Volume 10, Issue 1

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
499
Total Downloads

Downloads (Last 12 months)5
Downloads (Last 6 weeks)0

Reflects downloads up to 25 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Chatterji S(2021)Hindi Correspondence of Bengali Nominal SuffixesJournal of Multimedia Information System10.33851/JMIS.2021.8.4.2218:4(221-232)Online publication date: 31-Dec-2021
https://doi.org/10.33851/JMIS.2021.8.4.221

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Issue’s Table of Contents