Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Machine Translation Errors: English and Iraqi Arabic

Published: 01 March 2011 Publication History

Abstract

Errors in machine translations of English-Iraqi Arabic dialogues were analyzed using the methods developed for the Human Translation Error Rate measure (HTER). Human annotations were used to refine the Translation Error Rate (TER) annotations. The analyses were performed on approximately 100 translations into each language from four translation systems. Results include high frequencies of pronoun errors and errors involving the copula in translations to English. High frequencies of errors in subject/person inflection and closed-word classes characterized translations to Iraqi Arabic. There were similar frequencies of word order errors in both translation directions and low frequencies of polarity errors. The problems associated with many errors can be predicted from structural differences between the two languages. Also problematic is the need to insert lexemes not present in the source or vice versa. Some problems associated with deictic elements like pronouns will require knowledge of the discourse context to resolve.

References

[1]
Abels, K. 2005. Expletive negation in Russian: A conspiracy theory. J. Slavic Linguist. 13, 1, 5--74.
[2]
Al-Ajlouny, M. 2007. Contrastive analysis and diglossia. Int. J. Arabic-English Stud. 8, 151--158.
[3]
Badawi, el-S. and Hinds, M. 1986. A Dictionary of Egyptian Arabic: Arabic-English. Librairie du Liban, Beirut.
[4]
Banerjee, S. and Lavie, A. 2005. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. In Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for MT and/or Summarization (MTSE’05). 65--73.
[5]
Condon, S., Phillips, J., Doran, C., Aberdeen, J., Parvaz, D., Oshika, B., Sanders, G., and Schlenoff, C. 2008. Applying automated metrics to speech translation dialogs. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC’08).
[6]
Espinal, M. T. 2000. Expletive negation, negative concord and feature checking. In Catalan Working Papers in Linguistics 8, 47--69.
[7]
Haywood, J. A. and Nahmad, H. M. 1965. A New Arabic Grammar of the Written Language. Lund Humphries, London.
[8]
Krifka, M. Forthcoming. How to interpret “expletive” negation under bevor in German. http://amor.cms.hu-berlin.de/~h2816i3x/.
[9]
Llitjós, A., Carbonell, J., and Lavie, A. 2005. A framework for interactive and automatic refinement of transfer-based machine translation. In Proceedings of the 10th Annual Conference of the European Association for Machine Translation (EAMT’05).
[10]
Matusov, E., Zens, R., Vilar, D., Mauser, A., Popović, M., Hasan, S., and Ney, H. 2006. The RWTH machine translation system. In Proceedings of the TC-STAR Workshop on Speech-to-Speech Translation (SST’06). 31--36.
[11]
McCarus, E. N. 1979. A Course in Levantine Arabic. International Book Centre.
[12]
NIST. 2007. Post editing guidelines for GALE machine translation evaluation version 3.0.2. http://projects.ldc.upenn.edu/gale/Translation/Editors/GALEpostedit_guidelines-3.0.2.pdf.
[13]
NIST. 2008. MTPostEditor_V1.2.0.jar. V1.2.2. Available at Linguistic Data Consortium (LDC). GALE: Machine translation post-editing resource page for current post-editors. http://projects.ldc.upenn.edu/gale/Translation/Editors/.
[14]
Noor, H. 1996. EEnglish syntactic errors by Arabic speaking learners: Reviewed. ERIC Document ID#ED423660. http://www.eric.ed.gov/PDFS/ED423660.pdf.
[15]
Popović, M. and Ney, H. 2007. Word error rates: Decomposition over POS classes and applications for error analysis. In Proceedings of the 2nd Workshop on Statistical Machine Translation (SMT’07). 48--55.
[16]
Przybocki, M., Peterson, K., and Bronsart, S. 2008. Official results of the NIST 2008 “Metrics for MAchine TRanslation” challenge (MetricsMATR’08). http://nist.gov/speech/tests/metricsmatr/2008/results/.
[17]
Sanders, G., Bronsart, S., Condon, S., and Schlenoff, C. 2008. Odds of successful transfer of low-level concepts: A key metric for bidirectional speech-to-speech machine translation in DARPA’s TRANSTAC program. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC’08).
[18]
Stallard, D., Kao, C., Krstovski, K., Liu, D., Natarajan, P., Prasad, R., Saleem, S., and Subramanian, K. 2008. Recent improvements and performance analysis of ASR and MT in a speech-to-speech translation system. In Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP’08). 4973--4976.
[19]
Snover, M., Dorr, B., Schwartz, R., Micciula, L., and Makhoul, J. 2006. A study of translation error rate with targeted human annotation. In Proceedings of the Association for Machine Translation in the Americas (AMTA’06). 223--231.
[20]
Tillmann, C., Vogel, S., Ney, H., Zubiaga, A., and Sawaf, H. 1997. Accelerated DP based search for statistical translation. In Proceedings of the European Conference on Speech Communication and Technology (EUROSPEECH’97). 2667--2670.
[21]
Vilar, D., Xu, J., D’Haro, L., and Ney, H. 2006. Error analysis of statistical machine translation output. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC’06). 697--702.
[22]
Watson, J. 1993. A Syntax of San’ani Arabic. Harrassowitz, Wiesbaden, Germany.
[23]
Weiss, B., Schlenoff, C., Sanders, G., Steves, M., Condon, S., Phillips, J., and Parvaz, D. 2008. Performance evaluation of speech translation systems. In Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC’08).

Cited By

View all
  • (2021)Hindi Correspondence of Bengali Nominal SuffixesJournal of Multimedia Information System10.33851/JMIS.2021.8.4.2218:4(221-232)Online publication date: 31-Dec-2021

Index Terms

  1. Machine Translation Errors: English and Iraqi Arabic

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Asian Language Information Processing
    ACM Transactions on Asian Language Information Processing  Volume 10, Issue 1
    March 2011
    88 pages
    ISSN:1530-0226
    EISSN:1558-3430
    DOI:10.1145/1929908
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 March 2011
    Accepted: 01 November 2010
    Revised: 01 August 2010
    Received: 01 June 2010
    Published in TALIP Volume 10, Issue 1

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Arabic
    2. English
    3. error analysis
    4. evaluation
    5. statistical machine translation

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 25 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Hindi Correspondence of Bengali Nominal SuffixesJournal of Multimedia Information System10.33851/JMIS.2021.8.4.2218:4(221-232)Online publication date: 31-Dec-2021

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media