Nothing Special   »   [go: up one dir, main page]

skip to main content
10.3115/976909.979654dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

A DP based search using monotone alignments in statistical translation

Published: 07 July 1997 Publication History

Abstract

In this paper, we describe a Dynamic Programming (DP) based search algorithm for statistical translation and present experimental results. The statistical translation uses two sources of information: a translation model and a language model. The language model used is a standard bigram model. For the translation model, the alignment probabilities are made dependent on the differences in the alignment positions rather than on the absolute positions. Thus, the approach amounts to a first-order Hidden Markov model (HMM) as they are used successfully in speech recognition for the time alignment problem. Under the assumption that the alignment is monotone with respect to the word order in both languages, an efficient search strategy for translation can be formulated. The details of the search algorithm are described. Experiments on the EuTrans corpus produced a word error rate of 5.1%.

References

[1]
A. L. Berger. P. F. Brown. S. A. Della Pietra. V. J. Della Pietra. J. R. Gillett. J. D. Lafferty. R. L. Mercer. H. Printz. and L. Ures. 1994. "The Candide System for Machine Translation". In Proc. of ARPA Human Language Technology Workshop. pp. 152--157. Plainsboro. NJ. Morgan Kaufmann Publishers. San Mateo. CA. March.
[2]
P. F. Brown, V. J. Della Pietra. S. A. Della Pietra. and R. L. Mercer. 1993. "The Mathematics of Statistical Machine Translation: Parameter Estimation". Computational Linguistics. Vol. 19. No. 2. pp. 263--311.
[3]
I. Dagan. K. W. Church. and W. A. Gale. 1993. "Robust Bilingual Word Alignment for Machine Aided Translation". In Proc. of the Workshop on very Large Corpora. pp. 1--8. Columbus, OH.
[4]
P. Fung. and K. W. Church. 1994. "K-vec: A New Approach for Aligning Parallel Texts". In Proc. of the 15th Int. Conf. on Computational Linguistics, pp. 1096--1102. Kyoto.
[5]
F. Jelinek. 1976. "Speech Recognition by Statistical Methods". Proc. of the IEEE. Vol. 64. pp. 532--556. April.
[6]
M. Kay. and M. Röscheisen. 1993. "Text-Translation Alignment". Computational Linguistics. Vol. 19. No. 2. pp. 121--142.
[7]
H. Ney, D. Mergel, A. Noll, A. Paeseler. 1992. "Data Driven Search Organization for Continuous Speech Recognition". IEEE Trans. on Signal Processing, Vol. SP-40. No. 2. pp. 272--281. February.
[8]
E. Vidal. 1996. "Final report of Esprit Research Project 20268 (EuTrans): Example-Based Understanding and Translation Systems". Universidad Politécnica de Valencia. Instituto Tecnológio de Informática, October.
[9]
E. Vidal. 1997. "Finite-State Speech-to-Speech Translation". In Proc. of the Int. Conf. on Acoustics, Speech and Signal Processing. Munich. April.
[10]
S. Vogel, H. Ney, and C. Tillmann. 1996. "HMM Based Word Alignment in Statistical Translation". In Proc. of the 16th Int. Conf. on Computational Linguistics. pp. 836--841. Copenhagen. August.
[11]
D. Wu. 1996. "A Polynomial-Time Algorithm for Statistical Machine Translation". In Proc. of the 34th Annual Conf. of the Association for Computational Linguistics, pp. 152--158. Santa Cruz, CA. June.

Cited By

View all
  • (2012)Document-wide decoding for phrase-based statistical machine translationProceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning10.5555/2390948.2391081(1179-1190)Online publication date: 12-Jul-2012
  • (2011)Enriching document representation via translation for improved monolingual information retrievalProceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval10.1145/2009916.2010030(853-862)Online publication date: 24-Jul-2011
  • (2010)Unsupervised cleansing of noisy textProceedings of the 23rd International Conference on Computational Linguistics: Posters10.5555/1944566.1944588(189-196)Online publication date: 23-Aug-2010
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '98/EACL '98: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
July 1997
543 pages

Sponsors

  • Directorate General XIII (European Commission)
  • Universidad Complutense de Madrid
  • Universidad Autónoma de Madrid
  • Universidad Nacional de Educación a Distancia
  • Universidad Politécnica de Madrid

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 07 July 1997

Qualifiers

  • Article

Acceptance Rates

Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)41
  • Downloads (Last 6 weeks)10
Reflects downloads up to 22 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2012)Document-wide decoding for phrase-based statistical machine translationProceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning10.5555/2390948.2391081(1179-1190)Online publication date: 12-Jul-2012
  • (2011)Enriching document representation via translation for improved monolingual information retrievalProceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval10.1145/2009916.2010030(853-862)Online publication date: 24-Jul-2011
  • (2010)Unsupervised cleansing of noisy textProceedings of the 23rd International Conference on Computational Linguistics: Posters10.5555/1944566.1944588(189-196)Online publication date: 23-Aug-2010
  • (2009)A beam-search extraction algorithm for comparable dataProceedings of the ACL-IJCNLP 2009 Conference Short Papers10.5555/1667583.1667653(225-228)Online publication date: 4-Aug-2009
  • (2008)Statistical machine translationACM Computing Surveys10.1145/1380584.138058640:3(1-49)Online publication date: 13-Aug-2008
  • (2007)Chunk-level reordering of source language sentences with automatically learned rules for statistical machine translationProceedings of the NAACL-HLT 2007/AMTA Workshop on Syntax and Structure in Statistical Translation10.5555/1626281.1626282(1-8)Online publication date: 26-Apr-2007
  • (2007)Combination of statistical word alignments based on multiple preprocessing schemesHuman Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers10.5555/1614108.1614115(25-28)Online publication date: 22-Apr-2007
  • (2006)Efficient dynamic programming search algorithms for phrase-based SMTProceedings of the Workshop on Computationally Hard Problems and Joint Inference in Speech and Language Processing10.5555/1631828.1631830(9-16)Online publication date: 9-Jun-2006
  • (2006)Distortion models for statistical machine translationProceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics10.3115/1220175.1220242(529-536)Online publication date: 17-Jul-2006
  • (2006)Stemming to improve translation lexicon creation form bitextsInformation Processing and Management: an International Journal10.1016/j.ipm.2005.07.00242:4(1003-1016)Online publication date: 1-Jul-2006
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media