Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/2390524.2390648dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
research-article
Free access

Modeling the translation of predicate-argument structure for SMT

Published: 08 July 2012 Publication History

Abstract

Predicate-argument structure contains rich semantic information of which statistical machine translation hasn't taken full advantage. In this paper, we propose two discriminative, feature-based models to exploit predicate-argument structures for statistical machine translation: 1) a predicate translation model and 2) an argument reordering model. The predicate translation model explores lexical and semantic contexts surrounding a verbal predicate to select desirable translations for the predicate. The argument reordering model automatically predicts the moving direction of an argument relative to its predicate after translation using semantic features. The two models are integrated into a state-of-the-art phrase-based machine translation system and evaluated on Chinese-to-English translation tasks with large-scale training data. Experimental results demonstrate that the two models significantly improve translation accuracy.

References

[1]
Wilker Aziz, Miguel Rios, and Lucia Specia. 2011. Shallow semantic trees for smt. In Proceedings of the Sixth Workshop on Statistical Machine Translation, pages 316--322, Edinburgh, Scotland, July. Association for Computational Linguistics.
[2]
Adam L. Berger, Stephen A. Della Pietra, and Vincent J. Della Pietra. 1996. A maximum entropy approach to natural language processing. Computational Linguistics, 22(1): 39--71.
[3]
David Chiang. 2007. Hierarchical phrase-based translation. Computational Linguistics, 33(2): 201--228.
[4]
Pascale Fung, Wu Zhaojun, Yang Yongsheng, and Dekai Wu. 2006. Automatic learning of chinese english semantic structure mapping. In IEEE/ACL 2006 Workshop on Spoken Language Technology (SLT 2006), Aruba, December.
[5]
Philipp Koehn, Franz Joseph Och, and Daniel Marcu. 2003. Statistical phrase-based translation. In Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, pages 58--54, Edmonton, Canada, May-June.
[6]
Philipp Koehn. 2004. Statistical significance tests for machine translation evaluation. In Proceedings of EMNLP 2004, pages 388--395, Barcelona, Spain, July.
[7]
Mamoru Komachi and Yuji Matsumoto. 2006. Phrase reordering for statistical machine translation based on predicate-argument structure. In In Proceedings of the International Workshop on Spoken Language Translation: Evaluation Campaign on Spoken Language Translation, pages 77--82.
[8]
Junhui Li, Guodong Zhou, and Hwee Tou Ng. 2010. Joint syntactic and semantic parsing of chinese. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 1108--1117, Uppsala, Sweden, July. Association for Computational Linguistics.
[9]
Ding Liu and Daniel Gildea. 2010. Semantic role features for machine translation. In Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010), pages 716--724, Beijing, China, August. Coling 2010 Organizing Committee.
[10]
Arne Mauser, Saša Hasan, and Hermann Ney. 2009. Extending statistical machine translation with discriminative and trigger-based lexicon models. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pages 210--218, Singapore, August. Association for Computational Linguistics.
[11]
Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. Bleu: a method for automatic evaluation of machine translation. In Proceedings of 40th Annual Meeting of the Association for Computational Linguistics, pages 311--318, Philadelphia, Pennsylvania, USA, July.
[12]
Slav Petrov, Leon Barrett, Romain Thibaux, and Dan Klein. 2006. Learning accurate, compact, and interpretable tree annotation. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, pages 433--440, Sydney, Australia, July. Association for Computational Linguistics.
[13]
Andreas Stolcke. 2002. Srilm--an extensible language modeling toolkit. In Proceedings of the 7th International Conference on Spoken Language Processing, pages 901--904, Denver, Colorado, USA, September.
[14]
Sriram Venkatapathy and Srinivas Bangalore. 2007. Three models for discriminative machine translation using global lexical selection and sentence reconstruction. In Proceedings of SSST, NAACL-HLT 2007/AMTA Workshop on Syntax and Structure in Statistical Translation, pages 96--102, Rochester, New York, April. Association for Computational Linguistics.
[15]
Dekai Wu and Pascale Fung. 2009a. Can semantic role labeling improve smt. In Proceedings of the 13th Annual Conference of the EAMT, pages 218--225, Barcelona, May.
[16]
Dekai Wu and Pascale Fung. 2009b. Semantic roles for smt: A hybrid two-pass model. In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers, pages 13--16, Boulder, Colorado, June. Association for Computational Linguistics.
[17]
Xianchao Wu, Katsuhito Sudoh, Kevin Duh, Hajime Tsukada, and Masaaki Nagata. 2011. Extracting pre-ordering rules from predicate-argument structures. In Proceedings of 5th International Joint Conference on Natural Language Processing, pages 29--37, Chiang Mai, Thailand, November. Asian Federation of Natural Language Processing.
[18]
Dekai Wu. 1997. Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Computational Linguistics, 23(3): 377--403.
[19]
Deyi Xiong, Qun Liu, and Shouxun Lin. 2006. Maximum entropy based phrase reordering model for statistical machine translation. In Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, pages 521--528, Sydney, Australia, July. Association for Computational Linguistics.
[20]
Deyi Xiong, Min Zhang, and Haizhou Li. 2011. A maximum-entropy segmentation model for statistical machine translation. IEEE Transactions on Audio, Speech and Language Processing, 19(8): 2494--2505.
[21]
Nianwen Xue. 2008. Labeling chinese predicates with semantic roles. Computational Linguistics, 34(2): 225--255.

Cited By

View all
  • (2016)Learning Generalized Features for Semantic Role LabelingACM Transactions on Asian and Low-Resource Language Information Processing10.1145/289049615:4(1-16)Online publication date: 12-Apr-2016
  • (2016)Bilingual Semantic Role Labeling Inference via Dual DecompositionACM Transactions on Asian and Low-Resource Language Information Processing10.1145/283549315:3(1-21)Online publication date: 9-Jan-2016
  • (2016)Inter-, Intra-, and Extra-Chunk Pre-Ordering for Statistical Japanese-to-English Machine TranslationACM Transactions on Asian and Low-Resource Language Information Processing10.1145/281838115:3(1-28)Online publication date: 9-Jan-2016
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
ACL '12: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
July 2012
1100 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 08 July 2012

Qualifiers

  • Research-article

Acceptance Rates

Overall Acceptance Rate 85 of 443 submissions, 19%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)28
  • Downloads (Last 6 weeks)10
Reflects downloads up to 13 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2016)Learning Generalized Features for Semantic Role LabelingACM Transactions on Asian and Low-Resource Language Information Processing10.1145/289049615:4(1-16)Online publication date: 12-Apr-2016
  • (2016)Bilingual Semantic Role Labeling Inference via Dual DecompositionACM Transactions on Asian and Low-Resource Language Information Processing10.1145/283549315:3(1-21)Online publication date: 9-Jan-2016
  • (2016)Inter-, Intra-, and Extra-Chunk Pre-Ordering for Statistical Japanese-to-English Machine TranslationACM Transactions on Asian and Low-Resource Language Information Processing10.1145/281838115:3(1-28)Online publication date: 9-Jan-2016
  • (2014)Distortion Model Based on Word Sequence Labeling for Statistical Machine TranslationACM Transactions on Asian Language Information Processing10.1145/253712813:1(1-21)Online publication date: 1-Feb-2014

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media