Nothing Special   »   [go: up one dir, main page]

skip to main content
article
Free access

Machine translation divergences: a formal description and proposed solution

Published: 01 December 1994 Publication History

Abstract

There are many cases in which the natural translation of one language into another results in a very different form than that of the original. The existence of translation divergences (i.e., crosslinguistic distinctions) makes the straightforward transfer from source structures into target structures impractical. Many existing translation systems have mechanisms for handling divergent structures but do not provide a general procedure that takes advantage of takes advantage of the systematic relation between lexical-semantic structure and syntactic structure. This paper demonstrates that a systematic solution to the divergence problem can be derived from the formalization of two types of information: (1) the linguistically grounded classes upon which lexical-semantic divergences are based; and (2) the techniques by which lexical-semantic divergences are resolved. This formalization is advantageous in that it facilitates the design and implementation of the system, allows one to make an evaluation of the status of the system, and provides a basis for proving certain important properties about the system.

References

[1]
Abeillé, Anne; Schabes, Yves; and Joshi, Aravind K. (1990). "Using lexicalized tags for machine translation." In Proceedings, 13th International Conference on Computational Linguistics, Helsinki, Finland, 1--6.]]
[2]
Abney, S. (1989). "A computational model of human parsing." Journal of Psycholinguistic Research 18:129--144.]]
[3]
Allen, James F. (1983). "Maintaining knowledge about temporal intervals." Communications of the ACM 26(11):832--843.]]
[4]
Allen, James F. (1984). "Towards a general theory of action and time." Artificial Intelligence 23(2):123--160.]]
[5]
Alonso, Juan Alberto. (1990). "Transfer interstructure: designing an 'interlingua' for transfer-based MT systems." In Proceedings, Third International Conference on Theoretical and Methodological Issues in Machine Translation of Natural Languages, Linguistics Research Center, The University of Texas, Austin, Texas, 189--201.]]
[6]
Arnold, Doug, and des Tombe, Louis. (1987). "Basic theory and methodology in Eurotra." In Machine Translation: Theoretical and Methodological Issues, edited by Sergei Nirenburg, 114--135. Cambridge: Cambridge University Press.]]
[7]
Bernett, Jim; Mani, Inderjeet; Martin, Paul; and Rich, Elaine (1991a). "Reversible machine translation: What to do when the languages don't line up." In Proceedings, Workshop on Reversible Grammars in Natural Language Processing, ACL-91, University of California, Berkeley, California, 61--70.]]
[8]
Barnett, Jim; Mani, Inderjeet; Rich, Elaine; Aone, Chinatsu; Knight, Kevin; and Martinez, Juan C. (1991b). "Capturing language-specific semantic distinctions in interlingua-based MT." In Proceedings, Machine Translation Summit, Washington, DC, 25--32.]]
[9]
Beaven, John (1992a). "Lexicalist unification-based machine translation." Doctoral dissertation, University of Edinburgh, Edinburgh, UK.]]
[10]
Beaven, John (1992b). "Shake and bake machine translation." In Proceedings, 14th International Conference on Computational Linguistics, Nantes, France, 603--609.]]
[11]
Boguraev, Branimir, and Briscoe, Ted (1989). Computational Lexicography for Natural Language Processing. London: Longman.]]
[12]
Boguarev, Branimir, and Pustejovsky, James (1990). "Lexical ambiguity and the role of knowledge representation in lexical design." In Proceedings, 13th International Conference on Computational Linguistics, Helsinki, Finland, 36--41.]]
[13]
Boitet, Christian (1987). "Research and development on MT and related techniques at Grenoble University (GETA)." In Machine Translation: The State of the Art, edited by Margaret King, 133--153. Edinburgh University Press, Edinburgh.]]
[14]
Brent, Michael (1993). "From grammar to lexicon: Unsupervised learning of lexical syntax." Computational Linguistics 19(2):243--262.]]
[15]
Briscoe, E. J., and Copestake, A. A. (1990). "Enjoy the paper: Lexical semantics via lexicology." In Proceedings, 13th International Conference on Computational Linguistics, Helsinki, Finland, 42--47.]]
[16]
Byrd, Roy J., Calzoari, Nicoletta; Chodorow, Martin S.; Klavans, Judith L.; Neff, Mary S.; and Rizk, Omneya A. (1987). "Tools and methods for computational linguistics." Computational Linguistics 13(3-4):219--240.]]
[17]
Carbonell, Jaime G., and Tomita, Masaru (1987). "Knowledge-based machine translation, the CMU approach." In Machine Translation: Theoretical and Methodological Issues, edited by Sergei Nirenburg, 68--89. Cambridge: Cambridge University Press.]]
[18]
Chomsky, Noam A. (1981). Lectures on Government and Binding. Dordrecht, Holland: Foris Publications.]]
[19]
Chomsky, Noam A. (1982). Some Concepts and Consequences of the Theory of Government and Binding. Cambridge: MIT Press.]]
[20]
Chomsky, Noam A. (1986a). Barriers. Cambridge: MIT Press.]]
[21]
Chomsky, Noam A. (1986b). Knowledge of Language: Its Nature, Origin and Use. Cambridge: MIT Press.]]
[22]
Copeland, C.; Durand, J.; Krauwer, S.; and Maegaard, B. (1991). "The Eurotra linguistic specifications." In Studies in Machine Translation and Natural Language Processing, Volume 1, edited by Erwin Valentini. Brussels: Commission of the European Communities.]]
[23]
Derr, M., and McKeown, K. (1984). "Using focus to generate complex and simple sentences." In Proceedings, Tenth International Conference on Computational Linguistics, Stanford, California, 319--326.]]
[24]
Dorr, Bonnie J. (1990a). "A cross-linguistic approach to machine translation." In Proceedings, Third International Conference on Theoretical and Methodological Issues in Machine Translation of Natural Languages, Linguistics Research Center, The University of Texas, Austin, Texas, 13--32.]]
[25]
Dorr, Bonnie J. (1990b). "Solving thematic divergences in machine translation." In Proceedings, 28th Annual Conference of the Association for Computational Linguistics, University of Pittsburgh, Pittsburgh, Pennsylvania, 127--134.]]
[26]
Dorr, Bonnie J. (1992a). "Lexical semantics for interlingual machine translation." Machine Translation 7(3).]]
[27]
Dorr, Bonnie J. (1992b). "A parameterized approach to integrating aspect with lexical-semantics for machine translation." In Proceedings, 30th Annual Conference of the Association of Computational Linguistics, University of Delaware, Newark Delaware, 257--264.]]
[28]
Dorr, Bonnie J. (1993a). "Interlingual machine translation: A parameterized approach." Artificial Intelligence 63(1, 2).]]
[29]
Dorr, Bonnie J. (1993b). Machine Translation: A View from the Lexicon. Cambridge: MIT Press.]]
[30]
Dorr, Bonnie J., and Gaasterland, Terry (1992). "Reflecting time in generated text: Tense, aspect and temporal connecting words." Technical report UMIACS TR 92-92, CS TR 2950, Department of Computer Science, University of Maryland, College Park, Maryland.]]
[31]
Dorr, Bonnie J., and Voss, Clare R. (1993a). "Constraints on the space of MT divergences." In Building Lexicons for Machine Translation, Papers from the 1993 Spring Symposium, Technical report SS-93-02, Stanford University, Stanford, California.]]
[32]
Dorr, Bonnie J., and Voss, Clare R. (1993b). Machine translation of spatial expressions: Defining the relation between an interlingua and a knowledge representation system." In Proceedings, Twelfth Conference of the American Association for Artificial Intelligence, Washington, DC.]]
[33]
Farwell, David; Guthrie, Louise; and Wilks, Yorick (1992). "The automatic creation of lexical entries for a multilingual MT system." In Proceedings, 14th International Conference on Computational Linguistics, Nantes, France, 532--538.]]
[34]
Goldman, Neil M. (1975). "Conceptual memory and inference." In Conceptual Information Processing, edited by Roger C. Schank, 289--371. Amsterdam, Holland: Elsevier Science Publishers.]]
[35]
Granville, Robert (1983). "Cohesion in computer text generation: Lexical substitution." Technical Report, LCS Technical Report 310, Massachusetts Institute of Technology, Cambridge, Massachusetts.]]
[36]
Hale, Kenneth, and Keyser, S. Jay (1986a). "Some transitivity alternations in English." Technical report, Lexicon Project Working Paper 7, Center for Cognitive Science, Massachusetts Institute of Technology, Cambridge, Massachusetts.]]
[37]
Hale, Kenneth, and Keyser, S. Jay (1986b). "A view from the middle." Technical report, Lexicon Project Working Paper 10, Center for Cognitive Science, Massachusetts Institute of Technology, Cambridge, Massachusetts.]]
[38]
Hale, Kenneth, and Keyser, S. Jay (1989). "On some syntactic rules in the lexicon." Technical Report, Center for Cognitive Science, Massachusetts Institute of Technology, Cambridge, Massachusetts.]]
[39]
Hale, Kenneth, and Laughren, Mary (1983). "Warlpiri lexicon project: Warlpiri dictionary entries." Technical report, Warlpiri Lexicon Project, Massachusetts Institute of Technology, Cambridge, Massachusetts.]]
[40]
Hornstein, Norbert (1990). As Time Goes By. Cambridge: MIT Press.]]
[41]
Hovey, Eduard (1988). Generating Natural Language Under Pragmatic Constraints. New Jersey: Lawrence Erlbaum Associates.]]
[42]
Isabelle, Pierre (1987). "Machine translation at the TAUM group." In Machine Translation: The State of the Art, edited by Margaret King, 247--277. Edinburgh: Edinburgh University Press.]]
[43]
Jackendoff, Ray S. (1983). Semantics and Cognition. Cambridge: MIT Press.]]
[44]
Jackendoff, Ray S. (1990). Semantic Structures. Cambridge: MIT Press.]]
[45]
Jacobs, Paul S. (1985). "PHRED: A generator for natural language interfaces." Computational Linguistics 11(4):219--242.]]
[46]
Johnson, Rod; King, Maghi; and des Tombe, Louis (1985). "Eurotra: A multilingual system under development." Computational Linguistics 11(2, 3):155--169.]]
[47]
Kameyama, Megumi; Ochitani, Ryo; Peters, Stanley; and Sirai, Hidetoshi (1991). "Resolving translation mismatches with information flow." In Proceedings, 29th Annual Meeting of the Association for Computational Linguistics, University of California, Berkeley, California, 193--200.]]
[48]
Kaplan, Ronald M., and Bresnan, Joan (1982). "Lexical-functional grammar: A formal system for grammatical representation." In The Mental Representation of Grammatical Relations, edited by Joan Bresnan, 173--281. Cambridge: MIT Press.]]
[49]
Kaplan, Ronald M.; Netter, Klaus; Wedekind, Jurgen; and Zaenen, Annie (1989). "Translation by structural correspondences." In Proceedings, Fourth Conference of the European Chapter of the Association for Computational Linguistics, Manchester, UK, 272--281.]]
[50]
Kay, Martin (1984). "Functional unification grammar: A formalism for machine translation." In Proceedings, 10th International Conference on Computational Linguistics, Stanford University, Stanford, California, 75--78.]]
[51]
Kinoshita, Satoshi; Phillips, John; and Tsujii, Jun-ichi (1992). Interaction between structural changes in machine translation." In Proceedings, 14th International Conference on Computational Linguistics, Nantes, France, 679--685.]]
[52]
Kittredge, Richard I.; Iordanskaja, Lidija; and Polguère, Alain (1988). "Multi-lingual text generation and the meaning-text theory." In Proceedings, Conference on Theoretical and Methodological Issues in Machine Translation of Natural Languages, Carnegie Mellon University, Pittsburgh, Pennsylvania.]]
[53]
Levin, Beth, and Rappaport, Malka (1986). "The formation of adjectival passives." Linguistic Inquiry 17:623--662.]]
[54]
Lindop, Jeremy, and Tsujii, Jun-ichi (1991). "Complex transfer in MT: A survey of examples." Technical report, CCL/UMIST Report 91/5, Center for Computational Linguistics, UMIST, Manchester, UK.]]
[55]
Marantz, Alec (1992). "The way-constructions and the semantics of direct arguments in English: A reply to Jackendoff." Technical report, Department of Linguistics and Philosophy, Massachusetts Institute of Technology, Cambridge, Massachusetts.]]
[56]
McCord, Michael C. (1989). "Design of LMT: A prolog-based machine translation system." Computational Linguistics 15(1):33--52.]]
[57]
McKeown, Kathleen (1985). Text generation: Using discourse strategies and focus constraints to generate natural language text. Cambridge: Cambridge University Press.]]
[58]
Melby, A. K. (1986). "Lexical transfer: Missing element in linguistic theories." In Proceedings, 11th International Conference on Computational Linguistics, Bonn, Germany.]]
[59]
Meyer, Ingrid; Onyshkevych, Boyan; and Carlson, Lynn (1990). "Lexicographic principles and design for knowledge-based machine translation." Technical report, CMU CMT Technical Report 90--118, Carnegie Mellon University, Pittsburgh, Pennsylvania.]]
[60]
Montemagni, Simonetta; and Vanderwende, Lucy (1992). "Structural patterns vs. string patterns for extracting semantic information from dictionaries." In Proceedings, 14th International Conference on Computational Linguistics, Nantes, France, 546--552.]]
[61]
Nirenburg, Sergei; Carbonell, Jaime; Tomita, Masaru; and Goodman, Kenneth (1992). Machine Translation: A Knowledge-Based Approach. San Mateo, California: Morgan Kaufmann.]]
[62]
Nirenburg, Sergei, and Goodman, Kenneth (1990). "Treatment of meaning in MT systems." In Proceedings, Third International Conference on Theoretical and Methodological Issues in Machine Translation of Natural Languages, Linguistics Research Center, The University of Texas, Austin, Texas, 171--187.]]
[63]
Nirenburg, Sergei, and Levin, Lori (1989). "Knowledge representation support." Machine Translation 4(1):25--52.]]
[64]
Nirenburg, Sergei, and Nirenburg, Irene (1988). "A framework for lexical selection in natural language generation." In Proceedings, 12th International Conference on Computational Linguistics, Budapest, Hungary, 471--475.]]
[65]
Nirenburg, Sergei; Raskin, Victor; and Tucker, Allen B. (1987). "The structure of interlingua in translator." In Machine Translation: Theoretical and Methodological Issues, edited by Sergei Nirenburg, 90--113. Cambridge: Cambridge University Press.]]
[66]
Perlmutter, David M. (1983). Studies in Relational Grammar 1. Chicago: The University of Chicago Press.]]
[67]
Proctor, P. (1978). Longman Dictionary of Contemporary English. London: Longman.]]
[68]
Pustejovsky, James (1987). "On the acquisition of lexical entries: The perceptual origin of thematic relations." In Proceedings, 25th Annual Conference of the Association for Computational Linguistics, Stanford University, Stanford, California, 172--178.]]
[69]
Pustejovsy, James, and Bergler, Sabine (1987). "The acquisition of conceptual structure for the lexicon." In Proceedings, Sixth Conference of the American Association of Artificial Intelligence, Seattle, Washington, 566--570.]]
[70]
Pustejovsky, James; Bergler, Sabine; and Anick, Peter (1993). "Lexical semantic techniques for corpus analysis." Computational Linguistics 19(2):331--358.]]
[71]
Reichenbach, H. (1947). Elements of Symbolic Logic. London: Macmillan.]]
[72]
Reiter, Ehud (1990). "Generating appropriate natural language object descriptions." Doctoral dissertation, Harvard University, Cambridge, MA.]]
[73]
Sadler, Louisa, and Thompson, Henry S. (1991). "Structural non-correspondence in translation." In Proceedings, Fifth Conference of the European Chapter of the Association for Computational Linguistics, Berlin, Germany, 293--298.]]
[74]
Shieber, Stuart M.; van Noord, Gertjan; Moore, Robert C.; and Pereira, Fernando C. N. (1989). "A semantic-head-driven generation algorithm for unification-based formalisms." In Proceedings, 27th Annual Conference of the Association for Computational Linguistics, University of British Columbia, Vancouver, British Columbia, Canada, 7--17.]]
[75]
Shieber, Stuart M.; van Noord, Gertjan; Moore, Robert C.; and Pereira, Fernando C. N. (1990). "Semantic-head-driven generation." Computational Linguistics 16(1).]]
[76]
Sondheimer, N.; Cumming, S.; and Albano, R. (1990). "How to realize a concept: Lexical selection and the conceptual network in text generation." Machine Translation 5(1):57--78.]]
[77]
Thurmair, Gregor (1990). "Complex lexical transfer in metal." In Proceedings, Third International Conference on Theoretical and Methodological Issues in Machine Translation of Natural Languages, Linguistics Research Center, The University of Texas, Austin, Texas, 91--107.]]
[78]
Trujillo, Arturo (1992). "Locations in the machine translation of prepositional phrases." In Proceedings, Fourth International Conference on Theoretical and Methodological Issues in Machine Translation of Natural Languages, Montreal, Canada, 13--20.]]
[79]
Tsujii, Jun-ichi, and Fujita, Kimikazu (1991). "Lexical transfer based on bilingual signs: Towards interaction during transfer." In Proceedings, European Chapter of the Association for Computational Linguistics, Berlin, Germany, 275--280.]]
[80]
Vauquois, Bernard, and Boitet, Christian (1985). "Automated translation at Grenoble University." Computational Linguistics 11(1):28--36.]]
[81]
Werner, P., and Nirenburg, S. (1988). "A specification language that supports the realization of intersentential anaphora." In Proceedings, AAAI Workshop on Natural Language Generation, Saint Paul, Minnesota.]]
[82]
Whitelock, Pete (1992). "Shake-and-bake translation." In Proceedings, 14th International Conference on Computational Linguistics, Nantes, France, 784--791.]]
[83]
Wilks, Yorick (1973). "An artificial intelligence approach to machine translation." In Computer Models of Thought and Language, edited by Roger C. Schank and K. M. Colby, 114--151. San Francisco: Freeman.]]
[84]
Zubizarreta, Maria Luisa (1982). "On the relationship of the lexicon to syntax." Doctoral dissertation, Department, of Linguistics and Philosophy, Massachusetts Institute of Technology, Cambridge, Massachusetts.]]
[85]
Zubizarreta, Maria Luisa (1987). Levels of Representation in the Lexicon and in the Syntax. Dordrecht, Holland: Foris Publications.]]

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Computational Linguistics
Computational Linguistics  Volume 20, Issue 4
December 1994
170 pages
ISSN:0891-2017
EISSN:1530-9312
Issue’s Table of Contents

Publisher

MIT Press

Cambridge, MA, United States

Publication History

Published: 01 December 1994
Published in COLI Volume 20, Issue 4

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)83
  • Downloads (Last 6 weeks)1
Reflects downloads up to 25 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2020)Abstract Syntax as InterlinguaComputational Linguistics10.1162/coli_a_0037846:2(425-486)Online publication date: 1-Jun-2020
  • (2017)Role of Morphology Injection in SMTACM Transactions on Asian and Low-Resource Language Information Processing10.1145/312920817:1(1-31)Online publication date: 15-Sep-2017
  • (2017)Reassessing the value of resources for cross-lingual transfer of POS tagging modelsLanguage Resources and Evaluation10.1007/s10579-016-9362-751:4(927-960)Online publication date: 1-Dec-2017
  • (2016)Enriching a massively multilingual database of interlinear glossed textLanguage Resources and Evaluation10.1007/s10579-015-9325-450:2(321-349)Online publication date: 1-Jun-2016
  • (2014)Topic-based dissimilarity and sensitivity models for translation rule selectionJournal of Artificial Intelligence Research10.5555/2693068.269306950:1(1-30)Online publication date: 1-May-2014
  • (2014)Phrase dependency machine translation with quasi-synchronous tree-to-tree featuresComputational Linguistics10.1162/COLI_a_0017540:2(349-401)Online publication date: 1-Jun-2014
  • (2014)Linguistic analysis of non-ITG word reordering between language pairs with different word order typologiesACM Transactions on Asian Language Information Processing10.1145/264481013:3(1-12)Online publication date: 3-Oct-2014
  • (2014)Capturing divergence in dependency trees to improve syntactic projectionLanguage Resources and Evaluation10.1007/s10579-014-9273-448:4(709-739)Online publication date: 1-Dec-2014
  • (2012)Divergence patterns in machine translation between Malayalam and EnglishProceedings of the International Conference on Advances in Computing, Communications and Informatics10.1145/2345396.2345525(788-794)Online publication date: 3-Aug-2012
  • (2012)David Bellos (ed)Machine Translation10.1007/s10590-012-9129-x26:3(255-269)Online publication date: 1-Sep-2012
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Full Access

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media