Nothing Special   »   [go: up one dir, main page]

skip to main content
article
Free access

DATR: a language for lexical knowledge representation

Published: 01 June 1996 Publication History

Abstract

Much recent research on the design of natural language lexicons has made use of nonmonotonic inheritance networks as originally developed for general knowledge representation purposes in Artificial Intelligence. DATR is a simple, spartan language for defining nonmonotonic inheritance networks with path/value equations, one that has been designed specifically for lexical knowledge representation. In keeping with its intendedly minimalist character, it lacks many of the constructs embodied either in general-purpose knowledge representation languages or in contemporary grammar formalisms. The present paper shows that the language is nonetheless sufficiently expressive to represent concisely the structure of lexical information at a variety of levels of linguistic analysis. The paper provides an informal example-based introduction to DATR and to techniques for its use, including finite-state transduction, the encoding of DAGs and lexical rules, and the representation of ambiguity and alternation. Sample analysis of phenomena such as inflectional syncretism and verbal subcategorization are given that show how the language can be used to squeeze out redundancy from lexical descriptions.

References

[1]
Andry, Francois, Norman Fraser, Scott McGlashan, Simon Thornton, and Nick Youd. 1992 Making DATR work for speech: Lexicon compilation in SUNDIAL. Computational Linguistics 18:245--267.
[2]
Barg, Petra. 1994. Automatic acquisition of DATR theories from observations. Theories des Lexicons: Arbeiten des Sonderforschungsbereichs 282, Heinrich-Heine University of Duesseldorf.
[3]
Bleiching, Doris. 1992. Prosodisches Wissen in Lexicon. In G. Goerz, editor, Proceedings of KONVENS-92, Berlin: Springer-Verlag, pages 59--68.
[4]
Bleiching, Doris. 1994. Integration von Morphophonologie und Prosodie in ein hierarchisches Lexicon. In Harald Trost, editor, Proceedings of KONVENS-94, pages 32--41, Vienna: Oesterreichische Gesellschaft fuer Artificial Intelligence.
[5]
Bouma, Gosse. 1993. Nonmonotonicity and Categorial Unification Grammar. Proefschrift, Rijksuniversiteit Groningen.
[6]
Bouma, Gosse and John Nerbonne. 1994. Lexicons for feature-based systems. In Harald Trost, editor, Proceedings of KONVENS-94, pages 42--51, Vienna: Oesterreichische Gesellschaft fuer Artificial Intelligence.
[7]
Briscoe, Ted and Ann Copestake. 1991. Sense extensions as lexical rules. In D. Fass, E. Hinkelman & J. Martin, editors. Computational approaches to Non-Literal Language, Proceedings of the IJCAI Workshop, pages 12--20, Sydney.
[8]
Briscoe, Ted, Ann Copestake, and Alex Lascarides. 1995. Blocking. In Patrick Saint-Dizier & Evelyne Viegas, editors. Computational Lexical Semantics. Cambridge: Cambridge University Press, pages 272--302.
[9]
Briscoe, Ted, Valeria de Paiva, and Ann Copestake, editors. 1993. Inheritance, Defaults, and the Lexicon, Cambridge: Cambridge University Press.
[10]
Brown, Dunstan and Andrew Hippisley. 1994. Conflict in Russian genitive plural assignment: A solution represented in DATR. Journal of Slavic Linguistics, 2(1):48--76.
[11]
Cahill, Lynne. 1993a. Some reflections on the conversion of the TIC lexicon into DATR. In Ted Briscoe, Valeria de Paiva, and Ann Copestake, editors. Inheritance, Defaults, and the Lexicon. Cambridge: Cambridge University Press, pages 47--57.
[12]
Cahill, Lynne. 1993b. Morphonology in the lexicon. Sixth Conference of the European Chapter of the Association for Computational Linguistics, pages 87--96.
[13]
Cahill, Lynne. 1994. An inheritance-based lexicon for message understanding systems. Fourth ACL Conference on Applied Natural Language Processing, pages 211--212.
[14]
Cahill, Lynne and Roger Evans. 1990. An application of DATR: The TIC lexicon. In Proceedings of the 9th European Conference on Artificial Intelligence, pages 120--125, Stockholm.
[15]
Calder, Jo. 1994. Feature-value logics: Some limits on the role of defaults. In C. J. Rupp, M. A. Rosner, & R. L. Johnson, editors. Constraints, Language and Computation. London: Academic Press, pages 205--222.
[16]
Carpenter, Bob. 1991. The generative power of categorial grammars and head-driven phrase structure grammars with lexical rules. Computational Linguistics 17:301--313.
[17]
Carpenter, Bob. 1992. Categorial grammars, lexical rules, and the English predicative. In Robert Levine, editor. Formal Grammar: Theory and Implementation. New York: Oxford University Press, pages 168--242.
[18]
Copestake, Ann. 1992. The representation of lexical semantic information. Ph.D. dissertation, University of Sussex, Cognitive Science Research Paper CSRP 280.
[19]
Copestake, Ann and Ted Briscoe. 1992. Lexical operations in a unification based framework. In James Pustejovsky & Sabine Bergler, editors. Lexical Semantics and Knowledge Representation. Berlin: Springer-Verlag, pages 101--119.
[20]
Copestake, Ann and Ted Briscoe. 1995. Regular polysemy and semi-productive sense extension. Journal of Semantics 12:15--67.
[21]
Corbett, Greville and Norman Fraser. 1993. Network Morphology: A DATR account of Russian nominal inflection. Journal of Linguistics 29:113--142.
[22]
Daelemans, Walter. 1994. Review of Inheritance, Defaults, and the Lexicon, by Ted Briscoe, Valeria de Paiva & Ann Copestake, editors. Computational Linguistics 20(4):661--664.
[23]
Daelemans, Walter and Koenraad De Smedt. 1994. Inheritance in an object-oriented representation of linguistic categories. International Journal of Human-Computer Studies 41(1/2):149--177.
[24]
Daelemans, Walter, Koenraad De Smedt, and Gerald Gazdar. 1992. Inheritance in natural language processing. Computational Linguistics 18(2):205--218.
[25]
Daelemans, Walter and Gerald Gazdar, editors. 1992. Computational Linguistics 18(2) and 18(3), special issues on inheritance.
[26]
Daelemans, Walter and Erik-Jan van der Linden. 1992. Evaluation of lexical representation formalisms. In Jan van Eijck & Wilfried Meyer, editors. Computational Linguistics in the Netherlands: Papers from the Second CLIN Meeting, pages 54--67, Utrecht: OTS.
[27]
Domenig, Marc and Pius ten Hacken. 1992. Word Manager: A System for Morphological Dictionaries. Hidesheim: Georg Olms Verlag.
[28]
Duda, Markus and Gunter Gebhardi. 1994. DUTR---A DATR-PATR interface formalism. In Harald Trost, editor. Proceedings of KONVENS-94, pages 411--414, Vienna: Oesterreichische Gesellschaft fuer Artificial Intelligence.
[29]
Evans, Roger and Gerald Gazdar. 1989a. Inference in DATR. Fourth Conference of the European Chapter of the Association for Computational Linguistics, pages 66--71.
[30]
Evans, Roger and Gerald Gazdar. 1989b. The semantics of DATR. In Anthony G. Cohn, editor. Proceedings of the Seventh Conference of the Society for the Study of Artificial Intelligence and Simulation of Behaviour, pages 79--87, London: Pitman/Morgan Kaufmann.
[31]
Evans, Roger, Gerald Gazdar, and Lionel Moser. 1993. Prioritised multiple inheritance in DATR. In Ted Briscoe, Valeria de Paiva, and Ann Copestake, editors. Inheritance, Defaults, and the Lexicon. Cambridge: Cambridge University Press, pages 38--46.
[32]
Evans, Roger, Gerald Gazdar, and David Weir. 1995. Encoding lexicalized tree adjoining grammars with a nonmonotonic inheritance hierarchy. 33rd Annual Meeting of the Association for Computational Linguistics, pages 77--84.
[33]
Flickinger, Daniel P. 1987. Lexical Rules in the Hierarchical Lexicon. Ph.D. dissertation, Stanford University.
[34]
Fraser, Norman and Greville Corbett. 1995. Gender, animacy, and declensional class assignment: A unified account for Russian. In Geert Booij & Jaap van Marle, editors. Year Book of Morphology 1994. Dordrecht: Kluwer, pages 123--150.
[35]
Fraser, Norman and Greville Corbett. In press. Gender assignment in Arapesh: A Network Morphology analysis. Lingua.
[36]
Fraser, Norman and Richard Hudson. 1990. Word Grammar: An inheritance-based theory of language. In Walter Daelemans & Gerald Gazdar, editors. Proceedings of the Workshop on Inheritance in Natural Language Processing, pages 58--64, Tilburg: Institute for Language Technology.
[37]
Gazdar, Gerald. 1992. Paradigm function morphology in DATR. In Lynne Cahill & Richard Coates, editors. Sussex Papers in General and Computational Linguistics. Brighton, University of Sussex, Cognitive Science Research Paper CSRP 239, pages 43--53.
[38]
Gibbon, Dafydd. 1990. Prosodic association by template inheritance. In Walter Daelemans & Gerald Gazdar, editors. Proceedings of the Workshop on Inheritance in Natural Language Processing, pages 65--81, Tilburg: Institute for Language Technology.
[39]
Gibbon, Dafydd. 1992. ILEX: A linguistic approach to computational lexica. In Ursula Klenk, editor. Computatio Linguae: Aufsaze zur algorithmischen und quantitativen Analyse der Sprache (Zeitschrift fur Dialektologie und Linguistik, Beiheft 73), Stuttgart: Franz Steiner Verlag, pages 32--53.
[40]
Gibbon, Dafydd. 1993. Generalized DATR for flexible lexical access: PROLOG specification. Bielefeld: Verbmobil Report 2.
[41]
Gibbon, Dafydd and Doris Bleiching. 1991. An ILEX model for German compound stress in DATR. Proceedings of the FORWISS-ASL Workshop on Prosody in Man-Machine Communication, pages 1--6.
[42]
Ide, Nancy, Jacques Le Maitre, and Jean Véronis. 1994. Outline of a model for lexical databases. In Antonio Zampolli, Nicoletta Calzolari, and Martha Palmer, editors. Current Issues in Computational Linguistics: In Honour of Don Walker. Pisa: Kluwer, pages 283--320.
[43]
Kaplan, Ronald M. and Martin Kay. 1994. Regular models of phonological rule systems. Computational Linguistics 20(3):331--378.
[44]
Keller, William. 1995. DATR theories and DATR models. 33rd Annual Meeting of the Association for Computational Linguistics, pages 55--62.
[45]
Kilbury, James. 1993. Strict inheritance and the taxonomy of lexical types in DATR. Unpublished manuscript, University of Duesseldorf.
[46]
Kilbury, James, Petra {Barg} Naerger, and Ingrid Renz. 1991. DATR as a lexical component for PATR. Fifth Conference of the European Chapter of the Association for Computational Linguistics, pages 137--142.
[47]
Kilbury, James, Petra {Barg} Naerger, and Ingrid Renz. 1994. Simulation lexicalischen Erwerbs. In Sascha W. Felix, Christopher Habel, and Gert Rickheit Kognitive Linguistik: Repraesentation und Prozesse. Opladen: Westdeutscher Verlag, pages 251--271.
[48]
Kilgarriff, Adam. 1993. Inheriting verb alternations. Sixth Conference of the European Chapter of the Association for Computational Linguistics, pages 213--221.
[49]
Kilgarriff, Adam. 1995. Inheriting polysemy. In Patrick Saint-Dizier & Evelyne Viegas, editors. Computational Lexical Semantics. Cambridge: Cambridge University Press.
[50]
Kilgarriff, Adam and Gerald Gazdar. 1995. Polysemous relations. In F. R. Palmer, editor. Grammar and Meaning: Essays in Honour of Sir John Lyons. Cambridge: Cambridge University Press, pages 1--25.
[51]
Krieger, Hans-Ulrich. 1994. Derivation without lexical rules. In C. J. Rupp, M. A. Rosner, and R. L. Johnson, editors. Constraints, Language and Computation. London: Academic Press, pages 277--313.
[52]
Krieger, Hans-Ulrich and John Nerbonne. 1993. Feature-based inheritance networks for computational lexicons. In Ted Briscoe, Valeria de Paiva, and Ann Copestake, editors. Inheritance, Defaults, and the Lexicon. Cambridge: Cambridge University Press, pages 90--136.
[53]
Krieger, Hans-Ulrich, Hannes Pirker, and John Nerbonne. 1993. Feature-based allomorphy. 31st Annual Meeting of the Association for Computational Linguistics, pages 140--147.
[54]
Langer, Hagen. 1994. Reverse queries in DATR. COLING-94, pages 1089--1095.
[55]
Langer, Hagen and Dafydd Gibbon. 1992. DATR as a graph representation language for ILEX speech oriented lexica. Technical Report ASL-TR-43-92/UBI, University of Bielefeld.
[56]
Lascarides, Alex, Nicholas Asher, Ted Briscoe, and Ann Copestake. Forthcoming. Order independent and persistent typed default unification. Linguistics & Philosophy 19(1):1--89.
[57]
Light, Marc. 1994. Classification in feature-based default inheritance hierarchies. In Harald Trost, editor. Proceedings of KONVENS-94, pages 220--229, Vienna: Oesterreichische Gesellschaft fuer Artificial Intelligence.
[58]
Light, Marc, Sabine Reinhard, and Marie Boyle-Hinrichs. 1993. INSYST: An automatic inserter system for hierarchical lexica. Sixth Conference of the European Chapter of the Association for Computational Linguistics, page 471.
[59]
McFetridge, Paul and Aline Villavicencio. 1995. A hierarchical description of the Portuguese verb. Proceedings of the XIIth Brazilian Symposium on Artificial Intelligence, pages 302--311.
[60]
Mellish, Chris and Ehud Reiter. 1993. Using classification as a programming language. IJCAI-93, pages 696--701.
[61]
Mitamura, Teruko and Eric H. Nyberg III. 1992. Hierarchical lexical structure and interpretive mapping in machine translation. COLING-92 Vol. IV, pages 1254--1258.
[62]
Nerbonne, John. 1992. Feature-based lexicons---an example and a comparison to DATR. In Dorothee Reimann, editor. Beitrage des ASL-Lexicon-Workshops. Wandtlitz, pages 36--49.
[63]
Ostler, Nicholas and B. T. S. Atkins. 1992. Predictable meaning shift: Some linguistic properties of lexical implication rules. In James Pustejovsky & Sabine Bergler, editors. Lexical Semantics and Knowledge Representation. Berlin: Springer-Verlag, pages 87--100.
[64]
Penn, Gerald and Richmond Thomason. 1994. Default finite state machines and finite state phonology. Computational Phonology: Proceedings of the 1st Meeting of the ACL Special Interest Group in Computational Phonology, pages 33--42.
[65]
Pulman, Stephen G. Forthcoming. Unification encodings of grammatical notations. To appear in Computational Linguistics.
[66]
Pustejovsky, James. 1991. The generative lexicon. Computational Linguistics 17(4):409--441.
[67]
Pustjovsky, James and Branimir Boguraev. 1993. Lexical knowledge representation and natural language processing. Artificial Intelligence 63(1--2):193--223.
[68]
Reinhard, Sabine. 1990. Verarbeitungsprobleme nichtlinearer Morphologien: Umlautbeschreibung in einem hierarchischen Lexikon. In Burghard Rieger & Burkhard Schaeder Lexikon und Lexikographie. Hildesheim: Olms Verlag, 45--61.
[69]
Reinhard, Sabine and Dafydd Gibbon. 1991. Prosodic inheritance and morphological generalisations. Fifth Conference of the European Chapter of the Association for Computational Linguistics, pages 131--136.
[70]
Reiter, Ehud and Chris Mellish. 1992. Using classification to generate text. 30th Annual Meeting of the Association for Computational Linguistics, pages 265--272.
[71]
Ritchie, Graeme D., Graham J. Russell, Alan W. Black, and Stephen G. Pulman. 1992. Computational Morphology. Cambridge, MA: MIT Press.
[72]
Russell, Graham. 1993. Review of Word Manager: A System for Morphological Dictionaries, by Marc Domenig & Pius ten Hacken. Computational Linguistics 19(4):699--700.
[73]
Russell, Graham, Afzal Ballim, John Carroll, and Susan Warwick-Armstrong. 1992. A practical approach to multiple default inheritance for unification-based lexicons. Computational Linguistics 183:311--337.
[74]
Sacks, Harvey. 1973. On some puns with some intimations. In Roger W. Shuy, editor. Report of the 23rd Annual Roundtable Meeting on Linguistics and Language Studies. Washington D.C.: Georgetown University Press, pages 135--144.
[75]
Shieber, Stuart M. 1986. An Introduction to Unification Approaches to Grammar. Stanford: CSLI/Chicago University Press.
[76]
Stump, Greg. 1992. On the theoretical status of position class restrictions on inflectional affixes. In Geert Booij & Jaap van Marle, editors. Year Book of Morphology 1991. Dordrecht: Kluwer, pages 211--241.
[77]
Touretzky, David S. 1986. The Mathematics of Inheritance Systems. London/Los Altos: Pitman/Morgan Kaufmann.
[78]
Young, Mark A. 1992. Nonmonotonic sorts for feature structures. AAAI-92, pages 596--601.
[79]
Young, Mark A. and Bill Rounds. 1993. A logical semantics for nonmonotonic sorts. Proceedings of the 31st Annual Meeting of the ACL, pages 209--215.

Cited By

View all
  • (2016)XMG 2: Describing Description LanguagesLogical Aspects of Computational Linguistics. Celebrating 20 Years of LACL (1996–2016)10.1007/978-3-662-53826-5_16(255-272)Online publication date: 5-Dec-2016
  • (2012)A rule-based approach to unknown word recognition in ArabicProceedings of the Twelfth Meeting of the Special Interest Group on Computational Morphology and Phonology10.5555/2390930.2390935(35-41)Online publication date: 7-Jun-2012
  • (2010)The geometry of languageProceedings of the 14th WSEAS international conference on Computers: part of the 14th WSEAS CSCC multiconference - Volume II10.5555/1984366.1984418(721-725)Online publication date: 23-Jul-2010
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Computational Linguistics
Computational Linguistics  Volume 22, Issue 2
June 1996
131 pages
ISSN:0891-2017
EISSN:1530-9312
Issue’s Table of Contents

Publisher

MIT Press

Cambridge, MA, United States

Publication History

Published: 01 June 1996
Published in COLI Volume 22, Issue 2

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)21
  • Downloads (Last 6 weeks)11
Reflects downloads up to 26 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2016)XMG 2: Describing Description LanguagesLogical Aspects of Computational Linguistics. Celebrating 20 Years of LACL (1996–2016)10.1007/978-3-662-53826-5_16(255-272)Online publication date: 5-Dec-2016
  • (2012)A rule-based approach to unknown word recognition in ArabicProceedings of the Twelfth Meeting of the Special Interest Group on Computational Morphology and Phonology10.5555/2390930.2390935(35-41)Online publication date: 7-Jun-2012
  • (2010)The geometry of languageProceedings of the 14th WSEAS international conference on Computers: part of the 14th WSEAS CSCC multiconference - Volume II10.5555/1984366.1984418(721-725)Online publication date: 23-Jul-2010
  • (2010)Representing lexical knowledge for Bulgarian inflectional morphology in DATRProceedings of the 14th WSEAS international conference on Computers: part of the 14th WSEAS CSCC multiconference - Volume II10.5555/1984366.1984400(612-616)Online publication date: 23-Jul-2010
  • (2010)A framework for representing lexical resourcesProceedings of the 23rd International Conference on Computational Linguistics: Posters10.5555/1944566.1944622(490-497)Online publication date: 23-Aug-2010
  • (2010)Three learnable models for the description of languageProceedings of the 4th international conference on Language and Automata Theory and Applications10.1007/978-3-642-13089-2_2(16-31)Online publication date: 24-May-2010
  • (2006)Learning probabilistic paradigms for morphology in a latent class modelProceedings of the Eighth Meeting of the ACL Special Interest Group on Computational Phonology and Morphology10.5555/1622165.1622174(69-78)Online publication date: 8-Jun-2006
  • (2005)The head-modifier principle and multilingual term extractionNatural Language Engineering10.1017/S135132490400353511:2(129-157)Online publication date: 1-Jun-2005
  • (2004)Automatic acquisition of feature-based phonotactic resourcesProceedings of the 7th Meeting of the ACL Special Interest Group in Computational Phonology: Current Themes in Computational Phonology and Morphology10.5555/1622153.1622157(27-34)Online publication date: 26-Jul-2004
  • (2003)A large-scale inheritance-based morphological lexicon for RussianProceedings of the 2003 EACL Workshop on Morphological Processing of Slavic Languages10.5555/1613200.1613202(9-16)Online publication date: 13-Apr-2003
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media