Determining similarity and inferring relations in a lexical knowledge base

January 1997

Author:
Stephen D. Richardson

Publisher:

City University of New York
New York, NY
United States

Order Number:UMI Order No. GAX97-20134

Bibliometrics

Abstract

This dissertation describes the creation of a large-scale, richly structured lexical knowledge base (LKB) from complex structures of labeled semantic relations. These structures were automatically extracted using a natural language parser from the definitions and example sentences contained in two machine readable dictionaries. The structures were then completely inverted and propagated across all of the relevant headwords in the dictionaries to create the LKB.

A method is described for efficiently accessing salient paths of semantic relations between words in the LKB using weights assigned to those paths. The weights are based on a unique computation called averaged vertex probability. Extended paths, created by joining sub-paths from two different semantic relation structures, are allowed in order to increase the coverage of the information in the LKB.

A novel procedure is used to determine the similarity between words in the LKB based on the patterns of the semantic relation paths connecting those words. The patterns were obtained by extensive training using word pairs from an online thesaurus and a specially created anti-thesaurus.

The similarity procedure and the path accessing mechanism are used in a procedure to infer semantic relations that are not explicitly stored in the LKB. In particular, the utility of such inferences is discussed in the context of disambiguating phrasal attachments in a natural language understanding system.

Quantitative results indicate that the size and coverage of the LKB created in this research and the effectiveness of the methods for accessing explicit and implicit information contained therein represent significant progress toward the development of a truly broad-coverage semantic component for natural language processing.

Cited By

Contributors

Stephen D Richardson
The University of Western Australia
- Publication Years1988 - 2013
- Publication counts10
- Citation count103
- Available for Download5
- Downloads (cumulative)2,269
- Downloads (12 months)237
- Downloads (6 weeks)32
- Average Downloads per Article454
- Average Citation per Article10
View Full Profile

Index Terms

Determining similarity and inferring relations in a lexical knowledge base
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
2. Information systems
  1. Information retrieval
    1. Document representation
      1. Dictionaries

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

A morphological analyzer using hash tables in main memory (MAHT) and a lexical knowledge base
CICLing'12: Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I

This paper presents a morphological analyzer for the Spanish language (MAHT). This system is mainly based on the storage of words and its morphological information, leading to a lexical knowledge base that has almost five million words. The lexical ...
Role of Semantic Relations in Hindi Word Sense Disambiguation
Abstract
Semantic relations play an important role in resolving the ambiguity of a polysemous word. This paper investigates the role of hypernym, hyponym, holonym and meronym relations in Hindi Word Sense Disambiguation. In this work, we have considered ...
Unsupervised learning of semantic relations of a morphologically rich language

Use of semantic concepts and relations for NLP applications including information retrieval and web search is a major area of research. In this context, semantic relation extraction from open domain web documents is important not only for English but ...

Browse Theses

Sections

Cited By

Index Terms

A morphological analyzer using hash tables in main memory (MAHT) and a lexical knowledge base

Role of Semantic Relations in Hindi Word Sense Disambiguation

Unsupervised learning of semantic relations of a morphologically rich language

Sections

Cited By

Save to Binder

Index Terms

Recommendations

A morphological analyzer using hash tables in main memory (MAHT) and a lexical knowledge base

Role of Semantic Relations in Hindi Word Sense Disambiguation

Unsupervised learning of semantic relations of a morphologically rich language