[PDF][PDF] Automatic retrieval and clustering of similar words
D Lin - 36th Annual Meeting of the Association for …, 1998 - aclanthology.org
36th Annual Meeting of the Association for Computational Linguistics …, 1998•aclanthology.org
Bootstrapping semantics from text is one of the greatest challenges in natural language
learning. We first define a word similarity measure based on the distributional pattern of
words. The similarity measure allows us to construct a thesaurus using a parsed corpus. We
then present a new evaluation methodology for the automatically constructed thesaurus. The
evaluation results show that the thesaurns is significantly closer to WordNet than Roget
Thesaurus is.
learning. We first define a word similarity measure based on the distributional pattern of
words. The similarity measure allows us to construct a thesaurus using a parsed corpus. We
then present a new evaluation methodology for the automatically constructed thesaurus. The
evaluation results show that the thesaurns is significantly closer to WordNet than Roget
Thesaurus is.
Abstract
Bootstrapping semantics from text is one of the greatest challenges in natural language learning. We first define a word similarity measure based on the distributional pattern of words. The similarity measure allows us to construct a thesaurus using a parsed corpus. We then present a new evaluation methodology for the automatically constructed thesaurus. The evaluation results show that the thesaurns is significantly closer to WordNet than Roget Thesaurus is.
aclanthology.org