Combining Unsupervised Lexical Knowledge Methods for Word Sense Disambiguation

Eneko Agirre; German Rigau; Jordi Atserias

arxiv: cmp-lg/9704007 · v1 · submitted 1997-04-21 · cmp-lg · cs.CL

Combining Unsupervised Lexical Knowledge Methods for Word Sense Disambiguation

German Rigau , Jordi Atserias , Eneko Agirre This is my paper

classification cmp-lg cs.CL

keywords techniqueswordbeencombinedictionariesdisambiguategenuslexical

0 comments

read the original abstract

This paper presents a method to combine a set of unsupervised algorithms that can accurately disambiguate word senses in a large, completely untagged corpus. Although most of the techniques for word sense resolution have been presented as stand-alone, it is our belief that full-fledged lexical ambiguity resolution should combine several information sources and techniques. The set of techniques have been applied in a combined way to disambiguate the genus terms of two machine-readable dictionaries (MRD), enabling us to construct complete taxonomies for Spanish and French. Tested accuracy is above 80% overall and 95% for two-way ambiguous genus terms, showing that taxonomy building is not limited to structured dictionaries such as LDOCE.

This paper has not been read by Pith yet.

Combining Unsupervised Lexical Knowledge Methods for Word Sense Disambiguation

discussion (0)