pith. sign in

arxiv: 0910.5410 · v1 · submitted 2009-10-28 · 💻 cs.CL · cs.AI

The Uned systems at Senseval-2

classification 💻 cs.CL cs.AI
keywords lexicalsamplewordssystemunsupervisedfirstsensesenseval-2
0
0 comments X
read the original abstract

We have participated in the SENSEVAL-2 English tasks (all words and lexical sample) with an unsupervised system based on mutual information measured over a large corpus (277 million words) and some additional heuristics. A supervised extension of the system was also presented to the lexical sample task. Our system scored first among unsupervised systems in both tasks: 56.9% recall in all words, 40.2% in lexical sample. This is slightly worse than the first sense heuristic for all words and 3.6% better for the lexical sample, a strong indication that unsupervised Word Sense Disambiguation remains being a strong challenge.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.