The Uned systems at Senseval-2

David Fernandez-Amoros; Felisa Verdejo; Julio Gonzalo

arxiv: 0910.5410 · v1 · submitted 2009-10-28 · 💻 cs.CL · cs.AI

The Uned systems at Senseval-2

David Fernandez-Amoros , Julio Gonzalo , Felisa Verdejo This is my paper

classification 💻 cs.CL cs.AI

keywords lexicalsamplewordssystemunsupervisedfirstsensesenseval-2

0 comments

read the original abstract

We have participated in the SENSEVAL-2 English tasks (all words and lexical sample) with an unsupervised system based on mutual information measured over a large corpus (277 million words) and some additional heuristics. A supervised extension of the system was also presented to the lexical sample task. Our system scored first among unsupervised systems in both tasks: 56.9% recall in all words, 40.2% in lexical sample. This is slightly worse than the first sense heuristic for all words and 3.6% better for the lexical sample, a strong indication that unsupervised Word Sense Disambiguation remains being a strong challenge.

This paper has not been read by Pith yet.

The Uned systems at Senseval-2

discussion (0)