280 Birds with One Stone: Inducing Multilingual Taxonomies from Wikipedia using Character-level Classification
classification
💻 cs.CL
cs.AIcs.IR
keywords
approachlanguagesmultilingualtaxonomieswikipediacharacter-levelinducingaccurate
read the original abstract
We propose a simple, yet effective, approach towards inducing multilingual taxonomies from Wikipedia. Given an English taxonomy, our approach leverages the interlanguage links of Wikipedia followed by character-level classifiers to induce high-precision, high-coverage taxonomies in other languages. Through experiments, we demonstrate that our approach significantly outperforms the state-of-the-art, heuristics-heavy approaches for six languages. As a consequence of our work, we release presumably the largest and the most accurate multilingual taxonomic resource spanning over 280 languages.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.