pith. sign in

arxiv: 1704.07624 · v2 · pith:CNHPI6QKnew · submitted 2017-04-25 · 💻 cs.CL · cs.AI· cs.IR

280 Birds with One Stone: Inducing Multilingual Taxonomies from Wikipedia using Character-level Classification

classification 💻 cs.CL cs.AIcs.IR
keywords approachlanguagesmultilingualtaxonomieswikipediacharacter-levelinducingaccurate
0
0 comments X
read the original abstract

We propose a simple, yet effective, approach towards inducing multilingual taxonomies from Wikipedia. Given an English taxonomy, our approach leverages the interlanguage links of Wikipedia followed by character-level classifiers to induce high-precision, high-coverage taxonomies in other languages. Through experiments, we demonstrate that our approach significantly outperforms the state-of-the-art, heuristics-heavy approaches for six languages. As a consequence of our work, we release presumably the largest and the most accurate multilingual taxonomic resource spanning over 280 languages.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.