pith. sign in

arxiv: 1904.01689 · v1 · pith:JMXA5752new · submitted 2019-04-02 · 💻 cs.CL · cs.HC

The Tower of Babel Meets Web 2.0: User-Generated Content and its Applications in a Multilingual Context

classification 💻 cs.CL cs.HC
keywords applicationsdiversityknowledgeconceptscontentlanguageuser-generatedwikipedia
0
0 comments X
read the original abstract

This study explores language's fragmenting effect on user-generated content by examining the diversity of knowledge representations across 25 different Wikipedia language editions. This diversity is measured at two levels: the concepts that are included in each edition and the ways in which these concepts are described. We demonstrate that the diversity present is greater than has been presumed in the literature and has a significant influence on applications that use Wikipedia as a source of world knowledge. We close by explicating how knowledge diversity can be beneficially leveraged to create "culturally-aware applications" and "hyperlingual applications".

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.