pith. sign in

arxiv: 1508.06669 · v1 · pith:ELDZ2M4Snew · submitted 2015-08-26 · 💻 cs.CL

Component-Enhanced Chinese Character Embeddings

classification 💻 cs.CL
keywords chinesemodelswordcharactercomponent-enhancedembeddingsenglishsemantic
0
0 comments X
read the original abstract

Distributed word representations are very useful for capturing semantic information and have been successfully applied in a variety of NLP tasks, especially on English. In this work, we innovatively develop two component-enhanced Chinese character embedding models and their bigram extensions. Distinguished from English word embeddings, our models explore the compositions of Chinese characters, which often serve as semantic indictors inherently. The evaluations on both word similarity and text classification demonstrate the effectiveness of our models.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.