VCWE: Visual Character-Enhanced Word Embeddings

Chi Sun; Xipeng Qiu; Xuanjing Huang

arxiv: 1902.08795 · v2 · pith:GU3O2IZXnew · submitted 2019-02-23 · 💻 cs.CL

VCWE: Visual Character-Enhanced Word Embeddings

Chi Sun , Xipeng Qiu , Xuanjing Huang This is my paper

classification 💻 cs.CL

keywords wordchineseembeddingscharacterinformationmodelnetworkneural

0 comments

read the original abstract

Chinese is a logographic writing system, and the shape of Chinese characters contain rich syntactic and semantic information. In this paper, we propose a model to learn Chinese word embeddings via three-level composition: (1) a convolutional neural network to extract the intra-character compositionality from the visual shape of a character; (2) a recurrent neural network with self-attention to compose character representation into word embeddings; (3) the Skip-Gram framework to capture non-compositionality directly from the contextual information. Evaluations demonstrate the superior performance of our model on four tasks: word similarity, sentiment analysis, named entity recognition and part-of-speech tagging.

This paper has not been read by Pith yet.

VCWE: Visual Character-Enhanced Word Embeddings

discussion (0)