pith. sign in

arxiv: 1709.04820 · v1 · pith:EGPW5C22new · submitted 2017-09-14 · 💻 cs.CL

Synapse at CAp 2017 NER challenge: Fasttext CRF

classification 💻 cs.CL
keywords systemchallengefasttextfirstfrenchtweetsbestdata
0
0 comments X
read the original abstract

We present our system for the CAp 2017 NER challenge which is about named entity recognition on French tweets. Our system leverages unsupervised learning on a larger dataset of French tweets to learn features feeding a CRF model. It was ranked first without using any gazetteer or structured external data, with an F-measure of 58.89\%. To the best of our knowledge, it is the first system to use fasttext embeddings (which include subword representations) and an embedding-based sentence representation for NER.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.