A backtranslation-based synthetic data pipeline produces 1.7 million French tweets to train reasoners that reach 77-79% accuracy on human-annotated distress detection, matching or beating proprietary LLMs.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Model in Distress: Sentiment Analysis on French Synthetic Social Media
A backtranslation-based synthetic data pipeline produces 1.7 million French tweets to train reasoners that reach 77-79% accuracy on human-annotated distress detection, matching or beating proprietary LLMs.