Answers to Research Questions RQ1: Value of unlabelled web data.OWS con- tinued pre-training reliably improves BERT-family models, especially in multilingual low-data settings

Discussion 5

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Toward Generalized Cross-Lingual Hateful Language Detection with Web-Scale Data and Ensemble LLM Annotations

cs.CL · 2026-03-18 · unverdicted · novelty 4.0

Continued pre-training on web data and LLM-ensemble synthetic labels improve multilingual hate speech detection, with gains up to 11% for small models in low-resource settings.

citing papers explorer

Showing 1 of 1 citing paper.

Toward Generalized Cross-Lingual Hateful Language Detection with Web-Scale Data and Ensemble LLM Annotations cs.CL · 2026-03-18 · unverdicted · none · ref 15
Continued pre-training on web data and LLM-ensemble synthetic labels improve multilingual hate speech detection, with gains up to 11% for small models in low-resource settings.

Answers to Research Questions RQ1: Value of unlabelled web data.OWS con- tinued pre-training reliably improves BERT-family models, especially in multilingual low-data settings

fields

years

verdicts

representative citing papers

citing papers explorer