For transformer based mono and multi-lingual models performance are pre- sented in Table 13

Traditional Machine Learning Model The performance benchmark of the instruction-tuned llms are shown in the Table 11

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

MultiSoc-4D: A Benchmark for Diagnosing Instruction-Induced Label Collapse in Closed-Set LLM Annotation of Bengali Social Media

cs.CL · 2026-05-07 · unverdicted · novelty 7.0

MultiSoc-4D benchmark shows LLMs annotating Bengali social media exhibit instruction-induced label collapse, preferring fallback labels and missing 79% of hate speech and 75% of sarcasm instances despite high agreement but near-zero kappa.

citing papers explorer

Showing 1 of 1 citing paper.

MultiSoc-4D: A Benchmark for Diagnosing Instruction-Induced Label Collapse in Closed-Set LLM Annotation of Bengali Social Media cs.CL · 2026-05-07 · unverdicted · none · ref 12
MultiSoc-4D benchmark shows LLMs annotating Bengali social media exhibit instruction-induced label collapse, preferring fallback labels and missing 79% of hate speech and 75% of sarcasm instances despite high agreement but near-zero kappa.

For transformer based mono and multi-lingual models performance are pre- sented in Table 13

fields

years

verdicts

representative citing papers

citing papers explorer