Exploring the ability of emerging large language models to detect cyberbullying in social posts through new prompt-based classification approaches

Stefano Cirillo, Domenico Desiato, Giuseppe Polese, Giandomenico Solimando, Vijayan Sugumaran · 2025 · arXiv 2024.104043

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

Understanding helpfulness and harmless tension in reward models

cs.LG · 2026-06-11 · unverdicted · novelty 6.0

Mixed-objective reward models underperform single-objective ones because shared neurons support one objective while negatively affecting the other, creating alignment tension.

citing papers explorer

Showing 1 of 1 citing paper.

Understanding helpfulness and harmless tension in reward models cs.LG · 2026-06-11 · unverdicted · none · ref 32
Mixed-objective reward models underperform single-objective ones because shared neurons support one objective while negatively affecting the other, creating alignment tension.

Exploring the ability of emerging large language models to detect cyberbullying in social posts through new prompt-based classification approaches

fields

years

verdicts

representative citing papers

citing papers explorer