ToxiREX is a new dataset of 128k Reddit comments in six languages with hierarchical annotations for implicit toxicity in conversational context based on an existing reasoning schema.
We can’t understand ai using our existing vocabulary
3 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 3representative citing papers
Training LLMs to verbalize uncertainty explicitly at the end or during reasoning reduces overconfident errors and improves answer quality on factual tasks while enabling RAG triggers.
CHiQPM is a hierarchical interpretable image classifier that maintains 99% of non-interpretable model accuracy while supplying contrastive global explanations, human-like hierarchical paths, and calibrated interpretable set predictions via conformal prediction.
citing papers explorer
-
ToxiREX: A Dataset on Toxic REasoning in ConteXt
ToxiREX is a new dataset of 128k Reddit comments in six languages with hierarchical annotations for implicit toxicity in conversational context based on an existing reasoning schema.
-
LLMs Should Express Uncertainty Explicitly
Training LLMs to verbalize uncertainty explicitly at the end or during reasoning reduces overconfident errors and improves answer quality on factual tasks while enabling RAG triggers.
-
CHiQPM: Calibrated Hierarchical Interpretable Image Classification
CHiQPM is a hierarchical interpretable image classifier that maintains 99% of non-interpretable model accuracy while supplying contrastive global explanations, human-like hierarchical paths, and calibrated interpretable set predictions via conformal prediction.