R eddit B ias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models

Barikeri, Soumya, Lauscher, Anne, Glava s , Goran · 2021 · DOI 10.18653/v1/2021.acl-long.151

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Debiasing Without Protected Attributes: Latent Concept Erasure from Textual Profiles

cs.CL · 2026-06-10 · unverdicted · novelty 6.0

H-SAL erases latent concepts from text profiles using self-descriptions as implicit debiasing signals and shows competitive performance on a new multi-domain Stack Exchange helpfulness benchmark.

AI Safety Landscape for Large Language Models: Taxonomy, State-of-the-art, and Future Directions

cs.AI · 2024-08-23 · unverdicted · novelty 4.0

The paper introduces a taxonomy of AI safety for LLMs organized into Trustworthy AI, Responsible AI, and Safe AI perspectives, accompanied by a review of state-of-the-art methods, challenges, and future directions.

citing papers explorer

Showing 2 of 2 citing papers.

Debiasing Without Protected Attributes: Latent Concept Erasure from Textual Profiles cs.CL · 2026-06-10 · unverdicted · none · ref 23
H-SAL erases latent concepts from text profiles using self-descriptions as implicit debiasing signals and shows competitive performance on a new multi-domain Stack Exchange helpfulness benchmark.
AI Safety Landscape for Large Language Models: Taxonomy, State-of-the-art, and Future Directions cs.AI · 2024-08-23 · unverdicted · none · ref 45
The paper introduces a taxonomy of AI safety for LLMs organized into Trustworthy AI, Responsible AI, and Safe AI perspectives, accompanied by a review of state-of-the-art methods, challenges, and future directions.

R eddit B ias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer