In: ICML (2024)

Chen, H · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.CL · 2026-03-25 · unverdicted · novelty 7.0

Alignment reduces expressed gender bias in LLM outputs but does not remove the underlying encoded gender associations in internal representations.

Showing 1 of 1 citing paper.

Alignment Reduces Expressed but Not Encoded Gender Bias: A Unified Framework and Study cs.CL · 2026-03-25 · unverdicted · none · ref 6
Alignment reduces expressed gender bias in LLM outputs but does not remove the underlying encoded gender associations in internal representations.