emnlp-main.307/

URL https://aclanthology · 2024 · DOI 10.18653/v1/2022.acl-long.230

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

Going PLACES: Participatory Localized Red Teaming for Text-to-Image Safety in the Global South

cs.CY · 2026-05-18 · unverdicted · novelty 6.0

A participatory red-teaming project in the Global South created the PLACES dataset of 26k T2I failure examples that reveal unique cultural and linguistic harms missed by existing safety frameworks.

Discovering Implicit Large Language Model Alignment Objectives

cs.LG · 2026-02-17 · unverdicted · novelty 6.0

Obj-Disco decomposes LLM alignment reward signals into sparse weighted combinations of interpretable natural language objectives via iterative analysis of behavioral changes across checkpoints, capturing over 90% of observed reward behavior.

citing papers explorer

Showing 2 of 2 citing papers.

Going PLACES: Participatory Localized Red Teaming for Text-to-Image Safety in the Global South cs.CY · 2026-05-18 · unverdicted · none · ref 81
A participatory red-teaming project in the Global South created the PLACES dataset of 26k T2I failure examples that reveal unique cultural and linguistic harms missed by existing safety frameworks.
Discovering Implicit Large Language Model Alignment Objectives cs.LG · 2026-02-17 · unverdicted · none · ref 4
Obj-Disco decomposes LLM alignment reward signals into sparse weighted combinations of interpretable natural language objectives via iterative analysis of behavioral changes across checkpoints, capturing over 90% of observed reward behavior.

emnlp-main.307/

fields

years

verdicts

representative citing papers

citing papers explorer