Title resolution pending

Lee, B · 2025 · arXiv 2506.22806

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

SafeDiffusion-R1: Online Reward Steering for Safe Diffusion Post-Training

cs.CV · 2026-05-18 · unverdicted · novelty 6.0

SafeDiffusion-R1 uses online GRPO with CLIP embedding steering to cut inappropriate content from 48.9% to 18.07% and nudity detections from 646 to 15 in diffusion models while raising GenEval scores from 42.08% to 47.83% and generalizing across seven harm categories without supervised pairs or extra

BARRIER: Bounded Activation Regions for Robust Information Erasure

cs.CV · 2026-05-15 · unverdicted · novelty 5.0

BARRIER applies interval arithmetic to SVD-based activation projections to create bounded forget regions that enable aggressive unlearning while providing formal protection for retain distributions via tail bounds on functional drift.

GrOCE:Graph-Guided Online Concept Erasure for Text-to-Image Diffusion Models

cs.CV · 2025-11-17 · unverdicted · novelty 5.0

GrOCE uses dynamic semantic graphs for online, training-free erasure of target concepts from diffusion model prompts via cluster identification and selective severing.

citing papers explorer

Showing 3 of 3 citing papers.

SafeDiffusion-R1: Online Reward Steering for Safe Diffusion Post-Training cs.CV · 2026-05-18 · unverdicted · none · ref 78
SafeDiffusion-R1 uses online GRPO with CLIP embedding steering to cut inappropriate content from 48.9% to 18.07% and nudity detections from 646 to 15 in diffusion models while raising GenEval scores from 42.08% to 47.83% and generalizing across seven harm categories without supervised pairs or extra
BARRIER: Bounded Activation Regions for Robust Information Erasure cs.CV · 2026-05-15 · unverdicted · none · ref 35
BARRIER applies interval arithmetic to SVD-based activation projections to create bounded forget regions that enable aggressive unlearning while providing formal protection for retain distributions via tail bounds on functional drift.
GrOCE:Graph-Guided Online Concept Erasure for Text-to-Image Diffusion Models cs.CV · 2025-11-17 · unverdicted · none · ref 17
GrOCE uses dynamic semantic graphs for online, training-free erasure of target concepts from diffusion model prompts via cluster identification and selective severing.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer