Title resolution pending

Gemma Team , year=

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

CAP: Controllable Alignment Prompting for Unlearning in LLMs

cs.LG · 2026-04-23 · unverdicted · novelty 6.0

CAP is a reinforcement-learning-driven prompt optimization framework that suppresses target knowledge in LLMs while preserving general capabilities, enabling reversible unlearning without any parameter updates.

VLM-AR3L: Vision-Language Models for Absolute and Relative Rewards in Reinforcement Learning

cs.RO · 2026-07-01 · unverdicted · novelty 5.0

VLM-AR3L learns absolute and relative reward models from VLM preference labels to improve RL on control, manipulation, and Minecraft tasks.

To Intervene or Not: Guiding Inference-time Alignment with Probabilistic Model Blending

cs.LG · 2026-04-22 · conditional · novelty 5.0

BlendIn replaces binary guidance acceptance with confidence-weighted distribution blending between base and guidance models, mitigating cascading failures in inference-time LLM alignment.

citing papers explorer

Showing 3 of 3 citing papers.

CAP: Controllable Alignment Prompting for Unlearning in LLMs cs.LG · 2026-04-23 · unverdicted · none · ref 31
CAP is a reinforcement-learning-driven prompt optimization framework that suppresses target knowledge in LLMs while preserving general capabilities, enabling reversible unlearning without any parameter updates.
VLM-AR3L: Vision-Language Models for Absolute and Relative Rewards in Reinforcement Learning cs.RO · 2026-07-01 · unverdicted · none · ref 51
VLM-AR3L learns absolute and relative reward models from VLM preference labels to improve RL on control, manipulation, and Minecraft tasks.
To Intervene or Not: Guiding Inference-time Alignment with Probabilistic Model Blending cs.LG · 2026-04-22 · conditional · none · ref 20
BlendIn replaces binary guidance acceptance with confidence-weighted distribution blending between base and guidance models, mitigating cascading failures in inference-time LLM alignment.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer