Haven Kim, Zachary Novack, Weihan Xu, Julian McAuley, and Hao-Wen Dong

· 2025 · arXiv 2504.04479

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Steering Autoregressive Music Generation with Recursive Feature Machines

cs.LG · 2025-10-21 · unverdicted · novelty 7.0

MusicRFM discovers interpretable concept directions in music model hidden states using RFM probes and injects them at inference to steer generation toward desired musical properties without retraining.

Whisper Hallucination Detection and Mitigation via Hidden Representation Steering and Sparse AutoEncoders

cs.SD · 2026-06-05 · unverdicted · novelty 6.0

Hallucination information is linearly separable in Whisper activations and SAE latents; SAE steering reduces hallucination rates from 72.63% to 14.11% (small) and 86.88% to 27.33% (large-v3) on non-speech audio with small WER impact.

Latent Space Disentanglement via Activation Steering for Interpretable Attribute Control in Symbolic Music Generation

cs.SD · 2026-05-29 · unverdicted · novelty 5.0

Activation steering with Gram-Schmidt orthogonalization enables disentangled, deterministic control of pitch and duration attributes in the Multitrack Music Transformer without retraining.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Steering Autoregressive Music Generation with Recursive Feature Machines cs.LG · 2025-10-21 · unverdicted · none · ref 4
MusicRFM discovers interpretable concept directions in music model hidden states using RFM probes and injects them at inference to steer generation toward desired musical properties without retraining.
Whisper Hallucination Detection and Mitigation via Hidden Representation Steering and Sparse AutoEncoders cs.SD · 2026-06-05 · unverdicted · none · ref 38
Hallucination information is linearly separable in Whisper activations and SAE latents; SAE steering reduces hallucination rates from 72.63% to 14.11% (small) and 86.88% to 27.33% (large-v3) on non-speech audio with small WER impact.
Latent Space Disentanglement via Activation Steering for Interpretable Attribute Control in Symbolic Music Generation cs.SD · 2026-05-29 · unverdicted · none · ref 25
Activation steering with Gram-Schmidt orthogonalization enables disentangled, deterministic control of pitch and duration attributes in the Multitrack Music Transformer without retraining.

Haven Kim, Zachary Novack, Weihan Xu, Julian McAuley, and Hao-Wen Dong

fields

years

verdicts

representative citing papers

citing papers explorer