SteeringDiffusion: A Bottlenecked Activation Control Interface for Diffusion Models

Brian Summa; Fangzheng Wu

arxiv: 2605.01653 · v1 · submitted 2026-05-03 · 💻 cs.CV

SteeringDiffusion: A Bottlenecked Activation Control Interface for Diffusion Models

Fangzheng Wu , Brian Summa This is my paper

Pith reviewed 2026-05-08 19:36 UTC · model grok-4.3

classification 💻 cs.CV

keywords controldiffusionbottleneckedinterfacesteeringdiffusionsurfacecontent--stylefrozen

0 comments

The pith

SteeringDiffusion supplies a bottlenecked, prompt-conditioned activation interface for frozen diffusion models that delivers smooth monotonic content-style control via one runtime scalar and timestep gating.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Diffusion models create images by starting from noise and removing it step by step. The challenge is steering the output toward more artistic style or more faithful content without retraining the entire model. SteeringDiffusion keeps the main U-Net frozen and instead learns a compact latent code that is conditioned on the text prompt. This code is turned into modulation signals that adjust the model's internal activations using FiLM or AdaGN layers. The design starts at zero so the model behaves exactly like the original when the control is off. A gating mechanism limits the modulation to later denoising steps where style and content decisions are more settled. At generation time the user simply turns one number up or down to slide along a continuous trade-off curve. The authors test the approach on Stable Diffusion 1.5 and SDXL across several artistic styles and report that the control surface stays smooth and monotonic. They also compare it to LoRA, ControlNet, and rank-1 adapters under similar parameter counts and claim better controllability and stability. An additional diagnostic based on DDIM inversion is introduced to probe how stable the generated trajectories remain after intervention.

Core claim

Across experiments on Stable Diffusion 1.5 and SDXL covering multiple artistic styles, we show that SteeringDiffusion produces smooth and monotonic content--style trade-offs. Under matched parameter budgets, it outperforms LoRA in controllability and stability, while ControlNet and rank-1 adapters do not expose a comparable control surface.

Load-bearing premise

That a small prompt-conditioned latent code projected into FiLM/AdaGN modulation parameters, combined with zero initialization and timestep gating, can deliver independent, stable control over the content-style trade-off without degrading base model quality or introducing artifacts.

Figures

Figures reproduced from arXiv: 2605.01653 by Brian Summa, Fangzheng Wu.

**Figure 1.** Figure 1: SteeringDiffusion exposes a smooth runtime control manifold. Conceptual illustration (not to scale). Varying a single inference-time scalar traverses a continuous, monotonic content–style trade-off (blue), while alternative control mechanisms can exhibit fragmented traversal (red) or off-manifold, more invasive behavior (orange). Quantitative trade-off curves and matched-strength comparisons are provided in view at source ↗

**Figure 2.** Figure 2: Taxonomy of parameter-efficient adaptation and steering mechanisms. Methods are organized by whether control is implicit in weight space or explicit via external signals (x-axis), and by the dimensionality of the control bottleneck (y-axis). SteeringDiffusion belongs to bottlenecked explicit control (S-BEC). Activation steering in language models. Recent work [14, 15, 16] shows that adding learned vectors … view at source ↗

**Figure 3.** Figure 3: Content–style trade-off on Art Nouveau (class 0). At matched CLIP-I, steering achieves view at source ↗

**Figure 4.** Figure 4: Content–style trade-off across four ArtBench styles. Increasing the steering scale s consistently produces monotonic trade-offs across all styles, confirming a style-invariant control surface. Style ρ(s, Style) ρ(s, CLIP-I) Style Viol. CLIP-I Viol. Art Nouveau +1.000∗ −1.000∗ 0/6 0/6 Impressionism +1.000∗ −1.000∗ 0/6 0/6 Renaissance +1.000∗ −1.000∗ 0/6 0/6 Ukiyo-e +1.000∗ −1.000∗ 0/6 0/6 view at source ↗

read the original abstract

We introduce SteeringDiffusion, a bottlenecked activation-level control interface for diffusion models that exposes a smooth, monotonic, and runtime-adjustable control surface over the content--style trade-off. Our method keeps the U-Net backbone frozen and learns a small, prompt-conditioned latent code projected to FiLM/AdaGN-style modulation parameters. A zero-initialized design guarantees exact equivalence to the base model at zero scale, while timestep-aware gating restricts modulation to later denoising stages. A single scalar at inference continuously traverses the control surface without retraining. Across experiments on Stable Diffusion~1.5 and SDXL covering multiple artistic styles, we show that SteeringDiffusion produces smooth and monotonic content--style trade-offs. Under matched parameter budgets, it outperforms LoRA in controllability and stability, while ControlNet and rank-1 adapters do not expose a comparable control surface. We further introduce an inversion-stability diagnostic based on DDIM inversion, used as a post-hoc trajectory probe, which reveals strong correlations with intervention magnitude. These results position \emph{Steering Bottlenecked Explicit Control (S-BEC)} as a practical, general-purpose control interface for frozen diffusion backbones.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

Abstract provides insufficient detail to enumerate all free parameters or axioms; the method assumes modulation via FiLM/AdaGN on a frozen backbone is expressive enough for the desired control.

free parameters (1)

latent code dimension
Size of the prompt-conditioned latent code is a design choice that determines capacity of the control interface.

axioms (1)

domain assumption Modulation of U-Net activations via learned FiLM/AdaGN parameters is sufficient to steer content-style trade-off without backbone updates
Core premise that allows the backbone to remain frozen.

pith-pipeline@v0.9.0 · 5499 in / 1305 out tokens · 67465 ms · 2026-05-08T19:36:07.842348+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

Cost / FunctionalEquation (J(x)=½(x+x⁻¹)−1) washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

∆_ℓ(h_ℓ, c, t) = s f(t)(γ_ℓ(v) ⊙ GN(h_ℓ) + β_ℓ(v))

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

50 extracted references

[1]

Denoising diffusion probabilistic models, 2020

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilistic models, 2020

2020
[2]

Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole

Yang Song, Jascha Sohl-Dickstein, Diederik P. Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole. Score-based generative modeling through stochastic differential equations, 2021

2021
[3]

Diffusion models beat gans on image synthesis, 2021

Prafulla Dhariwal and Alex Nichol. Diffusion models beat gans on image synthesis, 2021

2021
[4]

High-resolution image synthesis with latent diffusion models, 2022

Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Ommer. High-resolution image synthesis with latent diffusion models, 2022

2022
[5]

Elucidating the design space of diffusion-based generative models, 2022

Tero Karras, Miika Aittala, Timo Aila, and Samuli Laine. Elucidating the design space of diffusion-based generative models, 2022

2022
[6]

Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, and Weizhu Chen

Edward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, and Weizhu Chen. Lora: Low-rank adaptation of large language models, 2021

2021
[7]

Difffit: Unlocking transferability of large diffusion models via simple parameter-efficient fine-tuning, 2023

Enze Xie, Lewei Yao, Han Shi, Zhili Liu, Daquan Zhou, Zhaoqiang Liu, Jiawei Li, and Zhenguo Li. Difffit: Unlocking transferability of large diffusion models via simple parameter-efficient fine-tuning, 2023

2023
[8]

Controlling text-to-image diffusion by orthogonal finetuning, 2024

Zeju Qiu, Weiyang Liu, Haiwen Feng, Yuxuan Xue, Yao Feng, Zhen Liu, Dan Zhang, Adrian Weller, and Bernhard Schölkopf. Controlling text-to-image diffusion by orthogonal finetuning, 2024

2024
[9]

Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation, 2023

Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Yael Pritch, Michael Rubinstein, and Kfir Aberman. Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation, 2023

2023
[10]

Bermano, Gal Chechik, and Daniel Cohen-Or

Rinon Gal, Yuval Alaluf, Yuval Atzmon, Or Patashnik, Amit H. Bermano, Gal Chechik, and Daniel Cohen-Or. An image is worth one word: Personalizing text-to-image generation using textual inversion, 2022

2022
[11]

Multi- concept customization of text-to-image diffusion, 2023

Nupur Kumari, Bingliang Zhang, Richard Zhang, Eli Shechtman, and Jun-Yan Zhu. Multi- concept customization of text-to-image diffusion, 2023

2023
[12]

Svdiff: Compact parameter space for diffusion fine-tuning, 2023

Ligong Han, Yinxiao Li, Han Zhang, Peyman Milanfar, Dimitris Metaxas, and Feng Yang. Svdiff: Compact parameter space for diffusion fine-tuning, 2023

2023
[13]

Steering large language model activations in sparse spaces, 2025

Reza Bayat, Ali Rahimi-Kalahroudi, Mohammad Pezeshki, Sarath Chandar, and Pascal Vincent. Steering large language model activations in sparse spaces, 2025

2025
[14]

Steering llm reasoning through bias-only adaptation, 2025

Viacheslav Sinii, Alexey Gorbatovski, Artem Cherepanov, Boris Shaposhnikov, Nikita Balagan- sky, and Daniil Gavrilov. Steering llm reasoning through bias-only adaptation, 2025

2025
[15]

Byun, Zifan Wang, Alex Mallen, Steven Basart, Sanmi Koyejo, Dawn Song, Matt Fredrikson, J

Andy Zou, Long Phan, Sarah Chen, James Campbell, Phillip Guo, Richard Ren, Alexander Pan, Xuwang Yin, Mantas Mazeika, Ann-Kathrin Dombrowski, Shashwat Goel, Nathaniel Li, Michael J. Byun, Zifan Wang, Alex Mallen, Steven Basart, Sanmi Koyejo, Dawn Song, Matt Fredrikson, J. Zico Kolter, and Dan Hendrycks. Representation engineering: A top-down approach to a...

2025
[16]

Inference- time intervention: Eliciting truthful answers from a language model, 2024

Kenneth Li, Oam Patel, Fernanda Viégas, Hanspeter Pfister, and Martin Wattenberg. Inference- time intervention: Eliciting truthful answers from a language model, 2024

2024
[17]

Nishant Subramani, Nivedita Suresh, and Matthew E. Peters. Extracting latent steering vectors from pretrained language models, 2022

2022
[18]

Steering llama 2 via contrastive activation addition, 2024

Nina Panickssery, Nick Gabrieli, Julian Schulz, Meg Tong, Evan Hubinger, and Alexander Matt Turner. Steering llama 2 via contrastive activation addition, 2024

2024
[19]

Li, Arnab Sen Sharma, Aaron Mueller, Byron C

Eric Todd, Millicent L. Li, Arnab Sen Sharma, Aaron Mueller, Byron C. Wallace, and David Bau. Function vectors in large language models, 2024

2024
[20]

Linearity of relation decoding in transformer language models, 2024

Evan Hernandez, Arnab Sen Sharma, Tal Haklay, Kevin Meng, Martin Wattenberg, Jacob Andreas, Yonatan Belinkov, and David Bau. Linearity of relation decoding in transformer language models, 2024. 12

2024
[21]

The linear representation hypothesis and the geometry of large language models, 2024

Kiho Park, Yo Joong Choe, and Victor Veitch. The linear representation hypothesis and the geometry of large language models, 2024

2024
[22]

Film: Visual reasoning with a general conditioning layer, 2017

Ethan Perez, Florian Strub, Harm de Vries, Vincent Dumoulin, and Aaron Courville. Film: Visual reasoning with a general conditioning layer, 2017

2017
[23]

Arbitrary style transfer in real-time with adaptive instance normalization, 2017

Xun Huang and Serge Belongie. Arbitrary style transfer in real-time with adaptive instance normalization, 2017

2017
[24]

The artbench dataset: Benchmarking generative models with artworks, 2022

Peiyuan Liao, Xiuyu Li, Xihui Liu, and Kurt Keutzer. The artbench dataset: Benchmarking generative models with artworks, 2022

2022
[25]

Adding conditional control to text-to-image diffusion models, 2023

Lvmin Zhang, Anyi Rao, and Maneesh Agrawala. Adding conditional control to text-to-image diffusion models, 2023

2023
[26]

One-dimensional adapter to rule them all: Concepts, diffusion models and erasing applications, 2024

Mengyao Lyu, Yuhong Yang, Haiwen Hong, Hui Chen, Xuan Jin, Yuan He, Hui Xue, Jungong Han, and Guiguang Ding. One-dimensional adapter to rule them all: Concepts, diffusion models and erasing applications, 2024

2024
[27]

Denoising diffusion implicit models, 2022

Jiaming Song, Chenlin Meng, and Stefano Ermon. Denoising diffusion implicit models, 2022

2022
[28]

Erasing concepts from diffusion models, 2023

Rohit Gandikota, Joanna Materzynska, Jaden Fiotto-Kaufman, and David Bau. Erasing concepts from diffusion models, 2023

2023
[29]

Unified concept editing in diffusion models, 2024

Rohit Gandikota, Hadas Orgad, Yonatan Belinkov, Joanna Materzy´nska, and David Bau. Unified concept editing in diffusion models, 2024

2024
[30]

Diffusekrona: A parameter efficient fine-tuning method for personalized diffusion models, 2024

Shyam Marjit, Harshit Singh, Nityanand Mathur, Sayak Paul, Chia-Mu Yu, and Pin-Yu Chen. Diffusekrona: A parameter efficient fine-tuning method for personalized diffusion models, 2024

2024
[31]

Network dissection: Quantifying interpretability of deep visual representations, 2017

David Bau, Bolei Zhou, Aditya Khosla, Aude Oliva, and Antonio Torralba. Network dissection: Quantifying interpretability of deep visual representations, 2017

2017
[32]

Locating and editing factual associations in gpt, 2023

Kevin Meng, David Bau, Alex Andonian, and Yonatan Belinkov. Locating and editing factual associations in gpt, 2023

2023
[33]

Classifier-free diffusion guidance, 2022

Jonathan Ho and Tim Salimans. Classifier-free diffusion guidance, 2022

2022
[34]

Prompt-to-prompt image editing with cross attention control, 2022

Amir Hertz, Ron Mokady, Jay Tenenbaum, Kfir Aberman, Yael Pritch, and Daniel Cohen-Or. Prompt-to-prompt image editing with cross attention control, 2022

2022
[35]

Plug-and-play diffusion features for text-driven image-to-image translation, 2022

Narek Tumanyan, Michal Geyer, Shai Bagon, and Tali Dekel. Plug-and-play diffusion features for text-driven image-to-image translation, 2022

2022
[36]

Ip-adapter: Text compatible image prompt adapter for text-to-image diffusion models, 2023

Hu Ye, Jun Zhang, Sibo Liu, Xiao Han, and Wei Yang. Ip-adapter: Text compatible image prompt adapter for text-to-image diffusion models, 2023

2023
[37]

T2i-adapter: Learning adapters to dig out more controllable ability for text-to-image diffusion models, 2023

Chong Mou, Xintao Wang, Liangbin Xie, Yanze Wu, Jian Zhang, Zhongang Qi, Ying Shan, and Xiaohu Qie. T2i-adapter: Learning adapters to dig out more controllable ability for text-to-image diffusion models, 2023

2023
[38]

Tim Brooks, Aleksander Holynski, and Alexei A. Efros. Instructpix2pix: Learning to follow image editing instructions, 2023

2023
[39]

Imagic: Text-based real image editing with diffusion models, 2023

Bahjat Kawar, Shiran Zada, Oran Lang, Omer Tov, Huiwen Chang, Tali Dekel, Inbar Mosseri, and Michal Irani. Imagic: Text-based real image editing with diffusion models, 2023

2023
[40]

Sdedit: Guided image synthesis and editing with stochastic differential equations, 2022

Chenlin Meng, Yutong He, Yang Song, Jiaming Song, Jiajun Wu, Jun-Yan Zhu, and Stefano Ermon. Sdedit: Guided image synthesis and editing with stochastic differential equations, 2022

2022
[41]

Stylediffusion: Controllable disentangled style transfer via diffusion models, 2023

Zhizhong Wang, Lei Zhao, and Wei Xing. Stylediffusion: Controllable disentangled style transfer via diffusion models, 2023

2023
[42]

Diffusionclip: Text-guided diffusion models for robust image manipulation, 2022

Gwanghyun Kim, Taesung Kwon, and Jong Chul Ye. Diffusionclip: Text-guided diffusion models for robust image manipulation, 2022. 13

2022
[43]

Styleclip: Text-driven manipulation of stylegan imagery, 2021

Or Patashnik, Zongze Wu, Eli Shechtman, Daniel Cohen-Or, and Dani Lischinski. Styleclip: Text-driven manipulation of stylegan imagery, 2021

2021
[44]

Semantic image synthesis with spatially-adaptive normalization, 2019

Taesung Park, Ming-Yu Liu, Ting-Chun Wang, and Jun-Yan Zhu. Semantic image synthesis with spatially-adaptive normalization, 2019

2019
[45]

Learning transferable visual models from natural language supervision, 2021

Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agar- wal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. Learning transferable visual models from natural language supervision, 2021

2021
[46]

Not all noises are created equally:diffusion noise selection and optimization, 2024

Zipeng Qi, Lichen Bai, Haoyi Xiong, and Zeke Xie. Not all noises are created equally:diffusion noise selection and optimization, 2024

2024
[47]

Null-text inversion for editing real images using guided diffusion models, 2022

Ron Mokady, Amir Hertz, Kfir Aberman, Yael Pritch, and Daniel Cohen-Or. Null-text inversion for editing real images using guided diffusion models, 2022

2022
[48]

A rank stabilization scaling factor for fine-tuning with lora, 2023

Damjan Kalajdzievski. A rank stabilization scaling factor for fine-tuning with lora, 2023

2023
[49]

Sdxl: Improving latent diffusion models for high-resolution image synthesis, 2023

Dustin Podell, Zion English, Kyle Lacey, Andreas Blattmann, Tim Dockhorn, Jonas Müller, Joe Penna, and Robin Rombach. Sdxl: Improving latent diffusion models for high-resolution image synthesis, 2023

2023
[50]

Scalable diffusion models with transformers, 2023

William Peebles and Saining Xie. Scalable diffusion models with transformers, 2023. 14

2023

[1] [1]

Denoising diffusion probabilistic models, 2020

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilistic models, 2020

2020

[2] [2]

Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole

Yang Song, Jascha Sohl-Dickstein, Diederik P. Kingma, Abhishek Kumar, Stefano Ermon, and Ben Poole. Score-based generative modeling through stochastic differential equations, 2021

2021

[3] [3]

Diffusion models beat gans on image synthesis, 2021

Prafulla Dhariwal and Alex Nichol. Diffusion models beat gans on image synthesis, 2021

2021

[4] [4]

High-resolution image synthesis with latent diffusion models, 2022

Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Ommer. High-resolution image synthesis with latent diffusion models, 2022

2022

[5] [5]

Elucidating the design space of diffusion-based generative models, 2022

Tero Karras, Miika Aittala, Timo Aila, and Samuli Laine. Elucidating the design space of diffusion-based generative models, 2022

2022

[6] [6]

Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, and Weizhu Chen

Edward J. Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, and Weizhu Chen. Lora: Low-rank adaptation of large language models, 2021

2021

[7] [7]

Difffit: Unlocking transferability of large diffusion models via simple parameter-efficient fine-tuning, 2023

Enze Xie, Lewei Yao, Han Shi, Zhili Liu, Daquan Zhou, Zhaoqiang Liu, Jiawei Li, and Zhenguo Li. Difffit: Unlocking transferability of large diffusion models via simple parameter-efficient fine-tuning, 2023

2023

[8] [8]

Controlling text-to-image diffusion by orthogonal finetuning, 2024

Zeju Qiu, Weiyang Liu, Haiwen Feng, Yuxuan Xue, Yao Feng, Zhen Liu, Dan Zhang, Adrian Weller, and Bernhard Schölkopf. Controlling text-to-image diffusion by orthogonal finetuning, 2024

2024

[9] [9]

Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation, 2023

Nataniel Ruiz, Yuanzhen Li, Varun Jampani, Yael Pritch, Michael Rubinstein, and Kfir Aberman. Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation, 2023

2023

[10] [10]

Bermano, Gal Chechik, and Daniel Cohen-Or

Rinon Gal, Yuval Alaluf, Yuval Atzmon, Or Patashnik, Amit H. Bermano, Gal Chechik, and Daniel Cohen-Or. An image is worth one word: Personalizing text-to-image generation using textual inversion, 2022

2022

[11] [11]

Multi- concept customization of text-to-image diffusion, 2023

Nupur Kumari, Bingliang Zhang, Richard Zhang, Eli Shechtman, and Jun-Yan Zhu. Multi- concept customization of text-to-image diffusion, 2023

2023

[12] [12]

Svdiff: Compact parameter space for diffusion fine-tuning, 2023

Ligong Han, Yinxiao Li, Han Zhang, Peyman Milanfar, Dimitris Metaxas, and Feng Yang. Svdiff: Compact parameter space for diffusion fine-tuning, 2023

2023

[13] [13]

Steering large language model activations in sparse spaces, 2025

Reza Bayat, Ali Rahimi-Kalahroudi, Mohammad Pezeshki, Sarath Chandar, and Pascal Vincent. Steering large language model activations in sparse spaces, 2025

2025

[14] [14]

Steering llm reasoning through bias-only adaptation, 2025

Viacheslav Sinii, Alexey Gorbatovski, Artem Cherepanov, Boris Shaposhnikov, Nikita Balagan- sky, and Daniil Gavrilov. Steering llm reasoning through bias-only adaptation, 2025

2025

[15] [15]

Byun, Zifan Wang, Alex Mallen, Steven Basart, Sanmi Koyejo, Dawn Song, Matt Fredrikson, J

Andy Zou, Long Phan, Sarah Chen, James Campbell, Phillip Guo, Richard Ren, Alexander Pan, Xuwang Yin, Mantas Mazeika, Ann-Kathrin Dombrowski, Shashwat Goel, Nathaniel Li, Michael J. Byun, Zifan Wang, Alex Mallen, Steven Basart, Sanmi Koyejo, Dawn Song, Matt Fredrikson, J. Zico Kolter, and Dan Hendrycks. Representation engineering: A top-down approach to a...

2025

[16] [16]

Inference- time intervention: Eliciting truthful answers from a language model, 2024

Kenneth Li, Oam Patel, Fernanda Viégas, Hanspeter Pfister, and Martin Wattenberg. Inference- time intervention: Eliciting truthful answers from a language model, 2024

2024

[17] [17]

Nishant Subramani, Nivedita Suresh, and Matthew E. Peters. Extracting latent steering vectors from pretrained language models, 2022

2022

[18] [18]

Steering llama 2 via contrastive activation addition, 2024

Nina Panickssery, Nick Gabrieli, Julian Schulz, Meg Tong, Evan Hubinger, and Alexander Matt Turner. Steering llama 2 via contrastive activation addition, 2024

2024

[19] [19]

Li, Arnab Sen Sharma, Aaron Mueller, Byron C

Eric Todd, Millicent L. Li, Arnab Sen Sharma, Aaron Mueller, Byron C. Wallace, and David Bau. Function vectors in large language models, 2024

2024

[20] [20]

Linearity of relation decoding in transformer language models, 2024

Evan Hernandez, Arnab Sen Sharma, Tal Haklay, Kevin Meng, Martin Wattenberg, Jacob Andreas, Yonatan Belinkov, and David Bau. Linearity of relation decoding in transformer language models, 2024. 12

2024

[21] [21]

The linear representation hypothesis and the geometry of large language models, 2024

Kiho Park, Yo Joong Choe, and Victor Veitch. The linear representation hypothesis and the geometry of large language models, 2024

2024

[22] [22]

Film: Visual reasoning with a general conditioning layer, 2017

Ethan Perez, Florian Strub, Harm de Vries, Vincent Dumoulin, and Aaron Courville. Film: Visual reasoning with a general conditioning layer, 2017

2017

[23] [23]

Arbitrary style transfer in real-time with adaptive instance normalization, 2017

Xun Huang and Serge Belongie. Arbitrary style transfer in real-time with adaptive instance normalization, 2017

2017

[24] [24]

The artbench dataset: Benchmarking generative models with artworks, 2022

Peiyuan Liao, Xiuyu Li, Xihui Liu, and Kurt Keutzer. The artbench dataset: Benchmarking generative models with artworks, 2022

2022

[25] [25]

Adding conditional control to text-to-image diffusion models, 2023

Lvmin Zhang, Anyi Rao, and Maneesh Agrawala. Adding conditional control to text-to-image diffusion models, 2023

2023

[26] [26]

One-dimensional adapter to rule them all: Concepts, diffusion models and erasing applications, 2024

Mengyao Lyu, Yuhong Yang, Haiwen Hong, Hui Chen, Xuan Jin, Yuan He, Hui Xue, Jungong Han, and Guiguang Ding. One-dimensional adapter to rule them all: Concepts, diffusion models and erasing applications, 2024

2024

[27] [27]

Denoising diffusion implicit models, 2022

Jiaming Song, Chenlin Meng, and Stefano Ermon. Denoising diffusion implicit models, 2022

2022

[28] [28]

Erasing concepts from diffusion models, 2023

Rohit Gandikota, Joanna Materzynska, Jaden Fiotto-Kaufman, and David Bau. Erasing concepts from diffusion models, 2023

2023

[29] [29]

Unified concept editing in diffusion models, 2024

Rohit Gandikota, Hadas Orgad, Yonatan Belinkov, Joanna Materzy´nska, and David Bau. Unified concept editing in diffusion models, 2024

2024

[30] [30]

Diffusekrona: A parameter efficient fine-tuning method for personalized diffusion models, 2024

Shyam Marjit, Harshit Singh, Nityanand Mathur, Sayak Paul, Chia-Mu Yu, and Pin-Yu Chen. Diffusekrona: A parameter efficient fine-tuning method for personalized diffusion models, 2024

2024

[31] [31]

Network dissection: Quantifying interpretability of deep visual representations, 2017

David Bau, Bolei Zhou, Aditya Khosla, Aude Oliva, and Antonio Torralba. Network dissection: Quantifying interpretability of deep visual representations, 2017

2017

[32] [32]

Locating and editing factual associations in gpt, 2023

Kevin Meng, David Bau, Alex Andonian, and Yonatan Belinkov. Locating and editing factual associations in gpt, 2023

2023

[33] [33]

Classifier-free diffusion guidance, 2022

Jonathan Ho and Tim Salimans. Classifier-free diffusion guidance, 2022

2022

[34] [34]

Prompt-to-prompt image editing with cross attention control, 2022

Amir Hertz, Ron Mokady, Jay Tenenbaum, Kfir Aberman, Yael Pritch, and Daniel Cohen-Or. Prompt-to-prompt image editing with cross attention control, 2022

2022

[35] [35]

Plug-and-play diffusion features for text-driven image-to-image translation, 2022

Narek Tumanyan, Michal Geyer, Shai Bagon, and Tali Dekel. Plug-and-play diffusion features for text-driven image-to-image translation, 2022

2022

[36] [36]

Ip-adapter: Text compatible image prompt adapter for text-to-image diffusion models, 2023

Hu Ye, Jun Zhang, Sibo Liu, Xiao Han, and Wei Yang. Ip-adapter: Text compatible image prompt adapter for text-to-image diffusion models, 2023

2023

[37] [37]

T2i-adapter: Learning adapters to dig out more controllable ability for text-to-image diffusion models, 2023

Chong Mou, Xintao Wang, Liangbin Xie, Yanze Wu, Jian Zhang, Zhongang Qi, Ying Shan, and Xiaohu Qie. T2i-adapter: Learning adapters to dig out more controllable ability for text-to-image diffusion models, 2023

2023

[38] [38]

Tim Brooks, Aleksander Holynski, and Alexei A. Efros. Instructpix2pix: Learning to follow image editing instructions, 2023

2023

[39] [39]

Imagic: Text-based real image editing with diffusion models, 2023

Bahjat Kawar, Shiran Zada, Oran Lang, Omer Tov, Huiwen Chang, Tali Dekel, Inbar Mosseri, and Michal Irani. Imagic: Text-based real image editing with diffusion models, 2023

2023

[40] [40]

Sdedit: Guided image synthesis and editing with stochastic differential equations, 2022

Chenlin Meng, Yutong He, Yang Song, Jiaming Song, Jiajun Wu, Jun-Yan Zhu, and Stefano Ermon. Sdedit: Guided image synthesis and editing with stochastic differential equations, 2022

2022

[41] [41]

Stylediffusion: Controllable disentangled style transfer via diffusion models, 2023

Zhizhong Wang, Lei Zhao, and Wei Xing. Stylediffusion: Controllable disentangled style transfer via diffusion models, 2023

2023

[42] [42]

Diffusionclip: Text-guided diffusion models for robust image manipulation, 2022

Gwanghyun Kim, Taesung Kwon, and Jong Chul Ye. Diffusionclip: Text-guided diffusion models for robust image manipulation, 2022. 13

2022

[43] [43]

Styleclip: Text-driven manipulation of stylegan imagery, 2021

Or Patashnik, Zongze Wu, Eli Shechtman, Daniel Cohen-Or, and Dani Lischinski. Styleclip: Text-driven manipulation of stylegan imagery, 2021

2021

[44] [44]

Semantic image synthesis with spatially-adaptive normalization, 2019

Taesung Park, Ming-Yu Liu, Ting-Chun Wang, and Jun-Yan Zhu. Semantic image synthesis with spatially-adaptive normalization, 2019

2019

[45] [45]

Learning transferable visual models from natural language supervision, 2021

Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agar- wal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. Learning transferable visual models from natural language supervision, 2021

2021

[46] [46]

Not all noises are created equally:diffusion noise selection and optimization, 2024

Zipeng Qi, Lichen Bai, Haoyi Xiong, and Zeke Xie. Not all noises are created equally:diffusion noise selection and optimization, 2024

2024

[47] [47]

Null-text inversion for editing real images using guided diffusion models, 2022

Ron Mokady, Amir Hertz, Kfir Aberman, Yael Pritch, and Daniel Cohen-Or. Null-text inversion for editing real images using guided diffusion models, 2022

2022

[48] [48]

A rank stabilization scaling factor for fine-tuning with lora, 2023

Damjan Kalajdzievski. A rank stabilization scaling factor for fine-tuning with lora, 2023

2023

[49] [49]

Sdxl: Improving latent diffusion models for high-resolution image synthesis, 2023

Dustin Podell, Zion English, Kyle Lacey, Andreas Blattmann, Tim Dockhorn, Jonas Müller, Joe Penna, and Robin Rombach. Sdxl: Improving latent diffusion models for high-resolution image synthesis, 2023

2023

[50] [50]

Scalable diffusion models with transformers, 2023

William Peebles and Saining Xie. Scalable diffusion models with transformers, 2023. 14

2023