UnHype: CLIP-Guided Hypernetworks for Dynamic LoRA Unlearning

Maciej Zieba; Maksym Petrenko; Piotr W\'ojcik; Przemys{\l}aw Spurek; Wojciech Gromski

arxiv: 2602.03410 · v2 · pith:DY3AVV7Vnew · submitted 2026-02-03 · 💻 cs.CV

UnHype: CLIP-Guided Hypernetworks for Dynamic LoRA Unlearning

Piotr W\'ojcik , Maksym Petrenko , Wojciech Gromski , Przemys{\l}aw Spurek , Maciej Zieba This is my paper

classification 💻 cs.CV

keywords loraunhypeunlearningconceptseffectivemodelsacrossconcept

0 comments

read the original abstract

Recent advances in large-scale diffusion models have intensified concerns about their potential misuse, particularly in generating realistic yet harmful or socially disruptive content. This challenge has spurred growing interest in effective machine unlearning, the process of selectively removing specific knowledge or concepts from a model without compromising its overall generative capabilities. Among various approaches, Low-Rank Adaptation (LoRA) has emerged as an effective and efficient method for fine-tuning models toward targeted unlearning. However, LoRA-based methods often exhibit limited adaptability to concept semantics and struggle to balance removing closely related concepts with maintaining generalization across broader meanings. Moreover, these methods face scalability challenges when multiple concepts must be erased simultaneously. To address these limitations, we introduce UnHype, a framework that incorporates hypernetworks into single- and multi-concept LoRA training. The proposed architecture can be directly plugged into Stable Diffusion as well as modern flow-based text-to-image models, where it demonstrates stable training behavior and effective concept control. During inference, the hypernetwork dynamically generates adaptive LoRA weights based on the CLIP embedding, enabling more context-aware, scalable unlearning. We evaluate UnHype across several challenging tasks, including object erasure, celebrity erasure, and explicit content removal, demonstrating its effectiveness and versatility. See the code on GitHub: https://github.com/gmum/UnHype.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

BARRIER: Bounded Activation Regions for Robust Information Erasure
cs.CV 2026-05 unverdicted novelty 5.0

BARRIER applies interval arithmetic to SVD-based activation projections to create bounded forget regions that enable aggressive unlearning while providing formal protection for retain distributions via tail bounds on ...