hub

Flux.1 kontext: Flow matching for in-context image generation and editing in latent space,

Black Forest Labs, Stephen Batifol, Andreas Blattmann, Frederic Boesel, Saksham Consul, Cyril Diagne, Tim Dockhorn, Jack English, Zion English, Patrick Esser, Sumith Kulal, Kyl

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

browse 12 citing papers

hub tools

JSON dossier citing papers JSON

representative citing papers

ParetoSlider: Diffusion Models Post-Training for Continuous Reward Control

cs.LG · 2026-04-22 · unverdicted · novelty 7.0

ParetoSlider conditions diffusion models on continuous preference weights to approximate the full Pareto front, providing dynamic control over multi-objective rewards at inference time.

ChArtist: Generating Pictorial Charts with Unified Spatial and Subject Control

cs.CV · 2026-03-15 · unverdicted · novelty 7.0

ChArtist generates pictorial charts via a Diffusion Transformer using skeleton-based spatial control and reference-image subject control, supported by a new 30,000-triplet dataset and data accuracy metric.

LooseRoPE: Content-aware Attention Manipulation for Semantic Harmonization

cs.GR · 2026-01-08 · unverdicted · novelty 7.0

LooseRoPE modulates RoPE in diffusion attention maps to continuously trade off between preserving a pasted object's identity and harmonizing it with its new surroundings.

Do-Undo Bench: Reversibility for Action Understanding in Image Generation

cs.CV · 2025-12-15 · unverdicted · novelty 7.0

Do-Undo Bench is a new evaluation task and dataset that forces models to simulate forward action effects and then undo them to measure genuine action understanding in image generation.

MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition

cs.CV · 2025-12-08 · unverdicted · novelty 7.0

MICo-150K is a new 150K-image dataset with 7 tasks, a De&Re real-image subset, MICo-Bench, and Weighted-Ref-VIEScore metric that improves AI models for generating consistent composites from arbitrary numbers of reference images.

A Framework for Evaluating Zero-Shot Image Generation in Concept-based Explainability

cs.CV · 2026-05-19 · unverdicted · novelty 6.0

The paper introduces a framework of four complementary analyses to evaluate the faithfulness of synthetic concept images from zero-shot T2I models versus real images for concept-based XAI.

Stepper: Stepwise Immersive Scene Generation with Multiview Panoramas

cs.CV · 2026-03-30 · unverdicted · novelty 6.0

Stepper uses stepwise panoramic expansion with a multi-view 360-degree diffusion model and geometry reconstruction to produce high-fidelity, structurally consistent immersive 3D scenes from text.

RenderFlow: Single-Step Neural Rendering via Flow Matching

cs.CV · 2026-01-11 · unverdicted · novelty 6.0

RenderFlow replaces iterative diffusion with flow matching for deterministic single-step neural rendering that achieves near real-time photorealistic quality and extends to inverse rendering via an adapter module.

Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling

cs.CV · 2025-12-14 · conditional · novelty 6.0

Scone unifies subject understanding and generation in a two-stage trained model to improve both composition and distinction in multi-subject image generation, outperforming prior open-source models on new benchmarks.

Refracting Reality: Generating Images with Realistic Transparent Objects

cs.CV · 2025-11-21 · unverdicted · novelty 6.0

The method warps pixels inside object boundaries with Snell's Law during generation and synchronizes with a second panorama image to produce optically plausible refraction in text-to-image outputs.

SkyReels-Text: Fine-Grained Font-Controllable Text Editing for Poster Design

cs.CV · 2025-11-17 · unverdicted · novelty 6.0

SkyReels-Text enables simultaneous fine-grained editing of multiple text regions in posters using arbitrary glyph patches for font control without labels or test-time fine-tuning.

GrOCE:Graph-Guided Online Concept Erasure for Text-to-Image Diffusion Models

cs.CV · 2025-11-17 · unverdicted · novelty 5.0

GrOCE uses dynamic semantic graphs for online, training-free erasure of target concepts from diffusion model prompts via cluster identification and selective severing.

citing papers explorer

Showing 12 of 12 citing papers.

ParetoSlider: Diffusion Models Post-Training for Continuous Reward Control cs.LG · 2026-04-22 · unverdicted · none · ref 23
ParetoSlider conditions diffusion models on continuous preference weights to approximate the full Pareto front, providing dynamic control over multi-objective rewards at inference time.
ChArtist: Generating Pictorial Charts with Unified Spatial and Subject Control cs.CV · 2026-03-15 · unverdicted · none · ref 24
ChArtist generates pictorial charts via a Diffusion Transformer using skeleton-based spatial control and reference-image subject control, supported by a new 30,000-triplet dataset and data accuracy metric.
LooseRoPE: Content-aware Attention Manipulation for Semantic Harmonization cs.GR · 2026-01-08 · unverdicted · none · ref 22
LooseRoPE modulates RoPE in diffusion attention maps to continuously trade off between preserving a pasted object's identity and harmonizing it with its new surroundings.
Do-Undo Bench: Reversibility for Action Understanding in Image Generation cs.CV · 2025-12-15 · unverdicted · none · ref 17
Do-Undo Bench is a new evaluation task and dataset that forces models to simulate forward action effects and then undo them to measure genuine action understanding in image generation.
MICo-150K: A Comprehensive Dataset Advancing Multi-Image Composition cs.CV · 2025-12-08 · unverdicted · none · ref 38
MICo-150K is a new 150K-image dataset with 7 tasks, a De&Re real-image subset, MICo-Bench, and Weighted-Ref-VIEScore metric that improves AI models for generating consistent composites from arbitrary numbers of reference images.
A Framework for Evaluating Zero-Shot Image Generation in Concept-based Explainability cs.CV · 2026-05-19 · unverdicted · none · ref 10
The paper introduces a framework of four complementary analyses to evaluate the faithfulness of synthetic concept images from zero-shot T2I models versus real images for concept-based XAI.
Stepper: Stepwise Immersive Scene Generation with Multiview Panoramas cs.CV · 2026-03-30 · unverdicted · none · ref 27
Stepper uses stepwise panoramic expansion with a multi-view 360-degree diffusion model and geometry reconstruction to produce high-fidelity, structurally consistent immersive 3D scenes from text.
RenderFlow: Single-Step Neural Rendering via Flow Matching cs.CV · 2026-01-11 · unverdicted · none · ref 14
RenderFlow replaces iterative diffusion with flow matching for deterministic single-step neural rendering that achieves near real-time photorealistic quality and extends to inverse rendering via an adapter module.
Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling cs.CV · 2025-12-14 · conditional · none · ref 13
Scone unifies subject understanding and generation in a two-stage trained model to improve both composition and distinction in multi-subject image generation, outperforming prior open-source models on new benchmarks.
Refracting Reality: Generating Images with Realistic Transparent Objects cs.CV · 2025-11-21 · unverdicted · none · ref 20
The method warps pixels inside object boundaries with Snell's Law during generation and synchronizes with a second panorama image to produce optically plausible refraction in text-to-image outputs.
SkyReels-Text: Fine-Grained Font-Controllable Text Editing for Poster Design cs.CV · 2025-11-17 · unverdicted · none · ref 18
SkyReels-Text enables simultaneous fine-grained editing of multiple text regions in posters using arbitrary glyph patches for font control without labels or test-time fine-tuning.
GrOCE:Graph-Guided Online Concept Erasure for Text-to-Image Diffusion Models cs.CV · 2025-11-17 · unverdicted · none · ref 16
GrOCE uses dynamic semantic graphs for online, training-free erasure of target concepts from diffusion model prompts via cluster identification and selective severing.

Flux.1 kontext: Flow matching for in-context image generation and editing in latent space,

hub tools

fields

years

verdicts

representative citing papers

citing papers explorer