Physctrl: Generative physics for controllable and physics-grounded video generation

Chen Wang, Chuhao Chen, Yiming Huang, Zhiyang Dou, Yuan Liu, Jiatao Gu, Lingjie Liu · 2025 · arXiv 2509.20358

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs

cs.LG · 2026-04-10 · unverdicted · novelty 6.0

DACO curates a 15,000-concept dictionary from 400K image-caption pairs and uses it to initialize an SAE that enables granular, concept-specific steering of MLLM activations, raising safety scores on MM-SafetyBench and JailBreakV while preserving general capabilities.

Lighting-grounded Video Generation with Renderer-based Agent Reasoning

cs.CV · 2026-04-09 · unverdicted · novelty 6.0

LiVER conditions video diffusion models on renderer-derived 3D control signals for disentangled, editable control over object layout, lighting, and camera trajectory.

Pretrained Video Models as Differentiable Physics Simulators for Urban Wind Flows

cs.LG · 2026-03-22 · unverdicted · novelty 6.0

WinDiNet repurposes a 2B-parameter video diffusion model as a differentiable surrogate that generates 112-frame urban wind flow rollouts in under one second and enables direct gradient optimization of building positions.

Next-Scale Autoregressive Models for Text-to-Motion Generation

cs.CV · 2026-04-04

citing papers explorer

Showing 4 of 4 citing papers.

Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs cs.LG · 2026-04-10 · unverdicted · none · ref 102
DACO curates a 15,000-concept dictionary from 400K image-caption pairs and uses it to initialize an SAE that enables granular, concept-specific steering of MLLM activations, raising safety scores on MM-SafetyBench and JailBreakV while preserving general capabilities.
Lighting-grounded Video Generation with Renderer-based Agent Reasoning cs.CV · 2026-04-09 · unverdicted · none · ref 40
LiVER conditions video diffusion models on renderer-derived 3D control signals for disentangled, editable control over object layout, lighting, and camera trajectory.
Pretrained Video Models as Differentiable Physics Simulators for Urban Wind Flows cs.LG · 2026-03-22 · unverdicted · none · ref 55
WinDiNet repurposes a 2B-parameter video diffusion model as a differentiable surrogate that generates 112-frame urban wind flow rollouts in under one second and enables direct gradient optimization of building positions.
Next-Scale Autoregressive Models for Text-to-Motion Generation cs.CV · 2026-04-04 · unreviewed · ref 49

Physctrl: Generative physics for controllable and physics-grounded video generation

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer