Safer Trajectory Planning with CBF-guided Diffusion Model for Unmanned Aerial Vehicles

Jiahao Hu; Peiwen Yang; Shiyu Bai; Weisong Wen; Yixin Gao

arxiv: 2604.17527 · v2 · submitted 2026-04-19 · 💻 cs.RO

Safer Trajectory Planning with CBF-guided Diffusion Model for Unmanned Aerial Vehicles

Peiwen Yang , Shiyu Bai , Weisong Wen , Yixin Gao , Jiahao Hu This is my paper

Pith reviewed 2026-05-10 05:22 UTC · model grok-4.3

classification 💻 cs.RO

keywords UAV trajectory planningdiffusion modelscontrol barrier functionssafe motion planningaerial roboticsgenerative modelsobstacle avoidanceaerobatic maneuvers

0 comments

The pith

Integrating control barrier function gradients into diffusion sampling generates collision-free UAV trajectories without needing fully safety-verified training data.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that diffusion models can generate agile UAV trajectories for complex aerobatic maneuvers by guiding the reverse sampling process with control barrier function gradients that enforce safety constraints. This matters because standard diffusion outputs are often unpredictable and can violate safety, while collecting large sets of verified safe demonstrations is costly and difficult. The guidance works by adding safety gradients directly to the model's score function at each inference step, steering generations away from collisions while preserving diversity. Simulation tests in multi-obstacle settings show the guided outputs maintain high agility across 14 maneuver types.

Core claim

The central claim is that CBF-guided sampling during the reverse diffusion process ensures collision-free trajectories by seamlessly integrating safety constraint gradients with the diffusion model's score function. This enables the generation of smooth, highly agile trajectories across 14 distinct aerobatic maneuvers using an obstacle-aware diffusion transformer with multi-modal conditioning on history, obstacles, styles, and goals, all trained on 2,000 expert demonstrations and evaluated in simulated multi-obstacle environments.

What carries the argument

The CBF-guided sampling mechanism, which adds gradients from control barrier functions that enforce safety constraints directly to the diffusion score function at each reverse process step to steer outputs toward collision-free paths.

If this is right

The method supports training on standard expert data rather than requiring costly safety-verified datasets while still producing safe outputs at runtime.
It enables generation of trajectories for 14 aerobatic maneuvers that remain agile and diverse under multi-obstacle conditions.
Safety enforcement occurs during sampling without retraining the underlying diffusion transformer.
The approach reduces collision rates by 94.7 percent relative to unguided diffusion baselines in simulation.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same gradient-steering idea during sampling could extend to other generative planners in robotics where full constraint satisfaction during training is impractical.
By shifting safety work to inference, the framework may lower data collection costs for deploying similar models on new vehicle types or environments.
If the simulation results transfer, the method could support more aggressive autonomous maneuvers in real settings by relaxing the need for perfectly curated safe demonstrations.

Load-bearing premise

Adding the CBF gradients at inference time will reliably enforce safety without distorting the learned distribution from the training data or requiring that data to already be extensively safety-verified, and that simulation gains will hold on physical UAVs.

What would settle it

Running the guided model on held-out multi-obstacle scenarios and measuring whether collision rates stay near the unguided baseline level would directly test the claim; sustained high collision rates would falsify effective safety enforcement.

read the original abstract

Safe and agile trajectory planning is essential for autonomous systems, especially during complex aerobatic maneuvers. Motivated by the recent success of diffusion models in generative tasks, this paper introduces AeroTrajGen, a novel framework for diffusion-based trajectory generation that incorporates control barrier function (CBF)-guided sampling during inference, specifically designed for unmanned aerial vehicles (UAVs). The proposed CBF-guided sampling addresses two critical challenges: (1) mitigating the inherent unpredictability and potential safety violations of diffusion models, and (2) reducing reliance on extensively safety-verified training data. During the reverse diffusion process, CBF-based guidance ensures collision-free trajectories by seamlessly integrating safety constraint gradients with the diffusion model's score function. The model features an obstacle-aware diffusion transformer architecture with multi-modal conditioning, including trajectory history, obstacles, maneuver styles, and goal, enabling the generation of smooth, highly agile trajectories across 14 distinct aerobatic maneuvers. Trained on a dataset of 2,000 expert demonstrations, AeroTrajGen is rigorously evaluated in simulation under multi-obstacle environments. Simulation results demonstrate that CBF-guided sampling reduces collision rates by 94.7% compared to unguided diffusion baselines, while preserving trajectory agility and diversity. Our code is open-sourced at https://github.com/RoboticsPolyu/CBF-DMP.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The abstract claims a 94.7% collision drop from CBF-guided diffusion sampling for UAV aerobatics, but without the actual guidance equation or schedule it is impossible to tell whether the safety comes from the method or from the training data.

read the letter

The paper's main contribution is a diffusion-based planner called AeroTrajGen that adds control barrier function guidance during the reverse process to produce collision-free trajectories for UAVs across 14 aerobatic maneuvers. It conditions a transformer on obstacles, history, maneuver style, and goal, trains on 2000 expert demonstrations, and reports that the guided version cuts collisions sharply while keeping agility and diversity. The code is open-sourced, which helps anyone who wants to inspect the implementation later.

Referee Report

2 major / 1 minor

Summary. The manuscript introduces AeroTrajGen, a diffusion-based trajectory generation framework for UAVs that incorporates CBF-guided sampling during the reverse diffusion process. It uses an obstacle-aware diffusion transformer conditioned on trajectory history, obstacles, maneuver styles, and goals to generate trajectories for 14 aerobatic maneuvers. Trained on 2,000 expert demonstrations, the method is claimed to reduce collision rates by 94.7% in simulation compared to unguided baselines while preserving agility and diversity. The code is open-sourced.

Significance. Should the integration of CBF guidance prove effective as described, this work has the potential to enhance safety in generative models for robotic planning without requiring fully safe training datasets. The open-sourcing of the implementation is a notable strength that facilitates reproducibility and extension by the community.

major comments (2)

[Abstract] The central mechanism is described as 'CBF-based guidance ensures collision-free trajectories by seamlessly integrating safety constraint gradients with the diffusion model's score function,' yet no equation, guidance formulation, or schedule is provided. This omission is load-bearing for the claim of a 94.7% collision reduction, as it leaves unclear whether the CBF term reliably enforces constraints or can be dominated by the learned score.
[Abstract] The simulation results are summarized as a 94.7% collision rate reduction, but the abstract supplies no information on the evaluation protocol, including environment details, baseline implementations, number of trials, or statistical tests. This prevents assessment of whether the result supports the broader claims about safety and data efficiency.

minor comments (1)

[Abstract] The abstract uses 'CBF' without spelling out 'Control Barrier Function' on first use, although this is common terminology in the field.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their valuable comments, which highlight opportunities to strengthen the abstract. We will make revisions to address the concerns regarding the description of the CBF guidance and the evaluation details. Our point-by-point responses are as follows.

read point-by-point responses

Referee: [Abstract] The central mechanism is described as 'CBF-based guidance ensures collision-free trajectories by seamlessly integrating safety constraint gradients with the diffusion model's score function,' yet no equation, guidance formulation, or schedule is provided. This omission is load-bearing for the claim of a 94.7% collision reduction, as it leaves unclear whether the CBF term reliably enforces constraints or can be dominated by the learned score.

Authors: We agree with the referee that the abstract would be improved by including more details on the CBF guidance mechanism to support the safety claims. The full paper provides the detailed formulation of the CBF-guided sampling process, but to make the abstract self-contained, we will add a brief description of the guidance, such as how the safety constraint gradients are incorporated into the score function during sampling, and mention the guidance schedule used. This revision will clarify the integration and help substantiate the reported collision reduction. revision: yes
Referee: [Abstract] The simulation results are summarized as a 94.7% collision rate reduction, but the abstract supplies no information on the evaluation protocol, including environment details, baseline implementations, number of trials, or statistical tests. This prevents assessment of whether the result supports the broader claims about safety and data efficiency.

Authors: We concur that additional context on the evaluation protocol is necessary in the abstract. We will revise the abstract to include key aspects of the simulation setup, such as the multi-obstacle environments, the unguided diffusion baselines used for comparison, the scope across 14 aerobatic maneuvers, and that the results are based on extensive trials in simulation. This will provide readers with sufficient information to evaluate the 94.7% reduction claim. revision: yes

Circularity Check

0 steps flagged

No circularity; empirical performance claims independent of inputs

full rationale

The abstract presents AeroTrajGen as a diffusion model trained on 2,000 expert demonstrations, with CBF-guided sampling added at inference to enforce safety. The central result (94.7% collision-rate reduction) is obtained by direct simulation comparison against unguided baselines. No equations, derivations, fitted parameters renamed as predictions, or self-citations appear in the provided text. The safety claim is framed as an outcome of the guidance mechanism rather than a tautology or reduction to the training distribution by construction. The contribution is therefore self-contained as an empirical method evaluation.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 1 invented entities

The approach relies on standard assumptions from diffusion models and control theory; no explicit free parameters or invented physical entities are detailed in the abstract.

axioms (2)

domain assumption Expert demonstrations provide sufficient coverage for learning agile UAV maneuvers
Training uses 2,000 expert demonstrations without further qualification in the abstract.
domain assumption CBF constraints can be expressed as differentiable gradients compatible with score-based sampling
The guidance mechanism assumes seamless integration of safety gradients.

invented entities (1)

AeroTrajGen framework no independent evidence
purpose: Diffusion-based trajectory generator with CBF safety guidance
New named system introduced in the paper.

pith-pipeline@v0.9.0 · 5514 in / 1312 out tokens · 49382 ms · 2026-05-10T05:22:02.653650+00:00 · methodology

Safer Trajectory Planning with CBF-guided Diffusion Model for Unmanned Aerial Vehicles

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)