Key Words for our project
Our project introduces a reinforcement-learning–based noise-steering approach that controls the behavior of diffusion driving models without modifying diffusion weights, enabling adaptive, style-aware, and efficient autonomous driving.
Autonomous Driving
Building reliable, adaptive driving systems that can handle diverse road conditions and human behaviors.
Diffusion Policy
A generative policy framework that produces smooth, multi-mode action sequences through iterative denoising..
Reinforcement Learning
Learning to make better decisions over time by optimizing rewards from driving performance and safety.
Noise Steering
Controlling the latent noise of diffusion models to shape driving behaviors without modifying model weights.
DSRL / Latent Actor-Critic
An efficient RL method operating directly in latent space, enabling stable noise selection and fine-grained control.
Driving Style Adaptation
Adapting the driving policy to calm, aggressive, or city-specific styles without retraining the full diffusion model.
Key insights of our project
We shift reinforcement learning from the action space to the latent noise space, enabling efficient, stable, and lightweight fine-tuning without modifying the pretrained diffusion-based driving model weights.
Reinforcement Learning can help
- No deep-nested backprogation
- Lower computation
- Better performance


From Action Space to Noise Space
- No nested backprop through diffusion
- No modification to diffusion weights
- Lightweight latent-space policy only
“Driving the world
toward Physical AI.”
Junhong Zhou
MSCV 25′ @ CMU RI
