We build on NVIDIA’s Cosmos Diffusion-based World Foundation Models, with the following architecture:




Latent Diffusion Models for 3D-aware Multi-modal Video Generation
Authors: Sheldon Liang, Yinghao Zhang; Advisor: John Galeotti, Ethan He
We build on NVIDIA’s Cosmos Diffusion-based World Foundation Models, with the following architecture:


