Ayaan Naveed Malik

about me.

I am a senior at Stanford studying Computer Science and Philosophy. Previously, I was a research intern at NVIDIA GEAR, advised by Jim Fan and Yuke Zhu, where we scaled world models for embodied AI and invented WAMs. Before that, I was a research intern at Together AI working on omni-model training and optimization. Feel free to reach out at ayaan04 [at] stanford [dot] edu.

research philosophy

understanding through imagination.

work.

DreamZero: World Action Models are Zero-shot Policies

NVIDIA GEAR·ICLR 2026 Workshop on World ModelsOral

A World Action Model built on a pretrained video diffusion backbone that jointly predicts future world states and actions, achieving over 2x improvement in generalization to new tasks and environments compared to state-of-the-art VLAs in real-robot experiments, with real-time closed-loop control at 7Hz.

DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos

NVIDIA GEAR·ICML 2026Spotlight

A foundation world model pretrained on 44k hours of diverse human egocentric videos—the largest dataset to date for world model pretraining—that demonstrates strong generalization to diverse objects and environments, with stable real-time interactions at 10 FPS for over 1 minute after distillation.