This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
This is the role at the center of the thesis. Luma already trains the strongest generative video models in the industry; the next step is turning those models into world models — interactive, controllable, physically faithful, and useful as a substrate for embodied reasoning. As a Research Scientist on the World Models team, you'll work on the next generation of generative models that can be rolled out as worlds.
Job Responsibility
Invent next-generation world model architectures — diffusion, transformer, autoregressive, or hybrid — with a particular focus on controllability and physical consistency
Develop controllability mechanisms that let an agent step into the world: action conditioning, view conditioning, long-horizon rollouts
Define and own the metrics: physical fidelity, long-horizon coherence, action-following, and downstream usefulness for policy training
Run scaling studies that tell us where compute, data, and architecture pay off
Publish at the frontier
contribute to the open-source release that is the long-term deliverable
Requirements
PhD or equivalent research record in ML, computer vision, robotics, or related fields
Deep expertise in at least one of: large-scale generative modeling (video/3D/world), self-supervised representation learning, model-based RL
Strong PyTorch and large-scale training experience — you've trained models that hit the limits of a multi-node cluster
A research record the field knows (top-venue publications and/or widely-used open releases)
Nice to have
Prior work on world models, model-based RL, generative video, neural simulation, or 4D scene representations
Experience using generative models for downstream embodied tasks (planning, control, evaluation)