This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Meta Reality Labs Research Team is composed of world-class researchers, developers, and engineers dedicated to shaping the future of AR/VR and machine perception. The Surreal Vision group at RL Research is seeking exceptional Research Scientist interns to contribute to the development of an egocentric AI system. This system will form the foundation for contextual-AI-enabled AR devices and humanoid robots. As a research intern, you will tackle cutting-edge research challenges, innovating novel computer vision and machine learning techniques. Your research project may cover or relate to the following topics: - Egocentric vision language model for long-context 3D scene understanding - Utilizing memory for more consistent and accurate future state prediction using visual language action models or world models - Exploring novel learning strategies to improve the quality and generalization of visual language action models or world models with egocentric data - 4D generation & reconstruction of dynamic scenes. Our internships are twelve (12) to twenty four (24) weeks long and we have various start dates throughout the year. Some projects may require a minimum of 24 consecutive weeks.
Job Responsibility:
Plan and execute cutting-edge research and development to advance the state-of-the-art in machine perception, future prediction, 4D scene understanding & reconstruction, robotics
Collaborate with other researchers and engineers across machine perception teams at Meta to develop experiments, prototypes, and concepts that advance the state-of-the-art in AR/VR and AI systems
Work with the team to help design, setup, and run practical experiments and prototype systems related to large-scale high quality sensing and machine reasoning
Requirements:
Currently has, or is in the process of obtaining a PhD degree in the domain of computer-vision, machine learning, robotics, and computer graphics
Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
Knowledge and hands-on experience on 3D computer vision
Hands-on experience implementing large foundation models and generative models, such as LLMs, VLMs, video diffusion models, LRMs, World Models, VLAs, Reinforcement Learning
Experience working within Python environments such as pytorch
Experience working in a Unix environment
Nice to have:
Ability to work a consecutive 24 weeks
Proven track record of achieving significant results as demonstrated by grants, fellowships, patents, as well as first-authored publications at leading workshops or conferences such as CVPR/ECCV/ICCV, ICLR, NeurIPS, CoRL/RSS/ICRA/IROS, SIGGRAPH/SIGGRAPH Asia, etc
Strong track-record of published research in the fields of generative modeling, large foundation models, robotics, neural reconstruction, and neural rendering
Strong programming experience using python and pytorch
Demonstrated software engineer experience via an internship, work experience, coding competitions, or widely used contributions in open source repositories (e.g. GitHub)
Intent to return to a degree-program after the completion of the internship
Experience working and communicating cross functionally in a team environment