AI Research Scientist, Multimodal Generation Job at Meta

Research Scientist Intern, AI Research - Multimodal Pretraining

Meta is seeking Research Scientist Interns in the multimodal pretraining team in...

Location

United States , Menlo Park

Salary:

7650.00 - 12134.00 USD / Month

Meta

Expiration Date

Until further notice

Requirements

Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Machine Learning, Computer Vision, Artificial Intelligence, or relevant technical field
Past projects/publications in the general domain of neural scaling laws, model architectures, image/text modeling, vision-language modeling
Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
Experience in PyTorch, Triton, or other related programming languages
Experience building systems based on machine learning and/or deep learning methods

Job Responsibility

Perform research to advance the frontiers of multimodal (images, video, text, audio, and other modalities) pretraining, to develop the next generation of multimodal architectures
Collaborate with researchers and cross-functional partners including communicating research plans, progress, and results
Publish research results and contribute to research that can be applied to Meta product development

Ai Research Scientist, Video Generation And Post Training, Fair

Meta is seeking a Research Scientist to join the Fundamental AI Research (FAIR) ...

Location

United States , Menlo Park

Salary:

154000.00 - 217000.00 USD / Year

Meta

Expiration Date

Until further notice

Requirements

Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
PhD or equivalent experience in Computer Science, Electrical Engineering, or a related field
Demonstrated expertise in video generation, computer vision, or multimodal AI
Experience with large-scale model training, post-training optimization techniques, and data curation
Publication record in relevant fields

Job Responsibility

Conduct fundamental and applied research in video generation, including generative models, video synthesis, and multimodal learning
Develop and optimize post-training paradigms for large-scale video and multimodal models, improving their performance, robustness, and generalization
Collaborate with teams across Meta to build perceptual foundations for real-time embodied agents and conversational AI
Contribute to the development and deployment of frontier models (e.g., Llama, LMMs) and push the boundaries of video and media generation

What we offer

bonus
equity
benefits

Fulltime

Research Scientist Intern, Multimodal AI (PhD)

The Meta Reality Labs Research Team brings together a world-class team of resear...

Location

United States , Redmond

Salary:

7650.00 - 12134.00 USD / Month

Meta

Expiration Date

Until further notice

Requirements

Currently has, or is in the process of obtaining, a PhD degree in the field of Computer Science, Artificial Intelligence, Generative AI, Transformer Models, Machine Learning, Signal Processing or Computer vision
3+ years experience with Python, Matlab, or similar
3+ years experience with machine learning software platforms such as PyTorch, TensorFlow, etc
Experience building novel audio computational models and LLM
Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment

Job Responsibility

Design, implement, and maintain comprehensive evaluation protocols for large language models, including both automated and human-in-the-loop assessments
Develop and curate high-quality datasets and benchmarks to measure model performance, safety, fairness, and robustness across a variety of tasks and modalities
Analyze model outputs to identify strengths, weaknesses, and failure modes, and provide actionable insights to research and engineering teams
Design and implementation of novel algorithms to solve audio research problems
Collaboration with teams building Meta’s language AI products
Collaborate with researchers, engineers, and cross-functional partners to define evaluation goals, communicate findings, and drive improvements in model quality
Develop tools and infrastructure to streamline and scale evaluation processes, including dashboards, annotation platforms, and reporting systems
Stay up-to-date with the latest research in audio LLM evaluation, benchmarking, and responsible AI, and incorporate best practices into Meta’s workflows
Disseminate evaluation results through internal reports, presentations, and, when appropriate, external publications

What we offer

benefits

Fulltime

Research Scientist Intern, Multimodal AI

The Meta Reality Labs Research Team brings together a world-class team of resear...

Location

United States , Redmond

Salary:

7650.00 - 12134.00 USD / Month

Meta

Expiration Date

Until further notice

Requirements

Currently has, or is in the process of obtaining, a PhD degree in the field of Computer Science, Artificial Intelligence, Generative AI, Transformer Models, Machine Learning, Signal Processing or Computer vision
3+ years experience with Python, Matlab, or similar
3+ years experience with machine learning software platforms such as PyTorch, TensorFlow, etc
Experience building novel audio computational models and LLM
Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment

Job Responsibility

Design, implement, and maintain comprehensive evaluation protocols for large language models, including both automated and human-in-the-loop assessments
Develop and curate high-quality datasets and benchmarks to measure model performance, safety, fairness, and robustness across a variety of tasks and modalities
Analyze model outputs to identify strengths, weaknesses, and failure modes, and provide actionable insights to research and engineering teams
Design and implementation of novel algorithms to solve audio research problems
Collaboration with teams building Meta’s language AI products.. Collaborate with researchers, engineers, and cross-functional partners to define evaluation goals, communicate findings, and drive improvements in model quality
Develop tools and infrastructure to streamline and scale evaluation processes, including dashboards, annotation platforms, and reporting systems
Stay up-to-date with the latest research in audio LLM evaluation, benchmarking, and responsible AI, and incorporate best practices into Meta’s workflows
Disseminate evaluation results through internal reports, presentations, and, when appropriate, external publications

AI Research Scientist (Technical Leadership), Multimodal - Monetization GenAI

The Monetization GenAI Video Gen & Visual Search group, part of the Ads pillar, ...

Location

United States , Menlo Park, CA

Salary:

219000.00 - 301000.00 USD / Year

Meta

Expiration Date

Until further notice

Requirements

Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
Has obtained a PhD in Computer Science, AI/ML, or a relevant technical field
Experience as a technical lead, driving major technical initiatives with cross-functional impact and influencing strategy across multiple teams
4+ years of experience training large language and/or vision models, with extensive and recent experience training multimodal LLMs
Research expertise in video generation/understanding, multimodal learning, or diffusion models
Demonstrated significant industry influence in the field of AI and/or recently published research in leading peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV)

Job Responsibility

Lead end-to-end AI research and model development for video-centric generative AI across Meta's advertising surfaces
Drive advancements in video generation & enhancement
Develop video-to-video & audio generation capabilities
Advance video & visual understanding through novel research
Conduct foundation model research to support generative AI innovation
Define research agendas and pioneer new directions in video/audio generation and multimodal understanding

What we offer

bonus
equity
benefits

Fulltime

Research Scientist Intern, Multimodal Generative AI and Robotics

The research intern will work on cutting edge research problems to innovate nove...

Location

United States , Redmond

Salary:

7650.00 - 12134.00 USD / Month

Meta

Expiration Date

Until further notice

Requirements

Currently has, or is in the process of obtaining a PhD degree in the domain of computer-vision, computer graphics, 3D machine perception or deep learning
Knowledge in deep learning, computer vision, graphics, generative modeling, LLMs and VLMs
Hands-on experience with implementing deep learning algorithms, large-scale training, benchmark and evaluation
Experience working within Python environments such as pytorch
Experience working in a Unix environment
Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment

Job Responsibility

Plan and execute cutting-edge research and development to advance the state-of-the-art in machine learning and large-scale training
Collaborate with other researchers and engineers across machine perception teams at Meta to develop experiments, prototypes, and concepts that advance the state-of-the-art contextual AI and robotic systems
Work with the team to help design, setup, and run practical experiments and prototype systems related to large-scale high-quality sensing and machine reasoning

Fulltime

Research Scientist Intern, AI Research - CoreML - World Models

Meta is seeking Research Interns to join the SAM team in the Multimedia Percepti...

Location

United States , Menlo Park

Salary:

7650.00 - 12134.00 USD / Month

Meta

Expiration Date

Until further notice

Requirements

Currently has, or is in the process of obtaining, a PhD degree in Computer Vision, Machine Learning, Artificial Intelligence, or relevant technical field
Research and/or work experience in Generative Modeling and Computer Vision. In particular: video generation, 3D/4D reconstruction, video and image understanding, vision-language foundation models, representation learning, and related areas
Research and/or work experience in Machine Learning or Deep Learning with applications to perception
Experience in Python, C++, or other related languages
Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment

Job Responsibility

Perform research to advance the science and technology of generative AI
Perform research that enables learning to predict and condition on multimodal data (video, 3D structures, primarily images, text, and other modalities like audio)
Brainstorm with research mentors, review literature and existing solutions of a challenging real-world research problem
Develop novel solutions, implement prototypes, and perform extensive experiments to test the proposed solutions in meaningful benchmarks and metrics, analyze the results and verify the conclusions
Contribute to ongoing research projects and impactful technology releases
Draft and polish research publications
Present research outcomes to internal and/or external audiences

Research Scientist Intern, AI Research - World Models

Meta is seeking Research Interns to join the SAM team in the Multimedia Percepti...

Location

United States , Menlo Park

Salary:

7650.00 - 12134.00 USD / Month

Meta

Expiration Date

Until further notice

Requirements

Currently has, or is in the process of obtaining, a PhD degree in Computer Vision, Machine Learning, Artificial Intelligence, or relevant technical field
Research and/or work experience in Generative Modeling and Computer Vision. In particular: video generation, 3D/4D reconstruction, video and image understanding, vision-language foundation models, representation learning, and related areas
Research and/or work experience in Machine Learning or Deep Learning with applications to perception
Experience in Python, C++, or other related languages
Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment

Job Responsibility

Perform research to advance the science and technology of generative AI
Perform research that enables learning to predict and condition on multimodal data (video, 3D structures, primarily images, text, and other modalities like audio)
Brainstorm with research mentors, review literature and existing solutions of a challenging real-world research problem
Develop novel solutions, implement prototypes, and perform extensive experiments to test the proposed solutions in meaningful benchmarks and metrics, analyze the results and verify the conclusions
Contribute to ongoing research projects and impactful technology releases
Draft and polish research publications
Present research outcomes to internal and/or external audiences

Select Country

AI Research Scientist, Multimodal Generation

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?