Research Scientist, AI & Systems Co-design (PhD) Job at Meta (Menlo Park)

Research Scientist, HW/SW Co-Design (PhD)

Our teams’ mission is to explore, develop and help productionize high performanc...

Location

United States , Menlo Park

Salary:

122000.00 - 181000.00 USD / Year

Meta

Expiration Date

Until further notice

Requirements

Currently has, or is in the process of obtaining a PhD degree in Computer Science, Electrical Engineering, Applied Mathematics, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
Theoretical background and practical experience with AI models (e.g., CNNs, Transformers, LLMs, Diffusion Models)
Research experience in one or more of the following areas: hardware-aware model enablement, performance modeling of AI systems or prevailing accelerators/silicon architectures
Experience in system-level performance analysis, profiling, and benchmarking of AI workloads
Hands-on proficiency with end-to-end AI hardware architecture or on-device mapping algorithm development, encompassing logic, architecture, and optimizations for performance, power, and area (PPA)
In-depth experience of Python and experience with at least one major AI framework
Track record of publishing research in peer-reviewed venues, with experience communicating technical results to both technical and non-technical stakeholders
Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment
Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta

Job Responsibility

Pioneer hardware-software co-design efforts for Meta's custom AI silicon, focusing on programmability, performance, and power efficiency
Integrate new silicon and system technologies into Meta's custom AI accelerator roadmap based on workload analysis and future model/GenAI requirements
Build system performance models and simulators to analyze options for Meta's custom datacenter infrastructure
Co-optimize deep learning kernels and primitives with hardware architects and internal compiler teams for maximum efficiency on Meta's hardware platform
Influence the hardware roadmap of Meta's custom AI accelerators
Architect and implement advanced frameworks and tooling to facilitate comprehensive comparative analyses across diverse system architectures
Lead cross-functional initiatives spanning multiple engineering organizations to drive high-impact technical milestones
Publish research results in recognized conferences (e.g., NeurIPS, ICML, ICLR, ASPLOS, ISCA, HPCA, MLSys, Micro)

What we offer

bonus
equity
benefits

Fulltime

Research Scientist Intern, AI & System Co-Design

The AI System SW/HW Co-design team’s mission is to explore, develop, and help pr...

Location

United States , Menlo Park

Salary:

7650.00 - 12134.00 USD / Month

Meta

Expiration Date

Until further notice

Requirements

Currently has, or is in the process of obtaining a PhD degree in the field of Computer Science or a related STEM field
Knowledge of Hardware Architecture and Distributed systems with interest in one or more of High Performance Computing, Numerics, Performance, and AI hardware including compute, networking, and storage
2+ years experience in one or more of High Performance Computing, Numerics, Performance and AI hardware including compute, networking and storage
Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment

Job Responsibility

Lead and support research that accelerates ML applications over one or more of software, system and accelerator architectures, optimizing training and/or inference of next generation AI workloads here at Meta
Work towards long-term ambitious research goals, while identifying intermediate milestones
Lead and collaborate on research projects with other researchers and engineers across diverse disciplines
Communicate research agenda, progress and results
Influence progress of relevant research communities by producing publications

Research Scientist Intern, Multimodal Contextual AI (PhD)

At Reality Labs, our team brings novel experiences to life on Meta’s AR devices....

Location

United States , Redmond

Salary:

7313.00 - 12134.00 USD / Month

Meta

Expiration Date

Until further notice

Requirements

Currently has, or is in the process of obtaining a PhD in Computer Science, Electrical Engineering, or a related field
Programming and simulation experience with languages such as C/C++ and Python
Experience with computer architecture and HW/SW co-design and co-optimization
Must obtain work authorization in the country of employment at the time of hire, and maintain on-going work authorization during employment

Job Responsibility

Build and characterize experimental HW+SW systems on AR devices and device prototypes
Develop embedded firmware and software in RTOS and mobile operating systems, e.g. AOSP
Collaborate with other researchers and engineers across various disciplines

What we offer

Benefits

Fulltime

AI Research Scientist, CoreML - Monetization AI

We are the Monetization Ranking AI Research organization, dedicated to deliverin...

Location

United States , Sunnyvale

Salary:

122000.00 - 181000.00 USD / Year

Meta

Expiration Date

Until further notice

Requirements

Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
Has obtained a PhD in Computer Science, Computer Engineering, Artificial Intelligence, Machine Learning, or relevant technical field
Experience holding an industry, faculty, or government researcher position
Research experience in natural language processing, large language modeling, deep learning, reinforcement learning, recommendations, ranking, search, or related areas
Publications in machine learning, artificial intelligence, or related field
Programming experience in Python and hands-on experience with frameworks such as PyTorch
Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment

Job Responsibility

Develop and implement large-scale model architectures, leveraging model scaling and transfer learning techniques
Prioritize training scalability and signal scaling to optimize model performance, efficiency, and reliability
Develop and apply NextGen sequence learning techniques to drive advancements in natural language processing and understanding
Design and implement generative modeling solutions for data augmentation
Research and develop graph-aware large language models
Develop and deploy AutoML pipelines
Apply Reinforcement Learning (RL) techniques, including long-term value optimization, RLHF, and RL4Reason
Use causal learning to identify and understand the cause and effect of relationships across data
Collaborate with cross-functional teams to design and optimize ML systems, leveraging expertise in hardware-software co-design, including quantization, compression, and resource-efficient AI, to drive performance improvements and efficiency gains
Develop and implement innovative solutions for data-related challenges, utilizing knowledge of semi/self-supervised learning, generative techniques, sampling, debiasing, domain adaptation, continual learning, data augmentation, cold-start, content understanding, and large language models

What we offer

bonus
equity
benefits

Research Scientist Intern, Multimodal Contextual AI

At Reality Labs, our team brings novel experiences to life on Meta’s AR devices....

Location

United States , Sunnyvale

Salary:

7313.00 - 12134.00 USD / Month

Meta

Expiration Date

Until further notice

Requirements

Currently has, or is in the process of obtaining a PhD in Computer Science, Electrical Engineering, or a related field
Programming and simulation experience with languages such as C/C++ and Python
Experience with computer architecture and HW/SW co-design and co-optimization
Must obtain work authorization in the country of employment at the time of hire, and maintain on-going work authorization during employment

Job Responsibility

Build and characterize experimental HW+SW systems on AR devices and device prototypes
Develop embedded firmware and software in RTOS and mobile operating systems, e.g. AOSP
Collaborate with other researchers and engineers across various disciplines

Research Scientist / Engineer – Foundation Model: Core Research

This is a rare and foundational opportunity to define the future of multimodal A...

Location

United States , Palo Alto

Salary:

250000.00 - 450000.00 USD / Year

Luma AI

Expiration Date

Until further notice

Requirements

A Bachelor's, Master's, or PhD degree in Computer Science, Machine Learning, Physics, or Mathematics is essential
A 'first-principles' intuition for scaling
Fluent in the language of frontier AI
Proven ability to design and rigorously analyze experiments and to articulate complex technical concepts effectively
Practical experience with distributed or high-performance computing environments, particularly managing and optimizing training runs on large-scale GPU clusters

Job Responsibility

Unified Modeling & Efficiency Drive the core research that powers all of Luma's products — co-designing multimodal representations, advancing core algorithms for long-context training, and establishing rigorous scaling laws to predict performance across compute budgets
Alignment & Evaluation Close the gap between training loss and user experience. Develop proxy tasks and automated metrics that serve as the compass for research decisions — ensuring our models optimize for what actually matters to users, not just benchmarks
Research Infrastructure Build the engine for high-velocity research. Maintain production-research parity, ensure reproducibility, and design systems for rapid experimentation — so that novel ideas go from hypothesis to validated result as fast as possible

Fulltime

Research Scientist Intern, MSL Infra Kernels & Optimizations

Meta’s Meta SuperIntelligence Labs (MSL) Infra Kernels & Optimizations (K&O) tea...

Location

United Kingdom , London

Salary:

Not provided

Meta

Expiration Date

Until further notice

Requirements

Currently has, or is in the process of obtaining, a PhD degree in the field of Computer Science, Computer Vision, Generative AI, NLP, relevant technical field, or equivalent practical experience
Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment
Specialized experience in one or more of the following areas: Accelerators/GPU architectures, High Performance Computing (HPC), Machine Learning Compilers, Training/Inference ML Systems, Model Compression, Communication Collectives, ML Kernels/Operator optimizations, Machine learning frameworks (e.g. PyTorch) and SW/HW co-design
Experience developing AI-System infrastructure or AI algorithms in C/C++ or Python

Job Responsibility

Explore, prototype and productionize highly optimized ML kernels to unlock full potential of current and future accelerators for Meta’s AI workloads. Open source SOTA implementations as applicable
Explore, co-design and optimize parallelisms, compute efficiency, distributed training/inference paradigms and algorithms to improve the scalability, efficiency and reliability of inference and large-scale training systems
Optimize inference and training communications performance at scale and investigate improvements to algorithms, tooling, and interfaces, working across multiple accelerator types and HPC collective communication libraries such as NCCL, RCCL, UCC and MPI
Innovate and co-design novel model architectures for sustained scaling and hardware efficiency during training and inference
Benchmark, analyze, model and project the performance of AI workloads against a wide range of what-if scenarios and provide early input to the design of future hardware, models and runtime, giving crucial feedback to the architecture, compiler, kernel, modeling and runtime teams
Explore, co-design and productionize model compression techniques such as Quantization, Pruning, Distillation and Sparsity to improve training and inference efficiency
Collaborate with AI & Systems Co-design to guide Meta’s AI HW strategy

Research Scientist Intern, MSL Infra Kernels & Optimizations

Meta’s Meta SuperIntelligence Labs (MSL) Infra Kernels & Optimizations (K&O) tea...

Location

United States , Menlo Park

Salary:

7650.00 - 12134.00 USD / Month

Meta

Expiration Date

Until further notice

Requirements

Currently has, or is in the process of obtaining, a PhD degree in the field of Computer Science, Computer Vision, Generative AI, NLP, relevant technical field, or equivalent practical experience
Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment
Specialized experience in one or more of the following areas: Accelerators/GPU architectures, High Performance Computing (HPC), Machine Learning Compilers, Training/Inference ML Systems, Model Compression, Communication Collectives, ML Kernels/Operator optimizations, Machine learning frameworks (e.g. PyTorch) and SW/HW co-design
Experience developing AI-System infrastructure or AI algorithms in C/C++ or Python

Job Responsibility

Explore, prototype and productionize highly optimized ML kernels to unlock full potential of current and future accelerators for Meta’s AI workloads. Open source SOTA implementations as applicable
Explore, co-design and optimize parallelisms, compute efficiency, distributed training/inference paradigms and algorithms to improve the scalability, efficiency and reliability of inference and large-scale training systems
Optimize inference and training communications performance at scale and investigate improvements to algorithms, tooling, and interfaces, working across multiple accelerator types and HPC collective communication libraries such as NCCL, RCCL, UCC and MPI
Innovate and co-design novel model architectures for sustained scaling and hardware efficiency during training and inference
Benchmark, analyze, model and project the performance of AI workloads against a wide range of what-if scenarios and provide early input to the design of future hardware, models and runtime, giving crucial feedback to the architecture, compiler, kernel, modeling and runtime teams
Explore, co-design and productionize model compression techniques such as Quantization, Pruning, Distillation and Sparsity to improve training and inference efficiency
Collaborate with AI & Systems Co-design to guide Meta’s AI HW strategy

Select Country

Research Scientist, AI & Systems Co-design (PhD)

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?