CrawlJobs Logo

Research Scientist, AI & Systems Co-design (PhD)

United States, Menlo Park Employment contract 122000.00 - 181000.00 USD / Year · Job Posted June 15, 2026
Apply Position
Job Link Share

Job Description

Our teams' mission is to explore, develop and help productionize high performance software & hardware technologies for AI at datacenter scale. We achieve this via concurrent design and optimization of many aspects of the system from models and runtime all the way to the AI hardware, optimizing across compute, network and storage. The team invests significantly into model optimization on existing accelerator systems and guiding the future of models and AI HW at Meta. This drives improved performance, new model architectures and reduces cost of ownership for all key AI services at Meta: Recommendations and Generative AI. This is an exciting space that spans exploration and productionization, coupled with close collaborations with industry, academia, Meta's Infrastructure and Product groups. Collaborating closely with product teams, the team's mode of operation is going from ideation and rapid prototyping, all the way to assisting productization of high leverage ideas, working with many partner teams to bring learnings from prototype into production. In addition to the real-world impact on billions of users of the Meta products, our team members have won Best Paper Awards at prestigious conferences such as ISCA, ASPLOS, SOSP, and OSDI, with multiple papers selected for IEEE Micro Top Picks. We regularly publish in ICML, NeurIPS, SC, HPCA, NSDI, VLDB, MLSys, and more. Overall, our work largely corresponds to the research communities of systems in general and especially systems for ML (MLSys, SOSP, OSDI, SIGCOMM, NSDI), hardware architecture (ISCA, ASPLOS), ML (NeurIPS, ICML, ICLR) and supercomputing (SC, ICS).

Job Responsibility

  • Explore, co-design and optimize parallelisms, compute efficiency, distributed training/inference paradigms and algorithms to improve the scalability, efficiency, and reliability of GenAI systems
  • Innovate and co-design novel model deployment techniques for sustained scaling and hardware efficiency during GenAI serving
  • Benchmark, analyze, model, and project the performance of AI workloads against a wide range of what-if scenarios and provide early input to the design of future hardware, models, and runtime, giving crucial feedback to the architecture, compiler, kernel, modeling, and runtime teams
  • Explore, prototype and productionize highly optimized ML kernels to unlock full potential of current and future accelerators for Meta's AI workloads
  • Influence the hardware roadmap of Meta's custom AI accelerators
  • Lead cross-functional initiatives spanning multiple engineering organizations to drive high-impact technical milestones
  • Guide Meta's AI HW requirements and design focusing on performance at System and Silicon levels. Co-design and optimize our AI HW and related software stack for Meta's future workloads, with technology pathfinding and evaluation of cutting-edge AI systems

Requirements

  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • PhD in Computer Science, Electrical Engineering, Applied Mathematics, or a related technical field, OR a Master's degree with 3+ years of relevant industry experience
  • Proven research experience in one or more of the following areas: hardware-aware model enablement, performance modeling of AI systems or prevailing accelerators/silicon architectures
  • Hands-on proficiency with end-to-end AI hardware architecture or on-device mapping algorithm development, encompassing logic, architecture, and optimizations for performance, power, and area (Power, Performance, and Area) (PPA)
  • Theoretical background and practical experience with AI models (e.g., CNNs, Transformers, LLMs, Diffusion models)
  • Experience in system-level performance analysis, profiling, and benchmarking of AI workloads
  • In-depth experience of Python and experience with at least one major AI framework
  • Track record of publishing research papers at peer-reviewed conferences or journals, and experience communicating technical results to cross-functional stakeholders

Nice to have

  • Experience with deploying AI agents/prevalining techniques for increased efficiency
  • Experience or knowledge of training/inference of large-scale deep learning models
  • Familiarity with low-level programming for specialized hardware (e.g., CUDA, HIP, Triton) or hardware description languages (HDL)
  • Experience or knowledge of distributed ML systems and algorithm development
  • Experience or knowledge of either Generative AI models such as LLMs/LDMs or Ranking & Recommendation models such as DLRM or equivalent

What we offer

  • bonus
  • equity
  • benefits

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Research Scientist, AI & Systems Co-design (PhD)

8 matching positions

Research Scientist, HW/SW Co-Design (PhD)

Our teams’ mission is to explore, develop and help productionize high performanc...
Location
Location
United States , Menlo Park
Salary
Salary:
122000.00 - 181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a PhD degree in Computer Science, Electrical Engineering, Applied Mathematics, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Theoretical background and practical experience with AI models (e.g., CNNs, Transformers, LLMs, Diffusion Models)
  • Research experience in one or more of the following areas: hardware-aware model enablement, performance modeling of AI systems or prevailing accelerators/silicon architectures
  • Experience in system-level performance analysis, profiling, and benchmarking of AI workloads
  • Hands-on proficiency with end-to-end AI hardware architecture or on-device mapping algorithm development, encompassing logic, architecture, and optimizations for performance, power, and area (PPA)
  • In-depth experience of Python and experience with at least one major AI framework
  • Track record of publishing research in peer-reviewed venues, with experience communicating technical results to both technical and non-technical stakeholders
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
Job Responsibility
Job Responsibility
  • Pioneer hardware-software co-design efforts for Meta's custom AI silicon, focusing on programmability, performance, and power efficiency
  • Integrate new silicon and system technologies into Meta's custom AI accelerator roadmap based on workload analysis and future model/GenAI requirements
  • Build system performance models and simulators to analyze options for Meta's custom datacenter infrastructure
  • Co-optimize deep learning kernels and primitives with hardware architects and internal compiler teams for maximum efficiency on Meta's hardware platform
  • Influence the hardware roadmap of Meta's custom AI accelerators
  • Architect and implement advanced frameworks and tooling to facilitate comprehensive comparative analyses across diverse system architectures
  • Lead cross-functional initiatives spanning multiple engineering organizations to drive high-impact technical milestones
  • Publish research results in recognized conferences (e.g., NeurIPS, ICML, ICLR, ASPLOS, ISCA, HPCA, MLSys, Micro)
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Research Scientist Intern, AI & System Co-Design

The AI System SW/HW Co-design team’s mission is to explore, develop, and help pr...
Location
Location
United States , Menlo Park
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a PhD degree in the field of Computer Science or a related STEM field
  • Knowledge of Hardware Architecture and Distributed systems with interest in one or more of High Performance Computing, Numerics, Performance, and AI hardware including compute, networking, and storage
  • 2+ years experience in one or more of High Performance Computing, Numerics, Performance and AI hardware including compute, networking and storage
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Lead and support research that accelerates ML applications over one or more of software, system and accelerator architectures, optimizing training and/or inference of next generation AI workloads here at Meta
  • Work towards long-term ambitious research goals, while identifying intermediate milestones
  • Lead and collaborate on research projects with other researchers and engineers across diverse disciplines
  • Communicate research agenda, progress and results
  • Influence progress of relevant research communities by producing publications
Read More
Arrow Right

Research Scientist Intern, Multimodal Contextual AI (PhD)

At Reality Labs, our team brings novel experiences to life on Meta’s AR devices....
Location
Location
United States , Redmond
Salary
Salary:
7313.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a PhD in Computer Science, Electrical Engineering, or a related field
  • Programming and simulation experience with languages such as C/C++ and Python
  • Experience with computer architecture and HW/SW co-design and co-optimization
  • Must obtain work authorization in the country of employment at the time of hire, and maintain on-going work authorization during employment
Job Responsibility
Job Responsibility
  • Build and characterize experimental HW+SW systems on AR devices and device prototypes
  • Develop embedded firmware and software in RTOS and mobile operating systems, e.g. AOSP
  • Collaborate with other researchers and engineers across various disciplines
What we offer
What we offer
  • Benefits
  • Fulltime
Read More
Arrow Right

AI Research Scientist, CoreML - Monetization AI

We are the Monetization Ranking AI Research organization, dedicated to deliverin...
Location
Location
United States , Sunnyvale
Salary
Salary:
122000.00 - 181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Has obtained a PhD in Computer Science, Computer Engineering, Artificial Intelligence, Machine Learning, or relevant technical field
  • Experience holding an industry, faculty, or government researcher position
  • Research experience in natural language processing, large language modeling, deep learning, reinforcement learning, recommendations, ranking, search, or related areas
  • Publications in machine learning, artificial intelligence, or related field
  • Programming experience in Python and hands-on experience with frameworks such as PyTorch
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Develop and implement large-scale model architectures, leveraging model scaling and transfer learning techniques
  • Prioritize training scalability and signal scaling to optimize model performance, efficiency, and reliability
  • Develop and apply NextGen sequence learning techniques to drive advancements in natural language processing and understanding
  • Design and implement generative modeling solutions for data augmentation
  • Research and develop graph-aware large language models
  • Develop and deploy AutoML pipelines
  • Apply Reinforcement Learning (RL) techniques, including long-term value optimization, RLHF, and RL4Reason
  • Use causal learning to identify and understand the cause and effect of relationships across data
  • Collaborate with cross-functional teams to design and optimize ML systems, leveraging expertise in hardware-software co-design, including quantization, compression, and resource-efficient AI, to drive performance improvements and efficiency gains
  • Develop and implement innovative solutions for data-related challenges, utilizing knowledge of semi/self-supervised learning, generative techniques, sampling, debiasing, domain adaptation, continual learning, data augmentation, cold-start, content understanding, and large language models
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Research Scientist Intern, Multimodal Contextual AI

At Reality Labs, our team brings novel experiences to life on Meta’s AR devices....
Location
Location
United States , Sunnyvale
Salary
Salary:
7313.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a PhD in Computer Science, Electrical Engineering, or a related field
  • Programming and simulation experience with languages such as C/C++ and Python
  • Experience with computer architecture and HW/SW co-design and co-optimization
  • Must obtain work authorization in the country of employment at the time of hire, and maintain on-going work authorization during employment
Job Responsibility
Job Responsibility
  • Build and characterize experimental HW+SW systems on AR devices and device prototypes
  • Develop embedded firmware and software in RTOS and mobile operating systems, e.g. AOSP
  • Collaborate with other researchers and engineers across various disciplines
Read More
Arrow Right

Research Scientist / Engineer – Foundation Model: Core Research

This is a rare and foundational opportunity to define the future of multimodal A...
Location
Location
United States , Palo Alto
Salary
Salary:
250000.00 - 450000.00 USD / Year
lumalabs.ai Logo
Luma AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A Bachelor's, Master's, or PhD degree in Computer Science, Machine Learning, Physics, or Mathematics is essential
  • A 'first-principles' intuition for scaling
  • Fluent in the language of frontier AI
  • Proven ability to design and rigorously analyze experiments and to articulate complex technical concepts effectively
  • Practical experience with distributed or high-performance computing environments, particularly managing and optimizing training runs on large-scale GPU clusters
Job Responsibility
Job Responsibility
  • Unified Modeling & Efficiency Drive the core research that powers all of Luma's products — co-designing multimodal representations, advancing core algorithms for long-context training, and establishing rigorous scaling laws to predict performance across compute budgets
  • Alignment & Evaluation Close the gap between training loss and user experience. Develop proxy tasks and automated metrics that serve as the compass for research decisions — ensuring our models optimize for what actually matters to users, not just benchmarks
  • Research Infrastructure Build the engine for high-velocity research. Maintain production-research parity, ensure reproducibility, and design systems for rapid experimentation — so that novel ideas go from hypothesis to validated result as fast as possible
  • Fulltime
Read More
Arrow Right

Research Scientist Intern, MSL Infra Kernels & Optimizations

Meta’s Meta SuperIntelligence Labs (MSL) Infra Kernels & Optimizations (K&O) tea...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining, a PhD degree in the field of Computer Science, Computer Vision, Generative AI, NLP, relevant technical field, or equivalent practical experience
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment
  • Specialized experience in one or more of the following areas: Accelerators/GPU architectures, High Performance Computing (HPC), Machine Learning Compilers, Training/Inference ML Systems, Model Compression, Communication Collectives, ML Kernels/Operator optimizations, Machine learning frameworks (e.g. PyTorch) and SW/HW co-design
  • Experience developing AI-System infrastructure or AI algorithms in C/C++ or Python
Job Responsibility
Job Responsibility
  • Explore, prototype and productionize highly optimized ML kernels to unlock full potential of current and future accelerators for Meta’s AI workloads. Open source SOTA implementations as applicable
  • Explore, co-design and optimize parallelisms, compute efficiency, distributed training/inference paradigms and algorithms to improve the scalability, efficiency and reliability of inference and large-scale training systems
  • Optimize inference and training communications performance at scale and investigate improvements to algorithms, tooling, and interfaces, working across multiple accelerator types and HPC collective communication libraries such as NCCL, RCCL, UCC and MPI
  • Innovate and co-design novel model architectures for sustained scaling and hardware efficiency during training and inference
  • Benchmark, analyze, model and project the performance of AI workloads against a wide range of what-if scenarios and provide early input to the design of future hardware, models and runtime, giving crucial feedback to the architecture, compiler, kernel, modeling and runtime teams
  • Explore, co-design and productionize model compression techniques such as Quantization, Pruning, Distillation and Sparsity to improve training and inference efficiency
  • Collaborate with AI & Systems Co-design to guide Meta’s AI HW strategy
Read More
Arrow Right

Research Scientist Intern, MSL Infra Kernels & Optimizations

Meta’s Meta SuperIntelligence Labs (MSL) Infra Kernels & Optimizations (K&O) tea...
Location
Location
United States , Menlo Park
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining, a PhD degree in the field of Computer Science, Computer Vision, Generative AI, NLP, relevant technical field, or equivalent practical experience
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment
  • Specialized experience in one or more of the following areas: Accelerators/GPU architectures, High Performance Computing (HPC), Machine Learning Compilers, Training/Inference ML Systems, Model Compression, Communication Collectives, ML Kernels/Operator optimizations, Machine learning frameworks (e.g. PyTorch) and SW/HW co-design
  • Experience developing AI-System infrastructure or AI algorithms in C/C++ or Python
Job Responsibility
Job Responsibility
  • Explore, prototype and productionize highly optimized ML kernels to unlock full potential of current and future accelerators for Meta’s AI workloads. Open source SOTA implementations as applicable
  • Explore, co-design and optimize parallelisms, compute efficiency, distributed training/inference paradigms and algorithms to improve the scalability, efficiency and reliability of inference and large-scale training systems
  • Optimize inference and training communications performance at scale and investigate improvements to algorithms, tooling, and interfaces, working across multiple accelerator types and HPC collective communication libraries such as NCCL, RCCL, UCC and MPI
  • Innovate and co-design novel model architectures for sustained scaling and hardware efficiency during training and inference
  • Benchmark, analyze, model and project the performance of AI workloads against a wide range of what-if scenarios and provide early input to the design of future hardware, models and runtime, giving crucial feedback to the architecture, compiler, kernel, modeling and runtime teams
  • Explore, co-design and productionize model compression techniques such as Quantization, Pruning, Distillation and Sparsity to improve training and inference efficiency
  • Collaborate with AI & Systems Co-design to guide Meta’s AI HW strategy
Read More
Arrow Right