CrawlJobs Logo

Research Scientist / Engineer – Performance Optimization

United States, Palo Alto 187500.00 - 395000.00 USD / Year · Job Posted January 13, 2026
Apply Position
Job Link Share

Job Description

The Performance Optimization team at Luma is dedicated to maximizing the efficiency and performance of our AI models. Working closely with both research and engineering teams, this group ensures that our cutting-edge multimodal models can be trained efficiently and deployed at scale while maintaining the highest quality standards.

Job Responsibility

  • Profile and optimize GPU/CPU/Accelerator code for maximum utilization and minimal latency
  • Write high-performance PyTorch, Triton, CUDA, deferring to custom PyTorch operations if necessary
  • Develop fused kernels and leverage tensor cores and modern hardware features for optimal hardware utilization on different hardware platforms
  • Optimize model architectures and implementations for distributed multi-node production deployment
  • Build performance monitoring and analysis tools and automation
  • Research and implement cutting-edge optimization techniques for transformer model

Requirements

  • Expert-level proficiency in Triton/CUDA programming and GPU optimization
  • Strong PyTorch skills
  • Experience with PyTorch kernel development and custom operations
  • Proficiency with profiling tools (NVIDIA Nsight, torch profiler, custom tooling)
  • Deep understanding of transformer architectures and attention mechanisms

Nice to have

  • Experience with compilers/exporters such as torch.compile, TensorRT, ONNX, XLA
  • Experience optimizing inference workloads for latency and throughput
  • Experience with Triton compiler and kernel fusion techniques
  • Knowledge of warp-level intrinsics and advanced CUDA optimization

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Research Scientist / Engineer – Performance Optimization

8 matching positions

Research Engineer / Research Scientist - Foundations Retrieval Lead

The Foundations Research team works on high-risk, high-reward ideas that could s...
Location
Location
United States , San Francisco
Salary
Salary:
445000.00 - 555000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience leading high-performance teams of researchers or engineers in ML infrastructure or foundational research
  • Deep technical expertise in representation learning, embedding models, or vector retrieval systems
  • Familiarity with transformer-based LLMs and how embedding spaces can interact with language model objectives
  • Research experience in areas such as contrastive learning, supervised or unsupervised embedding learning, or metric learning
  • A track record of building or scaling large machine learning systems, particularly embedding pipelines in production or research contexts
  • A first-principles mindset for challenging assumptions about how retrieval and memory should work for large models
Job Responsibility
Job Responsibility
  • Lead research into embedding models and retrieval systems optimized for grounding, relevance, and adaptive reasoning
  • Manage a team of researchers and engineers building end-to-end infrastructure for training, evaluating, and integrating embeddings into frontier models
  • Drive innovation in dense, sparse, and hybrid representation techniques, metric learning, and learning-to-retrieve systems
  • Collaborate closely with Pretraining, Inference, and other Research teams to integrate retrieval throughout the model lifecycle
  • Contribute to OpenAI’s long-term vision of AI systems with memory and knowledge access capabilities rooted in learned representations
What we offer
What we offer
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Fulltime
Read More
Arrow Right

Research Engineer / Research Scientist, Post-Training

The Post-Training team is responsible for training and improving pre-trained mod...
Location
Location
United States , San Francisco
Salary
Salary:
295000.00 - 555000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep understanding of machine learning and machine learning applications
  • Working knowledge of relevant models, and building evaluations for model capability improvement
  • Comfortable diving into a large ML codebase to debug
  • Thrive in a dynamic and technically complex environment
  • Strong ML engineering skills and research experience, especially with novel and highly capable models
  • Passionate about product-driven research
Job Responsibility
Job Responsibility
  • Own and pursue a research agenda to improve model capability and performance
  • Collaborate closely with the other research and product teams, allowing customers to optimize their own models
  • Build robust evaluations for tracking modeling improvements
  • Design, implement, test, and debug code across our research stack
What we offer
What we offer
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Fulltime
Read More
Arrow Right

Research Scientist / Engineer – Foundation Model: Core Research

This is a rare and foundational opportunity to define the future of multimodal A...
Location
Location
United States , Palo Alto
Salary
Salary:
250000.00 - 450000.00 USD / Year
lumalabs.ai Logo
Luma AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A Bachelor's, Master's, or PhD degree in Computer Science, Machine Learning, Physics, or Mathematics is essential
  • A 'first-principles' intuition for scaling
  • Fluent in the language of frontier AI
  • Proven ability to design and rigorously analyze experiments and to articulate complex technical concepts effectively
  • Practical experience with distributed or high-performance computing environments, particularly managing and optimizing training runs on large-scale GPU clusters
Job Responsibility
Job Responsibility
  • Unified Modeling & Efficiency Drive the core research that powers all of Luma's products — co-designing multimodal representations, advancing core algorithms for long-context training, and establishing rigorous scaling laws to predict performance across compute budgets
  • Alignment & Evaluation Close the gap between training loss and user experience. Develop proxy tasks and automated metrics that serve as the compass for research decisions — ensuring our models optimize for what actually matters to users, not just benchmarks
  • Research Infrastructure Build the engine for high-velocity research. Maintain production-research parity, ensure reproducibility, and design systems for rapid experimentation — so that novel ideas go from hypothesis to validated result as fast as possible
  • Fulltime
Read More
Arrow Right

Research Engineer / Scientist - Post-training

At Luma, the Post-training team is responsible for unlocking creative control in...
Location
Location
United States , Palo Alto
Salary
Salary:
187500.00 - 395000.00 USD / Year
lumalabs.ai Logo
Luma AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologies
  • Demonstrated ability to do independent research in Academic or Industry settings
  • Substantial industry experience in large-scale deep learning model training, with demonstrated expertise in at least one of Large Language Models, Vision-Language Models, Diffusion Models, or comparable generative AI architectures
  • Comprehensive technical proficiency and practical experience with leading deep learning frameworks, including advanced competency in one of PyTorch, JAX, TensorFlow, or equivalent platforms for model development and optimization
  • Strong orientation toward applied AI implementations with emphasis on translating product requirements into technical solutions, coupled with exceptional visual discrimination and dedicated focus on enhancing visual fidelity and aesthetic quality of generated content
  • Proficiency in accelerated prototyping and demonstration development for emerging features, facilitating efficient iteration cycles and comprehensive stakeholder evaluation prior to production implementation
  • Established track record of effective cross-functional teamwork, including successful partnerships with teams spanning Product, Design, Evaluation, Applied, and creative specialists
Job Responsibility
Job Responsibility
  • Optimize Luma's image and video generative models through targeted fine-tuning to improve visual quality, instruction adherence, and overall performance metrics
  • Implement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standards
  • Partner closely with the Applied Research team to identify product requirements, understand diverse use cases across Luma's platforms, and execute targeted fine-tuning initiatives to address performance gaps and enhance user-facing capabilities
  • Conduct comprehensive side-by-side evaluations comparing model performance against leading market competitors, systematically analyzing the impact of post-training techniques on downstream performance metrics and identifying areas for improvement
  • Develop advanced post-training capabilities for Luma’s video models including Camera control, Object & character Reference, Image & Video Editing, Human Performance & Motion Transfer Approaches
  • Architect data processing pipelines for large-scale video and image datasets, implementing filtering, balancing, and captioning systems to ensure training data quality across diverse content categories
  • Research and deploy cutting-edge diffusion sampling methodologies and hyperparameter optimization strategies to achieve superior performance on established visual quality benchmarks
  • Research emerging post-training methodologies in generative AI, evaluate their applicability to Luma's product ecosystem, and integrate promising techniques into our Post-training recipe
  • Fulltime
Read More
Arrow Right

Research Scientist Intern, Silicon Performance Architecture

Reality Labs (RL) focuses on delivering Meta's vision through AI-first devices t...
Location
Location
United States , Redmond
Salary
Salary:
7313.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining, a PhD in Computer Science, Electrical Engineering or related field
  • Experience with programming in C/C++ and Python (scripting)
  • Experience in computer architecture (NoC, DRAM, Cache, MMU)
  • Understanding of how to leverage performance modeling to support architectural exploration, with exposure to heterogeneous hardware architectures
  • Understanding of HW power, performance and area trade offs
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Build APIs and/or custom wrappers to integrate various perf modeling components, develop test suites to formally verify the functionality and performance of the integrated models
  • Identify areas of optimization to increase the speed of simulation infrastructure either through code refactoring or code restructuring
  • Conduct performance & power explorations of various architectural components using built performance models (including blocks such as NoC, DRAM, MMUs, etc)
Read More
Arrow Right

Research Scientist, Sensors and Systems Research

We are creating world-class consumer wearable experiences - as a member of Meta’...
Location
Location
United States , Redmond
Salary
Salary:
122000.00 - 181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Currently has, or is in the process of obtaining a PhD degree in Computer Science, Electrical Engineering, Mechanical Engineering, Systems Engineering, Optical Sciences, Physics, Wireless Communications, and/or relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Experience with modeling and analysis
  • C++ or Python programming experience in scientific computing and algorithm development and deployment
  • 1+ years experience in developing machine learning models at scale from inception to generating business impact
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Pioneer fundamental “0 to 1” research in physical sciences (hardware, systems, materials, processes etc)
  • Drive long-term research roadmaps (5–10+ years), establish feasibility of breakthrough technologies, and create initial enablers
  • Conceive, design, and prototype advanced concepts for next-generation Hardware products
  • Develop and optimize hardware, software and/or ML models to predict and enhance performance across multiple HW domains
  • Collaborate with cross-functional teams to advance the entire product pipeline (hardware, software, integration, infrastructure, and applications)
  • Architect and integrate complex hardware/software systems
  • Develop and refine algorithms while resolving discrepancies between model predictions and measured performance data across all technology domains
  • Prototype and characterize new technologies across a variety of HW platforms
  • Utilize scientific programming languages for modeling, simulation, algorithm development, and data analysis
  • Analyze experimental and simulation data to inform design decisions and optimize system performance
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Research Scientist, HW/SW Co-Design (PhD)

Our teams’ mission is to explore, develop and help productionize high performanc...
Location
Location
United States , Menlo Park
Salary
Salary:
122000.00 - 181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a PhD degree in Computer Science, Electrical Engineering, Applied Mathematics, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Theoretical background and practical experience with AI models (e.g., CNNs, Transformers, LLMs, Diffusion Models)
  • Research experience in one or more of the following areas: hardware-aware model enablement, performance modeling of AI systems or prevailing accelerators/silicon architectures
  • Experience in system-level performance analysis, profiling, and benchmarking of AI workloads
  • Hands-on proficiency with end-to-end AI hardware architecture or on-device mapping algorithm development, encompassing logic, architecture, and optimizations for performance, power, and area (PPA)
  • In-depth experience of Python and experience with at least one major AI framework
  • Track record of publishing research in peer-reviewed venues, with experience communicating technical results to both technical and non-technical stakeholders
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
Job Responsibility
Job Responsibility
  • Pioneer hardware-software co-design efforts for Meta's custom AI silicon, focusing on programmability, performance, and power efficiency
  • Integrate new silicon and system technologies into Meta's custom AI accelerator roadmap based on workload analysis and future model/GenAI requirements
  • Build system performance models and simulators to analyze options for Meta's custom datacenter infrastructure
  • Co-optimize deep learning kernels and primitives with hardware architects and internal compiler teams for maximum efficiency on Meta's hardware platform
  • Influence the hardware roadmap of Meta's custom AI accelerators
  • Architect and implement advanced frameworks and tooling to facilitate comprehensive comparative analyses across diverse system architectures
  • Lead cross-functional initiatives spanning multiple engineering organizations to drive high-impact technical milestones
  • Publish research results in recognized conferences (e.g., NeurIPS, ICML, ICLR, ASPLOS, ISCA, HPCA, MLSys, Micro)
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Senior Data Scientist - Optimization

NTT DATA is seeking a client-facing senior level Data Scientist with deep expert...
Location
Location
United States , Remote
Salary
Salary:
144975.00 - 241265.00 USD / Year
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience supporting data science projects in a consulting or professional services environment
  • 7+ years of experience across one or more of the following areas: Predictive Analytics, Data Design, Statistics, AI / Machine Learning, MLOps
  • 2+ years of hands-on experience in Operations Research, Optimization, and Mathematical Modeling
  • 3+ years of experience using Python, R, and/or C# to analyze disparate datasets and develop analytical solutions
  • Ability to travel up to 25%
Job Responsibility
Job Responsibility
  • Lead and deliver client-facing data science and optimization engagements, ensuring high satisfaction and measurable business outcomes
  • Define project objectives, scope, timelines, and success metrics in collaboration with client stakeholders and internal teams
  • Establish and maintain strong executive-level client relationships, gaining a deep understanding of business challenges and operational constraints
  • Communicate complex mathematical, optimization, and technical concepts clearly and credibly to non-technical and senior audiences
  • Prepare and deliver executive-ready updates, insights, and recommendations, including sensitivity analyses and scenario-based findings
  • Conduct market research, develop informed perspectives, and communicate thought leadership to clients and internal stakeholders
  • Lead the design, formulation, and delivery of operations research and optimization models for large, complex enterprises
  • Translate ambiguous business problems into rigorous mathematical formulations and scalable optimization solutions
  • Develop and refine models across domains such as: Resource allocation, Scheduling and workforce optimization, Cost and network optimization, Vehicle routing, Strategic planning (e.g., facility opening/closing, multi-period network modeling)
  • Apply deep expertise in optimization solvers (e.g., Gurobi preferred, CPLEX or similar) to deliver robust, production-ready solutions
  • Fulltime
Read More
Arrow Right