CrawlJobs Logo

Research Scientist / Engineer – Performance Optimization

lumalabs.ai Logo

Luma AI

Location Icon

Location:
United States , Palo Alto

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

187500.00 - 395000.00 USD / Year

Job Description:

The Performance Optimization team at Luma is dedicated to maximizing the efficiency and performance of our AI models. Working closely with both research and engineering teams, this group ensures that our cutting-edge multimodal models can be trained efficiently and deployed at scale while maintaining the highest quality standards.

Job Responsibility:

  • Profile and optimize GPU/CPU/Accelerator code for maximum utilization and minimal latency
  • Write high-performance PyTorch, Triton, CUDA, deferring to custom PyTorch operations if necessary
  • Develop fused kernels and leverage tensor cores and modern hardware features for optimal hardware utilization on different hardware platforms
  • Optimize model architectures and implementations for distributed multi-node production deployment
  • Build performance monitoring and analysis tools and automation
  • Research and implement cutting-edge optimization techniques for transformer model

Requirements:

  • Expert-level proficiency in Triton/CUDA programming and GPU optimization
  • Strong PyTorch skills
  • Experience with PyTorch kernel development and custom operations
  • Proficiency with profiling tools (NVIDIA Nsight, torch profiler, custom tooling)
  • Deep understanding of transformer architectures and attention mechanisms

Nice to have:

  • Experience with compilers/exporters such as torch.compile, TensorRT, ONNX, XLA
  • Experience optimizing inference workloads for latency and throughput
  • Experience with Triton compiler and kernel fusion techniques
  • Knowledge of warp-level intrinsics and advanced CUDA optimization

Additional Information:

Job Posted:
January 13, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Scientist / Engineer – Performance Optimization

Senior Computer Vision and Machine Learning Research Scientist

We are seeking a skilled and innovative Senior Research Scientist to join one of...
Location
Location
United States , Seattle
Salary
Salary:
159750.00 - 234300.00 USD / Year
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD and 3+ years experience in Computer Science or a related field with a focus on computer vision, machine learning, artificial intelligence or related technical fields
  • Proven track record of research in machine learning, computer vision or related fields
  • Experience driving the ML development lifecycle and leveraging state-of-the-art research to deliver high quality models at scale
  • Strong computer vision fundamentals such as image processing, feature extractions, object detection, semantic segmentation, video analysis or action recognition
  • Excellent problem-solving skills, analytical thinking, and the ability to work independently as well as collaboratively in a team environment
  • Proficiency in Python and frameworks such as PyTorch, TensorFlow or Keras
  • Strong communication skills and the ability to effectively present complex technical concepts to both technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Collaborate with other scientists, engineers and product managers to build proof-of-concepts to shape the Axon of tomorrow
  • Lead end-to-end research efforts in advanced computer vision, machine learning and gen-AI techniques for cloud and devices from multimodal data sources, including scene understanding, action recognition and anomaly detection
  • Design and implement responsible, privacy-preserving, efficient and scalable models for inference and analysis of visual data
  • Develop performance and quality metrics for CVML models and systems, and validate their effectiveness in real-world settings
  • Optimize algorithms for performance, memory footprint, and energy efficiency to meet the requirements of resource-constrained devices
  • Stay up-to-date with the latest research and advances in CVML and translate relevant findings into shipping Axon products
  • Contribute to academic publications, technical documentation, and patent disclosures to share insights and findings with the broader community
  • Coach and mentor junior scientists
What we offer
What we offer
  • Competitive salary and 401k with employer match
  • Discretionary paid time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Mental Wellness support
  • Learning & Development programs
  • Snacks in our offices
  • Fulltime
Read More
Arrow Right

Machine Learning Research Scientist

This role focuses on cutting-edge research and development in Artificial Intelli...
Location
Location
United States , Milpitas
Salary
Salary:
117500.00 - 270000.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Electrical Engineering, or related fields focusing on Machine Learning for the dissertation
  • extensive experience in deep learning research, preferably in Large Language Models or Reinforcement Learning
  • experience developing applications with deep learning frameworks like PyTorch with a high software proficiency
  • strong programming skills in Python, data structures, and algorithms are required
  • experience with ML model optimization, GPU acceleration, heterogeneous computation, system software, and performance optimization desired
  • experience in Python Web Frameworks – Django, Flask - a plus but not required.
Job Responsibility
Job Responsibility
  • conducting research, developing solutions, and creating intellectual property in emerging fields like reinforcement learning, LLMs, digital twins, clean energy, data center optimization, and sustainability
  • developing advanced technologies for analysis, optimization, time series forecasting, uncertainty quantification, and control
  • providing thought leadership, collaborating internally and externally, and contributing to HPE’s strategy by identifying emerging technologies
  • publishing in top conferences like NeurIPS, AAAI, and ACL
  • developing patent applications
  • software development, GPU acceleration, model optimization, and real-time data streaming to create robust AI solutions for real-world use cases.
What we offer
What we offer
  • a competitive salary and extensive social benefits
  • diverse and dynamic work environment
  • work-life balance and support for career development
  • health and wellbeing programs
  • personal and professional development programs
  • diversity, inclusion, and belonging initiatives.
  • Fulltime
Read More
Arrow Right

Senior Computer Vision and Machine Learning Research Scientist

Join Axon and be a Force for Good. At Axon, we’re on a mission to Protect Life. ...
Location
Location
United States , Seattle
Salary
Salary:
159750.00 - 234300.00 USD / Year
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD and 3+ years experience in Computer Science or a related field with a focus on computer vision, machine learning, artificial intelligence or related technical fields
  • Proven track record of research in machine learning, computer vision or related fields
  • Experience driving the ML development lifecycle and leveraging state-of-the-art research to deliver high quality models at scale
  • Strong computer vision fundamentals such as image processing, feature extractions, object detection, semantic segmentation, video analysis or action recognition
  • Excellent problem-solving skills, analytical thinking, and the ability to work independently as well as collaboratively in a team environment
  • Proficiency in Python and frameworks such as PyTorch, TensorFlow or Keras
  • Strong communication skills and the ability to effectively present complex technical concepts to both technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Collaborate with other scientists, engineers and product managers to build proof-of-concepts to shape the Axon of tomorrow
  • Lead end-to-end research efforts in advanced computer vision, machine learning and gen-AI techniques for cloud and devices from multimodal data sources, including scene understanding, action recognition and anomaly detection
  • Design and implement responsible, privacy-preserving, efficient and scalable models for inference and analysis of visual data
  • Develop performance and quality metrics for CVML models and systems, and validate their effectiveness in real-world settings
  • Optimize algorithms for performance, memory footprint, and energy efficiency to meet the requirements of resource-constrained devices
  • Stay up-to-date with the latest research and advances in CVML and translate relevant findings into shipping Axon products
  • Contribute to academic publications, technical documentation, and patent disclosures to share insights and findings with the broader community
  • Coach and mentor junior scientists
What we offer
What we offer
  • Competitive salary and 401k with employer match
  • Discretionary paid time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Mental Wellness support
  • Learning & Development programs
  • Snacks in our offices
  • Fulltime
Read More
Arrow Right

Senior/Principal Computer Scientist - Optimization & Operations Researcher

The Discrete Math and Optimization Department advances mathematical and computat...
Location
Location
United States , Albuquerque
Salary
Salary:
117500.00 - 235700.00 USD / Year
sandia.gov Logo
Sandia National Laboratories
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in a relevant discipline and five (5) years of directly relevant experience, or an equivalent combination of directly relevant education and engineering or scientific experience that demonstrates the knowledge, skills, and ability to perform independent research and development
  • Ability to obtain and maintain a DOE Q clearance
  • Graduate degree in Computer Science or a highly related field where an independent research project was a graduation requirement
  • Strong verbal and written communication skills, ability to work effectively in multidisciplinary teams, and a passion to improve and expand technical skills
  • PhD in Applied Mathematics, Computer Science, Engineering, Operations Research, or a closely related quantitative field, along with a Bachelor's degree in a STEM discipline
  • Demonstrated experience with major optimization paradigms, including Linear Programming (LP), Mixed-Integer Programming (MIP), Stochastic Programming, and/or Nonlinear Programming (NLP), or advanced Constraint Programming
  • Proficiency in Python and experience with modern software development and engineering practices, including version control (e.g., GitLab), testing, and collaborative coding workflows
  • A record of peer-reviewed publications and presentations at major scientific conferences demonstrating research leadership, combined with a history of successful project delivery, implementation, and/or technology transfer
  • Proven experience as a constructive and inclusive team lead or member within a diverse, interdisciplinary research environment, demonstrating responsibility and responsiveness in project execution
  • Expertise in theoretical and computational aspects of Mixed Integer Programming (MIP), Stochastic Programming, and other algorithms for discrete optimization
Job Responsibility
Job Responsibility
  • Designing and implementing novel optimization algorithms and advanced operations research methods
  • Applying mathematical modeling to critical domains, including power grid modeling and analysis, cybersecurity, process systems engineering, and national security logistics
  • Developing robust, scalable software tools that empower mission partners and advance the capabilities of the global scientific community
  • Collaborating across disciplines to bridge the gap between theoretical mathematics and practical, real-world applications and solutions
  • Sharing results via high-impact publications and presentations to funding agencies, stakeholders, and the broader research community
What we offer
What we offer
  • Challenging work with amazing impact that contributes to security, peace, and freedom worldwide
  • Extraordinary co-workers
  • Some of the best tools, equipment, and research facilities in the world
  • Career advancement and enrichment opportunities
  • Flexible work arrangements for many positions include 9/80 (work 80 hours every two weeks, with every other Friday off) and 4/10 (work 4 ten-hour days each week) compressed workweeks, part-time work, and telecommuting (a mix of onsite work and working from home)
  • Generous vacation, strong medical and other benefits, competitive 401k, learning opportunities, relocation assistance and amenities aimed at creating a solid work/life balance
  • Fulltime
Read More
Arrow Right

Research Engineer / Research Scientist - Foundations Retrieval Lead

The Foundations Research team works on high-risk, high-reward ideas that could s...
Location
Location
United States , San Francisco
Salary
Salary:
445000.00 - 555000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience leading high-performance teams of researchers or engineers in ML infrastructure or foundational research
  • Deep technical expertise in representation learning, embedding models, or vector retrieval systems
  • Familiarity with transformer-based LLMs and how embedding spaces can interact with language model objectives
  • Research experience in areas such as contrastive learning, supervised or unsupervised embedding learning, or metric learning
  • A track record of building or scaling large machine learning systems, particularly embedding pipelines in production or research contexts
  • A first-principles mindset for challenging assumptions about how retrieval and memory should work for large models
Job Responsibility
Job Responsibility
  • Lead research into embedding models and retrieval systems optimized for grounding, relevance, and adaptive reasoning
  • Manage a team of researchers and engineers building end-to-end infrastructure for training, evaluating, and integrating embeddings into frontier models
  • Drive innovation in dense, sparse, and hybrid representation techniques, metric learning, and learning-to-retrieve systems
  • Collaborate closely with Pretraining, Inference, and other Research teams to integrate retrieval throughout the model lifecycle
  • Contribute to OpenAI’s long-term vision of AI systems with memory and knowledge access capabilities rooted in learned representations
What we offer
What we offer
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Fulltime
Read More
Arrow Right

Research Engineering Manager

Meta is seeking hands-on Research Engineering Manager to join the Meta SuperInte...
Location
Location
United States , Menlo Park
Salary
Salary:
184000.00 - 257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's degree or PhD in Computer Science, Electrical Engineering, or a related field
  • 8+ years of experience in research and development in natural language processing, computer vision, generative AI, or related media technologies
  • 2+ years of experience managing technical teams, including performance management
  • Proven track record of leading research teams and delivering impactful results
  • Experience with large-scale systems and productization of research
  • Experience in LLM post-training, evaluation and optimization
Job Responsibility
Job Responsibility
  • Lead and mentor a team of research engineers and scientists working on cutting-edge LLM technologies
  • Drive the strategy and execution of research initiatives in LLM response quality improvement
  • Collaborate with cross-functional teams to translate research breakthroughs into scalable products and solutions
  • Lead the development of new algorithms and systems for LLM post-training, evaluation and efficiency
  • Stay abreast of the latest advancements in AI, large language modeling and apply them to Meta’s products
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Senior Machine Learning Engineer (Health)

WHOOP is an advanced health and fitness wearable, on a mission to unlock human p...
Location
Location
United States , Boston
Salary
Salary:
150000.00 - 210000.00 USD / Year
whoop.com Logo
Whoop
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science, Data Science, Applied Mathematics, or a related field. Master’s preferred
  • 5+ years of professional experience as a Machine Learning Engineer or Software Engineer with focus on ML systems
  • Proven expertise working with time series data (wearable, physiological, or high-frequency sensor data strongly preferred)
  • Experience designing and deploying ML inference systems at scale: both real-time streaming and large-scale batch pipelines
  • Strong coding skills in Python (scientific stack) and SQL, with a track record of writing clean, production-quality code
  • Strong communication skills to collaborate across engineering, research, and product teams
  • Proven experience deploying and maintaining ML systems on cloud platforms (AWS or GCP)
  • Working familiarity with MLOps best practices: model versioning, CI/CD for ML, observability, and monitoring for inference systems
  • Ability to reason about and design for performance trade-offs (latency vs. throughput vs. cost) when building ML inference systems
  • Strong understanding of backend service development (APIs and service reliability) as it applies to serving ML models at scale
Job Responsibility
Job Responsibility
  • Create, improve, and maintain production services that provide analysis for health features in collaboration with Data Scientists and MLOps Engineers
  • Collaborate with Data Engineers to improve ML data pipelines, tooling, and validation systems that support robust model performance
  • Work alongside data scientists to translate research prototypes into production ML systems optimized for scale, latency, and cost efficiency
  • Collaborate with researchers and product teams to align model development with health insights and member impact
  • Participate in on-call rotations for data science services, ensuring uptime and performance in production environments
What we offer
What we offer
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Research Engineer II

As a Research Engineer II at Microsoft you will apply both software engineering ...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • 2+ years of professional experience working with generative artificial intelligence, large language models, or agent-based systems
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Design and develop highly usable, scalable application capabilities, integrating AI models and enhancing existing features to meet evolving customer needs
  • Build and debug production-grade code in distributed systems
  • Translate business requirements into AI solutions, collaborating with data scientists, research scientists, product managers, and engineering teams to ensure alignment and impact
  • Optimize AI model performance and reliability in production environments, including retraining, evaluation, and continuous monitoring
  • Own deployment, quality and operation of AI systems, including automated evals, CI/CD pipelines, deployment, and monitoring with strong MLOps and DevOps practices
  • Troubleshoot live site issues as part of both product development and live site support rotations, ensuring rapid resolution and learning
  • Fulltime
Read More
Arrow Right