CrawlJobs Logo

ML Compiler and Performance Engineer

meta.com Logo

Meta

Location Icon

Location:
United States , Bellevue

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

217000.00 USD / Year

Job Description:

Meta's Training and Inference Accelerators (MTIA) team is developing novel HW to enable efficient execution of AI training and inference workloads. In this role, you will have end-to-end responsibility for the performance of in-production AI models in their transition from stock HW to MTIA chips, with a focus on models that require multi-node compute.

Job Responsibility:

  • Identifying bottlenecks and quantifying opportunities for improving performance
  • In-depth, end-to-end performance analysis and reporting
  • Developing optimizations to address identified bottlenecks
  • Optimizing compute/communication overlap
  • Work closely with other compiler teams as well as client teams (Recommendation Systems, Generative AI, etc)

Requirements:

  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Experience developing and deploying optimizations at the level of PyTorch/Aten or comparable stacks
  • A Masters degree and 4+ years in-domain experience
  • A PhD degree and 2+ years in-domain experience
  • Experience optimizing multi-node distributed compute
  • Experience optimizing runtimes and/or kernels for accelerator platforms
What we offer:
  • bonus
  • equity
  • benefits

Additional Information:

Job Posted:
February 20, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for ML Compiler and Performance Engineer

New

Software Engineer, Triton Compiler

As a Software Engineer, you will help build AI systems that achieve levels of pe...
Location
Location
United States , San Francisco
Salary
Salary:
266000.00 - 445000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of relevant engineering experience, ideally in systems, compilers, ML frameworks, or performance engineering
  • Owning problems end-to-end, including learning new hardware and software domains as needed
Job Responsibility
Job Responsibility
  • help build AI systems that achieve levels of performance that were previously impossible
  • designing and optimizing core ML systems
  • writing highly reliable low-level code
  • advancing the algorithms and infrastructure that power our models
  • design and build the compilers, languages, and high-performance kernels that allow researchers to fully exploit our first-party accelerators
  • advancing Triton and its backend
  • developing new compiler passes
  • creating the tooling needed to write fast, correct, and deeply optimized kernels for brand-new hardware
  • partner closely with the hardware team to unlock new capabilities and ensure our custom silicon can support the next generation of frontier models
What we offer
What we offer
  • Offers Equity
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Fulltime
Read More
Arrow Right
New

Software Engineer - Performance Tools

Join our team as a Software Engineer - Performance Tools and take the lead in il...
Location
Location
United States , San Jose
Salary
Salary:
150000.00 - 275000.00 USD / Year
etched.com Logo
Etched
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong proficiency in C++ or Rust
  • Proficiency in Python is a plus
  • Deep understanding of computer architecture (CPU, GPU, accelerators), memory hierarchies (caches, DRAM), and interconnects (especially PCIe)
  • Proven experience in low-level performance analysis, profiling, and bottleneck identification on complex hardware systems (GPUs, CPUs, FPGAs, or custom ASICs)
  • Experience with performance analysis tools (e.g., NVIDIA Nsight, AMD uProf, Intel VTune, perf, Tracy, ETW)
  • Experience working close to hardware, potentially reading performance counters or interacting directly with device drivers
Job Responsibility
Job Responsibility
  • Tool Architecture & Design: Lead the design and architecture of a comprehensive performance analysis suite, including data collection mechanisms, data processing pipelines, analysis engines, and user interfaces (CLI and/or GUI)
  • Low-Level Data Collection: Develop robust methods to capture performance data directly from our custom ML accelerator hardware (e.g., hardware performance counters, execution unit status, memory access patterns) via driver interfaces or other mechanisms
  • Host & System Tracing: Implement tracing for host-side API calls (runtime libraries, driver interactions) and system-level events (CPU activity, PCIe traffic, memory usage, network contention) related to Sohu workloads
  • Data Correlation & Synchronization: Design and implement techniques to accurately correlate performance events across the host CPU, device driver, PCIe bus, multiple accelerators, and multiple hosts, ensuring precise time synchronization
  • Performance Analysis Engine: Build analysis modules to automatically interpret collected trace and counter data, identifying key performance limiters (e.g., compute-bound, memory bandwidth-bound, latency-bound, PCIe-bound, specific hardware bottlenecks)
  • Visualization & Reporting: Develop intuitive visualizations (timelines, dependency graphs, resource utilization charts, statistical summaries) to clearly communicate performance characteristics and bottlenecks to users
  • Collaboration & Support: Work closely with hardware architects, firmware engineers, driver developers, compiler engineers, and ML application engineers to understand their needs, define tool requirements, and provide expert guidance on performance analysis and optimization using the tool
What we offer
What we offer
  • Medical, dental, and vision packages with generous premium coverage
  • $500 per month credit for waiving medical benefits
  • Housing subsidy of $2k per month for those living within walking distance of the office
  • Relocation support for those moving to San Jose (Santana Row)
  • Various wellness benefits covering fitness, mental health, and more
  • Daily lunch + dinner in our office
  • Fulltime
Read More
Arrow Right

LLM Inference Performance & Evals Engineer

Join the inference model team dedicated to bring up the state-of-the-art models,...
Location
Location
Canada , Toronto
Salary
Salary:
Not provided
cerebras.net Logo
Cerebras Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years building high-performance ML or systems software
  • Solid grounding in Transformer math—attention scaling, KV-cache, quantisation—or clear evidence you learn this material rapidly
  • Comfort navigating the full AI toolchain: Python modeling code, compiler IRs, performance profiling, etc.
  • Strong debugging skills across performance, numerical accuracy, and runtime integration
  • Prior experience in modeling, compilers or crafting benchmarks or performance studies
  • not just black-box QA tests
  • Strong passion to leverage AI agents or workflow orchestration tools to boost personal productivity
Job Responsibility
Job Responsibility
  • Prototype and benchmark cutting-edge ideas: new attentions, MoE, speculative decoding, and many more innovations as they emerge
  • Develop agent-driven automation that designs experiments, schedules runs, triages regressions, and drafts pull-requests
  • Work closely with compiler, runtime, and silicon teams: unique opportunity to experience the full stack of software/hardware innovation
  • Keep pace with the latest open- and closed-source models
  • run them first on wafer scale to expose new optimization opportunities
What we offer
What we offer
  • Build a breakthrough AI platform beyond the constraints of the GPU
  • Publish and open source their cutting-edge AI research
  • Work on one of the fastest AI supercomputers in the world
  • Enjoy job stability with startup vitality
  • Our simple, non-corporate work culture that respects individual beliefs
Read More
Arrow Right

Software Engineering Manager, Programming Languages and Runtimes (PL&R) Compilers

Meta’s Server LLVM team owns the C++ optimizing compiler that builds the majorit...
Location
Location
United States , Bellevue
Salary
Salary:
184000.00 - 257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with compiler architecture and development, particularly ML compilers or DSLs or static/dynamic languages compilers
  • Demonstrated experience recruiting, building, structuring, leading technical organizations, including performance management
  • 2+ years of experience in managing a team of compiler engineers of varied skill levels
Job Responsibility
Job Responsibility
  • Grow a team of compiler domain experts, supporting career growth and matching project opportunities
  • Communicate, collaborate, and build relationships with peer teams to facilitate cross-functional projects
  • Develop strategy and help set direction for the team
  • Remain up-to-date on ongoing software development activities in the team, help work through technical challenges, and be involved in design decisions
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Research Scientist Intern, PyTorch Compiler

Our team makes PyTorch run faster and more resource-efficient without sacrificin...
Location
Location
United States , Menlo Park
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a PhD degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Experience in ML compiler, Distributed Training, ML systems, or similar
  • Proficient in Python or Cuda programming
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Develop new techniques in TorchDynamo, TorchInductor, PyTorch core, PyTorch Distributed
  • Explore the intersection of PyTorch compiler and PyTorch Distributed
  • Optimize Generative AI models across the stack (pre-training, fine-tuning, and inference)
  • Improve general PyTorch performance
  • Conduct cutting-edge research on ML compiler and ML distributed technologies
  • Collaborate with users of PyTorch to enable new use cases for the framework both inside and outside Meta
Read More
Arrow Right

Software Engineer, Infra PyTorch (PhD)

This role is about developing the core PyTorch 2.0 technologies, innovating and ...
Location
Location
United States , Menlo Park
Salary
Salary:
181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Currently has or is in the process of obtaining a PhD degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Research or industry experience in developing compilers, ML systems, ML accelerators, GPU performance, and similar
  • Advanced in Python or C++ programming
Job Responsibility
Job Responsibility
  • Develop the PT2 compiler (e.g., TorchDynamo, TorchInductor, PyTorch Distributed, PyTorch Core)
  • Improve PyTorch performance via systematic solutions for the entire community
  • Explore the intersection of the PyTorch compiler and PyTorch distributed
  • Optimize Generative AI models across the stack (pre-training, fine-tuning, and inference)
  • Collaborate with users of PyTorch to enable new use cases of PT2 technologies both inside and outside Meta
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Software Engineer, Systems ML - Compilers / Backend

We are seeking a software engineer to support the development of the compiler to...
Location
Location
United States , Sunnyvale
Salary
Salary:
217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 2+ years experience developing compilers, toolchains, runtime, or similar code optimization software
  • Experience in software design and programming experience in Python and/or C/C++ for development, debugging, testing and performance analysis
  • Experience in AI framework development or accelerating models on hardware architectures (GPU, TPU, custom AI ASICs)
Job Responsibility
Job Responsibility
  • Analyze and design effective compiler passes and optimizations. Implement and/or enhance code generation targeting machine learning accelerators
  • Work with algorithm research teams to map ML graphs to hardware implementations, model data-flows, create cost-benefit analysis and estimate silicon power and performance
  • Work with hardware architects to co-design hardware features that maximize performance, power efficiency and programmability
  • Contribute to the development of machine-learning libraries, intermediate representations, export formats, and analysis tools
  • Analyze and improve the efficiency, scalability, and stability of our toolchains. Optimize and tune kernels and compiled code to achieve latency targets for ML inference
  • Conduct design and code reviews. Evaluate code performance, debug, diagnose and drive resolution of compiler and cross-disciplinary system issues
  • Interface with other compiler-focused teams to evaluate and incorporate their innovations and vice versa
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Performance Engineer - Inference

Engineers on the inference performance team operate at the intersection of hardw...
Location
Location
Canada , Toronto
Salary
Salary:
Not provided
cerebras.net Logo
Cerebras Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelors / Masters / PhD in Electrical Engineering or Computer Science
  • Strong background in computer architecture
  • Exposure to and understanding of low-level deep learning / LLM math
  • Strong analytical and problem-solving mindset
  • 3+ years of experience in a relevant domain (Computer Architecture, CPU/GPU Performance, Kernel Optimization, HPC)
  • Experience working on CPU/GPU simulators
  • Exposure to performance profiling and debug on any system pipeline
  • Comfort with C++ and Python
Job Responsibility
Job Responsibility
  • Build performance models (kernel-level, end-to-end) to estimate the performance of state of the art and customer ML models
  • Optimize and debug our kernel micro code and compiler algorithms to elevate ML model inference speed, throughput and compute utilization on the Cerebras WSE
  • Debug and understand runtime performance on the system and cluster
  • Develop tools and infrastructure to help visualize performance data collected from the Wafer Scale Engine and our compute cluster
What we offer
What we offer
  • Build a breakthrough AI platform beyond the constraints of the GPU
  • Publish and open source their cutting-edge AI research
  • Work on one of the fastest AI supercomputers in the world
  • Enjoy job stability with startup vitality
  • Our simple, non-corporate work culture that respects individual beliefs
Read More
Arrow Right