CrawlJobs Logo

Software Engineer, Systems ML - Frameworks / Compilers / Kernels

meta.com Logo

Meta

Location Icon

Location:
United States , Menlo Park

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

181000.00 USD / Year

Job Description:

In this role, you will be a member of the MTIA (Meta Training & Inference Accelerator) Software team and part of the bigger industry-leading PyTorch AI framework organization. MTIA Software Team has been developing a comprehensive AI Compiler strategy that delivers a highly flexible platform to train & serve new DL/ML model architectures, combined with auto-tuned high performance for production environments across specialized hardware architectures. The compiler stack, DL graph optimizations, and kernel authoring for specific hardware, directly impacts performance and deployment velocity of both AI training and inference platforms at Meta. You will be working on one of the core areas such as PyTorch framework components, AI compiler and runtime, high-performance kernels and tooling to accelerate machine learning workloads on the current & next generation of MTIA AI hardware platforms. You will work closely with AI researchers to analyze deep learning models and lower them efficiently on MTIA hardware. You will also partner with hardware design teams to develop compiler optimizations for high performance. You will apply software development best practices to design features, optimization, and performance tuning techniques. You will gain valuable experience in developing machine learning compiler frameworks and will help in driving next generation hardware software codesign for AI domain specific problems.

Job Responsibility:

  • Development of SW stack with one of the following core focus areas: AI frameworks, compiler stack, high performance kernel development and acceleration onto next generation of hardware architectures.
  • Contribute to the development of the industry-leading PyTorch AI framework core compilers to support new state of the art inference and training AI hardware accelerators and optimize their performance.
  • Analyze deep learning networks, develop & implement compiler optimization algorithms.
  • Collaborating with AI research scientists to accelerate the next generation of deep learning models such as Recommendation systems, Generative AI, Computer vision, NLP etc.
  • Performance tuning and optimizations of deep learning framework & software components.

Requirements:

  • Proven C/C++ programming skills
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta.
  • Experience in AI framework development or accelerating deep learning models on hardware architectures.

Nice to have:

  • A Bachelor's degree in Computer Science, Computer Engineering, relevant technical field and 4+ years of experience in AI framework development or accelerating deep learning models on hardware architectures OR a Master's degree in Computer Science, Computer Engineering, relevant technical field and 2+ years of experience in AI framework development or accelerating deep learning models on hardware architectures OR a PhD in Computer Science Computer Engineering, or relevant technical field.
  • Knowledge of GPU, CPU, or AI hardware accelerator architectures.
  • Experience working with frameworks like PyTorch, Caffe2, TensorFlow, ONNX, TensorRT OR AI high performance kernels: Experience with CUDA programming, OpenMP / OpenCL programming or AI hardware accelerator kernel programming. Experience in accelerating libraries on AI hardware, similar to cuBLAS, cuDNN, CUTLASS, HIP, ROCm etc. OR AI Compiler: Experience with compiler optimizations such as loop optimizations, vectorization, parallelization, hardware specific optimizations such as SIMD. Experience with MLIR, LLVM, IREE, XLA, TVM, Halide is a plus. OR AI frameworks: Experience in developing training and inference framework components. Experience in system performance optimizations such as runtime analysis of latency, memory bandwidth, I/O access, compute utilization analysis and associated tooling development.
What we offer:
  • bonus
  • equity
  • benefits

Additional Information:

Job Posted:
January 23, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Software Engineer, Systems ML - Frameworks / Compilers / Kernels

Software Engineer, Triton Compiler

As a Software Engineer, you will help build AI systems that achieve levels of pe...
Location
Location
United States , San Francisco
Salary
Salary:
266000.00 - 445000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of relevant engineering experience, ideally in systems, compilers, ML frameworks, or performance engineering
  • Owning problems end-to-end, including learning new hardware and software domains as needed
Job Responsibility
Job Responsibility
  • help build AI systems that achieve levels of performance that were previously impossible
  • designing and optimizing core ML systems
  • writing highly reliable low-level code
  • advancing the algorithms and infrastructure that power our models
  • design and build the compilers, languages, and high-performance kernels that allow researchers to fully exploit our first-party accelerators
  • advancing Triton and its backend
  • developing new compiler passes
  • creating the tooling needed to write fast, correct, and deeply optimized kernels for brand-new hardware
  • partner closely with the hardware team to unlock new capabilities and ensure our custom silicon can support the next generation of frontier models
What we offer
What we offer
  • Offers Equity
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Fulltime
Read More
Arrow Right

Software Engineer, Systems ML - Frameworks / Compilers / Kernels (PhD)

In this role, you will be a member of the MTIA (Meta Training & Inference Accele...
Location
Location
United States , Bellevue
Salary
Salary:
181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven C/C++ programming skills
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Experience in AI framework development or accelerating deep learning models on hardware architectures
Job Responsibility
Job Responsibility
  • Development of SW stack with one of the following core focus areas: AI frameworks, compiler stack, high performance kernel development and acceleration onto next generation of hardware architectures
  • Contribute to the development of the industry-leading PyTorch AI framework core compilers to support new state of the art inference and training AI hardware accelerators and optimize their performance
  • Analyze deep learning networks, develop & implement compiler optimization algorithms
  • Collaborating with AI research scientists to accelerate the next generation of deep learning models such as Recommendation systems, Generative AI, Computer vision, NLP etc
  • Performance tuning and optimizations of deep learning framework & software components
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Software Engineer, Hardware

As a software engineer on the Scaling team, you’ll help build and optimize the l...
Location
Location
United States , San Francisco
Salary
Salary:
266000.00 - 455000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proficient in systems programming (e.g., Rust, C++) and scripting languages like Python
  • Experience in one or more of the following areas: compiler development, kernel authoring, accelerator programming, runtime systems, distributed systems, or high-performance simulation
  • Deep curiosity for how large-scale systems work and enjoy making them faster, simpler, and more reliable
  • Excited to work in a fast-paced, highly collaborative environment with evolving hardware and ML system demands
  • Value engineering excellence, technical leadership, and thoughtful system design
Job Responsibility
Job Responsibility
  • Design and build APIs and runtime components to orchestrate computation and data movement across heterogeneous ML workloads
  • Contribute to compiler infrastructure, including the development of optimizations and compiler passes to support evolving hardware
  • Engineer and optimize compute and data kernels, ensuring correctness, high performance, and portability across simulation and production environments
  • Profile and optimize system bottlenecks, especially around I/O, memory hierarchy, and interconnects, at both local and distributed scales
  • Develop simulation infrastructure to validate runtime behaviors, test training stack changes, and support early-stage hardware and system development
  • Rapidly deploy runtime and compiler updates to new supercomputing builds in close collaboration with hardware and research teams
  • Work across a diverse stack, primarily using Rust and Python, with opportunities to influence architecture decisions across the training framework
What we offer
What we offer
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Software Co-Design AI HPC Systems

Our team’s mission is to architect, co-design, and productionize next-generation...
Location
Location
United States , Mountain View
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • Strong background in one or more of the following areas: AI accelerator or GPU architectures
  • Distributed systems and large-scale AI training/inference
  • High-performance computing (HPC) and collective communications
  • ML systems, runtimes, or compilers
  • Performance modeling, benchmarking, and systems analysis
  • Hardware–software co-design for AI workloads
  • Proficiency in systems-level programming (e.g., C/C++, CUDA, Python) and performance-critical software development.
  • Proven ability to work across organizational boundaries and influence technical decisions involving multiple stakeholders.
Job Responsibility
Job Responsibility
  • Lead the co-design of AI systems across hardware and software boundaries, spanning accelerators, interconnects, memory systems, storage, runtimes, and distributed training/inference frameworks.
  • Drive architectural decisions by analyzing real workloads, identifying bottlenecks across compute, communication, and data movement, and translating findings into actionable system and hardware requirements.
  • Co-design and optimize parallelism strategies, execution models, and distributed algorithms to improve scalability, utilization, reliability, and cost efficiency of large-scale AI systems.
  • Develop and evaluate what-if performance models to project system behavior under future workloads, model architectures, and hardware generations, providing early guidance to hardware and platform roadmaps.
  • Partner with compiler, kernel, and runtime teams to unlock the full performance of current and next-generation accelerators, including custom kernels, scheduling strategies, and memory optimizations.
  • Influence and guide AI hardware design at system and silicon levels, including accelerator microarchitecture, interconnect topology, memory hierarchy, and system integration trade-offs.
  • Lead cross-functional efforts to prototype, validate, and productionize high-impact co-design ideas, working across infrastructure, hardware, and product teams.
  • Mentor senior engineers and researchers, set technical direction, and raise the overall bar for systems rigor, performance engineering, and co-design thinking across the organization.
  • Fulltime
Read More
Arrow Right

Software Engineer, Systems ML - Compilers / Backend

We are seeking a software engineer to support the development of the compiler to...
Location
Location
United States , Sunnyvale
Salary
Salary:
217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 2+ years experience developing compilers, toolchains, runtime, or similar code optimization software
  • Experience in software design and programming experience in Python and/or C/C++ for development, debugging, testing and performance analysis
  • Experience in AI framework development or accelerating models on hardware architectures (GPU, TPU, custom AI ASICs)
Job Responsibility
Job Responsibility
  • Analyze and design effective compiler passes and optimizations. Implement and/or enhance code generation targeting machine learning accelerators
  • Work with algorithm research teams to map ML graphs to hardware implementations, model data-flows, create cost-benefit analysis and estimate silicon power and performance
  • Work with hardware architects to co-design hardware features that maximize performance, power efficiency and programmability
  • Contribute to the development of machine-learning libraries, intermediate representations, export formats, and analysis tools
  • Analyze and improve the efficiency, scalability, and stability of our toolchains. Optimize and tune kernels and compiled code to achieve latency targets for ML inference
  • Conduct design and code reviews. Evaluate code performance, debug, diagnose and drive resolution of compiler and cross-disciplinary system issues
  • Interface with other compiler-focused teams to evaluate and incorporate their innovations and vice versa
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Software Engineer, Systems ML - Compilers / Backend

We are seeking a software engineer to support the development of the compiler to...
Location
Location
United States , Sunnyvale
Salary
Salary:
181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Experience in software design and programming experience in Python and/or C/C++ for development, debugging, testing and performance analysis
  • Experience in AI framework development or accelerating models on hardware architectures (GPU, TPU, custom AI ASICs)
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Analyze and design effective compiler passes and optimizations. Implement and/or enhance code generation targeting machine learning accelerators
  • Work with algorithm research teams to support the co-design of hardware features mapping ML graphs to hardware implementations, modeling data-flows, creating cost-benefit analysis and estimating silicon power and performance
  • Work with hardware architects to co-design hardware features that maximize performance, power efficiency and programmability
  • Contribute to the development of machine-learning libraries, intermediate representations, export formats, and analysis tools
  • Collaborate with the team to enhance the efficiency, scalability, and stability of our toolchains by focusing on kernel optimization and tuning
  • Conduct design and code reviews. Evaluate code performance, debug, diagnose and drive resolution of compiler and cross-disciplinary system issues
  • Interface with other compiler-focused teams to evaluate and incorporate their innovations and vice versa
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Senior Machine Learning Engineer

As a Machine Learning Engineer at Dedrone, you’ll play a pivotal role in advanci...
Location
Location
United States , Sterling
Salary
Salary:
Not provided
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of professional experience in modern C++ (C++14/17 or later), with strong object-oriented and generic programming skills
  • Deep understanding of multithreading and concurrency (threads, thread pools, locks, lock-free structures, atomics, futures, async patterns) and experience building robust, concurrent systems
  • Hands-on experience with parallel processing frameworks or patterns (SIMD, task-based parallelism, GPU offload, or similar) for real-time or high-throughput applications
  • Strong command of data structures and algorithms, and the ability to choose and implement the right structures for performance-critical, memory-constrained environments
  • Proven experience with memory management and performance optimization in C++ (stack vs heap, custom allocators, cache-aware design, avoiding fragmentation, RAII, move semantics)
  • Practical experience with CUDA (or similar GPU programming frameworks): writing kernels, managing GPU memory, optimizing for occupancy and bandwidth, and integrating with C++ codebases
  • Familiarity with Linux-based development (build systems like CMake, unit testing frameworks, containerization and/or cross-compilation for edge devices)
  • Strong debugging and profiling skills across CPU and GPU, and a methodical approach to benchmarking and regression testing
  • Excellent collaboration and communication skills, with a track record of working closely with research or ML teams to move algorithms from prototype to production
Job Responsibility
Job Responsibility
  • Design and implement high-performance C++ software that runs computer vision and tracking algorithms in real time on edge devices
  • Work closely with computer vision / self-supervised learning engineers to integrate their models into production pipelines, including pre/post-processing, I/O, and system orchestration
  • Build and optimize multithreaded and parallel processing pipelines for ingesting, synchronizing, and processing data from a networked system of cameras
  • Implement and tune CUDA kernels and GPU-accelerated components to maximize throughput and minimize latency for inference, tracking, and search
  • Design robust data structures and memory management strategies for handling large volumes of video, sensor, and metadata streams under tight compute and power constraints
  • Profile and optimize code using tools such as perf, valgrind, nvprof / Nsight, and similar to identify bottlenecks and improve CPU/GPU utilization
  • Collaborate with simulation and CV teams to deploy and evaluate algorithms in realistic test scenarios, including fault handling and performance monitoring
  • Develop clean, well-tested, and well-documented C++ libraries and services that can be reused across products and future airspace applications
  • Contribute to system-level architecture decisions, including inter-process communication, scheduling, resource allocation, and deployment strategies on edge platforms
What we offer
What we offer
  • Competitive salary and 401k with employer match
  • Discretionary paid time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Mental Wellness support
  • Learning & Development programs
  • Snacks in our offices
  • Fulltime
Read More
Arrow Right
New

Healthcare Assistant & Lead Healthcare Assistant

Are you passionate about supporting older people and making a meaningful differe...
Location
Location
Ireland , Cork
Salary
Salary:
Not provided
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in elderly care is desirable but not essential (training may be provided)
  • QQI Level 5 in Healthcare Support or equivalent is an advantage
  • A caring, patient and respectful approach to supporting older people
  • Ability to work flexible shifts including days, nights, and weekends
  • Strong commitment to resident safety, dignity and wellbeing
  • Applicants must have legal entitlement to work in Ireland
Job Responsibility
Job Responsibility
  • Deliver person-centred care to residents with dignity and respect
  • Support residents with activities of daily living where required
  • Encourage independence and social engagement
  • Maintain a clean, safe and comfortable living environment
  • Build positive relationships with residents, families, and colleagues
  • Follow all health, safety and safeguarding procedures
  • Assist with compliance processes, inspections, and quality standards
  • Lead Healthcare Assistants will also assist in guiding care teams, supporting best practice, and helping maintain high standards of care delivery
What we offer
What we offer
  • Flexible working hours to support work–life balance
  • Local employment opportunities within the Douglas, Cork area
  • Competitive rates of pay in line with experience and role
  • Supportive team environment with ongoing training and development
  • Employee wellbeing initiatives
  • Uniform provided
  • Free on-site parking
  • Meals provided while on duty
Read More
Arrow Right