CrawlJobs Logo

Staff ML Infrastructure Engineer - Embodied AI

gm.com Logo

General Motors

Location Icon

Location:
United States , Sunnyvale

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

189300.00 - 290700.00 USD / Year

Job Description:

At General Motors, our product teams are redefining mobility. Through a human-centered design process, we create vehicles and experiences that are designed not just to be seen, but to be felt. We’re turning today’s impossible into tomorrow’s standard —from breakthrough hardware and battery systems to intuitive design, intelligent software, and next-generation safety and entertainment features. Every day, our products move millions of people as we aim to make driving safer, smarter, and more connected, shaping the future of transportation on a global scale. Join the Embodied AI team at General Motors. Our team is developing and deploying machine learning solutions that support safe and reliable autonomous vehicle behavior across real-world scenarios. As a Staff ML Infra Engineer, you will drive the development of core systems that enable rapid dataset generation, training, evaluation, and iteration of our most advanced Autonomous Driving models. From enabling large foundational driving models to distilling multi-stage production deployed models, your goal will be to dramatically accelerate the machine learning development cycle from one modeling hypothesis to next. You will deliver model training pipelines that are performant, easy to use, and exceptionally reliable. Your success will be measured by the velocity and impact of the ML models that rely on the scalable, intuitive, and high‑performance training platforms you help create.

Job Responsibility:

  • Lead the design, implementation, and deployment of scalable platforms and tools that drive machine learning model training and evaluation workflows across GM
  • Own complex technical projects end-to-end, making key architectural decisions and technical trade-offs
  • Take a holistic view of projects, considering their impact across multiple teams, and across a longer timeline
  • Proactively drive technical prioritization
  • Collaborate closely with partner teams to ensure maximum benefit from the systems we build
  • Help shape our team through technical interviewing with high, well-calibrated standards, and play an essential role in recruiting
  • Mentor and onboard junior engineers and interns, helping them grow their careers

Requirements:

  • 5+ years of experience building large-scale distributed systems, applications, or advanced ML systems
  • Proven track record of designing robust frameworks with high-quality, durable APIs
  • Deep understanding of machine learning algorithms with hands‑on application
  • Expertise in building reliable, high-performance, and cost-efficient systems on modern cloud infrastructure
  • End-to-end experience across the ML development lifecycle, including MLOps practices
  • Strong cross functional collaboration skills across teams and organizations
  • Exceptional coding skills in Python or C++
  • Strong interest in autonomous driving and its transformative potential
  • BS, MS, or PhD in Computer Science, Mathematics, or equivalent practical experience

Nice to have:

  • Experience with distributed training methodologies
  • Experience scaling ML training across large GPU/CPU clusters or other accelerators
  • Familiarity with deep learning frameworks (e.g., PyTorch, TensorFlow)
  • Experience with performance profiling and state-of-the-art training optimization techniques, including their impact on model performance
  • Experience with advanced build systems (e.g., Bazel, Buck, Blaze, CMake)
  • Proficiency with containerization and orchestration technologies (e.g., Docker, Kubernetes)
What we offer:
  • Medical
  • Dental
  • Vision
  • Health Savings Account
  • Flexible Spending Accounts
  • Retirement savings plan
  • Sickness and accident benefits
  • Life insurance
  • Paid vacation & holidays
  • Tuition assistance programs
  • Employee assistance program
  • GM vehicle discounts
  • Relocation benefits
  • Company vehicle evaluation program

Additional Information:

Job Posted:
March 25, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Staff ML Infrastructure Engineer - Embodied AI

Staff ML Engineer - Embodied AI Scaling Foundations

At General Motors, our product teams are redefining mobility. Through a human-ce...
Location
Location
United States , Sunnyvale, California
Salary
Salary:
189000.00 - 300000.00 USD / Year
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s, Master’s, or PhD in Computer Science, Robotics, Machine Learning, or related field
  • Experience working with large-scale foundation models and alignment methods applied to real-world systems
  • Demonstrated ability to deliver applied ML solutions under real-world constraints and timelines
  • Proficiency in PyTorch and Python
  • Experience building and scaling model training pipelines enabling efficient iteration across teams
  • Strong data processing skills using tools such as NumPy, Pandas, and Apache Spark
  • Strong communication skills enabling effective collaboration across engineering teams
  • Experience deploying ML models into production environments and understanding end-to-end deployment workflows
Job Responsibility
Job Responsibility
  • Design and implement ML solutions aligned with GM’s autonomous driving objectives
  • Apply techniques such as unsupervised pre-training, imitation learning, reinforcement learning, model scaling/selection, foundation modeling, to solve problems in object detection/tracking/classification, trajectory generation, and safe AI
  • Collaborate with cross-functional teams to deploy models and algorithms into onboard driving systems
  • Contribute to applied research efforts and remain current with advancements in ML frameworks and methods
  • Design and build efficient infrastructure, pipelines, and tooling to facilitate fast-pace model iterations
  • Drive technical execution from prototyping through production deployment, documenting learnings and best practices
  • Support and mentor engineers through technical collaboration and code reviews, fostering knowledge sharing and engineering excellence
What we offer
What we offer
  • medical
  • dental
  • vision
  • Health Savings Account
  • Flexible Spending Accounts
  • retirement savings plan
  • sickness and accident benefits
  • life insurance
  • paid vacation & holidays
  • tuition assistance programs
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Capacity & Efficiency Infrastructure

Microsoft AI is looking for a Member of Technical Staff – Capacity & Efficiency ...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science, or related technical discipline AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Deep understanding of the fundamentals of GPU architectures and DL/LLM architectures
  • Deep experience in profiling and analyzing performance in large-scale distributed computing systems
  • Deep experience in profiling and analyzing performance in ML models especially GenAI models
  • Experience with low-level GPU programming (CUDA, Triton, NCCL) and frameworks such as PyTorch or JAX
  • Experience in leading technical projects and supporting architectural decisions with data
  • Experience building infrastructure for large-scale machine learning or generative AI workloads
  • Experience in networking (InfiniBand, NVLink), storage systems, or distributed training parallelisms
  • Track record of contributing to high-performance computing or large-scale AI infrastructure projects
Job Responsibility
Job Responsibility
  • Design, implement, test, and optimize distributed training infrastructure in Python and C++ for large-scale GPU clusters
  • Build and evolve telemetry systems to provide visibility into infrastructure & ML model performance, utilization, and cost related metrics
  • Profile, benchmark, and debug performance bottlenecks across compute, memory, networking, and storage subsystems
  • Drive architectural improvements across various ML services which deliver measurable efficiency improvements
  • Build and evolve tools to automatically provide insights and recommendations to improve fleet-wide efficiency
  • Optimize collective communication libraries (e.g., NCCL) for emerging NVLink and InfiniBand topologies
  • Partner with ML researchers and infrastructure engineers to understand their plans and future needs and develop plans to balance growth with efficiency
  • Collaborate with hardware teams to optimize for next-generation accelerators (NVIDIA, MAIA, and beyond)
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Staff ML Engineer - Embodied AI Scaling Foundations

Are you passionate about accelerating the future of autonomous driving? Join the...
Location
Location
United States , Mountain View
Salary
Salary:
189000.00 - 280000.00 USD / Year
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience building large-scale distributed systems, applications, or advanced ML systems
  • Proven track record of designing robust frameworks with high-quality, durable APIs
  • Deep understanding of machine learning algorithms with hands‑on application
  • Expertise in building reliable, high-performance, and cost-efficient systems on modern cloud infrastructure
  • End-to-end experience across the ML development lifecycle, including MLOps practices
  • Strong cross functional collaboration skills across teams and organizations
  • Proficiency with containerization and orchestration technologies (e.g., Docker, Kubernetes)
  • Exceptional coding skills in Python or C++
  • Strong interest in autonomous driving and its transformative potential
  • BS, MS, or PhD in Computer Science, Mathematics, or equivalent practical experience.
Job Responsibility
Job Responsibility
  • Lead the design, implementation, and deployment of scalable platforms and tools that drive machine learning model training and evaluation workflows across GM
  • Own complex technical projects end-to-end, making key architectural decisions and technical trade-offs. You will be a core contributor to team planning, design reviews, and code quality
  • Take a holistic view of projects, considering their impact across multiple teams, and Proactively drive technical prioritization. Collaborate closely with partner teams to ensure maximum benefit from the systems we build
  • Help shape our team through technical interviewing with high, well-calibrated standards, and play an essential role in recruiting. Mentor and onboard junior engineers and interns, helping them grow their careers
What we offer
What we offer
  • medical
  • dental
  • vision
  • Health Savings Account
  • Flexible Spending Accounts
  • retirement savings plan
  • sickness and accident benefits
  • life insurance
  • paid vacation & holidays
  • tuition assistance programs
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Principal Engineering Manager

As Microsoft continues to push the boundaries of AI, we are on the lookout for s...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, Javascript, or Python OR equivalent experience
  • Demonstrated track record of building and scaling engineering organizations (hiring teams from scratch, structuring orgs, growing managers)
  • Experience delivering large-scale software systems in AI, machine learning, or related fields
  • Experience managing organizations of 30+ engineers across multiple teams and workstreams
  • Deep expertise in LLM evaluation, AI quality measurement, or ML infrastructure at scale
  • Track record of partnering with senior leadership (VP/CVP level) to set strategy and drive cross-organizational programs
  • Experience recruiting and developing senior engineering talent (principal engineers, engineering managers) in a competitive market
  • Proven ability to operate effectively in fast-paced, ambiguous environments — comfortable making decisions with incomplete information and course-correcting quickly
  • Strong technical judgment: ability to evaluate architectural tradeoffs, assess technical risk, and guide teams toward sound engineering decisions without needing to write the code yourself
  • Experience leading distributed or multi-site engineering teams.
Job Responsibility
Job Responsibility
  • Build and lead a multi-team engineering organization (30+ engineers across multiple teams), including hiring and developing engineering managers who lead their own teams
  • Set the technical and organizational strategy for Copilot AI Evaluation and response quality, aligning with MAI's broader product and engineering vision
  • Partner with senior Eng and Product leadership (Partner+ level) to define priorities, influence roadmaps, and drive cross-organizational initiatives
  • Own end-to-end delivery of evaluation platforms, novel evaluation techniques, and agentic solutions for measuring and improving Copilot quality at scale
  • Recruit, develop, and retain world-class engineering talent — building a culture of technical excellence, accountability, and continuous learning
  • Drive operational rigor: establish engineering processes, quality bars, and delivery cadences that enable predictable, high-quality execution across multiple concurrent workstreams
  • Navigate ambiguity and make high-judgment tradeoff decisions on technology, staffing, and investment priorities in a fast-moving AI landscape
  • Foster a diverse, inclusive team culture where engineers at all levels can do their best work and grow their careers
  • Embody our Culture and Values.
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Developer Experience - MAI Superintelligence Team

Microsoft AI is looking for a Member of Technical Staff, Developer Experience to...
Location
Location
United States , Mountain View
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, or related technical discipline AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
Job Responsibility
Job Responsibility
  • Design, implement, and optimize CI/CD pipelines for large-scale ML training workloads.
  • Build developer tools and automation to simplify training and evaluation workflows.
  • Improve and maintain core infrastructure across multi-cloud environments.
  • Deploy and manage model hosting systems for inference and data generation.
  • Collaborate with cross-functional teams to drive best practices in reliability, testability, and performance.
  • Care deeply about conversational AI and its deployment.
  • Actively contribute to the development of AI models that are powering our innovative products.
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively.
  • Enjoy working in a fast-paced, design-driven, product development cycle.
  • Embody our Culture and Values.
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Full Stack - ML Efficiency & Observability

Microsoft AI is looking for a Member of Technical Staff - Full Stack Engineer, M...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years experience in business analytics, data science, software development, data modeling or data engineering work
  • OR Master’s Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ year(s) experience in business analytics, data science, software development, or data engineering work
  • OR equivalent experience.
  • Experience with Capacity Management, Efficiency Management, ML Training and/or Inference
  • Solid expertise in JavaScript / TypeScript, React, HTML, CSS and browser internals
  • Solid understanding of web performance, accessibility, and cross‑browser compatibility
  • Experience with Development & Debugging with dev environments like Visual Studio or Visual Studio Code
  • Software development experience with Generative AI tools
  • Experience in leading technical projects and supporting architectural decisions with data.
Job Responsibility
Job Responsibility
  • Design and develop features for our capacity management portal
  • Design and develop features to provide visibility into model performance and quality across our fleet
  • Partner with ML researchers and PMs to translate functional requirements into highly functional, intuitive and appealing interfaces
  • Integrate with backend APIs from schedulers to training frameworks to build visibility across the training life cycle
  • Explore, develop, and adapt new innovations to the software development process
  • Contribute to the development of internal tooling and infrastructure
  • Implement best software development practices to ensure code quality. Hold a high quality bar.
  • Embody our culture and values.
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Machine Learning Engineer

As Microsoft continues to push the boundaries of AI, we are on the lookout for p...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • 3+ years of experience building and deploying ML models in production environments
  • Strong coding skills in Python and experience with ML frameworks (e.g., PyTorch, TensorFlow)
  • Familiarity with data processing tools (e.g., Spark, Pandas) and cloud platforms (e.g., Azure, AWS)
  • Experience with classification, recommendation, or personalization systems
  • Experience using large language models (LLMs) for machine learning and AI applications
  • Hands-on experience in growth engineering, driving improvements in user acquisition, engagement, and retention
  • Hands-on experience with machine learning frameworks such as TensorFlow, PyTorch, or Scikit-learn
  • Expertise in personalization strategies and user behavior modeling
  • Strong problem-solving skills and the ability to independently design solutions to complex challenges
Job Responsibility
Job Responsibility
  • Develop and Deploy Models: Design, develop, and implement machine learning models for high-performance recommendation systems and personalized feeds
  • Large Language Model Expertise: Leverage large language models (LLMs) to create scalable, intelligent solutions for content understanding, user engagement, and relevance ranking
  • Experimentation and Analysis: Drive data-driven experimentation using A/B testing, advanced analytics, and statistical techniques to identify growth opportunities and refine algorithms
  • Infrastructure Optimization: Develop and optimize pipelines, tools, and infrastructure to support real-time decision-making, personalization, and predictive analytics
  • Technical Leadership: Mentor team members and foster collaboration within cross-functional teams, including engineers, product managers, and designers
  • Continuous Innovation: Stay informed on emerging trends in AI and machine learning, and integrate them to drive innovation and improve product offerings
  • Cross-functional Collaboration: Articulate findings and recommendations to technical and non-technical audiences, influencing decisions across teams and leadership
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right
New

Senior Commercial Manager

Egis UK is seeking a highly experienced Senior Commercial Manager to lead commer...
Location
Location
United Kingdom , Birmingham
Salary
Salary:
Not provided
egis-group.com Logo
Egis in the UK
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Degree qualified, or equivalent
  • Experience of the development and implementation of procedures
  • Civil engineering works experience, including utilities
  • Knowledge of a wide range of contract conditions
  • Strong contractual experience
  • Excellent IT skills and knowledge of relevant software
  • Ability to solve problems efficiently
  • Excellent interpersonal and networking skills, with the ability to communicate effectively with all levels of personnel
  • Ability to effectively prioritise a busy workload and meet project deadlines
  • Evidence new ways of thinking which deliver benefits
Job Responsibility
Job Responsibility
  • Provide executive-level oversight of commercial operations across multi-disciplinary programmes
  • Shape and implement procurement and contract strategies that align with client objectives and regulatory frameworks
  • Act as a trusted advisor to senior stakeholders, regulators, and delivery partners
  • Champion commercial innovation, risk mitigation, and continuous improvement across the portfolio
  • Define and execute commercial strategy across multi-workstream programmes
  • Lead contract negotiation, supplier performance, and risk management
  • Ensure compliance with NEC3/4, FIDIC, UK law, and sector-specific regulations
  • Support work winning through commercial insight and bid strategy
  • Drive commercial governance and reporting across all phases
  • Oversee cost planning, benchmarking, and value engineering
  • Fulltime
Read More
Arrow Right