CrawlJobs Logo

Senior ML Ops Engineer - Architecture & Strategy

bmw.de Logo

BMW

Location Icon

Location:
Germany , Munich

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We own the platform blueprint for our ML infrastructure: designing systems that integrate with a data mesh of domain-owned data products, leverage Qualcomm Cloud AI 100 and NVIDIA GPU clusters for training at petabyte scale and produce optimised model artefacts ready for deployment to vehicle hardware. We set technical direction, make build-vs-buy decisions, and ensure the platform scales to hundreds of engineers.

Job Responsibility:

  • You design the reference architecture for the ML platform end-to-end: data ingestion, PB-scale data lake, heterogeneous training clusters, model registry, and deployment-ready artefacts
  • You design the data-format backbone, setting standards for data flows, ingestion, cataloguing, transcoding, and partitioning at PB scale, integrated with dataset management tooling
  • You define the platform component topology and integration contracts for pipeline orchestration, experiment tracking, hyperparameter optimisation, dataset management, observability, and metadata
  • You establish model lifecycle governance, including experiment tracking, approval gates, validation criteria, and clear handoff contracts to deployment teams
  • You drive cost governance at PB scale, including accelerator spot strategies, S3 tiering, cross-AZ traffic reduction, and Kubernetes cluster right-sizing
  • You partner with Security, Legal, and Functional-Safety teams on ISO 26262, ISO 8800, and data-protection compliance

Requirements:

  • University degree in Computer Science, Computer/Electrical Engineering or related subjects
  • 5–8+ years in ML platform or infrastructure engineering, with at least two years in a tech lead or architect role
  • Deep expertise in either AWS, Azure or Google cloud, ideally with multi-region or multi-account setups
  • Proven track record designing systems for PB-scale data and hundreds of concurrent training jobs as well as understanding of large vision models and the challenges of compressing them for automotive-grade SoCs
  • Strong knowledge of Kubernetes platform design, GitOps, and infrastructure-as-code
  • Excellent communication skills to align ML researchers, embedded engineers, data teams, and executives
  • Familiarity with edge model compilation toolchains for Qualcomm (QNN, AIMET) and/or NVIDIA (TensorRT, Triton) and experience with automotive data at scale, such as MDF4, MCAP, ROS bags, and multi-sensor synchronisation
What we offer:
  • Challenging projects with which we shape the mobility of tomorrow together
  • Wide range of personal and professional development opportunities
  • Attractive, fair and performance-related remuneration
  • High level of job security
  • Annual special payments such as vacation pay, Christmas bonus, and profit sharing
  • Flexible working hours including six weeks annual leave and overtime compensation
  • Discounted BMW & MINI conditions
  • Many other benefits at bmw.jobs/benefits

Additional Information:

Job Posted:
March 21, 2026

Employment Type:
Fulltime
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior ML Ops Engineer - Architecture & Strategy

Senior Software Engineer - ML Infrastructure

We build simple yet innovative consumer products and developer APIs that shape h...
Location
Location
United States , San Francisco
Salary
Salary:
180000.00 - 270000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of industry experience as a software engineer, with strong focus on ML/AI infrastructure or large-scale distributed systems
  • Hands-on expertise in building and operating ML platforms (e.g., feature stores, data pipelines, training/inference frameworks)
  • Proven experience delivering reliable and scalable infrastructure in production
  • Solid understanding of ML Ops concepts and tooling, as well as best practices for observability, security, and reliability
  • Strong communication skills and ability to collaborate across teams
Job Responsibility
Job Responsibility
  • Design and implement large-scale ML infrastructure, including feature stores, pipelines, deployment tooling, and inference systems
  • Drive the rollout of Plaid’s next-generation feature store to improve reliability and velocity of model development
  • Help define and evangelize an ML Ops “golden path” for secure, scalable model training, deployment, and monitoring
  • Ensure operational excellence of ML pipelines and services, including reliability, scalability, performance, and cost efficiency
  • Collaborate with ML product teams to understand requirements and deliver solutions that accelerate experimentation and iteration
  • Contribute to technical strategy and architecture discussions within the team
  • Mentor and support other engineers through code reviews, design discussions, and technical guidance
What we offer
What we offer
  • medical, dental, vision, and 401(k)
  • Fulltime
Read More
Arrow Right

Senior Principal Technical Program Manager - ML Platform

Location
Location
Salary
Salary:
231300.00 - 301975.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience on software teams as Development Manager, Technical Product Manager or TPM leading technical platforms areas
  • Deep domain experience in AI and/or Search. Example: Model Inference, Model Evaluation, Model Training, LLM Ops, Semantic Search, Search Relevance, etc.
  • Partner with Engineering in defining direction, strategy and execution at Platform level
  • Strategic thinking and ability to understand business objectives to translate them into technical problems and programs.
  • Technical understanding of systems involved. Willingness to develop domain expertise in the area they operate - storage, networking, authentication, capacity management, service deployments, etc.
  • TPMs are not expected to write or read code, but are expected to understand system flows, block architectures, APIs and such.
  • Experience defining and running end-to-end complex technical programs
  • Strong leadership, organizational, and communication skills
Job Responsibility
Job Responsibility
  • Understand and stay up-to-date on latest innovations in AI and Search. Partner closely with engineering teams to translate these into practical platform evolution for Atlassian bringing value to our customers.
  • Analyze business objectives, customer needs, product adoption inhibitors and opportunities, industry trends, and based on these, in close collaboration with your stakeholders, define a long-term strategy and roadmap for your platform and product components.
  • Understand business objectives and translate them into technical systems problems that need to be prioritized solved in the current business environment.
  • Define specific systems programs and create a plan of action for realizing those programs. Such programs could be around capacity planning, migration efforts, high availability, network architecture, performance optimization, reliability improvements and more.
  • Use your technical understanding of Atlassian and related systems to partner with and influence engineers and architects in making progress on these problems.
  • Responsible for taking a systematic approach to engineering problems. This includes: prioritizing tasks, scoping out the project, defining objectives, and making consistent progress against each of these.
  • Be accountable for the success of these technical programs by managing the entire lifecycle from initiation to forecasting, budgeting, scheduling, etc.
  • Manage complex dependencies and projects with a broad scope across the company
What we offer
What we offer
  • health and wellbeing resources
  • paid volunteer days
Read More
Arrow Right

Principal Engineer

The Principal AI/ML Operations Engineer leads the architecture, automation, and ...
Location
Location
United States , Pleasanton, California
Salary
Salary:
251000.00 - 314500.00 USD / Year
blackline.com Logo
BlackLine
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Machine Learning, Data Science, or a related field
  • 10+ years in ML infrastructure, DevOps, and software system architecture
  • 4+ years in leading MLOps or AI Ops platforms
  • Strong programming skills in languages such as Python, Java, or Scala
  • Expertise in ML frameworks (TensorFlow, PyTorch, scikit-learn) and orchestration tools (Airflow, Kubeflow, Vertex AI, MLflow)
  • Proven experience operating production pipelines for ML and LLM-based systems across cloud ecosystems (GCP, AWS, Azure)
  • Deep familiarity with LangChain, LangGraph, ADK or similar agentic system runtime management
  • Strong competencies in CI/CD, IaC, and DevSecOps pipelines integrating testing, compliance, and deployment automation
  • Hands-on with observability stacks (Prometheus, Grafana, Newrelic) for model and agent performance tracking
  • Understanding of governance frameworks for Responsible AI, auditability, and cost metering across training and inference workloads
Job Responsibility
Job Responsibility
  • Define enterprise-level standards and reference architectures for ML-Ops and AIOps systems
  • Partner with data science, security, and product teams to set evaluation and governance standards (Guardrails, Bias, Drift, Latency SLAs)
  • Mentor senior engineers and drive design reviews for ML pipelines, model registries, and agentic runtime environments
  • Lead incident response and reliability strategies for ML/AI systems
  • Lead the deployment of AI models and systems in various environments
  • Collaborate with development teams to integrate AI solutions into existing workflows and applications
  • Ensure seamless integration with different platforms and technologies
  • Define and manage MCP Registry for agentic component onboarding, lifecycle versioning, and dependency governance
  • Build CI/CD pipelines automating LLM agent deployment, policy validation, and prompt evaluation of workflows
  • Develop and operationalize experimentation frameworks for agent evaluations, scenario regression, and performance analytics
What we offer
What we offer
  • short-term and long-term incentive programs
  • robust offering of benefit and wellness plans
  • Fulltime
Read More
Arrow Right

Senior Manager, AI Engineering

By leading the strategic adoption and scaling of AI across the organisation this...
Location
Location
United Kingdom , London OR Newbury
Salary
Salary:
Not provided
vodafone.com Logo
Vodafone
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience in AI strategy, delivery, and enablement
  • Strong understanding of GenAI, ML Ops, and AI governance
  • Familiarity with infrastructure provisioning and model lifecycle
  • Ability to influence cross-functional teams and stakeholders
  • Experience in training, consulting, and change management
  • Knowledge of privacy, security, and ethical AI practices
Job Responsibility
Job Responsibility
  • Define and deliver the AI strategy and roadmap
  • Build and maintain self-service AI environments and infrastructure
  • Implement use cases to demonstrate business value
  • Operate and monitor AI models for accuracy and performance
  • Collaborate with architecture, governance, and security teams
  • Establish best practice and enable reuse across solutions
  • Drive AI enablement through training and consulting
  • Evangelise AI adoption across internal and customer-facing teams
  • Monitor industry trends and pilot emerging opportunities
  • Measure and report on efficiency gains and impact
What we offer
What we offer
  • Great pay, bonuses, up to 28 days off plus bank holidays, and paid time for charity work
  • Personalise benefits for you and your family, like discounts, vouchers, a pension plan and loads more
  • Amazing learning tools and top-notch parental leave policies
  • Fulltime
Read More
Arrow Right
New

Tech Specialist 3

We are seeking a Technical Specialist 3 to join our Security and Electronic Syst...
Location
Location
United Kingdom
Salary
Salary:
Not provided
mcdean.com Logo
M.C. Dean, Inc
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Requires 10 or more years of experience with installation, troubleshooting and/or testing of EPMS, BMS, Industrial instrumentation and/or SCADA systems
  • 10+ years of experience with a High School Diploma
  • 8+ years of experience with an Associate's
  • Requires reliable attendance at customer site during work hours required by customer
  • Ability to travel up to 25%
  • Preferred Industry Certifications: CCST certification, Security+ certification
Job Responsibility
Job Responsibility
  • Analyze system performance and recommend improvements
  • Provide support to the system administration team
  • Using advanced knowledge of networking principals and system administration skills to troubleshoot systems and operating systems
  • Communicate with customers, manufacturers, vendors and system administrators
  • Train system users and other system support personnel
  • Analyze and modify preventive maintenance checklists for system changes
  • Troubleshoot system performance issues and implement corrective actions
  • Organize and prepare detailed documentation of system performance, including service request records and analysis
  • Perform desktop hardware and operating system set-up, imaging software loading, and antivirus updates
  • Conduct in depth research and evaluate the research of Technical Specialists in order to evaluate existing and future systems
What we offer
What we offer
  • A collaborative team inspired by the way engineering and innovation enhance customer outcomes, improve lives, and change the world for the better
  • An opportunity to lead and build a business with the support of an industry-leading firm that has been in business for 75 years
  • Investment in your skills and expertise through a combination of professional and technical training programs, including leadership training and tuition reimbursement
  • Open and transparent communication with senior leadership as well as local office management
  • Fulltime
Read More
Arrow Right
New

Property Accountant

A well-established nonprofit organization focused on affordable housing is seeki...
Location
Location
United States , Los Angeles
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Accounting, Finance, or related field preferred
  • 3+ years of property accounting experience, preferably within affordable housing, nonprofit, or HUD-funded environments
  • Strong understanding of HUD compliance, affordable housing programs, or nonprofit accounting practices
  • Experience with general ledger, reconciliations, and financial reporting
  • Proficiency in Excel and accounting software systems
  • Strong attention to detail, organization, and ability to manage multiple properties or projects
  • Excellent communication and collaboration skills
Job Responsibility
Job Responsibility
  • Manage the full-cycle accounting for assigned affordable housing properties
  • Prepare and analyze monthly, quarterly, and annual financial statements
  • Maintain and reconcile general ledger accounts and bank reconciliations
  • Ensure compliance with HUD regulations and nonprofit financial reporting requirements
  • Prepare and submit financial reports required for HUD, grants, and funding agencies
  • Assist with budget preparation, variance analysis, and forecasting
  • Support year-end audits and coordinate with external auditors
  • Monitor property operating expenses and work with property management teams to ensure financial accuracy
  • Maintain accurate financial records and documentation in accordance with nonprofit accounting standards
  • Assist with grant tracking, restricted funds, and program-based accounting
What we offer
What we offer
  • medical
  • vision
  • dental
  • life and disability insurance
  • 401(k) plan
Read More
Arrow Right
New

Embedded Technical Specialist

We are seeking an Embedded Technical Specialist to join our Security and Electro...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
mcdean.com Logo
M.C. Dean, Inc
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Requires 10 or more years of experience with installation, troubleshooting and/or testing of EPMS, BMS, Industrial instrumentation and/or SCADA systems
  • 10+ years of experience with a High School Diploma 8+ years of experience with an Associate's
  • Requires reliable attendance at customer site during work hours required by customer
Job Responsibility
Job Responsibility
  • Assume localized ownership of the hardware layer for physical security systems
  • Ensure the stability, resilience, and lifecycle management of critical physical security field hardware infrastructure within the assigned region (London, UK)
  • Maintain the operational availability, resilience, and lifecycle management of field hardware devices, including Network Video Recorders (NVRs), access control panels, IP cameras, and regional visitor management solutions
  • Perform commissioning, firmware upgrades, vulnerability patching, replacements, and advanced hardware troubleshooting for field hardware
  • Provide after-hours support when necessary for critical incidents or regional operational needs
  • Maintain 100% accuracy of regional asset data in SAP’s financial system (ISP), asset management system (CCIR), and privileged account/password management tool (Thycotic)
  • Implement ITIL-based processes for incident, change, and configuration management to improve regional service performance metrics and reduce hardware downtime
  • Serve as the dedicated regional technical liaison for Operations Site Leads and integration partners
  • Provide insight and support for onboarding, access provisioning, and visitor management hardware
  • Ensure the field hardware remains aligned with enterprise IT and compliance requirements
  • Fulltime
Read More
Arrow Right
New

Assistant Manager-Restaurant

Areas of responsibility include Restaurants/Bars and Room Service, if applicable...
Location
Location
India , Kochi
Salary
Salary:
Not provided
https://www.marriott.com Logo
Marriott Bonvoy
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • High school diploma or GED
  • 4 years experience in the food and beverage, culinary, or related professional area
  • OR 2-year degree from an accredited university in Food Service Management, Hotel and Restaurant Management, Hospitality, Business Administration, or related major
  • 2 years experience in the food and beverage, culinary, or related professional area
Job Responsibility
Job Responsibility
  • Assists in the daily supervision restaurant operations
  • Assists with menu planning
  • Maintains sanitation standards
  • Assists servers and hosts on the floor during peak meal periods
  • Handles employee questions and concerns
  • Monitors employees to ensure performance expectations are met
  • Provides feedback to employees based on observation of service behaviors
  • Assists in supervising daily shift operations
  • Supervises restaurant and all related areas in the absence of the Director of Restaurants or Restaurant Manager
  • Participates in department meetings by communicating a clear and consistent message regarding the departmental goals to produce desired results
  • Fulltime
Read More
Arrow Right