CrawlJobs Logo

Big Data Engineer - ML Analytics & Search

Germany, Munich · Job Posted March 21, 2026
Apply Position
Job Link Share

Job Description

We train models on petabyte-scale automotive sensor data, but training is only half the story. Before a single GPU cycle is spent, engineers need to find, filter, evaluate, and understand the data. We build the analytics and search infrastructure that makes petabytes of measurements and recordings queryable in seconds, enabling rapid dataset assembly, quality analysis, and model evaluation at scale.

Job Responsibility

  • Design and build high-performance search and query pipelines over PB-scale MDF4 and MCAP data lakes, enabling ML engineers to find relevant driving scenarios, sensor conditions, and edge cases across billions of records in seconds
  • Build and operate indexing and cataloguing systems for automotive sensor data, including metadata extraction, signal-level indexing, scene tagging, and embedding-based similarity search
  • Implement distributed compute pipelines for large-scale data evaluation, such as batch statistics, distribution analysis, annotation coverage reports, and data-quality scoring
  • Build fast analytical queries that enable interactive exploration on top of raw data
  • Develop dataset assembly pipelines that automatically assemble, version, and register training and evaluation datasets
  • Optimise for cost and performance through intelligent partitioning, tiered storage, caching strategies, and query pushdown to minimise scan volumes over PB-scale data
  • Operate observability stacks for data pipelines, including query latency dashboards, pipeline health, and data freshness monitors

Requirements

  • University degree in Computer Science, Engineering, or a related field
  • 3–5 years of experience in big data or data engineering with a focus on analytics and search over very large datasets
  • Strong Python and SQL skills, with experience in at least one distributed compute framework
  • Experience with columnar or analytical storage and query optimisation at PB scale
  • Familiarity with search and indexing technologies, including full-text search, vector/embedding search or metadata catalogues
  • Production experience with Kubernetes and AWS / Azure / Google Cloud, as well as hands-on experience with infrastructure-as-code
  • Experience with automotive measurement data (MDF4/ASAM MDF or MCAP) as well as with embedding-based retrieval, dataset management tools, stream processing, or graph-based metadata systems

What we offer

  • Challenging projects with which we shape the mobility of tomorrow together
  • Wide range of personal and professional development opportunities
  • Attractive, fair and performance-related remuneration
  • High level of job security
  • Annual special payments such as vacation pay, Christmas bonus, and profit sharing
  • Flexible working hours including six weeks annual leave and overtime compensation
  • Discounted BMW & MINI conditions

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Big Data Engineer - ML Analytics & Search

8 matching positions

Cloud Solution Architect - Data Analytics

With more than 45,000 employees and partners worldwide, the Customer Experience ...
Location
Location
United States , Multiple Locations
Salary
Salary:
106400.00 - 203600.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Information Technology, Engineering, Business, Liberal Arts, or related field AND 4+ years experience in cloud/infrastructure technologies, information technology (IT) consulting/support, systems administration, network operations, software development/support, technology solutions, practice development, architecture, and/or consulting
  • OR equivalent experience.
  • 4+ years experience working in a customer-facing role (e.g., internal and/or external).
  • 4+ years experience working on technical projects.
  • Technical Certification in Cloud (e.g., Azure, Amazon Web Services, Google, security certifications).
  • Technical depth in one of the following Data Analytics and AI Platform Cloud solutions: NoSQL Databases including OSS (Maria, Mongo etc), CosmosDB
  • Big Data including Azure Synapse, Snowflake, Big Query, Redshift
  • Machine Learning including Azure ML, ML Server
  • Artificial Intelligence including BOT framework, Cognitive Services, Cognitive Search
  • Expertise in data estate workloads like HDInsight, Hadoop, Cloudera, Spark, Python
Job Responsibility
Job Responsibility
  • Plan and deliver proactive and reactive support including onsite presence as needed (post Covid restrictions).
  • Work within a larger virtual account team to strengthen customer relationships and work on mobile-first, cloud-first strategies for immediate and long-term service delivery plans.
  • Identify and manage goals and opportunities across Big Data platform to improve the health, performance, and availability.
  • Drive and participate in proactive delivery management, spot performance issues, analyze problems, and provide solutions to meet customer needs.
  • Work with support teams, account teams, product engineering teams and other stakeholders to ensure a streamlined customer experience.
  • Apply lessons learned for continuous process and delivery improvement for the customer.
  • Engage in meetings with customers and account teams to articulate service offerings.
  • Share and gain knowledge through communities.
  • Contribute to on-call rotations to ensure a high quality of service for critical incidents created by Support for Mission Critical customers.
  • Fulltime
Read More
Arrow Right

Software Engineer II (Search Quality)

Bloomreach is seeking a Backend Software Engineer to join our Search Quality tea...
Location
Location
India
Salary
Salary:
Not provided
bloomreach.com Logo
Bloomreach
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS/MS in Computer Science or a related field
  • 2+ years of professional software engineering experience building backend systems using Java or Python
  • Strong grasp of computer science fundamentals including algorithms, data structures, and distributed systems
  • Experience working with cloud environments (AWS or GCP) and containerized deployments (e.g., Docker, Kubernetes)
  • Proven experience with distributed systems, microservices architecture, and large-scale data pipelines
  • Experience with big data technologies such as Hadoop, Spark, Kafka, and data lakes
  • Strong analytical and debugging skills
  • passion for clean code and sustainable software practices
  • Interest in or exposure to machine learning technologies in real-world applications
Job Responsibility
Job Responsibility
  • Design, develop, and maintain backend services and distributed systems powering search at scale
  • Collaborate with applied scientists and ML engineers to bring research prototypes into production
  • Work across the full stack of our AI Search architecture, from ingest and indexing to query-time ranking and retrieval
  • Integrate big data and real-time streaming systems (e.g., Kafka, Spark) to process and learn from user behavior at scale
  • Optimize for low-latency and high-availability performance across hundreds of millions of queries per day
  • Operate in a fast-paced, collaborative environment, where your work will directly influence customer success
What we offer
What we offer
  • A great deal of freedom and trust
  • flexible working hours
  • work virtual-first
  • company events
  • 5 paid days off to volunteer
  • People Development Program
  • communication coach available
  • Leader Development Program
  • $1,500 professional education budget annually
  • Employee Assistance Program with counselors
  • Fulltime
Read More
Arrow Right

Senior Principal Data Platform Software Engineer

We’re looking for a Sr Principal Data Platform Software Engineer (P70) to be a k...
Location
Location
Salary
Salary:
239400.00 - 312550.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years in Data Engineering, Software Engineering, or related roles, with substantial exposure to big data ecosystems
  • Demonstrated experience building and operating data platforms or large‑scale data services in production
  • Proven track record of building services from the ground up (requirements → design → implementation → deployment → ongoing ownership)
  • Hands‑on experience with AWS, GCP (e.g., compute, storage, data, and streaming services) and cloud‑native architectures
  • Practical experience with big data technologies, such as Databricks, Apache Spark, AWS EMR, Apache Flink, or StarRocks
  • Strong programming skills in one or more of: Kotlin, Scala, Java, Python
  • Experience leading cross‑team technical initiatives and influencing senior stakeholders
  • Experience mentoring Staff/Principal engineers and lifting the technical bar for a team or org
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience
Job Responsibility
Job Responsibility
  • Design, develop and own delivery of high quality big data and analytical platform solutions aiming to solve Atlassian’s needs to support millions of users with optimal cost, minimal latency and maximum reliability
  • Improve and operate large‑scale distributed data systems in the cloud (primarily AWS, with increasing integration with GCP and Kubernetes‑based microservices)
  • Drive the evolution of our high-performance analytical databases and its integrations with products, cloud infrastructures (AWS and GCP) and isolated cloud environments
  • Help define and uplift engineering and operational standards for petabyte scale data platforms, with sub‑second analytic queries and multi‑region availability (coding guidelines, code review practices, observability, incident response, SLIs/SLOs)
  • Partner across multiple product and platform teams (including Analytics, Marketplace/Ecosystem, Core Data Platform, ML Platform, Search, and Oasis/FedRAMP) to deliver company‑wide initiatives that depend on reliable, high‑quality data
  • Act as a technical mentor and multiplier, raising the bar on design quality, code quality, and operational excellence across the broader team
  • Design and implement self‑healing, resilient data platforms with strong observability, fault tolerance, and recovery characteristics
  • Own the long‑term architecture and technical direction of Atlassian’s product data platform with projects that are directly tied to Atlassian’s company-level OKRs
  • Be accountable for the reliability, cost efficiency, and strategic direction of Atlassian’s product analytical data platform
  • Partner with executives and influence senior leaders to align engineering efforts with Atlassian’s long-term business objectives
What we offer
What we offer
  • health and wellbeing resources
  • paid volunteer days
  • Fulltime
Read More
Arrow Right

AI Machine Learning Scientist

Location
Location
United States of America , INDIANAPOLIS
Salary
Salary:
Not provided
elevancehealth.com Logo
Elevance Health
Expiration Date
July 15, 2026
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in a highly quantitative field: Computer Science, Machine Learning, Operational Research, Analytics, Statistics, Mathematics, or a related field of study
  • Four (4) years of Information Technology (IT) experience, or related experience
  • Experience with Machine Learning techniques including supervised learning (linear regression and classification) and unsupervised learning (clustering)
  • Experience with Deep Learning techniques including convolutional neural networks, recurrent neural networks, and reinforcement learning
  • Experience using Python programming language
  • Experience with big data technologies including Hadoop, Apache Spark, and AWS
  • Experience with relational database management and SQL
  • Experience with Natural Language Processing (NLP) techniques including sentiment analysis and text classification
  • Experience with statistical analysis and application of statistical methods
  • Employer will accept a Master’s degree in a highly quantitative field: Computer Science, Machine Learning, Operational Research, Analytics, Statistics, Mathematics, or a related field of study and Two (2) years of Information Technology (IT) experience, or related. Must have the skills listed above.
Job Responsibility
Job Responsibility
  • Leverage Artificial Intelligence (AI) scientific and statistical methods to assist with product creation, development, and improvement
  • Engage with product teams and business stakeholders to align on project objectives and ensure AI models meet business goals
  • Lead initiatives for developing Machine Learning (ML), Natural Language Processing (NLP), and Large Language Models (LLM) LLM models
  • Play a critical role in steering the strategic direction for ML, NLP, LLM, and algorithm development within the company’s AI/ML team, collaborating with distinguished experts in AI/ML modeling, ML engineering, data science, and data engineering
  • Define and articulate roadmaps for AI/ML model development, acting as a key figure in AI-driven transformation to deliver value internally and to customers
  • Design and develop customized ML, GenAI, NLP, and LLM models for both batch and stream processing-based AI/ML pipelines. This includes handling data ingestion, preprocessing, search and retrieval, Retrieval Augmented Generation (RAG), and ensuring that complete solutions meet all technical and business requirements, as well as Service Level Agreement (SLA) specifications
  • Work closely with MLOps, machine learning engineers, and software engineers to ensure seamless integration of machine learning models into production systems
  • Collaborate with the MLOps team to create and maintain strong evaluation solutions and tools that assess model performance, accuracy, consistency, and reliability during development and UAT
  • Mentor junior team members and influence the strategic direction of our ML and data science projects.
What we offer
What we offer
  • merit increases
  • paid holidays
  • Paid Time Off
  • incentive bonus programs
  • medical
  • dental
  • vision
  • short and long term disability benefits
  • 401(k) +match
  • stock purchase plan
  • Fulltime
Read More
Arrow Right

Data & AI Consultant

Microsoft Industry Solutions - Global Center for Innovation and Delivery Center ...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4 - 6 years of experience
  • Bachelor's degree in computer science engineering or equivalent work experience
  • Knowledge of solution design, planning, development and deployment of complex solutions
  • Hands-on experience in Data Engineering across cloud, on-prem, and hybrid environments
  • Strong foundational experience with Azure Data Services and data platform modernization initiatives
  • Handson exposure to Data Warehouse and analytics using platforms like Microsoft Fabric, and Azure Synapse Analytics
  • Experience/knowledge of one or more SQL and NoSQL database systems
  • Hands-on experience building AI-powered data pipelines using ETL/ELT tools like: Azure Data Factory (ADF), SSIS, Talend, Informatica, Airflow
  • Exposure to data migrations, platform upgrades, and modernization efforts
  • Understanding of multitenant data platform designs, basic security hardening, and access control concepts
Job Responsibility
Job Responsibility
  • Works as an Individual contributor and key member of the Data and AI team and helps in timely execution of assigned deliverables with accurate estimates, work priorities, and accommodates project changes and trade-offs necessary for a successful release
  • Applies technical experience and industry-specific knowledge to develop solutions, based on an analysis of how the proposed approach affects the business objectives of customers and partners
  • Works to accelerate the value proposition of customer/partner engagements by helping to design, develop, and deploy solutions on Microsoft technologies and methodologies
  • Contributes to the overall efficacy and quality of a project team’s technical delivery within assigned engagements
  • Defines dependencies and risks that go beyond the immediate scope and timeframe for a complex project
  • Develops contingency plans, risk-mitigation implementation criteria, and alternative strategies to manage short- and long-term risks and manages technical escalations
  • Drives opportunities to expand or accelerate the adoption and consumption of cloud and Microsoft technologies. Collaborates, as appropriate, with peers and other teams (e.g., Sales, account-aligned team) to scale the business with existing high-stake or strategic customers, by articulating/developing value propositions of strategic Microsoft products and services
  • Align with innovation and digital transformation initiatives. Ensures the use of existing intellectual property (IP) and delivers value to customers
  • Responsible for implementing the technology strategy with support from Senior peers
  • Applies information-compliance and assurance policies to ensure stakeholder confidence
  • Fulltime
Read More
Arrow Right

Senior Consultant Data

Microsoft Industry Solutions - Global Center for Innovation and Delivery Center ...
Location
Location
India
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience
  • Bachelor's degree in computer science engineering or equivalent work experience
  • Knowledge of solution design, planning, development and deployment of complex solutions
  • Hands-on experience in Data Engineering across cloud, on-prem, and hybrid environments
  • Strong foundational experience with Azure Data Services and data platform modernization initiatives
  • Handson exposure to Data Warehouse and analytics using platforms like Microsoft Fabric, and Azure Synapse Analytics
  • Experience/knowledge of one or more SQL and NoSQL database systems
  • Hands-on experience building AI-powered data pipelines using ETL/ELT tools like: Azure Data Factory (ADF), SSIS, Talend, Informatica, Airflow
  • Exposure to data migrations, platform upgrades, and modernization efforts
  • Understanding of multitenant data platform designs, basic security hardening, and access control concepts
Job Responsibility
Job Responsibility
  • Works as an Individual contributor and key member of the Data and AI team and helps in timely execution of assigned deliverables with accurate estimates, work priorities, and accommodates project changes and trade-offs necessary for a successful release
  • Applies technical experience and industry-specific knowledge to develop solutions, based on an analysis of how the proposed approach affects the business objectives of customers and partners
  • Works to accelerate the value proposition of customer/partner engagements by helping to design, develop, and deploy solutions on Microsoft technologies and methodologies
  • Contributes to the overall efficacy and quality of a project team's technical delivery within assigned engagements
  • Defines dependencies and risks that go beyond the immediate scope and timeframe for a complex project
  • Drives opportunities to expand or accelerate the adoption and consumption of cloud and Microsoft technologies
  • Align with innovation and digital transformation initiatives
  • Responsible for implementing the technology strategy with support from Senior peers
  • Applies information-compliance and assurance policies to ensure stakeholder confidence
  • Drives new ways of thinking, across the division and subsidiary, to improve quality, engineering productivity, and responsiveness to feedback and changing priorities
  • Fulltime
Read More
Arrow Right

Consultant A2 - Data & AI

Microsoft Industry Solutions - Global Center for Innovation and Delivery Center ...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4 - 6 years of experience
  • Bachelor's degree in computer science engineering or equivalent work experience
  • Knowledge of solution design, planning, development and deployment of complex solutions
  • Hands-on experience in Data Engineering across cloud, on-prem, and hybrid environments
  • Strong foundational experience with Azure Data Services and data platform modernization initiatives
  • Handson exposure to Data Warehouse and analytics using platforms like Microsoft Fabric, and Azure Synapse Analytics
  • Experience/knowledge of one or more SQL and NoSQL database systems
  • Hands-on experience building AI-powered data pipelines using ETL/ELT tools like Azure Data Factory (ADF), SSIS, Talend, Informatica, Airflow
  • Exposure to data migrations, platform upgrades, and modernization efforts
  • Understanding of multitenant data platform designs, basic security hardening, and access control concepts
Job Responsibility
Job Responsibility
  • Works as an Individual contributor and key member of the Data and AI team and helps in timely execution of assigned deliverables with accurate estimates, work priorities, and accommodates project changes and trade-offs necessary for a successful release
  • Applies technical experience and industry-specific knowledge to develop solutions, based on an analysis of how the proposed approach affects the business objectives of customers and partners
  • Works to accelerate the value proposition of customer/partner engagements by helping to design, develop, and deploy solutions on Microsoft technologies and methodologies
  • Contributes to the overall efficacy and quality of a project team’s technical delivery within assigned engagements
  • Defines dependencies and risks that go beyond the immediate scope and timeframe for a complex project. Develops contingency plans, risk-mitigation implementation criteria, and alternative strategies to manage short- and long-term risks and manages technical escalations
  • Drives opportunities to expand or accelerate the adoption and consumption of cloud and Microsoft technologies. Collaborates, as appropriate, with peers and other teams (e.g., Sales, account-aligned team) to scale the business with existing high-stake or strategic customers, by articulating/developing value propositions of strategic Microsoft products and services
  • Align with innovation and digital transformation initiatives. Ensures the use of existing intellectual property (IP) and delivers value to customers
  • Responsible for implementing the technology strategy with support from Senior peers
  • Applies information-compliance and assurance policies to ensure stakeholder confidence
  • Drives new ways of thinking, across the division and subsidiary, to improve quality, engineering productivity, and responsiveness to feedback and changing priorities
  • Fulltime
Read More
Arrow Right

Senior Software Engineer

Microsoft is a company where passionate innovators come to collaborate, envision...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor and/or Graduate degree in computer science, engineering or equivalent
  • Experience: 7+ years of professional development
  • Technical Stack: C++, C#, big data, ML, NLP, and search technologies
  • Focus Areas: Strong analytical skills, customer engagement, and cross-functional collaboration
  • Familiarity with Microsoft Cloud Services (Azure, Entra, O365)
  • Security awareness (penetration testing, threat analysis)
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Collaborates with appropriate stakeholders to determine user requirements for a scenario
  • Drives identification of dependencies and the development of design documents for a product, application, service, or platform
  • Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI)
  • Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items
  • Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate
  • Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale
  • Fulltime
Read More
Arrow Right