CrawlJobs Logo

Big Data Engineer - ML Analytics & Search

bmw.de Logo

BMW

Location Icon

Location:
Germany , Munich

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We train models on petabyte-scale automotive sensor data, but training is only half the story. Before a single GPU cycle is spent, engineers need to find, filter, evaluate, and understand the data. We build the analytics and search infrastructure that makes petabytes of measurements and recordings queryable in seconds, enabling rapid dataset assembly, quality analysis, and model evaluation at scale.

Job Responsibility:

  • Design and build high-performance search and query pipelines over PB-scale MDF4 and MCAP data lakes, enabling ML engineers to find relevant driving scenarios, sensor conditions, and edge cases across billions of records in seconds
  • Build and operate indexing and cataloguing systems for automotive sensor data, including metadata extraction, signal-level indexing, scene tagging, and embedding-based similarity search
  • Implement distributed compute pipelines for large-scale data evaluation, such as batch statistics, distribution analysis, annotation coverage reports, and data-quality scoring
  • Build fast analytical queries that enable interactive exploration on top of raw data
  • Develop dataset assembly pipelines that automatically assemble, version, and register training and evaluation datasets
  • Optimise for cost and performance through intelligent partitioning, tiered storage, caching strategies, and query pushdown to minimise scan volumes over PB-scale data
  • Operate observability stacks for data pipelines, including query latency dashboards, pipeline health, and data freshness monitors

Requirements:

  • University degree in Computer Science, Engineering, or a related field
  • 3–5 years of experience in big data or data engineering with a focus on analytics and search over very large datasets
  • Strong Python and SQL skills, with experience in at least one distributed compute framework
  • Experience with columnar or analytical storage and query optimisation at PB scale
  • Familiarity with search and indexing technologies, including full-text search, vector/embedding search or metadata catalogues
  • Production experience with Kubernetes and AWS / Azure / Google Cloud, as well as hands-on experience with infrastructure-as-code
  • Experience with automotive measurement data (MDF4/ASAM MDF or MCAP) as well as with embedding-based retrieval, dataset management tools, stream processing, or graph-based metadata systems
What we offer:
  • Challenging projects with which we shape the mobility of tomorrow together
  • Wide range of personal and professional development opportunities
  • Attractive, fair and performance-related remuneration
  • High level of job security
  • Annual special payments such as vacation pay, Christmas bonus, and profit sharing
  • Flexible working hours including six weeks annual leave and overtime compensation
  • Discounted BMW & MINI conditions

Additional Information:

Job Posted:
March 21, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Big Data Engineer - ML Analytics & Search

Senior Principal Data Platform Software Engineer

We’re looking for a Sr Principal Data Platform Software Engineer (P70) to be a k...
Location
Location
Salary
Salary:
239400.00 - 312550.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years in Data Engineering, Software Engineering, or related roles, with substantial exposure to big data ecosystems
  • Demonstrated experience building and operating data platforms or large‑scale data services in production
  • Proven track record of building services from the ground up (requirements → design → implementation → deployment → ongoing ownership)
  • Hands‑on experience with AWS, GCP (e.g., compute, storage, data, and streaming services) and cloud‑native architectures
  • Practical experience with big data technologies, such as Databricks, Apache Spark, AWS EMR, Apache Flink, or StarRocks
  • Strong programming skills in one or more of: Kotlin, Scala, Java, Python
  • Experience leading cross‑team technical initiatives and influencing senior stakeholders
  • Experience mentoring Staff/Principal engineers and lifting the technical bar for a team or org
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience
Job Responsibility
Job Responsibility
  • Design, develop and own delivery of high quality big data and analytical platform solutions aiming to solve Atlassian’s needs to support millions of users with optimal cost, minimal latency and maximum reliability
  • Improve and operate large‑scale distributed data systems in the cloud (primarily AWS, with increasing integration with GCP and Kubernetes‑based microservices)
  • Drive the evolution of our high-performance analytical databases and its integrations with products, cloud infrastructures (AWS and GCP) and isolated cloud environments
  • Help define and uplift engineering and operational standards for petabyte scale data platforms, with sub‑second analytic queries and multi‑region availability (coding guidelines, code review practices, observability, incident response, SLIs/SLOs)
  • Partner across multiple product and platform teams (including Analytics, Marketplace/Ecosystem, Core Data Platform, ML Platform, Search, and Oasis/FedRAMP) to deliver company‑wide initiatives that depend on reliable, high‑quality data
  • Act as a technical mentor and multiplier, raising the bar on design quality, code quality, and operational excellence across the broader team
  • Design and implement self‑healing, resilient data platforms with strong observability, fault tolerance, and recovery characteristics
  • Own the long‑term architecture and technical direction of Atlassian’s product data platform with projects that are directly tied to Atlassian’s company-level OKRs
  • Be accountable for the reliability, cost efficiency, and strategic direction of Atlassian’s product analytical data platform
  • Partner with executives and influence senior leaders to align engineering efforts with Atlassian’s long-term business objectives
What we offer
What we offer
  • health and wellbeing resources
  • paid volunteer days
  • Fulltime
Read More
Arrow Right

Cloud Solution Architect - Data Analytics

With more than 45,000 employees and partners worldwide, the Customer Experience ...
Location
Location
United States , Multiple Locations
Salary
Salary:
106400.00 - 203600.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Information Technology, Engineering, Business, Liberal Arts, or related field AND 4+ years experience in cloud/infrastructure technologies, information technology (IT) consulting/support, systems administration, network operations, software development/support, technology solutions, practice development, architecture, and/or consulting
  • OR equivalent experience.
  • 4+ years experience working in a customer-facing role (e.g., internal and/or external).
  • 4+ years experience working on technical projects.
  • Technical Certification in Cloud (e.g., Azure, Amazon Web Services, Google, security certifications).
  • Technical depth in one of the following Data Analytics and AI Platform Cloud solutions: NoSQL Databases including OSS (Maria, Mongo etc), CosmosDB
  • Big Data including Azure Synapse, Snowflake, Big Query, Redshift
  • Machine Learning including Azure ML, ML Server
  • Artificial Intelligence including BOT framework, Cognitive Services, Cognitive Search
  • Expertise in data estate workloads like HDInsight, Hadoop, Cloudera, Spark, Python
Job Responsibility
Job Responsibility
  • Plan and deliver proactive and reactive support including onsite presence as needed (post Covid restrictions).
  • Work within a larger virtual account team to strengthen customer relationships and work on mobile-first, cloud-first strategies for immediate and long-term service delivery plans.
  • Identify and manage goals and opportunities across Big Data platform to improve the health, performance, and availability.
  • Drive and participate in proactive delivery management, spot performance issues, analyze problems, and provide solutions to meet customer needs.
  • Work with support teams, account teams, product engineering teams and other stakeholders to ensure a streamlined customer experience.
  • Apply lessons learned for continuous process and delivery improvement for the customer.
  • Engage in meetings with customers and account teams to articulate service offerings.
  • Share and gain knowledge through communities.
  • Contribute to on-call rotations to ensure a high quality of service for critical incidents created by Support for Mission Critical customers.
  • Fulltime
Read More
Arrow Right

Consultant - Data & AI

Microsoft Industry Solutions - Global Center for Innovation and Delivery Center ...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4 - 10 years of experience
  • Bachelor's degree in computer science engineering or equivalent work experience
  • Higher relevant education is preferred
  • Knowledge of solution design, planning, development and deployment of complex solutions
  • One or more of the following certifications, or an equivalent industry certification is a plus: Microsoft Certified: Azure Data Engineer Associate (DP-600) / Microsoft Certified: Azure AI Engineer Associate (AI102) / Microsoft Certified: Azure Solution Architect Expert (AZ-305)
  • Core Data Engineering & Platform Skills
  • Hands-on experience in Data Engineering across cloud, on-prem, and hybrid environments
  • Strong foundational experience with Azure Data Services and data platform modernization initiatives
  • Handson exposure to Data Warehouse and analytics using platforms like Microsoft Fabric, and Azure Synapse Analytics
  • Azure Databricks is a plus
Job Responsibility
Job Responsibility
  • Works as an Individual contributor and key member of the Data and AI team and helps in timely execution of assigned deliverables with accurate estimates, work priorities, and accommodates project changes and trade-offs necessary for a successful release
  • Applies technical experience and industry-specific knowledge to develop solutions, based on an analysis of how the proposed approach affects the business objectives of customers and partners
  • Works to accelerate the value proposition of customer/partner engagements by helping to design, develop, and deploy solutions on Microsoft technologies and methodologies
  • Contributes to the overall efficacy and quality of a project team’s technical delivery within assigned engagements
  • Defines dependencies and risks that go beyond the immediate scope and timeframe for a complex project
  • Develops contingency plans, risk-mitigation implementation criteria, and alternative strategies to manage short- and long-term risks and manages technical escalations
  • Drives opportunities to expand or accelerate the adoption and consumption of cloud and Microsoft technologies
  • Collaborates, as appropriate, with peers and other teams (e.g., Sales, account-aligned team) to scale the business with existing high-stake or strategic customers, by articulating/developing value propositions of strategic Microsoft products and services
  • Align with innovation and digital transformation initiatives
  • Ensures the use of existing intellectual property (IP) and delivers value to customers
  • Fulltime
Read More
Arrow Right
New

Senior Applied Scientist - Image & Video Search Algorithm

Are you interested in solving large‑scale search problems and building next‑gene...
Location
Location
China , Beijing
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 5+ years related experience (e.g., statistics, predictive analytics, research)
  • OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research)
  • OR equivalent experience
  • Proven track record of impact in search engines or recommendation systems
  • Solid understanding of search engine fundamentals, including query, user, and content understanding, retrieval / recall architectures, ranking and relevance optimization
  • Experience building production machine learning systems, from data modeling to online deployment
  • Hands‑on experience in one or more of the following areas: Information retrieval, Recommendation systems, Computer vision or video understanding, Deep learning, NLP, or multimodal learning
  • Proficiency in Python and modern ML frameworks (e.g., PyTorch, TensorFlow)
  • Solid communication and collaboration skills in cross‑functional, global teams
  • MS or PhD in Computer Science or a related field
Job Responsibility
Job Responsibility
  • Design, develop, and improve large‑scale image and video search algorithms, with a focus on retrieval, recall, ranking, and relevance optimization
  • Build and operate multi‑stage search pipelines that power production image and video search experiences
  • Develop LLM‑powered search APIs and applications, integrating large language models with search, retrieval, and ranking systems to enable new developer and user experiences
  • Apply machine learning, deep learning, multimodal models, and LLM techniques to improve search quality, engagement, and long‑term product growth
  • Own the end‑to‑end machine learning lifecycle: Problem formulation and metric definition, Offline training, evaluation, and optimization, Online experimentation (A/B testing), Production deployment and monitoring
  • Build billion‑scale, low‑latency ML systems integrated into Bing and Windows search runtimes
  • Collaborate with researchers and partner teams to explore and productionize state‑of‑the‑art techniques for image, video, and LLM‑driven search scenarios
Read More
Arrow Right
New

Senior Software Engineer

Microsoft is a company where passionate innovators come to collaborate, envision...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor and/or Graduate degree in computer science, engineering or equivalent
  • Experience: 7+ years of professional development
  • Technical Stack: C++, C#, big data, ML, NLP, and search technologies
  • Focus Areas: Strong analytical skills, customer engagement, and cross-functional collaboration
  • Familiarity with Microsoft Cloud Services (Azure, Entra, O365)
  • Security awareness (penetration testing, threat analysis)
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Collaborates with appropriate stakeholders to determine user requirements for a scenario
  • Drives identification of dependencies and the development of design documents for a product, application, service, or platform
  • Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI)
  • Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items
  • Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate
  • Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale
  • Fulltime
Read More
Arrow Right

Senior Software Engineer

Microsoft is a company where passionate innovators come to collaborate, envision...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor and/or Graduate degree in computer science, engineering or equivalent
  • Experience: 7+ years of professional development
  • Technical Stack: C++, C#, big data, ML, NLP, and search technologies
  • Focus Areas: Strong analytical skills, customer engagement, and cross-functional collaboration
  • Familiarity with Microsoft Cloud Services (Azure, Entra, O365)
  • Security awareness (penetration testing, threat analysis)
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Collaborates with appropriate stakeholders to determine user requirements for a scenario
  • Drives identification of dependencies and the development of design documents for a product, application, service, or platform
  • Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI)
  • Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items
  • Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate
  • Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale
  • Fulltime
Read More
Arrow Right

Consultant A2 - Data & AI

Microsoft Industry Solutions - Global Center for Innovation and Delivery Center ...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4 - 10 years of experience
  • Bachelor's degree in computer science engineering or equivalent work experience
  • Knowledge of solution design, planning, development and deployment of complex solutions
  • Core Data Engineering & Platform Skills
  • Hands-on experience in Data Engineering across cloud, on-prem, and hybrid environments
  • Strong foundational experience with Azure Data Services and data platform modernization initiatives
  • Handson exposure to Data Warehouse and analytics using platforms like Microsoft Fabric, and Azure Synapse Analytics
  • Experience/knowledge of one or more SQL and NoSQL database systems
  • Hands-on experience building AI-powered data pipelines using ETL/ELT tools like: Azure Data Factory (ADF), SSIS, Talend, Informatica, Airflow
  • Exposure to data migrations, platform upgrades, and modernization efforts
Job Responsibility
Job Responsibility
  • Works as an Individual contributor and key member of the Data and AI team and helps in timely execution of assigned deliverables with accurate estimates, work priorities, and accommodates project changes and trade-offs necessary for a successful release
  • Applies technical experience and industry-specific knowledge to develop solutions, based on an analysis of how the proposed approach affects the business objectives of customers and partners
  • Works to accelerate the value proposition of customer/partner engagements by helping to design, develop, and deploy solutions on Microsoft technologies and methodologies
  • Contributes to the overall efficacy and quality of a project team’s technical delivery within assigned engagements
  • Defines dependencies and risks that go beyond the immediate scope and timeframe for a complex project
  • Develops contingency plans, risk-mitigation implementation criteria, and alternative strategies to manage short- and long-term risks and manages technical escalations
  • Drives opportunities to expand or accelerate the adoption and consumption of cloud and Microsoft technologies
  • Collaborates, as appropriate, with peers and other teams (e.g., Sales, account-aligned team) to scale the business with existing high-stake or strategic customers, by articulating/developing value propositions of strategic Microsoft products and services
  • Align with innovation and digital transformation initiatives
  • Ensures the use of existing intellectual property (IP) and delivers value to customers
  • Fulltime
Read More
Arrow Right

Software Engineer II (Search Quality)

Bloomreach is seeking a Backend Software Engineer to join our Search Quality tea...
Location
Location
India
Salary
Salary:
Not provided
bloomreach.com Logo
Bloomreach
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS/MS in Computer Science or a related field
  • 2+ years of professional software engineering experience building backend systems using Java or Python
  • Strong grasp of computer science fundamentals including algorithms, data structures, and distributed systems
  • Experience working with cloud environments (AWS or GCP) and containerized deployments (e.g., Docker, Kubernetes)
  • Proven experience with distributed systems, microservices architecture, and large-scale data pipelines
  • Experience with big data technologies such as Hadoop, Spark, Kafka, and data lakes
  • Strong analytical and debugging skills
  • passion for clean code and sustainable software practices
  • Interest in or exposure to machine learning technologies in real-world applications
Job Responsibility
Job Responsibility
  • Design, develop, and maintain backend services and distributed systems powering search at scale
  • Collaborate with applied scientists and ML engineers to bring research prototypes into production
  • Work across the full stack of our AI Search architecture, from ingest and indexing to query-time ranking and retrieval
  • Integrate big data and real-time streaming systems (e.g., Kafka, Spark) to process and learn from user behavior at scale
  • Optimize for low-latency and high-availability performance across hundreds of millions of queries per day
  • Operate in a fast-paced, collaborative environment, where your work will directly influence customer success
What we offer
What we offer
  • A great deal of freedom and trust
  • flexible working hours
  • work virtual-first
  • company events
  • 5 paid days off to volunteer
  • People Development Program
  • communication coach available
  • Leader Development Program
  • $1,500 professional education budget annually
  • Employee Assistance Program with counselors
  • Fulltime
Read More
Arrow Right