CrawlJobs Logo

Engineering Manager - Datasets Enrichment

wayve.ai Logo

Wayve

Location Icon

Location:
United Kingdom , London

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We are hiring an Engineering Manager (M4) to lead the team responsible for both semantic enrichment pipelines and the final silver and gold layers of the Wayve Corpus. This team transforms multimodal driving data and perception model outputs into reliable, high quality data products used across autonomy, evaluation, simulation, and research. The role combines high scale ML in the loop enrichment pipelines such as semantic segmentation, cuboid annotation, embeddings, and BC and ODD signals with production grade data engineering ownership including schema governance, table interfaces, quality gates, lineage, and SLO based operations. You will lead a team of up to 10 engineers across ML engineering, perception, and data engineering. You will own a multi quarter roadmap that scales enrichment throughput, improves data quality, and hardens corpus tables used across Wayve. You will partner with application, model training, and evaluation, teams to ensure alignment on requirements and interfaces. This role requires a leader comfortable at the intersection of ML systems and data engineering who can provide clear direction, reliable delivery, and strong people leadership during a period of significant technical and organizational scaling.

Job Responsibility:

  • Lead, coach, and grow a team of up to 10 engineers across ML engineering, perception, and data engineering
  • Define team structure, roles, leveling, hiring needs, and long term growth plans
  • Own and scale semantic enrichment pipelines including semantic segmentation, cuboids, embeddings, scenario, and ODD classification
  • Integrate ML assisted labeling, validation, and automated quality checks into enrichment workflows
  • Own the silver and gold layers of the Wayve Corpus including schema evolution, versioning, documentation, lineage, observability, and SLO backed operations
  • Establish data quality gates and quality metrics for enriched and corpus level data
  • Deliver a multi quarter roadmap spanning enrichment and corpus systems with predictable execution
  • Lead architecture decisions to improve efficiency, maintainability, and reliability
  • Partner with Data Platform on distributed compute systems including Spark, Databricks, Ray, and Flyte
  • Align with autonomy, evaluation, and research teams on corpus requirements, interfaces, and lifecycle

Requirements:

  • 2+ years managing engineering teams in ML systems, perception, or large scale data infrastructure
  • Experience delivering ML in production or perception pipelines, or strong experience in production data engineering systems. Ideally exposure to both
  • Proven ownership of production data tables such as Delta Lake, Spark, Hive, or BigQuery including schema evolution and multi team consumers
  • Experience with distributed compute systems such as Spark, Databricks, Ray, or Flyte
  • Experience building observable, high throughput pipelines
  • Ability to lead multi quarter delivery, manage dependencies, and align with multiple stakeholders
  • Strong communication and cross functional collaboration skills

Nice to have:

  • Experience with multimodal perception data such as images, video, and LiDAR
  • Experience with annotation workflows or ML assisted labeling systems
  • Experience with embeddings, feature stores, or ML data layers
  • Familiarity with data quality frameworks and operational analytics
  • Experience in autonomous vehicles, robotics, or large scale computer vision systems

Additional Information:

Job Posted:
January 01, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Engineering Manager - Datasets Enrichment

Senior Platform Engineer, ML Data Systems

We’re looking for an ML Data Engineer to evolve our eval dataset tools to meet t...
Location
Location
United States , Mountain View
Salary
Salary:
137871.00 - 172339.00 USD / Year
khanacademy.org Logo
Khan Academy
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field
  • 5 years of Software Engineering experience with 3+ of those years working with large ML datasets, especially those in open-source repositories such as Hugging Face
  • Strong programming skills in Go, Python, SQL, and at least one data pipeline framework (e.g., Airflow, Dagster, Prefect)
  • Experience with data versioning tools (e.g., DVC, LakeFS) and cloud storage systems
  • Familiarity with machine learning workflows — from training data preparation to evaluation
  • Familiarity with the architecture and operation of large language models, and a nuanced understanding of their capabilities and limitations
  • Attention to detail and an obsession with data quality and reproducibility
  • Motivated by the Khan Academy mission “to provide a free world-class education for anyone, anywhere.”
  • Proven cross-cultural competency skills demonstrating self-awareness, awareness of other, and the ability to adopt inclusive perspectives, attitudes, and behaviors to drive inclusion and belonging throughout the organization.
Job Responsibility
Job Responsibility
  • Evolve and maintain pipelines for transforming raw trace data into ML-ready datasets
  • Clean, normalize, and enrich data while preserving semantic meaning and consistency
  • Prepare and format datasets for human labeling, and integrate results into ML datasets
  • Develop and maintain scalable ETL pipelines using Airflow, DBT, Go, and Python running on GCP
  • Implement automated tests and validation to detect data drift or labeling inconsistencies
  • Collaborate with AI engineers, platform developers, and product teams to define data strategies in support of continuously improving the quality of Khan’s AI-based tutoring
  • Contribute to shared tools and documentation for dataset management and AI evaluation
  • Inform our data governance strategies for proper data retention, PII controls/scrubbing, and isolation of particularly sensitive data such as offensive test imagery.
What we offer
What we offer
  • Competitive salaries
  • Ample paid time off as needed
  • 8 pre-scheduled Wellness Days in 2026 occurring on a Monday or a Friday for a 3-day weekend boost
  • Remote-first culture - that caters to your time zone, with open flexibility as needed, at times
  • Generous parental leave
  • An exceptional team that trusts you and gives you the freedom to do your best
  • The chance to put your talents towards a deeply meaningful mission and the opportunity to work on high-impact products that are already defining the future of education
  • Opportunities to connect through affinity, ally, and social groups
  • 401(k) + 4% matching & comprehensive insurance, including medical, dental, vision, and life.
  • Fulltime
Read More
Arrow Right

Data Product Specialist

Join our Global Digital & Loyalty team as a Data Product Specialist who lives an...
Location
Location
Poland , Warszawa
Salary
Salary:
Not provided
https://www.circlek.com Logo
Circle K
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Responsive to evolving business needs
  • Analytical thinker who balances creativity with pragmatism
  • Excellent communicator in English, able to translate complex concepts for technical and non-technical stakeholders
  • Customer-obsessed problem solver who collaborates across geographies and levels
  • Curious lifelong learner with a passion for retail, loyalty programs and emerging analytics techniques
  • Higher degree in Engineering, Data Science, Statistics or a related analytical discipline
  • 3+ years of experience in one or more of: data engineering, data analytics or data stewardship
  • Strong awareness of data governance frameworks and privacy regulations (e.g., GDPR)
  • Hands-on with analytical tools such as Power BI, Excel and cloud data warehouses
  • Familiarity with mobile analytics platforms (e.g., Google Analytics) and loyalty program data is a plus
Job Responsibility
Job Responsibility
  • Partner with the Group Product Manager to maintain the data product vision, roadmap and backlog with clear acceptance criteria and measurable outcomes
  • Translate business questions into metric definitions, data models and scalable self-service data products
  • Collaborate with Digital Analytics to ensure accurate tagging of key user events in mobile apps (e.g., GA4)
  • Coordinate with Data Engineering to integrate mobile apps and retail loyalty datasets for a unified end-to-end customer journey
  • Monitor data pipelines, investigate quality issues and implement automated validation, alerting and remediation processes
  • Operationalize data governance, privacy and security requirements
  • Maintain the data catalog and lineage documentation to support discovery
  • Define and enforce data contracts, SLAs and schema-change processes with source teams
  • Liaise with Product Performance Analytics to deliver certified datasets on time for monthly leadership dashboards and ad-hoc reporting
  • Conduct stakeholder interviews to confirm data readiness for key insights and dashboards
What we offer
What we offer
  • Contract of employment
  • Annual bonus
  • Private medical care
  • Cafeteria Platform/Multisport
  • English lessons subsidized by the company
  • Group insurance
  • Attractive discounts for products and services at our stations
  • Employee stock purchase plan
  • Employee Assistance Program (Lyra)
  • Modern and convenient office
  • Fulltime
Read More
Arrow Right
New

Data Analyst

Part of the Technology and Transformation Analytics team, the Data Analyst is an...
Location
Location
United Kingdom
Salary
Salary:
37000.00 GBP / Year
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
February 20, 2026
Flip Icon
Requirements
Requirements
  • Experience collating and analysing management information and performance metrics
  • Strong SQL skills with experience querying relational databases (e.g. SQL Server)
  • Exposure to data modelling, enrichment and transformation techniques
  • Ability to work with complex datasets and apply business logic to analytical outputs
  • Excellent communication skills with the ability to explain insights to technical and non‑technical audiences in clear written English
  • This post is subject to a Disclosure and Barring Service (DBS) check
  • This post is subject to a Counter Terrorism Check (CTC)
  • Be able to provide a valid passport eg. 10 year full British passport, EU or non-EU Passport with indefinite leave to remain
  • Be able to provide continuous UK address history for the previous 5 years
  • Provide full employment history for the previous 3 years and/or suitable documentation to cover any gaps in employment
Job Responsibility
Job Responsibility
  • Design, build and maintain key performance indicators and management reports, delivering weekly and monthly insights, while ensuring accurate integration into strategic reporting cycles
  • Develop scalable Power BI dashboards and data visualisation outputs, underpinned by well governed models, and documented data pipelines
  • Partner with stakeholders across the organisation to define metrics and deliver analysis, translating findings into clear, actionable recommendations and compelling data stories
  • Engineer and manage robust ETL pipelines (including APIs) to integrate diverse data sources, enabling the creation of impactful, real-time reporting solutions
  • Design and maintain fit‑for‑purpose data models, views and transformations that enable reliable, near real‑time reporting solutions
  • Document data lineage and definitions to support transparency and reuse
  • Ensure data quality and integrity through rigorous validation, cleaning and reconciliation processes, fostering trust and reliability in all reporting outputs
  • Contribute to standardised metrics and data dictionaries to promote consistency and clarity across the organisation
What we offer
What we offer
  • Our working week is 35 hours per week offering flexibility and work life balance
  • Enhanced family friendly provisions
  • Employees will gain an extra day annual leave per year to a maximum of 39 days, including bank holidays (pro-rata)
  • Option to buy or sell up to 5 days of annual leave
  • Access to Perkbox, an employee rewards and benefits platform with over 9,000 deals and discounts, a range of free perks, employee wellbeing support and other additional employee benefits and recognitions
  • Wellbeing support
  • Migrant Help offers employees a non-contributory pension scheme Migrant Help pays 8% worth of employee salary into the pension scheme
  • Fulltime
Read More
Arrow Right
New

Data Analyst

Part of the Technology and Transformation Analytics team, the Data Analyst is an...
Location
Location
United Kingdom , Home based
Salary
Salary:
37000.00 GBP / Year
migranthelpuk.org Logo
Migrant Help
Expiration Date
February 20, 2026
Flip Icon
Requirements
Requirements
  • Experience collating and analysing management information and performance metrics
  • Strong SQL skills with experience querying relational databases (e.g. SQL Server)
  • Exposure to data modelling, enrichment and transformation techniques
  • Ability to work with complex datasets and apply business logic to analytical outputs
  • Excellent communication skills with the ability to explain insights to technical and non‑technical audiences in clear written English
  • This post is subject to a Disclosure and Barring Service (DBS) check
  • This post is subject to a Counter Terrorism Check (CTC)
  • Be able to provide a valid passport eg. 10 year full British passport, EU or non-EU Passport with indefinite leave to remain
  • Be able to provide continuous UK address history for the previous 5 years
  • Provide full employment history for the previous 3 years and/or suitable documentation to cover any gaps in employment
Job Responsibility
Job Responsibility
  • Design, build and maintain key performance indicators and management reports, delivering weekly and monthly insights, while ensuring accurate integration into strategic reporting cycles
  • Develop scalable Power BI dashboards and data visualisation outputs, underpinned by well governed models, and documented data pipelines
  • Partner with stakeholders across the organisation to define metrics and deliver analysis, translating findings into clear, actionable recommendations and compelling data stories
  • Engineer and manage robust ETL pipelines (including APIs) to integrate diverse data sources, enabling the creation of impactful, real-time reporting solutions
  • Design and maintain fit‑for‑purpose data models, views and transformations that enable reliable, near real‑time reporting solutions
  • Document data lineage and definitions to support transparency and reuse
  • Ensure data quality and integrity through rigorous validation, cleaning and reconciliation processes, fostering trust and reliability in all reporting outputs
  • Contribute to standardised metrics and data dictionaries to promote consistency and clarity across the organisation
  • Support the implementation of the Technology and Transformation Strategy, contributing to platform upgrades, migrations and scalable architecture improvements
  • Collaborate with teams and stakeholders across Migrant Help to identify strategic growth opportunities, understand analytical needs and translate requirements into deliverables
What we offer
What we offer
  • Our working week is 35 hours per week offering flexibility and work life balance
  • Enhanced family friendly provisions
  • Employees will gain an extra day annual leave per year to a maximum of 39 days, including bank holidays (pro-rata)
  • Option to buy or sell up to 5 days of annual leave
  • Access to Perkbox, an employee rewards and benefits platform with over 9,000 deals and discounts, a range of free perks, employee wellbeing support and other additional employee benefits and recognitions
  • Wellbeing support
  • Migrant Help offers employees a non-contributory pension scheme Migrant Help pays 8% worth of employee salary into the pension scheme
  • Fulltime
Read More
Arrow Right

Stibo MDM Architect

The Stibo MDM Architect will lead the design and implementation of enterprise Ma...
Location
Location
India , Coimbatore; Chennai; Bangalore
Salary
Salary:
Not provided
augustahitech.com Logo
Augusta Hitech Soft Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12+ years Stibo STEP / Stibo MDM
  • Expertise in MDM, PIM, taxonomy design, data modeling, workflows, and business rules
  • Strong experience with STEP configuration, XSLT, XML, SQL, APIs, and UI customizations
  • Experience architecting enterprise-level MDM systems
  • Experience with large product catalogs, supplier data, and asset management
  • Good understanding of eCommerce, retail, distribution, or manufacturing domains
  • Qualification: Any Degree
Job Responsibility
Job Responsibility
  • Design the end-to-end Stibo MDM / STEP architecture for Product, Supplier, Customer, and other master data domains
  • Define data models, hierarchies, taxonomies, workflows, governance rules, and integration patterns
  • Architect multidomain MDM solutions that support enterprise business processes
  • Develop data lineage, metadata models, versioning strategies, and governance frameworks
  • Build high-quality architecture that supports large SKU volumes and complex catalog structures
  • Lead Stibo MDM solution design and provide overall technical direction
  • Convert business requirements into detailed technical specifications
  • Guide developers, data engineers, functional consultants, and integration teams
  • Drive data quality initiatives, validation rules, match/merge logic, and governance setup
  • Oversee configuration, customization, and performance optimization of STEP
Read More
Arrow Right

Data Engineer

Unique opportunity to join a team that provides metrics and analytical products ...
Location
Location
Poland
Salary
Salary:
Not provided
https://www.hsbc.com Logo
HSBC
Expiration Date
February 18, 2026
Flip Icon
Requirements
Requirements
  • Proven experience building and operating cloud data platforms, ideally on Google Cloud Platform with BigQuery
  • Strong background in data engineering, including ETL pipeline development and orchestration using Airflow or Cloud Composer
  • Advanced SQL development skills, with at least 7 years of hands-on experience designing and optimizing queries and schemas
  • Experience in Shell scripting, Git, and with data build tools (DBT)
  • Experience with BI Tools – ideally Looker, Looker Studio
  • Experience working in a DevOps environment with continuous integration, continuous delivery
  • Experience collaborating with technical teams and project managers to deliver efficient, controlled solutions
  • Comfortable working independently and proactively in fast-paced, ambiguous situations, driving improvements and resolving issues as they arise.
Job Responsibility
Job Responsibility
  • Design, develop, test, and deploy data transformation pipelines and ETL processes for cloud-based analytics solutions
  • Collaborate with multi-disciplinary teams to deliver enriched datasets, dashboards, and actionable metrics for DevOps improvement
  • Ensure data quality, integrity, and reliability throughout the data lifecycle, from ingestion to presentation
  • Operate and iterate on our cloud data platform capabilities, supporting both greenfield and existing ecosystem components
  • Partner with business stakeholders to understand requirements and translate them into scalable data solutions
  • Contribute to continuous improvement by experimenting with new technologies, tools, and approaches
  • Take ownership of tasks and proactively resolve issues to ensure timely delivery of software and analytics products.
What we offer
What we offer
  • Competitive salary
  • Annual performance-based bonus
  • Additional bonuses for recognition awards
  • Multisport card
  • Private medical care
  • Life insurance
  • One-time reimbursement of home office set-up (up to 800 PLN)
  • Corporate parties & events
  • CSR initiatives
  • Nursery and kindergarten discounts
  • Fulltime
Read More
Arrow Right
New

Technical Pricing Manager

The purpose of this role is to build high-quality risk models that estimate risk...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
emerald-group.com Logo
The Emerald Group Ltd, Search and Selection
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong general insurance pricing toolkit: GLMs (Poisson/NB/Tweedie), GAMs, credibility/hierarchical methods
  • experience with tree-based ML (GBM/XGBoost/CatBoost) and regularisation
  • Proficient in R and Python, with strong SQL
  • comfortable in Git-based workflows and “in the engine room” with proprietary rating systems
  • Hands-on experience taking models from concept to live in rating engines
  • robust validation, change control and post-live monitoring
  • Familiarity with peril/exposure enrichment relevant to home insurance (e.g. flood and subsidence datasets) and geospatial modelling considerations
  • Awareness of reserving concepts, claims inflation and their interaction with technical pricing
  • Knowledge of model risk management, documentation standards and governance under UK regulation (Consumer Duty, Fair Value Assessments, GIPP)
Job Responsibility
Job Responsibility
  • Own the end-to-end modelling lifecycle: problem framing, data build, feature engineering, model development, validation, documentation
  • Build and maintain risk and price models using GLMs and machine learning
  • Translate models into implementable rating structures
  • Strong governance: change control, champion–challenger/shadow runs, rollback plans, and clear approvals and audit trails
  • Fulltime
Read More
Arrow Right
New

Data Engineer

Location
Location
Vietnam , Hà Nội
Salary
Salary:
Not provided
cmcglobal.com.vn Logo
CMC Global Company Limited.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Data Engineer with strong Hadoop / Spark / Talend experience
  • Experience building and operating large-scale data lakes and data warehouses
  • Experience with Hadoop ecosystem and big data tools, including Spark and Kafka
  • Experience with Master Data Management (MDM) tools and platforms such as Informatica MDM, Talend Data Catalog, Semarchy xDM, IBM PIM & IKC, or Profisee
  • Familiarity with MDM processes such as golden record creation, survivorship, reconciliation, enrichment, and quality
  • Experience in data governance, including data quality management, data profiling, data remediation, and automated data lineage
  • Experience with stream-processing systems including Spark-Streaming
  • Experience working with Cloud services using one or more Cloud providers such as Azure, GCP, or AWS
  • Experience with Delta Lake and Databricks
  • Advanced working experience with relational SQL and NoSQL databases, including Hive, HBase, and Postgres
Job Responsibility
Job Responsibility
  • Create and manage a single master record for each business entity, ensuring data consistency, accuracy, and reliability
  • Implement data governance processes, including data quality management, data profiling, data remediation, and automated data lineage
  • Create and maintain multiple robust and high-performance data processing pipelines within Cloud, Private Data Centre, and Hybrid data ecosystems
  • Assemble large, complex data sets from a wide variety of data sources
  • Collaborate with Data Scientists, Machine Learning Engineers, Business Analysts, and Business users to derive actionable insights and reliable foresights into customer acquisition, operational efficiency, and other key business performance metrics
  • Develop, deploy, and maintain multiple microservices, REST APIs, and reporting services
  • Design and implement internal processes to automate manual workflows, optimize data delivery, and re-design infrastructure for greater scalability
  • Establish expertise in designing, analyzing, and troubleshooting large-scale distributed systems
  • Support and work with cross-functional teams in a dynamic environment
What we offer
What we offer
  • Attractive compensation package: 14-month salary scheme plus annual bonus and additional allowances
  • Annual bonus package tailored based on performance and contribution
  • Young, open, and dynamic working environment that promotes innovation and creativity
  • Ongoing learning and development with regular professional training and opportunities to enhance both technical and soft skills
  • Exposure to cutting-edge technologies and diverse real-world enterprise projects
  • Vibrant company culture with regular team-building activities, sports tournaments, arts events, Family Day, and more
  • Full compliance with Vietnamese labor laws, plus additional internal perks such as annual company trips, special holidays, and other corporate benefits
  • Fulltime
Read More
Arrow Right