This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are looking for a Data Engineer with strong knowledge of computer vision workflows. You will design, build, and optimize data pipelines that power our AR-focused 2D/3D vision and diffusion models. You’ll ensure our data is reliable, scalable, and optimized for both research and production.
Job Responsibility:
Build and maintain scalable data pipelines for large-scale video and 3D datasets
Support post-training workflows (e.g., dataset augmentation, evaluation data preparation)
Optimize data flows for inference pipelines to ensure real-time performance
Collaborate with CV researchers and engineers to ensure datasets meet model needs
Implement best practices for data quality, versioning, and reproducibility
Requirements:
Strong experience in Python, SQL, and distributed systems (Dask, or Ray)
Hands-on experience with data pipelines for ML/CV (e.g., TFRecords, WebDataset,HDF5)
Familiarity with cloud platforms (AWS/GCP) and GPU-based data workflows
Understanding of computer vision data requirements (video, 3D assets, images)
Nice to have:
Experience with data labeling pipelines, synthetic data, or AR/VR datasets