CrawlJobs Logo

Research Engineer – Synthetic Data for Vision

sesame.com Logo

Sesame

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

175000.00 - 280000.00 USD / Year

Job Description:

Vision understanding is a critical addition to conversational AI, bridging the gap between speech and the physical world. We’re looking for a skilled engineer or researcher to build high-value synthetic data pipelines that accelerate vision model development. The ideal candidate will be fluent in classical computer vision techniques while also comfortable leveraging modern machine learning tools across the stack: from neural rendering and diffusion-based image synthesis to transfer learning, domain adaptation, and data-centric evaluation. You’ll collaborate with research, hardware, and product teams to build capture, generation, and rendering systems that combine physical accuracy with visual realism—delivering datasets and simulators that measurably improve downstream computer vision tasks.

Job Responsibility:

  • Build and maintain synthetic data generation pipelines (e.g., neural rendering, diffusion/score-based models, controllable generative priors, procedural assets) with levers for pose, expression, illumination, materials, and sensor characteristics
  • Apply transfer learning and domain adaptation (self-supervised pretraining, style/appearance transfer, sim-to-real) to bridge distribution gaps between synthetic and real data
  • Integrate off-the-shelf and open-source components where practical
  • fine-tune or distill models to meet latency, memory, and quality targets on target hardware
  • Stand up end-to-end systems—from capture and calibration to generation, data curation, quality gates, rendering/evaluation suites, and deployment
  • Define dataset and model evaluation frameworks (coverage, bias, sim-to-real gap, task-level KPIs such as gaze error) and iterate based on quantitative results
  • Survey literature across graphics, vision, and generative ML
  • prototype, adapt, and, where needed, invent new approaches that push facial reconstruction, appearance modeling, and synthetic data quality forward

Requirements:

  • Demonstrated experience with 3D reconstruction, photorealistic rendering, appearance modeling, or synthetic data generation for vision tasks
  • Ability to navigate and deliver results in high-ambiguity, open-ended problem spaces
  • Familiarity with large-scale, multi-camera datasets and the practicalities of curation, annotation, and evaluation
  • Excellent communication skills and the ability to work collaboratively across disciplines
  • Bachelor’s degree or higher in computer graphics, vision, imaging, machine learning, or a related field

Nice to have:

  • Master’s or Ph.D. in a relevant discipline
  • Hands-on experience training or adapting neural rendering models (e.g., NeRF/3DGS variants, relighting, inverse rendering) and modern generative models (e.g., diffusion/latent diffusion, controllable text-to-image/video, inpainting/outpainting)
  • Proficiency in PyTorch, JAX, or other modern ML frameworks
What we offer:
  • 401k matching
  • 100% employer-paid health, vision, and dental benefits
  • Unlimited PTO and sick time
  • Flexible spending account matching (medical FSA)

Additional Information:

Job Posted:
February 20, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Engineer – Synthetic Data for Vision

Senior Research Engineer

iProov has continued to scale rapidly this year and is currently looking for a S...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
iproov.com Logo
iProov
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 2-3 years experience delivering machine learning or computer vision systems in an industry environment
  • A Masters or PhD in a numerate discipline such as Computer Science, Engineering, Computational Neuroscience
  • Proven experience of high-quality delivery in machine learning and computer vision
  • Strong mathematical ability
  • Proficiency in Python and modern deep learning frameworks
  • Strong problem-solving skills and out-of-box thinking
Job Responsibility
Job Responsibility
  • Lead research projects in the areas of computer vision, machine learning and biometrics
  • Develop and train deep learning models for face verification, liveness and attack detection
  • Take the lead on shaping data and evaluation strategies, including synthetic and adversarial data
  • Investigate failure modes and improve generalisation, robustness and fairness
  • Work with platform and optimisation engineers to turn models into production-quality services
  • Communicate results and recommendations clearly with stakeholders across the business
  • Contribute to the broader research strategy of the company
What we offer
What we offer
  • 25 days Annual Leave, plus 8 Bank Holidays
  • Growth Shares allocated after passing probation
  • Salary sacrifice schemes including: Pension, Cycle To Work and Electric Car Scheme
  • Nursery Sacrifice Scheme
  • Work Overseas Perk - Work globally for up to 2 weeks
  • Life Assurance
  • SmartHealth Access to private GP, Psychologist, Nutritionist along with tailored fitness plans for both you and your family
  • Benefit from personalized 1:1 career coaching with our in-house Occupational Psychologist
  • Award winning L&D platform with personal allocated training budgets
  • Enhanced paid family leave
  • Fulltime
Read More
Arrow Right
New

Research Engineering Manager, Post-Training

Meta is seeking a Research Engineering Manager to lead the Post-Training team wi...
Location
Location
United States , Menlo Park
Salary
Salary:
219000.00 - 301000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Machine Learning, or a related technical field
  • 4+ years of experience in machine learning engineering, machine learning research, or a related technical role
  • 3+ years of experience managing or leading technical teams, including hiring, mentoring, and performance management
  • Proficiency in Python and experience with ML frameworks such as PyTorch
  • Proven track record of leading medium to large-scale technical projects (specifically data pipelines or ML infrastructure) from conception to deployment
  • Software engineering practices including version control, testing, code review, and system design
  • Demonstrated ability to balance hands-on technical work with people management and strategic planning
  • Great communication skills with the ability to influence cross-functional stakeholders
Job Responsibility
Job Responsibility
  • Build, mentor, and grow a team of research engineers focused on full-stack post-training data infrastructure
  • Conduct performance reviews, career development conversations, and provide technical mentorship to team members
  • Foster a Culture of Engineering Excellence, data rigor, and rapid iteration within the team
  • Partner with recruiting to hire world-class research engineering talent
  • Oversee the development and scaling of data collection pipelines for high-value domains (STEM, GDP-valuable tasks, finance, legal, health) and complex agentic workflows (deep research, computer use, shopping agents)
  • Establish and manage partnerships with external data vendors to source and securely prepare expert-level post-training datasets
  • Influence the technical roadmap for data infrastructure in collaboration with the MSL Infra team
  • Translate the strategic vision of research scientists into actionable engineering plans for synthetic data generation, SFT, and RLHF pipelines
  • Partner with research scientists, product teams, and model training teams to align data collection priorities with organizational capability goals
  • Build robust, reusable data pipelines that can rapidly deliver high-quality datasets to multiple model lines
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

AI Research Scientist, Media Data Research

Meta is seeking AI research scientists to help us build the data foundation for ...
Location
Location
United States , Menlo Park
Salary
Salary:
184000.00 - 257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • PhD in Computer Science or a related technical field
  • 2+ years of industry research experience in LLM/NLP, computer vision, or related AI/ML models
  • Experience as a formal technical lead, leading major technical initiatives with cross-functional impact, and/or influencing strategy across multiple teams
  • Practical experience with multimodal pre-training or mid-training data curation for large media perception or generation models
  • Published research in leading peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV) and/or demonstrated significant industry influence in the field of AI
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams to develop Meta’s next foundational models
  • Advance our understanding of data research, such as how to overcome data walls and how best to create synthetic data
  • Fundamentally improve our data velocity across workflows and projects by contributing to the advancement of data tooling
  • Execute on high priority projects in pre-training, mid-training, or post-training data curation
  • Apply specialized expertise in video/image generation, video/image perception, OCR, data scaling laws, or data mixing
  • Lead complex technical projects end-to-end
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right
New

AI Research Scientist, Media Data Research - MSL FAIR

Meta is seeking AI research scientists to help us build the data foundation for ...
Location
Location
United States , Menlo Park
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • PhD in Computer Science or a related technical field
  • 1+ year of industry research experience in LLM/LMM, computer vision, or related AI/ML models
  • Experience owning and/or driving complex technical projects from end-to-end
  • Practical experience with multimodal pre-training or mid-training data curation for large media perception or generation models
  • Published research in leading peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV) and/or demonstrated significant industry influence in the field of AI
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams to develop Meta’s next foundational models
  • Advance our understanding of data research, such as how to overcome data walls and how best to create synthetic data
  • Fundamentally improve our data velocity across workflows and projects by contributing to quality in data tooling
  • Execute on high priority projects in pre-training, mid-training, or post-training data curation
  • Apply specialized expertise in video/image generation, video/image perception, OCR, data scaling laws, or data mixing
  • Lead complex technical projects end-to-end
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

AI Research Scientist, Media Data Research - MSL FAIR

Meta is seeking AI research scientists to help us build the data foundation for ...
Location
Location
United States , Bellevue
Salary
Salary:
122000.00 - 181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • PhD in Computer Science or a related technical field, plus 1+ years of industry research experience in LLM/LMM, computer vision, or related AI/ML models
  • Published research in leading peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV) and/or demonstrated significant industry influence in the field of AI
  • Experience owning and/or driving complex technical projects from end-to-end
  • Practical experience with multimodal pre-training or mid-training data curation for large media perception or generation models
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams to develop Meta’s next foundational models
  • Advance our understanding of data research, such as how to overcome data walls and how best to create synthetic data
  • Fundamentally improve our data velocity across workflows and projects by contributing to innovation in data tooling
  • Execute on high priority projects in pre-training, mid-training, or post-training data curation
  • Apply specialized expertise in video/image generation, video/image perception, OCR, data scaling laws, or data mixing
  • Lead complex technical projects end-to-end
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Senior Software Engineer, Machine Learning and Artificial Intelligence

Mashgin is looking for a smart, driven engineer who’s fascinated by the latest d...
Location
Location
United States , Palo Alto
Salary
Salary:
200000.00 - 300000.00 USD / Year
mashgin.com Logo
Mashgin
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years relevant coding experience
  • B.S. or higher in Computer Science or related field
  • Strong background in Machine Learning or Computer Vision
  • Excellent knowledge of either Python or C/C++
Job Responsibility
Job Responsibility
  • Developing solutions for real-world computer vision problems
  • Working with the product team to come up with innovative ways to collect large data sets for training AI systems or generating equivalent synthetic data
  • Finding the optimal balance between doing longer term research and applying research results to production code
  • Researching and building state-of-the-art ML/CV algorithms to analyze 2D/3D image data
What we offer
What we offer
  • Excellent health, dental and vision insurance for you and your dependents
  • 401k plan
  • Flexible PTO policy
  • Catered lunch in office with fully stocked snacks and beverages
  • Pet insurance for your fur babies
  • Voluntary life insurance plan
  • Competitive salary and options in a small, rapidly scaling company
  • Fulltime
Read More
Arrow Right

Staff Research Engineer, MetaAI Assistant Measurement

Meta Superintelligence Labs is seeking a Staff Research Engineer to provide tech...
Location
Location
United States , Bellevue
Salary
Salary:
257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Track record of driving technical strategy and landing large-scale research or product impacts in a time-sensitive environment
  • Proven technical vision regarding the future trajectory of Generative AI, specifically in how model performance translates to user utility
  • Expertise in designing and implementing online and offline measurement systems, benchmark building, and data synthesis techniques
  • Experience leading complex, cross-functional technical initiatives, driving consensus across engineering, research, and product boundaries
  • Proficiency in Python and deep learning frameworks (e.g., PyTorch), with the ability to prototype and implement complex methodologies
Job Responsibility
Job Responsibility
  • Architect Scientific Strategy: Define and lead the execution of the scientific roadmap for AI Assistant measurement, ensuring methodologies are rigorous, scalable, and aligned with product goals
  • Innovate & Build: Spearhead the research and development of novel offline and online evaluation metrics, automated benchmarks, and synthetic data generation pipelines to close the loop between model training and deployment
  • Cross-Functional Technical Leadership: Serve as the primary scientific liaison to pre-training, post-training, and product teams, ensuring that measurement insights directly influence model architecture and training recipes (the "evaluation flywheel")
  • Mentorship & Influence: Provide technical mentorship and guidance to senior research engineers and applied scientists, fostering a culture of scientific rigor and code without direct management responsibilities
  • Hands-on Contribution: Remain hands-on in code and research, building prototypes for new evaluation frameworks and validating novel measurement theories
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right
New

Member of Technical Staff - ML Research Engineer, Data

Our Data team powers Liquid Foundation Models across pre-training, vision, audio...
Location
Location
United States , San Francisco; Boston
Salary
Salary:
Not provided
Liquid AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong Python skills with the ability to quickly comprehend problems and translate them into clean, working code
  • Solid ML fundamentals: experience training, evaluating, and iterating on models (PyTorch preferred)
  • Track record of learning new technical domains quickly
  • 3+ years relevant experience with an M.S., or 1+ year with a Ph.D. (5+ years with a B.S.)
Job Responsibility
Job Responsibility
  • Build and maintain data processing, filtering, and selection pipelines at scale
  • Create pipelines for pretraining, midtraining, SFT, and preference optimization datasets
  • Design synthetic data generation systems using LLMs, structured prompting, and domain-specific generators
  • Design and run evaluations and ablations to measure dataset's impact on model performance
  • Monitor public datasets across text, vision, and audio domains
  • Collaborate with pre-training, vision, and audio teams on modality-specific data needs
What we offer
What we offer
  • Competitive base salary with equity in a unicorn-stage company
  • We pay 100% of medical, dental, and vision premiums for employees and dependents
  • 401(k) matching up to 4% of base pay
  • Unlimited PTO plus company-wide Refill Days throughout the year
  • Fulltime
Read More
Arrow Right