CrawlJobs Logo

Research Engineer, Vision

United States, Palo Alto · Job Posted January 13, 2026
Apply Position
Job Link Share

Job Description

You will be a core contributor on Zyphra’s Vision Team building the next generation of vision-language models which can understand natural scenes with a focus on web, desktop, and mobile UIs. You will be deeply involved in the entire model training process from data gathering and processing to designing novel architectures and training methodologies.

Job Responsibility

  • Building the next generation of vision-language models which can understand natural scenes with a focus on web, desktop, and mobile UIs
  • Deeply involved in the entire model training process from data gathering and processing to designing novel architectures and training methodologies
  • Work across: Large-scale vision encoder and vision language training runs
  • Performance optimization of our training stack
  • Image and video dataset collection, processing, and evaluation
  • Architecture and training methodology ablations and improvements

Requirements

  • Strong research taste and intuition
  • Strong implementation and prototyping ability
  • The ability to work well and cooperate with others in a high-paced research setting
  • Willing to be in-person in our office in Palo Alto
  • US authorization to work

Nice to have

  • Experience with training and evaluating vision language models
  • Experience with creating and collecting large scale machine learning datasets, especially in the visual modality
  • Experience with training vision encoders using contrastive learning or other methods
  • Experience with supervised finetuning and preference learning methods as well as reinforcement learning methods
  • A good intuitive ability to understand model behaviours and correct them through iterative finetuning
  • Interest in grappling in detail with data and spending significant time involved in data engineering and synthetic data generation
  • Postgraduate degree in scientific subject (Computer Science, Mathematics, Physics, Machine Learning, etc)
  • Previously published machine learning research in well-respected venues
  • Highly proficient with Pytorch and Python
  • Are excited and able to rapidly learn new fields and implement new ideas
  • Excellent communication and collaboration skills and can work effectively on both research and engineering implementation at scale

What we offer

  • Medical, dental, vision and FSA plans
  • Competitive salary, equity and 401(k)
  • Relocation and immigration support on a case-by-case basis
  • On-site meals prepared by a dedicated culinary team
  • Thursday Happy Hours

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Research Engineer, Vision

8 matching positions

Research Engineer - Computer Vision ML

Vision understanding is a critical addition to conversational AI, bridging the g...
Location
Location
United States , San Francisco
Salary
Salary:
190000.00 - 320000.00 USD / Year
sesame.com Logo
Sesame
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience working with a high degree of autonomy in ambiguous environments
  • Proven experience in developing machine learning and computer vision models
  • Familiar with state-of-the-art in computer vision
  • Strong proficiency in deep learning frameworks such as PyTorch or Jax
  • Familiarity with large-scale dataset handling, including multi-camera datasets
  • Excellent communication skills and the ability to work collaboratively across disciplines
  • Bachelor’s degree or higher in computer science, computer vision, applied mathematics, machine learning, or a related field
Job Responsibility
Job Responsibility
  • Contribute to the development of our ML models across various flavors of 3D computer vision problems
  • Work across the ML stack, including model architectures, data capture, data curation, model evaluation, training & inference infrastructure, research, and experimentation
  • Collaborate with firmware and hardware engineers to deploy models onto embedded devices
  • Pick promising approaches from the literature to bet on, and create new approaches where necessary to achieve our unique goals
What we offer
What we offer
  • 401k matching
  • 100% employer-paid health, vision, and dental benefits
  • Unlimited PTO and sick time
  • Flexible spending account matching (medical FSA)
  • Fulltime
Read More
Arrow Right

AI Research Engineer, VLLM (vision large language models) - Generative AI

Meta is seeking a Research Engineer to join our Llama Large Language Model (LLM)...
Location
Location
United States , Menlo Park
Salary
Salary:
217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Masters in AI, computer science, or related technical fields
  • 3+ years of Experience holding an industry, faculty, or government researcher position
  • Publications in machine learning, computer vision, NLP, Audio
  • Experience writing software and executing complex experiments involving large AI models and datasets
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Lead, collaborate, and execute on developing scalable and effective data curation, model development and eval systems that push forward the state of the art in multimodal reasoning and generation research
  • Work towards long-term ambitious research/development goals, while identifying intermediate milestones
  • Directly contribute to experiments, including designing experimental details, writing reusable code, running evaluations, and organizing results
  • Work with a large team
  • Mentor other team members. Play a significant role in healthy cross-functional collaboration
  • Prioritize research and development that can be applied to Meta's product development
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Research Engineer, Computer Vision & AI

The Surreal Spatial AI group is seeking high-performing research engineers to bu...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Two or more years of experience in one or more of the following areas: Deep Learning, Computer Vision, AR/VR, 3D Vision, Robotics, Machine Learning or artificial intelligence
  • One or more years of experience in C/C++ or Rust
  • Bachelor's degree in Computer Science, Computer Vision, Robotics or a related technical field
  • Experience developing computer vision algorithms or computer vision infrastructure in C/C++, Python or Rust
Job Responsibility
Job Responsibility
  • Implement and prototype advanced research systems and technologies spanning device and cloud, in the domain of AI and machine perception
  • Collaborate with team members throughout the lifetime of a project, from early research through technology and experience prototyping
  • Play a critical role in the definition and execution of system research roadmaps in partnership and cross functional organizations in computer vision, machine learning, graphics, sensors, optics and silicon
  • Collaborate with cross-functional engineering and research teams from Reality Labs and FAIR in computer vision, machine learning, and graphics
Read More
Arrow Right

Research Engineer - Computer Vision and Robotics

The Surreal Team at Meta’s Reality Lab Research is a multi-disciplinary, vertica...
Location
Location
United States , Redmond
Salary
Salary:
181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • 5+ years of experience with any of the following research areas: computer vision, robotics, 3D reconstruction, computational imaging, image / video understanding, motion planning, embodied AI, human-robot interaction, sim-to-real transfer, learning from demonstration, reinforcement learning, dexterous manipulation
  • 5+ years of experience with C/C++ and Python programming
  • 5+ years of experience with deep learning frameworks: pytorch or tensorflow
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Plan and execute engineering development to advance the state-of-the-art in machine perception and robotics tasks such as 3D reconstruction, computational imaging, image / video understanding, navigation, dexterous manipulation, whole-body controlling, long-horizon reasoning, etc
  • Build data collection / generation platforms and develop algorithms based on state-of-the-art machine learning and neural network methodologies
  • Invent/improve novel data-driven paradigms for robotics intelligence, including AR, VR, machine perceptions and robotics, leveraging a variety of modalities (images, video, text, audio, tactile, etc)
  • Investigate paradigms that can deliver a spectrum of robotics behaviors - from simulated characters to real robots, and from short-horizon, low-level to long-horizon, high-level intelligence
  • Define, build and benchmark new technologies needed for the next generation of AI
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Research Engineer – Synthetic Data for Vision

Vision understanding is a critical addition to conversational AI, bridging the g...
Location
Location
United States , San Francisco
Salary
Salary:
175000.00 - 280000.00 USD / Year
sesame.com Logo
Sesame
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Demonstrated experience with 3D reconstruction, photorealistic rendering, appearance modeling, or synthetic data generation for vision tasks
  • Ability to navigate and deliver results in high-ambiguity, open-ended problem spaces
  • Familiarity with large-scale, multi-camera datasets and the practicalities of curation, annotation, and evaluation
  • Excellent communication skills and the ability to work collaboratively across disciplines
  • Bachelor’s degree or higher in computer graphics, vision, imaging, machine learning, or a related field
Job Responsibility
Job Responsibility
  • Build and maintain synthetic data generation pipelines (e.g., neural rendering, diffusion/score-based models, controllable generative priors, procedural assets) with levers for pose, expression, illumination, materials, and sensor characteristics
  • Apply transfer learning and domain adaptation (self-supervised pretraining, style/appearance transfer, sim-to-real) to bridge distribution gaps between synthetic and real data
  • Integrate off-the-shelf and open-source components where practical
  • fine-tune or distill models to meet latency, memory, and quality targets on target hardware
  • Stand up end-to-end systems—from capture and calibration to generation, data curation, quality gates, rendering/evaluation suites, and deployment
  • Define dataset and model evaluation frameworks (coverage, bias, sim-to-real gap, task-level KPIs such as gaze error) and iterate based on quantitative results
  • Survey literature across graphics, vision, and generative ML
  • prototype, adapt, and, where needed, invent new approaches that push facial reconstruction, appearance modeling, and synthetic data quality forward
What we offer
What we offer
  • 401k matching
  • 100% employer-paid health, vision, and dental benefits
  • Unlimited PTO and sick time
  • Flexible spending account matching (medical FSA)
  • Fulltime
Read More
Arrow Right

Research Engineer, ML, AI & Computer Vision

The aim of this role is to develop, advance and integrate ML, computer vision mo...
Location
Location
United States , Redmond
Salary
Salary:
217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Experience developing and designing Computer Vision and Perception for Robotics or smart device technologies and systems
  • 5+ years of experience with a mastery of modern features in C++
  • 3+ years of experience python experience, including appropriate ML frameworks
  • Interpersonal experience: cross-group and cross-functional collaboration
Job Responsibility
Job Responsibility
  • Build/integrate real-time prototypes for advanced, real-time 3D machine perception systems as part of a fast-moving research and research engineering team
  • Research, develop, and iterate on the data collection/generation pipeline along with state of art ML algorithms/models for dynamic object detection and tracking, estimation and understanding of user motion, actions and activities
  • Collaborate with team members throughout the lifetime of a project, from early research through technology and experience prototyping
  • Play a critical role in the definition and execution of system research roadmaps in partnership and cross functional organizations in computer vision, machine learning, graphics, sensors, optics and silicon
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Research Engineer / Research Scientist - Foundations Retrieval Lead

The Foundations Research team works on high-risk, high-reward ideas that could s...
Location
Location
United States , San Francisco
Salary
Salary:
445000.00 - 555000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience leading high-performance teams of researchers or engineers in ML infrastructure or foundational research
  • Deep technical expertise in representation learning, embedding models, or vector retrieval systems
  • Familiarity with transformer-based LLMs and how embedding spaces can interact with language model objectives
  • Research experience in areas such as contrastive learning, supervised or unsupervised embedding learning, or metric learning
  • A track record of building or scaling large machine learning systems, particularly embedding pipelines in production or research contexts
  • A first-principles mindset for challenging assumptions about how retrieval and memory should work for large models
Job Responsibility
Job Responsibility
  • Lead research into embedding models and retrieval systems optimized for grounding, relevance, and adaptive reasoning
  • Manage a team of researchers and engineers building end-to-end infrastructure for training, evaluating, and integrating embeddings into frontier models
  • Drive innovation in dense, sparse, and hybrid representation techniques, metric learning, and learning-to-retrieve systems
  • Collaborate closely with Pretraining, Inference, and other Research teams to integrate retrieval throughout the model lifecycle
  • Contribute to OpenAI’s long-term vision of AI systems with memory and knowledge access capabilities rooted in learned representations
What we offer
What we offer
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Fulltime
Read More
Arrow Right

Research Engineer, Media Data Research - MSL FAIR

Meta is seeking AI research engineers to help us build the data foundation for M...
Location
Location
United States , Menlo Park
Salary
Salary:
217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 1+ year of industry research experience in LLM/LMM, computer vision, or related AI/ML models
  • Experience owning and/or driving complex technical projects from end-to-end
  • Practical experience with multimodal pre-training or mid-training data curation for large media perception or generation models
  • Demonstrated data infrastructure and software background, and experience building data tooling and services
  • Published research in leading peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV) and/or demonstrated significant industry influence in the field of AI
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams to develop Meta’s next foundational models
  • Architect efficient and scalable data curation systems and pipelines
  • Fundamentally improve our data velocity across workflows and projects by contributing to the advancement of data tooling
  • Execute on high priority projects in pre-training, mid-training, or post-training data curation
  • Apply specialized expertise in video/image generation, video/image perception, OCR, data scaling laws, or data mixing
  • Lead complex technical projects end-to-end
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right