Research Engineer, Vision Job at Zyphra (Palo Alto)

Job Description

You will be a core contributor on Zyphra’s Vision Team building the next generation of vision-language models which can understand natural scenes with a focus on web, desktop, and mobile UIs. You will be deeply involved in the entire model training process from data gathering and processing to designing novel architectures and training methodologies.

Job Responsibility

Building the next generation of vision-language models which can understand natural scenes with a focus on web, desktop, and mobile UIs
Deeply involved in the entire model training process from data gathering and processing to designing novel architectures and training methodologies
Work across: Large-scale vision encoder and vision language training runs
Performance optimization of our training stack
Image and video dataset collection, processing, and evaluation
Architecture and training methodology ablations and improvements

Requirements

Strong research taste and intuition
Strong implementation and prototyping ability
The ability to work well and cooperate with others in a high-paced research setting
Willing to be in-person in our office in Palo Alto
US authorization to work

Nice to have

Experience with training and evaluating vision language models
Experience with creating and collecting large scale machine learning datasets, especially in the visual modality
Experience with training vision encoders using contrastive learning or other methods
Experience with supervised finetuning and preference learning methods as well as reinforcement learning methods
A good intuitive ability to understand model behaviours and correct them through iterative finetuning
Interest in grappling in detail with data and spending significant time involved in data engineering and synthetic data generation
Postgraduate degree in scientific subject (Computer Science, Mathematics, Physics, Machine Learning, etc)
Previously published machine learning research in well-respected venues
Highly proficient with Pytorch and Python
Are excited and able to rapidly learn new fields and implement new ideas
Excellent communication and collaboration skills and can work effectively on both research and engineering implementation at scale

What we offer

Medical, dental, vision and FSA plans
Competitive salary, equity and 401(k)
Relocation and immigration support on a case-by-case basis
On-site meals prepared by a dedicated culinary team
Thursday Happy Hours

Zyphra - All Job Offers

Select Country

Research Engineer, Vision

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?

Research Engineer, Vision

Research Engineer - Computer Vision ML

AI Research Engineer, VLLM (vision large language models) - Generative AI

Research Engineer, Computer Vision & AI

Research Engineer - Computer Vision and Robotics

Research Engineer – Synthetic Data for Vision

Research Engineer, ML, AI & Computer Vision

Research Engineer / Research Scientist - Foundations Retrieval Lead

Research Engineer, Media Data Research - MSL FAIR

Our AI answers in your language