CrawlJobs Logo

AI Research Scientist, VLM (vision language models)

United States, Menlo Park 184000.00 - 257000.00 USD / Year · Job Posted January 23, 2026
Apply Position
Job Link Share

Job Description

About Meta: Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics.

Job Responsibility

  • Lead, collaborate, and execute on research that pushes forward the state of the art in multimodal reasoning and generation research
  • Work towards long-term ambitious research goals, while identifying intermediate milestones
  • Directly contribute to experiments, including designing experimental details, writing reusable code, running evaluations, and organizing results
  • Work with a large team
  • Contribute to publications and open-sourcing efforts
  • Mentor other team members
  • Play a significant role in healthy cross-functional collaboration
  • Prioritize research that can be applied to Meta's product development
  • Push state of the art in multimodal generative AI
  • Explore new techniques for advanced reasoning and multimodal understanding for AI Assistants
  • Mentor and work with AI/ML engineers to find a path from research to production

Requirements

  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • A PhD in AI, computer science, or related technical fields
  • Publications in machine learning, computer vision, NLP, speech
  • Experience writing software and executing complex experiments involving large AI models and datasets
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment

Nice to have

  • First (joint) author publications experience at peer-reviewed AI conferences (e.g., NeurIPS, CVPR, ICML, ICLR, ICCV, and ACL)
  • Direct experience in generative AI and LLM research
  • Fluent in Python and PyTorch (or equivalent)

What we offer

  • bonus
  • equity
  • benefits

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

AI Research Scientist, VLM (vision language models)

8 matching positions

AI Research Scientist, VLM (vision language models)

Meta builds technologies that help people connect, find communities, and grow bu...
Location
Location
United States , Bellevue
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • A PhD in AI, computer science, or related technical fields
  • Publications in machine learning, computer vision, NLP, speech
  • Experience writing software and executing complex experiments involving large AI models and datasets
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Lead, collaborate, and execute on research that pushes forward the state of the art in multimodal reasoning and generation research
  • Work towards long-term ambitious research goals, while identifying intermediate milestones
  • Directly contribute to experiments, including designing experimental details, writing reusable code, running evaluations, and organizing results
  • Work with a large team
  • Contribute to publications and open-sourcing efforts
  • Mentor other team members. Play a significant role in healthy cross-functional collaboration
  • Prioritize research that can be applied to Meta's product development
  • Push state of the art in multimodal generative AI
  • Explore new techniques for advanced reasoning and multimodal understanding for AI Assistants
  • Mentor and work with AI/ML engineers to find a path from research to production
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Staff Research Scientist - VLM / VLA

At General Motors, our product teams are redefining mobility. Through a human-ce...
Location
Location
United States , Mountain View
Salary
Salary:
218800.00 - 335300.00 USD / Year
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ph.D. in Machine Learning, Robotics, Computer Science, Electrical Engineering, or a related technical field
  • 5+ years of experience in AI/ML research and applied development
  • Deep expertise in modern ML architectures (transformers, generative AI, multimodal systems)
  • Strong programming skills in Python
  • Excellent communication, collaboration, and mentoring abilities, comfortable influencing technical strategy and guiding ML excellence across the organization
Job Responsibility
Job Responsibility
  • Research, design, and prototype advanced Vision-Language Models and Vision-Language-Action foundational models tailored for real-time semantic understanding and behavioral prediction in autonomous driving
  • Drive the technical strategy for onboard model optimization, leading initiatives in model quantization, pruning, knowledge distillation, and compilation to ensure high-parameter models execute with ultra-low latency on vehicle edge hardware
  • Advance multimodal alignment techniques, ensuring seamless integration of camera, radar, LiDAR, and textual/logical prompts into unified foundational architectures
  • Influence technical roadmaps and shape strategic machine learning priorities that align with safety requirements, core product milestones, and next-generation vehicle launches
  • Provide technical mentorship and long-term vision to a multidisciplinary group of machine learning engineers, software developers, and hardware specialists
  • Foster internal innovation by collaborating closely with perception, planning, and infrastructure teams to integrate foundational models into the core autonomous software stack
  • Represent the company externally to the global scientific community by publishing original research, securing patents, and presenting at top-tier artificial intelligence and robotics conferences
What we offer
What we offer
  • Medical
  • dental
  • vision
  • Health Savings Account
  • Flexible Spending Accounts
  • retirement savings plan
  • sickness and accident benefits
  • life insurance
  • paid vacation & holidays
  • tuition assistance programs
  • Fulltime
Read More
Arrow Right

AI Research Scientist, Robotics

The ideal Research Scientist candidate will use their skills in system design an...
Location
Location
United States , Redmond
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Currently has or is in the process of obtaining a PhD degree in the field of Artificial Intelligence, Robotics, Computer Vision, Machine Learning, Language, a related field, or equivalent practical experience
  • Experience with any of the following research areas: robotics, motion planning, embodied AI, human-robot interaction, sim-to-real transfer, learning from demonstration, reinforcement learning, dexterous manipulation, digital agents, vision language models, computer vision, egocentric perception, and/or LLMs
  • Experience in relevant robotics related research areas, such as: VLM, robot learning, reinforcement learning, imitation learning, action-conditioned world models, task and motion planning, sim-to-real transfer robotic control, manipulation, navigation, or generally embodied AI
Job Responsibility
Job Responsibility
  • Perform fundamental and applied research to push the scientific and technological frontiers of embodied artificial intelligence
  • Invent/improve novel data-driven paradigms for robotics, leveraging a variety of modalities (images, video, text, audio, tactile, etc)
  • Investigate paradigms that can deliver a spectrum of embodied behaviors - from simulated characters to real robots, and from short-horizon, low-level to long-horizon, high-level intelligence
  • Develop algorithms based on state-of-the-art machine learning and neural network methodologies
  • Define, build and benchmark new functionalities needed for the next generation of AI
  • Conduct research towards long-term product goals while identifying intermediate milestones
  • Plan and execute novel research based on long-term objectives of the organization
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Vision-Language Model (VLM) Engineer

Join as Vision-Language Model Engineer to shape multimodal AI — design, train, a...
Location
Location
Turkey , Istanbul
Salary
Salary:
Not provided
wideandwise.co Logo
Wide and Wise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, or a related field
  • Strong experience with Python and deep learning frameworks (e.g., PyTorch or TensorFlow)
  • Solid understanding of machine learning, computer vision, and NLP concepts
  • Experience with multimodal models or related architectures (e.g., transformers)
  • Familiarity with handling large datasets and distributed training
Job Responsibility
Job Responsibility
  • Design and implement vision-language models for tasks such as image captioning, visual question answering, and cross-modal retrieval
  • Train, fine-tune, and evaluate multimodal models using large-scale datasets
  • Optimize model performance for scalability and real-world deployment
  • Collaborate with cross-functional teams including data scientists, software engineers, and product managers
  • Stay up to date with the latest research in multimodal AI and apply it to production systems
  • Fulltime
Read More
Arrow Right

Machine Learning Scientist II - Gen AI

We are seeking a highly motivated and experienced Machine Learning Scientist to ...
Location
Location
United States , Boston
Salary
Salary:
127300.00 - 186700.00 USD / Year
simplisafe.com Logo
SimpliSafe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • MS or PhD in Computer Science, Artificial Intelligence, or a related field
  • Experience training or fine-tuning large language models (LLMs) using modern frameworks
  • Strong grasp of deep learning, particularly transformer architectures and foundational model training techniques for text and vision modalities
  • Proficient in Python and relevant ML libraries (e.g., PyTorch, TensorFlow, HuggingFace Transformers)
  • Hands-on experience in developing and deploying LLM- or VLM-powered applications
  • Familiarity with prompt engineering, retrieval-augmented generation (RAG), MCP (Model Context Protocol, Agentic AI and evaluation of generative models
  • Understanding of MLOps practices and how to scale experiments into production-grade solutions
  • Strong communication and documentation skills
  • Collaborative mindset with the ability to thrive in a fast-paced, interdisciplinary environment
Job Responsibility
Job Responsibility
  • Develop and fine-tune large language models (LLMs) and vision-language models (VLMs) to address real-world challenges in the home security space
  • Work with key stakeholders to identify key research initiatives that can have impact on business outcomes
  • Take research initiatives from idea generation to production
  • Collaborate with engineers and product managers to integrate capabilities into our existing systems
  • Stay up-to-date on the latest advancements in LLMs, VLMs, and multimodal systems. Evaluate new techniques for potential adoption and improvement of internal capabilities
What we offer
What we offer
  • A mission- and values-driven culture and a safe, inclusive environment where you can build, grow and thrive
  • A comprehensive total rewards package that supports your wellness and provides security for SimpliSafers and their families
  • Free SimpliSafe system and professional monitoring for your home
  • Employee Resource Groups (ERGs) that bring people together, give opportunities to network, mentor and develop, and advocate for change
  • Participation in our annual bonus program, equity, and other forms of compensation, in addition to a full range of medical, retirement, and lifestyle benefits
  • Fulltime
Read More
Arrow Right

Applied Scientist II

Applied Scientist II - PowerPoint ML Team, Office Product Group. Are you an appl...
Location
Location
United States , Mountain View; Redmond
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Statistics, Computer Science, Electrical or Computer Engineering, or related field AND 2+ years related experience (e.g., statistics, predictive analytics, research)
  • OR Master's Degree in Statistics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience
  • OR Doctorate in Statistics, Computer Science, Electrical or Computer Engineering, or related field
  • OR equivalent experience
  • 1+ year(s) experience creating publications (e.g., patents, peer-reviewed academic papers)
  • 2+ years of experience demonstrating proficiency in Python and relevant Machine Learning (ML) libraries (e.g., PyTorch)
  • 2+ years of experience with LLM/VLM, including but not limited to: GPT, Claude, gemini, Deepseek-R1, Qwen, GPT OSS, Kimi-K2, Grok
  • Experience either shipping applied research to production with coding and AI model development skills, OR working with LLM deployment, orchestration frameworks, or agent systems
  • Experience in coding and design, specifically in the development of AI models for scaled production services
  • Experience in evaluating ML solutions and production A/B flights
Job Responsibility
Job Responsibility
  • Work on machine learning (ML) projects across various domains like natural language processing (NLP), Vision and harness LLMs, VLMs, and agentic models to deliver visual AI solutions for our customers
  • Work in a fast-paced environment developing algorithms and techniques leveraging text, and images for analyzing and transforming content to build solutions that have the potential to transform people’s lives
  • Work with engineering partner teams on the model integration/flight/maintenance
  • May contribute to building scalable LLM deployment pipelines, integrating orchestration frameworks, and enabling agent-based user experiences in production environments
  • Fulltime
Read More
Arrow Right
New

Social Worker – Fostering - Family and Friends Team

Are you passionate about working with Family and Friends Carers and improving th...
Location
Location
United Kingdom , Aylesbury
Salary
Salary:
40109.00 - 47629.00 GBP / Year
buckinghamshire.gov.uk Logo
Buckinghamshire Council
Expiration Date
July 20, 2026
Flip Icon
Requirements
Requirements
  • Social work qualification
  • Active SWE registration
  • Completed ASYE
  • Working knowledge of legislation and guidance which applies to family and friends work
  • Experience of working in looked after children or with foster carers
  • Knowledge of research, inquiries and recent studies affecting this sector of service and an ability to monitor its application in practice
  • Strong written communication skills
  • Minimum of 2 years post qualifying experience
  • Experience and competence in working with the courts and providing written and verbal evidence
Job Responsibility
Job Responsibility
  • Deliver targeted and specialist social work services in the area of Kinship Care
  • Clarify eligibility and undertake statutory social work assessments
  • Promote the safety and well-being of children and young people
  • Assess and support prospective Family and Friends carers in line with Fostering Regulations and court proceedings
  • Hold a manageable caseload of SG and Reg24 assessments
  • Provide carer support
  • Work within statutory guidelines and fostering regulations
  • Assist with duty with opportunities to deliver training, run support groups and be involved in the organisation of carer events
  • Support the development and growth of the service
What we offer
What we offer
  • Market premium of £2,750 per annum for Grade 7
  • Golden Hello payment after one years service (£1,000 for Grade 6, £2,125 for Grade 7)
  • Competitive salary with a market increment
  • Annual leave up to 30 days per year
  • Opportunity to buy further leave
  • Up to 15 days per year training, learning and development offer
  • Free parking across all office sites
  • Relocation packages available
  • Generous employer pension contribution
  • Discounts on Cafés, restaurants and shops
  • Fulltime
Read More
Arrow Right
New

Spanish Speaking Caregiver

Join Our Team as a Caregiver in Lake Ariel, PA! *Earn Up to $14hr + Extra Cash T...
Location
Location
United States , Lake Ariel
Salary
Salary:
14.00 USD / Hour
caregiversamerica.com Logo
CareGivers America
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Must be willing to work in Lake Ariel
  • Fluent in English, bilingual English/Spanish preferred
  • Availability to work Wednesday, Thursday, Saturday, and Sunday 6am-6pm
  • Reliable transportation required
  • Must be able to travel up to 25 miles to client locations
Job Responsibility
Job Responsibility
  • Bring comfort and companionship to clients in their homes
  • Help with daily care like bathing, dressing, toileting, and meals
  • Keep living spaces clean, safe, and welcoming
  • Offer medication reminders and support healthy routines
  • Communicate concerns and escalate safety issues as needed
  • Be a friendly face and a steady presence
What we offer
What we offer
  • Paid Orientation
  • Weekly Pay
  • Flexible Schedules
  • Earn up to $375 for referring a friend
  • Caregiver Rewards Program
  • Premium Holiday Pay
  • Paid Time Off + Benefits including medical, dental, vision, and retirement
  • Free Employee Assistance Program
  • Discount Perks
  • Career Growth
Read More
Arrow Right