CrawlJobs Logo

Senior Researcher - Multimodal AI

United States, Redmond 119800.00 - 234700.00 USD / Year · Job Posted April 16, 2026
Apply Position
Job Link Share

Job Description

Microsoft Research AI Frontiers lab is seeking applications for the position of Senior Researcher to join their team in Redmond, WA or New York City, NY. The mission of the AI Frontiers Lab is to expand the pareto frontier of Artificial Intelligence (AI) capabilities, efficiency, and safety through innovations in foundation models and learning agent platforms. Some of our projects include work on Small Language Models (e.g. Phi, Orca) and agentic AI systems (e.g. AutoGen, MagenticOne, OmniParser). We are seeking a Senior Researcher to join our team and lead efforts on the advancement of Generative AI and Multimodal Model (MLM) technologies. As a Senior Researcher, you will play a crucial role in developing, improving, and exploring the capabilities of Multimodal AI models. Your work will have a significant impact on the development of cutting-edge technologies, advancing state-of-the-art and providing practical solutions to real-world problems. Our ongoing research areas encompass but are not limited to: Reasoning method for multimodal models; New multimodal model architectures and training methods; Action models for automating web and computer tasks; Orchestration and multi-agent systems: automated orchestration between multiple agents incorporating human feedback and oversight; Evaluation and Understanding of model and agent capabilities

Job Responsibility

  • Perform cutting-edge research in collaboration with other researchers, engineers, and product groups
  • Be a part of research breakthroughs in the field and will be given an opportunity to realize your ideas in products and services used worldwide
  • Embody our culture and values

Requirements

  • Doctorate (or currently pursuing) in Computer Science or relevant field OR equivalent experience
  • Experience in Computer Vision and related fields
  • Research program demonstrated by publications at the following conferences: NeurIPS, ICML, ICLR, ACL, NAACL, CVPR, COLT, ECCV, ICCV, EMNLP

Nice to have

  • Doctorate in Computer Science or relevant field AND 2+ years related research experience OR equivalent experience
  • Experience publishing academic papers as a lead author or essential contributor in the field of Artificial Intelligence
  • Experience participating in a top conference in relevant research domain
  • Demonstrable ability to define an ambitious, original research agenda
  • Ability to collaborate, communicate effectively, and work as part of a multi-disciplinary team
  • Keen interest in real-world applications and impact

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Senior Researcher - Multimodal AI

8 matching positions

Senior AI Engineer

This role will be tasked with applying machine learning/deep learning to the aut...
Location
Location
United States , Belmont
Salary
Salary:
170000.00 - 210000.00 USD / Year
https://www.volkswagen-group.com Logo
Volkswagen AG
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6-8 years of professional experience post graduate degree preferred
  • 4+ years' Deep Learning experience post graduate degree preferred
  • Master's Degree in Computer Science or equivalent
  • PhD Strongly Preferred
  • Strong knowledge of different machine learning algorithms
  • Proficiency in deep learning techniques and frameworks
  • Strong understanding of traditional machine learning algorithms and their applications
  • Expertise in computer vision, including object detection, image segmentation, and image recognition
  • Proficiency in NLP techniques, including sentiment analysis, text generation, and language understanding models
  • Experience with multimodal language modeling and applications
Job Responsibility
Job Responsibility
  • Applying machine learning/deep learning to the automotive industry
  • Maintaining and enhancing existing machine learning modules for autonomous vehicles
  • Designing and implementing new machine learning based approaches based on existing frameworks
  • Keeping up to speed with the state of the art of academic research and technology in the industry
  • Coordinating with engineers at the ICC and in Germany on the development of autonomous driving software
  • Transferring technologies and solutions to Volkswagen Group development divisions
  • Developing technical specifications and documentation
  • Representing Volkswagen Group in the technical community, such as at conferences
  • Fulltime
Read More
Arrow Right

Senior Computer Vision and Machine Learning Research Scientist

We are seeking a skilled and innovative Senior Research Scientist to join one of...
Location
Location
United States , Seattle
Salary
Salary:
159750.00 - 234300.00 USD / Year
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD and 3+ years experience in Computer Science or a related field with a focus on computer vision, machine learning, artificial intelligence or related technical fields
  • Proven track record of research in machine learning, computer vision or related fields
  • Experience driving the ML development lifecycle and leveraging state-of-the-art research to deliver high quality models at scale
  • Strong computer vision fundamentals such as image processing, feature extractions, object detection, semantic segmentation, video analysis or action recognition
  • Excellent problem-solving skills, analytical thinking, and the ability to work independently as well as collaboratively in a team environment
  • Proficiency in Python and frameworks such as PyTorch, TensorFlow or Keras
  • Strong communication skills and the ability to effectively present complex technical concepts to both technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Collaborate with other scientists, engineers and product managers to build proof-of-concepts to shape the Axon of tomorrow
  • Lead end-to-end research efforts in advanced computer vision, machine learning and gen-AI techniques for cloud and devices from multimodal data sources, including scene understanding, action recognition and anomaly detection
  • Design and implement responsible, privacy-preserving, efficient and scalable models for inference and analysis of visual data
  • Develop performance and quality metrics for CVML models and systems, and validate their effectiveness in real-world settings
  • Optimize algorithms for performance, memory footprint, and energy efficiency to meet the requirements of resource-constrained devices
  • Stay up-to-date with the latest research and advances in CVML and translate relevant findings into shipping Axon products
  • Contribute to academic publications, technical documentation, and patent disclosures to share insights and findings with the broader community
  • Coach and mentor junior scientists
What we offer
What we offer
  • Competitive salary and 401k with employer match
  • Discretionary paid time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Mental Wellness support
  • Learning & Development programs
  • Snacks in our offices
  • Fulltime
Read More
Arrow Right

Senior Computer Vision and Machine Learning Research Scientist

Join Axon and be a Force for Good. At Axon, we’re on a mission to Protect Life. ...
Location
Location
United States , Seattle
Salary
Salary:
159750.00 - 234300.00 USD / Year
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD and 3+ years experience in Computer Science or a related field with a focus on computer vision, machine learning, artificial intelligence or related technical fields
  • Proven track record of research in machine learning, computer vision or related fields
  • Experience driving the ML development lifecycle and leveraging state-of-the-art research to deliver high quality models at scale
  • Strong computer vision fundamentals such as image processing, feature extractions, object detection, semantic segmentation, video analysis or action recognition
  • Excellent problem-solving skills, analytical thinking, and the ability to work independently as well as collaboratively in a team environment
  • Proficiency in Python and frameworks such as PyTorch, TensorFlow or Keras
  • Strong communication skills and the ability to effectively present complex technical concepts to both technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Collaborate with other scientists, engineers and product managers to build proof-of-concepts to shape the Axon of tomorrow
  • Lead end-to-end research efforts in advanced computer vision, machine learning and gen-AI techniques for cloud and devices from multimodal data sources, including scene understanding, action recognition and anomaly detection
  • Design and implement responsible, privacy-preserving, efficient and scalable models for inference and analysis of visual data
  • Develop performance and quality metrics for CVML models and systems, and validate their effectiveness in real-world settings
  • Optimize algorithms for performance, memory footprint, and energy efficiency to meet the requirements of resource-constrained devices
  • Stay up-to-date with the latest research and advances in CVML and translate relevant findings into shipping Axon products
  • Contribute to academic publications, technical documentation, and patent disclosures to share insights and findings with the broader community
  • Coach and mentor junior scientists
What we offer
What we offer
  • Competitive salary and 401k with employer match
  • Discretionary paid time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Mental Wellness support
  • Learning & Development programs
  • Snacks in our offices
  • Fulltime
Read More
Arrow Right

Senior AI Researcher

We’re hiring a Senior AI Researcher to lead foundational research at the interse...
Location
Location
United States , San Francisco
Salary
Salary:
Not provided
tavus.io Logo
Tavus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A PhD plus 2–3+ years working hands-on with LLMs, VLMs, or multimodal systems
  • Previous experience leading research efforts or mentoring teams
  • Expertise in sequence modeling across video, audio, and text — with strong understanding of autoregressive, predictive, and diffusion frameworks
  • Experience with large-scale model training and optimization for performance and real-time generation
  • Proven ability to translate research ideas into production-grade systems
  • Publications in top-tier venues (CVPR, ICCV, NeurIPS, ECCV, ACMMM)
  • Strong PyTorch skills and comfort moving fluidly between research and engineering
Job Responsibility
Job Responsibility
  • Lead research on Foundational Multimodal Models for Conversational Avatars — systems that can perceive, reason, and generate across video, audio, and language
  • Build and train models using Autoregressive, Predictive (e.g., V-JEPA), and Diffusion-based architectures with a deep focus on temporal and sequential data (not static frames)
  • Design and execute experiments to predict and control the visual, auditory, and linguistic responses of avatars
  • Partner with the Applied ML team to bring research into real-world use cases
  • Mentor other researchers and drive excellence across the team
What we offer
What we offer
  • Flexible work schedule
  • Unlimited PTO
  • Competitive healthcare
  • Gear stipends
  • Fulltime
Read More
Arrow Right

Director, Digital Ecosystem Applications

This position is responsible for the Software Platforms group at the Innovation ...
Location
Location
United States , Belmont
Salary
Salary:
240000.00 - 285000.00 USD / Year
https://www.volkswagen-group.com Logo
Volkswagen AG
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years with 2+ years in a technical leadership role
  • CS, EE, M.S. Engineering (or equivalent) REQUIRED
  • M.S. Engineering (or equivalent) or PhD PREFERRED
  • Analytical and conceptual thinking – using logic and reason, creative and strategic
  • Communication skills – interpersonal, presentation and written
  • Managing interdisciplinary teams on individual projects
  • Integration – joining people, processes or systems
  • Influencing and negotiation skills
  • Problem solving
  • Resource management
Job Responsibility
Job Responsibility
  • Define the technical mission, architecture strategy, and long‑term platform vision for the In‑Vehicle Computing & Digital Ecosystem Applications team, spanning Android Automotive OS (AAOS), in‑vehicle compute platforms, Software‑Defined Vehicle (SDV) architecture, and AI‑driven cockpit intelligence
  • Provide technical leadership across the full software stack, including Android Framework, System Services, HAL layers, middleware, connectivity stacks, media/audio frameworks, HMI toolchains, and cloud‑connected AI runtimes within an SDV‑aligned architecture
  • Lead and mentor engineering teams in platform bring‑up, system integration, performance optimization, and development of AI‑agentic features, multimodal interaction models, and next‑generation speech technologies
  • Manage multi‑year budgets for platform development, AI integration, SDV‑aligned compute evolution, SoC evaluations, cloud services, and prototype programs
  • Deliver executive‑level technical reporting on architecture decisions, platform readiness, SDV integration milestones, AI progress, risks, and strategic recommendations
  • Drive strategic planning for ICC’s infotainment and cockpit portfolio, including AAOS evolution, hybrid cloud/edge AI pipelines, intelligent mobile agent technologies, and SDV‑centric software and compute roadmaps
  • Align technical roadmaps with global VW Group Innovation teams across infotainment, connectivity, AI/ML, vehicle architecture, cloud services, and SDV platform strategy, ensuring cross‑platform consistency and shared component reuse
  • Build strategic relationships with SoC vendors, Tier‑1 suppliers, cloud providers, and AI technology partners to influence cockpit compute and SDV platform evolution
  • Maintain partnerships with Silicon Valley companies specializing in AI runtimes, LLMs, speech, multimodal interaction, and automotive‑grade SDV‑compatible software frameworks
  • Collaborate with academic and research institutions on AI‑agentic systems, embedded ML, HMI, and in‑vehicle compute architectures aligned with SDV principles
What we offer
What we offer
  • Eligibility for annual performance bonus
  • Healthcare benefits
  • 401(k), with company match
  • Defined contribution retirement program
  • Tuition reimbursement
  • Company lease car program
  • Paid time off
  • Fulltime
Read More
Arrow Right

Senior Conversational AI Designer

We are looking for a talented, curious, and highly motivated Senior Conversation...
Location
Location
United States
Salary
Salary:
106000.00 - 163000.00 USD / Year
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years creating conversational design experiences
  • Ability to combine vision with strong creative design execution
  • Strong portfolio with work samples showcasing expertise, craft, aesthetics, and depth of thought
  • Experience designing for Generative or Agentive AI-powered experiences
  • Understanding of machine learning & natural language processing (NLP) fundamentals
  • Experience incorporating research into the design process
  • Subject-matter expert. Ability to work independently, be organized, self-motivated, and have attention to detail
  • Excellent written, visual, and verbal communication skills
  • Proficient in common design tools (e.g.Figma, VoiceFlow, ProtoPie, etc)
  • Demonstrated experience documenting for voice-only, voice-forward, and multi-modal experiences
Job Responsibility
Job Responsibility
  • Lead design vision and execution for AI-based conversational and multimodal experiences, advocating usability and user-centered design principles
  • Develop and promote conversation design best practice standards, reusable interactional patterns, and processes
  • Deliver effective design documentation to communicate experience requirements, design principles, best practices, prototypes, and multimodal flows to convey voice forward interactions that leverage AI
  • Identify opportunities for usability testing and research efforts by leading and supporting the creation of prototypes, participating in clinic sessions, analyzing, brainstorming, ideating etc
  • Build strong relationships with internal stakeholders to understand priorities, collaborate on actions to implement product and service solutions, and ensure a connected end-to-end user experience
  • Identify, track, and monitor industry/experience trends and drivers of customer needs and satisfaction
  • Lead visualization of experiences using storyboards, customer journeys, user flows, low-fidelity wireframes and voice prototypes to illustrate vision and influence impactful experiences in AI
  • Balance user needs, technical constraints, and product objectives to solve problems effectively, crafting world-class multimodal user experiences
What we offer
What we offer
  • medical
  • dental
  • vision
  • Health Savings Account
  • Flexible Spending Accounts
  • retirement savings plan
  • sickness and accident benefits
  • life insurance
  • paid vacation & holidays
  • tuition assistance programs
  • Fulltime
Read More
Arrow Right

Senior Research Scientist

Senior Research Scientist at Cohere Labs, the dedicated research arm of Cohere. ...
Location
Location
United States; Canada; United Kingdom; France , London; Toronto; New York; San Francisco; Paris
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD (or equivalent research experience) with a strong track record of publications in ML, AI, statistics, or related fields
  • Expertise or deep curiosity in areas such as: efficiency, optimization, reasoning, multimodality, agentic systems, human–AI collaboration
  • Solid machine learning and NLP fundamentals, with experience in shaping and executing original research
  • Enjoy collaborating across disciplines and geographies
  • Care about mentoring and supporting the next generation of researchers
Job Responsibility
Job Responsibility
  • Drive an independent, high-impact research agenda on frontier topics in AI
  • Have the freedom to ask ambitious questions, explore the unknown, and publish findings openly
  • Work that advances science and engages with the societal implications of machine learning
  • Thrive at the intersection of leading and learning: advancing frontier research with global collaborations, guiding new researchers, and opening doors for broader participation in ML research
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right

Senior AI Researcher - AV

GM Israel (Herzliya) takes a significant part in introducing sophisticated softw...
Location
Location
Israel , Herzliya
Salary
Salary:
Not provided
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Electrical Engineering, Robotics, or a related field (Excellent M.Sc. graduates will be considered)
  • Over 3 years of research experience in computer vision, machine learning, autonomous perception, or related areas
  • Strong publication record at top-tier AI/ML conferences and journals
  • Excellent coding skills and familiarity with modern AI frameworks
  • Hands-on experience with large-scale training, 3D data, multimodal perception, or foundation models is highly desirable
Job Responsibility
Job Responsibility
  • Drive downstream KPI lift for the autonomous driving agent
  • Participate in AI research projects in the areas of VLMs / world modeling, computer vision, 3D perception, multimodal sensor fusion, and others
  • Design, build, train, and evaluate foundation models and large-scale deep learning architectures designed for autonomous driving
  • Collaborate with engineering teams to translate state-of-the-art research into scalable production solutions
  • Work towards external publications in top-tier conferences / journals
  • Track emerging trends in your field
  • Incubate cutting-edge technologies aimed at impacting our L3 autonomous driving technology
  • Build and maintain collaborations with top universities, research labs, and industry experts
  • Fulltime
Read More
Arrow Right