Senior Researcher - Multimodal AI Job at Microsoft Corporation (Redmond)

Senior AI Researcher - AV

GM Israel (Herzliya) takes a significant part in introducing sophisticated softw...

Location

Israel , Herzliya

Salary:

Not provided

General Motors

Expiration Date

Until further notice

Requirements

PhD in Computer Science, Electrical Engineering, Robotics, or a related field (Excellent M.Sc. graduates will be considered)
Over 3 years of research experience in computer vision, machine learning, autonomous perception, or related areas
Strong publication record at top-tier AI/ML conferences and journals
Excellent coding skills and familiarity with modern AI frameworks
Hands-on experience with large-scale training, 3D data, multimodal perception, or foundation models is highly desirable

Job Responsibility

Drive downstream KPI lift for the autonomous driving agent
Participate in AI research projects in the areas of VLMs / world modeling, computer vision, 3D perception, multimodal sensor fusion, and others
Design, build, train, and evaluate foundation models and large-scale deep learning architectures designed for autonomous driving
Collaborate with engineering teams to translate state-of-the-art research into scalable production solutions
Work towards external publications in top-tier conferences / journals
Track emerging trends in your field
Incubate cutting-edge technologies aimed at impacting our L3 autonomous driving technology
Build and maintain collaborations with top universities, research labs, and industry experts

Fulltime

Senior AI Researcher

We’re hiring a Senior AI Researcher to lead foundational research at the interse...

Location

United States , San Francisco

Salary:

Not provided

Tavus

Expiration Date

Until further notice

Requirements

A PhD plus 2–3+ years working hands-on with LLMs, VLMs, or multimodal systems
Previous experience leading research efforts or mentoring teams
Expertise in sequence modeling across video, audio, and text — with strong understanding of autoregressive, predictive, and diffusion frameworks
Experience with large-scale model training and optimization for performance and real-time generation
Proven ability to translate research ideas into production-grade systems
Publications in top-tier venues (CVPR, ICCV, NeurIPS, ECCV, ACMMM)
Strong PyTorch skills and comfort moving fluidly between research and engineering

Job Responsibility

Lead research on Foundational Multimodal Models for Conversational Avatars — systems that can perceive, reason, and generate across video, audio, and language
Build and train models using Autoregressive, Predictive (e.g., V-JEPA), and Diffusion-based architectures with a deep focus on temporal and sequential data (not static frames)
Design and execute experiments to predict and control the visual, auditory, and linguistic responses of avatars
Partner with the Applied ML team to bring research into real-world use cases
Mentor other researchers and drive excellence across the team

What we offer

Flexible work schedule
Unlimited PTO
Competitive healthcare
Gear stipends

Fulltime

Senior Applied Ai Engineer - Multimodal Transformers

Kodiak Robotics, Inc. was founded in 2018 and has become a leader in autonomous ...

Location

United States , Mountain View

Salary:

200000.00 - 260000.00 USD / Year

Kodiak Robotics

Expiration Date

Until further notice

Requirements

BS, MS, or PhD in AI, Computer Science, or a related field
4+ years experience with transformer architectures, particularly in multimodal or multi-stream settings
Familiarity with cross-attention, token fusion, or modality alignment techniques
Proficiency in Python and deep learning frameworks like PyTorch or TensorFlow
Strong understanding of scalable training for large models, including distributed training and mixed-precision optimization
Passion for building AI that reasons over the full breadth of sensory input to operate safely in the real world

Job Responsibility

Design and develop multimodal transformer architectures that fuse camera, LiDAR, and radar into unified representations
Research and implement cross-modal attention mechanisms, token fusion strategies, and efficient multi-stream tokenization
Build scalable training pipelines for large-scale multimodal transformers across massive real-world datasets
Explore self-supervised and contrastive pretraining objectives that learn transferable multimodal representations
Optimize transformer models for real-time inference under latency and compute constraints

What we offer

Competitive compensation package including equity and annual bonuses
Excellent Medical, Dental, and Vision plans through Kaiser Permanente, Cigna, and MetLife (including a medical plan with infertility benefits)
MetLife Legal Services, Identity & Fraud Protection, Hospital Indemnity Insurance, Accident Insurance, & Critical Illness Insurance
Flexible PTO, 10 paid holidays, and generous parental leave policies
Office perks: dog-friendly, free catered lunch, a fully stocked kitchen, and free EV charging
Long Term Disability, Short Term Disability, Life Insurance
Wellbeing Benefits - Headspace through Cigna, Calm through Kaiser, One Medical, Gympass, Spring Health through Cigna, Rula (mental health navigation)
Fidelity 401(k)
Commuter, FSA, Dependent Care FSA, HSA
Various incentive programs (referral bonuses, patent bonuses, etc.)

Fulltime

Senior Researcher - Foundations of Generative AI

Microsoft Research AI Frontiers lab is seeking applications for the position of ...

Location

United States , New York

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Doctorate (or currently pursuing) in Computer Science or relevant field OR equivalent experience
Doctorate in Computer Science or relevant field AND 2+ years related research experience OR equivalent experience
Research program demonstrated by public artifacts like models, tools, code in the AI space or publications at conferences: NeurIPS, ICML, ICLR, ACL, NAACL, CVPR, COLT, ECCV, ICCV, EMNLP
2+ years of academic or industry experience in developing, applying, and/or implementing algorithms for machine learning/statistics, using common ML engineering programming languages and platforms such as Python, Python numerical libraries, PyTorch, TensorFlow and/or HuggingFace
Experience publishing academic papers as a lead author or essential contributor in a top AI conference or journal
Deep understanding of frontier model architectures, especially transformers and state space models
Hands-on experience building and working with Large Language Models (LLMs) or multimodal models (VLMs, VLAs), including pre-training, fine-tuning, and inference
2+ years of industry or academic experience with building, debugging and optimizing large-scale ML training pipelines
Demonstrated software engineering excellence building and deploying prototypes, applications, or open-source (OSS) technologies
Ability to work independently and ramp-up quickly on complex projects or unfamiliar code

Job Responsibility

Apply research and engineering skills to develop, prototype, and evaluate cutting-edge research ideas
Work closely with other researchers and engineers to rapidly prototype and test new research ideas, driving a high-impact agenda and publishing results where appropriate
Collaborate hands-on with other researchers, engineers, and internal and external product groups to deliver high-impact solutions to real-world problems
Embody our culture and values

Fulltime

Senior Member of Technical Staff, Multimodal AI

At Cohere, we believe in the power of multimodal AI to revolutionise the way we ...

Location

Salary:

Not provided

Cohere

Expiration Date

Until further notice

Requirements

Exceptional software engineering skills with a proven track record of building robust and scalable systems
Strong command of Python and well-versed in popular deep learning frameworks like JAX, PyTorch, and TensorFlow, with an understanding of their multimodal capabilities
Knowledge of distributed training strategies, especially for large-scale multimodal models
Familiarity with autoregressive models, particularly their application in multimodal tasks such as image or video captioning, speech-to-text generation

Job Responsibility

Design and develop cutting-edge multimodal AI systems, integrating various modalities such as text, speech, and vision
Conduct research and experiments on our advanced compute infrastructure, exploring novel ideas in multimodal representation learning, transfer learning, and more
Collaborate closely with our world-class teams, learning from and contributing to their expertise in the field

What we offer

An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for up to 6 months
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
6 weeks of vacation (30 working days!)

Fulltime

Senior Dinstinguished Ai Engineer (Agentic Ai Platform)

At Capital One, we are creating responsible and reliable AI systems, changing ba...

Location

United States , San Francisco; McLean; New York; San Jose; Cambridge

Salary:

314800.00 - 392000.00 USD / Year

Capital One

Expiration Date

Until further notice

Requirements

Bachelor's degree in Computer Science, Engineering, or AI plus at least 10 years of experience developing AI and ML algorithms or technologies, or Master's degree plus at least 8 years of experience developing AI and ML algorithms or technologies
At least 10 years of experience programming with Python, Go, Scala, or Java

Job Responsibility

Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products
Contribute to the north star platform architecture, continuously publishing and refining living diagrams and canonical APIs
Standardizing and automating agentic workflows: evaluate agentic frameworks such LangGraph, AutoGen, Semantic Kernal, CrewAI and LlamaIndex and then harden / blend patterns
Contribute to crafting an end to end GenAI SDK, CLI and starter kits
Help bring together a vision of central guardrail services - prompt firewalls, content-filter hooks, red team harnesses and audit APIs
Collaborate with cross organization architects to drive end to end performance by optimizing orchestration - level batching, retrieval caching, heuristic tuning
Accelerate innovation by incubating proof of concepts and driving RFCs such as hierarchical agent memory, multimodal guardrails, multimodal RAG
Own central Helm charts, operators and CRDs that auto scale agents to hit tenant SLAs
Coach and evangelize - hosting architecture office hours, mentoring Staff, Principal and Senior engineers, authoring technical design documents and blogs and representing Capital One at Tier1 AI conferences

What we offer

Health, financial and other benefits that support your total well-being
Performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)

Fulltime

Senior AI Engineer for the Infotainment Platform

With our team you collaborate with experienced AI architects and engineers to bu...

Location

Germany , Munich

Salary:

Not provided

BMW

Expiration Date

Until further notice

Requirements

Master's or PhD in Computer Science, AI, Machine Learning, Data Science, or a related field, with hands-on experience in AI and data architecture beyond prototypes
In-depth knowledge of Android (AOSP / Android Automotive) as a production platform, including its constraints and evolution
Strong experience with AI architecture onboard and offboard, as well as implementation of Small Language Models, Vision Language Models, or autonomous driving models
Knowledge of multi-agent systems orchestration, multi-turn dialogues, and related open standards (e.g., A2A, MCP)
Experience with AI training toolchains and workflows including pruning, fine-tuning, and multimodal encoding
Proficiency in Rust, C++, Kotlin, and modern, API-driven system architectures
Proven technical leadership experience in agile, cross-functional teams

Job Responsibility

Design and own end-to-end AI and data architectures – from data ingestion and feature pipelines to model deployment and monitoring – with a strong focus on production-grade edge AI inside the vehicle
Build scalable platform architectures that serve multiple feature teams across the Android ecosystem (e.g. digital cockpits, infotainment features), enabling them to deliver AI-powered customer experiences reliably and at scale
Provide technical leadership for Android Automotive and its ecosystem, ensuring that AI platform decisions align with Android evolution, AOSP constraints, and long-term platform scalability
Define and evolve clear interfaces between AI platforms, Android infotainment, and application development – staying compatible with new Android versions while keeping the AI infrastructure adaptable and future-proof
Translate emerging AI requirements into production-ready platform capabilities, bridging applied research, platform engineering, and product needs
Continuously drive the AI strategy for infotainment platforms, tracking key technology trends (e.g. on-device LLMs/VLMs, multimodal interaction, agentic AI) and collaborating with leading external technology partners

What we offer

Challenging projects with which we are shaping the mobility of tomorrow together
Wide range of personal and professional development opportunities
Attractive, fair and performance-related remuneration
High level of job security
Annual special payments such as vacation pay, Christmas bonus, and profit sharing
Flexible working hours including 6 weeks annual leave and overtime compensation
Discounted BMW & MINI conditions

Fulltime

Senior AI Engineer

Teradata is building the next generation of AI-native analytics, enabling custom...

Location

India , Hyderabad; Pune; Bangalore

Salary:

Not provided

Teradata

Expiration Date

Until further notice

Requirements

BS/MS/PhD in Computer Science, AI/ML, or a related field
3+ years of software engineering experience with a strong focus on backend systems
Hands-on experience with vector databases or vector search systems
Practical experience building LLM-powered applications, especially RAG systems
Strong understanding of embeddings and similarity search, data chunking and context optimization, dense vs sparse vs hybrid retrieval, semantic search and relevance ranking
Proficiency in Python (and/or Java)
experience with production-grade systems
Experience working with large-scale data and performance-sensitive systems.

Job Responsibility

Design and implement vector store capabilities integrated with Teradata’s analytics platform, including indexing, storage, retrieval, and query optimization
Build end-to-end RAG pipelines, including data ingestion and chunking strategies, embedding generation and lifecycle management, retrieval (dense, sparse, and hybrid search), context assembly and prompt orchestration
Develop and optimize semantic search algorithms and ranking strategies for enterprise workloads
Enable multimodal RAG (text, structured data, images, etc.) and agent-based workflows
Design agentic AI patterns, including tool calling, planning, memory, and orchestration
Implement guardrails for safety, reliability, and governance (hallucination mitigation, rounding, policy enforcement)
Build and maintain RAG evaluation frameworks, including relevance, faithfulness, accuracy, and cost metrics
Collaborate with product, research, and platform teams to translate customer use cases into scalable features
Benchmark Teradata’s vector store and RAG capabilities against industry alternatives (e.g., cloud and open-source solutions)
Contribute to technical design reviews, architecture decisions, and long-term AI platform strategy.

What we offer

People-first culture
Flexible work model
Focus on well-being
Inclusive environment.

Fulltime

Select Country

Senior Researcher - Multimodal AI

Job Description

Job Responsibility

Requirements

Nice to have

Looking for more opportunities?