Senior Researcher

Senior AI Researcher

We’re hiring a Senior AI Researcher to lead foundational research at the interse...

Location

United States , San Francisco

Salary:

Not provided

Tavus

Expiration Date

Until further notice

Requirements

A PhD plus 2–3+ years working hands-on with LLMs, VLMs, or multimodal systems
Previous experience leading research efforts or mentoring teams
Expertise in sequence modeling across video, audio, and text — with strong understanding of autoregressive, predictive, and diffusion frameworks
Experience with large-scale model training and optimization for performance and real-time generation
Proven ability to translate research ideas into production-grade systems
Publications in top-tier venues (CVPR, ICCV, NeurIPS, ECCV, ACMMM)
Strong PyTorch skills and comfort moving fluidly between research and engineering

Job Responsibility

Lead research on Foundational Multimodal Models for Conversational Avatars — systems that can perceive, reason, and generate across video, audio, and language
Build and train models using Autoregressive, Predictive (e.g., V-JEPA), and Diffusion-based architectures with a deep focus on temporal and sequential data (not static frames)
Design and execute experiments to predict and control the visual, auditory, and linguistic responses of avatars
Partner with the Applied ML team to bring research into real-world use cases
Mentor other researchers and drive excellence across the team

What we offer

Flexible work schedule
Unlimited PTO
Competitive healthcare
Gear stipends

Fulltime

Senior Generative AI Engineer

At Velvetech we’re innovating at the intersection of retail operations, sales, l...

Location

United States

Salary:

Not provided

Velvetech

Expiration Date

Until further notice

Requirements

Expertise in AI/ML, NLP, and computer vision, and prompt engineering
Experience with cloud services, big data technologies
Expertise in GANs, VAEs, TensorFlow, PyTorch, and deep learning architectures
Proven experience in deploying scalable AI models in cloud environments
Strong programming skills in Python and advanced experience with AI development tools
Advanced analytics and excellent problem-solving abilities
Strong foundation in mathematics, statistics, and data preprocessing

Job Responsibility

Design and implement advanced generative AI models to address key project challenges
Develop generative AI models for enhancing customer experience, optimizing cross-channel sales, order fulfillment, and logistics
Integrate generative AI solutions with various sales and logistics platforms
Innovate in AI-driven customer experience management
Architect and develop advanced AI models using TensorFlow, PyTorch, GANs, VAEs, and Transformers
Implement computer vision algorithms for product assessment and NLP for processing textual data
Design dynamic pricing models that adapt to real-time data
Integrate the AI engine with existing systems via RESTful APIs
Implement a continuous improvement culture through a feedback loop mechanism
Stay abreast of the latest AI research and technologies to continuously improve our solutions

What we offer

Groundbreaking AI project work
Competitive salary
Career growth opportunities in a fast-paced tech-driven company
Result oriented culture, collaborative and highly innovative work environment

Fulltime

Senior Researcher - Efficient AI

Generative AI is transforming how people create, collaborate, and communicate—re...

Location

United States , Redmond

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Doctorate in relevant field OR Master's Degree in relevant field AND 3+ years related research experience OR Bachelor's Degree in relevant field AND 4+ years related research experience OR equivalent experience
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
Demonstrated experience in designing and optimizing efficient inference systems, combining foundations in algorithmic optimization, parallel computing, and request orchestration under strict SLO constraints with deep knowledge of attention and KV‑cache optimizations, batching and scheduling strategies, and cost‑aware deployment
3+ years of experience with machine learning frameworks (e.g., PyTorch, TensorFlow) and inference serving frameworks (e.g., vLLM, Triton Inference Server, TensorRT-LLM, ONNX Runtime, Ray Serve, DeepSpeed-MII)
3+ years of experience in GPU programming and optimization, with expert knowledge of CUDA, ROCm, Triton, PTX, CUTLASS, or similar GPU programming frameworks
Proficiency in C++ and Python for high-performance systems, with code quality and profiling/debugging skills
Research impact through publications and/or patents, coupled with hands‑on experience taking research ideas through execution and delivery in production

Job Responsibility

Formulate, develop, and evaluate new algorithmic and system-level approaches for end-to-end AI serving, using analytical modeling and large-scale measurement to study token-level latency, tail latency (p95/p99), throughput-per-dollar, cold-start behavior, warm pool strategies, and capacity planning under multi-tenant SLOs and variable sequence lengths
Design and experimentally evaluate endpoint configuration and execution policies, including batching, routing, and scheduling strategies, tensor and pipeline parallelism, quantization and precision profiles, speculative decoding, and chunked or streaming generation, and drive the most promising approaches through robust rollout and validation into production
Perform hardware- and kernel-aware optimization by collaborating closely with model, kernel, compiler, and hardware teams to align serving algorithms with attention/KV innovations and accelerator capabilities
Build and benchmark experimental prototypes and large-scale measurements to validate research ideas and drive them toward production readiness
produce clear technical documentation, design reviews, and operational playbooks
Publish research results, file patents, and, where appropriate, contribute to open-source systems and serving frameworks

Fulltime

Senior Manager, Data Science - AI Foundations

Senior Manager, Data Science - AI Foundations. Data is at the center of everythi...

Location

United States , McLean; New York; San Jose

Salary:

229900.00 - 286200.00 USD / Year

Capital One

Expiration Date

Until further notice

Requirements

Currently has, or is in the process of obtaining a Bachelor's Degree in a quantitative field plus 7 years of experience performing data analytics
Currently has, or is in the process of obtaining a Master's Degree in a quantitative field or an MBA with a quantitative concentration plus 5 years of experience performing data analytics
Currently has, or is in the process of obtaining a PHD in a quantitative field plus 2 years of experience performing data analytics
At least 2 years of experience leveraging open source programming languages for large scale data analysis
At least 2 years of experience working with machine learning
At least 2 years of experience utilizing relational databases

Job Responsibility

Partner with a cross-functional team to deliver AI powered products that change how developers write software
Lead cutting-edge research and development in Generative AI (GenAI) to enhance conversational AI capabilities and build scalable, futuristic digital assistant solutions
Fine-tune advanced Large Language Models (LLMs) for domain-specific conversational applications, inference optimization, and multi-agentic workflows
Leverage a broad stack of technologies — Python, AWS, Pyspark, LangChain, LangGraph, HuggingFace Transformers, vLLM and VectorDBs, and more
Be the expert in Natural Language Processing (NLP) to harness the power of Large Language Models (LLMs), adapt and finetune them for business specific applications and features
Drive innovation by designing, training, evaluating, and deploying state-of-the-art NLP models, partnering with engineering teams to integrate them into scalable and resilient production systems
Translate complex AI/ML research into tangible business outcomes, improving customer experience through real-time, intelligent digital assistance

What we offer

performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being

Fulltime

Senior AI Research Engineer

Adyen is building a world-class AI team to redefine what intelligent systems can...

Location

Spain , Madrid

Salary:

Not provided

Adyen

Expiration Date

Until further notice

Requirements

6+ years of hands-on experience in applied AI/ML research or engineering, with a clear track record of shipping AI systems, including agentic or LLM-powered systems, in production environments
Deep expertise in language models and Generative AI, with hands-on depth across several of: architecture, post-training (fine-tuning, RLHF), inference optimization, context engineering, and failure modes at scale
Proven experience designing and operating agentic systems at scale, multi-agent orchestration, tool use, memory and context management, state handling for long-running workflows, and human-in-the-loop design
Rigorous and systematic about evaluation
Strong foundation in classical machine learning: supervised learning, ensemble methods, optimization, probabilistic modeling, and statistics
Write clean, well-structured, production-ready code, primarily Python
Hands-on experience with at least one production-grade agentic framework

Job Responsibility

Design and Deploy AI Agents for Complex Tasks
Own Evaluation and Benchmarking
Provide AI Expertise Across the Organization
Raise the Bar

Head of AI Data Science, Intelligence Ventures

The Head of AI Data Science serves as the head of AI research and leader of data...

Location

United States , New York

Salary:

263200.00 - 393800.00 USD / Year

Spectrum

Expiration Date

Until further notice

Requirements

Deep expertise in transformer-based sequence modeling and its application to behavioral or interaction data at consumer scale — including architecture design, training methodology, fine-tuning, and embedding quality evaluation
Proven track record developing and deploying household- or user-level embedding models applied to real-world use cases in media, marketing, commerce, and/or customer intelligence — not just research environments. Demonstrated understanding of the unique characteristics of behavioral sequence data: sparsity, temporal dynamics, multi-entity structure, and the signal differences between behavioral intent and explicit interaction
Strong command of the full data science lifecycle in production settings — from exploratory data analysis and feature engineering through model training, validation, deployment, monitoring, and iteration — at large dataset scale (billions, even trillions of records)
Hands-on proficiency with Python, PyTorch or TensorFlow, and distributed ML training frameworks
experience running ML workloads on cloud platforms (AWS SageMaker, Snowflake Cortex, Databricks, or equivalent)
Experience designing and operationalizing feature stores and predictive modeling pipelines that serve downstream intelligence products, audiences, or decision systems in production environments
Ability to communicate complex AI/ML concepts clearly to non-technical executive audiences, product stakeholders, and external partners
comfort operating as an external-facing technical spokesperson for the platform's modeling capabilities and intelligence differentiation
Track record of leading and growing high-performing data science teams
experience recruiting and developing senior ML talent in competitive markets

Job Responsibility

Direct the research, design, and training of the platform's proprietary transformer-based behavioral embedding model — a multi-entity architecture that encodes household behavior across multiple signal sources into dense, privacy-safe vector representations. Own the full model development lifecycle from architecture decisions and training methodology through validation, deployment, and ongoing iteration as new signal sources and use cases are introduced
Lead the design and build of the platform's Feature Store — translating embedding representations into interpretable, actionable behavioral signals including purchase propensities, category interest intensities, lifestyle affinities, and behavior velocity signals. Oversee the outcome anchoring methodology that trains predictive models against external third-party datasets to produce validated, commercially relevant intelligence outputs across target verticals
Partner with the Head of Technology and external development partners to ensure the AI/ML architecture is production-grade, built for household-scale throughput, and integrated cleanly into the platform's cloud-native infrastructure. Establish model evaluation frameworks, quality benchmarks, and MLOps practices that enforce a strong bias toward production-deployed, commercially validated outputs — not just research-quality results
Serve as the platform's primary AI research voice in external partner conversations — including technical engagements with cloud AI platforms, frontier model teams, and enterprise data partners — articulating the platform's embedding architecture, signal differentiation, and model enrichment value proposition to sophisticated technical counterparts. Contribute to the development of packaged intelligence products such as behavioral demand indices, persona clusters, and predictive propensity scores
Establish the platform's responsible AI framework — including bias testing protocols for behavioral embeddings, model documentation standards, and privacy-preserving ML techniques — ensuring all intelligence products meet ethical and regulatory standards for consumer behavioral data
Build and lead a team of data scientists and ML researchers capable of competing with talent from the world's leading AI research and applied ML organizations. Establish the team's research agenda, hiring priorities, and culture of rigorous experimentation — maintaining a clear bias toward applied, production-oriented work while preserving the intellectual ambition required to stay ahead of a rapidly evolving AI landscape

Fulltime

Senior AI Engineer

In this role you will lead a critical and highly visible function within Teradat...

Location

India , Hyderabad; Pune; Bengaluru

Salary:

Not provided

Teradata

Expiration Date

Until further notice

Requirements

5+ years of hands-on experience in backend development, distributed systems, or AI infrastructure, with a proven track record of delivering in high-scale environments
Expertise in building and deploying AI-integrated software, particularly with LLMs and frameworks like LangChain, AutoGen, CrewAI, Semantic Kernel, or custom orchestrators
Strong development skills in Python (preferred), Go, Java, or similar languages used in intelligent system design
Practical knowledge of agentic AI principles — including task decomposition, autonomous decision-making, memory/context management, and multi-agent collaboration
Experience implementing or integrating the Model Context Protocol (MCP) to facilitate standardized agent context management and interoperability across tools
Extensive experience with Cloud Service Providers (AWS, Azure, GCP) including cloud-native infrastructure, container orchestration (Docker, Kubernetes), and infrastructure-as-code tools (Terraform, Ansible)
Familiarity with vector databases (Pinecone, Weaviate, FAISS) and embedding models for semantic search and retrieval-augmented generation (RAG)
Demonstrated ability to design clean APIs, modular microservices, and resilient, maintainable backend systems
Clear communicator with the ability to simplify complex AI system behaviors into actionable architecture
Passion for AI and a hunger to build systems that push the boundaries of autonomous software

Job Responsibility

Design, develop, and scale intelligent software systems that power autonomous AI agents capable of reasoning, planning, acting, and learning in real-world environments
Lead the implementation of core Agentic AI components — including agent memory, context-aware planning, multi-step tool use, and self-reflective behavior loops
Architect robust, cloud-native backends that support high-throughput agent pipelines across major Cloud Service Providers (AWS, Azure, GCP), ensuring best-in-class observability, fault tolerance, and scalability
Build seamless integrations with large language models (LLMs) such as GPT-4, Claude, Gemini, or open-source models — using advanced techniques like function calling, dynamic prompting, and multi-agent orchestration
Design and implement standardized context management and sharing using the Model Context Protocol (MCP) to enable consistent, interoperable agent and tool interactions
Develop scalable APIs and services to connect agents with internal tools, vector databases, RAG pipelines, and external APIs
Own technical delivery of major agent-related features, leading design reviews, code quality standards, and engineering best practices
Collaborate cross-functionally with researchers, ML engineers, product managers, and UX teams to translate ideas into intelligent, performant, and production-ready systems
Define and implement testing strategies to validate agentic behavior in both deterministic and probabilistic conditions
Guide junior engineers and peers by mentoring, unblocking challenges, and championing a culture of technical excellence

What we offer

Flexible work model
Focus on well-being
Inclusive environment

Fulltime

As a Senior Researcher in Applied Sciences Group you will play a pivotal role in...

Location

United States , Redmond

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics predictive analytics, research)
OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research)
OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research)
OR equivalent experience
6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
1+ years of experience with generative AI OR LLM/ML algorithms
1+ years of experience working with Generative AI models and ML stacks

Job Responsibility

Build collaborative relationships with product and business groups to deliver AI-driven impact
Research and implement state-of-the-art using foundation models, prompt engineering, graphs, multi-agent architectures, as well as classical machine learning techniques
Fine-tune foundation models using domain-specific datasets. Evaluate model behavior on relevance, bias, hallucination, and response quality via offline evaluations, shadow experiments, online experiments
Build rapid AI solution prototypes, contribute to production deployment of these solutions and debug production code
Contribute to papers, patents, and conference presentations. Translate research into production-ready solutions and measure their impact through A/B testing and telemetry that address customer needs
Ability to use data to identify gaps in AI quality, uncover insights and implement proof of concepts

Fulltime

Select Country

Senior Researcher - Foundations of Generative AI

Job Description

Job Responsibility

Requirements

Looking for more opportunities?

Senior Researcher - Foundations of Generative AI

Senior AI Researcher

Senior Generative AI Engineer

Senior Researcher - Efficient AI

Senior Manager, Data Science - AI Foundations

Senior AI Research Engineer

Head of AI Data Science, Intelligence Ventures

Senior AI Engineer

Senior Researcher

Our AI answers in your language