CrawlJobs Logo

Member of technical staff - Research - Agent

hcompany.ai Logo

H Company

Location Icon

Location:
France; United Kingdom , Paris

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

About H: H exists to push the boundaries of superintelligence with agentic AI. By automating complex, multi-step tasks typically performed by humans, AI agents will help unlock full human potential. H is hiring the world’s best AI talent, seeking those who are dedicated as much to building safely and responsibly as to advancing disruptive agentic capabilities. We promote a mindset of openness, learning, and collaboration, where everyone has something to contribute. About the Team: The Agent team defines new learning algorithms and agent paradigms to push the frontiers of agentic systems. We build upon foundation models and reinforcement learning to develop new approaches to train artificial general agents and work closely with the LLM/VLM and Safety teams to explore new directions. This is a heavily engineering-focused role embedded within the research team. You will be responsible for defining the architecture and building the robust, scalable systems that underpin our research efforts. Your work will translate cutting-edge research concepts into high-performance, production-quality platforms, enabling the next generation of agentic AI.

Job Responsibility:

  • Research & Leadership: Design and develop new agents, proposing new research directions, e.g., combining state-of-the-art RL with foundation models (LLMs/VLMs)
  • Algorithm & Systems Design: Design, implement, and scale complex, high-performance systems for training large-scale agents. This includes both the foundational infrastructure and the novel algorithms, reward models, and sophisticated training environments
  • Research-to-Production: Collaborate closely with researchers and engineers to implement, test, and productionize new agent logics, learning algorithms, and system architectures
  • Evaluation & Reliability: Create, manage, and scale massive benchmarks and evaluation systems to rigorously track agent capabilities. You will own system reliability, scalability, and observability for our entire research infrastructure
  • Mentorship & Standards: Mentor and guide other engineers and researchers on the team, fostering technical excellence. You will establish and enforce engineering standards, tooling, and best practices for both code and research design
  • Innovation: Conduct thorough code and design reviews, champion technical innovation, and proactively address technical debt to accelerate the R&D lifecycle

Requirements:

  • Senior Experience: Previous demonstrable role(s) as a Staff, Principal, or Senior Engineer (or equivalent Research Scientist) in a Frontier AI Lab with a proven track record of leading complex, end-to-end AI/ML projects from conception to production
  • Education / Publication: Preferably PhD (or equivalent research experience) in Machine Learning, Computer Science, or a related field, preferably with a strong publication record (e.g., NeurIPS, ICML, ICLR) in Computer Science
  • Core Expertise: Deep theoretical and practical expertise in Agentic AI and proven experience building, scaling, and shipping solutions involving foundation models (LLMs/VLMs)
  • Soft Skills: Collaborative: Enjoys collaboration and thrives in a teamwork-oriented, fast-paced research environment
  • High-Impact Communicator: Possesses impactful communication skills, with the ability to bridge the gap between research and engineering and articulate complex ideas clearly
  • Mission-Driven: Genuinely eager to explore and solve the new engineering and research challenges at the frontier of agentic AI

Nice to have:

  • Practical experience applying Reinforcement Learning to systems built on Large Language Models (LLMs)
  • Experience with distributed systems or cloud computing, preferably in AWS
  • Familiarity with building complex simulation environments for agent training
  • Experience with LLM training or fine-tuning
  • Experience developing large-scale evaluation and benchmarking systems for AI models
  • Experience in an agentic framework (e.g., LangChain, AutoGen, CrewAI, OpenAI SDK)
  • Expertise in system architecture, instrumentation, observability, and monitoring for complex, high-performance systems
What we offer:
  • Join the exciting journey of shaping the future of AI, and be part of the early days of one of the hottest AI startups
  • Collaborate with a fun, dynamic, and multicultural team, working alongside world-class AI talent in a highly collaborative environment
  • Enjoy a competitive salary
  • Unlock opportunities for professional growth, continuous learning, and career development

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:
PREMIUM
More languages and countries
+ Unlock 31698 hidden job offers
Languages
English Čeština Deutsch Ελληνικά Español Français +15
Countries
United States United Kingdom India Canada Australia +
See plans
Plans from $2.99 / month

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Member of technical staff - Research - Agent

Member of Technical Staff – Backend

As a backend engineer at Inflection, you will own the platforms, systems, and se...
Location
Location
United States , Palo Alto
Salary
Salary:
175000.00 - 350000.00 USD / Year
inflection.ai Logo
Inflection AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience building and scaling backend systems for high-throughput applications
  • Fluent in building distributed systems with Python, Go, Rust, or similar languages
  • Comfortable with cloud-native architectures (e.g., Kubernetes, gRPC, Postgres, Redis, Kafka)
  • Owned backend services end-to-end—from design and implementation to deployment, monitoring, and debugging
  • Thrive in fast-paced environments where you can move quickly without sacrificing engineering rigor
  • Proactively improve tooling and infrastructure to support teammates’ workflows and reliability goals
  • Communicate clearly across disciplines and take pride in solving user-facing problems with clean backend solutions
  • Have a bachelor’s degree or equivalent in a related field to the offered position requirements
Job Responsibility
Job Responsibility
  • Design and implement scalable backend systems and APIs that power production LLM experiences, including agentic workflows, memory systems, and tool integrations
  • Build and operate high-availability infrastructure to support real-time inference, retrieval, and conversation pipelines
  • Develop internal platforms to improve engineering productivity—CI/CD pipelines, service templates, observability frameworks, and rollout tooling
  • Collaborate closely with applied research and frontend teams to rapidly prototype, ship, and iterate on end-user features
  • Ensure systems meet our high bar for security, uptime, and latency—through incident response, load testing, monitoring, and automation
  • Participate in on-call rotations to maintain the reliability of the services you build
What we offer
What we offer
  • Diverse medical, dental and vision options
  • 401k matching program
  • Unlimited paid time off
  • Parental leave and flexibility for all parents and caregivers
  • Support of country-specific visa needs for international employees living in the Bay Area
  • Competitive stock options
Read More
Arrow Right

Member of Technical Staff, Agents Modeling

We’re looking for an experienced machine learning researcher / engineer who can ...
Location
Location
United States , New York
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Have a PhD in computer science or related field or similar industry research experience
  • Strong software engineering skills
  • Proficiency in Python and experience with ML-related code (e.g., pytorch, numpy, etc.)
  • Experience with LLMs and agentic frameworks
  • Experience with post-training LLMs (SFT, PEFT, or RL*)
  • Experience with building synthetic data generation pipelines
Job Responsibility
Job Responsibility
  • Design and develop novel agentic solutions
  • Improve upon SOTA on hard agentic tasks
  • Research the next-generation of on-line learning-from-experience self-improvement
  • Work with partner teams (Reasoning, Post-training, Pre-training, etc.) to improve performance of agentic system
  • Work with an amazing team of researchers and engineers pushing the boundaries
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Next Generation Agents

Agentic LLM systems are being deployed widely across enterprise companies includ...
Location
Location
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong software engineering skills
  • Proficiency in Python and have some experience with ML-related code (e.g., pytorch, numpy, etc.)
  • Experience with LLMs and agentic frameworks
  • Experience with post-training LLMs (SFT, PEFT, or RL*)
  • Experience with building synthetic data generation pipelines
Job Responsibility
Job Responsibility
  • Design and develop novel agentic solutions
  • Improve upon SOTA on hard agentic tasks
  • Research the next-generation of on-line learning-from-experience self-improvement
  • Work with partner teams (Reasoning, Post-training, Pre-training, etc.) to improve performance of agentic system
  • Work with an amazing team of researchers and engineers pushing the boundaries
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Agent Code

Code-generating LLMs and autonomous agents are revolutionizing how software is b...
Location
Location
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Machine Learning, or a related field, with publications in top-tier venues (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP)
  • Deep expertise in code LLMs and agent systems, with a strong understanding of the latest research and trends
  • Hands-on experience with frontier LLMs and their applications in code generation or automation
  • Strong software engineering skills, with proficiency in Python and PyTorch, TensorFlow, or similar frameworks
  • Experience with distributed systems, cloud infrastructure, and scalable architectures
  • A proactive, self-motivated mindset, with a passion for solving ambitious, open-ended problems
Job Responsibility
Job Responsibility
  • Stay up-to-date with the latest research in code LLMs, agents, and related fields, implementing novel ideas into our systems
  • Design and implement scalable strategies to train code models, and deploy agent frameworks for inference and sampling
  • Hillclimb on existing benchmarks and design new ones that reflect the needs of our enterprise users
  • Lead experiments on our state-of-the-art compute infrastructure, pushing the boundaries of what’s possible with frontier LLMs
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right

Member of technical staff - Research - Model

H exists to push the boundaries of superintelligence with agentic AI. By automat...
Location
Location
France; United Kingdom , Paris; London
Salary
Salary:
Not provided
hcompany.ai Logo
H Company
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong programming skills (Python, Git)
  • Expertise in deep learning frameworks (PyTorch, JAX, TensorFlow)
  • Experience with large-scale distributed training of LLMs and VLMs
  • Hands-on experience with LLM training, alignment, and reinforcement learning
  • Knowledge of multimodal architectures and applications
  • Publications in top-tier AI conferences (e.g., NeurIPS, ICML, CVPR, ACL, ICCV)
  • Advanced degree (PhD or MSc) in a relevant field (e.g., ML, DL, NLP, CV)
  • Excellent communication and presentation skills
  • Strong collaboration and teamwork skills
  • Passion for AI and problem-solving
Job Responsibility
Job Responsibility
  • Develop and train advanced LLMs and VLMs, including multimodal architectures
  • Research and implement training methods for enhanced capabilities like instruction following and tool use
  • Design and optimize data pipelines and training systems for large-scale distributed training
  • Collaborate with cross-functional teams to integrate models into agentic AI systems
  • Evaluate model performance and communicate findings to stakeholders
  • Stay current with advancements in LLMs, VLMs, and related fields
What we offer
What we offer
  • Join the exciting journey of shaping the future of AI, and be part of the early days of one of the hottest AI startups
  • Collaborate with a fun, dynamic and multicultural team, working alongside world-class AI talent in a highly collaborative environment
  • Enjoy a competitive salary
  • Unlock opportunities for professional growth, continuous learning, and career development
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Integration/RL Team (Research Engineer)

The integration team is responsible for developing and scaling machine learning ...
Location
Location
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extremely strong software engineering skills
  • Value test-driven development methods, clean code, and strive to reduce technical debts at all levels
  • Proficiency in Python and related ML frameworks such as JAX, Pytorch and/or XLA/MLIR
  • Experience using and debugging large-scale distributed training strategies (memory/speed profiling)
  • [Bonus] Experience with distributed training infrastructures (Kubernetes) and associated frameworks (Ray)
  • [Bonus] Hands-on experience with the post-training phase of model training, with a strong emphasis on scalability and performance
  • [Bonus] Experience in ML, LLM and RL academic research
Job Responsibility
Job Responsibility
  • Design and write high-performing and scalable software for training models
  • Develop new tools to support and accelerate research and LLM training
  • Coordinate with other engineering teams (Infrastructure, Efficiency, Serving) and the scientific teams (Agent, Multimodal, Multilingual, etc.) to create a strong and integrated post-training ecosystem
  • Craft and implement techniques to improve performance and speed up our training cycles, both on SFT, offline preference, and the RL regime
  • Research, implement, and experiment with ideas on our cluster and data infrastructure
  • Collaborate, Collaborate, and Collaborate with other scientists, engineers, and teams!
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right

Member of Technical Staff

Adyen is building a top-tier AI engineering team in San Francisco to drive our n...
Location
Location
United States , San Francisco
Salary
Salary:
227500.00 - 401000.00 USD / Year
adyen.com Logo
Adyen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • You are deeply embedded in the scientific AI research community and have a strong understanding of the latest SOTA advancements
  • You have significant experience and a strong understanding of Generative AI (GenAI) and Large Language Models (LLMs)
  • You demonstrate a strong engineering mindset with a track record of writing clean, efficient, and scalable code suitable for production environments
  • You have demonstrated experience taking cutting-edge AI research papers and implementing them into production-quality code
  • You demonstrate the ability to think critically and deliver simple and elegant solutions to complex, cross-team problems, influencing strategic direction and fostering innovation across the organization
  • You excel at translating complex technical concepts into clear, understandable terms for diverse audiences, including engineers, executives, and during public events. You adapt your communication style to effectively engage with diverse audiences
  • You thrive in leveraging empathy, influence, negotiation, relationship building, and conflict resolution to foster strong, trust-based collaborations.
Job Responsibility
Job Responsibility
  • Innovate and Deploy: Drive the execution of Adyen's AI strategy, focusing on the practical application of Generative AI (GenAI) and other AI methodologies in finance. This includes contributing to Adyen's efforts in key research areas such as AI agents for data analysis and operational workflows, human-in-the-loop for integrity risk, and development of foundation models
  • Build Production-grade Applications: Bridge the gap between cutting-edge AI research and production by implementing research papers into robust, scalable, and production-ready code. Reduce complexity and dependencies across teams by championing engineering and scientific alignment by setting high quality standards
  • Optimize and Scale: Contribute to defining the long-term vision for AI at Adyen, specifically how AI will interact with humans and finance, including consumers, merchants, and financial institutions. This also includes understanding regulation and advocating for safe innovation in the field
  • Think Outside the Box: Drive innovation by challenging the status quo, introducing transformative ideas and implementing creative solutions to solve real-world problems. Carry out flexible, value-driven assignments, proactively unblocking teams to maximize organizational impact and drive strategic initiatives
  • Force Multiplier: Provide mentorship and horizontal sponsorship across the organization, fostering collaboration to share knowledge and best practices, and cultivating a culture of continuous improvement. This includes deeply engaging them in problem-solving processes and guiding them through execution, fostering their growth through hands-on involvement
  • Team Player: Actively pair with other engineering teams to solve deep-rooted technical challenges and be fully capable of being hands-on with the code, whether creating proof-of-concepts or fixing critical performance issues
  • Learn and Lead: Connect with the broader AI community (including startups, VCs, and AI labs) to stay informed of the latest advancements and identify potential partnership opportunities.
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Machine Learning

As a Member of Technical Staff - Machine Learning, you will work to create LLM m...
Location
Location
United States , Mountain View
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Master's degree in Computer Science or related technical field AND 1+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Doctorate in Computer Science, Machine Learning, Human-Centered AI or related field and experience in (e.g., finetuning models with supervision or reinforcement learning, understanding and fixing data quality and curation, working with collaborators on creating new products)
  • Experience in machine learning, software engineering
  • Effective communicator and great teammate
  • Takes the initiative, is user-centered and enjoys building world-class AI experiences and products in a fast-paced environment
Job Responsibility
Job Responsibility
  • Own and pursue a research agenda to improve model capability and performance for agentive application
  • Collaborate closely with the other research and product teams, from pretraining to model hosting to unlock new model capabilities
  • Build robust evaluations for tracking modeling improvements
  • Design, implement, test, and debug code across our research stack
  • Work to create LLM models for general purpose capabilities and for products
  • Developing new methods to train core LLM capabilities (including agentive), collecting data, evaluating LLMs, creating data flywheels, tooling for LLM training/evals, writing production quality code, and creating new user-facing features
  • Creating Reinforcement Learning data, fine tuning, or training classifiers or engineering prompts to create SOTA foundation models and support Microsoft products and the Cloud API
  • Fulltime
Read More
Arrow Right