CrawlJobs Logo

Member of technical staff - Research - Agent

hcompany.ai Logo

H Company

Location Icon

Location:
France; United Kingdom , Paris

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

About H: H exists to push the boundaries of superintelligence with agentic AI. By automating complex, multi-step tasks typically performed by humans, AI agents will help unlock full human potential. H is hiring the world’s best AI talent, seeking those who are dedicated as much to building safely and responsibly as to advancing disruptive agentic capabilities. We promote a mindset of openness, learning, and collaboration, where everyone has something to contribute. About the Team: The Agent team defines new learning algorithms and agent paradigms to push the frontiers of agentic systems. We build upon foundation models and reinforcement learning to develop new approaches to train artificial general agents and work closely with the LLM/VLM and Safety teams to explore new directions. This is a heavily engineering-focused role embedded within the research team. You will be responsible for defining the architecture and building the robust, scalable systems that underpin our research efforts. Your work will translate cutting-edge research concepts into high-performance, production-quality platforms, enabling the next generation of agentic AI.

Job Responsibility:

  • Research & Leadership: Design and develop new agents, proposing new research directions, e.g., combining state-of-the-art RL with foundation models (LLMs/VLMs)
  • Algorithm & Systems Design: Design, implement, and scale complex, high-performance systems for training large-scale agents. This includes both the foundational infrastructure and the novel algorithms, reward models, and sophisticated training environments
  • Research-to-Production: Collaborate closely with researchers and engineers to implement, test, and productionize new agent logics, learning algorithms, and system architectures
  • Evaluation & Reliability: Create, manage, and scale massive benchmarks and evaluation systems to rigorously track agent capabilities. You will own system reliability, scalability, and observability for our entire research infrastructure
  • Mentorship & Standards: Mentor and guide other engineers and researchers on the team, fostering technical excellence. You will establish and enforce engineering standards, tooling, and best practices for both code and research design
  • Innovation: Conduct thorough code and design reviews, champion technical innovation, and proactively address technical debt to accelerate the R&D lifecycle

Requirements:

  • Senior Experience: Previous demonstrable role(s) as a Staff, Principal, or Senior Engineer (or equivalent Research Scientist) in a Frontier AI Lab with a proven track record of leading complex, end-to-end AI/ML projects from conception to production
  • Education / Publication: Preferably PhD (or equivalent research experience) in Machine Learning, Computer Science, or a related field, preferably with a strong publication record (e.g., NeurIPS, ICML, ICLR) in Computer Science
  • Core Expertise: Deep theoretical and practical expertise in Agentic AI and proven experience building, scaling, and shipping solutions involving foundation models (LLMs/VLMs)
  • Soft Skills: Collaborative: Enjoys collaboration and thrives in a teamwork-oriented, fast-paced research environment
  • High-Impact Communicator: Possesses impactful communication skills, with the ability to bridge the gap between research and engineering and articulate complex ideas clearly
  • Mission-Driven: Genuinely eager to explore and solve the new engineering and research challenges at the frontier of agentic AI

Nice to have:

  • Practical experience applying Reinforcement Learning to systems built on Large Language Models (LLMs)
  • Experience with distributed systems or cloud computing, preferably in AWS
  • Familiarity with building complex simulation environments for agent training
  • Experience with LLM training or fine-tuning
  • Experience developing large-scale evaluation and benchmarking systems for AI models
  • Experience in an agentic framework (e.g., LangChain, AutoGen, CrewAI, OpenAI SDK)
  • Expertise in system architecture, instrumentation, observability, and monitoring for complex, high-performance systems
What we offer:
  • Join the exciting journey of shaping the future of AI, and be part of the early days of one of the hottest AI startups
  • Collaborate with a fun, dynamic, and multicultural team, working alongside world-class AI talent in a highly collaborative environment
  • Enjoy a competitive salary
  • Unlock opportunities for professional growth, continuous learning, and career development

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Member of technical staff - Research - Agent

Member of Technical Staff – Backend

As a backend engineer at Inflection, you will own the platforms, systems, and se...
Location
Location
United States , Palo Alto
Salary
Salary:
175000.00 - 350000.00 USD / Year
inflection.ai Logo
Inflection AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience building and scaling backend systems for high-throughput applications
  • Fluent in building distributed systems with Python, Go, Rust, or similar languages
  • Comfortable with cloud-native architectures (e.g., Kubernetes, gRPC, Postgres, Redis, Kafka)
  • Owned backend services end-to-end—from design and implementation to deployment, monitoring, and debugging
  • Thrive in fast-paced environments where you can move quickly without sacrificing engineering rigor
  • Proactively improve tooling and infrastructure to support teammates’ workflows and reliability goals
  • Communicate clearly across disciplines and take pride in solving user-facing problems with clean backend solutions
  • Have a bachelor’s degree or equivalent in a related field to the offered position requirements
Job Responsibility
Job Responsibility
  • Design and implement scalable backend systems and APIs that power production LLM experiences, including agentic workflows, memory systems, and tool integrations
  • Build and operate high-availability infrastructure to support real-time inference, retrieval, and conversation pipelines
  • Develop internal platforms to improve engineering productivity—CI/CD pipelines, service templates, observability frameworks, and rollout tooling
  • Collaborate closely with applied research and frontend teams to rapidly prototype, ship, and iterate on end-user features
  • Ensure systems meet our high bar for security, uptime, and latency—through incident response, load testing, monitoring, and automation
  • Participate in on-call rotations to maintain the reliability of the services you build
What we offer
What we offer
  • Diverse medical, dental and vision options
  • 401k matching program
  • Unlimited paid time off
  • Parental leave and flexibility for all parents and caregivers
  • Support of country-specific visa needs for international employees living in the Bay Area
  • Competitive stock options
Read More
Arrow Right
New

Member of Technical Staff, Agents Modeling

We’re looking for an experienced machine learning researcher / engineer who can ...
Location
Location
United States , New York
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Have a PhD in computer science or related field or similar industry research experience
  • Strong software engineering skills
  • Proficiency in Python and experience with ML-related code (e.g., pytorch, numpy, etc.)
  • Experience with LLMs and agentic frameworks
  • Experience with post-training LLMs (SFT, PEFT, or RL*)
  • Experience with building synthetic data generation pipelines
Job Responsibility
Job Responsibility
  • Design and develop novel agentic solutions
  • Improve upon SOTA on hard agentic tasks
  • Research the next-generation of on-line learning-from-experience self-improvement
  • Work with partner teams (Reasoning, Post-training, Pre-training, etc.) to improve performance of agentic system
  • Work with an amazing team of researchers and engineers pushing the boundaries
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right
New

Member of Technical Staff, Next Generation Agents

Agentic LLM systems are being deployed widely across enterprise companies includ...
Location
Location
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong software engineering skills
  • Proficiency in Python and have some experience with ML-related code (e.g., pytorch, numpy, etc.)
  • Experience with LLMs and agentic frameworks
  • Experience with post-training LLMs (SFT, PEFT, or RL*)
  • Experience with building synthetic data generation pipelines
Job Responsibility
Job Responsibility
  • Design and develop novel agentic solutions
  • Improve upon SOTA on hard agentic tasks
  • Research the next-generation of on-line learning-from-experience self-improvement
  • Work with partner teams (Reasoning, Post-training, Pre-training, etc.) to improve performance of agentic system
  • Work with an amazing team of researchers and engineers pushing the boundaries
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right
New

Member of Technical Staff, Agent Code

Code-generating LLMs and autonomous agents are revolutionizing how software is b...
Location
Location
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Machine Learning, or a related field, with publications in top-tier venues (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP)
  • Deep expertise in code LLMs and agent systems, with a strong understanding of the latest research and trends
  • Hands-on experience with frontier LLMs and their applications in code generation or automation
  • Strong software engineering skills, with proficiency in Python and PyTorch, TensorFlow, or similar frameworks
  • Experience with distributed systems, cloud infrastructure, and scalable architectures
  • A proactive, self-motivated mindset, with a passion for solving ambitious, open-ended problems
Job Responsibility
Job Responsibility
  • Stay up-to-date with the latest research in code LLMs, agents, and related fields, implementing novel ideas into our systems
  • Design and implement scalable strategies to train code models, and deploy agent frameworks for inference and sampling
  • Hillclimb on existing benchmarks and design new ones that reflect the needs of our enterprise users
  • Lead experiments on our state-of-the-art compute infrastructure, pushing the boundaries of what’s possible with frontier LLMs
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right
New

Member of technical staff - Research - Model

H exists to push the boundaries of superintelligence with agentic AI. By automat...
Location
Location
France; United Kingdom , Paris; London
Salary
Salary:
Not provided
hcompany.ai Logo
H Company
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong programming skills (Python, Git)
  • Expertise in deep learning frameworks (PyTorch, JAX, TensorFlow)
  • Experience with large-scale distributed training of LLMs and VLMs
  • Hands-on experience with LLM training, alignment, and reinforcement learning
  • Knowledge of multimodal architectures and applications
  • Publications in top-tier AI conferences (e.g., NeurIPS, ICML, CVPR, ACL, ICCV)
  • Advanced degree (PhD or MSc) in a relevant field (e.g., ML, DL, NLP, CV)
  • Excellent communication and presentation skills
  • Strong collaboration and teamwork skills
  • Passion for AI and problem-solving
Job Responsibility
Job Responsibility
  • Develop and train advanced LLMs and VLMs, including multimodal architectures
  • Research and implement training methods for enhanced capabilities like instruction following and tool use
  • Design and optimize data pipelines and training systems for large-scale distributed training
  • Collaborate with cross-functional teams to integrate models into agentic AI systems
  • Evaluate model performance and communicate findings to stakeholders
  • Stay current with advancements in LLMs, VLMs, and related fields
What we offer
What we offer
  • Join the exciting journey of shaping the future of AI, and be part of the early days of one of the hottest AI startups
  • Collaborate with a fun, dynamic and multicultural team, working alongside world-class AI talent in a highly collaborative environment
  • Enjoy a competitive salary
  • Unlock opportunities for professional growth, continuous learning, and career development
  • Fulltime
Read More
Arrow Right
New

Member of Technical Staff, Integration/RL Team (Research Engineer)

The integration team is responsible for developing and scaling machine learning ...
Location
Location
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extremely strong software engineering skills
  • Value test-driven development methods, clean code, and strive to reduce technical debts at all levels
  • Proficiency in Python and related ML frameworks such as JAX, Pytorch and/or XLA/MLIR
  • Experience using and debugging large-scale distributed training strategies (memory/speed profiling)
  • [Bonus] Experience with distributed training infrastructures (Kubernetes) and associated frameworks (Ray)
  • [Bonus] Hands-on experience with the post-training phase of model training, with a strong emphasis on scalability and performance
  • [Bonus] Experience in ML, LLM and RL academic research
Job Responsibility
Job Responsibility
  • Design and write high-performing and scalable software for training models
  • Develop new tools to support and accelerate research and LLM training
  • Coordinate with other engineering teams (Infrastructure, Efficiency, Serving) and the scientific teams (Agent, Multimodal, Multilingual, etc.) to create a strong and integrated post-training ecosystem
  • Craft and implement techniques to improve performance and speed up our training cycles, both on SFT, offline preference, and the RL regime
  • Research, implement, and experiment with ideas on our cluster and data infrastructure
  • Collaborate, Collaborate, and Collaborate with other scientists, engineers, and teams!
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right
New

Member of Technical Staff, MLE

At Cohere, our Members of Technical Staff are at the forefront of defining and s...
Location
Location
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extremely strong software engineering skills
  • Proficiency in Python and related ML frameworks such as Tensorflow, TF-Serving, JAX, and XLA/MLIR
  • Deep experience in building and leading a product-centric organisation
  • Direct experience working as part of a team building Large Language Models
  • Released multiple features with several iterations
  • Strong track record of creating and curating large-scale datasets
  • Experience using large-scale distributed training strategies
  • Familiarity with autoregressive sequence models, such as Transformers
  • Ability to collaborate effectively with human annotators and cross-functional teams
  • Paper at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP)
Job Responsibility
Job Responsibility
  • Join a small, diverse team of engineers in designing, building, and scaling AI systems that underpin our suite of dev-centric enterprise products
  • Work directly on North, Cohere’s all-in-one secure AI workspace platform. Here you will drive agent development in RAG, tool use, and language agents embedded in North
  • Quickly research and experiment with novel ideas on our supercomputer and data infrastructure, ensuring our products remain at the forefront of the industry
  • Collaborate with top researchers, engineers, and annotators to create and evaluate data for post-training LLMs, ensuring our products are of the highest quality and performance
  • Engage with the latest AI and deep learning research, staying up to date with leading conferences such as NeurIPS, ICLR, and AAAI
  • Leverage product data to understand usage patterns and identify areas for improvement, ensuring our products remain relevant and competitive
  • Work closely with leadership to shape company strategy and goals, ensuring our product vision is aligned with our overall business objectives
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right
New

Member of Technical Staff, Senior/Staff MLE

This is not a typical “Applied Scientist” or “ML Engineer” role. As a Member of ...
Location
Location
United States; Canada , San Francisco; New York; Toronto; Montreal
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong ML fundamentals and the ability to frame complex, ambiguous problems as ML solutions
  • Fluency with Python and core ML/LLM frameworks
  • Experience working with large-scale datasets and distributed training or inference pipelines
  • Understanding of LLM architectures, tuning techniques (CPT, post-training), and evaluation methodologies
  • Demonstrated ability to meaningfully shape LLM performance
  • Experience engaging directly with customers or stakeholders to design and deliver ML-powered solutions
  • A track record of technical leadership at a team level
  • A broad view of the ML research landscape and a desire to push the state of the art
  • Bias toward action, high ownership, and comfort with ambiguity
  • Humility and strong collaboration instincts
Job Responsibility
Job Responsibility
  • Lead the design and delivery of custom LLM solutions for enterprise customers
  • Translate ambiguous business problems into well-framed ML problems with clear success criteria and evaluation methodologies
  • Build custom models using Cohere’s foundation model stack, CPT recipes, post-training pipelines (including RLVR), and data assets
  • Develop SOTA modeling techniques that directly enhance model performance for customer use-cases
  • Contribute improvements back to the foundation-model stack — including new capabilities, tuning strategies, and evaluation frameworks
  • Work closely with enterprise customers to identify high-value opportunities where LLMs can unlock transformative impact
  • Provide technical leadership across discovery, scoping, modeling, deployment, agent workflows, and post-deployment iteration
  • Establish evaluation frameworks and success metrics for custom modeling engagements
  • Mentor engineers across distributed teams
  • Drive clarity in ambiguous situations, build alignment, and raise engineering and modeling quality across the organization
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right