CrawlJobs Logo

Research Scientist, Agent Robustness

scale.com Logo

Scale

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

197400.00 - 246750.00 USD / Year

Job Description:

As a Research Scientist working on Agent Robustness you will work on the fundamental challenges of building AI agents that are safe and aligned with humans.

Job Responsibility:

  • Research the science of AI agent capabilities with a focus on safety, risk factors, and benchmarking methodologies
  • Design and build harnesses to test AI agents’ tendency to take harmful actions
  • Design and build exploits and mitigations for new failure modes
  • Characterize and design mitigations for potential failure modes of systems involving multiple interacting AI agents

Requirements:

  • Commitment to mission of promoting safe, secure, and trustworthy AI deployments
  • Practical experience conducting technical research collaboratively
  • Experience building and leveraging agent scaffolding, designing evaluation harnesses, and quickly turning new ideas into working prototypes
  • Experience with post-training and RL techniques such as RLHF, DPO, GRPO
  • A track record of published research in machine learning, particularly in generative AI
  • At least three years of experience addressing sophisticated ML problems
  • Strong written and verbal communication skills

Nice to have:

  • Hands-on experience with agent evaluation frameworks such as SWE-bench, WebArena, OSWorld, Inspect
  • Experience with red-teaming, prompt injection, or adversarial testing of AI systems
What we offer:
  • Comprehensive health, dental and vision coverage
  • Retirement benefits
  • Learning and development stipend
  • Generous PTO
  • Commuter stipend
  • Equity grant

Additional Information:

Job Posted:
March 22, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Scientist, Agent Robustness

Gen AI Developer

The Gen AI Developer is a senior-level position responsible for designing, devel...
Location
Location
Poland , Warsaw
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of professional experience in software development with a focus on AI, machine learning, or agent-based systems
  • Strong proficiency in Python, SQL
  • Java is a plus
  • Solid understanding of core AI concepts, including knowledge representation, automated planning, decision-making under uncertainty, and multi-agent systems
  • Experience with machine learning frameworks (e.g., TensorFlow, PyTorch) and relevant libraries (e.g., Scikit-Learn, NumPy, Pandas)
  • Familiarity with large language models (LLMs) and their application in agentic systems
  • Familiarity with specific agent frameworks (e.g., LangChain, AutoGen, CrewAI, RAG) or research in multi-agent reinforcement learning
  • Experience in designing and implementing APIs for AI services
  • Experience with software development best practices, including version control (Git), CI/CD pipelines, testing, and code reviews
  • Excellent analytical and problem-solving skills with a creative approach to complex challenges
Job Responsibility
Job Responsibility
  • Agent Design and Development: Design and implement intelligent agents, including their perception, reasoning, planning, and action execution modules
  • System Architecture: Develop scalable and robust architectures for agentic systems, ensuring high performance, reliability, and security
  • Machine Learning Integration: Integrate various machine learning models (e.g., LLMs, reinforcement learning, predictive models) to enhance agent capabilities and decision-making
  • Task Automation: Develop agents that can automate complex tasks, optimize workflows, and solve real-world problems across various domains
  • Framework and Tooling: Utilize and contribute to agentic AI frameworks and development tools
  • Evaluation and Optimization: Design and implement metrics and evaluation strategies for agent performance, continuously optimizing and improving agent behavior
  • Research and Innovation: Stay abreast of the latest advancements in AI, particularly in agent-based systems, autonomous AI, and related fields, and propose innovative solutions
  • Collaboration: Work closely with cross-functional teams including AI researchers, data scientists, product managers, and software engineers to integrate agentic solutions into broader products and services
  • Documentation: Create comprehensive technical documentation for agent designs, implementations, and operational procedures
  • Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency
What we offer
What we offer
  • Private Medical Care Program
  • Life Insurance Program
  • Pension Plan contribution (PPE Program)
  • Employee Assistance Program
  • Paid Parental Leave Program (maternity and paternity leave)
  • Sport Card
  • Holidays Allowance
  • Sport and team recreation activities
  • Special offers and discounts for employees
  • Access to an array of learning and development resources
  • Fulltime
Read More
Arrow Right

Senior Gen AI Developer

The Senior Gen AI Developer is a senior-level position responsible for designing...
Location
Location
Poland , Warsaw
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of professional experience in software development with a focus on AI, machine learning, or agent-based systems
  • Strong proficiency in Python, SQL
  • Java is a plus
  • Solid understanding of core AI concepts, including knowledge representation, automated planning, decision-making under uncertainty, and multi-agent systems
  • Experience with machine learning frameworks (e.g., TensorFlow, PyTorch) and relevant libraries (e.g., Scikit-Learn, NumPy, Pandas)
  • Familiarity with large language models (LLMs) and their application in agentic systems
  • Familiarity with specific agent frameworks (e.g., LangChain, AutoGen, CrewAI, RAG) or research in multi-agent reinforcement learning
  • Experience in designing and implementing APIs for AI services
  • Experience with software development best practices, including version control (Git), CI/CD pipelines, testing, and code reviews
  • Excellent analytical and problem-solving skills with a creative approach to complex challenges
Job Responsibility
Job Responsibility
  • Agent Design and Development: Design and implement intelligent agents, including their perception, reasoning, planning, and action execution modules
  • System Architecture: Develop scalable and robust architectures for agentic systems, ensuring high performance, reliability, and security
  • Machine Learning Integration: Integrate various machine learning models (e.g., LLMs, reinforcement learning, predictive models) to enhance agent capabilities and decision-making
  • Task Automation: Develop agents that can automate complex tasks, optimize workflows, and solve real-world problems across various domains
  • Framework and Tooling: Utilize and contribute to agentic AI frameworks and development tools
  • Evaluation and Optimization: Design and implement metrics and evaluation strategies for agent performance, continuously optimizing and improving agent behavior
  • Research and Innovation: Stay abreast of the latest advancements in AI, particularly in agent-based systems, autonomous AI, and related fields, and propose innovative solutions
  • Collaboration: Work closely with cross-functional teams including AI researchers, data scientists, product managers, and software engineers to integrate agentic solutions into broader products and services
  • Documentation: Create comprehensive technical documentation for agent designs, implementations, and operational procedures
  • Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency
What we offer
What we offer
  • Private Medical Care Program
  • Life Insurance Program
  • Pension Plan contribution (PPE Program)
  • Employee Assistance Program
  • Paid Parental Leave Program (maternity and paternity leave)
  • Sport Card
  • Holidays Allowance
  • Sport and team recreation activities
  • Special offers and discounts for employees
  • Access to an array of learning and development resources
  • Fulltime
Read More
Arrow Right

Senior AI Software Engineer

The Senior AI Software Engineer (Applications Development Technology Lead Analys...
Location
Location
Canada , Mississauga
Salary
Salary:
120800.00 - 170800.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of professional experience in software development with a focus on AI, machine learning, or agent-based systems
  • Strong proficiency in Python, SQL
  • Java is a plus
  • Solid understanding of core AI concepts, including knowledge representation, automated planning, decision-making under uncertainty, and multi-agent systems
  • Experience with machine learning frameworks (e.g., TensorFlow, PyTorch) and relevant libraries (e.g., Scikit-Learn, NumPy, Pandas)
  • Familiarity with large language models (LLMs) and their application in agentic systems
  • Familiarity with specific agent frameworks (e.g., LangChain, AutoGen, CrewAI, RAG) or research in multi-agent reinforcement learning
  • Experience in designing and implementing APIs for AI services
  • Experience with software development best practices, including version control (Git), CI/CD pipelines, testing, and code reviews
  • Excellent analytical and problem-solving skills with a creative approach to complex challenges
Job Responsibility
Job Responsibility
  • Design and implement intelligent agents, including their perception, reasoning, planning, and action execution modules
  • Develop scalable and robust architectures for agentic systems, ensuring high performance, reliability, and security
  • Integrate various machine learning models (e.g., LLMs, reinforcement learning, predictive models) to enhance agent capabilities and decision-making
  • Develop agents that can automate complex tasks, optimize workflows, and solve real-world problems across various domains
  • Utilize and contribute to agentic AI frameworks and development tools
  • Design and implement metrics and evaluation strategies for agent performance, continuously optimizing and improving agent behavior
  • Stay abreast of the latest advancements in AI, particularly in agent-based systems, autonomous AI, and related fields, and propose innovative solutions
  • Work closely with cross-functional teams including AI researchers, data scientists, product managers, and software engineers to integrate agentic solutions into broader products and services
  • Create comprehensive technical documentation for agent designs, implementations, and operational procedures
  • Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency
  • Fulltime
Read More
Arrow Right

Senior AI Data Engineer

The Senior AI Data Engineer (Applications Development Technology Lead Analyst - ...
Location
Location
Canada , Mississauga
Salary
Salary:
120800.00 - 170800.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of professional experience in software development with a focus on AI, machine learning, or agent-based systems
  • Strong proficiency in Python, SQL
  • Java is a plus
  • Solid understanding of core AI concepts, including knowledge representation, automated planning, decision-making under uncertainty, and multi-agent systems
  • Experience with machine learning frameworks (e.g., TensorFlow, PyTorch) and relevant libraries (e.g., Scikit-Learn, NumPy, Pandas)
  • Familiarity with large language models (LLMs) and their application in agentic systems
  • Familiarity with specific agent frameworks (e.g., LangChain, AutoGen, CrewAI, RAG) or research in multi-agent reinforcement learning
  • Experience in designing and implementing APIs for AI services
  • Experience with software development best practices, including version control (Git), CI/CD pipelines, testing, and code reviews
  • Excellent analytical and problem-solving skills with a creative approach to complex challenges
Job Responsibility
Job Responsibility
  • Design and implement intelligent agents, including their perception, reasoning, planning, and action execution modules
  • Develop scalable and robust architectures for agentic systems, ensuring high performance, reliability, and security
  • Integrate various machine learning models (e.g., LLMs, reinforcement learning, predictive models) to enhance agent capabilities and decision-making
  • Develop agents that can automate complex tasks, optimize workflows, and solve real-world problems across various domains
  • Utilize and contribute to agentic AI frameworks and development tools
  • Design and implement metrics and evaluation strategies for agent performance, continuously optimizing and improving agent behavior
  • Stay abreast of the latest advancements in AI, particularly in agent-based systems, autonomous AI, and related fields, and propose innovative solutions
  • Work closely with cross-functional teams including AI researchers, data scientists, product managers, and software engineers to integrate agentic solutions into broader products and services
  • Create comprehensive technical documentation for agent designs, implementations, and operational procedures
  • Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency
  • Fulltime
Read More
Arrow Right

Research Engineering Manager, Post-Training

Meta is seeking a Research Engineering Manager to lead the Post-Training team wi...
Location
Location
United States , Menlo Park
Salary
Salary:
219000.00 - 301000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Machine Learning, or a related technical field
  • 4+ years of experience in machine learning engineering, machine learning research, or a related technical role
  • 3+ years of experience managing or leading technical teams, including hiring, mentoring, and performance management
  • Proficiency in Python and experience with ML frameworks such as PyTorch
  • Proven track record of leading medium to large-scale technical projects (specifically data pipelines or ML infrastructure) from conception to deployment
  • Software engineering practices including version control, testing, code review, and system design
  • Demonstrated ability to balance hands-on technical work with people management and strategic planning
  • Great communication skills with the ability to influence cross-functional stakeholders
Job Responsibility
Job Responsibility
  • Build, mentor, and grow a team of research engineers focused on full-stack post-training data infrastructure
  • Conduct performance reviews, career development conversations, and provide technical mentorship to team members
  • Foster a Culture of Engineering Excellence, data rigor, and rapid iteration within the team
  • Partner with recruiting to hire world-class research engineering talent
  • Oversee the development and scaling of data collection pipelines for high-value domains (STEM, GDP-valuable tasks, finance, legal, health) and complex agentic workflows (deep research, computer use, shopping agents)
  • Establish and manage partnerships with external data vendors to source and securely prepare expert-level post-training datasets
  • Influence the technical roadmap for data infrastructure in collaboration with the MSL Infra team
  • Translate the strategic vision of research scientists into actionable engineering plans for synthetic data generation, SFT, and RLHF pipelines
  • Partner with research scientists, product teams, and model training teams to align data collection priorities with organizational capability goals
  • Build robust, reusable data pipelines that can rapidly deliver high-quality datasets to multiple model lines
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right
New

Research Engineer, Post-Training

Meta is seeking Research Engineers to join the Post-Training team within Meta Su...
Location
Location
United States , Menlo Park
Salary
Salary:
217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 3+ years of experience in machine learning engineering, machine learning research, or a related technical role
  • Proficiency in Python and experience with ML frameworks such as PyTorch
  • Experience identifying, designing, and completing medium to large technical features independently, without guidance
  • Software engineering practices including version control, testing, and code review practices
  • Ability to work independently and adapt to rapidly changing priorities
Job Responsibility
Job Responsibility
  • Design, build, and scale full-stack data collection pipelines for post-training (SFT, RLHF) across text, vision, and action modalities
  • Develop and implement environments to capture complex agentic trajectories, including computer use agents, research workflows, UI generation, and shopping agents
  • Collaborate with external data vendors and domain experts to source, securely ingest, and prepare high-quality datasets in fields like STEM, finance, legal, and health
  • Execute on the technical vision of research scientists to generate and filter high-quality synthetic data at scale
  • Build robust, reusable data processing pipelines that scale across multiple model lines and product areas
  • Contribute to tooling that measures and ensures the quality, variety, and safety of post-training datasets
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right
New

Research Engineer, Post-Training

Meta is seeking Research Engineers to join the Post-Training team within Meta Su...
Location
Location
United States , Menlo Park
Salary
Salary:
257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Bachelor's or Master's degree in Computer Science, Machine Learning, or a related technical field
  • 5+ years of experience in machine learning engineering, machine learning research, or a related technical role
  • Proficiency in Python and experience with ML frameworks such as PyTorch
  • Experience identifying, designing, and completing medium to large technical features independently, without guidance
  • Demonstrated experiences in software engineering practices including version control, testing, and code review practices
  • Ability to work independently and adapt to rapidly changing priorities
Job Responsibility
Job Responsibility
  • Design, build, and scale full-stack data collection pipelines for post-training (SFT, RLHF) across text, vision, and action modalities
  • Develop and implement environments to capture complex agentic trajectories, including computer use agents, Deep research workflows, UI generation, and shopping agents
  • Collaborate with external data vendors and domain experts to source, securely ingest, and prepare high-quality datasets in fields like STEM, finance, legal, and health
  • Execute on the technical vision of research scientists to generate and filter high-quality synthetic data at scale
  • Build robust, reusable data processing pipelines that scale across multiple model lines and product areas
  • Contribute to tooling that measures and ensures the Quality, Diversity, and Safety of post-training datasets
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Research Engineer, Post-Training

Meta is seeking Research Engineers to join the Post-Training team within Meta Su...
Location
Location
United States , Menlo Park
Salary
Salary:
181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Bachelor's or Master's degree in Computer Science, Machine Learning, or a related technical field
  • 1+ years of experience in machine learning engineering, machine learning research, or a related technical role
  • Proficiency in Python and experience with ML frameworks such as PyTorch
  • Experience identifying, designing, and completing medium to large technical features independently, without guidance
  • Demonstrated experience in software engineering practices including version control, testing, and code review practices
  • Ability to work independently and adapt to rapidly changing priorities
Job Responsibility
Job Responsibility
  • Design, build, and scale full-stack data collection pipelines for post-training (SFT, RLHF) across text, vision, and action modalities
  • Develop and implement environments to capture complex agentic trajectories, including computer use agents, Deep research workflows, UI generation, and shopping agents
  • Collaborate with external data vendors and domain experts to source, securely ingest, and prepare high-quality datasets in fields like STEM, finance, legal, and health
  • Execute on the technical vision of research scientists to generate and filter high-quality synthetic data at scale
  • Build robust, reusable data processing pipelines that scale across multiple model lines and product areas
  • Contribute to tooling that measures and ensures the Quality, Diversity, and Safety of post-training datasets
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right