CrawlJobs Logo

AI Research Scientist - Safety Alignment Team

United States, Menlo Park 184000.00 - 257000.00 USD / Year · Job Posted January 23, 2026
Apply Position
Job Link Share

Job Description

Meta is seeking AI Research Scientists to join the Safety Alignment team within Meta Superintelligence Labs, dedicated to advancing the safe development and deployment of superintelligent AI. Our mission is to pioneer robust safety alignment techniques that empower Meta’s most ambitious AI capabilities, ensuring billions of users experience our products and services securely and responsibly.

Job Responsibility

  • Design, implement, and evaluate novel safety alignment techniques for large language models and multimodal AI systems
  • Create, curate, and analyze high-quality datasets for safety alignment
  • Fine-tune and evaluate LLMs to adhere to Meta’s safety policies and evolving global standards
  • Build scalable infrastructure and tools for safety evaluation, monitoring, and rapid mitigation of emerging risks
  • Work closely with researchers, engineers, and cross-functional partners to integrate safety alignment into Meta’s products and services
  • Lead complex technical projects end-to-end

Requirements

  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • PhD in Computer Science, Machine Learning, or a relevant technical field
  • 3+ years of industry research experience in LLM/NLP, computer vision, or related AI/ML model training
  • Experience as a technical lead on a team and/or leading complex technical projects from end-to-end
  • Publications at peer-reviewed conferences (e.g. ICLR, NeurIPS, ICML, KDD, CVPR, ICCV, ACL)
  • Programming experience in Python and hands-on experience with frameworks such as PyTorch

Nice to have

  • Hands-on experience applying RL techniques (e.g., RLHF, PPO, DPO, GRPO, RLVF, reward modeling) to fine-tune large language models for safety and policy adherence
  • Experience developing, fine-tuning, or evaluating LLMs across multiple languages and modalities (text, image, voice, video)
  • Demonstrated experience to innovate in safety alignment, including custom guideline enforcement, dynamic policy adaptation, and rapid hotfixing of model vulnerabilities
  • Experience designing, curating, and evaluating safety datasets, including adversarial and borderline prompt pairs for risk mitigation
  • Experience with distributed training of LLMs (hundreds/thousands of GPUs), scalable safety mitigations, and automation of safety tooling

What we offer

  • bonus
  • equity
  • benefits

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

AI Research Scientist - Safety Alignment Team

8 matching positions

Research Scientist, Safety Post Training

As the leading data and evaluation partner for frontier AI companies, Scale play...
Location
Location
United States , San Francisco, CA; New York, NY
Salary
Salary:
216000.00 - 270000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with post-training and RL techniques such as RLHF, DPO, GRPO, and similar approaches
  • A track record of published research in machine learning, particularly in generative AI
  • At least three years of experience addressing sophisticated ML problems, whether in a research setting or in product development
  • Strong written and verbal communication skills to operate in a cross-functional team
Job Responsibility
Job Responsibility
  • Develop and apply post-training methods and interpretability techniques to make frontier AI systems safer, and better understood by researchers and policymakers
  • Design and run post-training pipelines to study how training choices affect model safety, robustness, and alignment properties
  • Develop interpretability-informed evaluations that reveal how and why models produce unsafe, deceptive, or otherwise undesirable behaviors, and use those insights to guide targeted mitigations
  • Collaborate with policymakers, engineers, and other researchers to translate post-training and interpretability findings into actionable safety standards, evaluation benchmarks, and best practices
What we offer
What we offer
  • comprehensive health, dental and vision coverage
  • retirement benefits
  • learning and development stipend
  • generous PTO
  • commuter stipend (eligible)
  • Fulltime
Read More
Arrow Right

Research Scientist, AI Controls and Monitoring

As a Research Scientist focused on AI Controls and Monitoring, you will design m...
Location
Location
United States , San Francisco; New York
Salary
Salary:
197400.00 - 246750.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Commitment to mission of promoting safe, secure, and trustworthy AI deployments
  • Practical experience conducting technical research collaboratively
  • Comfort designing control and monitoring experiments for AI systems
  • Experience building prototype systems
  • Ability to turn research ideas into working prototypes
  • Track record of published research in machine learning, particularly generative AI
  • At least three years of experience addressing sophisticated ML problems
  • Strong written and verbal communication skills
Job Responsibility
Job Responsibility
  • Design methods, systems, and experiments to ensure advanced AI models and agents remain aligned with intended goals
  • Develop monitoring techniques and observability methods to track AI behavior in real time
  • Research mechanisms for layered control, including fail-safes, oversight protocols, and intervention methods
  • Design red-team simulations to probe weaknesses in oversight and control mechanisms
  • Build mitigations to close identified gaps
  • Collaborate with policymakers, engineers, and other researchers to establish standards and benchmarks
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • Retirement benefits
  • Learning and development stipend
  • Generous PTO
  • Possible commuter stipend
  • Equity based compensation
  • Fulltime
Read More
Arrow Right

AI Research Scientist - Speech and Language

Reality Labs Research is Meta’s innovation engine for next-generation AR/VR, AI,...
Location
Location
United States , Redmond
Salary
Salary:
184000.00 - 257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Artificial Intelligence, Generative AI, or a relevant technical field
  • Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
  • Demonstrated programming skills in Python and familiarity with large-scale distributed training
  • Familiarity to learn new programming languages quickly
  • Can design, implement, and evaluate RL algorithms in production or research settings
  • Problem-solving, communication, and collaboration skills
Job Responsibility
Job Responsibility
  • Design, implement, and optimize LLM-based agents for a variety of applications, leveraging the latest advances in generative AI
  • Apply reinforcement learning algorithms to improve LLM performance, safety, and alignment
  • Integrate models and orchestrations in production
  • Collaborate with cross-functional teams (research, engineering, product) to deploy and evaluate LLM agents in real-world scenarios
  • Analyze and interpret experimental results, iterate on model architectures, and drive continuous improvement
  • Contribute to the broader AI/ML community at Meta through knowledge sharing, code reviews, and technical mentorship
  • Lead and contribute to research and development of post-training methods, including RLHF (Reinforcement Learning from Human Feedback), reward modeling, and other feedback-based approaches
  • Apply AI Models to Speech Encoding, Decoding, and Synthesis problems
  • Develop Natural Language interaction systems
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right
New

Research Scientist, Advanced Control

At Meta IDC (Infrastructure Data Center), our goal is to deliver the trusted cap...
Location
Location
United States , Menlo Park
Salary
Salary:
184000.00 - 257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in a science or engineering discipline
  • 8+ years of experience spanning advanced control (e.g., MPC, optimal control, adaptive control, etc.), applied reinforcement learning or AI-driven control, and critical infrastructure control systems
  • Technical leadership experience architecting and delivering research-to-production projects
  • Working knowledge of mechanical, electrical, and thermal systems in industrial or critical infrastructure environments
  • Demonstrated track record of leading interdisciplinary research and engineering initiatives across teams or organizations
  • Experience communicating technical strategy to both technical and non-technical audiences
  • Experience driving alignment in cross-functional, matrixed organizations
Job Responsibility
Job Responsibility
  • Define and own the advanced control roadmap in IDC, building on the team's existing physical modeling capabilities
  • Shape the vision for intelligent, autonomous data center operations from advisory recommendations to governed autonomy at fleet scale
  • Lead projects from problem framing through validated, deployment-ready solutions, translating ambiguous operational challenges into well-scoped research with clear success criteria
  • Develop RL-based control strategies that enable self-optimizing data center systems — improving thermal stability, energy efficiency, and operational reliability in transient conditions
  • Shape advanced control strategies into deployable solutions that align with Meta's system architecture, operational constraints, and deployment requirements
  • Establish validation frameworks and safety guardrails that build operational trust
  • Partner with internal engineering teams and external industrial control vendors to co-develop deployable advanced control solutions
  • Drive cross-functional alignment on methodology, adoption, and integration with Meta's system architecture, operational constraints, and fleet-scale deployment challenges
  • Represent advanced control capabilities to senior stakeholders, influencing investment and prioritization decisions
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Staff Research Scientist - VLM / VLA

At General Motors, our product teams are redefining mobility. Through a human-ce...
Location
Location
United States , Mountain View
Salary
Salary:
218800.00 - 335300.00 USD / Year
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ph.D. in Machine Learning, Robotics, Computer Science, Electrical Engineering, or a related technical field
  • 5+ years of experience in AI/ML research and applied development
  • Deep expertise in modern ML architectures (transformers, generative AI, multimodal systems)
  • Strong programming skills in Python
  • Excellent communication, collaboration, and mentoring abilities, comfortable influencing technical strategy and guiding ML excellence across the organization
Job Responsibility
Job Responsibility
  • Research, design, and prototype advanced Vision-Language Models and Vision-Language-Action foundational models tailored for real-time semantic understanding and behavioral prediction in autonomous driving
  • Drive the technical strategy for onboard model optimization, leading initiatives in model quantization, pruning, knowledge distillation, and compilation to ensure high-parameter models execute with ultra-low latency on vehicle edge hardware
  • Advance multimodal alignment techniques, ensuring seamless integration of camera, radar, LiDAR, and textual/logical prompts into unified foundational architectures
  • Influence technical roadmaps and shape strategic machine learning priorities that align with safety requirements, core product milestones, and next-generation vehicle launches
  • Provide technical mentorship and long-term vision to a multidisciplinary group of machine learning engineers, software developers, and hardware specialists
  • Foster internal innovation by collaborating closely with perception, planning, and infrastructure teams to integrate foundational models into the core autonomous software stack
  • Represent the company externally to the global scientific community by publishing original research, securing patents, and presenting at top-tier artificial intelligence and robotics conferences
What we offer
What we offer
  • Medical
  • dental
  • vision
  • Health Savings Account
  • Flexible Spending Accounts
  • retirement savings plan
  • sickness and accident benefits
  • life insurance
  • paid vacation & holidays
  • tuition assistance programs
  • Fulltime
Read More
Arrow Right

Principal Applied Data Scientist - AI for Good Lab

The AI for Good Lab is hiring a Principal Applied Data Scientist to join our tea...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience (e.g., statistics, predictive analytics, research)
  • OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics, predictive analytics, research)
  • OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research) OR equivalent experience
  • Deep foundation in AI, machine learning, statistics, or related quantitative methods applied to real-world problems
  • Experience working end-to-end with data-from sourcing and exploration through modeling, interpretation, and communication
  • Proficiency in at least one scientific programming language (Python, R or equivalent languages) and experience with SQL or similar query languages
  • Excellent written and verbal communication skills, with demonstrated experience communicating complex ideas clearly and persuasively to non-technical audiences
  • Proven ability to influence outcomes and lead work in cross-functional, matrixed environments
Job Responsibility
Job Responsibility
  • Lead and develop applied AI solutions (LLMs, Agents, Computer Vision) and data science solutions by identifying and gathering data, shaping problem formulations, applying AI, machine learning, and statistical methods, and generating insight with real-world impact
  • Use AI creatively as a research and solution-building tool, combining quantitative methods, experimentation, and domain knowledge to surface patterns, test ideas, and inform decisions
  • Rapidly prototype and validate approaches using modeling, statistics and experimentation
  • select methods under real-world constraints (cost/latency, safety, privacy, maintainability)
  • Design and build reliable, maintainable, end-to-end systems spanning data pipelines, model lifecycle, evaluation/telemetry, deployment, and operations
  • Advance the AI for Good Lab research agenda by authoring technical papers and presentation, published both internally and externally
  • Work in close partnership with other researchers and research organizations, as well as policy, industry, and nonprofit stakeholders, to co-create solutions
  • Present findings with clear and compelling narratives, using impactful visualizations and storytelling to articulate insights that drive understanding and action
  • Lead through influence by shaping technical direction and standards (model evaluation, responsible AI, safety/privacy, and monitoring), aligning collaborators, navigating tradeoffs, and sustaining momentum across teams and institutions
  • Fulltime
Read More
Arrow Right

Senior Applied Scientist, AI Security

Our mission is to make Uber the industry model for a secure and trustworthy AI e...
Location
Location
United States , Sunnyvale
Salary
Salary:
190000.00 - 211000.00 USD / Year
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ph.D., MS or Bachelors degree in Statistics, Economics, Operations Research, Computer Science, Engineering, or other quantitative field
  • If Ph.D or M.S. degree, a minimum of 2+ years of industry experience as an Applied Scientist or equivalent
  • Knowledge of underlying mathematical foundations of machine learning, statistics, optimization, economics, and analytics
  • Hands-on experience building and deploying ML models
  • Knowledge of experimental design and analysis
  • Experience with exploratory data analysis, statistical analysis and testing, and model development
  • Ability to use a language like Python or R to work efficiently at scale with large data sets
  • Proficiency in technologies in one or more of the following: SQL, Spark, Hadoop
Job Responsibility
Job Responsibility
  • Develop and evaluate large-scale machine learning model systems in production with a focus on adversarial robustness, input validation, and output sanitization
  • Propose, design, and analyze large-scale online experiments to test safety guardrails and detect ecosystem vulnerabilities
  • Define and implement metrics to measure security posture, attack surface reduction, and product performance
  • Present findings on AI risks, red-teaming results, and mitigations to business and executive audiences
  • Collaborate with engineers and product managers to implement secure-by-design ideas and plan future roadmaps
  • Optimize and secure retrieval-augmented generation (RAG) systems against prompt injection, indirect injection, and data exfiltration
  • Fine-tune large language models (LLMs) to improve safety alignment, resistance to jailbreaking, and operational efficiency
  • Implement secure agentic workflows to streamline processes, ensuring safe tool-use and authorization for both internal employee agents and external user agents
What we offer
What we offer
  • Eligible to participate in Uber's bonus program
  • May be offered an equity award & other types of comp
  • All full-time employees are eligible to participate in a 401(k) plan
  • Eligible for various benefits (see link)
  • Fulltime
Read More
Arrow Right

Language Research Scientist

We are seeking a technically skilled GenAI scientist to join our team focused on...
Location
Location
Switzerland , Zurich
Salary
Salary:
Not provided
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Artificial Intelligence, Generative AI, or a relevant technical field
  • Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
  • Good programming skills in Python and familiarity with large-scale distributed training
  • Familiarity to learn new programming languages quickly
  • Can design, implement, and evaluate RL algorithms in production or research settings
  • Problem-solving, communication, and collaboration skills
Job Responsibility
Job Responsibility
  • Design, implement, and optimize LLM-based agents for a variety of applications, leveraging the latest advances in generative AI
  • Apply reinforcement learning algorithms to improve LLM performance, safety, and alignment
  • Integrate models and orchestrations in production
  • Collaborate with cross-functional teams (research, engineering, product) to deploy and evaluate LLM agents in real-world scenarios
  • Analyze and interpret experimental results, iterate on model architectures, and drive continuous improvement
  • Contribute to the broader AI/ML community at Meta through knowledge sharing, code reviews, and technical mentorship
  • Lead and contribute to research and development of post-training methods, including RLHF (Reinforcement Learning from Human Feedback), reward modeling, and other feedback-based approaches
Read More
Arrow Right