CrawlJobs Logo

Researcher, Trustworthy AI

openai.com Logo

OpenAI

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

The Trustworthy AI team works on action relevant or decision relevant research to ensure we shape A(G)I keeping societal impacts in mind. This includes work on full stack policy problems such as building methods for public inputs into model values and understanding impacts of anthropomorphism of AI. We aim to translate nebulous policy problems to be technically tractable and measurable. We then use this work to inform and build interventions that increase societal readiness for increasingly intelligent systems. Our team also works on external assurances for AI with an aim for increasing independent checks and forming additional layers of validation.

Job Responsibility:

  • Set research and strategies to study societal impacts of models in an action-relevant manner and tie this back into model design
  • Build creative methods and run experiments that enable public input into model values
  • Increase rigor of external assurances by turning external findings into robust evaluations
  • Facilitate and grow ability to effectively de-risk flagship model deployments in a timely manner

Requirements:

  • 3+ years of research experience (industry or similar academic experience)
  • Proficiency in Python or similar languages
  • Experience with large-scale AI systems and multimodal datasets
  • Proficiency in AI safety topics like RLHF, adversarial training, robustness, LLM evaluations
  • Past experience in interdisciplinary research
  • Enthusiasm for socio-technical topics
  • Demonstrated passion for AI safety and making cutting-edge AI models safer for real-world use
  • Alignment with OpenAI’s charter and mission
What we offer:
  • Medical, dental, and vision insurance for you and your family with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents) plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays and multiple paid coordinated company office closures
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend
  • Daily meals in offices and meal delivery credits
  • Relocation support for eligible employees
  • Charitable donation matching
  • Wellness stipends
  • Performance-related bonus(es)
  • Equity

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Researcher, Trustworthy AI

ML Postdoc Researcher - LLMs & Generative AI

Truveta is the world’s first health provider led data platform with a vision of ...
Location
Location
United States , Seattle
Salary
Salary:
50.00 - 60.00 USD / Hour
truveta.com Logo
Truveta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ph.D. in Computer Science, Electrical Engineering, or a related field, with a focus on machine learning, natural language processing (NLP), Large Language Models (LLMs), multi-modal foundation models, and generative AI
  • Strong theoretical and practical background in NLP including experience with state-of-the-art architectures
  • Proficiency in deep learning frameworks (e.g., PyTorch, TensorFlow, etc.) and libraries commonly used in NLP and Generative AI
  • Solid programming skills in Python and the ability to write clean, efficient, and well-documented code
  • Excellent problem-solving and troubleshooting abilities, along with a strong analytical mindset and persistence in resolving problems
  • Strong communication skills and the ability to work effectively in a collaborative research environment
Job Responsibility
Job Responsibility
  • Collaborate with researchers and engineers to design, develop, and refine large language models and generative models for various applications
  • Utilize your expertise in machine learning and natural language processing to develop novel algorithms and methodologies for generative modeling tasks
  • Implement, train, and fine-tune LLM and GPT-like models on large-scale datasets to ensure optimal performance and accuracy
  • Stay up to date with the latest research advancements and techniques in the field of language modeling, generative modeling, and machine learning
  • Deliver the next generation of innovation in trustworthy healthcare
What we offer
What we offer
  • Competitive compensation
  • Company-issued laptop and equipment
  • Opportunities for future full-time positions
Read More
Arrow Right

PhD Student Human Factors Explainable AI

Volkswagen Group Innovation continuously strives to develop innovative solutions...
Location
Location
Germany , Wolfsburg
Salary
Salary:
Not provided
https://www.volkswagen-group.com Logo
Volkswagen AG
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Good to very good university degree qualifying for doctoral studies in the fields of human factors, psychology or comparable fields
  • In-depth experience with user studies, especially study design
  • Good knowledge of data analysis and statistics
  • English language level C1
  • German language level B1
  • Knowledge of AI and Explainable AI desirable
Job Responsibility
Job Responsibility
  • Researching on approaches to making AI models explainable
  • Developing potential usage scenarios of xAI in the vehicle
  • Empirically investigating variables such as usability, acceptance, trust and preference towards xAI in the vehicle
  • Assessing the opportunities and risks of xAI in the automotive industry
  • Deriving design features for xAI
What we offer
What we offer
  • Attractive salary & 30 vacation days (+ 24.12. and 31.12. off)
  • 35-hour week, flexible working hours, remote work
  • Special conditions for the purchase and leasing of vehicles
  • Free seminars on scientific work and interdisciplinary qualifications
  • Participation in the doctoral network for scientific exchange with science representatives and other doctoral candidates within the Volkswagen Group
  • Fulltime
Read More
Arrow Right

Research Scientist, AI Controls and Monitoring

As a Research Scientist focused on AI Controls and Monitoring, you will design m...
Location
Location
United States , San Francisco; New York
Salary
Salary:
197400.00 - 246750.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Commitment to mission of promoting safe, secure, and trustworthy AI deployments
  • Practical experience conducting technical research collaboratively
  • Comfort designing control and monitoring experiments for AI systems
  • Experience building prototype systems
  • Ability to turn research ideas into working prototypes
  • Track record of published research in machine learning, particularly generative AI
  • At least three years of experience addressing sophisticated ML problems
  • Strong written and verbal communication skills
Job Responsibility
Job Responsibility
  • Design methods, systems, and experiments to ensure advanced AI models and agents remain aligned with intended goals
  • Develop monitoring techniques and observability methods to track AI behavior in real time
  • Research mechanisms for layered control, including fail-safes, oversight protocols, and intervention methods
  • Design red-team simulations to probe weaknesses in oversight and control mechanisms
  • Build mitigations to close identified gaps
  • Collaborate with policymakers, engineers, and other researchers to establish standards and benchmarks
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • Retirement benefits
  • Learning and development stipend
  • Generous PTO
  • Possible commuter stipend
  • Equity based compensation
  • Fulltime
Read More
Arrow Right

Research Scientist, Agent Robustness

As a Research Scientist working on Agent Robustness you will work on the fundame...
Location
Location
United States , San Francisco; New York
Salary
Salary:
197400.00 - 246750.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Commitment to mission of promoting safe, secure, and trustworthy AI deployments
  • Practical experience conducting technical research collaboratively
  • Experience building and leveraging agent scaffolding, designing evaluation harnesses, and quickly turning new ideas into working prototypes
  • Experience with post-training and RL techniques such as RLHF, DPO, GRPO
  • A track record of published research in machine learning, particularly in generative AI
  • At least three years of experience addressing sophisticated ML problems
  • Strong written and verbal communication skills
Job Responsibility
Job Responsibility
  • Research the science of AI agent capabilities with a focus on safety, risk factors, and benchmarking methodologies
  • Design and build harnesses to test AI agents’ tendency to take harmful actions
  • Design and build exploits and mitigations for new failure modes
  • Characterize and design mitigations for potential failure modes of systems involving multiple interacting AI agents
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • Retirement benefits
  • Learning and development stipend
  • Generous PTO
  • Commuter stipend
  • Equity grant
  • Fulltime
Read More
Arrow Right

Associate or Full Professor of Trustworthy AI in Cybersecurity

The School of Cybersecurity at Old Dominion University (ODU) invites applicants ...
Location
Location
United States , Norfolk
Salary
Salary:
Not provided
odu.edu Logo
Old Dominion University
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ph.D. in Computer Science, Computer Engineering, or any field closely related to Cybersecurity
  • Ability to teach at the undergraduate and graduate levels
  • Ability to work in multidisciplinary settings
  • A strong research record in Trustworthy AI demonstrated by peer-reviewed publications, a sustained stream of externally funded multidisciplinary research projects
Job Responsibility
Job Responsibility
  • Establishing and leading interdisciplinary research in collaboration with existing ODU faculty and other members of the cluster
  • Teach at the undergraduate and graduate levels
  • Fulltime
Read More
Arrow Right

Associate or Full Professor of Trustworthy AI in Cybersecurity

The School of Cybersecurity at Old Dominion University (ODU) invites applicants ...
Location
Location
United States , Norfolk
Salary
Salary:
Not provided
odu.edu Logo
Old Dominion University
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ph.D. in Computer Science, Computer Engineering, or any field closely related to Cybersecurity
  • Ability to teach at the undergraduate and graduate levels
  • Ability to work in multidisciplinary settings
  • A strong research record in Trustworthy AI demonstrated by peer-reviewed publications, a sustained stream of externally funded multidisciplinary research projects
Job Responsibility
Job Responsibility
  • Establishing and leading interdisciplinary research in collaboration with existing ODU faculty and other members of the cluster
  • Teach at the undergraduate and graduate levels
  • Fulltime
Read More
Arrow Right
New

Research Scientist, Frontier Risk Evaluations

As a Research Scientist focused on Frontier Risk Evaluations, you will design an...
Location
Location
United States , San Francisco; New York; Seattle
Salary
Salary:
216000.00 - 270000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Commitment to mission of promoting safe, secure, and trustworthy AI deployments
  • Practical experience conducting technical research collaboratively
  • Comfort building and instrumenting ML pipelines, writing evaluation harnesses, and turning research ideas into prototypes
  • Track record of published research in machine learning, particularly generative AI
  • At least three years of experience addressing sophisticated ML problems
  • Strong written and verbal communication skills
Job Responsibility
Job Responsibility
  • Design and create evaluation measures, harnesses and datasets for measuring risks posed by frontier AI systems
  • Design and build harnesses to test AI models and systems for dangerous capabilities
  • Work with government agencies or other labs to collectively scope and design evaluations
  • Publish evaluation methodologies and write technical reports for policymakers
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • Retirement benefits
  • Learning and development stipend
  • Generous PTO
  • Possible commuter stipend
  • Equity based compensation
  • Fulltime
Read More
Arrow Right

Technical Program Manager, Trustworthy AI

The Trustworthy AI team is investing in external assurances to build out a robus...
Location
Location
United States , San Francisco
Salary
Salary:
207000.00 - 335000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience running strategic academic collaborations and program management with technical and research teams
  • Understanding of AI evaluations and measurements and ability to engage with technical teams on AI evaluations
  • Experience working with and managing stakeholders external to an organization, especially academic researchers
  • Ability to create executive summaries and synthesis of technical and social science research
  • Experience working cross functionally across product, research, and engineering teams
  • Understanding and interest in frontier AI safety and policy
Job Responsibility
Job Responsibility
  • Create strategic research partnerships
  • Proactively identify new partners for external assurances such as high quality third party evaluators and academic research labs
  • Create feedback mechanisms for translating external research into actionable product and policy recommendations
  • Communicate progress, status and risk effectively to stakeholders internally and externally
  • Drive tool and process improvements to improve efficiency
What we offer
What we offer
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Fulltime
Read More
Arrow Right