CrawlJobs Logo

Research Intern - AI Safety & Reliability for LLM Systems

https://www.microsoft.com/ Logo

Microsoft Corporation

Location Icon

Location:
United States , Redmond

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

6710.00 - 13270.00 USD / Month

Job Description:

This Research Internship focuses on improving the reliability and trustworthiness of artificial intelligence (AI) systems that support complex, real-world decision-making. The Research Intern will study how large language model (LLM)–based assistants behave when relevant information is incomplete or unevenly available and explore methods for detecting such gaps and adapting system responses accordingly. The work emphasizes uncertainty awareness, responsible reasoning, and robustness, contributing to safer and more dependable AI systems in enterprise settings.

Job Responsibility:

  • Research Interns put inquiry and theory into practice
  • Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life
  • Research Interns not only advance their own careers, but they also contribute to exciting research and development strides
  • During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community

Requirements:

  • Currently enrolled in a PhD program in computer science, machine learning, statistics, human-computer interaction or a related field
  • Proficiency in Python and experience with common ML and data processing libraries
  • Experience with large language models and/or retrieval-augmented generation (RAG) or related approaches
  • Prior research experience in machine learning, NLP, or human-centered AI, demonstrated through publications, preprints, or substantial projects suitable for peer-reviewed venues such as NeurIPS, ICML, FAccT, AIES, CHI, or CSCW
  • Proficient written and verbal communication skills for presenting and documenting research
  • Interest in AI reliability, robustness, safety, or responsible AI research

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Intern - AI Safety & Reliability for LLM Systems

Director of AI Engineering

We are entering a hyper-growth phase of AI innovation and are hiring a Director ...
Location
Location
Canada; United States
Salary
Salary:
300000.00 - 450000.00 USD / Year
apollo.io Logo
Apollo.io
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10–15+ years in software engineering, with significant leadership experience owning AI/ML or applied LLM systems at scale
  • Proven history shipping LLM-powered features, agentic workflows, or AI assistants used by real customers in production
  • Deep understanding of LLM orchestration frameworks (LangChain, LlamaIndex), RAG pipelines, vector search, embeddings, and prompt engineering
  • Expert in backend & distributed systems (Python strongly preferred) and cloud infrastructure (AWS/GCP)
  • Strong experience with telemetry, observability, and cost-aware real-time inference optimizations
  • Demonstrated ability to lead senior engineers, define technical roadmaps, and deliver outcomes aligned to business metrics
  • Experience building or scaling teams working on experimentation, optimization, personalization, or ML-powered growth systems
  • Exceptional ability to simplify complex problems, set clear standards, and drive alignment across Product, Data, Design, and Engineering
  • Strong product sense, ability to weigh novelty vs. impact, focus on user value, and prioritize speed with guardrails
  • Fluent in integrating AI tools into engineering workflows for code generation, debugging, delivery velocity, and operational efficiency
Job Responsibility
Job Responsibility
  • Define the multi-year technical vision for Apollo’s AI stack, spanning agents, orchestration, inference, retrieval, and platformization
  • Prioritize high-impact AI investments by partnering with Product, Design, Research, and Data leaders to align engineering outcomes with business goals
  • Establish technical standards, evaluation criteria, and success metrics for every AI-powered feature shipped
  • Lead the architecture and deployment of long-horizon autonomous agents, multi-agent workflows, and API-driven orchestration frameworks
  • Build reusable, scalable agentic components that power GTM workflows like research, enrichment, sequencing, lead scoring, routing, and personalization
  • Own the evolution of Apollo’s internal LLM platform for high-scale, low-latency, cost-optimized inference
  • Oversee model-driven experiences for natural-language interfaces, RAG pipelines, semantic search, personalized recommendations, and email intelligence
  • Partner with Product & Design to build intuitive conversational UX that hides underlying complexity while elevating user productivity
  • Implement rigorous evaluation frameworks, including offline benchmarking, human-in-the-loop review, and online A/B experimentation
  • Ensure robust observability, monitoring, and safety guardrails for all AI systems in production
What we offer
What we offer
  • Equity
  • Company bonus or sales commissions/bonuses
  • 401(k) plan
  • At least 10 paid holidays per year
  • Flex PTO
  • Parental leave
  • Employee assistance program and wellbeing benefits
  • Global travel coverage
  • Life/AD&D/STD/LTD insurance
  • FSA/HSA
  • Fulltime
Read More
Arrow Right
New

Artificial Intelligence Engineer

PMGTech leads PMG’s AI strategy by building an AI-ready research environment, un...
Location
Location
India , Mumbai
Salary
Salary:
Not provided
blackrock.com Logo
BlackRock Investments
Expiration Date
February 27, 2026
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Data Science, AI/ML, or equivalent
  • 6+ years of work experience delivering ML, AI, and data-intensive systems
  • Hands-on experience building and deploying AI systems end-to-end - including LLM workflows, prompt engineering, RAG pipelines, entity extraction, embeddings/vector search, text2sql, fine-tuning, evaluation, and backend integration using Python and SQL
  • Strong written and oral communication skills and ability to work directly with investors and senior partners is a must
Job Responsibility
Job Responsibility
  • Design architectures for AI-powered research applications leveraging Generative AI capabilities (RAG, agentic workflows, search, model fine-tuning)
  • Partner with investors to translate open-ended research questions into feasible AI-driven product concepts
  • Be hands-on in leading the lifecycle from POC to MVP to production for AI applications, including data pipelines and backend integration
  • Own the end-to-end user experience of investor-facing research apps, including intuitive front-end UI designs
  • Evaluate emerging models and APIs
  • define best practices for prompts, safety, reliability, and testing with internal and external tech teams
  • Leverage enterprise data engines, orchestration frameworks, and secure/observable production practices with Engineering Hub and Platforms
  • Implement monitoring, observability, evaluation frameworks, and data-quality safeguards for GenAI-powered research applications
What we offer
What we offer
  • strong retirement plan
  • tuition reimbursement
  • comprehensive healthcare
  • support for working parents
  • Flexible Time Off (FTO)
  • Fulltime
!
Read More
Arrow Right

Manager, Machine Learning - Community Support Engineering

The Community Support Platform (CSP) at Airbnb is a critical system that drives ...
Location
Location
United States
Salary
Salary:
204000.00 - 255000.00 USD / Year
airbnb.com Logo
Airbnb
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Expertise in various machine learning and AI methodologies, including LLMs and non-LLMs, tailored for user-facing products
  • Proven experience in leading teams that develop large-scale ML models and systems to improve online user experiences
  • Strong leadership skills with a track record of nurturing an innovative and collaborative team environment
  • Exceptional verbal and written communication abilities, with a keen eye for detail
  • Demonstrated capability to work effectively with stakeholders at all organizational levels, both internally and externally
  • Skilled in navigating and resolving ambiguous challenges through proactive and strategic approaches
  • PhD, or Master's degree in Computer Science, Mathematics, Statistics, or related technical field
  • 10+ years of experience in building and shipping AI models and products, including 2+ years of experience with LLMs
  • 5+ years managing machine learning teams that deliver large impact
  • Expert knowledge of machine learning algorithms and techniques
Job Responsibility
Job Responsibility
  • Lead and mentor a dynamic team of highly skilled applied scientists and machine learning engineers in the research, design and optimization of AI models and services
  • Develop and refine the overarching strategy for the ML and AI aspects of our community support products, focusing on scalability, quality, safety, performance, and reliability
  • Foster rapid development cycles without sacrificing quality, collaborating closely with platform, backend, and frontend engineers to engineer robust ML models and systems that enhance community support initiatives
  • Evaluate technical trade-offs in key decisions, ensuring optimal outcomes through data-backed strategies
  • Conduct thorough design and architecture reviews to continually elevate our standards of technical excellence
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Employee Travel Credits
  • Fulltime
Read More
Arrow Right
New

Principal Research Engineer

As a Principal Research Engineer at Microsoft, you will set the technical vision...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • PhD in AI/ML or related field with top-venue publications and/or patents
  • Experience architecting and deploying LLMs/multimodal models and multi-agent systems in production at scale
  • Familiarity with Responsible AI frameworks and bias-mitigation techniques
  • Demonstrated ability to shape product strategy and drive organizational change
  • Experience with Microsoft’s LLMOps stack: Azure AI Foundry, Azure Machine L
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Define and execute technical strategy for foundational models, multi-agent systems, and next-generation Copilot experiences, especially within Business & Industry Copilot
  • Lead cross-team efforts to deliver scalable, reliable, and responsible AI systems
  • Advance the state of the art and translate breakthroughs into measurable customer and business impact
  • Architect and deliver complex AI systems across model development, data, infra, evaluation, and deployment spanning multiple product lines
  • Set technical direction for large programs
  • drive alignment across Research, Engineering, and Product
  • Integrate LLMs, multimodal models, multi-agent architectures, and RAG into Microsoft’s ecosystem
  • Establish best practices for MLOps, governance, and Responsible AI, compliant with Microsoft principles and industry standards
  • Drive original research and thought leadership (whitepapers, internal notes, patents)
  • convert insights into shipped capabilities
  • Fulltime
Read More
Arrow Right

AI / ML Engineer, Software Engineering

iCapital is seeking an experienced and forward-thinking AI/ML Engineer Vice Pres...
Location
Location
United States , New York
Salary
Salary:
180000.00 - 220000.00 USD / Year
icapital.com Logo
iCapital Network
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience in software engineering, with at least 2+ years focused on AI/ML systems
  • Proven experience in building and deploying ML models in production environments
  • Hands-on experience with AI agent frameworks (e.g., LangChain, Semantic Kernel, AutoGen, or custom-built systems)
  • Strong understanding of the ML lifecycle, including data pipelines, model training, evaluation, deployment, and monitoring
  • Familiar with MLOps tools such as MLflow, Kubeflow, or SageMaker
  • Deep understanding of LLM orchestration, prompt engineering, tool use, and memory architectures
  • Familiar with various LLM inference engines such as vLLM or SGLang
  • Experience in integrating agents with APIs, databases, and external systems
  • Familiar with retrieval-augmented generation (RAG), vector databases, and knowledge graphs
  • Experience deploying AI systems in cloud environments (AWS, GCP, Azure) and utilizing containerization tools (Docker, Kubernetes)
Job Responsibility
Job Responsibility
  • Design, build, and optimize scalable AI/ML infrastructure and services powering intelligent features across our platform
  • Lead the development of AI agents capable of autonomous decision-making, task execution, and multi-step reasoning across internal and customer-facing applications
  • Architect and implement modular agent frameworks by integrating tools, APIs, and memory systems for dynamic and context-aware behavior
  • Collaborate with product, data, and infrastructure teams to embed AI capabilities into production systems
  • Drive the architecture and development of ML pipelines, model serving frameworks, and real-time inference systems
  • Evaluate and integrate state-of-the-art AI tools and frameworks to accelerate development and deployment
  • Provide technical mentorship and guidance to engineers, contributing to team growth and best practices
  • Partner with Data Science teams to operationalize models, ensuring a smooth transition from experimentation to production
  • Contribute to technical roadmaps and help define long-term AI/ML platform and agent strategy
  • Optimize agent performance for latency, reliability, and safety in production environments
What we offer
What we offer
  • Equity for all full-time employees
  • Annual performance bonus
  • Employer matched retirement plan
  • Generously subsidized healthcare with 100% employer paid dental, vision, telemedicine, and virtual mental health counseling
  • Parental leave
  • Unlimited paid time off (PTO)
  • Fulltime
Read More
Arrow Right
New

Principal Research Engineer

As a Principal Research Engineer at Microsoft, you will set the technical vision...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Define and execute technical strategy for foundational models, multi-agent systems, and next-generation Copilot experiences, especially within Business & Industry Copilot
  • Lead cross-team efforts to deliver scalable, reliable, and responsible AI systems
  • Advance the state of the art and translate breakthroughs into measurable customer and business impact
  • Architect and deliver complex AI systems across model development, data, infra, evaluation, and deployment spanning multiple product lines
  • Set technical direction for large programs
  • drive alignment across Research, Engineering, and Product
  • Integrate LLMs, multimodal models, multi-agent architectures, and RAG into Microsoft’s ecosystem
  • Establish best practices for MLOps, governance, and Responsible AI, compliant with Microsoft principles and industry standards
  • Drive original research and thought leadership (whitepapers, internal notes, patents)
  • convert insights into shipped capabilities
  • Fulltime
Read More
Arrow Right

Senior Applied AI Engineer

We’re hiring a Senior Applied AI Engineer to join a fast‑moving, high‑ownership ...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Master’s Degree AND 3+ years of experience in engineering, problem solving, model building, evaluation, data analysis OR equivalent experience
  • 2+ years shipping production-level code, models, or data analysis
  • 1+ years using AI-assisted coding and analysis techniques
  • Experience working on small teams and mid-stage startup environments
  • Experience working on AI products
  • PhD in engineering, applied math, statistics, or related analytical field
  • 4+ years shipping production-level code, models, or data analysis
  • Deep experience building from zero-to-one
  • Hands on work hillclimbing AI evaluations
Job Responsibility
Job Responsibility
  • Design and ship LLM‑powered assistant features, including conversational flows, agentic behaviors, retrieval pipelines, and multimodal interactions
  • Build prompt architectures, system instructions, and orchestration logic that ensure reliability, grounding, and personality consistency
  • Prototype new capabilities rapidly and iterate based on user signals and evaluation data
  • Build and maintain evaluation frameworks for correctness, safety, grounding, and UX quality
  • Run hillclimbing loops across prompts, models, and tool‑use strategies to continuously improve assistant performance
  • Analyze failure modes, design mitigations, and drive systematic improvements across the stack
  • Develop internal tools for prompt experimentation, model comparison telemetry and debugging automated eval pipelines
  • Create reusable frameworks that accelerate the entire AI org’s ability to ship high‑quality assistant features
  • Integrate LLMs with product surfaces, APIs, and backend systems
  • Build lightweight ML components (ranking, classification, summarization, personalization) that enhance assistant intelligence
  • Fulltime
Read More
Arrow Right
New

Product Analyst — GenAI/Agentic Insights

In support of Workato’s broader push toward becoming a wall‑to‑wall agentic comp...
Location
Location
Singapore
Salary
Salary:
Not provided
workato.com Logo
Workato
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong AI use encouraged: you’re excited to use AI tools to move faster (while staying rigorous about correctness, privacy, and safety)
  • Modern analytics skillset: strong SQL + data wrangling
  • comfortable translating messy questions into clean analysis
  • Text/qual comfort: excited to analyze and structure qualitative data (transcripts, notes, open‑ended feedback) alongside quantitative signals
  • LLM + agentic systems fundamentals: familiarity with prompting, structured outputs, tool/function calling, retrieval/RAG, and basic evaluation/guardrails
  • Builder mindset: you ship usable things: prototypes, automations, scripts, lightweight internal tools
  • not just slides
  • Product sense: you can clearly articulate tradeoffs, propose practical workflows, and align outputs to stakeholder decisions
  • Clear communication: you can make complex findings and systems legible through crisp writing, visualizations, docs, and demos
  • Graduating senior with a relevant BS/MS degree (CS, Data Science, HCI, Design Engineering, or related)
Job Responsibility
Job Responsibility
  • Multimodal analysis for product & design decisions (quant + qual): Work with qualitative and quantitative data sources to consolidate product signals across channels
  • Partner closely with Product Researchers, Designers, and Product Managers to define schemas and pipelines that make qualitative and quantitative signals joinable
  • Run analyses that inform product and design decisions: trend analysis, segmentation, lightweight experimentation readouts, measurement strategy, and narrative synthesis
  • Use LLMs to help turn messy qualitative data into structured, explainable representations
  • Build small tools/dashboards that keep decisions data‑informed
  • Help teams frame hypotheses, interpret results, and connect insights to product/design actions across Workato’s product offerings
  • Agentic insight delivery (insights that actually land): Build and iterate on a User Insight Agent that supports everyday product decisions
  • Create stakeholder‑ready outputs with traceable evidence, tailored to the decision being made
  • Design for trust and reliability: transparency, citations to source evidence, confidence/uncertainty signals, rigorous evaluation/guardrails, and human‑in‑the‑loop controls
  • Implement evaluation + observability so the system improves over time
What we offer
What we offer
  • A front‑row seat and real ownership on what “next‑gen analytics” looks like in an agentic company
  • Mentorship across Product Research, Design, PM, Engineering, and AI Lab teams
  • The chance to ship state-of-the-art Agentic products—both internally, and externally to customers—that influence real product decisions and help shape how insights flow through the org
  • Hands‑on experience building, evaluating, and hardening LLM/agent workflows for real stakeholders and real decisions
  • A stronger portfolio of practical work artifacts (insight narratives, lightweight tools, automations, evaluation setups) you can talk about
  • A flexible, trust-oriented culture
  • A vibrant and dynamic work environment
  • A multitude of benefits they can enjoy inside and outside of their work lives
  • Fulltime
Read More
Arrow Right