CrawlJobs Logo

Research Intern - AI Evaluation and Alignment

United States, Redmond Employment contract 6710.00 - 13270.00 USD / Month · Job Posted February 04, 2026
Apply Position
Job Link Share

Job Description

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment. Microsoft Research and Copilot Studio team are seeking Research Interns to help advance the quality, reliability and evaluation of Large Language Model (LLM)-based systems. Research Interns will collaborate with applied scientists and engineers to explore new machine learning methods that improve how Artificial Intelligence (AI) systems assess and align with human expectations.

Job Responsibility

  • Co-developing a research project in collaboration with the supervisor and research mentors
  • Designing and implementing machine learning approaches, including training and fine-tuning using real-world datasets
  • Developing evaluation frameworks and benchmarking methods to assess model quality, robustness, and generalization
  • Presentation and communication of research findings

Requirements

  • Currently enrolled in a PhD program in Statistics, Computer Science, Physics, Operations Research, or a related technical field
  • At least 1 year of hands-on experience working on LLM-related projects (e.g., prompt engineering, building and evaluating LLM-based systems, rewards modeling etc.)
  • At least 1 year of experience coding in Python
  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship
  • Submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples
  • Submit a list of projects you worked on in the last 2 years with the following information: Start and end date for the project, Brief overview of what the project is about, What you did on the project, What technologies you used for the project

Nice to have

  • Prior experience in reward models for large language models or LLM-as-a-Judge
  • Strong experience with deep learning frameworks (e.g., PyTorch, TensorFlow) and familiarity with software engineering best practices (e.g. git)
  • Experience with LLM post-training and evaluation or LLM-based judge systems
  • Research experience demonstrated through publications or projects
  • Ability to work independently in ambiguous or rapidly evolving situations and collaborate effectively across disciplines

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Research Intern - AI Evaluation and Alignment

8 matching positions

PhD AI Research Intern

Join our cutting-edge Machine Learning Research team at Atlassian as a PhD Resea...
Location
Location
Canada
Salary
Salary:
55.00 USD / Hour
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Completed Bachelors degree in Computer Science or a related field
  • Currently pursuing a PhD in Computer Science or a related field at any stage of your doctoral studies
  • Strong foundation in AI/ML, LLMs, modeling and/or optimization techniques
Job Responsibility
Job Responsibility
  • Collaborate cross-functionally with Research Scientists and Machine Learning Engineers to design, implement, and evaluate experiments that advance the performance, efficiency, and scalability of modern ML and LLM systems for our AI products
  • Curate, preprocess, and manage large-scale datasets for training and evaluation, ensuring data quality, diversity, and reproducibility across experiments
  • Conduct continued training, fine-tuning, and alignment of large language models for specialized applications such as conversational AI, summarization, generative search, and multimodal agents
  • Evaluate cutting-edge ML algorithms through rigorous experimentation and provide detailed analyses highlighting performance insights, failure modes, and opportunities for improvement
  • Contribute to publications and presentations at internal workshops or top-tier academic venues, helping to drive innovation in Enterprise AI and large-scale ML systems
What we offer
What we offer
  • health and wellbeing resources
  • paid volunteer days
Read More
Arrow Right

PhD AI Research Intern

Join our cutting-edge Machine Learning Research team at Atlassian as a PhD Resea...
Location
Location
United States , Seattle
Salary
Salary:
49.00 - 75.00 USD / Hour
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Completed Bachelors degree in Computer Science or a related field
  • Currently pursuing a PhD in Computer Science or a related field at any stage of your doctoral studies
  • Degree completion date cannot be earlier than September 2026 - June 2027
  • Strong foundation in AI/ML, LLMs, modeling and/or optimization techniques
  • Exhibit a solid grasp of algorithms and data structures
  • Demonstrate proficiency in Python programming and ability to write clean, efficient, and well-documented code
  • Experience working with large-scale datasets, including data preprocessing, augmentation, and scaling techniques
  • Has expertise in managing data using Python libraries such as NumPy, Pandas, Matplotlib, in addition to leveraging models from Hugging Face and has practical knowledge of applied machine learning and deep learning frameworks, like PyTorch
  • Demonstrated exposure to natural language processing (NLP) and Computer Vision (CV)
  • Familiarity with state-of-the-art research in machine learning and AI, as evidenced by relevant coursework, publications, or projects
Job Responsibility
Job Responsibility
  • Collaborate cross-functionally with Research Scientists and Machine Learning Engineers to design, implement, and evaluate experiments that advance the performance, efficiency, and scalability of modern ML and LLM systems for our AI products
  • Curate, preprocess, and manage large-scale datasets for training and evaluation, ensuring data quality, diversity, and reproducibility across experiments
  • Conduct continued training, fine-tuning, and alignment of large language models for specialized applications such as conversational AI, summarization, generative search, and multimodal agents
  • Evaluate cutting-edge ML algorithms through rigorous experimentation and provide detailed analyses highlighting performance insights, failure modes, and opportunities for improvement
  • Contribute to publications and presentations at internal workshops or top-tier academic venues, helping to drive innovation in Enterprise AI and large-scale ML systems
What we offer
What we offer
  • health and wellbeing resources
  • paid volunteer days
Read More
Arrow Right

Director, Digital Ecosystem Applications

This position is responsible for the Software Platforms group at the Innovation ...
Location
Location
United States , Belmont
Salary
Salary:
240000.00 - 285000.00 USD / Year
https://www.volkswagen-group.com Logo
Volkswagen AG
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years with 2+ years in a technical leadership role
  • CS, EE, M.S. Engineering (or equivalent) REQUIRED
  • M.S. Engineering (or equivalent) or PhD PREFERRED
  • Analytical and conceptual thinking – using logic and reason, creative and strategic
  • Communication skills – interpersonal, presentation and written
  • Managing interdisciplinary teams on individual projects
  • Integration – joining people, processes or systems
  • Influencing and negotiation skills
  • Problem solving
  • Resource management
Job Responsibility
Job Responsibility
  • Define the technical mission, architecture strategy, and long‑term platform vision for the In‑Vehicle Computing & Digital Ecosystem Applications team, spanning Android Automotive OS (AAOS), in‑vehicle compute platforms, Software‑Defined Vehicle (SDV) architecture, and AI‑driven cockpit intelligence
  • Provide technical leadership across the full software stack, including Android Framework, System Services, HAL layers, middleware, connectivity stacks, media/audio frameworks, HMI toolchains, and cloud‑connected AI runtimes within an SDV‑aligned architecture
  • Lead and mentor engineering teams in platform bring‑up, system integration, performance optimization, and development of AI‑agentic features, multimodal interaction models, and next‑generation speech technologies
  • Manage multi‑year budgets for platform development, AI integration, SDV‑aligned compute evolution, SoC evaluations, cloud services, and prototype programs
  • Deliver executive‑level technical reporting on architecture decisions, platform readiness, SDV integration milestones, AI progress, risks, and strategic recommendations
  • Drive strategic planning for ICC’s infotainment and cockpit portfolio, including AAOS evolution, hybrid cloud/edge AI pipelines, intelligent mobile agent technologies, and SDV‑centric software and compute roadmaps
  • Align technical roadmaps with global VW Group Innovation teams across infotainment, connectivity, AI/ML, vehicle architecture, cloud services, and SDV platform strategy, ensuring cross‑platform consistency and shared component reuse
  • Build strategic relationships with SoC vendors, Tier‑1 suppliers, cloud providers, and AI technology partners to influence cockpit compute and SDV platform evolution
  • Maintain partnerships with Silicon Valley companies specializing in AI runtimes, LLMs, speech, multimodal interaction, and automotive‑grade SDV‑compatible software frameworks
  • Collaborate with academic and research institutions on AI‑agentic systems, embedded ML, HMI, and in‑vehicle compute architectures aligned with SDV principles
What we offer
What we offer
  • Eligibility for annual performance bonus
  • Healthcare benefits
  • 401(k), with company match
  • Defined contribution retirement program
  • Tuition reimbursement
  • Company lease car program
  • Paid time off
  • Fulltime
Read More
Arrow Right

AI Program Manager, Performance

Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologie...
Location
Location
United States , New York, NY
Salary
Salary:
258950.00 - 294800.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Requires a Bachelor’s degree (or foreign equivalent) in Human Resources, Industrial/Organizational Psychology or a related field and 8 years of progressive, post-baccalaureate work experience in organizational behavior, human resources program management, product development, or a related occupation
  • 4 years of experience in: Leading cross-functional teams of human resources professionals, engineers, researchers, and business leaders to design and implement HR programs or products
  • Designing HR program guidelines, workflows, and prototypes that support HR or organizational performance initiatives
  • Developing and implementing compliance frameworks for data privacy, labor, and employment law in program or product design
  • Influencing senior stakeholders and cross-functional partners to adopt HR solutions and drive organizational change
  • Building long-term HR programs centered in technology to inform short- and medium-term roadmaps
  • 1 year of experience in: AI program design and cross-functional AI product development
  • Building AI-driven solutions for human resources processes, such as performance evaluation, coaching, or workforce management systems
  • Customizing large language models using organizational behavior principles for human resources management such as feedback writing or employees evaluations
Job Responsibility
Job Responsibility
  • Responsible for technical AI product development, integrating behavioral science into automated performance management systems
  • Various product development responsibilities such as designing, testing, and deploying AI-driven tools that generate recommendations and insights for employee evaluation
  • Apply research methods to evaluate and improve AI-enabled evaluation systems
  • Coordinate and lead AI development projects across large cross-functional and highly technical teams including engineers, behavioral scientists, and legal experts to develop AI solutions
  • Develop and refine internal AI models using behavioral sciences for HR use
  • Equip managers and employees to use AI-supported tools to assess performance fairly
  • Lead communication and adoption of AI-enabled performance processes
  • Align multifunctional teams and senior stakeholders to deliver AI solutions for performance management in a 12-month horizon
  • Continuously measure, assess, and evolve AI in performance systems
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Sr Staff Mixed Methods User Researcher

Location
Location
Canada; United States
Salary
Salary:
Not provided
mozilla.org Logo
Mozilla
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of applied research work experience (or equivalent), including strategic research leadership in complex organizations, with at least 6 years in the tech/SaaS industry
  • Extensive experience leading both evaluative and generative research across multiple phases of the product development lifecycle
  • Demonstrated success operating in fast-paced, agile environments, adapting to shifting priorities while maintaining research impact
  • Deep expertise in qualitative methods (e.g., interviews, concept testing, usability studies) and practical experience with quantitative approaches (e.g., survey design, data analysis)
  • Exceptional judgment in setting research priorities at the program level, making trade-offs across multiple concurrent workstreams, and advising other researchers on method, scope, and strategic framing
  • Deep, current knowledge of AI tools used in research workflows, with the expertise to evaluate their outputs critically, define organizational standards for their use, and guide the team in applying them responsibly and effectively
  • Recognized as an internal expert in research methodology, with the ability to introduce novel approaches, advance the field, and advise others where standard methods fall short
  • Collaborative and influential communicator, skilled at building partnerships with cross-functional teams and driving alignment through evidence-based insights
  • Experienced in supporting iterative, user-centered product development, ensuring research continuously informs design and strategy
  • Commitment to our values: Welcoming differences
Job Responsibility
Job Responsibility
  • Lead the most complex, ambiguous, and highest-stakes research studies, setting the standard for research excellence and directly shaping product vision at the organisational level
  • Drive user-centered decision-making at the executive level, influencing directors and VPs and ensuring user needs are embedded in the long-term product and business strategy
  • Partner cross-functionally at a strategic level with product, design, engineering, data science, and marketing to identify high-impact research opportunities and align insights to business priorities
  • Integrate diverse data sources — qualitative and quantitative research, behavioral data, and market intelligence — to build cohesive, evidence-based perspectives on user experience
  • Leverage AI tools thoughtfully across the research workflow while applying critical review to ensure outputs are accurate
  • Define standards for responsible AI use in research, establishing frameworks that the broader team adopts and that position Mozilla as a leader in ethical AI-assisted research practice
  • Elevate the research practice by setting the methodological bar, identifying capability gaps across the team, and mentoring Staff researchers and below
What we offer
What we offer
  • Generous performance-based bonus plans to all eligible employees
  • Rich medical, dental, and vision coverage
  • Generous retirement contributions with 100% immediate vesting (regardless of whether you contribute)
  • Quarterly all-company wellness days where everyone takes a pause together
  • Country specific holidays plus a day off for your birthday
  • One-time home office stipend
  • Annual professional development budget
  • Quarterly well-being stipend
  • Considerable paid parental leave
  • Employee referral bonus program
  • Fulltime
Read More
Arrow Right

Senior Research Engineer

As a Senior Research Engineer at Microsoft, you will advance Microsoft’s mission...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, Mathematics, Statistics, Physics, or a related field and 4 or more years in applied ML or AI research and product engineering
  • OR Master’s degree and 3 or more years in applied ML or AI research and product engineering
  • OR PhD in a relevant field and 2 or more years with generative AI, LLMs, or related ML algorithms
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Bringing State-of-the-Art Research to Products
  • Design and implement AI systems using foundation models, prompt engineering, retrieval-augmented generation, multi-agent architectures, and classic ML
  • Fine-tune large language models on domain-specific data and evaluate via offline and online methods such as A/B testing, telemetry, and shadow deployments
  • Build and harden prototypes into production-ready services using robust software engineering and MLOps practices
  • Drive original research and thought leadership (whitepapers, internal notes, patents)
  • convert insights into shipped capabilities
  • Research Translation: Continuously review emerging work
  • identify high-potential methods and adapt them to Microsoft problem spaces
  • End-to-End System Development
  • ML Design & Architecture: Own end-to-end pipeline from data prep, training, evaluation, deployment, and feedback loops
  • Fulltime
Read More
Arrow Right

Senior Software Engineer

As a Senior Research Engineer at Microsoft, you will advance Microsoft’s mission...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, Mathematics, Statistics, Physics, or a related field and 4 or more years in applied ML or AI research and product engineering
  • Master’s degree and 3 or more years in applied ML or AI research and product engineering
  • PhD in a relevant field and 2 or more years with generative AI, LLMs, or related ML algorithms
  • Proficiency in Python and at least one deep learning framework such as PyTorch, JAX, or TensorFlow
  • Experience deploying Fine Tuned LLMs or multimodal models in live production environments
  • Experience shipping and maintaining production AI systems
  • Ability to meet Microsoft, customer, and government security screening requirements
  • Microsoft Cloud Background Check upon hire or transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Bringing State-of-the-Art Research to Products
  • Design and implement AI systems using foundation models, prompt engineering, retrieval-augmented generation, multi-agent architectures, and classic ML
  • Fine-tune large language models on domain-specific data and evaluate via offline and online methods such as A/B testing, telemetry, and shadow deployments
  • Build and harden prototypes into production-ready services using robust software engineering and MLOps practices
  • Drive original research and thought leadership (whitepapers, internal notes, patents)
  • convert insights into shipped capabilities
  • Research Translation: Continuously review emerging work
  • identify high-potential methods and adapt them to Microsoft problem spaces
  • End-to-End System Development
  • ML Design & Architecture: Own end-to-end pipeline from data prep, training, evaluation, deployment, and feedback loops
  • Fulltime
Read More
Arrow Right

Principal Product Manager

We are looking for a deeply technical and forward thinking Principal Product Man...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree AND 8+ years experience in product/service/program management or software development OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
  • Bachelor's Degree AND 12+ years experience in product/service/program management or software development OR equivalent experience
  • 4+ years experience taking a product, feature, or experience to market (e.g., design, addressing product market fit, and launch, internal tool/framework)
  • 6+ years experience improving product metrics for a product, feature, or experience in a market (e.g., growing customer base, expanding customer usage, avoiding customer churn)
  • 6+ years experience disrupting a market for a product, feature, or experience (e.g., competitive disruption, taking the place of an established competing product)
  • Demonstrated technical depth across LLMs and line of business systems, with proven experience leading AI/LLM evaluation strategy—including offline/online eval frameworks, rubric and AI judge design, and defining measurable quality bars for agentic tools and orchestration workflows
  • Cross-functional collaboration skills, with the ability to influence across engineering, research, design and business teams
  • Exceptional written and verbal communication skills, with a knack for storytelling and clear articulation of complex ideas
Job Responsibility
Job Responsibility
  • Define and own the evaluation strategy for all 1P and 3P Agentic tools like MCP servers, skills etc. including tool invocation success, tool quality, trajectory evaluation, intent detection, and scenario‑level scoring
  • Develop a unified framework covering offline evals, online evals, AI‑judge‑based evals, and assertion‑based rubric design
  • Partner with engineering to evolve internal platforms like Agent 365 Evals, Agent Arena, dashboards, CI/CD‑integrated nightly evals, and metrics pipelines
  • Create grading frameworks, mapping strategies, and ground truth generation mechanisms, including automation for user‑intent derivation
  • Establish Cross‑Model, Cross‑Orchestrator Eval Infrastructure i.e. ensure agentic tools reliably work across all major LLMs and orchestrators
  • Design and maintain evaluation suites that capture model regressions, tool invocation drift, and scenario fidelity as products evolve
  • Drive alignment with internal partners and ISV teams to ensure consistent evaluation approaches, shared pipelines, and consolidated quality dashboards
  • Define product readiness criteria for 1P/3P tools, aligning certification requirements for partner‑built agentic tools
  • Partner with responsible AI, security, governance, and compliance teams to ensure eval frameworks respect enterprise boundaries and safety constraints
  • Track the latest developments in multi‑agent evaluation frameworks, trajectory alignment research, and AI behavioral evals
  • Fulltime
Read More
Arrow Right