CrawlJobs Logo

Research Intern - Training Methods for LLM Efficiency

https://www.microsoft.com/ Logo

Microsoft Corporation

Location Icon

Location:
United States , Mountain View

Category Icon

Job Type Icon

Contract Type:
Employment contract

Salary Icon

Salary:

6710.00 - 13270.00 USD / Month

Job Description:

This Research Internship will design training algorithms and apply them to improving the quality/efficiency trade-offs of large language models, with a focus on resource-constrained environments. Possible directions of investigation include: designing new algorithms for quantized model fine-tuning; leveraging training to improve the token efficiency of reasoning models; proposing and implementing systems optimizations to scale training under resource constraints.

Job Responsibility:

  • Research Interns put inquiry and theory into practice
  • Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life
  • Research Interns not only advance their own careers, but they also contribute to exciting research and development strides
  • During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community

Requirements:

  • Currently enrolled in a PhD program in Computer Science or a related field
  • At least 1 year of experience working on AI/Machine Learning
  • Hands-on experience with ML tools and frameworks such as Pytorch
  • Experience training and evaluating models
  • Publication track record in ML conferences
  • Ability to collaborate effectively with other researchers and product teams

Nice to have:

  • Hands-on experience with ML tools and frameworks such as Pytorch
  • Experience training and evaluating models
  • Publication track record in ML conferences
  • Ability to collaborate effectively with other researchers and product teams

Additional Information:

Job Posted:
April 23, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Intern - Training Methods for LLM Efficiency

Member of Technical Staff, Research

As a Member of Technical Staff on the Research team, you’ll push the boundaries ...
Location
Location
United States , San Mateo
Salary
Salary:
175000.00 - 240000.00 USD / Year
fireworks.ai Logo
Fireworks AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Research background in Artificial Intelligence, Machine Learning, Physics, or similar field
  • Experience solving analytical problems using analytic and quantitative approaches
  • Experience communicating research to audiences with different backgrounds
  • Experience coding in C/C++, Python, or other similar languages
Job Responsibility
Job Responsibility
  • Conduct foundational research to advance the capabilities, efficiency, and reliability of LLMs and multimodal systems
  • Design, implement, and evaluate novel model architectures, training methods, and optimization techniques
  • Collaborate with engineering teams to transition research prototypes into production-grade systems
  • Analyze empirical results, identify performance bottlenecks, and iterate quickly to improve model quality
  • Contribute to internal research strategy by identifying high-impact opportunities and emerging trends in AI
What we offer
What we offer
  • Meaningful equity in a fast-growing startup
  • Competitive salary
  • Comprehensive benefits package
  • Fulltime
Read More
Arrow Right

Senior UX Researcher

Join Handshake as a Senior UX Researcher to shape the future of human data in th...
Location
Location
United States , San Francisco
Salary
Salary:
175000.00 - 220000.00 USD / Year
edtechjobs.io Logo
EdTech Jobs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong understanding of AI systems, LLM development workflows, and the role of human-in-the-loop data in model training and evaluation
  • Demonstrated experience leading UX research for complex, high-impact platforms—ideally across both consumer-facing and internal operations tools
  • Proven ability to deliver fast, high impact insights that accelerate product development and drive measurable impact
  • Experience informing content strategy, design, and workflows through research
  • Experience conducting research that directly improves overall services delivered to external users and operational workflows
  • Deep understanding of usability and design principles, and how UX research directly improves system clarity and efficiency
  • Experience using lean research methods—delivering insights in days, not weeks
  • Ability to synthesize qualitative and quantitative inputs, including product analytics, to tell a compelling user story
  • Highly collaborative, with a history of effective cross-functional work across product, design, engineering, operations, marketing, and leadership
  • Track record of influencing product direction by grounding teams in user needs, pain points, and research-backed opportunities
Job Responsibility
Job Responsibility
  • Lead end-to-end UX research that powers Handshake Al's human data products including expert onboarding, annotation workflows, evaluation tools, and the internal operations systems that power and scale these experiences
  • Track and measure user sentiment, usability, and efficiency across systems to drive continuous improvement
  • Extract insights qualitative research and product analytics to inform design and strategy
  • Plan and run evaluative and generative research throughout the product lifecycle-from early concepts to live tools
  • Deliver rapid, actionable insights that improve fellow experience, product quality, and operational performance
  • Recruit participants independently and manage logistics with minimal overhead
  • Produce clear, timely research deliverables and maintain a centralized repository of insights and recommendations
  • Deliver actionable insights that directly impact product improvements
  • Track how research influences key outcomes such as OKRs, engagement, task quality, throughput, and business impact
  • Build deep partnerships with cross-functional stakeholders to align research with strategic goals
What we offer
What we offer
  • Equity in a fast-growing company
  • 401(k) match, competitive compensation, financial coaching
  • Paid parental leave, fertility benefits, parental coaching
  • Medical, dental, and vision, mental health support, $500 wellness stipend
  • $2,000 learning stipend, ongoing development
  • Internet, commuting, and free lunch/gym in our SF office
  • Flexible PTO, 15 holidays + 2 flex days
  • Team outings & referral bonuses
  • Fulltime
Read More
Arrow Right

AI Engineering Intern (LLM)

Student Exploration and Experience Development (SEED) is a 12-week internship op...
Location
Location
United States , Paramus
Salary
Salary:
21.00 - 25.00 USD / Hour
veolianorthamerica.com Logo
Veolia
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Working towards a PhD degree in AI/ML/Computer Science
  • 3.8 Cumulative G.P.A required
  • Strong communication skills, including written, verbal, listening, presentation and facilitation skills
  • Demonstrated ability to build collaborative relationships
  • Understanding and working with commercial/proprietary LLMs such as Gemini( Google), GPT(OpenAI) and Claude Sonnet (Anthropic)for high performance, large context, and multimodal tasks
  • Familiarity with open-source/self-hosted LLMs like Llama from Meta and Mixtral from (Mistral AI)
  • Requirements Gathering: Using Confluence for documentation and collaboration
  • Architecture Design: Creating system diagrams and workflows with Lucidchart
  • Prototyping: Designing UI/UX prototypes in Figma
  • Project Management: Tracking tasks and progress in Jira
Job Responsibility
Job Responsibility
  • Support the development and implementation of an AI-powered deep research agent
  • Gain hands-on experience with cutting-edge large language models, cloud infrastructure, and enterprise software development
  • Work on real-world projects
  • Receive mentorship from industry professionals
  • Participate in workshops and networking events
  • Fulltime
Read More
Arrow Right

Research Scientist Intern, Agentic AI

Meta is seeking Research Interns to join our Meta Superintelligence Lab in the p...
Location
Location
United States , Bellevue
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Artificial Intelligence, Generative AI, or a relevant technical field
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
  • Experience with Python, C++, C, Java or other related languages
  • Experience building systems based on machine learning and/or deep learning methods
Job Responsibility
Job Responsibility
  • Develop novel state-of-the-art agentic AI algorithms and corresponding systems, leveraging machine learning and reinforcement learning techniques
  • Conduct research on agentic LLMs, agentic RL environments, LLM post-training, and related topics
  • Analyze and improve the efficiency, scalability, and stability of agentic AI algorithms and deployed systems
  • Advance the science and technology of intelligent, agentic machines capable of reasoning, tool use, and personalized interactions
  • Collaborate with researchers and cross-functional partners, including communicating research plans, progress, and results
  • Disseminate research results through publications, presentations, and open source contributions
  • When applicable, contribute to research that can be applied to Meta product development
Read More
Arrow Right

Research Scientist Intern

Meta is seeking Research Interns to join our Meta Superintelligence Lab in one o...
Location
Location
France , Paris
Salary
Salary:
Not provided
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Artificial Intelligence, Generative AI, or a relevant technical field
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
  • Experience with Python, C++, C, Java or other related languages
  • Experience building systems based on machine learning and/or deep learning methods
Job Responsibility
Job Responsibility
  • Develop novel state-of-the-art agentic AI algorithms and corresponding systems, leveraging machine learning and reinforcement learning techniques
  • Conduct research on agentic LLMs, agentic RL environments, LLM post-training, and related topics
  • Analyze and improve the efficiency, scalability, and stability of agentic AI algorithms and deployed systems
  • Advance the science and technology of intelligent, agentic machines capable of reasoning, tool use, and personalized interactions
  • Collaborate with researchers and cross-functional partners, including communicating research plans, progress, and results
  • Disseminate research results through publications, presentations, and open source contributions
  • When applicable, contribute to research that can be applied to Meta product development
Read More
Arrow Right

AI/Data Resident

A 6 month remote internship opportunity offering you a glimpse into the real wor...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
ema.co Logo
Ema
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in a full-time degree program in Computer Science or related field graduating by June 2026
  • Top rankers in JEE Advanced and consistent high academic record (GPA) preferred
  • Strong interest in either: Machine Learning or Natural Language Processing (NLP), ideally in one or more of these areas: Generative AI, Natural Language Understanding (NLU), Natural Language Generation (NLG), Structured Prediction, Unsupervised Learning & Representation Learning
  • AND/OR Systems programming, data engineering, data processing frameworks, streaming databases and data visualization
  • Strong in coding skills and be ready to pick up any task and run with it
  • Experience with SQL and other programming languages like Python, Cala, or R
  • Experience with programming languages like Python and familiarity with frameworks like PyTorch, Tensorflow, Keras, etc.
  • Experience with statistical methods (linear models, multivariate analysis, stochastic processes, sampling methods, etc.)
Job Responsibility
Job Responsibility
  • Research, design, implement, optimize and deploy deep learning models that advance the state of the art in perception and control for autonomous driving
  • A typical day to day includes reading deep learning papers, implementing described models and algorithms, adapting them to our setting and driving up internal metrics
  • Play a pivotal role in developing and maintaining sophisticated enterprise-level software applications, with a focus on back-end systems, API development, and the integration of language models and NLP technologies
  • Train machine learning models at Ema
  • Develop state-of-the-art algorithms in one or all of the following areas: Prompt engineering for LLM models, Fine tuning models, Training open source models, large-scale distributed training
  • Comparing and benchmarking performance of different models
  • Optimize deep neural networks and the associated preprocessing/postprocessing code to run efficiently on an embedded device
  • Conduct analysis that includes data gathering, data transformation, data processing and analysis
  • Work with large complex data sets, solve difficult non-routine analysis problems, and apply advanced analytical methods
  • Build and prototype analysis pipelines iteratively to provide insights at scale
What we offer
What we offer
  • Monthly Stipend - Rs 65,000 per month
  • Coaching and mentorship from highly accomplished leaders
  • Connections to a powerful and sought-after network of world leaders for future endeavors
  • Parttime
Read More
Arrow Right

Research Intern - OneDrive and SharePoint

Research Internships at Microsoft provide a dynamic environment for research car...
Location
Location
United States , Redmond
Salary
Salary:
6710.00 - 13270.00 USD / Month
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in a PhD program in Computer Science, Electrical/Computer Engineering, Data Science, Statistics, or related fields
  • Demonstrated foundation in machine learning and artificial intelligence
  • Hands-on experience with modern deep learning techniques (e.g., transformer models, large language models)
  • Practical Python coding experience with PyTorch or similar frameworks
  • Ability to prototype and implement algorithms efficiently
  • Proficient analytical, problem solving, communication, and collaborative skills
  • Able to formulate hypotheses, drive experiments, and work effectively in a collaborative environment
  • Demonstrated research impact through publications or projects in relevant AI domains (e.g., natural language processing, information retrieval, computer vision, multimodal AI, knowledge mining)
  • Familiarity with LLM training or fine-tuning, particularly using reinforcement learning techniques or AI agent orchestrators
  • Experience working with large-scale datasets, enterprise content, or content management systems
Job Responsibility
Job Responsibility
  • Conduct experiments and develop novel AI models and algorithms to address complex ODSP scenarios (e.g., intelligent document understanding, search/RAG, recommendation, generative experiences, proactive knowledge mining)
  • Design rigorous evaluation metrics, methodologies, and validation experiments to measure performance and quality of devised AI solutions and agentic AI workflows
  • Conduct user studies to gather qualitative and quantitative insights, ensuring solutions align with user needs and improve real-world experience
  • Build datasets (including leveraging privacy-preserving synthetic data techniques) to fine tune and benchmark models for ODSP applications
  • Deliver models and algorithms for content understanding, enrichment, and use at scale, across a range of different modalities, including text, images, and video
  • Apply your in-depth knowledge, problem-solving skills, and drive to solve new challenges in the field and realize your ideas in products used worldwide
  • Present research findings and propose ideas in team discussions
  • share results through documentation and presentations and contribute to research publications
  • Embody Microsoft culture and values
  • Fulltime
Read More
Arrow Right

Principal Engineer, ASIC Development Engineering (Frontend Architect - AI Storage Solutions)

In this Frontend Architect position, you will develop AI Storage Solutions based...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
sandisk.com Logo
Sandisk
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelors or Masters or PhD in Computer/Electrical Engineering with 8+ years of hands-on Architecture experience authoring specifications
  • Strong technical background architecting SoC and I/O subsystems involving PCIe and PCIe-DMA engines, or UCIe or CXL or UAL
  • Strong IO subsystem microarchitecture, technical, and working knowledge of the PCIe/UCIe protocol specifications
  • Knowledge of I/O Subsystem and DMA interactions with internal embedded processor-subsystems (x86, RISC-V or ARM) and external host CPU
  • Good understanding of computer/graphics architecture, ML, LLM
  • Architecting an GPU/TPU/xPU Accelerator systems with optimized high bandwidth memory hierarchy and frontend architecture for multi-trillion parameter LLM training/inference including Dense, Mixture of Experts (MoE) with multiple modalities (text, vision, speech)
  • Deep experience optimizing large-scale ML systems, GPU architectures
  • Proficiency in principles and methods of microarchitecture, software, and hardware relevant to performance engineering
  • Multi-disciplinary experience, including familiarity with Firmware and ASIC design
  • Expertise in CUDA programming, GPU memory hierarchies, and hardware-specific optimizations
Job Responsibility
Job Responsibility
  • Responsible for driving the SoC architecture, with a particular focus on I/O subsystems connected over UCIe, PCIe, UAL or CXL
  • Define I/O subsystem and PCIe DMA architectures, including their interactions with internal embedded processor-subsystems, Network on Chip, Memory controllers, and FPGA fabric
  • Create flexible and modular I/O subsystem architectures that can be deployed in either chiplet, monolithic or 3D form factors
  • Work with customers, and cross-functional teams to scope SoC requirements, analyze PPA tradeoffs, and then define architectural requirements that meet the PPA and schedule targets
  • Define I/O subsystem and DMA hardware, software, and firmware interactions with embedded processing subsystems and SoC CPUs on the device side and Host CPUs
  • Author architecture specifications in clear and concise language. Guide and assist pre-silicon design/verification and post-silicon validation during the execution phase
  • Responsible for improving the AI/ML ASIC Architecture performance through hardware & software co-optimization, post-silicon performance analysis, and influencing the strategic product roadmap
  • LLM Workload analysis and characterization of ASIC and competitive datacenter and AI solutions to identify opportunities for performance improvement in our products
  • Experience architecting one or some components of AI/ML accelerator ASICs such as HBM, PCIe/UCIe/CXL, NoC, DMA, Firmware Interactions, NAND, xPU, fabrics, etc
  • Drive the AI Storage Solutions frontend system architecture with GPU/TPU/NPU/xPU to match or exceed the nextgen HBM bandwidth
  • Fulltime
Read More
Arrow Right