CrawlJobs Logo

Research Intern - LLM Performance Optimization

https://www.microsoft.com/ Logo

Microsoft Corporation

Location Icon

Location:
United States , Redmond

Category Icon

Job Type Icon

Contract Type:
Employment contract

Salary Icon

Salary:

6710.00 - 13270.00 USD / Month

Job Description:

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

Job Responsibility:

  • Research Interns put inquiry and theory into practice
  • learn, collaborate, and network for life
  • contribute to exciting research and development strides
  • paired with mentors
  • expected to collaborate with other Research Interns and researchers
  • present findings
  • contribute to the vibrant life of the community

Requirements:

  • Currently enrolled in a PhD program in Computer Science or a related STEM field
  • At least 1 year of experience with Large Language Model architecture or inference performance optimization
  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship
  • submit a minimum of two reference letters
  • a cover letter
  • any relevant work or research samples

Nice to have:

  • Demonstrated ability to assess and fix kernel performance bottlenecks for GPUs or other high performance parallel computer architectures
  • Familiarity with optimizing compiler architecture and intermediate representations (such as LLVMIR or MLIR)
  • Ability to think unconventionally to derive creative and innovative solutions

Additional Information:

Job Posted:
March 25, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Intern - LLM Performance Optimization

PhD AI Research Intern

Join our cutting-edge Machine Learning Research team at Atlassian as a PhD Resea...
Location
Location
Canada
Salary
Salary:
55.00 USD / Hour
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Completed Bachelors degree in Computer Science or a related field
  • Currently pursuing a PhD in Computer Science or a related field at any stage of your doctoral studies
  • Strong foundation in AI/ML, LLMs, modeling and/or optimization techniques
Job Responsibility
Job Responsibility
  • Collaborate cross-functionally with Research Scientists and Machine Learning Engineers to design, implement, and evaluate experiments that advance the performance, efficiency, and scalability of modern ML and LLM systems for our AI products
  • Curate, preprocess, and manage large-scale datasets for training and evaluation, ensuring data quality, diversity, and reproducibility across experiments
  • Conduct continued training, fine-tuning, and alignment of large language models for specialized applications such as conversational AI, summarization, generative search, and multimodal agents
  • Evaluate cutting-edge ML algorithms through rigorous experimentation and provide detailed analyses highlighting performance insights, failure modes, and opportunities for improvement
  • Contribute to publications and presentations at internal workshops or top-tier academic venues, helping to drive innovation in Enterprise AI and large-scale ML systems
What we offer
What we offer
  • health and wellbeing resources
  • paid volunteer days
Read More
Arrow Right

PhD AI Research Intern

Join our cutting-edge Machine Learning Research team at Atlassian as a PhD Resea...
Location
Location
United States , Seattle
Salary
Salary:
49.00 - 75.00 USD / Hour
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Completed Bachelors degree in Computer Science or a related field
  • Currently pursuing a PhD in Computer Science or a related field at any stage of your doctoral studies
  • Degree completion date cannot be earlier than September 2026 - June 2027
  • Strong foundation in AI/ML, LLMs, modeling and/or optimization techniques
  • Exhibit a solid grasp of algorithms and data structures
  • Demonstrate proficiency in Python programming and ability to write clean, efficient, and well-documented code
  • Experience working with large-scale datasets, including data preprocessing, augmentation, and scaling techniques
  • Has expertise in managing data using Python libraries such as NumPy, Pandas, Matplotlib, in addition to leveraging models from Hugging Face and has practical knowledge of applied machine learning and deep learning frameworks, like PyTorch
  • Demonstrated exposure to natural language processing (NLP) and Computer Vision (CV)
  • Familiarity with state-of-the-art research in machine learning and AI, as evidenced by relevant coursework, publications, or projects
Job Responsibility
Job Responsibility
  • Collaborate cross-functionally with Research Scientists and Machine Learning Engineers to design, implement, and evaluate experiments that advance the performance, efficiency, and scalability of modern ML and LLM systems for our AI products
  • Curate, preprocess, and manage large-scale datasets for training and evaluation, ensuring data quality, diversity, and reproducibility across experiments
  • Conduct continued training, fine-tuning, and alignment of large language models for specialized applications such as conversational AI, summarization, generative search, and multimodal agents
  • Evaluate cutting-edge ML algorithms through rigorous experimentation and provide detailed analyses highlighting performance insights, failure modes, and opportunities for improvement
  • Contribute to publications and presentations at internal workshops or top-tier academic venues, helping to drive innovation in Enterprise AI and large-scale ML systems
What we offer
What we offer
  • health and wellbeing resources
  • paid volunteer days
Read More
Arrow Right

Member of Technical Staff, Research

As a Member of Technical Staff on the Research team, you’ll push the boundaries ...
Location
Location
United States , San Mateo
Salary
Salary:
175000.00 - 240000.00 USD / Year
fireworks.ai Logo
Fireworks AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Research background in Artificial Intelligence, Machine Learning, Physics, or similar field
  • Experience solving analytical problems using analytic and quantitative approaches
  • Experience communicating research to audiences with different backgrounds
  • Experience coding in C/C++, Python, or other similar languages
Job Responsibility
Job Responsibility
  • Conduct foundational research to advance the capabilities, efficiency, and reliability of LLMs and multimodal systems
  • Design, implement, and evaluate novel model architectures, training methods, and optimization techniques
  • Collaborate with engineering teams to transition research prototypes into production-grade systems
  • Analyze empirical results, identify performance bottlenecks, and iterate quickly to improve model quality
  • Contribute to internal research strategy by identifying high-impact opportunities and emerging trends in AI
What we offer
What we offer
  • Meaningful equity in a fast-growing startup
  • Competitive salary
  • Comprehensive benefits package
  • Fulltime
Read More
Arrow Right

Performance Architect

In this position, you will develop AI Storage Solutions based advanced system ar...
Location
Location
United States , Milpitas
Salary
Salary:
136537.00 - 193442.00 USD / Year
sandisk.com Logo
Sandisk
Expiration Date
April 28, 2026
Flip Icon
Requirements
Requirements
  • Bachelors or Masters or PhD in Computer/Electrical Engineering with 5+ years of relevant experience in Performance Modeling, Simulation, and Analysis using SystemC
  • At least 5+ years of experience with SystemC modeling
  • Good understanding of computer/graphics architecture, ML, LLM
  • Experience of simulation using System C and TLM, behavioral modeling and performance analysis
Job Responsibility
Job Responsibility
  • Build SystemC performance models for AI Storage Solutions based products covering end-to-end from GPU/TPU/NPU/xPU, host interface, memory hierarchy, basedie controller, and AI Storage Solutions using various packaging technolgies
  • Responsible for improving the AI/ML ASIC Architecture performance through hardware & software co-optimization, post-silicon performance analysis, and influencing the strategic product roadmap
  • Workload analysis and characterization of ASIC and competitive datacenter and AI solutions to identify opportunities for performance improvement in our products
  • Collaboration with Architecture team to resolve performance issues and optimize the performance and TCO of their AI Storage Solutions based datacenter technologies
  • Experience modeling one or some components of AI/ML accelerator ASICs such as AI Storage Solutions, PCIe/UCIe/CXL, NoC, DMA, Firmware Interactions, NAND, xPU, fabrics, etc
  • Performance modeling and optimization for multi-trillion parameter LLM training/inference including Dense, Mixture of Experts (MoE) with multiple modalities (text, vision, speech)
  • Model/optimize novel parallelization strategies across tensor, pipeline, context, expert and data parallel dimensions
  • Architect memory-efficient training systems utilizing techniques like structured pruning, quantization (MX formats), continuous batching/chunked prefill, speculative decoding
  • Incorporate and extend SOTA models such as GPT-4, Reasoning models like Deepseek-R1, and multi-modal architectures
  • Collaborate with internal and external stakeholders/ML researchers to disseminate results and iterate at rapid pace
What we offer
What we offer
  • Short-Term Incentive (STI) Plan
  • Long-Term Incentive (LTI) program (restricted stock units (RSUs) or cash equivalents)
  • RSU awards for eligible new hires
  • Paid vacation time
  • Paid sick leave
  • Medical/dental/vision insurance
  • Life, accident and disability insurance
  • Tax-advantaged flexible spending and health savings accounts
  • Employee assistance program
  • Other voluntary benefit programs such as supplemental life and AD&D, legal plan, pet insurance, critical illness, accident and hospital indemnity
  • Fulltime
Read More
Arrow Right

Master’s research intern

Automated Prompt Optimization for Industrial Applications Based on LLMs. As part...
Location
Location
France , Hem
Salary
Salary:
Not provided
hornetsecurity.com Logo
Hornetsecurity
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master’s student (or equivalent) in Computer Science, Artificial Intelligence, Machine Learning, or a related field, ideally with a research component
  • Strong interest in LLM prompt engineering and evaluation methodologies
  • Proficient in Python and modern machine learning tools
  • Research-oriented mindset, capable of critically reading and analyzing recent scientific papers, formulating hypotheses and designing experiments to test them
  • Willingness to pursue a CIFRE PhD after the internship
  • Intellectually curious, autonomous, and rigorous, able to document work clearly and communicate results to both technical and non-technical audiences
  • Located in Paris or Lille
  • Fluent in English (written and spoken)
Job Responsibility
Job Responsibility
  • Designing and evaluating an automated prompt optimization pipeline using frameworks such as DSPy
  • Exploring the state-of-the-art in prompting and optimization techniques
  • Implementing automated prompt optimization using open-source tools
  • Defining and tracking performance metrics (accuracy, recall, F1)
  • Extending evaluation with text quality metrics (fluency, correctness, faithfulness, etc.)
  • Integrating quality feedback through “LLM-as-a-judge” methods into the optimization loop
What we offer
What we offer
  • 100% reimbursement of public transportation costs
  • Meal vouchers worth €10 per working day
  • CSE benefits & Student health insurance providing effective coverage from day one
  • Fulltime
Read More
Arrow Right
New

Senior Software Engineer

The R&D of Search Ads aims to build an online advertising ecosystem of users, ad...
Location
Location
China , Beijing
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Python, CUDA, or ROCm OR equivalent experience
  • 3+ years' practical experience working on applications that use GPUs, experience in optimizing their performance
  • Practical Experience writing new GPU kernels, going beyond experience of GPU workloads with existing library kernels
  • Cross-team collaboration skills and the desire to collaborate in a team of researchers and developers
  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C/C++, CUDA, or ROCm OR Master's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C/C++, CUDA, or ROCm OR equivalent experience
  • Experience in low-level performance analysis and optimization, including proficiency using GPU profiling tools such as NVIDIA Visual Profiler, and NVIDIA Nsight Compute
  • Technical background and solid foundation in software engineering principles and architecture design
  • Familiar with inference optimization, experience in developing popular inference framework such as TensorRT-LLM, SGLang, vLLM
  • Exposure to Deep Neural Network inference and experience in one or more deep learning frameworks such as PyTorch, Tensorflow, or ONNX Runtime
Job Responsibility
Job Responsibility
  • Design, develop, and maintain high-performance software in C/C++ and Python, including GPU programming with CUDA, ROCm, or Triton
  • Optimize model inference and training pipelines for speed, throughput, memory efficiency, and cost across GPU platforms
  • Collaborate with platform teams to integrate and tune solutions on emerging accelerator stacks and rapidly evolving toolchains
  • Profile workloads end-to-end, identify bottlenecks, and implement kernel-level and system-level performance improvements
  • Partner with internal and external stakeholders to translate requirements into scalable performance features and optimizations for state-of-the-art models
  • Validate performance, stability, and correctness through benchmarking, automated testing, and production readiness reviews
  • Fulltime
Read More
Arrow Right

Ai infrastructure engineer, model serving platform

As a Software Engineer on the ML Infrastructure team, you will design and build ...
Location
Location
United States , San Francisco; New York
Salary
Salary:
179400.00 - 224250.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of experience building large-scale, high-performance backend systems
  • Strong programming skills in one or more languages (e.g., Python, Go, Rust, C++)
  • Experience with LLM serving and routing fundamentals (e.g. rate limiting, token streaming, load balancing, budgets, etc.)
  • Experience with LLM capabilities and concepts such as reasoning, tool calling, prompt templates, etc.
  • Experience with containers and orchestration tools (e.g., Docker, Kubernetes)
  • Familiarity with cloud infrastructure (AWS, GCP) and infrastructure as code (e.g., Terraform)
  • Proven ability to solve complex problems and work independently in fast-moving environments
Job Responsibility
Job Responsibility
  • Build and maintain fault-tolerant, high-performance systems for serving LLMs workloads at scale
  • Build an internal platform to empower LLM capability discovery
  • Collaborate with researchers and engineers to integrate and optimize models for production and research use cases
  • Conduct architecture and design reviews to uphold best practices in system design and scalability
  • Develop monitoring and observability solutions to ensure system health and performance
  • Lead projects end-to-end, from requirements gathering to implementation, in a cross-functional environment
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • Fulltime
Read More
Arrow Right

Senior Technical Product Manager

As a Senior Technical Product Manager, Clinical Data Platform, you will be a key...
Location
Location
United States
Salary
Salary:
Not provided
aledade.com Logo
Aledade, Inc.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of product or technical program management experience in healthcare data platforms, interoperability, or machine learning infrastructure, with a focus on clinical data ingestion and transformation, technology-enabled services industry, or a SaaS product
  • Experience using and writing queries against data for the purposes of performing preliminary research to inform solution design and build internal business understanding
  • Strong understanding of the software development lifecycle, Agile methodologies, and cross functional collaboration across engineering, informatics and data science teams
  • Product development experience supporting LLM pipelines or retrieval-augmented generation workflows using structured and unstructured healthcare data
  • Proven ability to bridge business objectives and platform capabilities in environments requiring data standardization and semantic normalization
Job Responsibility
Job Responsibility
  • Define and drive both short and long term technical roadmaps for data pipeline infrastructure, ensuring scalable, reliable ingestion and transformation of structured and unstructured data across diverse upstream sources and downstream consumers to deliver maximum value with minimum risk
  • Partner cross functionally with engineering, analytics and key business stakeholders to identify data requirements, translate them into technical specifications and support implementation through backlog grooming, solution design and adoption oversight
  • Monitor pipeline performance and data quality metrics, proactively investigate anomalies with SQL or equivalent query tools to drive root cause analysis and implement improvements to support data completeness, timeliness, analytics and generative AI initiatives
  • Work with internal teams and end users to develop a deep understanding of requirements, perform thoughtful technical solution designs, use data to test hypotheses, and support teams throughout execution
  • Write detailed user stories for new features, capturing detailed descriptions of business rationale, requirements, and success criteria that are defined by measurable outcomes
  • Ongoing optimization of live user workflows and capabilities including monitoring of key metrics & internal user feedback
What we offer
What we offer
  • Flexible work schedules and the ability to work remotely are available for many roles
  • Health, dental and vision insurance paid up to 80% for employees, dependents and domestic partners
  • Robust time-off plan (21 days of PTO in your first year)
  • Two paid volunteer days and 11 paid holidays
  • 12 weeks paid parental leave for all new parents
  • Six weeks paid sabbatical after six years of service
  • Educational Assistant Program and Clinical Employee Reimbursement Program
  • 401(k) with up to 4% match
  • Stock options
  • Fulltime
Read More
Arrow Right