CrawlJobs Logo

Research Intern - LLM Acceleration

United States, Mountain View 5610.00 - 11010.00 USD / Month · Job Posted May 04, 2026
Apply Position
Job Link Share

Job Description

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment. Large language model applications are ubiquitous today. They present enormous opportunities for acceleration and performance improvement on custom architectures. This Research Internship is an opportunity to work at the confluence of AI, architecture and performance optimization.

Job Responsibility

  • Research Interns put inquiry and theory into practice
  • Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life
  • Research Interns not only advance their own careers, but they also contribute to exciting research and development strides
  • During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community

Requirements

  • At least 1 year of experience with computer architecture and parallel programming
  • Currently enrolled in a PhD or Masters program in Computer Science or a related STEM field
  • Other Requirements: Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship
  • In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples

Nice to have

  • Demonstrated ability to develop original research agendas
  • Able to collaborate effectively with other researchers and product development teams
  • Interpersonal skills, cross-group, and cross-culture collaboration
  • Ability to think unconventionally to derive creative and innovative solutions
  • Experience with PyTorch, CUDA/Triton etc.

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Research Intern - LLM Acceleration

8 matching positions

Research Intern - AI Frameworks (Network Systems and Tools)

Research Internships at Microsoft provide a dynamic environment for research car...
Location
Location
United States , Redmond
Salary
Salary:
6710.00 - 13270.00 USD / Month
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in a PhD program in Computer Science, Electrical/Computer Engineering, or a related field
  • Research experience in areas such as computer architecture, AI/ML systems, performance modeling, distributed systems, or hardware–software co-design
  • Programming skills in Python, C/C++ with experience building prototypes, simulators, or performance analysis tools
  • Familiarity with modern AI workloads and/or deep learning frameworks (e.g., PyTorch)
  • Demonstrated ability to define and pursue original research directions in AI systems or architecture
  • Ability to collaborate effectively with researchers across disciplines and work in cross-group, cross-cultural environments
  • Proficient communication and presentation skills for sharing complex technical insights
  • Ability to think creatively and approach system and architecture challenges with unconventional or innovative solutions
  • Experience with PyTorch, CUDA, Triton, or performance-simulation tools
  • Background in large-scale system design, AI inference bottleneck analysis, or modeling cost/performance tradeoffs
Job Responsibility
Job Responsibility
  • Investigate and evaluate emerging disaggregated KV cache architectures
  • Implement a hierarchical storage architecture with multiple tiers GPU Memory: Active working set of KV caches currently used by the model CPU DRAM: Hot cache for recently used KV chunks using pinned memory for efficient GPU-CPU transfers Local Storage: Large-scale local caching (NVMe, local disk)
  • Build Peer-to-Peer (P2P) service KV cache sharing architecture that enables direct, high-performance cache transfer between multiple LLM serving instances without requiring centralized cache servers
  • Fulltime
Read More
Arrow Right

Research Intern - Artificial Intelligence

Research Internships at Microsoft provide a dynamic environment for research car...
Location
Location
Canada , Vancouver
Salary
Salary:
5460.00 - 10680.00 USD / Month
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in a Master or Ph.D. program in Computer Science, Electrical Engineering, Mathematics or a related field
  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship
  • submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples
Job Responsibility
Job Responsibility
  • Conduct research on state-of-the-art artificial intelligence methodologies and identify new opportunities to advance artificial intelligence from various aspects
  • Leverage interdisciplinary expertise and knowledge across NLP, LLM, computer vision, and other related domains to accelerate innovation in artificial intelligence
  • Develop, prototype, and optimize novel methodologies to tackle the core challenges of artificial intelligence
  • Disseminate research findings through publications in peer-reviewed journals, top-tier conferences, and other relevant venues, and present results both internally and externally
  • Collaborate with researchers at MSR or Microsoft and beyond to advance and propel the research process
  • Fulltime
Read More
Arrow Right

Research Intern - Artificial Intelligence

Research Internships at Microsoft provide a dynamic environment for research car...
Location
Location
Canada , Vancouver
Salary
Salary:
5600.00 - 10000.00 CAD / Month
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in a master's or PhD program in Computer Science, Electrical Engineering, Mathematics or a related field
  • Ability to work independently and collaboratively in a dynamic and vibrant research environment
  • Willingness to embrace knowledge/technique outside your field of research
  • Solid programming skill, including prototyping, implementation and optimization
  • Experience in LLM pre-training, post-training and inference (like Megatron framework)
  • Experience in world model, multimodality and AI4Code
  • Reinforcement Learning experiences and frameworks (like VeRL, rLLM)
  • Proven publication track such as CVPR, ACL, ICML, ICCV, ECCV, NeurIPS, ICLR, RSS
Job Responsibility
Job Responsibility
  • Conduct research on state-of-the-art artificial intelligence methodologies and identify new opportunities to advance artificial intelligence from various aspects
  • Leverage interdisciplinary expertise and knowledge across NLP, LLM, computer vision, and other related domains to accelerate innovation in artificial intelligence
  • Develop, prototype, and optimize novel methodologies to tackle the core challenges of artificial intelligence
  • Disseminate research findings through publications in peer-reviewed journals, top-tier conferences, and other relevant venues, and present results both internally and externally
  • Collaborate with researchers at MSR or Microsoft and beyond to advance and propel the research process
  • Fulltime
Read More
Arrow Right

Research Scientist Intern, PyTorch Distributed

Meta is seeking a Research Scientist Intern to join our Meta PyTorch Distributed...
Location
Location
United States , Menlo Park
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining, PhD degree in the field of Computer Science or a related STEM field
  • Experience in one or more of the following machine learning/deep learning domains: Large scale training and inference ML Systems Research, ML theory: Basic knowledge about ML models in different modalities like LLM (Large Language Models), Vision (VITS, MVITS) and Multimodal and how scale impacts performance, ML systems: AI infrastructure, machine learning accelerators, high performance computing, machine learning compilers, GPU architecture, machine learning frameworks, distributed systems, on-device optimization
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Apply relevant AI and machine learning techniques to advance the state-of-the-art in machine learning frameworks
  • Collaborate with users of PyTorch to enable new use cases for the framework both inside and outside Meta
  • Develop novel, accurate AI algorithms and advanced systems for large scale distributed training and inference
  • Leverage graph-based and compiler-based technologies to optimize distributed training and distributed inference use-cases
Read More
Arrow Right

Research Scientist Intern, Modern Recommendation Systems

Meta is seeking Research Interns to join our “Modern Recommendation Systems” (MR...
Location
Location
United States , Bellevue
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Computer Vision, Artificial Intelligence, or relevant technical field
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
  • Experience with Python, with experience in machine learning libraries such as Pytorch
  • Familiarity with AI/ML modeling techniques (e.g., LLM, RAG, LSTM, GRU, Transformers) and/or its acceleration for large scale use cases
  • Experience building systems based on machine learning and/or deep learning methods
Job Responsibility
Job Responsibility
  • Initiate and lead efforts towards long-term ambitious research goals, while identifying intermediate milestones in the area of recommendation systems and models, user and content understanding and multi-modal (video, audio, and text) LLM analysis for classification and relevance use cases
  • Conduct original research that can eventually be applied to Meta product development, engage with the wider research community, including publishing and releasing open source software where appropriate
  • Design, train and support video understanding libraries and models to implement new features and functionality for use internally at Meta
  • Collaborate with researchers and cross-functional partners including communicating research plans, progress, and results
Read More
Arrow Right

Research Scientist Intern

Meta is seeking Research Interns to join our Meta Recommendation Systems (MRS) R...
Location
Location
United States , Bellevue
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Computer Vision, Artificial Intelligence, or relevant technical field
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
  • Experience with Python, with experience in machine learning libraries such as Pytorch
  • Familiarity with AI/ML modeling and algorithmic techniques (e.g., various components of multimodal LLM, RAG, LSTM, GRU, Transformers, RL and/or its acceleration for large scale use cases)
  • Experience building systems based on machine learning and/or deep learning methods
Job Responsibility
Job Responsibility
  • Initiate and lead efforts towards long-term ambitious research goals, while identifying intermediate milestones in the area of recommendation systems and models, user and content understanding and multi-modal (video, audio, and text) LLM analysis for classification and relevance use cases
  • Conduct original research that can eventually be applied to Meta product development, engage with the wider research community, including publishing and releasing open source software where appropriate
  • Design, train and support AI/ML libraries and models to implement new features and functionality for use internally at Meta
  • Collaborate with researchers and cross-functional partners including communicating research plans, progress, and results.
Read More
Arrow Right
New

Senior Data Scientist - Agentic Systems

The Global Marketing Engines and Experiences (E&E) team within Microsoft is resp...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Doctorate in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 1+ year(s) data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results)
  • OR Master's Degree in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 3+ years data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results)
  • OR Bachelor's Degree in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 5+ years data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results)
  • OR equivalent experience
Job Responsibility
Job Responsibility
  • AI-Native Operations
  • Design, build, and ship agentic capabilities that make the analytics team more AI-native, including co-PM agents that triage incoming work, monitor data science workstreams in Azure DevOps, and propose ticket updates that keep our backlog accurate without manual hygiene effort
  • Build PM-layer agents that read across the Azure Dev Ops (ADO) portfolio to surface risk, estimate effort on new requests, and recommend project plans that managers can adapt rather than write from scratch
  • Establish shared infrastructure and patterns — prompting, evaluation, orchestration, observability, guardrails — that let the rest of the team build downstream agents reliably
  • Analyst Delivery Acceleration
  • Develop LLM-powered internal tools and skills that compress the cycle from analytics request to delivered insight, including capabilities that draft, format, and pressure-test the standard inputs analysts produce for MBR, MMR, and leadership review rhythms
  • Identify the highest-friction parts of the analyst delivery flow and design generative AI interventions that remove rather than relocate the work
  • Partner with the analytics team to instrument adoption, measure time saved, and iterate based on real usage rather than projected value
  • Marketer-Facing Capabilities
  • Lead the design and delivery of marketer-facing generative AI capabilities, anchored by a conversational analytics agent that allows marketers across Brand, Product Marketing Management (PMM), Customer Insights, and demand generation to self-serve on the analytics questions they bring to the team today
What we offer
What we offer
  • Eligible for benefits and other compensation
  • Fulltime
Read More
Arrow Right

Senior ML Engineer (GenAI, AWS)

Provectus helps companies adopt ML/AI to transform the ways they operate, compet...
Location
Location
Colombia , Medellín; Bogotá; Cali; Barranquilla; Bucaramanga
Salary
Salary:
Not provided
provectus.com Logo
Provectus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • ML Fundamentals: supervised, unsupervised, and reinforcement learning
  • Model Development: feature engineering, model training, evaluation, hyperparameter tuning, and validation
  • ML Frameworks: classical ML libraries, TensorFlow, PyTorch, or similar frameworks
  • Deep Learning: CNNs, RNNs, Transformers
  • LLM Applications: Experience building production LLM-based applications
  • Prompt Engineering: Ability to design effective prompts and chain-of-thought strategies
  • RAG Systems: Experience building retrieval-augmented generation architectures
  • Vector Databases: Familiarity with embedding models and vector search
  • LLM Evaluation: Experience with evaluation metrics and techniques for LLM outputs
  • Python: Advanced proficiency in Python for ML applications
Job Responsibility
Job Responsibility
  • Design and implement end-to-end ML solutions from experimentation to production
  • Build scalable ML pipelines and infrastructure
  • Optimize model performance, efficiency, and reliability
  • Write clean, maintainable, production-quality code
  • Conduct rigorous experimentation and model evaluation
  • Troubleshoot and resolve complex technical challenges
  • Mentor junior and mid-level ML engineers
  • Conduct code reviews and provide constructive feedback
  • Share knowledge through documentation, presentations, and workshops
  • Collaborate with cross-functional teams (DevOps, Data Engineering, SAs)
What we offer
What we offer
  • Long-term B2B collaboration
  • Fully remote setup
  • A budget for your medical insurance
  • Paid sick leave, vacation, public holidays
  • Continuous learning support, including unlimited AWS certification sponsorship
  • Fulltime
Read More
Arrow Right