Research Intern - LLM Acceleration Job at Microsoft Corporation (Mountain View)

Research Intern - AI Frameworks (Network Systems and Tools)

Research Internships at Microsoft provide a dynamic environment for research car...

Location

United States , Redmond

Salary:

6710.00 - 13270.00 USD / Month

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Currently enrolled in a PhD program in Computer Science, Electrical/Computer Engineering, or a related field
Research experience in areas such as computer architecture, AI/ML systems, performance modeling, distributed systems, or hardware–software co-design
Programming skills in Python, C/C++ with experience building prototypes, simulators, or performance analysis tools
Familiarity with modern AI workloads and/or deep learning frameworks (e.g., PyTorch)
Demonstrated ability to define and pursue original research directions in AI systems or architecture
Ability to collaborate effectively with researchers across disciplines and work in cross-group, cross-cultural environments
Proficient communication and presentation skills for sharing complex technical insights
Ability to think creatively and approach system and architecture challenges with unconventional or innovative solutions
Experience with PyTorch, CUDA, Triton, or performance-simulation tools
Background in large-scale system design, AI inference bottleneck analysis, or modeling cost/performance tradeoffs

Job Responsibility

Investigate and evaluate emerging disaggregated KV cache architectures
Implement a hierarchical storage architecture with multiple tiers GPU Memory: Active working set of KV caches currently used by the model CPU DRAM: Hot cache for recently used KV chunks using pinned memory for efficient GPU-CPU transfers Local Storage: Large-scale local caching (NVMe, local disk)
Build Peer-to-Peer (P2P) service KV cache sharing architecture that enables direct, high-performance cache transfer between multiple LLM serving instances without requiring centralized cache servers

Fulltime

Research Intern - Artificial Intelligence

Research Internships at Microsoft provide a dynamic environment for research car...

Location

Canada , Vancouver

Salary:

5460.00 - 10680.00 USD / Month

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Currently enrolled in a Master or Ph.D. program in Computer Science, Electrical Engineering, Mathematics or a related field
Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship
submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples

Job Responsibility

Conduct research on state-of-the-art artificial intelligence methodologies and identify new opportunities to advance artificial intelligence from various aspects
Leverage interdisciplinary expertise and knowledge across NLP, LLM, computer vision, and other related domains to accelerate innovation in artificial intelligence
Develop, prototype, and optimize novel methodologies to tackle the core challenges of artificial intelligence
Disseminate research findings through publications in peer-reviewed journals, top-tier conferences, and other relevant venues, and present results both internally and externally
Collaborate with researchers at MSR or Microsoft and beyond to advance and propel the research process

Fulltime

Research Intern - Artificial Intelligence

Research Internships at Microsoft provide a dynamic environment for research car...

Location

Canada , Vancouver

Salary:

5600.00 - 10000.00 CAD / Month

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Currently enrolled in a master's or PhD program in Computer Science, Electrical Engineering, Mathematics or a related field
Ability to work independently and collaboratively in a dynamic and vibrant research environment
Willingness to embrace knowledge/technique outside your field of research
Solid programming skill, including prototyping, implementation and optimization
Experience in LLM pre-training, post-training and inference (like Megatron framework)
Experience in world model, multimodality and AI4Code
Reinforcement Learning experiences and frameworks (like VeRL, rLLM)
Proven publication track such as CVPR, ACL, ICML, ICCV, ECCV, NeurIPS, ICLR, RSS

Job Responsibility

Conduct research on state-of-the-art artificial intelligence methodologies and identify new opportunities to advance artificial intelligence from various aspects
Leverage interdisciplinary expertise and knowledge across NLP, LLM, computer vision, and other related domains to accelerate innovation in artificial intelligence
Develop, prototype, and optimize novel methodologies to tackle the core challenges of artificial intelligence
Disseminate research findings through publications in peer-reviewed journals, top-tier conferences, and other relevant venues, and present results both internally and externally
Collaborate with researchers at MSR or Microsoft and beyond to advance and propel the research process

Fulltime

Research Scientist Intern, PyTorch Distributed

Meta is seeking a Research Scientist Intern to join our Meta PyTorch Distributed...

Location

United States , Menlo Park

Salary:

7650.00 - 12134.00 USD / Month

Research Scientist Intern, Modern Recommendation Systems

Meta is seeking Research Interns to join our “Modern Recommendation Systems” (MR...

Location

United States , Bellevue

Salary:

7650.00 - 12134.00 USD / Month

Research Scientist Intern

Meta is seeking Research Interns to join our Meta Recommendation Systems (MRS) R...

Location

United States , Bellevue

Salary:

7650.00 - 12134.00 USD / Month

New

Senior Data Scientist - Agentic Systems

The Global Marketing Engines and Experiences (E&E) team within Microsoft is resp...

Location

United States , Redmond

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Doctorate in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 1+ year(s) data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results)
OR Master's Degree in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 3+ years data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results)
OR Bachelor's Degree in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 5+ years data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results)
OR equivalent experience

Job Responsibility

AI-Native Operations
Design, build, and ship agentic capabilities that make the analytics team more AI-native, including co-PM agents that triage incoming work, monitor data science workstreams in Azure DevOps, and propose ticket updates that keep our backlog accurate without manual hygiene effort
Build PM-layer agents that read across the Azure Dev Ops (ADO) portfolio to surface risk, estimate effort on new requests, and recommend project plans that managers can adapt rather than write from scratch
Establish shared infrastructure and patterns — prompting, evaluation, orchestration, observability, guardrails — that let the rest of the team build downstream agents reliably
Analyst Delivery Acceleration
Develop LLM-powered internal tools and skills that compress the cycle from analytics request to delivered insight, including capabilities that draft, format, and pressure-test the standard inputs analysts produce for MBR, MMR, and leadership review rhythms
Identify the highest-friction parts of the analyst delivery flow and design generative AI interventions that remove rather than relocate the work
Partner with the analytics team to instrument adoption, measure time saved, and iterate based on real usage rather than projected value
Marketer-Facing Capabilities
Lead the design and delivery of marketer-facing generative AI capabilities, anchored by a conversational analytics agent that allows marketers across Brand, Product Marketing Management (PMM), Customer Insights, and demand generation to self-serve on the analytics questions they bring to the team today

What we offer

Eligible for benefits and other compensation

Fulltime

Senior ML Engineer (GenAI, AWS)

Provectus helps companies adopt ML/AI to transform the ways they operate, compet...

Location

Colombia , Medellín; Bogotá; Cali; Barranquilla; Bucaramanga

Salary:

Not provided

Provectus

Expiration Date

Until further notice

Requirements

ML Fundamentals: supervised, unsupervised, and reinforcement learning
Model Development: feature engineering, model training, evaluation, hyperparameter tuning, and validation
ML Frameworks: classical ML libraries, TensorFlow, PyTorch, or similar frameworks
Deep Learning: CNNs, RNNs, Transformers
LLM Applications: Experience building production LLM-based applications
Prompt Engineering: Ability to design effective prompts and chain-of-thought strategies
RAG Systems: Experience building retrieval-augmented generation architectures
Vector Databases: Familiarity with embedding models and vector search
LLM Evaluation: Experience with evaluation metrics and techniques for LLM outputs
Python: Advanced proficiency in Python for ML applications

Job Responsibility

Design and implement end-to-end ML solutions from experimentation to production
Build scalable ML pipelines and infrastructure
Optimize model performance, efficiency, and reliability
Write clean, maintainable, production-quality code
Conduct rigorous experimentation and model evaluation
Troubleshoot and resolve complex technical challenges
Mentor junior and mid-level ML engineers
Conduct code reviews and provide constructive feedback
Share knowledge through documentation, presentations, and workshops
Collaborate with cross-functional teams (DevOps, Data Engineering, SAs)

What we offer

Long-term B2B collaboration
Fully remote setup
A budget for your medical insurance
Paid sick leave, vacation, public holidays
Continuous learning support, including unlimited AWS certification sponsorship

Fulltime

Select Country

Research Intern - LLM Acceleration

Job Description

Job Responsibility

Requirements

Nice to have

Looking for more opportunities?