CrawlJobs Logo

Research Intern - AI System Architecture Modeling and Performance

https://www.microsoft.com/ Logo

Microsoft Corporation

Location Icon

Location:
United States , Hillsboro

Category Icon

Job Type Icon

Contract Type:
Employment contract

Salary Icon

Salary:

6710.00 - 13270.00 USD / Month

Job Description:

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment. The Azure Hardware and Systems Infrastructure organization is central to defining Microsoft's first-party Artificial Intelligence (AI) infrastructure architecture and strategy. This is a dynamic and fast-paced environment that in close partnership with sister organizations helps define System on Chip (SoC) designs, interconnect topologies, memory hierarchies, and much more, all in the context of enabling and optimizing workload optimized data flows for large-scale AI models. This organization plays a critical role in roadmap definition all the way from concept to silicon to hyperscale integration.

Job Responsibility:

  • Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life
  • Research Interns not only advance their own careers, but they also contribute to exciting research and development strides
  • During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community
  • As a Research Intern, you will be at the forefront of hardware/software co-design and have a direct impact in answering critical questions around designing an optimized AI system and evaluating real-world impact on the Azure’s supporting hyperscale infrastructure
  • This role will evaluate opportunities to co-optimize central processing unit (CPU), graphics processing unit (GPU) and networking infrastructure for the Maia accelerator ecosystem
  • You will be expected to identify system stress points, propose novel architectural ideas, and create methodologies using a combination of workload characterization, modeling and benchmarking to evaluate their effectiveness

Requirements:

  • Accepted or currently enrolled in a PhD program in Computer Science or related STEM field
  • At least 1 year of experience with performance analysis tools and methodologies, optimization and modeling
  • Proficiency with frameworks such as PyTorch, SGLang, Dynamo, and AI accelerator programming models/compilers such as CUDA and Triton
  • Deep understanding of GPU and AI architectures including memory hierarchies, compute-communication interplay, kernel scheduling and interconnect properties
  • Familiarity with CPU/server architectures including understanding of PCIe topologies and accelerator/NIC/peripheral demand. Solid understanding of CPU involvement in dispatching, scheduling and orchestration of input data pipelines to AI accelerators
  • Hands-on experience with benchmarking, profiling, identifying perf bottlenecks and performance analysis and optimization, including trace generation, event monitoring and instrumentation
  • Familiarity with roofline performance modeling, detailed performance simulations and awareness of speed vs accuracy tradeoffs in various performance modeling methodologies
  • Ability to apply the appropriate performance analysis methodology including devising new or combinatorial approaches in evaluating complex system architecture what-if scenarios
  • Solid verbal and written communication skills

Additional Information:

Job Posted:
March 25, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Intern - AI System Architecture Modeling and Performance

AI Research Lab Research Associate

We are currently seeking highly qualified interns to accelerate research towards...
Location
Location
United States , Milpitas
Salary
Salary:
43.27 - 93.15 USD / Hour
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
May 26, 2026
Flip Icon
Requirements
Requirements
  • Pursuing PhD degree (or other degree with significant research and innovation experience) in a relevant discipline (e.g. machine learning, computer science, electrical engineering, math, statistics, etc.)
  • Track record of world-class innovative contributions and ideas in machine learning
  • Experience with innovative solution development, such as developing proofs-of-concept, first-of-a-kind solutions, and/or technology transfer
  • Experience in deep learning research
  • Experience in developing deep learning software with high proficiency in data structures and algorithms
  • Strong programming skills and experience with Python, C/C++, and preferably Java
  • Software development experience in Deep Learning, GPU acceleration, and Model Optimization
  • Experience in Deep Learning and Machine Learning frameworks and models like Tensorflow, PyTorch
  • Experience in Transformer Neural Network architectures for Generative AI and natural language processing
  • Experience with Agentic AI and Generative AI workflows - desired
Job Responsibility
Job Responsibility
  • Conduct research and come up with solutions with a fast turnaround time
  • Build the software and applications for Neural Networks and Machine Learning
  • Work with system programming, Deep Learning frameworks and models, GPU acceleration, Model optimization, real-time streaming data, distributed computing, and deployment
  • Provide thought leadership and technical influence both internally and externally to HPE
  • Collaborate with HPE Labs research teams as well as external partners
  • Work in alignment with HPE's broader innovation community.
What we offer
What we offer
  • Health & Wellbeing benefits including physical, financial and emotional wellbeing support
  • Personal and professional development programs
  • Unconditional inclusion and flexibility to manage work and personal needs.
  • Fulltime
Read More
Arrow Right

Research Intern - AI Systems & Architecture

Research Internships at Microsoft provide a dynamic environment for research car...
Location
Location
United States , Mountain View
Salary
Salary:
6710.00 - 13270.00 USD / Month
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in a PhD program in Computer Science, Electrical/Computer Engineering, or a related field
  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship
  • submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples
Job Responsibility
Job Responsibility
  • Investigate emerging AI system architectures and analyze how hardware, software, and model behavior interact across large-scale inference workloads
  • Develop and evaluate analytical or simulation-based performance models to identify system bottlenecks, scalability limits, and optimization opportunities
  • Prototype or assess new inference mechanisms, including disaggregated execution, sparse/expert model scaling, and hierarchical attention techniques
  • Explore next-generation accelerator, memory-architecture, and interconnect technologies, assessing their architectural trade-offs and cost implications
  • Conduct experiments, synthesize research findings, and communicate results to mentors and collaborating researchers
  • Collaborate with fellow interns and researchers to advance new ideas in AI systems and architectural design
  • Fulltime
Read More
Arrow Right

Director, Digital Ecosystem Applications

This position is responsible for the Software Platforms group at the Innovation ...
Location
Location
United States , Belmont
Salary
Salary:
240000.00 - 285000.00 USD / Year
https://www.volkswagen-group.com Logo
Volkswagen AG
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years with 2+ years in a technical leadership role
  • CS, EE, M.S. Engineering (or equivalent) REQUIRED
  • M.S. Engineering (or equivalent) or PhD PREFERRED
  • Analytical and conceptual thinking – using logic and reason, creative and strategic
  • Communication skills – interpersonal, presentation and written
  • Managing interdisciplinary teams on individual projects
  • Integration – joining people, processes or systems
  • Influencing and negotiation skills
  • Problem solving
  • Resource management
Job Responsibility
Job Responsibility
  • Define the technical mission, architecture strategy, and long‑term platform vision for the In‑Vehicle Computing & Digital Ecosystem Applications team, spanning Android Automotive OS (AAOS), in‑vehicle compute platforms, Software‑Defined Vehicle (SDV) architecture, and AI‑driven cockpit intelligence
  • Provide technical leadership across the full software stack, including Android Framework, System Services, HAL layers, middleware, connectivity stacks, media/audio frameworks, HMI toolchains, and cloud‑connected AI runtimes within an SDV‑aligned architecture
  • Lead and mentor engineering teams in platform bring‑up, system integration, performance optimization, and development of AI‑agentic features, multimodal interaction models, and next‑generation speech technologies
  • Manage multi‑year budgets for platform development, AI integration, SDV‑aligned compute evolution, SoC evaluations, cloud services, and prototype programs
  • Deliver executive‑level technical reporting on architecture decisions, platform readiness, SDV integration milestones, AI progress, risks, and strategic recommendations
  • Drive strategic planning for ICC’s infotainment and cockpit portfolio, including AAOS evolution, hybrid cloud/edge AI pipelines, intelligent mobile agent technologies, and SDV‑centric software and compute roadmaps
  • Align technical roadmaps with global VW Group Innovation teams across infotainment, connectivity, AI/ML, vehicle architecture, cloud services, and SDV platform strategy, ensuring cross‑platform consistency and shared component reuse
  • Build strategic relationships with SoC vendors, Tier‑1 suppliers, cloud providers, and AI technology partners to influence cockpit compute and SDV platform evolution
  • Maintain partnerships with Silicon Valley companies specializing in AI runtimes, LLMs, speech, multimodal interaction, and automotive‑grade SDV‑compatible software frameworks
  • Collaborate with academic and research institutions on AI‑agentic systems, embedded ML, HMI, and in‑vehicle compute architectures aligned with SDV principles
What we offer
What we offer
  • Eligibility for annual performance bonus
  • Healthcare benefits
  • 401(k), with company match
  • Defined contribution retirement program
  • Tuition reimbursement
  • Company lease car program
  • Paid time off
  • Fulltime
Read More
Arrow Right

Research Intern - AI Frameworks (Network Systems and Tools)

Research Internships at Microsoft provide a dynamic environment for research car...
Location
Location
United States , Redmond
Salary
Salary:
6710.00 - 13270.00 USD / Month
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in a PhD program in Computer Science, Electrical/Computer Engineering, or a related field
  • Research experience in areas such as computer architecture, AI/ML systems, performance modeling, distributed systems, or hardware–software co-design
  • Programming skills in Python, C/C++ with experience building prototypes, simulators, or performance analysis tools
  • Familiarity with modern AI workloads and/or deep learning frameworks (e.g., PyTorch)
  • Demonstrated ability to define and pursue original research directions in AI systems or architecture
  • Ability to collaborate effectively with researchers across disciplines and work in cross-group, cross-cultural environments
  • Proficient communication and presentation skills for sharing complex technical insights
  • Ability to think creatively and approach system and architecture challenges with unconventional or innovative solutions
  • Experience with PyTorch, CUDA, Triton, or performance-simulation tools
  • Background in large-scale system design, AI inference bottleneck analysis, or modeling cost/performance tradeoffs
Job Responsibility
Job Responsibility
  • Investigate and evaluate emerging disaggregated KV cache architectures
  • Implement a hierarchical storage architecture with multiple tiers GPU Memory: Active working set of KV caches currently used by the model CPU DRAM: Hot cache for recently used KV chunks using pinned memory for efficient GPU-CPU transfers Local Storage: Large-scale local caching (NVMe, local disk)
  • Build Peer-to-Peer (P2P) service KV cache sharing architecture that enables direct, high-performance cache transfer between multiple LLM serving instances without requiring centralized cache servers
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Research

As a Member of Technical Staff on the Research team, you’ll push the boundaries ...
Location
Location
United States , San Mateo
Salary
Salary:
175000.00 - 240000.00 USD / Year
fireworks.ai Logo
Fireworks AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Research background in Artificial Intelligence, Machine Learning, Physics, or similar field
  • Experience solving analytical problems using analytic and quantitative approaches
  • Experience communicating research to audiences with different backgrounds
  • Experience coding in C/C++, Python, or other similar languages
Job Responsibility
Job Responsibility
  • Conduct foundational research to advance the capabilities, efficiency, and reliability of LLMs and multimodal systems
  • Design, implement, and evaluate novel model architectures, training methods, and optimization techniques
  • Collaborate with engineering teams to transition research prototypes into production-grade systems
  • Analyze empirical results, identify performance bottlenecks, and iterate quickly to improve model quality
  • Contribute to internal research strategy by identifying high-impact opportunities and emerging trends in AI
What we offer
What we offer
  • Meaningful equity in a fast-growing startup
  • Competitive salary
  • Comprehensive benefits package
  • Fulltime
Read More
Arrow Right

AI Engineer

Reporting to the AI & Technology Oversight Manager, the AI Engineer is responsib...
Location
Location
India , Mumbai
Salary
Salary:
Not provided
waystone.com Logo
Waystone Governance Ltd.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep understanding of the distinction between Generative AI and Agentic AI, including their foundations, capabilities, and appropriate use cases
  • Strong understanding of AI, ML and LLM concepts, including prompt engineering, prompt grounding, iterative loop techniques, context windows, embeddings, RAG, agentic workflows
  • Proven ability to integrate AI capabilities both into low-code automation flows and high-code stacks, including, applications, APIs, microservices, distributed systems, and development or testing tools
  • Solid software development background with hands-on coding experience in one or more engineering ecosystem such as .NET (C#), Python, or TypeScript
  • Excellent communication skills, with the ability to translate complex AI concepts for non‑experts and to effectively influence and collaborate with stakeholders at all levels, both technical and non‑technical
  • Strong writing skills, with the ability to contribute to AI literacy and AI fluency documentation
  • Strong understanding of responsible AI principles, including governance, bias mitigation, compliance, and risk-based decision-making
  • Analytical thinking with excellent problem‑solving ability and keen attention to details
  • Ability to mentor developers and testers, and to drive innovation across engineering, QA, and architecture
  • Ability to assess AI‑enabled capabilities in third‑party SaaS platforms (e.g., Appian, Salesforce,etc) and provide guidance on responsible, effective adoption
Job Responsibility
Job Responsibility
  • Hands-on contributor to the design and development of AI-enabled solutions, capable of writing both production-quality code and rapid experimental prototypes
  • Develop and implement AI‑enabled microservices, APIs, applications, and internal tools
  • Integrate AI capabilities following secure, scalable engineering best practices
  • Design, build and validate AI‑driven solutions leveraging providers such as OpenAI and Anthropic
  • Enhance low‑code/no‑code automation platforms (e.g., Power Automate, n8n, Workato) by embedding intelligent processing and applying agentic patterns where relevant
  • Implement Model Context Protocol (MCP) servers for secure AI‑to‑system connectivity
  • Lead AI‑based document parsing and intelligent data extraction initiatives
  • Contribute to educating and enabling Enterprise Capabilities areas, including Integration and Automation, by providing guidance, training, and best practices, e.g., on effective use of n8n agents
  • Engage with business stakeholders to understand requirements, constraints, and key drivers, identifying and implementing high‑value AI opportunities across Waystone
  • Prototype AI features and iterate towards production‑ready capabilities
  • Fulltime
Read More
Arrow Right

AI Engineer

Reporting to the AI & Technology Oversight Manager, the AI Engineer is responsib...
Location
Location
United Kingdom , Leeds
Salary
Salary:
Not provided
waystone.com Logo
Waystone Governance Ltd.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep understanding of the distinction between Generative AI and Agentic AI, including their foundations, capabilities, and appropriate use cases
  • Strong understanding of AI, ML and LLM concepts, including prompt engineering, prompt grounding, iterative loop techniques, context windows, embeddings, RAG, agentic workflows
  • Proven ability to integrate AI capabilities both into low-code automation flows and high-code stacks, including, applications, APIs, microservices, distributed systems, and development or testing tools
  • Solid software development background with hands-on coding experience in one or more engineering ecosystem such as .NET (C#), Python, or TypeScript
  • Excellent communication skills, with the ability to translate complex AI concepts for non‑experts and to effectively influence and collaborate with stakeholders at all levels, both technical and non‑technical
  • Strong writing skills, with the ability to contribute to AI literacy and AI fluency documentation
  • Strong understanding of responsible AI principles, including governance, bias mitigation, compliance, and risk-based decision-making
  • Analytical thinking with excellent problem‑solving ability and keen attention to details
  • Ability to mentor developers and testers, and to drive innovation across engineering, QA, and architecture
  • Ability to assess AI‑enabled capabilities in third‑party SaaS platforms (e.g., Appian, Salesforce,etc) and provide guidance on responsible, effective adoption
Job Responsibility
Job Responsibility
  • Hands-on contributor to the design and development of AI-enabled solutions, capable of writing both production-quality code and rapid experimental prototypes
  • Develop and implement AI‑enabled microservices, APIs, applications, and internal tools
  • Integrate AI capabilities following secure, scalable engineering best practices
  • Design, build and validate AI‑driven solutions leveraging providers such as OpenAI and Anthropic
  • Enhance low‑code/no‑code automation platforms (e.g., Power Automate, n8n, Workato) by embedding intelligent processing and applying agentic patterns where relevant
  • Implement Model Context Protocol (MCP) servers for secure AI‑to‑system connectivity
  • Lead AI‑based document parsing and intelligent data extraction initiatives
  • Contribute to educating and enabling Enterprise Capabilities areas, including Integration and Automation, by providing guidance, training, and best practices, e.g., on effective use of n8n agents
  • Engage with business stakeholders to understand requirements, constraints, and key drivers, identifying and implementing high‑value AI opportunities across Waystone
  • Prototype AI features and iterate towards production‑ready capabilities
  • Fulltime
Read More
Arrow Right

Research Associate

The role involves accelerating research towards new applications, core methodolo...
Location
Location
United States , Milpitas
Salary
Salary:
43.27 - 93.15 USD / Hour
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
May 26, 2026
Flip Icon
Requirements
Requirements
  • Pursuing a PhD degree (or other degree with significant research and innovation experience) in a relevant discipline (e.g. electrical engineering, computer science, machine learning, applied physics, mathematics, statistics, etc.)
  • Track record of world-class innovative contributions and ideas in AI/ML and/or related areas
  • Experience with innovative solution development, such as developing proofs-of-concept, first-of-a-kind solutions, and/or technology transfer
  • Experience with in-memory computing accelerators
  • Experience with micro architecture design for custom accelerators
  • Experience in deep learning research, algorithms, and data structures
  • Experience in design and test of custom integrated circuit IP for AI/ML applications
  • Experience with emerging analog memory devices for computing applications such as memristor, ReRAM, PCM, and others
  • Experience in system software, GPU acceleration, DL model execution and performance optimization
  • Self-motivated, proactive, with leadership qualities
Job Responsibility
Job Responsibility
  • Investigation of high-performance accelerators which combine CMOS and emerging ReRAM device technologies (or memristors) for computing applications including machine learning, neuromorphic computing, network security, finite automata, and other novel computational models
  • Design of prototype systems and/or integrated circuits
  • Invention of new architectures, circuits, and/or devices to take advantage of physical hardware systems for acceleration of target computations
  • Operation of existing hardware platforms
  • Performance evaluations with competing systems
  • Provide thought leadership and technical influence both internally and externally to HPE
  • Contribute along the full range from initial novel ideas to design, development, implementation, evaluation, and technology transfer
  • Collaborate with HPE Labs research teams as well as with external partners
  • Work in alignment with HPE's broader innovation community
What we offer
What we offer
  • Comprehensive suite of benefits supporting physical, financial and emotional wellbeing
  • Programs catered to helping employees reach career goals
  • Unconditional inclusion in work environment
  • Fulltime
Read More
Arrow Right