CrawlJobs Logo

Research Intern - AI System Architecture Modeling and Performance

United States, Hillsboro Employment contract 6710.00 - 13270.00 USD / Month · Job Posted March 25, 2026
Apply Position
Job Link Share

Job Description

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment. The Azure Hardware and Systems Infrastructure organization is central to defining Microsoft's first-party Artificial Intelligence (AI) infrastructure architecture and strategy. This is a dynamic and fast-paced environment that in close partnership with sister organizations helps define System on Chip (SoC) designs, interconnect topologies, memory hierarchies, and much more, all in the context of enabling and optimizing workload optimized data flows for large-scale AI models. This organization plays a critical role in roadmap definition all the way from concept to silicon to hyperscale integration.

Job Responsibility

  • Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life
  • Research Interns not only advance their own careers, but they also contribute to exciting research and development strides
  • During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community
  • As a Research Intern, you will be at the forefront of hardware/software co-design and have a direct impact in answering critical questions around designing an optimized AI system and evaluating real-world impact on the Azure’s supporting hyperscale infrastructure
  • This role will evaluate opportunities to co-optimize central processing unit (CPU), graphics processing unit (GPU) and networking infrastructure for the Maia accelerator ecosystem
  • You will be expected to identify system stress points, propose novel architectural ideas, and create methodologies using a combination of workload characterization, modeling and benchmarking to evaluate their effectiveness

Requirements

  • Accepted or currently enrolled in a PhD program in Computer Science or related STEM field
  • At least 1 year of experience with performance analysis tools and methodologies, optimization and modeling
  • Proficiency with frameworks such as PyTorch, SGLang, Dynamo, and AI accelerator programming models/compilers such as CUDA and Triton
  • Deep understanding of GPU and AI architectures including memory hierarchies, compute-communication interplay, kernel scheduling and interconnect properties
  • Familiarity with CPU/server architectures including understanding of PCIe topologies and accelerator/NIC/peripheral demand. Solid understanding of CPU involvement in dispatching, scheduling and orchestration of input data pipelines to AI accelerators
  • Hands-on experience with benchmarking, profiling, identifying perf bottlenecks and performance analysis and optimization, including trace generation, event monitoring and instrumentation
  • Familiarity with roofline performance modeling, detailed performance simulations and awareness of speed vs accuracy tradeoffs in various performance modeling methodologies
  • Ability to apply the appropriate performance analysis methodology including devising new or combinatorial approaches in evaluating complex system architecture what-if scenarios
  • Solid verbal and written communication skills

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Research Intern - AI System Architecture Modeling and Performance

8 matching positions

Research Intern - AI Systems & Architecture

Research Internships at Microsoft provide a dynamic environment for research car...
Location
Location
United States , Mountain View
Salary
Salary:
6710.00 - 13270.00 USD / Month
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in a PhD program in Computer Science, Electrical/Computer Engineering, or a related field
  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship
  • submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples
Job Responsibility
Job Responsibility
  • Investigate emerging AI system architectures and analyze how hardware, software, and model behavior interact across large-scale inference workloads
  • Develop and evaluate analytical or simulation-based performance models to identify system bottlenecks, scalability limits, and optimization opportunities
  • Prototype or assess new inference mechanisms, including disaggregated execution, sparse/expert model scaling, and hierarchical attention techniques
  • Explore next-generation accelerator, memory-architecture, and interconnect technologies, assessing their architectural trade-offs and cost implications
  • Conduct experiments, synthesize research findings, and communicate results to mentors and collaborating researchers
  • Collaborate with fellow interns and researchers to advance new ideas in AI systems and architectural design
  • Fulltime
Read More
Arrow Right

Research Intern - AI Frameworks (Network Systems and Tools)

Research Internships at Microsoft provide a dynamic environment for research car...
Location
Location
United States , Redmond
Salary
Salary:
6710.00 - 13270.00 USD / Month
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in a PhD program in Computer Science, Electrical/Computer Engineering, or a related field
  • Research experience in areas such as computer architecture, AI/ML systems, performance modeling, distributed systems, or hardware–software co-design
  • Programming skills in Python, C/C++ with experience building prototypes, simulators, or performance analysis tools
  • Familiarity with modern AI workloads and/or deep learning frameworks (e.g., PyTorch)
  • Demonstrated ability to define and pursue original research directions in AI systems or architecture
  • Ability to collaborate effectively with researchers across disciplines and work in cross-group, cross-cultural environments
  • Proficient communication and presentation skills for sharing complex technical insights
  • Ability to think creatively and approach system and architecture challenges with unconventional or innovative solutions
  • Experience with PyTorch, CUDA, Triton, or performance-simulation tools
  • Background in large-scale system design, AI inference bottleneck analysis, or modeling cost/performance tradeoffs
Job Responsibility
Job Responsibility
  • Investigate and evaluate emerging disaggregated KV cache architectures
  • Implement a hierarchical storage architecture with multiple tiers GPU Memory: Active working set of KV caches currently used by the model CPU DRAM: Hot cache for recently used KV chunks using pinned memory for efficient GPU-CPU transfers Local Storage: Large-scale local caching (NVMe, local disk)
  • Build Peer-to-Peer (P2P) service KV cache sharing architecture that enables direct, high-performance cache transfer between multiple LLM serving instances without requiring centralized cache servers
  • Fulltime
Read More
Arrow Right

Research Scientist Intern, AI Research - Multimodal Pretraining

Meta is seeking Research Scientist Interns in the multimodal pretraining team in...
Location
Location
United States , Menlo Park
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Machine Learning, Computer Vision, Artificial Intelligence, or relevant technical field
  • Past projects/publications in the general domain of neural scaling laws, model architectures, image/text modeling, vision-language modeling
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
  • Experience in PyTorch, Triton, or other related programming languages
  • Experience building systems based on machine learning and/or deep learning methods
Job Responsibility
Job Responsibility
  • Perform research to advance the frontiers of multimodal (images, video, text, audio, and other modalities) pretraining, to develop the next generation of multimodal architectures
  • Collaborate with researchers and cross-functional partners including communicating research plans, progress, and results
  • Publish research results and contribute to research that can be applied to Meta product development
Read More
Arrow Right

Research Scientist Intern, AI & Compute Foundation - MTIA Software

The MTIA (Meta Training & Inference Accelerator) Software team is part of the AI...
Location
Location
Canada , Toronto
Salary
Salary:
6240.00 - 10334.00 CAD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining, PhD degree in the field of Computer Science or a related STEM field
  • C/C++ programming skills
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment
  • Knowledge of Computer Architecture and Distributed systems with interest in one or more of High Performance Computing, Numerics, Performance and AI hardware including compute, networking and storage
Job Responsibility
Job Responsibility
  • Development of Software stack with one of the following core focus areas: AI frameworks, compiler stack, high performance kernel development and acceleration onto next generation of hardware architectures
  • Contribute to the development of the industry-leading PyTorch AI framework core compilers to support new state of the art inference and training AI hardware accelerators and optimize their performance
  • Analyze deep learning networks, develop & implement compiler optimization algorithms
  • Collaborating with AI research scientists to accelerate the next generation of deep learning models such as Recommendation systems, Generative AI, Computer vision, NLP etc
  • Performance tuning and optimizations of deep learning framework & software components
Read More
Arrow Right

Ai And Application Development Engineer

The Applications Development Senior Programmer Analyst is an intermediate level ...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Core Java: Strong understanding of Java (JDK 8+, preferably Java 11/17), including multithreading, collections, garbage collection, and JVM internals
  • Frameworks: Extensive experience with Spring Framework (Spring Boot, Spring MVC, Spring Data JPA, Spring Security)
  • Middleware: Proven experience in designing and developing RESTful APIs and microservices
  • Database Technologies (Must): Relational Databases: Strong proficiency in SQL and experience with Oracle databases, including schema design, query optimization, and stored procedures
  • NoSQL Databases: Experience with MongoDB, including data modeling, querying, and performance tuning
  • CI/CD & DevOps: Hands-on experience with CI/CD tools and practices (e.g., Jenkins, GitLab CI, GitHub Actions, Maven/Gradle, Docker, Kubernetes)
  • Version Control: Proficiency with Git and standard branching strategies (e.g., Gitflow)
  • Testing: Experience with unit testing frameworks (JUnit, Mockito) and integration testing
  • Web Technologies (Beneficial): Familiarity with web services (SOAP/REST), XML, JSON
  • AI Tools & Methodologies (Must): Demonstrable exposure and practical experience with AI development tools such as Devin, GitHub Copilot, Claude, Anti Gravity, and Codex
Job Responsibility
Job Responsibility
  • Lead the design, development, and implementation of complex middleware applications using Java and Spring Boot: Utilize AI-powered code generation tools (e.g., Devin, Copilot, Codex) to accelerate development, automate boilerplate code, suggest optimal implementations, and enforce architectural patterns
  • Architect and optimize database interactions with Oracle, SQL, and MongoDB, ensuring high performance and data integrity: Employ AI to analyze database query performance, suggest advanced indexing strategies, optimize schema designs, and generate efficient SQL/NoSQL queries
  • Drive the adoption and continuous improvement of CI/CD pipelines to facilitate rapid and reliable software delivery: Integrate AI into CI/CD processes for intelligent test case generation, predictive failure analysis, automated code vulnerability scanning, and optimization of pipeline execution times based on historical data
  • Collaborate with cross-functional teams, including product management, QA, and operations, to define requirements, design solutions, and deliver high-quality software: Use AI-powered communication and summarization tools (e.g., Claude) to streamline requirement gathering, document analysis, and stakeholder communication
  • Mentor and provide technical guidance to junior and mid-level software engineers, fostering a culture of technical excellence and continuous learning: Leverage AI platforms for personalized learning paths, automated code feedback, and explanations of complex technical concepts
  • Actively research and experiment with AI technologies to identify opportunities for enhancing developer productivity, automating tasks, and improving software quality: Continuously explore emerging AI tools and techniques (such as Anti Gravity for complex problem-solving) and assess their applicability to our development ecosystem
  • Participate in code reviews, ensuring adherence to coding standards, best practices, and architectural guidelines: Utilize AI-powered code analysis tools to pre-scan code for potential bugs, security vulnerabilities, performance bottlenecks, and style deviations, allowing human reviewers to focus on higher-level logic and design
  • Troubleshoot and resolve complex technical issues, ensuring the stability and performance of production systems: Implement AI-driven anomaly detection in monitoring systems, leverage AI for rapid log analysis and root cause identification, and automate incident response workflows
  • Contribute to the strategic planning and technical roadmap for our middleware platforms: Employ AI to analyze industry trends, forecast technology evolution, assess the impact of new features, and prioritize roadmap initiatives based on data-driven insights
  • Conduct tasks related to feasibility studies, time and cost estimates, IT planning, risk technology, applications development, and model development: Utilize AI for data synthesis, predictive modeling for estimations, identification of potential IT risks, and accelerated model prototyping and validation
  • Fulltime
Read More
Arrow Right

Research Intern - Copilot Tuning

Research Internships at Microsoft provide a dynamic environment for research car...
Location
Location
United States , Redmond
Salary
Salary:
6710.00 - 13270.00 USD / Month
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Accepted or currently enrolled in a PhD program in Computer Science or related STEM field
  • Ability to work in a dynamic, collaborative environment, through effective communication, with a multi-disciplinary team
  • Track record of publications in the field of Artificial Intelligence, Machine Learning, and/or scientific journals
  • Interest in real-world applications and impact
  • Demonstrable ability to define an ambitious, original research agenda
  • Experience in Python software development, ideally demonstrated by published software projects (e.g., Github)
  • Experience implementing new AI systems or algorithms and running experiments and analyses to study and evaluate their performance
  • Experience with training or post-training of LLMs, multimodal models, diffusion models, reasoning models, computer use agents, or building other related architectures or AI systems
  • Willingness to embrace knowledge outside your field of research interest
Job Responsibility
Job Responsibility
  • Perform cutting-edge and thorough research in collaboration with other researchers, applied scientists, and engineers
  • Prepare detailed report of methodologies and findings, present results and insights, and demonstrate working prototypes
  • Apply your in-depth knowledge, problem-solving skills, and drive to innovate to solve new challenges in the field and be given an opportunity to realize your ideas in products and services used worldwide
  • Embody our culture and values
Read More
Arrow Right

AI Research Engineer

We are looking for an AI Research Engineer to join the PAIR team and play a cent...
Location
Location
North Macedonia , Skopje
Salary
Salary:
Not provided
hornetsecurity.com Logo
Hornetsecurity
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Applied AI research engineer with at least 3 years of experience in backend development or AI in production
  • Strong command of Go (microservices, REST/gRPC, high performance) and Python for AI/ML
  • Solid experience with cloud-native architectures: Docker, Kubernetes, CI/CD, observability, distributed systems, and real-time services
  • AI/ML/NLP skills: LLMs, embeddings, classification, text generation, model evaluation
  • Proven ability to design, optimize, and deploy scalable AI services in production
  • Scientific curiosity, autonomy, rigor, and strong teamwork skills
  • Excellent communication skills, documentation abilities, and the capacity to simplify complex topics
  • Professional fluency in English, both written and spoken
Job Responsibility
Job Responsibility
  • End-to-End Ownership of AI Solutions: Design, develop, and maintain AI services from prototype to production
  • Ensure robustness, performance, scalability, and operational reliability of solutions in industrial settings
  • Rigorous Experimentation & Applied Research: Methodically test and benchmark AI models (standards, metrics, comparisons)
  • Document results and propose innovative solutions tailored to cybersecurity challenges
  • Innovation & Technology Watch: Maintain active and structured monitoring of advances in AI/ML, LLMs, agents, NLP, as well as DevOps and MLOps best practices
  • Anticipate technological developments and contribute to the technical roadmap
  • Technical Leadership, Documentation & Collaboration: Be a key contributor to technical quality, knowledge sharing, and internal communication
  • Produce clear documentation and provide technical support to teams
What we offer
What we offer
  • Free space for innovation and independent action in a fast-growing international company
  • Short decision paths and flat hierarchies in an open work atmosphere
  • Extensive onboarding with a welcome kit, 2-day Onboarding Bootcamp, a Mentoring Program, and regular feedback meetings
  • Temporary Employee Exchange Program – we provide the ability for you to work at our global office locations and explore the world (e.g. Berlin, Madrid, Malta, Montréal)
  • Home-office-option (in a hybrid setting) and flexible working time
  • Team events like Laser Tag, Office Movie Nights, Foodie Fridays and much more
  • Fit Kit subscription and private insurance for your health
  • Referral Bonus – 1500€ for each successful referral
  • Fulltime
Read More
Arrow Right

Gen AI Intern

Gen AI Intern
Location
Location
India , Bangalore Area
Salary
Salary:
Not provided
airbus.com Logo
Airbus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong Python skills
  • Experience with Data Wrangling and Preprocessing
  • Familiarity with LangChain, LlamaIndex, and VectorDB
  • Good understanding of transformer architecture, embeddings, and tokenizers
  • Experience with machine learning frameworks like TensorFlow or PyTorch
  • Understanding of LLMs, fine-tuning techniques, and RAG methodologies
  • Proficiency in version control systems (e.g., Git) and software development best practices
  • Pursuing technical Under Graduate or post graduate in B.E/B.Tech OR M.Tech/M.E in Computer Science, Electricals, Electronics or relevant degree
  • Awareness of any potential compliance risks and a commitment to act with integrity
Job Responsibility
Job Responsibility
  • Engage in research and implementation of Retrieval-Augmented Generation (RAG) and other advanced techniques to enhance Language model capabilities
  • Participate in data preparation for fine-tuning processes
  • Assist in setting up and optimizing the pipeline for fine-tuning, Quantization
  • Participate in the training of small transformer models, ensuring performance and efficiency
  • Contribute to the development of an internal orchestration framework for LLM
  • Collaborate with senior engineers to integrate LLM S\W Components into the existing system architecture
  • Support the deployment and monitoring of models in production environments
  • Document processes, workflows, and findings to ensure knowledge sharing and reproducibility
  • Fulltime
Read More
Arrow Right