CrawlJobs Logo

Research Intern - Azure Storage

https://www.microsoft.com/ Logo

Microsoft Corporation

Location Icon

Location:
United States , Redmond

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

6710.00 - 13270.00 USD / Month

Job Description:

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment. As a Research Intern you will be working in the Azure Storage group, conducting research on leading edge storage systems. You will have an opportunity to optimize the performance and efficiency of storage systems at enormous scale and work on extremely challenging problems. Current work includes designing storage systems optimized for artificial intelligence (AI) workloads including training, checkpointing and inferencing. You will gain insights about these AI workloads from our work with leading edge AI customers, both external and internal.

Job Responsibility:

  • Research Interns put inquiry and theory into practice
  • learn, collaborate, and network for life
  • contribute to exciting research and development strides
  • are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community

Requirements:

  • At least 3 years of research and/or experience in storage systems
  • Currently enrolled in a PhD program in Computer Science
  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship
  • submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples

Nice to have:

PhD advisor is someone who does research in storage systems

Additional Information:

Job Posted:
February 13, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Intern - Azure Storage

Machine Learning Platform / Backend Engineer

We are seeking a Machine Learning Platform/Backend Engineer to design, build, an...
Location
Location
Serbia; Romania , Belgrade; Timișoara
Salary
Salary:
Not provided
everseen.ai Logo
Everseen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4-5+ years of work experience in either ML infrastructure, MLOps, or Platform Engineering
  • Bachelors degree or equivalent focusing on the computer science field is preferred
  • Excellent communication and collaboration skills
  • Expert knowledge of Python
  • Experience with CI/CD tools (e.g., GitLab, Jenkins)
  • Hands-on experience with Kubernetes, Docker, and cloud services
  • Understanding of ML training pipelines, data lifecycle, and model serving concepts
  • Familiarity with workflow orchestration tools (e.g., Airflow, Kubeflow, Ray, Vertex AI, Azure ML)
  • A demonstrated understanding of the ML lifecycle, model versioning, and monitoring
  • Experience with ML frameworks (e.g., TensorFlow, PyTorch)
Job Responsibility
Job Responsibility
  • Design, build, and maintain scalable infrastructure that empowers data scientists and machine learning engineers
  • Own the design and implementation of the internal ML platform, enabling end-to-end workflow orchestration, resource management, and automation using cloud-native technologies (GCP/Azure)
  • Design and manage Kubernetes-based infrastructure for multi-tenant GPU and CPU workloads with strong isolation, quota control, and monitoring
  • Integrate and extend orchestration tools (Airflow, Kubeflow, Ray, Vertex AI, Azure ML or custom schedulers) to automate data processing, training, and deployment pipelines
  • Develop shared services for model behavior/performance tracking, data/datasets versioning, and artifact management (MLflow, DVC, or custom registries)
  • Build out documentation in relation to architecture, policies and operations runbooks
  • Share skills, knowledge, and expertise with members of the data engineering team
  • Foster a culture of collaboration and continuous learning by organizing training sessions, workshops, and knowledge-sharing sessions
  • Collaborate and drive progress with cross-functional teams to design and develop new features and functionalities
  • Ensure that the developed solutions meet project objectives and enhance user experience
  • Fulltime
Read More
Arrow Right

Principal Software Engineer - Azure Blob Storage

The Azure Blob Storage team is focused on building the data foundation that supp...
Location
Location
United States , Multiple Locations
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
  • Bachelor's Degree in Computer Science OR related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, OR Python OR Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • 3+ years of experience in designing and developing reliable, high-performance distributed systems with a strong focus on scalability and resilience
  • Demonstrated experience with distributed systems and large-scale storage, including solving complex systems problems and working on storage internals such as availability, remote replication, durability schemes, and high-performance I/O
  • Practical knowledge in one or more technical domains: database internals, indexing and search techniques, caching and data placement strategies, and metadata/namespace management at scale
  • Ability to influence technical direction and contribute to architecture decisions across teams, fostering collaboration and innovation
Job Responsibility
Job Responsibility
  • Design and deliver next-generation distributed storage systems optimized for AI/ML workloads, from training to inferencing
  • Provide technical leadership across architecture, development, testing, and deployment at hyperscale
  • Drive innovations in scalability, performance, security, and efficiency to meet the demands of zettabyte-scale storage
  • Collaborate with cross-functional teams — from AI researchers to cloud infrastructure engineers — to ensure storage meets the needs of cutting-edge AI workloads
  • Mentor and guide other engineers, fostering a culture of technical excellence, innovation, and collaboration
  • Continuously evaluate emerging technologies and trends to anticipate the future of AI-native storage
  • Fulltime
Read More
Arrow Right

Staff Software Engineer - AI/ML Infra

GEICO AI platform and Infrastructure team is seeking an exceptional Senior ML Pl...
Location
Location
United States , Palo Alto
Salary
Salary:
90000.00 - 300000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, Engineering, or related technical field (or equivalent experience)
  • 8+ years of software engineering experience with focus on infrastructure, platform engineering, or MLOps
  • 3+ years of hands-on experience with machine learning infrastructure and deployment at scale
  • 2+ years of experience working with Large Language Models and transformer architectures
  • Proficient in Python
  • strong skills in Go, Rust, or Java preferred
  • Proven experience working with open source LLMs (Llama 2/3, Qwen, Mistral, Gemma, Code Llama, etc.)
  • Proficient in Kubernetes including custom operators, helm charts, and GPU scheduling
  • Deep expertise in Azure services (AKS, Azure ML, Container Registry, Storage, Networking)
  • Experience implementing and operating feature stores (Chronon, Feast, Tecton, Azure ML Feature Store, or custom solutions)
Job Responsibility
Job Responsibility
  • Design and implement scalable infrastructure for training, fine-tuning, and serving open source LLMs (Llama, Mistral, Gemma, etc.)
  • Architect and manage Kubernetes clusters for ML workloads, including GPU scheduling, autoscaling, and resource optimization
  • Design, implement, and maintain feature stores for ML model training and inference pipelines
  • Build and optimize LLM inference systems using frameworks like vLLM, TensorRT-LLM, and custom serving solutions
  • Ensure 99.9%+ uptime for ML platforms through robust monitoring, alerting, and incident response procedures
  • Design and implement ML platforms using DataRobot, Azure Machine Learning, Azure Kubernetes Service (AKS), and Azure Container Instances
  • Develop and maintain infrastructure using Terraform, ARM templates, and Azure DevOps
  • Implement cost-effective solutions for GPU compute, storage, and networking across Azure regions
  • Ensure ML platforms meet enterprise security standards and regulatory compliance requirements
  • Evaluate and potentially implement hybrid cloud solutions with AWS/GCP as backup or specialized use cases
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Staff Software Engineer - AI/ML Platform

GEICO AI platform and Infrastructure team is seeking an exceptional Senior ML Pl...
Location
Location
United States , Chevy Chase; New York City; Palo Alto
Salary
Salary:
115000.00 - 300000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, Engineering, or related technical field (or equivalent experience)
  • 8+ years of software engineering experience with focus on infrastructure, platform engineering, or MLOps
  • 3+ years of hands-on experience with machine learning infrastructure and deployment at scale
  • 2+ years of experience working with Large Language Models and transformer architectures
  • Proficient in Python
  • strong skills in Go, Rust, or Java preferred
  • Proven experience working with open source LLMs (Llama 2/3, Qwen, Mistral, Gemma, Code Llama, etc.)
  • Proficient in Kubernetes including custom operators, helm charts, and GPU scheduling
  • Deep expertise in Azure services (AKS, Azure ML, Container Registry, Storage, Networking)
  • Experience implementing and operating feature stores (Chronon, Feast, Tecton, Azure ML Feature Store, or custom solutions)
Job Responsibility
Job Responsibility
  • Design and implement scalable infrastructure for training, fine-tuning, and serving open source LLMs (Llama, Mistral, Gemma, etc.)
  • Architect and manage Kubernetes clusters for ML workloads, including GPU scheduling, autoscaling, and resource optimization
  • Design, implement, and maintain feature stores for ML model training and inference pipelines
  • Build and optimize LLM inference systems using frameworks like vLLM, TensorRT-LLM, and custom serving solutions
  • Ensure 99.9%+ uptime for ML platforms through robust monitoring, alerting, and incident response procedures
  • Design and implement ML platforms using DataRobot, Azure Machine Learning, Azure Kubernetes Service (AKS), and Azure Container Instances
  • Develop and maintain infrastructure using Terraform, ARM templates, and Azure DevOps
  • Implement cost-effective solutions for GPU compute, storage, and networking across Azure regions
  • Ensure ML platforms meet enterprise security standards and regulatory compliance requirements
  • Evaluate and potentially implement hybrid cloud solutions with AWS/GCP as backup or specialized use cases
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Staff Software Engineer - AI/ML Infra

GEICO AI platform and Infrastructure team is seeking an exceptional Senior ML Pl...
Location
Location
United States , Chevy Chase; New York City; Palo Alto
Salary
Salary:
115000.00 - 300000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, Engineering, or related technical field (or equivalent experience)
  • 8+ years of software engineering experience with focus on infrastructure, platform engineering, or MLOps
  • 3+ years of hands-on experience with machine learning infrastructure and deployment at scale
  • 2+ years of experience working with Large Language Models and transformer architectures
  • Proficient in Python
  • strong skills in Go, Rust, or Java preferred
  • Proven experience working with open source LLMs (Llama 2/3, Qwen, Mistral, Gemma, Code Llama, etc.)
  • Proficient in Kubernetes including custom operators, helm charts, and GPU scheduling
  • Deep expertise in Azure services (AKS, Azure ML, Container Registry, Storage, Networking)
  • Experience implementing and operating feature stores (Chronon, Feast, Tecton, Azure ML Feature Store, or custom solutions)
Job Responsibility
Job Responsibility
  • Design and implement scalable infrastructure for training, fine-tuning, and serving open source LLMs (Llama, Mistral, Gemma, etc.)
  • Architect and manage Kubernetes clusters for ML workloads, including GPU scheduling, autoscaling, and resource optimization
  • Design, implement, and maintain feature stores for ML model training and inference pipelines
  • Build and optimize LLM inference systems using frameworks like vLLM, TensorRT-LLM, and custom serving solutions
  • Ensure 99.9%+ uptime for ML platforms through robust monitoring, alerting, and incident response procedures
  • Design and implement ML platforms using DataRobot, Azure Machine Learning, Azure Kubernetes Service (AKS), and Azure Container Instances
  • Develop and maintain infrastructure using Terraform, ARM templates, and Azure DevOps
  • Implement cost-effective solutions for GPU compute, storage, and networking across Azure regions
  • Ensure ML platforms meet enterprise security standards and regulatory compliance requirements
  • Evaluate and potentially implement hybrid cloud solutions with AWS/GCP as backup or specialized use cases
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Sr Software Development Engineer - ML OPs

Everseen is a leader in vision AI. We are transforming business operations for g...
Location
Location
Serbia , Belgrade
Salary
Salary:
Not provided
everseen.ai Logo
Everseen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4-5 years of work experience in a relevant role and global SaaS company
  • Bachelors degree or equivalent focusing on the computer science field is preferred
  • Excellent communication and cross-functional collaboration skills
  • Comfort working in ambiguous and fast-evolving environments
  • Expert knowledge of Python
  • Experience with CI/CD tools (e.g., GitLab, Jenkins)
  • Hands-on experience with Kubernetes, Docker, and cloud services
  • Understanding of ML training pipelines, data lifecycle, and model serving concepts
  • Familiarity with workflow orchestration tools (e.g., Airflow, Kubeflow, Ray, Vertex AI, Azure ML)
  • A demonstrated understanding of the ML lifecycle, model versioning, and monitoring
Job Responsibility
Job Responsibility
  • Shares skills, knowledge, and expertise with members of the data engineering team
  • Fosters a culture of collaboration and continuous learning by organizing training sessions, workshops, and knowledge-sharing sessions
  • Collaborates and drive progress with cross-functional teams to design and develop new features and functionalities
  • Ensure that the developed solutions meet project objectives and enhance user experience
  • Have influence over the technology stack and internal technical improvements, contributing to strategic decision-making
  • Based on requirements and a longer-term product and feature strategy, design and implement reusable, testable, efficient, and elegant code
  • Ensure adherence to coding standards and best practices
  • Creates, maintains, and runs unit tests for new and existing applications and services
  • Aims to deliver defect-free and well-tested solutions
  • Analyzes and collect data from various sources such as log files, application stack traces, and thread dumps
  • Fulltime
Read More
Arrow Right

Software Engineer II - Office AI Platform

Come join Microsoft Office AI team to work on state-of-the-art services, AI solu...
Location
Location
Serbia , Belgrade
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science OR related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, Typescript, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Distributed systems and cloud scale, proven ability to design, implement and operate scalable, fault-tolerant microservices and distributed storage on cloud platforms such as Azure
  • experience building APIs and services, designing data pipelines and feedback loops, and implementing secure, compliant solutions
  • AI and machine learning mastery, hands-on experience with state-of-the-art generative AI and ML frameworks and deep understanding of large language models, embeddings, prompt engineering and model fine-tuning
  • Technical leadership, demonstrated experience leading complex initiatives from concept to production, mentoring engineers, and driving technical decision-making while working with cross-functional teams
  • Security and compliance, ability to meet Microsoft’s and customers’ security and background requirements and to ensure that solutions adhere to responsible AI and data privacy standards
  • Demonstrated passion for mentoring, fostering an inclusive team culture, and collaborating closely with product, research and design partners to drive strategy and deliver user value
Job Responsibility
Job Responsibility
  • Design and implement scalable, reliable, and secure AI services on Azure, optimizing performance, cost and compliance
  • Apply knowledge of generative AI, large language models, and modern frameworks to develop intelligent features and automation within the service
  • Participate actively in code reviews, bug/issue triage with the feature teams, and support well informed decisions towards business and engineering goals
  • Contribute to fixing, enhancing, and supporting our services in production, including periodic on-call duties where necessary
  • Have sense of ownership and drive on well-defined but ambiguous problems
  • Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products
  • Work with international teams within Microsoft
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Office AI Platform

Come join Microsoft Office AI team to work on state-of-the-art services, AI solu...
Location
Location
Serbia , Belgrade
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science OR related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, Typescript, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Distributed systems and cloud scale, proven ability to design, implement and operate scalable, fault-tolerant microservices and distributed storage on cloud platforms such as Azure
  • experience building APIs and services, designing data pipelines and feedback loops, and implementing secure, compliant solutions
  • AI and machine learning mastery, hands-on experience with state-of-the-art generative AI and ML frameworks and deep understanding of large language models, embeddings, prompt engineering and model fine-tuning
  • Technical leadership, demonstrated experience leading complex initiatives from concept to production, mentoring engineers, and driving technical decision-making while working with cross-functional teams
  • Security and compliance, ability to meet Microsoft’s and customers’ security and background requirements and to ensure that solutions adhere to responsible AI and data privacy standards
  • Demonstrated passion for mentoring, fostering an inclusive team culture, and collaborating closely with product, research and design partners to drive strategy and deliver user value
Job Responsibility
Job Responsibility
  • Design and implement scalable, reliable, and secure AI services on Azure, optimizing performance, cost and compliance
  • Apply knowledge of generative AI, large language models, and modern frameworks to develop intelligent features and automation within the service
  • Participate actively in code reviews, bug/issue triage with the feature teams, and support well informed decisions towards business and engineering goals
  • Contribute to fixing, enhancing, and supporting our services in production, including periodic on-call duties where necessary
  • Drive features, projects, and open-ended problems towards resolution
  • Take initiative to identify and fix bottlenecks, proactively come up with the proposals for further improvements of the system, process and owned product
  • Provide mentorship, technical guidance, and peer coaching to other engineers, fostering a culture of innovation, continuous learning, and inclusion
  • Encourage best practices in code quality, security and responsible AI while elevating the skills of fellow team members
  • Work with international teams within Microsoft
  • Fulltime
Read More
Arrow Right