CrawlJobs Logo

Member of Technical Staff, High Performance Computing Engineer

United Kingdom, London · Job Posted March 01, 2026
Apply Position
Job Link Share

Job Description

Microsoft AI is looking for experienced Member of Technical Staff, High Performance Computing Engineers to help build and scale the infrastructure that trains our frontier models and powers the next evolution of our personal AI, Copilot. This role offers the unique opportunity to work on some of the largest scale supercomputers in the world.

Job Responsibility

  • Design, operate, and maintain large-scale HPC environments
  • Own the deployment, configuration, and day-to-day operation of HPC schedulers (e.g., SLURM, Kubernetes)
  • Serve as a technical owner for at least one core HPC domain (GPU compute, high-performance storage, networking, or similar)
  • Develop and maintain automation and tooling using Bash and/or Python
  • Partner closely with researchers and engineers to support their workloads, troubleshoot cluster usage issues, and triage failed or underperforming jobs
  • Drive work forward independently by navigating ambiguity and technical roadblocks
  • Enjoy working in a fast-paced, design-driven product development environment
  • Embody our Culture and Values

Requirements

  • Bachelor’s degree in computer science, or related technical field AND 4+ years technical engineering experience with deploying or operating on-premise or cloud high-performance clusters
  • 4+ years experience working with high-scale training clusters (ex. working with frameworks/tools such as nvidia InfiniBand clusters, SLURM, Kubernetes, Ray, etc.)
  • 4+ years experience building scalable services on top of public cloud infrastructure like Azure, AWS, or GCP
  • OR equivalent experience

Nice to have

  • Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with deploying or operating on-premise or cloud high-performance clusters
  • 6+ years experience working with high-scale training clusters
  • 6+ years experience building scalable services on top of public cloud infrastructure
  • OR equivalent experience
  • Experience with LLM training clusters
  • Experience working with AI platforms, frameworks, and APIs
  • Experience using Machine Learning frameworks, including experience using, deploying, and scaling language learning models
  • Experience working with large-scale HPC or GPU systems (ex. NVIDIA H100/GB200 or equivalent)
  • Ability to identify, analyze, and resolve complex technical issues
  • Dedication to writing clean, maintainable, and well-documented code
  • Demonstrated interpersonal skills and ability to work closely with cross-functional teams
  • Ability to clearly communicate complex technical concepts
  • Passion for learning new technologies
  • Ability to work in a fast-paced environment, manage multiple priorities, and adapt to changing requirements
  • Proven ability to collaborate and contribute to a positive, inclusive work environment

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Member of Technical Staff, High Performance Computing Engineer

8 matching positions

Member of Technical Staff, High Performance Computing Engineer

As Microsoft AI we are pushing the boundaries of technology. We are creating uni...
Location
Location
United States , Mountain View
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, or related technical discipline AND 6+ years technical engineering experience building web services with coding in languages including, but not limited to, Python, C#, C++, Rust, Java
  • OR equivalent experience
  • 6+ years of experience working with high-scale training clusters (ex. working with frameworks/tools such as nvidia InfiniBand clusters, SLURM, Kubernetes, Ray, etc.)
  • 6+ years' experience building scalable services on top of public cloud infrastructure like Azure, AWS, or GCP
Job Responsibility
Job Responsibility
  • Build secure and performant AI Platform services that power Copilot
  • Work collaboratively with other Platform, infrastructure, application engineers as well as AI Researchers to build next generation AI products and services
  • Ship high-quality, well-tested, secure, and maintainable code
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
  • Enjoy working in a fast-paced, design-driven, product development cycle
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - GPU Performance Engineer

Our models and workflows require performance work that generic frameworks don’t ...
Location
Location
United States , San Francisco; Boston
Salary
Salary:
Not provided
liquid.ai Logo
Liquid AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Authored custom CUDA kernels (not only calling cuDNN/cuBLAS)
  • Strong understanding of GPU architecture and performance: memory hierarchy, warps, shared memory/register pressure, bandwidth vs compute limits
  • Proficiency with low-level profiling (Nsight Systems/Compute) and performance methodology
  • Strong C/C++ skills
Job Responsibility
Job Responsibility
  • Write high-performance GPU kernels for our novel model architectures
  • Integrate kernels into PyTorch pipelines (custom ops, extensions, dispatch, benchmarking)
  • Profile and optimize training and inference workflows to eliminate bottlenecks
  • Build correctness tests and numerics checks
  • Build/maintain performance benchmarks and guardrails to prevent regressions
  • Collaborate closely with researchers to turn promising ideas into shipped speedups
What we offer
What we offer
  • Competitive base salary with equity in a unicorn-stage company
  • We pay 100% of medical, dental, and vision premiums for employees and dependents
  • 401(k) matching up to 4% of base pay
  • Unlimited PTO plus company-wide Refill Days throughout the year
  • Fulltime
Read More
Arrow Right

Member Of Technical Staff - Security Engineer

Copilot is becoming an agentic system: it can plan, reason, and take actions acr...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Doctorate in Statistics, Mathematics, Computer Science, or related field AND 3+ years of experience OR Master’s Degree AND 4+ years of experience OR Bachelor’s Degree AND 6+ years of experience in security engineering, secure software development, large-scale computing, threat modeling, or applied security analytics, including experience designing or building systems to detect, prevent, or mitigate security threats, or equivalent experience.
Job Responsibility
Job Responsibility
  • Design and build secure, high‑performance platform components that support Copilot’s agentic workflows across cloud and device environments
  • Develop novel security mechanisms for agentic AI systems, including real‑time intent validation, information‑flow controls, isolation boundaries, and abuse‑resistant orchestration
  • Eliminate entire classes of vulnerabilities by creating secure‑by‑default APIs, sandboxing layers, and hardened system interfaces
  • Build and operate offensive security tooling and agents that continuously probe Copilot’s autonomy, reasoning paths, and trust boundaries
  • Partner closely with AI researchers, platform engineers, and product teams to translate research and prototypes into production‑ready security features
  • Write high‑quality, well‑tested code across backend services, platform layers, and AI‑adjacent systems
  • Use telemetry, signals, and data‑driven analysis to detect abuse, anomalous agent behavior, and emerging threat patterns
  • Navigate ambiguity, make sound engineering tradeoffs, and ship iteratively in a fast‑paced product environment
  • Contribute to a culture of high ownership, technical excellence, and inclusive collaboration.
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Backend Engineer

Microsoft AI is looking for a talented Backend engineer to help build the next w...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • 4+ years' experience building backend API for mobile apps such as GraphQL/Rest APIs/Protobuf/Thrift, and streaming protocols such as websocket/SSE/WebRTC with familiarity in backend and mobile data schema code generation or consistency, version control for mobile releases, analytics, feature flags, a/b testing framework
  • 4+ years' experience building scalable services on top of public cloud infrastructure like Azure, AWS, or GCP. Extensive use datastores like RDBMS, key-value stores, etc
  • 4+ years' experience building distributed systems at scale and extensive systems knowledge that spans bare-metal hosts to containers to networking
Job Responsibility
Job Responsibility
  • Build secure and performant APIs that power Copilot apps
  • Work collaboratively with other product engineers, Product Managers, and platform engineers to take ambiguous projects and mold them into amazing experiences
  • Ship high-quality, well-tested, secure, and maintainable code
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
  • Enjoy working in a fast-paced, design-driven, product development cycle
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Infrastructure Engineer

As Microsoft continues to push the boundaries of AI, we are on the lookout for p...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, or related technical discipline AND 4+ years technical engineering experience building services and products in languages such as Python, C#, C++, Rust, Java
  • OR equivalent experience
  • 4+ years’ experience building scalable platforms on public cloud infrastructure like Azure, AWS, or GCP with extensive use of technologies like Docker, Kubernetes, nginx, RDBMS, key-value stores, etc
  • 4+ years’ experience in building and releasing production software at the platform level
  • Solid knowledge of APIs, data flows, systems, and services
Job Responsibility
Job Responsibility
  • Design, develop, and maintain performant and secure AI Platform services that power Copilot
  • Work collaboratively with platform, infrastructure, application engineers, and AI researchers to build next generation AI products and services
  • Ship high-quality and maintainable code, and ensure the reliability, scalability, and performance of platform components
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
  • Enjoy working in a fast-paced, design-driven, product development cycle
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Data Engineer

As Microsoft continues to push the boundaries of AI, we are on the lookout for i...
Location
Location
United States , New York
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years experience in business analytics, data science, software development, data modeling or data engineering work
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ years experience in business analytics, data science, software development, or data engineering work
  • OR equivalent experience
  • 4+ years technical engineering experience building data processing applications (batch and streaming) with coding in languages including, but not limited to, Python, Java, Spark, SQL
  • Experience working with Apache Hadoop eco system, Kafka, NoSQL, etc
  • 3+ years experience with data governance, data compliance and/or data security
  • 2+ years' experience building scalable services on top of public cloud infrastructure like Azure, AWS, or GCP. Extensive use datastores like RDBMS, key-value stores, etc
  • 2+ years' experience building distributed systems at scale and extensive systems knowledge that spans bare-metal hosts to containers to networking
  • Ability to identify, analyze, and resolve complex technical issues, ensuring optimal performance, scalability, and user experience
  • Dedication to writing clean, maintainable, and well-documented code with a focus on application quality, performance, and security
Job Responsibility
Job Responsibility
  • Build scalable data pipelines for sourcing, transforming and publishing data assets for AI use cases
  • Work collaboratively with other Platform, infrastructure, application engineers as well as AI Researchers to build next generation data platform products and services
  • Ship high-quality, well-tested, secure, and maintainable code
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
  • Enjoy working in a fast-paced, design-driven, product development cycle
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Member Of Technical Staff, Principal Software Engineer - Windows Copilot

As Microsoft continues to push the boundaries of AI we are on the lookout for pa...
Location
Location
United States , Redmond
Salary
Salary:
142800.00 - 331200.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Master's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 15+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Extensive hands-on experience using modern AI coding agents (e.g., Claude Code, Codex, Github Copilot, or similar tools), with a track record of: Acting as a team expert or go-to resource for AI-assisted/agentic development workflows
  • Driving adoption of agentic coding tools to improve engineering velocity and code quality
  • Defining best practices for prompt design, tool usage, and human-in-the-loop systems
Job Responsibility
Job Responsibility
  • Design and develop next generation products and features for Copilot experiences on Windows
  • Design and develop secure and performant platform services that support Copilot experiences on Windows
  • Work collaboratively with platform, infrastructure, application engineers and researchers to build next generation AI products and services
  • Ship high-quality, well-tested, secure, and maintainable code
  • Overcome obstacles to deliver work quickly and iteratively to users
  • Enjoy working in a fast-paced, design-driven, product development cycle
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Fullstack Education Engineer

Empower Every Learner. Shape the Future of Education with Microsoft AI.Microsoft...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, JavaScript, TypeScript, React, CSS, Node.js
  • Experience building fullstack applications from the ground up, including architecture and deployment
  • Work in consumer-facing education products at scale
  • Working knowledge of web browsers, web protocols, UI/UX principles, application architecture, and performance profiling
  • Experience designing or implementing backend APIs, distributed systems, or service architectures
  • Demonstrated success driving growth outcomes—acquisition, engagement, retention—with concrete examples of features or systems you’ve built
  • Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers
  • Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in mobile development and AI
  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team
Job Responsibility
Job Responsibility
  • Build outstanding consumer-grade web applications using modern JavaScript/TypeScript, React, CSS, Node.js, and related technologies
  • Design, build, and maintain secure, reliable, and scalable backend APIs powering Copilot’s education experiences across platforms
  • Work collaboratively with our Designers, Product Managers, and AI Researchers to take ambiguous projects and mold them into amazing experiences
  • Use data insights to analyze user behavior, inform product decisions, and identify opportunities to unlock learner motivation, mastery, and delight
  • Ship high-quality, well-tested, secure, and maintainable code across the full stack
  • Find creative paths around ambiguity and roadblocks to ship value quickly and iteratively
  • Thrive in a fast-paced, design-driven, product development environment with rapid experimentation and tight feedback loops
  • Contribute to a team culture grounded in empathy, collaboration, inclusion, curiosity, and a growth mindset
  • Fulltime
Read More
Arrow Right