CrawlJobs Logo

Member of Technical Staff, Infrastructure Engineer

United States, Mountain View 119800.00 - 234700.00 USD / Year · Job Posted April 01, 2026
Apply Position
Job Link Share

Job Description

As Microsoft continues to push the boundaries of AI, we are on the lookout for passionate individuals to work with us on the most interesting and challenging AI questions of our time. Our vision is bold and broad — to build systems that have true artificial intelligence across agents, applications, services, and infrastructure. It’s also inclusive: we aim to make AI accessible to all — consumers, businesses, developers — so that everyone can realize its benefits. Our Platform Infrastructure team is responsible for building and scaling the backend platform at the core of Microsoft consumer products, the integrations with our AI models and the tools that our engineers use. We collaborate closely with cross-functional engineering, product management, and AI research, empowering all Microsoft Copilot teams to more effectively bring cutting-edge AI research to production. We’re seeking experienced Platform Infrastructure Engineers who are passionate about AI, are deeply proficient in scaling backend technologies, and possess a mastery of templating to architect solutions that stand the test of time.

Job Responsibility

  • Design, develop, and maintain performant and secure AI Platform services that power Copilot
  • Work collaboratively with platform, infrastructure, application engineers, and AI researchers to build next generation AI products and services
  • Ship high-quality and maintainable code, and ensure the reliability, scalability, and performance of platform components
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
  • Enjoy working in a fast-paced, design-driven, product development cycle
  • Embody our Culture and Values

Requirements

  • Bachelor’s degree in computer science, or related technical discipline AND 4+ years technical engineering experience building services and products in languages such as Python, C#, C++, Rust, Java
  • OR equivalent experience
  • 4+ years’ experience building scalable platforms on public cloud infrastructure like Azure, AWS, or GCP with extensive use of technologies like Docker, Kubernetes, nginx, RDBMS, key-value stores, etc
  • 4+ years’ experience in building and releasing production software at the platform level
  • Solid knowledge of APIs, data flows, systems, and services

Nice to have

  • Experience managing high scale, multi-region, production environments on Kubernetes in cloud environments
  • Ability to identify, analyze, and resolve complex technical issues, ensuring optimal performance, scalability, and user experience
  • Dedication to writing clean, maintainable, and well-documented code with a focus on reliability, security and ease of use
  • Demonstrated interpersonal skills and ability to work closely with cross-functional teams, including product managers, and other engineers
  • Ability to clearly communicate complex technical concepts to both technical and non-technical stakeholders
  • Ability to work in a fast-paced environment, manage multiple priorities, and adapt to changing requirements and deadlines
  • Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in web development and AI
  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Member of Technical Staff, Infrastructure Engineer

8 matching positions

Member of Technical Staff, Infrastructure

The Infra team at LlamaIndex owns the foundations that our product is built upon...
Location
Location
United States , San Francisco
Salary
Salary:
Not provided
llamaindex.ai Logo
LlamaIndex
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of engineering experience
  • Worked on Platform or Infrastructure teams on significant projects involving infrastructure components (Terraform/CDKTF, Kubernetes, Helm, test infrastructure, release management, observability, etc.)
  • Experience in optimizing cloud resource utilization
  • Proficient in tuning Kubernetes clusters and cloud resources for cost and performance efficiency
  • Willing to build LlamaIndex's engineering culture as we grow
  • You can balance speed and pragmatism and build the appropriate solutions for each stage of the company's growth
Job Responsibility
Job Responsibility
  • Collaborate with other engineering teams to build and maintain foundational systems that empower developers and support the company's rapid growth
  • Design and implement scalable infrastructure solutions for various deployment models, including SaaS, single-tenant, and private deployments
  • Manage and optimize cloud resources and Kubernetes clusters for cost-effectiveness and performance
  • Enable external customer deployment success through maintaining clear infrastructure boundaries and principles
  • Optimize and improve the release and deployment processes to enhance efficiency and reliability
  • Ensure compliance with relevant regulations and implement robust security measures across different deployment environments
What we offer
What we offer
  • Competitive base salary and equity compensation
  • Comprehensive medical/dental/vision coverage for you and your family
  • Unlimited paid time off policy
  • Daily catered lunch and snacks in the San Francisco office
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Infrastructure & Engineering

The Member of Technical Staff (MTS) - Systems is a senior individual contributor...
Location
Location
United States , Austin
Salary
Salary:
Not provided
aptiv.com Logo
Aptiv plc
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor degree in Computer Science, Electrical Engineering, or related field
  • 8+ years of software engineering experience
  • 5+ years of experience with embedded Linux or systems programming
  • Experience leading technical projects and mentoring engineers
  • Strong background in C/C++ programming
  • Expert-level proficiency in C/C++ programming
  • Deep understanding of Linux kernel architecture and internals
  • Experience with embedded systems development
  • Knowledge of build systems (Yocto, Buildroot, or similar)
  • Strong debugging and problem-solving skills
Job Responsibility
Job Responsibility
  • Serve as technical lead for major features and projects
  • Design and architect complex system components and solutions
  • Provide technical guidance and mentorship to junior engineers
  • Review code, designs, and architecture decisions
  • Drive technical standards and best practices within the team
  • Develop and maintain embedded Linux systems software
  • Work on user space applications, kernel modules, or toolchain components
  • Implement new features and enhancements based on requirements
  • Debug and resolve complex technical issues
  • Write high-quality, maintainable code following team standards
What we offer
What we offer
  • Hybrid work model for workplace flexibility
  • Comprehensive health, dental, and life insurance
  • Short and long-term disability coverage
  • RRSP matching for financial security
  • Flexible time-off policies for work-life balance
  • Employee assistance program for mental well-being
  • Learning benefits, including a LinkedIn Learning subscription and seminars
  • Fulltime
Read More
Arrow Right

Senior Member of technical staff (Infrastructure)

About the Team: The Infrastructure team aims to make it seamless for our researc...
Location
Location
United Kingdom; France , London; Paris
Salary
Salary:
Not provided
hcompany.ai Logo
H Company
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Infrastructure as code (CDK, Terraform, ...)
  • Experience architecting and deploying distributed systems on public cloud (AWS, Azure, GCP)
  • Observability and monitoring (Datadog, Prometheus, Grafana, …)
  • Good knowledge of a modern programming language (ideally Python or JS/Typescript)
Job Responsibility
Job Responsibility
  • Designing and managing the infrastructure to support Research efforts in Model and Agent development incl. training infrastructure, data pipelines and inference
  • Designing and managing the infrastructure to support Product Engineering efforts on H Company’s agent platform including client-facing APIs and agent runtimes within various deployment scenarios (multi-tenant and on-prem)
  • Setup and maintain observability and monitoring strategies
  • Mentor and grow other engineers in infrastructure-related topics as well as general engineering practices
What we offer
What we offer
  • Join the exciting journey of shaping the future of AI, and be part of the early days of one of the hottest AI startups
  • Collaborate with a fun, dynamic and multicultural team, working alongside world-class AI talent in a highly collaborative environment
  • Enjoy a competitive salary
  • Unlock opportunities for professional growth, continuous learning, and career development
  • Fulltime
Read More
Arrow Right

Member of technical staff (Infrastructure)

About H: H exists to push the boundaries of superintelligence with agentic AI. B...
Location
Location
France; United Kingdom , Paris; London
Salary
Salary:
Not provided
hcompany.ai Logo
H Company
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Observability and monitoring (Datadog, Prometheus, Grafana, …)
  • Good knowledge of a modern programming language (ideally Python or JS/Typescript)
Job Responsibility
Job Responsibility
  • Designing and managing the infrastructure to support Research efforts in Model and Agent development incl. training infrastructure, data pipelines and inference
  • Product Engineering efforts on H Company’s agent platform including client-facing APIs and agent runtimes within various deployment scenarios (multi-tenant and on-prem)
  • Setup and maintain observability and monitoring strategies
What we offer
What we offer
  • Join the exciting journey of shaping the future of AI, and be part of the early days of one of the hottest AI startups
  • Collaborate with a fun, dynamic and multicultural team, working alongside world-class AI talent in a highly collaborative environment
  • Enjoy a competitive salary
  • Unlock opportunities for professional growth, continuous learning, and career development
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Infrastructure Data & Analytics

We are seeking experienced Infrastructure Data & Analytics Engineers to join our...
Location
Location
United States , Multiple Locations; Mountain View; San Francisco Bay area; New York City metropolitan area
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, or related technical field AND 8+ years technical engineering experience with data engineering, analytics, or data science, with increasing technical ownership in startup environment AND 6+ years experience with distributed data processing frameworks and large-scale data systems
  • OR equivalent experience
  • Master's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with technical engineering experience with data engineering, analytics, or data science, with increasing technical ownership in startup environment AND 10+ years experience with distributed data processing frameworks and large-scale data systems
  • OR equivalent experience
  • Proven technical leadership in data engineering, analytics platforms, or large-scale telemetry systems
  • Hands-on experience with ETL orchestration frameworks such as Airflow, Dagster, or similar
  • Strong communication skills
  • can explain complex systems clearly to senior leader
Job Responsibility
Job Responsibility
  • Act as the technical lead and owner for infrastructure analytics across compute, storage, and networking
  • Design and build durable, scalable data pipelines that ingest telemetry from clusters, schedulers, health systems, and capacity trackers into Data Warehouse
  • Define and standardize core metrics and semantics (e.g., utilization, occupancy, MFU, goodput, capacity readiness, delivery-to-production)
  • Architect and maintain self-service dashboards and APIs for fleet, cluster, and squad-level visibility
  • Partner closely with stakeholders across Supercomputing Infra, Researchers, Strategy and Executives to ensure metrics reflect operational and business reality
  • Implement robust and fault-tolerant systems for data ingestion and processing
  • Lead data architecture and engineering decisions, applying strong technical judgment to proactively shape executive-level discussions and decisions
  • Identify data gaps and instrumentation issues
  • drive fixes by influencing upstream engineering teams
  • Establish data quality, validation, documentation, and governance so metrics are trusted and repeatable
  • Fulltime
Read More
Arrow Right

Member Of Technical Staff - Software Engineer, Health AI

At Microsoft AI, our health team is on a mission to help millions of users bette...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Higher Degree in Computer Science, or related technical discipline AND strong software engineering experience with coding in languages/frameworks including, but not limited to, C#, C, C++, Java, Python, Rust, Typescript, Swift, Kotlin
  • Demonstrated expertise building products at scale, with domain expertise in one or more of distributed systems, cloud infrastructure, web, mobile, GenAI
  • Experience collaborating in cross functional teams, working through ambiguity to deliver high quality products
  • Have 0 to 1 experience with a bias towards shipping and learning, while balancing a high-quality bar
  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team
Job Responsibility
Job Responsibility
  • Collaborate with AI researchers, product managers, and designers to bring a world-class AI health companion to the world
  • Own the end-to-end development of features, from ideation and specification through to deployment and iteration
  • Design, build, and optimize production-grade code, delivering robust features within a much larger existing architecture
  • Work independently across a wide range of our stack, shipping delightful user experiences
  • Ensure resilience, maintainability, and security above all else
  • Build the hiring pipelines, onboarding frameworks, or software development best practices as needed to scale an engineering team around you
  • Guide peers, contributing to a culture of technical excellence and continuous improvement
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Software Engineer

Help build the infrastructure that powers training, evaluation, and data platfor...
Location
Location
Switzerland , Zürich
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong software engineering background building reliable, scalable production systems (Python preferred)
  • Hands‑on experience supporting large‑scale ML / LLM training, evaluation, or experimentation infrastructure
  • Operating GPU‑heavy workloads in cloud environments using Docker and Kubernetes (scheduling, utilization, isolation)
  • Designing and running data / compute pipelines and orchestration (e.g., Airflow, Argo) with object storage (Azure Blob / S3)
  • Platform reliability and operability: observability, metrics, logging, tracing, alerting (Prometheus, Grafana, OpenTelemetry)
Job Responsibility
Job Responsibility
  • Design and build core platform services for scalable training and evaluation, including cluster orchestration, job scheduling, data and compute pipelines, and artifact management
  • Standardize containerized workflows by maintaining Docker images, CI/CD, and runtime configurations
  • advocate for best practices in security, reproducibility, and cost efficiency
  • Implement end-to-end observability and operations through metrics, tracing, logging, dashboard development, monitoring, and automated alerts for model training and platform health (using Prometheus, Grafana, OpenTelemetry)
  • Architect and operate services on Azure cloud platforms, managing infrastructure-as-code (Terraform/Helm), secrets, networking, and storage
  • Enhance developer experience by creating tools, CLIs, and portals that simplify job submission, metrics analysis, and experiment management for generalist software engineering and research teams
  • Enforce security and compliance policies for data access, container hardening, and supply-chain integrity, and partner with security and privacy teams to maintain robust practices in multi-tenant environments and secret management
  • Collaborate cross-functionally with data, model, and product teams to align infrastructure roadmaps with training needs, evaluation protocols, and Copilot product goals
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Backend Engineer

Microsoft AI is looking for a talented Backend engineer to help build the next w...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • 4+ years' experience building backend API for mobile apps such as GraphQL/Rest APIs/Protobuf/Thrift, and streaming protocols such as websocket/SSE/WebRTC with familiarity in backend and mobile data schema code generation or consistency, version control for mobile releases, analytics, feature flags, a/b testing framework
  • 4+ years' experience building scalable services on top of public cloud infrastructure like Azure, AWS, or GCP. Extensive use datastores like RDBMS, key-value stores, etc
  • 4+ years' experience building distributed systems at scale and extensive systems knowledge that spans bare-metal hosts to containers to networking
Job Responsibility
Job Responsibility
  • Build secure and performant APIs that power Copilot apps
  • Work collaboratively with other product engineers, Product Managers, and platform engineers to take ambiguous projects and mold them into amazing experiences
  • Ship high-quality, well-tested, secure, and maintainable code
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
  • Enjoy working in a fast-paced, design-driven, product development cycle
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right