CrawlJobs Logo

Datacenter Hardware Operations Lead

OpenAI

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

125600.00 - 228000.00 USD / Year

Job Description:

We are seeking a Datacenter Hardware Operations Lead focused on hardware and network operations, and logistics, with 15+ years of experience managing complex, mission-critical data center environments. You will oversee and optimize day-to-day hardware physical operations—including material movement, and hardware maintenance—across our expanding global footprint. This role includes designing scalable repair and logistics systems, managing vendors, and ensuring seamless coordination across facilities, supply chain, and engineering.

Job Responsibility:

  • Collaborate with internal and external teams to establish the hardware operations strategy, critical metrics and SLAs
  • Lead daily physical operations for data center campuses, from commissioning through ongoing maintenance
  • Design and implement robust logistics systems for material movement, repairs, and operational workflows
  • Collaborate with engineering, construction, supply chain, and operations teams to streamline processes and resolve bottlenecks
  • Develop tools and practices for improved traceability, throughput, and vendor coordination
  • Manage relationships with external partners and ensure alignment with operational goals
  • Apply best practices in logistics and operational planning to support scalable infrastructure growth

Requirements:

  • 15+ years of experience in physical operations and logistics for mission-critical infrastructure and data centers
  • Proven ability to manage complex logistics systems and coordinate across disciplines
  • Deep understanding of operational processes for large-scale facilities including maintenance, construction support, and warehousing
  • Experience leading cross-functional initiatives and working with third-party vendors
  • Bachelor's degree in Engineering, Logistics, Operations Management, or a related field (advanced certifications preferred)

Nice to have:

  • 20+ years of experience managing global-scale data center operations and logistics
  • Expertise in supply chain logistics and systems implementation for hyperscale environments
  • Strong leadership in dynamic, high-pressure settings with evolving technical needs
What we offer:
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Relocation support for eligible employees
  • Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided
  • Offers Equity
  • performance-related bonus(es) for eligible employees

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Datacenter Hardware Operations Lead

New

Principal Engineer

Microsoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's Degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, OR related field AND 7+ years technical engineering experience OR Bachelor's Degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, OR related field AND 8+ years technical engineering experience OR equivalent experience
  • 5+ years of experience of technical leadership as a platform or software architect or validation architect or a lead debug engineer or equivalent industry experience leadership position
  • Deep understanding of modern server or datacenter architectures or System on Chip features like virtualization technologies or major architectural blocks like Memory Controllers or Central Processing Units or Storage or Networking solutions for Cloud or Datacenter infrastructures
  • Experience leading technical deep dives into datacenter software solutions used in at scale environments or datacenter infrastructure and data systems, cloud native operating systems, or virtualization technologies
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Lead development and implementation of end to end debug solutions for @scale datacenter systems
  • Lead collaboration projects with hardware, firmware and software teams that drive root cause analysis
  • Accountable for successful execution of targeted defect reduction projects
  • Provide technical recommendations on at scale test content deployment technologies
  • Lead resolution of complex problems based on technical and business understanding
  • Develop world class at scale debug methodologies, test strategies and test routines in data center solutions
  • Solve problems relating to mission critical services and build automation to drive debug efficiency
  • Effectively communicate with partners and stakeholders for planning and progress on initiatives using data
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Principal Product Manager - Private Cloud and Flex Infrastructure

Experienced and strategic Principal Product Manager to lead the Private Cloud an...
Location
Location
United States
Salary
Salary:
148000.00 - 340500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree or equivalent in computer science, engineering or related field of study
  • MBA or advanced degree in computer science or engineering preferred
  • 10+ years of work experience in related field
  • Extensive team skills and ability to cross functionally drive/influence work through others, ability to mentor and lead teams to achieve results for complex, ambiguous projects
  • Extensive skills in cost efficient solution building, financial performance metric creation and analysis
  • Extensive business acumen and knowledge of root cause analysis and problem detection
  • Technical understanding and knowledge of the relevant industry and ability to provide product specific technical training to the team
Job Responsibility
Job Responsibility
  • Independently leads and drives the end-to-end strategy and operational product roadmap for one or more complex products or a product portfolio
  • Builds and delivers the value proposition, target customer segments, and business case to bring innovative and disruptive products to market for a product portfolio with respect to the whole company product portfolio
  • Synthesizes market requirements (MRD) into marketing/customer details through having intimate customer knowledge and business, financial and industry market acumen
  • Guides key stakeholders on the portfolio strategy across all phases of the lifecycle
  • Creates and drives goal alignment and collaborates across one or more products' value chain partners to optimize margins and enable success of products per plans across the product lifecycle
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right
New

Engineering Build Manager

We’re looking for an Engineering Build Manager to serve as the primary liaison b...
Location
Location
Taiwan , Taipei
Salary
Salary:
Not provided
etched.com Logo
Etched
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in hardware operations, manufacturing program management, or similar build-focused roles
  • Strong cross-functional communication skills and experience working with engineering and supply chain teams
  • Familiarity with BOM structures, ERP systems, and material planning workflows
  • Experience managing builds with ODM partners, including PCB assembly and SMT processes
  • Ability to manage multiple concurrent builds while maintaining clarity on priorities and timelines
  • Experience supporting NPI to mass production transitions
  • Exposure to datacenter hardware, PCB development, or complex system integration programs
  • An electrical or mechanical engineering background, or equivalent hands-on technical experience
  • Experience troubleshooting material shortages, allocation challenges, or factory readiness issues
Job Responsibility
Job Responsibility
  • Partner closely with Platform TPMs to develop build schedules and execution timelines across vendors and contract manufacturers
  • Monitor and track logistics throughout the build process, proactively identifying and escalating potential issues
  • Work with factory supply chain teams to ensure materials are clear-to-build (CTB) and aligned with production schedules
  • Support and participate in on-site build activities at CMs and vendor locations
  • Collaborate with platform engineering teams to identify build risks and define mitigation strategies
  • Lead recurring factory execution calls to track progress and manage key build milestones
  • Coordinate with cross-functional engineering teams to define prototype quantities required for each build phase
  • Act as the primary engineering representative when engaging with suppliers and manufacturing partners
  • Optimize engineering build planning, including prototype quoting, scheduling, logistics, execution, and reporting with CM/OEM partners
  • Identify build risks and develop and execute risk management and workaround plans
What we offer
What we offer
  • Competitive compensation packages, including generous equity packages
  • Comprehensive insurance coverage and other top-of-market benefits
  • Fulltime
Read More
Arrow Right

Vmware, Storage & Datacenter Engineer

Ryanair Labs are currently recruiting for Systems Engineer to join Europe’s Larg...
Location
Location
Ireland , Dublin
Salary
Salary:
Not provided
ryanair.com Logo
Ryanair - Europe's Favourite Airline
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years experience in a similar role
  • VMware vSphere, ESXi, vCenter (design, build, lifecycle management)
  • VMware NSX-T Data Center (security, micro‑segmentation, routing, overlays)
  • Enterprise storage (Dell PowerStore preferred
  • Pure/NetApp acceptable)
  • SAN technologies (FC, iSCSI, zoning, LUN masking, multipathing)
  • Physical server hardware (Dell, HPE, UCS, etc.)
  • Multi-datacenter architecture (replication, failover, DR)
  • Cold/isolated vault environments (CyberVault, Commvault, or similar)
  • Backup technologies — Veeam Backup & Replication, immutable repositories
Job Responsibility
Job Responsibility
  • Lead the technical design, implementation and delivery of VMware, storage, and datacenter solutions
  • Architect, deploy, and manage VMware vSphere, ESXi clusters, vCenter and associated ecosystem platforms
  • Design & operate NSX-T network virtualization technologies and micro‑segmentation
  • Lead multi‑datacenter architecture design including replication, failover, DR, and HA strategies
  • Design and manage Dell PowerStore, SAN/NAS systems, storage performance/IOPS optimization, and replication technologies
  • Lead the build-out and management of isolated/cold environments (Dell CyberVault / Commvault / air‑gapped recovery sites)
  • Provide advanced Level 3 engineering support for VMware, storage, and physical server issues
  • Maintain and operate enterprise hardware including Dell, HPE or Lenovo physical servers
  • Ensure datacenter environments (power, cooling, cabling, racks, security) are maintained to best practice
  • Monitor and analyse system performance, capacity, and reliability across virtualized and physical platforms
What we offer
What we offer
  • Discounted and unlimited travel to over 250 destinations
  • Defined Contribution Pension Scheme – Matched up to 5% or €5,000
  • Death in Service Benefit – Up to 2 times of annual basic salary
  • 20 Days Annual Leave – Increasing to 22 days after 12 months and 25 days after 3 years of continuous service
  • Option for up to 5 additional unpaid leave days per year
  • Cycle 2 Work Scheme
  • Unrivalled career progression
Read More
Arrow Right

Senior Research Engineer

The HPE HPC & AI EMEA Research Lab (ERL) is characterized by a unique blend of i...
Location
Location
Germany , Munich, Berlin
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Development experience in compiled languages such as C, C++ or Fortran and experience with interpreted environments such as Python
  • At least a B.Sc. equivalent in a Science, Technology, Engineering or Mathematical discipline
  • Parallel programming experience, with programming models such as OpenMP, MPI, CUDA, OpenACC, HIP, PGAS languages, etc.
  • An understanding of AI/ML frameworks, experience with frameworks such as TensorFlow or PyTorch is highly desirable
  • An interest in system- and data center monitoring and operational data analysis
  • Professional language skills in English and German
Job Responsibility
Job Responsibility
  • Perform world-class research while also shaping products of the future
  • Work with the most esteemed research partners across Europe
  • Enable high performance research software on pre-Exascale and Exascale supercomputers
  • Provide new environments/abstractions to support application developers to build, deploy, and run applications taking advantage of leading-edge hardware at scale
  • Make and operate HPC/AI systems and datacenters in a sustainable way
  • Manage modern data-intensive workloads in high performance environments
What we offer
What we offer
  • Competitive salary and extensive benefits package (pension scheme, insurances, bike and car leasing, and other fringe benefits)
  • Work-life balance (flexible working time and hybrid workplace model, 30 vacation days, four HPE Wellness-Fridays, up to six months paid parental leave)
  • Support for education, training, and career development
  • Diverse and dynamic work environment
Read More
Arrow Right

Director Product Management (Artificial Intelligence Hardware)

Do you want to be at the forefront of innovating the latest hardware designs to ...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree AND 8+ years experience in product/service/project/program management or software development OR equivalent experience
  • 3+ years of experience working on AI systems as an architect or a product manager
  • 7+ years of technical product management experience, including products within datacenter Hardware systems and/or Cloud infrastructure
  • 7+ years experience creating product roadmap(s) from conception to launch, driving end-to-end program execution, defining product go-to-market strategy, and leading program direction discussions
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Collaborate with customers and partner organizations to define future generations of Artificial Intelligence (AI) Hardware for Azure at Microsoft
  • Lead the strategic product vision, roadmap and product requirements for our next generations of AI hardware platforms
  • Identify and prioritize customer needs, market opportunities, and competitive gaps, and translate them into clear and actionable product requirements and specifications
  • Drive executive decision making for new investments, including competitive analysis, program goals and business requirements, architectural concepts, risk management strategies, financial analysis, schedule and hardware strategy
  • Lead technical programs from concept to execution, collaborating with architecture, engineering and business teams to develop and drive end-to-end product development
  • Develop and maintain a high level of technical proficiency in AI workload requirements, AI technology landscape and AI Industry roadmaps
  • Engage with senior leadership, highlighting risks across functional teams and providing recommendations to support product level decisions
  • Operate effectively in ambiguity. Apply process where it creates value, and design process where it’s needed. Recognize the situations where each approach is most appropriate
  • Fulltime
Read More
Arrow Right

Lead Infrastructure Engineer - Digital

Lead Infrastructure Engineer for Digital Site Services. Responsible for deliveri...
Location
Location
Sweden , Boden
Salary
Salary:
Not provided
stegra.com Logo
Stegra
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A University degree or equivalent experience, Bachelor or Master’s in Engineering or Computer Science
  • Expertise with administration of Linux and Windows based systems
  • Relevant experience in datacenter operations
  • Hands-on experience with configuring and managing virtualization environments such as VMware / Nutanix
  • Hands-on experience in Infrastructure provisioning and configuration management via infrastructure as code (IaC) – Ansible / Terraform
  • Hands-on experience with Kubernetes for Containers management, deployment and scaling
  • Knowledge of Cybersecurity best practices, and experience in analyzing possible threats and vulnerabilities, embedding security standards, monitoring adherence to them, and implementing disaster recovery plans
  • Experience with Active Directory and Users management systems
  • Very good experience with AWS services configuration and management
  • Experience with performance monitoring and troubleshooting for hosts, servers, databases, virtual machines, containers
Job Responsibility
Job Responsibility
  • Oversee the deployment, commissioning, and configuration of digital infrastructure, including both hardware and software
  • Configure and operate physical servers, storage, backup systems, datacenter support functions, as well as virtualization environments (both on-premises and in the cloud)
  • Work closely with team members and vendors to design digital infrastructure and implement best practices
  • Define operational processes for monitoring and maintaining infrastructure, ensuring secure, efficient, and reliable operations
  • Develop plans for maintenance and support to minimize downtime and ensure smooth operations
  • Collaborate with stakeholders to understand business needs, prioritize infrastructure requirements, and ensure the safe and robust platform while improving productivity, safety, and sustainability in the factory
  • Create and document best practices, and mentor team members to foster growth and improve infrastructure operations
  • Participate in an on-call rotation together with other team members to support the 24/7 operation of critical infrastructure
What we offer
What we offer
  • Fair, competitive compensation aligned with collective agreements
  • Up to 30 days of paid vacation
  • Occupational pension
  • Parental benefits
  • Insurance
  • Relocation and immigration support
  • Subsidized gym memberships
  • Bike leases
  • Fulltime
Read More
Arrow Right

Principal Software Engineer

Microsoft Azure High Performance Computing & AI Engineering (HPC & AI Eng) team ...
Location
Location
United States , Multiple Locations
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python - OR equivalent experience
  • 5+ years hands on experience designing and developing high volume low latency pipelines using products such as AzPubSub, Event Hubs, Azure Stream Analytics, Kafka, Grafana, Event Hubs, Prometheus or equivalent products
  • 3+ years of experience with one of AI/HPC system management OR High-Speed Networks OR HPC Storage OR managing Cloud Infrastructure
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Architect, design and develop high volume low latency end to end event pipelines that can provide first-to-know-insights on events causing job interrupts and job reliability
  • Conduct analysis of existing event pipelines to evaluate fidelity, granularity and latency of critical events
  • Contribute to improving key metrics such as Job Mean Time to Interrupt, Nodes in Service, Mean Time to Resolve on flagship supercomputers by enabling data scientists and domain experts to use the telemetry to identify events & issues at the intersection of datacenter and hardware, develop hypothesis, conduct A/B tests and synthesize results
  • Partner with cross organizational teams to evaluate available telemetry and latency drive architecture, design, development and deployment of end-to-end solutions to manage core infrastructure including current & next generation datacenter, IT hardware, power & cooling technologies
  • Drive engineering and operational excellence based on issues and learnings from strategic customers on their usage scenarios to improve product features and capabilities
  • Partner with teams on continuous learning and continuous improvement programs by leading the resolution of complex incidents, driving root cause analyses and championing initiatives to minimize future customer impact
  • Fulltime
Read More
Arrow Right