This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Crusoe is expanding our hyperscale AI and high-performance computing (HPC) data center portfolio across the U.S. and internationally. As our footprint grows, engineering excellence and system reliability are fundamental to ensuring world-class uptime, performance, and scalability for AI workloads. The Director of Engineering & Reliability will lead the standards, frameworks, and technical governance behind Crusoe’s mechanical, electrical, and critical infrastructure systems. This leader owns engineering design standards, reliability strategy, system performance modeling, and asset lifecycle programs across our 50–400 MW hyperscale campuses. This is a high-impact role partnering closely with Construction, Facility Operations, Commissioning, Design/Engineering, and Executive Leadership to ensure Crusoe’s data centers achieve world-class reliability, efficiency, and operational readiness.
Job Responsibility:
Build and govern Crusoe’s enterprise engineering design standards for mechanical, electrical, and critical infrastructure systems
Lead reliability engineering programs including FMEA, RCM, RCA, uptime strategy, and risk modeling
Develop asset lifecycle strategies, predictive maintenance programs, and long-term capital planning
Model power, cooling, airflow, and liquid-loop performance to optimize system capacity and readiness
Serve as L3 escalation for complex MEP issues and major incidents
Lead technical audits, quality assurance programs, and engineering evaluations across all campuses
Partner with Construction, Commissioning, and Operations to enable scalable, high-density AI workloads
Build and lead a team of MEP and reliability engineers
Requirements:
10+ years of engineering experience in mission-critical facilities or hyperscale data centers
Strong technical expertise in mechanical and electrical systems (MV distribution, UPS, generators, cooling plants, CRAC/CRAH, liquid cooling)
Experience implementing RCM, FMEA, RCA, and reliability engineering programs
Ability to govern engineering standards across multi-site portfolios
Strong analytical, modeling, and systems-thinking capabilities
Nice to have:
PE license (Mechanical or Electrical)
CMRP/CRE certification
Experience with >50 MW data centers
Familiarity with AI/HPC cooling and electrical challenges
Experience in high-growth environments
What we offer:
Restricted Stock Units
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability