CrawlJobs Logo

Director, Site Reliability Engineering

earnin.com Logo

EarnIn

Location Icon

Location:
United States , Mountain View

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

315000.00 - 385000.00 USD / Year

Job Description:

The Director of Site Reliability Engineering (SRE) will provide strategic leadership and technical direction for the reliability, scalability, and performance of our mission‑critical systems and services. This role combines deep SRE expertise with strong engineering leadership to drive organizational transformation toward reliability-first principles. The ideal candidate brings a strong software engineering foundation, a passion for automation, and a proven ability to develop and lead high‑performing teams. The Director will partner with engineering, product, operations, and business stakeholders to design, deliver, and operate resilient, high‑availability systems that support our customers and business objectives at scale.

Job Responsibility:

  • Drive organizational transformation toward SRE principles and own the strategic direction for reliability maturity, cultivating a culture centered on reliability, efficiency, and continuous improvement
  • Develop and oversee automation strategies, tools, and frameworks that improve system reliability, reduce operational toil, and enhance team productivity
  • Architect and evolve robust observability, monitoring, and alerting systems
  • champion chaos engineering and resilience testing practices to proactively validate system behavior under failure conditions
  • Partner with engineering, product, and operations teams to embed SRE practices throughout the development lifecycle and influence architectural decisions for reliability
  • Build, mentor, and develop a high‑performing global SRE organization, fostering technical excellence, career growth, and a strong culture of knowledge sharing
  • Oversee capacity planning, scalability assessments, and future‑state demand forecasting across critical systems
  • Lead and govern high‑severity incident response practices—ensuring rapid triage, thorough root cause analysis, and follow‑through on corrective and preventative actions

Requirements:

  • BS, MS, or PhD degree in Computer Science, Engineering, or related field, or related experience
  • 7+ years of experience in the field, including 3+ years leading SRE teams or a team in a similar role
  • Strong experience with container orchestration (Kubernetes), infrastructure as code (Terraform), and CI/CD pipelines
  • Hands-on experience with observability platforms (e.g., Datadog, Prometheus, Grafana) and incident management tools (e.g., incident.io, PagerDuty)
  • Proficiency in at least one programming language (Python, Go, or Java) with the ability to review code and guide system design decisions
  • Proven experience in architecting and managing highly available, scalable, and fault-tolerant systems
  • Ability to define a clear reliability vision and inspire teams and stakeholders toward long‑term reliability goals
  • Demonstrated sound judgment and calm decision‑making under pressure, particularly during high‑severity incidents
  • Strong people leadership skills, with experience coaching and mentoring engineering talent, developing future leaders, and aligning peer engineering managers and leaders on reliability best practices
  • Strategic planning skills with a track record of aligning technical direction with organizational objectives
  • Excellent communication skills
  • able to translate complex technical issues into clear, actionable insights for executive and non‑technical audiences
  • Highly collaborative, with the ability to work effectively across engineering, product, operations, and business functions and leaders
What we offer:

equity and benefits

Additional Information:

Job Posted:
February 17, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Director, Site Reliability Engineering

Director SRE & Operations

Director SRE & Operations for E-business / Digital at PUMA in Herzogenaurach, Ge...
Location
Location
Germany , Herzogenaurach
Salary
Salary:
Not provided
about.puma.com Logo
Puma Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10–15 years of experience in technology operations, site reliability engineering, or platform engineering within large-scale digital or eCommerce environments
  • Proven track record owning platform reliability, availability, and operational performance for consumer-facing systems
  • Strong experience with cloud infrastructure, incident management, observability, and operational readiness in high-traffic, peak-driven environments
  • Demonstrated ability to embed SRE practices (SLOs, SLIs, incident response, automation) across engineering teams
  • Experienced leader of global operations or SRE teams, comfortable working in on-call and 24/7 operational models
  • Calm, decisive leader with a strong focus on stability, resilience, and continuous operational improvement
Job Responsibility
Job Responsibility
  • Leadership: Responsible for all aspects of the performance management and professional development of the team, including recruitment, development plans, providing constructive feedback, appraisals and exit processes
  • Foster a positive and inclusive team culture by actively engaging team members, promoting open communication, and implementing initiatives that enhance employee satisfaction and well-being
  • Compliance with and implementation of legal and operational requirements regarding occupational health and safety within your own area of responsibility
  • Global Site Reliability & Operations Strategy: Define and execute a global Site Reliability Engineering (SRE) and Technology Operations strategy aligned with PUMA’s D2C growth, peak trading demands, and omnichannel ambitions
  • Establish reliability, availability, performance, and scalability targets across all D2C platforms (eCommerce, in-store integrations, APIs, data platforms)
  • Own the end-to-end operational health of consumer-facing and business-critical platforms
  • Platform Reliability, Resilience & Performance: Drive a reliability-first mindset across engineering, embedding SRE principles such as SLIs, SLOs, SLAs, error budgets, and resilience-by-design
  • Ensure platforms are engineered to handle peak events (campaigns, drops, seasonal peaks) with minimal risk and rapid recovery
  • Lead incident management, major incident response, root cause analysis, and post-incident reviews with a strong focus on learning and prevention
  • Continuously improve platform observability, monitoring, alerting, and performance management
  • Fulltime
Read More
Arrow Right

Director of Engineering & Reliability

Crusoe is expanding our hyperscale AI and high-performance computing (HPC) data ...
Location
Location
United States , San Francisco
Salary
Salary:
216000.00 - 260000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of engineering experience in mission-critical facilities or hyperscale data centers
  • Strong technical expertise in mechanical and electrical systems (MV distribution, UPS, generators, cooling plants, CRAC/CRAH, liquid cooling)
  • Experience implementing RCM, FMEA, RCA, and reliability engineering programs
  • Ability to govern engineering standards across multi-site portfolios
  • Strong analytical, modeling, and systems-thinking capabilities
Job Responsibility
Job Responsibility
  • Build and govern Crusoe’s enterprise engineering design standards for mechanical, electrical, and critical infrastructure systems
  • Lead reliability engineering programs including FMEA, RCM, RCA, uptime strategy, and risk modeling
  • Develop asset lifecycle strategies, predictive maintenance programs, and long-term capital planning
  • Model power, cooling, airflow, and liquid-loop performance to optimize system capacity and readiness
  • Serve as L3 escalation for complex MEP issues and major incidents
  • Lead technical audits, quality assurance programs, and engineering evaluations across all campuses
  • Partner with Construction, Commissioning, and Operations to enable scalable, high-density AI workloads
  • Build and lead a team of MEP and reliability engineers
What we offer
What we offer
  • Restricted Stock Units
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Fulltime
Read More
Arrow Right

Director of Engineering, Cloud Availability

As the Director of Engineering, Cloud Availability, you will lead our engineerin...
Location
Location
Ireland , Dublin
Salary
Salary:
Not provided
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of engineering leadership experience with a proven track record of managing high-performing technical teams
  • Deep technical knowledge of public cloud infrastructure and experience building or operating large-scale platforms (Public, Private, or Hybrid)
  • Expert-level understanding of availability, observability, SLIs/SLOs, and modern incident management frameworks
  • Proven ability to lead remote teams and successfully collaborate with US-based engineering organizations
  • Demonstrated success navigating and leading within a matrix organizational structure
  • Strong familiarity with virtual and managed Kubernetes platforms, such as EKS, GKE, or AKS
  • The ability to balance long-term organizational strategy with the immediate tactical needs of a fast-growing engineering site
Job Responsibility
Job Responsibility
  • Organizational Leadership: Partner closely with Data Center, Network, and SRE teams to build and scale a world-class engineering organization in Dublin
  • Site Leadership & Culture: Serve as the primary point of contact and face of Crusoe leadership in Dublin, proactively managing office sentiment and ensuring the team remains focused on high-impact objectives
  • Global Strategic Alignment: Build high-trust partnerships with US-based leadership to ensure local priorities are perfectly synchronized with the global business roadmap
  • Operational Excellence: Implement and refine "follow-the-sun" protocols to enable smooth hand-offs between time zones, ensuring zero customer disruption and 24/7 reliability
  • Unified Team Vision: Foster a "one-team" mindset across geographic boundaries, breaking down silos and promoting deep collaboration between Dublin and US offices
  • Talent Development: Level up the Dublin engineering team by identifying individual strengths and establishing a culture of mentorship to grow the next generation of Engineering Leads and ICs
  • Reliability Initiatives: Lead the development of SRE functions for IaaS and managed services, including Inference, SLURM, and automated cluster management
What we offer
What we offer
  • pension contributions
  • private health and dental insurance
  • income protection
  • life assurance
  • Fulltime
Read More
Arrow Right

Senior Director of Engineering, SRE

We are looking for a Senior Director of Site Reliability Engineering (SRE) to de...
Location
Location
United States
Salary
Salary:
186000.00 - 255000.00 USD / Year
alpha-sense.com Logo
AlphaSense
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Several years of Senior leadership experience in Site Reliability Engineering capacity
  • Deep knowledge of SRE principles and practices (SLIs/SLOs, error budgets, reliability economics)
  • Experience building self-service systems through platform engineering
  • Strong background in distributed systems and microservices
  • Production experience operating Kubernetes-based platforms
  • Solid understanding of cloud-native networking fundamentals
  • Experience running systems in multi-cloud environments (AWS and at least one of GCP or Azure)
  • Proven success scaling SRE practices across large engineering organizations
  • Demonstrated experience building, mentoring, and developing high-performing SRE teams
  • Ability to grow and sustain an inclusive, resilient engineering culture
Job Responsibility
Job Responsibility
  • Lead reliability and operational excellence across AlphaSense’s platforms and products
  • Scale SRE practices in a “you build it, you run it” engineering organization
  • Lead and grow a follow-the-sun SRE team across multiple time zones
  • Build, mentor, and develop high-performing SRE engineers
  • Own incident management, on-call operations, and post-incident learning
  • Cultivate an awareness and culture of reliability throughout the engineering organization
  • Set direction for observability and operational tooling
  • Enable teams to operate production systems safely and confidently
  • Embed reliability into the whole software delivery lifecycle in collaboration with Product, Platform, Cloud, and Security
  • Reduce systemic risk through toil reduction and continuous improvement
What we offer
What we offer
  • equity
  • a generous benefits program
  • Fulltime
Read More
Arrow Right
New

Site Engineering Director

DS Smith Paper is seeking an outstanding Engineering Director to lead the Engine...
Location
Location
United Kingdom , Kemsley
Salary
Salary:
Not provided
dssmith.com Logo
DS Smith
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Chartered member of relevant professional body or equivalent experience
  • Extensive engineering experience within a busy organisation is essential
  • Significant successful experience in management in a dynamic complex environment
  • Recognised maintenance leadership capability
  • Able to demonstrate in depth knowledge and experience of health and safety management and compliance management processes
  • Proven ability to influence decision making and build cross functional / cross company relationships
  • Experience managing budgets, operations, and forecasting
  • Strong problem-solving skills with a focus on continuous improvement
Job Responsibility
Job Responsibility
  • Lead and role-model a strong safety culture, ensuring all engineering work is carried out safely and in line with risk assessments and legal requirements
  • Build the right team structure and ensure all engineers are trained, competent and fully compliant
  • Develop and deliver the site’s long-term engineering strategy, with a clear focus on improving reliability and reducing unplanned downtime
  • Embed continuous improvement across the Engineering function and support wider operational improvements
  • Create engineering investment plans and provide accurate financial forecasts for budgets and long-term planning
  • Deliver clear, concise communication and reporting to support effective decision-making across the site
  • Strengthen maintenance and asset management practices to achieve world-class levels of reliability and asset availability
  • Improve planned and preventive maintenance systems, including SAP utilisation, spares management and workshop standards
  • Build strong relationships across the DS Smith Paper Division and with key suppliers to give the site rapid access to expertise and industry best practice
  • Ensure all engineering activities meet statutory requirements and follow DS Smith standards
What we offer
What we offer
  • Competitive salary
  • Qualifying Sick Pay scheme
  • Pension scheme & Life insurance
  • Share Save scheme
  • Income Protection
  • 25 days holiday plus Bank Holidays
  • Employee Assistance Programme
  • Virtual GP, Occupational Health & free Flu vaccine
  • Cycle to Work and shopping discounts
  • Fulltime
Read More
Arrow Right
New

Senior Director, Product Management

We’re seeking a Senior Director of Product Management to own and evolve our corp...
Location
Location
United States of America , Newton
Salary
Salary:
165000.00 - 205000.00 USD / Year
brighthorizons.com Logo
Bright Horizons
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree
  • 10+ years of Product Management experience
  • 5+ years senior leadership experience with direct management experience of three person or larger teams
  • 5 years of practical experience of product development lifecycle and Agile methodology
  • 5 years of experience with SEO/ AEO, CRO, analytics and CMS systems
Job Responsibility
Job Responsibility
  • Take a leadership role in defining the product vision and strategy for the corporate website across multi-audiences
  • Work closely with CX and UX to define audience‑specific journeys, value propositions, and content architectures
  • prioritize a unified roadmap that balances short‑term growth with long‑term platform scalability
  • Partner with Content/Brand to develop role‑based narratives and thought leadership that moves visitors from insight to action
  • Collaborate with Marketing and Analytics on campaigns, messaging, and measurement
  • Work with Performance and Engineering teams on SEO strategy (technical, on‑page, content) and site health (core web vitals, crawl/indexing, schema) for both traditional SEO and AIO/AEO
  • Own CRO: experimentation roadmap (A/B, multivariate), landing page optimization, forms, funnel instrumentation
  • Partner with Engineering for site reliability, performance, and scalability
  • Champion user research and translate insights into roadmaps
  • Lead Product Management team, which includes a team of 3-4 high performing product managers and analysts
What we offer
What we offer
  • Medical, dental, and vision insurance
  • Paid vacation, sick, holiday, and parental bonding leave
  • 401(k) retirement plan
  • Long-term and short-term disability insurance
  • Life insurance
  • Money-saving discounts and financial planning tools
  • Tuition assistance and education coaching
  • Caregiving support and resources for the children and adults in your family
  • Bonus
  • RSUs
  • Fulltime
Read More
Arrow Right
New

Director, Principal Software Architect

At Schwab, you’re empowered to make an impact on your career. Here, innovative t...
Location
Location
United States , Southlake; Austin
Salary
Salary:
204000.00 - 235000.00 USD / Year
schwab.com Logo
Charles Schwab
Expiration Date
February 25, 2026
Flip Icon
Requirements
Requirements
  • Champions architectural strategies across teams and enterprises
  • Builds enablement tools to improve time to market for delivery teams
  • Has developed and demonstrated AI strategies that can be multipliers for technical leads
  • Directs the strategic priorities for organizational capability improvement
  • Shapes strategic frameworks for innovative, system design and architecture
  • Demonstrated strong coaching and mentorship skills for junior associates creating a culture of curious learning
  • Proven to be iterative and strategic in solution designs with a focus on Site Reliability Engineering tenants
  • Excellent interpersonal, communication, and presentation skills
  • 10+ years experience with Java / Spring
  • 5+ years experience with front end technologies – Angular, React, JavaScript
Job Responsibility
Job Responsibility
  • Leads architectural initiatives from design to support
  • Establishes methods to optimize enterprise architecture alignment and drive tactical execution
  • Champions efforts that optimize system reliability through advanced life cycle management techniques
  • Guides transformative initiatives to optimize organizational capabilities for impactful results
  • Leads the execution of system design strategies for cloud-based environments
What we offer
What we offer
  • 401(k) with company match and Employee stock purchase plan
  • Paid time for vacation, volunteering, and 28-day sabbatical after every 5 years of service for eligible positions
  • Paid parental leave and family building benefits
  • Tuition reimbursement
  • Health, dental, and vision insurance
  • Bonus or incentive opportunities
  • Fulltime
!
Read More
Arrow Right

Director, Principal Architect

At Schwab, you’re empowered to make an impact on your career. Here, innovative t...
Location
Location
United States , Southlake
Salary
Salary:
204000.00 - 235000.00 USD / Year
schwab.com Logo
Charles Schwab
Expiration Date
February 17, 2026
Flip Icon
Requirements
Requirements
  • Champions architectural strategies across teams and enterprises
  • Builds enablement tools to improve time to market for delivery teams
  • Has developed and demonstrated AI strategies that can be multipliers for technical leads
  • Directs the strategic priorities for organizational capability improvement
  • Shapes strategic frameworks for innovative, system design and architecture
  • Demonstrated strong coaching and mentorship skills for junior associates creating a culture of curious learning
  • Proven to be iterative and strategic in solution designs with a focus on Site Reliability Engineering tenants
  • Excellent interpersonal, communication, and presentation skills
  • 10+ years experience with Java / Spring
  • 5+ years experience with front end technologies – Angular, React, JavaScript
Job Responsibility
Job Responsibility
  • Leads architectural initiatives from design to support
  • Establishes methods to optimize enterprise architecture alignment and drive tactical execution
  • Champions efforts that optimize system reliability through advanced life cycle management techniques
  • Guides transformative initiatives to optimize organizational capabilities for impactful results
  • Leads the execution of system design strategies for cloud-based environments
What we offer
What we offer
  • 401(k) with company match and Employee stock purchase plan
  • Paid time for vacation, volunteering, and 28-day sabbatical after every 5 years of service for eligible positions
  • Paid parental leave and family building benefits
  • Tuition reimbursement
  • Health, dental, and vision insurance
  • Bonus or incentive opportunities
  • Fulltime
Read More
Arrow Right