CrawlJobs Logo

Senior Engineering Manager, SRE

abridge.com Logo

Abridge

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

250000.00 - 290000.00 USD / Year

Job Description:

Abridge’s services and engineering teams are in hyperscale mode, and multiplying rapidly with our customer base and new product launches. We are looking for a seasoned leader who can harness this growth across the organization through reliability and performance engineering, engineering velocity, software replatforming and rearchitecture, and application security. You’ll lead and build an extremely fast growing organization, iteratively scope and execute a company-wide application reliability roadmap, and lead development and improvement of SLOs across the entire company and spanning multi-region and multi-cloud. The combination of security, scale, uptime, and timeline requirements Abridge has has never been executed before in tech. This is a rapidly expanding role that sits at the intersection of AI, reliability engineering, security, and healthcare.

Job Responsibility:

  • Visionary leadership: Scope, resource, evangelize, and execute a company-wide reliability and engineering velocity roadmap across environments and clouds, real-time streaming infrastructure under immense scale, compute as well as AI -at-edge infrastructure, and the most ambitious cloud security roadmap in the entire tech industry
  • Collaborate with department heads across product engineering, security, product management, commercial, and more to develop, align, and execute an extremely ambitious strategic roadmap
  • Gifted tactician: Work at the level of small tiger teams to unblock, enable, and drive execution and solutioning
  • Juggle several ambiguous and tricky problems at a time
  • Recruiter extraordinaire: Scale out your team to meet this roadmap - both ICs and managers
  • Attract top talent and hire quickly while maintaining a consistently high bar
  • Iterate on the hiring process along with other leaders, improve diversity and equity, retain and maximize the effectiveness of an extremely senior team, and make strategic bets on the people that will take us to the next level
  • Mentor to the mentors: Develop their careers, create top-of-ladder development opportunities, and continuously raise the bar for your staff as well as your peers and leaders in their abilities and awareness
  • Earn their trust, lead by example, be a doctor rather than a judge for organizational and people challenges, and help establish and maintain a hivemind, de-siloed culture across all engineering pods

Requirements:

  • 6+ years as a manager in rapidly growing organizations including at least 1 year as a manager of managers
  • Seeking an extremely challenging role that will push you beyond your limits, where failures are inevitable and not to be feared
  • Seeking a senior leadership role to develop people, environments, and impact - not ego, accolades, or ladder climbing
  • Able to ask for help, fail fast and admit defeat
  • get yourself and others out of their comfort zone
  • Track record of leading performance engineering including load test and chaos engineering, large scale distributed telemetry implementation, major architectural and software refactors, engineering velocity, and full stack development
  • Experience running production workloads in more than one cloud provider (at a time, or across your experience)
  • Experience managing workloads across containerized solutions, Kubernetes, and CNCF-approved tooling such as Argo, istio, OTel, and more
  • Thought leader in platform building, with a strong desire to represent Abridge as a reliability engineering leader in the tech industry
  • Genuine passion for Abridge’s mission to improve healthcare in America and across the world
What we offer:
  • Generous Time Off: 14 paid holidays, flexible PTO for salaried employees, and accrued time off for hourly employees
  • Comprehensive Health Plans: Medical, Dental, and Vision coverage for all full-time employees and their families
  • Generous HSA Contribution: If you choose a High Deductible Health Plan, Abridge makes monthly contributions to your HSA
  • Paid Parental Leave: Generous paid parental leave for all full-time employees
  • Family Forming Benefits: Resources and financial support to help you build your family
  • 401(k) Matching: Contribution matching to help invest in your future
  • Personal Device Allowance: Tax free funds for personal device usage
  • Pre-tax Benefits: Access to Flexible Spending Accounts (FSA) and Commuter Benefits
  • Lifestyle Wallet: Monthly contributions for fitness, professional development, coworking, and more
  • Mental Health Support: Dedicated access to therapy and coaching to help you reach your goals
  • Sabbatical Leave: Paid Sabbatical Leave after 5 years of employment
  • Compensation and Equity: Competitive compensation and equity grants for full time employees

Additional Information:

Job Posted:
January 20, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Engineering Manager, SRE

Engineering Manager, Infrastructure

As an Engineering Manager for the Infrastructure team, you’ll lead the engineers...
Location
Location
Canada; United States
Salary
Salary:
195000.00 - 285000.00 USD / Year
apollo.io Logo
Apollo.io
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of hands-on software or infrastructure engineering experience
  • 2+ years of experience leading teams of senior and staff-level engineers in platform, SRE, or infrastructure domains
  • Proven ability to design and operate large-scale distributed systems in cloud environments (preferably GCP or AWS)
  • Expertise with Kubernetes, Docker, Terraform, Ubuntu, and CI/CD pipelines
  • Familiarity with observability tools (Grafana, Prometheus, ELK, Datadog, NewRelic) and performance tuning
  • Strong grounding in networking, security, and reliability principles
  • Experience managing infrastructure costs, availability SLAs, and high-throughput systems at scale
Job Responsibility
Job Responsibility
  • Lead, coach, and grow a distributed team of high-impact Infrastructure Engineers
  • Partner with senior engineering leadership on strategic initiatives such as cloud migration, infrastructure scaling, platform reliability, and cost efficiency
  • Define and implement modern operational excellence practices, including SLOs, error budgets, incident reviews, and performance monitoring
  • Guide technical decision-making across key areas like Kubernetes, GCP, observability, networking, CI/CD, and IaC (Terraform, Ansible)
  • Collaborate with AI, Data, and Product Engineering teams to ensure infrastructure scalability for ML and AI-native workloads
  • Run effective 1:1s, career development conversations, and quarterly performance reviews
  • Support recruiting efforts to attract top engineering talent across time zones
What we offer
What we offer
  • Equity
  • Company bonus or sales commissions/bonuses
  • 401(k) plan
  • At least 10 paid holidays per year
  • Flex PTO
  • Parental leave
  • Employee assistance program and wellbeing benefits
  • Global travel coverage
  • Life/AD&D/STD/LTD insurance
  • FSA/HSA and medical, dental, and vision benefits
  • Fulltime
Read More
Arrow Right

Engineering Manager

The Engineering Manager is responsible for leading a team of software engineers ...
Location
Location
France , Paris
Salary
Salary:
65000.00 - 80000.00 EUR / Year
beamy.io Logo
Beamy
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 7 years of coding experience (back-end / full stack)
  • At least one significant prior experience managing a full-stack engineering squad (2-3 years)
  • Strong leadership to guide your team
  • Extensive technical expertise for coaching
  • Pragmatic mindset for identifying clear, effective solutions
  • Comfortable building, running and continuously iterating on squad rituals
  • Comfortable collaborating with cross-functional counterparts: PM, Product Designer
  • Comfortable supporting and coaching your IC direct reports, and giving them regular feedback
  • Comfortable managing in both French and English, in a remote context
  • Comfortable being hands-on in a full-stack context, through code and design document review or code contributions
Job Responsibility
Job Responsibility
  • Technical Leadership at Squad Level: Collaborate with Product Manager & Designer on roadmap, project management, and strategic planning
  • Provide technical guidance and architectural oversight, with selective hands-on coding (20-30%)
  • Enable engineers through architectural guidance, technical decision-making, and removing technical blockers
  • Oversee code review process and standards
  • Ensure team meets velocity and quality targets through effective prioritization and resource allocation
  • Define and communicate technical direction and architecture decisions for the squad
  • Analyze requirements, assess feasibility, and ensure appropriate technical documentation
  • Guarantee project goal achievement, identify risks early, and orchestrate solutions to blockers
  • Champion best practices (unit testing, TDD, CI/CD, etc.) in collaboration with the QA team
  • Drive implementation of clean code principles, testing standards, release processes, and pair programming culture
What we offer
What we offer
  • Four-day week
  • Professional development plan
  • Sick child leave
  • Mental health benefits
  • Employee Resource Groups (ERG)
  • Fulltime
Read More
Arrow Right

Senior Platform Engineer

Glide is looking for a Senior Platform Engineer to join our Infrastructure team ...
Location
Location
Salary
Salary:
Not provided
glideapps.com Logo
Glide
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience as a platform engineer/SRE
  • 3+ years experience building and maintaining highly available and scalable distributed data sources
  • Experience with Google Cloud Platform services like Cloud SQL, Cloud Run, AlloyDB, or equivalent
  • Experience orchestrating complex systems with Kubernetes
  • Proficiency in TypeScript development
  • Strong SQL skills
  • can speak to covering index optimization strategies
  • Experience designing, building and running data-intensive event-driven architectures
  • You are a clear and effective communicator, be it when you write code, write emails, or explain complex technical issues to non-technical co-workers
  • Passionate and self-motivated, with a demonstrated ability to work in a fast-paced and evolving environment
Job Responsibility
Job Responsibility
  • Managing our existing infrastructure in GCP
  • Driving our platform evolution as the complexity and sophistication of our product only increases
  • Managing our Github/GH Actions based build pipeline
  • Provide build, test, and runtime infrastructure to service teams
  • Ensure patterns are established (e.g., for database throttling, request rate limiting, etc…) to protect Glide’s uptime
  • Monitor infrastructure costs and coordinate improvements when necessary
  • Drive SRE tooling and best practices around observability and alerting
  • Write, review, and maintain code primarily in TypeScript
  • Write architecture briefs and proposals, carry out code experiments, and build prototypes to learn how we can achieve reliable scale with our systems
  • Provide technical leadership, mentorship, pairing opportunities, and code review to encourage the growth of others
What we offer
What we offer
  • competitive salary and benefits package
  • a supportive and dynamic remote work environment
  • opportunities for career growth
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Infrastructure

You’ll help shape the future of infrastructure automation for law enforcement sy...
Location
Location
United States , Seattle; Boston
Salary
Salary:
141000.00 - 225600.00 USD / Year
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience)
  • 8+ years of professional software development experience
  • Strong background building cloud-native, distributed solutions
  • Experience designing tooling and automation to simplify the operational management of SaaS/PaaS systems
  • Proficiency in backend services with multiple managed languages (e.g., Java, Scala, Go, C#, or similar)
  • Expertise with Infrastructure as Code (IaC) tools (e.g., Terraform, CloudFormation) and building modular, reusable, testable components
  • Familiarity with Kubernetes platforms (e.g., AKS, EKS, or similar)
  • Hands-on experience with CI/CD platforms for automating infrastructure, builds, testing, and releases
  • Strong collaboration and communication skills, with empathy for the needs of engineering teams
Job Responsibility
Job Responsibility
  • Lead engineering architecture design reviews
  • Set a high technical bar for the team through code and architecture design reviews
  • Mentoring engineers
  • Working across teams with Product, Design, and Engineering to create integrated solutions that delight our customers
  • Improve our Engineering process, including long-term thinking, sprint planning and stand-ups
  • Building services that adhere to our high bar on availability and latency in this mission-critical space
  • Working with the latest open source technologies
What we offer
What we offer
  • Competitive salary and 401k with employer match
  • Discretionary paid time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Mental Wellness support
  • Learning & Development programs
  • Snacks in our offices
  • Fulltime
Read More
Arrow Right

Senior Infrastructure Engineer - Postgres

ClickHouse is expanding its cloud data platform across AWS, GCP, and Azure—addin...
Location
Location
United States
Salary
Salary:
140000.00 - 208000.00 USD / Year
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years in SRE, DevOps, or infrastructure engineering, with a track record of running distributed, production-grade systems
  • Solid understanding of Postgres operations, scaling, and performance tuning
  • Deep hands-on experience across AWS, with exposure to GCP and Azure
  • comfortable navigating multi-cloud topologies
  • Proficient with Terraform, Kubernetes, and container-based infrastructure
  • Strong Go development skills (or willingness to write and own production Go code)
  • Familiar with tools like Prometheus, Grafana, Loki, OpenTelemetry, or equivalents
  • Deep understanding of SLOs, incident response, and continuous improvement in service reliability
  • You operate with a founder’s mentality — hands-on, resourceful, and willing to dive deep to get things done. You take pride in hard work, autonomy, and shipping impactful systems
Job Responsibility
Job Responsibility
  • Lead reliability and operations for ClickHouse’s Postgres integration — upgrades, patching, maintenance, and scaling
  • Design and implement automation for provisioning, deployments, and service lifecycle management across AWS, GCP, and Azure
  • Develop infrastructure-as-code using Terraform and modern CI/CD tooling to ensure consistent, repeatable deployments
  • Contribute Go-based tooling and services that improve automation, observability, and developer experience
  • Own observability and monitoring, ensuring robust alerting, metrics, and tracing across environments
  • Drive incident management and postmortem practices that strengthen reliability and learning loops
  • Collaborate cross-functionally with platform, networking, and product teams to improve service operability
  • Mentor and enable engineers, helping the team scale effectively as customer adoption grows
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Senior Infrastructure Engineer - Postgres

ClickHouse is expanding its cloud data platform across AWS, GCP, and Azure—addin...
Location
Location
India
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years in SRE, DevOps, or infrastructure engineering, with a track record of running distributed, production-grade systems
  • Solid understanding of Postgres operations, scaling, and performance tuning
  • Deep hands-on experience across AWS, with exposure to GCP and Azure
  • comfortable navigating multi-cloud topologies
  • Proficient with Terraform, Kubernetes, and container-based infrastructure
  • Strong Go development skills (or willingness to write and own production Go code)
  • Familiar with tools like Prometheus, Grafana, Loki, OpenTelemetry, or equivalents
  • Deep understanding of SLOs, incident response, and continuous improvement in service reliability
  • You operate with a founder’s mentality — hands-on, resourceful, and willing to dive deep to get things done. You take pride in hard work, autonomy, and shipping impactful systems
Job Responsibility
Job Responsibility
  • Lead reliability and operations for ClickHouse’s Postgres integration — upgrades, patching, maintenance, and scaling
  • Design and implement automation for provisioning, deployments, and service lifecycle management across AWS, GCP, and Azure
  • Develop infrastructure-as-code using Terraform and modern CI/CD tooling to ensure consistent, repeatable deployments
  • Contribute Go-based tooling and services that improve automation, observability, and developer experience
  • Own observability and monitoring, ensuring robust alerting, metrics, and tracing across environments
  • Drive incident management and postmortem practices that strengthen reliability and learning loops
  • Collaborate cross-functionally with platform, networking, and product teams to improve service operability
  • Mentor and enable engineers, helping the team scale effectively as customer adoption grows
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Senior DevOps Engineer

We're looking for a seasoned Sr DevOps Engineer to help drive the reliability, s...
Location
Location
Bulgaria , Sofia
Salary
Salary:
Not provided
brandwatch.com Logo
Brandwatch
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5-7 years of experience in DevOps, SRE, or Software Engineering roles, with increasing responsibility in system design and operations
  • Extensive experience with containerization (Docker) and orchestration (Kubernetes) in production environments, including managing and scaling clusters
  • Proficiency in Infrastructure as Code (Terraform, CloudFormation, etc.) and configuration management tools (Ansible, Puppet) to automate infrastructure provisioning
  • Strong coding and scripting skills in languages like Python, Go, or Ruby, with the ability to build automation tools for system management
  • Deep knowledge of cloud platforms (AWS and/or GCP) and their services, with experience designing and operating cloud-based infrastructure at scale
  • Solid understanding of networking and security fundamentals in cloud and on-prem environments
  • Experience setting up and tuning monitoring/alerting systems (Prometheus, Grafana, etc.), and a thorough understanding of SRE best practices (SLIs, SLOs, incident management)
  • Strong problem-solving and communication skills, with a track record of working effectively in collaborative team environments
Job Responsibility
Job Responsibility
  • Oversee the reliability, performance, and security of critical production services from design to deployment, ensuring they meet our uptime and performance targets
  • Collaborate with development, QA, and product teams to build and maintain resilient infrastructure and efficient deployment pipelines
  • Automate infrastructure provisioning and software deployments using Infrastructure as Code and CI/CD tools, reducing manual work and errors
  • Participate in and improve our 24×7 on-call process, swiftly troubleshooting incidents and performing root cause analysis to prevent recurrence
  • Document and standardize processes and configurations, sharing knowledge to uplift the entire engineering team’s capabilities
Read More
Arrow Right

Senior SRE/DevOps Engineer

Metabase is the easiest way for people to get insights from their data. We're lo...
Location
Location
Salary
Salary:
Not provided
metabase.com Logo
Metabase
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Is thoughtful and careful
  • Compulsively automates everything and documents it
  • Is able to make solid technical judgements and back them up articulately
  • Has at least 5 years of experience building and operating production infrastructure, ideally on public cloud
  • Strong Kubernetes and AWS experience
  • Strong experience with IaC and Terraform
  • Can write high quality and readable code in a modern language (e.g. Python, Go, etc.)
  • Experience with modern monitoring stacks (e.g Prometheus/Grafana/Datadog)
Job Responsibility
Job Responsibility
  • Own and operate our application stack and AWS infrastructure to orchestrate and manage our hosted customer instances of Metabase
  • Debug runtime issues across the different levels of our application stack and hosting stack
  • Develop and build our internal tooling and automation to manage the lifecycle of a hosted Metabase installation, from purchase to deployment, zero-downtime upgrades, and general operational health
  • Continuously improve our automated deployments and testing
What we offer
What we offer
  • Flexibility (define your own schedule and work from wherever you want)
  • autonomy
  • an environment that fosters growth, learning, and development
  • Fulltime
Read More
Arrow Right