CrawlJobs Logo

Senior SRE Manager

aiven.io Logo

Aiven Deutschland GmbH

Location Icon

Location:
New Zealand , Auckland

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We are in search of a Senior Site Reliability Engineering Manager to build and lead an ANZ-based team of talented Site Reliability Engineers. The team will provide reliable services for Aiven customers by proactively looking after the health of the Aiven platform during business hours of the 24/7/365 Aiven global operation. The team will also handle incidents and coordination to fix those in cooperation with other units. The team's work is heavily oriented toward software development and automation.

Job Responsibility:

  • Coordinate and develop global operations work together with our EMEA and NA based SRE Managers and their respective teams
  • Lead a team of ANZ-based SREs
  • Enable the team to drive important software and process initiatives for the Aiven platform plan on-call shift rotations, ensuring adequate team availability in Aiven's global follow-the-sun operation
  • Run a tight, efficient operation where decision-making is based on metrics and data and work is prioritized accordingly

Requirements:

  • Experience managing or leading software development or operations team and defining, setting measurable success criteria for engineering roles with a focus on hiring, training and leading a team of engineers
  • Prior experience working as an engineer at a software company, enabling you to contribute to technical discussions at a senior level
  • Previous knowledge working with other internal teams to successfully resolve complex cases with excellent communication skills

Nice to have:

  • Development application or some technical hands-on capacity with two or more technologies in Aiven's portfolio for multiple years
  • Troubleshooting, root-causing, and triaging support and/or operational cases
  • Incident management and facilitating retrospectives and postmortems
What we offer:
  • Participate in Aiven's equity plan
  • Balance work and life with our hybrid work policy
  • Choose the equipment you need to set yourself up for success
  • Use your Professional Development Plan budget for learning opportunities
  • Receive holistic wellbeing support through our global Employee Assistance Program
  • Inquire about our Global Time Off Commitment (Parental and Sick Leave, as well as Personal Time)
  • Enjoy country-specific benefits for our global cast

Additional Information:

Job Posted:
April 24, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior SRE Manager

Senior Product Manager

As an important member of the Dev Solutions product management team, you will co...
Location
Location
United States , San Francisco; Seattle; Austin; New York
Salary
Salary:
134500.00 - 216000.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years in Product Management with 0-1 products in the software engineering/DevOps market
  • Familiarity with engineering, APIs, integrations, and event-driven systems
  • Familiarity with software engineering, DevOps, or IT operations markets
  • Experience in Agile, Software Development, DevOps, SRE, or developer-focused products
Job Responsibility
Job Responsibility
  • Report to Head of Product for Compass
  • Inspire and energize the team by creating future product visions, sharing customer challenges, analyzing market trends, and making strategic decisions
  • Lead software teams to create value for customers and the business by iterating on functionality through experimentation
  • Identify customer problems, highlight trends and deliver solutions quickly
  • Scope and define product experiments that improve organisational goals and customer outcomes
  • Oversee the entire lifecycle of ideas from idea to impact assessment and iteration
  • Collaborate with design and engineering to deliver delightful products
  • Work with teams across Atlassian to leverage and enhance platform capabilities
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Senior Product Manager

Be part of a team where your work takes center stage, shaping the future of soft...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
jfrog.com Logo
JFrog
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12+ years of industry experience
  • minimum 6+ years of Product Management experience in the Tech Product / SaaS industry
  • Proven success working on large-scale products with thousands of customers
  • Technical experience in Engineering, DevOps, SRE, or Tech Support (Advantage)
  • A customer-centric, data-driven approach to product development
  • Strong interpersonal skills and an ability for influencing diverse stakeholders
Job Responsibility
Job Responsibility
  • Drive Product Strategy: Identify market opportunities, define product vision, and gather customer requirements
  • Own the Lifecycle: Lead your product from ideation to launch, ensuring alignment with customer needs and high-quality execution
  • Leverage Data: Design, experiment, and iterate based on customer feedback, market research, and usage metrics
  • Collaborate Cross-Functionally: Partner with Sales, Support, Marketing, and Engineering to deliver impactful results
  • Build Relationships: Foster trust and alignment across internal and external stakeholders to bring your vision to life
Read More
Arrow Right

Senior Product Manager - AppTrust

At JFrog, we’re reinventing DevOps to help the world’s greatest companies innova...
Location
Location
Israel , Netanya/Tel Aviv
Salary
Salary:
Not provided
jfrog.com Logo
JFrog
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in E2E Product Management, preferably in B2B products and SaaS platforms
  • Experience driving elements of the product development lifecycle such as product vision, go-to-market strategy, driving requirements, UX, and product launch
  • Experience with user-facing products
  • solid understanding of UX and product design
  • Technical experience in Engineering, DevOps, SRE, and Tech Support — a huge advantage
  • Experience in driving strategic initiatives in a cross-organization environment
  • Excellent analytical, interpersonal, and problem-solving skills
Job Responsibility
Job Responsibility
  • Own the full cycle of product development including ideation, competitive analysis, client validation, discovery with R&D, spec writing, launching and monitoring
  • Understand customer needs and gather product requirements, identify market opportunities, and define product vision and strategy
  • Work closely with multiple teams within the company to deliver a high-quality B2D product on schedule, including Sales, Support, Marketing, and Engineering
  • Master the product and lead the requirements through the full lifecycle, from ideation to development and launch
  • Build positive relationships and trust through strong cross-team interactions, and get buy-in for the product vision across internal and external stakeholders
  • Identify, design, experiment, and iterate product decisions by leveraging data and evidence gathered from customer usage and interviews, market research, and usage/adoption metrics
Read More
Arrow Right

FX Applications Support Senior Analyst

As an OpsTech Application Support Analyst, the candidate will play a pivotal rol...
Location
Location
Australia , Sydney
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5-8 years experience in an Application Support role
  • experience installing, configuring or supporting business applications
  • experience with some programming languages and willingness/ability to learn
  • advanced execution capabilities and ability to adjust quickly to changes and re-prioritization
  • effective written and verbal communications including ability to explain technical issues in simple terms that non-IT staff can understand
  • demonstrated analytical skills
  • issue tracking and reporting using tools
  • knowledge/experience of problem Management Tools
  • good all-round technical skills
  • effectively share information with other support team members and with other technology teams
Job Responsibility
Job Responsibility
  • Provide technical and business support for users of Citi Applications
  • maintain application systems
  • manage, maintain and support applications
  • perform start of day checks, continuous monitoring, and regional handover
  • develop and maintain technical support documentation
  • maximize the potential of applications
  • assess risk and impact of production issues and escalate
  • ensure storage and archiving procedures are functioning correctly
  • formulate and define scope and objectives for complex application enhancements
  • prioritize bug fixes and support tooling requirements
What we offer
What we offer
  • Rewarding work in a supportive environment
  • clear opportunities for progression
  • exciting company benefits
  • Fulltime
Read More
Arrow Right

Engineering Manager, Infrastructure

As an Engineering Manager for the Infrastructure team, you’ll lead the engineers...
Location
Location
Canada; United States
Salary
Salary:
195000.00 - 285000.00 USD / Year
apollo.io Logo
Apollo.io
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of hands-on software or infrastructure engineering experience
  • 2+ years of experience leading teams of senior and staff-level engineers in platform, SRE, or infrastructure domains
  • Proven ability to design and operate large-scale distributed systems in cloud environments (preferably GCP or AWS)
  • Expertise with Kubernetes, Docker, Terraform, Ubuntu, and CI/CD pipelines
  • Familiarity with observability tools (Grafana, Prometheus, ELK, Datadog, NewRelic) and performance tuning
  • Strong grounding in networking, security, and reliability principles
  • Experience managing infrastructure costs, availability SLAs, and high-throughput systems at scale
Job Responsibility
Job Responsibility
  • Lead, coach, and grow a distributed team of high-impact Infrastructure Engineers
  • Partner with senior engineering leadership on strategic initiatives such as cloud migration, infrastructure scaling, platform reliability, and cost efficiency
  • Define and implement modern operational excellence practices, including SLOs, error budgets, incident reviews, and performance monitoring
  • Guide technical decision-making across key areas like Kubernetes, GCP, observability, networking, CI/CD, and IaC (Terraform, Ansible)
  • Collaborate with AI, Data, and Product Engineering teams to ensure infrastructure scalability for ML and AI-native workloads
  • Run effective 1:1s, career development conversations, and quarterly performance reviews
  • Support recruiting efforts to attract top engineering talent across time zones
What we offer
What we offer
  • Equity
  • Company bonus or sales commissions/bonuses
  • 401(k) plan
  • At least 10 paid holidays per year
  • Flex PTO
  • Parental leave
  • Employee assistance program and wellbeing benefits
  • Global travel coverage
  • Life/AD&D/STD/LTD insurance
  • FSA/HSA and medical, dental, and vision benefits
  • Fulltime
Read More
Arrow Right

Senior DevOps Engineer

We're looking for a seasoned Sr DevOps Engineer to help drive the reliability, s...
Location
Location
Bulgaria , Sofia
Salary
Salary:
Not provided
brandwatch.com Logo
Brandwatch
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5-7 years of experience in DevOps, SRE, or Software Engineering roles, with increasing responsibility in system design and operations
  • Extensive experience with containerization (Docker) and orchestration (Kubernetes) in production environments, including managing and scaling clusters
  • Proficiency in Infrastructure as Code (Terraform, CloudFormation, etc.) and configuration management tools (Ansible, Puppet) to automate infrastructure provisioning
  • Strong coding and scripting skills in languages like Python, Go, or Ruby, with the ability to build automation tools for system management
  • Deep knowledge of cloud platforms (AWS and/or GCP) and their services, with experience designing and operating cloud-based infrastructure at scale
  • Solid understanding of networking and security fundamentals in cloud and on-prem environments
  • Experience setting up and tuning monitoring/alerting systems (Prometheus, Grafana, etc.), and a thorough understanding of SRE best practices (SLIs, SLOs, incident management)
  • Strong problem-solving and communication skills, with a track record of working effectively in collaborative team environments
Job Responsibility
Job Responsibility
  • Oversee the reliability, performance, and security of critical production services from design to deployment, ensuring they meet our uptime and performance targets
  • Collaborate with development, QA, and product teams to build and maintain resilient infrastructure and efficient deployment pipelines
  • Automate infrastructure provisioning and software deployments using Infrastructure as Code and CI/CD tools, reducing manual work and errors
  • Participate in and improve our 24×7 on-call process, swiftly troubleshooting incidents and performing root cause analysis to prevent recurrence
  • Document and standardize processes and configurations, sharing knowledge to uplift the entire engineering team’s capabilities
Read More
Arrow Right

Senior Infrastructure Engineer - Postgres

ClickHouse is expanding its cloud data platform across AWS, GCP, and Azure—addin...
Location
Location
India
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years in SRE, DevOps, or infrastructure engineering, with a track record of running distributed, production-grade systems
  • Solid understanding of Postgres operations, scaling, and performance tuning
  • Deep hands-on experience across AWS, with exposure to GCP and Azure
  • comfortable navigating multi-cloud topologies
  • Proficient with Terraform, Kubernetes, and container-based infrastructure
  • Strong Go development skills (or willingness to write and own production Go code)
  • Familiar with tools like Prometheus, Grafana, Loki, OpenTelemetry, or equivalents
  • Deep understanding of SLOs, incident response, and continuous improvement in service reliability
  • You operate with a founder’s mentality — hands-on, resourceful, and willing to dive deep to get things done. You take pride in hard work, autonomy, and shipping impactful systems
Job Responsibility
Job Responsibility
  • Lead reliability and operations for ClickHouse’s Postgres integration — upgrades, patching, maintenance, and scaling
  • Design and implement automation for provisioning, deployments, and service lifecycle management across AWS, GCP, and Azure
  • Develop infrastructure-as-code using Terraform and modern CI/CD tooling to ensure consistent, repeatable deployments
  • Contribute Go-based tooling and services that improve automation, observability, and developer experience
  • Own observability and monitoring, ensuring robust alerting, metrics, and tracing across environments
  • Drive incident management and postmortem practices that strengthen reliability and learning loops
  • Collaborate cross-functionally with platform, networking, and product teams to improve service operability
  • Mentor and enable engineers, helping the team scale effectively as customer adoption grows
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Senior Infrastructure Engineer - Postgres

ClickHouse is expanding its cloud data platform across AWS, GCP, and Azure—addin...
Location
Location
United States
Salary
Salary:
140000.00 - 208000.00 USD / Year
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years in SRE, DevOps, or infrastructure engineering, with a track record of running distributed, production-grade systems
  • Solid understanding of Postgres operations, scaling, and performance tuning
  • Deep hands-on experience across AWS, with exposure to GCP and Azure
  • comfortable navigating multi-cloud topologies
  • Proficient with Terraform, Kubernetes, and container-based infrastructure
  • Strong Go development skills (or willingness to write and own production Go code)
  • Familiar with tools like Prometheus, Grafana, Loki, OpenTelemetry, or equivalents
  • Deep understanding of SLOs, incident response, and continuous improvement in service reliability
  • You operate with a founder’s mentality — hands-on, resourceful, and willing to dive deep to get things done. You take pride in hard work, autonomy, and shipping impactful systems
Job Responsibility
Job Responsibility
  • Lead reliability and operations for ClickHouse’s Postgres integration — upgrades, patching, maintenance, and scaling
  • Design and implement automation for provisioning, deployments, and service lifecycle management across AWS, GCP, and Azure
  • Develop infrastructure-as-code using Terraform and modern CI/CD tooling to ensure consistent, repeatable deployments
  • Contribute Go-based tooling and services that improve automation, observability, and developer experience
  • Own observability and monitoring, ensuring robust alerting, metrics, and tracing across environments
  • Drive incident management and postmortem practices that strengthen reliability and learning loops
  • Collaborate cross-functionally with platform, networking, and product teams to improve service operability
  • Mentor and enable engineers, helping the team scale effectively as customer adoption grows
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right