SRE Engineer Job at ResMed

Sre Engineer

We are looking for a highly motivated SRE Engineer with strong hands-on experien...

Location

Poland

Salary:

15109.00 - 22658.00 PLN / Month

HSBC

Expiration Date

July 15, 2026

Requirements

Proven experience in production/application support for business-critical systems, with strong troubleshooting and prioritization skills
Strong hands-on experience in Java application support and development with knowledge of Spring Boot, REST APIs and microservices architecture
Good understanding of SQL and database technologies such as MySQL, PostgreSQL, Oracle, or MongoDB
Experience with CI/CD tools such as Jenkins, GitHub Actions, GitLab CI or similar
Hands on exposure to DevOps and automation practices. Experience working with Docker, Kubernetes, and containerized deployments
Experience with monitoring and observability tools like Grafana, Prometheus, ELK, Splunk
Familiarity with Linux/Unix environments and scripting knowledge in Shell, Python or Bash
Understanding of ITIL processes like Incident Management, Problem Management, Root Cause Analysis, Release Management, and Production Support processes
Ability to work independently in a fast-paced environment with strong ownership and accountability. Experience improving operational maturity (monitoring strategy, alert quality, postmortems, reducing MTTR)
Strong ownership mindset: track actions to closure and ensure outcomes are properly evidenced and documented

Job Responsibility

Support, maintain, troubleshoot an enhanced enterprise applications built using Java and related technologies
Participate in application enhancement deployment release activities, maintenance, and production support. Troubleshoot application infrastructure and production issues across distributed systems
Own production incidents end to end including triage impact assessment stakeholder communication resolution RCA and follow up actions
Build and maintain CI/CD pipelines for automated build testing deployment and release processes. Collaborate with DevOps and infrastructure teams to improve deployment automation and environment stability. Work with cloud and containerized environments to support deployments environment stability and operational efficiency
Perform performance tuning debugging and optimization of applications and services
Drive continuous improvement initiatives focused on reliability automation observability and operational excellence
Maintain strong technical documentations including runbooks deployment guides troubleshooting steps and post incident reports
Collaborate effectively with global stakeholders and team across different time zones

What we offer

Additional bonuses for recognition awards
Multisport card
Private medical care
Life insurance
One-time reimbursement of home office set-up (up to 800 PLN)
Cafeteria platform
Employee assistance program
Additional contributions to PPK scheme
Corporate parties & events
CSR initiatives

Fulltime

New

Sre engineer

This is a challenging and exciting opportunity to work on Collaboration Service ...

Location

India , Chennai

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

Must be a self-starter, effective listener & communicator, problem solver and team player
5- 8 yrs of technical support experience
Experience in providing technical support for globally distributed & complex service platform
Experience in software deployment on large global infrastructure
Strong understanding of collaboration technology stack and at least one years of hands-on experience
2+ years of experience with Linux (shell/batch scripting & server management), Java, LDAP, Active Directory, Oracle/SQL Server database (SQL, backup & recovery)
Good understanding of SaaS service and managing such service as an enterprise level consumer of vendor service
Bachelor’s Degree in Computer Science or a Related Field

Job Responsibility

Support Management: Technical Support for Collaboration Service Platform
Technical Support for Jira integration with developer pipeline service
Technical support for Jira on-premise to Jira Cloud migration
Troubleshoot technical issues and manage customer expectation
Reduce recurring issue using root cause analysis
Proactively monitor and manage infrastructure stability and performance
Technology Management: Upgrade existing product along with Engineering
Adopt new technology that gives competitive advantage to Citigroup Developers
Identify automation opportunity and implement solution to improve operational efficiency and user experience
Adopt AI solution to improve developer experience

Fulltime

New

SRE Engineer

This is a challenging and exciting opportunity to work on Code Quality Service (...

Location

India , Chennai

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

5-8 years of experience in SRE
Must be a self-starter, effective listener & communicator, problem solver and team player
Experience in providing technical support for globally distributed & complex service platform
Experience in software deployment on large global infrastructure
Good understanding of developer pipeline concepts & Linux shell scripting
Minimum of 5 years of technical support experience
Strong understanding of Code Quality (like SonarQube) technology stack and at least two years of hands-on experience
Bachelor’s Degree in Computer Science or a Related Field

Job Responsibility

Operational SME for Code Quality Service Platform (like SonarQube)
Operational SME for Developer Pipeline integration with Code Quality Service
Troubleshoot technical issues and manage customer expectation
Reduce recurring issue using root cause analysis
Proactively monitor and manage infrastructure stability and performance
Upgrade existing product along with Engineering
Adopt new AI enabled technology that gives competitive advantage to Citigroup Developers as well as improved support engineer productivity
Identify automation opportunity and implement solution to improve operational efficiency and user experience
Manage infrastructure level risk & compliance issues as per Citi guidelines

Fulltime

Cloud Engineer / Site Reliability Engineer (SRE)

Location

United States , Orlando

Salary:

75.00 USD / Hour

Beacon Hill

Expiration Date

Until further notice

Requirements

Strong hands-on AWS experience with solid understanding of core AWS services
Experience supporting and troubleshooting AWS and Azure cloud environments
Terraform experience for Infrastructure as Code
Docker/containerization experience
Strong troubleshooting and problem-solving skills
Ability to translate requirements into technical execution
Experience performing cloud architecture and diagramming
Experience supporting deployments, environments, and site standups
Strong communication and collaboration skills

Job Responsibility

Support cloud infrastructure and deployments across AWS and Azure
Troubleshoot infrastructure and application-related cloud issues
Build and maintain Terraform-based infrastructure
Support Docker/containerized environments
Create architecture diagrams and technical documentation
Work closely with engineering and project teams to execute cloud initiatives
Assist with automation and operational improvement efforts

Fulltime

New

Senior Research Engineer (SRE) - Automation Transformation

The purpose of the Automation Transformation, Senior Advisor (SRE) role is to wo...

Location

United Kingdom , Coventry; Liverpool

Salary:

55000.00 - 65000.00 GBP / Year

Manufacturing Technology Centre

Expiration Date

Until further notice

Requirements

Communication
Innovation
Knowledge: automation and robotics technologies, solutions and typical applications
business/manufacturing improvement techniques e.g. value stream mapping, process mapping, lean techniques
manufacturing operational efficiency improvement methods e.g. flow and layout
business process re-engineering for innovation
leadership and change management
coaching and mentoring for performance improvement
supply chain assessments and strategy
Microsoft office 365 product set

Job Responsibility

leading projects with manufacturing clients
running client discovery workshops to identify information, systems requirements, and opportunity for Adopting Automation
using the MTC structured engagement process
Leadership of Client facing projects, to include leading project Teams, comprising engineers from the Automation Transformation Team and other MTC Technology Groups
Project and technical leadership of project team members
Technical governance of project outcomes
Understanding client problems and identifying how MTC can support their resolution
Identification of appropriate Automation technologies
Using assessment tools to identify the current state of a customer’s readiness to adopt Automation
Running workshops to identify Automation/Technology opportunities to support strategic growth

What we offer

Competitive Salary
Excellent Pension Scheme
Flexible Working

Fulltime

Senior Software Engineer - SRE

Roku is changing how the world watches TV. Roku is the #1 TV streaming platform ...

Location

India , Bengaluru

Salary:

Not provided

Roku

Expiration Date

Until further notice

Requirements

Preferably 8+ years of experience in DevOps/SRE roles, with demonstrated expertise in implementing SRE principles, SLO/SLI frameworks, and error budget policies in production environments
Deep experience with observability and monitoring platforms such as Prometheus, Grafana, Datadog, New Relic, or equivalent, including experience building custom dashboards, alerts, and SLO-based monitoring
Strong background in incident management, including experience as an Incident Commander, conducting blameless postmortems, and implementing systematic reliability improvements based on incident learnings
Strong understanding of distributed systems and reliability engineering, including failure modes, fault tolerance patterns, circuit breakers, bulkheads, rate limiting, and graceful degradation strategies
Experience with a number of the following: Kubernetes, Docker, Service Mesh such as Istio, Envoy, Linkerd, Solo & ECS
Experience in cloud-focused software development, preferably in Go, Python, or other object-oriented programming languages
Experience with Infrastructure as Code (IaC) tools such as Terraform, Ansible, or CloudFormation
Experience with CI/CD automation, including GitLab pipelines and other related tools
Strong hands-on experience with cloud platforms such as AWS, GCP or Azure
Proven track record of implementing scalable, high-performance infrastructure solutions in fast-paced, dynamic environments

Job Responsibility

Design & Infrastructure
Contribute to postmortem culture by facilitating comprehensive, blameless post-incident reviews that identify root causes, contributing factors, and actionable remediation items. Track incident trends to identify systemic issues and prioritize reliability improvements
Implement chaos engineering practices to proactively identify failure modes, validate system resilience, and build confidence in recovery procedures. Conduct game days and disaster recovery exercises
SRE Process & Principles Implementation
Deploy and evolve SRE practices across the organization by establishing core SRE principles, frameworks, and methodologies. Define and implement service reliability practices, including Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Error Budgets, to balance innovation velocity with system reliability
Manage Error Budgets as a mechanism for making data-driven decisions about feature velocity vs. reliability. Track, report, and enforce error budget policies, facilitating conversations between engineering and product teams about risk tolerance and release decisions
Reliability Engineering & Infrastructure
Reduce toil through automation by identifying repetitive operational work and systematically eliminating it through infrastructure-as-code, automation frameworks, and intelligent tooling. Measure and track toil reduction efforts, aiming to keep toil below 50% of team time
Implement capacity planning processes that ensure systems have adequate headroom to meet SLOs during peak traffic, unexpected load spikes, and degraded states. Develop predictive models and automated scaling mechanisms
Observability, Monitoring & Reporting

What we offer

global access to mental health and financial wellness support and resources
healthcare (medical, dental, and vision)
life, accident, disability, commuter, and retirement options (401(k)/pension)
time off in accordance with local leave policies

Fulltime

Site Reliability Engineer (SRE)

Wissen Technology is hiring for Site Reliability Engineer (SRE). At Wissen Techn...

Location

India , Bangalore South

Salary:

Not provided

Wissen

Expiration Date

Until further notice

Requirements

Strong experience in Java Application Support
Proven expertise with Terraform
Solid Cloud knowledge (AWS, Azure, or GCP)
9+ years of professional experience in SRE or related roles
Hands-on experience with MongoDB and Kafka
Experience with GitHub Actions for CI/CD automation
Strong problem-solving skills and ability to work independently during critical incidents
Excellent communication and stakeholder management skills

Job Responsibility

Ensure reliability, scalability, and performance of mission-critical systems
Provide Java application support and troubleshoot production issues
Implement and maintain Infrastructure as Code (IaC) using Terraform
Manage and optimize cloud infrastructure across AWS, Azure, or GCP
Automate CI/CD pipelines using GitHub Actions
Administer and support MongoDB and Kafka clusters
Drive incident response, root cause analysis, and postmortem documentation
Collaborate with cross-functional teams to enhance observability, monitoring, and alerting capabilities

Fulltime

Senior Site Reliability Engineer (SRE)

The Senior SRE is responsible for deployment, updates, and operational support f...

Location

India , Chennai

Salary:

Not provided

Dalet

Expiration Date

Until further notice

Requirements

Cloud platforms: AWS, Azure
Containerisation & Orchestration: Kubernetes
Infrastructure as Code: Terraform
Configuration Management: Ansible
Packaging & Deployment: Helm
Databases: MariaDB, MongoDB
Monitoring, observability, networking, and cloud security.

Job Responsibility

Act as a senior technical authority for APAC Site Reliability Engineering activities
Drive best practices in reliability, operations, and engineering standards
Promote technical excellence, collaboration, and accountability across stakeholders
Make infrastructure complexity transparent to both internal teams and customers, ensuring a consistently excellent client experience
Implement, track, and evolve service performance measures such as SLAs, SLOs, and SLIs
Anticipate risks related to service availability, capacity, performance regressions, and security vulnerabilities
Drive continuous improvement, including leading and facilitating Root Cause Analysis (RCA) activities
Ensure timely execution of deployments, upgrades, maintenance activities, and change requests
Anticipate workload, plan deliverables, and ensure qualification/validation of upcoming tasks
Collaborate closely with engineering to improve platform components, automation, and operational processes

What we offer

Great career opportunities around the world
Truly collaborative environment with supportive leadership
Cutting edge technologies (AI, Cloud, Cybersecurity...)
Talented and passionate team members
Fun working environment

Fulltime

Select Country

SRE Engineer

Job Description

Job Responsibility

Requirements

Looking for more opportunities?

SRE Engineer

Sre Engineer

Sre engineer

SRE Engineer

Cloud Engineer / Site Reliability Engineer (SRE)

Senior Research Engineer (SRE) - Automation Transformation

Senior Software Engineer - SRE

Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

Our AI answers in your language