CrawlJobs Logo

Sre Engineer

Poland Employment contract 15109.00 - 22658.00 PLN / Month · Job Posted June 16, 2026
Apply Position
Job Link Share

Job Description

We are looking for a highly motivated SRE Engineer with strong hands-on experience in application support service management production stability and DevOps practices the role primarily focuses on the platform is compile with the global, regional and market specific controls, while ensuring reliability performance automation, operational excellence across environments and continuously focusing on observability of the platform. Toil Reduction - Continuously eliminate manual operational work through AI-powered runbooks, automated pipelines, and intelligent guardrails - freeing the team to focus on higher-value stability initiatives. The candidate will work closely with cross functional teams to deliver stable, secure, and high performing applications.

Job Responsibility

  • Support, maintain, troubleshoot an enhanced enterprise applications built using Java and related technologies
  • Participate in application enhancement deployment release activities, maintenance, and production support. Troubleshoot application infrastructure and production issues across distributed systems
  • Own production incidents end to end including triage impact assessment stakeholder communication resolution RCA and follow up actions
  • Build and maintain CI/CD pipelines for automated build testing deployment and release processes. Collaborate with DevOps and infrastructure teams to improve deployment automation and environment stability. Work with cloud and containerized environments to support deployments environment stability and operational efficiency
  • Perform performance tuning debugging and optimization of applications and services
  • Drive continuous improvement initiatives focused on reliability automation observability and operational excellence
  • Maintain strong technical documentations including runbooks deployment guides troubleshooting steps and post incident reports
  • Collaborate effectively with global stakeholders and team across different time zones

Requirements

  • Proven experience in production/application support for business-critical systems, with strong troubleshooting and prioritization skills
  • Strong hands-on experience in Java application support and development with knowledge of Spring Boot, REST APIs and microservices architecture
  • Good understanding of SQL and database technologies such as MySQL, PostgreSQL, Oracle, or MongoDB
  • Experience with CI/CD tools such as Jenkins, GitHub Actions, GitLab CI or similar
  • Hands on exposure to DevOps and automation practices. Experience working with Docker, Kubernetes, and containerized deployments
  • Experience with monitoring and observability tools like Grafana, Prometheus, ELK, Splunk
  • Familiarity with Linux/Unix environments and scripting knowledge in Shell, Python or Bash
  • Understanding of ITIL processes like Incident Management, Problem Management, Root Cause Analysis, Release Management, and Production Support processes
  • Ability to work independently in a fast-paced environment with strong ownership and accountability. Experience improving operational maturity (monitoring strategy, alert quality, postmortems, reducing MTTR)
  • Strong ownership mindset: track actions to closure and ensure outcomes are properly evidenced and documented

Nice to have

Solid knowledge of derivatives trading systems and the trade lifecycle (front-to-back understanding) is definitely a plus!

What we offer

  • Additional bonuses for recognition awards
  • Multisport card
  • Private medical care
  • Life insurance
  • One-time reimbursement of home office set-up (up to 800 PLN)
  • Cafeteria platform
  • Employee assistance program
  • Additional contributions to PPK scheme
  • Corporate parties & events
  • CSR initiatives
  • Nursery discounts
  • Financial support with trainings and education
  • Social fund
  • Flexible working hours
  • Free parking

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Sre Engineer

8 matching positions

SRE Engineer

We’re looking for a Site Reliability Engineer (SRE) to join our Global SRE team ...
Location
Location
United States
Salary
Salary:
Not provided
resmed.com Logo
ResMed
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering roles
  • Experience operating Kubernetes-based production systems
  • Hands-on experience with AWS and infrastructure-as-code tools
  • Experience designing and supporting CI/CD pipelines and automated deployments
  • Proficiency in Python for automation, tooling, or backend services
  • Solid understanding of distributed systems and networking concepts
  • Experience with monitoring and observability platforms such as Datadog and CloudWatch
Job Responsibility
Job Responsibility
  • Ensure the reliability, availability, and resiliency of Resmed’s digital products by designing and operating fault-tolerant systems
  • Partner with product and platform teams to define and improve service health using operational and customer-experience metrics
  • Design, implement, and maintain monitoring, alerting, logging, and tracing solutions that provide real-time visibility into system behavior and customer experience
  • Analyze system performance, scalability, and capacity, and drive optimizations to improve efficiency and stability in cloud environments
  • Build automation and tooling to support deployments, scaling, incident response, and operational workflows
  • Participate in an on-call rotation as part of a globally distributed team, lead incident response efforts, troubleshoot production issues, conduct postmortems, and drive continuous improvement initiatives
  • Collaborate with security and compliance partners to support secure, privacy-aware, and compliant operations
  • Work closely with engineering teams to improve developer experience, operational maturity, and overall customer experience
  • Fulltime
Read More
Arrow Right

Cloud Engineer / Site Reliability Engineer (SRE)

Location
Location
United States , Orlando
Salary
Salary:
75.00 USD / Hour
bhsg.com Logo
Beacon Hill
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong hands-on AWS experience with solid understanding of core AWS services
  • Experience supporting and troubleshooting AWS and Azure cloud environments
  • Terraform experience for Infrastructure as Code
  • Docker/containerization experience
  • Strong troubleshooting and problem-solving skills
  • Ability to translate requirements into technical execution
  • Experience performing cloud architecture and diagramming
  • Experience supporting deployments, environments, and site standups
  • Strong communication and collaboration skills
Job Responsibility
Job Responsibility
  • Support cloud infrastructure and deployments across AWS and Azure
  • Troubleshoot infrastructure and application-related cloud issues
  • Build and maintain Terraform-based infrastructure
  • Support Docker/containerized environments
  • Create architecture diagrams and technical documentation
  • Work closely with engineering and project teams to execute cloud initiatives
  • Assist with automation and operational improvement efforts
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - SRE

Roku is changing how the world watches TV. Roku is the #1 TV streaming platform ...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
roku.com Logo
Roku
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Preferably 8+ years of experience in DevOps/SRE roles, with demonstrated expertise in implementing SRE principles, SLO/SLI frameworks, and error budget policies in production environments
  • Deep experience with observability and monitoring platforms such as Prometheus, Grafana, Datadog, New Relic, or equivalent, including experience building custom dashboards, alerts, and SLO-based monitoring
  • Strong background in incident management, including experience as an Incident Commander, conducting blameless postmortems, and implementing systematic reliability improvements based on incident learnings
  • Strong understanding of distributed systems and reliability engineering, including failure modes, fault tolerance patterns, circuit breakers, bulkheads, rate limiting, and graceful degradation strategies
  • Experience with a number of the following: Kubernetes, Docker, Service Mesh such as Istio, Envoy, Linkerd, Solo & ECS
  • Experience in cloud-focused software development, preferably in Go, Python, or other object-oriented programming languages
  • Experience with Infrastructure as Code (IaC) tools such as Terraform, Ansible, or CloudFormation
  • Experience with CI/CD automation, including GitLab pipelines and other related tools
  • Strong hands-on experience with cloud platforms such as AWS, GCP or Azure
  • Proven track record of implementing scalable, high-performance infrastructure solutions in fast-paced, dynamic environments
Job Responsibility
Job Responsibility
  • Design & Infrastructure
  • Contribute to postmortem culture by facilitating comprehensive, blameless post-incident reviews that identify root causes, contributing factors, and actionable remediation items. Track incident trends to identify systemic issues and prioritize reliability improvements
  • Implement chaos engineering practices to proactively identify failure modes, validate system resilience, and build confidence in recovery procedures. Conduct game days and disaster recovery exercises
  • SRE Process & Principles Implementation
  • Deploy and evolve SRE practices across the organization by establishing core SRE principles, frameworks, and methodologies. Define and implement service reliability practices, including Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Error Budgets, to balance innovation velocity with system reliability
  • Manage Error Budgets as a mechanism for making data-driven decisions about feature velocity vs. reliability. Track, report, and enforce error budget policies, facilitating conversations between engineering and product teams about risk tolerance and release decisions
  • Reliability Engineering & Infrastructure
  • Reduce toil through automation by identifying repetitive operational work and systematically eliminating it through infrastructure-as-code, automation frameworks, and intelligent tooling. Measure and track toil reduction efforts, aiming to keep toil below 50% of team time
  • Implement capacity planning processes that ensure systems have adequate headroom to meet SLOs during peak traffic, unexpected load spikes, and degraded states. Develop predictive models and automated scaling mechanisms
  • Observability, Monitoring & Reporting
What we offer
What we offer
  • global access to mental health and financial wellness support and resources
  • healthcare (medical, dental, and vision)
  • life, accident, disability, commuter, and retirement options (401(k)/pension)
  • time off in accordance with local leave policies
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer (SRE)

Wissen Technology is hiring for Site Reliability Engineer (SRE). At Wissen Techn...
Location
Location
India , Bangalore South
Salary
Salary:
Not provided
votredircom.fr Logo
Wissen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong experience in Java Application Support
  • Proven expertise with Terraform
  • Solid Cloud knowledge (AWS, Azure, or GCP)
  • 9+ years of professional experience in SRE or related roles
  • Hands-on experience with MongoDB and Kafka
  • Experience with GitHub Actions for CI/CD automation
  • Strong problem-solving skills and ability to work independently during critical incidents
  • Excellent communication and stakeholder management skills
Job Responsibility
Job Responsibility
  • Ensure reliability, scalability, and performance of mission-critical systems
  • Provide Java application support and troubleshoot production issues
  • Implement and maintain Infrastructure as Code (IaC) using Terraform
  • Manage and optimize cloud infrastructure across AWS, Azure, or GCP
  • Automate CI/CD pipelines using GitHub Actions
  • Administer and support MongoDB and Kafka clusters
  • Drive incident response, root cause analysis, and postmortem documentation
  • Collaborate with cross-functional teams to enhance observability, monitoring, and alerting capabilities
  • Fulltime
Read More
Arrow Right

Senior Site Reliability Engineer (SRE)

The Senior SRE is responsible for deployment, updates, and operational support f...
Location
Location
India , Chennai
Salary
Salary:
Not provided
dalet.com Logo
Dalet
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Cloud platforms: AWS, Azure
  • Containerisation & Orchestration: Kubernetes
  • Infrastructure as Code: Terraform
  • Configuration Management: Ansible
  • Packaging & Deployment: Helm
  • Databases: MariaDB, MongoDB
  • Monitoring, observability, networking, and cloud security.
Job Responsibility
Job Responsibility
  • Act as a senior technical authority for APAC Site Reliability Engineering activities
  • Drive best practices in reliability, operations, and engineering standards
  • Promote technical excellence, collaboration, and accountability across stakeholders
  • Make infrastructure complexity transparent to both internal teams and customers, ensuring a consistently excellent client experience
  • Implement, track, and evolve service performance measures such as SLAs, SLOs, and SLIs
  • Anticipate risks related to service availability, capacity, performance regressions, and security vulnerabilities
  • Drive continuous improvement, including leading and facilitating Root Cause Analysis (RCA) activities
  • Ensure timely execution of deployments, upgrades, maintenance activities, and change requests
  • Anticipate workload, plan deliverables, and ensure qualification/validation of upcoming tasks
  • Collaborate closely with engineering to improve platform components, automation, and operational processes
What we offer
What we offer
  • Great career opportunities around the world
  • Truly collaborative environment with supportive leadership
  • Cutting edge technologies (AI, Cloud, Cybersecurity...)
  • Talented and passionate team members
  • Fun working environment
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Sre

Hybrid: This role is categorized as hybrid and is expected to report to Austin ...
Location
Location
United States , Austin; Warren
Salary
Salary:
Not provided
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in computer science or a related field, or equivalent work experience
  • 7-10 years software experience with strong proficiency in PostgreSQL and at least one other (Oracle, SQL Server) database technologies
  • Proficiency in at least one programming language (e.g., Python, Go, Java) and familiarity with multiple language ecosystems
  • Solid understanding of operating systems, networking, distributed systems, databases, and storage architectures
  • Deep understanding of how code runs on underlying hardware, including operating systems, algorithms, and data structures
  • Ability to optimize or troubleshoot code by understanding its execution and the impact on system resources
  • Experience handling production incidents, including root cause analysis, mitigation, and working through complex system failures
  • Strong communication skills, with an ability to explain technical concepts to both engineering and business stakeholders
  • Commitment to collaborative problem-solving and shared ownership of services
  • Proven experience in automating manual processes, building deployment pipelines, or managing configuration systems
Job Responsibility
Job Responsibility
  • Develop tools and software to automate operational processes, improve system reliability, and reduce manual intervention
  • Lead, Implement and improve monitoring and observability frameworks, enabling proactive detection and resolution of incidents
  • Participate in an on-call rotation to diagnose, troubleshoot, and mitigate production incidents, ensuring minimal downtime and swift resolution
  • Work alongside developers to ensure the quality, scalability, and reliability of our database services
  • Practice shared ownership of services in production, fostering a "You build it, you run it" culture
  • Manage Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs) to manage reliability expectations effectively
  • Conduct deep-dive analyses of incidents and collaborate on post-incident reviews to derive learnings and prevent recurrence
  • Champion a culture of continuous improvement
  • Evaluate system performance and advocate for optimizations that reduce infrastructure costs while maintaining service reliability
  • Fulltime
Read More
Arrow Right

DevOps Engineer / SRE

We're Fundraise Up - a global fundraising platform built to make donating to non...
Location
Location
Spain
Salary
Salary:
4900.00 - 6200.00 EUR / Month
fundraiseup.com Logo
Fundraise Up
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years as a DevOps Engineer / SRE (or very close responsibilities)
  • Real, hands-on experience with servers (VMs, bare metal) at the OS level and below: configuring, troubleshooting, digging into 'why it's broken'
  • Confident Linux skills (we use Ubuntu). We expect you to be comfortable with the core tools from Linux Crisis Tools
  • Solid understanding of networking basics
  • ability to configure and troubleshoot iptables
  • Ansible + Git
  • Experience with Bash or Python scripting for automation/observability
  • Production/on‑call experience: diagnosing incidents, restoring service, participating in post‑mortems
  • Ownership and attention to detail. Downtime is expensive: five years ago, 10 minutes of downtime cost us $100k — today it's even more
Job Responsibility
Job Responsibility
  • Work primarily with on‑premise infrastructure (bare metal and VMs): setup, maintenance, troubleshooting
  • Drive clarity in ambiguous situations by defining requirements, assumptions, and next steps
  • Own automation projects end‑to‑end (design → rollout → maintenance)
  • Improve how we operate: harden and tune systems and also improve the way the team works in terms of operational hygiene
  • Keep the platform stable, fast, and secure: servers, web servers, databases, queues
  • Investigate production incidents across OS / networking / infrastructure layers, apply temporary mitigations, coordinate with developers and participate in post‑mortems
  • Participate in on‑call rotations
  • Use AI in all aspects of day‑to‑day work: researching, troubleshooting, developing
What we offer
What we offer
  • Private medical insurance for the employee and their family
  • 23 paid vacation days per year
  • 11 paid public holidays per year
  • 5 company-paid sick leave days
  • English learning courses
  • Relevant professional education
  • Gym or swimming pool
  • Home Office Setup Assistance: the company offers assistance with purchasing furniture (office chair, office desk, monitor) and other items to create a comfortable workspace
  • Co-working
  • Remote working
  • Fulltime
Read More
Arrow Right

DevOps Engineer / SRE

You will join a small team responsible for the stability, performance, and secur...
Location
Location
Armenia
Salary
Salary:
Not provided
fundraiseup.com Logo
Fundraise Up
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years as a DevOps Engineer / SRE (or very close responsibilities)
  • Real, hands-on experience with servers (VMs, bare metal) at the OS level and below: configuring, troubleshooting, digging into "why it’s broken"
  • Confident Linux skills (we use Ubuntu). We expect you to be comfortable with the core tools from Linux Crisis Tools
  • Solid understanding of networking basics
  • ability to configure and troubleshoot iptables
  • Ansible + Git
  • Experience with Bash or Python scripting for automation/observability
  • Production/on‑call experience: diagnosing incidents, restoring service, participating in post‑mortems
  • Ownership and attention to detail. Downtime is expensive: five years ago, 10 minutes of downtime cost us $100k — today it’s even more
Job Responsibility
Job Responsibility
  • Work primarily with on‑premise infrastructure (bare metal and VMs): setup, maintenance, troubleshooting
  • Drive clarity in ambiguous situations by defining requirements, assumptions, and next steps
  • Own automation projects end‑to‑end (design → rollout → maintenance)
  • Improve how we operate: harden and tune systems and also improve the way the team works in terms of operational hygiene
  • Keep the platform stable, fast, and secure: servers, web servers, databases, queues
  • Investigate production incidents across OS / networking / infrastructure layers, apply temporary mitigations, coordinate with developers and participate in post‑mortems
  • Participate in on‑call rotations
  • Use AI in all aspects of day‑to‑day work: researching, troubleshooting, developing
What we offer
What we offer
  • 31 days off
  • 100% paid telemedicine plan
  • Home Office Setup Assistance: the company offers assistance with purchasing furniture (office chair, office desk, monitor) and other items to create a comfortable workspace
  • English learning courses
  • Relevant professional education
  • Gym or swimming pool
  • Co-working
  • Remote working
  • Fulltime
Read More
Arrow Right