CrawlJobs Logo

SRE Engineer

jfrog.com Logo

JFrog

Location Icon

Location:
Israel , Netanya/Tel Aviv

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

At JFrog, we’re reinventing DevOps to help the world’s greatest companies innovate -- and we want you along for the ride. This is a special place with a unique combination of brilliance, spirit and just all-around great people. Here, if you’re willing to do more, your career can take off. And since software plays a central role in everyone’s lives, you’ll be part of an important mission. Thousands of customers, including the majority of the Fortune 100, trust JFrog to manage, accelerate, and secure their software delivery from code to production -- a concept we call “liquid software.” Wouldn't it be amazing if you could join us on our journey? We are looking for a Site Reliability Engineer to join our SaaS Production team and help us ensure high availability, performance, and reliability across our global cloud environments.

Job Responsibility:

  • Support the operation and reliability of JFrog’s large-scale, multi-cloud, Kubernetes-based SaaS environments
  • Troubleshoot complex production issues across distributed systems and work closely with Engineering and Cloud teams to resolve them
  • Contribute to improving system reliability, performance, scalability, and observability
  • Apply SRE best practices, including incident response, service monitoring, capacity considerations, and continuous reliability improvements
  • Participate in on-call rotations and take part in incident investigations and postmortems
  • Build and enhance automation tools (primarily in Python or Go) to reduce operational toil and improve efficiency
  • Assist in improving CI/CD workflows and deployment safety
  • Design and develop AI-based tools and automation to improve operational efficiency and productivity for JFrog’s internal engineering and SaaS teams
  • Support resilience initiatives, including disaster recovery validation and service readiness improvements
  • Continuously learn and explore new technologies that improve operational excellence

Requirements:

  • 1-3 years of experience in SRE, DevOps, Production Engineering, or a similar role in a production environment
  • Hands-on experience operating Kubernetes-based containerized workloads in production
  • Experience with at least one public cloud provider (AWS, GCP, or Azure)
  • Strong troubleshooting and analytical skills with the ability to debug production issues methodically

Additional Information:

Job Posted:
March 20, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for SRE Engineer

Cloud Engineer II - SRE

Cloud Engineer II - SRE role at Hewlett Packard Enterprise, part of the 24X7 ope...
Location
Location
India
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in computer science, engineering, information systems, or closely related quantitative discipline
  • Master's desirable
  • Typically 3-5 years' experience
  • Strong Experience in Ubuntu & K8s platforms
  • Experience in programming skills in Scripting / Python / Golang/ Ansible/ Terraform
  • Strong experience in DevOps practices like continuous integration/continuous deployment (CI/CD)
  • Knowledge on Git Ops model
  • Working experience in cloud platforms, especially AWS
  • Ability to quickly learn new skills and technologies
  • Strong system debugging skills
Job Responsibility
Job Responsibility
  • Part of the 24X7 operations group working in shifts managing an application or multiple applications
  • Monitor & remediate alerts and maintain uptime
  • Develops and maintains automated systems to improve operational efficiency and ensure compliance with security policies
  • Executes automation and debugs issues as required
  • Leverage CI/CD & Git Ops for managing the application platform
  • Patching security vulnerabilities
  • Manage public cloud infrastructure
  • Shares and reviews innovative technical ideas with peers
  • Analyses incidents / problems to develop and implement solutions to complex application problems
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Comprehensive suite of benefits supporting physical, financial and emotional wellbeing
  • Fulltime
Read More
Arrow Right

Intermediate Software Engineer SRE – AI

At PointClickCare our mission is simple: to help providers deliver exceptional c...
Location
Location
Canada , Mississauga
Salary
Salary:
115000.00 - 128000.00 CAD / Year
pointclickcare.com Logo
PointClickCare
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years' experience in software engineering
  • Experience with SRE principles
  • Experience with AI/ML in production environments
  • A passion for automation, intelligent systems, and operational excellence
  • Strong debugging, problem-solving, and system design skills
  • Languages: Python, Java, Bash, Terraform
  • Platforms: Azure, Kubernetes, Docker
  • Tools: Datadog, Prometheus, AppDynamics, ELK, GitHub Actions
  • ML/AI: MCP framework, AI agents, Vector store, Agent orchestration (LangChain), RAG
  • CI/CD: Jenkins, ArgoCD, Spinnaker
Job Responsibility
Job Responsibility
  • Build ML-based anomaly detection and pattern recognition systems
  • Enhance telemetry with smart tagging and metadata for better AI insights
  • Develop event-driven workflows and self-healing systems using AI triggers
  • Automate incident response with generative AI and custom AI agent orchestration
  • Use time-series forecasting and predictive modelling to anticipate failures
  • Optimise infrastructure with AI-powered autoscaling and cost-aware resource allocation
  • Build scalable, fault-tolerant systems in a cloud-native environment
  • Participate in on-call rotations and lead incident response for critical systems
  • Skilled in API integration for streamlined data exchange and system connectivity
  • Run internal AIOps workshops and help teams adopt AI maturity models
What we offer
What we offer
  • Benefits starting from Day 1
  • Retirement Plan Matching
  • Flexible Paid Time Off
  • Wellness Support Programs and Resources
  • Parental & Caregiver Leaves
  • Fertility & Adoption Support
  • Continuous Development Support Program
  • Employee Assistance Program
  • Allyship and Inclusion Communities
  • Employee Recognition … and more
  • Fulltime
Read More
Arrow Right

Site Reliability Engineering (SRE)

Fyld is a Portuguese consulting company specializing in IT services. We bring hi...
Location
Location
Portugal , Lisboa; Porto
Salary
Salary:
Not provided
https://www.fyld.pt Logo
Fyld
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Degree in Computer Science, Information Technology, Engineering, or a related
  • Previous experience working as an SRE or in a similar role within DevOps, system administration, or software engineering
  • Familiarity with industry-specific applications and regulatory requirements (e.g., HIPAA, GDPR)
  • Proficiency in system administration for Linux/Unix and Windows systems
  • Strong understanding of networking concepts, including TCP/IP, DNS, load balancing, and firewalls
  • Proficiency in programming languages such as Python, Go, Java, or C++
  • Strong skills in scripting languages like Bash, Perl, or Ruby
  • Experience with automation tools such as Ansible, Puppet, Chef, or Terraform
  • Knowledge of Infrastructure as Code (IaC) principles and practices
  • Experience with monitoring and logging tools such as Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), or Splunk
  • Fulltime
Read More
Arrow Right

Cloud Engineer SRE

Cloud Engineer SRE role at Hewlett Packard Enterprise, part of the Operations te...
Location
Location
India
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in computer science, engineering, information systems, or closely related quantitative discipline
  • Master's desirable
  • Typically 5-8 years' experience
  • Strong Experience in ubuntu & K8s platforms
  • Experience in programming skills in Scripting/Python/Golang/Ansible/Terraform
  • Expert experience in DevOps practices like continuous integration/continuous deployment (CI/CD)
  • Knowledge on Git Ops model
  • Strong working experience in cloud platforms, especially AWS
  • Ability to quickly learn new skills and technologies
  • Strong system debugging skills
Job Responsibility
Job Responsibility
  • Part of the 24X7 operations group working in shifts managing an application or multiple applications
  • Monitor & remediate alerts and maintain uptime
  • Develops and maintains automated systems to improve operational efficiency and ensure compliance with security policies
  • Executes automation and debugs issues as required
  • Leverage CI/CD & Git Ops for managing the application platform
  • Patching security vulnerabilities
  • Manage public cloud infrastructure using Automation
  • Shares and reviews innovative technical ideas with peers
  • Analyses incidents/problems to develop and implement solutions to complex application problems
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Comprehensive suite of benefits supporting physical, financial and emotional wellbeing
  • Fulltime
Read More
Arrow Right

VP - Cloud Security Reliability Engineer (SRE)

This role sits within the Cloud Security team which is responsible for Private a...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree or equivalent work experience
  • 6+ years of relevant work experience
  • Highly motivated self-starter with excellent interpersonal and communication skills
  • Certification or formal training in site reliability engineering concepts and practices
  • Prior experience working towards SLIs, SLOs and observability capabilities at a large scale
  • 4+ years experience in Python (preferable) or Java, on large scale systems alongside Linux based scripting languages
  • Experience working on observability, logging and metrics toolsets: Prometheus, Grafana, Splunk, Elk
  • Experience of k8s and container technologies: Docker, Openshift and EKS
  • Experience with public cloud technologies: AWS, GCP or Azure
  • Experience with Secrets products: HashiCorp Vault or CyberArk
Job Responsibility
Job Responsibility
  • Working across Container products and Secrets products, across Public and Private Cloud, as well as Cloud native specific products
  • Architecting and building tools and platforms that provide capabilities for SRE
  • Collaboration with multiple stakeholders and partners across Engineering and Operations as well as partner teams within the wider Citi organisation
  • Actively owning production level incidents till resolution
  • Fulltime
Read More
Arrow Right

VP - Cloud Security Reliability Engineer (SRE)

This role sits within the Cloud Security team which is responsible for Private a...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree or equivalent work experience
  • 6+ years of relevant work experience
  • Highly motivated self-starter with excellent interpersonal and communication skills
  • Certification or formal training in site reliability engineering concepts and practices
  • Prior experience working towards SLIs, SLOs and observability capabilities at a large scale
  • 4+ years experience in Python (preferable) or Java, on large scale systems alongside Linux based scripting languages
  • Experience working on observability, logging and metrics toolsets: Prometheus, Grafana, Splunk, Elk
  • Experience of k8s and container technologies: Docker, Openshift and EKS
  • Experience with public cloud technologies: AWS, GCP or Azure
  • Experience with Secrets products: HashiCorp Vault or CyberArk
Job Responsibility
Job Responsibility
  • Working across Container products and Secrets products, across Public and Private Cloud, as well as Cloud native specific products
  • Architecting and building tools and platforms that provide capabilities for SRE
  • Collaboration with multiple stakeholders and partners across Engineering and Operations as well as partner teams within the wider Citi organisation
  • Actively owning production level incidents till resolution
  • Fulltime
Read More
Arrow Right

DevOps and SRE Engineer

The DevOps and SRE Engineer will be responsible for building and maintaining hig...
Location
Location
Salary
Salary:
Not provided
aciinfotech.com Logo
ACI Infotech
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science or related field
  • 5+ years of experience in DevOps/SRE roles supporting high-availability SaaS
  • Proven expertise in Kubernetes administration (EKS, GKE, or AKS)
  • Strong experience with Terraform, Helm, and GitOps pipelines
  • Skilled in CI/CD pipeline design and maintenance
  • Knowledge of monitoring, alerting, and logging (Prometheus, Grafana, OpenTelemetry)
  • Strong fundamentals in cloud networking and security
  • Calm, methodical, and automation-first mindset
Job Responsibility
Job Responsibility
  • Design and maintain CI/CD pipelines with progressive delivery
  • Operate and scale EKS, GKE, or AKS clusters with strong multi-tenancy
  • Instrument systems using Prometheus, Grafana, and OpenTelemetry
  • Run incident response, postmortems, and capacity planning
  • Harden networking, IAM, and secret management
  • Deliver automated, repeatable environments using GitOps and IaC
  • Ensure clear SLOs with meaningful alerting and manage error budgets
  • Drive cloud cost efficiency per transaction while maintaining reliability
  • Fulltime
Read More
Arrow Right

Devops engineer / sre

Your role is to join our IAM (Identity and Access Management) team, which is par...
Location
Location
Czech Republic , Praha
Salary
Salary:
Not provided
https://www.randstad.com Logo
Randstad
Expiration Date
March 30, 2026
Flip Icon
Requirements
Requirements
  • 2-4 years of relevant experience with a solid technical background
  • familiarity with the DevOps technology stack and a basic understanding of cloud-based IT infrastructure
  • fluent in English
  • experience with Elasticsearch for central logging solutions
  • working knowledge of container technologies, especially Kubernetes
  • good understanding of AWS services, including EKS, VPC, IAM, CloudWatch, and S3
Job Responsibility
Job Responsibility
  • support the team by building and maintaining infrastructure components in our cloud solution for internal applications
  • work with a team that manages Microservices and Open-Source Applications for core IAM solutions in containerized environments
  • research and implement innovative solutions to drive the business forward
What we offer
What we offer
  • 25 days of vacation + sick days
  • annual bonus
  • pension contribution
  • cafeteria budget
  • international environment, projects and modern tech stack
  • opportunity to grow and develop, trainings, educational courses and more
Read More
Arrow Right