This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Groupon is a marketplace where customers discover new experiences and services everyday and local businesses thrive. To date we have worked with over a million merchant partners worldwide, connecting over 16 million customers with deals across various categories. In a world often dominated by e-commerce giants, we stand out as one of the few platforms uniquely committed to helping local businesses succeed on a performance basis. Groupon is on a radical journey to transform our business with relentless pursuit of results. Even with thousands of employees spread across multiple continents, we still maintain a culture that inspires innovation, rewards risk-taking and celebrates success. The impact here can be immediate due to our scale and the speed of our transformation. We're a "best of both worlds" kind of company. We're big enough to have the resources and scale, but small enough that a single person has a surprising amount of autonomy and can make a meaningful impact.
Job Responsibility:
Architect and maintain fault-tolerant systems, ensuring uptime SLAs of 99.9% or higher
drive automation in infrastructure management and deployment using Terraform, Ansible, Kubernetes, and similar tools
create and optimize CI/CD pipelines to ensure reliable, secure, and efficient software delivery
build and enhance comprehensive observability solutions, including monitoring, logging, and alerting systems using Prometheus, Grafana, and the ELK stack
collaborate with stakeholders to define and achieve SLIs, SLOs, and error budgets aligned with business needs
lead incident response during on-call rotations, ensuring rapid resolution and root cause analysis for critical issues
design and execute performance testing, capacity planning, and scalability strategies for evolving workloads
proactively identify and resolve bottlenecks, increasing system performance and developer efficiency
mentor junior engineers, fostering a collaborative and growth-oriented team environment
guide architectural decisions that drive innovation and enhance system reliability
Requirements:
10+ years in systems engineering
at least 5+ years in SRE or DevOps roles
expertise in cloud platforms (GCP, AWS) and container orchestration (Kubernetes, Docker)
proficiency in programming and scripting languages like Python, Go, and Bash
advanced knowledge of Infrastructure as Code (IaC) tools such as Terraform and Ansible
deep understanding of networking, DNS, load balancing, and security principles
proven track record of managing high-availability systems in demanding environments
exceptional analytical and problem-solving skills
Nice to have:
Certifications in cloud or container technologies (e.g., AWS/GCP/Azure, Kubernetes CKA)
experience in industries like eCommerce, FinTech, or SaaS
familiarity with Agile development processes and frameworks
What we offer:
The opportunity to work with cutting-edge technologies in a transformative environment
a collaborative and innovative work values alignment that values your expertise and contributions
professional growth and leadership development pathways tailored to your aspirations
a chance to leave a lasting impact by shaping the future of reliable and scalable systems
Welcome to CrawlJobs.com – Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.
We use cookies to enhance your experience, analyze traffic, and serve personalized content. By clicking “Accept”, you agree to the use of cookies.