This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are looking for an SRE Full-Stack Engineer who is equally comfortable writing application code and improving platform reliability. This role focuses on building reliability into the system, not just operating it.
Job Responsibility:
Write production-grade code (Java / Node) with reliability in mind
Embed observability, metrics, and logging into services
Improve service performance, fault tolerance, and error handling
Build internal tools and dashboards for monitoring and diagnostics
Contribute to CI/CD improvements and release safety mechanisms
Implement health checks, readiness probes, and resilience patterns
Design and improve monitoring using metrics, logs, and traces
Build dashboards and alerts aligned with SLOs
Help reduce alert noise and false positives
Participate in on-call rotations
Troubleshoot production issues across application, database, and infrastructure layers
Contribute to root cause analysis and post-incident improvements
Requirements:
4–7 years of experience in backend or full-stack development
Strong hands-on experience with Java and/or Node.js
Experience building REST APIs and microservices
Working knowledge of Docker and Kubernetes
AWS fundamentals
PostgreSQL and MongoDB
Strong debugging and problem-solving skills
Nice to have:
React experience for internal tools or dashboards
Exposure to distributed tracing (OpenTelemetry, etc.)
Prior experience working with SRE or platform teams