Devops Sre Engineer Job at Acuver Consulting (Bengaluru)

Job Description

We are looking for a mid-senior SRE/DevOps Engineer (5–8 years) to build and scale a cloud-native, event-driven platform powering high-throughput logistics and fulfillment systems. This role will be responsible for establishing infrastructure foundations, CI/CD pipelines, observability, and system reliability, while working closely with backend, data, and architecture teams to ensure production stability and scalability.

Job Responsibility

Design and implement robust CI/CD pipelines (GitLab CI, Jenkins, or similar)
Enable automated build, test, and deployment workflows
Implement blue-green / canary deployments for zero-downtime releases
Ensure release traceability, rollback mechanisms, and deployment governance
Design, provision, and manage infrastructure on AWS (primary) and/or GCP
Build infrastructure using Infrastructure as Code (Terraform preferred)
Create reusable modules for scalable, secure, and standardized environments
Optimize cost, performance, and scalability of cloud resources
Deploy and manage applications using Docker & Kubernetes
Manage Kubernetes workloads using Helm charts
Implement auto-scaling, resource optimization, and high availability patterns
Ensure platform readiness for high-throughput microservices
Define and implement SLIs, SLOs, and SLAs
Drive improvements in system reliability, uptime, and performance
Lead incident response, debugging, RCA (root cause analysis), and postmortems
Build resilient systems with self-healing and fault-tolerant mechanisms
Implement end-to-end observability across services: Metrics (Prometheus / Cloud Monitoring)
Logs (ELK / Kibana / Cloud Logging)
Tracing (OpenTelemetry / Jaeger)
Build actionable alerting systems to reduce noise and improve response time
Enable faster production debugging and performance analysis
Support and scale event-driven architectures (Kafka, Pub/Sub, SQS/SNS or similar)
Ensure reliability of asynchronous workflows and message processing systems
Work closely with backend teams to: Improve service resilience and fault handling
Optimize event processing and throughput
Support distributed microservices architecture
Work with PostgreSQL (RDS) for: Performance tuning
High availability and failover setups
Backup and recovery strategies
Collaborate with data teams supporting Snowflake / data pipelines (nice to have)
Drive production stabilization efforts for high-growth systems
Identify and resolve bottlenecks in performance and scalability
Improve MTTR (Mean Time to Recovery) and incident response efficiency
Enable platform readiness for scale and high transaction volumes
Implement secure DevOps practices
Manage IAM roles, secrets, and access controls
Ensure adherence to cloud security best practices

Requirements

5–8 years of experience in DevOps / SRE roles
Strong hands-on experience with AWS (preferred) and/or GCP
Expertise in: Kubernetes & Docker
Terraform (Infrastructure as Code)
CI/CD tools (GitLab, Jenkins, or similar)
Experience with: Event-driven / asynchronous architectures (Kafka, Pub/Sub, etc.)
Monitoring & logging tools (Prometheus, Grafana, ELK, etc.)
Microservices and distributed systems
Solid understanding of: Networking, load balancing, scaling strategies
High availability and fault-tolerant systems

Nice to have

Experience with service mesh (Istio / Linkerd)
Working knowledge of PostgreSQL / AWS RDS operations
Exposure to Snowflake or data platforms
Experience in logistics / supply chain domain
Familiarity with cost optimization and cloud governance

Acuver Consulting - All Job Offers

Select Country

Devops Sre Engineer

Job Description

Job Responsibility

Requirements

Nice to have

Looking for more opportunities?

Devops Sre Engineer

SRE / DevOps Engineer

DevOps Engineer / SRE

DevOps Engineer / SRE

DevOps Engineer / SRE

DevOps Engineer / Sre

DevOps Engineer / Sre

Sre Devops Engineer

DevOps Engineer / SRE

Our AI answers in your language