This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
RUCKUS Networks is seeking an experienced Site Reliability Engineering (SRE) Manager to lead our NAM and APAC operations teams in transforming traditional operations into modern SRE practices. This high-impact leadership role will drive operational excellence, mentor engineering managers, and spearhead SRE transformation across our largest regional operations. As a key member of the engineering leadership team, you will shape people and processes, implement scalable reliability strategies, and ensure 24/7 operational stability for our infrastructure. You’ll manage engineering managers and their teams, fostering a culture of automation, reliability, and continuous improvement while collaborating with global operations counterparts.
Job Responsibility:
Lead and develop engineering managers and technical operations engineers across India and APAC time zones
Build a collaborative team culture that emphasizes knowledge sharing, automation, and operational excellence
Mentor engineering managers to strengthen leadership capabilities and technical expertise
Set clear performance expectations and provide ongoing coaching for growth
Partner cross-functionally with Product, Security, Development, and global operations teams
Own 24/7 operational stability for India/APAC, including incident response, escalation, and post-incident reviews
Drive comprehensive incident management: alert handling, outage response, and root cause analysis (RCA/CAR)
Transform traditional operations into modern SRE practices using SLOs, error budgets, and reliability engineering
Implement robust monitoring and alerting with APM tools, dashboards, and automation frameworks
Lead technical project delivery with clear timelines, resource planning, and stakeholder communication
Apply structured project management methodologies for visibility to senior leadership
Drive automation initiatives to reduce manual toil and improve reliability
Oversee change management processes with proper documentation and training
Manage infrastructure automation using Terraform, Kubernetes, and cloud platforms (GCP/AWS)
Develop operational roadmaps aligned with business and technical strategies
Provide executive-level reporting on operational metrics, project status, and team performance
Collaborate with US operations teams to ensure seamless global coverage
Build high-performing technical teams through effective hiring, development, and retention strategies
Requirements:
12+ years in Site Reliability Engineering (SRE), with 6+ years leading SRE, DevOps, or infrastructure teams
Proven experience mentoring engineering managers and developing leadership talent
Track record of transforming traditional operations or NOC teams into modern SRE organizations
Strong project management skills with Agile/Kanban experience and JIRA proficiency
Excellent communication skills, including executive-level presentations
Deep SRE expertise: incident management, on-call systems, monitoring, and reliability engineering
Infrastructure automation experience with Terraform, Kubernetes, Docker, and CI/CD pipelines
Cloud platform proficiency (GCP/AWS), including networking, security, and cost optimization
Monitoring and observability experience with Prometheus, Grafana, APM tools, and log aggregation
24/7 operations experience with global team coordination and escalation management
Change management and compliance experience (SOC2, security reviews, audits)
Strong foundation in Linux systems, networking protocols, and security practices
What we offer:
medical, dental, and vision plans
life and accidental death insurance
a 401(k) plan
participation in the Company’s Incentive Plan
eleven paid holidays in a full calendar year
two weeks of paid vacation (prorated based on start date)