This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Palo Alto Networks CDSS group is looking for a seasoned platformization and cloud automation engineer to design, develop and deliver next-generation technologies within our CASB teams. We are looking for leaders who take ownership of their area of focus, and who are driven to solve challenging technical problems using best practices, and state of the art technologies. Collaboration and teamwork are at the foundation of our culture, and we need engineers who can communicate at a high level, and work well with others towards achieving a common goal. If you have the passion to solve challenging cloud infrastructure and DevOps engineering problems, if you are interested in pushing your boundaries as an engineer, and working at the cusp of delivering Data Security at huge scale, and state of the art technologies within a quality focussed dynamic engineering culture, talk to us.
Job Responsibility:
Work with development teams to ensure that applications have scalability and reliability built-in from day one
Design, review and enhance software architecture to improve scalability, service reliability, cost, and performance
Drive platformization by building standardized, self-service infrastructure platforms that improve developer productivity, scalability, and operational efficiency
Deploy automation for provisioning and operating infrastructure at large scale
Partner with teams to improve CI/CD processes and technology
Mentor members of the staff on large scale cloud deployments
Drive the adoption of observability practices and a data-driven mindset
Setup processes like on-call rotations, Postmortems, Run books to continue supporting the infrastructure owned by the SRE team while finding ways to reduce the time to resolution and improve the reliability of services
Support, optimize and deploy mission critical, front-end and back-end production
Improving site performance, monitoring, and overall stability of our infrastructure
Requirements:
Bachelors/Masters degree in Computer Science or a related field
5+ years of industry experience in engineering
Fluent scripting skills (preferably Python or Bash) with deep experience in Unix/Linux systems from kernel to shell and beyond
4+ years of working with Microservices architectures on Kubernetes
HandsOn experience with container native tools like Docker, Helm for managing workloads running in Kubernetes
Experience managing AWS and GCP at scale, with knowledge of cloud-neutral connectivity between platforms
Experience designing and maintaining API specifications using Swagger/OpenAPI, and working with API frameworks such as Apigee to enable secure, scalable integrations
HandsOn experience with infrastructure-as-code and automation tools such as Terraform, Ansible, etc.
Proficient in CI/CD platforms like GitlabCI, Jenkins, ArgoCD, CircleCI etc.
In-depth knowledge of operating systems (processes, threads, concurrency, etc)
Implement and enforce network and cloud security best practices, serving as a security champion within the team
Drive observability initiatives by implementing distributed tracing, standardized logging, dashboards, and profiling, leveraging tools such as Prometheus, Grafana, and OpenTelemetry to ensure SLA/SLO compliance
Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
Experience with setting up and troubleshooting Nginx or Ingress Nginx
Hands-on production experience with Kafka, MongoDB, Amazon Redshift, BigQuery
Expertise in tuning, performance, and guiding long-term strategy for self-hosted or managed solutions a strong plus
The exceptional communicator in and across teams, taking the lead
Evaluate and integrate AI-powered tools to streamline workflows and boost productivity