CrawlJobs Logo

Principal DevOps Engineer

United States, Santa Clara Employment contract 147000.00 - 237500.00 USD / Year · Job Posted June 29, 2026
Apply Position
Job Link Share

Job Description

The Cortex team builds and delivers the industry’s most advanced SecOps platform, consisting of XDR, XSIAM, XSOAR, and XPANSE. As a Principal DevOps Engineer for the Cortex platform, you will serve as a technical pillar and visionary, architecting, scaling, and optimizing a massive-scale, multi-region GCP environment. In this high-impact role, you will leverage a deep, comprehensive DevOps and Site Reliability Engineering skillset to define the future of our cloud infrastructure. You will not just operate systems; you will set engineering standards, pioneer infrastructure-as-code paradigms, and drive organizational alignment across product engineering and specialized CI/CD groups. Your mission is to champion absolute system resilience, eliminate engineering friction at scale, and guarantee that our next-generation SecOps platform remains secure, cost-optimized, and highly available.

Job Responsibility

  • Architectural Leadership & IaC Strategy: Design and govern the global, multi-region cloud infrastructure strategy using Infrastructure as Code (IaC) principles (Terraform). Establish blueprints and modular architectures that scale seamlessly across the enterprise
  • Strategic CI/CD & Platform Engineering: Partner with engineering leadership and specialized pipeline teams to architect robust, secure, and self-healing continuous delivery workflows, radically reducing time-to-market for Cortex features
  • Cloud Economics & Optimization: Drive the long-term optimization strategy for our global GCP footprint. Architect for maximum performance, multi-zone reliability, and sophisticated cost-efficiency/FinOps models
  • Consultative Engineering & Influence: Serve as a trusted advisor to Product Engineering teams. Influence the core application architecture early in the lifecycle to ensure services are natively containerized, horizontally scalable, and highly observable
  • Systemic Reliability & Incident Governance: Act as a critical escalation point for complex, systemic outages. Lead post-mortems, identify systemic vulnerabilities, and design architectural mitigations to prevent recurring incidents across the entire platform
  • Advanced Tooling & R&D: Spearhead the creation of internal platforms, automated remediation frameworks, and intelligent auto-scaling solutions to eliminate manual operational toil
  • Technology Evangelism: Continually evaluate emerging technologies, paradigms, and open-source tools. Define the technical roadmap for the DevOps organization and mentor senior engineers across the team

Requirements

  • 10+ years of experience in DevOps, Site Reliability Engineering, or Cloud Architecture, with a proven track record of owning large-scale, business-critical production environments
  • Cloud Infrastructure: Deep, authoritative expertise in Google Cloud Platform (GCP) or Amazon Web Services (AWS), including complex networking, IAM governance, and multi-region architectures
  • Container Orchestration: Expert-level mastery of Kubernetes (GKE/EKS) and the broader cloud-native ecosystem (Service Meshes, Ingress controllers, advanced scheduling)
  • Automation & Software Engineering: High proficiency in Python or Go, advanced Linux internals, and robust shell scripting
  • Expert-level experience designing maintainable, dry, and scalable Terraform modules
  • Enterprise CI/CD: Deep architectural understanding of enterprise-scale software delivery pipelines (e.g., GitLab CI, GitHub Actions, Jenkins) and GitOps methodologies (e.g., ArgoCD)
  • Observability & Telemetry: Proven experience designing comprehensive, platform-wide observability strategies using tools like Prometheus, Grafana, OpenTelemetry, and PagerDuty

Nice to have

  • Technical Leadership: Demonstrated ability to lead cross-functional initiatives, influence engineering directors, and align technical roadmaps across distributed global teams
  • Complex Problem Solving: Exceptional capacity for troubleshooting deeply complex, distributed systems, networking bottlenecks, and cloud infrastructure anomalies
  • Autonomy & Decision Making: A proven track record of operating with absolute autonomy, making high-stakes architectural decisions, and owning the long-term outcomes

What we offer

  • restricted stock units
  • bonus

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Principal DevOps Engineer

8 matching positions

Principal DevOps Engineer

Riverstone Enterprise Solutions, an Envision Innovative Solutions Company, deliv...
Location
Location
United States , Annapolis Junction
Salary
Salary:
200000.00 - 220000.00 USD / Year
rivsol.com Logo
Riverstone Enterprise Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree or higher is required in either Engineering (i.e. Computer, Electrical, Mechanical, Aerospace, etc.) or Computer Science with a minimum of ten (10) years of related experience. Five (5) years of additional DevOps experience may be substituted for a bachelor's degree
  • Must be fluent with Git
  • Strong knowledge of Linux and Linux environments (RHEL 617/8, RHCSNRHCE CentOS)
  • Experience with Windows system administration, system monitoring, instrumentation, resiliency and performance
  • Experience integrating Jenkins/Bamboo Docker, and Kubernetes for automated deployment preferred
  • Experience with caching technologies (Memcache, Active MQ, Redis, APC, etc.)
  • Experience with MySQL (Clusters, Replication, and Tuning) and Elasticsearch (Kibana a plus)
  • Knowledge of security practices, networking protocols, firewalls, PCI compliance etc.
  • Experience managing/monitoring AWS cloud and virtualized servers for optimal performance while working in a Platform as a Service (PaaS) environment
  • Familiarity with software development life cycle models, agile, and DevOps programming methodologies
Job Responsibility
Job Responsibility
  • Support the development life cycle of platform architectural design, deployment and debugging
  • Develop & maintain sound version control best practices-based CM systems (GIT), including branching and merging strategies
  • Serve as a technical lead for an Agile team and actively participate in all Agile ceremonies
  • Participate in all team ceremonies including planning, grooming, product demonstration and team retrospectives
  • Ability to automate release deployments across development, test, staging, Quality Assurance and production stacks using a combination of scripting languages and other automation toolkits
  • Set-up up new sites and applications via configuration management such as Puppet and Ansible
  • Maintain / upgrade/ patch tracking and documentation software (Confluence / Jira)
  • Create, Assist, and Implement design and maintenance web service infrastructure and deployment
  • Leverage programming Languages such as Python, Ruby, Perl, and Java
  • Proficient with DevOps or Site Reliability Engineering methodologies
  • Fulltime
Read More
Arrow Right

Principal DevOps Engineer

We are looking for Principal Engineer to join our Cloud-NGFW engineering team. Y...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent practical experience
  • 10+ years of professional experience in DevOps, SRE, or Infrastructure Engineering, with a Bachelor's degree
  • Strong proficiency in Linux/Unix systems administration, internals, networking, and troubleshooting
  • Expertise in at least one programming/scripting language (e.g., Python, Go, Bash)
  • Hands-on experience with at least one major cloud platform (AWS, Azure) and its core services
  • Proven experience with containerization and orchestration technologies
  • Demonstrable experience building and managing CI/CD pipelines (e.g., GitLab Actions, Jenkins)
  • Strong hands-on experience with infrastructure as code (e.g., Terraform, Ansible)
Job Responsibility
Job Responsibility
  • Design, build, and maintain scalable, highly available, and secure infrastructure to support our global security services
  • Spearhead the transition to autonomous operational processes by developing and implementing Infrastructure as Code (IaC) practices
  • Define and govern SLIs/SLOs/SLAs to ensure rigorous service standards and lead the 'Error Budget' conversation to balance feature velocity with system stability
  • Build and optimize CI/CD pipelines that empower developers to ship code multiple times a day with high confidence and reliability
  • Implement and enhance monitoring, alerting, and observability solutions to improve system visibility and proactively reduce Mean Time to Resolution (MTTR)
  • Drive a culture of continuous improvement through blameless postmortems and root cause analysis, and collaborate with cross-functional teams to implement preventative measures
  • Automate repetitive operational tasks using scripting and configuration management tools to improve efficiency and reduce manual error
  • Fulltime
Read More
Arrow Right

Principal DevOps Engineer

Riverstone Enterprise Solutions, a PD Systems company, delivers mission-focused ...
Location
Location
United States , Annapolis Junction
Salary
Salary:
Not provided
rivsol.com Logo
Riverstone Enterprise Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree or higher is required in either Engineering (i. e. Computer, Electrical, Mechanical, Aerospace, etc.) or Computer Science with a minimum of ten (10) years of related experience
  • Must be fluent with Git
  • Strong knowledge of Linux and Linux environments (RHEL 617/8, RHCSNRHCE CentOS)
  • Experience with Windows system administration, system monitoring, instrumentation, resiliency and performance
  • Experience integrating Jenkins/Bamboo Docker, and Kubernetes for automated deployment preferred
  • Experience with caching technologies (Memcache, Active MQ, Redis, APC, etc.)
  • Experience with MySQL (Clusters, Replication, and Tuning) and Elasticsearch (Kibana a plus)
  • Knowledge of security practices, networking protocols, firewalls, PCI compliance etc.
  • Experience managing/monitoring AWS cloud and virtualized servers for optimal performance while working in a Platform as a Service (PaaS) environment
  • Familiarity with software development life cycle models, agile, and DevOps programming methodologies
Job Responsibility
Job Responsibility
  • Support the development life cycle of platform architectural design, deployment and debugging
  • Develop & maintain sound version control best practices-based CM systems (GIT), including branching and merging strategies
  • Serve as a technical lead for an Agile team and actively participate in all Agile ceremonies
  • Ability to automate release deployments across development, test, staging, Quality Assurance and production stacks using a combination of scripting languages and other automation toolkits
  • Set-up new sites and applications via configuration management such as Puppet and Ansible
  • Maintain / upgrade/ patch tracking and documentation software (Confluence / Jira)
  • Create, Assist, and Implement design and maintenance web service infrastructure and deployments
  • Analyze service stack and make recommendations for further improvements
  • Identify processes and capabilities that can be streamlined and automated
  • Communicate effectively to help bridge stakeholders and development requirements
  • Fulltime
Read More
Arrow Right

Principal DevOps Engineer

We are seeking a highly experienced and visionary Principal DevOps Engineer to l...
Location
Location
United States , Santa Clara
Salary
Salary:
151600.00 - 245300.00 USD / Year
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of proven experience in DevOps, Site Reliability Engineering (SRE), or Systems Engineering, with a track record of operating at a senior or principal level
  • Deep expertise in advanced Git operations (complex merges, interactive rebasing, history management) and enterprise repository administration using GitLab or Bitbucket
  • Deep expertise in deploying and managing microservices using OpenShift or Kubernetes in production environments
  • Extensive hands-on experience building complex, automated pipelines using Jenkins
  • Advanced Linux administration skills (Red Hat, Ubuntu, CentOS, etc.)
  • Strong network troubleshooting skills, with a deep understanding of TCP/IP, DNS, HTTP/S, routing, and network security protocols
  • Hands-on experience integrating static code analysis and security testing into automated workflows, with specific expertise in Coverity (or similar enterprise SAST tools)
  • Strong proficiency in Python for scripting, automation, and API integrations
  • Exceptional analytical skills with the ability to troubleshoot complex, inter-dependent platform issues during critical incidents
Job Responsibility
Job Responsibility
  • Architect & Scale Infrastructure: Design, deploy, and maintain highly available and resilient containerized infrastructure utilizing Kubernetes or Red Hat OpenShift
  • Version Control Leadership: Establish and enforce advanced Git workflows, optimal branching strategies (e.g., GitFlow, trunk-based development), and governance policies across enterprise GitLab or Bitbucket environments
  • CI/CD Leadership: Own and continuously improve enterprise-scale CI/CD pipelines using Jenkins, integrating seamlessly with our source control repositories to ensure fast, reliable, and automated software delivery
  • DevSecOps Integration: Champion secure coding practices by integrating and managing static analysis (SAST) tools, specifically Coverity, directly into the build pipelines to catch vulnerabilities early
  • Systems Engineering & Administration: Serve as the subject matter expert for Linux administration, performance tuning, and capacity planning across all environments
  • Network Operations: Lead advanced network troubleshooting, managing configurations, firewalls, load balancing, and resolving complex connectivity issues across distributed systems
  • Automation & Tooling: Write robust, scalable automation scripts and internal tooling using Python to eliminate manual toil and optimize system performance
  • Mentorship & Strategy: Act as a technical mentor to the broader engineering team, establish best practices for DevOps, and guide the architectural direction of our platform
  • Fulltime
Read More
Arrow Right

Principal DevOps Engineer

The Principal DevOps Engineer owns the clarity, reliability, security, and repea...
Location
Location
United States , Columbus
Salary
Salary:
Not provided
revelit.com Logo
Revel IT
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6-10 years of hands-on experience in DevOps, infrastructure, or platform engineering supporting production systems
  • Advanced programming experience (Python, Go, Ruby, etc.)
  • Proficiency with Linux/Unix administration, scripting, and programming (bash, Python, Ruby, etc.)
  • Deep hands-on expertise with core DevOps technologies such as Docker, Terraform, Ansible, and CloudFormation
  • Strong experience building and improving CI/CD workflows for provisioning, deployment, and scaling
  • Hands-on experience managing application-level networking, VPN configurations, load balancers, and connectivity required for secure, distributed environments
  • Experience implementing test automation and use of AI-assisted tooling to improve deployment quality, reliability, and operational efficiency
  • Strong troubleshooting and monitoring skills for Linux operating systems
  • Hands-on experience implementing monitoring and log aggregation platforms (ELK, Graylog, Graphite, Prometheus, etc.)
  • Experience deploying and managing web/ application servers, load balancers, queues, and caches
Job Responsibility
Job Responsibility
  • Own and execute deployment processes end-to-end, ensuring they are secure, repeatable, transparent, and well documented with clear failure signals and automated rollback strategies
  • Design, build, and maintain automated, scalable, secure, and cost-effective infrastructure across production, development, and test environments
  • Build, operate, and continuously improve CI/CD pipelines with clear failure signals, recovery paths, and rollback strategies
  • Own application-level networking and infrastructure concerns, including network configuration, access controls, and connectivity required to support development and production environments
  • Own all infrastructure and networking concerns, including the configuration and troubleshooting of site-to-site VPNs, firewall rules, and secure connectivity required for county-level integrations and remote access
  • Own day-to-day DevOps operations, including infrastructure health, monitoring, logging, patching, security posture, and maintenance, ensuring systems are observable and failures are diagnosable through strong metrics, logging, root-cause visibility, and effective incident response
  • Perform regular access analysis across all systems, managing secrets, credentials, and IAM roles to ensure strict adherence to security best practices
  • Proactively support compliance requirements (such as SOC 2) by maintaining auditable operational practices and generating technical evidence/reports for software and security audits
  • Enforce security posture through proactive patching, encryption, and vulnerability management across web servers, load balancers, and data stores
  • Partner with software engineers during deployments and operational work to build shared understanding and enable safe, independent troubleshooting
Read More
Arrow Right

Principal DevOps Engineer

We are looking for a Senior DevOps engineer to build and maintain the infrastruc...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
Plasma
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of operating distributed systems at scale, ideally at companies where uptime was existential: big tech, HFT, exchanges, or blockchain infrastructure
  • system design depth: architected platforms for fault tolerance
  • fluent in Kubernetes, Terraform, and cloud infrastructure
  • led reliability transformations before, taking production systems from 'good enough' to 'financial-grade'
  • lead through influence, not authority
Job Responsibility
Job Responsibility
  • Own the reliability roadmap: lead the architectural evolution of our production systems toward higher availability, graceful degradation, and predictable failure recovery
  • Redesign for resilience: identify brittleness in existing infrastructure and drive systematic improvements, such as chaos engineering, redundancy patterns, and blast-radius reduction
  • Elevate observability: transform our monitoring from system health to risk awareness, ensuring we catch issues before they escalate to incidents
  • Strengthen operational rigour: improve our runbooks, incident response protocols, and post-mortem practices to compound reliability gains over time
  • Mentor and uplift the team: share hard-won lessons from operating at scale, raise the collective bar, and nurture a culture of ownership
  • Partner across engineering: work with protocol, client, and backend teams to bake reliability in at the design phase
What we offer
What we offer
  • Above market salary plus token compensation
  • Premium health insurance for you and your family fully covered by Plasma
  • Monthly wellness budget, whether for the gym, therapy, sauna & massage
  • A beautiful London HQ with gym access and daily food
  • All the tools and tech you need to operate at your best
  • Visa sponsorship and relocation support if you are joining the London office from abroad
  • Fulltime
Read More
Arrow Right

Principal DevOps Engineer (Prisma Browser Platform)

Location
Location
United States , Santa Clara
Salary
Salary:
147000.00 - 237500.00 USD / Year
paloaltonetworks.it Logo
Palo Alto Networks Italia
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years as DevOps engineer with a passion for technology, strong motivation and responsibility
  • Proficiency in DevOps and Platform Engineering with expertise in AWS, GCP, Terraform, ArgoCD, Kubernetes, and related tools
  • Experience in developing and maintaining CI/CD pipelines for continuous delivery in agile environments
  • Skilled in managing cloud infrastructure, particularly with AWS and GCP, and adept in infrastructure as code practices using Terraform/Terragrunt
  • Demonstrated capability in supporting high-scale SaaS applications, focusing on scalability, reliability, and performance
  • Strong communication, strategic thinking, and problem-solving skills
Job Responsibility
Job Responsibility
  • Implement and optimize CI/CD pipelines and cloud infrastructure using our technology stack, ensuring efficient and reliable deployment to production
  • Participate in the deployment of monitoring and alerting systems to maintain high system performance and reliability
  • Collaborate with software development and other cross-functional teams to streamline and enhance processes, aiming for efficiency and alignment with business goals
  • Contribute to the management of the cloud infrastructure, utilizing Infrastructure as Code principles
What we offer
What we offer
  • Restricted stock units
  • Bonus
  • Employee benefits
  • Fulltime
Read More
Arrow Right

Principal DevOps Engineer (Prisma Browser Platform)

Location
Location
United States , Santa Clara
Salary
Salary:
147000.00 - 237500.00 USD / Year
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years as DevOps engineer with a passion for technology, strong motivation and responsibility
  • Proficiency in DevOps and Platform Engineering with expertise in AWS, GCP, Terraform, ArgoCD, Kubernetes, and related tools
  • Experience in developing and maintaining CI/CD pipelines for continuous delivery in agile environments
  • Skilled in managing cloud infrastructure, particularly with AWS and GCP, and adept in infrastructure as code practices using Terraform/Terragrunt
  • Demonstrated capability in supporting high-scale SaaS applications, focusing on scalability, reliability, and performance
  • Strong communication, strategic thinking, and problem-solving skills
Job Responsibility
Job Responsibility
  • Implement and optimize CI/CD pipelines and cloud infrastructure using our technology stack, ensuring efficient and reliable deployment to production
  • Participate in the deployment of monitoring and alerting systems to maintain high system performance and reliability
  • Collaborate with software development and other cross-functional teams to streamline and enhance processes, aiming for efficiency and alignment with business goals
  • Contribute to the management of the cloud infrastructure, utilizing Infrastructure as Code principles
What we offer
What we offer
  • Restricted stock units
  • Bonus
  • Employee benefits
  • Fulltime
Read More
Arrow Right