CrawlJobs Logo

Principal DevOps Engineer

United States, Santa Clara 151600.00 - 245300.00 USD / Year · Job Posted April 16, 2026
Apply Position
Job Link Share

Job Description

We are seeking a highly experienced and visionary Principal DevOps Engineer to lead the design, implementation, and scaling of our core infrastructure and deployment pipelines. As a technical leader, you will be instrumental in bridging the gap between software development and IT operations, ensuring high availability, robust security, and seamless continuous integration and delivery (CI/CD). You will champion automation, embed security practices into our development lifecycle, and mentor junior engineers while driving our technical strategy forward.

Job Responsibility

  • Architect & Scale Infrastructure: Design, deploy, and maintain highly available and resilient containerized infrastructure utilizing Kubernetes or Red Hat OpenShift
  • Version Control Leadership: Establish and enforce advanced Git workflows, optimal branching strategies (e.g., GitFlow, trunk-based development), and governance policies across enterprise GitLab or Bitbucket environments
  • CI/CD Leadership: Own and continuously improve enterprise-scale CI/CD pipelines using Jenkins, integrating seamlessly with our source control repositories to ensure fast, reliable, and automated software delivery
  • DevSecOps Integration: Champion secure coding practices by integrating and managing static analysis (SAST) tools, specifically Coverity, directly into the build pipelines to catch vulnerabilities early
  • Systems Engineering & Administration: Serve as the subject matter expert for Linux administration, performance tuning, and capacity planning across all environments
  • Network Operations: Lead advanced network troubleshooting, managing configurations, firewalls, load balancing, and resolving complex connectivity issues across distributed systems
  • Automation & Tooling: Write robust, scalable automation scripts and internal tooling using Python to eliminate manual toil and optimize system performance
  • Mentorship & Strategy: Act as a technical mentor to the broader engineering team, establish best practices for DevOps, and guide the architectural direction of our platform

Requirements

  • 8+ years of proven experience in DevOps, Site Reliability Engineering (SRE), or Systems Engineering, with a track record of operating at a senior or principal level
  • Deep expertise in advanced Git operations (complex merges, interactive rebasing, history management) and enterprise repository administration using GitLab or Bitbucket
  • Deep expertise in deploying and managing microservices using OpenShift or Kubernetes in production environments
  • Extensive hands-on experience building complex, automated pipelines using Jenkins
  • Advanced Linux administration skills (Red Hat, Ubuntu, CentOS, etc.)
  • Strong network troubleshooting skills, with a deep understanding of TCP/IP, DNS, HTTP/S, routing, and network security protocols
  • Hands-on experience integrating static code analysis and security testing into automated workflows, with specific expertise in Coverity (or similar enterprise SAST tools)
  • Strong proficiency in Python for scripting, automation, and API integrations
  • Exceptional analytical skills with the ability to troubleshoot complex, inter-dependent platform issues during critical incidents

Nice to have

  • Experience migrating repositories or CI/CD pipelines between major platforms (e.g., Bitbucket to GitLab, Jenkins to GitLab CI)
  • Experience with Infrastructure as Code (IaC) tools such as Terraform or Ansible
  • Experience with AI-related skills, including Large Language Models (LLM), AI agents, and Claude code
  • Experience operating within major public cloud providers (AWS, GCP, or Azure)
  • Familiarity with modern observability and monitoring stacks (Prometheus, Grafana, Datadog, ELK)
  • Proven ability to integrate with existing toolchains to boost developer productivity, coupled with previous involvement in open-source projects

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Principal DevOps Engineer

8 matching positions

Principal Software Engineer

Principal Software Engineer (Golang | Distributed Systems) to join a high-growth...
Location
Location
United Kingdom , London
Salary
Salary:
170000.00 GBP / Year
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years’ backend engineering experience, ideally at Staff / Principal / Tech Lead level
  • Expert-level proficiency with Golang in production
  • Proven track record designing distributed systems and event-driven architectures (Kafka, RabbitMQ, WebSockets)
  • Deep understanding of PostgreSQL, Redis, and high-performance data systems
  • Strong DevOps mindset – CI/CD, infrastructure as code, observability (Grafana, Prometheus, OpenTelemetry)
  • Exceptional communicator, able to influence architecture and direction across teams
Job Responsibility
Job Responsibility
  • Architect and scale high-throughput, event-driven systems built in Go
  • Lead the evolution of real-time APIs and data platforms handling billions of requests
  • Stay deeply hands-on with Golang while influencing design and long-term technical strategy
  • Drive improvements in observability, testing, and performance across all services
  • Mentor senior engineers and play a key role in shaping engineering culture
What we offer
What we offer
  • 25% bonus
  • excellent benefits
  • Fulltime
Read More
Arrow Right

Principal Engineer

The Principal AI/ML Operations Engineer leads the architecture, automation, and ...
Location
Location
United States , Pleasanton, California
Salary
Salary:
251000.00 - 314500.00 USD / Year
blackline.com Logo
BlackLine
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Machine Learning, Data Science, or a related field
  • 10+ years in ML infrastructure, DevOps, and software system architecture
  • 4+ years in leading MLOps or AI Ops platforms
  • Strong programming skills in languages such as Python, Java, or Scala
  • Expertise in ML frameworks (TensorFlow, PyTorch, scikit-learn) and orchestration tools (Airflow, Kubeflow, Vertex AI, MLflow)
  • Proven experience operating production pipelines for ML and LLM-based systems across cloud ecosystems (GCP, AWS, Azure)
  • Deep familiarity with LangChain, LangGraph, ADK or similar agentic system runtime management
  • Strong competencies in CI/CD, IaC, and DevSecOps pipelines integrating testing, compliance, and deployment automation
  • Hands-on with observability stacks (Prometheus, Grafana, Newrelic) for model and agent performance tracking
  • Understanding of governance frameworks for Responsible AI, auditability, and cost metering across training and inference workloads
Job Responsibility
Job Responsibility
  • Define enterprise-level standards and reference architectures for ML-Ops and AIOps systems
  • Partner with data science, security, and product teams to set evaluation and governance standards (Guardrails, Bias, Drift, Latency SLAs)
  • Mentor senior engineers and drive design reviews for ML pipelines, model registries, and agentic runtime environments
  • Lead incident response and reliability strategies for ML/AI systems
  • Lead the deployment of AI models and systems in various environments
  • Collaborate with development teams to integrate AI solutions into existing workflows and applications
  • Ensure seamless integration with different platforms and technologies
  • Define and manage MCP Registry for agentic component onboarding, lifecycle versioning, and dependency governance
  • Build CI/CD pipelines automating LLM agent deployment, policy validation, and prompt evaluation of workflows
  • Develop and operationalize experimentation frameworks for agent evaluations, scenario regression, and performance analytics
What we offer
What we offer
  • short-term and long-term incentive programs
  • robust offering of benefit and wellness plans
  • Fulltime
Read More
Arrow Right

Senior Principal Solution Engineer

Designs, develops, troubleshoots and debugs software programs for software enhan...
Location
Location
United States , All
Salary
Salary:
157500.00 - 361500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Information Systems, or equivalent
  • Typically, 10+ year's experience
  • Experience designing and developing software systems design tools and languages
  • Excellent analytical and problem solving skills
  • Experience in overall architecture of software systems for products and solutions
  • Designing and integrating software systems running on multiple platform types into overall architecture
  • Evaluating and selecting forms and processes for software systems testing and methodology, including writing and execution of test plans, debugging, and testing scripts and tools
  • History of innovation with multiple patents or deployed solutions in the field of software design
  • Excellent written and verbal communication skills
  • mastery in English and local language
Job Responsibility
Job Responsibility
  • Develops organization-wide architectures and methodologies for software systems design and development across multiple platforms and organizations within the Global Business Unit
  • Identifies and evaluates new technologies, innovations, and outsourced development partner relationships for alignment with technology roadmap and business value
  • creates plans for integration and update into architecture
  • Reviews and evaluates designs and project activities for compliance with development guidelines and standards
  • provides tangible feedback to improve product quality and mitigate failure risk
  • Leverages recognized domain expertise, business acumen, and experience to influence decisions of executive business leadership, outsourced development partners, and industry standards groups
  • Provides guidance and mentoring to less- experienced staff members to set an example of software systems design and development innovation and excellence
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Sr Staff/Principal Devops Engineer

Balbix is looking for a DevOps Sr Staff/Principal Engineer to join our growing t...
Location
Location
India , Delhi
Salary
Salary:
Not provided
balbix.com Logo
Balbix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science or a related field
  • 10+ years of experience in DevOps for Sr Staff or 12-15 years for Principal
  • 4+ years of experience setting up and managing infrastructure in AWS for a product development organization
  • Ability to independently architect, design, document, and implement complex platforms and complex DevOps systems
  • Solid understanding of AWS infrastructure and services such as load balancers (ALB/ELB), IAM, KMS, Networking, EC2, CloudWatch, CloudTrail, CloudFormation, Lambda, etc.
  • 4+ years of experience building infrastructure using Terraform
  • 3+ years of solid experience with Kubernetes and Helm
  • Expert-level programming experience with Python for scripting and automation
  • Excellent knowledge of working on configuration management systems such as Ansible
  • Hands-on experience with CI/CD code management and deployment technologies like GitLab, Jenkins, or similar
Job Responsibility
Job Responsibility
  • Lead the development of critical DevOps projects, set technical direction, and influence the organization's technical strategy
  • Solve complex problems, mentor senior engineers, and collaborate with cross-functional teams to deliver high-impact DevOps solutions
  • Design and develop IaC components for Balbix solutions and internal engineering tools running in AWS
  • Build and deploy a state-of-the-art security SaaS platform using the latest CI/CD techniques, ensuring it is fully automated, repeatable, and secure
  • Secure infrastructure using best practices (e.g., TLS, bastion hosts, certificate management, authentication and authorization, network segmentation)
  • Design and develop a scalable, cost-efficient deployment infrastructure on Kubernetes
  • Design and implement consistent observability systems for Balbix solutions
  • Participate in on-call rotation
  • Fulltime
Read More
Arrow Right

Principal Security Engineer

We’re seeking a Principal Security Engineer with deep expertise in cloud securit...
Location
Location
United States , San Francisco
Salary
Salary:
136000.00 - 241000.00 USD / Year
ethoslife.com Logo
Ethos
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience in security engineering or architecture roles
  • Bachelor’s degree in Cybersecurity, Information Technology, Computer Science, or related field from a reputable institution
  • Deep expertise in cloud platforms (particularly AWS), including infrastructure-as-code (e.g., Terraform, CloudFormation)
  • Strong experience in secure software development and application security (e.g., OWASP Top 10, SAST, DAST, threat modeling)
  • Experience designing and implementing zero-trust architectures, secure API gateways, and identity/access controls
  • Proficient in scripting or development languages (e.g., Python, Go, JavaScript) and secure coding practices
  • Demonstrated leadership in cross-functional security initiatives and technical mentorship
  • Ability to come into our San Francisco, CA office once a week
Job Responsibility
Job Responsibility
  • Design and implement secure architectures for applications, APIs, microservices, and containerized workloads
  • Develop and enforce application security best practices across SDLC
  • partner with DevOps and engineering teams to integrate security into CI/CD pipelines
  • Conduct threat modeling, security design reviews, and risk assessments for new and existing systems
  • Evaluate and implement cloud security tools, controls, and frameworks (e.g., CSPM, CWPP, IAM, KMS, logging, and monitoring)
  • Provide technical leadership and mentorship to security engineers, software developers, and DevOps personnel
  • Lead response to complex security incidents or architectural flaws
  • conduct root cause analysis and recommend strategic remediations
  • Contribute to and influence security policies, standards, and governance
  • Stay current with emerging threats, vulnerabilities, and security technologies, advising stakeholders on evolving risks and mitigations
  • Fulltime
Read More
Arrow Right

Principal Software QA Engineer

Principal Software QA Engineer to lead test architecture and automation strategy...
Location
Location
Puerto Rico , Aguadilla
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of hands-on QA experience
  • Designing and building test automation frameworks from scratch
  • Non-functional testing (scale, reliability, performance, security)
  • Strong coding skills in Python, Java, or Go
  • Experience with Pytest, TestNG, JUnit, Playwright or similar tools
  • Deep understanding of Cloud platforms (AWS, Azure, GCP)
  • Microservices, Containers (Docker, Kubernetes)
  • Infrastructure & Data Center management
  • Linux/VM environments, Storage, Compute, Networking
  • REST APIs, JSON, SQL/NoSQL
Job Responsibility
Job Responsibility
  • Design, automate, and execute system-level test cases focused on scale, reliability, security, and performance
  • Lead the test automation strategy
  • evaluate and integrate new tools to improve efficiency and coverage
  • Collaborate closely with product, development, support, and platform engineering teams to ensure full lifecycle quality coverage
  • Provide technical leadership and mentorship to QA engineers and partners across teams
  • Contribute to design reviews with a QA lens to ensure testability and risk mitigation
  • Maintain and manage multiple product test configurations aligned with diverse deployment environments
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Comprehensive suite of benefits supporting physical, financial and emotional wellbeing
  • Fulltime
Read More
Arrow Right

Principal Engineer

We are hiring a Principal Engineer for a client that provides a cloud-native Saa...
Location
Location
United States , New York
Salary
Salary:
175000.00 - 185000.00 USD / Year
resourcefultalentgroup.com Logo
Resourceful Talent Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience building SaaS applications
  • 3+ years in a Principal Engineer or Tech Lead role
  • Strong C#, ASP.NET, .NET Core, and xUnit experience
  • 1+ years AWS experience: Lambda, API Gateway, S3, CloudFormation, RDS, DynamoDB, EC2, Cognito
  • Strong system architecture and distributed systems expertise
  • Ability to review and guide React/TypeScript development
  • Excellent leadership and collaboration skills
  • Bachelor’s degree in Computer Science or related field (preferred)
  • Must be a U.S. Citizen or Green Card holder
Job Responsibility
Job Responsibility
  • Lead the architectural direction and modernization of a large-scale SaaS platform
  • Improve system scalability, performance, and reliability
  • Build features in .NET and AWS (Lambda, API Gateway, DynamoDB, RDS, etc.)
  • Guide backend, frontend, DevOps, QA, and offshore engineers
  • Implement and uphold engineering best practices and code standards
  • Collaborate with Product leadership to deliver high-quality releases
  • Fulltime
Read More
Arrow Right

Principal Product Support Engineer

Hewlett Packard Enterprise is seeking a master-level Principal Product Support E...
Location
Location
United States , Oklahoma City; Dallas; Houston
Salary
Salary:
152000.00 - 349000.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in a technical field or equivalent experience demonstrating advanced expertise
  • US Citizenship with active Secret clearance
  • Industry-recognized certifications, including CompTIA Security+ (or higher, such as CASP or CISSP)
  • Cloud certifications (e.g., AWS Certified Solution Architect, Microsoft Azure Solution Architect, Google Professional Cloud Architect)
  • 10+ years of hands-on experience in IT support, cloud architecture, virtualization, or related areas, with a proven record of resolving deeply technical issues and leading support for federal customers
Job Responsibility
Job Responsibility
  • Serve as the top-tier escalation point for the most challenging technical issues within HPE Private Cloud and related technologies
  • Lead in-depth troubleshooting across multi-cloud, virtualization, and infrastructure platforms (AWS, Azure, Google Cloud, VMware ESX, Kubernetes)
  • Collaborate directly with BU engineering teams and managed services personnel to drive resolution of systemic, high-impact issues and develop critical patches and product enhancements
  • Analyze, identify, and architect solutions for recurring or complex customer issues, ensuring permanent resolution and knowledge transfer
  • Demonstrate mastery across all supported platforms, infrastructure, and technologies, acting as the subject matter expert for internal teams and federal customers
  • Develop and review automated solutions leveraging DevOps principles, CI/CD pipelines, and Infrastructure as Code
  • Lead compliance efforts for DISA STIGs and other federal standards, ensuring audit readiness and system hardening
  • Mentor and guide technical support engineers, sharing expertise and ensuring best practices are followed across teams
  • Work closely with federal customer stakeholders to understand business needs, translate requirements, and deliver innovative technical solutions
  • Engage regularly with product management, engineering, and BU teams to influence and prioritize product fixes, enhancements, and updates
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion culture
  • Comprehensive suite of benefits supporting physical, financial and emotional wellbeing
  • Fulltime
Read More
Arrow Right