CrawlJobs Logo

Principal DevOps Engineer

revelit.com Logo

Revel IT

Location Icon

Location:
United States , Columbus

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

The Principal DevOps Engineer owns the clarity, reliability, security, and repeatability of how our systems are built, deployed, and operated. This role designs and maintains automated, scalable, secure, and cost-effective infrastructure across production, development, and test environments. This is a deeply hands-on role responsible for executing and improving deployments, observability, and core operational practices to reduce risk caused by opaque processes, undocumented knowledge, and single points of failure.

Job Responsibility:

  • Own and execute deployment processes end-to-end, ensuring they are secure, repeatable, transparent, and well documented with clear failure signals and automated rollback strategies
  • Design, build, and maintain automated, scalable, secure, and cost-effective infrastructure across production, development, and test environments
  • Build, operate, and continuously improve CI/CD pipelines with clear failure signals, recovery paths, and rollback strategies
  • Own application-level networking and infrastructure concerns, including network configuration, access controls, and connectivity required to support development and production environments
  • Own all infrastructure and networking concerns, including the configuration and troubleshooting of site-to-site VPNs, firewall rules, and secure connectivity required for county-level integrations and remote access
  • Own day-to-day DevOps operations, including infrastructure health, monitoring, logging, patching, security posture, and maintenance, ensuring systems are observable and failures are diagnosable through strong metrics, logging, root-cause visibility, and effective incident response
  • Perform regular access analysis across all systems, managing secrets, credentials, and IAM roles to ensure strict adherence to security best practices
  • Proactively support compliance requirements (such as SOC 2) by maintaining auditable operational practices and generating technical evidence/reports for software and security audits
  • Enforce security posture through proactive patching, encryption, and vulnerability management across web servers, load balancers, and data stores
  • Partner with software engineers during deployments and operational work to build shared understanding and enable safe, independent troubleshooting
  • Deploy, manage, and scale web and application servers, load balancers, queues, and caches through automated, repeatable workflows
  • Identify, prioritize, and deliver improvements that reduce operational risk, remove bottlenecks, improve efficiency, and increase delivery confidence
  • Document systems and processes with a focus on explaining both how they work and why
  • Take proactive ownership of workload while ensuring strong coordination and transparency across the team
  • Perform other job-related duties as assigned to support departmental goals and continuous improvement initiatives

Requirements:

  • 6-10 years of hands-on experience in DevOps, infrastructure, or platform engineering supporting production systems
  • Advanced programming experience (Python, Go, Ruby, etc.)
  • Proficiency with Linux/Unix administration, scripting, and programming (bash, Python, Ruby, etc.)
  • Deep hands-on expertise with core DevOps technologies such as Docker, Terraform, Ansible, and CloudFormation
  • Strong experience building and improving CI/CD workflows for provisioning, deployment, and scaling
  • Hands-on experience managing application-level networking, VPN configurations, load balancers, and connectivity required for secure, distributed environments
  • Experience implementing test automation and use of AI-assisted tooling to improve deployment quality, reliability, and operational efficiency
  • Strong troubleshooting and monitoring skills for Linux operating systems
  • Hands-on experience implementing monitoring and log aggregation platforms (ELK, Graylog, Graphite, Prometheus, etc.)
  • Experience deploying and managing web/ application servers, load balancers, queues, and caches
  • proven ability to scale and build resiliency into web architectures
  • Proficiency with source control systems and IaC to create scriptable, repeatable processes
  • Must be authorized to work in the U.S.

Nice to have:

  • AWS experience (EC2, S3, Route 53, VPC, IAM, EKS)
  • Database administration experience
  • Experience supporting or deploying 12-Factor applications

Additional Information:

Job Posted:
March 21, 2026

Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Principal DevOps Engineer

Principal DevOps Engineer

As an Engineer well into your career, we know you're an expert at what you do an...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelors, Masters, or PhD in Computer science in a related technical field or similar experience
  • 10+ years of experience in software development and architecture
  • Expert-level experience with one or more prominent languages such as Java, C# or C/C++ is crucial
  • An expert in at least one technical topic/domain
  • Passion for collaborating with and mentoring junior members of the team
  • A real appetite for helping others learn and grow
  • Considers the customer impact when making technical decisions
Job Responsibility
Job Responsibility
  • Regularly tackle the largest and most complex problems on the team, from technical design to launch
  • Deliver solutions that are used by other teams and products
  • Determine plans-of-attack on large projects
  • Routinely tackle complex architecture challenges and apply architectural standards and start using them on new projects
  • Lead code reviews & documentation as well as take on complex bug fixes, especially on high-risk problems
  • Set the standard for thorough, meaningful code reviews
  • Partner across engineering teams to take on company-wide initiatives spanning multiple projects
  • Transfer your depth of knowledge from your current language to excel as a Software Engineer
  • Mentor more junior members
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Principal Software Engineer – Cloud Security

Principal Software Engineer – Cloud Security role at Hewlett Packard Enterprise,...
Location
Location
Israel , Tel Aviv
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or master’s degree in computer science, engineering, information systems, or closely related quantitative discipline
  • Typically, 10-15 years’ experience
  • Deep expertise in software systems design, development methodologies, and integration across diverse platforms and technologies
  • Strong business acumen, focusing on aligning technological initiatives with business goals and driving sustainable growth and profitability
  • Exceptional analytical and problem-solving skills, with the ability to navigate complex technical challenges and drive impactful solutions
  • Track record of driving technological innovation, with a portfolio of patents and successful product deployments
  • Exceptional communication and stakeholder management skills, with the ability to effectively convey complex technical concepts to non-technical audiences and influence decision-making at the executive level
Job Responsibility
Job Responsibility
  • Leads the identification, evaluation, and adoption of cutting-edge technologies, innovations, and strategic partnerships to drive growth and competitiveness
  • Drives developing and implementing robust methodologies, standards, and best practices for software systems design, development, and integration
  • Leverages recognized domain expertise and experience to influence decisions
  • Collaborates with executive leadership to align technology initiatives with business objectives, ensuring technology investments deliver measurable value and impact
  • Champion a culture of continuous innovation, thought leadership, and excellence in software systems design and help build technical community
  • Provides strategic guidance and mentorship to senior technical teams, fostering a culture of collaboration, creativity, and high-performance outcomes
  • Analyzes science, engineering, business, and other data processing problems to develop and implement solutions to complex application problems, system administration issues, or network concerns
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Principal Software Engineer

Principal Software Engineer (Golang | Distributed Systems) to join a high-growth...
Location
Location
United Kingdom , London
Salary
Salary:
170000.00 GBP / Year
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years’ backend engineering experience, ideally at Staff / Principal / Tech Lead level
  • Expert-level proficiency with Golang in production
  • Proven track record designing distributed systems and event-driven architectures (Kafka, RabbitMQ, WebSockets)
  • Deep understanding of PostgreSQL, Redis, and high-performance data systems
  • Strong DevOps mindset – CI/CD, infrastructure as code, observability (Grafana, Prometheus, OpenTelemetry)
  • Exceptional communicator, able to influence architecture and direction across teams
Job Responsibility
Job Responsibility
  • Architect and scale high-throughput, event-driven systems built in Go
  • Lead the evolution of real-time APIs and data platforms handling billions of requests
  • Stay deeply hands-on with Golang while influencing design and long-term technical strategy
  • Drive improvements in observability, testing, and performance across all services
  • Mentor senior engineers and play a key role in shaping engineering culture
What we offer
What we offer
  • 25% bonus
  • excellent benefits
  • Fulltime
Read More
Arrow Right

Principal Engineer

The Principal AI/ML Operations Engineer leads the architecture, automation, and ...
Location
Location
United States , Pleasanton, California
Salary
Salary:
251000.00 - 314500.00 USD / Year
blackline.com Logo
BlackLine
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Machine Learning, Data Science, or a related field
  • 10+ years in ML infrastructure, DevOps, and software system architecture
  • 4+ years in leading MLOps or AI Ops platforms
  • Strong programming skills in languages such as Python, Java, or Scala
  • Expertise in ML frameworks (TensorFlow, PyTorch, scikit-learn) and orchestration tools (Airflow, Kubeflow, Vertex AI, MLflow)
  • Proven experience operating production pipelines for ML and LLM-based systems across cloud ecosystems (GCP, AWS, Azure)
  • Deep familiarity with LangChain, LangGraph, ADK or similar agentic system runtime management
  • Strong competencies in CI/CD, IaC, and DevSecOps pipelines integrating testing, compliance, and deployment automation
  • Hands-on with observability stacks (Prometheus, Grafana, Newrelic) for model and agent performance tracking
  • Understanding of governance frameworks for Responsible AI, auditability, and cost metering across training and inference workloads
Job Responsibility
Job Responsibility
  • Define enterprise-level standards and reference architectures for ML-Ops and AIOps systems
  • Partner with data science, security, and product teams to set evaluation and governance standards (Guardrails, Bias, Drift, Latency SLAs)
  • Mentor senior engineers and drive design reviews for ML pipelines, model registries, and agentic runtime environments
  • Lead incident response and reliability strategies for ML/AI systems
  • Lead the deployment of AI models and systems in various environments
  • Collaborate with development teams to integrate AI solutions into existing workflows and applications
  • Ensure seamless integration with different platforms and technologies
  • Define and manage MCP Registry for agentic component onboarding, lifecycle versioning, and dependency governance
  • Build CI/CD pipelines automating LLM agent deployment, policy validation, and prompt evaluation of workflows
  • Develop and operationalize experimentation frameworks for agent evaluations, scenario regression, and performance analytics
What we offer
What we offer
  • short-term and long-term incentive programs
  • robust offering of benefit and wellness plans
  • Fulltime
Read More
Arrow Right

Senior Principal Solution Engineer

Designs, develops, troubleshoots and debugs software programs for software enhan...
Location
Location
United States , All
Salary
Salary:
157500.00 - 361500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Information Systems, or equivalent
  • Typically, 10+ year's experience
  • Experience designing and developing software systems design tools and languages
  • Excellent analytical and problem solving skills
  • Experience in overall architecture of software systems for products and solutions
  • Designing and integrating software systems running on multiple platform types into overall architecture
  • Evaluating and selecting forms and processes for software systems testing and methodology, including writing and execution of test plans, debugging, and testing scripts and tools
  • History of innovation with multiple patents or deployed solutions in the field of software design
  • Excellent written and verbal communication skills
  • mastery in English and local language
Job Responsibility
Job Responsibility
  • Develops organization-wide architectures and methodologies for software systems design and development across multiple platforms and organizations within the Global Business Unit
  • Identifies and evaluates new technologies, innovations, and outsourced development partner relationships for alignment with technology roadmap and business value
  • creates plans for integration and update into architecture
  • Reviews and evaluates designs and project activities for compliance with development guidelines and standards
  • provides tangible feedback to improve product quality and mitigate failure risk
  • Leverages recognized domain expertise, business acumen, and experience to influence decisions of executive business leadership, outsourced development partners, and industry standards groups
  • Provides guidance and mentoring to less- experienced staff members to set an example of software systems design and development innovation and excellence
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Sr Staff/Principal Devops Engineer

Balbix is looking for a DevOps Sr Staff/Principal Engineer to join our growing t...
Location
Location
India , Delhi
Salary
Salary:
Not provided
balbix.com Logo
Balbix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science or a related field
  • 10+ years of experience in DevOps for Sr Staff or 12-15 years for Principal
  • 4+ years of experience setting up and managing infrastructure in AWS for a product development organization
  • Ability to independently architect, design, document, and implement complex platforms and complex DevOps systems
  • Solid understanding of AWS infrastructure and services such as load balancers (ALB/ELB), IAM, KMS, Networking, EC2, CloudWatch, CloudTrail, CloudFormation, Lambda, etc.
  • 4+ years of experience building infrastructure using Terraform
  • 3+ years of solid experience with Kubernetes and Helm
  • Expert-level programming experience with Python for scripting and automation
  • Excellent knowledge of working on configuration management systems such as Ansible
  • Hands-on experience with CI/CD code management and deployment technologies like GitLab, Jenkins, or similar
Job Responsibility
Job Responsibility
  • Lead the development of critical DevOps projects, set technical direction, and influence the organization's technical strategy
  • Solve complex problems, mentor senior engineers, and collaborate with cross-functional teams to deliver high-impact DevOps solutions
  • Design and develop IaC components for Balbix solutions and internal engineering tools running in AWS
  • Build and deploy a state-of-the-art security SaaS platform using the latest CI/CD techniques, ensuring it is fully automated, repeatable, and secure
  • Secure infrastructure using best practices (e.g., TLS, bastion hosts, certificate management, authentication and authorization, network segmentation)
  • Design and develop a scalable, cost-efficient deployment infrastructure on Kubernetes
  • Design and implement consistent observability systems for Balbix solutions
  • Participate in on-call rotation
  • Fulltime
Read More
Arrow Right

Principal Engineer, GenAI Innovation

The full stack Principal Engineer will work with a team of other software engine...
Location
Location
United States , Bellevue
Salary
Salary:
129400.00 - 233400.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or Engineering
  • 7+ years of proven experience in full stack web development
  • Technical engineering experience
  • Crafting database schemas, writing SQL
  • 3+ years of DevOps experience with infrastructure as code
  • 4-7 years using cloud services from AWS, Azure, or GCP
  • 2+ years of experience working with Generative AI models, APIs
  • Experience coaching and mentoring team members
  • Knowledge of HTML, CSS, Webpack, JavaScript, at least one front-end framework, and one back-end framework
  • Understanding of database modeling and SQL
Job Responsibility
Job Responsibility
  • Designs and builds full stack web solutions including both the back end and front end
  • Code Review and mentoring of other team members
  • Designs / crafts sophisticated scheduled jobs and micro-services defining new patterns and orchestrations
  • Designs / implements detailed data storage mechanisms using relational and non-relational data stores
  • Explores, creates and configures cloud services using infrastructure as code
  • Recommends new cloud services and patterns
  • Presents new ideas which improve an existing system/process/service
  • Collaborates with team to break down features into user stories and estimate them
  • Awareness of technology roadmap
  • Updates job knowledge by monitoring and understanding emerging engineering practices
What we offer
What we offer
  • Competitive base salary and compensation package
  • Annual stock grant
  • Employee stock purchase plan
  • 401(k)
  • Access to free, year-round money coaches
  • Medical, dental and vision insurance
  • Flexible spending account
  • Paid time off
  • Up to 12 paid holidays
  • Paid parental and family leave
  • Fulltime
Read More
Arrow Right

Principal Security Engineer

We’re seeking a Principal Security Engineer with deep expertise in cloud securit...
Location
Location
United States , San Francisco
Salary
Salary:
136000.00 - 241000.00 USD / Year
ethoslife.com Logo
Ethos
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience in security engineering or architecture roles
  • Bachelor’s degree in Cybersecurity, Information Technology, Computer Science, or related field from a reputable institution
  • Deep expertise in cloud platforms (particularly AWS), including infrastructure-as-code (e.g., Terraform, CloudFormation)
  • Strong experience in secure software development and application security (e.g., OWASP Top 10, SAST, DAST, threat modeling)
  • Experience designing and implementing zero-trust architectures, secure API gateways, and identity/access controls
  • Proficient in scripting or development languages (e.g., Python, Go, JavaScript) and secure coding practices
  • Demonstrated leadership in cross-functional security initiatives and technical mentorship
  • Ability to come into our San Francisco, CA office once a week
Job Responsibility
Job Responsibility
  • Design and implement secure architectures for applications, APIs, microservices, and containerized workloads
  • Develop and enforce application security best practices across SDLC
  • partner with DevOps and engineering teams to integrate security into CI/CD pipelines
  • Conduct threat modeling, security design reviews, and risk assessments for new and existing systems
  • Evaluate and implement cloud security tools, controls, and frameworks (e.g., CSPM, CWPP, IAM, KMS, logging, and monitoring)
  • Provide technical leadership and mentorship to security engineers, software developers, and DevOps personnel
  • Lead response to complex security incidents or architectural flaws
  • conduct root cause analysis and recommend strategic remediations
  • Contribute to and influence security policies, standards, and governance
  • Stay current with emerging threats, vulnerabilities, and security technologies, advising stakeholders on evolving risks and mitigations
  • Fulltime
Read More
Arrow Right