CrawlJobs Logo

Principal Infrastructure Automation Engineer

cloud.com Logo

Cloud Software Group

Location Icon

Location:
United States , San Ramon

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

143957.00 - 259122.00 USD / Year

Job Description:

A Principal Infrastructure Automation Engineer is a senior technical leader who designs, develops, and implements large-scale, automated solutions to streamline IT operations and manage complex infrastructure environments. This role balances deep technical expertise with strategic planning and mentorship responsibilities.

Job Responsibility:

  • Define the strategy and long-term goals for automation and control systems across the organization
  • Lead the design and development of robust, scalable, and secure automation frameworks and infrastructure blueprints
  • Oversee the implementation of automation solutions for provisioning, configuration management, and application deployment, often within CI/CD pipelines
  • Serve as the subject matter expert (SME) for highly complex issues, performing root-cause analysis and providing technical consultation
  • Mentor and guide junior engineers and cross-functional teams on automation best practices, code quality, and design patterns
  • Ensure all automated systems comply with internal security standards and external regulatory requirements
  • Implement comprehensive monitoring and logging systems, using the gathered data to drive continuous improvement in efficiency, reliability, and performance

Requirements:

  • Typically requires 10+ years of experience in software development, IT automation, and/or infrastructure engineering
  • A Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field
  • Extensive experience with tools like Ansible, Puppet, Chef, Terraform, or AWS CloudFormation
  • Strong skills in languages such as Python, Java, PowerShell, or Bash for automation and tooling
  • Expertise in managing scalable and reliable cloud infrastructure in environments such as AWS, Azure, or Google Cloud Platform (GCP)
  • Proficiency with CI/CD pipelines, Docker, Kubernetes, and monitoring tools like Prometheus or Grafana
  • Exceptional problem-solving, analytical, and critical thinking skills
  • Strong communication and leadership abilities, with a proven ability to influence senior levels and build alignment across teams
  • Strong project management skills to manage complex, large-scale projects from concept to completion
What we offer:
  • Healthcare
  • Life insurance
  • Disability benefits
  • 401(k) plan and company match

Additional Information:

Job Posted:
January 31, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Principal Infrastructure Automation Engineer

Principal Infrastructure Engineer

The Principal Infrastructure Engineer, Electronic Trading is responsible for sys...
Location
Location
Canada , Mississauga
Salary
Salary:
120800.00 - 170800.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of experience
  • experience in delivering infrastructure technologies products and services
  • experience in financial services or large complex and/or global environment preferred
  • experience developing projects for the identification of best practices (design of metrics, analytical tools, benchmarking activities and related reporting)
  • consistently demonstrate clear and concise written and verbal communication with ability to communicate technical concepts to a non-technical audience
  • proven analytical, diagnostic, and multitasking skills with focus on execution and attention to detail
  • demonstrated ability to both work independently and partner with virtual teams in a high-pressure matrix environment
  • demonstrated ability to take ownership of various parts of a project/initiative with tight deadlines or unexpected changes in expectation/requirements
  • bachelor's degree/university degree or equivalent experience
  • master’s degree preferred
Job Responsibility
Job Responsibility
  • conduct work on a variety of high-impact, high-profile problems/projects driving technology infrastructure aligned to the business
  • identify and resolve issues, engaging in Root Cause Analysis (RCA) if escalation
  • conduct responsibilities such as quality control, work allocation, coaching/mentoring, ensuring ongoing compliance with regulatory requirements
  • evaluate controls to help mitigate negative outcomes through prevention, detection, and correction
  • design and create complex processes and reporting streams, participate in the review and approval of requirement documents
  • examine and update processes and procedures for hardware acquisition toward automation
  • understand diverse stakeholder needs and share and influence stakeholder expectations
  • appropriately assess risk when business decisions are made, demonstrating consideration for the firm’s reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency
What we offer
What we offer
  • professional development opportunities
  • equal opportunity employer
  • work-life balance programs
  • Fulltime
Read More
Arrow Right

Principal Software Engineer - Research Infrastructure Team

We are seeking a highly motivated and experienced Senior Software Engineer, pass...
Location
Location
Israel , Tel Aviv
Salary
Salary:
Not provided
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS in Computer Science or equivalent knowledge or equivalent military experience required
  • 5+ years of software engineering experience
  • Expertise in Python and Python internals
  • Experience in designing, building and maintaining a user facing application/API
  • Experience with Git or other source controls
  • Good communication skills
  • Self-driven with the ability to work independently, take initiative, and drive processes end-to-end
Job Responsibility
Job Responsibility
  • Responsible for the complete software development life cycle including requirement analysis, design, development and deployment
  • Take part in integrating the newest features and technologies, automate workflows, and create user friendly tools and frameworks for researchers
  • Produce elegant, generic, modular and extendable code
  • Actively influence the processes and methods for researchers, affecting their day to day life
  • Fulltime
Read More
Arrow Right

Principal QA Automation Engineer w/ AI experience

We seek a Principal QA Automation Engineer with a strong background in Cypress, ...
Location
Location
Argentina , Gran Buenos Aires; Capital Federal; Mar del Plata
Salary
Salary:
Not provided
basicagency.com Logo
BASIC/DEPT®
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of hands-on experience with Cypress for UI and end-to-end automation testing
  • Expert-level proficiency in TypeScript, JavaScript and Python
  • Proven experience testing AI-powered or machine learning applications, including AI model validation techniques
  • Strong understanding of AWS services (e.g., Lambda, S3, SQS, CloudWatch, ECS/EKS) and how to validate applications deployed in cloud environments
  • Experience with threaded/multi-agent AI tools and how they impact test design and validation
  • Familiarity with version control (Git), containerization (Docker), and CI/CD pipelines (e.g., GitHub Actions, Jenkins, or CircleCI)
  • Strong communication, leadership, and mentoring skills
Job Responsibility
Job Responsibility
  • Lead the design and implementation of end-to-end test automation frameworks using Cypress with TypeScript
  • Define quality strategies for applications with AI/ML components, including deterministic and non-deterministic testing approaches
  • Collaborate with engineering, DevOps, and AI/ML teams to ensure quality across AI-infused features in production environments
  • Build and scale testing strategies for threaded AI applications running in AWS cloud infrastructure
  • Integrate automated tests into CI/CD pipelines to support frequent, reliable deployments
  • Mentor and guide mid- and senior-level QA engineers, setting best practices and driving a culture of quality-first development
  • Evaluate and introduce new tools, libraries, and frameworks to improve test coverage, performance, and developer experience
  • Participate in architectural discussions to ensure testability and reliability are baked into software designs from the start
  • Analyze test results, track quality metrics, and communicate risk and coverage to stakeholders
What we offer
What we offer
  • Premium healthcare through OSDE for the employee and their immediate family members
  • Mendel prepaid card with a monthly allowance for grocery purchases
  • Monthly reimbursements for Wi-Fi/electricity expenses
  • Monthly reimbursements for training/English classes
  • 100% covered “Plan Total” membership at Sportclub
  • Access to a our benefits platform through Bonda
  • A flexible vacation policy
  • Fulltime
Read More
Arrow Right

Principal Cloud Infrastructure Engineer

As Highspot continues to scale rapidly, building a robust and efficient platform...
Location
Location
United States , Seattle
Salary
Salary:
188696.00 - 282609.00 USD / Year
highspot.com Logo
Highspot
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years of experience in software or infrastructure engineering
  • At least 5 years focused on platform engineering or cloud infrastructure at scale
  • Proven success designing and operating internal developer platforms in AWS environments
  • Expert-level experience with Kubernetes, including provisioning, cluster lifecycle management, workload orchestration, and multi-tenant design
  • Strong expertise in Terraform, GitOps tools (e.g., ArgoCD), and CI/CD systems (e.g., GitHub Actions, Spinnaker)
  • Deep understanding of cloud networking, IAM, service meshes, and container orchestration at scale
  • Familiar with the CNCF landscape and how to leverage open-source tools to solve platform problems
  • Passion for developer experience
  • Track record of technical leadership, mentoring, and influencing engineering culture at a large scale
  • Bachelor's or Master’s in Computer Science or related discipline, or equivalent practical experience
Job Responsibility
Job Responsibility
  • Design and build scalable platform capabilities that empower engineering teams to ship features reliably, securely, and quickly
  • Create and maintain developer-facing tools and paved paths (e.g., CI/CD pipelines, Kubernetes platforms, observability stacks, secrets management)
  • Implement Infrastructure-as-Code and GitOps patterns to promote consistency, automation, and compliance across environments
  • Collaborate with product, security, and compliance stakeholders to build platform services that meet SLAs and governance standards
  • Drive efforts to standardize and simplify infrastructure across cloud environments (AWS, Azure), enabling secure multi-cloud operation
  • Lead incident response, reliability engineering, and observability improvements that ensure platform uptime and performance
  • Act as a technical mentor and thought leader, guiding teams on infrastructure architecture, platform adoption, and best practices
  • Define and execute on a strategic roadmap to evolve the internal platform in line with company growth and technology direction
What we offer
What we offer
  • Comprehensive medical, dental, vision, disability, and life benefits
  • Health Savings Account (HSA) with employer contribution
  • 401(k) Matching with immediate vesting on employer match
  • Flexible PTO
  • 8 paid holidays and 5 paid days for Annual Holiday Week
  • Quarterly Recharge Fridays (paid days off for mental health recharge)
  • 18 weeks paid parental leave
  • Access to Coaches and Therapists through Modern Health
  • 2 volunteer days per year
  • Commuting benefits
  • Fulltime
Read More
Arrow Right

Principal Cloud Infrastructure Engineer

As Highspot continues to scale rapidly, building a robust and efficient platform...
Location
Location
Canada , Vancouver
Salary
Salary:
170435.00 - 230435.00 CAD / Year
highspot.com Logo
Highspot
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years of experience in software or infrastructure engineering
  • At least 5 years focused on platform engineering or cloud infrastructure at scale
  • Proven success designing and operating internal developer platforms in AWS and/or Azure environments
  • Expert-level experience with Kubernetes, including provisioning, cluster lifecycle management, workload orchestration, and multi-tenant design
  • Strong expertise in Terraform, GitOps tools (e.g., ArgoCD), and CI/CD systems (e.g., GitHub Actions, Spinnaker)
  • Deep understanding of cloud networking, IAM, service meshes, and container orchestration at scale
  • Familiar with the CNCF landscape and how to leverage open-source tools to solve platform problems
  • Passion for developer experience
  • Track record of technical leadership, mentoring, and influencing engineering culture at a large scale
  • Bachelor's or Master’s in Computer Science or related discipline, or equivalent practical experience
Job Responsibility
Job Responsibility
  • Design and build scalable platform capabilities that empower engineering teams to ship features reliably, securely, and quickly
  • Create and maintain developer-facing tools and paved paths (e.g., CI/CD pipelines, Kubernetes platforms, observability stacks, secrets management)
  • Implement Infrastructure-as-Code and GitOps patterns to promote consistency, automation, and compliance across environments
  • Collaborate with product, security, and compliance stakeholders to build platform services that meet SLAs and governance standards
  • Drive efforts to standardize and simplify infrastructure across cloud environments (AWS, Azure), enabling secure multi-cloud operation
  • Lead incident response, reliability engineering, and observability improvements that ensure platform uptime and performance
  • Act as a technical mentor and thought leader, guiding teams on infrastructure architecture, platform adoption, and best practices
  • Define and execute on a strategic roadmap to evolve the internal platform in line with company growth and technology direction
What we offer
What we offer
  • Comprehensive medical, dental, vision, disability, and life benefits
  • Group Retirement Savings Plan (RRSP) and matching employer contributions (DPSP) with immediate vesting
  • Flexible PTO
  • Generous Holiday Schedule + 5 Days for Annual Holiday Week
  • Quarterly Recharge Fridays (paid days off for mental health recharge)
  • Flexible work schedules
  • Access to Coaches and Therapists through Modern Health
  • 2 Volunteer days per year
  • Monthly transportation allowance for employees that work in our Vancouver Hub location
  • Eligible for bonuses and stock options
  • Fulltime
Read More
Arrow Right

Principal Platform Engineer

Principal Platform Engineer role at Endor Labs building the Application Security...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
https://www.endorlabs.com Logo
Endor Labs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12+ years of Site Reliability Engineering or Platform Engineering experience
  • Deep hands-on expertise with Kubernetes and CNCF ecosystem in production environments
  • Significant experience with at least one major cloud provider (Azure, Google Cloud, or AWS)
  • Strong experience managing large infrastructure deployments using Terraform, OpenTofu, or Terragrunt
  • Hands-on experience with open source observability tools (Prometheus, Grafana, Mimir, Pyroscope)
  • Self-driven problem solver with initiative
  • Customer-focused engineering mindset
  • Clear communication skills across technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Build Cloud Infrastructure at Scale on Azure, Google Cloud, and AWS
  • Master Kubernetes & CNCF Ecosystem with multi-tenant clusters
  • Scale Observability Platform with Prometheus, Grafana, Mimir, and Pyroscope
  • Transform Developer Experience with self-service tools and automation
  • Drive Infrastructure as Code with Terraform/OpenTofu
  • Solve Complex Technical Challenges like zero-downtime migrations and cost optimization
  • Collaborate Across Teams with Security, Backend, and Product Engineering
  • Iterate and Innovate in fast-paced environment
  • Fulltime
Read More
Arrow Right

Principal Cloud Engineer

As the Principal Cloud Engineer, you will play a pivotal role in leading the arc...
Location
Location
United States
Salary
Salary:
Not provided
https://seamless.ai/ Logo
Seamless.AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 7 years of experience managing AWS cloud infrastructure at scale
  • Strong understanding of core AWS services (EC2, S3, RDS, Lambda, VPC, etc.) and expertise in designing and managing multi-region, scalable cloud architectures
  • Hands-on experience with Infrastructure as Code (IAC) tools like Terraform or CloudFormation
  • Proven track record of managing and optimizing cloud costs, using tools like AWS Cost Explorer, Trusted Advisor, or other cost-management platforms
  • Experience scaling large data systems (including databases, data lakes, and big data platforms) across distributed cloud environments
  • Expertise in disaster recovery planning, implementation, and management within a cloud infrastructure
  • Solid understanding of cloud security, including IAM policies, encryption, network security, and proactive threat and vulnerability mitigation strategies
  • Experience with monitoring and logging tools (e.g., CloudWatch, ELK stack, Prometheus) to ensure infrastructure health and performance
  • Ability to communicate complex technical concepts to a variety of stakeholders, including non-technical team members
  • Bachelor's degree in Computer Science, Information Systems, or a related field, or equivalent years of work experience
Job Responsibility
Job Responsibility
  • Design, implement, and manage highly scalable, secure, and cost-optimized AWS cloud infrastructure
  • Lead the automation of Infrastructure as Code (IAC) using tools like Terraform, CloudFormation, or similar technologies
  • Ensure high availability and reliability of systems, implementing disaster recovery and failover strategies
  • Collaborate with software development and data teams to optimize cloud architecture for large-scale data systems
  • Implement and maintain security best practices, including monitoring, threat detection, and vulnerability mitigation
  • Work on optimizing AWS costs while ensuring the infrastructure meets performance and scalability requirements
  • Stay current with the latest cloud technologies, and continuously improve the cloud environment with new tools and services
  • Provide technical leadership and mentorship to other engineers, promoting best practices in cloud operations and architecture
  • Monitor and respond to infrastructure incidents, ensuring timely resolutions and minimal downtime
  • Fulltime
Read More
Arrow Right

Principal Software QA Engineer

Principal Software QA Engineer to lead test architecture and automation strategy...
Location
Location
Puerto Rico , Aguadilla
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of hands-on QA experience
  • Designing and building test automation frameworks from scratch
  • Non-functional testing (scale, reliability, performance, security)
  • Strong coding skills in Python, Java, or Go
  • Experience with Pytest, TestNG, JUnit, Playwright or similar tools
  • Deep understanding of Cloud platforms (AWS, Azure, GCP)
  • Microservices, Containers (Docker, Kubernetes)
  • Infrastructure & Data Center management
  • Linux/VM environments, Storage, Compute, Networking
  • REST APIs, JSON, SQL/NoSQL
Job Responsibility
Job Responsibility
  • Design, automate, and execute system-level test cases focused on scale, reliability, security, and performance
  • Lead the test automation strategy
  • evaluate and integrate new tools to improve efficiency and coverage
  • Collaborate closely with product, development, support, and platform engineering teams to ensure full lifecycle quality coverage
  • Provide technical leadership and mentorship to QA engineers and partners across teams
  • Contribute to design reviews with a QA lens to ensure testability and risk mitigation
  • Maintain and manage multiple product test configurations aligned with diverse deployment environments
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Comprehensive suite of benefits supporting physical, financial and emotional wellbeing
  • Fulltime
Read More
Arrow Right