Cloud Site Reliability Engineer Job at Airbus (Toulouse)

Cloud Site Reliability Engineer

Airbus Commercial Aircraft is looking for a Cloud Site Reliability Engineer (f/m...

Location

France , Toulouse

Salary:

Not provided

Airbus

Expiration Date

Until further notice

Requirements

Expert-level proficiency in GCP Deployment Manager, Google Cloud CDK (Cloud Development Kit), and Python
Comprehensive and hands-on knowledge of a wide range of GCP services and their architectural best practices
Strong background and practical experience in cloud security principles and compliance frameworks
Proven experience with DevOps methodologies and working within agile environments
Demonstrated ability to architect and implement scalable infrastructure solutions
A strategic thinker with a keen focus on optimizing infrastructure efficiency, scalability, and cost-effectiveness
A collaborative team player with excellent communication skills, capable of working effectively across engineering teams.

Job Responsibility

Spearhead the development and maintenance of our cloud infrastructure using GCP Deployment Manager templates and CDK scripts for efficient resource provisioning
Architect and implement highly scalable, secure, and resilient cloud infrastructure solutions across various GCP services
Collaborate closely with development and operations teams to deeply integrate IaC practices into our entire software development lifecycle
Conduct thorough code reviews and mentor team members, ensuring adherence to best practices in cloud architecture, security, and operational excellence
Drive continuous improvement in our infrastructure by identifying opportunities for automation, optimization, and enhanced reliability.

What we offer

Financial rewards: Attractive salary, agreements on success and profit sharing schemes, employee savings plan abounded by Airbus and employee stock purchase plan on a voluntary basis
Work / Life Balance: Extra days-off for special occasions, holiday transfer option, a Staff council offering many social, cultural and sport activities and other services
Wellbeing / Health: Complementary health insurance coverage (disability, invalidity, death). Depending on the site: health services center, concierge services, gym, carpooling application
Individual development: Great upskilling opportunities and development prospects with unlimited access to +10.000 e-learning courses to develop your employability, certifications, expert career path, accelerated development programmes, national and international mobility.

Fulltime

Cloud Engineer / Site Reliability Engineer (SRE)

Location

United States , Orlando

Salary:

75.00 USD / Hour

Beacon Hill

Expiration Date

Until further notice

Requirements

Strong hands-on AWS experience with solid understanding of core AWS services
Experience supporting and troubleshooting AWS and Azure cloud environments
Terraform experience for Infrastructure as Code
Docker/containerization experience
Strong troubleshooting and problem-solving skills
Ability to translate requirements into technical execution
Experience performing cloud architecture and diagramming
Experience supporting deployments, environments, and site standups
Strong communication and collaboration skills

Job Responsibility

Support cloud infrastructure and deployments across AWS and Azure
Troubleshoot infrastructure and application-related cloud issues
Build and maintain Terraform-based infrastructure
Support Docker/containerized environments
Create architecture diagrams and technical documentation
Work closely with engineering and project teams to execute cloud initiatives
Assist with automation and operational improvement efforts

Fulltime

Senior Site Reliability Engineer Cloud Platform

Zilliz is a fast-growing startup developing the industry’s leading vector databa...

Location

Salary:

175000.00 - 225000.00 USD / Year

Zilliz

Expiration Date

Until further notice

Requirements

4+ years of experience in site reliability engineering or similar roles with a focus on cloud-native systems
Proficiency in scripting languages such as Python, Go, or Java
Strong knowledge of container orchestration technologies like Kubernetes and Docker
Expertise with cloud platforms such as AWS, GCP, or Azure, and their respective monitoring and management tools
Experience with infrastructure as code tools such as Terraform or Ansible
Familiarity with CI/CD tools such as Jenkins, GitLab CI, or Argo
Proven ability to troubleshoot complex distributed systems and resolve issues promptly
Bachelor’s degree or above in computer science, software engineering, or other relevant disciplines
Ability to thrive in a fast-paced, startup environment and handle multiple projects simultaneously

Job Responsibility

Work at the intersection of development and site reliability. Creating SRE tools and systems, as well as supporting existing infrastructure and platforms
Ensure the reliability, availability, and performance of Zilliz’s distributed database systems
Develop and implement strategies for monitoring, incident management, and disaster recovery
Automate system operations and maintenance tasks to improve efficiency and reduce manual intervention
Design and build tools to manage and monitor infrastructure, ensuring scalability and robustness
Collaborate with software engineers to enhance system reliability, scalability, and performance
Maintain and improve the CI/CD pipeline to ensure smooth and rapid deployment of changes
Actively contribute to the Milvus Vector Database open-source community, focusing on improving reliability and operational efficiency

Fulltime

New

Principal Site Reliability Engineer (Sovereign Cloud)

Location

Bulgaria , Sofia

Salary:

Not provided

Palo Alto Networks

Expiration Date

Until further notice

Requirements

6+ years as DevOps engineer with a passion for technology, strong motivation and responsibility
Proficiency in DevOps and Platform Engineering with expertise in AWS, GCP, Terraform, ArgoCD, Kubernetes, and related tools
Experience in developing and maintaining CI/CD pipelines for continuous delivery in agile environments
Skilled in managing cloud infrastructure, particularly with AWS and GCP, and adept in infrastructure as code practices using Terraform/Terragrunt
Demonstrated capability in supporting high-scale SaaS applications, focusing on scalability, reliability, and performance
Strong communication, strategic thinking, and problem-solving skills
Self-disciplined, self-managed, self-motivated, strong sense of ownership, urgency, and drive
Ready to understand and dissect new technology stacks quickly

Job Responsibility

Implement and optimize CI/CD pipelines and cloud infrastructure using our technology stack, ensuring efficient and reliable deployment to production
Participate in the deployment of monitoring and alerting systems to maintain high system performance and reliability
Collaborate with software development and other cross-functional teams to streamline and enhance processes, aiming for efficiency and alignment with business goals
Contribute to the management of the cloud infrastructure, utilizing Infrastructure as Code principles
Participate in on-call rotations to support critical business and production systems

Fulltime

New

Sr Principal Site Reliability Engineer (Sovereign Cloud)

The Prisma Access team is seeking a seasoned Principal Site Reliability Engineer...

Location

Bulgaria , Sofia

Salary:

Not provided

Palo Alto Networks

Expiration Date

Until further notice

Requirements

10+ years of experience in Infrastructure, SRE, or DevOps roles
BS or MS in Computer Science, a related field, or equivalent professional experience
7+ years of experience with GCP, and expertise in their architecture, services and PKI concepts for cloud security
Expert troubleshooting skills to resolve cloud infrastructure and service issues, effectively identifying root cause and devising effective solutions
Proficiency in automation using Python and shell scripting
Expertise in Infrastructure as Code (IaC) with Terraform and Helm, leveraging AI tools for development
Solid experience with Kubernetes, container networking, and container workloads
Strong Linux administration skills
Proficiency with CI/CD pipelines, GitOps principles, and tooling like GitLab and Jenkins
Excellent written and verbal communication skills, with the ability to collaborate effectively to drive outcomes

Job Responsibility

Design, build, and operate reliable, secure Cloud infrastructure across multi-cloud environments for our sovereign customers
Lead cross-functional initiatives to ensure applications are production-ready, scalable, secure, and resilient
Develop expertise in new technologies, embracing continuous learning and the adoption of AI tools
Develop tools and automation frameworks, championing Infrastructure as Code (IaC) and Monitoring as Code (MaC) principles
Automate robust deployments and orchestrate end-to-end monitoring and alerting solutions
Participate in on-call rotations to support critical business and production systems
Lead root cause analysis of critical issues, driving improvements and preventing recurrence
Champion the success of SRE and DevOps initiatives, aligning technical decisions with business goals

Fulltime

New

Sr Principal Site Reliability Engineer (Sovereign Cloud)

Palo Alto Networks runs a large infrastructure and is one of the largest GCP cus...

Location

Bulgaria , Sofia

Salary:

Not provided

Palo Alto Networks

Expiration Date

Until further notice

Requirements

10+ years as an engineer in Infrastructure, Operations, DevOps, or System Engineering
7+ years building high availability, scalable cloud-native applications on AWS and GCP
BS or MS in Computer Science, a related field, or equivalent professional experience required
Expertise in configuration management with a framework such as Ansible, Terraform, Helm
Passion for infrastructure and monitoring as code
Solid experience in container workloads and Kubernetes
Familiarity with PKI concepts, Networking concepts
In-depth knowledge of different security controls ( app-id, user-id, security profile, url category, content, ssl decryption, firewall MFA etc)
Linux administration, internals, and network troubleshooting
Proficiency with programming languages like Golang or Python along with shell scripting to automate tasks

Job Responsibility

Contribute to the success of SRE and DevOps
Develop expertise in new technologies
Work with developers, researchers, data scientists, and security experts
Design, build and operate reliable, secure Cloud infrastructure
Ensure that applications are production-ready, scalable, and reliable
Develop tools and automation frameworks
Automate robust deployment of robust services
Orchestrate end-to-end monitoring and alerting
Participate in on-call rotations to support critical business and production systems
Lead root cause analysis of critical business and production issues

Fulltime

New

Principal Site Reliability Engineer (Sovereign Cloud)

As a Principal Site Reliability Engineer, you will serve as the technical author...

Location

Bulgaria , Sofia

Salary:

Not provided

Palo Alto Networks

Expiration Date

Until further notice

Requirements

7+ years of experience in Infrastructure, SRE, or DevOps roles
BS or MS in Computer Science, a related field, or equivalent professional experience
Kubernetes Mastery: Expert-level experience (6+ years) managing production K8s workloads (preferably within GKE, but will also consider EKS)
Deep understanding of Networking, Storage, and RBAC
CI/CD & GitOps: Hands-on expertise with ArgoCD and modern pipeline runners (GitHub Actions, GitLab CI, or Jenkins)
Programming: Proficient in Python for systems programming and automation
Security Mindset: Proven experience integrating security scanning and compliance checks within a containerized environment
Modern Workflow: Experience (or strong desire) using AI-pair programming tools like Cursor and Claude to multiply personal and team productivity
Excellent written and verbal communication, able to collaborate and rally support
Self-disciplined, self-managed, self-motivated, strong sense of ownership, urgency, and drive

Job Responsibility

Infrastructure Leadership: Architect and oversee large-scale Kubernetes clusters in GKE, ensuring high availability, performance tuning, and cost optimization
GitOps & Orchestration: Design and refine complex CI/CD lifecycles using ArgoCD, moving toward a fully declarative infrastructure-as-code model
Security Engineering: Implement and manage security scanning tools (e.g., Prisma Cloud, Snyk, or GKE native security) to ensure container integrity and shift-left security compliance
Automation & Tooling: Develop sophisticated automation scripts and internal tools using Python to eliminate manual toil and improve system observability
AI-Driven Development: Lean into the future of engineering by utilizing Cursor and Claude to accelerate coding, debugging, and documentation tasks
Incident Management: Act as a final escalation point for complex infrastructure outages, conducting blameless post-mortems to drive systemic improvements
Participate in on-call rotations to support critical business and production systems

Fulltime

New

Principal Site Reliability Engineer (Sovereign Cloud)

We are looking for a Principal Engineer to join our SDWAN engineering team. You ...

Location

Bulgaria , Sofia

Salary:

Not provided

Palo Alto Networks

Expiration Date

Until further notice

Requirements

6+ years as DevOps engineer with a passion for technology, strong motivation and responsibility
Proficiency in DevOps and Platform Engineering with expertise in AWS, GCP, Terraform, ArgoCD, Kubernetes, and related tools
Experience in developing and maintaining CI/CD pipelines for continuous delivery in agile environments
Skilled in managing cloud infrastructure, particularly with AWS and GCP, and adept in infrastructure as code practices using Terraform/Terragrunt
Demonstrated capability in supporting high-scale SaaS applications, focusing on scalability, reliability, and performance
Excellent written and verbal communication, able to collaborate and rally support
Self-disciplined, self-managed, self-motivated, strong sense of ownership, urgency, and drive
Passion for infrastructure and monitoring as code
Ready to understand and dissect new technology stacks quickly

Job Responsibility

Implement and optimize CI/CD pipelines and cloud infrastructure using our technology stack, ensuring efficient and reliable deployment to production
Participate in the deployment of monitoring and alerting systems to maintain high system performance and reliability
Collaborate with software development and other cross-functional teams to streamline and enhance processes, aiming for efficiency and alignment with business goals
Contribute to the management of the cloud infrastructure, utilizing Infrastructure as Code principles
Participate in on-call rotations to support critical business and production systems

Fulltime

Select Country

Cloud Site Reliability Engineer

Job Description

Job Responsibility

Requirements

What we offer

Looking for more opportunities?

Cloud Site Reliability Engineer

Cloud Site Reliability Engineer

Cloud Engineer / Site Reliability Engineer (SRE)

Senior Site Reliability Engineer Cloud Platform

Principal Site Reliability Engineer (Sovereign Cloud)

Sr Principal Site Reliability Engineer (Sovereign Cloud)

Sr Principal Site Reliability Engineer (Sovereign Cloud)

Principal Site Reliability Engineer (Sovereign Cloud)

Principal Site Reliability Engineer (Sovereign Cloud)

Our AI answers in your language