This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
As a Senior Infrastructure Engineer, you will be a key technical leader, driving the reliability, scalability, and efficiency of our cloud-based infrastructure and SaaS products. This role combines deep technical expertise with a passion for mentoring and sharing knowledge. You will leverage your deep experience with AWS, SRE principles, automation, and infrastructure management to improve our systems, while also guiding and supporting your colleagues in their technical growth. Your focus will be on hands-on technical work, mentoring, and contributing to the overall improvement of our SRE practices, with a strong emphasis on automation using tools like Terraform and Bitbucket Pipelines.
Job Responsibility:
Serve as a technical expert and mentor to other engineers, sharing knowledge and best practices
Lead by example, demonstrating strong technical proficiency in SRE principles and practices, specifically within the AWS ecosystem
Contribute to the development and implementation of SRE standards and guidelines, tailored to AWS best practices
Foster a culture of continuous learning and improvement within the team
Help others to grow their automation skillsets
Design, build, and maintain robust and scalable infrastructure using Terraform, leveraging AWS services effectively
Develop and optimize CI/CD pipelines using Bitbucket Pipelines, integrating seamlessly with AWS deployment strategies
Implement and maintain monitoring and logging solutions to ensure system observability, utilizing AWS monitoring tools
Automate infrastructure and operational tasks to reduce toil and improve efficiency, with a focus on AWS automation
Contribute to the development and maintenance of automation tools and scripts
Troubleshoot complex infrastructure and application issues within the AWS environment
Respond to on-call Sev 1 incidents, particularly those occurring during the Australian (AU) time zone, and participate in a 24/7 on-call rotation approximately once per month
Participate in incident response and root cause analysis, contributing to the resolution of critical issues on AWS
Define and monitor SLOs/SLAs to ensure system reliability, using AWS metrics and monitoring
Contribute to disaster recovery planning and testing, utilizing AWS disaster recovery capabilities
Analyze system performance and identify areas for improvement within AWS
Proactively find and resolve potential issues before they become incidents
Collaborate with development, operations, and other teams to ensure smooth and efficient operations on AWS
Contribute to code reviews and technical discussions
Identify and implement process improvements to enhance team efficiency and effectiveness
Document best practices and create knowledge-sharing resources
Participate in agile ceremonies
Requirements:
Deep experience with AWS, including core services like EC2, S3, RDS, Lambda, CloudWatch, EKS, and a solid understanding of AWS networking (VPC, Security Groups) and security fundamentals (IAM)
4+ years of experience working with public cloud technologies (AWS preferred)
4+ years of experience developing monitoring and log analysis tools, including proficiency with Grafana and New Relic
Deep understanding of Site Reliability Engineering (SRE) principles, platforms, and tools
Proven experience with Terraform and Bitbucket Pipelines
Strong understanding of CI/CD pipelines and SDLC
Experience with Docker and Kubernetes
Proficiency in scripting languages (bash, Python)
Experience implementing and managing security controls and tools
Understanding of security systems and best practices
Experience with git and code branching/merging strategies
Experience with Agile methodologies (Scrum, Kanban)
Strong problem-solving and troubleshooting skills
Excellent communication and collaboration skills
Passion for mentoring and sharing knowledge
Automation-first mindset
Ability to own medium to large technical projects
Candidates MUST resides in Australia to be considered