This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
As a Senior Cloud Platform Engineer based in our Sydney Head Office, you will be the foundational architect and operational steward of Nuix's SaaS delivery model. This is a critical leadership role for an engineer who thrives at the intersection of strategic design and hands-on execution. You will be instrumental in defining, implementing, and managing the multi-cloud infrastructure (leveraging AWS, Azure, GCP, etc.) that hosts our cutting-edge SaaS platform. Your mandate will be to ensure the platform is secure, highly available, and scalable for our global customers, while relentlessly driving cost optimization and operational excellence (SLOs). This role bridges the gap between our strategic business objectives and technical delivery, requiring a deep commitment to Infrastructure as Code (IaC), advanced DevOps practices, and effective cross-functional collaboration. You won't just build systems; you will architect for the future, mentor team members, and act as an Incident Commander to ensure business continuity and resilience. If you are passionate about owning the platform lifecycle and driving innovation through continuous learning, this is your opportunity to significantly impact Nuix's cloud journey.
Job Responsibility:
Infrastructure Design and Implementation: Designing the cloud environment (utilizing AWS, Azure, GCP, etc.) and defining architecture patterns and standards that support a multi-tenant, modular SaaS model
Ensuring Operational Excellence: Establishing and applying consistent procedures and best practices for managing and delivering the cloud platform to meet agreed-upon Service-Level Objectives (SLOs)
Scalability and Reliability: Architecting systems with redundancy, fault tolerance, and high availability to handle fluctuating workloads and minimize downtime, potentially leveraging microservices and serverless computing
Security and Compliance: Designing and implementing robust security controls (e.g., identity and access management, encryption, firewalls), monitoring for threats, and ensuring compliance with industry regulations like GDPR or HIPAA
Performance Monitoring and Optimization: Setting up comprehensive monitoring and observability tools to track key metrics and performance bottlenecks. Continuously optimizing resource allocation (e.g., rightsizing, auto-scaling) and application code to improve performance and responsiveness
Cost Management and Optimization: Evaluating and optimizing cloud service costs and usage. This includes implementing tagging strategies, managing budgets, and working with finance teams to ensure cost efficiency
Automation and DevOps Practices: Leveraging Infrastructure as Code (IaC) tools like Terraform and implementing CI/CD pipelines to automate deployment, configuration, and management tasks, reducing manual errors and speeding up delivery
Disaster Recovery and Business Continuity: Designing and testing disaster recovery (DR) and business continuity (BC) plans, including automated backups and recovery procedures, to mitigate the impact of outages. Act as the Incident Commander when required
Collaboration and Communication: Working closely with software developers, DevOps & CloudOps engineers, security teams, and business stakeholders to align technical solutions with organizational goals and provide technical guidance and mentorship
Continuous Learning: Staying abreast of new cloud technologies, trends, and provider updates (e.g., new AWS/Azure/GCP services) to drive innovation and maintain a cutting-edge platform
Requirements:
Deep Cloud Expertise: Extensive hands-on experience designing, deploying, and managing complex, production-grade cloud environments across at least one major provider (AWS is preferred, Azure, or GCP), with exposure to multi-cloud concepts
Infrastructure as Code (IaC): Advanced proficiency with IaC tools such as Terraform for managing and provisioning infrastructure in a declarative, repeatable manner
DevOps and CI/CD: Proven ability to implement and manage robust CI/CD pipelines (e.g., GitLab CI, Jenkins, Ansible) to automate configuration, testing, and deployment processes
Networking and Security: Expert knowledge of cloud networking (VPCs, VNETs, routing, load balancing) and security principles, including IAM, encryption standards, firewall rules, and robust security monitoring implementation
Containerization & Orchestration: Strong working knowledge of container technologies (Docker) and orchestration platforms (Kubernetes) is highly desirable, especially in a microservices context
Monitoring and Observability: Experience setting up and utilizing comprehensive monitoring and logging systems (e.g., Sumologic, Grafana, ELK Stack, Datadog) to establish and enforce SLOs
Scripting/Programming: Proficiency in scripting languages (e.g., Python, Bash, PowerShell) for automation and system management tasks
Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience
7+ years of progressive experience in cloud engineering, SRE, or DevOps roles, with a significant tenure in a senior capacity
Strong Analytical and Cost Management Skills: Demonstrated ability to analyze cloud consumption patterns, implement tagging strategies, and drive significant cost optimization initiatives
Operational Leadership: Experience as an Incident Commander or in leading the response, resolution, and root cause analysis (RCA) process for major platform outages
Excellent Communication and Collaboration: Ability to articulate complex technical concepts to both technical and non-technical stakeholders, and to provide technical guidance and mentorship to junior team members
System Design: Proven track record in designing high-availability, fault-tolerant, and disaster recovery architectures for mission-critical SaaS applications