CrawlJobs Logo

Cloud Platform DevOps Engineer

Canada, Mississauga 94300.00 - 141500.00 USD / Year · Job Posted May 15, 2026
Apply Position
Job Link Share

Job Description

We are seeking an experienced (5+ years), motivated, and hands-on Cloud Platform DevOps Engineer to join our North American AI and DevOps Platform Engineering team. In this critical role, you will be responsible for enhancing the stability, reliability, and performance of our AI and DevOps platforms, which support a diverse ecosystem of AI applications, developer tools, and CI/CD pipeline technologies across the organization. You will actively contribute to infrastructure design, implementation, and maintenance, and facilitate agile development within the team. The ideal candidate is a strong technical leader who champions agile practices, drives continuous improvement, and excels in both coding and coaching, possessing a deep understanding of infrastructure and operational considerations for Artificial Intelligence and Machine Learning initiatives, with proven hands-on experience in DevOps tools and technologies such as Kubernetes, Docker, HELM, Ansible, DevOps tools, or similar CI/CD platforms, and proficiency in scripting and automation (e.g., Python, Bash). We are looking for someone with a track record of implementing scalable, resilient, and high-performance solutions, coupled with strong communication and collaboration skills, and an ability to mentor and guide junior team members, as you join a dynamic team committed to fostering innovation and collaboration.

Job Responsibility

  • Lead the design, implementation, and ongoing management of secure, scalable, and resilient infrastructure components
  • Administer and maintain secret and certificate management solutions using HashiCorp Vault
  • Perform hands-on administration and optimization of database systems (PostgreSQL, Oracle, MongoDB)
  • Deploy, monitor, and troubleshoot data orchestration workflows using Apache Airflow
  • Implement and manage messaging queues such as Kafka and IBM MQ
  • Develop, maintain, and troubleshoot RESTful API and SOAP integrations
  • Implement and optimize build and deployment processes using Gradle
  • Design, implement, and manage container orchestration platforms with Kubernetes and Helm
  • Configure and manage persistent storage solutions including PVC, SONiC NAS, and S3
  • Set up and maintain load balancing solutions (e.g., Nginx, HAProxy, AWS ELB/ALB, Kubernetes Ingress controllers)
  • Implement, configure, and utilize comprehensive monitoring and logging solutions (Prometheus, Grafana, ELK Stack)
  • Develop robust automation scripts and tools using Python, Bash, Go, or similar languages
  • Participate actively in on-call rotations
  • Create and maintain technical documentation, architecture diagrams, and runbooks
  • Proactively identify and resolve technical impediments and process bottlenecks
  • Collaborate closely with stakeholders to ensure a well-defined and prioritized backlog
  • Drive continuous improvement in the team's agile and DevOps practices

Requirements

  • 5+ years of experience
  • Experience in infrastructure design, implementation, and maintenance
  • Agile development
  • Understanding of infrastructure and operational considerations for AI and Machine Learning initiatives
  • Hands-on experience with Kubernetes, Docker, HELM, Ansible, DevOps tools, or similar CI/CD platforms
  • Proficiency in scripting and automation (e.g., Python, Bash)
  • Track record of implementing scalable, resilient, and high-performance solutions
  • Strong communication and collaboration skills
  • Ability to mentor and guide junior team members
  • Proven hands-on experience with HashiCorp Vault
  • Strong hands-on experience with at least two of PostgreSQL, Oracle, or MongoDB
  • Hands-on experience deploying, managing, and developing DAGs for Apache Airflow
  • Solid hands-on experience with Kafka and/or IBM MQ
  • In-depth hands-on experience with Kubernetes and Helm, including YAML configuration, troubleshooting PODs/Jobs/Deployments, and integrations with secrets management (CyberArk, HashiCorp)
  • Practical experience with Kubernetes PVCs, Persistent Volumes, S3, and/or enterprise NAS solutions (e.g., SONiC NAS)
  • Strong hands-on experience with Prometheus, Grafana, and the ELK Stack
  • High proficiency in Python, Bash, or Go for automation
  • Extensive hands-on experience with at least one major cloud provider (AWS, Azure, GCP)
  • Proficiency with IaC tools such as Terraform or Ansible
  • Experience designing, implementing, and maintaining CI/CD pipelines (e.g., Jenkins, GitLab CI, GitHub Actions)
  • Experience with RESTful API and SOAP web services
  • Proficiency with Gradle for build automation
  • Understanding of specific infrastructure requirements for deploying, managing, and scaling AI and Machine Learning workloads
  • Awareness of data management strategies and data governance principles relevant to AI/ML models
  • Familiarity with metrics and monitoring approaches for the performance and health of AI/ML applications
  • Proven experience acting as a Scrum Master within a technical team
  • In-depth knowledge and practical application of Agile principles and the Scrum framework
  • Excellent facilitation, coaching, and mentoring skills within a technical context
  • Strong verbal and written communication skills
  • Ability to guide technical discussions, influence architectural decisions, and drive best practices

Nice to have

  • Certified ScrumMaster (CSM) or Professional Scrum Master (PSM) certification
  • Relevant cloud certifications (e.g., AWS Certified DevOps Engineer, Azure DevOps Engineer Expert, GCP Professional Cloud DevOps Engineer)
  • Experience with site reliability engineering (SRE) principles and practices
  • Familiarity with other Agile scaling frameworks (e.g., SAFe, LeSS)
  • Exposure to MLOps platforms or tools (e.g., Kubeflow, MLflow)

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Cloud Platform DevOps Engineer

8 matching positions

DevOps Engineer - Google Cloud Platform

Your mission: Build and evolve our developer platforms to enable fast feedback, ...
Location
Location
Germany , Berlin
Salary
Salary:
Not provided
smartclip.tv Logo
Smartclip Europe GmbH
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of DevOps engineering experience
  • Expert in Google Cloud Platform (GCP)
  • Confident work with Kubernetes and Linux
  • Use of Infrastructure as Code, Docker/CRI-O, and shell scripting
  • Ownership of stack end-to-end
Job Responsibility
Job Responsibility
  • Build and evolve our developer platforms to enable fast feedback, high quality, and maximum flexibility
  • Design and automate developer platforms that empower teams to move fast and ship with confidence
  • Integrate modern tools into CI/CD pipelines – always with a focus on functional safety and reliability
  • Take ownership of our GKE / Kubernetes infrastructure on Google Cloud – and actively shape it
  • Harden systems proactively through strong security engineering – don’t wait for a pen test to find issues
  • Continuously improve our platform to make development faster, safer, and more scalable
What we offer
What we offer
  • 30 days of vacation + Dec 24 & 31 off
  • Smart Fridays (4 days week possible)
  • Mobility (Germany ticket & JobRad)
  • Sports & health offerings
  • Mental health support
  • Corporate benefits
  • RTL+ access
  • Fulltime
Read More
Arrow Right

Cloud DevOps & Data Platform Engineer

We are seeking a highly skilled Cloud DevOps & Data Platform Engineer with stron...
Location
Location
India , Mangalore
Salary
Salary:
Not provided
abottstech.com Logo
Abotts
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong experience in DevOps tools and practices (CI/CD, automation, monitoring)
  • Hands-on experience with Google Cloud Platform (GCP)
  • Proficiency in Linux system administration
  • Solid expertise in Power BI (reports, dashboards, DAX, data modeling)
  • Strong knowledge of PL/SQL and relational DBMS concepts
  • Experience with Python scripting for automation and data processing
  • Understanding of cloud security, IAM, and networking concepts
  • Experience working in Agile / DevOps environments
  • 3+ Year Experience
  • Availability for night shifts as needed to support global teams
Job Responsibility
Job Responsibility
  • Design, implement, and maintain CI/CD pipelines and cloud infrastructure using DevOps best practices
  • Manage and optimize GCP cloud services, ensuring high availability, scalability, and cost efficiency
  • Develop, deploy, and monitor applications and services on Linux-based systems
  • Support and enhance Power BI dashboards, datasets, and data models for business reporting
  • Write, optimize, and maintain PL/SQL queries, procedures, and database objects
  • Perform database administration tasks related to DBMS performance, tuning, and reliability
  • Develop automation scripts and utilities using Python for infrastructure and data workflows
  • Implement monitoring, logging, and alerting for infrastructure and applications
  • Ensure security, access control, and compliance across cloud and data platforms
  • Collaborate with engineering, analytics, and business teams to deliver scalable solutions
  • Fulltime
Read More
Arrow Right

Platform Engineer / DevOps Engineer – Trading

My client are seeking a knowledgeable Platform Engineer to work on their low lat...
Location
Location
United States , New York
Salary
Salary:
150000.00 - 300000.00 USD / Year
hunterbond.com Logo
Hunter Bond
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong Linux experience
  • Experience with Chef, Puppet or Ansible
  • Experience with Kubernetes, Docker or Podman
  • Strong CI/CD knowledge
  • Python scripting experience
Job Responsibility
Job Responsibility
  • Platform Engineering: Building, designing and architecting automated solutions for scalable deployment across private and public cloud infrastructure
  • Low Latency: Supporting and optimising a low latency Linux environment
  • Project & BAU: Working across both project work and BAU activities, automating wherever possible
  • Linux Engineering: Managing and supporting Linux systems
  • Configuration Management: Working with Chef, Puppet or Ansible
  • Containers: Supporting Kubernetes, Docker or Podman environments
  • CI/CD: Building and maintaining CI/CD pipelines
  • Automation: Writing and maintaining Python scripts
What we offer
What we offer
  • Bonus
  • Fulltime
Read More
Arrow Right

Platform Engineer / DevOps Engineer

My client are seeking a knowledgeable Platform Engineer to work on their low lat...
Location
Location
Canada , Montreal
Salary
Salary:
125000.00 - 250000.00 SGD / Year
hunterbond.com Logo
Hunter Bond
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong Linux experience
  • Experience with Chef, Puppet or Ansible
  • Kubernetes, Docker, or Podman experience
  • Strong CI/CD skills
  • Python, Ruby or Go
Job Responsibility
Job Responsibility
  • Platform Engineering: Building, designing and architecting automated solutions for scalable deployment in private and public cloud infrastructure
  • DevOps & SRE: Working across both project and BAU capacity, automating wherever possible
  • Linux: Managing and supporting Linux environments
  • Configuration Management: Using Chef, Puppet or Ansible
  • Containers & Orchestration: Working with Kubernetes, Docker or Podman
  • CI/CD: Implementing and maintaining CI/CD pipelines
  • Scripting & Development: Using Python, Ruby or Go for automation and tooling
What we offer
What we offer
  • Bonus
  • Fulltime
Read More
Arrow Right

Devops engineer & cloud engineer

At ADEPTIC Reply, we are at the forefront of innovation in the Cloud-Edge Contin...
Location
Location
Italy , Roma, Milano, Torino, Bari
Salary
Salary:
Not provided
likereply.com Logo
Like Reply
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with cloud providers such as AWS, Azure or GCloud
  • Experience with Cloud platform such as Open Stack
  • Cloud infra related networking knowledge
  • Experience in cloud-based solution offerings of K8 like Amazon EKS
  • Familiar with infrastructure as code, like Terraform and Ansible
  • Configuration and deployment experience
  • Familiarity with setting up and maintaining Kubernetes clusters and Go
  • Experience on supporting services like NATs gateway/load balancers (load-balancer provisioning via k8s)
  • Experience working in Agile or Scrum environments (e.g., Jira, OpenProject)
  • Teamwork and problem-solving attitude, proactivity and strong motivation
Job Responsibility
Job Responsibility
  • Design and provide scalable platforms for the deployment of applications and infrastructure
  • Building and setting up new development tools and infrastructures
  • Understanding the needs of stakeholders and conveying this to developers
  • Working on ways to automate and improve development and release processes
  • Ensuring that systems are safe and secure against cybersecurity threats
  • Identifying technical problems and developing software updates and ‘fixes’
What we offer
What we offer
  • Excellent and real growth opportunities
  • Cross Market and Industry Projects
  • Tailored certification program
  • Opportunity to travel around Europe
Read More
Arrow Right

Platform Engineer / DevOps Engineer – Trading

My client are seeking a knowledgeable Platform Engineer to work on their low lat...
Location
Location
Hong Kong , Hong Kong
Salary
Salary:
700000.00 - 1500000.00 HKD / Year
hunterbond.com Logo
Hunter Bond
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong Linux experience
  • Experience with Chef, Puppet or Ansible
  • Experience with Kubernetes, Docker or Podman
  • Strong CI/CD knowledge
  • Python scripting experience
Job Responsibility
Job Responsibility
  • Platform Engineering: Building, designing and architecting automated solutions for scalable deployment across private and public cloud infrastructure
  • Low Latency: Supporting and optimising a low latency Linux environment
  • Project & BAU: Working across both project work and BAU activities, automating wherever possible
  • Linux Engineering: Managing and supporting Linux systems
  • Configuration Management: Working with Chef, Puppet or Ansible
  • Containers: Supporting Kubernetes, Docker or Podman environments
  • CI/CD: Building and maintaining CI/CD pipelines
  • Automation: Writing and maintaining Python scripts
What we offer
What we offer
  • Bonus
  • Fulltime
Read More
Arrow Right

Platform Engineer DevOps Engineer Trading

The role sits between Platform Engineering, DevOps, Linux Systems Administration...
Location
Location
Canada , Montreal
Salary
Salary:
125000.00 - 250000.00 SGD / Year
hunterbond.com Logo
Hunter Bond
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep expertise in managing and tuning the Linux estate
  • Proficiency in Chef, Puppet, or Ansible
  • Hands-on experience with Kubernetes, Docker, or Podman
  • Experience with continuous integration and deployment workflows
  • Proficiency in Python, Ruby, or Go
Job Responsibility
Job Responsibility
  • Build, design, and architect automated solutions for scalable deployment in private and public cloud infrastructure
  • Work in both a project and BAU capacity, automating wherever possible
What we offer
What we offer
  • Bonus
  • Fulltime
Read More
Arrow Right

Platform Engineer / DevOps Engineer – Trading

My client are seeking a knowledgeable Platform Engineer to work on their low lat...
Location
Location
United States , Houston
Salary
Salary:
130000.00 - 200000.00 USD / Year
hunterbond.com Logo
Hunter Bond
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong Linux experience
  • Experience with Chef, Puppet or Ansible
  • Experience with Kubernetes, Docker or Podman
  • Strong CI/CD knowledge
  • Python scripting experience
Job Responsibility
Job Responsibility
  • Platform Engineering: Building, designing and architecting automated solutions for scalable deployment across private and public cloud infrastructure
  • Low Latency: Supporting and optimising a low latency Linux environment
  • Project & BAU: Working across both project work and BAU activities, automating wherever possible
  • Linux Engineering: Managing and supporting Linux systems
  • Configuration Management: Working with Chef, Puppet or Ansible
  • Containers: Supporting Kubernetes, Docker or Podman environments
  • CI/CD: Building and maintaining CI/CD pipelines
  • Automation: Writing and maintaining Python scripts
  • Fulltime
Read More
Arrow Right