Platform Engineer Job at Randstad (Vancouver)

Job Description

We are seeking a highly progressive Platform Engineer specializing in AI infrastructure and agentic execution environments to join our core cloud enablement team in Vancouver. In this role, you will bridge the gap between traditional Site Reliability Engineering (SRE) and cutting-edge Agentic AI operations. You will design, build, and operate secure multi-cloud landing zones, developer "golden paths," and reusable automated frameworks that empower application teams to safely deploy AI agents at scale.

Job Responsibility

Build integration patterns, API mediation layers, and approval workflows supporting autonomous AI agent tool execution and runtime function calling
Integrate advanced distributed telemetry for agent runs (execution traces, evaluation metrics, latency logs, and token cost analytics)
Establish runtime safety controls for AI applications, embedding automated rollback scripts, cost control ceilings, and master kill-switches
Build and scale highly secure, automated multi-cloud landing zones (AWS and Azure) utilizing reusable Terraform modules
Construct and maintain robust GitLab CI/CD pipelines, package registries, and automated infrastructure release strategies
Implement strict automated infrastructure guardrails using Open Policy Agent (OPA), Conftest, or Azure Policies to guarantee security without breaking developer velocity
Embed least-privileged access, zero-trust network segmentation, private endpoints, KMS encryption keys, and advanced secrets management
Champion Site Reliability Engineering standards by managing Service Level Objectives (SLOs), calculating error budgets, configuring autoscaling matrices, and leading chaos engineering simulations
Apply cloud financial management protocols (structured resource tagging, budget alarms, anomaly detection, and cluster right-sizing)
Author clear, accessible developer guides and self-service templates that streamline the adoption of core AI platform features
Form part of a formal production on-call rotation, managing real-time incident resolution and driving exhaustive post-mortem evaluations

Requirements

3-5 years of dedicated cloud platform engineering or SRE experience working with high-volume distributed systems natively in AWS and Azure
Elite proficiency with Terraform, with an emphasis on creating modular, reusable code structures and multi-environment pipelines
Coding proficiency in Python or Go, with a solid history of integrating with complex REST/JSON APIs
Strong operational working knowledge of GitLab CI/CD, Docker containerization, and cloud orchestration layers
Proven, hands-on exposure to AI/LLM development concepts (advanced prompting, tool/skill integration, and Retrieval-Augmented Generation [RAG])
Extensive experience leveraging AI and Agentic Coding tools to accelerate software delivery and maintain platform scripts

Nice to have

Direct experience building or operating internal Agent Frameworks (tool catalogs, runtime orchestration layers, prompt management)
Hands-on tracing setup using monitoring tools such as Datadog or Splunk
Formal background in public sector or retail e-commerce compliance (PII protection, data masking rules)
Post-secondary degree in Computer Science, Software Engineering, or an equivalent technical field

What we offer

Pioneering Technical Landscape
Elite Multi-Cloud Exposure
High Extensibility Indicators
Premier Workspace

Randstad - All Job Offers

Select Country

Platform Engineer

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?