This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Elevate Global Operations as our Next Cloud Site Reliability Engineer (OpenTelemetry)! Are you ready to lead an OTel-first strategy and redefine reliability for a global industrial technology leader? Trimble is looking for a visionary Cloud Site Reliability Engineer to manage our massive-scale observability platform, ensuring our digital and physical solutions remain performant and resilient. This is your chance to use cutting-edge automation and OpenTelemetry to make a tangible impact on the world's most critical industries.
Job Responsibility
Lead a global "OTel First" strategy, implementing OpenTelemetry at scale across a diverse technological landscape
Spearhead the development of automation scripts and Infrastructure as Code using Terraform to ensure seamless, reproducible platform delivery
Optimize platform performance and cost-efficiency, ensuring our observability tools scale economically as our data grows
Collaborate with engineering teams to embed reliability and security standards into new features from the ground up
Drive root cause analysis and problem management to proactively prevent incidents and improve the customer experience
Requirements
Hands-on experience with the OpenTelemetry Collector, APIs, and SDKs
Extensive experience with observability tools like NewRelic, Datadog, or Splunk
Strong proficiency in Infrastructure as Code (Terraform, Ansible) and cloud platforms (AWS, GCP, or Azure)
Deep understanding of containerization and orchestration using Docker and Kubernetes
Advanced coding skills in Python, Go, or Java for building robust automation and monitoring tools
Experience leveraging AI coding assistants like GitHub Co-Pilot to accelerate development