Lead Site Reliability Engineer Job at Trimble Inc. (Chennai)

Job Description

Trimble is looking for a Site Reliability Engineering Lead to join Business Systems. Our team is building the platform fuelling Trimble's digital transformation. We take a cloud-first approach to deliver customer-centric experiences & platform web services that are used by Trimble product teams and Trimble partners. As a Solutions Engineer, you'll be a vital part of Trimble's Engagement Team for Digital Transformation. This team enables and aids Trimble's product teams and partners with adopting and integrating Trimble cloud services with a customer-centric ideology always in mind. You will be an expert of Business Systems services, building, proving out, and communicating the value of the platform that is enabling Trimble's Digital Transformation.

Job Responsibility

Become well-versed in the opportunities and challenges of the business and Trimble's customers
Become an expert in Business Systems services, especially the interfaces—APIs, protocols (e.g. OAuth), and user interfaces
Establish, then utilize tight working relationships with stakeholders across the company, especially Trimble's engineering community
Prototype and create proofs of concept as required
Scope and deploy new integrations
Investigate, diagnose, and solve customer integration issues
Effectively communicate technical issues with stakeholders in non-technical language
Contribute to utilities and SDKs to help integration and migration efforts

Requirements

Bachelor's or Master's degree in Computer Engineering, Computer Science, or a related field
7+ years in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles with at least 2+ years in a leadership or mentoring capacity
Deep AWS expertise (EC2, S3, RDS, IAM, VPC, Lambda, CloudFormation/Terraform, etc.)
Strong knowledge of Infrastructure-as-Code (IaC) using Terraform, AWS CDK, or CloudFormation
Proven experience with CI/CD tools (Jenkins, GitHub Actions, GitLab CI, or similar)
Proficiency in containerization and orchestration (Docker, Kubernetes, ECS, or EKS)
Expertise in monitoring and observability tools (Datadog, New Relic, Prometheus, Grafana, ELK, CloudWatch, etc.)
Strong scripting or programming background (Python, Bash, or Go)
Sound understanding of networking, security, and identity/access management in the cloud
Experience designing high-availability and disaster recovery strategies for critical workloads
Excellent communication, problem-solving, and leadership skills with the ability to influence across teams

Nice to have

AWS or other Cloud Certification (Solutions Architect, DevOps Engineer, etc.)
Experience with AIOps, Serverless Architectures, and event-driven systems
Familiarity with FinOps practices and cost optimization frameworks
Experience with SaaS monitoring tools (Datadog, New Relic, Sumo Logic, PagerDuty)
Exposure to Atlassian tools (Jira, Confluence, Bitbucket)
Experience with SQL/NoSQL databases
Proven track record of leading cross-functional reliability initiatives or platform-wide automation projects

Trimble Inc. - All Job Offers

Select Country

Lead Site Reliability Engineer

Job Description

Job Responsibility

Requirements

Nice to have

Looking for more opportunities?