CrawlJobs Logo

Sre Sr. Engineer

Mexico, Guadalajara · Job Posted June 29, 2026
Apply Position
Job Link Share

Job Description

We are currently seeking a SRE Sr. Engineer to join our team in Guadalajara, Jalisco (MX-JAL), Mexico (MX). The SRE Sr. Engineer will be able to own the cloud infrastructure, build an infrastructure-as-code environment while also being able to monitor overall systems and infrastructure health to help the company continue advancing their innovation. The successful candidate will have the Cloud and DevOps culture embedded in their work habits. This role will help drive these practices in the overall transformation of operational support.

Job Responsibility

  • Maintain public cloud infrastructure by using at least one of the Cloud technology Azure or AWS
  • Build and Maintain cloud infrastructure automation (IaC) by using Terraform, ARM Templates or similar
  • Build and Maintain IT automation using tools like Ansible, Chef or managing complex container-based applications like Helm for Kubernetes
  • Build, delivery and deployment by using modern technologies like Git, Git Action, Jenkins, Octopus, Ansible, Docker, Kubernetes or similar
  • Build and maintain observability and monitoring across different IT platforms by using Grafana, Prometheus, Elastic, DataDog or similar
  • Oversee all planned outages, assess RCA and assist with major upgrades to ensure minimum downtime
  • Be part of a L3 team, spread across 3 time zones, to support troubleshooting IT platforms issues, when required (escalated from previous support levels L1 and L2).

Requirements

  • Bachelor's degree in computer science, Engineering or related fields
  • Minimum 3 years of working experience with at least one of the public cloud platforms
  • Minimum of 5 years Windows / Linux experience
  • Minimum of 2 years Terraform or other IaC platforms experience
  • Strong knowledge of Elastic, Grafana, Prometheus or other observability platforms (Datadog, Dynatrace, etc.)
  • Proven experience with running and/or managing large IT platform services with multiple availability regions
  • Experience with container orchestration platform Docker or Kubernetes, or similar
  • Strong English communication (written and oral) skills are required.

Nice to have

  • Public Cloud (Azure or AWS) Certifications – Professional level preferred
  • Comfort with both Linux and Windows administration
  • Background with scripting technologies: PowerShell or Bash or Python
  • Knowledge of monitoring and logging systems (e.g. ELK stack, Grafana, Icinga2 or similar).

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Sre Sr. Engineer

8 matching positions

Sr./Staff - Infrastructure/Site Reliability Engineer (SRE)

Shape the future of trust in the age of AI. At Oscilar, we're building the most ...
Location
Location
United Kingdom
Salary
Salary:
Not provided
oscilar.com Logo
Oscilar
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven track record as a senior SRE or Infrastructure Engineer in high-scale environments
  • Expert-level skills in AWS and Infrastructure as Code (Pulumi, Terraform)
  • Strong programming ability in Go or Python. We use Go
  • Deep understanding of distributed systems (Kafka, ClickHouse) and microservices architecture
  • Mastery of container orchestration (Kubernetes) and production debugging
  • Strong sense of ownership, and the judgment to balance velocity with reliability
Job Responsibility
Job Responsibility
  • Architect and operate resilient cloud infrastructure (AWS, Pulumi, Kubernetes)
  • Lead initiatives to improve availability, latency, and performance at scale
  • Design and evolve our CI/CD pipelines to optimize for speed, safety, and repeatability
  • Define the metrics, alerts, and runbooks that form our observability backbone
  • Run chaos experiments and failure simulations to harden the platform
  • Mentor engineers and set best practices for SRE across the company
  • Fulltime
Read More
Arrow Right

Sr./Staff - Infrastructure/Site Reliability Engineer (SRE)

Shape the future of trust in the age of AI. At Oscilar, we're building the most ...
Location
Location
Poland
Salary
Salary:
Not provided
oscilar.com Logo
Oscilar
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven track record as a senior SRE or Infrastructure Engineer in high-scale environments
  • Expert-level skills in AWS and Infrastructure as Code (Pulumi, Terraform)
  • Strong programming ability in Go or Python. We use Go
  • Deep understanding of distributed systems (Kafka, ClickHouse) and microservices architecture
  • Mastery of container orchestration (Kubernetes) and production debugging
  • Strong sense of ownership, and the judgment to balance velocity with reliability
Job Responsibility
Job Responsibility
  • Architect and operate resilient cloud infrastructure (AWS, Pulumi, Kubernetes)
  • Lead initiatives to improve availability, latency, and performance at scale
  • Design and evolve our CI/CD pipelines to optimize for speed, safety, and repeatability
  • Define the metrics, alerts, and runbooks that form our observability backbone
  • Run chaos experiments and failure simulations to harden the platform
  • Mentor engineers and set best practices for SRE across the company
  • Fulltime
Read More
Arrow Right

Sr. Software Engineer - QA / Test Automation Engineer

Location
Location
India , Gurgaon
Salary
Salary:
Not provided
https://www.randstad.com Logo
Randstad
Expiration Date
July 09, 2026
Flip Icon
Requirements
Requirements
  • 8+ years of experience in QA automation, SDET, or software engineering roles focused on test automation for distributed or cloud-based systems
  • Strong understanding of QA methodologies, test design, and systems validation
  • Proficiency in .NET 8/C#, Node.js, Python, or TypeScript for automation scripting
  • Hands-on experience with Selenium, Playwright, Cypress, REST API automation, and integration testing frameworks
  • Experience running tests in AWS environments with strong understanding of CI/CD pipelines using Azure DevOps
  • Familiarity with IaC, containerized test execution, and observability tools
  • Experience testing SQL Server 2022, Snowflake, PostgreSQL data flows
  • Ability to validate ETL pipelines, schema changes, and data quality through automation
  • Expertise in automated testing (unit, integration, contract, E2E, regression)
  • Familiarity with blue/green and canary release testing
Job Responsibility
Job Responsibility
  • Contribute to the design of scalable, maintainable QA automation frameworks for API, UI, integration, and performance testing
  • Implement automated test scenarios across microservices, APIs, data workflows, and distributed systems
  • Participate in design discussions to ensure testability, document risks, and propose automation strategies aligned with engineering standards
  • Produce clean, reusable, and maintainable automation scripts following best practices
  • Implement unit, integration, contract, and E2E tests integrated with CI/CD pipelines
  • Conduct root-cause analysis for defects and drive preventive quality improvements
  • Perform debugging, reliability analysis, and optimization of automation suites
  • Own test execution pipelines from development through deployment and monitoring
  • Create automated dashboards, alerts, and quality signals to validate release readiness
  • Collaborate in production issue investigations by building automated repros and validation scripts
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer Sr. Staff

Designs, develops, troubleshoots and debugs software programs for software enhan...
Location
Location
Puerto Rico , San Juan
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 10 years of hands-on experience in Infra Ops, Dev Ops, or Site Reliability Engineering (SRE)
  • Proficiency with Linux systems, especially Debian-based distributions
  • Strong experience with cloud platforms such as AWS and GCP
  • Expertise in Infrastructure as Code tools like Terraform, Packer, and Ansible
  • Solid programming skills in Python and/or Golang
  • Deep understanding of containerization (Docker, Container) and orchestration tools (AWS EKS, GCP GKE)
  • Experience with GitOps workflows
  • Proven track record in implementing and maintaining CI/CD pipelines
  • Strong background in security and familiarity with security programs
  • Experience with monitoring and logging tools (Prometheus, Grafana, ELK)
Job Responsibility
Job Responsibility
  • Enhance Infrastructure as Code (IAC) and enforce best practices
  • Optimize cloud infrastructure for scalability, security, and cost-effectiveness
  • Develop internal tools to support and streamline cloud platform operations
  • Improve CI/CD pipelines and deployment workflows using FluxCD and Jenkins
  • Address container image vulnerabilities and standardize remediation processes
  • Build Amazon Machine Images (AMIs) aligned with CIS and STIG benchmarks
  • Strengthen monitoring, alerting, and observability using Prometheus, Grafana, and logging tools
  • Troubleshoot complex production issues to ensure system reliability and customer satisfaction
  • Fine-tune distributed systems such as Apache Kafka and Cassandra
  • Collaborate with development, security, and operations teams to align infrastructure with application needs.
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer Sr. Staff

Designs, develops, troubleshoots and debugs software programs for software enhan...
Location
Location
Puerto Rico , San Juan
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 10 years of hands-on experience in Infra Ops, Dev Ops, or Site Reliability Engineering (SRE)
  • Proficiency with Linux systems, especially Debian-based distributions
  • Strong experience with cloud platforms such as AWS and GCP
  • Expertise in Infrastructure as Code tools like Terraform, Packer, and Ansible
  • Solid programming skills in Python and/or Golang
  • Deep understanding of containerization (Docker, Container) and orchestration tools (AWS EKS, GCP GKE)
  • Experience with GitOps workflows
  • Proven track record in implementing and maintaining CI/CD pipelines
  • Strong background in security and familiarity with security programs
  • Experience with monitoring and logging tools (Prometheus, Grafana, ELK)
Job Responsibility
Job Responsibility
  • Enhance Infrastructure as Code (IAC) and enforce best practices
  • Optimize cloud infrastructure for scalability, security, and cost-effectiveness
  • Develop internal tools to support and streamline cloud platform operations
  • Improve CI/CD pipelines and deployment workflows using FluxCD and Jenkins
  • Address container image vulnerabilities and standardize remediation processes
  • Build Amazon Machine Images (AMIs) aligned with CIS and STIG benchmarks
  • Strengthen monitoring, alerting, and observability using Prometheus, Grafana, and logging tools
  • Troubleshoot complex production issues to ensure system reliability and customer satisfaction
  • Fine-tune distributed systems such as Apache Kafka and Cassandra
  • Collaborate with development, security, and operations teams to align infrastructure with application needs.
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right
New

Sr. Manager Sre

We're building a Site Reliability Engineering center in Mexico City, and we're h...
Location
Location
Mexico , Mexico City
Salary
Salary:
Not provided
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Professional English fluency
  • Bachelor's degree
  • At least 8+ years of experience in SRE, production operations, or reliability engineering
  • Experience in DevOps Engineering (internship experience does not apply)
  • 8+ years of experience in at least one of the following: Java, Python, Go
  • At least 6 years of experience with Cloud Native technologies (Amazon Web Services, Microsoft Azure, Google Cloud Platform)
  • 5+ years of experience with container orchestration services including Docker or Kubernetes
  • Experience with Shell or Bash scripting
  • At least 5 years of Unix or Linux system administration experience
Job Responsibility
Job Responsibility
  • Define and maintain a 12-18 month technical vision and roadmap for GPN SRE in Mexico City - decompose destination architecture into deliverable steps, sequence investments, and align execution across teams
  • Drive reliability transformation across settlement, observability, and automation domains - establish SLOs, error budgets, severity frameworks, and operational standards that teams build against
  • Pioneer AI and agentic automation approaches - design and build AI-driven solutions (using Claude Code, Copilot CLI, and LLM frameworks) for alert classification, runbook generation, automated remediation, and incident analysis
  • set patterns that other engineers extend
  • Own the technical strategy for domain-specific knowledge ramp-up: identify which domain expertise requires deep engineering investment vs. documentation, and architect systems that reduce reliance on tribal knowledge
  • Lead cross-team technical initiatives - drive observability platform convergence, standardize on COF tooling, and eliminate arbitrary uniqueness across towers
  • Serve as the senior escalation point for complex production incidents - diagnose cascading failures across distributed systems (storage, network, application), drive resolution, and ensure durable fixes land
  • Architect automation for high-risk operational processes - certificate rotation, compliance artifact generation, settlement cycle validation - ensuring security and reliability are built in from design
  • Mentor and elevate engineers across teams - conduct design reviews, establish engineering standards, coach on debugging and system thinking, and create an environment where Principal Associates and Managers grow into domain experts
  • Introduce and advocate for engineering practices that raise the bar - AI engineering, innersourcing, reuse over rebuild, open source contribution, blameless postmortems, and chaos engineering
  • Fulltime
Read More
Arrow Right

Sr, Software Engineer, Cloud Storage

As a Software Engineer, you will play a key role in delivering an enterprise‑cla...
Location
Location
United States , Morrisville
Salary
Salary:
170000.00 - 220000.00 USD / Year
netapp.com Logo
NetApp
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ Years of Software Engineering/Development Experience
  • Strong experience in software design, development, and system-level architecture
  • Proficiency in programming languages, with Go, Python, C++, or C
  • Deep knowledge of Kubernetes
  • hands-on experience building or deploying micro-services using Docker and Kubernetes
  • Practical experience with public cloud providers such as GCP, Azure, or AWS
  • Solid understanding of data structures, algorithms, multithreading, distributed systems, and modern programming practices
  • Strong collaboration and communication skills (verbal and written)
  • Demonstrated ability to lead features or small teams independently
  • Quick learner with the ability to adapt to new technologies and complex systems
Job Responsibility
Job Responsibility
  • Design, develop, and test new product features involving complex and interdependent distributed systems
  • Deliver high‑quality, maintainable code across cloud‑native storage components
  • Independently drive feature development from design to completion
  • Participate in technical discussions within the team and across partner groups
  • Collaborate with cloud hyperscalers and internal stakeholders on solutions built for first party cloud native platforms
  • Work closely with SRE, Product Management, and cross-functional engineering teams to align on design, requirements, and execution
  • Contribute to design reviews, architectural discussions, and problem investigations
  • Mentor junior engineers in best practices and technical execution
  • Ensure solutions meet scalability, reliability, and performance goals for enterprise-class cloud storage systems
What we offer
What we offer
  • Health Insurance
  • Life Insurance
  • Retirement or Pension Plans
  • Paid Time Off
  • various Leave options
  • Performance-Based Incentives
  • employee stock purchase plan
  • restricted stocks (RSU’s)
  • Volunteer time off: 40 hours of paid volunteer time each year
  • Well-being: Employee Assistance Program, fitness, and mental health resources
  • Fulltime
Read More
Arrow Right

Sr Staff Engineer Software (Prisma AIRS - Runtime BackEnd)

As a Senior Staff Software Engineer on the Prisma AIRS Runtime Security team, yo...
Location
Location
United States , Santa Clara
Salary
Salary:
126000.00 - 204500.00 USD / Year
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science or a related field with 5+ years of experience, or a Master's degree with 3+ years of experience, or a PhD
  • Expertise in building scalable distributed systems with excellent Python or Golang programming skills
  • Proven experience with modern backend frameworks, relational databases (SQL), and cloud platforms, specifically GCP (Google Cloud Platform)
  • Demonstrated ability to work collaboratively with senior and junior engineers in a dynamic, fast-paced environment
  • Experience with container platforms like Kubernetes, CI/CD pipelines (GitLab pipeline, ArgoCD), observability and monitoring solution like Grafana and Prometheus
Job Responsibility
Job Responsibility
  • Lead cross-functionally with Product Management, SRE, Software, and Quality Engineering teams to deliver new security as a service offerings in a timely fashion
  • Analyze and solve complex problems by evaluating requirements and applying advanced engineering techniques to achieve high-quality results
  • Proactively identify problems and opportunities, proposing and developing simple, attainable solutions to enhance the team's development process and product quality
  • Evangelize and implement engineering best practices, including test-driven development, spec driven development within the team
  • Lead the architectural design and implementation of new features, ensuring the scalability, performance, and maintainability of the backend codebase
  • Mentor junior engineers, fostering a culture of technical excellence and continuous learning within the team
What we offer
What we offer
  • Restricted stock units
  • Bonus
  • Fulltime
Read More
Arrow Right