CrawlJobs Logo

Site Reliability Engineer – Foundation Team

global-e.com Logo

Global-e

Location Icon

Location:
Ireland , Dublin

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

As a DevOps/SRE Engineer on our Foundation team, you will help build, operate, and continuously improve the platform infrastructure that underpins all of Global-e's microservices. You'll work at the intersection of infrastructure engineering and reliability — automating deployments, hardening systems, and ensuring our platform scales reliably to handle millions of transactions worldwide. This is a mid-level role for someone who is hands-on with cloud infrastructure and eager to grow their skills across a modern, high-traffic production environment. You'll collaborate closely with software engineers, platform architects, and product teams to keep our systems fast, resilient, and operationally excellent. This role reports to the Engineering Manager for the Foundation team and is based out of our Dublin, Ireland office.

Job Responsibility:

  • Own Infrastructure as Code: Write, maintain, and improve Terraform modules to provision and manage cloud resources on AWS (EKS, RDS, Kinesis, and more)
  • Support and Improve CI/CD Pipelines: Build and maintain reliable deployment pipelines that enable engineering teams to ship with speed and confidence
  • Ensure Platform Reliability: Define and track SLOs/SLAs, respond to incidents, conduct post-mortems, and drive systemic improvements to reduce toil and prevent recurrence
  • Monitor and Observe: Implement and maintain observability tooling — metrics, logging, alerting, and dashboards — to provide clear visibility into system health
  • Scale Kubernetes Workloads: Help manage and evolve our EKS clusters, ensuring workloads are performant, cost-efficient, and fault-tolerant
  • Embrace AI-Augmented Operations: Leverage and help expand our growing use of AI tooling — from AIOps and anomaly detection to AI-assisted incident response and infrastructure optimisation — as we invest heavily in bringing AI into our day-to-day engineering workflows
  • Collaborate Across Teams: Partner with software engineers to bridge the gap between development and operations — advising on best practices, reviewing infrastructure changes, and supporting teams during rollouts
  • Improve Security Posture: Contribute to hardening cloud environments, managing secrets, and enforcing least-privilege access controls
  • Automate Everything: Identify manual processes and replace them with robust, repeatable automation

Requirements:

  • 2+ Years of Experience in a DevOps, SRE, or platform/infrastructure engineering role
  • Solid Cloud Experience: Hands-on with AWS services (EKS, RDS, Kinesis, S3, IAM, and related services)
  • Terraform Proficiency: Comfortable writing and maintaining production-grade infrastructure as code
  • Kubernetes Familiarity: Experience deploying, debugging, and operating containerised workloads on Kubernetes
  • CI/CD Experience: Familiarity with pipeline tooling (e.g. GitHub Actions, ArgoCD, Jenkins, or similar)
  • Observability Skills: Experience with monitoring and alerting tools (e.g. Datadog, Prometheus/Grafana, or equivalent)
  • Scripting Ability: Comfortable with Python, Bash, or similar for automation and tooling tasks
  • A Reliability Mindset: Understanding of SRE principles — SLOs, error budgets, incident management, and blameless post-mortems
What we offer:
  • Impact at Global Scale
  • Modern Technology Stack
  • Be Part of Our AI Journey
  • Growth & Development

Additional Information:

Job Posted:
May 03, 2026

Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Site Reliability Engineer – Foundation Team

Site Reliability Engineering Manager

Hewlett Packard Enterprise (HPE) is looking for a Site Reliability Engineering M...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7–10 years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles
  • Minimum 2 years of experience managing or leading cloud operations teams
  • Deep understanding of cloud platforms (AWS, GCP, or Azure) and cloud-native architectures
  • Hands-on experience with Kubernetes, containers, infrastructure as code (e.g., Terraform), and configuration management tools
  • Strong foundation in observability (monitoring, logging, tracing), automation using Python, and incident response
  • Familiarity with modern CI/CD automation and tools
  • Excellent communication, stakeholder management, and team-building skills
  • Experience scaling SRE practices in high-growth or large-scale environments
  • Ability to balance long-term reliability initiatives with short-term delivery needs.
Job Responsibility
Job Responsibility
  • Lead and mentor a team of Site Reliability Engineers, supporting their growth, performance, and well-being
  • Own the reliability strategy for SASE cloud infrastructure systems, including incident management, SLIs/SLOs, and capacity planning
  • Partner with Engineering, Product, and Security teams to design and deliver highly available, scalable, and resilient cloud-native services
  • Guide the team in building automation, improving observability, and improve operational efficiency of our cloud infrastructure
  • Drive adoption of best practices in monitoring, alerting, on-call operations, and runbook development
  • Build and maintain a strong engineering culture based on ownership, collaboration, and continuous learning
  • Define and track key reliability metrics, and report on team performance and system health to leadership
  • Contribute to hiring, onboarding, and career development for SREs.
What we offer
What we offer
  • Health & Wellbeing benefits for physical, financial, and emotional wellbeing
  • Personal & Professional Development programs
  • Unconditional inclusion in the workplace.
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer

At Instabase, our Site Reliability and Platform Engineering team is at the heart...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
instabase.com Logo
Instabase
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in Site Reliability Engineering, Software Engineering, or Production Engineering
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience
  • Demonstrated experience in managing and sustaining SaaS production environments
  • Hands-on experience with major cloud providers such as AWS and Azure
  • Hands-on experience with any high level programming language such as Python, Java, or Go
  • Proficient in containerization technologies like Docker
  • Expertise in container orchestration platforms, especially Kubernetes
  • Skilled in overseeing and managing software release processes
  • A systematic approach to solving platform and production issues, with strong problem-solving abilities and a passion for automation
Job Responsibility
Job Responsibility
  • Define and steer the technical direction for your team, collaborating with cross-functional partners to drive impactful results
  • Develop and execute comprehensive short and long-term roadmaps, balancing business needs, user experience, and robust technical foundations
  • Oversee cloud infrastructure and deployment automation, ensuring efficient and reliable operations
  • Guarantee uptime and reliability for production systems through proactive monitoring and support
  • Manage vulnerability assessments and facilitate prompt remediation to maintain security and integrity
  • Maintain and enhance build and CI/CD infrastructure to support seamless development workflows
  • Implement and optimize tools that enhance developer productivity and streamline processes
  • Drive improvements in release management processes and tooling to ensure smooth and reliable software delivery
  • Fulltime
Read More
Arrow Right

Security Reliability Engineering Lead

This is a new, bootstrap team focused on applying strong Site Reliability Engine...
Location
Location
United States , San Francisco
Salary
Salary:
293000.00 - 385000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10 or more years of experience operating and architecting mission critical infrastructure in high reliability environments
  • Have led the design and maturation of complex on prem, hybrid, or cloud integrated systems, setting durable architectural patterns used by multiple teams
  • Apply Site Reliability Engineering principles at scale, using observability, automation, and incident learnings to materially reduce risk and operational toil
  • Operate comfortably in ambiguity, making sound architectural decisions under pressure while staying close to technical detail
  • Influence cross functional partners across security, identity, network, and platform teams to land reliability improvements without direct authority
Job Responsibility
Job Responsibility
  • Set direction and establish strong foundations
  • Define and evolve infrastructure patterns for on prem and hybrid environments, including self hosted platforms, vendor supported systems, and lab environments
  • Establish standardized, production grade deployment and operational models that replace bespoke implementations
  • Partner with IT, Security, Identity, and Network teams to ensure infrastructure meets reliability, security, and access requirements by design
  • Design and mature the production architecture for IAM adjacent platforms such as Microsoft Entra using SRE principles
  • Establish common management rules and shared resources within Azure subscriptions to ensure consistent, policy aligned operations
  • Build, operate, and scale reliably
  • Own the full lifecycle of infrastructure systems, including deployment, upgrades, patching, recovery, and ongoing operations
  • Operate and harden shared infrastructure provisioned through Infra Terraform, ensuring repeatability, auditability, and safe change management
  • Design and implement infrastructure as code and configuration management to support shared services, identity adjacent systems, and endpoint platforms using tools like Chef, Ansible and Terraform
What we offer
What we offer
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Fulltime
Read More
Arrow Right

Senior Site Reliability Engineer

Join us in shaping the future of infrastructure automation for mission-critical ...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in software engineering or site reliability
  • Experience building and scaling complex and impactful software products in a team environment
  • Deep skill in driving technical solutions across multiple teams
  • Strong Experience with Terraform and CI/CD
  • Strong experience managing infrastructure in the cloud (AWS or Azure)
  • Experience using languages such as Go, Python, C#, Java, or similar
  • Experience designing tooling to simplify the operational management of SaaS/PaaS systems
  • Empathy to support the needs of software engineers
Job Responsibility
Job Responsibility
  • Build robust, easy-to-use foundational platforms and tools that enable engineering teams to provision infrastructure rapidly, consistently, and securely across multiple cloud providers
  • Write code in Go that is performant, maintainable, clear, and concise
  • Championing and enforcing Infrastructure as Code (IaC) best practices and coding standards
  • Employ strong problem-solving skills, with the ability to debug problems in cloud native distributed systems
  • Influence and educate the engineering organization to adopt new and improved architectural patterns
  • Provide robust documentation for use by engineers to promote self-service
What we offer
What we offer
  • Competitive base salary and RSUs
  • Comprehensive pension plan with matching contribution
  • Private health insurance & cash plans
  • 30 days paid holiday + UK public holidays
  • Enhanced maternity/paternity leave
  • GymPass subscription
  • Life assurance & income protection
  • Career growth support and wellness resources
  • Fulltime
Read More
Arrow Right
New

Senior Manager Manufacturing Systems Engineering

Join Amgen’s Mission of Serving Patients. At Amgen, if you feel like you’re part...
Location
Location
United States , Holly Springs
Salary
Salary:
148715.15 - 201202.85 USD / Year
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • High school diploma / GED and 12 years of engineering experience OR Associate’s degree and 10 years of engineering experience OR Bachelor’s degree and 8 years of engineering experience OR Master’s degree and 6 years of engineering experience OR Doctorate degree and 2 years of engineering experience
  • minimum of 2 years experience directly managing people and/or leadership experience leading teams, projects, programs, or directing the allocation or resources
  • Degree in Electrical Engineering, Computer Science, Chemical Engineering or Biotech Engineering
  • Extensive experience with Emerson DeltaV, Process Control Network design including network segregation, Process Control Systems: Virtual Infrastructure design and implementation, System Integration using OPC, Foundation Fieldbus, and Profibus
  • Hands on experience with IO-Link technology for smart sensor/actuator integration in automated manufacturing environments
  • Familiarity with Process Analytical Technology (PAT) tools and strategies for real-time process monitoring and control in biopharmaceutical manufacturing
  • Direct knowledge of integrating various OEM automation software and field instrumentation technologies
  • In-depth knowledge of industry standards such as 21 CFR Part 11, ASTM 2500, S88, S95, GAMP, GDP, and GMP
  • Experience working in a regulated environment (e.g. cGMP, OSHA, EPA, etc.), experience interacting with regulatory agencies and inspectors, and familiarity with GMP quality systems/processes such as change control, non-conformances, corrective and preventative actions, and qualifications/validation
  • Experience with Tech Transfer, Process Design, and Commissioning & Qualification
Job Responsibility
Job Responsibility
  • Oversee the Automation Engineering team, lead technical projects, and drive innovation and support manufacturing’s automation needs
  • Solve complex engineering problems and ensure the company's engineering efforts align with strategic objectives
  • Provide leadership and guidance to the Automation Engineering team supporting 24/7 manufacturing and facility operations
  • Build and lead a team of Automation engineering professionals and serve as the main point of contact for the Automation engineering function
  • Support design reviews while partnering closely with the corporate engineering team
  • Execute and support the commissioning and qualification of process equipment automation in alignment with GMP requirements
  • Collaborate with the site process engineering team to understand key process requirements and develop process control automation solutions primarily utilizing Emerson DeltaV Distributed Control System (DCS)
  • Support commissioning and qualification efforts including Automation Installation Verification/Automation Check Out (IV/ACO)
  • Partner with various organizational units to support tasks including operational readiness, document reviews, deviation investigation and change controls
  • Provide mentoring to develop and accelerate site technical automation capabilities
What we offer
What we offer
  • competitive and comprehensive Total Rewards Plans that are aligned with local industry standards
  • Fulltime
Read More
Arrow Right

Senior Site Reliability Manager

RUCKUS Networks is seeking an experienced Site Reliability Engineering (SRE) Man...
Location
Location
United States , Sunnyvale
Salary
Salary:
135600.00 - 200000.00 USD / Year
commscope.com Logo
CommScope
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12+ years in Site Reliability Engineering (SRE), with 6+ years leading SRE, DevOps, or infrastructure teams
  • Proven experience mentoring engineering managers and developing leadership talent
  • Track record of transforming traditional operations or NOC teams into modern SRE organizations
  • Strong project management skills with Agile/Kanban experience and JIRA proficiency
  • Excellent communication skills, including executive-level presentations
  • Deep SRE expertise: incident management, on-call systems, monitoring, and reliability engineering
  • Infrastructure automation experience with Terraform, Kubernetes, Docker, and CI/CD pipelines
  • Cloud platform proficiency (GCP/AWS), including networking, security, and cost optimization
  • Monitoring and observability experience with Prometheus, Grafana, APM tools, and log aggregation
  • 24/7 operations experience with global team coordination and escalation management
Job Responsibility
Job Responsibility
  • Lead and develop engineering managers and technical operations engineers across India and APAC time zones
  • Build a collaborative team culture that emphasizes knowledge sharing, automation, and operational excellence
  • Mentor engineering managers to strengthen leadership capabilities and technical expertise
  • Set clear performance expectations and provide ongoing coaching for growth
  • Partner cross-functionally with Product, Security, Development, and global operations teams
  • Own 24/7 operational stability for India/APAC, including incident response, escalation, and post-incident reviews
  • Drive comprehensive incident management: alert handling, outage response, and root cause analysis (RCA/CAR)
  • Transform traditional operations into modern SRE practices using SLOs, error budgets, and reliability engineering
  • Implement robust monitoring and alerting with APM tools, dashboards, and automation frameworks
  • Lead technical project delivery with clear timelines, resource planning, and stakeholder communication
What we offer
What we offer
  • medical, dental, and vision plans
  • life and accidental death insurance
  • a 401(k) plan
  • participation in the Company’s Incentive Plan
  • eleven paid holidays in a full calendar year
  • two weeks of paid vacation (prorated based on start date)
  • other leave options
  • Fulltime
Read More
Arrow Right

Product Design Engineering Manager

We're seeking an experienced Product Design Engineering Manager (Mechanical) to ...
Location
Location
United States , Sunnyvale
Salary
Salary:
173000.00 - 245000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS in Mechanical Engineering or equivalent degree
  • 8+ years of experience shipping consumer products, with an in depth foundation in mechanical engineering fundamentals
  • 3+ years of team management experience, leading mechanical engineers
  • Experience leading hardware teams to ship consumer devices and be inspired about building and developing high-performing teams
  • Proven experience collaborating with Asian contract manufacturers and suppliers, both remotely and on-site
  • Skilled in tackling ambiguous issues, providing strategic direction, and applying conceptual thinking
  • Proficient in state-of-the-art mechanical design and simulation tools, prototyping tools, manufacturing technologies, and materials
  • Experience driving technical contributions to innovation-driven product design teams
  • Demonstrated skill in communicating technical information to stakeholders, driving understanding and buy-in
Job Responsibility
Job Responsibility
  • Lead a team of engineers developing complex consumer hardware devices, providing technical guidance and oversight
  • Collaborate with recruiting staff to expand the team through conferences, events, and onboarding new employees
  • Develop comprehensive plans for prioritizing technical and resourcing challenges by understanding technical architectures, long-term and short-term planning, hardware roadmapping, and other key issues
  • Regularly assess employee performance, address under-performance, and recognize and promote team members
  • Foster effective collaboration between teams by partnering with product management, technical program management, manufacturing, operations, and other engineering groups
  • Support engineer career development by assigning projects tailored to their skill levels, strengths, and work styles, and promoting long-term skill growth
  • Travel domestically and internationally (up to 10%) to support builds, meet with suppliers, and work with multi-site hardware teams
  • Work effectively with cross-functional teams, including UX, ID, manufacturing, reliability, and others
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Principal Software Engineering Manager - Business Enablement

Xbox's Creator Onboarding, Release, and Support (CORS) organization is on a miss...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • 4+ years people management experience
  • Experience with CRM platforms (e.g., Dynamics 365), data engineering pipelines, or business process automation
  • Experience leading distributed and geographically diverse engineering teams, including vendor partner management
  • Familiarity with cloud-scale service operations, incident management, and SRE practices
  • Track record of building inclusive teams and fostering psychological safety in engineering organizations
  • Experience with security compliance frameworks and secure-by-design engineering practices
Job Responsibility
Job Responsibility
  • Lead, coach, and grow a diverse engineering organization spanning software engineering, CRM/Dynamics 365, data engineering, and site reliability across multiple geographies and vendor partnerships fostering a culture of belonging, continuous learning, and high performance
  • Own the technical vision and engineering roadmap for Xbox's core business systems including partner onboarding, developer identity, CRM, contract automation, and data platforms ensuring alignment with CORS's strategic priorities and long-term organizational goals
  • Drive production reliability and security excellence across a broad service portfolio, championing Secure Future Initiative (SFI) compliance, incident readiness, and operational rigor as foundational engineering practices
  • Lead AI-driven modernization of business processes, replacing manual workflows with scalable, intelligent automation that accelerates the creator journey and unlocks new partner capabilities
  • Collaborate across CORS and Xbox Business teams with Product Management, Business Development, and peer engineering teams to align on priorities, resolve cross-team dependencies, and deliver cohesive end-to-end creator experiences
  • Fulltime
Read More
Arrow Right