CrawlJobs Logo

Site Reliability Engineer III

allianceautomotive.co.uk Logo

Alliance Automotive UK LV Ltd

Location Icon

Location:
United States , Birmingham

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Under limited supervision, the Site Reliability Engineer III is responsible for improving system reliability and resilience. This role focuses on building automation to reduce manual effort and prevent service-impacting incidents. The SRE combines software and systems engineering to build and support large-scale, distributed, fault-tolerant systems. This role ensures that critical platforms are available, reliable, and able to support a fast rate of improvement. This role relies on monitoring platforms and is continually taking a holistic view of system health and performance. The SRE will enhance and support cloud-based transformations and is focused on pushing capabilities forward, staying ahead of customer needs, and innovating for continuous improvement. The SRE provides operational support and engineering for multiple large-scale distributed software applications.

Job Responsibility:

  • Gathers and analyzes metrics from monitoring platforms to assist in performance tuning and fault tolerance
  • Partners with development teams to improve services through testing and release procedures
  • Participates in system design, platform management and capacity planning
  • Balances feature development speed and reliability with service-level objectives
  • Works closely with the incident response team and restoring service to normal operation
  • Understands debugging and applying troubleshooting skills
  • Investigates, blocks and rate-limits unwanted traffic
  • Utilizes monitoring systems and dashboards for proactive changes and alerting
  • Establishes continuous process improvement cycles where the process, performance, and supporting technologies are reviewed and enhanced where applicable
  • Performs other duties as assigned

Requirements:

  • Typically requires a bachelor's degree and five (5) or more years of related experience or an equivalent combination
  • Understanding of Kubernetes, containers, clusters, and elastic scalability
  • Expertise in SRE principles
  • Mindset of continually finding ways to drive scalability, stability, and performance
  • Cloud Services experience with Google Cloud Platform (GCP)
  • Experience with API, service-based or microservice-based architecture
  • Proficiency in infrastructure, network, database, operating systems, or security troubleshooting and remediation
  • Architecture-level knowledge of Windows and Linux and Infrastructure systems
  • Experience with production deployment, monitoring, and operational support for enterprise-class applications (Dynatrace a plus)
  • Experience working with Continuous Integration/ Continuous Deployment tools
  • Experience in performance diagnostics, capacity planning, performance architecture design, performance tuning, and performance monitoring
  • A strong mix of software engineering and operational support skills
  • Knowledge of web technologies – HTTP, proxy, java, etc.
  • Experience with Azure DevOps (ADO), Dynatrace, Prometheus, Terraform and Grafana
  • You must be eligible to work in the US without Visa Sponsorship
What we offer:

options for healthcare coverage, 401(k), tuition reimbursement, vacation, sick, and holiday pay

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Site Reliability Engineer III

Project Engineer III

We are seeking an experienced Civil Engineer specializing in water treatment pro...
Location
Location
United States , Columbus
Salary
Salary:
Not provided
wesslerengineering.com Logo
Wessler Engineering
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A minimum of 4 years of experience in water treatment engineering and design
  • Bachelor of Science Degree in Civil Engineering from an ABET accredited institution
  • Professional Engineer (PE) license preferred or the ability to obtain within 12 months
  • Team-oriented with good communication and organizational skills
  • Strong analytical and problem-solving skills with a focus on technical innovation
  • Ability to successfully complete tasks independently with guidance from managers
  • Proficiency in industry-standard computer software for engineering analysis and design (e.g., AutoCAD, Water CAD, etc.)
Job Responsibility
Job Responsibility
  • Develop, evaluate, and design water treatment systems, including conceptual planning, detailed engineering, and process optimization
  • Conduct hydraulic and process modeling to ensure the efficiency and reliability of treatment systems
  • Collaborate with multidisciplinary teams to integrate treatment systems with broader infrastructure projects
  • Provide technical expertise and guidance during the construction and commissioning of water treatment facilities
  • Analyze system performance, troubleshoot issues, and propose solutions for operational improvements
  • Stay updated on emerging technologies and advancements in water treatment processes
  • Prepare technical reports, specifications, and proposals
  • Participate in site visits to evaluate facilities and processes, meet with operations staff, and collect system information useful for evaluation and design
  • Ensure compliance with local, state, and federal regulations
Read More
Arrow Right

Site Reliability Engineer III

The GCF5 Track Lead is the senior technical leader for one capability pillar—Ent...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS+8 / MS+6 / PhD in CS/Engineering/Data disciplines
  • Demonstrated production delivery experience in data/ML/HPC at scale
  • Demonstrated literacy in a relevant scientific domain (e.g., biology, chemistry, therapeutic discovery)
  • Depth in the assigned pillar (EDF/CDM, Agentic‑ML, or HPC)
  • Kubernetes and continuous integration/continuous delivery (CI/CD) at scale
  • observability, performance tuning, and security-by-design
  • Evidence of standard‑setting and cross‑team influence
  • mentoring experience
Job Responsibility
Job Responsibility
  • Own the pillar roadmap and backlog
  • plan, prioritize, and deliver multi‑team initiatives to agreed Objectives and Key Results (OKRs)
  • Define, document, and govern standards/patterns (APIs, schemas, contracts, security, observability, testing)
  • Lead designs and architecture reviews
  • ensure solutions align with enterprise guardrails and regulatory posture
  • Mentor and develop GCF4 engineers
  • set expectations for code quality, reviews, testing, and incident response
  • Establish SLAs/SLOs and error budgets
  • drive reliability, performance, and cost efficiency for the pillar
  • Partner with scientists and platform teams to translate lab/scientific workflows into scalable data/ML/HPC solutions
Read More
Arrow Right

Site Reliability Engineer III

Under limited supervision, the Site Reliability Engineer III is responsible for ...
Location
Location
United States , Birmingham, Alabama
Salary
Salary:
Not provided
genpt.com Logo
Genuine Parts Company
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Typically requires a bachelor's degree and five (5) or more years of related experience or an equivalent combination
  • Understanding of Kubernetes, containers, clusters, and elastic scalability
  • Expertise in SRE principles
  • Mindset of continually finding ways to drive scalability, stability, and performance
  • Cloud Services experience with Google Cloud Platform (GCP)
  • Experience with API, service-based or microservice-based architecture
  • Proficiency in infrastructure, network, database, operating systems, or security troubleshooting and remediation
  • Architecture-level knowledge of Windows and Linux and Infrastructure systems
  • Experience with production deployment, monitoring, and operational support for enterprise-class applications (Dynatrace a plus)
  • Experience working with Continuous Integration/ Continuous Deployment tools
Job Responsibility
Job Responsibility
  • Gathers and analyzes metrics from monitoring platforms to assist in performance tuning and fault tolerance
  • Partners with development teams to improve services through testing and release procedures
  • Participates in system design, platform management and capacity planning
  • Balances feature development speed and reliability with service-level objectives
  • Works closely with the incident response team and restoring service to normal operation
  • Understands debugging and applying troubleshooting skills
  • Investigates, blocks and rate-limits unwanted traffic
  • Utilizes monitoring systems and dashboards for proactive changes and alerting
  • Establishes continuous process improvement cycles where the process, performance, and supporting technologies are reviewed and enhanced where applicable
  • Performs other duties as assigned.
What we offer
What we offer
  • Options for healthcare coverage, 401(k), tuition reimbursement, vacation, sick, and holiday pay.
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer III

Zuora’s Cloud Engineering teams are responsible for Cloud infrastructures, monit...
Location
Location
India , Chennai
Salary
Salary:
Not provided
zuora.com Logo
Zuora
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6-8 years of relevant experience on SRE/DevOps
  • Proven hands-on working experience with core AWS services (e.g., EC2, VPC, S3, RDS, IAM, CloudWatch, EKS/ECS)
  • Deep expertise in infrastructure-as-code principles using Terraform for provisioning and state management
  • Expert-level knowledge and practical experience with configuration management tools such as Puppet and/or Ansible
  • Strong experience setting up, maintaining, and enhancing Continuous Integration/Continuous Deployment pipelines using Jenkins
  • Proficiency in scripting languages, particularly Python and/or Shell scripting, for developing automation tools and performing system administration tasks
  • Advanced knowledge of Linux operating systems, including performance tuning, troubleshooting, security, and networking fundamentals
  • Working knowledge and operational experience with distributed messaging queues, specifically Kafka
Job Responsibility
Job Responsibility
  • Maintain and improve the reliability, scalability, and performance of our production systems, targeting a high-availability environment
  • Design, implement, and maintain automation solutions for infrastructure provisioning, deployment, configuration management, and monitoring using Terraform and Jenkins
  • Administer, manage, and optimize our cloud infrastructure primarily hosted on AWS, focusing on cost efficiency and secure operations
  • Develop and maintain infrastructure-as-code using Puppet and/or Ansible to ensure consistent and reproducible environments
  • Participate in on-call rotation, troubleshoot and resolve critical production incidents, and conduct comprehensive post-mortems to prevent recurrence
  • Apply strong Linux administration skills to manage, patch, and secure operating systems and underlying infrastructure
  • Manage and optimize distributed messaging systems, specifically Kafka, ensuring high throughput and data integrity
What we offer
What we offer
  • Competitive compensation, variable bonus and performance reward opportunities, and retirement programs
  • Medical Insurance
  • Generous, flexible time off
  • Paid holidays, “wellness” days and company wide end of year break
  • Learning & Development stipend
  • Opportunities to volunteer and give back, including charitable donation match
  • Free resources and support for your mental wellbeing
Read More
Arrow Right
New

Software Engineer Level III – Forward Deployed

We are seeking a skilled Software Engineer who will design, build, and maintain ...
Location
Location
China , Shanghai; Dalian; Wuhan
Salary
Salary:
Not provided
pfizer.de Logo
Pfizer
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Engineering, or related field with 5-8 years of relevant experience
  • AI-Augmented Development: integrate AI tools strategically into development workflow, review AI-generated code with rigor
  • Business Immersion: apply deep domain knowledge to technical solutions, bridge business and technology conversations
  • Data Integration: integrate multiple data sources independently, clean messy datasets
  • Full-Stack Development: deliver complete features end-to-end independently—frontend, backend, database, and infrastructure
  • Multi-Audience Communication: present complex topics clearly to any audience, translate between technical and business language
  • Problem Discovery: navigate ambiguous problem spaces independently, discover requirements through observation
  • Rapid Prototyping & Validation: deliver working solutions rapidly (days not weeks)
  • Site Reliability Engineering: design observability strategies for services, lead incident response
  • Stakeholder Management: manage multiple stakeholders with different interests
Job Responsibility
Job Responsibility
  • Delivery: Own feature delivery from design through deployment, making sound technical trade-offs to ship value on time
  • AI: Integrate AI capabilities into solutions, critically evaluate AI-generated code
  • People: Mentor junior engineers on technical topics, contribute to hiring through interviews
  • Business: Translate business needs into technical solutions, manage stakeholder expectations
  • Process: Contribute to process improvement, maintain team workflows
  • Documentation: Create clear documentation for features you build, contribute to team knowledge bases
  • Fulltime
Read More
Arrow Right

Site Reliability Operations III

The Command & Control Center is the nerve center for Walmart Global Technology. ...
Location
Location
United States of America , Bentonville
Salary
Salary:
80000.00 - 155000.00 USD / Year
walmart.com Logo
Walmart
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong communication and interpersonal skills
  • Experience with Jira, Looper, and Kubernetes
  • Familiarity with Grafana and ability to write queries (PromQL)
  • GitHub experience
  • Database knowledge is preferable but not required
  • Ability to work independently and make decisions with guidance
  • Comprehension of changes to methodologies and resources, and ability to articulate the same
  • Experience with cloud applications and ability to pull logs
  • Strong analytical and problem-solving skills
  • Ability to work collaboratively with cross-functional teams
Job Responsibility
Job Responsibility
  • Monitor and alert on software or system performance, determining thresholds for monitoring metrics and triggers alerts based on thresholds
  • Supervise specific procedures to proactively check the health of applications and infrastructure, including a variety of operating systems, hardware, and software
  • Investigate and diagnose incidents to restore a failed IT service as quickly as possible and within specified SLAs
  • Document troubleshooting steps and service restoration details for knowledge management
  • Liaison between Tech and external support to resolve escalated incidents and ensure timely closure
  • Record and classify received incidents and undertake immediate corrective action for moderate complexity queries under moderate supervision
  • Research and recommend alternative actions for incident resolution
  • Contribute to command-and-control related activities focused on restoration of complex outages
  • Conduct complex maintenance procedures for applications independently
  • Monitor and evaluate the performance of the application by tracking and analyzing appropriate metrics
What we offer
What we offer
  • Multiple health plan options, including vision & dental plans for you & dependents
  • Financial benefits including 401(k), stock purchase plans, life insurance and more
  • Associate discounts in-store and online
  • Education assistance for Associate and dependents
  • Parental Leave
  • Pay during military service
  • Paid Time off - to include vacation, sick, parental
  • Short-term and long-term disability for when you can't work because of injury, illness, or childbirth
  • incentive awards for your performance
  • maternity and parental leave, PTO, health benefits
  • Fulltime
Read More
Arrow Right

Engineer III – ASIC Validation & Automation

On behalf of our client, a global technology leader focused on developing innova...
Location
Location
United States , Milpitas
Salary
Salary:
50.00 - 55.00 USD / Hour
tpsmithgroup.com Logo
Tucker Parker Smith Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • B.S. or M.S. in Electrical Engineering, Electronics Engineering, Computer Engineering, or equivalent practical experience
  • 5 years of hands-on experience in ASIC development, validation, or related engineering disciplines
  • Strong proficiency in automation and scripting, including Python, API integration, and workflow orchestration
  • Demonstrated expertise with validation frameworks, testing methodologies, and agent development
  • Strong communication skills with the ability to explain complex technical concepts clearly
Job Responsibility
Job Responsibility
  • Translate architectural and design specifications into production-ready agents and implementations
  • Lead ASIC testing and validation efforts across multiple projects and engineering sites
  • Execute and manage validation workflows, automation, and framework development
  • Partner closely with platform, design, and validation teams to offload implementation work and accelerate delivery timelines
  • Develop and maintain automation solutions using Python, APIs, and workflow orchestration tools
  • Identify, debug, and resolve complex system-level issues to ensure high reliability and performance
  • Contribute to best practices, documentation, and continuous improvement initiatives
Read More
Arrow Right
New

Structural Engineer III

Bowman has an opportunity for a Structural Engineer III to join our team in Chic...
Location
Location
United States , Chicago
Salary
Salary:
Not provided
bowman.com Logo
Bowman
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in civil engineering required
  • Completed coursework in structural engineering required
  • Master’s degree in structural or civil engineering preferred
  • Five or more (5+) years of professional engineering experience with an emphasis on structural engineering or related field
  • EIT Registration required, or alternatively, six or more (6+) years of successful structural engineering experience required
  • Professional Engineer (PE) Registration or ability to receive within six (6) months required
  • Structural Engineer (SE) Registration or ability to sit for 16-hour NCEES Structural exam within six (6) months preferred
  • Approved Team Leader by IDOT for overseeing and conducting NBIS Bridge Inspections preferred
  • Experience with the preparation of construction plans and/or studies for structural projects required
  • Experience directing tasks and performing QA/QC required
Job Responsibility
Job Responsibility
  • Execute complex technical structural engineering techniques, procedures and criteria for transportation and infrastructure projects
  • Receive broad guidance relating to overall key objectives, critical issues, new concepts, and policy matters and broad parameters for execution
  • Independently apply extensive and diversified knowledge of engineering principles and practices in broad areas of assignments and related fields
  • Review work produced by staff for quality assurance
  • May serve as a lead/resource among team of colleagues in equivalent roles to share technical proficiency, guidance, mentorship and delegation of assignments
  • May occasionally provide feedback to managers
  • Work closely with more senior members to learn about and assist with structural engineering and planning work
  • Assist with the marketing of the firm’s capabilities to establish new clients and enhance relationships
  • Coordinate with other disciplines and internal services/groups/offices as necessary
  • Work side by side with the project team to develop detailed structural designs
What we offer
What we offer
  • Medical, dental, vision, life, and disability insurance
  • 401(k) retirement savings plan with company match
  • Paid time off, sick leave, and paid holidays
  • Tuition reimbursement and professional development support
  • Discretionary bonuses and other performance-based incentives
  • Employee Assistance Program (EAP), wellness initiatives, and employee discounts
  • Fulltime
Read More
Arrow Right