Dependability, Reliability & Maintainability Engineer Job at Defence Equipment & Support (Bristol)

Database Reliability Engineer

The Database Reliability Engineer (DBRE) is responsible for managing, building, ...

Location

United States

Salary:

120000.00 - 179000.00 USD / Year

PointClickCare

Expiration Date

Until further notice

Requirements

3+ years of experience working with relational database systems
Strong hands-on experience with MySQL (administration, performance tuning, replication, HA/DR)
1+ years in a DBRE or database-focused engineering role
Experience working in cloud environments (AWS, GCP, or Azure — Azure preferred)
Coding and automation experience (Python, PowerShell, SQL, etc.)
Experience with Infrastructure-as-Code tools such as Ansible and Terraform
Experience working with source control systems such as Git
MySQL experience preferred
PostgreSQL is a plus
Experience working with VLDBs (1+ TB) and managing large database fleets (100+ instances)

Job Responsibility

Managing, building, maintaining, monitoring, and troubleshooting the cloud-based MySQL database infrastructure that our mission-critical SaaS application depends on
Focuses heavily on automation and coding to reduce operational toil
Collaborate closely with Engineering and SRE teams to support new product development and ensure reliable database integration across the platform
Work on observability of MySQL database metrics and ensure database performance and reliability objectives are consistently met
Work with the DBA team to identify areas of operational toil and implement automations/processes to manage PCC’s MySQL database systems at scale
Apply a data-driven approach to performance tuning, availability improvements, and operational optimization
Provide database support to Engineering and SRE teams, including review of database migrations, query performance, schema/design improvements, and standardizing MySQL configuration and deployment patterns
Assist the DBA team with performance troubleshooting and root-cause analysis

What we offer

Benefits starting from Day 1!
Retirement Plan Matching
Flexible Paid Time Off
Wellness Support Programs and Resources
Parental & Caregiver Leaves
Fertility & Adoption Support
Continuous Development Support Program
Employee Assistance Program
Allyship and Inclusion Communities
Employee Recognition … and more!

Fulltime

Site Reliability Engineer

The Silver Edge team brings the power of Azure to the edge for our customers, ta...

Location

United States , Redmond

Salary:

100600.00 - 199000.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Master's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience
Security Clearance Requirements: Candidates must be able to meet Microsoft, customer and/or government security screening requirements are required for this role
The successful candidate must have an active U.S. Government Top Secret Clearance with access to Sensitive Compartmented Information (SCI) based on a Single Scope Background Investigation (SSBI) with Polygraph
Ability to meet Microsoft, customer and/or government security screening requirements are required pre-offer and post-hire for this role
This position requires successful verification of the stated security clearance to meet federal government customer requirements
This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
This position requires verification of U.S citizenship due to citizenship-based legal restrictions

Job Responsibility

Support customer deployments and use of Azure Local and Azure Local disconnected operations
Maintain Azure Service reliability including deployment, availability, security, performance and customer satisfaction for sovereign environments
Leverages technical expertise in cloud technologies and specific products, as well as objective insights drawn from analyses of production telemetry data to suggest changes or add-ons to product features or the automation to improve the availability, security, quality, observability, reliability, efficiency, observability, and performance of product components or features supported by their team
Engages with product engineering teams by participating code/design reviews, regular meetings, on-call rotations and incident responses throughout product development and operations cycles
Utilizes technical knowledge of systems/platforms and insights drawn from product engineering teams, security best practices, artificial intelligence (AI)/machine learning (ML), and telemetry analyses to suggest potential improvements in code base and designs across components and features of one or more products
Leverages technical expertise and telemetry analysis alongside advanced artificial intelligence (AI) and machine learning (ML) algorithms across a range of components and/or features to identify patterns and opportunities to implement configuration and data changes for one or more platforms, systems, or products in production using code, tooling, and automation
Independently writes code or scripts that automate the performance of scalable operations processes (e.g., monitoring, alerting, deploying products and updates) across components and features of products operating at scale
Shares insights and best practices via documented artifacts that can be applied to improve development and operations of system, platform, or product components and features by participating in code/design reviews, incident drills and debriefs, and regular meetings, as well as interactions with more experienced SREs and members of product engineering teams
Develops alerts and instrumentation across components and features to monitor product capacity, related security risk, and resource demands and analyze telemetry data using existing capacity planning models
Draws insights from analyses of capacity and resource data to optimize component and feature code to manage resources and capacity across limited range of use conditions and system parameters

Fulltime

CloudOps Engineer

As a Cloud Operations Engineer, you will support our internal teams by managing ...

Location

Poland

Salary:

Not provided

RTB House

Expiration Date

Until further notice

Requirements

Solid knowledge of foundational Cloud concepts - including IAM, networking, VPC Service Controls, project and service configuration, and integrations - and the ability to guide users through setup, troubleshooting, and best practices
Experience assisting teams with CI/CD pipelines, including configuration, troubleshooting, and optimization, as well as automating repetitive operational tasks to improve efficiency and reliability
Ability to take ownership of operational tasks, diagnose and resolve issues across cloud services, and deliver clear, practical solutions even when requirements are incomplete or ambiguous
Strong communication and coordination skills to support internal users from IT and non-IT backgrounds, collaborate with engineering, security, and platform teams, and maintain clear documentation and operational guidance
Commitment to consistent, dependable operations through maintaining high standards of security and reliability, identifying areas for improvement, and contributing to streamlined workflows, runbooks, and service enablement
3-5+ years of experience in cloud operations, IT operations, or similar technical support/engineering roles
Solid hands-on experience with Google Cloud Platform (GCP) or other major Cloud providers, including IAM, networking, resource management, and service configuration
Practical experience troubleshooting cloud services, integrations, networking issues, and IAM
Good working knowledge of CI/CD systems (preferably GitHub Actions), with the ability to assist teams in configuring pipelines, troubleshooting issues, and maintaining smooth deployment workflows
Solid understanding of infrastructure-as-code principles and practical experience with Terraform, sufficient to read, modify, and operate Terraform configurations in day-to-day cloud operations

Job Responsibility

Support our internal teams by managing day-to-day operations and providing technical guidance across our Google Cloud Platform (GCP) environments
Assist internal users - from IT and engineering to business teams - by advising on project setups, configuring services, enabling integrations, supporting CI/CD workflows, and ensuring that cloud resources follow best practices in security, networking, and governance
Combine operational support with small project work, helping teams onboard to GCP, troubleshoot issues, and efficiently adopt platform capabilities
Collaborate closely with DevOps, Platform Engineering, Security, and development teams to maintain a reliable, compliant, and well-structured cloud environment

What we offer

Competitive Compensation: We offer an attractive salary package with significant growth opportunities
Cutting-edge Technology: Engage with the latest technologies on large-scale, dynamic projects
Wellbeing: extremely flexible working conditions - you work when it is convenient for you and devote as much time as you can
you can work fully remotely
Purpose-Driven Work: being at the heart of the system, your growing knowledge and competencies will be used in practical applications directly connected to business results

Lead Infrastructure Engineer - Solace

Wells Fargo is seeking a Lead Infrastructure Engineer–Solace to join the Solace ...

Location

United States , ISELIN; CHARLOTTE; IRVING

Salary:

119000.00 - 224000.00 USD / Year

Wells Fargo

Expiration Date

April 07, 2026

Requirements

5+ years of Technology Infrastructure Engineering and Solutions experience, or an equivalent combination of education and experience
5+ years of middleware engineering or administration experience using Solace PubSub+ hardware appliance offerings
5+ years of experience building and managing enterprise-scale infrastructure using automation
5+ years of overall infrastructure management experience
3+ years of advanced coding or scripting skills, with a strong focus on infrastructure automation (e.g., Python or similar)

Job Responsibility

Lead complex initiatives to design, build, and engineer infrastructure solutions supporting mission-critical business applications
Drive an automation-first infrastructure engineering strategy, ensuring infrastructure is designed to be programmatically provisioned, configured, and managed
Apply AI-assisted engineering capabilities to accelerate infrastructure development, including code generation, configuration validation, design analysis, and standardization
Participate in projects to modernize and evolve Solace infrastructure architecture, aligning with target-state engineering principles
Evaluate internal and external technologies, including AI-enabled engineering platforms, to support infrastructure build, automation, and architectural goals
Design infrastructure patterns and tooling that reduce manual effort and improv repeatability, reliability, and engineering quality
Design, build, deploy, and maintain infrastructure solutions through collaboration with engineering teams
Design, code, test, debug, and document infrastructure automation and tooling using Agile engineering practices
Make technical decisions related to architecture, automation frameworks, AI usage, implementation plans, and engineering tradeoffs
Identify engineering risks and dependencies early and define mitigation strategies through design and automation

What we offer

Health benefits
401(k) Plan
Paid time off
Disability benefits
Life insurance, critical illness insurance, and accident insurance
Parental leave
Critical caregiving leave
Discounts and savings
Commuter benefits
Tuition reimbursement

Fulltime

!

Principal Group Engineering Manager

Microsoft Specialized Clouds combines the power of edge platforms, devices, and ...

Location

India , Bangalore

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

15+ years of professional software engineering experience, including designing, building, and operating distributed, cloud-scale services
5+ years of engineering leadership experience, including managing managers and leading multi-team engineering organizations (M2+)
Deep experience with network device platforms — specifically Arista (EOS, eAPI, CloudVision) and/or Cisco (NX-OS, DCNM/NDFC) — including device programming, configuration management, and automation
Strong background in device programming and network automation — building systems that programmatically configure, validate, and manage network device state at scale
Experience with Azure Resource Provider (RP) engineering — ARM resource modeling, deployment pipelines, control-plane architecture, and resource lifecycle management
Solid understanding of L2/L3 networking fundamentals: spine-leaf architecture, VXLAN, overlay/underlay networking, BGP, and data center network design
Proven ability to set technical direction and architectural strategy for complex platforms spanning multiple components and partner teams
Demonstrated success owning end-to-end delivery of customer-critical services, including design, development, release, and live-site operations
Strong experience driving operational excellence, including reliability, incident management, automation, and cost optimization for production services
Proven track record of leading organizational transformation — such as quality resets, reliability turnarounds, code yellow resolution, or engineering culture change across an engineering org

Job Responsibility

Lead engineering teams through the design, architecture, development, testing, and operations of the Network Fabric platform — the cloud-managed networking layer for Azure Operator Nexus and Azure Local
Drive execution excellence across the full software lifecycle: semester planning, feature delivery, release management, and live-site operations
Own engineering commitments across multiple workstreams including network device programming, Azure Resource Provider development, fabric orchestration, and network configuration management
Ensure services meet Microsoft standards for quality, reliability, security, and operational readiness
Establish and enforce engineering best practices — including test-driven development, automated validation, secure development lifecycle (SDL/SFI), and continuous integration
Continue and accelerate the ongoing engineering transformation: driving quality resets, improving release predictability, and reducing customer-impacting incidents
Own the resolution of code yellow and equivalent quality escalations, driving root cause analysis and systemic remediation across the engineering organization
Champion a culture of engineering fundamentals — ensuring that quality, security, and operational maturity are embedded into every sprint, not treated as afterthoughts
Drive measurable reduction in support costs through automation, improved test coverage, and process optimization
Provide technical leadership across device programming (Arista EOS, Cisco NX-OS), network fabric orchestration, and Azure Resource Provider engineering

Fulltime

Senior/Middle DevOps Engineer

Our client is among the top-5 health insurance companies in the USA, serving ove...

Location

Argentina , Buenos Aires

Salary:

Not provided

ELEKS

Expiration Date

Until further notice

Requirements

4+ years of experience as a DevOps Engineer, Site Reliability Engineer (SRE), or similar role focused on CI/CD and release automation
Hands-on experience designing, configuring, and maintaining CI/CD pipelines with GitHub Actions, Azure DevOps, Jenkins, UCD
Strong focus on pipeline automation, rollback strategies, traceability, testing, and deployment validation
Practical knowledge of access management, release governance, and audit/compliance controls in CI/CD workflows
Familiarity with scripting (Bash, Python) to automate build/test/deployment steps
Understanding of containerization and orchestration (Docker, Kubernetes, OpenShift) for deployment pipelines
Ability to implement security and quality gates in pipelines to prevent faulty code from reaching production
Experience with monitoring and logging tools (Grafana, Prometheus, ELK) to validate and audit deployments
Excellent troubleshooting skills for resolving build and deployment issues quickly
Strong communication and collaboration skills to work with developers, QA, and operations teams

Job Responsibility

Own the administration & integration of all Software Development Lifecycle (SDLC) tooling to support developers and DevOps teams
Develop tool integrations that allow for end-to-end traceability for application development
Create auditable solutions for builds and deployments using tools like Github Enterprise, Jenkins, Sonar, Nexus and/or UCD
Support all development tooling platforms being used by product teams. Provide technical expertise in regard to tools development to enable automation
Gather tools requirements to create integrated toolset across portfolios.
Formulate strategy to identify application dependencies across multiple applications
Collaborate with customers to understand problem statement and translate into deliverable units of work
Define, implement, and maintain IT processes for integrating and deploying applications
Design, implement, and maintain automated CI/CD pipelines and processes to build and deploy code, content, services, and product environments
Demonstrated understanding of cloud concepts and technologies such as AWS and Azure

What we offer

Close cooperation with a customer
Challenging tasks
Competence development
Ability to influence project technologies
Team of professionals
Dynamic environment with low level of bureaucracy

Director of Engineering

The Director of Engineering is the senior technical execution leader responsible...

Location

United States , Aberdeen Proving Ground

Salary:

Not provided

VES

Expiration Date

Until further notice

Requirements

Bachelor's degree in Engineering, Computer Science, or a related technical field (Master's degree preferred)
15+ years of engineering experience, including significant hands-on technical responsibility for complex systems
7+ years in senior technical leadership roles, such as Principal Engineer, Chief Engineer, Lead Architect, or equivalent
Demonstrated ability to independently solve complex, cross-domain technical problems involving software, systems, infrastructure, and security
Strong understanding of software engineering, systems engineering, integration practices, and modern deployment environments
Experience implementing and enforcing SDLC, configuration management, and quality standards
Experience working in a government contracting or regulated environment, including DoD or Federal programs
Ability to communicate complex technical concepts clearly to engineers, program leadership, executives, and customers
Excellent written and oral communication skills with respect to the above requirements
Ability to obtain and maintain a U.S. Government security clearance

Job Responsibility

Lead and oversee engineering execution across multiple concurrent programs, ensuring solutions meet cost, schedule, performance, quality, and architectural expectations
Serve as the primary technical execution lead across the organization, with authority to make technical decisions necessary to unblock delivery and resolve engineering challenges
Act as the first escalation point for complex technical problems, integration failures, and cross-program dependencies, independently driving solutions for the majority of issues before CTO involvement is required
Apply deep systems-level technical judgment to diagnose, frame, and resolve difficult engineering problems spanning software, systems, infrastructure, deployment, and security
Ensure engineering decisions made under delivery pressure preserve long-term system maintainability, reliability, and scalability
Develop and maintain a deep understanding of VES engineering processes, standards, and technical expectations, and ensure they are applied consistently across programs
Partner with Principal Engineers to review and approve system architectures, technical approaches, and major design decisions
Ensure architectural consistency and technical coherence across programs while allowing appropriate flexibility to meet mission and customer needs
Identify systemic technical issues, recurring failure modes, and architectural debt across the portfolio and drive corrective action
Work closely with Principal Engineers (Mission Command, Land Systems, Emerging Technologies, Cyber Security, Systems Engineering) as domain technical authorities

What we offer

401(k) match
Highly Competitive Salary
Up to 15 Paid Vacation days / year
11 Paid Holidays
Flexible work/life balance culture

Fulltime

Principal Consultant A2 - Infra

Microsoft Industry Solution - Global Center Innovation and Delivery Center (GCID...

Location

India , Hyderabad

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor’s degree in computer science, Engineering, or related field AND 3+ years leadership experience in relevant area of business. Higher Education Preferred
OR master’s degree in computer science, Information Technology, Engineering, or related field AND 6+ years’ experience in technology solutions, practice development, architecture, consulting, and/or Cloud Infrastructure domain
Highly proficient & solid Customer facing Project experience involving solution design, project envisioning, planning, development, and deployment of complex solutions with minimum of 10 plus years
Must have a proven record of delivering technical solutions
2+ years managing multiple projects or portfolios
1+ year(s) experience leading blended, multidisciplinary teams
Preferred Qualifications: Overall minimum 20+ Year of industry experience
Technical or Professional Certification in Cloud Infrastructure domain
Open to travel domestically and internationally and work with different cultures and customers
Technical certifications based on domain/service line (e.g., Azure, Security, Dynamics)

Job Responsibility

AI-First Delivery Leadership: Embed AI-first principles into delivery workflows, leveraging automation and intelligent orchestration where applicable
Lead end-to-end delivery of complex projects, ensuring solutions are scalable, robust, and aligned with client business outcomes
Drive engineering excellence through reusable components, accelerators, and scalable architecture
Oversee technical execution across multiple projects, ensuring adherence to best practices, quality standards, and compliance requirements
Collaborate with clients and internal stakeholders to define strategies, delivery plans, milestones, and risk mitigation approaches
Act as a technical point of contact for clients, translating business requirements into scalable technical solutions
Ensure delivery models are optimized for modern, AI-native execution, including integration of automation and intelligent processes
Ability to step into at risk projects, quickly assess issues, and establish a credible path to recovery or exit
Engineering Excellence: Champion high-quality engineering practices across all delivery engagements
Ensure adherence to coding standards, architectural integrity, and performance benchmarks

Fulltime

Dependability, Reliability & Maintainability Engineer

Defence Equipment & Support

Location:
United Kingdom , Bristol

Category:
IT - Administration

Contract Type:
Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Additional Information:

Job Posted:
January 20, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Dependability, Reliability & Maintainability Engineer

Database Reliability Engineer

Site Reliability Engineer

CloudOps Engineer

Lead Infrastructure Engineer - Solace

Principal Group Engineering Manager

Senior/Middle DevOps Engineer

Director of Engineering

Principal Consultant A2 - Infra

Dependability, Reliability & Maintainability Engineer

Defence Equipment & Support

Location:United Kingdom , Bristol

Category:IT - Administration

Contract Type:Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Additional Information:

Job Posted:January 20, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Dependability, Reliability & Maintainability Engineer

Database Reliability Engineer

Site Reliability Engineer

CloudOps Engineer

Lead Infrastructure Engineer - Solace

Principal Group Engineering Manager

Senior/Middle DevOps Engineer

Director of Engineering

Principal Consultant A2 - Infra

Location:
United Kingdom , Bristol

Category:
IT - Administration

Contract Type:
Not provided

Job Posted:
January 20, 2026