CrawlJobs Logo

Dependability, Reliability & Maintainability Engineer

des.mod.uk Logo

Defence Equipment & Support

Location Icon

Location:
United Kingdom , Bristol

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

46400.00 GBP / Year

Job Description:

Are you passionate about the defence of the United Kingdom, with a good moral compass and a drive to protect people? Do you want to work in an environment where you can develop your skills and experience whilst working to support the UK’s Armed Forces? Do you have experience working in a Reliability and Maintainability environment, and excited by the opportunity to work on military platforms and systems? If so, opportunities have arisen for Reliability Engineer to join us at Defence Equipment & Support (DE&S). In joining one of our integrated teams of highly skilled professionals, you have an exciting opportunity to support the UK Armed Forces, working with military colleagues and industry on some of the most important and challenging work the UK Government is undertaking. In this rare and unique opportunity, you will join the Task based Engineering Resource (TBER) team, in Support Chain Services, delivering Reliability & Maintainability (R&M) engineering services to DE&S customers across the Army, Navy and Royal Air Force, to ensure equipment and contracted services deliver value for money and meet R&M contractual requirements.

Job Responsibility:

  • Own the generation of Initial R&M Cases to support technical requirements, defining technical deliverables/artifacts and updating other documentation for multiple systems
  • Analyse dependability related data to identify areas for improvement or to demonstrate dependability performance levels
  • Identifies and analyses technical hazards /project impacts then contributes to the identification and evaluation of risk reduction measures
  • Manage the review of the dependability aspects of a design ensuring that the R&M case demonstrates that the item under consideration will meet the user needs
  • Collaborate and manage trade-offs with other sub-system teams and specialists, over technical requirements, and design compromises
  • Review Dependability performance, conduct root cause analysis to identify “design” weaknesses
  • Manage and participate in assurance, audit, and reviews activity. Support the development of specialist elements of assurance tools such as GEAR and SSDT

Requirements:

  • Experience of providing R&M engineering services at a senior level and leading the delivery of successful R&M engineering outcome in multidisciplinary projects
  • Knowledge and experience of project R&M programmes
  • As a minimum be professionally registered as either: Incorporated Engineer (IEng) or, Registered Scientist (RSci) or, Chartered Mathematician (CMath) or, Advanced Data Science Professional or, Holds RQF Level 6 in related Mathematics subject, and an Associate Member (AM) of the Institute of Mathematics and its Application (IMA)
What we offer:
  • 25 days’ annual leave +1 day a year up to 30 days, 8 bank holidays and a day off for the King’s birthday
  • Flexible and hybrid working options
  • Market-leading average employer pension contribution of 28.97%
  • Annual performance-based bonus and recognition awards
  • Access to specialist training and funded qualifications
  • Support for progression
  • Huge range of discounts
  • Volunteering days
  • Enhanced parental leave schemes

Additional Information:

Job Posted:
January 20, 2026

Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Dependability, Reliability & Maintainability Engineer

Database Reliability Engineer

The Database Reliability Engineer (DBRE) is responsible for managing, building, ...
Location
Location
United States
Salary
Salary:
120000.00 - 179000.00 USD / Year
pointclickcare.com Logo
PointClickCare
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience working with relational database systems
  • Strong hands-on experience with MySQL (administration, performance tuning, replication, HA/DR)
  • 1+ years in a DBRE or database-focused engineering role
  • Experience working in cloud environments (AWS, GCP, or Azure — Azure preferred)
  • Coding and automation experience (Python, PowerShell, SQL, etc.)
  • Experience with Infrastructure-as-Code tools such as Ansible and Terraform
  • Experience working with source control systems such as Git
  • MySQL experience preferred
  • PostgreSQL is a plus
  • Experience working with VLDBs (1+ TB) and managing large database fleets (100+ instances)
Job Responsibility
Job Responsibility
  • Managing, building, maintaining, monitoring, and troubleshooting the cloud-based MySQL database infrastructure that our mission-critical SaaS application depends on
  • Focuses heavily on automation and coding to reduce operational toil
  • Collaborate closely with Engineering and SRE teams to support new product development and ensure reliable database integration across the platform
  • Work on observability of MySQL database metrics and ensure database performance and reliability objectives are consistently met
  • Work with the DBA team to identify areas of operational toil and implement automations/processes to manage PCC’s MySQL database systems at scale
  • Apply a data-driven approach to performance tuning, availability improvements, and operational optimization
  • Provide database support to Engineering and SRE teams, including review of database migrations, query performance, schema/design improvements, and standardizing MySQL configuration and deployment patterns
  • Assist the DBA team with performance troubleshooting and root-cause analysis
What we offer
What we offer
  • Benefits starting from Day 1!
  • Retirement Plan Matching
  • Flexible Paid Time Off
  • Wellness Support Programs and Resources
  • Parental & Caregiver Leaves
  • Fertility & Adoption Support
  • Continuous Development Support Program
  • Employee Assistance Program
  • Allyship and Inclusion Communities
  • Employee Recognition … and more!
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer

The Silver Edge team brings the power of Azure to the edge for our customers, ta...
Location
Location
United States , Redmond
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration OR equivalent experience
  • Security Clearance Requirements: Candidates must be able to meet Microsoft, customer and/or government security screening requirements are required for this role
  • The successful candidate must have an active U.S. Government Top Secret Clearance with access to Sensitive Compartmented Information (SCI) based on a Single Scope Background Investigation (SSBI) with Polygraph
  • Ability to meet Microsoft, customer and/or government security screening requirements are required pre-offer and post-hire for this role
  • This position requires successful verification of the stated security clearance to meet federal government customer requirements
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
  • This position requires verification of U.S citizenship due to citizenship-based legal restrictions
Job Responsibility
Job Responsibility
  • Support customer deployments and use of Azure Local and Azure Local disconnected operations
  • Maintain Azure Service reliability including deployment, availability, security, performance and customer satisfaction for sovereign environments
  • Leverages technical expertise in cloud technologies and specific products, as well as objective insights drawn from analyses of production telemetry data to suggest changes or add-ons to product features or the automation to improve the availability, security, quality, observability, reliability, efficiency, observability, and performance of product components or features supported by their team
  • Engages with product engineering teams by participating code/design reviews, regular meetings, on-call rotations and incident responses throughout product development and operations cycles
  • Utilizes technical knowledge of systems/platforms and insights drawn from product engineering teams, security best practices, artificial intelligence (AI)/machine learning (ML), and telemetry analyses to suggest potential improvements in code base and designs across components and features of one or more products
  • Leverages technical expertise and telemetry analysis alongside advanced artificial intelligence (AI) and machine learning (ML) algorithms across a range of components and/or features to identify patterns and opportunities to implement configuration and data changes for one or more platforms, systems, or products in production using code, tooling, and automation
  • Independently writes code or scripts that automate the performance of scalable operations processes (e.g., monitoring, alerting, deploying products and updates) across components and features of products operating at scale
  • Shares insights and best practices via documented artifacts that can be applied to improve development and operations of system, platform, or product components and features by participating in code/design reviews, incident drills and debriefs, and regular meetings, as well as interactions with more experienced SREs and members of product engineering teams
  • Develops alerts and instrumentation across components and features to monitor product capacity, related security risk, and resource demands and analyze telemetry data using existing capacity planning models
  • Draws insights from analyses of capacity and resource data to optimize component and feature code to manage resources and capacity across limited range of use conditions and system parameters
  • Fulltime
Read More
Arrow Right

CloudOps Engineer

As a Cloud Operations Engineer, you will support our internal teams by managing ...
Location
Location
Poland
Salary
Salary:
Not provided
rtbhouse.com Logo
RTB House
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Solid knowledge of foundational Cloud concepts - including IAM, networking, VPC Service Controls, project and service configuration, and integrations - and the ability to guide users through setup, troubleshooting, and best practices
  • Experience assisting teams with CI/CD pipelines, including configuration, troubleshooting, and optimization, as well as automating repetitive operational tasks to improve efficiency and reliability
  • Ability to take ownership of operational tasks, diagnose and resolve issues across cloud services, and deliver clear, practical solutions even when requirements are incomplete or ambiguous
  • Strong communication and coordination skills to support internal users from IT and non-IT backgrounds, collaborate with engineering, security, and platform teams, and maintain clear documentation and operational guidance
  • Commitment to consistent, dependable operations through maintaining high standards of security and reliability, identifying areas for improvement, and contributing to streamlined workflows, runbooks, and service enablement
  • 3-5+ years of experience in cloud operations, IT operations, or similar technical support/engineering roles
  • Solid hands-on experience with Google Cloud Platform (GCP) or other major Cloud providers, including IAM, networking, resource management, and service configuration
  • Practical experience troubleshooting cloud services, integrations, networking issues, and IAM
  • Good working knowledge of CI/CD systems (preferably GitHub Actions), with the ability to assist teams in configuring pipelines, troubleshooting issues, and maintaining smooth deployment workflows
  • Solid understanding of infrastructure-as-code principles and practical experience with Terraform, sufficient to read, modify, and operate Terraform configurations in day-to-day cloud operations
Job Responsibility
Job Responsibility
  • Support our internal teams by managing day-to-day operations and providing technical guidance across our Google Cloud Platform (GCP) environments
  • Assist internal users - from IT and engineering to business teams - by advising on project setups, configuring services, enabling integrations, supporting CI/CD workflows, and ensuring that cloud resources follow best practices in security, networking, and governance
  • Combine operational support with small project work, helping teams onboard to GCP, troubleshoot issues, and efficiently adopt platform capabilities
  • Collaborate closely with DevOps, Platform Engineering, Security, and development teams to maintain a reliable, compliant, and well-structured cloud environment
What we offer
What we offer
  • Competitive Compensation: We offer an attractive salary package with significant growth opportunities
  • Cutting-edge Technology: Engage with the latest technologies on large-scale, dynamic projects
  • Wellbeing: extremely flexible working conditions - you work when it is convenient for you and devote as much time as you can
  • you can work fully remotely
  • Purpose-Driven Work: being at the heart of the system, your growing knowledge and competencies will be used in practical applications directly connected to business results
Read More
Arrow Right

Lead Infrastructure Engineer - Solace

Wells Fargo is seeking a Lead Infrastructure Engineer–Solace to join the Solace ...
Location
Location
United States , ISELIN; CHARLOTTE; IRVING
Salary
Salary:
119000.00 - 224000.00 USD / Year
https://www.wellsfargo.com/ Logo
Wells Fargo
Expiration Date
April 07, 2026
Flip Icon
Requirements
Requirements
  • 5+ years of Technology Infrastructure Engineering and Solutions experience, or an equivalent combination of education and experience
  • 5+ years of middleware engineering or administration experience using Solace PubSub+ hardware appliance offerings
  • 5+ years of experience building and managing enterprise-scale infrastructure using automation
  • 5+ years of overall infrastructure management experience
  • 3+ years of advanced coding or scripting skills, with a strong focus on infrastructure automation (e.g., Python or similar)
Job Responsibility
Job Responsibility
  • Lead complex initiatives to design, build, and engineer infrastructure solutions supporting mission-critical business applications
  • Drive an automation-first infrastructure engineering strategy, ensuring infrastructure is designed to be programmatically provisioned, configured, and managed
  • Apply AI-assisted engineering capabilities to accelerate infrastructure development, including code generation, configuration validation, design analysis, and standardization
  • Participate in projects to modernize and evolve Solace infrastructure architecture, aligning with target-state engineering principles
  • Evaluate internal and external technologies, including AI-enabled engineering platforms, to support infrastructure build, automation, and architectural goals
  • Design infrastructure patterns and tooling that reduce manual effort and improv repeatability, reliability, and engineering quality
  • Design, build, deploy, and maintain infrastructure solutions through collaboration with engineering teams
  • Design, code, test, debug, and document infrastructure automation and tooling using Agile engineering practices
  • Make technical decisions related to architecture, automation frameworks, AI usage, implementation plans, and engineering tradeoffs
  • Identify engineering risks and dependencies early and define mitigation strategies through design and automation
What we offer
What we offer
  • Health benefits
  • 401(k) Plan
  • Paid time off
  • Disability benefits
  • Life insurance, critical illness insurance, and accident insurance
  • Parental leave
  • Critical caregiving leave
  • Discounts and savings
  • Commuter benefits
  • Tuition reimbursement
  • Fulltime
!
Read More
Arrow Right

Principal Group Engineering Manager

Microsoft Specialized Clouds combines the power of edge platforms, devices, and ...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years of professional software engineering experience, including designing, building, and operating distributed, cloud-scale services
  • 5+ years of engineering leadership experience, including managing managers and leading multi-team engineering organizations (M2+)
  • Deep experience with network device platforms — specifically Arista (EOS, eAPI, CloudVision) and/or Cisco (NX-OS, DCNM/NDFC) — including device programming, configuration management, and automation
  • Strong background in device programming and network automation — building systems that programmatically configure, validate, and manage network device state at scale
  • Experience with Azure Resource Provider (RP) engineering — ARM resource modeling, deployment pipelines, control-plane architecture, and resource lifecycle management
  • Solid understanding of L2/L3 networking fundamentals: spine-leaf architecture, VXLAN, overlay/underlay networking, BGP, and data center network design
  • Proven ability to set technical direction and architectural strategy for complex platforms spanning multiple components and partner teams
  • Demonstrated success owning end-to-end delivery of customer-critical services, including design, development, release, and live-site operations
  • Strong experience driving operational excellence, including reliability, incident management, automation, and cost optimization for production services
  • Proven track record of leading organizational transformation — such as quality resets, reliability turnarounds, code yellow resolution, or engineering culture change across an engineering org
Job Responsibility
Job Responsibility
  • Lead engineering teams through the design, architecture, development, testing, and operations of the Network Fabric platform — the cloud-managed networking layer for Azure Operator Nexus and Azure Local
  • Drive execution excellence across the full software lifecycle: semester planning, feature delivery, release management, and live-site operations
  • Own engineering commitments across multiple workstreams including network device programming, Azure Resource Provider development, fabric orchestration, and network configuration management
  • Ensure services meet Microsoft standards for quality, reliability, security, and operational readiness
  • Establish and enforce engineering best practices — including test-driven development, automated validation, secure development lifecycle (SDL/SFI), and continuous integration
  • Continue and accelerate the ongoing engineering transformation: driving quality resets, improving release predictability, and reducing customer-impacting incidents
  • Own the resolution of code yellow and equivalent quality escalations, driving root cause analysis and systemic remediation across the engineering organization
  • Champion a culture of engineering fundamentals — ensuring that quality, security, and operational maturity are embedded into every sprint, not treated as afterthoughts
  • Drive measurable reduction in support costs through automation, improved test coverage, and process optimization
  • Provide technical leadership across device programming (Arista EOS, Cisco NX-OS), network fabric orchestration, and Azure Resource Provider engineering
  • Fulltime
Read More
Arrow Right

Senior/Middle DevOps Engineer

Our client is among the top-5 health insurance companies in the USA, serving ove...
Location
Location
Argentina , Buenos Aires
Salary
Salary:
Not provided
eleks.com Logo
ELEKS
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of experience as a DevOps Engineer, Site Reliability Engineer (SRE), or similar role focused on CI/CD and release automation
  • Hands-on experience designing, configuring, and maintaining CI/CD pipelines with GitHub Actions, Azure DevOps, Jenkins, UCD
  • Strong focus on pipeline automation, rollback strategies, traceability, testing, and deployment validation
  • Practical knowledge of access management, release governance, and audit/compliance controls in CI/CD workflows
  • Familiarity with scripting (Bash, Python) to automate build/test/deployment steps
  • Understanding of containerization and orchestration (Docker, Kubernetes, OpenShift) for deployment pipelines
  • Ability to implement security and quality gates in pipelines to prevent faulty code from reaching production
  • Experience with monitoring and logging tools (Grafana, Prometheus, ELK) to validate and audit deployments
  • Excellent troubleshooting skills for resolving build and deployment issues quickly
  • Strong communication and collaboration skills to work with developers, QA, and operations teams
Job Responsibility
Job Responsibility
  • Own the administration & integration of all Software Development Lifecycle (SDLC) tooling to support developers and DevOps teams
  • Develop tool integrations that allow for end-to-end traceability for application development
  • Create auditable solutions for builds and deployments using tools like Github Enterprise, Jenkins, Sonar, Nexus and/or UCD
  • Support all development tooling platforms being used by product teams. Provide technical expertise in regard to tools development to enable automation
  • Gather tools requirements to create integrated toolset across portfolios.
  • Formulate strategy to identify application dependencies across multiple applications
  • Collaborate with customers to understand problem statement and translate into deliverable units of work
  • Define, implement, and maintain IT processes for integrating and deploying applications
  • Design, implement, and maintain automated CI/CD pipelines and processes to build and deploy code, content, services, and product environments
  • Demonstrated understanding of cloud concepts and technologies such as AWS and Azure
What we offer
What we offer
  • Close cooperation with a customer
  • Challenging tasks
  • Competence development
  • Ability to influence project technologies
  • Team of professionals
  • Dynamic environment with low level of bureaucracy
Read More
Arrow Right

Director of Engineering

The Director of Engineering is the senior technical execution leader responsible...
Location
Location
United States , Aberdeen Proving Ground
Salary
Salary:
Not provided
VES
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Engineering, Computer Science, or a related technical field (Master's degree preferred)
  • 15+ years of engineering experience, including significant hands-on technical responsibility for complex systems
  • 7+ years in senior technical leadership roles, such as Principal Engineer, Chief Engineer, Lead Architect, or equivalent
  • Demonstrated ability to independently solve complex, cross-domain technical problems involving software, systems, infrastructure, and security
  • Strong understanding of software engineering, systems engineering, integration practices, and modern deployment environments
  • Experience implementing and enforcing SDLC, configuration management, and quality standards
  • Experience working in a government contracting or regulated environment, including DoD or Federal programs
  • Ability to communicate complex technical concepts clearly to engineers, program leadership, executives, and customers
  • Excellent written and oral communication skills with respect to the above requirements
  • Ability to obtain and maintain a U.S. Government security clearance
Job Responsibility
Job Responsibility
  • Lead and oversee engineering execution across multiple concurrent programs, ensuring solutions meet cost, schedule, performance, quality, and architectural expectations
  • Serve as the primary technical execution lead across the organization, with authority to make technical decisions necessary to unblock delivery and resolve engineering challenges
  • Act as the first escalation point for complex technical problems, integration failures, and cross-program dependencies, independently driving solutions for the majority of issues before CTO involvement is required
  • Apply deep systems-level technical judgment to diagnose, frame, and resolve difficult engineering problems spanning software, systems, infrastructure, deployment, and security
  • Ensure engineering decisions made under delivery pressure preserve long-term system maintainability, reliability, and scalability
  • Develop and maintain a deep understanding of VES engineering processes, standards, and technical expectations, and ensure they are applied consistently across programs
  • Partner with Principal Engineers to review and approve system architectures, technical approaches, and major design decisions
  • Ensure architectural consistency and technical coherence across programs while allowing appropriate flexibility to meet mission and customer needs
  • Identify systemic technical issues, recurring failure modes, and architectural debt across the portfolio and drive corrective action
  • Work closely with Principal Engineers (Mission Command, Land Systems, Emerging Technologies, Cyber Security, Systems Engineering) as domain technical authorities
What we offer
What we offer
  • 401(k) match
  • Highly Competitive Salary
  • Up to 15 Paid Vacation days / year
  • 11 Paid Holidays
  • Flexible work/life balance culture
  • Fulltime
Read More
Arrow Right

Principal Consultant A2 - Infra

Microsoft Industry Solution - Global Center Innovation and Delivery Center (GCID...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, Engineering, or related field AND 3+ years leadership experience in relevant area of business. Higher Education Preferred
  • OR master’s degree in computer science, Information Technology, Engineering, or related field AND 6+ years’ experience in technology solutions, practice development, architecture, consulting, and/or Cloud Infrastructure domain
  • Highly proficient & solid Customer facing Project experience involving solution design, project envisioning, planning, development, and deployment of complex solutions with minimum of 10 plus years
  • Must have a proven record of delivering technical solutions
  • 2+ years managing multiple projects or portfolios
  • 1+ year(s) experience leading blended, multidisciplinary teams
  • Preferred Qualifications: Overall minimum 20+ Year of industry experience
  • Technical or Professional Certification in Cloud Infrastructure domain
  • Open to travel domestically and internationally and work with different cultures and customers
  • Technical certifications based on domain/service line (e.g., Azure, Security, Dynamics)
Job Responsibility
Job Responsibility
  • AI-First Delivery Leadership: Embed AI-first principles into delivery workflows, leveraging automation and intelligent orchestration where applicable
  • Lead end-to-end delivery of complex projects, ensuring solutions are scalable, robust, and aligned with client business outcomes
  • Drive engineering excellence through reusable components, accelerators, and scalable architecture
  • Oversee technical execution across multiple projects, ensuring adherence to best practices, quality standards, and compliance requirements
  • Collaborate with clients and internal stakeholders to define strategies, delivery plans, milestones, and risk mitigation approaches
  • Act as a technical point of contact for clients, translating business requirements into scalable technical solutions
  • Ensure delivery models are optimized for modern, AI-native execution, including integration of automation and intelligent processes
  • Ability to step into at risk projects, quickly assess issues, and establish a credible path to recovery or exit
  • Engineering Excellence: Champion high-quality engineering practices across all delivery engagements
  • Ensure adherence to coding standards, architectural integrity, and performance benchmarks
  • Fulltime
Read More
Arrow Right