This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Join us as an Infrastructure Engineer Barclays, responsible for supporting the successful delivery of Location Strategy projects to plan, budget, agreed quality and governance standards. You'll spearhead the evolution of our digital landscape, driving innovation and excellence. You will harness cutting-edge technology to revolutionise our digital offerings, ensuring unparalleled customer experiences. To build and maintain infrastructure platforms and products that support applications and data systems, using hardware, software, networks, and cloud computing platforms as required with the aim of ensuring that the infrastructure is reliable, scalable, and secure. Ensure the reliability, availability, and scalability of the systems, platforms, and technology through the application of software engineering techniques, automation, and best practices in incident response.
Job Responsibility:
Build Engineering: Development, delivery, and maintenance of high-quality infrastructure solutions to fulfil business requirements ensuring measurable reliability, performance, availability, and ease of use
Incident Management: Monitoring of IT infrastructure and system performance to measure, identify, address, and resolve any potential issues, vulnerabilities, or outages
Automation: Development and implementation of automated tasks and processes to improve efficiency and reduce manual intervention
Security: Implementation of a secure configuration and measures to protect infrastructure against cyber-attacks, vulnerabilities, and other security threats
Teamwork: Cross-functional collaboration with product managers, architects, and other engineers to define IT Infrastructure requirements, devise solutions, and ensure seamless integration
Learning: Stay informed of industry technology trends and innovations, and actively contribute to the organization's technology communities
Requirements:
SRE Fundamentals: Incident management and RCA for production systems
SLAs for mission-critical platforms
Capacity planning and performance tuning
Change and release risk management
Observability (Baseline): Nonstop Measure
System and application monitoring
Alerting, threshold management, and event correlation
Modern Observability: Grafana dashboards
Prometheus (metrics concepts, exporters)
Log aggregation (ELK / OpenSearch)
Custom metric extraction and integration
DevOps & Integration: Git-based workflows
CI/CD tools (Jenkins, GitHub Actions, DevOps)
API integration (REST/SOAP)
Messaging systems (Kafka, MQ, JMS)
Programming & Automation: TAL / pTAL, COBOL, C/C++
OSS shell scripting
Python for automation and monitoring
Job scheduling and workload automation
HP NonStop Platform: HP NonStop OS (Guardian & OSS)
NonStop performance, capacity, and availability management