This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Barclays is seeking a Site Reliability Engineer to join its Securitized Products team within the Markets organization. This role focuses on improving the reliability, scalability, and performance of critical Securitized Products systems through automation, and engineering practices. You will apply software engineering, automation, and incident, change and best practices to ensure the reliability, availability, and scalability of Barclays technology platforms. You will act as a subject‑matter specialist for SRE practices, partnering with infrastructure and engineering teams to embed reliability into platforms across cloud, on-prem, compute, storage, networking, and databases.
Job Responsibility:
Development and delivery of high-quality software solutions by using industry aligned programming languages, frameworks, and tools
Cross-functional collaboration with product managers, designers, and other engineers to define software requirements, devise solution strategies, and ensure seamless integration and alignment with business objectives
Collaboration with peers, participate in code reviews, and promote a culture of code quality and knowledge sharing
Stay informed of industry technology trends and innovations and actively contribute to the organization’s technology communities to foster a culture of technical excellence and growth
Adherence to secure coding practices to mitigate vulnerabilities, protect sensitive data, and ensure secure software solutions
Implementation of effective unit testing practices to ensure proper code design, readability, and reliability
Requirements:
Programming or scripting experience (Python, Go, PowerShell, Bash, or similar) and SQL
Linux/Unix/Windows systems and systems engineering fundamentals
Client-server model architecture and scalability knowledge monitoring high traffic by distributing load across multiple backend servers
Performance monitoring and reducing latency in request-response cycles
Containers and orchestration (Docker, Kubernetes)
Networking (TCP/IP, DNS, HTTP, SFTP) and relational databases
and monitoring and observability tools (Geneos ITRS, Prometheus, Grafana, APM, Observe)
Nice to have:
Communication skills with the ability to influence teams and drive adoption of SRE practices
Familiarity with cloud platforms such as AWS, Azure, or GCP
Ability to troubleshoot multi-faceted, distributed systems with contending priorities
Trading floor support experience and basic understanding of Securitized Products business
Implementing redundancy, such as server clustering and failover mechanisms, to reduce application downtime and improve recovery time goals