This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
GEICO is seeking an experienced Distinguished Engineer with a passion for building high-performance, low maintenance, zero-downtime platforms and applications. You will help drive our enterprise transformation by establishing engineering excellence as a core mission, with a specific focus on organizational resilience, strategic risk management, and rigorous technical governance. This role demands mastery of reliability, availability, software engineering, and best practices in BCDR.
Job Responsibility:
Drive the technical BCDR strategy, ensuring it aligns with critical business and regulatory goals
Conduct comprehensive risk assessments
Lead the architecture of highly resilient systems
Define organization-wide Recovery Time Objective (RTO) and Recovery Point Objective (RPO) metrics
Validate recovery targets by overseeing regular BCDR simulations and Chaos Engineering programs
Serve as a key leader within the Architecture Review Board
Set and rigorously enforce architectural standards, policies, and blueprints
Ensure all major technology investments are strategically aligned with business objectives and compliance requirements
Enforce domain consistency across architecture layers
Drive strategic modernization efforts to maximize scalability and coherence
Lead the SRE strategy by establishing and monitoring Service Level Objectives (SLOs) and error budgets
Develop and maintain comprehensive incident response plans, runbooks, and playbooks
Drive automation to achieve low Mean Time To Resolution (MTTR)
Analyze post-incident results to eradicate architectural flaws
Act as a trusted advisor to executive stakeholders on resilience and governance matters
Serve as a role model and mentor to coach senior and principal engineering talent
Analyze cost and forecast data, playing a critical role in strategic financial stewardship, particularly in Cloud Spend Optimization
Requirements:
Fluency and specialization in software development and best practices using modern programming languages
Deep knowledge of SRE practices, methodologies, and principles, along with a solid understanding of cloud-based compute, network, and storage technologies
Strong background in incident management (a core function of Case Management in platform operations), including the ability to create incident response playbooks, runbooks, and perform rigorous post-incident analysis
Expertise in distributed systems architecture, replication topologies, and distributed consistency patterns to meet stringent RTO and RPO requirements
Understanding of SQL and NoSQL databases, including stateful services management, storage, and optimization strategies for resilience and cloud cost efficiency
In-depth knowledge of hybrid cloud architecture, IaaS and PaaS technologies, container orchestration platforms (e.g., Kubernetes), and cloud efficiency
Experience with infrastructure automation, tooling, and configuration management frameworks (e.g., Ansible, Terraform)
Exceptional leadership and communication skills, with a passion for mentoring and fostering professional growth
Visionary thinker with the ability to anticipate future challenges and opportunities in resilience and governance
Proven track record of successfully leading, designing, and delivering complex engineering projects in large and complex organizations
12+ years of professional software development experience
10+ years of experience with architecture and design
6+ years of experience in open-source frameworks
6+ years of experience with AWS, GCP, Azure, or another cloud service
Bachelor's degree in computer science, Information Systems, or equivalent education or work experience
What we offer:
Comprehensive Total Rewards program
401K savings plan vested from day one that offers a 6% match
Performance and recognition-based incentives
Tuition assistance
Access to additional benefits like mental healthcare as well as fertility and adoption assistance
Workplace flexibility
GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year